Homology
BLAST of IVF0006653 vs. ExPASy Swiss-Prot
Match:
P11369 (LINE-1 retrotransposable element ORF2 protein OS=Mus musculus OX=10090 GN=Pol PE=1 SV=2)
HSP 1 Score: 199.5 bits (506), Expect = 2.7e-49
Identity = 186/760 (24.47%), Postives = 341/760 (44.87%), Query Frame = 0
Query: 748 TFSRIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPL-VCENSNTKLSWGPVPFRLNSIAL 807
TFS+ID + + T N + +P SDH L + N+N ++LN+ L
Sbjct: 207 TFSKIDHIIGHKTGLNRYK--NIEIVPCILSDHHGLRLIFNNNINNGKPTFTWKLNNTLL 266
Query: 808 SDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLTHAK-------- 867
+D K + + ++ ++ + +L + +K + + KL +L+ +K
Sbjct: 267 NDTLVKEGIKKEIKDFLEFNE---NEATTYPNLWDTMKAFLRGKLIALSASKKKRETAHT 326
Query: 868 DSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDEN 927
S+ + +++KKE ++P + + L+ +++++ + + + + + + ++
Sbjct: 327 SSLTTHLKALEKKEANSP-KRSRRQEIIKLRGEINQVETRRTIQRINQTRSWFFEKINKI 386
Query: 928 SSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPL----- 987
R+ + + I++I++E+G I I F+ ++Y STK + L
Sbjct: 387 DKPLARLTKGHRDKILINKIRNEKGDITTDPEEIQNTIRSFYKRLY--STKLENLDEMDK 446
Query: 988 FIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKED 1047
F+D + + HL +P EI+ VINSL KK+PG DGF F++T+ KED
Sbjct: 447 FLDRYQVPKLNQDQVDHLNSPISPKEIEAVINSLPTKKSPGPDGFSAEFYQTF----KED 506
Query: 1048 ILDIFKDFYDKGVINKNMNNTY----IALIPK-KKDYSHPKDFRPISLTTSIYKIIAKTL 1107
++ I + K + + N++ I LIPK +KD + ++FRPISL KI+ K L
Sbjct: 507 LIPILHKLFHKIEVEGTLPNSFYEATITLIPKPQKDPTKIENFRPISLMNIDAKILNKIL 566
Query: 1108 SNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDFW-KVKKIKGFILKLDIEKAFDK 1167
+NR++ + I +Q+ F+ Q I + + + K+K I+ LD EKAFDK
Sbjct: 567 ANRIQEHIKAIIHPDQVGFIPGMQGWFNIRKSINVIHYINKLKDKNHMIISLDAEKAFDK 626
Query: 1168 LNWDFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFL 1227
+ F+ VLE+ + I+ S ++ VNG I G RQG PLSP+L
Sbjct: 627 IQHPFMIKVLERSGIQGPYLNMIKAIYSKPVANIKVNGEKLEAIPLKSGTRQGCPLSPYL 686
Query: 1228 FVIAMDYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILLFIEDNDYFLNNLRMALS 1287
F I ++ L+R + + IKG+ + + L ADD++++I D L ++
Sbjct: 687 FNIVLEVLARAIRQQKE---IKGIQIGKE-EVKISLLADDMIVYISDPKNSTRELLNLIN 746
Query: 1288 LFERASSLKINLLKSALVPMNVSVNRAKECASIWGIPCHSLPLSYLGVPLGGNPKS--NL 1347
F KIN KS + KE + + YLGV L K +
Sbjct: 747 SFGEVVGYKINSNKSMAFLYTKNKQAEKEIRETTPFSIVTNNIKYLGVTLTKEVKDLYDK 806
Query: 1348 FWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV--FQAPSMTCKNIEKS 1407
+++++ +I++ L WK S GR+ ++K + IY+ + + P+ +E +
Sbjct: 807 NFKSLKKEIKEDLRRWKDLPCSWIGRINIVKMAILPKAIYRFNAIPIKIPTQFFNELEGA 866
Query: 1408 WRKFLWKGNNGSVGSHLINWTKVSKSKEEGGLGTSRLHVTNKALL--TKWLWRYLSEPNA 1467
KF+W + L+ + S GG+ L + +A++ T W W Y
Sbjct: 867 ICKFVWNNKKPRIAKSLLKDKRTS-----GGITMPDLKLYYRAIVIKTAWYW-YRDRQVD 926
Query: 1468 LWRRLIQCKYKGNYPGDIPSNISSITSKAPWRSIIDNIDW 1482
W R+ + + G + + + T + SI +N W
Sbjct: 927 QWNRIEDPEMNPHTYGHLIFDKGAKTIQWKKDSIFNNWCW 944
BLAST of IVF0006653 vs. ExPASy Swiss-Prot
Match:
O00370 (LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens OX=9606 PE=1 SV=1)
HSP 1 Score: 197.2 bits (500), Expect = 1.4e-48
Identity = 188/727 (25.86%), Postives = 325/727 (44.70%), Query Frame = 0
Query: 748 TFSRIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWG-PVPFRLNSIAL 807
T+S+ID + + L T + SDH + E L+ ++LN++ L
Sbjct: 200 TYSKIDHIVGSKAL--LSKCKRTEIITNYLSDHSAIKLELRIKNLTQSRSTTWKLNNLLL 259
Query: 808 SD----PEFKRNMGRWWE------NSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLTH 867
+D E K + ++E + Q+ F + R K +A + +++++ S
Sbjct: 260 NDYWVHNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIA--LNAYKRKQERSKID 319
Query: 868 AKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLK---ESQFW-YQRAKKLWL 927
S L+E++ ++ QE + R LK ++ +L+ ES+ W ++R K+
Sbjct: 320 TLTSQLKELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKI-- 379
Query: 928 REGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRS---STK 987
R+ +++++ I I++++G I I T +++ +Y + + +
Sbjct: 380 ------DRPLARLIKKKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLE 439
Query: 988 SDPLFIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYL 1047
F+D + E L P EI +INSL KK+PG DGF F++ Y
Sbjct: 440 EMDTFLDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEE 499
Query: 1048 LKEDILDIFKDFYDKGVINKNMNNTYIALIPKK-KDYSHPKDFRPISLTTSIYKIIAKTL 1107
L +L +F+ +G++ + I LIPK +D + ++FRPISL KI+ K L
Sbjct: 500 LVPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKIL 559
Query: 1108 SNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKG-FILKLDIEKAFDK 1167
+NR++ + I +Q+ F+ Q I + + K K I+ +D EKAFDK
Sbjct: 560 ANRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRAKDKNHVIISIDAEKAFDK 619
Query: 1168 LNWDFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFL 1227
+ F+ L K ++ K IR T ++I+NG+ G RQG PLSP L
Sbjct: 620 IQQPFMLKTLNKLGIDGMYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLL 679
Query: 1228 FVIAMDYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILLFIEDNDYFLNNLRMALS 1287
F I ++ L+R + IKG+ L + LFADD+++++E+ NL +S
Sbjct: 680 FNIVLEVLARA---IRQEKEIKGIQLGKE-EVKLSLFADDMIVYLENPIVSAQNLLKLIS 739
Query: 1288 LFERASSLKINLLKSALVPMNVSVNRAKECASIWGIP--CHSLPLSYLGVPLGGNPKSNL 1347
F + S KIN+ KS N NR E + +P S + YLG+ L + K +L
Sbjct: 740 NFSKVSGYKINVQKSQAFLYN--NNRQTESQIMGELPFTIASKRIKYLGIQLTRDVK-DL 799
Query: 1348 FWRNVE---DKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV--FQAPSMTCKNI 1407
F N + +I++ N WK S GR+ ++K + IY+ + + P +
Sbjct: 800 FKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTEL 859
Query: 1408 EKSWRKFLWKGNNGSVGSHLINWTKVSKSKEEGGLGTSRLHVTNKALLTK--WLWRYLSE 1446
EK+ KF+W + ++ S+ + GG+ + KA +TK W W Y +
Sbjct: 860 EKTTLKFIWNQKRARIAKSIL-----SQKNKAGGITLPDFKLYYKATVTKTAWYW-YQNR 901
BLAST of IVF0006653 vs. ExPASy Swiss-Prot
Match:
P08548 (LINE-1 reverse transcriptase homolog OS=Nycticebus coucang OX=9470 PE=4 SV=1)
HSP 1 Score: 189.1 bits (479), Expect = 3.7e-46
Identity = 206/800 (25.75%), Postives = 356/800 (44.50%), Query Frame = 0
Query: 681 TVTDSSGATTSTNVLINQLNSGLA---SKGIGALGTSILQNVEQFHHQQSSDRSSLIN-- 740
T+TD S +ST++++ N+ LA L IL H +D +
Sbjct: 127 TLTDMSNLISSTSIVVGDFNTPLAVLDRSSKKKLSKEILDLNSTIQHLDLTDIYRTFHPN 186
Query: 741 -NRFTWSNLRNPPTFSRIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCE-NSNTKLS 800
+T+ + + T+S+ID L + + NL +P SDH + E N+N L
Sbjct: 187 KTEYTFFSSAH-GTYSKIDHILGHKS--NLSKFKKIEIIPCIFSDHHGIKVELNNNRNLH 246
Query: 801 WGPVPFRLNSIALSD----PEFKRNMGRWWE-NSIQDGH-----PGFSFIQRLK--SLAN 860
++LN++ L D E K+ + ++ E N+ QD + + R K +L
Sbjct: 247 THTKTWKLNNLMLKDTWVIDEIKKEITKFLEQNNNQDTNYQNLWDTAKAVLRGKFIALQA 306
Query: 861 FIKPWQKEKLHSLTHAKDSILREVDSIDKKELDTPLTQEESNRR--LALKADLSELSLKE 920
F+K ++E++++L + + ++K+E P + S R+ ++A+L+E+ K
Sbjct: 307 FLKKTEREEVNNL-------MGHLKQLEKEEHSNP---KPSRRKEITKIRAELNEIENKR 366
Query: 921 SQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTAFIKF 980
++K + + ++ + ++ +S I I++ I I ++
Sbjct: 367 IIQQINKSKSWFFEKINKIDKPLANLTRKKRVKSLISSIRNGNDEITTDPSEIQKILNEY 426
Query: 981 FSKIYR---SSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKTPGLD 1040
+ K+Y + K +++ + E L P EI I +L KK+PG D
Sbjct: 427 YKKLYSHKYENLKEIDQYLEACHLPRLSQKEVEMLNRPISSSEIASTIQNLPKKKSPGPD 486
Query: 1041 GFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMNNTYIALIPKK-KDYSHPKDFRPIS 1100
GF F++T+ L +L++F++ +G++ I LIPK KD + +++RPIS
Sbjct: 487 GFTSEFYQTFKEELVPILLNLFQNIEKEGILPNTFYEANITLIPKPGKDPTRKENYRPIS 546
Query: 1101 LTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMA-NEAVDFWKVKKIKG 1160
L KI+ K L+NR++ + I +Q+ F+ Q I + N K+K
Sbjct: 547 LMNIDAKILNKILTNRIQQHIKKIIHHDQVGFIPGSQGWFNIRKSINVIQHINKLKNKDH 606
Query: 1161 FILKLDIEKAFDKLNWDFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKAN 1220
IL +D EKAFD + F+ L+K + K I S T ++I+NG
Sbjct: 607 MILSIDAEKAFDNIQHPFMIRTLKKIGIEGTFLKLIEAIYSKPTANIILNGVKLKSFPLR 666
Query: 1221 RGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILLFIED 1280
G RQG PLSP LF I M+ L+ + AIKG+ + S I LFADD+++++E+
Sbjct: 667 SGTRQGCPLSPLLFNIVMEVLA---IAIREEKAIKGIHIGSE-EIKLSLFADDMIVYLEN 726
Query: 1281 NDYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECASIWGIPCHSLP--LSY 1340
L + + S KIN KS V + N E IP +P + Y
Sbjct: 727 TRDSTTKLLEVIKEYSNVSGYKINTHKS--VAFIYTNNNQAEKTVKDSIPFTVVPKKMKY 786
Query: 1341 LGVPLGGNPKSNLFWRNVE---DKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSV 1400
LGV L + K +L+ N E +I + +N WK S GR+ ++K ++ IY +
Sbjct: 787 LGVYLTKDVK-DLYKENYETLRKEIAEDVNKWKNIPCSWLGRINIVKMSILPKAIYNFNA 846
Query: 1401 --FQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEEG-GLGTSRLHVTNKA 1447
+AP K++EK F+W + L++ +K+K G L RL+ +
Sbjct: 847 IPIKAPLSYFKDLEKIILHFIWNQKKPQIAKTLLS----NKNKAGGITLPDLRLYYKSIV 901
BLAST of IVF0006653 vs. ExPASy Swiss-Prot
Match:
P14381 (Transposon TX1 uncharacterized 149 kDa protein OS=Xenopus laevis OX=8355 PE=4 SV=1)
HSP 1 Score: 153.3 bits (386), Expect = 2.2e-35
Identity = 171/719 (23.78%), Postives = 311/719 (43.25%), Query Frame = 0
Query: 750 SRIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVPFRLNSIALSDP 809
SRIDR +S + T R P SDH + S + N+ L D
Sbjct: 202 SRIDRIYISSHLMSRAQSSTIRLAP--FSDHNCVSLRMSIAPSLPKAAYWHFNNSLLEDE 261
Query: 810 EFKRNMGRWWE--NSIQDGHPGFSFIQRLKSLAN-FIKPWQKEKLHSLTHAKDSILREVD 869
F +++ W + QD F+ + + + +K +E S++ +++ E++
Sbjct: 262 GFAKSVRDTWRGWRAFQD---EFATLNQWWDVGKVHLKLLCQEYTKSVSGQRNA---EIE 321
Query: 870 SIDKKELDTPLTQEESNRR------LALKADLSELSLKESQFWYQRAKKLWLREGDENSS 929
+++ + LD S + L K L + ++++ + R++ L + D S
Sbjct: 322 ALNGEVLDLEQRLSGSEDQALQCEYLERKEALRNMEQRQARGAFVRSRMQLLCDMDRGSR 381
Query: 930 FFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPLFIDNLDW 989
FF+ + + R I + E+G +I F+ ++ S + D
Sbjct: 382 FFYALEKKKGNRKQITCLFAEDGTPLEDPEAIRDRARSFYQNLFSPDPISPDACEELWDG 441
Query: 990 NP-IEHSEWSHLCAPFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFK 1049
P + L P DE+ + + K+PGLDG I FF+ +W L D +
Sbjct: 442 LPVVSERRKERLETPITLDELSQALRLMPHNKSPGLDGLTIEFFQFFWDTLGPDFHRVLT 501
Query: 1050 DFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDT 1109
+ + KG + + ++L+PKK D K++RP+SL ++ YKI+AK +S RLK+ L +
Sbjct: 502 EAFKKGELPLSCRRAVLSLLPKKGDLRLIKNWRPVSLLSTDYKIVAKAISLRLKSVLAEV 561
Query: 1110 ISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFIDFVLEK 1169
I +Q V R I D + + + + F + + L LD EKAFD+++ ++ L+
Sbjct: 562 IHPDQSYTVPGRTIFDNVFLIRDLLHFARRTGLSLAFLSLDQEKAFDRVDHQYLIGTLQA 621
Query: 1170 KNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLL 1229
+F + +++ ++ V +N + RG+RQG PLS L+ +A++ LL
Sbjct: 622 YSFGPQFVGYLKTMYASAECLVKINWSLTAPLAFGRGVRQGCPLSGQLYSLAIEPFLCLL 681
Query: 1230 SHLESSGAIKGVSLNSNCNISHILFADDILLFIEDNDYFLNNLRMALSLFERASSLKINL 1289
+ +K + + +ADD++L +D L + ++ ASS +IN
Sbjct: 682 RKRLTGLVLK----EPDMRVVLSAYADDVILVAQDL-VDLERAQECQEVYAAASSARINW 741
Query: 1290 LKSA-LVPMNVSVNRAKECASIWGIPCHSLPLSYLGVPLGGN--PKSNLFWRNVEDKIQK 1349
KS+ L+ ++ V+ + I S + YLGV L P S F +E+ +
Sbjct: 742 SKSSGLLEGSLKVDFLP--PAFRDISWESKIIKYLGVYLSAEEYPVSQNF-IELEECVLT 801
Query: 1350 KLNNWK-YAQI-SKGGRLTLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNG 1409
+L WK +A++ S GR +I ++S Y+L I++ FLW
Sbjct: 802 RLGKWKGFAKVLSMRGRALVINQLVASQIWYRLICLSPTQEFIAKIQRRLLDFLW----- 861
Query: 1410 SVGSHLINWTKVSKSKEEGGLGTSRLHVTNKALLTKWLWRYL-SEPNALWRRLIQCKYK 1453
+G H ++ S +EGG G + + + RYL ++P+ W L Y+
Sbjct: 862 -IGKHWVSAGVSSLPLKEGGQGVVCIRSQVHTFRLQQIQRYLYADPSPQWCTLASSFYR 898
BLAST of IVF0006653 vs. ExPASy Swiss-Prot
Match:
P0C2F6 (Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1g65750 PE=3 SV=1)
HSP 1 Score: 109.0 bits (271), Expect = 4.8e-22
Identity = 57/173 (32.95%), Postives = 86/173 (49.71%), Query Frame = 0
Query: 1332 DKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKG 1391
+++ +++ W+ +S GRLTL K+ LSS+P++ +S P +++ R FLW
Sbjct: 18 ERVSSRMSGWREKTLSFAGRLTLTKAVLSSMPVHSMSTILLPQSILNRLDQLSRTFLWGS 77
Query: 1392 NNGSVGSHLINWTKVSKSKEEGGLGTSRLHVTNKALLTKWLWRYLSEPNALWRRLIQCKY 1451
HL+ W+KV K+EGGLG N+AL++K WR L E N+LW ++Q KY
Sbjct: 78 TAEKKKQHLVKWSKVCSPKKEGGLGVRAAKSMNRALISKVGWRLLQEKNSLWTLVLQKKY 137
Query: 1452 KGNYPGDIPSNISSITSKAPWRSIIDNI-DWFKSNQSWELNNGDQISFWYSNW 1504
D I + + WRSI + D W +G QI FW W
Sbjct: 138 HVGEIRDSRWLIPKGSWSSTWRSIAIGLRDVVSHGVGWIPGDGQQIRFWTDRW 190
BLAST of IVF0006653 vs. ExPASy TrEMBL
Match:
A0A5D3BLV7 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G005290 PE=4 SV=1)
HSP 1 Score: 2552.7 bits (6615), Expect = 0.0e+00
Identity = 1321/1687 (78.30%), Postives = 1378/1687 (81.68%), Query Frame = 0
Query: 21 SSLLLAVKRSLSSPVLFTAFHLPSLVFPTISQTMAYFKSLPRSCKVERKEFVLHLDKYSK 80
+ LLL VKRSLS PVLF AFHLPSL F +FKSLPRSCKVERKEFVLHLDKYSK
Sbjct: 22 TDLLLVVKRSLSPPVLFIAFHLPSLAFSYNLPNNGHFKSLPRSCKVERKEFVLHLDKYSK 81
Query: 81 HTHYWLTETGAHKAFSIEVSPKDLDWIRCTLKSLIATPNTNRFFLETRDSEQCIWIRKTR 140
HTHYWLTETGAHKAFSIEVSP+DLDWIRCTLKSLIATPNTNRFFLETRDSEQ IWIRKTR
Sbjct: 82 HTHYWLTETGAHKAFSIEVSPRDLDWIRCTLKSLIATPNTNRFFLETRDSEQRIWIRKTR 141
Query: 141 NSKGCTAEIFRVDQKNRKSCILVPEGPEKSGWVSFLSMITPKVEVKAKTRPTFLPRSSPD 200
NSKGCTAEIFRVDQKNRKSCILVPEGP+KSGWVSFLSMITPKVEVKAKTRPTFLPR+SPD
Sbjct: 142 NSKGCTAEIFRVDQKNRKSCILVPEGPDKSGWVSFLSMITPKVEVKAKTRPTFLPRTSPD 201
Query: 201 GRLSPPIDYHKRSYARAVTEGR--ATSDSSDSYDTSDSSHSSGNSFCDSPSSDLLENTVV 260
RLSPPIDYHKRSYA+AVTEGR ATSDSSDSYD+SDSSHSS NSFCDSPSSDLLENTVV
Sbjct: 202 CRLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHSSSNSFCDSPSSDLLENTVV 261
Query: 261 IVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWTTVGKY 320
IVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGW+TVGKY
Sbjct: 262 IVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKY 321
Query: 321 SVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLHLWNMTTFQQLGKACGGLIKVAEETRSA 380
SV+FEKWS YHATPKLIPSYGGWTTFRGIPLHLWNM TFQQ+GKAC GLIKVAEETRSA
Sbjct: 322 SVRFEKWSPVYHATPKLIPSYGGWTTFRGIPLHLWNMMTFQQIGKACEGLIKVAEETRSA 381
Query: 381 KNLVKARIKVRYNYSGFLPANVRIFDNEGNKFSIQVVTHPEGKWLIERNVRLHGTFKRQA 440
KNL++ARIKVRYNYSGFLPANVRIFDNEGNKF +QVVTHPEGKWLIERNVRLHGTFKRQA
Sbjct: 382 KNLIEARIKVRYNYSGFLPANVRIFDNEGNKFFVQVVTHPEGKWLIERNVRLHGTFKRQA 441
Query: 441 AAAFDEFNPESEQFFFEGMEAISPDFLSTSSDGRKSNTPDQPPALKSVIIKSDRVATSPS 500
AA+FD+FNPESEQFFFEG EAISPDFLSTSSDGRKS+TPDQP ALKSVIIK DR AT PS
Sbjct: 442 AASFDDFNPESEQFFFEGSEAISPDFLSTSSDGRKSSTPDQPSALKSVIIKPDRNATLPS 501
Query: 501 FLNEEVVNDSNLHATANKSKSEILPGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVS 560
FLNEE+VNDSNLHATANKSK EIL GISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVS
Sbjct: 502 FLNEELVNDSNLHATANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVS 561
Query: 561 FNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKVSRER----------------------- 620
FNSP NKTNIFNPDSAPANHSPSL+SPEKKQKVSRER
Sbjct: 562 FNSPSNKTNIFNPDSAPANHSPSLNSPEKKQKVSRERSIKKKSSSTQPNSKANQNKGVFI 621
Query: 621 -------------------------------------NHHSSDNAEVIDITNTEVVPETP 680
+HH+SDNAEV+DITNTEVVPETP
Sbjct: 622 TQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPNKSLEDHHNSDNAEVVDITNTEVVPETP 681
Query: 681 EMKMQVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSKAFKKQLASWLKENGLK 740
EMKM VNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDS+AFKKQL SWLK+NGLK
Sbjct: 682 EMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKKNGLK 741
Query: 741 ISTVTDSSGATTSTNVLINQLNSGL----------------------------------- 800
+ST TDSSGATTSTNVL+NQ+NSGL
Sbjct: 742 LSTDTDSSGATTSTNVLLNQMNSGLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILW 801
Query: 801 ---------ASKGIGALGTSILQNVE----------------------QFHHQQ------ 860
+G+ +L + L N + H+ Q
Sbjct: 802 DAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGPVKRRERIHFWAELHNLQHLNSFP 861
Query: 861 --------------------SSDRSS----------------LINNRFTWSNLRNPPTFS 920
SS +S L NNRFTWSNLRNPPTFS
Sbjct: 862 WILGGDLNVIRMREESTSVLSSSHNSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFS 921
Query: 921 RIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVPFRLNSIALSDPE 980
RIDRFLYNS+WENLFSPHTTRTLPRSTSDHFPLVCE+SN KLSWGP+PFRLNSI LSDPE
Sbjct: 922 RIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPKLSWGPIPFRLNSITLSDPE 981
Query: 981 FKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLTHAKDSILREVDSIDK 1040
FKRNMGRWWENSIQ G+PGFSFIQRLKSLANFIKPWQKEKLHSLT+AK++I+REVDSIDK
Sbjct: 982 FKRNMGRWWENSIQAGYPGFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDK 1041
Query: 1041 KELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQ 1100
KELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQ
Sbjct: 1042 KELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQ 1101
Query: 1101 KRSFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSH 1160
KRSFIHEIQDEEG IQNTN SISTAFIKFFS+IYRSSTKSDPLFI+NLDWNPI SEWSH
Sbjct: 1102 KRSFIHEIQDEEGSIQNTNNSISTAFIKFFSRIYRSSTKSDPLFIENLDWNPIASSEWSH 1161
Query: 1161 LCAPFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKN 1220
LCAPFLE EIKGVINS DGKKTPG DGFPISFFK++W
Sbjct: 1162 LCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHW----------------------- 1221
Query: 1221 MNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKN 1280
LKTTLP+TISGNQLAFVKN
Sbjct: 1222 -----------------------------------------LKTTLPNTISGNQLAFVKN 1281
Query: 1281 RQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFIDFVLEKKNFPILWRKWI 1340
RQITDAILMANEAVD+WKVKKIKGFILKLDIEKAFD LN DFID VLEKKNFP WRKWI
Sbjct: 1282 RQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLNLDFIDNVLEKKNFPNPWRKWI 1341
Query: 1341 RGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKG 1400
RGCISNVTYSVI+NGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKG
Sbjct: 1342 RGCISNVTYSVIINGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKG 1401
Query: 1401 VSLNSNCNISHILFADDILLFIEDNDYFLNNLRMALSLFERASSLKINLLKSALVPMNVS 1460
VSLN NCNISHILFADDILLFIEDND FL NLRMALSLFERAS LKINLLKSALVP+NVS
Sbjct: 1402 VSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKSALVPVNVS 1461
Query: 1461 VNRAKECASIWGIPCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGG 1520
+ RAKECAS WGI CHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGG
Sbjct: 1462 LKRAKECASFWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGG 1521
Query: 1521 RLTLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSK 1538
RLTLIKSTLSSLPIYQLSVFQAPS+TCKNIEK WRKFLWKGNNGS GSHLINWTKVSKSK
Sbjct: 1522 RLTLIKSTLSSLPIYQLSVFQAPSLTCKNIEKLWRKFLWKGNNGSEGSHLINWTKVSKSK 1581
BLAST of IVF0006653 vs. ExPASy TrEMBL
Match:
A0A5D3BL61 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold169G001020 PE=4 SV=1)
HSP 1 Score: 2205.6 bits (5714), Expect = 0.0e+00
Identity = 1132/1625 (69.66%), Postives = 1257/1625 (77.35%), Query Frame = 0
Query: 54 MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPKDLDWIRCTLKS 113
MAYFKSLPRSCK+ERKEFVL LDKY+KHTHYWLTETGAHKAFSIEVSP+DLDWIR TLKS
Sbjct: 1 MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKS 60
Query: 114 LIATPNTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVDQKNRKSCILVPEGPEKSGWV 173
LI TP++NRFFLE RD E CIWIRKTRN KGCTAEIFRVD KNRKSCILVPEG EKS WV
Sbjct: 61 LIETPSSNRFFLENRDYEHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEGLEKSCWV 120
Query: 174 SFLSMITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTEGRA--TSDSSDSY 233
SFLSMITPKVEVKAKTRP FLPRSSP+ RLSPPIDYHKRSYA+AV+EGR+ +SDSSDSY
Sbjct: 121 SFLSMITPKVEVKAKTRPIFLPRSSPEFRLSPPIDYHKRSYAKAVSEGRSSISSDSSDSY 180
Query: 234 DTSDSSHSSGNSFCDSPSSDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAE 293
+SDSS SSGNS CDSP LLENTVV+VRRFFHDDW KILQNLRKQTEESFTYNAFHAE
Sbjct: 181 ASSDSSQSSGNSPCDSPFPVLLENTVVLVRRFFHDDWQKILQNLRKQTEESFTYNAFHAE 240
Query: 294 KALVHFSSNIPANLLCQNKGWTTVGKYSVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLH 353
K LVHF+SN+PANLLCQNKGWTTVGKY+V+FEKW+ A HA+PKLIPSYGGWTTFRGIPLH
Sbjct: 241 KVLVHFNSNVPANLLCQNKGWTTVGKYTVRFEKWAPASHASPKLIPSYGGWTTFRGIPLH 300
Query: 354 LWNMTTFQQLGKACGGLIKVAEETRSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFS 413
LWNM TFQQ+GKACGGLIKVAEET++A+NL++A++K+RYNYSGFLPA V+IFD EGNKF
Sbjct: 301 LWNMMTFQQIGKACGGLIKVAEETKTARNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFV 360
Query: 414 IQVVTHPEGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQFFFEGMEAISPDFLSTSSDG 473
+QVVTH EGKWL+ERNVRLHGTFKRQAAA+FD+FNP+SEQF F+G+EAISPD L+T S
Sbjct: 361 VQVVTHSEGKWLMERNVRLHGTFKRQAAASFDDFNPDSEQFLFDGLEAISPDLLNTISGS 420
Query: 474 RKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSKSEILPGISNDGVL 533
RKS +P+QP ALKSVIIK + ATSP+ LNEEVVND++LHATANKSK +IL GISNDG L
Sbjct: 421 RKSISPEQPSALKSVIIKPAKYATSPTTLNEEVVNDNSLHATANKSKLKILSGISNDGSL 480
Query: 534 DKGKQKVDIQLQPNSALNLDKSKRKVSFNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKV 593
DKGKQKVDI Q SA K KRKVSFNSP NKT FNPDSAPANH SPEKK++V
Sbjct: 481 DKGKQKVDIPSQLTSAFIFYKPKRKVSFNSPSNKTTFFNPDSAPANH-----SPEKKKRV 540
Query: 594 SRER-------------------------------------------------------- 653
SRER
Sbjct: 541 SRERSVKKKSSTIQPKLRANQGKGNLITQPLQVVAHDLDASKKGLSLTVDLGNLPVLDPS 600
Query: 654 ----NHHSSDNAEVIDITNTEVVPETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRK 713
+HHSSDNAEVIDITNTEVVPETPE+KM E SNSS E NYRK KH H+R++YYRK
Sbjct: 601 KSFEDHHSSDNAEVIDITNTEVVPETPELKMTDPEKSNSSPEVNYRKQKHSHRRRHYYRK 660
Query: 714 KEEKEKDPDSKAFKKQLASWLKENGLKISTVTDSSGATTSTNVLINQLNS---------- 773
KE+KEKD +S+AFK QL +WLKENGLK+S TDSSGATTSTN L +QL S
Sbjct: 661 KEDKEKDTNSEAFKNQLVTWLKENGLKLSIDTDSSGATTSTNALFSQLGSSAGGILILWD 720
Query: 774 ----GLASKGIGALGTS--------------------------ILQNVEQFHHQQS---- 833
L S+ G S + +++ HH S
Sbjct: 721 AQHHSLLSQEEGKFSLSANFSSFNNSWWLTGLYGPVKRRERLNVWEDLHNLHHLNSSPWI 780
Query: 834 -------------------SDRSS----------------LINNRFTWSNLRNPPTFSRI 893
S SS L NNR+TWSNLRNPPTFSR+
Sbjct: 781 IGGDLNVVRMREESTAVTFSSHSSNMLNDFISNNLLIDPPLTNNRYTWSNLRNPPTFSRL 840
Query: 894 DRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVPFRLNSIALSDPEFK 953
DRFLYNS WE LF+PH TRTLPR TSDHFPLVCE+S + L WGP PFRLNSIAL+DPEFK
Sbjct: 841 DRFLYNSRWEILFNPHITRTLPRPTSDHFPLVCEDSTSTLRWGPAPFRLNSIALNDPEFK 900
Query: 954 RNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLTHAKDSILREVDSIDKKE 1013
RNM RWWE S+Q+GHPGF FIQRLKSLAN IKPWQKEK SLT AK++I+REVDSIDK E
Sbjct: 901 RNMERWWELSVQNGHPGFFFIQRLKSLANLIKPWQKEKFQSLTSAKENIIREVDSIDKNE 960
Query: 1014 LDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKR 1073
LDTPL+ EESNRRLALKA+L++LSLKESQFW+QRAKKLWL+EGDENS+FFHRICSSRQKR
Sbjct: 961 LDTPLSLEESNRRLALKAELNDLSLKESQFWFQRAKKLWLKEGDENSAFFHRICSSRQKR 1020
Query: 1074 SFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSHLC 1133
+ IHEIQDEEG IQNTN +IS AF+ FS+IYR STK DPLFI+NL+WNPI++S+WS LC
Sbjct: 1021 NLIHEIQDEEGSIQNTNNNISLAFVNHFSRIYRCSTKKDPLFIENLEWNPIDYSDWSLLC 1080
Query: 1134 APFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMN 1193
APF E+EIKGVI S DG K PG DGFPISFFK+YW+LLKEDILDIFKDF++KGVINKNMN
Sbjct: 1081 APFSEEEIKGVIKSFDGNKAPGPDGFPISFFKSYWHLLKEDILDIFKDFFEKGVINKNMN 1140
Query: 1194 NTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQ 1253
NTYIALI KKKDYSHPKDFRPISLTTSIYK IAKTLSNRLK TLPDTISGNQLAF+KNRQ
Sbjct: 1141 NTYIALIEKKKDYSHPKDFRPISLTTSIYKTIAKTLSNRLKLTLPDTISGNQLAFIKNRQ 1200
Query: 1254 ITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFIDFVLEKKNFPILWRKWIRG 1313
ITDAILMANEA+D+WKVKKIKGFILKLDIEKAFD LNW+FID VL+K N+P WRKWIRG
Sbjct: 1201 ITDAILMANEALDYWKVKKIKGFILKLDIEKAFDNLNWNFIDLVLKKNNYPNSWRKWIRG 1260
Query: 1314 CISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVS 1373
CISNVTYS+IVNG+PQGRIKANRGLRQGDPLS FLFVIAMDYLSRLLSHLES+GAIKG
Sbjct: 1261 CISNVTYSIIVNGKPQGRIKANRGLRQGDPLSLFLFVIAMDYLSRLLSHLESTGAIKG-- 1320
Query: 1374 LNSNCNISHILFADDILLFIEDNDYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVN 1433
Sbjct: 1321 ------------------------------------------------------------ 1380
Query: 1434 RAKECASIWGIPCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRL 1493
GI CH+LPL+YLGVPLGGNPKSNLFWRN+ED+IQKKL+NWKYA ISKGGRL
Sbjct: 1381 ---------GILCHTLPLTYLGVPLGGNPKSNLFWRNIEDRIQKKLSNWKYAHISKGGRL 1440
Query: 1494 TLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEE 1538
TLIKSTLSSLPIY+LSVFQAPS T KNIEK WR FLWKG+ G GSHLINW+ V+K KEE
Sbjct: 1441 TLIKSTLSSLPIYKLSVFQAPSSTYKNIEKLWRNFLWKGSCGLKGSHLINWSIVTKPKEE 1500
BLAST of IVF0006653 vs. ExPASy TrEMBL
Match:
A0A5A7TDG1 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold64G001050 PE=4 SV=1)
HSP 1 Score: 1934.8 bits (5011), Expect = 0.0e+00
Identity = 997/1387 (71.88%), Postives = 1108/1387 (79.88%), Query Frame = 0
Query: 54 MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPKDLDWIRCTLKS 113
MAYFKSLPRSCK+ERKEFVL LDKY+KHTHYWLTETGAHKAFSIEVSP+DLDWIR TLKS
Sbjct: 1 MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKS 60
Query: 114 LIATPNTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVDQKNRKSCILVPEGPEKSGWV 173
LI TP++NRFFLE RD E CIWIRKTRN KGCTAEIFRVD KNRKSCILVPEG EKS WV
Sbjct: 61 LIETPSSNRFFLENRDYEHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEGLEKSCWV 120
Query: 174 SFLSMITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTEGRA--TSDSSDSY 233
SFLSMITPKVEVKAKTRP FLPRSSP+ RLSPPIDYHKRSYA+AV+EGR+ +SDSSDSY
Sbjct: 121 SFLSMITPKVEVKAKTRPIFLPRSSPEFRLSPPIDYHKRSYAKAVSEGRSSISSDSSDSY 180
Query: 234 DTSDSSHSSGNSFCDSPSSDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAE 293
+SDSS SSGNS CDSP LLENTVV+VRRFFHDDW KILQNLRKQTEESFTYNAFHAE
Sbjct: 181 ASSDSSQSSGNSPCDSPFPVLLENTVVLVRRFFHDDWQKILQNLRKQTEESFTYNAFHAE 240
Query: 294 KALVHFSSNIPANLLCQNKGWTTVGKYSVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLH 353
K LVHF+SN+PANLLCQNKGWTTVGKY+V+FEKW+ A HA+PKLIPSYGGWTTFRGIPLH
Sbjct: 241 KVLVHFNSNVPANLLCQNKGWTTVGKYTVRFEKWAPASHASPKLIPSYGGWTTFRGIPLH 300
Query: 354 LWNMTTFQQLGKACGGLIKVAEETRSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFS 413
LWNM TFQQ+GKACGGLIKVAEET++A+NL++A++K+RYNYSGFLPA V+IFD EGNKF
Sbjct: 301 LWNMMTFQQIGKACGGLIKVAEETKTARNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFV 360
Query: 414 IQVVTHPEGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQFFFEGMEAISPDFLSTSSDG 473
+QVVTH EGKWL+ERNVRLHGTFKRQAAA+FD+FNP+SEQF F+G+EAISPD L+T S
Sbjct: 361 VQVVTHSEGKWLMERNVRLHGTFKRQAAASFDDFNPDSEQFLFDGLEAISPDLLNTISGS 420
Query: 474 RKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSKSEILPGISNDGVL 533
RKS +P+QP ALKSVIIK + ATSP+ LNEEVVND++LHATANKSK +IL GISNDG L
Sbjct: 421 RKSISPEQPSALKSVIIKPAKYATSPTTLNEEVVNDNSLHATANKSKLKILSGISNDGSL 480
Query: 534 DKGKQKVDIQLQPNSALNLDKSKRKVSFNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKV 593
DKGKQKVDI Q SA K KRKVSFNSP NKT FNPDSAPANH SPEKK++V
Sbjct: 481 DKGKQKVDIPSQLTSAFIFYKPKRKVSFNSPSNKTTFFNPDSAPANH-----SPEKKKRV 540
Query: 594 SRER-------------------------------------------------------- 653
SRER
Sbjct: 541 SRERSVKKKSSTIQPKLRANQGKGNLITQPLQVVAHDLDASKKGLSLTVDLGNLPVLDPS 600
Query: 654 ----NHHSSDNAEVIDITNTEVVPETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRK 713
+HHSSDNAEVIDITNTEVVPETPE+KM E SNSS E NYRK KH H+R++YYRK
Sbjct: 601 KSFEDHHSSDNAEVIDITNTEVVPETPELKMTDPEKSNSSPEVNYRKQKHSHRRRHYYRK 660
Query: 714 KEEKEKDPDSKAFKKQLASWLKENGLKISTVTDSSGATTSTNVLINQLNS---------- 773
KE+KEKD +S+AFK QL +WLKENGLK+S TDSSGATTSTN L +QL S
Sbjct: 661 KEDKEKDTNSEAFKNQLVTWLKENGLKLSIDTDSSGATTSTNALFSQLGSSAGGILILWD 720
Query: 774 ----GLASKGIGALGTS--------------------------ILQNVEQFHHQQS---- 833
L S+ G S + +++ HH S
Sbjct: 721 AQHHSLLSQEEGKFSLSANFSSFNNSWWLTGLYGPVKRRERLNVWEDLHNLHHLNSSPWI 780
Query: 834 -------------------SDRSS----------------LINNRFTWSNLRNPPTFSRI 893
S SS L NNR+TWSNLRNPPTFSR+
Sbjct: 781 IGGDLNVVRMREESTAVTFSSHSSNMLNDFISNNLLIDPPLTNNRYTWSNLRNPPTFSRL 840
Query: 894 DRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVPFRLNSIALSDPEFK 953
DRFLYNS WE LF+PH TRTLPR TSDHFPLVCE+S + L WGP PFRLNSIAL+DPEFK
Sbjct: 841 DRFLYNSRWEILFNPHITRTLPRPTSDHFPLVCEDSTSTLRWGPAPFRLNSIALNDPEFK 900
Query: 954 RNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLTHAKDSILREVDSIDKKE 1013
RNM RWWE S+Q+GHPGF FIQRLKSLAN IKPWQKEK SLT AK++I+REVDSIDK E
Sbjct: 901 RNMERWWELSVQNGHPGFFFIQRLKSLANLIKPWQKEKFQSLTSAKENIIREVDSIDKNE 960
Query: 1014 LDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKR 1073
LDTPL+ EESNRRLALKA+L++LSLKESQFW+QRAKKLWL+EGDENS+FFHRICSSRQKR
Sbjct: 961 LDTPLSLEESNRRLALKAELNDLSLKESQFWFQRAKKLWLKEGDENSAFFHRICSSRQKR 1020
Query: 1074 SFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSHLC 1133
+ IHEIQDEEG IQNTN +IS AF+ FS+IYR STK DPLFI+NL+WNPI++S+WS LC
Sbjct: 1021 NLIHEIQDEEGSIQNTNNNISLAFVNHFSRIYRCSTKKDPLFIENLEWNPIDYSDWSLLC 1080
Query: 1134 APFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMN 1193
APF E+EIKGVI S DG K PG DGFPISFFK+YW+LLKEDILDIFKDF++KGVINKNMN
Sbjct: 1081 APFSEEEIKGVIKSFDGNKAPGPDGFPISFFKSYWHLLKEDILDIFKDFFEKGVINKNMN 1140
Query: 1194 NTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQ 1253
NTYIALI KKKDYSHPKDFRPISLTTSIYK IAKTLSNRLK TLPDTISGNQLAF+KNRQ
Sbjct: 1141 NTYIALIEKKKDYSHPKDFRPISLTTSIYKTIAKTLSNRLKLTLPDTISGNQLAFIKNRQ 1200
Query: 1254 ITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFIDFVLEKKNFPILWRKWIRG 1300
ITDAILMANEA+D+WKVKKIKGFILKLDIEKAFD LNW+FID VL+K N+P WRKWIRG
Sbjct: 1201 ITDAILMANEALDYWKVKKIKGFILKLDIEKAFDNLNWNFIDLVLKKNNYPNSWRKWIRG 1260
BLAST of IVF0006653 vs. ExPASy TrEMBL
Match:
A0A5D3C3M3 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold98G003160 PE=4 SV=1)
HSP 1 Score: 1895.9 bits (4910), Expect = 0.0e+00
Identity = 1014/1582 (64.10%), Postives = 1129/1582 (71.37%), Query Frame = 0
Query: 54 MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPKDLDWIRCTLKS 113
MAYFKSLPRSCK+ERKEFVL LDKY+KHTHYWLTETGAHKAFSIEVSP+DLDWIR TLKS
Sbjct: 1 MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKS 60
Query: 114 LIATPNTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVDQKNRKSCILVPEGPEKSGWV 173
LI TP++NRFFLE RD E CIWIRKTRN KGCTAEIFRVD KNRKSCILVPEGPEKSG V
Sbjct: 61 LIETPSSNRFFLENRDYEHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEGPEKSGRV 120
Query: 174 SFLSMITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTEGRA--TSDSSDSY 233
SFLSMITPKVEVKAKTRPTFLPRSSP+ RLSPPIDYHKRSY +AV++GR+ +SDSSDSY
Sbjct: 121 SFLSMITPKVEVKAKTRPTFLPRSSPEFRLSPPIDYHKRSYEKAVSKGRSSISSDSSDSY 180
Query: 234 DTSDSSHSSGNSFCDSPSSDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAE 293
+SDSS SSGNS CDSP LLENTVV+
Sbjct: 181 TSSDSSQSSGNSPCDSPFPVLLENTVVL-------------------------------- 240
Query: 294 KALVHFSSNIPANLLCQNKGWTTVGKYSVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLH 353
AL+HF+SN+PANLLCQNKGWTTV KY V+
Sbjct: 241 -ALIHFNSNVPANLLCQNKGWTTVEKYMVR------------------------------ 300
Query: 354 LWNMTTFQQLGKACGGLIKVAEETRSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFS 413
Sbjct: 301 ------------------------------------------------------------ 360
Query: 414 IQVVTHPEGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQFFFEGMEAISPDFLSTSSDG 473
+ F+G+EAISPD L+T S
Sbjct: 361 ---------------------------------------KSLFDGLEAISPDLLNTISGS 420
Query: 474 RKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSKSEILPGISNDGVL 533
RKSN+ +QP ALKSVIIK R ATSP+ LNEEVVND++LHAT KS+ +IL GISNDG L
Sbjct: 421 RKSNSREQPSALKSVIIKPARDATSPTTLNEEVVNDNSLHATTIKSELKILSGISNDGSL 480
Query: 534 DKGKQKVDIQLQPNSALNLDKSKRKVSFNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKV 593
DKGKQKVDI Q SA DK KRKVSFNSP NKT FN DSAP NHSP LSSPEKKQ+V
Sbjct: 481 DKGKQKVDIPSQLTSAFIYDKPKRKVSFNSPSNKTTFFNSDSAPTNHSPPLSSPEKKQRV 540
Query: 594 SRER-------------------------------------------------------- 653
SRER
Sbjct: 541 SRERSVKKKSSTIQPKSRANQGKGELITQPLQVVAHDLDASKKGLSLTVDLGNLPVLDPS 600
Query: 654 ----NHHSSDNAEVIDITNTEVVPETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRK 713
+HHSSDNAEVIDITNTEVVPETPE+KM E SNSS E NYRK KH H+R++YYRK
Sbjct: 601 KSFEDHHSSDNAEVIDITNTEVVPETPELKMTDPEKSNSSPEVNYRKQKHSHRRRHYYRK 660
Query: 714 KEEKEKDPDSKAFKKQLASWLKENGLKISTVTDSSGATTSTNVLINQLNSGLASKGIGAL 773
KE+KEKD +S+AFK QL +WLKENGLK+ST TDSSGATTSTN L +QL S ++ A+
Sbjct: 661 KEDKEKDTNSEAFKNQLVTWLKENGLKLSTDTDSSGATTSTNALFSQLGSSISWIVKNAI 720
Query: 774 GTS--ILQNVEQFHHQ------------------QSSDRSS----------------LIN 833
+S IL + HH SS SS L N
Sbjct: 721 DSSGGILILWDAQHHSLLRGDLNVVRMREESTAVTSSSHSSNMLNNFISNNLLIDPPLTN 780
Query: 834 NRFTWSNLRNPPTFSRIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWG 893
NR+TWSNLRNPPTFSR+DRFLYNS WE LF+PH TRTL R TSDHFPLVCE+S + L WG
Sbjct: 781 NRYTWSNLRNPPTFSRLDRFLYNSRWETLFNPHITRTLSRPTSDHFPLVCEDSTSTLRWG 840
Query: 894 PVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLT 953
P PFRLNSIAL+DP+FKRNM RWWE S+Q+GHPGFSFI+RLKSLAN IKPWQKEK HSLT
Sbjct: 841 PAPFRLNSIALNDPKFKRNMERWWELSVQNGHPGFSFIRRLKSLANLIKPWQKEKFHSLT 900
Query: 954 HAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREG 1013
AK++I+REVDSIDK ELDTPL+QEESNRRLALKA+LS+LSLKESQFW+QRAKKLWL+EG
Sbjct: 901 SAKENIIREVDSIDKNELDTPLSQEESNRRLALKAELSDLSLKESQFWFQRAKKLWLKEG 960
Query: 1014 DENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPLFI 1073
DENS+FFHRICSSRQKR+ IHEIQDEEG IQNTN +IS AF+ FS IYR STK DPLFI
Sbjct: 961 DENSAFFHRICSSRQKRNLIHEIQDEEGSIQNTNNNISLAFVNHFSSIYRCSTKKDPLFI 1020
Query: 1074 DNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDIL 1133
+NL+WNPI++S+WS LCAPFLE+EIKGVI S DG K PG DGFPISFFK+YW+LLKEDIL
Sbjct: 1021 ENLEWNPIDYSDWSLLCAPFLEEEIKGVIKSFDGNKAPGPDGFPISFFKSYWHLLKEDIL 1080
Query: 1134 DIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTT 1193
DIFKDF++KG IIAKTLSNRLK T
Sbjct: 1081 DIFKDFFEKG-------------------------------------IIAKTLSNRLKLT 1140
Query: 1194 LPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFIDF 1253
LPDTISGNQLAF+KNRQITDAIL ANEA+D+WKVKKIK FILKLDIEKAFD LNWDFIDF
Sbjct: 1141 LPDTISGNQLAFIKNRQITDAILRANEALDYWKVKKIKSFILKLDIEKAFDNLNWDFIDF 1200
Query: 1254 VLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYL 1313
VL+KKN+P WRKWIRGCISNVTYS+IVN +PQ RIKANRGLRQGDPLSPFLFV AMDYL
Sbjct: 1201 VLKKKNYPNSWRKWIRGCISNVTYSIIVNEKPQDRIKANRGLRQGDPLSPFLFVSAMDYL 1260
Query: 1314 SRLLSHLESSGAIKGVSLNSNCNISHILFADDILLFIEDNDYFLNNLRMALSLFERASSL 1373
SRLLSHLESSGAIKGV L ++CNISHILFADDILLF+EDND+FLNNLRMALSLFE+AS L
Sbjct: 1261 SRLLSHLESSGAIKGVCLANDCNISHILFADDILLFVEDNDHFLNNLRMALSLFEKASGL 1320
Query: 1374 KINLLKSALVPMNVSVNRAKECASIWGIPCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQ 1433
KINL KSA+VP+NVS +RA ECAS WGI CH+LPL+YLGVPLGGNPKSN+FWRN+ED+IQ
Sbjct: 1321 KINLSKSAMVPVNVSWSRALECASSWGISCHTLPLTYLGVPLGGNPKSNIFWRNIEDRIQ 1380
Query: 1434 KKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGS 1493
KKLNNWKYA ISKGGRLTLIKSTLSSL IYQLSVFQAP T KNIEK WR FLWKG+ G
Sbjct: 1381 KKLNNWKYAHISKGGRLTLIKSTLSSLSIYQLSVFQAPPSTYKNIEKLWRNFLWKGSFGL 1383
Query: 1494 VGSHLINWTKVSKSKEEGGLGTSRLHVTNKALLTKWLWRYLSEPNALWRRLIQCKYKGNY 1538
GSHLINW+ V+K KEEGGLG SRL V N+ALL+KWLWRY SEPN+LWRRLI KYKG +
Sbjct: 1441 KGSHLINWSIVTKLKEEGGLGISRLQVINQALLSKWLWRYYSEPNSLWRRLIHIKYKGKH 1383
BLAST of IVF0006653 vs. ExPASy TrEMBL
Match:
A0A5A7UV84 (Reverse transcriptase domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold98G001710 PE=4 SV=1)
HSP 1 Score: 1700.3 bits (4402), Expect = 0.0e+00
Identity = 953/1539 (61.92%), Postives = 1029/1539 (66.86%), Query Frame = 0
Query: 178 MITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTEGR--ATSDSSDSYDTSD 237
MITPKVEVK KTRPTFLPRSSP+ RLSPPIDYHKRSYA+ VTEGR TSDSSDSY +SD
Sbjct: 1 MITPKVEVKEKTRPTFLPRSSPEYRLSPPIDYHKRSYAKVVTEGRPFTTSDSSDSYVSSD 60
Query: 238 SSHSSGNSFCDSPSSDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALV 297
SSHSSGNSFCDSPS DLLENTVV+VRRFFHDDW KILQNLRKQTEESFTYNAFHAEKALV
Sbjct: 61 SSHSSGNSFCDSPSPDLLENTVVLVRRFFHDDWQKILQNLRKQTEESFTYNAFHAEKALV 120
Query: 298 HFSSNIPANLLCQNKGWTTVGKYSVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLHLWNM 357
HF+SNIP NLLCQNKGWTTVGKYSV+FEKWS AYHATPKLIPSYGGWTTF+
Sbjct: 121 HFNSNIPENLLCQNKGWTTVGKYSVRFEKWSPAYHATPKLIPSYGGWTTFQ--------- 180
Query: 358 TTFQQLGKACGGLIKVAEETRSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFSIQVV 417
R++ LV+
Sbjct: 181 --------------------RNSATLVE-------------------------------- 240
Query: 418 THPEGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQ----FFFEGMEAISPDFLSTSSDG 477
+D+F+ E F+G EAISPDFLSTSS
Sbjct: 241 --------------------------YDDFSTNCESLRRIILFDGSEAISPDFLSTSSRS 300
Query: 478 RKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSKSEILPGISNDGVL 537
RKS+TPDQP ALKSVIIK D+ ATSP++LNEEVVNDSNLHATANKS+ EIL GI NDGVL
Sbjct: 301 RKSSTPDQPSALKSVIIKPDKAATSPTYLNEEVVNDSNLHATANKSRLEILSGIPNDGVL 360
Query: 538 DKGKQKVDIQLQPNSALNLDKSKRKVSFNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKV 597
DKGKQKVDIQL PNSALNL+K KRKVSFNSP NKTNIFNPDSAPANHS SLSSPEKKQKV
Sbjct: 361 DKGKQKVDIQLHPNSALNLNKPKRKVSFNSPSNKTNIFNPDSAPANHSLSLSSPEKKQKV 420
Query: 598 SRER-------------------------------------------------------- 657
SRER
Sbjct: 421 SRERSIKKKSSSIQPIQNKGVLITQPIQVVAHDLEASKKGLSLIVNLGDLPVLDPSKSFE 480
Query: 658 NHHSSDNAEVIDITNTEVVPETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRKKEEK 717
+HHSS NAEVIDITNTEVVPETPEMKM VNENSNSSSEANYRKPKHVH+R+YYYRKK K
Sbjct: 481 DHHSSHNAEVIDITNTEVVPETPEMKMPVNENSNSSSEANYRKPKHVHRRRYYYRKKRSK 540
Query: 718 EKDPDSKA--------FKKQLASW---------------------------LKENGLKIS 777
+ + K +L +W L E LKI+
Sbjct: 541 GEGSGLRGEWGSGDIYCKMKLLTWNARGLGSPSKRALIKNAIISYSPDFVILTETMLKIT 600
Query: 778 T----------------VTDSSGATTSTNVLINQLNSGLAS--KGIGALGTSILQNVE-- 837
V ++SG++ +L + + L S + I +L + N
Sbjct: 601 NKRIIKSFWPSNSINWIVKNASGSSGGILILWDAQSHSLLSQEEAIFSLSANFFLNNNSS 660
Query: 838 --------------------QFHHQQ--------------------------SSDRSS-- 897
H+ Q SS SS
Sbjct: 661 WWLTGLYGPDKRRKRIHFWADLHNLQHLNSFPWSLERDLNVIRMREETTSILSSSHSSRM 720
Query: 898 --------------LINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSPHTTRTLPRSTS 957
L NNRFTWSNLRNP TFSRIDRFLYNS+WENLFSPHTTRTLPR TS
Sbjct: 721 LNNFISNNLLIDPPLTNNRFTWSNLRNPSTFSRIDRFLYNSSWENLFSPHTTRTLPRPTS 780
Query: 958 DHFPLVCENSNTKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKS 1017
DHFPLVCE+SN KL WGP PFRLNSIAL+DPEFKRNM RWWENS+Q+GHPGFSFIQRLKS
Sbjct: 781 DHFPLVCEDSNPKLRWGPAPFRLNSIALNDPEFKRNMERWWENSVQNGHPGFSFIQRLKS 840
Query: 1018 LANFIKPWQKEKLHSLTHAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLK 1077
LAN IKPWQKEKLHSL +AK++I+REVDSIDKKELDTPL+Q+ESNRRLALKA+LS+LSLK
Sbjct: 841 LANHIKPWQKEKLHSLNYAKETIIREVDSIDKKELDTPLSQKESNRRLALKAELSDLSLK 900
Query: 1078 ESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTAFIK 1137
ESQF C
Sbjct: 901 ESQF-----------------------C-------------------------------- 960
Query: 1138 FFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKTPGLDGF 1197
IY+SSTKSDPLFI+NLDWNPIE SEW HLCAPFLE+EIKGVINS DGKK P DGF
Sbjct: 961 ----IYKSSTKSDPLFIENLDWNPIEFSEWPHLCAPFLEEEIKGVINSFDGKKAPSPDGF 1020
Query: 1198 PISFFKTYWYLLKEDILDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTT 1257
PISFFK+YW+LLKEDI+DIFKDF++KGVINKNMNNTYIALI KKKDYSHPKDFRPISLTT
Sbjct: 1021 PISFFKSYWHLLKEDIMDIFKDFFEKGVINKNMNNTYIALIGKKKDYSHPKDFRPISLTT 1080
Query: 1258 SIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILK 1317
SIYKIIAKTLSNRLKTTLP TISGNQLAF+KNRQITDAILMANEAVD+WKVKKIKGFILK
Sbjct: 1081 SIYKIIAKTLSNRLKTTLPGTISGNQLAFIKNRQITDAILMANEAVDYWKVKKIKGFILK 1140
Query: 1318 LDIEKAFDKLNWDFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLR 1377
LDIEK F LNWDFID+VL KKNFP WRKWIRGCISNVTYSVI+NGRPQGRIKANRGLR
Sbjct: 1141 LDIEKVFYNLNWDFIDYVLGKKNFPNSWRKWIRGCISNVTYSVIINGRPQGRIKANRGLR 1200
Query: 1378 QGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILLFIEDNDYF 1437
QGDPLSPFLFVIAMDY SRLLSHLE+SGAIKGVSLN+NCNISHILFADDILLF+EDND F
Sbjct: 1201 QGDPLSPFLFVIAMDYFSRLLSHLEASGAIKGVSLNNNCNISHILFADDILLFVEDNDCF 1260
Query: 1438 LNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECASIWGIPCHSLPLSYLGVPLG 1497
LNNL MALSLFE+AS LKINLLKSALVP+NVS+NRAKECAS WGI CHSL LSYLGVPLG
Sbjct: 1261 LNNLIMALSLFEKASGLKINLLKSALVPVNVSLNRAKECASFWGISCHSLLLSYLGVPLG 1320
Query: 1498 GNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSMTCK 1538
Sbjct: 1321 ------------------------------------------------------------ 1321
BLAST of IVF0006653 vs. NCBI nr
Match:
TYJ99315.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])
HSP 1 Score: 2540 bits (6583), Expect = 0.0
Identity = 1321/1687 (78.30%), Postives = 1378/1687 (81.68%), Query Frame = 0
Query: 21 SSLLLAVKRSLSSPVLFTAFHLPSLVFPTISQTMAYFKSLPRSCKVERKEFVLHLDKYSK 80
+ LLL VKRSLS PVLF AFHLPSL F +FKSLPRSCKVERKEFVLHLDKYSK
Sbjct: 22 TDLLLVVKRSLSPPVLFIAFHLPSLAFSYNLPNNGHFKSLPRSCKVERKEFVLHLDKYSK 81
Query: 81 HTHYWLTETGAHKAFSIEVSPKDLDWIRCTLKSLIATPNTNRFFLETRDSEQCIWIRKTR 140
HTHYWLTETGAHKAFSIEVSP+DLDWIRCTLKSLIATPNTNRFFLETRDSEQ IWIRKTR
Sbjct: 82 HTHYWLTETGAHKAFSIEVSPRDLDWIRCTLKSLIATPNTNRFFLETRDSEQRIWIRKTR 141
Query: 141 NSKGCTAEIFRVDQKNRKSCILVPEGPEKSGWVSFLSMITPKVEVKAKTRPTFLPRSSPD 200
NSKGCTAEIFRVDQKNRKSCILVPEGP+KSGWVSFLSMITPKVEVKAKTRPTFLPR+SPD
Sbjct: 142 NSKGCTAEIFRVDQKNRKSCILVPEGPDKSGWVSFLSMITPKVEVKAKTRPTFLPRTSPD 201
Query: 201 GRLSPPIDYHKRSYARAVTEGR--ATSDSSDSYDTSDSSHSSGNSFCDSPSSDLLENTVV 260
RLSPPIDYHKRSYA+AVTEGR ATSDSSDSYD+SDSSHSS NSFCDSPSSDLLENTVV
Sbjct: 202 CRLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHSSSNSFCDSPSSDLLENTVV 261
Query: 261 IVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWTTVGKY 320
IVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGW+TVGKY
Sbjct: 262 IVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKY 321
Query: 321 SVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLHLWNMTTFQQLGKACGGLIKVAEETRSA 380
SV+FEKWS YHATPKLIPSYGGWTTFRGIPLHLWNM TFQQ+GKAC GLIKVAEETRSA
Sbjct: 322 SVRFEKWSPVYHATPKLIPSYGGWTTFRGIPLHLWNMMTFQQIGKACEGLIKVAEETRSA 381
Query: 381 KNLVKARIKVRYNYSGFLPANVRIFDNEGNKFSIQVVTHPEGKWLIERNVRLHGTFKRQA 440
KNL++ARIKVRYNYSGFLPANVRIFDNEGNKF +QVVTHPEGKWLIERNVRLHGTFKRQA
Sbjct: 382 KNLIEARIKVRYNYSGFLPANVRIFDNEGNKFFVQVVTHPEGKWLIERNVRLHGTFKRQA 441
Query: 441 AAAFDEFNPESEQFFFEGMEAISPDFLSTSSDGRKSNTPDQPPALKSVIIKSDRVATSPS 500
AA+FD+FNPESEQFFFEG EAISPDFLSTSSDGRKS+TPDQP ALKSVIIK DR AT PS
Sbjct: 442 AASFDDFNPESEQFFFEGSEAISPDFLSTSSDGRKSSTPDQPSALKSVIIKPDRNATLPS 501
Query: 501 FLNEEVVNDSNLHATANKSKSEILPGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVS 560
FLNEE+VNDSNLHATANKSK EIL GISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVS
Sbjct: 502 FLNEELVNDSNLHATANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVS 561
Query: 561 FNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKVSRERN---------------------- 620
FNSP NKTNIFNPDSAPANHSPSL+SPEKKQKVSRER+
Sbjct: 562 FNSPSNKTNIFNPDSAPANHSPSLNSPEKKQKVSRERSIKKKSSSTQPNSKANQNKGVFI 621
Query: 621 --------------------------------------HHSSDNAEVIDITNTEVVPETP 680
HH+SDNAEV+DITNTEVVPETP
Sbjct: 622 TQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPNKSLEDHHNSDNAEVVDITNTEVVPETP 681
Query: 681 EMKMQVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSKAFKKQLASWLKENGLK 740
EMKM VNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDS+AFKKQL SWLK+NGLK
Sbjct: 682 EMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKKNGLK 741
Query: 741 ISTVTDSSGATTSTNVLINQLNSGLA---------------------------------- 800
+ST TDSSGATTSTNVL+NQ+NSGL
Sbjct: 742 LSTDTDSSGATTSTNVLLNQMNSGLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILW 801
Query: 801 ----------SKGIGALGTSILQNVE----------------------QFHHQQ------ 860
+G+ +L + L N + H+ Q
Sbjct: 802 DAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGPVKRRERIHFWAELHNLQHLNSFP 861
Query: 861 --------------------SSDRSS----------------LINNRFTWSNLRNPPTFS 920
SS +S L NNRFTWSNLRNPPTFS
Sbjct: 862 WILGGDLNVIRMREESTSVLSSSHNSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFS 921
Query: 921 RIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVPFRLNSIALSDPE 980
RIDRFLYNS+WENLFSPHTTRTLPRSTSDHFPLVCE+SN KLSWGP+PFRLNSI LSDPE
Sbjct: 922 RIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPKLSWGPIPFRLNSITLSDPE 981
Query: 981 FKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLTHAKDSILREVDSIDK 1040
FKRNMGRWWENSIQ G+PGFSFIQRLKSLANFIKPWQKEKLHSLT+AK++I+REVDSIDK
Sbjct: 982 FKRNMGRWWENSIQAGYPGFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDK 1041
Query: 1041 KELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQ 1100
KELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQ
Sbjct: 1042 KELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQ 1101
Query: 1101 KRSFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSH 1160
KRSFIHEIQDEEG IQNTN SISTAFIKFFS+IYRSSTKSDPLFI+NLDWNPI SEWSH
Sbjct: 1102 KRSFIHEIQDEEGSIQNTNNSISTAFIKFFSRIYRSSTKSDPLFIENLDWNPIASSEWSH 1161
Query: 1161 LCAPFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKN 1220
LCAPFLE EIKGVINS DGKKTPG DGFPISFFK++W
Sbjct: 1162 LCAPFLEGEIKGVINSFDGKKTPGPDGFPISFFKSHW----------------------- 1221
Query: 1221 MNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKN 1280
LKTTLP+TISGNQLAFVKN
Sbjct: 1222 -----------------------------------------LKTTLPNTISGNQLAFVKN 1281
Query: 1281 RQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFIDFVLEKKNFPILWRKWI 1340
RQITDAILMANEAVD+WKVKKIKGFILKLDIEKAFD LN DFID VLEKKNFP WRKWI
Sbjct: 1282 RQITDAILMANEAVDYWKVKKIKGFILKLDIEKAFDNLNLDFIDNVLEKKNFPNPWRKWI 1341
Query: 1341 RGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKG 1400
RGCISNVTYSVI+NGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKG
Sbjct: 1342 RGCISNVTYSVIINGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKG 1401
Query: 1401 VSLNSNCNISHILFADDILLFIEDNDYFLNNLRMALSLFERASSLKINLLKSALVPMNVS 1460
VSLN NCNISHILFADDILLFIEDND FL NLRMALSLFERAS LKINLLKSALVP+NVS
Sbjct: 1402 VSLNGNCNISHILFADDILLFIEDNDCFLKNLRMALSLFERASGLKINLLKSALVPVNVS 1461
Query: 1461 VNRAKECASIWGIPCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGG 1520
+ RAKECAS WGI CHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGG
Sbjct: 1462 LKRAKECASFWGISCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGG 1521
Query: 1521 RLTLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSK 1537
RLTLIKSTLSSLPIYQLSVFQAPS+TCKNIEK WRKFLWKGNNGS GSHLINWTKVSKSK
Sbjct: 1522 RLTLIKSTLSSLPIYQLSVFQAPSLTCKNIEKLWRKFLWKGNNGSEGSHLINWTKVSKSK 1581
BLAST of IVF0006653 vs. NCBI nr
Match:
TYK00493.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])
HSP 1 Score: 2196 bits (5691), Expect = 0.0
Identity = 1132/1625 (69.66%), Postives = 1257/1625 (77.35%), Query Frame = 0
Query: 54 MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPKDLDWIRCTLKS 113
MAYFKSLPRSCK+ERKEFVL LDKY+KHTHYWLTETGAHKAFSIEVSP+DLDWIR TLKS
Sbjct: 1 MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKS 60
Query: 114 LIATPNTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVDQKNRKSCILVPEGPEKSGWV 173
LI TP++NRFFLE RD E CIWIRKTRN KGCTAEIFRVD KNRKSCILVPEG EKS WV
Sbjct: 61 LIETPSSNRFFLENRDYEHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEGLEKSCWV 120
Query: 174 SFLSMITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTEGRAT--SDSSDSY 233
SFLSMITPKVEVKAKTRP FLPRSSP+ RLSPPIDYHKRSYA+AV+EGR++ SDSSDSY
Sbjct: 121 SFLSMITPKVEVKAKTRPIFLPRSSPEFRLSPPIDYHKRSYAKAVSEGRSSISSDSSDSY 180
Query: 234 DTSDSSHSSGNSFCDSPSSDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAE 293
+SDSS SSGNS CDSP LLENTVV+VRRFFHDDW KILQNLRKQTEESFTYNAFHAE
Sbjct: 181 ASSDSSQSSGNSPCDSPFPVLLENTVVLVRRFFHDDWQKILQNLRKQTEESFTYNAFHAE 240
Query: 294 KALVHFSSNIPANLLCQNKGWTTVGKYSVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLH 353
K LVHF+SN+PANLLCQNKGWTTVGKY+V+FEKW+ A HA+PKLIPSYGGWTTFRGIPLH
Sbjct: 241 KVLVHFNSNVPANLLCQNKGWTTVGKYTVRFEKWAPASHASPKLIPSYGGWTTFRGIPLH 300
Query: 354 LWNMTTFQQLGKACGGLIKVAEETRSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFS 413
LWNM TFQQ+GKACGGLIKVAEET++A+NL++A++K+RYNYSGFLPA V+IFD EGNKF
Sbjct: 301 LWNMMTFQQIGKACGGLIKVAEETKTARNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFV 360
Query: 414 IQVVTHPEGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQFFFEGMEAISPDFLSTSSDG 473
+QVVTH EGKWL+ERNVRLHGTFKRQAAA+FD+FNP+SEQF F+G+EAISPD L+T S
Sbjct: 361 VQVVTHSEGKWLMERNVRLHGTFKRQAAASFDDFNPDSEQFLFDGLEAISPDLLNTISGS 420
Query: 474 RKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSKSEILPGISNDGVL 533
RKS +P+QP ALKSVIIK + ATSP+ LNEEVVND++LHATANKSK +IL GISNDG L
Sbjct: 421 RKSISPEQPSALKSVIIKPAKYATSPTTLNEEVVNDNSLHATANKSKLKILSGISNDGSL 480
Query: 534 DKGKQKVDIQLQPNSALNLDKSKRKVSFNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKV 593
DKGKQKVDI Q SA K KRKVSFNSP NKT FNPDSAPANHSP EKK++V
Sbjct: 481 DKGKQKVDIPSQLTSAFIFYKPKRKVSFNSPSNKTTFFNPDSAPANHSP-----EKKKRV 540
Query: 594 SRERN------------------------------------------------------- 653
SRER+
Sbjct: 541 SRERSVKKKSSTIQPKLRANQGKGNLITQPLQVVAHDLDASKKGLSLTVDLGNLPVLDPS 600
Query: 654 -----HHSSDNAEVIDITNTEVVPETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRK 713
HHSSDNAEVIDITNTEVVPETPE+KM E SNSS E NYRK KH H+R++YYRK
Sbjct: 601 KSFEDHHSSDNAEVIDITNTEVVPETPELKMTDPEKSNSSPEVNYRKQKHSHRRRHYYRK 660
Query: 714 KEEKEKDPDSKAFKKQLASWLKENGLKISTVTDSSGATTSTNVLINQLNSG--------- 773
KE+KEKD +S+AFK QL +WLKENGLK+S TDSSGATTSTN L +QL S
Sbjct: 661 KEDKEKDTNSEAFKNQLVTWLKENGLKLSIDTDSSGATTSTNALFSQLGSSAGGILILWD 720
Query: 774 -----LASKGIGALGTS--------------------------ILQNVEQFHHQQSS--- 833
L S+ G S + +++ HH SS
Sbjct: 721 AQHHSLLSQEEGKFSLSANFSSFNNSWWLTGLYGPVKRRERLNVWEDLHNLHHLNSSPWI 780
Query: 834 --------------------DRSS----------------LINNRFTWSNLRNPPTFSRI 893
SS L NNR+TWSNLRNPPTFSR+
Sbjct: 781 IGGDLNVVRMREESTAVTFSSHSSNMLNDFISNNLLIDPPLTNNRYTWSNLRNPPTFSRL 840
Query: 894 DRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVPFRLNSIALSDPEFK 953
DRFLYNS WE LF+PH TRTLPR TSDHFPLVCE+S + L WGP PFRLNSIAL+DPEFK
Sbjct: 841 DRFLYNSRWEILFNPHITRTLPRPTSDHFPLVCEDSTSTLRWGPAPFRLNSIALNDPEFK 900
Query: 954 RNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLTHAKDSILREVDSIDKKE 1013
RNM RWWE S+Q+GHPGF FIQRLKSLAN IKPWQKEK SLT AK++I+REVDSIDK E
Sbjct: 901 RNMERWWELSVQNGHPGFFFIQRLKSLANLIKPWQKEKFQSLTSAKENIIREVDSIDKNE 960
Query: 1014 LDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKR 1073
LDTPL+ EESNRRLALKA+L++LSLKESQFW+QRAKKLWL+EGDENS+FFHRICSSRQKR
Sbjct: 961 LDTPLSLEESNRRLALKAELNDLSLKESQFWFQRAKKLWLKEGDENSAFFHRICSSRQKR 1020
Query: 1074 SFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSHLC 1133
+ IHEIQDEEG IQNTN +IS AF+ FS+IYR STK DPLFI+NL+WNPI++S+WS LC
Sbjct: 1021 NLIHEIQDEEGSIQNTNNNISLAFVNHFSRIYRCSTKKDPLFIENLEWNPIDYSDWSLLC 1080
Query: 1134 APFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMN 1193
APF E+EIKGVI S DG K PG DGFPISFFK+YW+LLKEDILDIFKDF++KGVINKNMN
Sbjct: 1081 APFSEEEIKGVIKSFDGNKAPGPDGFPISFFKSYWHLLKEDILDIFKDFFEKGVINKNMN 1140
Query: 1194 NTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQ 1253
NTYIALI KKKDYSHPKDFRPISLTTSIYK IAKTLSNRLK TLPDTISGNQLAF+KNRQ
Sbjct: 1141 NTYIALIEKKKDYSHPKDFRPISLTTSIYKTIAKTLSNRLKLTLPDTISGNQLAFIKNRQ 1200
Query: 1254 ITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFIDFVLEKKNFPILWRKWIRG 1313
ITDAILMANEA+D+WKVKKIKGFILKLDIEKAFD LNW+FID VL+K N+P WRKWIRG
Sbjct: 1201 ITDAILMANEALDYWKVKKIKGFILKLDIEKAFDNLNWNFIDLVLKKNNYPNSWRKWIRG 1260
Query: 1314 CISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVS 1373
CISNVTYS+IVNG+PQGRIKANRGLRQGDPLS FLFVIAMDYLSRLLSHLES+GAIKG
Sbjct: 1261 CISNVTYSIIVNGKPQGRIKANRGLRQGDPLSLFLFVIAMDYLSRLLSHLESTGAIKG-- 1320
Query: 1374 LNSNCNISHILFADDILLFIEDNDYFLNNLRMALSLFERASSLKINLLKSALVPMNVSVN 1433
Sbjct: 1321 ------------------------------------------------------------ 1380
Query: 1434 RAKECASIWGIPCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRL 1493
GI CH+LPL+YLGVPLGGNPKSNLFWRN+ED+IQKKL+NWKYA ISKGGRL
Sbjct: 1381 ---------GILCHTLPLTYLGVPLGGNPKSNLFWRNIEDRIQKKLSNWKYAHISKGGRL 1440
Query: 1494 TLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEE 1537
TLIKSTLSSLPIY+LSVFQAPS T KNIEK WR FLWKG+ G GSHLINW+ V+K KEE
Sbjct: 1441 TLIKSTLSSLPIYKLSVFQAPSSTYKNIEKLWRNFLWKGSCGLKGSHLINWSIVTKPKEE 1500
BLAST of IVF0006653 vs. NCBI nr
Match:
KAA0039309.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])
HSP 1 Score: 1925 bits (4986), Expect = 0.0
Identity = 997/1387 (71.88%), Postives = 1108/1387 (79.88%), Query Frame = 0
Query: 54 MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPKDLDWIRCTLKS 113
MAYFKSLPRSCK+ERKEFVL LDKY+KHTHYWLTETGAHKAFSIEVSP+DLDWIR TLKS
Sbjct: 1 MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKS 60
Query: 114 LIATPNTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVDQKNRKSCILVPEGPEKSGWV 173
LI TP++NRFFLE RD E CIWIRKTRN KGCTAEIFRVD KNRKSCILVPEG EKS WV
Sbjct: 61 LIETPSSNRFFLENRDYEHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEGLEKSCWV 120
Query: 174 SFLSMITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTEGRAT--SDSSDSY 233
SFLSMITPKVEVKAKTRP FLPRSSP+ RLSPPIDYHKRSYA+AV+EGR++ SDSSDSY
Sbjct: 121 SFLSMITPKVEVKAKTRPIFLPRSSPEFRLSPPIDYHKRSYAKAVSEGRSSISSDSSDSY 180
Query: 234 DTSDSSHSSGNSFCDSPSSDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAE 293
+SDSS SSGNS CDSP LLENTVV+VRRFFHDDW KILQNLRKQTEESFTYNAFHAE
Sbjct: 181 ASSDSSQSSGNSPCDSPFPVLLENTVVLVRRFFHDDWQKILQNLRKQTEESFTYNAFHAE 240
Query: 294 KALVHFSSNIPANLLCQNKGWTTVGKYSVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLH 353
K LVHF+SN+PANLLCQNKGWTTVGKY+V+FEKW+ A HA+PKLIPSYGGWTTFRGIPLH
Sbjct: 241 KVLVHFNSNVPANLLCQNKGWTTVGKYTVRFEKWAPASHASPKLIPSYGGWTTFRGIPLH 300
Query: 354 LWNMTTFQQLGKACGGLIKVAEETRSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFS 413
LWNM TFQQ+GKACGGLIKVAEET++A+NL++A++K+RYNYSGFLPA V+IFD EGNKF
Sbjct: 301 LWNMMTFQQIGKACGGLIKVAEETKTARNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFV 360
Query: 414 IQVVTHPEGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQFFFEGMEAISPDFLSTSSDG 473
+QVVTH EGKWL+ERNVRLHGTFKRQAAA+FD+FNP+SEQF F+G+EAISPD L+T S
Sbjct: 361 VQVVTHSEGKWLMERNVRLHGTFKRQAAASFDDFNPDSEQFLFDGLEAISPDLLNTISGS 420
Query: 474 RKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSKSEILPGISNDGVL 533
RKS +P+QP ALKSVIIK + ATSP+ LNEEVVND++LHATANKSK +IL GISNDG L
Sbjct: 421 RKSISPEQPSALKSVIIKPAKYATSPTTLNEEVVNDNSLHATANKSKLKILSGISNDGSL 480
Query: 534 DKGKQKVDIQLQPNSALNLDKSKRKVSFNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKV 593
DKGKQKVDI Q SA K KRKVSFNSP NKT FNPDSAPANHSP EKK++V
Sbjct: 481 DKGKQKVDIPSQLTSAFIFYKPKRKVSFNSPSNKTTFFNPDSAPANHSP-----EKKKRV 540
Query: 594 SRERN------------------------------------------------------- 653
SRER+
Sbjct: 541 SRERSVKKKSSTIQPKLRANQGKGNLITQPLQVVAHDLDASKKGLSLTVDLGNLPVLDPS 600
Query: 654 -----HHSSDNAEVIDITNTEVVPETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRK 713
HHSSDNAEVIDITNTEVVPETPE+KM E SNSS E NYRK KH H+R++YYRK
Sbjct: 601 KSFEDHHSSDNAEVIDITNTEVVPETPELKMTDPEKSNSSPEVNYRKQKHSHRRRHYYRK 660
Query: 714 KEEKEKDPDSKAFKKQLASWLKENGLKISTVTDSSGATTSTNVLINQLNSG--------- 773
KE+KEKD +S+AFK QL +WLKENGLK+S TDSSGATTSTN L +QL S
Sbjct: 661 KEDKEKDTNSEAFKNQLVTWLKENGLKLSIDTDSSGATTSTNALFSQLGSSAGGILILWD 720
Query: 774 -----LASKGIGALGTS--------------------------ILQNVEQFHHQQSS--- 833
L S+ G S + +++ HH SS
Sbjct: 721 AQHHSLLSQEEGKFSLSANFSSFNNSWWLTGLYGPVKRRERLNVWEDLHNLHHLNSSPWI 780
Query: 834 --------------------DRSS----------------LINNRFTWSNLRNPPTFSRI 893
SS L NNR+TWSNLRNPPTFSR+
Sbjct: 781 IGGDLNVVRMREESTAVTFSSHSSNMLNDFISNNLLIDPPLTNNRYTWSNLRNPPTFSRL 840
Query: 894 DRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWGPVPFRLNSIALSDPEFK 953
DRFLYNS WE LF+PH TRTLPR TSDHFPLVCE+S + L WGP PFRLNSIAL+DPEFK
Sbjct: 841 DRFLYNSRWEILFNPHITRTLPRPTSDHFPLVCEDSTSTLRWGPAPFRLNSIALNDPEFK 900
Query: 954 RNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLTHAKDSILREVDSIDKKE 1013
RNM RWWE S+Q+GHPGF FIQRLKSLAN IKPWQKEK SLT AK++I+REVDSIDK E
Sbjct: 901 RNMERWWELSVQNGHPGFFFIQRLKSLANLIKPWQKEKFQSLTSAKENIIREVDSIDKNE 960
Query: 1014 LDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKR 1073
LDTPL+ EESNRRLALKA+L++LSLKESQFW+QRAKKLWL+EGDENS+FFHRICSSRQKR
Sbjct: 961 LDTPLSLEESNRRLALKAELNDLSLKESQFWFQRAKKLWLKEGDENSAFFHRICSSRQKR 1020
Query: 1074 SFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSHLC 1133
+ IHEIQDEEG IQNTN +IS AF+ FS+IYR STK DPLFI+NL+WNPI++S+WS LC
Sbjct: 1021 NLIHEIQDEEGSIQNTNNNISLAFVNHFSRIYRCSTKKDPLFIENLEWNPIDYSDWSLLC 1080
Query: 1134 APFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMN 1193
APF E+EIKGVI S DG K PG DGFPISFFK+YW+LLKEDILDIFKDF++KGVINKNMN
Sbjct: 1081 APFSEEEIKGVIKSFDGNKAPGPDGFPISFFKSYWHLLKEDILDIFKDFFEKGVINKNMN 1140
Query: 1194 NTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQ 1253
NTYIALI KKKDYSHPKDFRPISLTTSIYK IAKTLSNRLK TLPDTISGNQLAF+KNRQ
Sbjct: 1141 NTYIALIEKKKDYSHPKDFRPISLTTSIYKTIAKTLSNRLKLTLPDTISGNQLAFIKNRQ 1200
Query: 1254 ITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFIDFVLEKKNFPILWRKWIRG 1299
ITDAILMANEA+D+WKVKKIKGFILKLDIEKAFD LNW+FID VL+K N+P WRKWIRG
Sbjct: 1201 ITDAILMANEALDYWKVKKIKGFILKLDIEKAFDNLNWNFIDLVLKKNNYPNSWRKWIRG 1260
BLAST of IVF0006653 vs. NCBI nr
Match:
TYK05808.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])
HSP 1 Score: 1885 bits (4884), Expect = 0.0
Identity = 1016/1582 (64.22%), Postives = 1131/1582 (71.49%), Query Frame = 0
Query: 54 MAYFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPKDLDWIRCTLKS 113
MAYFKSLPRSCK+ERKEFVL LDKY+KHTHYWLTETGAHKAFSIEVSP+DLDWIR TLKS
Sbjct: 1 MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKS 60
Query: 114 LIATPNTNRFFLETRDSEQCIWIRKTRNSKGCTAEIFRVDQKNRKSCILVPEGPEKSGWV 173
LI TP++NRFFLE RD E CIWIRKTRN KGCTAEIFRVD KNRKSCILVPEGPEKSG V
Sbjct: 61 LIETPSSNRFFLENRDYEHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEGPEKSGRV 120
Query: 174 SFLSMITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTEGRAT--SDSSDSY 233
SFLSMITPKVEVKAKTRPTFLPRSSP+ RLSPPIDYHKRSY +AV++GR++ SDSSDSY
Sbjct: 121 SFLSMITPKVEVKAKTRPTFLPRSSPEFRLSPPIDYHKRSYEKAVSKGRSSISSDSSDSY 180
Query: 234 DTSDSSHSSGNSFCDSPSSDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAE 293
+SDSS SSGNS CDSP LLENTVV+
Sbjct: 181 TSSDSSQSSGNSPCDSPFPVLLENTVVL-------------------------------- 240
Query: 294 KALVHFSSNIPANLLCQNKGWTTVGKYSVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLH 353
AL+HF+SN+PANLLCQNKGWTTV KY V+
Sbjct: 241 -ALIHFNSNVPANLLCQNKGWTTVEKYMVR------------------------------ 300
Query: 354 LWNMTTFQQLGKACGGLIKVAEETRSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFS 413
K+L
Sbjct: 301 ---------------------------KSL------------------------------ 360
Query: 414 IQVVTHPEGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQFFFEGMEAISPDFLSTSSDG 473
F+G+EAISPD L+T S
Sbjct: 361 ------------------------------------------FDGLEAISPDLLNTISGS 420
Query: 474 RKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSKSEILPGISNDGVL 533
RKSN+ +QP ALKSVIIK R ATSP+ LNEEVVND++LHAT KS+ +IL GISNDG L
Sbjct: 421 RKSNSREQPSALKSVIIKPARDATSPTTLNEEVVNDNSLHATTIKSELKILSGISNDGSL 480
Query: 534 DKGKQKVDIQLQPNSALNLDKSKRKVSFNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKV 593
DKGKQKVDI Q SA DK KRKVSFNSP NKT FN DSAP NHSP LSSPEKKQ+V
Sbjct: 481 DKGKQKVDIPSQLTSAFIYDKPKRKVSFNSPSNKTTFFNSDSAPTNHSPPLSSPEKKQRV 540
Query: 594 SRERN------------------------------------------------------- 653
SRER+
Sbjct: 541 SRERSVKKKSSTIQPKSRANQGKGELITQPLQVVAHDLDASKKGLSLTVDLGNLPVLDPS 600
Query: 654 -----HHSSDNAEVIDITNTEVVPETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRK 713
HHSSDNAEVIDITNTEVVPETPE+KM E SNSS E NYRK KH H+R++YYRK
Sbjct: 601 KSFEDHHSSDNAEVIDITNTEVVPETPELKMTDPEKSNSSPEVNYRKQKHSHRRRHYYRK 660
Query: 714 KEEKEKDPDSKAFKKQLASWLKENGLKISTVTDSSGATTSTNVLINQLNSGLASKGIGAL 773
KE+KEKD +S+AFK QL +WLKENGLK+ST TDSSGATTSTN L +QL S ++ A+
Sbjct: 661 KEDKEKDTNSEAFKNQLVTWLKENGLKLSTDTDSSGATTSTNALFSQLGSSISWIVKNAI 720
Query: 774 GTS--ILQNVEQFHHQ------------------QSSDRSS----------------LIN 833
+S IL + HH SS SS L N
Sbjct: 721 DSSGGILILWDAQHHSLLRGDLNVVRMREESTAVTSSSHSSNMLNNFISNNLLIDPPLTN 780
Query: 834 NRFTWSNLRNPPTFSRIDRFLYNSTWENLFSPHTTRTLPRSTSDHFPLVCENSNTKLSWG 893
NR+TWSNLRNPPTFSR+DRFLYNS WE LF+PH TRTL R TSDHFPLVCE+S + L WG
Sbjct: 781 NRYTWSNLRNPPTFSRLDRFLYNSRWETLFNPHITRTLSRPTSDHFPLVCEDSTSTLRWG 840
Query: 894 PVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKSLANFIKPWQKEKLHSLT 953
P PFRLNSIAL+DP+FKRNM RWWE S+Q+GHPGFSFI+RLKSLAN IKPWQKEK HSLT
Sbjct: 841 PAPFRLNSIALNDPKFKRNMERWWELSVQNGHPGFSFIRRLKSLANLIKPWQKEKFHSLT 900
Query: 954 HAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREG 1013
AK++I+REVDSIDK ELDTPL+QEESNRRLALKA+LS+LSLKESQFW+QRAKKLWL+EG
Sbjct: 901 SAKENIIREVDSIDKNELDTPLSQEESNRRLALKAELSDLSLKESQFWFQRAKKLWLKEG 960
Query: 1014 DENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTAFIKFFSKIYRSSTKSDPLFI 1073
DENS+FFHRICSSRQKR+ IHEIQDEEG IQNTN +IS AF+ FS IYR STK DPLFI
Sbjct: 961 DENSAFFHRICSSRQKRNLIHEIQDEEGSIQNTNNNISLAFVNHFSSIYRCSTKKDPLFI 1020
Query: 1074 DNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDIL 1133
+NL+WNPI++S+WS LCAPFLE+EIKGVI S DG K PG DGFPISFFK+YW+LLKEDIL
Sbjct: 1021 ENLEWNPIDYSDWSLLCAPFLEEEIKGVIKSFDGNKAPGPDGFPISFFKSYWHLLKEDIL 1080
Query: 1134 DIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKTT 1193
DIFKDF++KG IIAKTLSNRLK T
Sbjct: 1081 DIFKDFFEKG-------------------------------------IIAKTLSNRLKLT 1140
Query: 1194 LPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILKLDIEKAFDKLNWDFIDF 1253
LPDTISGNQLAF+KNRQITDAIL ANEA+D+WKVKKIK FILKLDIEKAFD LNWDFIDF
Sbjct: 1141 LPDTISGNQLAFIKNRQITDAILRANEALDYWKVKKIKSFILKLDIEKAFDNLNWDFIDF 1200
Query: 1254 VLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYL 1313
VL+KKN+P WRKWIRGCISNVTYS+IVN +PQ RIKANRGLRQGDPLSPFLFV AMDYL
Sbjct: 1201 VLKKKNYPNSWRKWIRGCISNVTYSIIVNEKPQDRIKANRGLRQGDPLSPFLFVSAMDYL 1260
Query: 1314 SRLLSHLESSGAIKGVSLNSNCNISHILFADDILLFIEDNDYFLNNLRMALSLFERASSL 1373
SRLLSHLESSGAIKGV L ++CNISHILFADDILLF+EDND+FLNNLRMALSLFE+AS L
Sbjct: 1261 SRLLSHLESSGAIKGVCLANDCNISHILFADDILLFVEDNDHFLNNLRMALSLFEKASGL 1320
Query: 1374 KINLLKSALVPMNVSVNRAKECASIWGIPCHSLPLSYLGVPLGGNPKSNLFWRNVEDKIQ 1433
KINL KSA+VP+NVS +RA ECAS WGI CH+LPL+YLGVPLGGNPKSN+FWRN+ED+IQ
Sbjct: 1321 KINLSKSAMVPVNVSWSRALECASSWGISCHTLPLTYLGVPLGGNPKSNIFWRNIEDRIQ 1380
Query: 1434 KKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGS 1493
KKLNNWKYA ISKGGRLTLIKSTLSSL IYQLSVFQAP T KNIEK WR FLWKG+ G
Sbjct: 1381 KKLNNWKYAHISKGGRLTLIKSTLSSLSIYQLSVFQAPPSTYKNIEKLWRNFLWKGSFGL 1383
Query: 1494 VGSHLINWTKVSKSKEEGGLGTSRLHVTNKALLTKWLWRYLSEPNALWRRLIQCKYKGNY 1537
GSHLINW+ V+K KEEGGLG SRL V N+ALL+KWLWRY SEPN+LWRRLI KYKG +
Sbjct: 1441 KGSHLINWSIVTKLKEEGGLGISRLQVINQALLSKWLWRYYSEPNSLWRRLIHIKYKGKH 1383
BLAST of IVF0006653 vs. NCBI nr
Match:
KAA0058980.1 (uncharacterized protein E6C27_scaffold98G001710 [Cucumis melo var. makuwa])
HSP 1 Score: 1691 bits (4380), Expect = 0.0
Identity = 953/1539 (61.92%), Postives = 1029/1539 (66.86%), Query Frame = 0
Query: 178 MITPKVEVKAKTRPTFLPRSSPDGRLSPPIDYHKRSYARAVTEGR--ATSDSSDSYDTSD 237
MITPKVEVK KTRPTFLPRSSP+ RLSPPIDYHKRSYA+ VTEGR TSDSSDSY +SD
Sbjct: 1 MITPKVEVKEKTRPTFLPRSSPEYRLSPPIDYHKRSYAKVVTEGRPFTTSDSSDSYVSSD 60
Query: 238 SSHSSGNSFCDSPSSDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKALV 297
SSHSSGNSFCDSPS DLLENTVV+VRRFFHDDW KILQNLRKQTEESFTYNAFHAEKALV
Sbjct: 61 SSHSSGNSFCDSPSPDLLENTVVLVRRFFHDDWQKILQNLRKQTEESFTYNAFHAEKALV 120
Query: 298 HFSSNIPANLLCQNKGWTTVGKYSVKFEKWSSAYHATPKLIPSYGGWTTFRGIPLHLWNM 357
HF+SNIP NLLCQNKGWTTVGKYSV+FEKWS AYHATPKLIPSYGGWTTF+
Sbjct: 121 HFNSNIPENLLCQNKGWTTVGKYSVRFEKWSPAYHATPKLIPSYGGWTTFQ--------- 180
Query: 358 TTFQQLGKACGGLIKVAEETRSAKNLVKARIKVRYNYSGFLPANVRIFDNEGNKFSIQVV 417
R++ LV+
Sbjct: 181 --------------------RNSATLVE-------------------------------- 240
Query: 418 THPEGKWLIERNVRLHGTFKRQAAAAFDEFNPESEQF----FFEGMEAISPDFLSTSSDG 477
+D+F+ E F+G EAISPDFLSTSS
Sbjct: 241 --------------------------YDDFSTNCESLRRIILFDGSEAISPDFLSTSSRS 300
Query: 478 RKSNTPDQPPALKSVIIKSDRVATSPSFLNEEVVNDSNLHATANKSKSEILPGISNDGVL 537
RKS+TPDQP ALKSVIIK D+ ATSP++LNEEVVNDSNLHATANKS+ EIL GI NDGVL
Sbjct: 301 RKSSTPDQPSALKSVIIKPDKAATSPTYLNEEVVNDSNLHATANKSRLEILSGIPNDGVL 360
Query: 538 DKGKQKVDIQLQPNSALNLDKSKRKVSFNSPCNKTNIFNPDSAPANHSPSLSSPEKKQKV 597
DKGKQKVDIQL PNSALNL+K KRKVSFNSP NKTNIFNPDSAPANHS SLSSPEKKQKV
Sbjct: 361 DKGKQKVDIQLHPNSALNLNKPKRKVSFNSPSNKTNIFNPDSAPANHSLSLSSPEKKQKV 420
Query: 598 SRERN------------------------------------------------------- 657
SRER+
Sbjct: 421 SRERSIKKKSSSIQPIQNKGVLITQPIQVVAHDLEASKKGLSLIVNLGDLPVLDPSKSFE 480
Query: 658 -HHSSDNAEVIDITNTEVVPETPEMKMQVNENSNSSSEANYRKPKHVHKRKYYYRKKEEK 717
HHSS NAEVIDITNTEVVPETPEMKM VNENSNSSSEANYRKPKHVH+R+YYYRKK K
Sbjct: 481 DHHSSHNAEVIDITNTEVVPETPEMKMPVNENSNSSSEANYRKPKHVHRRRYYYRKKRSK 540
Query: 718 EKDPDSKA--------FKKQLASW---------------------------LKENGLKIS 777
+ + K +L +W L E LKI+
Sbjct: 541 GEGSGLRGEWGSGDIYCKMKLLTWNARGLGSPSKRALIKNAIISYSPDFVILTETMLKIT 600
Query: 778 T----------------VTDSSGATTSTNVLINQLNSGLASK--GIGALGTSILQNVEQ- 837
V ++SG++ +L + + L S+ I +L + N
Sbjct: 601 NKRIIKSFWPSNSINWIVKNASGSSGGILILWDAQSHSLLSQEEAIFSLSANFFLNNNSS 660
Query: 838 ---------------------FHHQQ--------------------------SSDRSS-- 897
H+ Q SS SS
Sbjct: 661 WWLTGLYGPDKRRKRIHFWADLHNLQHLNSFPWSLERDLNVIRMREETTSILSSSHSSRM 720
Query: 898 --------------LINNRFTWSNLRNPPTFSRIDRFLYNSTWENLFSPHTTRTLPRSTS 957
L NNRFTWSNLRNP TFSRIDRFLYNS+WENLFSPHTTRTLPR TS
Sbjct: 721 LNNFISNNLLIDPPLTNNRFTWSNLRNPSTFSRIDRFLYNSSWENLFSPHTTRTLPRPTS 780
Query: 958 DHFPLVCENSNTKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSIQDGHPGFSFIQRLKS 1017
DHFPLVCE+SN KL WGP PFRLNSIAL+DPEFKRNM RWWENS+Q+GHPGFSFIQRLKS
Sbjct: 781 DHFPLVCEDSNPKLRWGPAPFRLNSIALNDPEFKRNMERWWENSVQNGHPGFSFIQRLKS 840
Query: 1018 LANFIKPWQKEKLHSLTHAKDSILREVDSIDKKELDTPLTQEESNRRLALKADLSELSLK 1077
LAN IKPWQKEKLHSL +AK++I+REVDSIDKKELDTPL+Q+ESNRRLALKA+LS+LSLK
Sbjct: 841 LANHIKPWQKEKLHSLNYAKETIIREVDSIDKKELDTPLSQKESNRRLALKAELSDLSLK 900
Query: 1078 ESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGLIQNTNISISTAFIK 1137
ESQF C
Sbjct: 901 ESQF-----------------------C-------------------------------- 960
Query: 1138 FFSKIYRSSTKSDPLFIDNLDWNPIEHSEWSHLCAPFLEDEIKGVINSLDGKKTPGLDGF 1197
IY+SSTKSDPLFI+NLDWNPIE SEW HLCAPFLE+EIKGVINS DGKK P DGF
Sbjct: 961 ----IYKSSTKSDPLFIENLDWNPIEFSEWPHLCAPFLEEEIKGVINSFDGKKAPSPDGF 1020
Query: 1198 PISFFKTYWYLLKEDILDIFKDFYDKGVINKNMNNTYIALIPKKKDYSHPKDFRPISLTT 1257
PISFFK+YW+LLKEDI+DIFKDF++KGVINKNMNNTYIALI KKKDYSHPKDFRPISLTT
Sbjct: 1021 PISFFKSYWHLLKEDIMDIFKDFFEKGVINKNMNNTYIALIGKKKDYSHPKDFRPISLTT 1080
Query: 1258 SIYKIIAKTLSNRLKTTLPDTISGNQLAFVKNRQITDAILMANEAVDFWKVKKIKGFILK 1317
SIYKIIAKTLSNRLKTTLP TISGNQLAF+KNRQITDAILMANEAVD+WKVKKIKGFILK
Sbjct: 1081 SIYKIIAKTLSNRLKTTLPGTISGNQLAFIKNRQITDAILMANEAVDYWKVKKIKGFILK 1140
Query: 1318 LDIEKAFDKLNWDFIDFVLEKKNFPILWRKWIRGCISNVTYSVIVNGRPQGRIKANRGLR 1377
LDIEK F LNWDFID+VL KKNFP WRKWIRGCISNVTYSVI+NGRPQGRIKANRGLR
Sbjct: 1141 LDIEKVFYNLNWDFIDYVLGKKNFPNSWRKWIRGCISNVTYSVIINGRPQGRIKANRGLR 1200
Query: 1378 QGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNSNCNISHILFADDILLFIEDNDYF 1437
QGDPLSPFLFVIAMDY SRLLSHLE+SGAIKGVSLN+NCNISHILFADDILLF+EDND F
Sbjct: 1201 QGDPLSPFLFVIAMDYFSRLLSHLEASGAIKGVSLNNNCNISHILFADDILLFVEDNDCF 1260
Query: 1438 LNNLRMALSLFERASSLKINLLKSALVPMNVSVNRAKECASIWGIPCHSLPLSYLGVPLG 1497
LNNL MALSLFE+AS LKINLLKSALVP+NVS+NRAKECAS WGI CHSL LSYLGVPLG
Sbjct: 1261 LNNLIMALSLFEKASGLKINLLKSALVPVNVSLNRAKECASFWGISCHSLLLSYLGVPLG 1320
Query: 1498 GNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVFQAPSMTCK 1537
G+
Sbjct: 1321 GS---------------------------------------------------------- 1321
BLAST of IVF0006653 vs. TAIR 10
Match:
AT1G43760.1 (DNAse I-like superfamily protein )
HSP 1 Score: 116.7 bits (291), Expect = 1.6e-25
Identity = 101/387 (26.10%), Postives = 175/387 (45.22%), Query Frame = 0
Query: 711 LGTSI-LQNVEQFHH-QQSSDRSSLINN--RFTWSNLRNP-PTFSRIDRFLYNSTWENLF 770
L TSI ++ +E+F + + SD + + +TWSN ++ P ++DR + N W + F
Sbjct: 240 LQTSIPMRGLEEFQNCLRDSDLVDIPSRGVHYTWSNHQDDNPIIRKLDRAIANGDWFSSF 299
Query: 771 SPHTTRTLPRSTSDHFP--LVCENSNTKLSWGPVPFRLNSIALSDPEFKRNMGRWWENSI 830
SDH P ++ EN + FR S + P F ++ WE I
Sbjct: 300 PSAIAVFELSGVSDHSPCIIILENLPKR---SKKCFRYFSFLSTHPTFLVSLTVAWEEQI 359
Query: 831 QDGHPGFSFIQRLKSLANFIKPWQKEKLHSLTHAKDSILREVDSIDKKELDTPLTQEESN 890
G FS + LK+ K ++ ++ H L ++SI + L P
Sbjct: 360 PVGSHMFSLGEHLKAAKKCCKLLNRQGFGNIQHKTKEALDSLESIQSQLLTNPSDSLFRV 419
Query: 891 RRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEG 950
+A K + ES F+ Q+++ WL++GD N+ FFH++ + Q ++ I ++ ++
Sbjct: 420 EHVARKKWNFFAAALES-FYRQKSRIKWLQDGDANTRFFHKVILANQAKNLIKFLRMDDD 479
Query: 951 LIQNTNISISTAFIKFFSKIYRSSTKSDPLFIDNL----DWNPIEHSEW--SHLCAPFLE 1010
+ + + +++ + S SD L D++ D +P ++ S L A +
Sbjct: 480 VRVENVTQVKEMIVAYYTHLLGSD--SDILTPDSVQRIKDIHPFRCNDTLASRLSALPSD 539
Query: 1011 DEIKGVINSLDGKKTPGLDGFPISFFKTYWYLLKEDILDIFKDFYDKGVINKNMNNTYIA 1070
EI + ++ K PG D F FF W+++K+ + K+F+ G + K N T I
Sbjct: 540 KEITAAVFAMPRNKAPGPDSFTAEFFWESWFVVKDSTIAAVKEFFRTGHLLKRFNATAIT 599
Query: 1071 LIPKKKDYSHPKDFRPISLTTSIYKII 1085
LIPK FRP+S T +YKII
Sbjct: 600 LIPKVTGVDQLSMFRPVSCCTVVYKII 620
BLAST of IVF0006653 vs. TAIR 10
Match:
AT3G24255.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )
HSP 1 Score: 85.1 bits (209), Expect = 5.3e-16
Identity = 56/212 (26.42%), Postives = 86/212 (40.57%), Query Frame = 0
Query: 1307 SLPLSYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQ 1366
+LP+ YLG+PL + + + +KI+ ++ W +S GRL LI S + SL +
Sbjct: 22 ALPVRYLGLPLLTKKMTTSDYGPLVEKIRVRIGKWTARHLSFAGRLQLISSVIHSLTNFW 81
Query: 1367 LSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEEGGLGTSRLHVTNK- 1426
+S F+ PS K I+ FLW G + + W+ V K+EGGLG L NK
Sbjct: 82 MSAFRLPSACIKEIDSICSSFLWSGPELNTKKAKVAWSDVCTPKDEGGLGIRSLKEANKG 141
Query: 1427 --------ALLTKWLWRYLSEPNALWRRLIQCKYKGNYPGDIPSNISSITSKAPWRSIID 1486
L W+W+ + + AL ++
Sbjct: 142 SFWSISGNTTLGSWMWKKILKHRALASGFVK----------------------------- 194
Query: 1487 NIDWFKSNQSWELNNGDQISFWYSNWSLEGRL 1510
+++NG SFW+ NWS GRL
Sbjct: 202 ----------HDIHNGSNTSFWFDNWSKIGRL 194
BLAST of IVF0006653 vs. TAIR 10
Match:
ATMG01250.1 (RNA-directed DNA polymerase (reverse transcriptase) )
HSP 1 Score: 72.0 bits (175), Expect = 4.7e-12
Identity = 32/67 (47.76%), Postives = 47/67 (70.15%), Query Frame = 0
Query: 1182 IVNGRPQGRIKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNSNC-NIS 1241
I+NG PQG + +RGLRQGDPLSP+LF++ + LS L + G + G+ +++N I+
Sbjct: 13 IINGAPQGLVTPSRGLRQGDPLSPYLFILCTEVLSGLCRRAQEQGRLPGIRVSNNSPRIN 72
Query: 1242 HILFADD 1248
H+LFADD
Sbjct: 73 HLLFADD 79
BLAST of IVF0006653 vs. TAIR 10
Match:
AT4G29090.1 (Ribonuclease H-like superfamily protein )
HSP 1 Score: 70.9 bits (172), Expect = 1.0e-11
Identity = 47/162 (29.01%), Postives = 73/162 (45.06%), Query Frame = 0
Query: 1361 SLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKEEGGLGTSRL 1420
+LP Y ++ F P CK I F W+ + G H W +S K EGG+G +
Sbjct: 2 ALPTYTMACFLLPKTVCKQIISVLADFWWRNKQEAKGMHWKAWDHLSCYKAEGGIGFKDI 61
Query: 1421 HVTNKALLTKWLWRYLSEPNALWRRLIQCKY-----KGNYP-GDIPSNISSITSKAPWRS 1480
N ALL K +WR LS P +L ++ + +Y N P G PS + W+S
Sbjct: 62 EAFNLALLGKQMWRMLSRPESLMAKVFKSRYFHKSDPLNAPLGSRPSFV--------WKS 121
Query: 1481 IIDNIDWFKSNQSWELNNGDQISFWYSNWSLEGRLSTAYPRL 1517
I + + + + NG+ I W W L+ + ++A R+
Sbjct: 122 IHASQEILRQGARAVVGNGEDIIIWRHKW-LDSKPASAALRM 154
BLAST of IVF0006653 vs. TAIR 10
Match:
ATMG00310.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )
HSP 1 Score: 67.0 bits (162), Expect = 1.5e-10
Identity = 42/148 (28.38%), Postives = 68/148 (45.95%), Query Frame = 0
Query: 1361 SLPIYQLSVFQAPSMTCKNIEKSWRKFLWKGNNGSVGSHLINWTKVSKSKE-EGGLGTSR 1420
+LP+Y +S F+ + CK + + +F W + W K+ KSKE +GGLG
Sbjct: 2 ALPVYAMSCFRLSKLLCKKLTSAMTEFWWSSCENKRKISWVAWQKLCKSKEDDGGLGFRD 61
Query: 1421 LHVTNKALLTKWLWRYLSEPNALWRRLIQCKYKGNYPGDIPSNISSITSKA-PWRSIIDN 1480
L N+ALL K +R + +P+ L RL++ +Y +P S T + WRSII
Sbjct: 62 LGWFNQALLAKQSFRIIHQPHTLLSRLLRSRY---FPHSSMMECSVGTRPSYAWRSIIHG 121
Query: 1481 IDWFKSNQSWELNNGDQISFWYSNWSLE 1507
+ + +G W W ++
Sbjct: 122 RELLSRGLLRTIGDGIHTKVWLDRWIMD 146
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
P11369 | 2.7e-49 | 24.47 | LINE-1 retrotransposable element ORF2 protein OS=Mus musculus OX=10090 GN=Pol PE... | [more] |
O00370 | 1.4e-48 | 25.86 | LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens OX=9606 PE=1 SV=1 | [more] |
P08548 | 3.7e-46 | 25.75 | LINE-1 reverse transcriptase homolog OS=Nycticebus coucang OX=9470 PE=4 SV=1 | [more] |
P14381 | 2.2e-35 | 23.78 | Transposon TX1 uncharacterized 149 kDa protein OS=Xenopus laevis OX=8355 PE=4 SV... | [more] |
P0C2F6 | 4.8e-22 | 32.95 | Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1... | [more] |
Match Name | E-value | Identity | Description | |
A0A5D3BLV7 | 0.0e+00 | 78.30 | LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... | [more] |
A0A5D3BL61 | 0.0e+00 | 69.66 | LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... | [more] |
A0A5A7TDG1 | 0.0e+00 | 71.88 | LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... | [more] |
A0A5D3C3M3 | 0.0e+00 | 64.10 | LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... | [more] |
A0A5A7UV84 | 0.0e+00 | 61.92 | Reverse transcriptase domain-containing protein OS=Cucumis melo var. makuwa OX=1... | [more] |
Match Name | E-value | Identity | Description | |
TYJ99315.1 | 0.0 | 78.30 | LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa] | [more] |
TYK00493.1 | 0.0 | 69.66 | LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa] | [more] |
KAA0039309.1 | 0.0 | 71.88 | LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa] | [more] |
TYK05808.1 | 0.0 | 64.22 | LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa] | [more] |
KAA0058980.1 | 0.0 | 61.92 | uncharacterized protein E6C27_scaffold98G001710 [Cucumis melo var. makuwa] | [more] |
Match Name | E-value | Identity | Description | |
AT1G43760.1 | 1.6e-25 | 26.10 | DNAse I-like superfamily protein | [more] |
AT3G24255.1 | 5.3e-16 | 26.42 | RNA-directed DNA polymerase (reverse transcriptase)-related family protein | [more] |
ATMG01250.1 | 4.7e-12 | 47.76 | RNA-directed DNA polymerase (reverse transcriptase) | [more] |
AT4G29090.1 | 1.0e-11 | 29.01 | Ribonuclease H-like superfamily protein | [more] |
ATMG00310.1 | 1.5e-10 | 28.38 | RNA-directed DNA polymerase (reverse transcriptase)-related family protein | [more] |