Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGGTAAACTATTACTCTTCTGACTCTGATGCAACAAAAAGAACATATGCAAGTGTGGTATCAGGCAGCATATCCTCCAACTCAAGTGAATCAGCATTCTCTTCTGACTTAGACAAAGCCATGCAAAAGAAAAAAAGGATTGAAAAAAGCTTTGAGTTTTCTGGGAAAGACTCAAAGATGCGCTTAGAGAATTCTGTTGTCATCACAAGGAGATGTTTCCATGATGACTAGCATAAAATTAAGCAGAATCTCAAAAAACAGATTGAAGTCAACTTCTCTGTTAAACCATTCCACCCAGAAAAAGCCATACTAAACCTTCAGGATACAGAGCAAGCAAATCTCCTATGTAGCAACAATGGAAGCAAAGGATGGTCCACAGTGGGAAATTTTTTTGTTAAGTTCGAAAAATGGTCATCAACAAACCATACGTCTCAGAAACTCTTCCTTAGCTATGGTGGATGGAATTCCTTTAGAGGAGTTCCCCTTCATCTTTGGAATTACAATACTTTTAAGCAGATTGGAAATGCATGTGGGGGTTTTGTCGCTGTTGCTAAAGATACAATGGAGAAAAAGGACCTGATGGAAGCAAAAATCAAAGTCAAGTATAACTATACAGGATTCATTCCAGCAAACATTGGAATAACTGACGATAATGGAGAACTTTTTGTGGTCCAAACTGTTTGCAAGACGGAGAGTAAGTGGCTCAAAGAGAGAAATGTTGACATGCATGGAACTTTCAAAAGGCAAGCGACTGCTAGCTTCAATGAATACAATTCAGAATCGGAAATGTATCACTTCACCGGAAATGTTGCAGTTACACCGGATTTTTTTGAACTTTCAAAATCCGAAGCATTAAATTTGGAAGTAACTACACCTGAAGCAAATCTCACAAATCTCACAAAAACCAAGACTCCAAATCCTAGCCATTATGGAAAGACTGACAAAACCACAAAAAAAACAGAAAAACGGCTGGACAAAGGGAAACAGTTAATGGTGGATGATGATGATACTGACAGCCAGCAAAACAAATTCAGCTCGAAAAGAAAGGTATCATTTACCTCACCAAAAAACAAAACTTTTTTTTTCAATCCTGAAAATGCTCCAGCTAAGCACCTCTGCATTAAAAGTTCGTTGGAAAAAAATAGCAGTGGGCCCTCTGAAGATTCTTCAATGCAAAAAAGAAAATTTGTGAAAAAATACTACAGAATCAAAAACAAATTTAACCAAGCTCCTGACACATGTAAAGAAAAGGCAGCAAAGAAAGAAGGCTATCATCTAACAGTGGACTTAGGACAGCTGTCTCCTTTAAAGAAGGATCAAAATCAACAAATTGTTAACAGTCAGGAAGAAACAAAAGGAGATTCAAGTCCTGAAGAAACAAATACTTTGCAAGCACAAGAATCACAAGCTCAAACAGAATTGATCAAGGACAGCTCTGACCCTATGATGACAAGTGATGAAGATGAATGCAGGATCACAAGAGTAAAAGGAAAATGTGAGGAAGAGGAGGAAAACTTCAAAAAACAGTTGATAATCTGGCTTAAAGAAAATAACTTAAAGTTAGCTCCATCTCTAAACCAAAACTTGCCAAGCTCATCAAACAGTGAACAAGTGAGGATCAGTCGAGGAGAAGAGCAAAGATCTTCAGGAAGTAATGTTGTTAAATGAAAATTTTATGTTGGAATATTAGAGGCATAGGCTCAGTCTCAAAAAGAGCCAGAATAAAAAATCAGTTATTTCCTCTTACTCGCCGGATTTTGTTATTTTATCTGAAACGAAACGCATTTCAACAAACAAAAAAGTAATCAATTCTCTATGGTCTCTCAAAAGCATAAAGTGGTTAAATGTCAATTCAAGAGGAAGAACTGGAGGCATTCTAATTATGTGGAACGACCAAAGACATAGGCTGTTAAATAGTTTTGAAGGAGATTTCACAATTTCTGCAAATATACAAGACTCCTTAGGCAACACTTGGTGGATCACAGGCTTATATGGTCATGCTAAAAGAAAACAGAGAAACAAATTATGGATTGAACTTCAAAATCTTCACAATCTTTGTTCTACAAACTGGCTGATTGGAGGAGACTTTAATGTGGTGAGATGGAACAATGAGACTACAACACTGAACCCAGGGAAACACAGTATGCAAGAACTAAATGGTTTCATTCAGAAAATAATTTGATTGATCCTCCTCTCACAAACAACAGATTCACTTGGTCAAATCTTAGAAGTCAACCAACTTGTTCTAGGCTGGATAGATTCCTTTATACCAGCTCTTGGGAATTATGCTTTAAAGAACATTATACAAGGACTCTATCAAGATCCACTTCAGATCACTTTCCTCTTGTCCTGGAAGCTTCAAACATTCGTTGGGGCCCATCACCCTTCAGGCTAAACAATAACTCCCTCTCTGATCCAGATTTCAATAGAAATATAAGGGGTTGGTGGGAAGGATCAAAACATCTTGGTCATCCTGGTTTTGCTTTTGTACAAAGACTAAAGACTCTCTCCAGAACTTTAAAGAATTGGCAGTTCAGTCTTTTAAATAAACAGGAGGATGAAAAAAAGAGAATTATTCAGCAGATTGACAACATTGACAAGCTAGAAAAGCAGAATCTTTTAACTTTGGAGGACAGCAACAGAAGAACGGCTTTCAAATCTAAGCTTTGTTCTATTGATTTCAAACAAGCTCAATGTTGGGCCCAACGAACAAAGAAGCAATGGCTCAACGAAGGGGATGAAAATACAACTTATTTTCATAAGGTGTGCTCAGCAAATCAAAGAACAAATTGTATATCAGAGATTCAAGATGAAAATGGTTTGACTCATTGTTCAAGTGATTCCATTGCTGGAGTCTTAACCAATCATTTCAGCCATATTTACTCTGAAGACAAGAGAGGCACAATGTTGATTGAAAACCTGAACTGGAAACCAATTGACTCATTGCATCACAATGAGTTATGTGCTCCCTTTGATGAAATTGAAGTATTACAAGCCATCAACTCTATTAGTGACAAAAAGGCCCCTGGACCTGATGGTTACACAGTGAAATTCTACAAAAAATATTGGCCCATGGTCAAAGAGGAGGTGATGCAAATTTTCAACGACTTTCATAAAAAAGGCATCATCAACAAATGA
mRNA sequence
ATGGAGCATAAAATTAAGCAGAATCTCAAAAAACAGATTGAAGTCAACTTCTCTGTTAAACCATTCCACCCAGAAAAAGCCATACTAAACCTTCAGGATACAGAGCAAGCAAATCTCCTATGTAGCAACAATGGAAGCAAAGGATGGTCCACAGTGGGAAATTTTTTTGTTAAGTTCGAAAAATGGTCATCAACAAACCATACGTCTCAGAAACTCTTCCTTAGCTATGGTGGATGGAATTCCTTTAGAGGAGTTCCCCTTCATCTTTGGAATTACAATACTTTTAAGCAGATTGGAAATGCATGTGGGGGTTTTGTCGCTGTTGCTAAAGATACAATGGAGAAAAAGGACCTGATGGAAGCAAAAATCAAAGTCAAGTATAACTATACAGGATTCATTCCAGCAAACATTGGAATAACTGACGATAATGGAGAACTTTTTGTGGTCCAAACTGTTTGCAAGACGGAGAGTAAGTGGCTCAAAGAGAGAAATGTTGACATGCATGGAACTTTCAAAAGGCAAGCGACTGCTAGCTTCAATGAATACAATTCAGAATCGGAAATGTATCACTTCACCGGAAATGTTGCAGTTACACCGGATTTTTTTGAACTTTCAAAATCCGAAGCATTAAATTTGGAAGTAACTACACCTGAAGCAAATCTCACAAATCTCACAAAAACCAAGACTCCAAATCCTAGCCATTATGGAAAGACTGACAAAACCACAAAAAAAACAGAAAAACGGCTGGACAAAGGGAAACAGTTAATGGTGGATGATGATGATACTGACAGCCAGCAAAACAAATTCAGCTCGAAAAGAAAGGTATCATTTACCTCACCAAAAAACAAAACTTTTTTTTTCAATCCTGAAAATGCTCCAGCTAAGCACCTCTGCATTAAAAGTTCGTTGGAAAAAAATAGCAGTGGGCCCTCTGAAGATTCTTCAATGCAAAAAAGAAAATTTGTGAAAAAATACTACAGAATCAAAAACAAATTTAACCAAGCTCCTGACACATGTAAAGAAAAGGCAGCAAAGAAAGAAGGCTATCATCTAACAGTGGACTTAGGACAGCTGTCTCCTTTAAAGAAGGATCAAAATCAACAAATTGTTAACAGTCAGGAAGAAACAAAAGGAGATTCAAGTCCTGAAGAAACAAATACTTTGCAAGCACAAGAATCACAAGCTCAAACAGAATTGATCAAGGACAGCTCTGACCCTATGATGACAAGTGATGAAGATGAATGCAGGATCACAAGAGTAAAAGGAAAATGTGAGGAAGAGGAGGAAAACTTCAAAAAACAGTTGATAATCTGGCTTAAAGAAAATAACTTAAAGTTAGCTCCATCTCTAAACCAAAACTTGCCAAGCTCATCAAACAGTGAACAAGTGAGGATCATTATTTCCTCTTACTCGCCGGATTTTGTTATTTTATCTGAAACGAAACGCATTTCAACAAACAAAAAAGTAATCAATTCTCTATGGTCTCTCAAAAGCATAAAGTGGTTAAATGTCAATTCAAGAGGAAGAACTGGAGGCATTCTAATTATGTGGAACGACCAAAGACATAGGCTGTTAAATAGTTTTGAAGGAGATTTCACAATTTCTGCAAATATACAAGACTCCTTAGGCAACACTTGGTGGATCACAGGCTTATATGGTCATGCTAAAAGAAAACAGAGAAACAAATTATGGATTGAACTTCAAAATCTTCACAATCTTTGTTCTACAAACTGGCTGATTGGAGGAGACTTTAATGTGGTGAGATGGAACAATGAGACTACAACACTGAACCCAGGGAAACACAAAAATAATTTGATTGATCCTCCTCTCACAAACAACAGATTCACTTGGTCAAATCTTAGAAGTCAACCAACTTGTTCTAGGCTGGATAGATTCCTTTATACCAGCTCTTGGGAATTATGCTTTAAAGAACATTATACAAGGACTCTATCAAGATCCACTTCAGATCACTTTCCTCTTGTCCTGGAAGCTTCAAACATTCGTTGGGGCCCATCACCCTTCAGGCTAAACAATAACTCCCTCTCTGATCCAGATTTCAATAGAAATATAAGGGGTTGGTGGGAAGGATCAAAACATCTTGGTCATCCTGGTTTTGCTTTTGTACAAAGACTAAAGACTCTCTCCAGAACTTTAAAGAATTGGCAGTTCAGTCTTTTAAATAAACAGGAGGATGAAAAAAAGAGAATTATTCAGCAGATTGACAACATTGACAAGCTAGAAAAGCAGAATCTTTTAACTTTGGAGGACAGCAACAGAAGAACGGCTTTCAAATCTAAGCTTTGTTCTATTGATTTCAAACAAGCTCAATGTTGGGCCCAACGAACAAAGAAGCAATGGCTCAACGAAGGGGATGAAAATACAACTTATTTTCATAAGGTGTGCTCAGCAAATCAAAGAACAAATTGTATATCAGAGATTCAAGATGAAAATGGTTTGACTCATTGTTCAAGTGATTCCATTGCTGGAGTCTTAACCAATCATTTCAGCCATATTTACTCTGAAGACAAGAGAGGCACAATGTTGATTGAAAACCTGAACTGGAAACCAATTGACTCATTGCATCACAATGAGTTATGTGCTCCCTTTGATGAAATTGAAGTATTACAAGCCATCAACTCTATTAGTGACAAAAAGGCCCCTGGACCTGATGGTTACACAGTGAAATTCTACAAAAAATATTGGCCCATGGTCAAAGAGGAGGTGATGCAAATTTTCAACGACTTTCATAAAAAAGGCATCATCAACAAATGA
Coding sequence (CDS)
ATGGAGCATAAAATTAAGCAGAATCTCAAAAAACAGATTGAAGTCAACTTCTCTGTTAAACCATTCCACCCAGAAAAAGCCATACTAAACCTTCAGGATACAGAGCAAGCAAATCTCCTATGTAGCAACAATGGAAGCAAAGGATGGTCCACAGTGGGAAATTTTTTTGTTAAGTTCGAAAAATGGTCATCAACAAACCATACGTCTCAGAAACTCTTCCTTAGCTATGGTGGATGGAATTCCTTTAGAGGAGTTCCCCTTCATCTTTGGAATTACAATACTTTTAAGCAGATTGGAAATGCATGTGGGGGTTTTGTCGCTGTTGCTAAAGATACAATGGAGAAAAAGGACCTGATGGAAGCAAAAATCAAAGTCAAGTATAACTATACAGGATTCATTCCAGCAAACATTGGAATAACTGACGATAATGGAGAACTTTTTGTGGTCCAAACTGTTTGCAAGACGGAGAGTAAGTGGCTCAAAGAGAGAAATGTTGACATGCATGGAACTTTCAAAAGGCAAGCGACTGCTAGCTTCAATGAATACAATTCAGAATCGGAAATGTATCACTTCACCGGAAATGTTGCAGTTACACCGGATTTTTTTGAACTTTCAAAATCCGAAGCATTAAATTTGGAAGTAACTACACCTGAAGCAAATCTCACAAATCTCACAAAAACCAAGACTCCAAATCCTAGCCATTATGGAAAGACTGACAAAACCACAAAAAAAACAGAAAAACGGCTGGACAAAGGGAAACAGTTAATGGTGGATGATGATGATACTGACAGCCAGCAAAACAAATTCAGCTCGAAAAGAAAGGTATCATTTACCTCACCAAAAAACAAAACTTTTTTTTTCAATCCTGAAAATGCTCCAGCTAAGCACCTCTGCATTAAAAGTTCGTTGGAAAAAAATAGCAGTGGGCCCTCTGAAGATTCTTCAATGCAAAAAAGAAAATTTGTGAAAAAATACTACAGAATCAAAAACAAATTTAACCAAGCTCCTGACACATGTAAAGAAAAGGCAGCAAAGAAAGAAGGCTATCATCTAACAGTGGACTTAGGACAGCTGTCTCCTTTAAAGAAGGATCAAAATCAACAAATTGTTAACAGTCAGGAAGAAACAAAAGGAGATTCAAGTCCTGAAGAAACAAATACTTTGCAAGCACAAGAATCACAAGCTCAAACAGAATTGATCAAGGACAGCTCTGACCCTATGATGACAAGTGATGAAGATGAATGCAGGATCACAAGAGTAAAAGGAAAATGTGAGGAAGAGGAGGAAAACTTCAAAAAACAGTTGATAATCTGGCTTAAAGAAAATAACTTAAAGTTAGCTCCATCTCTAAACCAAAACTTGCCAAGCTCATCAAACAGTGAACAAGTGAGGATCATTATTTCCTCTTACTCGCCGGATTTTGTTATTTTATCTGAAACGAAACGCATTTCAACAAACAAAAAAGTAATCAATTCTCTATGGTCTCTCAAAAGCATAAAGTGGTTAAATGTCAATTCAAGAGGAAGAACTGGAGGCATTCTAATTATGTGGAACGACCAAAGACATAGGCTGTTAAATAGTTTTGAAGGAGATTTCACAATTTCTGCAAATATACAAGACTCCTTAGGCAACACTTGGTGGATCACAGGCTTATATGGTCATGCTAAAAGAAAACAGAGAAACAAATTATGGATTGAACTTCAAAATCTTCACAATCTTTGTTCTACAAACTGGCTGATTGGAGGAGACTTTAATGTGGTGAGATGGAACAATGAGACTACAACACTGAACCCAGGGAAACACAAAAATAATTTGATTGATCCTCCTCTCACAAACAACAGATTCACTTGGTCAAATCTTAGAAGTCAACCAACTTGTTCTAGGCTGGATAGATTCCTTTATACCAGCTCTTGGGAATTATGCTTTAAAGAACATTATACAAGGACTCTATCAAGATCCACTTCAGATCACTTTCCTCTTGTCCTGGAAGCTTCAAACATTCGTTGGGGCCCATCACCCTTCAGGCTAAACAATAACTCCCTCTCTGATCCAGATTTCAATAGAAATATAAGGGGTTGGTGGGAAGGATCAAAACATCTTGGTCATCCTGGTTTTGCTTTTGTACAAAGACTAAAGACTCTCTCCAGAACTTTAAAGAATTGGCAGTTCAGTCTTTTAAATAAACAGGAGGATGAAAAAAAGAGAATTATTCAGCAGATTGACAACATTGACAAGCTAGAAAAGCAGAATCTTTTAACTTTGGAGGACAGCAACAGAAGAACGGCTTTCAAATCTAAGCTTTGTTCTATTGATTTCAAACAAGCTCAATGTTGGGCCCAACGAACAAAGAAGCAATGGCTCAACGAAGGGGATGAAAATACAACTTATTTTCATAAGGTGTGCTCAGCAAATCAAAGAACAAATTGTATATCAGAGATTCAAGATGAAAATGGTTTGACTCATTGTTCAAGTGATTCCATTGCTGGAGTCTTAACCAATCATTTCAGCCATATTTACTCTGAAGACAAGAGAGGCACAATGTTGATTGAAAACCTGAACTGGAAACCAATTGACTCATTGCATCACAATGAGTTATGTGCTCCCTTTGATGAAATTGAAGTATTACAAGCCATCAACTCTATTAGTGACAAAAAGGCCCCTGGACCTGATGGTTACACAGTGAAATTCTACAAAAAATATTGGCCCATGGTCAAAGAGGAGGTGATGCAAATTTTCAACGACTTTCATAAAAAAGGCATCATCAACAAATGA
Protein sequence
MEHKIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKWSSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNACGGFVAVAKDTMEKKDLMEAKIKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEYNSESEMYHFTGNVAVTPDFFELSKSEALNLEVTTPEANLTNLTKTKTPNPSHYGKTDKTTKKTEKRLDKGKQLMVDDDDTDSQQNKFSSKRKVSFTSPKNKTFFFNPENAPAKHLCIKSSLEKNSSGPSEDSSMQKRKFVKKYYRIKNKFNQAPDTCKEKAAKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKGDSSPEETNTLQAQESQAQTELIKDSSDPMMTSDEDECRITRVKGKCEEEEENFKKQLIIWLKENNLKLAPSLNQNLPSSSNSEQVRIIISSYSPDFVILSETKRISTNKKVINSLWSLKSIKWLNVNSRGRTGGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQNLHNLCSTNWLIGGDFNVVRWNNETTTLNPGKHKNNLIDPPLTNNRFTWSNLRSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPLVLEASNIRWGPSPFRLNNNSLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQQIDNIDKLEKQNLLTLEDSNRRTAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIYSEDKRGTMLIENLNWKPIDSLHHNELCAPFDEIEVLQAINSISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGIINK
Homology
BLAST of PI0011905 vs. ExPASy Swiss-Prot
Match:
O00370 (LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens OX=9606 PE=1 SV=1)
HSP 1 Score: 77.4 bits (189), Expect = 9.4e-13
Identity = 90/388 (23.20%), Postives = 169/388 (43.56%), Query Frame = 0
Query: 567 LQNLHNLCSTNWLIGGDFNV----------VRWNNETTTLNPGKHKNNLIDPPLT-NNRF 626
L +L ++ LI GDFN + N +T LN H+ +LID T + +
Sbjct: 129 LSDLQRDLDSHTLIMGDFNTPLSILDRSTRQKVNKDTQELNSALHQTDLIDIYRTLHPKS 188
Query: 627 TWSNLRSQP--TCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPLVLE--ASNIRWGP 686
T S P T S++D + S L K T ++ SDH + LE N+
Sbjct: 189 TEYTFFSAPHHTYSKIDHIV--GSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQSR 248
Query: 687 S-PFRLNNNSLSD----PDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLL 746
S ++LNN L+D + I+ ++E +++ Q L + + +F L
Sbjct: 249 STTWKLNNLLLNDYWVHNEMKAEIKMFFETNENKD----TTYQNLWDAFKAVCRGKFIAL 308
Query: 747 N--KQEDEKKRIIQQIDNIDKLEKQNLLTLEDSNRR--TAFKSKLCSIDFKQAQCWAQRT 806
N K++ E+ +I + +LEKQ + S R+ T +++L I+ ++ +
Sbjct: 309 NAYKRKQERSKIDTLTSQLKELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINES 368
Query: 807 KKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIYSED 866
+ + ++ ++ + N I I+++ G I + ++ H+Y+
Sbjct: 369 RSWFFERINKIDRPLARLIKKKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYANK 428
Query: 867 KRG---------TMLIENLNWKPIDSLHHNELCAPFDEIEVLQAINSISDKKAPGPDGYT 922
T + LN + ++SL+ P E++ INS+ KK+PGPDG+T
Sbjct: 429 LENLEEMDTFLDTYTLPRLNQEEVESLNR-----PITGSEIVAIINSLPTKKSPGPDGFT 488
BLAST of PI0011905 vs. ExPASy Swiss-Prot
Match:
P08548 (LINE-1 reverse transcriptase homolog OS=Nycticebus coucang OX=9470 PE=4 SV=1)
HSP 1 Score: 53.5 bits (127), Expect = 1.4e-05
Identity = 56/282 (19.86%), Postives = 120/282 (42.55%), Query Frame = 0
Query: 655 SDHFPLVLEAS---NIRWGPSPFRLNNNSLSD----PDFNRNIRGWWEGSKHLGHPGFAF 714
SDH + +E + N+ ++LNN L D + + I + E + +
Sbjct: 227 SDHHGIKVELNNNRNLHTHTKTWKLNNLMLKDTWVIDEIKKEITKFLEQNNNQD----TN 286
Query: 715 VQRLKTLSRTLKNWQFSLLNK--QEDEKKRIIQQIDNIDKLEKQNLLTLEDSNRR--TAF 774
Q L ++ + +F L ++ E++ + + ++ +LEK+ + S R+ T
Sbjct: 287 YQNLWDTAKAVLRGKFIALQAFLKKTEREEVNNLMGHLKQLEKEEHSNPKPSRRKEITKI 346
Query: 775 KSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCS 834
+++L I+ K+ ++K + + ++ + + + IS I++ N
Sbjct: 347 RAELNEIENKRIIQQINKSKSWFFEKINKIDKPLANLTRKKRVKSLISSIRNGNDEITTD 406
Query: 835 SDSIAGVLTNH----FSHIYSEDKRGTMLIENLNWKPIDSLHHNELCAPFDEIEVLQAIN 894
I +L + +SH Y K +E + + L P E+ I
Sbjct: 407 PSEIQKILNEYYKKLYSHKYENLKEIDQYLEACHLPRLSQKEVEMLNRPISSSEIASTIQ 466
Query: 895 SISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGII 922
++ KK+PGPDG+T +FY+ + + ++ +F + K+GI+
Sbjct: 467 NLPKKKSPGPDGFTSEFYQTFKEELVPILLNLFQNIEKEGIL 504
BLAST of PI0011905 vs. ExPASy TrEMBL
Match:
A0A5D3BLV7 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G005290 PE=4 SV=1)
HSP 1 Score: 718.0 bits (1852), Expect = 5.0e-203
Identity = 416/945 (44.02%), Postives = 559/945 (59.15%), Query Frame = 0
Query: 3 HKIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKW 62
HKI QNL+KQ E +F+ FH EKA+++ ANLLC N KGWSTVG + V+FEKW
Sbjct: 272 HKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQN---KGWSTVGKYSVRFEKW 331
Query: 63 SSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNACGGFVAVAKDTMEKKDLMEAK 122
S H + KL SYGGW +FRG+PLHLWN TF+QIG AC G + VA++T K+L+EA+
Sbjct: 332 SPVYHATPKLIPSYGGWTTFRGIPLHLWNMMTFQQIGKACEGLIKVAEETRSAKNLIEAR 391
Query: 123 IKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEY 182
IKV+YNY+GF+PAN+ I D+ G F VQ V E KWL ERNV +HGTFKRQA ASF+++
Sbjct: 392 IKVRYNYSGFLPANVRIFDNEGNKFFVQVVTHPEGKWLIERNVRLHGTFKRQAAASFDDF 451
Query: 183 NSESEMYHFTGNVAVTPDFFELS----------KSEALNLEVTTPEANLT--NLTKTKTP 242
N ESE + F G+ A++PDF S + AL + P+ N T + +
Sbjct: 452 NPESEQFFFEGSEAISPDFLSTSSDGRKSSTPDQPSALKSVIIKPDRNATLPSFLNEELV 511
Query: 243 NPSHYGKTDKTTK-------KTEKRLDKGKQLMVDDDDTDSQQNKFSSKRKVSFTSPKNK 302
N S+ T +K + LDKGKQ + +S N SKRKVSF SP NK
Sbjct: 512 NDSNLHATANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNK 571
Query: 303 TFFFNPENAPAKHLCIKSSLEKNSSGPSEDSSMQKRKFVK---KYYRIKNKFNQAP--DT 362
T FNP++APA H +S EK E S +K + K + K F P
Sbjct: 572 TNIFNPDSAPANHSPSLNSPEKKQKVSRERSIKKKSSSTQPNSKANQNKGVFITQPIQIV 631
Query: 363 CKEKAAKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKGDSSPEETNTLQAQESQAQTE 422
++ A K+G LTVDLG L L D N+ + + + + TNT E+
Sbjct: 632 AHDRDAAKKGLSLTVDLGDLPAL--DPNKSLEDHHNSDNAE-VVDITNTEVVPETPEMKM 691
Query: 423 LIKDSSDPMMTSDEDECRITRVK----GKCEEEE-----ENFKKQLIIWLKENNLKLAPS 482
+ ++S+ ++ + + + K EE+E E FKKQL+ WLK+N LKL+
Sbjct: 692 PVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKKNGLKLSTD 751
Query: 483 LNQNLPSSSNSEQVRIIISSYSPDFVILSETKRISTNKKVINSLWSLKSIKWLNVNSRGR 542
+ SS + ++++ + I TNK++I SLW SI W+ N+ G
Sbjct: 752 TD----SSGATTSTNVLLNQMNSGLKI--------TNKRIIKSLWPSNSINWIAKNASGS 811
Query: 543 TGGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQN 602
+GGILI+W+ Q H LL+ EG F++SAN + ++WW+TGLYG KR++R W EL N
Sbjct: 812 SGGILILWDAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGPVKRRERIHFWAELHN 871
Query: 603 LHNLCSTNWLIGGDFNVVRWNNETTTLNPGKH----------KNNLIDPPLTNNRFTWSN 662
L +L S W++GGD NV+R E+T++ H N LIDPPLTNNRFTWSN
Sbjct: 872 LQHLNSFPWILGGDLNVIRMREESTSVLSSSHNSRMLNNFISNNLLIDPPLTNNRFTWSN 931
Query: 663 LRSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPLVLEASN--IRWGPSPFRLN 722
LR+ PT SR+DRFLY SSWE F H TRTL RSTSDHFPLV E SN + WGP PFRLN
Sbjct: 932 LRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPKLSWGPIPFRLN 991
Query: 723 NNSLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRII 782
+ +LSDP+F RN+ WWE S G+PGF+F+QRLK+L+ +K WQ L+ K+ II
Sbjct: 992 SITLSDPEFKRNMGRWWENSIQAGYPGFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAII 1051
Query: 783 QQIDNIDKLEKQNLLTLEDSNRRTAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYF 842
+++D+IDK E LT E+SNRR A K+ L + K++Q W QR KK WL EGDEN+++F
Sbjct: 1052 REVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFF 1111
Query: 843 HKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIY-SEDKRGTMLIENLNWKP 902
H++CS+ Q+ + I EIQDE G +++SI+ FS IY S K + IENL+W P
Sbjct: 1112 HRICSSRQKRSFIHEIQDEEGSIQNTNNSISTAFIKFFSRIYRSSTKSDPLFIENLDWNP 1171
BLAST of PI0011905 vs. ExPASy TrEMBL
Match:
A0A5A7TDG1 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold64G001050 PE=4 SV=1)
HSP 1 Score: 704.1 bits (1816), Expect = 7.5e-199
Identity = 410/966 (42.44%), Postives = 554/966 (57.35%), Query Frame = 0
Query: 4 KIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKWS 63
KI QNL+KQ E +F+ FH EK +++ ANLLC N KGW+TVG + V+FEKW+
Sbjct: 219 KILQNLRKQTEESFTYNAFHAEKVLVHFNSNVPANLLCQN---KGWTTVGKYTVRFEKWA 278
Query: 64 STNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNACGGFVAVAKDTMEKKDLMEAKI 123
+H S KL SYGGW +FRG+PLHLWN TF+QIG ACGG + VA++T ++L+EAK+
Sbjct: 279 PASHASPKLIPSYGGWTTFRGIPLHLWNMMTFQQIGKACGGLIKVAEETKTARNLIEAKL 338
Query: 124 KVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEYN 183
K++YNY+GF+PA + I D G FVVQ V +E KWL ERNV +HGTFKRQA ASF+++N
Sbjct: 339 KIRYNYSGFLPAYVKIFDQEGNKFVVQVVTHSEGKWLMERNVRLHGTFKRQAAASFDDFN 398
Query: 184 SESEMYHFTGNVAVTPDFFELSKSEALNLEVTTPEA-------------NLTNLTKTKTP 243
+SE + F G A++PD ++ P A + T L +
Sbjct: 399 PDSEQFLFDGLEAISPDLLNTISGSRKSISPEQPSALKSVIIKPAKYATSPTTLNEEVVN 458
Query: 244 NPSHYGKTDKTTKK------TEKRLDKGKQLMVDDDDTDSQQNKFSSKRKVSFTSPKNKT 303
+ S + +K+ K + LDKGKQ + S + KRKVSF SP NKT
Sbjct: 459 DNSLHATANKSKLKILSGISNDGSLDKGKQKVDIPSQLTSAFIFYKPKRKVSFNSPSNKT 518
Query: 304 FFFNPENAPAKHLCIKSSLEKNSSGPSEDSSMQKRKFVKKYYRI-KNKFNQAPDTCKEKA 363
FFNP++APA H S EK E S +K ++ R + K N + A
Sbjct: 519 TFFNPDSAPANH-----SPEKKKRVSRERSVKKKSSTIQPKLRANQGKGNLITQPLQVVA 578
Query: 364 ----AKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKGDSSPEETNTLQAQESQ--AQT 423
A K+G LTVDLG L L ++ + +S + + + TNT E+ T
Sbjct: 579 HDLDASKKGLSLTVDLGNLPVLDPSKSFEDHHSSDNAE---VIDITNTEVVPETPELKMT 638
Query: 424 ELIKDSSDPMMT--SDEDECRITRVKGKCEEEE-----ENFKKQLIIWLKENNLKLAPSL 483
+ K +S P + + R K E++E E FK QL+ WLKEN LKL+
Sbjct: 639 DPEKSNSSPEVNYRKQKHSHRRRHYYRKKEDKEKDTNSEAFKNQLVTWLKENGLKLSIDT 698
Query: 484 NQNLPSSSNSEQVRIIISSYSPDFVILSETKRISTNKKVINSLWSLKSIKWLNVNSRGRT 543
+ + ++S N+L+S
Sbjct: 699 DSSGATTST-------------------------------NALFS---------QLGSSA 758
Query: 544 GGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQNL 603
GGILI+W+ Q H LL+ EG F++SAN S N+WW+TGLYG KR++R +W +L NL
Sbjct: 759 GGILILWDAQHHSLLSQEEGKFSLSANF-SSFNNSWWLTGLYGPVKRRERLNVWEDLHNL 818
Query: 604 HNLCSTNWLIGGDFNVVRWNNETTTLNPGKHKNN----------LIDPPLTNNRFTWSNL 663
H+L S+ W+IGGD NVVR E+T + H +N LIDPPLTNNR+TWSNL
Sbjct: 819 HHLNSSPWIIGGDLNVVRMREESTAVTFSSHSSNMLNDFISNNLLIDPPLTNNRYTWSNL 878
Query: 664 RSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPLVLE--ASNIRWGPSPFRLNN 723
R+ PT SRLDRFLY S WE+ F H TRTL R TSDHFPLV E S +RWGP+PFRLN+
Sbjct: 879 RNPPTFSRLDRFLYNSRWEILFNPHITRTLPRPTSDHFPLVCEDSTSTLRWGPAPFRLNS 938
Query: 724 NSLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQ 783
+L+DP+F RN+ WWE S GHPGF F+QRLK+L+ +K WQ K+ II+
Sbjct: 939 IALNDPEFKRNMERWWELSVQNGHPGFFFIQRLKSLANLIKPWQKEKFQSLTSAKENIIR 998
Query: 784 QIDNIDKLEKQNLLTLEDSNRRTAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYFH 843
++D+IDK E L+LE+SNRR A K++L + K++Q W QR KK WL EGDEN+ +FH
Sbjct: 999 EVDSIDKNELDTPLSLEESNRRLALKAELNDLSLKESQFWFQRAKKLWLKEGDENSAFFH 1058
Query: 844 KVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIY-SEDKRGTMLIENLNWKPI 903
++CS+ Q+ N I EIQDE G ++++I+ NHFS IY K+ + IENL W PI
Sbjct: 1059 RICSSRQKRNLIHEIQDEEGSIQNTNNNISLAFVNHFSRIYRCSTKKDPLFIENLEWNPI 1118
Query: 904 DSLHHNELCAPFDEIEVLQAINSISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHK 924
D + LCAPF E E+ I S KAPGPDG+ + F+K YW ++KE+++ IF DF +
Sbjct: 1119 DYSDWSLLCAPFSEEEIKGVIKSFDGNKAPGPDGFPISFFKSYWHLLKEDILDIFKDFFE 1132
BLAST of PI0011905 vs. ExPASy TrEMBL
Match:
A0A5D3BL61 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold169G001020 PE=4 SV=1)
HSP 1 Score: 704.1 bits (1816), Expect = 7.5e-199
Identity = 410/966 (42.44%), Postives = 554/966 (57.35%), Query Frame = 0
Query: 4 KIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKWS 63
KI QNL+KQ E +F+ FH EK +++ ANLLC N KGW+TVG + V+FEKW+
Sbjct: 219 KILQNLRKQTEESFTYNAFHAEKVLVHFNSNVPANLLCQN---KGWTTVGKYTVRFEKWA 278
Query: 64 STNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNACGGFVAVAKDTMEKKDLMEAKI 123
+H S KL SYGGW +FRG+PLHLWN TF+QIG ACGG + VA++T ++L+EAK+
Sbjct: 279 PASHASPKLIPSYGGWTTFRGIPLHLWNMMTFQQIGKACGGLIKVAEETKTARNLIEAKL 338
Query: 124 KVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEYN 183
K++YNY+GF+PA + I D G FVVQ V +E KWL ERNV +HGTFKRQA ASF+++N
Sbjct: 339 KIRYNYSGFLPAYVKIFDQEGNKFVVQVVTHSEGKWLMERNVRLHGTFKRQAAASFDDFN 398
Query: 184 SESEMYHFTGNVAVTPDFFELSKSEALNLEVTTPEA-------------NLTNLTKTKTP 243
+SE + F G A++PD ++ P A + T L +
Sbjct: 399 PDSEQFLFDGLEAISPDLLNTISGSRKSISPEQPSALKSVIIKPAKYATSPTTLNEEVVN 458
Query: 244 NPSHYGKTDKTTKK------TEKRLDKGKQLMVDDDDTDSQQNKFSSKRKVSFTSPKNKT 303
+ S + +K+ K + LDKGKQ + S + KRKVSF SP NKT
Sbjct: 459 DNSLHATANKSKLKILSGISNDGSLDKGKQKVDIPSQLTSAFIFYKPKRKVSFNSPSNKT 518
Query: 304 FFFNPENAPAKHLCIKSSLEKNSSGPSEDSSMQKRKFVKKYYRI-KNKFNQAPDTCKEKA 363
FFNP++APA H S EK E S +K ++ R + K N + A
Sbjct: 519 TFFNPDSAPANH-----SPEKKKRVSRERSVKKKSSTIQPKLRANQGKGNLITQPLQVVA 578
Query: 364 ----AKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKGDSSPEETNTLQAQESQ--AQT 423
A K+G LTVDLG L L ++ + +S + + + TNT E+ T
Sbjct: 579 HDLDASKKGLSLTVDLGNLPVLDPSKSFEDHHSSDNAE---VIDITNTEVVPETPELKMT 638
Query: 424 ELIKDSSDPMMT--SDEDECRITRVKGKCEEEE-----ENFKKQLIIWLKENNLKLAPSL 483
+ K +S P + + R K E++E E FK QL+ WLKEN LKL+
Sbjct: 639 DPEKSNSSPEVNYRKQKHSHRRRHYYRKKEDKEKDTNSEAFKNQLVTWLKENGLKLSIDT 698
Query: 484 NQNLPSSSNSEQVRIIISSYSPDFVILSETKRISTNKKVINSLWSLKSIKWLNVNSRGRT 543
+ + ++S N+L+S
Sbjct: 699 DSSGATTST-------------------------------NALFS---------QLGSSA 758
Query: 544 GGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQNL 603
GGILI+W+ Q H LL+ EG F++SAN S N+WW+TGLYG KR++R +W +L NL
Sbjct: 759 GGILILWDAQHHSLLSQEEGKFSLSANF-SSFNNSWWLTGLYGPVKRRERLNVWEDLHNL 818
Query: 604 HNLCSTNWLIGGDFNVVRWNNETTTLNPGKHKNN----------LIDPPLTNNRFTWSNL 663
H+L S+ W+IGGD NVVR E+T + H +N LIDPPLTNNR+TWSNL
Sbjct: 819 HHLNSSPWIIGGDLNVVRMREESTAVTFSSHSSNMLNDFISNNLLIDPPLTNNRYTWSNL 878
Query: 664 RSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPLVLE--ASNIRWGPSPFRLNN 723
R+ PT SRLDRFLY S WE+ F H TRTL R TSDHFPLV E S +RWGP+PFRLN+
Sbjct: 879 RNPPTFSRLDRFLYNSRWEILFNPHITRTLPRPTSDHFPLVCEDSTSTLRWGPAPFRLNS 938
Query: 724 NSLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQ 783
+L+DP+F RN+ WWE S GHPGF F+QRLK+L+ +K WQ K+ II+
Sbjct: 939 IALNDPEFKRNMERWWELSVQNGHPGFFFIQRLKSLANLIKPWQKEKFQSLTSAKENIIR 998
Query: 784 QIDNIDKLEKQNLLTLEDSNRRTAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYFH 843
++D+IDK E L+LE+SNRR A K++L + K++Q W QR KK WL EGDEN+ +FH
Sbjct: 999 EVDSIDKNELDTPLSLEESNRRLALKAELNDLSLKESQFWFQRAKKLWLKEGDENSAFFH 1058
Query: 844 KVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIY-SEDKRGTMLIENLNWKPI 903
++CS+ Q+ N I EIQDE G ++++I+ NHFS IY K+ + IENL W PI
Sbjct: 1059 RICSSRQKRNLIHEIQDEEGSIQNTNNNISLAFVNHFSRIYRCSTKKDPLFIENLEWNPI 1118
Query: 904 DSLHHNELCAPFDEIEVLQAINSISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHK 924
D + LCAPF E E+ I S KAPGPDG+ + F+K YW ++KE+++ IF DF +
Sbjct: 1119 DYSDWSLLCAPFSEEEIKGVIKSFDGNKAPGPDGFPISFFKSYWHLLKEDILDIFKDFFE 1132
BLAST of PI0011905 vs. ExPASy TrEMBL
Match:
A0A5A7US62 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold280G003960 PE=4 SV=1)
HSP 1 Score: 667.2 bits (1720), Expect = 1.0e-187
Identity = 386/943 (40.93%), Postives = 558/943 (59.17%), Query Frame = 0
Query: 3 HKIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKW 62
++I +L+KQ E+ FS KPF +KAIL L + + A LLCSN G+ GWSTVGN+ VKFE W
Sbjct: 195 NRIMFSLRKQSEIAFSYKPFQADKAILFL-NPDHAKLLCSNKGANGWSTVGNYQVKFESW 254
Query: 63 SSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNACGGFVAVAKDTMEKKDLMEAK 122
S H+ + SYGGW FRG+PLHLWNYNTF+ IG+ACGGF+ VAK+TM+ L++AK
Sbjct: 255 DSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVAKETMQMDKLIDAK 314
Query: 123 IKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEY 182
IKV+YNY GF+PA+I ITD+ GE F+V TV E++WL ERNV +HG+F+ +A F+++
Sbjct: 315 IKVRYNYIGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQH 374
Query: 183 NSESEMYHFTGNVAVTPDFFELSKSEALNLEVTTPEANLTNLTKTKTPNPSHYGKTDKTT 242
N +E Y + G A+ P+ +++ + + +++ T+ K N S
Sbjct: 375 NHLAETYTYNGFQAIPPEPTRTHGDYSIH---NSDKHSISYHTQAKKNNSSESEYDPFDQ 434
Query: 243 KKTEKRLDKGKQLMVDDDDTD---SQQNKFSSKRKVSFTSPKN-KTFFFNPE-NAPAKHL 302
+ +++R +KGK +++ +D S+++K S RKVSF SP ++ N E N K L
Sbjct: 435 QLSDRRKEKGKAILIINDQDHGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSL 494
Query: 303 CIKSSLEKNSSGPSEDSSMQKRKFVKKYYRIKNKFNQAPD----TCKEKAAKKEGYHLTV 362
I + N S QK K K YRIK ++ + + KE + +L+V
Sbjct: 495 EISTI---NDQFEKRWSPRQKTK-TKLTYRIKKDPQESTEDHKLSLKETGEGSKQMNLSV 554
Query: 363 DLGQLSPLKKDQNQQIVNSQEETKGDSSPE-ETNTLQAQESQAQTELIKDSSDPMMTSDE 422
D+G +SPL + Q N +T + +P+ + + + E++ T +K+ +D ++
Sbjct: 555 DMGPISPL-ESMIQSENNHGLDTFNNQTPDGNSKSTDSAEAKNLTVSVKEGADQNKSASR 614
Query: 423 DECRITRVKGKCEEE---EENFKKQLIIWLKENNLKLAPSLNQNLPSSSNSEQVRIIISS 482
K E + FK++L+IWLKEN LKL+P ++PS SS
Sbjct: 615 STAEGNSKDAKTGSELEIDRAFKEKLVIWLKENELKLSPKYTNDVPS-----------SS 674
Query: 483 YSPDFVILSETKRISTNKKVINSLWSLKSIKWLNVNSRGRTGGILIMWNDQRHRLLNSFE 542
Y P VI+S+ +++ G GGIL++W+D ++ +
Sbjct: 675 YFP--VIVSD-----------------QNMDIAGHGPLGDKGGILVLWDDTNFKVNDIKV 734
Query: 543 GDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQNLHNLCSTNWLIGGDFNVVRW 602
G+++IS NI ++ GN WW+T +YG K R KLW EL+ L +LC NWLI GDFN+VRW
Sbjct: 735 GNYSISLNILNTNGN-WWLTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRW 794
Query: 603 NNETTTLNPGKHK----------NNLIDPPLTNNRFTWSNLRSQPTCSRLDRFLYTSSWE 662
ET + K N LIDPP NN FTWSNLR PT SRLDRFL + WE
Sbjct: 795 ERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLRVNPTYSRLDRFLLSKGWE 854
Query: 663 LCFKEHYTRTLSRSTSDHFPLVLEASNIRWGPSPFRLNNNSLSDPDFNRNIRGWWEGSKH 722
F H +RTL R+ SDHFP++LE+ I+WGP PFRLNN+SL D +F +N WW SK
Sbjct: 855 NAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQ 914
Query: 723 LGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQQIDNIDKLEKQNLLTLEDSNR 782
G PG+AF+Q L +LS+ +K WQ + +N + KK ++++ID IDKLE Q ++ +
Sbjct: 915 AGFPGYAFIQSLNSLSKFIKEWQHNKVNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQK 974
Query: 783 RTAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGL 842
R + KS L SI+ QAQ W QR +++W GDEN +YFH++C+ NQR N I I D G
Sbjct: 975 RISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGT 1034
Query: 843 THCSSDSIAGVLTNHFSHIYSEDKRGTMLIENLNWKPIDSLHHNELCAPFDEIEVLQAIN 902
+ S D I+ +HF +IY+++ +LI+NL+W PI L +ELC PFDE E+ I
Sbjct: 1035 SLDSIDDISRTFISHFQNIYTKESYEEILIDNLSWNPISRLCQSELCKPFDESEIKSTIM 1094
Query: 903 SISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGIIN 923
S S++KAPGPDGYT+ FYKK+WP +K++++ +F DFHK GI+N
Sbjct: 1095 SFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVN 1097
BLAST of PI0011905 vs. ExPASy TrEMBL
Match:
A0A5D3CA17 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1503G00050 PE=4 SV=1)
HSP 1 Score: 667.2 bits (1720), Expect = 1.0e-187
Identity = 386/943 (40.93%), Postives = 558/943 (59.17%), Query Frame = 0
Query: 3 HKIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKW 62
++I +L+KQ E+ FS KPF +KAIL L + + A LLCSN G+ GWSTVGN+ VKFE W
Sbjct: 195 NRIMFSLRKQSEIAFSYKPFQADKAILFL-NPDHAKLLCSNKGANGWSTVGNYQVKFESW 254
Query: 63 SSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNACGGFVAVAKDTMEKKDLMEAK 122
S H+ + SYGGW FRG+PLHLWNYNTF+ IG+ACGGF+ VAK+TM+ L++AK
Sbjct: 255 DSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVAKETMQMDKLIDAK 314
Query: 123 IKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEY 182
IKV+YNY GF+PA+I ITD+ GE F+V TV E++WL ERNV +HG+F+ +A F+++
Sbjct: 315 IKVRYNYIGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQH 374
Query: 183 NSESEMYHFTGNVAVTPDFFELSKSEALNLEVTTPEANLTNLTKTKTPNPSHYGKTDKTT 242
N +E Y + G A+ P+ +++ + + +++ T+ K N S
Sbjct: 375 NHLAETYTYNGFQAIPPEPTRTHGDYSIH---NSDKHSISYHTQAKKNNSSESEYDPFDQ 434
Query: 243 KKTEKRLDKGKQLMVDDDDTD---SQQNKFSSKRKVSFTSPKN-KTFFFNPE-NAPAKHL 302
+ +++R +KGK +++ +D S+++K S RKVSF SP ++ N E N K L
Sbjct: 435 QLSDRRKEKGKAILIINDQDHGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSL 494
Query: 303 CIKSSLEKNSSGPSEDSSMQKRKFVKKYYRIKNKFNQAPD----TCKEKAAKKEGYHLTV 362
I + N S QK K K YRIK ++ + + KE + +L+V
Sbjct: 495 EISTI---NDQFEKRWSPRQKTK-TKLTYRIKKDPQESTEDHKLSLKETGEGSKQMNLSV 554
Query: 363 DLGQLSPLKKDQNQQIVNSQEETKGDSSPE-ETNTLQAQESQAQTELIKDSSDPMMTSDE 422
D+G +SPL + Q N +T + +P+ + + + E++ T +K+ +D ++
Sbjct: 555 DMGPISPL-ESMIQSENNHGLDTFNNQTPDGNSKSTDSAEAKNLTVSVKEGADQNKSASR 614
Query: 423 DECRITRVKGKCEEE---EENFKKQLIIWLKENNLKLAPSLNQNLPSSSNSEQVRIIISS 482
K E + FK++L+IWLKEN LKL+P ++PS SS
Sbjct: 615 STAEGNSKDAKTGSELEIDRAFKEKLVIWLKENELKLSPKYTNDVPS-----------SS 674
Query: 483 YSPDFVILSETKRISTNKKVINSLWSLKSIKWLNVNSRGRTGGILIMWNDQRHRLLNSFE 542
Y P VI+S+ +++ G GGIL++W+D ++ +
Sbjct: 675 YFP--VIVSD-----------------QNMDIAGHGPLGDKGGILVLWDDTNFKVNDIKV 734
Query: 543 GDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQNLHNLCSTNWLIGGDFNVVRW 602
G+++IS NI ++ GN WW+T +YG K R KLW EL+ L +LC NWLI GDFN+VRW
Sbjct: 735 GNYSISLNILNTNGN-WWLTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRW 794
Query: 603 NNETTTLNPGKHK----------NNLIDPPLTNNRFTWSNLRSQPTCSRLDRFLYTSSWE 662
ET + K N LIDPP NN FTWSNLR PT SRLDRFL + WE
Sbjct: 795 ERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLRVNPTYSRLDRFLLSKGWE 854
Query: 663 LCFKEHYTRTLSRSTSDHFPLVLEASNIRWGPSPFRLNNNSLSDPDFNRNIRGWWEGSKH 722
F H +RTL R+ SDHFP++LE+ I+WGP PFRLNN+SL D +F +N WW SK
Sbjct: 855 NAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQ 914
Query: 723 LGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQQIDNIDKLEKQNLLTLEDSNR 782
G PG+AF+Q L +LS+ +K WQ + +N + KK ++++ID IDKLE Q ++ +
Sbjct: 915 AGFPGYAFIQSLNSLSKFIKEWQHNKVNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQK 974
Query: 783 RTAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGL 842
R + KS L SI+ QAQ W QR +++W GDEN +YFH++C+ NQR N I I D G
Sbjct: 975 RISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGT 1034
Query: 843 THCSSDSIAGVLTNHFSHIYSEDKRGTMLIENLNWKPIDSLHHNELCAPFDEIEVLQAIN 902
+ S D I+ +HF +IY+++ +LI+NL+W PI L +ELC PFDE E+ I
Sbjct: 1035 SLDSIDDISRTFISHFQNIYTKESYEEILIDNLSWNPISRLCQSELCKPFDESEIKSTIM 1094
Query: 903 SISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGIIN 923
S S++KAPGPDGYT+ FYKK+WP +K++++ +F DFHK GI+N
Sbjct: 1095 SFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVN 1097
BLAST of PI0011905 vs. NCBI nr
Match:
TYJ99315.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])
HSP 1 Score: 718.0 bits (1852), Expect = 1.0e-202
Identity = 416/945 (44.02%), Postives = 559/945 (59.15%), Query Frame = 0
Query: 3 HKIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKW 62
HKI QNL+KQ E +F+ FH EKA+++ ANLLC N KGWSTVG + V+FEKW
Sbjct: 272 HKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQN---KGWSTVGKYSVRFEKW 331
Query: 63 SSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNACGGFVAVAKDTMEKKDLMEAK 122
S H + KL SYGGW +FRG+PLHLWN TF+QIG AC G + VA++T K+L+EA+
Sbjct: 332 SPVYHATPKLIPSYGGWTTFRGIPLHLWNMMTFQQIGKACEGLIKVAEETRSAKNLIEAR 391
Query: 123 IKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEY 182
IKV+YNY+GF+PAN+ I D+ G F VQ V E KWL ERNV +HGTFKRQA ASF+++
Sbjct: 392 IKVRYNYSGFLPANVRIFDNEGNKFFVQVVTHPEGKWLIERNVRLHGTFKRQAAASFDDF 451
Query: 183 NSESEMYHFTGNVAVTPDFFELS----------KSEALNLEVTTPEANLT--NLTKTKTP 242
N ESE + F G+ A++PDF S + AL + P+ N T + +
Sbjct: 452 NPESEQFFFEGSEAISPDFLSTSSDGRKSSTPDQPSALKSVIIKPDRNATLPSFLNEELV 511
Query: 243 NPSHYGKTDKTTK-------KTEKRLDKGKQLMVDDDDTDSQQNKFSSKRKVSFTSPKNK 302
N S+ T +K + LDKGKQ + +S N SKRKVSF SP NK
Sbjct: 512 NDSNLHATANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLDKSKRKVSFNSPSNK 571
Query: 303 TFFFNPENAPAKHLCIKSSLEKNSSGPSEDSSMQKRKFVK---KYYRIKNKFNQAP--DT 362
T FNP++APA H +S EK E S +K + K + K F P
Sbjct: 572 TNIFNPDSAPANHSPSLNSPEKKQKVSRERSIKKKSSSTQPNSKANQNKGVFITQPIQIV 631
Query: 363 CKEKAAKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKGDSSPEETNTLQAQESQAQTE 422
++ A K+G LTVDLG L L D N+ + + + + TNT E+
Sbjct: 632 AHDRDAAKKGLSLTVDLGDLPAL--DPNKSLEDHHNSDNAE-VVDITNTEVVPETPEMKM 691
Query: 423 LIKDSSDPMMTSDEDECRITRVK----GKCEEEE-----ENFKKQLIIWLKENNLKLAPS 482
+ ++S+ ++ + + + K EE+E E FKKQL+ WLK+N LKL+
Sbjct: 692 PVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKKNGLKLSTD 751
Query: 483 LNQNLPSSSNSEQVRIIISSYSPDFVILSETKRISTNKKVINSLWSLKSIKWLNVNSRGR 542
+ SS + ++++ + I TNK++I SLW SI W+ N+ G
Sbjct: 752 TD----SSGATTSTNVLLNQMNSGLKI--------TNKRIIKSLWPSNSINWIAKNASGS 811
Query: 543 TGGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQN 602
+GGILI+W+ Q H LL+ EG F++SAN + ++WW+TGLYG KR++R W EL N
Sbjct: 812 SGGILILWDAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGPVKRRERIHFWAELHN 871
Query: 603 LHNLCSTNWLIGGDFNVVRWNNETTTLNPGKH----------KNNLIDPPLTNNRFTWSN 662
L +L S W++GGD NV+R E+T++ H N LIDPPLTNNRFTWSN
Sbjct: 872 LQHLNSFPWILGGDLNVIRMREESTSVLSSSHNSRMLNNFISNNLLIDPPLTNNRFTWSN 931
Query: 663 LRSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPLVLEASN--IRWGPSPFRLN 722
LR+ PT SR+DRFLY SSWE F H TRTL RSTSDHFPLV E SN + WGP PFRLN
Sbjct: 932 LRNPPTFSRIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPKLSWGPIPFRLN 991
Query: 723 NNSLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRII 782
+ +LSDP+F RN+ WWE S G+PGF+F+QRLK+L+ +K WQ L+ K+ II
Sbjct: 992 SITLSDPEFKRNMGRWWENSIQAGYPGFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAII 1051
Query: 783 QQIDNIDKLEKQNLLTLEDSNRRTAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYF 842
+++D+IDK E LT E+SNRR A K+ L + K++Q W QR KK WL EGDEN+++F
Sbjct: 1052 REVDSIDKKELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFF 1111
Query: 843 HKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIY-SEDKRGTMLIENLNWKP 902
H++CS+ Q+ + I EIQDE G +++SI+ FS IY S K + IENL+W P
Sbjct: 1112 HRICSSRQKRSFIHEIQDEEGSIQNTNNSISTAFIKFFSRIYRSSTKSDPLFIENLDWNP 1171
BLAST of PI0011905 vs. NCBI nr
Match:
KAA0039309.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])
HSP 1 Score: 704.1 bits (1816), Expect = 1.6e-198
Identity = 410/966 (42.44%), Postives = 554/966 (57.35%), Query Frame = 0
Query: 4 KIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKWS 63
KI QNL+KQ E +F+ FH EK +++ ANLLC N KGW+TVG + V+FEKW+
Sbjct: 219 KILQNLRKQTEESFTYNAFHAEKVLVHFNSNVPANLLCQN---KGWTTVGKYTVRFEKWA 278
Query: 64 STNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNACGGFVAVAKDTMEKKDLMEAKI 123
+H S KL SYGGW +FRG+PLHLWN TF+QIG ACGG + VA++T ++L+EAK+
Sbjct: 279 PASHASPKLIPSYGGWTTFRGIPLHLWNMMTFQQIGKACGGLIKVAEETKTARNLIEAKL 338
Query: 124 KVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEYN 183
K++YNY+GF+PA + I D G FVVQ V +E KWL ERNV +HGTFKRQA ASF+++N
Sbjct: 339 KIRYNYSGFLPAYVKIFDQEGNKFVVQVVTHSEGKWLMERNVRLHGTFKRQAAASFDDFN 398
Query: 184 SESEMYHFTGNVAVTPDFFELSKSEALNLEVTTPEA-------------NLTNLTKTKTP 243
+SE + F G A++PD ++ P A + T L +
Sbjct: 399 PDSEQFLFDGLEAISPDLLNTISGSRKSISPEQPSALKSVIIKPAKYATSPTTLNEEVVN 458
Query: 244 NPSHYGKTDKTTKK------TEKRLDKGKQLMVDDDDTDSQQNKFSSKRKVSFTSPKNKT 303
+ S + +K+ K + LDKGKQ + S + KRKVSF SP NKT
Sbjct: 459 DNSLHATANKSKLKILSGISNDGSLDKGKQKVDIPSQLTSAFIFYKPKRKVSFNSPSNKT 518
Query: 304 FFFNPENAPAKHLCIKSSLEKNSSGPSEDSSMQKRKFVKKYYRI-KNKFNQAPDTCKEKA 363
FFNP++APA H S EK E S +K ++ R + K N + A
Sbjct: 519 TFFNPDSAPANH-----SPEKKKRVSRERSVKKKSSTIQPKLRANQGKGNLITQPLQVVA 578
Query: 364 ----AKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKGDSSPEETNTLQAQESQ--AQT 423
A K+G LTVDLG L L ++ + +S + + + TNT E+ T
Sbjct: 579 HDLDASKKGLSLTVDLGNLPVLDPSKSFEDHHSSDNAE---VIDITNTEVVPETPELKMT 638
Query: 424 ELIKDSSDPMMT--SDEDECRITRVKGKCEEEE-----ENFKKQLIIWLKENNLKLAPSL 483
+ K +S P + + R K E++E E FK QL+ WLKEN LKL+
Sbjct: 639 DPEKSNSSPEVNYRKQKHSHRRRHYYRKKEDKEKDTNSEAFKNQLVTWLKENGLKLSIDT 698
Query: 484 NQNLPSSSNSEQVRIIISSYSPDFVILSETKRISTNKKVINSLWSLKSIKWLNVNSRGRT 543
+ + ++S N+L+S
Sbjct: 699 DSSGATTST-------------------------------NALFS---------QLGSSA 758
Query: 544 GGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQNL 603
GGILI+W+ Q H LL+ EG F++SAN S N+WW+TGLYG KR++R +W +L NL
Sbjct: 759 GGILILWDAQHHSLLSQEEGKFSLSANF-SSFNNSWWLTGLYGPVKRRERLNVWEDLHNL 818
Query: 604 HNLCSTNWLIGGDFNVVRWNNETTTLNPGKHKNN----------LIDPPLTNNRFTWSNL 663
H+L S+ W+IGGD NVVR E+T + H +N LIDPPLTNNR+TWSNL
Sbjct: 819 HHLNSSPWIIGGDLNVVRMREESTAVTFSSHSSNMLNDFISNNLLIDPPLTNNRYTWSNL 878
Query: 664 RSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPLVLE--ASNIRWGPSPFRLNN 723
R+ PT SRLDRFLY S WE+ F H TRTL R TSDHFPLV E S +RWGP+PFRLN+
Sbjct: 879 RNPPTFSRLDRFLYNSRWEILFNPHITRTLPRPTSDHFPLVCEDSTSTLRWGPAPFRLNS 938
Query: 724 NSLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQ 783
+L+DP+F RN+ WWE S GHPGF F+QRLK+L+ +K WQ K+ II+
Sbjct: 939 IALNDPEFKRNMERWWELSVQNGHPGFFFIQRLKSLANLIKPWQKEKFQSLTSAKENIIR 998
Query: 784 QIDNIDKLEKQNLLTLEDSNRRTAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYFH 843
++D+IDK E L+LE+SNRR A K++L + K++Q W QR KK WL EGDEN+ +FH
Sbjct: 999 EVDSIDKNELDTPLSLEESNRRLALKAELNDLSLKESQFWFQRAKKLWLKEGDENSAFFH 1058
Query: 844 KVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIY-SEDKRGTMLIENLNWKPI 903
++CS+ Q+ N I EIQDE G ++++I+ NHFS IY K+ + IENL W PI
Sbjct: 1059 RICSSRQKRNLIHEIQDEEGSIQNTNNNISLAFVNHFSRIYRCSTKKDPLFIENLEWNPI 1118
Query: 904 DSLHHNELCAPFDEIEVLQAINSISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHK 924
D + LCAPF E E+ I S KAPGPDG+ + F+K YW ++KE+++ IF DF +
Sbjct: 1119 DYSDWSLLCAPFSEEEIKGVIKSFDGNKAPGPDGFPISFFKSYWHLLKEDILDIFKDFFE 1132
BLAST of PI0011905 vs. NCBI nr
Match:
TYK00493.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])
HSP 1 Score: 704.1 bits (1816), Expect = 1.6e-198
Identity = 410/966 (42.44%), Postives = 554/966 (57.35%), Query Frame = 0
Query: 4 KIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKWS 63
KI QNL+KQ E +F+ FH EK +++ ANLLC N KGW+TVG + V+FEKW+
Sbjct: 219 KILQNLRKQTEESFTYNAFHAEKVLVHFNSNVPANLLCQN---KGWTTVGKYTVRFEKWA 278
Query: 64 STNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNACGGFVAVAKDTMEKKDLMEAKI 123
+H S KL SYGGW +FRG+PLHLWN TF+QIG ACGG + VA++T ++L+EAK+
Sbjct: 279 PASHASPKLIPSYGGWTTFRGIPLHLWNMMTFQQIGKACGGLIKVAEETKTARNLIEAKL 338
Query: 124 KVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEYN 183
K++YNY+GF+PA + I D G FVVQ V +E KWL ERNV +HGTFKRQA ASF+++N
Sbjct: 339 KIRYNYSGFLPAYVKIFDQEGNKFVVQVVTHSEGKWLMERNVRLHGTFKRQAAASFDDFN 398
Query: 184 SESEMYHFTGNVAVTPDFFELSKSEALNLEVTTPEA-------------NLTNLTKTKTP 243
+SE + F G A++PD ++ P A + T L +
Sbjct: 399 PDSEQFLFDGLEAISPDLLNTISGSRKSISPEQPSALKSVIIKPAKYATSPTTLNEEVVN 458
Query: 244 NPSHYGKTDKTTKK------TEKRLDKGKQLMVDDDDTDSQQNKFSSKRKVSFTSPKNKT 303
+ S + +K+ K + LDKGKQ + S + KRKVSF SP NKT
Sbjct: 459 DNSLHATANKSKLKILSGISNDGSLDKGKQKVDIPSQLTSAFIFYKPKRKVSFNSPSNKT 518
Query: 304 FFFNPENAPAKHLCIKSSLEKNSSGPSEDSSMQKRKFVKKYYRI-KNKFNQAPDTCKEKA 363
FFNP++APA H S EK E S +K ++ R + K N + A
Sbjct: 519 TFFNPDSAPANH-----SPEKKKRVSRERSVKKKSSTIQPKLRANQGKGNLITQPLQVVA 578
Query: 364 ----AKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKGDSSPEETNTLQAQESQ--AQT 423
A K+G LTVDLG L L ++ + +S + + + TNT E+ T
Sbjct: 579 HDLDASKKGLSLTVDLGNLPVLDPSKSFEDHHSSDNAE---VIDITNTEVVPETPELKMT 638
Query: 424 ELIKDSSDPMMT--SDEDECRITRVKGKCEEEE-----ENFKKQLIIWLKENNLKLAPSL 483
+ K +S P + + R K E++E E FK QL+ WLKEN LKL+
Sbjct: 639 DPEKSNSSPEVNYRKQKHSHRRRHYYRKKEDKEKDTNSEAFKNQLVTWLKENGLKLSIDT 698
Query: 484 NQNLPSSSNSEQVRIIISSYSPDFVILSETKRISTNKKVINSLWSLKSIKWLNVNSRGRT 543
+ + ++S N+L+S
Sbjct: 699 DSSGATTST-------------------------------NALFS---------QLGSSA 758
Query: 544 GGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQNL 603
GGILI+W+ Q H LL+ EG F++SAN S N+WW+TGLYG KR++R +W +L NL
Sbjct: 759 GGILILWDAQHHSLLSQEEGKFSLSANF-SSFNNSWWLTGLYGPVKRRERLNVWEDLHNL 818
Query: 604 HNLCSTNWLIGGDFNVVRWNNETTTLNPGKHKNN----------LIDPPLTNNRFTWSNL 663
H+L S+ W+IGGD NVVR E+T + H +N LIDPPLTNNR+TWSNL
Sbjct: 819 HHLNSSPWIIGGDLNVVRMREESTAVTFSSHSSNMLNDFISNNLLIDPPLTNNRYTWSNL 878
Query: 664 RSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPLVLE--ASNIRWGPSPFRLNN 723
R+ PT SRLDRFLY S WE+ F H TRTL R TSDHFPLV E S +RWGP+PFRLN+
Sbjct: 879 RNPPTFSRLDRFLYNSRWEILFNPHITRTLPRPTSDHFPLVCEDSTSTLRWGPAPFRLNS 938
Query: 724 NSLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQ 783
+L+DP+F RN+ WWE S GHPGF F+QRLK+L+ +K WQ K+ II+
Sbjct: 939 IALNDPEFKRNMERWWELSVQNGHPGFFFIQRLKSLANLIKPWQKEKFQSLTSAKENIIR 998
Query: 784 QIDNIDKLEKQNLLTLEDSNRRTAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYFH 843
++D+IDK E L+LE+SNRR A K++L + K++Q W QR KK WL EGDEN+ +FH
Sbjct: 999 EVDSIDKNELDTPLSLEESNRRLALKAELNDLSLKESQFWFQRAKKLWLKEGDENSAFFH 1058
Query: 844 KVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIY-SEDKRGTMLIENLNWKPI 903
++CS+ Q+ N I EIQDE G ++++I+ NHFS IY K+ + IENL W PI
Sbjct: 1059 RICSSRQKRNLIHEIQDEEGSIQNTNNNISLAFVNHFSRIYRCSTKKDPLFIENLEWNPI 1118
Query: 904 DSLHHNELCAPFDEIEVLQAINSISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHK 924
D + LCAPF E E+ I S KAPGPDG+ + F+K YW ++KE+++ IF DF +
Sbjct: 1119 DYSDWSLLCAPFSEEEIKGVIKSFDGNKAPGPDGFPISFFKSYWHLLKEDILDIFKDFFE 1132
BLAST of PI0011905 vs. NCBI nr
Match:
TYK08190.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])
HSP 1 Score: 667.2 bits (1720), Expect = 2.1e-187
Identity = 386/943 (40.93%), Postives = 558/943 (59.17%), Query Frame = 0
Query: 3 HKIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKW 62
++I +L+KQ E+ FS KPF +KAIL L + + A LLCSN G+ GWSTVGN+ VKFE W
Sbjct: 195 NRIMFSLRKQSEIAFSYKPFQADKAILFL-NPDHAKLLCSNKGANGWSTVGNYQVKFESW 254
Query: 63 SSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNACGGFVAVAKDTMEKKDLMEAK 122
S H+ + SYGGW FRG+PLHLWNYNTF+ IG+ACGGF+ VAK+TM+ L++AK
Sbjct: 255 DSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVAKETMQMDKLIDAK 314
Query: 123 IKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEY 182
IKV+YNY GF+PA+I ITD+ GE F+V TV E++WL ERNV +HG+F+ +A F+++
Sbjct: 315 IKVRYNYIGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQH 374
Query: 183 NSESEMYHFTGNVAVTPDFFELSKSEALNLEVTTPEANLTNLTKTKTPNPSHYGKTDKTT 242
N +E Y + G A+ P+ +++ + + +++ T+ K N S
Sbjct: 375 NHLAETYTYNGFQAIPPEPTRTHGDYSIH---NSDKHSISYHTQAKKNNSSESEYDPFDQ 434
Query: 243 KKTEKRLDKGKQLMVDDDDTD---SQQNKFSSKRKVSFTSPKN-KTFFFNPE-NAPAKHL 302
+ +++R +KGK +++ +D S+++K S RKVSF SP ++ N E N K L
Sbjct: 435 QLSDRRKEKGKAILIINDQDHGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSL 494
Query: 303 CIKSSLEKNSSGPSEDSSMQKRKFVKKYYRIKNKFNQAPD----TCKEKAAKKEGYHLTV 362
I + N S QK K K YRIK ++ + + KE + +L+V
Sbjct: 495 EISTI---NDQFEKRWSPRQKTK-TKLTYRIKKDPQESTEDHKLSLKETGEGSKQMNLSV 554
Query: 363 DLGQLSPLKKDQNQQIVNSQEETKGDSSPE-ETNTLQAQESQAQTELIKDSSDPMMTSDE 422
D+G +SPL + Q N +T + +P+ + + + E++ T +K+ +D ++
Sbjct: 555 DMGPISPL-ESMIQSENNHGLDTFNNQTPDGNSKSTDSAEAKNLTVSVKEGADQNKSASR 614
Query: 423 DECRITRVKGKCEEE---EENFKKQLIIWLKENNLKLAPSLNQNLPSSSNSEQVRIIISS 482
K E + FK++L+IWLKEN LKL+P ++PS SS
Sbjct: 615 STAEGNSKDAKTGSELEIDRAFKEKLVIWLKENELKLSPKYTNDVPS-----------SS 674
Query: 483 YSPDFVILSETKRISTNKKVINSLWSLKSIKWLNVNSRGRTGGILIMWNDQRHRLLNSFE 542
Y P VI+S+ +++ G GGIL++W+D ++ +
Sbjct: 675 YFP--VIVSD-----------------QNMDIAGHGPLGDKGGILVLWDDTNFKVNDIKV 734
Query: 543 GDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQNLHNLCSTNWLIGGDFNVVRW 602
G+++IS NI ++ GN WW+T +YG K R KLW EL+ L +LC NWLI GDFN+VRW
Sbjct: 735 GNYSISLNILNTNGN-WWLTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRW 794
Query: 603 NNETTTLNPGKHK----------NNLIDPPLTNNRFTWSNLRSQPTCSRLDRFLYTSSWE 662
ET + K N LIDPP NN FTWSNLR PT SRLDRFL + WE
Sbjct: 795 ERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLRVNPTYSRLDRFLLSKGWE 854
Query: 663 LCFKEHYTRTLSRSTSDHFPLVLEASNIRWGPSPFRLNNNSLSDPDFNRNIRGWWEGSKH 722
F H +RTL R+ SDHFP++LE+ I+WGP PFRLNN+SL D +F +N WW SK
Sbjct: 855 NAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQ 914
Query: 723 LGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQQIDNIDKLEKQNLLTLEDSNR 782
G PG+AF+Q L +LS+ +K WQ + +N + KK ++++ID IDKLE Q ++ +
Sbjct: 915 AGFPGYAFIQSLNSLSKFIKEWQHNKVNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQK 974
Query: 783 RTAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGL 842
R + KS L SI+ QAQ W QR +++W GDEN +YFH++C+ NQR N I I D G
Sbjct: 975 RISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGT 1034
Query: 843 THCSSDSIAGVLTNHFSHIYSEDKRGTMLIENLNWKPIDSLHHNELCAPFDEIEVLQAIN 902
+ S D I+ +HF +IY+++ +LI+NL+W PI L +ELC PFDE E+ I
Sbjct: 1035 SLDSIDDISRTFISHFQNIYTKESYEEILIDNLSWNPISRLCQSELCKPFDESEIKSTIM 1094
Query: 903 SISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGIIN 923
S S++KAPGPDGYT+ FYKK+WP +K++++ +F DFHK GI+N
Sbjct: 1095 SFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVN 1097
BLAST of PI0011905 vs. NCBI nr
Match:
KAA0057507.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])
HSP 1 Score: 667.2 bits (1720), Expect = 2.1e-187
Identity = 386/943 (40.93%), Postives = 558/943 (59.17%), Query Frame = 0
Query: 3 HKIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKW 62
++I +L+KQ E+ FS KPF +KAIL L + + A LLCSN G+ GWSTVGN+ VKFE W
Sbjct: 195 NRIMFSLRKQSEIAFSYKPFQADKAILFL-NPDHAKLLCSNKGANGWSTVGNYQVKFESW 254
Query: 63 SSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNACGGFVAVAKDTMEKKDLMEAK 122
S H+ + SYGGW FRG+PLHLWNYNTF+ IG+ACGGF+ VAK+TM+ L++AK
Sbjct: 255 DSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIGSACGGFLDVAKETMQMDKLIDAK 314
Query: 123 IKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEY 182
IKV+YNY GF+PA+I ITD+ GE F+V TV E++WL ERNV +HG+F+ +A F+++
Sbjct: 315 IKVRYNYIGFVPASILITDNQGENFIVTTVQPAEARWLVERNVRVHGSFRTKAADEFDQH 374
Query: 183 NSESEMYHFTGNVAVTPDFFELSKSEALNLEVTTPEANLTNLTKTKTPNPSHYGKTDKTT 242
N +E Y + G A+ P+ +++ + + +++ T+ K N S
Sbjct: 375 NHLAETYTYNGFQAIPPEPTRTHGDYSIH---NSDKHSISYHTQAKKNNSSESEYDPFDQ 434
Query: 243 KKTEKRLDKGKQLMVDDDDTD---SQQNKFSSKRKVSFTSPKN-KTFFFNPE-NAPAKHL 302
+ +++R +KGK +++ +D S+++K S RKVSF SP ++ N E N K L
Sbjct: 435 QLSDRRKEKGKAILIINDQDHGHYSKRSKRISNRKVSFLSPGGIQSNSSNTEINTKGKSL 494
Query: 303 CIKSSLEKNSSGPSEDSSMQKRKFVKKYYRIKNKFNQAPD----TCKEKAAKKEGYHLTV 362
I + N S QK K K YRIK ++ + + KE + +L+V
Sbjct: 495 EISTI---NDQFEKRWSPRQKTK-TKLTYRIKKDPQESTEDHKLSLKETGEGSKQMNLSV 554
Query: 363 DLGQLSPLKKDQNQQIVNSQEETKGDSSPE-ETNTLQAQESQAQTELIKDSSDPMMTSDE 422
D+G +SPL + Q N +T + +P+ + + + E++ T +K+ +D ++
Sbjct: 555 DMGPISPL-ESMIQSENNHGLDTFNNQTPDGNSKSTDSAEAKNLTVSVKEGADQNKSASR 614
Query: 423 DECRITRVKGKCEEE---EENFKKQLIIWLKENNLKLAPSLNQNLPSSSNSEQVRIIISS 482
K E + FK++L+IWLKEN LKL+P ++PS SS
Sbjct: 615 STAEGNSKDAKTGSELEIDRAFKEKLVIWLKENELKLSPKYTNDVPS-----------SS 674
Query: 483 YSPDFVILSETKRISTNKKVINSLWSLKSIKWLNVNSRGRTGGILIMWNDQRHRLLNSFE 542
Y P VI+S+ +++ G GGIL++W+D ++ +
Sbjct: 675 YFP--VIVSD-----------------QNMDIAGHGPLGDKGGILVLWDDTNFKVNDIKV 734
Query: 543 GDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQNLHNLCSTNWLIGGDFNVVRW 602
G+++IS NI ++ GN WW+T +YG K R KLW EL+ L +LC NWLI GDFN+VRW
Sbjct: 735 GNYSISLNILNTNGN-WWLTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAGDFNIVRW 794
Query: 603 NNETTTLNPGKHK----------NNLIDPPLTNNRFTWSNLRSQPTCSRLDRFLYTSSWE 662
ET + K N LIDPP NN FTWSNLR PT SRLDRFL + WE
Sbjct: 795 ERETNAKSLDKRNMANFNNFISVNELIDPPPLNNNFTWSNLRVNPTYSRLDRFLLSKGWE 854
Query: 663 LCFKEHYTRTLSRSTSDHFPLVLEASNIRWGPSPFRLNNNSLSDPDFNRNIRGWWEGSKH 722
F H +RTL R+ SDHFP++LE+ I+WGP PFRLNN+SL D +F +N WW SK
Sbjct: 855 NAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKEFQKNFINWWNSSKQ 914
Query: 723 LGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQQIDNIDKLEKQNLLTLEDSNR 782
G PG+AF+Q L +LS+ +K WQ + +N + KK ++++ID IDKLE Q ++ +
Sbjct: 915 AGFPGYAFIQSLNSLSKFIKEWQHNKVNLYDANKKALLKEIDIIDKLEFQGEMSTTHHQK 974
Query: 783 RTAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGL 842
R + KS L SI+ QAQ W QR +++W GDEN +YFH++C+ NQR N I I D G
Sbjct: 975 RISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKSICDPAGT 1034
Query: 843 THCSSDSIAGVLTNHFSHIYSEDKRGTMLIENLNWKPIDSLHHNELCAPFDEIEVLQAIN 902
+ S D I+ +HF +IY+++ +LI+NL+W PI L +ELC PFDE E+ I
Sbjct: 1035 SLDSIDDISRTFISHFQNIYTKESYEEILIDNLSWNPISRLCQSELCKPFDESEIKSTIM 1094
Query: 903 SISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGIIN 923
S S++KAPGPDGYT+ FYKK+WP +K++++ +F DFHK GI+N
Sbjct: 1095 SFSNEKAPGPDGYTMLFYKKHWPDLKDDLLNVFKDFHKAGIVN 1097
BLAST of PI0011905 vs. TAIR 10
Match:
AT1G43760.1 (DNAse I-like superfamily protein )
HSP 1 Score: 97.4 bits (241), Expect = 6.2e-20
Identity = 80/333 (24.02%), Postives = 154/333 (46.25%), Query Frame = 0
Query: 603 NNLIDPPLTNNRFTWSNLR-SQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPLV 662
++L+D P +TWSN + P +LDR + W F SDH P +
Sbjct: 259 SDLVDIPSRGVHYTWSNHQDDNPIIRKLDRAIANGDWFSSFPSAIAVFELSGVSDHSPCI 318
Query: 663 LEASNI-RWGPSPFRLNNNSLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKN 722
+ N+ + FR + + P F ++ WE +G F+ + LK + K
Sbjct: 319 IILENLPKRSKKCFRYFSFLSTHPTFLVSLTVAWEEQIPVGSHMFSLGEHLKAAKKCCK- 378
Query: 723 WQFSLLNKQ--EDEKKRIIQQIDNIDKLEKQNLLTLEDSNRRTAFKSKLCSIDFKQA--Q 782
LLN+Q + + + + +D+++ ++ Q L DS R ++ F A
Sbjct: 379 ----LLNRQGFGNIQHKTKEALDSLESIQSQLLTNPSDSLFRVEHVARKKWNFFAAALES 438
Query: 783 CWAQRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFS 842
+ Q+++ +WL +GD NT +FHKV ANQ N I ++ ++ + + + ++ +++
Sbjct: 439 FYRQKSRIKWLQDGDANTRFFHKVILANQAKNLIKFLRMDDDVRVENVTQVKEMIVAYYT 498
Query: 843 HIYSEDK-----RGTMLIENLN-WKPIDSLHHNELCAPFDEIEVLQAINSISDKKAPGPD 902
H+ D I++++ ++ D+L P D+ E+ A+ ++ KAPGPD
Sbjct: 499 HLLGSDSDILTPDSVQRIKDIHPFRCNDTLASRLSALPSDK-EITAAVFAMPRNKAPGPD 558
Query: 903 GYTVKFYKKYWPMVKEEVMQIFNDFHKKGIINK 924
+T +F+ + W +VK+ + +F + G + K
Sbjct: 559 SFTAEFFWESWFVVKDSTIAAVKEFFRTGHLLK 585
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
O00370 | 9.4e-13 | 23.20 | LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens OX=9606 PE=1 SV=1 | [more] |
P08548 | 1.4e-05 | 19.86 | LINE-1 reverse transcriptase homolog OS=Nycticebus coucang OX=9470 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A5D3BLV7 | 5.0e-203 | 44.02 | LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... | [more] |
A0A5A7TDG1 | 7.5e-199 | 42.44 | LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... | [more] |
A0A5D3BL61 | 7.5e-199 | 42.44 | LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... | [more] |
A0A5A7US62 | 1.0e-187 | 40.93 | LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... | [more] |
A0A5D3CA17 | 1.0e-187 | 40.93 | LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... | [more] |
Match Name | E-value | Identity | Description | |
TYJ99315.1 | 1.0e-202 | 44.02 | LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa] | [more] |
KAA0039309.1 | 1.6e-198 | 42.44 | LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa] | [more] |
TYK00493.1 | 1.6e-198 | 42.44 | LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa] | [more] |
TYK08190.1 | 2.1e-187 | 40.93 | LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa] | [more] |
KAA0057507.1 | 2.1e-187 | 40.93 | LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa] | [more] |
Match Name | E-value | Identity | Description | |
AT1G43760.1 | 6.2e-20 | 24.02 | DNAse I-like superfamily protein | [more] |