IVF0006035 (gene) Melon (IVF77) v1

Overview
NameIVF0006035
Typegene
OrganismCucumis melo L. ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Locationchr05: 711257 .. 715470 (+)
RNA-Seq ExpressionIVF0006035
SyntenyIVF0006035
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGTTATGCAAAAATGGAAGCTAAATCAAAAAGGCCTACTCAGTCCATAAAGAAAAAAGTCTATAGAGTCAAGAGCCGAAGCATGGAAAGGGAAACTCCTCAAACAAGCAGGCAAAAAGATAAAGAAAAAATTGACCCGAATGAGTTTGAACTGGTGGTAGACCTTGGCCACATCTCCCCCCTCTCAGATACTGATTTCTCTTGTCCTGAAAGTCCTTCGTACATCCCTTCACCAACATCTCCTACTGAGTCAGACATTGTAAAGGACAGCCTGGCCTCTATGATGACTTGTGCTCATGAAGATAGAGAAAAGAAGAAAAAGGAGAACTTAAGGGAAGAGACCGAAGATGATGAAGTTAGCTTCAAGAGGAAACTTACAGATTGGCTGAAAGAAAACAACCTCAGACTGGCGGCAGATTTTAATTCACAGTTTAATTCTGTTACAAATGATAGGATGATTTCTATTTTAAATGGGCCACCAAATGTAGGTATTGAAAATGTGTCTGAGGATGTAATTGATGGCAAGGAAAATGACACCATGGGGCCAATTGTGTTCCCATGAAGCTTTTATCCTGGAATGTAAGAGGGCTAAGCTCGGCCCAAAAAAGAGCCCAAATAAAAGCAACTATCTTTGCCTACGATCCAGATTTTGTCATTCTCACTGAAACTAAACTTACATCAGTCTCTAAGAAAATAATAAAATCGATTTGGAGCCCAATTAGCATCAATTGGGCCTTTCTCAATTCTTCCGGCCGATCAGGAGGTATTATTGTGATGTGGAATGATCAAAATTTTTCTATCCTGAGTGTTTTTAAAGGGGCTTTCTCTGTCTCCATCCAAGTTGGCTCCAACAATGGTGCTTCTTGGTGGCTTTCTGCCATTTACGGCCCAGCTAAAAGAAAAAATAGGCCTTTATTTTGGGAGGAACTTGAAAATCTAAAATCCATTTGCTTTCCAACCTGGATTCTTGGTGGAGATTTTAACGTTATCAGATGGAAGGAGGAGACGTCCACCAAAAATCCAGCCTCGCTAAGCATGAAAAGATTCAACACTTTCATAAGCAATTGTAATCTGATTGATCCTCCCCTCACCAATGCAAAGTTTACTTGGTCAAATCTCAGAGCTCAGGCCACCCTCTCCAGACTGGACAGATTTCTTTTCTCTACCCATTGGGAAAATATTTTCCCGGGCCATACTTCAAAAGTGTTAACCCGAACTACTTCAGACCATTTTCCCATTGTTCTCGAGTCGTCTACGATCTCTTGGGGTCCTTCTCCTTTTAGATTCACAAATGCCTACCTAAAAGATCCAGACTACAAGAAAAACATTGAGTTTTGGTGGGGAAACACCAGTCAGCCAGGCTATGCAGGTTACTCCTTTATGCACAGACTAAAGCAGCTGGCTTTGAAAATCAAAGCTTGGGGAAGAGAAAAAAAGGAAAAAATGAAGCTTCTAAAAAGGCCTGGATCAAAGAAATCGATCTAATTGACAAACTAGAGGCTGAAGGATCTGCAACTGAGATTCACAGAGAGAAAAGGATTGCTCTAAAAGCCGACCTTTCCCAAATTACTCTCACTGAAGCTCAAATATGGGCCCAAAAATGCAAAAGAATATGGGTCCATGAAGGTGATGAAAATTCTTCCTTTTTCCACAAAATTTGCACAGCAAGGCAAAAAAAGTGTTTGATCTCCAAGATAATAAACAACAGTGGACAGAATTGCCTAAATGACAGTGACATTGCCGATGCCTTCATTCAACATTTTGAAGAAATCTATACAGACAACAGAAACAGCCATCTGTTTATTGATAATCTCGATTGGTGCCCCATCTCCAACACCAACAGTGACTTGCTGGACAAACCCTTTAATGAAGCTGAAATTTGGCTCACTTTAAAGTCTTTTGCAAAGAATAAAGCTCCAGGTCCAGATGGTTATACGATGGATTTCCTACAAAAGTCTTGGTCTTTTATGAAGCAAAACATTTGTGATATCTTCAAGGATTTTCACAGCACCCATACCATCAATAAAGTTGTCAATGAAACTCTCATTACCCTTATAGCCAAAAAAGAAAATTGTGAGACAGTTGCAGACTTTCGGCCCATCAGCCTCACCACGGCTATCTACAAATTAATCGCAAAGGCTTTGGCTGATAGATTGAAACAAACTCTCCCCGATACGATCTCTGAGTCTCAAATGGCCTTCGTTAAAGGAAGAAAAATTACAGAGGCCATTCTTATTGCAAATGAAGCTTTGGATTTCTGGAGAAATAAAAAAGAAAGAGGTTTTGTGATAAAACTGGACATTGAAAAGGCCTTCGATAAGCTAAATTGGCGCTTCATAGACTTTGTGCTTATGAAAAAGAACTACTCCCAGAAATGGAGGAAAATGATTGCCAGTTGCATCTCTAGTGTCCAATACTCTATTCTTATCAATGGTAGACCGAGAGGCAGAATCAAACCTTCTAGAGGAATCCGACAGGGTGACCCCCTTTCACCCTTCATCTTTGTTTTGGCTATGGACTATCTCAGCCGTCTTTTGAACAACTTAGCAGATAAAAGAAAAATCAATGGAGTCAATTTCAGTCCCAACCTTAATCTTACCCACATCCTATTTGCGGATGACATCCTCATCTTTGTAGAGGATAGGGATGACTACGTATCAAACCTCAAAATGATCCTTCATCTCTTTGAATCAGCCTCGGGCCTTAACATCAATCTGTCCAAGTCTACTATCTTTCCCATAAACGTCCCAACAGATCGTGCAAAGTCTATAGCGGACAGTTGGGGAATAAGCAAGGGCCATCTTCCGACATCTTACCTTGGTATGCCCTTAGGAGGGAAGCCTTCCTCATCAAACTTCTGGGACAATGTGCTTCAGAAAATCCAGAAAAAATTGAGCAGCTGGAAATACTCTCAGTTATCCAAAGGCGGCAGAATCACTCTGATAAACTCAACTCTTGAAAGCCTTCCATATATATCAAATGTCGGTCTTCAAGGTCCCCAAAGGTATAGCTCAGAAAATTGAAGCTTCTTGGAGAAATTTCCTTTGGAATGGTACATCGAATGGCCACAACATTAGCCTCATCAGATGGAACCAAATTGTCTCCCCAAAAGAGAAAGGAGGCCTCGGTATTCACTCTGTCAATAGCACAAATTTTGCCCTCCTCTGTAAATGGCTCTGGAAATTTCTAACTGAAAAAGATCCTTTATGGAAACGCCTGATCATTTCCAAATATGATCAGGAGAAAATGGGCAGATTTCCTTCTCGTGGAAAATTCAGCAGCAATAATAGCCCTTGGAAAGCAGTGACAGAGTGTATCAGTTGGTTCTATAAAAACATCAGCTGGAAGGTAAATGATGGAGAAGATATCTCCTTTTGGCTTGACAACTGGAATGGAAATGCTCCTTTATCTTTGGCCGTCCCCCGTCTTTTTGCTCTATCTACAAACAAAAAGGGGTCTGTTAAAGATTTTTGGAATCCCTCATCTAATGACTGGCATCTCCATATCAATCGGCCCCTCCGTGACCATGAAAAAAATTTGTGGCACAATATTAAAGCCTCTCTTCCAACTCCCTTACCGAATAGGGGCCTCCCAAAGCCTTTATGGAAACTAAATTCAAACAACATCTTCGATACCGCTTCCGTAAAAAGGATCCTATCTGAAGCTCCAATCTCTCCAGCAAACTTTCATCCTAATCTCTACAAAACTCTGTGGAAGGTGGAGTTTCCAAAAAAGTGTAAATTTTTCATCTGGACGCTCATCCATGGTTGCATTAATACAGCTGATCGCCTGCAGAAACGTTTACCAAATTGGGCCCTCAGTCCCAACTGGTGTTACATGTGCAACAAGAGCCAAGAAGACATAAATCATCTCTTCATCCATTGCCCCTATAGTCAGCAGTTATGGAGTAAGGCCAAAGCTCTCCTCAAATGGAATAGAACTCCAACTGATGTGCAGTCCCTTGTTCAGAACATTTGCTCCCTTAACATAAGAAATCAAAAAGGGCTGATAACATTCAATACCAGTGCTACCCTCCTTTGGAAGATTTGGCTGGAAAGAAACAATAGAATCTTCAAGCAACAGGGAAAAGATTCTCAAGATCTTTGGGAAGACATTCTCGCTCAAACCGGTTTATGGAGCTGCAAATCTAAATTATTTTCAAATTATGATTGTTGCTCCATAGCGTTAAACATCTCTGCTTTTGTAAAATAG

mRNA sequence

ATGAGTTATGCAAAAATGGAAGCTAAATCAAAAAGGCCTACTCAGTCCATAAAGAAAAAAGTCTATAGAGTCAAGAGCCGAAGCATGGAAAGGGAAACTCCTCAAACAAGCAGGCAAAAAGATAAAGAAAAAATTGACCCGAATGAGTTTGAACTGGTGGTAGACCTTGGCCACATCTCCCCCCTCTCAGATACTGATTTCTCTTGTCCTGAAAGTCCTTCGTACATCCCTTCACCAACATCTCCTACTGAGTCAGACATTGTAAAGGACAGCCTGGCCTCTATGATGACTTGTGCTCATGAAGATAGAGAAAAGAAGAAAAAGGAGAACTTAAGGGAAGAGACCGAAGATGATGAAGTTAGCTTCAAGAGGAAACTTACAGATTGGCTGAAAGAAAACAACCTCAGACTGGCGGCAGATTTTAATTCACAGTTTAATTCTGTTACAAATGATAGGATGATTTCTATTTTAAATGGGCCACCAAATGTAGGGGCTTTCTCTGTCTCCATCCAAGTTGGCTCCAACAATGGTGCTTCTTGGTGGCTTTCTGCCATTTACGGCCCAGCTAAAAGAAAAAATAGGCCTTTATTTTGGGAGGAACTTGAAAATCTAAAATCCATTTGCTTTCCAACCTGGATTCTTGGTGGAGATTTTAACGTTATCAGATGGAAGGAGGAGACGTCCACCAAAAATCCAGCCTCGCTAAGCATGAAAAGATTCAACACTTTCATAAGCAATTGTAATCTGATTGATCCTCCCCTCACCAATGCAAAGTTTACTTGGTCAAATCTCAGAGCTCAGGCCACCCTCTCCAGACTGGACAGATTTCTTTTCTCTACCCATTGGGAAAATATTTTCCCGGGCCATACTTCAAAAGTGTTAACCCGAACTACTTCAGACCATTTTCCCATTGTTCTCGAGTCGTCTACGATCTCTTGGGGTCCTTCTCCTTTTAGATTCACAAATGCCTACCTAAAAGATCCAGACTACAAGAAAAACATTGAGTTTTGGTGGGGAAACACCAGTCAGCCAGGCTATGCAGCTGGCTTTGAAAATCAAAGCTTGGGGAAGAGAAAAAAAGGAAAAAATGAAGCTTCTAAAAAGGCCTGGATCAAAGAAATCGATCTAATTGACAAACTAGAGGCTGAAGGATCTGCAACTGAGATTCACAGAGAGAAAAGGATTGCTCTAAAAGCCGACCTTTCCCAAATTACTCTCACTGAAGCTCAAATATGGGCCCAAAAATGCAAAAGAATATGGGTCCATGAAGGTGATGAAAATTCTTCCTTTTTCCACAAAATTTGCACAGCAAGGCAAAAAAAGTGTTTGATCTCCAAGATAATAAACAACAGTGGACAGAATTGCCTAAATGACAGTGACATTGCCGATGCCTTCATTCAACATTTTGAAGAAATCTATACAGACAACAGAAACAGCCATCTGTTTATTGATAATCTCGATTGGTGCCCCATCTCCAACACCAACAGTGACTTGCTGGACAAACCCTTTAATGAAGCTGAAATTTGGCTCACTTTAAAGTCTTTTGCAAAGAATAAAGCTCCAGGTCCAGATGGTTATACGATGGATTTCCTACAAAAGTCTTGGTCTTTTATGAAGCAAAACATTTGTGATATCTTCAAGGATTTTCACAGCACCCATACCATCAATAAAGTTGTCAATGAAACTCTCATTACCCTTATAGCCAAAAAAGAAAATTGTGAGACAGTTGCAGACTTTCGGCCCATCAGCCTCACCACGGCTATCTACAAATTAATCGCAAAGGCTTTGGCTGATAGATTGAAACAAACTCTCCCCGATACGATCTCTGAGTCTCAAATGGCCTTCGTTAAAGGAAGAAAAATTACAGAGGCCATTCTTATTGCAAATGAAGCTTTGGATTTCTGGAGAAATAAAAAAGAAAGAGGTTTTGTGATAAAACTGGACATTGAAAAGGCCTTCGATAAGCTAAATTGGCGCTTCATAGACTTTGTGCTTATGAAAAAGAACTACTCCCAGAAATGGAGGAAAATGATTGCCAGTTGCATCTCTAGTGTCCAATACTCTATTCTTATCAATGGTAGACCGAGAGGCAGAATCAAACCTTCTAGAGGAATCCGACAGGGTGACCCCCTTTCACCCTTCATCTTTGTTTTGGCTATGGACTATCTCAGCCGTCTTTTGAACAACTTAGCAGATAAAAGAAAAATCAATGGAGTCAATTTCAGTCCCAACCTTAATCTTACCCACATCCTATTTGCGGATGACATCCTCATCTTTGTAGAGGATAGGGATGACTACGTATCAAACCTCAAAATGATCCTTCATCTCTTTGAATCAGCCTCGGGCCTTAACATCAATCTGTCCAAGTCTACTATCTTTCCCATAAACGTCCCAACAGATCGTGCAAAGTCTATAGCGGACAGTTGGGGAATAAGCAAGGGCCATCTTCCGACATCTTACCTTGGTATGCCCTTAGGAGGGAAGCCTTCCTCATCAAACTTCTGGGACAATGTGCTTCAGAAAATCCAGAAAAAATTGAGCAGCTGGAAATACTCTCAGTTATCCAAAGGCGGCAGAATCACTCTGATAAACTCAACTCTTGAAAGCCTTCCATATATATCAAATGTCCCCAAAGGTATAGCTCAGAAAATTGAAGCTTCTTGGAGAAATTTCCTTTGGAATGGTACATCGAATGGCCACAACATTAGCCTCATCAGATGGAACCAAATTGTCTCCCCAAAAGAGAAAGGAGGCCTCGGTATTCACTCTGTCAATAGCACAAATTTTGCCCTCCTCTGTAAATGGCTCTGGAAATTTCTAACTGAAAAAGATCCTTTATGGAAACGCCTGATCATTTCCAAATATGATCAGGAGAAAATGGGCAGATTTCCTTCTCGTGGAAAATTCAGCAGCAATAATAGCCCTTGGAAAGCAGTGACAGAGTGTATCAGTTGGTTCTATAAAAACATCAGCTGGAAGGTAAATGATGGAGAAGATATCTCCTTTTGGCTTGACAACTGGAATGGAAATGCTCCTTTATCTTTGGCCGTCCCCCGTCTTTTTGCTCTATCTACAAACAAAAAGGGGTCTGTTAAAGATTTTTGGAATCCCTCATCTAATGACTGGCATCTCCATATCAATCGGCCCCTCCGTGACCATGAAAAAAATTTGTGGCACAATATTAAAGCCTCTCTTCCAACTCCCTTACCGAATAGGGGCCTCCCAAAGCCTTTATGGAAACTAAATTCAAACAACATCTTCGATACCGCTTCCGTAAAAAGGATCCTATCTGAAGCTCCAATCTCTCCAGCAAACTTTCATCCTAATCTCTACAAAACTCTGTGGAAGGTGGAGTTTCCAAAAAAGTGTAAATTTTTCATCTGGACGCTCATCCATGGTTGCATTAATACAGCTGATCGCCTGCAGAAACGTTTACCAAATTGGGCCCTCAGTCCCAACTGGTGTTACATGTGCAACAAGAGCCAAGAAGACATAAATCATCTCTTCATCCATTGCCCCTATAGTCAGCAGTTATGGAGTAAGGCCAAAGCTCTCCTCAAATGGAATAGAACTCCAACTGATGTGCAGTCCCTTGTTCAGAACATTTGCTCCCTTAACATAAGAAATCAAAAAGGGCTGATAACATTCAATACCAGTGCTACCCTCCTTTGGAAGATTTGGCTGGAAAGAAACAATAGAATCTTCAAGCAACAGGGAAAAGATTCTCAAGATCTTTGGGAAGACATTCTCGCTCAAACCGGTTTATGGAGCTGCAAATCTAAATTATTTTCAAATTATGATTGTTGCTCCATAGCGTTAAACATCTCTGCTTTTGTAAAATAG

Coding sequence (CDS)

ATGAGTTATGCAAAAATGGAAGCTAAATCAAAAAGGCCTACTCAGTCCATAAAGAAAAAAGTCTATAGAGTCAAGAGCCGAAGCATGGAAAGGGAAACTCCTCAAACAAGCAGGCAAAAAGATAAAGAAAAAATTGACCCGAATGAGTTTGAACTGGTGGTAGACCTTGGCCACATCTCCCCCCTCTCAGATACTGATTTCTCTTGTCCTGAAAGTCCTTCGTACATCCCTTCACCAACATCTCCTACTGAGTCAGACATTGTAAAGGACAGCCTGGCCTCTATGATGACTTGTGCTCATGAAGATAGAGAAAAGAAGAAAAAGGAGAACTTAAGGGAAGAGACCGAAGATGATGAAGTTAGCTTCAAGAGGAAACTTACAGATTGGCTGAAAGAAAACAACCTCAGACTGGCGGCAGATTTTAATTCACAGTTTAATTCTGTTACAAATGATAGGATGATTTCTATTTTAAATGGGCCACCAAATGTAGGGGCTTTCTCTGTCTCCATCCAAGTTGGCTCCAACAATGGTGCTTCTTGGTGGCTTTCTGCCATTTACGGCCCAGCTAAAAGAAAAAATAGGCCTTTATTTTGGGAGGAACTTGAAAATCTAAAATCCATTTGCTTTCCAACCTGGATTCTTGGTGGAGATTTTAACGTTATCAGATGGAAGGAGGAGACGTCCACCAAAAATCCAGCCTCGCTAAGCATGAAAAGATTCAACACTTTCATAAGCAATTGTAATCTGATTGATCCTCCCCTCACCAATGCAAAGTTTACTTGGTCAAATCTCAGAGCTCAGGCCACCCTCTCCAGACTGGACAGATTTCTTTTCTCTACCCATTGGGAAAATATTTTCCCGGGCCATACTTCAAAAGTGTTAACCCGAACTACTTCAGACCATTTTCCCATTGTTCTCGAGTCGTCTACGATCTCTTGGGGTCCTTCTCCTTTTAGATTCACAAATGCCTACCTAAAAGATCCAGACTACAAGAAAAACATTGAGTTTTGGTGGGGAAACACCAGTCAGCCAGGCTATGCAGCTGGCTTTGAAAATCAAAGCTTGGGGAAGAGAAAAAAAGGAAAAAATGAAGCTTCTAAAAAGGCCTGGATCAAAGAAATCGATCTAATTGACAAACTAGAGGCTGAAGGATCTGCAACTGAGATTCACAGAGAGAAAAGGATTGCTCTAAAAGCCGACCTTTCCCAAATTACTCTCACTGAAGCTCAAATATGGGCCCAAAAATGCAAAAGAATATGGGTCCATGAAGGTGATGAAAATTCTTCCTTTTTCCACAAAATTTGCACAGCAAGGCAAAAAAAGTGTTTGATCTCCAAGATAATAAACAACAGTGGACAGAATTGCCTAAATGACAGTGACATTGCCGATGCCTTCATTCAACATTTTGAAGAAATCTATACAGACAACAGAAACAGCCATCTGTTTATTGATAATCTCGATTGGTGCCCCATCTCCAACACCAACAGTGACTTGCTGGACAAACCCTTTAATGAAGCTGAAATTTGGCTCACTTTAAAGTCTTTTGCAAAGAATAAAGCTCCAGGTCCAGATGGTTATACGATGGATTTCCTACAAAAGTCTTGGTCTTTTATGAAGCAAAACATTTGTGATATCTTCAAGGATTTTCACAGCACCCATACCATCAATAAAGTTGTCAATGAAACTCTCATTACCCTTATAGCCAAAAAAGAAAATTGTGAGACAGTTGCAGACTTTCGGCCCATCAGCCTCACCACGGCTATCTACAAATTAATCGCAAAGGCTTTGGCTGATAGATTGAAACAAACTCTCCCCGATACGATCTCTGAGTCTCAAATGGCCTTCGTTAAAGGAAGAAAAATTACAGAGGCCATTCTTATTGCAAATGAAGCTTTGGATTTCTGGAGAAATAAAAAAGAAAGAGGTTTTGTGATAAAACTGGACATTGAAAAGGCCTTCGATAAGCTAAATTGGCGCTTCATAGACTTTGTGCTTATGAAAAAGAACTACTCCCAGAAATGGAGGAAAATGATTGCCAGTTGCATCTCTAGTGTCCAATACTCTATTCTTATCAATGGTAGACCGAGAGGCAGAATCAAACCTTCTAGAGGAATCCGACAGGGTGACCCCCTTTCACCCTTCATCTTTGTTTTGGCTATGGACTATCTCAGCCGTCTTTTGAACAACTTAGCAGATAAAAGAAAAATCAATGGAGTCAATTTCAGTCCCAACCTTAATCTTACCCACATCCTATTTGCGGATGACATCCTCATCTTTGTAGAGGATAGGGATGACTACGTATCAAACCTCAAAATGATCCTTCATCTCTTTGAATCAGCCTCGGGCCTTAACATCAATCTGTCCAAGTCTACTATCTTTCCCATAAACGTCCCAACAGATCGTGCAAAGTCTATAGCGGACAGTTGGGGAATAAGCAAGGGCCATCTTCCGACATCTTACCTTGGTATGCCCTTAGGAGGGAAGCCTTCCTCATCAAACTTCTGGGACAATGTGCTTCAGAAAATCCAGAAAAAATTGAGCAGCTGGAAATACTCTCAGTTATCCAAAGGCGGCAGAATCACTCTGATAAACTCAACTCTTGAAAGCCTTCCATATATATCAAATGTCCCCAAAGGTATAGCTCAGAAAATTGAAGCTTCTTGGAGAAATTTCCTTTGGAATGGTACATCGAATGGCCACAACATTAGCCTCATCAGATGGAACCAAATTGTCTCCCCAAAAGAGAAAGGAGGCCTCGGTATTCACTCTGTCAATAGCACAAATTTTGCCCTCCTCTGTAAATGGCTCTGGAAATTTCTAACTGAAAAAGATCCTTTATGGAAACGCCTGATCATTTCCAAATATGATCAGGAGAAAATGGGCAGATTTCCTTCTCGTGGAAAATTCAGCAGCAATAATAGCCCTTGGAAAGCAGTGACAGAGTGTATCAGTTGGTTCTATAAAAACATCAGCTGGAAGGTAAATGATGGAGAAGATATCTCCTTTTGGCTTGACAACTGGAATGGAAATGCTCCTTTATCTTTGGCCGTCCCCCGTCTTTTTGCTCTATCTACAAACAAAAAGGGGTCTGTTAAAGATTTTTGGAATCCCTCATCTAATGACTGGCATCTCCATATCAATCGGCCCCTCCGTGACCATGAAAAAAATTTGTGGCACAATATTAAAGCCTCTCTTCCAACTCCCTTACCGAATAGGGGCCTCCCAAAGCCTTTATGGAAACTAAATTCAAACAACATCTTCGATACCGCTTCCGTAAAAAGGATCCTATCTGAAGCTCCAATCTCTCCAGCAAACTTTCATCCTAATCTCTACAAAACTCTGTGGAAGGTGGAGTTTCCAAAAAAGTGTAAATTTTTCATCTGGACGCTCATCCATGGTTGCATTAATACAGCTGATCGCCTGCAGAAACGTTTACCAAATTGGGCCCTCAGTCCCAACTGGTGTTACATGTGCAACAAGAGCCAAGAAGACATAAATCATCTCTTCATCCATTGCCCCTATAGTCAGCAGTTATGGAGTAAGGCCAAAGCTCTCCTCAAATGGAATAGAACTCCAACTGATGTGCAGTCCCTTGTTCAGAACATTTGCTCCCTTAACATAAGAAATCAAAAAGGGCTGATAACATTCAATACCAGTGCTACCCTCCTTTGGAAGATTTGGCTGGAAAGAAACAATAGAATCTTCAAGCAACAGGGAAAAGATTCTCAAGATCTTTGGGAAGACATTCTCGCTCAAACCGGTTTATGGAGCTGCAAATCTAAATTATTTTCAAATTATGATTGTTGCTCCATAGCGTTAAACATCTCTGCTTTTGTAAAATAG

Protein sequence

MSYAKMEAKSKRPTQSIKKKVYRVKSRSMERETPQTSRQKDKEKIDPNEFELVVDLGHISPLSDTDFSCPESPSYIPSPTSPTESDIVKDSLASMMTCAHEDREKKKKENLREETEDDEVSFKRKLTDWLKENNLRLAADFNSQFNSVTNDRMISILNGPPNVGAFSVSIQVGSNNGASWWLSAIYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNLIDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAAGFENQSLGKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSGQNCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTHTINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNLNLTHILFADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPYISNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFYKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFWNPSSNDWHLHINRPLRDHEKNLWHNIKASLPTPLPNRGLPKPLWKLNSNNIFDTASVKRILSEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNWALSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKAKALLKWNRTPTDVQSLVQNICSLNIRNQKGLITFNTSATLLWKIWLERNNRIFKQQGKDSQDLWEDILAQTGLWSCKSKLFSNYDCCSIALNISAFVK
Homology
BLAST of IVF0006035 vs. ExPASy Swiss-Prot
Match: P08548 (LINE-1 reverse transcriptase homolog OS=Nycticebus coucang OX=9470 PE=4 SV=1)

HSP 1 Score: 181.0 bits (458), Expect = 8.3e-44
Identity = 197/816 (24.14%), Postives = 352/816 (43.14%), Query Frame = 0

Query: 185 IYGPAKRKNRPLFWEE-LENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRFNTF 244
           IY P    N P F  E L ++ ++   T I+ GDFN      + S+K   S  +   N+ 
Sbjct: 113 IYAP--NHNAPQFIRETLTDMSNLISSTSIVVGDFNTPLAVLDRSSKKKLSKEILDLNST 172

Query: 245 ISNCNLID------PPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRT 304
           I + +L D      P  T   F  S   A  T S++D  L   H  N+      +++   
Sbjct: 173 IQHLDLTDIYRTFHPNKTEYTFFSS---AHGTYSKIDHIL--GHKSNLSKFKKIEIIPCI 232

Query: 305 TSDHFPIVLE--------SSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAAG 364
            SDH  I +E        + T +W  +     + ++ D   K+  +F   N +Q      
Sbjct: 233 FSDHHGIKVELNNNRNLHTHTKTWKLNNLMLKDTWVIDEIKKEITKFLEQNNNQD---TN 292

Query: 365 FEN-QSLGKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALK-ADLSQITLT 424
           ++N     K        + +A++K+ +  +     G   ++ +E+    K +   +IT  
Sbjct: 293 YQNLWDTAKAVLRGKFIALQAFLKKTEREEVNNLMGHLKQLEKEEHSNPKPSRRKEITKI 352

Query: 425 EAQIWAQKCKRIWVHEGDENSSFFHKI----------CTARQKKCLISKIINNSGQNCLN 484
            A++   + KRI        S FF KI             ++ K LIS I N + +   +
Sbjct: 353 RAELNEIENKRIIQQINKSKSWFFEKINKIDKPLANLTRKKRVKSLISSIRNGNDEITTD 412

Query: 485 DSDIADAFIQHFEEIYTDNRNSHLFIDN-LDWC---PISNTNSDLLDKPFNEAEIWLTLK 544
            S+I     ++++++Y+    +   ID  L+ C    +S    ++L++P + +EI  T++
Sbjct: 413 PSEIQKILNEYYKKLYSHKYENLKEIDQYLEACHLPRLSQKEVEMLNRPISSSEIASTIQ 472

Query: 545 SFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTHTINKVVNETLITLIAKKENC 604
           +  K K+PGPDG+T +F Q     +   + ++F++      +     E  ITLI K    
Sbjct: 473 NLPKKKSPGPDGFTSEFYQTFKEELVPILLNLFQNIEKEGILPNTFYEANITLIPKPGKD 532

Query: 605 ET-VADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKG-------RKITEAI 664
            T   ++RPISL     K++ K L +R++Q +   I   Q+ F+ G       RK    I
Sbjct: 533 PTRKENYRPISLMNIDAKILNKILTNRIQQHIKKIIHHDQVGFIPGSQGWFNIRKSINVI 592

Query: 665 LIANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSV 724
              N+       K +   ++ +D EKAFD +   F+   L K      + K+I +  S  
Sbjct: 593 QHINKL------KNKDHMILSIDAEKAFDNIQHPFMIRTLKKIGIEGTFLKLIEAIYSKP 652

Query: 725 QYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNF-SPN 784
             +I++NG          G RQG PLSP +F + M+ L+  +    +++ I G++  S  
Sbjct: 653 TANIILNGVKLKSFPLRSGTRQGCPLSPLLFNIVMEVLAIAIR---EEKAIKGIHIGSEE 712

Query: 785 LNLTHILFADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKS 844
           + L+  LFADD+++++E+  D  + L  ++  + + SG  IN  KS  F         K+
Sbjct: 713 IKLS--LFADDMIVYLENTRDSTTKLLEVIKEYSNVSGYKINTHKSVAFIYTNNNQAEKT 772

Query: 845 IADSWGISKGHLPTSYLGMPL--GGKPSSSNFWDNVLQKIQKKLSSWKYSQLSKGGRITL 904
           + DS   +       YLG+ L    K      ++ + ++I + ++ WK    S  GRI +
Sbjct: 773 VKDSIPFTVVPKKMKYLGVYLTKDVKDLYKENYETLRKEIAEDVNKWKNIPCSWLGRINI 832

Query: 905 IN-STLESLPYISN-----VPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEK 951
           +  S L    Y  N      P    + +E    +F+WN          I    + +  + 
Sbjct: 833 VKMSILPKAIYNFNAIPIKAPLSYFKDLEKIILHFIWN-----QKKPQIAKTLLSNKNKA 892

BLAST of IVF0006035 vs. ExPASy Swiss-Prot
Match: P11369 (LINE-1 retrotransposable element ORF2 protein OS=Mus musculus OX=10090 GN=Pol PE=1 SV=2)

HSP 1 Score: 171.8 bits (434), Expect = 5.1e-41
Identity = 189/811 (23.30%), Postives = 337/811 (41.55%), Query Frame = 0

Query: 185 IYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRFNTFI 244
           IY P  R       + L  LK+   P  I+ GDFN     ++ S K   +    +    +
Sbjct: 121 IYAPNARA-ATFIRDTLVKLKAYIAPHTIIVGDFNTPLSSKDRSWKQKLNRDTVKLTEVM 180

Query: 245 SNCNLID-----PPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTS 304
              +L D      P T     +S      T S++D  +   H   +      +++    S
Sbjct: 181 KQMDLTDIYRTFYPKTKGYTFFS--APHGTFSKIDHII--GHKTGLNRYKNIEIVPCILS 240

Query: 305 DHFPI-VLESSTISWGPSPF--RFTNAYLKDPDYKKNI-----EFWWGNTSQPGYAAGFE 364
           DH  + ++ ++ I+ G   F  +  N  L D   K+ I     +F   N ++   A  + 
Sbjct: 241 DHHGLRLIFNNNINNGKPTFTWKLNNTLLNDTLVKEGIKKEIKDFLEFNENE---ATTYP 300

Query: 365 NQ-------------SLGKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALK 424
           N              +L   KK +  A   +    +  ++K EA  S     R++ I L+
Sbjct: 301 NLWDTMKAFLRGKLIALSASKKKRETAHTSSLTTHLKALEKKEA-NSPKRSRRQEIIKLR 360

Query: 425 ADLSQITLTEAQIWAQKCKRIWVHEG-DENSSFFHKICTARQKKCLISKIINNSGQNCLN 484
            +++Q+  T   I      R W  E  ++      ++    + K LI+KI N  G    +
Sbjct: 361 GEINQVE-TRRTIQRINQTRSWFFEKINKIDKPLARLTKGHRDKILINKIRNEKGDITTD 420

Query: 485 DSDIADAFIQHFEEIYTDNRNS----HLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLK 544
             +I +     ++ +Y+    +      F+D      ++    D L+ P +  EI   + 
Sbjct: 421 PEEIQNTIRSFYKRLYSTKLENLDEMDKFLDRYQVPKLNQDQVDHLNSPISPKEIEAVIN 480

Query: 545 SFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTHTINKVVN---ETLITLIAKK 604
           S    K+PGPDG++ +F Q   +F +  I  + K FH       + N   E  ITLI K 
Sbjct: 481 SLPTKKSPGPDGFSAEFYQ---TFKEDLIPILHKLFHKIEVEGTLPNSFYEATITLIPKP 540

Query: 605 ENCET-VADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIAN 664
           +   T + +FRPISL     K++ K LA+R+++ +   I   Q+ F+ G +    I  + 
Sbjct: 541 QKDPTKIENFRPISLMNIDAKILNKILANRIQEHIKAIIHPDQVGFIPGMQGWFNIRKSI 600

Query: 665 EALDFWRNKKERG-FVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYS 724
             + +    K++   +I LD EKAFDK+   F+  VL +      +  MI +  S    +
Sbjct: 601 NVIHYINKLKDKNHMIISLDAEKAFDKIQHPFMIKVLERSGIQGPYLNMIKAIYSKPVAN 660

Query: 725 ILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNLNLT 784
           I +NG     I    G RQG PLSP++F + ++ L+R +     +++I G+       + 
Sbjct: 661 IKVNGEKLEAIPLKSGTRQGCPLSPYLFNIVLEVLARAIR---QQKEIKGIQIGKE-EVK 720

Query: 785 HILFADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADS 844
             L ADD+++++ D  +    L  +++ F    G  IN +KS  F         K I ++
Sbjct: 721 ISLLADDMIVYISDPKNSTRELLNLINSFGEVVGYKINSNKSMAFLYTKNKQAEKEIRET 780

Query: 845 WGISKGHLPTSYLGMPLGG--KPSSSNFWDNVLQKIQKKLSSWKYSQLSKGGRITLIN-S 904
              S       YLG+ L    K      + ++ ++I++ L  WK    S  GRI ++  +
Sbjct: 781 TPFSIVTNNIKYLGVTLTKEVKDLYDKNFKSLKKEIKEDLRRWKDLPCSWIGRINIVKMA 840

Query: 905 TLESLPYISN-----VPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLG 951
            L    Y  N     +P     ++E +   F+WN        SL++       +  GG+ 
Sbjct: 841 ILPKAIYRFNAIPIKIPTQFFNELEGAICKFVWNNKKPRIAKSLLK-----DKRTSGGIT 900

BLAST of IVF0006035 vs. ExPASy Swiss-Prot
Match: P14381 (Transposon TX1 uncharacterized 149 kDa protein OS=Xenopus laevis OX=8355 PE=4 SV=1)

HSP 1 Score: 163.3 bits (412), Expect = 1.8e-38
Identity = 197/822 (23.97%), Postives = 350/822 (42.58%), Query Frame = 0

Query: 176 NGASWWLSAIYGPAKRKNRPLFWEEL----ENLKSICFPTWILGGDFNVIRWKEETSTKN 235
           +G ++ L  +Y P     R  F+E L    E + S      I+GGDFN      + +   
Sbjct: 100 SGRTYNLMNVYAPTTGPERARFFESLSAYMETIDS--DEALIIGGDFNYTLDARDRNVPK 159

Query: 236 PASLSMKRFNTFISNCNLID-----PPLTNAKFTWSNLR-AQATLSRLDRFLFSTHWENI 295
               S       I++ +L+D      P T A FT+  +R    + SR+DR   S+H   +
Sbjct: 160 KRDSSESVLRELIAHFSLVDVWREQNPETVA-FTYVRVRDGHVSQSRIDRIYISSHL--M 219

Query: 296 FPGHTSKVLTRTTSDHFPIVLESSTISWGPSP--FRFTNAYLKDPDYKKNIEFWWGN--- 355
               +S +     SDH  + L  S     P    + F N+ L+D  + K++   W     
Sbjct: 220 SRAQSSTIRLAPFSDHNCVSLRMSIAPSLPKAAYWHFNNSLLEDEGFAKSVRDTWRGWRA 279

Query: 356 ------TSQPGYAAGFEN-----QSLGKRKKGKNEASKKAWIKEI-DLIDKLEAEGSATE 415
                 T    +  G  +     Q   K   G+  A  +A   E+ DL  +L   GS  +
Sbjct: 280 FQDEFATLNQWWDVGKVHLKLLCQEYTKSVSGQRNAEIEALNGEVLDLEQRL--SGSEDQ 339

Query: 416 IHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKII 475
             + + +  K  L  +   +A+    + +   + + D  S FF+ +   +  +  I+ + 
Sbjct: 340 ALQCEYLERKEALRNMEQRQARGAFVRSRMQLLCDMDRGSRFFYALEKKKGNRKQITCLF 399

Query: 476 NNSGQNCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNL-DWCP-ISNTNSDLLDKPFNEA 535
              G    +   I D     ++ +++ +  S    + L D  P +S    + L+ P    
Sbjct: 400 AEDGTPLEDPEAIRDRARSFYQNLFSPDPISPDACEELWDGLPVVSERRKERLETPITLD 459

Query: 536 EIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTHTINKVVNETLITL 595
           E+   L+    NK+PG DG T++F Q  W  +  +   +  +      +       +++L
Sbjct: 460 ELSQALRLMPHNKSPGLDGLTIEFFQFFWDTLGPDFHRVLTEAFKKGELPLSCRRAVLSL 519

Query: 596 IAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRKITEAIL 655
           + KK +   + ++RP+SL +  YK++AKA++ RLK  L + I   Q   V GR I + + 
Sbjct: 520 LPKKGDLRLIKNWRPVSLLSTDYKIVAKAISLRLKSVLAEVIHPDQSYTVPGRTIFDNVF 579

Query: 656 IANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQ 715
           +  + L F R        + LD EKAFD+++ +++   L   ++  ++   + +  +S +
Sbjct: 580 LIRDLLHFARRTGLSLAFLSLDQEKAFDRVDHQYLIGTLQAYSFGPQFVGYLKTMYASAE 639

Query: 716 YSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNLN 775
             + IN      +   RG+RQG PLS  ++ LA++    LL     KR    V   P++ 
Sbjct: 640 CLVKINWSLTAPLAFGRGVRQGCPLSGQLYSLAIEPFLCLLR----KRLTGLVLKEPDMR 699

Query: 776 LTHILFADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKST-IFPINVPTDRAKSI 835
           +    +ADD+++  +D  D +   +    ++ +AS   IN SKS+ +   ++  D     
Sbjct: 700 VVLSAYADDVILVAQDLVD-LERAQECQEVYAAASSARINWSKSSGLLEGSLKVDFLPPA 759

Query: 836 ADSWGISKGHLPTSYLGMPLGGK--PSSSNFWDNVLQKIQKKLSSWK--YSQLSKGGRIT 895
                IS       YLG+ L  +  P S NF + + + +  +L  WK     LS  GR  
Sbjct: 760 FRD--ISWESKIIKYLGVYLSAEEYPVSQNFIE-LEECVLTRLGKWKGFAKVLSMRGRAL 819

Query: 896 LINSTLES-----LPYISNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEK 955
           +IN  + S     L  +S   + IA KI+    +FLW G    H +S         P ++
Sbjct: 820 VINQLVASQIWYRLICLSPTQEFIA-KIQRRLLDFLWIGK---HWVSA---GVSSLPLKE 879

Query: 956 GGLGIHSVNSTNFALLCKWLWKFL-TEKDPLWKRLIISKYDQ 958
           GG G+  + S       + + ++L  +  P W  L  S Y Q
Sbjct: 880 GGQGVVCIRSQVHTFRLQQIQRYLYADPSPQWCTLASSFYRQ 899

BLAST of IVF0006035 vs. ExPASy Swiss-Prot
Match: O00370 (LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens OX=9606 PE=1 SV=1)

HSP 1 Score: 161.4 bits (407), Expect = 6.8e-38
Identity = 185/788 (23.48%), Postives = 331/788 (42.01%), Query Frame = 0

Query: 213 ILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNLIDPPLT----NAKFTWSNLRAQA 272
           ++ GDFN      + ST+   +   +  N+ +   +LID   T    + ++T+ +     
Sbjct: 141 LIMGDFNTPLSILDRSTRQKVNKDTQELNSALHQTDLIDIYRTLHPKSTEYTFFS-APHH 200

Query: 273 TLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLE--------SSTISWGPSPFRF 332
           T S++D  + S     +     ++++T   SDH  I LE        S + +W  +    
Sbjct: 201 TYSKIDHIVGSK--ALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQSRSTTWKLNNL-L 260

Query: 333 TNAYLKDPDYKKNIEFWW-----GNTSQPGYAAGFENQSLGK-----RKKGKNEASK--- 392
            N Y    + K  I+ ++      +T+       F+    GK       K K E SK   
Sbjct: 261 LNDYWVHNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAYKRKQERSKIDT 320

Query: 393 -KAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKC--KRIWVHEG 452
             + +KE++  ++  ++ S     R++   ++A+L +I   E Q   QK    R W  E 
Sbjct: 321 LTSQLKELEKQEQTHSKAS----RRQEITKIRAELKEI---ETQKTLQKINESRSWFFER 380

Query: 453 -DENSSFFHKICTARQKKCLISKIINNSGQNCLNDSDIADAFIQHFEEIYTDN----RNS 512
            ++      ++   +++K  I  I N+ G    + ++I     ++++ +Y +        
Sbjct: 381 INKIDRPLARLIKKKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEM 440

Query: 513 HLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMK 572
             F+D      ++    + L++P   +EI   + S    K+PGPDG+T +F Q+    + 
Sbjct: 441 DTFLDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELV 500

Query: 573 QNICDIFKDFHSTHTINKVVNETLITLIAKKENCETVAD-FRPISLTTAIYKLIAKALAD 632
             +  +F+       +     E  I LI K     T  + FRPISL     K++ K LA+
Sbjct: 501 PFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILAN 560

Query: 633 RLKQTLPDTISESQMAFVKG-------RKITEAILIANEALDFWRNKKERGFVIKLDIEK 692
           R++Q +   I   Q+ F+ G       RK    I   N A      K +   +I +D EK
Sbjct: 561 RIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRA------KDKNHVIISIDAEK 620

Query: 693 AFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPL 752
           AFDK+   F+   L K      + K+I +       +I++NG+         G RQG PL
Sbjct: 621 AFDKIQQPFMLKTLNKLGIDGMYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPL 680

Query: 753 SPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNLNLTHILFADDILIFVEDRDDYVSNLK 812
           SP +F + ++ L+R +     +++I G+       +   LFADD+++++E+      NL 
Sbjct: 681 SPLLFNIVLEVLARAIR---QEKEIKGIQLGKE-EVKLSLFADDMIVYLENPIVSAQNLL 740

Query: 813 MILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPL--GGKP 872
            ++  F   SG  IN+ KS  F  N        I      +       YLG+ L    K 
Sbjct: 741 KLISNFSKVSGYKINVQKSQAFLYNNNRQTESQIMGELPFTIASKRIKYLGIQLTRDVKD 800

Query: 873 SSSNFWDNVLQKIQKKLSSWKYSQLSKGGRITLIN-STLESLPYISN-----VPKGIAQK 932
                +  +L++I++  + WK    S  GRI ++  + L  + Y  N     +P     +
Sbjct: 801 LFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTE 860

Query: 933 IEASWRNFLWNGTSNGHNISLIRWNQIVSPKEK-GGLGIHSVNSTNFALLCKWLWKFLTE 950
           +E +   F+WN       I+      I+S K K GG+ +        A + K  W +   
Sbjct: 861 LEKTTLKFIWN--QKRARIA----KSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQN 901

BLAST of IVF0006035 vs. ExPASy Swiss-Prot
Match: P0C2F6 (Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1g65750 PE=3 SV=1)

HSP 1 Score: 147.5 bits (371), Expect = 1.0e-33
Identity = 96/385 (24.94%), Postives = 171/385 (44.42%), Query Frame = 0

Query: 823  MPLGGKPSSSNFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPYISN----VPK 882
            MP+  K  + + +  +L+++  ++S W+   LS  GR+TL  + L S+P  S     +P+
Sbjct: 1    MPVLQKRINKDTFGEILERVSSRMSGWREKTLSFAGRLTLTKAVLSSMPVHSMSTILLPQ 60

Query: 883  GIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKWLWK 942
             I  +++   R FLW  T+      L++W+++ SPK++GGLG+ +  S N AL+ K  W+
Sbjct: 61   SILNRLDQLSRTFLWGSTAEKKKQHLVKWSKVCSPKKEGGLGVRAAKSMNRALISKVGWR 120

Query: 943  FLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVT----ECISWFYKNISWKV 1002
             L EK+ LW  ++  KY   ++          S +S W+++     + +S     + W  
Sbjct: 121  LLQEKNSLWTLVLQKKYHVGEIRDSRWLIPKGSWSSTWRSIAIGLRDVVS---HGVGWIP 180

Query: 1003 NDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFWNPSSNDWHLHINRPLRDHE 1062
             DG+ I FW D W    PL L +      +       KD W P    W      P   + 
Sbjct: 181  GDGQQIRFWTDRWVSGKPL-LELDNGERPTDCDTVVAKDLWIP-GRGWDFAKIDPYTTNN 240

Query: 1063 KNLWHNIKASLPTPLPNRGLPKPLWKLNSNNIFDTASVKRILSEAPISPANFHPNLYKTL 1122
              L   ++A +   L      +  WK + +  F   S   +L+   +   N   + +  L
Sbjct: 241  TRL--ELRA-VVLDLVTGARDRLSWKFSQDGQFSVRSAYEMLTVDEVPRPNM-ASFFNCL 300

Query: 1123 WKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNWALSPNWCYMCNKSQEDINHLFIHCPY 1182
            WKV  P++ K F+W + +  + T +   +R      + N C +C    E + H+   CP 
Sbjct: 301  WKVRVPERVKTFLWLVGNQAVMTEEERHRR---HLSASNVCQVCKGGVESMLHVLRDCPA 360

Query: 1183 SQQLWSK-----------AKALLKW 1189
               +W +           +K+L +W
Sbjct: 361  QLGIWVRVVPQRRQQGFFSKSLFEW 373

BLAST of IVF0006035 vs. ExPASy TrEMBL
Match: A0A5A7T9I7 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold266G00980 PE=4 SV=1)

HSP 1 Score: 2190.2 bits (5674), Expect = 0.0e+00
Identity = 1054/1144 (92.13%), Postives = 1084/1144 (94.76%), Query Frame = 0

Query: 150  NDRMISILNGPPNVGAFSVSIQVGSNNGASWWLSAIYGPAKRKNRPLFWEELENLKSICF 209
            ND+  SIL+     GAFSVSIQVGSNNGA WWLSAIYGPAKRKNRPLFWEELE+LKSIC 
Sbjct: 3    NDQNFSILS--VFKGAFSVSIQVGSNNGAFWWLSAIYGPAKRKNRPLFWEELEHLKSICL 62

Query: 210  PTWILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNLIDPPLTNAKFTWSNLRAQAT 269
            PTWILGGDFNVIRWKEET+TKNPA LSM+RFN+FISNCNLIDPPL+NAK+TWSNLRAQAT
Sbjct: 63   PTWILGGDFNVIRWKEETTTKNPALLSMRRFNSFISNCNLIDPPLSNAKYTWSNLRAQAT 122

Query: 270  LSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTNAYLKDPD 329
            LSRLDRFLF++ WENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTNAYLKDPD
Sbjct: 123  LSRLDRFLFTSQWENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTNAYLKDPD 182

Query: 330  YKKNIEFWWGNTSQPGYAA----------GFENQSLGKRKKGKNEASKKAWIKEIDLIDK 389
            YKKNIEFWWGNTSQPGYA               ++ G+ KKGKNEASKKA IKEID IDK
Sbjct: 183  YKKNIEFWWGNTSQPGYAGYSFMRRLKQLALIIKTWGRDKKGKNEASKKACIKEIDQIDK 242

Query: 390  LEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQ 449
            LEAEGSATEIHREKR ALKADLSQI LTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQ
Sbjct: 243  LEAEGSATEIHREKRTALKADLSQINLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQ 302

Query: 450  KKCLISKIINNSGQNCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNSDLL 509
            KKCLISKIINNSGQNCLNDSDIADAFIQHFE+IYTDNRNS LFI+NLDWCPISN NS+LL
Sbjct: 303  KKCLISKIINNSGQNCLNDSDIADAFIQHFEDIYTDNRNSQLFIENLDWCPISNINSELL 362

Query: 510  DKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTHTINKVV 569
            DKPFNEAEIWLTLKSFAKNKAPGPDGY MDFLQKSWSFMKQNICDIFKDFHSTH INKVV
Sbjct: 363  DKPFNEAEIWLTLKSFAKNKAPGPDGYAMDFLQKSWSFMKQNICDIFKDFHSTHIINKVV 422

Query: 570  NETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGR 629
            NETLITLIAKKE+CET ADFRPISLTTAIYKLIAK LADRLKQTLPDTISESQMAFVKGR
Sbjct: 423  NETLITLIAKKEHCETAADFRPISLTTAIYKLIAKTLADRLKQTLPDTISESQMAFVKGR 482

Query: 630  KITEAILIANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIA 689
            +ITEAILIANEALDFWR+KKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIA
Sbjct: 483  QITEAILIANEALDFWRSKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIA 542

Query: 690  SCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGV 749
            SCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGV
Sbjct: 543  SCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGV 602

Query: 750  NFSPNLNLTHILFADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPT 809
             FSPNLNLTHILFADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPT
Sbjct: 603  KFSPNLNLTHILFADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPT 662

Query: 810  DRAKSIADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYSQLSKGGR 869
            DRAKSIADSWGISKGHLPTSYLGMPLGG+PSSSNFWDNVLQKIQKKLS+WKYSQLSKGGR
Sbjct: 663  DRAKSIADSWGISKGHLPTSYLGMPLGGRPSSSNFWDNVLQKIQKKLSNWKYSQLSKGGR 722

Query: 870  ITLINSTLESLPY----ISNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKE 929
            ITLINSTLESLP     +  VPKGIAQKIEASWRNFLWNG SNGHNISLIRWNQIVSPKE
Sbjct: 723  ITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGASNGHNISLIRWNQIVSPKE 782

Query: 930  KGGLGIHSVNSTNFALLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSP 989
            KGGLGIHSVNSTNFALLCKWLWKFLTEKDPLWKRLIISKYD+EKMG FPS GKFSSNNSP
Sbjct: 783  KGGLGIHSVNSTNFALLCKWLWKFLTEKDPLWKRLIISKYDKEKMGSFPSHGKFSSNNSP 842

Query: 990  WKAVTECISWFYKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFW 1049
            WKAVTECISWFYKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVK+FW
Sbjct: 843  WKAVTECISWFYKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKEFW 902

Query: 1050 NPSSNDWHLHINRPLRDHEKNLWHNIKASLPTPLPNRGLPKPLWKLNSNNIFDTASVKRI 1109
            NPSSNDWHLHINRPLRDHE+NLWHNIKASLPTPLPNRG PKPLW LNSNNIFDTASVKR 
Sbjct: 903  NPSSNDWHLHINRPLRDHEENLWHNIKASLPTPLPNRGHPKPLWNLNSNNIFDTASVKRA 962

Query: 1110 LSEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNWALSPNWC 1169
            ++EAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNW LSPNWC
Sbjct: 963  IAEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNWTLSPNWC 1022

Query: 1170 YMCNKSQEDINHLFIHCPYSQQLWSKAKALLKWNRTPTDVQSLVQNICSLNIRNQKGLIT 1229
            YMCNKSQEDINHLFIHCPYSQQLWSKAKALL WN TPTDVQSL+QNICSLNIRNQKGLIT
Sbjct: 1023 YMCNKSQEDINHLFIHCPYSQQLWSKAKALLNWNSTPTDVQSLIQNICSLNIRNQKGLIT 1082

Query: 1230 FNTSATLLWKIWLERNNRIFKQQGKDSQDLWEDILAQTGLWSCKSKLFSNYDCCSIALNI 1280
            FNT+AT+LWKIWLERNNRIFKQQ K  QDLWED LAQ GLWSCKSKLFSNYDCCSIALNI
Sbjct: 1083 FNTNATILWKIWLERNNRIFKQQEKAPQDLWEDTLAQIGLWSCKSKLFSNYDCCSIALNI 1142

BLAST of IVF0006035 vs. ExPASy TrEMBL
Match: A0A5A7TIB8 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold206G00430 PE=4 SV=1)

HSP 1 Score: 1924.4 bits (4984), Expect = 0.0e+00
Identity = 918/1032 (88.95%), Postives = 960/1032 (93.02%), Query Frame = 0

Query: 263  NLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTN 322
            NLRAQATLSRLDRFLFS  WEN FPGHTSK LTRTTSDHFPIVLESS+ISWGP PFRFTN
Sbjct: 6    NLRAQATLSRLDRFLFSPQWENTFPGHTSKTLTRTTSDHFPIVLESSSISWGPPPFRFTN 65

Query: 323  AYLKDPDYKKNIEFWWGNTSQPGYAA----------GFENQSLGKRKKGKNEASKKAWIK 382
            AYLKDPDYK+NIEFWWGNTSQPG+A             + ++ GK KKGK+E SKKAWIK
Sbjct: 66   AYLKDPDYKRNIEFWWGNTSQPGFAGYSFMRRLKQLAMKIKAWGKEKKGKDEVSKKAWIK 125

Query: 383  EIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFH 442
            EI+LIDKLEAEG+ATEIHR KR+ALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFH
Sbjct: 126  EINLIDKLEAEGTATEIHRVKRLALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFH 185

Query: 443  KICTARQKKCLISKIINNSGQNCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCPIS 502
            KICTARQKKCLISK+INN GQNCLNDSDI DAFIQHFEEIYTDN+NS LFIDNLDWCPIS
Sbjct: 186  KICTARQKKCLISKVINNCGQNCLNDSDIVDAFIQHFEEIYTDNKNSQLFIDNLDWCPIS 245

Query: 503  NTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHST 562
            NTN  LLDKPFNE+EIWLTLKSF KNKAPGPDG+TMDFLQKSWSFMK NICDIFKDFHS 
Sbjct: 246  NTNRCLLDKPFNESEIWLTLKSFTKNKAPGPDGFTMDFLQKSWSFMKHNICDIFKDFHSN 305

Query: 563  HTINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQ 622
            HTINKVVNETLITLIAKK+NCETV+DFRPISLTTAIYKLIAK LADRLKQTLP TISE Q
Sbjct: 306  HTINKVVNETLITLIAKKDNCETVSDFRPISLTTAIYKLIAKVLADRLKQTLPYTISELQ 365

Query: 623  MAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQ 682
            MAFVKGR+ITEAILIANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDF+LMKKNYS 
Sbjct: 366  MAFVKGRQITEAILIANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFMLMKKNYSP 425

Query: 683  KWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLAD 742
            KWR MIASCISSVQYSILINGRPRGRIKP+RGIRQGDPLSPFIFVLAMDYLS LL NLA+
Sbjct: 426  KWRNMIASCISSVQYSILINGRPRGRIKPTRGIRQGDPLSPFIFVLAMDYLSHLLINLAE 485

Query: 743  KRKINGVNFSPNLNLTHILFADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTI 802
            K KINGVNF PNLNLTHILFADDILIFVED++DYVSNLKMILHLFESASGLNINLSKSTI
Sbjct: 486  KGKINGVNFGPNLNLTHILFADDILIFVEDKEDYVSNLKMILHLFESASGLNINLSKSTI 545

Query: 803  FPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYS 862
            FPINVPTDRA SI DSWGISKG LPT+YLGMPLGGKPSSSNFWDN+LQKIQKKLSSWKYS
Sbjct: 546  FPINVPTDRANSIVDSWGISKGQLPTTYLGMPLGGKPSSSNFWDNILQKIQKKLSSWKYS 605

Query: 863  QLSKGGRITLINSTLESLPY----ISNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWN 922
            QLSKGGRITLINSTLESLP     +  VPKGIAQKIEA WRNFLWNGTSNGHNISLIRWN
Sbjct: 606  QLSKGGRITLINSTLESLPIYQLSVFKVPKGIAQKIEAYWRNFLWNGTSNGHNISLIRWN 665

Query: 923  QIVSPKEKGGLGIHSVNSTNFALLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGK 982
            Q+VSPKEKGGLGIHSV+STNFALLCKWLWKFLTEK+PLWKRLIISKYDQEKMGRFPSRGK
Sbjct: 666  QVVSPKEKGGLGIHSVHSTNFALLCKWLWKFLTEKEPLWKRLIISKYDQEKMGRFPSRGK 725

Query: 983  FSSNNSPWKAVTECISWFYKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKK 1042
            +SSNNSPWKAVT CISWFYKNI WKVNDGEDISFWLDNWNGN+PLSL VPRLFALSTNKK
Sbjct: 726  YSSNNSPWKAVTNCISWFYKNIGWKVNDGEDISFWLDNWNGNSPLSLVVPRLFALSTNKK 785

Query: 1043 GSVKDFWNPSSNDWHLHINRPLRDHEKNLWHNIKASLPTPLPNRGLPKPLWKLNSNNIFD 1102
            GSVKD WNPS  DW++H+NRPLRDHEKNLWHNIKASLPTPLP+RG  KPLWKLNSNNIFD
Sbjct: 786  GSVKDLWNPSLKDWNIHVNRPLRDHEKNLWHNIKASLPTPLPDRGPSKPLWKLNSNNIFD 845

Query: 1103 TASVKRILSEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNW 1162
            TAS+K+ LSEA  SP NFHP+LYKTLWKV+FPKKCKFFIWTLIHGCINTADRLQKRLPNW
Sbjct: 846  TASIKKDLSEASASPTNFHPSLYKTLWKVDFPKKCKFFIWTLIHGCINTADRLQKRLPNW 905

Query: 1163 ALSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKAKALLKWNRTPTDVQSLVQNICSLNIR 1222
             LSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKA+ALLKWN TP DV+SL QNICSLNI+
Sbjct: 906  TLSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKAQALLKWNSTPNDVKSLAQNICSLNIK 965

Query: 1223 NQKGLITFNTSATLLWKIWLERNNRIFKQQGKDSQDLWEDILAQTGLWSCKSKLFSNYDC 1281
             QKGLITFNT A LLWKIWLERNNRIFKQQ K+ QDLWEDILAQTGLWSCKSKLFSNYDC
Sbjct: 966  TQKGLITFNTIAILLWKIWLERNNRIFKQQKKEFQDLWEDILAQTGLWSCKSKLFSNYDC 1025

BLAST of IVF0006035 vs. ExPASy TrEMBL
Match: A0A5A7TR15 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold46G003050 PE=4 SV=1)

HSP 1 Score: 1760.3 bits (4558), Expect = 0.0e+00
Identity = 837/929 (90.10%), Postives = 875/929 (94.19%), Query Frame = 0

Query: 356  GKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQK 415
            GK KKGK+E SKKAWIKEIDLIDKLEAEG+ATEIHR+KR+ALKADLSQITLT+AQ+WAQK
Sbjct: 15   GKEKKGKDEVSKKAWIKEIDLIDKLEAEGTATEIHRDKRLALKADLSQITLTKAQMWAQK 74

Query: 416  CKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSGQNCLNDSDIADAFIQHFEEIYTD 475
            CKRIWVHEGDENSSFFHKICT RQKKCLISK+INN GQNCLNDSDI DAFIQHFEEIYTD
Sbjct: 75   CKRIWVHEGDENSSFFHKICTTRQKKCLISKVINNCGQNCLNDSDIVDAFIQHFEEIYTD 134

Query: 476  NRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSW 535
            N+NS LFIDN DWCPISNTN  LLDKPFNE+EIWLTLKSF KNKAPGPDG+TMDFLQKSW
Sbjct: 135  NKNSQLFIDNRDWCPISNTNRCLLDKPFNESEIWLTLKSFTKNKAPGPDGFTMDFLQKSW 194

Query: 536  SFMKQNICDIFKDFHSTHTINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKA 595
            SFMK NICDIFKDFHS HTINKVVNETLITLIAKK NCETV+DF+PISLTTAIYKLIAK 
Sbjct: 195  SFMKHNICDIFKDFHSNHTINKVVNETLITLIAKKNNCETVSDFQPISLTTAIYKLIAKV 254

Query: 596  LADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIEKAFDK 655
            LADRLKQTLPDTISE QMAFVKGR+ITEAILIANEALDFWRNKKERGFVIKLDIEKAFDK
Sbjct: 255  LADRLKQTLPDTISELQMAFVKGRQITEAILIANEALDFWRNKKERGFVIKLDIEKAFDK 314

Query: 656  LNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFI 715
            LNWRFIDF+LMKKNYS KWR MIASCISSVQYSILINGRPRGRIKP+RGIRQGDPLS FI
Sbjct: 315  LNWRFIDFMLMKKNYSPKWRNMIASCISSVQYSILINGRPRGRIKPTRGIRQGDPLSSFI 374

Query: 716  FVLAMDYLSRLLNNLADKRKINGVNFSPNLNLTHILFADDILIFVEDRDDYVSNLKMILH 775
            FVLAMDYLS LL NLA+K KINGVNF PNLNLTHILFADDILIFVED++DYVSNLKMILH
Sbjct: 375  FVLAMDYLSHLLINLAEKGKINGVNFGPNLNLTHILFADDILIFVEDKEDYVSNLKMILH 434

Query: 776  LFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSSNFW 835
            LFESASGLNINLSKSTIFPINVPTDRA SI DSWGISKG LPT+YLGMPLGGKPSSSNFW
Sbjct: 435  LFESASGLNINLSKSTIFPINVPTDRANSIVDSWGISKGQLPTTYLGMPLGGKPSSSNFW 494

Query: 836  DNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPY----ISNVPKGIAQKIEASWRNF 895
            DN+LQKIQKKLSSWKYSQLSKGGRITLINSTLESLP     +  VPKGIAQKIEA WRNF
Sbjct: 495  DNILQKIQKKLSSWKYSQLSKGGRITLINSTLESLPIYQLSVFKVPKGIAQKIEAYWRNF 554

Query: 896  LWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKWLWKFLTEKDPLWKRLI 955
            LWNGTSNGHNISLIRWNQ+VSPKEKGGLGIH V+STNFALLCKWLWKFLTEK+PLWKRLI
Sbjct: 555  LWNGTSNGHNISLIRWNQVVSPKEKGGLGIHFVHSTNFALLCKWLWKFLTEKEPLWKRLI 614

Query: 956  ISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFYKNISWKVNDGEDISFWLDNWNGNA 1015
            ISKYDQEKMGRFPSRGK+SSNNSPWKAVT CISWFYKNI WKVNDGEDISFWLDNWNGN+
Sbjct: 615  ISKYDQEKMGRFPSRGKYSSNNSPWKAVTNCISWFYKNIGWKVNDGEDISFWLDNWNGNS 674

Query: 1016 PLSLAVPRLFALSTNKKGSVKDFWNPSSNDWHLHINRPLRDHEKNLWHNIKASLPTPLPN 1075
            PLSLAVPRLFALSTNKKGSVKD WNPS  DW++H+NRPLRDHEKNLWHNIKASLPTPLP+
Sbjct: 675  PLSLAVPRLFALSTNKKGSVKDLWNPSLKDWNIHVNRPLRDHEKNLWHNIKASLPTPLPD 734

Query: 1076 RGLPKPLWKLNSNNIFDTASVKRILSEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLI 1135
            RG  KPLWKLNSNNIFDTAS+K+ LSEA  SP NFHP+LYKTLWKV+FPKKCKFFIWTLI
Sbjct: 735  RGPSKPLWKLNSNNIFDTASIKKDLSEASASPTNFHPSLYKTLWKVDFPKKCKFFIWTLI 794

Query: 1136 HGCINTADRLQKRLPNWALSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKAKALLKWNRT 1195
            HGCINTADRLQKRLPNW LSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKA+ALLKWN T
Sbjct: 795  HGCINTADRLQKRLPNWTLSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKAQALLKWNST 854

Query: 1196 PTDVQSLVQNICSLNIRNQKGLITFNTSATLLWKIWLERNNRIFKQQGKDSQDLWEDILA 1255
            P DV+SL QNICSLNI+ QKGLITFNT A LLWKIWLERNNRIFKQQ K+ QDLWEDILA
Sbjct: 855  PNDVKSLAQNICSLNIKTQKGLITFNTIAILLWKIWLERNNRIFKQQKKEFQDLWEDILA 914

Query: 1256 QTGLWSCKSKLFSNYDCCSIALNISAFVK 1281
            QTGLWSCKSKLFSNYDCCSIALNISAFVK
Sbjct: 915  QTGLWSCKSKLFSNYDCCSIALNISAFVK 943

BLAST of IVF0006035 vs. ExPASy TrEMBL
Match: A0A5D3BJP3 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G005410 PE=4 SV=1)

HSP 1 Score: 1425.2 bits (3688), Expect = 0.0e+00
Identity = 701/767 (91.40%), Postives = 719/767 (93.74%), Query Frame = 0

Query: 164 GAFSVSIQVGSNNGASWWLSAIYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNVIRW 223
           G FSVSIQVGSNNGASWWLSAIYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNVIRW
Sbjct: 15  GNFSVSIQVGSNNGASWWLSAIYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNVIRW 74

Query: 224 KEETSTKNPASLSMKRFNTFISNCNLIDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWE 283
           KEETSTKNPASLSMKRFNTFISNCNLIDPPLTNAKFTWSNLRAQATLSRLDRFLFST WE
Sbjct: 75  KEETSTKNPASLSMKRFNTFISNCNLIDPPLTNAKFTWSNLRAQATLSRLDRFLFSTQWE 134

Query: 284 NIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQ 343
           NIFPGHTSKVLTRTTSDHFPIVLESS+ISWGPSPFRFTNAYLKDPDYK+NIEFWWGNTSQ
Sbjct: 135 NIFPGHTSKVLTRTTSDHFPIVLESSSISWGPSPFRFTNAYLKDPDYKRNIEFWWGNTSQ 194

Query: 344 PGYAA----------GFENQSLGKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREK 403
           PG+A             + ++ G+ KKGK+EASKKAWIKEIDLI+KLEAEG++TEIHREK
Sbjct: 195 PGFAGYSFMHRLKQLAMKIKAWGREKKGKDEASKKAWIKEIDLINKLEAEGTSTEIHREK 254

Query: 404 RIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSGQ 463
           RIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIIN  GQ
Sbjct: 255 RIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINICGQ 314

Query: 464 NCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLK 523
           NCLNDSDI DAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNS LLDKPFNEAEIWLTLK
Sbjct: 315 NCLNDSDIVDAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNSGLLDKPFNEAEIWLTLK 374

Query: 524 SFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTHTINKVVNETLITLIAKKENC 583
           SFAKNKAPGPDG+TMDFLQKSWSFMKQNICDIFKDFHS HTINKVVNETLIT IAKKENC
Sbjct: 375 SFAKNKAPGPDGFTMDFLQKSWSFMKQNICDIFKDFHSNHTINKVVNETLITFIAKKENC 434

Query: 584 ETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALD 643
           ETVADFRPISLTTAIYKLIAK LADRLKQTLPDTISESQMAFVKGR+ITEAILIANEALD
Sbjct: 435 ETVADFRPISLTTAIYKLIAKVLADRLKQTLPDTISESQMAFVKGRQITEAILIANEALD 494

Query: 644 FWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILING 703
            WRNKKERGFVIKLDIEKAFDKLNWRFIDF+LMKKNYSQKWRKMIASCISSVQYSILING
Sbjct: 495 LWRNKKERGFVIKLDIEKAFDKLNWRFIDFMLMKKNYSQKWRKMIASCISSVQYSILING 554

Query: 704 RPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNLNLTHILFA 763
           RPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADK KINGVNF PNLNLTHILFA
Sbjct: 555 RPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKGKINGVNFGPNLNLTHILFA 614

Query: 764 DDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISK 823
           DDILIFVED+DDYVSNLKMILHLFESASGLNINLSKSTIFPINVP DRA SIADSWGISK
Sbjct: 615 DDILIFVEDKDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPADRANSIADSWGISK 674

Query: 824 GHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPY- 883
           GHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYSQLSKG RITLINSTLESLP  
Sbjct: 675 GHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYSQLSKGRRITLINSTLESLPIY 734

Query: 884 ---ISNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEK 917
              +  VPKGIAQKIEA WRNFLWNGTSNGHNIS     ++   K K
Sbjct: 735 QLSVFKVPKGIAQKIEAYWRNFLWNGTSNGHNISSSDGTKLSPQKRK 781

BLAST of IVF0006035 vs. ExPASy TrEMBL
Match: A0A5D3E0F6 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold655G001830 PE=4 SV=1)

HSP 1 Score: 1331.2 bits (3444), Expect = 0.0e+00
Identity = 663/683 (97.07%), Postives = 667/683 (97.66%), Query Frame = 0

Query: 1   MSYAKMEAKSKRPTQSIKKKVYRVKSRSMERETPQTSRQKDKEKIDPNEFELVVDLGHIS 60
           MSYAKMEAKSKRPTQSIKKKVYRVKSRSMERETPQTSRQKDKEKIDPNEFELVVDLGHIS
Sbjct: 56  MSYAKMEAKSKRPTQSIKKKVYRVKSRSMERETPQTSRQKDKEKIDPNEFELVVDLGHIS 115

Query: 61  PLSDTDFSCPESPSYIPSPTSPTESDIVKDSLASMMTCAHEDREKKKKENLREETEDDEV 120
           PLSDTDFSCPESPSYIPSPTSPTESDIVKDSLASMMTCAHEDREKKKKENLREETEDDEV
Sbjct: 116 PLSDTDFSCPESPSYIPSPTSPTESDIVKDSLASMMTCAHEDREKKKKENLREETEDDEV 175

Query: 121 SFKRKLTDWLKENNLRLAADFNSQFNSVTNDRMISILNGPPNVGAFSVSIQVGSNNGASW 180
           SFKRKLTDWLKENNLRLAADFNSQFNSVTNDRMISILNGPPNVGAFSVSIQVGSNNGASW
Sbjct: 176 SFKRKLTDWLKENNLRLAADFNSQFNSVTNDRMISILNGPPNVGAFSVSIQVGSNNGASW 235

Query: 181 WLSAIYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRF 240
           WLSAIYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRF
Sbjct: 236 WLSAIYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRF 295

Query: 241 NTFISNCNLIDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSD 300
           NTFISNCNLIDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSD
Sbjct: 296 NTFISNCNLIDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSD 355

Query: 301 HFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAA----------GF 360
           HFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYA             
Sbjct: 356 HFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAGYSFMHRLKQLAL 415

Query: 361 ENQSLGKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQ 420
           + ++ G+ KKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQ
Sbjct: 416 KIKAWGREKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQ 475

Query: 421 IWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSGQNCLNDSDIADAFIQHFE 480
           IWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSGQNCLNDSDIADAFIQHFE
Sbjct: 476 IWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSGQNCLNDSDIADAFIQHFE 535

Query: 481 EIYTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDF 540
           EIYTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDF
Sbjct: 536 EIYTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDF 595

Query: 541 LQKSWSFMKQNICDIFKDFHSTHTINKVVNETLITLIAKKENCETVADFRPISLTTAIYK 600
           LQKSWSFMKQNICDIFKDFHSTHTINKVVNETLITLIAKKENCETVADFRPISLTTAIYK
Sbjct: 596 LQKSWSFMKQNICDIFKDFHSTHTINKVVNETLITLIAKKENCETVADFRPISLTTAIYK 655

Query: 601 LIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIE 660
           LIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIE
Sbjct: 656 LIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIE 715

Query: 661 KAFDKLNWRFIDFVLMKKNYSQK 674
           KAFDKLNWRFIDFVLMKKNYSQK
Sbjct: 716 KAFDKLNWRFIDFVLMKKNYSQK 738

BLAST of IVF0006035 vs. NCBI nr
Match: KAA0039950.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa] >TYK24553.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])

HSP 1 Score: 2179 bits (5646), Expect = 0.0
Identity = 1055/1144 (92.22%), Postives = 1084/1144 (94.76%), Query Frame = 0

Query: 150  NDRMISILNGPPNVGAFSVSIQVGSNNGASWWLSAIYGPAKRKNRPLFWEELENLKSICF 209
            ND+  SIL+     GAFSVSIQVGSNNGA WWLSAIYGPAKRKNRPLFWEELE+LKSIC 
Sbjct: 3    NDQNFSILS--VFKGAFSVSIQVGSNNGAFWWLSAIYGPAKRKNRPLFWEELEHLKSICL 62

Query: 210  PTWILGGDFNVIRWKEETSTKNPASLSMKRFNTFISNCNLIDPPLTNAKFTWSNLRAQAT 269
            PTWILGGDFNVIRWKEET+TKNPA LSM+RFN+FISNCNLIDPPL+NAK+TWSNLRAQAT
Sbjct: 63   PTWILGGDFNVIRWKEETTTKNPALLSMRRFNSFISNCNLIDPPLSNAKYTWSNLRAQAT 122

Query: 270  LSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTNAYLKDPD 329
            LSRLDRFLF++ WENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTNAYLKDPD
Sbjct: 123  LSRLDRFLFTSQWENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTNAYLKDPD 182

Query: 330  YKKNIEFWWGNTSQPGYAAGFENQSL----------GKRKKGKNEASKKAWIKEIDLIDK 389
            YKKNIEFWWGNTSQPGYA     + L          G+ KKGKNEASKKA IKEID IDK
Sbjct: 183  YKKNIEFWWGNTSQPGYAGYSFMRRLKQLALIIKTWGRDKKGKNEASKKACIKEIDQIDK 242

Query: 390  LEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQ 449
            LEAEGSATEIHREKR ALKADLSQI LTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQ
Sbjct: 243  LEAEGSATEIHREKRTALKADLSQINLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQ 302

Query: 450  KKCLISKIINNSGQNCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNSDLL 509
            KKCLISKIINNSGQNCLNDSDIADAFIQHFE+IYTDNRNS LFI+NLDWCPISN NS+LL
Sbjct: 303  KKCLISKIINNSGQNCLNDSDIADAFIQHFEDIYTDNRNSQLFIENLDWCPISNINSELL 362

Query: 510  DKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTHTINKVV 569
            DKPFNEAEIWLTLKSFAKNKAPGPDGY MDFLQKSWSFMKQNICDIFKDFHSTH INKVV
Sbjct: 363  DKPFNEAEIWLTLKSFAKNKAPGPDGYAMDFLQKSWSFMKQNICDIFKDFHSTHIINKVV 422

Query: 570  NETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGR 629
            NETLITLIAKKE+CET ADFRPISLTTAIYKLIAK LADRLKQTLPDTISESQMAFVKGR
Sbjct: 423  NETLITLIAKKEHCETAADFRPISLTTAIYKLIAKTLADRLKQTLPDTISESQMAFVKGR 482

Query: 630  KITEAILIANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIA 689
            +ITEAILIANEALDFWR+KKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIA
Sbjct: 483  QITEAILIANEALDFWRSKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIA 542

Query: 690  SCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGV 749
            SCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGV
Sbjct: 543  SCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGV 602

Query: 750  NFSPNLNLTHILFADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPT 809
             FSPNLNLTHILFADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPT
Sbjct: 603  KFSPNLNLTHILFADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPT 662

Query: 810  DRAKSIADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYSQLSKGGR 869
            DRAKSIADSWGISKGHLPTSYLGMPLGG+PSSSNFWDNVLQKIQKKLS+WKYSQLSKGGR
Sbjct: 663  DRAKSIADSWGISKGHLPTSYLGMPLGGRPSSSNFWDNVLQKIQKKLSNWKYSQLSKGGR 722

Query: 870  ITLINSTLESLPY----ISNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKE 929
            ITLINSTLESLP     +  VPKGIAQKIEASWRNFLWNG SNGHNISLIRWNQIVSPKE
Sbjct: 723  ITLINSTLESLPIYQMSVFKVPKGIAQKIEASWRNFLWNGASNGHNISLIRWNQIVSPKE 782

Query: 930  KGGLGIHSVNSTNFALLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSP 989
            KGGLGIHSVNSTNFALLCKWLWKFLTEKDPLWKRLIISKYD+EKMG FPS GKFSSNNSP
Sbjct: 783  KGGLGIHSVNSTNFALLCKWLWKFLTEKDPLWKRLIISKYDKEKMGSFPSHGKFSSNNSP 842

Query: 990  WKAVTECISWFYKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFW 1049
            WKAVTECISWFYKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVK+FW
Sbjct: 843  WKAVTECISWFYKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKEFW 902

Query: 1050 NPSSNDWHLHINRPLRDHEKNLWHNIKASLPTPLPNRGLPKPLWKLNSNNIFDTASVKRI 1109
            NPSSNDWHLHINRPLRDHE+NLWHNIKASLPTPLPNRG PKPLW LNSNNIFDTASVKR 
Sbjct: 903  NPSSNDWHLHINRPLRDHEENLWHNIKASLPTPLPNRGHPKPLWNLNSNNIFDTASVKRA 962

Query: 1110 LSEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNWALSPNWC 1169
            ++EAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNW LSPNWC
Sbjct: 963  IAEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNWTLSPNWC 1022

Query: 1170 YMCNKSQEDINHLFIHCPYSQQLWSKAKALLKWNRTPTDVQSLVQNICSLNIRNQKGLIT 1229
            YMCNKSQEDINHLFIHCPYSQQLWSKAKALL WN TPTDVQSL+QNICSLNIRNQKGLIT
Sbjct: 1023 YMCNKSQEDINHLFIHCPYSQQLWSKAKALLNWNSTPTDVQSLIQNICSLNIRNQKGLIT 1082

Query: 1230 FNTSATLLWKIWLERNNRIFKQQGKDSQDLWEDILAQTGLWSCKSKLFSNYDCCSIALNI 1279
            FNT+AT+LWKIWLERNNRIFKQQ K  QDLWED LAQ GLWSCKSKLFSNYDCCSIALNI
Sbjct: 1083 FNTNATILWKIWLERNNRIFKQQEKAPQDLWEDTLAQIGLWSCKSKLFSNYDCCSIALNI 1142

BLAST of IVF0006035 vs. NCBI nr
Match: KAA0041397.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])

HSP 1 Score: 1916 bits (4964), Expect = 0.0
Identity = 918/1032 (88.95%), Postives = 960/1032 (93.02%), Query Frame = 0

Query: 263  NLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTN 322
            NLRAQATLSRLDRFLFS  WEN FPGHTSK LTRTTSDHFPIVLESS+ISWGP PFRFTN
Sbjct: 6    NLRAQATLSRLDRFLFSPQWENTFPGHTSKTLTRTTSDHFPIVLESSSISWGPPPFRFTN 65

Query: 323  AYLKDPDYKKNIEFWWGNTSQPGYAA----------GFENQSLGKRKKGKNEASKKAWIK 382
            AYLKDPDYK+NIEFWWGNTSQPG+A             + ++ GK KKGK+E SKKAWIK
Sbjct: 66   AYLKDPDYKRNIEFWWGNTSQPGFAGYSFMRRLKQLAMKIKAWGKEKKGKDEVSKKAWIK 125

Query: 383  EIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFH 442
            EI+LIDKLEAEG+ATEIHR KR+ALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFH
Sbjct: 126  EINLIDKLEAEGTATEIHRVKRLALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFH 185

Query: 443  KICTARQKKCLISKIINNSGQNCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCPIS 502
            KICTARQKKCLISK+INN GQNCLNDSDI DAFIQHFEEIYTDN+NS LFIDNLDWCPIS
Sbjct: 186  KICTARQKKCLISKVINNCGQNCLNDSDIVDAFIQHFEEIYTDNKNSQLFIDNLDWCPIS 245

Query: 503  NTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHST 562
            NTN  LLDKPFNE+EIWLTLKSF KNKAPGPDG+TMDFLQKSWSFMK NICDIFKDFHS 
Sbjct: 246  NTNRCLLDKPFNESEIWLTLKSFTKNKAPGPDGFTMDFLQKSWSFMKHNICDIFKDFHSN 305

Query: 563  HTINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQ 622
            HTINKVVNETLITLIAKK+NCETV+DFRPISLTTAIYKLIAK LADRLKQTLP TISE Q
Sbjct: 306  HTINKVVNETLITLIAKKDNCETVSDFRPISLTTAIYKLIAKVLADRLKQTLPYTISELQ 365

Query: 623  MAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQ 682
            MAFVKGR+ITEAILIANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDF+LMKKNYS 
Sbjct: 366  MAFVKGRQITEAILIANEALDFWRNKKERGFVIKLDIEKAFDKLNWRFIDFMLMKKNYSP 425

Query: 683  KWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLAD 742
            KWR MIASCISSVQYSILINGRPRGRIKP+RGIRQGDPLSPFIFVLAMDYLS LL NLA+
Sbjct: 426  KWRNMIASCISSVQYSILINGRPRGRIKPTRGIRQGDPLSPFIFVLAMDYLSHLLINLAE 485

Query: 743  KRKINGVNFSPNLNLTHILFADDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTI 802
            K KINGVNF PNLNLTHILFADDILIFVED++DYVSNLKMILHLFESASGLNINLSKSTI
Sbjct: 486  KGKINGVNFGPNLNLTHILFADDILIFVEDKEDYVSNLKMILHLFESASGLNINLSKSTI 545

Query: 803  FPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYS 862
            FPINVPTDRA SI DSWGISKG LPT+YLGMPLGGKPSSSNFWDN+LQKIQKKLSSWKYS
Sbjct: 546  FPINVPTDRANSIVDSWGISKGQLPTTYLGMPLGGKPSSSNFWDNILQKIQKKLSSWKYS 605

Query: 863  QLSKGGRITLINSTLESLPY----ISNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWN 922
            QLSKGGRITLINSTLESLP     +  VPKGIAQKIEA WRNFLWNGTSNGHNISLIRWN
Sbjct: 606  QLSKGGRITLINSTLESLPIYQLSVFKVPKGIAQKIEAYWRNFLWNGTSNGHNISLIRWN 665

Query: 923  QIVSPKEKGGLGIHSVNSTNFALLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGK 982
            Q+VSPKEKGGLGIHSV+STNFALLCKWLWKFLTEK+PLWKRLIISKYDQEKMGRFPSRGK
Sbjct: 666  QVVSPKEKGGLGIHSVHSTNFALLCKWLWKFLTEKEPLWKRLIISKYDQEKMGRFPSRGK 725

Query: 983  FSSNNSPWKAVTECISWFYKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKK 1042
            +SSNNSPWKAVT CISWFYKNI WKVNDGEDISFWLDNWNGN+PLSL VPRLFALSTNKK
Sbjct: 726  YSSNNSPWKAVTNCISWFYKNIGWKVNDGEDISFWLDNWNGNSPLSLVVPRLFALSTNKK 785

Query: 1043 GSVKDFWNPSSNDWHLHINRPLRDHEKNLWHNIKASLPTPLPNRGLPKPLWKLNSNNIFD 1102
            GSVKD WNPS  DW++H+NRPLRDHEKNLWHNIKASLPTPLP+RG  KPLWKLNSNNIFD
Sbjct: 786  GSVKDLWNPSLKDWNIHVNRPLRDHEKNLWHNIKASLPTPLPDRGPSKPLWKLNSNNIFD 845

Query: 1103 TASVKRILSEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNW 1162
            TAS+K+ LSEA  SP NFHP+LYKTLWKV+FPKKCKFFIWTLIHGCINTADRLQKRLPNW
Sbjct: 846  TASIKKDLSEASASPTNFHPSLYKTLWKVDFPKKCKFFIWTLIHGCINTADRLQKRLPNW 905

Query: 1163 ALSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKAKALLKWNRTPTDVQSLVQNICSLNIR 1222
             LSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKA+ALLKWN TP DV+SL QNICSLNI+
Sbjct: 906  TLSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKAQALLKWNSTPNDVKSLAQNICSLNIK 965

Query: 1223 NQKGLITFNTSATLLWKIWLERNNRIFKQQGKDSQDLWEDILAQTGLWSCKSKLFSNYDC 1280
             QKGLITFNT A LLWKIWLERNNRIFKQQ K+ QDLWEDILAQTGLWSCKSKLFSNYDC
Sbjct: 966  TQKGLITFNTIAILLWKIWLERNNRIFKQQKKEFQDLWEDILAQTGLWSCKSKLFSNYDC 1025

BLAST of IVF0006035 vs. NCBI nr
Match: KAA0044556.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])

HSP 1 Score: 1752 bits (4537), Expect = 0.0
Identity = 837/932 (89.81%), Postives = 877/932 (94.10%), Query Frame = 0

Query: 353  QSLGKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQIW 412
            ++ GK KKGK+E SKKAWIKEIDLIDKLEAEG+ATEIHR+KR+ALKADLSQITLT+AQ+W
Sbjct: 12   KAWGKEKKGKDEVSKKAWIKEIDLIDKLEAEGTATEIHRDKRLALKADLSQITLTKAQMW 71

Query: 413  AQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSGQNCLNDSDIADAFIQHFEEI 472
            AQKCKRIWVHEGDENSSFFHKICT RQKKCLISK+INN GQNCLNDSDI DAFIQHFEEI
Sbjct: 72   AQKCKRIWVHEGDENSSFFHKICTTRQKKCLISKVINNCGQNCLNDSDIVDAFIQHFEEI 131

Query: 473  YTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQ 532
            YTDN+NS LFIDN DWCPISNTN  LLDKPFNE+EIWLTLKSF KNKAPGPDG+TMDFLQ
Sbjct: 132  YTDNKNSQLFIDNRDWCPISNTNRCLLDKPFNESEIWLTLKSFTKNKAPGPDGFTMDFLQ 191

Query: 533  KSWSFMKQNICDIFKDFHSTHTINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLI 592
            KSWSFMK NICDIFKDFHS HTINKVVNETLITLIAKK NCETV+DF+PISLTTAIYKLI
Sbjct: 192  KSWSFMKHNICDIFKDFHSNHTINKVVNETLITLIAKKNNCETVSDFQPISLTTAIYKLI 251

Query: 593  AKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIEKA 652
            AK LADRLKQTLPDTISE QMAFVKGR+ITEAILIANEALDFWRNKKERGFVIKLDIEKA
Sbjct: 252  AKVLADRLKQTLPDTISELQMAFVKGRQITEAILIANEALDFWRNKKERGFVIKLDIEKA 311

Query: 653  FDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILINGRPRGRIKPSRGIRQGDPLS 712
            FDKLNWRFIDF+LMKKNYS KWR MIASCISSVQYSILINGRPRGRIKP+RGIRQGDPLS
Sbjct: 312  FDKLNWRFIDFMLMKKNYSPKWRNMIASCISSVQYSILINGRPRGRIKPTRGIRQGDPLS 371

Query: 713  PFIFVLAMDYLSRLLNNLADKRKINGVNFSPNLNLTHILFADDILIFVEDRDDYVSNLKM 772
             FIFVLAMDYLS LL NLA+K KINGVNF PNLNLTHILFADDILIFVED++DYVSNLKM
Sbjct: 372  SFIFVLAMDYLSHLLINLAEKGKINGVNFGPNLNLTHILFADDILIFVEDKEDYVSNLKM 431

Query: 773  ILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISKGHLPTSYLGMPLGGKPSSS 832
            ILHLFESASGLNINLSKSTIFPINVPTDRA SI DSWGISKG LPT+YLGMPLGGKPSSS
Sbjct: 432  ILHLFESASGLNINLSKSTIFPINVPTDRANSIVDSWGISKGQLPTTYLGMPLGGKPSSS 491

Query: 833  NFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPY----ISNVPKGIAQKIEASW 892
            NFWDN+LQKIQKKLSSWKYSQLSKGGRITLINSTLESLP     +  VPKGIAQKIEA W
Sbjct: 492  NFWDNILQKIQKKLSSWKYSQLSKGGRITLINSTLESLPIYQLSVFKVPKGIAQKIEAYW 551

Query: 893  RNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKWLWKFLTEKDPLWK 952
            RNFLWNGTSNGHNISLIRWNQ+VSPKEKGGLGIH V+STNFALLCKWLWKFLTEK+PLWK
Sbjct: 552  RNFLWNGTSNGHNISLIRWNQVVSPKEKGGLGIHFVHSTNFALLCKWLWKFLTEKEPLWK 611

Query: 953  RLIISKYDQEKMGRFPSRGKFSSNNSPWKAVTECISWFYKNISWKVNDGEDISFWLDNWN 1012
            RLIISKYDQEKMGRFPSRGK+SSNNSPWKAVT CISWFYKNI WKVNDGEDISFWLDNWN
Sbjct: 612  RLIISKYDQEKMGRFPSRGKYSSNNSPWKAVTNCISWFYKNIGWKVNDGEDISFWLDNWN 671

Query: 1013 GNAPLSLAVPRLFALSTNKKGSVKDFWNPSSNDWHLHINRPLRDHEKNLWHNIKASLPTP 1072
            GN+PLSLAVPRLFALSTNKKGSVKD WNPS  DW++H+NRPLRDHEKNLWHNIKASLPTP
Sbjct: 672  GNSPLSLAVPRLFALSTNKKGSVKDLWNPSLKDWNIHVNRPLRDHEKNLWHNIKASLPTP 731

Query: 1073 LPNRGLPKPLWKLNSNNIFDTASVKRILSEAPISPANFHPNLYKTLWKVEFPKKCKFFIW 1132
            LP+RG  KPLWKLNSNNIFDTAS+K+ LSEA  SP NFHP+LYKTLWKV+FPKKCKFFIW
Sbjct: 732  LPDRGPSKPLWKLNSNNIFDTASIKKDLSEASASPTNFHPSLYKTLWKVDFPKKCKFFIW 791

Query: 1133 TLIHGCINTADRLQKRLPNWALSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKAKALLKW 1192
            TLIHGCINTADRLQKRLPNW LSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKA+ALLKW
Sbjct: 792  TLIHGCINTADRLQKRLPNWTLSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKAQALLKW 851

Query: 1193 NRTPTDVQSLVQNICSLNIRNQKGLITFNTSATLLWKIWLERNNRIFKQQGKDSQDLWED 1252
            N TP DV+SL QNICSLNI+ QKGLITFNT A LLWKIWLERNNRIFKQQ K+ QDLWED
Sbjct: 852  NSTPNDVKSLAQNICSLNIKTQKGLITFNTIAILLWKIWLERNNRIFKQQKKEFQDLWED 911

Query: 1253 ILAQTGLWSCKSKLFSNYDCCSIALNISAFVK 1280
            ILAQTGLWSCKSKLFSNYDCCSIALNISAFVK
Sbjct: 912  ILAQTGLWSCKSKLFSNYDCCSIALNISAFVK 943

BLAST of IVF0006035 vs. NCBI nr
Match: TYJ99326.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])

HSP 1 Score: 1419 bits (3674), Expect = 0.0
Identity = 701/767 (91.40%), Postives = 719/767 (93.74%), Query Frame = 0

Query: 164 GAFSVSIQVGSNNGASWWLSAIYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNVIRW 223
           G FSVSIQVGSNNGASWWLSAIYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNVIRW
Sbjct: 15  GNFSVSIQVGSNNGASWWLSAIYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNVIRW 74

Query: 224 KEETSTKNPASLSMKRFNTFISNCNLIDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWE 283
           KEETSTKNPASLSMKRFNTFISNCNLIDPPLTNAKFTWSNLRAQATLSRLDRFLFST WE
Sbjct: 75  KEETSTKNPASLSMKRFNTFISNCNLIDPPLTNAKFTWSNLRAQATLSRLDRFLFSTQWE 134

Query: 284 NIFPGHTSKVLTRTTSDHFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQ 343
           NIFPGHTSKVLTRTTSDHFPIVLESS+ISWGPSPFRFTNAYLKDPDYK+NIEFWWGNTSQ
Sbjct: 135 NIFPGHTSKVLTRTTSDHFPIVLESSSISWGPSPFRFTNAYLKDPDYKRNIEFWWGNTSQ 194

Query: 344 PGYAA----------GFENQSLGKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREK 403
           PG+A             + ++ G+ KKGK+EASKKAWIKEIDLI+KLEAEG++TEIHREK
Sbjct: 195 PGFAGYSFMHRLKQLAMKIKAWGREKKGKDEASKKAWIKEIDLINKLEAEGTSTEIHREK 254

Query: 404 RIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSGQ 463
           RIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIIN  GQ
Sbjct: 255 RIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINICGQ 314

Query: 464 NCLNDSDIADAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLK 523
           NCLNDSDI DAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNS LLDKPFNEAEIWLTLK
Sbjct: 315 NCLNDSDIVDAFIQHFEEIYTDNRNSHLFIDNLDWCPISNTNSGLLDKPFNEAEIWLTLK 374

Query: 524 SFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKDFHSTHTINKVVNETLITLIAKKENC 583
           SFAKNKAPGPDG+TMDFLQKSWSFMKQNICDIFKDFHS HTINKVVNETLIT IAKKENC
Sbjct: 375 SFAKNKAPGPDGFTMDFLQKSWSFMKQNICDIFKDFHSNHTINKVVNETLITFIAKKENC 434

Query: 584 ETVADFRPISLTTAIYKLIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALD 643
           ETVADFRPISLTTAIYKLIAK LADRLKQTLPDTISESQMAFVKGR+ITEAILIANEALD
Sbjct: 435 ETVADFRPISLTTAIYKLIAKVLADRLKQTLPDTISESQMAFVKGRQITEAILIANEALD 494

Query: 644 FWRNKKERGFVIKLDIEKAFDKLNWRFIDFVLMKKNYSQKWRKMIASCISSVQYSILING 703
            WRNKKERGFVIKLDIEKAFDKLNWRFIDF+LMKKNYSQKWRKMIASCISSVQYSILING
Sbjct: 495 LWRNKKERGFVIKLDIEKAFDKLNWRFIDFMLMKKNYSQKWRKMIASCISSVQYSILING 554

Query: 704 RPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNLNLTHILFA 763
           RPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADK KINGVNF PNLNLTHILFA
Sbjct: 555 RPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKGKINGVNFGPNLNLTHILFA 614

Query: 764 DDILIFVEDRDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPTDRAKSIADSWGISK 823
           DDILIFVED+DDYVSNLKMILHLFESASGLNINLSKSTIFPINVP DRA SIADSWGISK
Sbjct: 615 DDILIFVEDKDDYVSNLKMILHLFESASGLNINLSKSTIFPINVPADRANSIADSWGISK 674

Query: 824 GHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYSQLSKGGRITLINSTLESLPY- 883
           GHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYSQLSKG RITLINSTLESLP  
Sbjct: 675 GHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYSQLSKGRRITLINSTLESLPIY 734

Query: 884 ---ISNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEK 916
              +  VPKGIAQKIEA WRNFLWNGTSNGHNIS     ++   K K
Sbjct: 735 QLSVFKVPKGIAQKIEAYWRNFLWNGTSNGHNISSSDGTKLSPQKRK 781

BLAST of IVF0006035 vs. NCBI nr
Match: TYK29577.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])

HSP 1 Score: 1325 bits (3429), Expect = 0.0
Identity = 663/683 (97.07%), Postives = 667/683 (97.66%), Query Frame = 0

Query: 1   MSYAKMEAKSKRPTQSIKKKVYRVKSRSMERETPQTSRQKDKEKIDPNEFELVVDLGHIS 60
           MSYAKMEAKSKRPTQSIKKKVYRVKSRSMERETPQTSRQKDKEKIDPNEFELVVDLGHIS
Sbjct: 56  MSYAKMEAKSKRPTQSIKKKVYRVKSRSMERETPQTSRQKDKEKIDPNEFELVVDLGHIS 115

Query: 61  PLSDTDFSCPESPSYIPSPTSPTESDIVKDSLASMMTCAHEDREKKKKENLREETEDDEV 120
           PLSDTDFSCPESPSYIPSPTSPTESDIVKDSLASMMTCAHEDREKKKKENLREETEDDEV
Sbjct: 116 PLSDTDFSCPESPSYIPSPTSPTESDIVKDSLASMMTCAHEDREKKKKENLREETEDDEV 175

Query: 121 SFKRKLTDWLKENNLRLAADFNSQFNSVTNDRMISILNGPPNVGAFSVSIQVGSNNGASW 180
           SFKRKLTDWLKENNLRLAADFNSQFNSVTNDRMISILNGPPNVGAFSVSIQVGSNNGASW
Sbjct: 176 SFKRKLTDWLKENNLRLAADFNSQFNSVTNDRMISILNGPPNVGAFSVSIQVGSNNGASW 235

Query: 181 WLSAIYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRF 240
           WLSAIYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRF
Sbjct: 236 WLSAIYGPAKRKNRPLFWEELENLKSICFPTWILGGDFNVIRWKEETSTKNPASLSMKRF 295

Query: 241 NTFISNCNLIDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSD 300
           NTFISNCNLIDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSD
Sbjct: 296 NTFISNCNLIDPPLTNAKFTWSNLRAQATLSRLDRFLFSTHWENIFPGHTSKVLTRTTSD 355

Query: 301 HFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAA----------GF 360
           HFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYA             
Sbjct: 356 HFPIVLESSTISWGPSPFRFTNAYLKDPDYKKNIEFWWGNTSQPGYAGYSFMHRLKQLAL 415

Query: 361 ENQSLGKRKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQ 420
           + ++ G+ KKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQ
Sbjct: 416 KIKAWGREKKGKNEASKKAWIKEIDLIDKLEAEGSATEIHREKRIALKADLSQITLTEAQ 475

Query: 421 IWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSGQNCLNDSDIADAFIQHFE 480
           IWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSGQNCLNDSDIADAFIQHFE
Sbjct: 476 IWAQKCKRIWVHEGDENSSFFHKICTARQKKCLISKIINNSGQNCLNDSDIADAFIQHFE 535

Query: 481 EIYTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDF 540
           EIYTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDF
Sbjct: 536 EIYTDNRNSHLFIDNLDWCPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDF 595

Query: 541 LQKSWSFMKQNICDIFKDFHSTHTINKVVNETLITLIAKKENCETVADFRPISLTTAIYK 600
           LQKSWSFMKQNICDIFKDFHSTHTINKVVNETLITLIAKKENCETVADFRPISLTTAIYK
Sbjct: 596 LQKSWSFMKQNICDIFKDFHSTHTINKVVNETLITLIAKKENCETVADFRPISLTTAIYK 655

Query: 601 LIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIE 660
           LIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIE
Sbjct: 656 LIAKALADRLKQTLPDTISESQMAFVKGRKITEAILIANEALDFWRNKKERGFVIKLDIE 715

Query: 661 KAFDKLNWRFIDFVLMKKNYSQK 673
           KAFDKLNWRFIDFVLMKKNYSQK
Sbjct: 716 KAFDKLNWRFIDFVLMKKNYSQK 738

BLAST of IVF0006035 vs. TAIR 10
Match: AT1G43760.1 (DNAse I-like superfamily protein )

HSP 1 Score: 114.4 bits (285), Expect = 6.8e-25
Identity = 99/404 (24.50%), Postives = 174/404 (43.07%), Query Frame = 0

Query: 213 ILGGDFNVIRWKEETSTKNPASLSMK---RFNTFISNCNLIDPPLTNAKFTWSNLRAQAT 272
           IL GDF+ I    +  +    S+ M+    F   + + +L+D P     +TWSN +    
Sbjct: 222 ILVGDFDQIAATSDHYSVLQTSIPMRGLEEFQNCLRDSDLVDIPSRGVHYTWSNHQDDNP 281

Query: 273 LSR-LDRFLFSTHWENIFPGHTSKVLTRTTSDHFP-IVLESSTISWGPSPFRFTNAYLKD 332
           + R LDR + +  W + FP   +       SDH P I++  +        FR+ +     
Sbjct: 282 IIRKLDRAIANGDWFSSFPSAIAVFELSGVSDHSPCIIILENLPKRSKKCFRYFSFLSTH 341

Query: 333 PDYKKNIEFWWGNTSQPGYAAGFENQSLGKRKKGKNEASK--------KAWIKEIDLIDK 392
           P +  ++   W    +     G    SLG+  K   +  K            K  + +D 
Sbjct: 342 PTFLVSLTVAW----EEQIPVGSHMFSLGEHLKAAKKCCKLLNRQGFGNIQHKTKEALDS 401

Query: 393 LEAEGS------ATEIHREKRIALKADLSQITLTEAQIWAQKCKRIWVHEGDENSSFFHK 452
           LE+  S      +  + R + +A K   +         + QK +  W+ +GD N+ FFHK
Sbjct: 402 LESIQSQLLTNPSDSLFRVEHVARK-KWNFFAAALESFYRQKSRIKWLQDGDANTRFFHK 461

Query: 453 ICTARQKKCLISKIINNSGQNCLNDSDIADAFIQHFEEIYTDNR-----NSHLFIDNLDW 512
           +  A Q K LI  +  +      N + + +  + ++  +   +      +S   I ++  
Sbjct: 462 VILANQAKNLIKFLRMDDDVRVENVTQVKEMIVAYYTHLLGSDSDILTPDSVQRIKDIHP 521

Query: 513 CPISNTNSDLLDKPFNEAEIWLTLKSFAKNKAPGPDGYTMDFLQKSWSFMKQNICDIFKD 572
              ++T +  L    ++ EI   + +  +NKAPGPD +T +F  +SW  +K +     K+
Sbjct: 522 FRCNDTLASRLSALPSDKEITAAVFAMPRNKAPGPDSFTAEFFWESWFVVKDSTIAAVKE 581

Query: 573 FHSTHTINKVVNETLITLIAKKENCETVADFRPISLTTAIYKLI 593
           F  T  + K  N T ITLI K    + ++ FRP+S  T +YK+I
Sbjct: 582 FFRTGHLLKRFNATAITLIPKVTGVDQLSMFRPVSCCTVVYKII 620

BLAST of IVF0006035 vs. TAIR 10
Match: AT3G24255.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 107.8 bits (268), Expect = 6.4e-23
Identity = 88/383 (22.98%), Postives = 157/383 (40.99%), Query Frame = 0

Query: 805  IADSWGISKGHLPTSYLGMPLGGKPSSSNFWDNVLQKIQKKLSSWKYSQLSKGGRITLIN 864
            I  S+  + G LP  YLG+PL  K  +++ +  +++KI+ ++  W    LS  GR+ LI+
Sbjct: 12   ILHSFPFASGALPVRYLGLPLLTKKMTTSDYGPLVEKIRVRIGKWTARHLSFAGRLQLIS 71

Query: 865  STLESLPYI----SNVPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLG 924
            S + SL         +P    ++I++   +FLW+G       + + W+ + +PK++GGLG
Sbjct: 72   SVIHSLTNFWMSAFRLPSACIKEIDSICSSFLWSGPELNTKKAKVAWSDVCTPKDEGGLG 131

Query: 925  IHSVNSTNFALLCKWLWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNSPWKAVT 984
            I S+   N              K   W                   G  +  +  WK + 
Sbjct: 132  IRSLKEAN--------------KGSFWS----------------ISGNTTLGSWMWKKIL 191

Query: 985  ECISWFYKNISWKVNDGEDISFWLDNWNGNAPLSLAVPRLFALSTNKKGSVKDFWNPSSN 1044
            +  +     +   +++G + SFW DNW+        + RL  + T  +G +       ++
Sbjct: 192  KHRALASGFVKHDIHNGSNTSFWFDNWS-------KIGRLIDV-TGHRGCIDMGITLHAS 251

Query: 1045 DWHLHINRPLRDHEKNLWHNIKASLPTPLPNRGLPK----PLWKLNSNNIFDTASVKRIL 1104
                 +N   R H  +    I+  +   + ++GL        WK N +      + K   
Sbjct: 252  VAEAVVNHRPRRHRHDTLLRIE-DVIAEVRHQGLTSGEDTVRWKGNGDIFKPCFNTKE-- 311

Query: 1105 SEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNW-ALSPNWC 1164
            + A         N YK +W      K     W  I   + T DR+     +W A + + C
Sbjct: 312  TWAATREPKLKVNWYKGVWFSHATPKYSVLAWIAIKNRLTTGDRML----SWNAGADSSC 349

Query: 1165 YMCNKSQEDINHLFIHCPYSQQL 1179
             +C+   E  +HLF  CPYS ++
Sbjct: 372  VLCHHLVETRDHLFFTCPYSAEV 349

BLAST of IVF0006035 vs. TAIR 10
Match: AT4G29090.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 104.4 bits (259), Expect = 7.0e-22
Identity = 91/419 (21.72%), Postives = 165/419 (39.38%), Query Frame = 0

Query: 876  VPKGIAQKIEASWRNFLWNGTSNGHNISLIRWNQIVSPKEKGGLGIHSVNSTNFALLCKW 935
            +PK + ++I +   +F W        +    W+ +   K +GG+G   + + N ALL K 
Sbjct: 13   LPKTVCKQIISVLADFWWRNKQEAKGMHWKAWDHLSCYKAEGGIGFKDIEAFNLALLGKQ 72

Query: 936  LWKFLTEKDPLWKRLIISKYDQEKMGRFPSRGKFSSNNS-PWKAVTECISWFYKNISWKV 995
            +W+ L+  + L  ++  S+Y  +     P      S  S  WK++        +     V
Sbjct: 73   MWRMLSRPESLMAKVFKSRYFHKS---DPLNAPLGSRPSFVWKSIHASQEILRQGARAVV 132

Query: 996  NDGEDISFWLDNWNGNAPLSLA-----VPRLFALSTNKKGSVKDFWNPSSNDWHLHINRP 1055
             +GEDI  W   W  + P S A     VP     S +    V D  + S  +W   +   
Sbjct: 133  GNGEDIIIWRHKWLDSKPASAALRMQRVPPQEYASVSSILKVSDLIDESGREWRKDVIEM 192

Query: 1056 L-RDHEKNLWHNIKASLPTPLPNRGLPKPLWKLNSNNIFDTAS--------VKRILSEAP 1115
            L  + E+ L   ++     P   R L    W   S+  +   S        + +  S   
Sbjct: 193  LFPEVERKLIGELR-----PGGRRILDSYTWDYTSSGDYTVKSGYWVLTQIINKRSSPQE 252

Query: 1116 ISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNWALSPNWCYMCNK 1175
            +S  + +P +Y+ +WK +   K + F+W  +   +  A  L  R        + C  C  
Sbjct: 253  VSEPSLNP-IYQKIWKSQTSPKIQHFLWKCLSNSLPVAGALAYR---HLSKESACIRCPS 312

Query: 1176 SQEDINHLFIHCPYSQQLWSKAKALLKWNRTPTDVQSLVQN---ICSLNIRNQKGLITFN 1235
             +E +NHL   C +++  W+ +   +       D  S+  N   + +L   N +      
Sbjct: 313  CKETVNHLLFKCTFARLTWAISSIPIPLGGEWAD--SIYVNLYWVFNLGNGNPQWEKASQ 372

Query: 1236 TSATLLWKIWLERNNRIFKQQGKDSQDL-------WEDILAQTGLWSCKSKLFSNYDCC 1270
                LLW++W  RN  +F+ +  ++Q++        E+   +T   SC +K   N   C
Sbjct: 373  LVPWLLWRLWKNRNELVFRGREFNAQEVLRRAEDDLEEWRIRTEAESCGTKPQVNRSSC 417

BLAST of IVF0006035 vs. TAIR 10
Match: AT2G02650.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 71.6 bits (174), Expect = 5.1e-12
Identity = 38/151 (25.17%), Postives = 71/151 (47.02%), Query Frame = 0

Query: 1090 ASVKRILSEAPISPANFHPNLYKTLWKVEFPKKCKFFIWTLIHGCINTADRLQKRLPNWA 1149
            A+ + +L E  I P      + + +WK+    K K F+W  + G + T  RL+ R  N  
Sbjct: 13   ATHEDLLEEEAIQPPPGSTEVKQAIWKLHVAPKIKHFLWRCVTGALATNTRLRSR--NID 72

Query: 1150 LSPNWCYMCNKSQEDINHLFIHCPYSQQLWSKAKALL--KWNRTPTDVQSLVQNICSLNI 1209
              P  C  C   +E I+H+  +CPY+Q +W  A  ++  +W   P+  +  +  +  L+ 
Sbjct: 73   ADP-ICQRCCIEEETIHHIMFNCPYTQSVWRSANIIIGNQWG-PPSSFEDNLNRLIQLSK 132

Query: 1210 RNQKGLITFNTSATLLWKIWLERNNRIFKQQ 1239
                  +       ++W++W  RN  +F+Q+
Sbjct: 133  TQTTNSLDRFLPFWIMWRLWKSRNVFLFQQK 159

BLAST of IVF0006035 vs. TAIR 10
Match: ATMG01250.1 (RNA-directed DNA polymerase (reverse transcriptase) )

HSP 1 Score: 70.9 bits (172), Expect = 8.6e-12
Identity = 31/67 (46.27%), Postives = 46/67 (68.66%), Query Frame = 0

Query: 690 LINGRPRGRIKPSRGIRQGDPLSPFIFVLAMDYLSRLLNNLADKRKINGVNFSPNL-NLT 749
           +ING P+G + PSRG+RQGDPLSP++F+L  + LS L     ++ ++ G+  S N   + 
Sbjct: 13  IINGAPQGLVTPSRGLRQGDPLSPYLFILCTEVLSGLCRRAQEQGRLPGIRVSNNSPRIN 72

Query: 750 HILFADD 756
           H+LFADD
Sbjct: 73  HLLFADD 79

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P085488.3e-4424.14LINE-1 reverse transcriptase homolog OS=Nycticebus coucang OX=9470 PE=4 SV=1[more]
P113695.1e-4123.30LINE-1 retrotransposable element ORF2 protein OS=Mus musculus OX=10090 GN=Pol PE... [more]
P143811.8e-3823.97Transposon TX1 uncharacterized 149 kDa protein OS=Xenopus laevis OX=8355 PE=4 SV... [more]
O003706.8e-3823.48LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens OX=9606 PE=1 SV=1[more]
P0C2F61.0e-3324.94Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1... [more]
Match NameE-valueIdentityDescription
A0A5A7T9I70.0e+0092.13LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... [more]
A0A5A7TIB80.0e+0088.95LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... [more]
A0A5A7TR150.0e+0090.10LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... [more]
A0A5D3BJP30.0e+0091.40LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... [more]
A0A5D3E0F60.0e+0097.07LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... [more]
Match NameE-valueIdentityDescription
KAA0039950.10.092.22LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa] >TYK245... [more]
KAA0041397.10.088.95LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa][more]
KAA0044556.10.089.81LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa][more]
TYJ99326.10.091.40LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa][more]
TYK29577.10.097.07LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
AT1G43760.16.8e-2524.50DNAse I-like superfamily protein [more]
AT3G24255.16.4e-2322.98RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
AT4G29090.17.0e-2221.72Ribonuclease H-like superfamily protein [more]
AT2G02650.15.1e-1225.17Ribonuclease H-like superfamily protein [more]
ATMG01250.18.6e-1246.27RNA-directed DNA polymerase (reverse transcriptase) [more]
InterPro
Analysis Name: InterPro Annotations of Melon (IVF77) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 98..118
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..48
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 23..48
NoneNo IPR availablePANTHERPTHR33116:SF38OS01G0158850 PROTEINcoord: 261..1185
NoneNo IPR availablePANTHERPTHR33116REVERSE TRANSCRIPTASE ZINC-BINDING DOMAIN-CONTAINING PROTEIN-RELATED-RELATEDcoord: 261..1185
NoneNo IPR availableCDDcd01650RT_nLTR_likecoord: 561..825
e-value: 5.20064E-54
score: 185.958
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 569..824
e-value: 7.1E-45
score: 153.2
IPR000477Reverse transcriptase domainPROSITEPS50878RT_POLcoord: 548..825
score: 17.587692
IPR036691Endonuclease/exonuclease/phosphatase superfamilyGENE3D3.60.10.10Endonuclease/exonuclease/phosphatasecoord: 150..307
e-value: 2.7E-21
score: 78.4
IPR036691Endonuclease/exonuclease/phosphatase superfamilySUPERFAMILY56219DNase I-likecoord: 159..307
IPR026960Reverse transcriptase zinc-binding domainPFAMPF13966zf-RVTcoord: 1108..1179
e-value: 1.7E-17
score: 63.9
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 512..853

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
IVF0006035.2IVF0006035.2mRNA