Pay0007691 (gene) Melon (Payzawat) v1

Overview
NamePay0007691
Typegene
OrganismCucumis melo L. var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Locationchr02: 21669223 .. 21675067 (-)
RNA-Seq ExpressionPay0007691
SyntenyPay0007691
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCTACTTCAAATCACTTCCTAGATCATGCAAAATAGAAAGAAAGGAATTTGTCCTTCTCCTTGATAAATACGCGAAACACACTCACTACTGGCTAACCGAAACAGGGGCTCACAAAGCTTTTTCCATTGAAGTCTCCCCAAGGGATTTAGATTGGATAAGAAGCACCCTTAAATCACTGATTGAAACCCCAAGCTCGAATCGCTTCTTTCTTGAAAATCGTGATTCTGAGCATTGCATTTGGATCAGGAAAACAAGGAATGGCAAAGGATGTACTGCAGAAATCTTCAGAGTAGATCATAAAAATAGAAAATCTTGCATCCTAGTCCCGGAAGGCCCTGAGAAAAGTGGTTGGGTGTCATTCTTGTCTATGATCACTCCAAAAGTGGAAGTAAAAGCAAAGACAAGACCAACCTTCCTACCAAGAAGCAGCCCTGAGTTTCGTTTATCACCTCTCATAGATTACCACAAACGCTAATATGCAAAGGCAGTCTCTGAAGGAAGATCTTCCATTTCTAGTGACTCAAGCGACTCTTATGCCTCAAACGATTCAAGTCAATCTTCAGGTAATAGCCCCTGCGACTCCCCCTTTCCTGTCGTTCTAGAAAATACAGTGGTGTTAGTAAGACGTTTTTTTCATGATGATTGGCAAAAAATCCTACAAAACTTGAGGAAACAAACAGAGGAATCTTTTACATACCATGCCTCCATGCTGAAAAGGCCCTGGTTCATTTCAACTCAAATGTACCCGCGAATCTTCTTTGTCAAAACAAAGGGTGGACAACCGTTGGGAAATACACGGTCAGGTTTGAAAAATGGGCCCCTGCCTCCCATGCCTCTCCAAAACTCATTCCTAGCTATGGGGGATGGACAACCTTTAGAGGAATTCCACTACACTTGTGGAATATGATGACCTTTCAACAAATTGGGAAAGCGTGTGGTAGTCTGGTTAAAGTGGCTGAGGAGACAAAAACAGCAAGAAACCTGATAGAAGCAAAATTAAAAATCAGATACAACTACTCCGGCTTCTTACCAGCTTACGTGAAGATTTTTGACCAAGAAGGAAACAAATTTGTTGTTCAAGTAGTCACTCACTCAGAAGGAAAATGGTTAATGGAAAGAAATATTAGATTACATGGTACCTTCAAGAGACAAGCTGCGGCTTCTTTTGATGATTTCAATCCTGATGCAGAGCAATTCCTGTTTGATGGTCTGGAGGCCATTTCACCGGATCTCCTGAACACCATCTCCGGCAGCCGTAAAAGCATTTCTCCGGAACAGCAATCTGCATTAAAATCAGTCATCATTAAACCTGCCAGAGATGCCACGTCGCCAACTACTTTAAATGAAGAGGTAGTTAATGATAACAGTTTGCATGCAACGACTATTAATTCGAAGTTGAAGATCTTATCTGGGATATCAAATGATGGCTCTTTGGATAAAGGAAAGCAAAAGGTTGACATTCCGTCTCAACTAACTTCAGCATTTATTTTTGACAAACCCAAAAGAAAAGTCTCCTTTAATTCCCCCAATAATAAAACCACCTTTTTTAATCCGGATTCTGCCCCAACCAATCATTCCCCTCTATTGAGCTCCCCTGAGAAAAAACAAAGAGTCAGTAGAGAGAGAAGTGTTAAAAAGAAATCATTAACCATTCAGCCTAAATCAAGAGCCAATCAGGGCAAAGGTGATTTAATCACTCAACCTCTTCAAGTTGTGGCACATGATCTGGATGCTTCCAAAAAAGGTCTCTCTCTCACAGTGGATTTGGGAAATCTGCCAGCTTTAGATCCGAGTAAATCCTGCGAAGATCATCACAGCTCTGACAATGCAGAAGTCATAGACATAACTAACACCGAAGTGGTTACAGAGACACCTGAATTGAAGATGACAGATCCAAAGAAATCAAACTCCTCTCCGGAAGTCAACTATAGGAAGCAAAAACATTCTCACCGAAGAAGACACTACTATAGGAAAAAGGAAGGCAAGGAGAAGGACACAAATTCAGAAGCCTTCAAAAATCAACTTGTTACTTGGCTAAAGGAAAATGGGCTGAAACTCTCTACAGATACAGACTCTTCTGGTGCAACAACTTCTACAAATGCTTTGTTTTCTCAATTGGGTTCCAGGTTAGATCCAAAGGGGGATGGGGCCCCGGGGACATCAAATGTAATATGAAACTTCTTACTTGGAATGCAAGAGGTTTAGGCTCTCCTTCTAAAAGAGCCTTAATAAAAAATACTATAATTTCCTATTCCCCCGACTTTGTGATTCTTACTGAAACAAGGCTCAAAATCACAAATAAGAGAATTATTAAGTCCCTTTGGCCTTCTAATAGCATTAATTGAATTGTTAAAAACCCTATCGATAGCTCAAGAGGAATTCTGATTTTATGGGACGCTCAACATCATTCTCTTTTAAGTCAAGAGGAAGGGAAGTTCAGCTTATCAGCAAATTTTTTGTCCTTCAATAATTCTTGGTGGTTAACTGGTTTATATGGTCCAGTCAAAAGAAGGGAAAGATTAAATTTTTGGGCAGATTTACATAATCTCCTCCATCTTAACTCTTCTCCTTGGATAATAGGGTGAGATCTAAATGTGGTCAGAATGAGAGAGGAATCTACAGCAGTCACTTCCTCTTCCCACAGCTCCAACATGCTAAATAACTTCATCTCCAACAATCTTCTGATTGACCCTCCTCTAACAAATAACAGATACACTTGGTCAAACTTGAGAAATCCTCCAACCTTTTCCCATTTAGACAGATTCTTGTACAACTCTAGTTGGGAGATTCTCTTCAATCCTCATATCACAAGGACTCTCCCGAGAACTACTTCTGACCACTTCCCCCTAGTCTGTGAAGATTCCACCTCCACTCTCAGATGGGGCCCTGCTCCATTTAGATTAAACTCCATAACTTTAAATGACCCTGAATTCAAAAGAAATATGGAAAGATGGTGGGAGCTCTCAATTCAAAATGGTCACCCCGGGTTTTCCTTCATTCAAAGACTCAAGTCTTTAGCAAACCTCATCAAACCATGGCAAAAGGAGAAATTTCACTCCCTCACTACTGCTAAAGAAAATATTATTAGGGAAGTGGATGCCATTGATAAGAATGAGTTGGATACTCCGTTGTCTCAGGAGGAAAGCAACCGTAGACTTGCTCTAAAAGCTGAGCTCAATGATCTTTCCCTTAAGGAATCCCAATTTTGGTTCCAAAGGGCAAAAAAGCTTTGGATTAAAGAAGGAGATGAAAATTATGCCTTCTTTCATAGAATTTGTTCATCAAGACAGAAGAGAAATCTCATTCACGAAATCCAGAATGAAGAAGGCTCGATTCAGAATACAAACAATAATATTTCTCTTGCCTTTGTCAATCACTTTTCAAGAATTTACAGATGTTCCACCAAAAAAGACCCTCTTTTCATTGAAAATCTTGAGTGGAATCCAATTGATTACTCAGATTGGTCTCTCCTTTGTGCCCCCTTCTTGGAGGAAGAAATTAAAGGGGTCATCAAATCTTTTGATGGAAATAAGGCTCCTGGTCCAGACGGATTTCCTATTTCATTCTTTAAGTCTTATTGGCATCTTCTAAAAGAGGATATCATGGACATCTTTAAGGATTTCTTTGAGAAGGGAGTGATCAATAAGAATATGAACAACACCTACATCGCGTTGATTGCAAAAAAGAAGGATTATTCTCATCCAAAAGATTTCAGACCAATCAGCCTAACAACGTCTATCTACAAGATCATTGCAAAGACTCTTTCCAATAGGCTAAAGCTCACCCTCCCAGATACTATCTCAGGTAACCAACTTGCTTTTATCAAAAATCGTCAAATAACAGATGCTATCTTAATAGCGAATGAAGCTTTAGACTATTGGAAAGTGAAAAAGATCAAAGGTTTTATCTTGAAGCTGGATATTGAAAAGGCTTTTGATAATCTGAACTGGGATTTCATTGATTTCGTTCTTGAGAAAAAAAATTACCCAACTTCCTGGAGGAAATGGATAAGAGGCTGCATAAGCAATGTCACCTACTCAATCGAAGTCAATGGAAAACCACAAGGGCGAATTAAAGCCAATAGGGGTCTGAGACAAGGTGATCCCCTTTCTCCCTTCCTTTTTGTTATTGTTATGGATTACCTCAGCAGGCTCTTAAGTCATTTGGAGTCCACTGGTGCCATCAAAGGGGTGTGCCTCGCCAATGATTGCAACATTTCCCATATCCTCTTCGCTGATGATATTCTACTTTTTGTAGAAGATAATGACAACTTTCTGAATAATCTTAGAATGGCTATTTCTCTGTTTGAAAAGGCCTCTGGGCTCAAAATAAATTTGTCAAAATCAGCTATGGTTCCAGTTAATGTCTCCTGGTTAAGAGCTTTGGAGTGTACTTCATCTTGGGGTATTTCGTGCCACACTCTTCCTCTTACTTACTTAGGAGTCCCCCTTGGTGGCAACCAAAAATCCAACCTTTTCTGGAGGAATATTGAGGACAGAATCCAAAAAAAGCTTAGCAACTGGAAATATGCCCACATCTCAAAAGGTGGAAGACTCACGCTAATCAAGTCTACTCTAAGCAGTCTCCCAATTTATCAACTATCTGTCTTTCAAGCTCCTTCCTCCACGTATAAAAACATCGAAAAAATTTGGAGGAATTTCCTTTGGAAAGGTAGCGGTGGTCTAAAAGGGTCTCACTTAATCAACTGGTCAATAGTGACTAAGCCTAAGGAGGAGGGTGGGCTGGGTATATCGAGACTCCAAGTAACAAATCAAGCTCTCTTATCTAAGTGGCTTTGGCGCTACCATTCGGAGCCTAATTCCCTTTGGAGGAGGTTAATCCATATCAAGTATAAAGGTAAACACCCAGGGGACCTCCCATCAAACATTTCCTCTAGCTCCTCTAAAGCCCCGTGGAGATCTATCATCAACAACATTGACTGGTTCAAAAGTAATCAAGGTTGGAACTTGAACAATGGAGATCAAATCTCCTTCTGGTATTCTAACTGGTCTCCAGAAGGATGTCTCTCTACTGCCTATCCCAGACTTTTTGCCCTCTCTATCGACAAAGAATCCTCAATCAAAGATGTGTGGAACTCAAACAACAATCAATGGGAAATAACTTTTAGAAGAAAGTTGAATGATAGAGAACTTAGCACATGGCAGAAAATTTTAGAGAATCTTCCTATTCCGAGAACTAACAGAGGGCCAAGTAAACCTACCTGGATTCCCGACAGCAAGAAATTGTTCTCCATCGCCTCTGCTAAAAGTTGTATCTCCCACCAGCCGGATCGTCCGGTAGCGAATCCTCGAGTAAAGCTGCTTGAATTAATTTGGAAAACTCACGTTCCTATGAAGATAAAATTCTTCATGTGGTGCCTGGTTCAAAGAAAGTTAAACACTATGGAGGTCATTCAGCAAAGAATGCCAAATACGCTTCTACAACCAAATTGGTGCGTTCTCTGCAAAAAAGATAGCGAAACGGGAGCTCACCTTTTCCTTTACTGTGATCGGGTGAAGCCCCTGTGGTCCTTCCTCCATCGATCTCTCAATTTCGCACCCATTTCCGACGACTTTGAAGCGATGTTCTCCTTCTTCCTCTCCCTAAACCAATCCCTCCCGAAGCACAAGGTCGTTCTTTGTGGGCTGATAGCTATTCTTTGGGGTATTTGGACAGAGAGAAATAATAGAATTTTTTATACTTTAAGTTATCAAAAATCTATTGCTAACTTATGGGAAGGCTGCAAAATTCTGATAGGAAATTGGTGTAGTAGAGATCCTACTTTAAAAAATTATTCAGCAGCTACTATTGCTCTTAATCTTAACGCCTTCTGTAATTAG

mRNA sequence

ATGGCCTACTTCAAATCACTTCCTAGATCATGCAAAATAGAAAGAAAGGAATTTGTCCTTCTCCTTGATAAATACGCGAAACACACTCACTACTGGCTAACCGAAACAGGGGCTCACAAAGCTTTTTCCATTGAAGTCTCCCCAAGGGATTTAGATTGGATAAGAAGCACCCTTAAATCACTGATTGAAACCCCAAGCTCGAATCGCTTCTTTCTTGAAAATCGTGATTCTGAGCATTGCATTTGGATCAGGAAAACAAGGAATGGCAAAGGATGTACTGCAGAAATCTTCAGAGTAGATCATAAAAATAGAAAATCTTGCATCCTAGTCCCGGAAGGCCCTGAGAAAAGTGGTTGGGTGTCATTCTTGTCTATGATCACTCCAAAAGTGGAAGAAACAAACAGAGGAATCTTTTACATACCATGCCTCCATGCTGAAAAGGCCCTGGTTCATTTCAACTCAAATGTACCCGCGAATCTTCTTTGTCAAAACAAAGGGTGGACAACCGTTGGGAAATACACGGTCAGGTTTGAAAAATGGGCCCCTGCCTCCCATGCCTCTCCAAAACTCATTCCTAGCTATGGGGGATGGACAACCTTTAGAGGAATTCCACTACACTTGTGGAATATGATGACCTTTCAACAAATTGGGAAAGCGTGTGGTAGTCTGGTTAAAGTGGCTGAGGAGACAAAAACAGCAAGAAACCTGATAGAAGCAAAATTAAAAATCAGATACAACTACTCCGGCTTCTTACCAGCTTACGTGAAGATTTTTGACCAAGAAGGAAACAAATTTGTTGTTCAAGTAGTCACTCACTCAGAAGGAAAATGGTTAATGGAAAGAAATATTAGATTACATGGTACCTTCAAGAGACAAGCTGCGGCTTCTTTTGATGATTTCAATCCTGATGCAGAGCAATTCCTGTTTGATGGTCTGGAGGCCATTTCACCGGATCTCCTGAACACCATCTCCGGCAGCCGTAAAAGCATTTCTCCGGAACAGCAATCTGCATTAAAATCAGTCATCATTAAACCTGCCAGAGATGCCACGTCGCCAACTACTTTAAATGAAGAGGTAGTTAATGATAACAGTTTGCATGCAACGACTATTAATTCGAAGTTGAAGATCTTATCTGGGATATCAAATGATGGCTCTTTGGATAAAGGAAAGCAAAAGGTTGACATTCCGTCTCAACTAACTTCAGCATTTATTTTTGACAAACCCAAAAGAAAAGTCTCCTTTAATTCCCCCAATAATAAAACCACCTTTTTTAATCCGGATTCTGCCCCAACCAATCATTCCCCTCTATTGAGCTCCCCTGAGAAAAAACAAAGAGTCAGTAGAGAGAGAAGTGTTAAAAAGAAATCATTAACCATTCAGCCTAAATCAAGAGCCAATCAGGGCAAAGGTGATTTAATCACTCAACCTCTTCAAGTTGTGGCACATGATCTGGATGCTTCCAAAAAAGGTCTCTCTCTCACAGTGGATTTGGGAAATCTGCCAGCTTTAGATCCGAGTAAATCCTGCGAAGATCATCACAGCTCTGACAATGCAGAAGTCATAGACATAACTAACACCGAAGTGGTTACAGAGACACCTGAATTGAAGATGACAGATCCAAAGAAATCAAACTCCTCTCCGGAAGTCAACTATAGGAAGCAAAAACATTCTCACCGAAGAAGACACTACTATAGGAAAAAGGAAGGCAAGGAGAAGGACACAAATTCAGAAGCCTTCAAAAATCAACTTGTTACTTGGCTAAAGGAAAATGGGCTGAAACTCTCTACAGATACAGACTCTTCTGGGGGATGGGGCCCCGGGGACATCAAATGTAATATGAAACTTCTTACTTGGAATGCAAGAGGTTTAGGCTCTCCTTCTAAAAGAGCCTTAATAAAAAATACTATAATTTCCTATTCCCCCGACTTTGTGATTCTTACTGAAACAAGGCTCAAAATCACAAATAAGAGAATTATTAACTCAAGAGGAATTCTGATTTTATGGGACGCTCAACATCATTCTCTTTTAAGTCAAGAGGAAGGGAAGTTCAGCTTATCAGCAAATTTTTTGTCCTTCAATAATTCTTGGTGGTTAACTGGTTTATATGGTCCAGTCAAAAGAAGGGAAAGATTAAATTTTTGGGCAGATTTACATAATCTCCTCCATCTTAACTCTTCTCCTTGGATAATAGGCTCCAACATGCTAAATAACTTCATCTCCAACAATCTTCTGATTGACCCTCCTCTAACAAATAACAGATACACTTGGTCAAACTTGAGAAATCCTCCAACCTTTTCCCATTTAGACAGATTCTTGTACAACTCTAGTTGGGAGATTCTCTTCAATCCTCATATCACAAGGACTCTCCCGAGAACTACTTCTGACCACTTCCCCCTAGTCTGTGAAGATTCCACCTCCACTCTCAGATGGGGCCCTGCTCCATTTAGATTAAACTCCATAACTTTAAATGACCCTGAATTCAAAAGAAATATGGAAAGATGGTGGGAGCTCTCAATTCAAAATGGTCACCCCGGGTTTTCCTTCATTCAAAGACTCAAGTCTTTAGCAAACCTCATCAAACCATGGCAAAAGGAGAAATTTCACTCCCTCACTACTGCTAAAGAAAATATTATTAGGGAAGTGGATGCCATTGATAAGAATGAGTTGGATACTCCGTTGTCTCAGGAGGAAAGCAACCGTAGACTTGCTCTAAAAGCTGAGCTCAATGATCTTTCCCTTAAGGAATCCCAATTTTGGTTCCAAAGGGCAAAAAAGCTTTGGATTAAAGAAGGAGATGAAAATTATGCCTTCTTTCATAGAATTTGTTCATCAAGACAGAAGAGAAATCTCATTCACGAAATCCAGAATGAAGAAGGCTCGATTCAGAATACAAACAATAATATTTCTCTTGCCTTTGTCAATCACTTTTCAAGAATTTACAGATGTTCCACCAAAAAAGACCCTCTTTTCATTGAAAATCTTGAGTGGAATCCAATTGATTACTCAGATTGGTCTCTCCTTTGTGCCCCCTTCTTGGAGGAAGAAATTAAAGGGGTCATCAAATCTTTTGATGGAAATAAGGCTCCTGGTCCAGACGGATTTCCTATTTCATTCTTTAAGTCTTATTGGCATCTTCTAAAAGAGGATATCATGGACATCTTTAAGGATTTCTTTGAGAAGGGAGTGATCAATAAGAATATGAACAACACCTACATCGCGTTGATTGCAAAAAAGAAGGATTATTCTCATCCAAAAGATTTCAGACCAATCAGCCTAACAACGTCTATCTACAAGATCATTGCAAAGACTCTTTCCAATAGGCTAAAGCTCACCCTCCCAGATACTATCTCAGGTAACCAACTTGCTTTTATCAAAAATCGTCAAATAACAGATGCTATCTTAATAGCGAATGAAGCTTTAGACTATTGGAAAGTGAAAAAGATCAAAGGTTTTATCTTGAAGCTGGATATTGAAAAGGCTTTTGATAATCTGAACTGGGATTTCATTGATTTCGTTCTTGAGAAAAAAAATTACCCAACTTCCTGGAGGAAATGGATAAGAGGCTGCATAAGCAATGTCACCTACTCAATCGAAGTCAATGGAAAACCACAAGGGCGAATTAAAGCCAATAGGGGTCTGAGACAAGGTGATCCCCTTTCTCCCTTCCTTTTTGTTATTGTTATGGATTACCTCAGCAGGCTCTTAAGTCATTTGGAGTCCACTGGTGCCATCAAAGGGGTGTGCCTCGCCAATGATTGCAACATTTCCCATATCCTCTTCGCTGATGATATTCTACTTTTTGTAGAAGATAATGACAACTTTCTGAATAATCTTAGAATGGCTATTTCTCTGTTTGAAAAGGCCTCTGGGCTCAAAATAAATTTGTCAAAATCAGCTATGGTTCCAGTTAATGTCTCCTGGTTAAGAGCTTTGGAGTGTACTTCATCTTGGGGTATTTCGTGCCACACTCTTCCTCTTACTTACTTAGGAGTCCCCCTTGGTGGCAACCAAAAATCCAACCTTTTCTGGAGGAATATTGAGGACAGAATCCAAAAAAAGCTTAGCAACTGGAAATATGCCCACATCTCAAAAGGTGGAAGACTCACGCTAATCAAGTCTACTCTAAGCAGTCTCCCAATTTATCAACTATCTGTCTTTCAAGCTCCTTCCTCCACGTATAAAAACATCGAAAAAATTTGGAGGAATTTCCTTTGGAAAGGTAGCGGTGGTCTAAAAGGGTCTCACTTAATCAACTGGTCAATAGTGACTAAGCCTAAGGAGGAGGGTGGGCTGGGTATATCGAGACTCCAAGTAACAAATCAAGCTCTCTTATCTAAGTGGCTTTGGCGCTACCATTCGGAGCCTAATTCCCTTTGGAGGAGGTTAATCCATATCAAGTATAAAGGTAAACACCCAGGGGACCTCCCATCAAACATTTCCTCTAGCTCCTCTAAAGCCCCGTGGAGATCTATCATCAACAACATTGACTGGTTCAAAAGTAATCAAGGTTGGAACTTGAACAATGGAGATCAAATCTCCTTCTGGTATTCTAACTGGTCTCCAGAAGGATGTCTCTCTACTGCCTATCCCAGACTTTTTGCCCTCTCTATCGACAAAGAATCCTCAATCAAAGATGTGTGGAACTCAAACAACAATCAATGGGAAATAACTTTTAGAAGAAAGTTGAATGATAGAGAACTTAGCACATGGCAGAAAATTTTAGAGAATCTTCCTATTCCGAGAACTAACAGAGGGCCAAGTAAACCTACCTGGATTCCCGACAGCAAGAAATTGTTCTCCATCGCCTCTGCTAAAAGTTGTATCTCCCACCAGCCGGATCGTCCGGTAGCGAATCCTCGAGTAAAGCTGCTTGAATTAATTTGGAAAACTCACGTTCCTATGAAGATAAAATTCTTCATGTGGTGCCTGGTTCAAAGAAAGTTAAACACTATGGAGGTCATTCAGCAAAGAATGCCAAATACGCTTCTACAACCAAATTGGTGCGTTCTCTGCAAAAAAGATAGCGAAACGGGAGCTCACCTTTTCCTTTACTGTGATCGGGTGAAGCCCCTGTGGTCCTTCCTCCATCGATCTCTCAATTTCGCACCCATTTCCGACGACTTTGAAGCGATGTTCTCCTTCTTCCTCTCCCTAAACCAATCCCTCCCGAAGCACAAGGTCGTTCTTTGTGGGCTGATAGCTATTCTTTGGGGTATTTGGACAGAGAGAAATAATAGAATTTTTTATACTTTAAGTTATCAAAAATCTATTGCTAACTTATGGGAAGGCTGCAAAATTCTGATAGGAAATTGGTGTAGTAGAGATCCTACTTTAAAAAATTATTCAGCAGCTACTATTGCTCTTAATCTTAACGCCTTCTGTAATTAG

Coding sequence (CDS)

ATGGCCTACTTCAAATCACTTCCTAGATCATGCAAAATAGAAAGAAAGGAATTTGTCCTTCTCCTTGATAAATACGCGAAACACACTCACTACTGGCTAACCGAAACAGGGGCTCACAAAGCTTTTTCCATTGAAGTCTCCCCAAGGGATTTAGATTGGATAAGAAGCACCCTTAAATCACTGATTGAAACCCCAAGCTCGAATCGCTTCTTTCTTGAAAATCGTGATTCTGAGCATTGCATTTGGATCAGGAAAACAAGGAATGGCAAAGGATGTACTGCAGAAATCTTCAGAGTAGATCATAAAAATAGAAAATCTTGCATCCTAGTCCCGGAAGGCCCTGAGAAAAGTGGTTGGGTGTCATTCTTGTCTATGATCACTCCAAAAGTGGAAGAAACAAACAGAGGAATCTTTTACATACCATGCCTCCATGCTGAAAAGGCCCTGGTTCATTTCAACTCAAATGTACCCGCGAATCTTCTTTGTCAAAACAAAGGGTGGACAACCGTTGGGAAATACACGGTCAGGTTTGAAAAATGGGCCCCTGCCTCCCATGCCTCTCCAAAACTCATTCCTAGCTATGGGGGATGGACAACCTTTAGAGGAATTCCACTACACTTGTGGAATATGATGACCTTTCAACAAATTGGGAAAGCGTGTGGTAGTCTGGTTAAAGTGGCTGAGGAGACAAAAACAGCAAGAAACCTGATAGAAGCAAAATTAAAAATCAGATACAACTACTCCGGCTTCTTACCAGCTTACGTGAAGATTTTTGACCAAGAAGGAAACAAATTTGTTGTTCAAGTAGTCACTCACTCAGAAGGAAAATGGTTAATGGAAAGAAATATTAGATTACATGGTACCTTCAAGAGACAAGCTGCGGCTTCTTTTGATGATTTCAATCCTGATGCAGAGCAATTCCTGTTTGATGGTCTGGAGGCCATTTCACCGGATCTCCTGAACACCATCTCCGGCAGCCGTAAAAGCATTTCTCCGGAACAGCAATCTGCATTAAAATCAGTCATCATTAAACCTGCCAGAGATGCCACGTCGCCAACTACTTTAAATGAAGAGGTAGTTAATGATAACAGTTTGCATGCAACGACTATTAATTCGAAGTTGAAGATCTTATCTGGGATATCAAATGATGGCTCTTTGGATAAAGGAAAGCAAAAGGTTGACATTCCGTCTCAACTAACTTCAGCATTTATTTTTGACAAACCCAAAAGAAAAGTCTCCTTTAATTCCCCCAATAATAAAACCACCTTTTTTAATCCGGATTCTGCCCCAACCAATCATTCCCCTCTATTGAGCTCCCCTGAGAAAAAACAAAGAGTCAGTAGAGAGAGAAGTGTTAAAAAGAAATCATTAACCATTCAGCCTAAATCAAGAGCCAATCAGGGCAAAGGTGATTTAATCACTCAACCTCTTCAAGTTGTGGCACATGATCTGGATGCTTCCAAAAAAGGTCTCTCTCTCACAGTGGATTTGGGAAATCTGCCAGCTTTAGATCCGAGTAAATCCTGCGAAGATCATCACAGCTCTGACAATGCAGAAGTCATAGACATAACTAACACCGAAGTGGTTACAGAGACACCTGAATTGAAGATGACAGATCCAAAGAAATCAAACTCCTCTCCGGAAGTCAACTATAGGAAGCAAAAACATTCTCACCGAAGAAGACACTACTATAGGAAAAAGGAAGGCAAGGAGAAGGACACAAATTCAGAAGCCTTCAAAAATCAACTTGTTACTTGGCTAAAGGAAAATGGGCTGAAACTCTCTACAGATACAGACTCTTCTGGGGGATGGGGCCCCGGGGACATCAAATGTAATATGAAACTTCTTACTTGGAATGCAAGAGGTTTAGGCTCTCCTTCTAAAAGAGCCTTAATAAAAAATACTATAATTTCCTATTCCCCCGACTTTGTGATTCTTACTGAAACAAGGCTCAAAATCACAAATAAGAGAATTATTAACTCAAGAGGAATTCTGATTTTATGGGACGCTCAACATCATTCTCTTTTAAGTCAAGAGGAAGGGAAGTTCAGCTTATCAGCAAATTTTTTGTCCTTCAATAATTCTTGGTGGTTAACTGGTTTATATGGTCCAGTCAAAAGAAGGGAAAGATTAAATTTTTGGGCAGATTTACATAATCTCCTCCATCTTAACTCTTCTCCTTGGATAATAGGCTCCAACATGCTAAATAACTTCATCTCCAACAATCTTCTGATTGACCCTCCTCTAACAAATAACAGATACACTTGGTCAAACTTGAGAAATCCTCCAACCTTTTCCCATTTAGACAGATTCTTGTACAACTCTAGTTGGGAGATTCTCTTCAATCCTCATATCACAAGGACTCTCCCGAGAACTACTTCTGACCACTTCCCCCTAGTCTGTGAAGATTCCACCTCCACTCTCAGATGGGGCCCTGCTCCATTTAGATTAAACTCCATAACTTTAAATGACCCTGAATTCAAAAGAAATATGGAAAGATGGTGGGAGCTCTCAATTCAAAATGGTCACCCCGGGTTTTCCTTCATTCAAAGACTCAAGTCTTTAGCAAACCTCATCAAACCATGGCAAAAGGAGAAATTTCACTCCCTCACTACTGCTAAAGAAAATATTATTAGGGAAGTGGATGCCATTGATAAGAATGAGTTGGATACTCCGTTGTCTCAGGAGGAAAGCAACCGTAGACTTGCTCTAAAAGCTGAGCTCAATGATCTTTCCCTTAAGGAATCCCAATTTTGGTTCCAAAGGGCAAAAAAGCTTTGGATTAAAGAAGGAGATGAAAATTATGCCTTCTTTCATAGAATTTGTTCATCAAGACAGAAGAGAAATCTCATTCACGAAATCCAGAATGAAGAAGGCTCGATTCAGAATACAAACAATAATATTTCTCTTGCCTTTGTCAATCACTTTTCAAGAATTTACAGATGTTCCACCAAAAAAGACCCTCTTTTCATTGAAAATCTTGAGTGGAATCCAATTGATTACTCAGATTGGTCTCTCCTTTGTGCCCCCTTCTTGGAGGAAGAAATTAAAGGGGTCATCAAATCTTTTGATGGAAATAAGGCTCCTGGTCCAGACGGATTTCCTATTTCATTCTTTAAGTCTTATTGGCATCTTCTAAAAGAGGATATCATGGACATCTTTAAGGATTTCTTTGAGAAGGGAGTGATCAATAAGAATATGAACAACACCTACATCGCGTTGATTGCAAAAAAGAAGGATTATTCTCATCCAAAAGATTTCAGACCAATCAGCCTAACAACGTCTATCTACAAGATCATTGCAAAGACTCTTTCCAATAGGCTAAAGCTCACCCTCCCAGATACTATCTCAGGTAACCAACTTGCTTTTATCAAAAATCGTCAAATAACAGATGCTATCTTAATAGCGAATGAAGCTTTAGACTATTGGAAAGTGAAAAAGATCAAAGGTTTTATCTTGAAGCTGGATATTGAAAAGGCTTTTGATAATCTGAACTGGGATTTCATTGATTTCGTTCTTGAGAAAAAAAATTACCCAACTTCCTGGAGGAAATGGATAAGAGGCTGCATAAGCAATGTCACCTACTCAATCGAAGTCAATGGAAAACCACAAGGGCGAATTAAAGCCAATAGGGGTCTGAGACAAGGTGATCCCCTTTCTCCCTTCCTTTTTGTTATTGTTATGGATTACCTCAGCAGGCTCTTAAGTCATTTGGAGTCCACTGGTGCCATCAAAGGGGTGTGCCTCGCCAATGATTGCAACATTTCCCATATCCTCTTCGCTGATGATATTCTACTTTTTGTAGAAGATAATGACAACTTTCTGAATAATCTTAGAATGGCTATTTCTCTGTTTGAAAAGGCCTCTGGGCTCAAAATAAATTTGTCAAAATCAGCTATGGTTCCAGTTAATGTCTCCTGGTTAAGAGCTTTGGAGTGTACTTCATCTTGGGGTATTTCGTGCCACACTCTTCCTCTTACTTACTTAGGAGTCCCCCTTGGTGGCAACCAAAAATCCAACCTTTTCTGGAGGAATATTGAGGACAGAATCCAAAAAAAGCTTAGCAACTGGAAATATGCCCACATCTCAAAAGGTGGAAGACTCACGCTAATCAAGTCTACTCTAAGCAGTCTCCCAATTTATCAACTATCTGTCTTTCAAGCTCCTTCCTCCACGTATAAAAACATCGAAAAAATTTGGAGGAATTTCCTTTGGAAAGGTAGCGGTGGTCTAAAAGGGTCTCACTTAATCAACTGGTCAATAGTGACTAAGCCTAAGGAGGAGGGTGGGCTGGGTATATCGAGACTCCAAGTAACAAATCAAGCTCTCTTATCTAAGTGGCTTTGGCGCTACCATTCGGAGCCTAATTCCCTTTGGAGGAGGTTAATCCATATCAAGTATAAAGGTAAACACCCAGGGGACCTCCCATCAAACATTTCCTCTAGCTCCTCTAAAGCCCCGTGGAGATCTATCATCAACAACATTGACTGGTTCAAAAGTAATCAAGGTTGGAACTTGAACAATGGAGATCAAATCTCCTTCTGGTATTCTAACTGGTCTCCAGAAGGATGTCTCTCTACTGCCTATCCCAGACTTTTTGCCCTCTCTATCGACAAAGAATCCTCAATCAAAGATGTGTGGAACTCAAACAACAATCAATGGGAAATAACTTTTAGAAGAAAGTTGAATGATAGAGAACTTAGCACATGGCAGAAAATTTTAGAGAATCTTCCTATTCCGAGAACTAACAGAGGGCCAAGTAAACCTACCTGGATTCCCGACAGCAAGAAATTGTTCTCCATCGCCTCTGCTAAAAGTTGTATCTCCCACCAGCCGGATCGTCCGGTAGCGAATCCTCGAGTAAAGCTGCTTGAATTAATTTGGAAAACTCACGTTCCTATGAAGATAAAATTCTTCATGTGGTGCCTGGTTCAAAGAAAGTTAAACACTATGGAGGTCATTCAGCAAAGAATGCCAAATACGCTTCTACAACCAAATTGGTGCGTTCTCTGCAAAAAAGATAGCGAAACGGGAGCTCACCTTTTCCTTTACTGTGATCGGGTGAAGCCCCTGTGGTCCTTCCTCCATCGATCTCTCAATTTCGCACCCATTTCCGACGACTTTGAAGCGATGTTCTCCTTCTTCCTCTCCCTAAACCAATCCCTCCCGAAGCACAAGGTCGTTCTTTGTGGGCTGATAGCTATTCTTTGGGGTATTTGGACAGAGAGAAATAATAGAATTTTTTATACTTTAAGTTATCAAAAATCTATTGCTAACTTATGGGAAGGCTGCAAAATTCTGATAGGAAATTGGTGTAGTAGAGATCCTACTTTAAAAAATTATTCAGCAGCTACTATTGCTCTTAATCTTAACGCCTTCTGTAATTAG

Protein sequence

MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKSLIETPSSNRFFLENRDSEHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEGPEKSGWVSFLSMITPKVEETNRGIFYIPCLHAEKALVHFNSNVPANLLCQNKGWTTVGKYTVRFEKWAPASHASPKLIPSYGGWTTFRGIPLHLWNMMTFQQIGKACGSLVKVAEETKTARNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQVVTHSEGKWLMERNIRLHGTFKRQAAASFDDFNPDAEQFLFDGLEAISPDLLNTISGSRKSISPEQQSALKSVIIKPARDATSPTTLNEEVVNDNSLHATTINSKLKILSGISNDGSLDKGKQKVDIPSQLTSAFIFDKPKRKVSFNSPNNKTTFFNPDSAPTNHSPLLSSPEKKQRVSRERSVKKKSLTIQPKSRANQGKGDLITQPLQVVAHDLDASKKGLSLTVDLGNLPALDPSKSCEDHHSSDNAEVIDITNTEVVTETPELKMTDPKKSNSSPEVNYRKQKHSHRRRHYYRKKEGKEKDTNSEAFKNQLVTWLKENGLKLSTDTDSSGGWGPGDIKCNMKLLTWNARGLGSPSKRALIKNTIISYSPDFVILTETRLKITNKRIINSRGILILWDAQHHSLLSQEEGKFSLSANFLSFNNSWWLTGLYGPVKRRERLNFWADLHNLLHLNSSPWIIGSNMLNNFISNNLLIDPPLTNNRYTWSNLRNPPTFSHLDRFLYNSSWEILFNPHITRTLPRTTSDHFPLVCEDSTSTLRWGPAPFRLNSITLNDPEFKRNMERWWELSIQNGHPGFSFIQRLKSLANLIKPWQKEKFHSLTTAKENIIREVDAIDKNELDTPLSQEESNRRLALKAELNDLSLKESQFWFQRAKKLWIKEGDENYAFFHRICSSRQKRNLIHEIQNEEGSIQNTNNNISLAFVNHFSRIYRCSTKKDPLFIENLEWNPIDYSDWSLLCAPFLEEEIKGVIKSFDGNKAPGPDGFPISFFKSYWHLLKEDIMDIFKDFFEKGVINKNMNNTYIALIAKKKDYSHPKDFRPISLTTSIYKIIAKTLSNRLKLTLPDTISGNQLAFIKNRQITDAILIANEALDYWKVKKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNYPTSWRKWIRGCISNVTYSIEVNGKPQGRIKANRGLRQGDPLSPFLFVIVMDYLSRLLSHLESTGAIKGVCLANDCNISHILFADDILLFVEDNDNFLNNLRMAISLFEKASGLKINLSKSAMVPVNVSWLRALECTSSWGISCHTLPLTYLGVPLGGNQKSNLFWRNIEDRIQKKLSNWKYAHISKGGRLTLIKSTLSSLPIYQLSVFQAPSSTYKNIEKIWRNFLWKGSGGLKGSHLINWSIVTKPKEEGGLGISRLQVTNQALLSKWLWRYHSEPNSLWRRLIHIKYKGKHPGDLPSNISSSSSKAPWRSIINNIDWFKSNQGWNLNNGDQISFWYSNWSPEGCLSTAYPRLFALSIDKESSIKDVWNSNNNQWEITFRRKLNDRELSTWQKILENLPIPRTNRGPSKPTWIPDSKKLFSIASAKSCISHQPDRPVANPRVKLLELIWKTHVPMKIKFFMWCLVQRKLNTMEVIQQRMPNTLLQPNWCVLCKKDSETGAHLFLYCDRVKPLWSFLHRSLNFAPISDDFEAMFSFFLSLNQSLPKHKVVLCGLIAILWGIWTERNNRIFYTLSYQKSIANLWEGCKILIGNWCSRDPTLKNYSAATIALNLNAFCN
Homology
BLAST of Pay0007691 vs. ExPASy Swiss-Prot
Match: P11369 (LINE-1 retrotransposable element ORF2 protein OS=Mus musculus OX=10090 GN=Pol PE=1 SV=2)

HSP 1 Score: 196.4 bits (498), Expect = 2.7e-48
Identity = 177/709 (24.96%), Postives = 328/709 (46.26%), Query Frame = 0

Query: 815  FRLNSITLNDPEFKRNMERWWELSIQ-NGHPGFSFIQRLKSLANLIKPWQKEKFHSLTTA 874
            ++LN+  LND   K  +++  +  ++ N +   ++     +L + +K + + K  +L+ +
Sbjct: 257  WKLNNTLLNDTLVKEGIKKEIKDFLEFNENEATTY----PNLWDTMKAFLRGKLIALSAS 316

Query: 875  KE--------NIIREVDAIDKNELDTPLSQEESNRRLALKAELNDLSLKESQFWFQRAKK 934
            K+        ++   + A++K E ++P  +      + L+ E+N +  + +     + + 
Sbjct: 317  KKKRETAHTSSLTTHLKALEKKEANSP-KRSRRQEIIKLRGEINQVETRRTIQRINQTRS 376

Query: 935  LWIKEGDENYAFFHRICSSRQKRNLIHEIQNEEGSIQNTNNNISLAFVNHFSRIYRCSTK 994
             + ++ ++      R+    + + LI++I+NE+G I      I     + + R+Y  STK
Sbjct: 377  WFFEKINKIDKPLARLTKGHRDKILINKIRNEKGDITTDPEEIQNTIRSFYKRLY--STK 436

Query: 995  KDPL-----FIENLEWNPIDYSDWSLLCAPFLEEEIKGVIKSFDGNKAPGPDGFPISFFK 1054
             + L     F++  +   ++      L +P   +EI+ VI S    K+PGPDGF   F++
Sbjct: 437  LENLDEMDKFLDRYQVPKLNQDQVDHLNSPISPKEIEAVINSLPTKKSPGPDGFSAEFYQ 496

Query: 1055 SYWHLLKEDIMDIFKDFFEKGVINKNMNNTY----IALIAK-KKDYSHPKDFRPISLTTS 1114
            ++    KED++ I    F K  +   + N++    I LI K +KD +  ++FRPISL   
Sbjct: 497  TF----KEDLIPILHKLFHKIEVEGTLPNSFYEATITLIPKPQKDPTKIENFRPISLMNI 556

Query: 1115 IYKIIAKTLSNRLKLTLPDTISGNQLAFIKNRQITDAILIANEALDYW-KVKKIKGFILK 1174
              KI+ K L+NR++  +   I  +Q+ FI   Q    I  +   + Y  K+K     I+ 
Sbjct: 557  DAKILNKILANRIQEHIKAIIHPDQVGFIPGMQGWFNIRKSINVIHYINKLKDKNHMIIS 616

Query: 1175 LDIEKAFDNLNWDFIDFVLEKKNYPTSWRKWIRGCISNVTYSIEVNGKPQGRIKANRGLR 1234
            LD EKAFD +   F+  VLE+      +   I+   S    +I+VNG+    I    G R
Sbjct: 617  LDAEKAFDKIQHPFMIKVLERSGIQGPYLNMIKAIYSKPVANIKVNGEKLEAIPLKSGTR 676

Query: 1235 QGDPLSPFLFVIVMDYLSRLLSHLESTGAIKGVCLANDCNISHILFADDILLFVEDNDNF 1294
            QG PLSP+LF IV++ L+R +   +    IKG+ +  +  +   L ADD+++++ D  N 
Sbjct: 677  QGCPLSPYLFNIVLEVLARAIRQQKE---IKGIQIGKE-EVKISLLADDMIVYISDPKNS 736

Query: 1295 LNNLRMAISLFEKASGLKINLSKSAMVPVNVSWLRALECTSSWGISCHTLPLTYLGVPLG 1354
               L   I+ F +  G KIN +KS       +     E   +   S  T  + YLGV L 
Sbjct: 737  TRELLNLINSFGEVVGYKINSNKSMAFLYTKNKQAEKEIRETTPFSIVTNNIKYLGVTLT 796

Query: 1355 GNQKS--NLFWRNIEDRIQKKLSNWKYAHISKGGRLTLIKSTLSSLPIYQLSV--FQAPS 1414
               K   +  +++++  I++ L  WK    S  GR+ ++K  +    IY+ +    + P+
Sbjct: 797  KEVKDLYDKNFKSLKKEIKEDLRRWKDLPCSWIGRINIVKMAILPKAIYRFNAIPIKIPT 856

Query: 1415 STYKNIEKIWRNFLWKGSGGLKGSHLINWSIVTKPKEEGGLGISRLQVTNQALLSK--WL 1474
              +  +E     F+W           I  S++   +  GG+ +  L++  +A++ K  W 
Sbjct: 857  QFFNELEGAICKFVWNNK-----KPRIAKSLLKDKRTSGGITMPDLKLYYRAIVIKTAWY 916

Query: 1475 WRYHSEPNSLWRRLIHIKYKGKHPGDLPSNISSSSSKAPWRSIINNIDW 1498
            W Y       W R+   +      G L  +  + + +    SI NN  W
Sbjct: 917  W-YRDRQVDQWNRIEDPEMNPHTYGHLIFDKGAKTIQWKKDSIFNNWCW 944

BLAST of Pay0007691 vs. ExPASy Swiss-Prot
Match: P08548 (LINE-1 reverse transcriptase homolog OS=Nycticebus coucang OX=9470 PE=4 SV=1)

HSP 1 Score: 193.0 bits (489), Expect = 3.0e-47
Identity = 185/728 (25.41%), Postives = 339/728 (46.57%), Query Frame = 0

Query: 764  TFSHLDRFLYNSSWEILFNPHITRTLPRTTSDHFPLVCE-DSTSTLRWGPAPFRLNSITL 823
            T+S +D  L + S    F       +P   SDH  +  E ++   L      ++LN++ L
Sbjct: 199  TYSKIDHILGHKSNLSKFKK--IEIIPCIFSDHHGIKVELNNNRNLHTHTKTWKLNNLML 258

Query: 824  ND----PEFKRNMERWWELSIQNGHPGFSFIQRLKSLANLIKPWQKEKFHSL-----TTA 883
             D     E K+ + ++ E   QN +   ++    ++L +  K   + KF +L      T 
Sbjct: 259  KDTWVIDEIKKEITKFLE---QNNNQDTNY----QNLWDTAKAVLRGKFIALQAFLKKTE 318

Query: 884  KE---NIIREVDAIDKNELDTPLSQEESNRR--LALKAELNDLSLKESQFWFQRAKKLWI 943
            +E   N++  +  ++K E   P   + S R+    ++AELN++  K       ++K  + 
Sbjct: 319  REEVNNLMGHLKQLEKEEHSNP---KPSRRKEITKIRAELNEIENKRIIQQINKSKSWFF 378

Query: 944  KEGDENYAFFHRICSSRQKRNLIHEIQNEEGSIQNTNNNISLAFVNHFSRIYRC---STK 1003
            ++ ++       +   ++ ++LI  I+N    I    + I      ++ ++Y     + K
Sbjct: 379  EKINKIDKPLANLTRKKRVKSLISSIRNGNDEITTDPSEIQKILNEYYKKLYSHKYENLK 438

Query: 1004 KDPLFIENLEWNPIDYSDWSLLCAPFLEEEIKGVIKSFDGNKAPGPDGFPISFFKSYWHL 1063
            +   ++E      +   +  +L  P    EI   I++    K+PGPDGF   F++++   
Sbjct: 439  EIDQYLEACHLPRLSQKEVEMLNRPISSSEIASTIQNLPKKKSPGPDGFTSEFYQTFKEE 498

Query: 1064 LKEDIMDIFKDFFEKGVINKNMNNTYIALIAKK-KDYSHPKDFRPISLTTSIYKIIAKTL 1123
            L   ++++F++  ++G++        I LI K  KD +  +++RPISL     KI+ K L
Sbjct: 499  LVPILLNLFQNIEKEGILPNTFYEANITLIPKPGKDPTRKENYRPISLMNIDAKILNKIL 558

Query: 1124 SNRLKLTLPDTISGNQLAFIKNRQITDAILIANEALDYW-KVKKIKGFILKLDIEKAFDN 1183
            +NR++  +   I  +Q+ FI   Q    I  +   + +  K+K     IL +D EKAFDN
Sbjct: 559  TNRIQQHIKKIIHHDQVGFIPGSQGWFNIRKSINVIQHINKLKNKDHMILSIDAEKAFDN 618

Query: 1184 LNWDFIDFVLEKKNYPTSWRKWIRGCISNVTYSIEVNGKPQGRIKANRGLRQGDPLSPFL 1243
            +   F+   L+K     ++ K I    S  T +I +NG          G RQG PLSP L
Sbjct: 619  IQHPFMIRTLKKIGIEGTFLKLIEAIYSKPTANIILNGVKLKSFPLRSGTRQGCPLSPLL 678

Query: 1244 FVIVMDYLSRLLSHLESTGAIKGVCLANDCNISHILFADDILLFVEDNDNFLNNLRMAIS 1303
            F IVM+ L+     +    AIKG+ + ++  I   LFADD+++++E+  +    L   I 
Sbjct: 679  FNIVMEVLA---IAIREEKAIKGIHIGSE-EIKLSLFADDMIVYLENTRDSTTKLLEVIK 738

Query: 1304 LFEKASGLKINLSKS-AMVPVNVSWLRALECTSSWGISCHTLP--LTYLGVPLGGNQKSN 1363
             +   SG KIN  KS A +  N +     E T    I    +P  + YLGV L  + K +
Sbjct: 739  EYSNVSGYKINTHKSVAFIYTNNN---QAEKTVKDSIPFTVVPKKMKYLGVYLTKDVK-D 798

Query: 1364 LFWRNIE---DRIQKKLSNWKYAHISKGGRLTLIKSTLSSLPIYQLSV--FQAPSSTYKN 1423
            L+  N E     I + ++ WK    S  GR+ ++K ++    IY  +    +AP S +K+
Sbjct: 799  LYKENYETLRKEIAEDVNKWKNIPCSWLGRINIVKMSILPKAIYNFNAIPIKAPLSYFKD 858

Query: 1424 IEKIWRNFLWKGSGGLKGSHLINWSIVTKPKEEGGLGISRLQVTNQALLSKWLWRYHSEP 1463
            +EKI  +F+W      +    I  ++++   + GG+ +  L++  ++++ K  W +H   
Sbjct: 859  LEKIILHFIWN-----QKKPQIAKTLLSNKNKAGGITLPDLRLYYKSIVIKTAWYWHKNR 901

BLAST of Pay0007691 vs. ExPASy Swiss-Prot
Match: O00370 (LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens OX=9606 PE=1 SV=1)

HSP 1 Score: 193.0 bits (489), Expect = 3.0e-47
Identity = 183/725 (25.24%), Postives = 324/725 (44.69%), Query Frame = 0

Query: 764  TFSHLDRFLYNSSWEILFNPHITRTLPRTTSDHFPLVCEDSTSTLRWG-PAPFRLNSITL 823
            T+S +D  +   S  +L     T  +    SDH  +  E     L       ++LN++ L
Sbjct: 200  TYSKIDHIV--GSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQSRSTTWKLNNLLL 259

Query: 824  ND----PEFKRNMERWWE------LSIQNGHPGFSFIQRLKSLANLIKPWQKEKFHSLTT 883
            ND     E K  ++ ++E       + QN    F  + R K +A  +  +++++  S   
Sbjct: 260  NDYWVHNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIA--LNAYKRKQERSKID 319

Query: 884  AKENIIREVDAIDKNELDTPLSQEESNRRLALKAELNDLSLK---ESQFW-FQRAKKLWI 943
               + ++E++  ++        QE +  R  LK      +L+   ES+ W F+R  K+  
Sbjct: 320  TLTSQLKELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKI-- 379

Query: 944  KEGDENYAFFHRICSSRQKRNLIHEIQNEEGSIQNTNNNISLAFVNHFSRIYRC---STK 1003
               D   A   R+   ++++N I  I+N++G I      I      ++  +Y     + +
Sbjct: 380  ---DRPLA---RLIKKKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLE 439

Query: 1004 KDPLFIENLEWNPIDYSDWSLLCAPFLEEEIKGVIKSFDGNKAPGPDGFPISFFKSYWHL 1063
            +   F++      ++  +   L  P    EI  +I S    K+PGPDGF   F++ Y   
Sbjct: 440  EMDTFLDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEE 499

Query: 1064 LKEDIMDIFKDFFEKGVINKNMNNTYIALIAKK-KDYSHPKDFRPISLTTSIYKIIAKTL 1123
            L   ++ +F+   ++G++  +     I LI K  +D +  ++FRPISL     KI+ K L
Sbjct: 500  LVPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKIL 559

Query: 1124 SNRLKLTLPDTISGNQLAFIKNRQITDAILIANEALDYWKVKKIKG-FILKLDIEKAFDN 1183
            +NR++  +   I  +Q+ FI   Q    I  +   + +    K K   I+ +D EKAFD 
Sbjct: 560  ANRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRAKDKNHVIISIDAEKAFDK 619

Query: 1184 LNWDFIDFVLEKKNYPTSWRKWIRGCISNVTYSIEVNGKPQGRIKANRGLRQGDPLSPFL 1243
            +   F+   L K      + K IR      T +I +NG+         G RQG PLSP L
Sbjct: 620  IQQPFMLKTLNKLGIDGMYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLL 679

Query: 1244 FVIVMDYLSRLLSHLESTGAIKGVCLANDCNISHILFADDILLFVEDNDNFLNNLRMAIS 1303
            F IV++ L+R +   +    IKG+ L  +  +   LFADD+++++E+      NL   IS
Sbjct: 680  FNIVLEVLARAIRQEKE---IKGIQLGKE-EVKLSLFADDMIVYLENPIVSAQNLLKLIS 739

Query: 1304 LFEKASGLKINLSKSAMVPVNVSWLRALECTSSWGISCHTLPLTYLGVPLGGNQKSNLFW 1363
             F K SG KIN+ KS     N +     +       +  +  + YLG+ L  + K +LF 
Sbjct: 740  NFSKVSGYKINVQKSQAFLYNNNRQTESQIMGELPFTIASKRIKYLGIQLTRDVK-DLFK 799

Query: 1364 RNIE---DRIQKKLSNWKYAHISKGGRLTLIKSTLSSLPIYQLSV--FQAPSSTYKNIEK 1423
             N +     I++  + WK    S  GR+ ++K  +    IY+ +    + P + +  +EK
Sbjct: 800  ENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEK 859

Query: 1424 IWRNFLWKGSGGLKGSHLINWSIVTKPKEEGGLGISRLQVTNQALLSK--WLWRYHSEPN 1462
                F+W      +    I  SI+++  + GG+ +   ++  +A ++K  W W Y +   
Sbjct: 860  TTLKFIWN-----QKRARIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYW-YQNRDI 901

BLAST of Pay0007691 vs. ExPASy Swiss-Prot
Match: P14381 (Transposon TX1 uncharacterized 149 kDa protein OS=Xenopus laevis OX=8355 PE=4 SV=1)

HSP 1 Score: 151.8 bits (382), Expect = 7.6e-35
Identity = 160/682 (23.46%), Postives = 294/682 (43.11%), Query Frame = 0

Query: 813  APFRLNSITLNDPEFKRNMERWWELSIQNGHPGFSFIQRLKSLANLIKPWQKEKFHSLTT 872
            A +  N+  L D  F +++   W         G+   Q     A L + W   K H    
Sbjct: 247  AYWHFNNSLLEDEGFAKSVRDTWR--------GWRAFQ--DEFATLNQWWDVGKVHLKLL 306

Query: 873  AKE-------NIIREVDAIDKNELDTP--LSQEESN----RRLALKAELNDLSLKESQFW 932
             +E           E++A++   LD    LS  E        L  K  L ++  ++++  
Sbjct: 307  CQEYTKSVSGQRNAEIEALNGEVLDLEQRLSGSEDQALQCEYLERKEALRNMEQRQARGA 366

Query: 933  FQRAKKLWIKEGDENYAFFHRICSSRQKRNLIHEIQNEEGSIQNTNNNISLAFVNHFSRI 992
            F R++   + + D    FF+ +   +  R  I  +  E+G    T      A  +     
Sbjct: 367  FVRSRMQLLCDMDRGSRFFYALEKKKGNRKQITCLFAEDG----TPLEDPEAIRDRARSF 426

Query: 993  YRCSTKKDPLFIENLE--WN---PIDYSDWSLLCAPFLEEEIKGVIKSFDGNKAPGPDGF 1052
            Y+     DP+  +  E  W+    +       L  P   +E+   ++    NK+PG DG 
Sbjct: 427  YQNLFSPDPISPDACEELWDGLPVVSERRKERLETPITLDELSQALRLMPHNKSPGLDGL 486

Query: 1053 PISFFKSYWHLLKEDIMDIFKDFFEKGVINKNMNNTYIALIAKKKDYSHPKDFRPISLTT 1112
             I FF+ +W  L  D   +  + F+KG +  +     ++L+ KK D    K++RP+SL +
Sbjct: 487  TIEFFQFFWDTLGPDFHRVLTEAFKKGELPLSCRRAVLSLLPKKGDLRLIKNWRPVSLLS 546

Query: 1113 SIYKIIAKTLSNRLKLTLPDTISGNQLAFIKNRQITDAILIANEALDYWKVKKIKGFILK 1172
            + YKI+AK +S RLK  L + I  +Q   +  R I D + +  + L + +   +    L 
Sbjct: 547  TDYKIVAKAISLRLKSVLAEVIHPDQSYTVPGRTIFDNVFLIRDLLHFARRTGLSLAFLS 606

Query: 1173 LDIEKAFDNLNWDFIDFVLEKKNYPTSWRKWIRGCISNVTYSIEVNGKPQGRIKANRGLR 1232
            LD EKAFD ++  ++   L+  ++   +  +++   ++    +++N      +   RG+R
Sbjct: 607  LDQEKAFDRVDHQYLIGTLQAYSFGPQFVGYLKTMYASAECLVKINWSLTAPLAFGRGVR 666

Query: 1233 QGDPLSPFLFVIVMD-YLSRLLSHLESTGAIKGVCLANDCNISHILFADDILLFVEDNDN 1292
            QG PLS  L+ + ++ +L  L   L  TG    V    D  +    +ADD++L  +D  +
Sbjct: 667  QGCPLSGQLYSLAIEPFLCLLRKRL--TGL---VLKEPDMRVVLSAYADDVILVAQDLVD 726

Query: 1293 FLNNLRMAISLFEKASGLKINLSKSAMV---PVNVSWLRALECTSSWGISCHTLPLTYLG 1352
             L   +    ++  AS  +IN SKS+ +    + V +L       SW     +  + YLG
Sbjct: 727  -LERAQECQEVYAAASSARINWSKSSGLLEGSLKVDFLPPAFRDISW----ESKIIKYLG 786

Query: 1353 VPLGGNQ-KSNLFWRNIEDRIQKKLSNWK-YAHI-SKGGRLTLIKSTLSSLPIYQLSVFQ 1412
            V L   +   +  +  +E+ +  +L  WK +A + S  GR  +I   ++S   Y+L    
Sbjct: 787  VYLSAEEYPVSQNFIELEECVLTRLGKWKGFAKVLSMRGRALVINQLVASQIWYRLICLS 846

Query: 1413 APSSTYKNIEKIWRNFLWKGSGGLKGSHLINWSIVTKPKEEGGLGISRLQVTNQALLSKW 1469
                    I++   +FLW       G H ++  + + P +EGG G+  ++        + 
Sbjct: 847  PTQEFIAKIQRRLLDFLW------IGKHWVSAGVSSLPLKEGGQGVVCIRSQVHTFRLQQ 898

BLAST of Pay0007691 vs. ExPASy Swiss-Prot
Match: P0C2F6 (Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1g65750 PE=3 SV=1)

HSP 1 Score: 144.4 bits (363), Expect = 1.2e-32
Identity = 100/356 (28.09%), Postives = 159/356 (44.66%), Query Frame = 0

Query: 1346 IEDRIQKKLSNWKYAHISKGGRLTLIKSTLSSLPIYQLSVFQAPSSTYKNIEKIWRNFLW 1405
            I +R+  ++S W+   +S  GRLTL K+ LSS+P++ +S    P S    ++++ R FLW
Sbjct: 16   ILERVSSRMSGWREKTLSFAGRLTLTKAVLSSMPVHSMSTILLPQSILNRLDQLSRTFLW 75

Query: 1406 KGSGGLKGSHLINWSIVTKPKEEGGLGISRLQVTNQALLSKWLWRYHSEPNSLWRRLIHI 1465
              +   K  HL+ WS V  PK+EGGLG+   +  N+AL+SK  WR   E NSLW  ++  
Sbjct: 76   GSTAEKKKQHLVKWSKVCSPKKEGGLGVRAAKSMNRALISKVGWRLLQEKNSLWTLVLQK 135

Query: 1466 KYKGKHPGDLPSN---ISSSSSKAPWRSI-INNIDWFKSNQGWNLNNGDQISFWYSNWSP 1525
            KY   H G++  +   I   S  + WRSI I   D      GW   +G QI FW   W  
Sbjct: 136  KY---HVGEIRDSRWLIPKGSWSSTWRSIAIGLRDVVSHGVGWIPGDGQQIRFWTDRW-- 195

Query: 1526 EGCLSTAYPRLFALSIDKESS-----IKDVWNSNNNQWEITFRRKLNDRELSTWQKILEN 1585
                 +  P L   + ++ +       KD+W      W+     K++    +  +  L  
Sbjct: 196  ----VSGKPLLELDNGERPTDCDTVVAKDLWIPGRG-WDFA---KIDPYTTNNTRLELRA 255

Query: 1586 LPIPRTNRGPSKPTWIPDSKKLFSIASAKSCISHQPDRPVANPRVKLLELIWKTHVPMKI 1645
            + +        + +W       FS+ SA   ++   D             +WK  VP ++
Sbjct: 256  VVLDLVTGARDRLSWKFSQDGQFSVRSAYEMLT--VDEVPRPNMASFFNCLWKVRVPERV 315

Query: 1646 KFFMWCLVQRKLNTMEVIQQRMPNTLLQPNWCVLCKKDSETGAHLFLYCDRVKPLW 1693
            K F+W +  + + T E   +R    L   N C +CK   E+  H+   C     +W
Sbjct: 316  KTFLWLVGNQAVMTEE---ERHRRHLSASNVCQVCKGGVESMLHVLRDCPAQLGIW 353

BLAST of Pay0007691 vs. ExPASy TrEMBL
Match: A0A5D3BL61 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold169G001020 PE=4 SV=1)

HSP 1 Score: 3079.7 bits (7983), Expect = 0.0e+00
Identity = 1568/1910 (82.09%), Postives = 1595/1910 (83.51%), Query Frame = 0

Query: 1    MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKS 60
            MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKS
Sbjct: 1    MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKS 60

Query: 61   LIETPSSNRFFLENRDSEHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEGPEKSGWV 120
            LIETPSSNRFFLENRD EHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEG EKS WV
Sbjct: 61   LIETPSSNRFFLENRDYEHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEGLEKSCWV 120

Query: 121  SFLSMITPKVE--ETNRGIFY--------------------------------------- 180
            SFLSMITPKVE     R IF                                        
Sbjct: 121  SFLSMITPKVEVKAKTRPIFLPRSSPEFRLSPPIDYHKRSYAKAVSEGRSSISSDSSDSY 180

Query: 181  -----------IPC------------------------------------------LHAE 240
                        PC                                           HAE
Sbjct: 181  ASSDSSQSSGNSPCDSPFPVLLENTVVLVRRFFHDDWQKILQNLRKQTEESFTYNAFHAE 240

Query: 241  KALVHFNSNVPANLLCQNKGWTTVGKYTVRFEKWAPASHASPKLIPSYGGWTTFRGIPLH 300
            K LVHFNSNVPANLLCQNKGWTTVGKYTVRFEKWAPASHASPKLIPSYGGWTTFRGIPLH
Sbjct: 241  KVLVHFNSNVPANLLCQNKGWTTVGKYTVRFEKWAPASHASPKLIPSYGGWTTFRGIPLH 300

Query: 301  LWNMMTFQQIGKACGSLVKVAEETKTARNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFV 360
            LWNMMTFQQIGKACG L+KVAEETKTARNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFV
Sbjct: 301  LWNMMTFQQIGKACGGLIKVAEETKTARNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFV 360

Query: 361  VQVVTHSEGKWLMERNIRLHGTFKRQAAASFDDFNPDAEQFLFDGLEAISPDLLNTISGS 420
            VQVVTHSEGKWLMERN+RLHGTFKRQAAASFDDFNPD+EQFLFDGLEAISPDLLNTISGS
Sbjct: 361  VQVVTHSEGKWLMERNVRLHGTFKRQAAASFDDFNPDSEQFLFDGLEAISPDLLNTISGS 420

Query: 421  RKSISPEQQSALKSVIIKPARDATSPTTLNEEVVNDNSLHATTINSKLKILSGISNDGSL 480
            RKSISPEQ SALKSVIIKPA+ ATSPTTLNEEVVNDNSLHAT   SKLKILSGISNDGSL
Sbjct: 421  RKSISPEQPSALKSVIIKPAKYATSPTTLNEEVVNDNSLHATANKSKLKILSGISNDGSL 480

Query: 481  DKGKQKVDIPSQLTSAFIFDKPKRKVSFNSPNNKTTFFNPDSAPTNHSPLLSSPEKKQRV 540
            DKGKQKVDIPSQLTSAFIF KPKRKVSFNSP+NKTTFFNPDSAP NH     SPEKK+RV
Sbjct: 481  DKGKQKVDIPSQLTSAFIFYKPKRKVSFNSPSNKTTFFNPDSAPANH-----SPEKKKRV 540

Query: 541  SRERSVKKKSLTIQPKSRANQGKGDLITQPLQVVAHDLDASKKGLSLTVDLGNLPALDPS 600
            SRERSVKKKS TIQPK RANQGKG+LITQPLQVVAHDLDASKKGLSLTVDLGNLP LDPS
Sbjct: 541  SRERSVKKKSSTIQPKLRANQGKGNLITQPLQVVAHDLDASKKGLSLTVDLGNLPVLDPS 600

Query: 601  KSCEDHHSSDNAEVIDITNTEVVTETPELKMTDPKKSNSSPEVNYRKQKHSHRRRHYYRK 660
            KS EDHHSSDNAEVIDITNTEVV ETPELKMTDP+KSNSSPEVNYRKQKHSHRRRHYYRK
Sbjct: 601  KSFEDHHSSDNAEVIDITNTEVVPETPELKMTDPEKSNSSPEVNYRKQKHSHRRRHYYRK 660

Query: 661  KEGKEKDTNSEAFKNQLVTWLKENGLKLSTDTDSSGGWGPGDIKCNMKLLTWNARGLGSP 720
            KE KEKDTNSEAFKNQLVTWLKENGLKLS DTDSSG                      + 
Sbjct: 661  KEDKEKDTNSEAFKNQLVTWLKENGLKLSIDTDSSG---------------------ATT 720

Query: 721  SKRALIKNTIISYSPDFVILTETRLKITNKRIINSRGILILWDAQHHSLLSQEEGKFSLS 780
            S  AL      S                      + GILILWDAQHHSLLSQEEGKFSLS
Sbjct: 721  STNALFSQLGSS----------------------AGGILILWDAQHHSLLSQEEGKFSLS 780

Query: 781  ANFLSFNNSWWLTGLYGPVKRRERLNFWADLHNLLHLNSSPWIIG--------------- 840
            ANF SFNNSWWLTGLYGPVKRRERLN W DLHNL HLNSSPWIIG               
Sbjct: 781  ANFSSFNNSWWLTGLYGPVKRRERLNVWEDLHNLHHLNSSPWIIGGDLNVVRMREESTAV 840

Query: 841  ------SNMLNNFISNNLLIDPPLTNNRYTWSNLRNPPTFSHLDRFLYNSSWEILFNPHI 900
                  SNMLN+FISNNLLIDPPLTNNRYTWSNLRNPPTFS LDRFLYNS WEILFNPHI
Sbjct: 841  TFSSHSSNMLNDFISNNLLIDPPLTNNRYTWSNLRNPPTFSRLDRFLYNSRWEILFNPHI 900

Query: 901  TRTLPRTTSDHFPLVCEDSTSTLRWGPAPFRLNSITLNDPEFKRNMERWWELSIQNGHPG 960
            TRTLPR TSDHFPLVCEDSTSTLRWGPAPFRLNSI LNDPEFKRNMERWWELS+QNGHPG
Sbjct: 901  TRTLPRPTSDHFPLVCEDSTSTLRWGPAPFRLNSIALNDPEFKRNMERWWELSVQNGHPG 960

Query: 961  FSFIQRLKSLANLIKPWQKEKFHSLTTAKENIIREVDAIDKNELDTPLSQEESNRRLALK 1020
            F FIQRLKSLANLIKPWQKEKF SLT+AKENIIREVD+IDKNELDTPLS EESNRRLALK
Sbjct: 961  FFFIQRLKSLANLIKPWQKEKFQSLTSAKENIIREVDSIDKNELDTPLSLEESNRRLALK 1020

Query: 1021 AELNDLSLKESQFWFQRAKKLWIKEGDENYAFFHRICSSRQKRNLIHEIQNEEGSIQNTN 1080
            AELNDLSLKESQFWFQRAKKLW+KEGDEN AFFHRICSSRQKRNLIHEIQ+EEGSIQNTN
Sbjct: 1021 AELNDLSLKESQFWFQRAKKLWLKEGDENSAFFHRICSSRQKRNLIHEIQDEEGSIQNTN 1080

Query: 1081 NNISLAFVNHFSRIYRCSTKKDPLFIENLEWNPIDYSDWSLLCAPFLEEEIKGVIKSFDG 1140
            NNISLAFVNHFSRIYRCSTKKDPLFIENLEWNPIDYSDWSLLCAPF EEEIKGVIKSFDG
Sbjct: 1081 NNISLAFVNHFSRIYRCSTKKDPLFIENLEWNPIDYSDWSLLCAPFSEEEIKGVIKSFDG 1140

Query: 1141 NKAPGPDGFPISFFKSYWHLLKEDIMDIFKDFFEKGVINKNMNNTYIALIAKKKDYSHPK 1200
            NKAPGPDGFPISFFKSYWHLLKEDI+DIFKDFFEKGVINKNMNNTYIALI KKKDYSHPK
Sbjct: 1141 NKAPGPDGFPISFFKSYWHLLKEDILDIFKDFFEKGVINKNMNNTYIALIEKKKDYSHPK 1200

Query: 1201 DFRPISLTTSIYKIIAKTLSNRLKLTLPDTISGNQLAFIKNRQITDAILIANEALDYWKV 1260
            DFRPISLTTSIYK IAKTLSNRLKLTLPDTISGNQLAFIKNRQITDAIL+ANEALDYWKV
Sbjct: 1201 DFRPISLTTSIYKTIAKTLSNRLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKV 1260

Query: 1261 KKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNYPTSWRKWIRGCISNVTYSIEVNGKPQG 1320
            KKIKGFILKLDIEKAFDNLNW+FID VL+K NYP SWRKWIRGCISNVTYSI VNGKPQG
Sbjct: 1261 KKIKGFILKLDIEKAFDNLNWNFIDLVLKKNNYPNSWRKWIRGCISNVTYSIIVNGKPQG 1320

Query: 1321 RIKANRGLRQGDPLSPFLFVIVMDYLSRLLSHLESTGAIKGVCLANDCNISHILFADDIL 1380
            RIKANRGLRQGDPLS FLFVI MDYLSRLLSHLESTGAIKG                   
Sbjct: 1321 RIKANRGLRQGDPLSLFLFVIAMDYLSRLLSHLESTGAIKG------------------- 1380

Query: 1381 LFVEDNDNFLNNLRMAISLFEKASGLKINLSKSAMVPVNVSWLRALECTSSWGISCHTLP 1440
                                                                GI CHTLP
Sbjct: 1381 ----------------------------------------------------GILCHTLP 1440

Query: 1441 LTYLGVPLGGNQKSNLFWRNIEDRIQKKLSNWKYAHISKGGRLTLIKSTLSSLPIYQLSV 1500
            LTYLGVPLGGN KSNLFWRNIEDRIQKKLSNWKYAHISKGGRLTLIKSTLSSLPIY+LSV
Sbjct: 1441 LTYLGVPLGGNPKSNLFWRNIEDRIQKKLSNWKYAHISKGGRLTLIKSTLSSLPIYKLSV 1500

Query: 1501 FQAPSSTYKNIEKIWRNFLWKGSGGLKGSHLINWSIVTKPKEEGGLGISRLQVTNQALLS 1560
            FQAPSSTYKNIEK+WRNFLWKGS GLKGSHLINWSIVTKPKEEGGLGISRLQVTNQALLS
Sbjct: 1501 FQAPSSTYKNIEKLWRNFLWKGSCGLKGSHLINWSIVTKPKEEGGLGISRLQVTNQALLS 1560

Query: 1561 KWLWRYHSEPNSLWRRLIHIKYKGKHPGDLPSNISSSSSKAPWRSIINNIDWFKSNQGWN 1620
            KWLWRY+SEPNSLWRRLIHIKYKGKHPGDLPSNISSSSSKAPWRSIINNIDWFKSNQGW+
Sbjct: 1561 KWLWRYYSEPNSLWRRLIHIKYKGKHPGDLPSNISSSSSKAPWRSIINNIDWFKSNQGWD 1620

Query: 1621 LNNGDQISFWYSNWSPEGCLSTAYPRLFALSIDKESSIKDVWNSNNNQWEITFRRKLNDR 1680
            LNNGDQISFWYSNWSPEGCLSTAYPRLFALS+DKESSIKDVWNSNNNQWEITFRRKLNDR
Sbjct: 1621 LNNGDQISFWYSNWSPEGCLSTAYPRLFALSMDKESSIKDVWNSNNNQWEITFRRKLNDR 1680

Query: 1681 ELSTWQKILENLPIPRTNRGPSKPTWIPDSKKLFSIASAKSCISHQPDRPVANPRVKLLE 1740
            ELSTWQKILENLPI RTNRGPSKPTWIPDSKK FSIASAKSCISHQPDR VANPRVKLL 
Sbjct: 1681 ELSTWQKILENLPILRTNRGPSKPTWIPDSKKFFSIASAKSCISHQPDRSVANPRVKLLN 1740

Query: 1741 LIWKTHVPMKIKFFMWCLVQRKLNTMEVIQQRMPNTLLQPNWCVLCKKDSETGAHLFLYC 1796
            LIWKTHVPMKIKFFMWCLVQRKLNTMEV       TLLQPNWCVLCKK SETGAHLFL+C
Sbjct: 1741 LIWKTHVPMKIKFFMWCLVQRKLNTMEV------XTLLQPNWCVLCKKKSETGAHLFLHC 1785

BLAST of Pay0007691 vs. ExPASy TrEMBL
Match: A0A5D3BLV7 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G005290 PE=4 SV=1)

HSP 1 Score: 2469.1 bits (6398), Expect = 0.0e+00
Identity = 1252/1780 (70.34%), Postives = 1393/1780 (78.26%), Query Frame = 0

Query: 3    YFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKSLI 62
            +FKSLPRSCK+ERKEFVL LDKY+KHTHYWLTETGAHKAFSIEVSPRDLDWIR TLKSLI
Sbjct: 57   HFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPRDLDWIRCTLKSLI 116

Query: 63   ETPSSNRFFLENRDSEHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEGPEKSGWVSF 122
             TP++NRFFLE RDSE  IWIRKTRN KGCTAEIFRVD KNRKSCILVPEGP+KSGWVSF
Sbjct: 117  ATPNTNRFFLETRDSEQRIWIRKTRNSKGCTAEIFRVDQKNRKSCILVPEGPDKSGWVSF 176

Query: 123  LSMITPKVE--------------------------------------------------- 182
            LSMITPKVE                                                   
Sbjct: 177  LSMITPKVEVKAKTRPTFLPRTSPDCRLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDS 236

Query: 183  -------------------------------------------ETNRGIFYIPCLHAEKA 242
                                                       +     F     HAEKA
Sbjct: 237  SDSSHSSSNSFCDSPSSDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKA 296

Query: 243  LVHFNSNVPANLLCQNKGWTTVGKYTVRFEKWAPASHASPKLIPSYGGWTTFRGIPLHLW 302
            LVHF+SN+PANLLCQNKGW+TVGKY+VRFEKW+P  HA+PKLIPSYGGWTTFRGIPLHLW
Sbjct: 297  LVHFSSNIPANLLCQNKGWSTVGKYSVRFEKWSPVYHATPKLIPSYGGWTTFRGIPLHLW 356

Query: 303  NMMTFQQIGKACGSLVKVAEETKTARNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQ 362
            NMMTFQQIGKAC  L+KVAEET++A+NLIEA++K+RYNYSGFLPA V+IFD EGNKF VQ
Sbjct: 357  NMMTFQQIGKACEGLIKVAEETRSAKNLIEARIKVRYNYSGFLPANVRIFDNEGNKFFVQ 416

Query: 363  VVTHSEGKWLMERNIRLHGTFKRQAAASFDDFNPDAEQFLFDGLEAISPDLLNTISGSRK 422
            VVTH EGKWL+ERN+RLHGTFKRQAAASFDDFNP++EQF F+G EAISPD L+T S  RK
Sbjct: 417  VVTHPEGKWLIERNVRLHGTFKRQAAASFDDFNPESEQFFFEGSEAISPDFLSTSSDGRK 476

Query: 423  SISPEQQSALKSVIIKPARDATSPTTLNEEVVNDNSLHATTINSKLKILSGISNDGSLDK 482
            S +P+Q SALKSVIIKP R+AT P+ LNEE+VND++LHAT   SKL+ILSGISNDG LDK
Sbjct: 477  SSTPDQPSALKSVIIKPDRNATLPSFLNEELVNDSNLHATANKSKLEILSGISNDGVLDK 536

Query: 483  GKQKVDIPSQLTSAFIFDKPKRKVSFNSPNNKTTFFNPDSAPTNHSPLLSSPEKKQRVSR 542
            GKQKVDI  Q  SA   DK KRKVSFNSP+NKT  FNPDSAP NHSP L+SPEKKQ+VSR
Sbjct: 537  GKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPDSAPANHSPSLNSPEKKQKVSR 596

Query: 543  ERSVKKKSLTIQPKSRANQGKGDLITQPLQVVAHDLDASKKGLSLTVDLGNLPALDPSKS 602
            ERS+KKKS + QP S+ANQ KG  ITQP+Q+VAHD DA+KKGLSLTVDLG+LPALDP+KS
Sbjct: 597  ERSIKKKSSSTQPNSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPNKS 656

Query: 603  CEDHHSSDNAEVIDITNTEVVTETPELKMTDPKKSNSSPEVNYRKQKHSHRRRHYYRKKE 662
             EDHH+SDNAEV+DITNTEVV ETPE+KM   + SNSS E NYRK KH H+R++YYRKKE
Sbjct: 657  LEDHHNSDNAEVVDITNTEVVPETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKE 716

Query: 663  GKEKDTNSEAFKNQLVTWLKENGLKLSTDTDSSGGWGPGDIKCNMKLLTWNARGLGSPSK 722
             KEKD +SEAFK QLV+WLK+NGLKLSTDTDSSG     ++     LL     GL   +K
Sbjct: 717  EKEKDPDSEAFKKQLVSWLKKNGLKLSTDTDSSGATTSTNV-----LLNQMNSGLKITNK 776

Query: 723  RALIKNTIISYSPDFVILTETRLKITNKRIINSRGILILWDAQHHSLLSQEEGKFSLSAN 782
            R +IK+   S S +++    +          +S GILILWDAQ+HSLLSQEEG FSLSAN
Sbjct: 777  R-IIKSLWPSNSINWIAKNASG---------SSGGILILWDAQNHSLLSQEEGLFSLSAN 836

Query: 783  FLSFNN-SWWLTGLYGPVKRRERLNFWADLHNLLHLNSSPWIIG---------------- 842
            FL  NN SWWLTGLYGPVKRRER++FWA+LHNL HLNS PWI+G                
Sbjct: 837  FLLNNNSSWWLTGLYGPVKRRERIHFWAELHNLQHLNSFPWILGGDLNVIRMREESTSVL 896

Query: 843  -----SNMLNNFISNNLLIDPPLTNNRYTWSNLRNPPTFSHLDRFLYNSSWEILFNPHIT 902
                 S MLNNFISNNLLIDPPLTNNR+TWSNLRNPPTFS +DRFLYNSSWE LF+PH T
Sbjct: 897  SSSHNSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTT 956

Query: 903  RTLPRTTSDHFPLVCEDSTSTLRWGPAPFRLNSITLNDPEFKRNMERWWELSIQNGHPGF 962
            RTLPR+TSDHFPLVCEDS   L WGP PFRLNSITL+DPEFKRNM RWWE SIQ G+PGF
Sbjct: 957  RTLPRSTSDHFPLVCEDSNPKLSWGPIPFRLNSITLSDPEFKRNMGRWWENSIQAGYPGF 1016

Query: 963  SFIQRLKSLANLIKPWQKEKFHSLTTAKENIIREVDAIDKNELDTPLSQEESNRRLALKA 1022
            SFIQRLKSLAN IKPWQKEK HSLT AKE IIREVD+IDK ELDTPL+QEESNRRLALKA
Sbjct: 1017 SFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKKELDTPLTQEESNRRLALKA 1076

Query: 1023 ELNDLSLKESQFWFQRAKKLWIKEGDENYAFFHRICSSRQKRNLIHEIQNEEGSIQNTNN 1082
            +L++LSLKESQFW+QRAKKLW++EGDEN +FFHRICSSRQKR+ IHEIQ+EEGSIQNTNN
Sbjct: 1077 DLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGSIQNTNN 1136

Query: 1083 NISLAFVNHFSRIYRCSTKKDPLFIENLEWNPIDYSDWSLLCAPFLEEEIKGVIKSFDGN 1142
            +IS AF+  FSRIYR STK DPLFIENL+WNPI  S+WS LCAPFLE EIKGVI SFDG 
Sbjct: 1137 SISTAFIKFFSRIYRSSTKSDPLFIENLDWNPIASSEWSHLCAPFLEGEIKGVINSFDGK 1196

Query: 1143 KAPGPDGFPISFFKSYWHLLKEDIMDIFKDFFEKGVINKNMNNTYIALIAKKKDYSHPKD 1202
            K PGPDGFPISFFKS+W                                           
Sbjct: 1197 KTPGPDGFPISFFKSHW------------------------------------------- 1256

Query: 1203 FRPISLTTSIYKIIAKTLSNRLKLTLPDTISGNQLAFIKNRQITDAILIANEALDYWKVK 1262
                                 LK TLP+TISGNQLAF+KNRQITDAIL+ANEA+DYWKVK
Sbjct: 1257 ---------------------LKTTLPNTISGNQLAFVKNRQITDAILMANEAVDYWKVK 1316

Query: 1263 KIKGFILKLDIEKAFDNLNWDFIDFVLEKKNYPTSWRKWIRGCISNVTYSIEVNGKPQGR 1322
            KIKGFILKLDIEKAFDNLN DFID VLEKKN+P  WRKWIRGCISNVTYS+ +NG+PQGR
Sbjct: 1317 KIKGFILKLDIEKAFDNLNLDFIDNVLEKKNFPNPWRKWIRGCISNVTYSVIINGRPQGR 1376

Query: 1323 IKANRGLRQGDPLSPFLFVIVMDYLSRLLSHLESTGAIKGVCLANDCNISHILFADDILL 1382
            IKANRGLRQGDPLSPFLFVI MDYLSRLLSHLES+GAIKGV L  +CNISHILFADDILL
Sbjct: 1377 IKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCNISHILFADDILL 1436

Query: 1383 FVEDNDNFLNNLRMAISLFEKASGLKINLSKSAMVPVNVSWLRALECTSSWGISCHTLPL 1442
            F+EDND FL NLRMA+SLFE+ASGLKINL KSA+VPVNVS  RA EC S WGISCH+LPL
Sbjct: 1437 FIEDNDCFLKNLRMALSLFERASGLKINLLKSALVPVNVSLKRAKECASFWGISCHSLPL 1496

Query: 1443 TYLGVPLGGNQKSNLFWRNIEDRIQKKLSNWKYAHISKGGRLTLIKSTLSSLPIYQLSVF 1502
            +YLGVPLGGN KSNLFWRN+ED+IQKKL+NWKYA ISKGGRLTLIKSTLSSLPIYQLSVF
Sbjct: 1497 SYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVF 1556

Query: 1503 QAPSSTYKNIEKIWRNFLWKGSGGLKGSHLINWSIVTKPKEEGGLGISRLQVTNQALLSK 1562
            QAPS T KNIEK+WR FLWKG+ G +GSHLINW+ V+K KEEGGLGISRL VTN+ALLSK
Sbjct: 1557 QAPSLTCKNIEKLWRKFLWKGNNGSEGSHLINWTKVSKSKEEGGLGISRLNVTNKALLSK 1616

Query: 1563 WLWRYHSEPNSLWRRLIHIKYKGKHPGDLPSNISSSSSKAPWRSIINNIDWFKSNQGWNL 1622
            WLWRY SEPN+LWRRLI  KYKGK PGD+PSNISSS+SKAPWRSII++ DWFKSNQ W+L
Sbjct: 1617 WLWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSKAPWRSIIDSTDWFKSNQSWDL 1676

Query: 1623 NNGDQISFWYSNWSPEGCLSTAYPRLFALSIDKESSIKDVWNSNNNQWEITFRRKLNDRE 1667
            NNGDQISFWYSNWS EG LSTAYPRLFAL++DKE S+KD WN+ +NQW I FRR+LNDRE
Sbjct: 1677 NNGDQISFWYSNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRE 1736

BLAST of Pay0007691 vs. ExPASy TrEMBL
Match: A0A5D3C3M3 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold98G003160 PE=4 SV=1)

HSP 1 Score: 2450.6 bits (6350), Expect = 0.0e+00
Identity = 1286/1705 (75.43%), Postives = 1325/1705 (77.71%), Query Frame = 0

Query: 1    MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKS 60
            MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKS
Sbjct: 1    MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKS 60

Query: 61   LIETPSSNRFFLENRDSEHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEGPEKSGWV 120
            LIETPSSNRFFLENRD EHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEGPEKSG V
Sbjct: 61   LIETPSSNRFFLENRDYEHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEGPEKSGRV 120

Query: 121  SFLSMITPKVE------------------------------------------------- 180
            SFLSMITPKVE                                                 
Sbjct: 121  SFLSMITPKVEVKAKTRPTFLPRSSPEFRLSPPIDYHKRSYEKAVSKGRSSISSDSSDSY 180

Query: 181  ---ETNRGIFYIPC---------LHAEKALVHFNSNVPANLLCQNKGWTTVGKYTVRFEK 240
               ++++     PC              AL+HFNSNVPANLLCQNKGWTTV KY VR   
Sbjct: 181  TSSDSSQSSGNSPCDSPFPVLLENTVVLALIHFNSNVPANLLCQNKGWTTVEKYMVR--- 240

Query: 241  WAPASHASPKLIPSYGGWTTFRGIPLHLWNMMTFQQIGKACGSLVKVAEETKTARNLIEA 300
                                                                        
Sbjct: 241  ------------------------------------------------------------ 300

Query: 301  KLKIRYNYSGFLPAYVKIFDQEGNKFVVQVVTHSEGKWLMERNIRLHGTFKRQAAASFDD 360
                                                                        
Sbjct: 301  ------------------------------------------------------------ 360

Query: 361  FNPDAEQFLFDGLEAISPDLLNTISGSRKSISPEQQSALKSVIIKPARDATSPTTLNEEV 420
                  + LFDGLEAISPDLLNTISGSRKS S EQ SALKSVIIKPARDATSPTTLNEEV
Sbjct: 361  ------KSLFDGLEAISPDLLNTISGSRKSNSREQPSALKSVIIKPARDATSPTTLNEEV 420

Query: 421  VNDNSLHATTINSKLKILSGISNDGSLDKGKQKVDIPSQLTSAFIFDKPKRKVSFNSPNN 480
            VNDNSLHATTI S+LKILSGISNDGSLDKGKQKVDIPSQLTSAFI+DKPKRKVSFNSP+N
Sbjct: 421  VNDNSLHATTIKSELKILSGISNDGSLDKGKQKVDIPSQLTSAFIYDKPKRKVSFNSPSN 480

Query: 481  KTTFFNPDSAPTNHSPLLSSPEKKQRVSRERSVKKKSLTIQPKSRANQGKGDLITQPLQV 540
            KTTFFN DSAPTNHSP LSSPEKKQRVSRERSVKKKS TIQPKSRANQGKG+LITQPLQV
Sbjct: 481  KTTFFNSDSAPTNHSPPLSSPEKKQRVSRERSVKKKSSTIQPKSRANQGKGELITQPLQV 540

Query: 541  VAHDLDASKKGLSLTVDLGNLPALDPSKSCEDHHSSDNAEVIDITNTEVVTETPELKMTD 600
            VAHDLDASKKGLSLTVDLGNLP LDPSKS EDHHSSDNAEVIDITNTEVV ETPELKMTD
Sbjct: 541  VAHDLDASKKGLSLTVDLGNLPVLDPSKSFEDHHSSDNAEVIDITNTEVVPETPELKMTD 600

Query: 601  PKKSNSSPEVNYRKQKHSHRRRHYYRKKEGKEKDTNSEAFKNQLVTWLKENGLKLSTDTD 660
            P+KSNSSPEVNYRKQKHSHRRRHYYRKKE KEKDTNSEAFKNQLVTWLKENGLKLSTDTD
Sbjct: 601  PEKSNSSPEVNYRKQKHSHRRRHYYRKKEDKEKDTNSEAFKNQLVTWLKENGLKLSTDTD 660

Query: 661  SSGGWGPGDIKCNMKLLTWNARGLGSPSKRALIKNTIISYSPDFVILTETRLKITNKRII 720
            SSG                      + S  AL      S S            I    I 
Sbjct: 661  SSG---------------------ATTSTNALFSQLGSSIS-----------WIVKNAID 720

Query: 721  NSRGILILWDAQHHSLLSQEEGKFSLSANFLSFNNSWWLTGLYGPVKRRERLNFWADLHN 780
            +S GILILWDAQHHSLL  +     +                     R E     +  H+
Sbjct: 721  SSGGILILWDAQHHSLLRGDLNVVRM---------------------REESTAVTSSSHS 780

Query: 781  LLHLNSSPWIIGSNMLNNFISNNLLIDPPLTNNRYTWSNLRNPPTFSHLDRFLYNSSWEI 840
                        SNMLNNFISNNLLIDPPLTNNRYTWSNLRNPPTFS LDRFLYNS WE 
Sbjct: 781  ------------SNMLNNFISNNLLIDPPLTNNRYTWSNLRNPPTFSRLDRFLYNSRWET 840

Query: 841  LFNPHITRTLPRTTSDHFPLVCEDSTSTLRWGPAPFRLNSITLNDPEFKRNMERWWELSI 900
            LFNPHITRTL R TSDHFPLVCEDSTSTLRWGPAPFRLNSI LNDP+FKRNMERWWELS+
Sbjct: 841  LFNPHITRTLSRPTSDHFPLVCEDSTSTLRWGPAPFRLNSIALNDPKFKRNMERWWELSV 900

Query: 901  QNGHPGFSFIQRLKSLANLIKPWQKEKFHSLTTAKENIIREVDAIDKNELDTPLSQEESN 960
            QNGHPGFSFI+RLKSLANLIKPWQKEKFHSLT+AKENIIREVD+IDKNELDTPLSQEESN
Sbjct: 901  QNGHPGFSFIRRLKSLANLIKPWQKEKFHSLTSAKENIIREVDSIDKNELDTPLSQEESN 960

Query: 961  RRLALKAELNDLSLKESQFWFQRAKKLWIKEGDENYAFFHRICSSRQKRNLIHEIQNEEG 1020
            RRLALKAEL+DLSLKESQFWFQRAKKLW+KEGDEN AFFHRICSSRQKRNLIHEIQ+EEG
Sbjct: 961  RRLALKAELSDLSLKESQFWFQRAKKLWLKEGDENSAFFHRICSSRQKRNLIHEIQDEEG 1020

Query: 1021 SIQNTNNNISLAFVNHFSRIYRCSTKKDPLFIENLEWNPIDYSDWSLLCAPFLEEEIKGV 1080
            SIQNTNNNISLAFVNHFS IYRCSTKKDPLFIENLEWNPIDYSDWSLLCAPFLEEEIKGV
Sbjct: 1021 SIQNTNNNISLAFVNHFSSIYRCSTKKDPLFIENLEWNPIDYSDWSLLCAPFLEEEIKGV 1080

Query: 1081 IKSFDGNKAPGPDGFPISFFKSYWHLLKEDIMDIFKDFFEKGVINKNMNNTYIALIAKKK 1140
            IKSFDGNKAPGPDGFPISFFKSYWHLLKEDI+DIFKDFFEKG                  
Sbjct: 1081 IKSFDGNKAPGPDGFPISFFKSYWHLLKEDILDIFKDFFEKG------------------ 1140

Query: 1141 DYSHPKDFRPISLTTSIYKIIAKTLSNRLKLTLPDTISGNQLAFIKNRQITDAILIANEA 1200
                               IIAKTLSNRLKLTLPDTISGNQLAFIKNRQITDAIL ANEA
Sbjct: 1141 -------------------IIAKTLSNRLKLTLPDTISGNQLAFIKNRQITDAILRANEA 1200

Query: 1201 LDYWKVKKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNYPTSWRKWIRGCISNVTYSIEV 1260
            LDYWKVKKIK FILKLDIEKAFDNLNWDFIDFVL+KKNYP SWRKWIRGCISNVTYSI V
Sbjct: 1201 LDYWKVKKIKSFILKLDIEKAFDNLNWDFIDFVLKKKNYPNSWRKWIRGCISNVTYSIIV 1260

Query: 1261 NGKPQGRIKANRGLRQGDPLSPFLFVIVMDYLSRLLSHLESTGAIKGVCLANDCNISHIL 1320
            N KPQ RIKANRGLRQGDPLSPFLFV  MDYLSRLLSHLES+GAIKGVCLANDCNISHIL
Sbjct: 1261 NEKPQDRIKANRGLRQGDPLSPFLFVSAMDYLSRLLSHLESSGAIKGVCLANDCNISHIL 1320

Query: 1321 FADDILLFVEDNDNFLNNLRMAISLFEKASGLKINLSKSAMVPVNVSWLRALECTSSWGI 1380
            FADDILLFVEDND+FLNNLRMA+SLFEKASGLKINLSKSAMVPVNVSW RALEC SSWGI
Sbjct: 1321 FADDILLFVEDNDHFLNNLRMALSLFEKASGLKINLSKSAMVPVNVSWSRALECASSWGI 1380

Query: 1381 SCHTLPLTYLGVPLGGNQKSNLFWRNIEDRIQKKLSNWKYAHISKGGRLTLIKSTLSSLP 1440
            SCHTLPLTYLGVPLGGN KSN+FWRNIEDRIQKKL+NWKYAHISKGGRLTLIKSTLSSL 
Sbjct: 1381 SCHTLPLTYLGVPLGGNPKSNIFWRNIEDRIQKKLNNWKYAHISKGGRLTLIKSTLSSLS 1440

Query: 1441 IYQLSVFQAPSSTYKNIEKIWRNFLWKGSGGLKGSHLINWSIVTKPKEEGGLGISRLQVT 1500
            IYQLSVFQAP STYKNIEK+WRNFLWKGS GLKGSHLINWSIVTK KEEGGLGISRLQV 
Sbjct: 1441 IYQLSVFQAPPSTYKNIEKLWRNFLWKGSFGLKGSHLINWSIVTKLKEEGGLGISRLQVI 1474

Query: 1501 NQALLSKWLWRYHSEPNSLWRRLIHIKYKGKHPGDLPSNISSSSSKAPWRSIINNIDWFK 1560
            NQALLSKWLWRY+SEPNSLWRRLIHIKYKGKHPGD+PSNISSSSSKAPW+SIINNIDWFK
Sbjct: 1501 NQALLSKWLWRYYSEPNSLWRRLIHIKYKGKHPGDIPSNISSSSSKAPWKSIINNIDWFK 1474

Query: 1561 SNQGWNLNNGDQISFWYSNWSPEGCLSTAYPRLFALSIDKESSIKDVWNSNNNQWEITFR 1620
            SNQGW+LNN DQISFWYSNWSPEGCLSTAYPRLFALSIDK+SSIKDVWNSNNNQWEITFR
Sbjct: 1561 SNQGWDLNNEDQISFWYSNWSPEGCLSTAYPRLFALSIDKKSSIKDVWNSNNNQWEITFR 1474

Query: 1621 RKLNDRELSTWQKILENLPIPRTNRGPSKPTWIPDSKKLFSIASAKSCISHQPDRPVANP 1645
            RKLNDRELSTWQ ILENL IPRTNRGPSKPTWIPDSKK FSIASAKSCISHQPDR VANP
Sbjct: 1621 RKLNDRELSTWQNILENLSIPRTNRGPSKPTWIPDSKKFFSIASAKSCISHQPDRSVANP 1474

BLAST of Pay0007691 vs. ExPASy TrEMBL
Match: A0A5A7TDG1 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold64G001050 PE=4 SV=1)

HSP 1 Score: 2303.1 bits (5967), Expect = 0.0e+00
Identity = 1186/1431 (82.88%), Postives = 1208/1431 (84.42%), Query Frame = 0

Query: 1    MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKS 60
            MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKS
Sbjct: 1    MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKS 60

Query: 61   LIETPSSNRFFLENRDSEHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEGPEKSGWV 120
            LIETPSSNRFFLENRD EHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEG EKS WV
Sbjct: 61   LIETPSSNRFFLENRDYEHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEGLEKSCWV 120

Query: 121  SFLSMITPKVE--ETNRGIFY--------------------------------------- 180
            SFLSMITPKVE     R IF                                        
Sbjct: 121  SFLSMITPKVEVKAKTRPIFLPRSSPEFRLSPPIDYHKRSYAKAVSEGRSSISSDSSDSY 180

Query: 181  -----------IPC------------------------------------------LHAE 240
                        PC                                           HAE
Sbjct: 181  ASSDSSQSSGNSPCDSPFPVLLENTVVLVRRFFHDDWQKILQNLRKQTEESFTYNAFHAE 240

Query: 241  KALVHFNSNVPANLLCQNKGWTTVGKYTVRFEKWAPASHASPKLIPSYGGWTTFRGIPLH 300
            K LVHFNSNVPANLLCQNKGWTTVGKYTVRFEKWAPASHASPKLIPSYGGWTTFRGIPLH
Sbjct: 241  KVLVHFNSNVPANLLCQNKGWTTVGKYTVRFEKWAPASHASPKLIPSYGGWTTFRGIPLH 300

Query: 301  LWNMMTFQQIGKACGSLVKVAEETKTARNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFV 360
            LWNMMTFQQIGKACG L+KVAEETKTARNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFV
Sbjct: 301  LWNMMTFQQIGKACGGLIKVAEETKTARNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFV 360

Query: 361  VQVVTHSEGKWLMERNIRLHGTFKRQAAASFDDFNPDAEQFLFDGLEAISPDLLNTISGS 420
            VQVVTHSEGKWLMERN+RLHGTFKRQAAASFDDFNPD+EQFLFDGLEAISPDLLNTISGS
Sbjct: 361  VQVVTHSEGKWLMERNVRLHGTFKRQAAASFDDFNPDSEQFLFDGLEAISPDLLNTISGS 420

Query: 421  RKSISPEQQSALKSVIIKPARDATSPTTLNEEVVNDNSLHATTINSKLKILSGISNDGSL 480
            RKSISPEQ SALKSVIIKPA+ ATSPTTLNEEVVNDNSLHAT   SKLKILSGISNDGSL
Sbjct: 421  RKSISPEQPSALKSVIIKPAKYATSPTTLNEEVVNDNSLHATANKSKLKILSGISNDGSL 480

Query: 481  DKGKQKVDIPSQLTSAFIFDKPKRKVSFNSPNNKTTFFNPDSAPTNHSPLLSSPEKKQRV 540
            DKGKQKVDIPSQLTSAFIF KPKRKVSFNSP+NKTTFFNPDSAP NH     SPEKK+RV
Sbjct: 481  DKGKQKVDIPSQLTSAFIFYKPKRKVSFNSPSNKTTFFNPDSAPANH-----SPEKKKRV 540

Query: 541  SRERSVKKKSLTIQPKSRANQGKGDLITQPLQVVAHDLDASKKGLSLTVDLGNLPALDPS 600
            SRERSVKKKS TIQPK RANQGKG+LITQPLQVVAHDLDASKKGLSLTVDLGNLP LDPS
Sbjct: 541  SRERSVKKKSSTIQPKLRANQGKGNLITQPLQVVAHDLDASKKGLSLTVDLGNLPVLDPS 600

Query: 601  KSCEDHHSSDNAEVIDITNTEVVTETPELKMTDPKKSNSSPEVNYRKQKHSHRRRHYYRK 660
            KS EDHHSSDNAEVIDITNTEVV ETPELKMTDP+KSNSSPEVNYRKQKHSHRRRHYYRK
Sbjct: 601  KSFEDHHSSDNAEVIDITNTEVVPETPELKMTDPEKSNSSPEVNYRKQKHSHRRRHYYRK 660

Query: 661  KEGKEKDTNSEAFKNQLVTWLKENGLKLSTDTDSSGGWGPGDIKCNMKLLTWNARGLGSP 720
            KE KEKDTNSEAFKNQLVTWLKENGLKLS DTDSSG                      + 
Sbjct: 661  KEDKEKDTNSEAFKNQLVTWLKENGLKLSIDTDSSG---------------------ATT 720

Query: 721  SKRALIKNTIISYSPDFVILTETRLKITNKRIINSRGILILWDAQHHSLLSQEEGKFSLS 780
            S  AL      S                      + GILILWDAQHHSLLSQEEGKFSLS
Sbjct: 721  STNALFSQLGSS----------------------AGGILILWDAQHHSLLSQEEGKFSLS 780

Query: 781  ANFLSFNNSWWLTGLYGPVKRRERLNFWADLHNLLHLNSSPWIIG--------------- 840
            ANF SFNNSWWLTGLYGPVKRRERLN W DLHNL HLNSSPWIIG               
Sbjct: 781  ANFSSFNNSWWLTGLYGPVKRRERLNVWEDLHNLHHLNSSPWIIGGDLNVVRMREESTAV 840

Query: 841  ------SNMLNNFISNNLLIDPPLTNNRYTWSNLRNPPTFSHLDRFLYNSSWEILFNPHI 900
                  SNMLN+FISNNLLIDPPLTNNRYTWSNLRNPPTFS LDRFLYNS WEILFNPHI
Sbjct: 841  TFSSHSSNMLNDFISNNLLIDPPLTNNRYTWSNLRNPPTFSRLDRFLYNSRWEILFNPHI 900

Query: 901  TRTLPRTTSDHFPLVCEDSTSTLRWGPAPFRLNSITLNDPEFKRNMERWWELSIQNGHPG 960
            TRTLPR TSDHFPLVCEDSTSTLRWGPAPFRLNSI LNDPEFKRNMERWWELS+QNGHPG
Sbjct: 901  TRTLPRPTSDHFPLVCEDSTSTLRWGPAPFRLNSIALNDPEFKRNMERWWELSVQNGHPG 960

Query: 961  FSFIQRLKSLANLIKPWQKEKFHSLTTAKENIIREVDAIDKNELDTPLSQEESNRRLALK 1020
            F FIQRLKSLANLIKPWQKEKF SLT+AKENIIREVD+IDKNELDTPLS EESNRRLALK
Sbjct: 961  FFFIQRLKSLANLIKPWQKEKFQSLTSAKENIIREVDSIDKNELDTPLSLEESNRRLALK 1020

Query: 1021 AELNDLSLKESQFWFQRAKKLWIKEGDENYAFFHRICSSRQKRNLIHEIQNEEGSIQNTN 1080
            AELNDLSLKESQFWFQRAKKLW+KEGDEN AFFHRICSSRQKRNLIHEIQ+EEGSIQNTN
Sbjct: 1021 AELNDLSLKESQFWFQRAKKLWLKEGDENSAFFHRICSSRQKRNLIHEIQDEEGSIQNTN 1080

Query: 1081 NNISLAFVNHFSRIYRCSTKKDPLFIENLEWNPIDYSDWSLLCAPFLEEEIKGVIKSFDG 1140
            NNISLAFVNHFSRIYRCSTKKDPLFIENLEWNPIDYSDWSLLCAPF EEEIKGVIKSFDG
Sbjct: 1081 NNISLAFVNHFSRIYRCSTKKDPLFIENLEWNPIDYSDWSLLCAPFSEEEIKGVIKSFDG 1140

Query: 1141 NKAPGPDGFPISFFKSYWHLLKEDIMDIFKDFFEKGVINKNMNNTYIALIAKKKDYSHPK 1200
            NKAPGPDGFPISFFKSYWHLLKEDI+DIFKDFFEKGVINKNMNNTYIALI KKKDYSHPK
Sbjct: 1141 NKAPGPDGFPISFFKSYWHLLKEDILDIFKDFFEKGVINKNMNNTYIALIEKKKDYSHPK 1200

Query: 1201 DFRPISLTTSIYKIIAKTLSNRLKLTLPDTISGNQLAFIKNRQITDAILIANEALDYWKV 1260
            DFRPISLTTSIYK IAKTLSNRLKLTLPDTISGNQLAFIKNRQITDAIL+ANEALDYWKV
Sbjct: 1201 DFRPISLTTSIYKTIAKTLSNRLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKV 1260

Query: 1261 KKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNYPTSWRKWIRGCISNVTYSIEVNGKPQG 1317
            KKIKGFILKLDIEKAFDNLNW+FID VL+K NYP SWRKWIRGCISNVTYSI VNGKPQG
Sbjct: 1261 KKIKGFILKLDIEKAFDNLNWNFIDLVLKKNNYPNSWRKWIRGCISNVTYSIIVNGKPQG 1320

BLAST of Pay0007691 vs. ExPASy TrEMBL
Match: A0A5A7UV84 (Reverse transcriptase domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold98G001710 PE=4 SV=1)

HSP 1 Score: 2041.5 bits (5288), Expect = 0.0e+00
Identity = 1086/1652 (65.74%), Postives = 1183/1652 (71.61%), Query Frame = 0

Query: 119  WVSFLSMITPKVEETNRGIFYIPCLHAEKALVHFNSNVPANLLCQNKGWTTVGKYTVRFE 178
            W   L  +  + EE+    F     HAEKALVHFNSN+P NLLCQNKGWTTVGKY+VRFE
Sbjct: 93   WQKILQNLRKQTEES----FTYNAFHAEKALVHFNSNIPENLLCQNKGWTTVGKYSVRFE 152

Query: 179  KWAPASHASPKLIPSYGGWTTFRGIPLHLWNMMTFQQIGKACGSLVKVAEETKTARNLIE 238
            KW+PA HA+PKLIPSYGGWTTF+                             + +  L+E
Sbjct: 153  KWSPAYHATPKLIPSYGGWTTFQ-----------------------------RNSATLVE 212

Query: 239  AKLKIRYNYSGFLPAYVKIFDQEGNKFVVQVVTHSEGKWLMERNIRLHGTFKRQAAASFD 298
                                                                      +D
Sbjct: 213  ----------------------------------------------------------YD 272

Query: 299  DFNPDAEQ----FLFDGLEAISPDLLNTISGSRKSISPEQQSALKSVIIKPARDATSPTT 358
            DF+ + E      LFDG EAISPD L+T S SRKS +P+Q SALKSVIIKP + ATSPT 
Sbjct: 273  DFSTNCESLRRIILFDGSEAISPDFLSTSSRSRKSSTPDQPSALKSVIIKPDKAATSPTY 332

Query: 359  LNEEVVNDNSLHATTINSKLKILSGISNDGSLDKGKQKVDIPSQLTSAFIFDKPKRKVSF 418
            LNEEVVND++LHAT   S+L+ILSGI NDG LDKGKQKVDI     SA   +KPKRKVSF
Sbjct: 333  LNEEVVNDSNLHATANKSRLEILSGIPNDGVLDKGKQKVDIQLHPNSALNLNKPKRKVSF 392

Query: 419  NSPNNKTTFFNPDSAPTNHSPLLSSPEKKQRVSRERSVKKKSLTIQPKSRANQGKGDLIT 478
            NSP+NKT  FNPDSAP NHS  LSSPEKKQ+VSRERS+KKKS +IQP     Q KG LIT
Sbjct: 393  NSPSNKTNIFNPDSAPANHSLSLSSPEKKQKVSRERSIKKKSSSIQP----IQNKGVLIT 452

Query: 479  QPLQVVAHDLDASKKGLSLTVDLGNLPALDPSKSCEDHHSSDNAEVIDITNTEVVTETPE 538
            QP+QVVAHDL+ASKKGLSL V+LG+LP LDPSKS EDHHSS NAEVIDITNTEVV ETPE
Sbjct: 453  QPIQVVAHDLEASKKGLSLIVNLGDLPVLDPSKSFEDHHSSHNAEVIDITNTEVVPETPE 512

Query: 539  LKMTDPKKSNSSPEVNYRKQKHSHRRRHYYRKKEGKEKDTNSEAFKNQLVTWLKENGLKL 598
            +KM   + SNSS E NYRK KH HRRR+YYRKK  K +                 +GL+ 
Sbjct: 513  MKMPVNENSNSSSEANYRKPKHVHRRRYYYRKKRSKGEG----------------SGLR- 572

Query: 599  STDTDSSGGWGPGDIKCNMKLLTWNARGLGSPSKRALIKNTIISYSPDFVILTETRLKIT 658
                   G WG GDI C MKLLTWNARGLGSPSKRALIKN IISYSPDFVILTET LKIT
Sbjct: 573  -------GEWGSGDIYCKMKLLTWNARGLGSPSKRALIKNAIISYSPDFVILTETMLKIT 632

Query: 659  NKRII------------------NSRGILILWDAQHHSLLSQEEGKFSLSAN-FLSFNNS 718
            NKRII                  +S GILILWDAQ HSLLSQEE  FSLSAN FL+ N+S
Sbjct: 633  NKRIIKSFWPSNSINWIVKNASGSSGGILILWDAQSHSLLSQEEAIFSLSANFFLNNNSS 692

Query: 719  WWLTGLYGPVKRRERLNFWADLHNLLHLNSSPWII---------------------GSNM 778
            WWLTGLYGP KRR+R++FWADLHNL HLNS PW +                      S M
Sbjct: 693  WWLTGLYGPDKRRKRIHFWADLHNLQHLNSFPWSLERDLNVIRMREETTSILSSSHSSRM 752

Query: 779  LNNFISNNLLIDPPLTNNRYTWSNLRNPPTFSHLDRFLYNSSWEILFNPHITRTLPRTTS 838
            LNNFISNNLLIDPPLTNNR+TWSNLRNP TFS +DRFLYNSSWE LF+PH TRTLPR TS
Sbjct: 753  LNNFISNNLLIDPPLTNNRFTWSNLRNPSTFSRIDRFLYNSSWENLFSPHTTRTLPRPTS 812

Query: 839  DHFPLVCEDSTSTLRWGPAPFRLNSITLNDPEFKRNMERWWELSIQNGHPGFSFIQRLKS 898
            DHFPLVCEDS   LRWGPAPFRLNSI LNDPEFKRNMERWWE S+QNGHPGFSFIQRLKS
Sbjct: 813  DHFPLVCEDSNPKLRWGPAPFRLNSIALNDPEFKRNMERWWENSVQNGHPGFSFIQRLKS 872

Query: 899  LANLIKPWQKEKFHSLTTAKENIIREVDAIDKNELDTPLSQEESNRRLALKAELNDLSLK 958
            LAN IKPWQKEK HSL  AKE IIREVD+IDK ELDTPLSQ+ESNRRLALKAEL+DLSLK
Sbjct: 873  LANHIKPWQKEKLHSLNYAKETIIREVDSIDKKELDTPLSQKESNRRLALKAELSDLSLK 932

Query: 959  ESQFWFQRAKKLWIKEGDENYAFFHRICSSRQKRNLIHEIQNEEGSIQNTNNNISLAFVN 1018
            ESQF                       C                                
Sbjct: 933  ESQF-----------------------C-------------------------------- 992

Query: 1019 HFSRIYRCSTKKDPLFIENLEWNPIDYSDWSLLCAPFLEEEIKGVIKSFDGNKAPGPDGF 1078
                IY+ STK DPLFIENL+WNPI++S+W  LCAPFLEEEIKGVI SFDG KAP PDGF
Sbjct: 993  ----IYKSSTKSDPLFIENLDWNPIEFSEWPHLCAPFLEEEIKGVINSFDGKKAPSPDGF 1052

Query: 1079 PISFFKSYWHLLKEDIMDIFKDFFEKGVINKNMNNTYIALIAKKKDYSHPKDFRPISLTT 1138
            PISFFKSYWHLLKEDIMDIFKDFFEKGVINKNMNNTYIALI KKKDYSHPKDFRPISLTT
Sbjct: 1053 PISFFKSYWHLLKEDIMDIFKDFFEKGVINKNMNNTYIALIGKKKDYSHPKDFRPISLTT 1112

Query: 1139 SIYKIIAKTLSNRLKLTLPDTISGNQLAFIKNRQITDAILIANEALDYWKVKKIKGFILK 1198
            SIYKIIAKTLSNRLK TLP TISGNQLAFIKNRQITDAIL+ANEA+DYWKVKKIKGFILK
Sbjct: 1113 SIYKIIAKTLSNRLKTTLPGTISGNQLAFIKNRQITDAILMANEAVDYWKVKKIKGFILK 1172

Query: 1199 LDIEKAFDNLNWDFIDFVLEKKNYPTSWRKWIRGCISNVTYSIEVNGKPQGRIKANRGLR 1258
            LDIEK F NLNWDFID+VL KKN+P SWRKWIRGCISNVTYS+ +NG+PQGRIKANRGLR
Sbjct: 1173 LDIEKVFYNLNWDFIDYVLGKKNFPNSWRKWIRGCISNVTYSVIINGRPQGRIKANRGLR 1232

Query: 1259 QGDPLSPFLFVIVMDYLSRLLSHLESTGAIKGVCLANDCNISHILFADDILLFVEDNDNF 1318
            QGDPLSPFLFVI MDY SRLLSHLE++GAIKGV L N+CNISHILFADDILLFVEDND F
Sbjct: 1233 QGDPLSPFLFVIAMDYFSRLLSHLEASGAIKGVSLNNNCNISHILFADDILLFVEDNDCF 1292

Query: 1319 LNNLRMAISLFEKASGLKINLSKSAMVPVNVSWLRALECTSSWGISCHTLPLTYLGVPLG 1378
            LNNL MA+SLFEKASGLKINL KSA+VPVNVS  RA EC S WGISCH+L L+YLGVPLG
Sbjct: 1293 LNNLIMALSLFEKASGLKINLLKSALVPVNVSLNRAKECASFWGISCHSLLLSYLGVPLG 1352

Query: 1379 GNQKSNLFWRNIEDRIQKKLSNWKYAHISKGGRLTLIKSTLSSLPIYQLSVFQAPSSTYK 1438
                                                                        
Sbjct: 1353 ------------------------------------------------------------ 1412

Query: 1439 NIEKIWRNFLWKGSGGLKGSHLINWSIVTKPKEEGGLGISRLQVTNQALLSKWLWRYHSE 1498
                        GS G KGSHLINW+ V K KEEGGLGISRLQVTN+ALLSKWLWRY SE
Sbjct: 1413 ------------GSNGSKGSHLINWTKVFKSKEEGGLGISRLQVTNKALLSKWLWRYFSE 1472

Query: 1499 PNSLWRRLIHIKYKGKHPGDLPSNISSSSSKAPWRSIINNIDWFKSNQGWNLNNGDQISF 1558
            PN+LWRRLI  KYKGKHPGD+PSN SSSSSKAPWRSII+NIDWFKSNQ W+LNNGDQISF
Sbjct: 1473 PNALWRRLIQCKYKGKHPGDIPSNNSSSSSKAPWRSIIDNIDWFKSNQSWDLNNGDQISF 1494

Query: 1559 WYSNWSPEGCLSTAYPRLFALSIDKESSIKDVWNSNNNQWEITFRRKLNDRELSTWQKIL 1618
            WYSNWS EGCLSTAYPRLFAL++DKE S+KD WN+ +NQW I FRR+LNDRE   W+KIL
Sbjct: 1533 WYSNWSQEGCLSTAYPRLFALTLDKEISVKDAWNTIDNQWAINFRRELNDRERCNWEKIL 1494

Query: 1619 ENLPIPRTNRGPSKPTWIPDSKKLFSIASAKSCISHQPDRPVANPRVKLLELIWKTHVPM 1678
            E LP PR NRG SKPTWIPD  K FSIASAK  IS Q D+   + RVKLLE+IWK+++PM
Sbjct: 1593 EILPTPRPNRGSSKPTWIPDCNKSFSIASAKILISCQLDQTSGDSRVKLLEIIWKSNIPM 1494

Query: 1679 KIKFFMWCLVQRKLNTMEVIQQRMPNTLLQPNWCVLCKKDSETGAHLFLYCDRVKPLWSF 1727
            KIKFFMWCL+QR+++TMEVIQQRM NTLLQPNWCVLC KD+E+G HLFL CD VKPLWS 
Sbjct: 1653 KIKFFMWCLIQRRISTMEVIQQRMSNTLLQPNWCVLCNKDNESGNHLFLRCDAVKPLWSL 1494

BLAST of Pay0007691 vs. NCBI nr
Match: TYK00493.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])

HSP 1 Score: 3079.7 bits (7983), Expect = 0.0e+00
Identity = 1568/1910 (82.09%), Postives = 1595/1910 (83.51%), Query Frame = 0

Query: 1    MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKS 60
            MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKS
Sbjct: 1    MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKS 60

Query: 61   LIETPSSNRFFLENRDSEHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEGPEKSGWV 120
            LIETPSSNRFFLENRD EHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEG EKS WV
Sbjct: 61   LIETPSSNRFFLENRDYEHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEGLEKSCWV 120

Query: 121  SFLSMITPKVE--ETNRGIFY--------------------------------------- 180
            SFLSMITPKVE     R IF                                        
Sbjct: 121  SFLSMITPKVEVKAKTRPIFLPRSSPEFRLSPPIDYHKRSYAKAVSEGRSSISSDSSDSY 180

Query: 181  -----------IPC------------------------------------------LHAE 240
                        PC                                           HAE
Sbjct: 181  ASSDSSQSSGNSPCDSPFPVLLENTVVLVRRFFHDDWQKILQNLRKQTEESFTYNAFHAE 240

Query: 241  KALVHFNSNVPANLLCQNKGWTTVGKYTVRFEKWAPASHASPKLIPSYGGWTTFRGIPLH 300
            K LVHFNSNVPANLLCQNKGWTTVGKYTVRFEKWAPASHASPKLIPSYGGWTTFRGIPLH
Sbjct: 241  KVLVHFNSNVPANLLCQNKGWTTVGKYTVRFEKWAPASHASPKLIPSYGGWTTFRGIPLH 300

Query: 301  LWNMMTFQQIGKACGSLVKVAEETKTARNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFV 360
            LWNMMTFQQIGKACG L+KVAEETKTARNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFV
Sbjct: 301  LWNMMTFQQIGKACGGLIKVAEETKTARNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFV 360

Query: 361  VQVVTHSEGKWLMERNIRLHGTFKRQAAASFDDFNPDAEQFLFDGLEAISPDLLNTISGS 420
            VQVVTHSEGKWLMERN+RLHGTFKRQAAASFDDFNPD+EQFLFDGLEAISPDLLNTISGS
Sbjct: 361  VQVVTHSEGKWLMERNVRLHGTFKRQAAASFDDFNPDSEQFLFDGLEAISPDLLNTISGS 420

Query: 421  RKSISPEQQSALKSVIIKPARDATSPTTLNEEVVNDNSLHATTINSKLKILSGISNDGSL 480
            RKSISPEQ SALKSVIIKPA+ ATSPTTLNEEVVNDNSLHAT   SKLKILSGISNDGSL
Sbjct: 421  RKSISPEQPSALKSVIIKPAKYATSPTTLNEEVVNDNSLHATANKSKLKILSGISNDGSL 480

Query: 481  DKGKQKVDIPSQLTSAFIFDKPKRKVSFNSPNNKTTFFNPDSAPTNHSPLLSSPEKKQRV 540
            DKGKQKVDIPSQLTSAFIF KPKRKVSFNSP+NKTTFFNPDSAP NH     SPEKK+RV
Sbjct: 481  DKGKQKVDIPSQLTSAFIFYKPKRKVSFNSPSNKTTFFNPDSAPANH-----SPEKKKRV 540

Query: 541  SRERSVKKKSLTIQPKSRANQGKGDLITQPLQVVAHDLDASKKGLSLTVDLGNLPALDPS 600
            SRERSVKKKS TIQPK RANQGKG+LITQPLQVVAHDLDASKKGLSLTVDLGNLP LDPS
Sbjct: 541  SRERSVKKKSSTIQPKLRANQGKGNLITQPLQVVAHDLDASKKGLSLTVDLGNLPVLDPS 600

Query: 601  KSCEDHHSSDNAEVIDITNTEVVTETPELKMTDPKKSNSSPEVNYRKQKHSHRRRHYYRK 660
            KS EDHHSSDNAEVIDITNTEVV ETPELKMTDP+KSNSSPEVNYRKQKHSHRRRHYYRK
Sbjct: 601  KSFEDHHSSDNAEVIDITNTEVVPETPELKMTDPEKSNSSPEVNYRKQKHSHRRRHYYRK 660

Query: 661  KEGKEKDTNSEAFKNQLVTWLKENGLKLSTDTDSSGGWGPGDIKCNMKLLTWNARGLGSP 720
            KE KEKDTNSEAFKNQLVTWLKENGLKLS DTDSSG                      + 
Sbjct: 661  KEDKEKDTNSEAFKNQLVTWLKENGLKLSIDTDSSG---------------------ATT 720

Query: 721  SKRALIKNTIISYSPDFVILTETRLKITNKRIINSRGILILWDAQHHSLLSQEEGKFSLS 780
            S  AL      S                      + GILILWDAQHHSLLSQEEGKFSLS
Sbjct: 721  STNALFSQLGSS----------------------AGGILILWDAQHHSLLSQEEGKFSLS 780

Query: 781  ANFLSFNNSWWLTGLYGPVKRRERLNFWADLHNLLHLNSSPWIIG--------------- 840
            ANF SFNNSWWLTGLYGPVKRRERLN W DLHNL HLNSSPWIIG               
Sbjct: 781  ANFSSFNNSWWLTGLYGPVKRRERLNVWEDLHNLHHLNSSPWIIGGDLNVVRMREESTAV 840

Query: 841  ------SNMLNNFISNNLLIDPPLTNNRYTWSNLRNPPTFSHLDRFLYNSSWEILFNPHI 900
                  SNMLN+FISNNLLIDPPLTNNRYTWSNLRNPPTFS LDRFLYNS WEILFNPHI
Sbjct: 841  TFSSHSSNMLNDFISNNLLIDPPLTNNRYTWSNLRNPPTFSRLDRFLYNSRWEILFNPHI 900

Query: 901  TRTLPRTTSDHFPLVCEDSTSTLRWGPAPFRLNSITLNDPEFKRNMERWWELSIQNGHPG 960
            TRTLPR TSDHFPLVCEDSTSTLRWGPAPFRLNSI LNDPEFKRNMERWWELS+QNGHPG
Sbjct: 901  TRTLPRPTSDHFPLVCEDSTSTLRWGPAPFRLNSIALNDPEFKRNMERWWELSVQNGHPG 960

Query: 961  FSFIQRLKSLANLIKPWQKEKFHSLTTAKENIIREVDAIDKNELDTPLSQEESNRRLALK 1020
            F FIQRLKSLANLIKPWQKEKF SLT+AKENIIREVD+IDKNELDTPLS EESNRRLALK
Sbjct: 961  FFFIQRLKSLANLIKPWQKEKFQSLTSAKENIIREVDSIDKNELDTPLSLEESNRRLALK 1020

Query: 1021 AELNDLSLKESQFWFQRAKKLWIKEGDENYAFFHRICSSRQKRNLIHEIQNEEGSIQNTN 1080
            AELNDLSLKESQFWFQRAKKLW+KEGDEN AFFHRICSSRQKRNLIHEIQ+EEGSIQNTN
Sbjct: 1021 AELNDLSLKESQFWFQRAKKLWLKEGDENSAFFHRICSSRQKRNLIHEIQDEEGSIQNTN 1080

Query: 1081 NNISLAFVNHFSRIYRCSTKKDPLFIENLEWNPIDYSDWSLLCAPFLEEEIKGVIKSFDG 1140
            NNISLAFVNHFSRIYRCSTKKDPLFIENLEWNPIDYSDWSLLCAPF EEEIKGVIKSFDG
Sbjct: 1081 NNISLAFVNHFSRIYRCSTKKDPLFIENLEWNPIDYSDWSLLCAPFSEEEIKGVIKSFDG 1140

Query: 1141 NKAPGPDGFPISFFKSYWHLLKEDIMDIFKDFFEKGVINKNMNNTYIALIAKKKDYSHPK 1200
            NKAPGPDGFPISFFKSYWHLLKEDI+DIFKDFFEKGVINKNMNNTYIALI KKKDYSHPK
Sbjct: 1141 NKAPGPDGFPISFFKSYWHLLKEDILDIFKDFFEKGVINKNMNNTYIALIEKKKDYSHPK 1200

Query: 1201 DFRPISLTTSIYKIIAKTLSNRLKLTLPDTISGNQLAFIKNRQITDAILIANEALDYWKV 1260
            DFRPISLTTSIYK IAKTLSNRLKLTLPDTISGNQLAFIKNRQITDAIL+ANEALDYWKV
Sbjct: 1201 DFRPISLTTSIYKTIAKTLSNRLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKV 1260

Query: 1261 KKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNYPTSWRKWIRGCISNVTYSIEVNGKPQG 1320
            KKIKGFILKLDIEKAFDNLNW+FID VL+K NYP SWRKWIRGCISNVTYSI VNGKPQG
Sbjct: 1261 KKIKGFILKLDIEKAFDNLNWNFIDLVLKKNNYPNSWRKWIRGCISNVTYSIIVNGKPQG 1320

Query: 1321 RIKANRGLRQGDPLSPFLFVIVMDYLSRLLSHLESTGAIKGVCLANDCNISHILFADDIL 1380
            RIKANRGLRQGDPLS FLFVI MDYLSRLLSHLESTGAIKG                   
Sbjct: 1321 RIKANRGLRQGDPLSLFLFVIAMDYLSRLLSHLESTGAIKG------------------- 1380

Query: 1381 LFVEDNDNFLNNLRMAISLFEKASGLKINLSKSAMVPVNVSWLRALECTSSWGISCHTLP 1440
                                                                GI CHTLP
Sbjct: 1381 ----------------------------------------------------GILCHTLP 1440

Query: 1441 LTYLGVPLGGNQKSNLFWRNIEDRIQKKLSNWKYAHISKGGRLTLIKSTLSSLPIYQLSV 1500
            LTYLGVPLGGN KSNLFWRNIEDRIQKKLSNWKYAHISKGGRLTLIKSTLSSLPIY+LSV
Sbjct: 1441 LTYLGVPLGGNPKSNLFWRNIEDRIQKKLSNWKYAHISKGGRLTLIKSTLSSLPIYKLSV 1500

Query: 1501 FQAPSSTYKNIEKIWRNFLWKGSGGLKGSHLINWSIVTKPKEEGGLGISRLQVTNQALLS 1560
            FQAPSSTYKNIEK+WRNFLWKGS GLKGSHLINWSIVTKPKEEGGLGISRLQVTNQALLS
Sbjct: 1501 FQAPSSTYKNIEKLWRNFLWKGSCGLKGSHLINWSIVTKPKEEGGLGISRLQVTNQALLS 1560

Query: 1561 KWLWRYHSEPNSLWRRLIHIKYKGKHPGDLPSNISSSSSKAPWRSIINNIDWFKSNQGWN 1620
            KWLWRY+SEPNSLWRRLIHIKYKGKHPGDLPSNISSSSSKAPWRSIINNIDWFKSNQGW+
Sbjct: 1561 KWLWRYYSEPNSLWRRLIHIKYKGKHPGDLPSNISSSSSKAPWRSIINNIDWFKSNQGWD 1620

Query: 1621 LNNGDQISFWYSNWSPEGCLSTAYPRLFALSIDKESSIKDVWNSNNNQWEITFRRKLNDR 1680
            LNNGDQISFWYSNWSPEGCLSTAYPRLFALS+DKESSIKDVWNSNNNQWEITFRRKLNDR
Sbjct: 1621 LNNGDQISFWYSNWSPEGCLSTAYPRLFALSMDKESSIKDVWNSNNNQWEITFRRKLNDR 1680

Query: 1681 ELSTWQKILENLPIPRTNRGPSKPTWIPDSKKLFSIASAKSCISHQPDRPVANPRVKLLE 1740
            ELSTWQKILENLPI RTNRGPSKPTWIPDSKK FSIASAKSCISHQPDR VANPRVKLL 
Sbjct: 1681 ELSTWQKILENLPILRTNRGPSKPTWIPDSKKFFSIASAKSCISHQPDRSVANPRVKLLN 1740

Query: 1741 LIWKTHVPMKIKFFMWCLVQRKLNTMEVIQQRMPNTLLQPNWCVLCKKDSETGAHLFLYC 1796
            LIWKTHVPMKIKFFMWCLVQRKLNTMEV       TLLQPNWCVLCKK SETGAHLFL+C
Sbjct: 1741 LIWKTHVPMKIKFFMWCLVQRKLNTMEV------XTLLQPNWCVLCKKKSETGAHLFLHC 1785

BLAST of Pay0007691 vs. NCBI nr
Match: TYJ99315.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])

HSP 1 Score: 2469.1 bits (6398), Expect = 0.0e+00
Identity = 1252/1780 (70.34%), Postives = 1393/1780 (78.26%), Query Frame = 0

Query: 3    YFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKSLI 62
            +FKSLPRSCK+ERKEFVL LDKY+KHTHYWLTETGAHKAFSIEVSPRDLDWIR TLKSLI
Sbjct: 57   HFKSLPRSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPRDLDWIRCTLKSLI 116

Query: 63   ETPSSNRFFLENRDSEHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEGPEKSGWVSF 122
             TP++NRFFLE RDSE  IWIRKTRN KGCTAEIFRVD KNRKSCILVPEGP+KSGWVSF
Sbjct: 117  ATPNTNRFFLETRDSEQRIWIRKTRNSKGCTAEIFRVDQKNRKSCILVPEGPDKSGWVSF 176

Query: 123  LSMITPKVE--------------------------------------------------- 182
            LSMITPKVE                                                   
Sbjct: 177  LSMITPKVEVKAKTRPTFLPRTSPDCRLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDS 236

Query: 183  -------------------------------------------ETNRGIFYIPCLHAEKA 242
                                                       +     F     HAEKA
Sbjct: 237  SDSSHSSSNSFCDSPSSDLLENTVVIVRRFFHDDWHKILQNLRKQTEESFTYNAFHAEKA 296

Query: 243  LVHFNSNVPANLLCQNKGWTTVGKYTVRFEKWAPASHASPKLIPSYGGWTTFRGIPLHLW 302
            LVHF+SN+PANLLCQNKGW+TVGKY+VRFEKW+P  HA+PKLIPSYGGWTTFRGIPLHLW
Sbjct: 297  LVHFSSNIPANLLCQNKGWSTVGKYSVRFEKWSPVYHATPKLIPSYGGWTTFRGIPLHLW 356

Query: 303  NMMTFQQIGKACGSLVKVAEETKTARNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFVVQ 362
            NMMTFQQIGKAC  L+KVAEET++A+NLIEA++K+RYNYSGFLPA V+IFD EGNKF VQ
Sbjct: 357  NMMTFQQIGKACEGLIKVAEETRSAKNLIEARIKVRYNYSGFLPANVRIFDNEGNKFFVQ 416

Query: 363  VVTHSEGKWLMERNIRLHGTFKRQAAASFDDFNPDAEQFLFDGLEAISPDLLNTISGSRK 422
            VVTH EGKWL+ERN+RLHGTFKRQAAASFDDFNP++EQF F+G EAISPD L+T S  RK
Sbjct: 417  VVTHPEGKWLIERNVRLHGTFKRQAAASFDDFNPESEQFFFEGSEAISPDFLSTSSDGRK 476

Query: 423  SISPEQQSALKSVIIKPARDATSPTTLNEEVVNDNSLHATTINSKLKILSGISNDGSLDK 482
            S +P+Q SALKSVIIKP R+AT P+ LNEE+VND++LHAT   SKL+ILSGISNDG LDK
Sbjct: 477  SSTPDQPSALKSVIIKPDRNATLPSFLNEELVNDSNLHATANKSKLEILSGISNDGVLDK 536

Query: 483  GKQKVDIPSQLTSAFIFDKPKRKVSFNSPNNKTTFFNPDSAPTNHSPLLSSPEKKQRVSR 542
            GKQKVDI  Q  SA   DK KRKVSFNSP+NKT  FNPDSAP NHSP L+SPEKKQ+VSR
Sbjct: 537  GKQKVDIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPDSAPANHSPSLNSPEKKQKVSR 596

Query: 543  ERSVKKKSLTIQPKSRANQGKGDLITQPLQVVAHDLDASKKGLSLTVDLGNLPALDPSKS 602
            ERS+KKKS + QP S+ANQ KG  ITQP+Q+VAHD DA+KKGLSLTVDLG+LPALDP+KS
Sbjct: 597  ERSIKKKSSSTQPNSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALDPNKS 656

Query: 603  CEDHHSSDNAEVIDITNTEVVTETPELKMTDPKKSNSSPEVNYRKQKHSHRRRHYYRKKE 662
             EDHH+SDNAEV+DITNTEVV ETPE+KM   + SNSS E NYRK KH H+R++YYRKKE
Sbjct: 657  LEDHHNSDNAEVVDITNTEVVPETPEMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKE 716

Query: 663  GKEKDTNSEAFKNQLVTWLKENGLKLSTDTDSSGGWGPGDIKCNMKLLTWNARGLGSPSK 722
             KEKD +SEAFK QLV+WLK+NGLKLSTDTDSSG     ++     LL     GL   +K
Sbjct: 717  EKEKDPDSEAFKKQLVSWLKKNGLKLSTDTDSSGATTSTNV-----LLNQMNSGLKITNK 776

Query: 723  RALIKNTIISYSPDFVILTETRLKITNKRIINSRGILILWDAQHHSLLSQEEGKFSLSAN 782
            R +IK+   S S +++    +          +S GILILWDAQ+HSLLSQEEG FSLSAN
Sbjct: 777  R-IIKSLWPSNSINWIAKNASG---------SSGGILILWDAQNHSLLSQEEGLFSLSAN 836

Query: 783  FLSFNN-SWWLTGLYGPVKRRERLNFWADLHNLLHLNSSPWIIG---------------- 842
            FL  NN SWWLTGLYGPVKRRER++FWA+LHNL HLNS PWI+G                
Sbjct: 837  FLLNNNSSWWLTGLYGPVKRRERIHFWAELHNLQHLNSFPWILGGDLNVIRMREESTSVL 896

Query: 843  -----SNMLNNFISNNLLIDPPLTNNRYTWSNLRNPPTFSHLDRFLYNSSWEILFNPHIT 902
                 S MLNNFISNNLLIDPPLTNNR+TWSNLRNPPTFS +DRFLYNSSWE LF+PH T
Sbjct: 897  SSSHNSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFSRIDRFLYNSSWENLFSPHTT 956

Query: 903  RTLPRTTSDHFPLVCEDSTSTLRWGPAPFRLNSITLNDPEFKRNMERWWELSIQNGHPGF 962
            RTLPR+TSDHFPLVCEDS   L WGP PFRLNSITL+DPEFKRNM RWWE SIQ G+PGF
Sbjct: 957  RTLPRSTSDHFPLVCEDSNPKLSWGPIPFRLNSITLSDPEFKRNMGRWWENSIQAGYPGF 1016

Query: 963  SFIQRLKSLANLIKPWQKEKFHSLTTAKENIIREVDAIDKNELDTPLSQEESNRRLALKA 1022
            SFIQRLKSLAN IKPWQKEK HSLT AKE IIREVD+IDK ELDTPL+QEESNRRLALKA
Sbjct: 1017 SFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKKELDTPLTQEESNRRLALKA 1076

Query: 1023 ELNDLSLKESQFWFQRAKKLWIKEGDENYAFFHRICSSRQKRNLIHEIQNEEGSIQNTNN 1082
            +L++LSLKESQFW+QRAKKLW++EGDEN +FFHRICSSRQKR+ IHEIQ+EEGSIQNTNN
Sbjct: 1077 DLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQKRSFIHEIQDEEGSIQNTNN 1136

Query: 1083 NISLAFVNHFSRIYRCSTKKDPLFIENLEWNPIDYSDWSLLCAPFLEEEIKGVIKSFDGN 1142
            +IS AF+  FSRIYR STK DPLFIENL+WNPI  S+WS LCAPFLE EIKGVI SFDG 
Sbjct: 1137 SISTAFIKFFSRIYRSSTKSDPLFIENLDWNPIASSEWSHLCAPFLEGEIKGVINSFDGK 1196

Query: 1143 KAPGPDGFPISFFKSYWHLLKEDIMDIFKDFFEKGVINKNMNNTYIALIAKKKDYSHPKD 1202
            K PGPDGFPISFFKS+W                                           
Sbjct: 1197 KTPGPDGFPISFFKSHW------------------------------------------- 1256

Query: 1203 FRPISLTTSIYKIIAKTLSNRLKLTLPDTISGNQLAFIKNRQITDAILIANEALDYWKVK 1262
                                 LK TLP+TISGNQLAF+KNRQITDAIL+ANEA+DYWKVK
Sbjct: 1257 ---------------------LKTTLPNTISGNQLAFVKNRQITDAILMANEAVDYWKVK 1316

Query: 1263 KIKGFILKLDIEKAFDNLNWDFIDFVLEKKNYPTSWRKWIRGCISNVTYSIEVNGKPQGR 1322
            KIKGFILKLDIEKAFDNLN DFID VLEKKN+P  WRKWIRGCISNVTYS+ +NG+PQGR
Sbjct: 1317 KIKGFILKLDIEKAFDNLNLDFIDNVLEKKNFPNPWRKWIRGCISNVTYSVIINGRPQGR 1376

Query: 1323 IKANRGLRQGDPLSPFLFVIVMDYLSRLLSHLESTGAIKGVCLANDCNISHILFADDILL 1382
            IKANRGLRQGDPLSPFLFVI MDYLSRLLSHLES+GAIKGV L  +CNISHILFADDILL
Sbjct: 1377 IKANRGLRQGDPLSPFLFVIAMDYLSRLLSHLESSGAIKGVSLNGNCNISHILFADDILL 1436

Query: 1383 FVEDNDNFLNNLRMAISLFEKASGLKINLSKSAMVPVNVSWLRALECTSSWGISCHTLPL 1442
            F+EDND FL NLRMA+SLFE+ASGLKINL KSA+VPVNVS  RA EC S WGISCH+LPL
Sbjct: 1437 FIEDNDCFLKNLRMALSLFERASGLKINLLKSALVPVNVSLKRAKECASFWGISCHSLPL 1496

Query: 1443 TYLGVPLGGNQKSNLFWRNIEDRIQKKLSNWKYAHISKGGRLTLIKSTLSSLPIYQLSVF 1502
            +YLGVPLGGN KSNLFWRN+ED+IQKKL+NWKYA ISKGGRLTLIKSTLSSLPIYQLSVF
Sbjct: 1497 SYLGVPLGGNPKSNLFWRNVEDKIQKKLNNWKYAQISKGGRLTLIKSTLSSLPIYQLSVF 1556

Query: 1503 QAPSSTYKNIEKIWRNFLWKGSGGLKGSHLINWSIVTKPKEEGGLGISRLQVTNQALLSK 1562
            QAPS T KNIEK+WR FLWKG+ G +GSHLINW+ V+K KEEGGLGISRL VTN+ALLSK
Sbjct: 1557 QAPSLTCKNIEKLWRKFLWKGNNGSEGSHLINWTKVSKSKEEGGLGISRLNVTNKALLSK 1616

Query: 1563 WLWRYHSEPNSLWRRLIHIKYKGKHPGDLPSNISSSSSKAPWRSIINNIDWFKSNQGWNL 1622
            WLWRY SEPN+LWRRLI  KYKGK PGD+PSNISSS+SKAPWRSII++ DWFKSNQ W+L
Sbjct: 1617 WLWRYLSEPNALWRRLIQCKYKGKFPGDIPSNISSSTSKAPWRSIIDSTDWFKSNQSWDL 1676

Query: 1623 NNGDQISFWYSNWSPEGCLSTAYPRLFALSIDKESSIKDVWNSNNNQWEITFRRKLNDRE 1667
            NNGDQISFWYSNWS EG LSTAYPRLFAL++DKE S+KD WN+ +NQW I FRR+LNDRE
Sbjct: 1677 NNGDQISFWYSNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDRE 1736

BLAST of Pay0007691 vs. NCBI nr
Match: TYK05808.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])

HSP 1 Score: 2450.6 bits (6350), Expect = 0.0e+00
Identity = 1286/1705 (75.43%), Postives = 1325/1705 (77.71%), Query Frame = 0

Query: 1    MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKS 60
            MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKS
Sbjct: 1    MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKS 60

Query: 61   LIETPSSNRFFLENRDSEHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEGPEKSGWV 120
            LIETPSSNRFFLENRD EHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEGPEKSG V
Sbjct: 61   LIETPSSNRFFLENRDYEHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEGPEKSGRV 120

Query: 121  SFLSMITPKVE------------------------------------------------- 180
            SFLSMITPKVE                                                 
Sbjct: 121  SFLSMITPKVEVKAKTRPTFLPRSSPEFRLSPPIDYHKRSYEKAVSKGRSSISSDSSDSY 180

Query: 181  ---ETNRGIFYIPC---------LHAEKALVHFNSNVPANLLCQNKGWTTVGKYTVRFEK 240
               ++++     PC              AL+HFNSNVPANLLCQNKGWTTV KY VR   
Sbjct: 181  TSSDSSQSSGNSPCDSPFPVLLENTVVLALIHFNSNVPANLLCQNKGWTTVEKYMVR--- 240

Query: 241  WAPASHASPKLIPSYGGWTTFRGIPLHLWNMMTFQQIGKACGSLVKVAEETKTARNLIEA 300
                                                                        
Sbjct: 241  ------------------------------------------------------------ 300

Query: 301  KLKIRYNYSGFLPAYVKIFDQEGNKFVVQVVTHSEGKWLMERNIRLHGTFKRQAAASFDD 360
                                                                        
Sbjct: 301  ------------------------------------------------------------ 360

Query: 361  FNPDAEQFLFDGLEAISPDLLNTISGSRKSISPEQQSALKSVIIKPARDATSPTTLNEEV 420
                  + LFDGLEAISPDLLNTISGSRKS S EQ SALKSVIIKPARDATSPTTLNEEV
Sbjct: 361  ------KSLFDGLEAISPDLLNTISGSRKSNSREQPSALKSVIIKPARDATSPTTLNEEV 420

Query: 421  VNDNSLHATTINSKLKILSGISNDGSLDKGKQKVDIPSQLTSAFIFDKPKRKVSFNSPNN 480
            VNDNSLHATTI S+LKILSGISNDGSLDKGKQKVDIPSQLTSAFI+DKPKRKVSFNSP+N
Sbjct: 421  VNDNSLHATTIKSELKILSGISNDGSLDKGKQKVDIPSQLTSAFIYDKPKRKVSFNSPSN 480

Query: 481  KTTFFNPDSAPTNHSPLLSSPEKKQRVSRERSVKKKSLTIQPKSRANQGKGDLITQPLQV 540
            KTTFFN DSAPTNHSP LSSPEKKQRVSRERSVKKKS TIQPKSRANQGKG+LITQPLQV
Sbjct: 481  KTTFFNSDSAPTNHSPPLSSPEKKQRVSRERSVKKKSSTIQPKSRANQGKGELITQPLQV 540

Query: 541  VAHDLDASKKGLSLTVDLGNLPALDPSKSCEDHHSSDNAEVIDITNTEVVTETPELKMTD 600
            VAHDLDASKKGLSLTVDLGNLP LDPSKS EDHHSSDNAEVIDITNTEVV ETPELKMTD
Sbjct: 541  VAHDLDASKKGLSLTVDLGNLPVLDPSKSFEDHHSSDNAEVIDITNTEVVPETPELKMTD 600

Query: 601  PKKSNSSPEVNYRKQKHSHRRRHYYRKKEGKEKDTNSEAFKNQLVTWLKENGLKLSTDTD 660
            P+KSNSSPEVNYRKQKHSHRRRHYYRKKE KEKDTNSEAFKNQLVTWLKENGLKLSTDTD
Sbjct: 601  PEKSNSSPEVNYRKQKHSHRRRHYYRKKEDKEKDTNSEAFKNQLVTWLKENGLKLSTDTD 660

Query: 661  SSGGWGPGDIKCNMKLLTWNARGLGSPSKRALIKNTIISYSPDFVILTETRLKITNKRII 720
            SSG                      + S  AL      S S            I    I 
Sbjct: 661  SSG---------------------ATTSTNALFSQLGSSIS-----------WIVKNAID 720

Query: 721  NSRGILILWDAQHHSLLSQEEGKFSLSANFLSFNNSWWLTGLYGPVKRRERLNFWADLHN 780
            +S GILILWDAQHHSLL  +     +                     R E     +  H+
Sbjct: 721  SSGGILILWDAQHHSLLRGDLNVVRM---------------------REESTAVTSSSHS 780

Query: 781  LLHLNSSPWIIGSNMLNNFISNNLLIDPPLTNNRYTWSNLRNPPTFSHLDRFLYNSSWEI 840
                        SNMLNNFISNNLLIDPPLTNNRYTWSNLRNPPTFS LDRFLYNS WE 
Sbjct: 781  ------------SNMLNNFISNNLLIDPPLTNNRYTWSNLRNPPTFSRLDRFLYNSRWET 840

Query: 841  LFNPHITRTLPRTTSDHFPLVCEDSTSTLRWGPAPFRLNSITLNDPEFKRNMERWWELSI 900
            LFNPHITRTL R TSDHFPLVCEDSTSTLRWGPAPFRLNSI LNDP+FKRNMERWWELS+
Sbjct: 841  LFNPHITRTLSRPTSDHFPLVCEDSTSTLRWGPAPFRLNSIALNDPKFKRNMERWWELSV 900

Query: 901  QNGHPGFSFIQRLKSLANLIKPWQKEKFHSLTTAKENIIREVDAIDKNELDTPLSQEESN 960
            QNGHPGFSFI+RLKSLANLIKPWQKEKFHSLT+AKENIIREVD+IDKNELDTPLSQEESN
Sbjct: 901  QNGHPGFSFIRRLKSLANLIKPWQKEKFHSLTSAKENIIREVDSIDKNELDTPLSQEESN 960

Query: 961  RRLALKAELNDLSLKESQFWFQRAKKLWIKEGDENYAFFHRICSSRQKRNLIHEIQNEEG 1020
            RRLALKAEL+DLSLKESQFWFQRAKKLW+KEGDEN AFFHRICSSRQKRNLIHEIQ+EEG
Sbjct: 961  RRLALKAELSDLSLKESQFWFQRAKKLWLKEGDENSAFFHRICSSRQKRNLIHEIQDEEG 1020

Query: 1021 SIQNTNNNISLAFVNHFSRIYRCSTKKDPLFIENLEWNPIDYSDWSLLCAPFLEEEIKGV 1080
            SIQNTNNNISLAFVNHFS IYRCSTKKDPLFIENLEWNPIDYSDWSLLCAPFLEEEIKGV
Sbjct: 1021 SIQNTNNNISLAFVNHFSSIYRCSTKKDPLFIENLEWNPIDYSDWSLLCAPFLEEEIKGV 1080

Query: 1081 IKSFDGNKAPGPDGFPISFFKSYWHLLKEDIMDIFKDFFEKGVINKNMNNTYIALIAKKK 1140
            IKSFDGNKAPGPDGFPISFFKSYWHLLKEDI+DIFKDFFEKG                  
Sbjct: 1081 IKSFDGNKAPGPDGFPISFFKSYWHLLKEDILDIFKDFFEKG------------------ 1140

Query: 1141 DYSHPKDFRPISLTTSIYKIIAKTLSNRLKLTLPDTISGNQLAFIKNRQITDAILIANEA 1200
                               IIAKTLSNRLKLTLPDTISGNQLAFIKNRQITDAIL ANEA
Sbjct: 1141 -------------------IIAKTLSNRLKLTLPDTISGNQLAFIKNRQITDAILRANEA 1200

Query: 1201 LDYWKVKKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNYPTSWRKWIRGCISNVTYSIEV 1260
            LDYWKVKKIK FILKLDIEKAFDNLNWDFIDFVL+KKNYP SWRKWIRGCISNVTYSI V
Sbjct: 1201 LDYWKVKKIKSFILKLDIEKAFDNLNWDFIDFVLKKKNYPNSWRKWIRGCISNVTYSIIV 1260

Query: 1261 NGKPQGRIKANRGLRQGDPLSPFLFVIVMDYLSRLLSHLESTGAIKGVCLANDCNISHIL 1320
            N KPQ RIKANRGLRQGDPLSPFLFV  MDYLSRLLSHLES+GAIKGVCLANDCNISHIL
Sbjct: 1261 NEKPQDRIKANRGLRQGDPLSPFLFVSAMDYLSRLLSHLESSGAIKGVCLANDCNISHIL 1320

Query: 1321 FADDILLFVEDNDNFLNNLRMAISLFEKASGLKINLSKSAMVPVNVSWLRALECTSSWGI 1380
            FADDILLFVEDND+FLNNLRMA+SLFEKASGLKINLSKSAMVPVNVSW RALEC SSWGI
Sbjct: 1321 FADDILLFVEDNDHFLNNLRMALSLFEKASGLKINLSKSAMVPVNVSWSRALECASSWGI 1380

Query: 1381 SCHTLPLTYLGVPLGGNQKSNLFWRNIEDRIQKKLSNWKYAHISKGGRLTLIKSTLSSLP 1440
            SCHTLPLTYLGVPLGGN KSN+FWRNIEDRIQKKL+NWKYAHISKGGRLTLIKSTLSSL 
Sbjct: 1381 SCHTLPLTYLGVPLGGNPKSNIFWRNIEDRIQKKLNNWKYAHISKGGRLTLIKSTLSSLS 1440

Query: 1441 IYQLSVFQAPSSTYKNIEKIWRNFLWKGSGGLKGSHLINWSIVTKPKEEGGLGISRLQVT 1500
            IYQLSVFQAP STYKNIEK+WRNFLWKGS GLKGSHLINWSIVTK KEEGGLGISRLQV 
Sbjct: 1441 IYQLSVFQAPPSTYKNIEKLWRNFLWKGSFGLKGSHLINWSIVTKLKEEGGLGISRLQVI 1474

Query: 1501 NQALLSKWLWRYHSEPNSLWRRLIHIKYKGKHPGDLPSNISSSSSKAPWRSIINNIDWFK 1560
            NQALLSKWLWRY+SEPNSLWRRLIHIKYKGKHPGD+PSNISSSSSKAPW+SIINNIDWFK
Sbjct: 1501 NQALLSKWLWRYYSEPNSLWRRLIHIKYKGKHPGDIPSNISSSSSKAPWKSIINNIDWFK 1474

Query: 1561 SNQGWNLNNGDQISFWYSNWSPEGCLSTAYPRLFALSIDKESSIKDVWNSNNNQWEITFR 1620
            SNQGW+LNN DQISFWYSNWSPEGCLSTAYPRLFALSIDK+SSIKDVWNSNNNQWEITFR
Sbjct: 1561 SNQGWDLNNEDQISFWYSNWSPEGCLSTAYPRLFALSIDKKSSIKDVWNSNNNQWEITFR 1474

Query: 1621 RKLNDRELSTWQKILENLPIPRTNRGPSKPTWIPDSKKLFSIASAKSCISHQPDRPVANP 1645
            RKLNDRELSTWQ ILENL IPRTNRGPSKPTWIPDSKK FSIASAKSCISHQPDR VANP
Sbjct: 1621 RKLNDRELSTWQNILENLSIPRTNRGPSKPTWIPDSKKFFSIASAKSCISHQPDRSVANP 1474

BLAST of Pay0007691 vs. NCBI nr
Match: KAA0039309.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])

HSP 1 Score: 2303.1 bits (5967), Expect = 0.0e+00
Identity = 1186/1431 (82.88%), Postives = 1208/1431 (84.42%), Query Frame = 0

Query: 1    MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKS 60
            MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKS
Sbjct: 1    MAYFKSLPRSCKIERKEFVLLLDKYAKHTHYWLTETGAHKAFSIEVSPRDLDWIRSTLKS 60

Query: 61   LIETPSSNRFFLENRDSEHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEGPEKSGWV 120
            LIETPSSNRFFLENRD EHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEG EKS WV
Sbjct: 61   LIETPSSNRFFLENRDYEHCIWIRKTRNGKGCTAEIFRVDHKNRKSCILVPEGLEKSCWV 120

Query: 121  SFLSMITPKVE--ETNRGIFY--------------------------------------- 180
            SFLSMITPKVE     R IF                                        
Sbjct: 121  SFLSMITPKVEVKAKTRPIFLPRSSPEFRLSPPIDYHKRSYAKAVSEGRSSISSDSSDSY 180

Query: 181  -----------IPC------------------------------------------LHAE 240
                        PC                                           HAE
Sbjct: 181  ASSDSSQSSGNSPCDSPFPVLLENTVVLVRRFFHDDWQKILQNLRKQTEESFTYNAFHAE 240

Query: 241  KALVHFNSNVPANLLCQNKGWTTVGKYTVRFEKWAPASHASPKLIPSYGGWTTFRGIPLH 300
            K LVHFNSNVPANLLCQNKGWTTVGKYTVRFEKWAPASHASPKLIPSYGGWTTFRGIPLH
Sbjct: 241  KVLVHFNSNVPANLLCQNKGWTTVGKYTVRFEKWAPASHASPKLIPSYGGWTTFRGIPLH 300

Query: 301  LWNMMTFQQIGKACGSLVKVAEETKTARNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFV 360
            LWNMMTFQQIGKACG L+KVAEETKTARNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFV
Sbjct: 301  LWNMMTFQQIGKACGGLIKVAEETKTARNLIEAKLKIRYNYSGFLPAYVKIFDQEGNKFV 360

Query: 361  VQVVTHSEGKWLMERNIRLHGTFKRQAAASFDDFNPDAEQFLFDGLEAISPDLLNTISGS 420
            VQVVTHSEGKWLMERN+RLHGTFKRQAAASFDDFNPD+EQFLFDGLEAISPDLLNTISGS
Sbjct: 361  VQVVTHSEGKWLMERNVRLHGTFKRQAAASFDDFNPDSEQFLFDGLEAISPDLLNTISGS 420

Query: 421  RKSISPEQQSALKSVIIKPARDATSPTTLNEEVVNDNSLHATTINSKLKILSGISNDGSL 480
            RKSISPEQ SALKSVIIKPA+ ATSPTTLNEEVVNDNSLHAT   SKLKILSGISNDGSL
Sbjct: 421  RKSISPEQPSALKSVIIKPAKYATSPTTLNEEVVNDNSLHATANKSKLKILSGISNDGSL 480

Query: 481  DKGKQKVDIPSQLTSAFIFDKPKRKVSFNSPNNKTTFFNPDSAPTNHSPLLSSPEKKQRV 540
            DKGKQKVDIPSQLTSAFIF KPKRKVSFNSP+NKTTFFNPDSAP NH     SPEKK+RV
Sbjct: 481  DKGKQKVDIPSQLTSAFIFYKPKRKVSFNSPSNKTTFFNPDSAPANH-----SPEKKKRV 540

Query: 541  SRERSVKKKSLTIQPKSRANQGKGDLITQPLQVVAHDLDASKKGLSLTVDLGNLPALDPS 600
            SRERSVKKKS TIQPK RANQGKG+LITQPLQVVAHDLDASKKGLSLTVDLGNLP LDPS
Sbjct: 541  SRERSVKKKSSTIQPKLRANQGKGNLITQPLQVVAHDLDASKKGLSLTVDLGNLPVLDPS 600

Query: 601  KSCEDHHSSDNAEVIDITNTEVVTETPELKMTDPKKSNSSPEVNYRKQKHSHRRRHYYRK 660
            KS EDHHSSDNAEVIDITNTEVV ETPELKMTDP+KSNSSPEVNYRKQKHSHRRRHYYRK
Sbjct: 601  KSFEDHHSSDNAEVIDITNTEVVPETPELKMTDPEKSNSSPEVNYRKQKHSHRRRHYYRK 660

Query: 661  KEGKEKDTNSEAFKNQLVTWLKENGLKLSTDTDSSGGWGPGDIKCNMKLLTWNARGLGSP 720
            KE KEKDTNSEAFKNQLVTWLKENGLKLS DTDSSG                      + 
Sbjct: 661  KEDKEKDTNSEAFKNQLVTWLKENGLKLSIDTDSSG---------------------ATT 720

Query: 721  SKRALIKNTIISYSPDFVILTETRLKITNKRIINSRGILILWDAQHHSLLSQEEGKFSLS 780
            S  AL      S                      + GILILWDAQHHSLLSQEEGKFSLS
Sbjct: 721  STNALFSQLGSS----------------------AGGILILWDAQHHSLLSQEEGKFSLS 780

Query: 781  ANFLSFNNSWWLTGLYGPVKRRERLNFWADLHNLLHLNSSPWIIG--------------- 840
            ANF SFNNSWWLTGLYGPVKRRERLN W DLHNL HLNSSPWIIG               
Sbjct: 781  ANFSSFNNSWWLTGLYGPVKRRERLNVWEDLHNLHHLNSSPWIIGGDLNVVRMREESTAV 840

Query: 841  ------SNMLNNFISNNLLIDPPLTNNRYTWSNLRNPPTFSHLDRFLYNSSWEILFNPHI 900
                  SNMLN+FISNNLLIDPPLTNNRYTWSNLRNPPTFS LDRFLYNS WEILFNPHI
Sbjct: 841  TFSSHSSNMLNDFISNNLLIDPPLTNNRYTWSNLRNPPTFSRLDRFLYNSRWEILFNPHI 900

Query: 901  TRTLPRTTSDHFPLVCEDSTSTLRWGPAPFRLNSITLNDPEFKRNMERWWELSIQNGHPG 960
            TRTLPR TSDHFPLVCEDSTSTLRWGPAPFRLNSI LNDPEFKRNMERWWELS+QNGHPG
Sbjct: 901  TRTLPRPTSDHFPLVCEDSTSTLRWGPAPFRLNSIALNDPEFKRNMERWWELSVQNGHPG 960

Query: 961  FSFIQRLKSLANLIKPWQKEKFHSLTTAKENIIREVDAIDKNELDTPLSQEESNRRLALK 1020
            F FIQRLKSLANLIKPWQKEKF SLT+AKENIIREVD+IDKNELDTPLS EESNRRLALK
Sbjct: 961  FFFIQRLKSLANLIKPWQKEKFQSLTSAKENIIREVDSIDKNELDTPLSLEESNRRLALK 1020

Query: 1021 AELNDLSLKESQFWFQRAKKLWIKEGDENYAFFHRICSSRQKRNLIHEIQNEEGSIQNTN 1080
            AELNDLSLKESQFWFQRAKKLW+KEGDEN AFFHRICSSRQKRNLIHEIQ+EEGSIQNTN
Sbjct: 1021 AELNDLSLKESQFWFQRAKKLWLKEGDENSAFFHRICSSRQKRNLIHEIQDEEGSIQNTN 1080

Query: 1081 NNISLAFVNHFSRIYRCSTKKDPLFIENLEWNPIDYSDWSLLCAPFLEEEIKGVIKSFDG 1140
            NNISLAFVNHFSRIYRCSTKKDPLFIENLEWNPIDYSDWSLLCAPF EEEIKGVIKSFDG
Sbjct: 1081 NNISLAFVNHFSRIYRCSTKKDPLFIENLEWNPIDYSDWSLLCAPFSEEEIKGVIKSFDG 1140

Query: 1141 NKAPGPDGFPISFFKSYWHLLKEDIMDIFKDFFEKGVINKNMNNTYIALIAKKKDYSHPK 1200
            NKAPGPDGFPISFFKSYWHLLKEDI+DIFKDFFEKGVINKNMNNTYIALI KKKDYSHPK
Sbjct: 1141 NKAPGPDGFPISFFKSYWHLLKEDILDIFKDFFEKGVINKNMNNTYIALIEKKKDYSHPK 1200

Query: 1201 DFRPISLTTSIYKIIAKTLSNRLKLTLPDTISGNQLAFIKNRQITDAILIANEALDYWKV 1260
            DFRPISLTTSIYK IAKTLSNRLKLTLPDTISGNQLAFIKNRQITDAIL+ANEALDYWKV
Sbjct: 1201 DFRPISLTTSIYKTIAKTLSNRLKLTLPDTISGNQLAFIKNRQITDAILMANEALDYWKV 1260

Query: 1261 KKIKGFILKLDIEKAFDNLNWDFIDFVLEKKNYPTSWRKWIRGCISNVTYSIEVNGKPQG 1317
            KKIKGFILKLDIEKAFDNLNW+FID VL+K NYP SWRKWIRGCISNVTYSI VNGKPQG
Sbjct: 1261 KKIKGFILKLDIEKAFDNLNWNFIDLVLKKNNYPNSWRKWIRGCISNVTYSIIVNGKPQG 1320

BLAST of Pay0007691 vs. NCBI nr
Match: KAA0058980.1 (uncharacterized protein E6C27_scaffold98G001710 [Cucumis melo var. makuwa])

HSP 1 Score: 2041.5 bits (5288), Expect = 0.0e+00
Identity = 1086/1652 (65.74%), Postives = 1183/1652 (71.61%), Query Frame = 0

Query: 119  WVSFLSMITPKVEETNRGIFYIPCLHAEKALVHFNSNVPANLLCQNKGWTTVGKYTVRFE 178
            W   L  +  + EE+    F     HAEKALVHFNSN+P NLLCQNKGWTTVGKY+VRFE
Sbjct: 93   WQKILQNLRKQTEES----FTYNAFHAEKALVHFNSNIPENLLCQNKGWTTVGKYSVRFE 152

Query: 179  KWAPASHASPKLIPSYGGWTTFRGIPLHLWNMMTFQQIGKACGSLVKVAEETKTARNLIE 238
            KW+PA HA+PKLIPSYGGWTTF+                             + +  L+E
Sbjct: 153  KWSPAYHATPKLIPSYGGWTTFQ-----------------------------RNSATLVE 212

Query: 239  AKLKIRYNYSGFLPAYVKIFDQEGNKFVVQVVTHSEGKWLMERNIRLHGTFKRQAAASFD 298
                                                                      +D
Sbjct: 213  ----------------------------------------------------------YD 272

Query: 299  DFNPDAEQ----FLFDGLEAISPDLLNTISGSRKSISPEQQSALKSVIIKPARDATSPTT 358
            DF+ + E      LFDG EAISPD L+T S SRKS +P+Q SALKSVIIKP + ATSPT 
Sbjct: 273  DFSTNCESLRRIILFDGSEAISPDFLSTSSRSRKSSTPDQPSALKSVIIKPDKAATSPTY 332

Query: 359  LNEEVVNDNSLHATTINSKLKILSGISNDGSLDKGKQKVDIPSQLTSAFIFDKPKRKVSF 418
            LNEEVVND++LHAT   S+L+ILSGI NDG LDKGKQKVDI     SA   +KPKRKVSF
Sbjct: 333  LNEEVVNDSNLHATANKSRLEILSGIPNDGVLDKGKQKVDIQLHPNSALNLNKPKRKVSF 392

Query: 419  NSPNNKTTFFNPDSAPTNHSPLLSSPEKKQRVSRERSVKKKSLTIQPKSRANQGKGDLIT 478
            NSP+NKT  FNPDSAP NHS  LSSPEKKQ+VSRERS+KKKS +IQP     Q KG LIT
Sbjct: 393  NSPSNKTNIFNPDSAPANHSLSLSSPEKKQKVSRERSIKKKSSSIQP----IQNKGVLIT 452

Query: 479  QPLQVVAHDLDASKKGLSLTVDLGNLPALDPSKSCEDHHSSDNAEVIDITNTEVVTETPE 538
            QP+QVVAHDL+ASKKGLSL V+LG+LP LDPSKS EDHHSS NAEVIDITNTEVV ETPE
Sbjct: 453  QPIQVVAHDLEASKKGLSLIVNLGDLPVLDPSKSFEDHHSSHNAEVIDITNTEVVPETPE 512

Query: 539  LKMTDPKKSNSSPEVNYRKQKHSHRRRHYYRKKEGKEKDTNSEAFKNQLVTWLKENGLKL 598
            +KM   + SNSS E NYRK KH HRRR+YYRKK  K +                 +GL+ 
Sbjct: 513  MKMPVNENSNSSSEANYRKPKHVHRRRYYYRKKRSKGEG----------------SGLR- 572

Query: 599  STDTDSSGGWGPGDIKCNMKLLTWNARGLGSPSKRALIKNTIISYSPDFVILTETRLKIT 658
                   G WG GDI C MKLLTWNARGLGSPSKRALIKN IISYSPDFVILTET LKIT
Sbjct: 573  -------GEWGSGDIYCKMKLLTWNARGLGSPSKRALIKNAIISYSPDFVILTETMLKIT 632

Query: 659  NKRII------------------NSRGILILWDAQHHSLLSQEEGKFSLSAN-FLSFNNS 718
            NKRII                  +S GILILWDAQ HSLLSQEE  FSLSAN FL+ N+S
Sbjct: 633  NKRIIKSFWPSNSINWIVKNASGSSGGILILWDAQSHSLLSQEEAIFSLSANFFLNNNSS 692

Query: 719  WWLTGLYGPVKRRERLNFWADLHNLLHLNSSPWII---------------------GSNM 778
            WWLTGLYGP KRR+R++FWADLHNL HLNS PW +                      S M
Sbjct: 693  WWLTGLYGPDKRRKRIHFWADLHNLQHLNSFPWSLERDLNVIRMREETTSILSSSHSSRM 752

Query: 779  LNNFISNNLLIDPPLTNNRYTWSNLRNPPTFSHLDRFLYNSSWEILFNPHITRTLPRTTS 838
            LNNFISNNLLIDPPLTNNR+TWSNLRNP TFS +DRFLYNSSWE LF+PH TRTLPR TS
Sbjct: 753  LNNFISNNLLIDPPLTNNRFTWSNLRNPSTFSRIDRFLYNSSWENLFSPHTTRTLPRPTS 812

Query: 839  DHFPLVCEDSTSTLRWGPAPFRLNSITLNDPEFKRNMERWWELSIQNGHPGFSFIQRLKS 898
            DHFPLVCEDS   LRWGPAPFRLNSI LNDPEFKRNMERWWE S+QNGHPGFSFIQRLKS
Sbjct: 813  DHFPLVCEDSNPKLRWGPAPFRLNSIALNDPEFKRNMERWWENSVQNGHPGFSFIQRLKS 872

Query: 899  LANLIKPWQKEKFHSLTTAKENIIREVDAIDKNELDTPLSQEESNRRLALKAELNDLSLK 958
            LAN IKPWQKEK HSL  AKE IIREVD+IDK ELDTPLSQ+ESNRRLALKAEL+DLSLK
Sbjct: 873  LANHIKPWQKEKLHSLNYAKETIIREVDSIDKKELDTPLSQKESNRRLALKAELSDLSLK 932

Query: 959  ESQFWFQRAKKLWIKEGDENYAFFHRICSSRQKRNLIHEIQNEEGSIQNTNNNISLAFVN 1018
            ESQF                       C                                
Sbjct: 933  ESQF-----------------------C-------------------------------- 992

Query: 1019 HFSRIYRCSTKKDPLFIENLEWNPIDYSDWSLLCAPFLEEEIKGVIKSFDGNKAPGPDGF 1078
                IY+ STK DPLFIENL+WNPI++S+W  LCAPFLEEEIKGVI SFDG KAP PDGF
Sbjct: 993  ----IYKSSTKSDPLFIENLDWNPIEFSEWPHLCAPFLEEEIKGVINSFDGKKAPSPDGF 1052

Query: 1079 PISFFKSYWHLLKEDIMDIFKDFFEKGVINKNMNNTYIALIAKKKDYSHPKDFRPISLTT 1138
            PISFFKSYWHLLKEDIMDIFKDFFEKGVINKNMNNTYIALI KKKDYSHPKDFRPISLTT
Sbjct: 1053 PISFFKSYWHLLKEDIMDIFKDFFEKGVINKNMNNTYIALIGKKKDYSHPKDFRPISLTT 1112

Query: 1139 SIYKIIAKTLSNRLKLTLPDTISGNQLAFIKNRQITDAILIANEALDYWKVKKIKGFILK 1198
            SIYKIIAKTLSNRLK TLP TISGNQLAFIKNRQITDAIL+ANEA+DYWKVKKIKGFILK
Sbjct: 1113 SIYKIIAKTLSNRLKTTLPGTISGNQLAFIKNRQITDAILMANEAVDYWKVKKIKGFILK 1172

Query: 1199 LDIEKAFDNLNWDFIDFVLEKKNYPTSWRKWIRGCISNVTYSIEVNGKPQGRIKANRGLR 1258
            LDIEK F NLNWDFID+VL KKN+P SWRKWIRGCISNVTYS+ +NG+PQGRIKANRGLR
Sbjct: 1173 LDIEKVFYNLNWDFIDYVLGKKNFPNSWRKWIRGCISNVTYSVIINGRPQGRIKANRGLR 1232

Query: 1259 QGDPLSPFLFVIVMDYLSRLLSHLESTGAIKGVCLANDCNISHILFADDILLFVEDNDNF 1318
            QGDPLSPFLFVI MDY SRLLSHLE++GAIKGV L N+CNISHILFADDILLFVEDND F
Sbjct: 1233 QGDPLSPFLFVIAMDYFSRLLSHLEASGAIKGVSLNNNCNISHILFADDILLFVEDNDCF 1292

Query: 1319 LNNLRMAISLFEKASGLKINLSKSAMVPVNVSWLRALECTSSWGISCHTLPLTYLGVPLG 1378
            LNNL MA+SLFEKASGLKINL KSA+VPVNVS  RA EC S WGISCH+L L+YLGVPLG
Sbjct: 1293 LNNLIMALSLFEKASGLKINLLKSALVPVNVSLNRAKECASFWGISCHSLLLSYLGVPLG 1352

Query: 1379 GNQKSNLFWRNIEDRIQKKLSNWKYAHISKGGRLTLIKSTLSSLPIYQLSVFQAPSSTYK 1438
                                                                        
Sbjct: 1353 ------------------------------------------------------------ 1412

Query: 1439 NIEKIWRNFLWKGSGGLKGSHLINWSIVTKPKEEGGLGISRLQVTNQALLSKWLWRYHSE 1498
                        GS G KGSHLINW+ V K KEEGGLGISRLQVTN+ALLSKWLWRY SE
Sbjct: 1413 ------------GSNGSKGSHLINWTKVFKSKEEGGLGISRLQVTNKALLSKWLWRYFSE 1472

Query: 1499 PNSLWRRLIHIKYKGKHPGDLPSNISSSSSKAPWRSIINNIDWFKSNQGWNLNNGDQISF 1558
            PN+LWRRLI  KYKGKHPGD+PSN SSSSSKAPWRSII+NIDWFKSNQ W+LNNGDQISF
Sbjct: 1473 PNALWRRLIQCKYKGKHPGDIPSNNSSSSSKAPWRSIIDNIDWFKSNQSWDLNNGDQISF 1494

Query: 1559 WYSNWSPEGCLSTAYPRLFALSIDKESSIKDVWNSNNNQWEITFRRKLNDRELSTWQKIL 1618
            WYSNWS EGCLSTAYPRLFAL++DKE S+KD WN+ +NQW I FRR+LNDRE   W+KIL
Sbjct: 1533 WYSNWSQEGCLSTAYPRLFALTLDKEISVKDAWNTIDNQWAINFRRELNDRERCNWEKIL 1494

Query: 1619 ENLPIPRTNRGPSKPTWIPDSKKLFSIASAKSCISHQPDRPVANPRVKLLELIWKTHVPM 1678
            E LP PR NRG SKPTWIPD  K FSIASAK  IS Q D+   + RVKLLE+IWK+++PM
Sbjct: 1593 EILPTPRPNRGSSKPTWIPDCNKSFSIASAKILISCQLDQTSGDSRVKLLEIIWKSNIPM 1494

Query: 1679 KIKFFMWCLVQRKLNTMEVIQQRMPNTLLQPNWCVLCKKDSETGAHLFLYCDRVKPLWSF 1727
            KIKFFMWCL+QR+++TMEVIQQRM NTLLQPNWCVLC KD+E+G HLFL CD VKPLWS 
Sbjct: 1653 KIKFFMWCLIQRRISTMEVIQQRMSNTLLQPNWCVLCNKDNESGNHLFLRCDAVKPLWSL 1494

BLAST of Pay0007691 vs. TAIR 10
Match: AT1G43760.1 (DNAse I-like superfamily protein )

HSP 1 Score: 120.6 bits (301), Expect = 1.3e-26
Identity = 99/375 (26.40%), Postives = 162/375 (43.20%), Query Frame = 0

Query: 731  GSNMLNNFISNNLLIDPPLTNNRYTWSNLRNP-PTFSHLDRFLYNSSWEILFNPHITRTL 790
            G     N + ++ L+D P     YTWSN ++  P    LDR + N  W   F   I    
Sbjct: 248  GLEEFQNCLRDSDLVDIPSRGVHYTWSNHQDDNPIIRKLDRAIANGDWFSSFPSAIAVFE 307

Query: 791  PRTTSDHFPLVCEDSTSTLRWGPAPFRLNSITLNDPEFKRNMERWWELSIQNGHPGFSFI 850
                SDH P +        R     FR  S     P F  ++   WE  I  G   FS  
Sbjct: 308  LSGVSDHSPCIIILENLPKR-SKKCFRYFSFLSTHPTFLVSLTVAWEEQIPVGSHMFSLG 367

Query: 851  QRLKSLANLIKPWQKEKFHSLTTAKENIIREVDAIDKNELDTPLSQEESNRRLALKAELN 910
            + LK+     K   ++ F ++    +  +  +++I    L  P         +A K + N
Sbjct: 368  EHLKAAKKCCKLLNRQGFGNIQHKTKEALDSLESIQSQLLTNPSDSLFRVEHVARK-KWN 427

Query: 911  DLSLKESQFWFQRAKKLWIKEGDENYAFFHRICSSRQKRNLIHEIQ-NEEGSIQNTNNNI 970
              +     F+ Q+++  W+++GD N  FFH++  + Q +NLI  ++ +++  ++N     
Sbjct: 428  FFAAALESFYRQKSRIKWLQDGDANTRFFHKVILANQAKNLIKFLRMDDDVRVENVTQVK 487

Query: 971  SLAFVNHFSRIYRCSTKKDPLFIENL-EWNPIDYSDW--SLLCAPFLEEEIKGVIKSFDG 1030
             +    +   +   S    P  ++ + + +P   +D   S L A   ++EI   + +   
Sbjct: 488  EMIVAYYTHLLGSDSDILTPDSVQRIKDIHPFRCNDTLASRLSALPSDKEITAAVFAMPR 547

Query: 1031 NKAPGPDGFPISFFKSYWHLLKEDIMDIFKDFFEKGVINKNMNNTYIALIAKKKDYSHPK 1090
            NKAPGPD F   FF   W ++K+  +   K+FF  G + K  N T I LI K        
Sbjct: 548  NKAPGPDSFTAEFFWESWFVVKDSTIAAVKEFFRTGHLLKRFNATAITLIPKVTGVDQLS 607

Query: 1091 DFRPISLTTSIYKII 1101
             FRP+S  T +YKII
Sbjct: 608  MFRPVSCCTVVYKII 620

BLAST of Pay0007691 vs. TAIR 10
Match: AT4G29090.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 102.1 bits (253), Expect = 4.9e-21
Identity = 93/395 (23.54%), Postives = 157/395 (39.75%), Query Frame = 0

Query: 1377 SLPIYQLSVFQAPSSTYKNIEKIWRNFLWKGSGGLKGSHLINWSIVTKPKEEGGLGISRL 1436
            +LP Y ++ F  P +  K I  +  +F W+     KG H   W  ++  K EGG+G   +
Sbjct: 2    ALPTYTMACFLLPKTVCKQIISVLADFWWRNKQEAKGMHWKAWDHLSCYKAEGGIGFKDI 61

Query: 1437 QVTNQALLSKWLWRYHSEPNSLWRRLIHIKYKGKHPGDLPSNISSSSSKAPWRSIINNID 1496
            +  N ALL K +WR  S P SL  ++   +Y   H  D  +    S     W+SI  + +
Sbjct: 62   EAFNLALLGKQMWRMLSRPESLMAKVFKSRY--FHKSDPLNAPLGSRPSFVWKSIHASQE 121

Query: 1497 WFKSNQGWNLNNGDQISFWYSNWSPEGCLSTAY------PRLFALSIDKESSIKDVWNSN 1556
              +      + NG+ I  W   W      S A       P+ +A S+     + D+ + +
Sbjct: 122  ILRQGARAVVGNGEDIIIWRHKWLDSKPASAALRMQRVPPQEYA-SVSSILKVSDLIDES 181

Query: 1557 NNQW-----EITF---RRKLNDRELSTWQKILENLPIPRTNRGP---SKPTWIPDSKKLF 1616
              +W     E+ F    RKL        ++IL++     T+ G        W+     L 
Sbjct: 182  GREWRKDVIEMLFPEVERKLIGELRPGGRRILDSYTWDYTSSGDYTVKSGYWV-----LT 241

Query: 1617 SIASAKSCISHQPDRPVANPRVKLLELIWKTHVPMKIKFFMWCLVQRKLNTMEVIQQRMP 1676
             I + +S    +   P  NP   + + IWK+    KI+ F+W  +   L     +  R  
Sbjct: 242  QIINKRSS-PQEVSEPSLNP---IYQKIWKSQTSPKIQHFLWKCLSNSLPVAGALAYR-- 301

Query: 1677 NTLLQPNWCVLCKKDSETGAHLFLYCDRVKPLWSFLHRSLNFAPI------SDDFEAMFS 1736
              L + + C+ C    ET  HL   C   +  W     +++  PI      +D       
Sbjct: 302  -HLSKESACIRCPSCKETVNHLLFKCTFARLTW-----AISSIPIPLGGEWADSIYVNLY 361

Query: 1737 FFLSLNQSLPKHKVVLCGLIAILWGIWTERNNRIF 1749
            +  +L    P+ +     +  +LW +W  RN  +F
Sbjct: 362  WVFNLGNGNPQWEKASQLVPWLLWRLWKNRNELVF 376

BLAST of Pay0007691 vs. TAIR 10
Match: AT3G24255.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 98.6 bits (244), Expect = 5.4e-20
Identity = 90/375 (24.00%), Postives = 153/375 (40.80%), Query Frame = 0

Query: 1316 SWGISCHTLPLTYLGVPLGGNQKSNLFWRNIEDRIQKKLSNWKYAHISKGGRLTLIKSTL 1375
            S+  +   LP+ YLG+PL   + +   +  + ++I+ ++  W   H+S  GRL LI S +
Sbjct: 15   SFPFASGALPVRYLGLPLLTKKMTTSDYGPLVEKIRVRIGKWTARHLSFAGRLQLISSVI 74

Query: 1376 SSLPIYQLSVFQAPSSTYKNIEKIWRNFLWKGSGGLKGSHLINWSIVTKPKEEGGLGISR 1435
             SL  + +S F+ PS+  K I+ I  +FLW G         + WS V  PK+EGGLGI  
Sbjct: 75   HSLTNFWMSAFRLPSACIKEIDSICSSFLWSGPELNTKKAKVAWSDVCTPKDEGGLGIRS 134

Query: 1436 LQVTNQALLSKWLWRYHSEPNSLWRRLIHIKYKGKHPGDLPSNISSSSSKAPWRSIINNI 1495
            L+  N+               S W               +  N +  S    W+ I+ + 
Sbjct: 135  LKEANK--------------GSFW--------------SISGNTTLGSWM--WKKILKHR 194

Query: 1496 DWFKSNQGWNLNNGDQISFWYSNWSPEGCL--STAYPRLFALSIDKESSIKDVWNSNNNQ 1555
                     +++NG   SFW+ NWS  G L   T +     + I   +S+ +   ++   
Sbjct: 195  ALASGFVKHDIHNGSNTSFWFDNWSKIGRLIDVTGHRGCIDMGITLHASVAEAVVNHRP- 254

Query: 1556 WEITFRRKLNDRELSTWQKILENLPIPRTNRGPSKPTWIPDSKKLFSIASAKSCISHQPD 1615
                 RR  +D  L   + ++  +       G     W   +  +F     K C + +  
Sbjct: 255  -----RRHRHDTLLRI-EDVIAEVRHQGLTSGEDTVRW-KGNGDIF-----KPCFNTKET 314

Query: 1616 -RPVANPRVKL--LELIWKTHVPMKIKFFMWCLVQRKLNTMEVIQQRMPNTLLQPNWCVL 1675
                  P++K+   + +W +H   K     W  ++ +L T +   + +       + CVL
Sbjct: 315  WAATREPKLKVNWYKGVWFSHATPKYSVLAWIAIKNRLTTGD---RMLSWNAGADSSCVL 343

Query: 1676 CKKDSETGAHLFLYC 1686
            C    ET  HLF  C
Sbjct: 375  CHHLVETRDHLFFTC 343

BLAST of Pay0007691 vs. TAIR 10
Match: ATMG01250.1 (RNA-directed DNA polymerase (reverse transcriptase) )

HSP 1 Score: 69.7 bits (169), Expect = 2.7e-11
Identity = 31/66 (46.97%), Postives = 46/66 (69.70%), Query Frame = 0

Query: 1199 VNGKPQGRIKANRGLRQGDPLSPFLFVIVMDYLSRLLSHLESTGAIKGVCLANDC-NISH 1258
            +NG PQG +  +RGLRQGDPLSP+LF++  + LS L    +  G + G+ ++N+   I+H
Sbjct: 14   INGAPQGLVTPSRGLRQGDPLSPYLFILCTEVLSGLCRRAQEQGRLPGIRVSNNSPRINH 73

Query: 1259 ILFADD 1264
            +LFADD
Sbjct: 74   LLFADD 79

BLAST of Pay0007691 vs. TAIR 10
Match: ATMG00310.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 59.7 bits (143), Expect = 2.8e-08
Identity = 40/144 (27.78%), Postives = 64/144 (44.44%), Query Frame = 0

Query: 1377 SLPIYQLSVFQAPSSTYKNIEKIWRNFLWKGSGGLKGSHLINWSIVTKPKE-EGGLGISR 1436
            +LP+Y +S F+      K +      F W      +    + W  + K KE +GGLG   
Sbjct: 2    ALPVYAMSCFRLSKLLCKKLTSAMTEFWWSSCENKRKISWVAWQKLCKSKEDDGGLGFRD 61

Query: 1437 LQVTNQALLSKWLWRYHSEPNSLWRRLIHIKYKGKHPGDLPSNISSSSSKAPWRSIINNI 1496
            L   NQALL+K  +R   +P++L  RL+  +Y   H   +  ++ +  S A WRSII+  
Sbjct: 62   LGWFNQALLAKQSFRIIHQPHTLLSRLLRSRY-FPHSSMMECSVGTRPSYA-WRSIIHGR 121

Query: 1497 DWFKSNQGWNLNNGDQISFWYSNW 1520
            +         + +G     W   W
Sbjct: 122  ELLSRGLLRTIGDGIHTKVWLDRW 143

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P113692.7e-4824.96LINE-1 retrotransposable element ORF2 protein OS=Mus musculus OX=10090 GN=Pol PE... [more]
P085483.0e-4725.41LINE-1 reverse transcriptase homolog OS=Nycticebus coucang OX=9470 PE=4 SV=1[more]
O003703.0e-4725.24LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens OX=9606 PE=1 SV=1[more]
P143817.6e-3523.46Transposon TX1 uncharacterized 149 kDa protein OS=Xenopus laevis OX=8355 PE=4 SV... [more]
P0C2F61.2e-3228.09Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1... [more]
Match NameE-valueIdentityDescription
A0A5D3BL610.0e+0082.09LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... [more]
A0A5D3BLV70.0e+0070.34LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... [more]
A0A5D3C3M30.0e+0075.43LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... [more]
A0A5A7TDG10.0e+0082.88LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... [more]
A0A5A7UV840.0e+0065.74Reverse transcriptase domain-containing protein OS=Cucumis melo var. makuwa OX=1... [more]
Match NameE-valueIdentityDescription
TYK00493.10.0e+0082.09LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa][more]
TYJ99315.10.0e+0070.34LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa][more]
TYK05808.10.0e+0075.43LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa][more]
KAA0039309.10.0e+0082.88LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa][more]
KAA0058980.10.0e+0065.74uncharacterized protein E6C27_scaffold98G001710 [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
AT1G43760.11.3e-2626.40DNAse I-like superfamily protein [more]
AT4G29090.14.9e-2123.54Ribonuclease H-like superfamily protein [more]
AT3G24255.15.4e-2024.00RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
ATMG01250.12.7e-1146.97RNA-directed DNA polymerase (reverse transcriptase) [more]
ATMG00310.12.8e-0827.78RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
InterPro
Analysis Name: InterPro Annotations of Melon (Payzawat) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 947..967
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 532..575
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 442..456
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 415..469
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 415..441
NoneNo IPR availablePANTHERPTHR33116:SF38OS01G0158850 PROTEINcoord: 755..1695
NoneNo IPR availablePANTHERPTHR33116REVERSE TRANSCRIPTASE ZINC-BINDING DOMAIN-CONTAINING PROTEIN-RELATED-RELATEDcoord: 755..1695
NoneNo IPR availableCDDcd01650RT_nLTR_likecoord: 1070..1333
e-value: 1.17286E-56
score: 194.047
IPR036691Endonuclease/exonuclease/phosphatase superfamilyGENE3D3.60.10.10Endonuclease/exonuclease/phosphatasecoord: 609..802
e-value: 2.6E-19
score: 71.9
IPR036691Endonuclease/exonuclease/phosphatase superfamilySUPERFAMILY56219DNase I-likecoord: 612..802
IPR026960Reverse transcriptase zinc-binding domainPFAMPF13966zf-RVTcoord: 1618..1692
e-value: 1.2E-14
score: 54.7
IPR025558Domain of unknown function DUF4283PFAMPF14111DUF4283coord: 146..243
e-value: 2.1E-13
score: 50.0
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 1077..1333
e-value: 3.1E-47
score: 160.9
IPR000477Reverse transcriptase domainPROSITEPS50878RT_POLcoord: 1056..1333
score: 19.904079
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 1018..1303

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Pay0007691.1Pay0007691.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0007165 signal transduction
molecular_function GO:0003953 NAD+ nucleosidase activity