IVF0023520 (gene) Melon (IVF77) v1

Overview
NameIVF0023520
Typegene
OrganismCucumis melo L. ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionReverse transcriptase
Locationchr08: 10114889 .. 10119603 (+)
RNA-Seq ExpressionIVF0023520
SyntenyIVF0023520
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGAAACGATCCTTGCCGCTCGTCAGGTACGCTCTCATGAAAGTATATGTATAACTCTACATGCATTACCTTACCTAAGTTAAACTGCAAATTCAATTACCATTTGTGACTAAAGGAATCGTTGGTGGTTGTTAGGGAAATGCCACCAAGGAGAGGTGCACGAAGGGGTGGCCGAGGAGGCCGAGGAAGGGGAGCAGGACGCGTTCAGCCTGAGGTGCAGCCTGTAGCCCAAGCCCCTGACCCGGCTGCGCCAGTTACTCATGCGGACCTAGCCGCCATGGAGCAGAGGTTTAGAGATATGATTATGCAAATGCGGGAGCAGCAGAAGCCTGCCTCGCCAACTCCGGCGCCAGCTCCAGCACCAGTTCCTGCTCCAGCTCCGGCTCCGGTACCAGTTGCACCCCAGTTTGTGCCGGATCAGTTGTCGGCAGAGGCTAAGCACCTGAGGGATTTCAGGAAGTATAATCCCACGACGTTTGATGGGTCTTTGGAGGACCCCACCAGAGCTCAGATGTGGCTATCGTCCTTGGAGACCATATTCCGTTACATGAAATGCCCTGAGGATCAGAAAGTTCAGTGTGCTGTTTTTATGTTGACTGACAGAGGTACTGCCTGGTGGGAGACTACAGAGAGGATGCTAGGTGGTGATGTGAGTCAGATCACGTGGCAGCAGTTCAAGGAGAGTTTCTATGCGAAATTCTTCTCTGCCAGTTTGAGAGATGCCAAGCGGCAGGAGTTTCTGAACTTAGAGCAGGGTGACATGACAGTGGAGCAGTATGATGCGGAATTTGACATGTTATCCCGCTTCGCTCCCGAGATGATAGCGACCGAGGCGGCCAGAGCTGACAAGTTTGTTAGAGGCCTCAGACTGGACATTCAGGGTTTGGTCCGAGCTTTCAGACCCGCTACTCATGCCGATGCACTGCGCCTGGCAGTGGATCTCAGTTTACAGGAGAGGGCCAACTCGTCTAAGACCGCTGGTAGAGGTTCGACGTCGGGACAGAAGAGGAAGGCTGAGCAGCAGCCTGTTCCAGTGCCACAGCGGAATTTCAGACCAGGTGGTGAGTTTCGCAGCTTCCAGCAGAAACCTTTTGAGGCAGGGGAGGCTGCCAGAGGAAAGCCGTTGTGTACCACTTGTGGGAAGCACCATCTGGGCCGTTGCTTATTCGGGACCAGGACCTGCTTTAAGTGCAGGCAAGAGGGTCATACAGCTGACAGATGCCCGTTGAGAGTCACGGGGATCGCGCAGAATCAGGGAGCAGGTGCTCCACATCAGGGTAGAGTCTTTGCTACCAACAGGACTGAGGCTGAGAAGGCAGGCACAGTAGTGACAGGTACGCTCCCAGTGTTGGGGCATTACGCCTTAGTTTTGTTTGATTCGGGTTCGTCACATTCTTTTATCTCTTCCGCATTTGTGTCGCATGCCCGCTTAGAGGTAGAGCCCTTACACCATGTTCTGTCAGTATCTACTCCTTCCGGGGAATGTATGTTGTCGAAGGAAAAGGTGAAGGCATGTCAGATTGAGATAGCAGGCCATGTGATTGAGGTAACGCTGATAGTCTGGATATGCTGGACTTTGATGTAATCCTGGGTATGGATTGGCTGGCCGCTAACCACGCCAGCATAGATTGTTCACGTAAGGAGGTAACGTTTAACCCTCCCTCGTTGGCCAGTTTTAAATTTAAGGGAGGAGGGTCAAAGTCGTTGCCTCAGGTAATCTCAGCCATCAGGGCCAGTAAACTGCTCAGTCAGGGTACTTGGGGTATCTTAGCGAGCGTGGTGGATACTAGAGAGGCGGTGTATCCCTGTCATCAGAACCAGTAGTGAGGGACTATCCGGACGTTTTTCCTGAGGAACTTCCAGGGTTACCTCCGCACAGGGAGGTTGAGTTTGCCATAGAGTTGAGCCGGGCACGGTTCCTATATCCAGAGCCCCTTACAGAATGGCCCCCGCAGAATTGAAAGAACTGAAGGTACAGTTACAGGAATTGCTTGATAAGGGATTCATTCGACCGAGCGTGTCACCTTGGGGTGCGCCAGTCTTATTCGTTAAGAAGAAGGACGGATCGATGCGTCTGTGCATTGACTATAGGGAGTTGAACAAAGTAACAGTAAAGAACAGATATCCCTTACCCAGGATAGACGATCTATTTGACCAGTTACAGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCATCAGCTGAGGATTAAGGATGAGGATATACCGAAGACAGCATTTCGATCCAGATATGGACACTACGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTTCTAGACACTTTTGTGATTGTGTTTATCGACGATATCTTGATATACTCCAAGACGGAGGCCGAACACGAGGAGCATTTACGTATGGTTTTGCAAACACTTCGGGATAATAAGTTGTACGCAAAGTTCTCGAAGTGCGAGTTTTGGCTGAAGCAGGTGTCCTTTCTGGGCCACGTGGTTTCTAAGGCTGGAGTCTCTGTAGATCCAGCTAAGATAGAGGCAGTCACTGGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGCAGTTTTCTGGGTTTAGCGGGCTATTATCGACGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGAAAGGGAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGACCCTTAAACAGAAGCTAGTTACCGCACCAGTTCTCACGGTACCCGATGGTTCTGGCAATTTCGTGATTTATAGTGATGCTTCCAAGAAGGGTCTGGGTTGTGTCTTGATGCAGCAGGGTAAGGTGGTCGCTTATGCGTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATCTGGAGTTGGCAGCAGTGGTTTTTGCTTTGAAGATATGGAGGCATTATTTATATGGTGAAAAGATACAGATATTCACAGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAAGAATTGAATATGAGACAGCGAAGGTGGCTTGAGTTAGTGAAGGACTACGATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGTAGCTGATGCCCTTAGTAGGAAAGTATCACATTCAGCAGCACTCATTACCCGGCAGGCCCCATTGCATCGGGATCTTGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTTACTATGCAGTTAGCCCAGTTGACGGTACAGCCGACTTTGAGGCTGAGGATCATTGATGCTCAGAGTAACGATCCTTATCTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAACGACTGAGTTCTCGTTATCCTCTGATGGTGGACTGTTGTTTGAGAGACGCCTCTGTGTTCCGTCAGATAGTGCGGTTAAGACAGAATTATTGTCTGAGGCGCACAGTTCCCCATTTTCCATGCACCCAGGTAGTACGAAGATGTATCAGGACCTGAAGCGAGTTTATTGGTGGCGTAACATGAAGAGAGAGGTAGCAGAATTTGTTAGTAAATGCTTGGTGTGCCAGCAGGTTAAGGCACCAAGGCAGAAACCAGCGGGTTTATTACAACCCTTGAGCATACCGGAATGGAAGTGGGAGAACGTGTCCATGGATTTCATTACAGGGCTACCGAGAACTCTGAGGGGTTTTACAGTGATCTGGGTTGTGGTGGACAGACTTACTAAGTCAGCGCACTTCGTTCCGGGTAAATCCACCTATACTGCTAGTAAGTGGGCACAGTTGTACATGTCTGAGATAGTGAGATTACATGGAGTGCCAGTGTCGATTGTTTCTGATAGAGATGCCCGTTTCACTTCCAAATTTTGGAAGGGTTTGCAGACTGCTATGGGCACGAGGTTGGACTTTAGTACGGCTTTCCATCCACAGACTGACGGTCAGACTGAGCGTCTGAACCAGGTTTTAGAGGATATGTTGCGGGCGTGTGCATTGGAATTTCCAGGTAGCTGGGACTCCCACTTGCATTTGATGGAATTTGCTTATAATAACAGTTATCAGGCTACTATTGGCATGGCACCGTTTGAGGCCCTGTACGGCAGATGTTGTAGATCCCCGGTTTGCTGGGGTGAGGTGGGTGAGCAGAGATTGATGGGTCCTGAGTTAGTTCAGTCTACTAACGAAGCAATTCAGAAGATTAGATCACGCATGCATACCGCTCAGAATAGACAGAAGAGTTATTCAGATGTGAGGCGGAAGGACCTTGAGTTTGAGATAGGGGATAAGGTGTTCTTAAAGGTAGCACCTATGAAAGGTGTCTTGCGTTTTGAAAGGAGGGGAAAGTTGAGTCCCCGTTTTGTTGGGCCATTTGAGATTCTGGAGCGGATTGGCCCTGTAGCTTACCGCTTGGCGTTGCCTCCATCACTCTCGACAGTCCATGATGTGTTTCACGTTTCTATGTTGAGGAAGTACGTGCCAGACCCATCCCATGTAGTGGATTACGAGCCACTAGAGATTGATGAAAACTTGAGCTATGTTGAACAACCTGTTGAGGTGCTGGCTAGAGAGGTGAAGACGTTGAGAAATAAACAAATTCCCCTAGTTAAAGTCTTATGGCGGAATCACCGGGTAGAAGAGGCTACATGGGAGCGTGAAGACGACATGAGATCCCGTTATCCCGAACTGTTCGGGGAATAA

mRNA sequence

ATGGCTGAAACGATCCTTGCCGCTCGTCAGGAATCGTTGGTGGTTGTTAGGGAAATGCCACCAAGGAGAGGTGCACGAAGGGGTGGCCGAGGAGGCCGAGGAAGGGGAGCAGGACGCGTTCAGCCTGAGGTGCAGCCTGTAGCCCAAGCCCCTGACCCGGCTGCGCCAGTTACTCATGCGGACCTAGCCGCCATGGAGCAGAGGTTTAGAGATATGATTATGCAAATGCGGGAGCAGCAGAAGCCTGCCTCGCCAACTCCGGCGCCAGCTCCAGCACCAGTTCCTGCTCCAGCTCCGGCTCCGGTACCAGTTGCACCCCAGTTTGTGCCGGATCAGTTGTCGGCAGAGGCTAAGCACCTGAGGGATTTCAGGAAGTATAATCCCACGACGTTTGATGGGTCTTTGGAGGACCCCACCAGAGCTCAGATGTGGCTATCGTCCTTGGAGACCATATTCCGTTACATGAAATGCCCTGAGGATCAGAAAGTTCAGTGTGCTGTTTTTATGTTGACTGACAGAGGTACTGCCTGGTGGGAGACTACAGAGAGGATGCTAGGTGGTGATGTGAGTCAGATCACGTGGCAGCAGTTCAAGGAGAGTTTCTATGCGAAATTCTTCTCTGCCAGTTTGAGAGATGCCAAGCGGCAGGAGTTTCTGAACTTAGAGCAGGGTGACATGACAGTGGAGCAGTATGATGCGGAATTTGACATGTTATCCCGCTTCGCTCCCGAGATGATAGCGACCGAGGCGGCCAGAGCTGACAAGTTTGTTAGAGGCCTCAGACTGGACATTCAGGGTTTGGTCCGAGCTTTCAGACCCGCTACTCATGCCGATGCACTGCGCCTGGCAGTGGATCTCAGTTTACAGGAGAGGGCCAACTCGTCTAAGACCGCTGGTAGAGGTTCGACGTCGGGACAGAAGAGGAAGGCTGAGCAGCAGCCTGTTCCAGTGCCACAGCGGAATTTCAGACCAGGTGGTGAGTTTCGCAGCTTCCAGCAGAAACCTTTTGAGGCAGGGGAGGCTGCCAGAGGAAAGCCGTTGTGTACCACTTGTGGGAAGCACCATCTGGGCCGTTGCTTATTCGGGACCAGGACCTGCTTTAAGTGCAGGCAAGAGGGTCATACAGCTGACAGATGCCCGTTGAGAGTCACGGGGATCGCGCAGAATCAGGGAGCAGGTGCTCCACATCAGGGTAGAGTCTTTGCTACCAACAGGACTGAGGCTGAGAAGGCAGGCACAGTAGTGACAGGTACGCTCCCAGTGTTGGGGCATTACGCCTTAGTTTTGTTTGATTCGGGTTCGTCACATTCTTTTATCTCTTCCGCATTTGTGTCGCATGCCCGCTTAGAGGTAGAGCCCTTACACCATGTTCTGTCAGTATCTACTCCTTCCGGGGAATGTATGTTGTCGAAGGAAAAGGTGAAGGCATGTCAGATTGAGATAGCAGGCCATGTGATTGAGGTAACGCTGATAGTCTGGATATGCTGGACTTTGATTTTTAAATTTAAGGGAGGAGGGTCAAAGTCGTTGCCTCAGGTAATCTCAGCCATCAGGGCCAGTAAACTGCTCAGTCAGGGTACTTGGGGTATCTTAGCGAGCGTGGTGGATACTAGAGAGGCGGTAGCCCCTTACAGAATGGCCCCCGCAGAATTGAAAGAACTGAAGGTACAGTTACAGGAATTGCTTGATAAGGGATTCATTCGACCGAGCGTGTCACCTTGGGGTGCGCCAGTCTTATTCGTTAAGAAGAAGGACGGATCGATGCGTCTGTGCATTGACTATAGGGAGTTGAACAAAGTAACAGTAAAGAACAGATATCCCTTACCCAGGATAGACGATCTATTTGACCAGTTACAGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCATCAGCTGAGGATTAAGGATGAGGATATACCGAAGACAGCATTTCGATCCAGATATGGACACTACGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTTCTAGACACTTTTGTGATTGTGTTTATCGACGATATCTTGATATACTCCAAGACGGAGGCCGAACACGAGGAGCATTTACGTATGGTTTTGCAAACACTTCGGGATAATAAGTTGTACGCAAAGTTCTCGAAGTGCGAGTTTTGGCTGAAGCAGGTGTCCTTTCTGGGCCACGTGGTTTCTAAGGCTGGAGTCTCTGTAGATCCAGCTAAGATAGAGGCAGTCACTGGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGCAGTTTTCTGGGTTTAGCGGGCTATTATCGACGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGAAAGGGAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGACCCTTAAACAGAAGCTAGTTACCGCACCAGTTCTCACGGTACCCGATGGTTCTGGCAATTTCGTGATTTATAGTGATGCTTCCAAGAAGGGTCTGGGTTGTGTCTTGATGCAGCAGGGTAAGGTGGTCGCTTATGCGTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATCTGGAGTTGGCAGCAGTGGTTTTTGCTTTGAAGATATGGAGGCATTATTTATATGGTGAAAAGATACAGATATTCACAGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAAGAATTGAATATGAGACAGCGAAGGTGGCTTGAGTTAGTGAAGGACTACGATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGTAGCTGATGCCCTTAGTAGGAAAGTATCACATTCAGCAGCACTCATTACCCGGCAGGCCCCATTGCATCGGGATCTTGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTTACTATGCAGTTAGCCCAGTTGACGGTACAGCCGACTTTGAGGCTGAGGATCATTGATGCTCAGAGTAACGATCCTTATCTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAACGACTGAGTTCTCGTTATCCTCTGATGGTGGACTGTTGTTTGAGAGACGCCTCTGTGTTCCGTCAGATAGTGCGGTTAAGACAGAATTATTGTCTGAGGCGCACAGTTCCCCATTTTCCATGCACCCAGGTAGTACGAAGATGTATCAGGACCTGAAGCGAGTTTATTGGTGGCGTAACATGAAGAGAGAGGTAGCAGAATTTGTTAGTAAATGCTTGGTGTGCCAGCAGGTTAAGGCACCAAGGCAGAAACCAGCGGGTTTATTACAACCCTTGAGCATACCGGAATGGAAGTGGGAGAACGTGTCCATGGATTTCATTACAGGGCTACCGAGAACTCTGAGGGGTTTTACAGTGATCTGGGTTGTGGTGGACAGACTTACTAAGTCAGCGCACTTCGTTCCGGGTAAATCCACCTATACTGCTAGTAAGTGGGCACAGTTGTACATGTCTGAGATAGTGAGATTACATGGAGTGCCAGTGTCGATTGTTTCTGATAGAGATGCCCGTTTCACTTCCAAATTTTGGAAGGGTTTGCAGACTGCTATGGGCACGAGGTTGGACTTTAGTACGGCTTTCCATCCACAGACTGACGGTCAGACTGAGCGTCTGAACCAGGTTTTAGAGGATATGTTGCGGGCGTGTGCATTGGAATTTCCAGGTAGCTGGGACTCCCACTTGCATTTGATGGAATTTGCTTATAATAACAGTTATCAGGCTACTATTGGCATGGCACCGTTTGAGGCCCTGTACGGCAGATGTTGTAGATCCCCGGTTTGCTGGGGTGAGGTGGGTGAGCAGAGATTGATGGGTCCTGAGTTAGTTCAGTCTACTAACGAAGCAATTCAGAAGATTAGATCACGCATGCATACCGCTCAGAATAGACAGAAGAGTTATTCAGATGTGAGGCGGAAGGACCTTGAGTTTGAGATAGGGGATAAGGTGTTCTTAAAGGTAGCACCTATGAAAGGTGTCTTGCGTTTTGAAAGGAGGGGAAAGTTGAGTCCCCGTTTTGTTGGGCCATTTGAGATTCTGGAGCGGATTGGCCCTGTAGCTTACCGCTTGGCGTTGCCTCCATCACTCTCGACAGTCCATGATGTGTTTCACGTTTCTATGTTGAGGAAGTACGTGCCAGACCCATCCCATGTAGTGGATTACGAGCCACTAGAGATTGATGAAAACTTGAGCTATGTTGAACAACCTGTTGAGGTGCTGGCTAGAGAGGTGAAGACGTTGAGAAATAAACAAATTCCCCTAGTTAAAGTCTTATGGCGGAATCACCGGGTAGAAGAGGCTACATGGGAGCGTGAAGACGACATGAGATCCCGTTATCCCGAACTGTTCGGGGAATAA

Coding sequence (CDS)

ATGGCTGAAACGATCCTTGCCGCTCGTCAGGAATCGTTGGTGGTTGTTAGGGAAATGCCACCAAGGAGAGGTGCACGAAGGGGTGGCCGAGGAGGCCGAGGAAGGGGAGCAGGACGCGTTCAGCCTGAGGTGCAGCCTGTAGCCCAAGCCCCTGACCCGGCTGCGCCAGTTACTCATGCGGACCTAGCCGCCATGGAGCAGAGGTTTAGAGATATGATTATGCAAATGCGGGAGCAGCAGAAGCCTGCCTCGCCAACTCCGGCGCCAGCTCCAGCACCAGTTCCTGCTCCAGCTCCGGCTCCGGTACCAGTTGCACCCCAGTTTGTGCCGGATCAGTTGTCGGCAGAGGCTAAGCACCTGAGGGATTTCAGGAAGTATAATCCCACGACGTTTGATGGGTCTTTGGAGGACCCCACCAGAGCTCAGATGTGGCTATCGTCCTTGGAGACCATATTCCGTTACATGAAATGCCCTGAGGATCAGAAAGTTCAGTGTGCTGTTTTTATGTTGACTGACAGAGGTACTGCCTGGTGGGAGACTACAGAGAGGATGCTAGGTGGTGATGTGAGTCAGATCACGTGGCAGCAGTTCAAGGAGAGTTTCTATGCGAAATTCTTCTCTGCCAGTTTGAGAGATGCCAAGCGGCAGGAGTTTCTGAACTTAGAGCAGGGTGACATGACAGTGGAGCAGTATGATGCGGAATTTGACATGTTATCCCGCTTCGCTCCCGAGATGATAGCGACCGAGGCGGCCAGAGCTGACAAGTTTGTTAGAGGCCTCAGACTGGACATTCAGGGTTTGGTCCGAGCTTTCAGACCCGCTACTCATGCCGATGCACTGCGCCTGGCAGTGGATCTCAGTTTACAGGAGAGGGCCAACTCGTCTAAGACCGCTGGTAGAGGTTCGACGTCGGGACAGAAGAGGAAGGCTGAGCAGCAGCCTGTTCCAGTGCCACAGCGGAATTTCAGACCAGGTGGTGAGTTTCGCAGCTTCCAGCAGAAACCTTTTGAGGCAGGGGAGGCTGCCAGAGGAAAGCCGTTGTGTACCACTTGTGGGAAGCACCATCTGGGCCGTTGCTTATTCGGGACCAGGACCTGCTTTAAGTGCAGGCAAGAGGGTCATACAGCTGACAGATGCCCGTTGAGAGTCACGGGGATCGCGCAGAATCAGGGAGCAGGTGCTCCACATCAGGGTAGAGTCTTTGCTACCAACAGGACTGAGGCTGAGAAGGCAGGCACAGTAGTGACAGGTACGCTCCCAGTGTTGGGGCATTACGCCTTAGTTTTGTTTGATTCGGGTTCGTCACATTCTTTTATCTCTTCCGCATTTGTGTCGCATGCCCGCTTAGAGGTAGAGCCCTTACACCATGTTCTGTCAGTATCTACTCCTTCCGGGGAATGTATGTTGTCGAAGGAAAAGGTGAAGGCATGTCAGATTGAGATAGCAGGCCATGTGATTGAGGTAACGCTGATAGTCTGGATATGCTGGACTTTGATTTTTAAATTTAAGGGAGGAGGGTCAAAGTCGTTGCCTCAGGTAATCTCAGCCATCAGGGCCAGTAAACTGCTCAGTCAGGGTACTTGGGGTATCTTAGCGAGCGTGGTGGATACTAGAGAGGCGGTAGCCCCTTACAGAATGGCCCCCGCAGAATTGAAAGAACTGAAGGTACAGTTACAGGAATTGCTTGATAAGGGATTCATTCGACCGAGCGTGTCACCTTGGGGTGCGCCAGTCTTATTCGTTAAGAAGAAGGACGGATCGATGCGTCTGTGCATTGACTATAGGGAGTTGAACAAAGTAACAGTAAAGAACAGATATCCCTTACCCAGGATAGACGATCTATTTGACCAGTTACAGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCATCAGCTGAGGATTAAGGATGAGGATATACCGAAGACAGCATTTCGATCCAGATATGGACACTACGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTTCTAGACACTTTTGTGATTGTGTTTATCGACGATATCTTGATATACTCCAAGACGGAGGCCGAACACGAGGAGCATTTACGTATGGTTTTGCAAACACTTCGGGATAATAAGTTGTACGCAAAGTTCTCGAAGTGCGAGTTTTGGCTGAAGCAGGTGTCCTTTCTGGGCCACGTGGTTTCTAAGGCTGGAGTCTCTGTAGATCCAGCTAAGATAGAGGCAGTCACTGGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGCAGTTTTCTGGGTTTAGCGGGCTATTATCGACGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGAAAGGGAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGACCCTTAAACAGAAGCTAGTTACCGCACCAGTTCTCACGGTACCCGATGGTTCTGGCAATTTCGTGATTTATAGTGATGCTTCCAAGAAGGGTCTGGGTTGTGTCTTGATGCAGCAGGGTAAGGTGGTCGCTTATGCGTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATCTGGAGTTGGCAGCAGTGGTTTTTGCTTTGAAGATATGGAGGCATTATTTATATGGTGAAAAGATACAGATATTCACAGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAAGAATTGAATATGAGACAGCGAAGGTGGCTTGAGTTAGTGAAGGACTACGATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGTAGCTGATGCCCTTAGTAGGAAAGTATCACATTCAGCAGCACTCATTACCCGGCAGGCCCCATTGCATCGGGATCTTGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTTACTATGCAGTTAGCCCAGTTGACGGTACAGCCGACTTTGAGGCTGAGGATCATTGATGCTCAGAGTAACGATCCTTATCTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAACGACTGAGTTCTCGTTATCCTCTGATGGTGGACTGTTGTTTGAGAGACGCCTCTGTGTTCCGTCAGATAGTGCGGTTAAGACAGAATTATTGTCTGAGGCGCACAGTTCCCCATTTTCCATGCACCCAGGTAGTACGAAGATGTATCAGGACCTGAAGCGAGTTTATTGGTGGCGTAACATGAAGAGAGAGGTAGCAGAATTTGTTAGTAAATGCTTGGTGTGCCAGCAGGTTAAGGCACCAAGGCAGAAACCAGCGGGTTTATTACAACCCTTGAGCATACCGGAATGGAAGTGGGAGAACGTGTCCATGGATTTCATTACAGGGCTACCGAGAACTCTGAGGGGTTTTACAGTGATCTGGGTTGTGGTGGACAGACTTACTAAGTCAGCGCACTTCGTTCCGGGTAAATCCACCTATACTGCTAGTAAGTGGGCACAGTTGTACATGTCTGAGATAGTGAGATTACATGGAGTGCCAGTGTCGATTGTTTCTGATAGAGATGCCCGTTTCACTTCCAAATTTTGGAAGGGTTTGCAGACTGCTATGGGCACGAGGTTGGACTTTAGTACGGCTTTCCATCCACAGACTGACGGTCAGACTGAGCGTCTGAACCAGGTTTTAGAGGATATGTTGCGGGCGTGTGCATTGGAATTTCCAGGTAGCTGGGACTCCCACTTGCATTTGATGGAATTTGCTTATAATAACAGTTATCAGGCTACTATTGGCATGGCACCGTTTGAGGCCCTGTACGGCAGATGTTGTAGATCCCCGGTTTGCTGGGGTGAGGTGGGTGAGCAGAGATTGATGGGTCCTGAGTTAGTTCAGTCTACTAACGAAGCAATTCAGAAGATTAGATCACGCATGCATACCGCTCAGAATAGACAGAAGAGTTATTCAGATGTGAGGCGGAAGGACCTTGAGTTTGAGATAGGGGATAAGGTGTTCTTAAAGGTAGCACCTATGAAAGGTGTCTTGCGTTTTGAAAGGAGGGGAAAGTTGAGTCCCCGTTTTGTTGGGCCATTTGAGATTCTGGAGCGGATTGGCCCTGTAGCTTACCGCTTGGCGTTGCCTCCATCACTCTCGACAGTCCATGATGTGTTTCACGTTTCTATGTTGAGGAAGTACGTGCCAGACCCATCCCATGTAGTGGATTACGAGCCACTAGAGATTGATGAAAACTTGAGCTATGTTGAACAACCTGTTGAGGTGCTGGCTAGAGAGGTGAAGACGTTGAGAAATAAACAAATTCCCCTAGTTAAAGTCTTATGGCGGAATCACCGGGTAGAAGAGGCTACATGGGAGCGTGAAGACGACATGAGATCCCGTTATCCCGAACTGTTCGGGGAATAA

Protein sequence

MAETILAARQESLVVVREMPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMREQQKPASPTPAPAPAPVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVWICWTLIFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREAVAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRLRIIDAQSNDPYLVEKRGLAEAGQTTEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGRCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQNRQKSYSDVRRKDLEFEIGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKQIPLVKVLWRNHRVEEATWEREDDMRSRYPELFGE
Homology
BLAST of IVF0023520 vs. ExPASy Swiss-Prot
Match: P0CT41 (Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-12 PE=3 SV=1)

HSP 1 Score: 507.3 bits (1305), Expect = 5.8e-142
Identity = 296/901 (32.85%), Postives = 483/901 (53.61%), Query Frame = 0

Query: 507  SKSLPQVISAIRASKLLSQGTWGILASVVDTREAVAPYRMAPAELKELKVQLQELLDKGF 566
            ++ LP+ I  +     L+Q  +         R  +  Y + P +++ +  ++ + L  G 
Sbjct: 390  TEKLPKPIKGLEFEVELTQENY---------RLPIRNYPLPPGKMQAMNDEINQGLKSGI 449

Query: 567  IRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSK 626
            IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K
Sbjct: 450  IRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTK 509

Query: 627  IDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTF 686
            +DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ 
Sbjct: 510  LDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESH 569

Query: 687  VIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGV 746
            V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF   QV F+G+ +S+ G 
Sbjct: 570  VVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGF 629

Query: 747  SVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSK 806
            +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+ 
Sbjct: 630  TPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTP 689

Query: 807  ACEDSFQTLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGK-----VVAYASR 866
                + + +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S 
Sbjct: 690  TQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSA 749

Query: 867  QLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMR 926
            ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R
Sbjct: 750  KMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKR 809

Query: 927  QRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVG 986
              RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I     
Sbjct: 810  LARWQLFLQDFNFEINYRPGSANHIADALSR-------IVDETEPIPKDSEDNSINF--- 869

Query: 987  AVTMQLAQLTVQPTLRLRIIDAQSNDPYLVEKRGLAEAGQTTEFSLSSDGGLLFERR--L 1046
                 + Q+++    + +++   +ND  L+    L    +  E ++    GLL   +  +
Sbjct: 870  -----VNQISITDDFKNQVVTEYTNDTKLLNL--LNNEDKRVEENIQLKDGLLINSKDQI 929

Query: 1047 CVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQV 1106
             +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  
Sbjct: 930  LLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQIN 989

Query: 1107 KAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKS 1166
            K+   KP G LQP+   E  WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   
Sbjct: 990  KSRNHKPYGPLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVPCTK 1049

Query: 1167 TYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTD 1226
            + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQTD
Sbjct: 1050 SITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTD 1109

Query: 1227 GQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGRC-CRS 1286
            GQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++      S
Sbjct: 1110 GQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS 1169

Query: 1287 PVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQNRQKSYSDVRRKDL-EFEIGDKVF 1346
            P+   E+        E  Q T +  Q ++  ++T   + K Y D++ +++ EF+ GD V 
Sbjct: 1170 PL---ELPSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVM 1229

Query: 1347 LKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFHVSMLRK 1394
            +K     G L   +  KL+P F GPF +L++ GP  Y L LP S+  +    FHVS L K
Sbjct: 1230 VK-RTKTGFL--HKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEK 1257

BLAST of IVF0023520 vs. ExPASy Swiss-Prot
Match: P0CT34 (Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-1 PE=3 SV=1)

HSP 1 Score: 507.3 bits (1305), Expect = 5.8e-142
Identity = 296/901 (32.85%), Postives = 483/901 (53.61%), Query Frame = 0

Query: 507  SKSLPQVISAIRASKLLSQGTWGILASVVDTREAVAPYRMAPAELKELKVQLQELLDKGF 566
            ++ LP+ I  +     L+Q  +         R  +  Y + P +++ +  ++ + L  G 
Sbjct: 390  TEKLPKPIKGLEFEVELTQENY---------RLPIRNYPLPPGKMQAMNDEINQGLKSGI 449

Query: 567  IRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSK 626
            IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K
Sbjct: 450  IRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTK 509

Query: 627  IDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTF 686
            +DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ 
Sbjct: 510  LDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESH 569

Query: 687  VIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGV 746
            V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF   QV F+G+ +S+ G 
Sbjct: 570  VVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGF 629

Query: 747  SVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSK 806
            +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+ 
Sbjct: 630  TPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTP 689

Query: 807  ACEDSFQTLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGK-----VVAYASR 866
                + + +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S 
Sbjct: 690  TQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSA 749

Query: 867  QLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMR 926
            ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R
Sbjct: 750  KMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKR 809

Query: 927  QRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVG 986
              RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I     
Sbjct: 810  LARWQLFLQDFNFEINYRPGSANHIADALSR-------IVDETEPIPKDSEDNSINF--- 869

Query: 987  AVTMQLAQLTVQPTLRLRIIDAQSNDPYLVEKRGLAEAGQTTEFSLSSDGGLLFERR--L 1046
                 + Q+++    + +++   +ND  L+    L    +  E ++    GLL   +  +
Sbjct: 870  -----VNQISITDDFKNQVVTEYTNDTKLLNL--LNNEDKRVEENIQLKDGLLINSKDQI 929

Query: 1047 CVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQV 1106
             +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  
Sbjct: 930  LLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQIN 989

Query: 1107 KAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKS 1166
            K+   KP G LQP+   E  WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   
Sbjct: 990  KSRNHKPYGPLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVPCTK 1049

Query: 1167 TYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTD 1226
            + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQTD
Sbjct: 1050 SITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTD 1109

Query: 1227 GQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGRC-CRS 1286
            GQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++      S
Sbjct: 1110 GQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS 1169

Query: 1287 PVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQNRQKSYSDVRRKDL-EFEIGDKVF 1346
            P+   E+        E  Q T +  Q ++  ++T   + K Y D++ +++ EF+ GD V 
Sbjct: 1170 PL---ELPSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVM 1229

Query: 1347 LKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFHVSMLRK 1394
            +K     G L   +  KL+P F GPF +L++ GP  Y L LP S+  +    FHVS L K
Sbjct: 1230 VK-RTKTGFL--HKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEK 1257

BLAST of IVF0023520 vs. ExPASy Swiss-Prot
Match: P0CT35 (Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-2 PE=3 SV=1)

HSP 1 Score: 507.3 bits (1305), Expect = 5.8e-142
Identity = 296/901 (32.85%), Postives = 483/901 (53.61%), Query Frame = 0

Query: 507  SKSLPQVISAIRASKLLSQGTWGILASVVDTREAVAPYRMAPAELKELKVQLQELLDKGF 566
            ++ LP+ I  +     L+Q  +         R  +  Y + P +++ +  ++ + L  G 
Sbjct: 390  TEKLPKPIKGLEFEVELTQENY---------RLPIRNYPLPPGKMQAMNDEINQGLKSGI 449

Query: 567  IRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSK 626
            IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K
Sbjct: 450  IRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTK 509

Query: 627  IDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTF 686
            +DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ 
Sbjct: 510  LDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESH 569

Query: 687  VIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGV 746
            V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF   QV F+G+ +S+ G 
Sbjct: 570  VVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGF 629

Query: 747  SVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSK 806
            +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+ 
Sbjct: 630  TPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTP 689

Query: 807  ACEDSFQTLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGK-----VVAYASR 866
                + + +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S 
Sbjct: 690  TQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSA 749

Query: 867  QLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMR 926
            ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R
Sbjct: 750  KMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKR 809

Query: 927  QRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVG 986
              RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I     
Sbjct: 810  LARWQLFLQDFNFEINYRPGSANHIADALSR-------IVDETEPIPKDSEDNSINF--- 869

Query: 987  AVTMQLAQLTVQPTLRLRIIDAQSNDPYLVEKRGLAEAGQTTEFSLSSDGGLLFERR--L 1046
                 + Q+++    + +++   +ND  L+    L    +  E ++    GLL   +  +
Sbjct: 870  -----VNQISITDDFKNQVVTEYTNDTKLLNL--LNNEDKRVEENIQLKDGLLINSKDQI 929

Query: 1047 CVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQV 1106
             +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  
Sbjct: 930  LLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQIN 989

Query: 1107 KAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKS 1166
            K+   KP G LQP+   E  WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   
Sbjct: 990  KSRNHKPYGPLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVPCTK 1049

Query: 1167 TYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTD 1226
            + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQTD
Sbjct: 1050 SITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTD 1109

Query: 1227 GQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGRC-CRS 1286
            GQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++      S
Sbjct: 1110 GQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS 1169

Query: 1287 PVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQNRQKSYSDVRRKDL-EFEIGDKVF 1346
            P+   E+        E  Q T +  Q ++  ++T   + K Y D++ +++ EF+ GD V 
Sbjct: 1170 PL---ELPSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVM 1229

Query: 1347 LKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFHVSMLRK 1394
            +K     G L   +  KL+P F GPF +L++ GP  Y L LP S+  +    FHVS L K
Sbjct: 1230 VK-RTKTGFL--HKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEK 1257

BLAST of IVF0023520 vs. ExPASy Swiss-Prot
Match: P0CT36 (Transposon Tf2-3 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-3 PE=1 SV=1)

HSP 1 Score: 507.3 bits (1305), Expect = 5.8e-142
Identity = 296/901 (32.85%), Postives = 483/901 (53.61%), Query Frame = 0

Query: 507  SKSLPQVISAIRASKLLSQGTWGILASVVDTREAVAPYRMAPAELKELKVQLQELLDKGF 566
            ++ LP+ I  +     L+Q  +         R  +  Y + P +++ +  ++ + L  G 
Sbjct: 390  TEKLPKPIKGLEFEVELTQENY---------RLPIRNYPLPPGKMQAMNDEINQGLKSGI 449

Query: 567  IRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSK 626
            IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K
Sbjct: 450  IRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTK 509

Query: 627  IDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTF 686
            +DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ 
Sbjct: 510  LDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESH 569

Query: 687  VIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGV 746
            V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF   QV F+G+ +S+ G 
Sbjct: 570  VVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGF 629

Query: 747  SVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSK 806
            +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+ 
Sbjct: 630  TPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTP 689

Query: 807  ACEDSFQTLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGK-----VVAYASR 866
                + + +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S 
Sbjct: 690  TQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSA 749

Query: 867  QLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMR 926
            ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R
Sbjct: 750  KMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKR 809

Query: 927  QRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVG 986
              RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I     
Sbjct: 810  LARWQLFLQDFNFEINYRPGSANHIADALSR-------IVDETEPIPKDSEDNSINF--- 869

Query: 987  AVTMQLAQLTVQPTLRLRIIDAQSNDPYLVEKRGLAEAGQTTEFSLSSDGGLLFERR--L 1046
                 + Q+++    + +++   +ND  L+    L    +  E ++    GLL   +  +
Sbjct: 870  -----VNQISITDDFKNQVVTEYTNDTKLLNL--LNNEDKRVEENIQLKDGLLINSKDQI 929

Query: 1047 CVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQV 1106
             +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  
Sbjct: 930  LLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQIN 989

Query: 1107 KAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKS 1166
            K+   KP G LQP+   E  WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   
Sbjct: 990  KSRNHKPYGPLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVPCTK 1049

Query: 1167 TYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTD 1226
            + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQTD
Sbjct: 1050 SITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTD 1109

Query: 1227 GQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGRC-CRS 1286
            GQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++      S
Sbjct: 1110 GQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS 1169

Query: 1287 PVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQNRQKSYSDVRRKDL-EFEIGDKVF 1346
            P+   E+        E  Q T +  Q ++  ++T   + K Y D++ +++ EF+ GD V 
Sbjct: 1170 PL---ELPSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVM 1229

Query: 1347 LKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFHVSMLRK 1394
            +K     G L   +  KL+P F GPF +L++ GP  Y L LP S+  +    FHVS L K
Sbjct: 1230 VK-RTKTGFL--HKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEK 1257

BLAST of IVF0023520 vs. ExPASy Swiss-Prot
Match: P0CT37 (Transposon Tf2-4 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-4 PE=3 SV=1)

HSP 1 Score: 507.3 bits (1305), Expect = 5.8e-142
Identity = 296/901 (32.85%), Postives = 483/901 (53.61%), Query Frame = 0

Query: 507  SKSLPQVISAIRASKLLSQGTWGILASVVDTREAVAPYRMAPAELKELKVQLQELLDKGF 566
            ++ LP+ I  +     L+Q  +         R  +  Y + P +++ +  ++ + L  G 
Sbjct: 390  TEKLPKPIKGLEFEVELTQENY---------RLPIRNYPLPPGKMQAMNDEINQGLKSGI 449

Query: 567  IRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSK 626
            IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K
Sbjct: 450  IRESKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIEQLLAKIQGSTIFTK 509

Query: 627  IDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTF 686
            +DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ 
Sbjct: 510  LDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQYFINTILGEAKESH 569

Query: 687  VIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGV 746
            V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF   QV F+G+ +S+ G 
Sbjct: 570  VVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQSQVKFIGYHISEKGF 629

Query: 747  SVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSK 806
            +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+ 
Sbjct: 630  TPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPLNNLLKKDVRWKWTP 689

Query: 807  ACEDSFQTLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGK-----VVAYASR 866
                + + +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S 
Sbjct: 690  TQTQAIENIKQCLVSPPVLRHFDFSKKILLETDASDVAVGAVLSQKHDDDKYYPVGYYSA 749

Query: 867  QLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMR 926
            ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R
Sbjct: 750  KMSKAQLNYSVSDKEMLAIIKSLKHWRHYLESTIEPFKILTDHRNLIGRITNESEPENKR 809

Query: 927  QRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVG 986
              RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I     
Sbjct: 810  LARWQLFLQDFNFEINYRPGSANHIADALSR-------IVDETEPIPKDSEDNSINF--- 869

Query: 987  AVTMQLAQLTVQPTLRLRIIDAQSNDPYLVEKRGLAEAGQTTEFSLSSDGGLLFERR--L 1046
                 + Q+++    + +++   +ND  L+    L    +  E ++    GLL   +  +
Sbjct: 870  -----VNQISITDDFKNQVVTEYTNDTKLLNL--LNNEDKRVEENIQLKDGLLINSKDQI 929

Query: 1047 CVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQV 1106
             +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  
Sbjct: 930  LLPNDTQLTRTIIKKYHEEGKLIHPGIELLTNIILRRFTWKGIRKQIQEYVQNCHTCQIN 989

Query: 1107 KAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKS 1166
            K+   KP G LQP+   E  WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   
Sbjct: 990  KSRNHKPYGPLQPIPPSERPWESLSMDFITALPES-SGYNALFVVVDRFSKMAILVPCTK 1049

Query: 1167 TYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTD 1226
            + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQTD
Sbjct: 1050 SITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTD 1109

Query: 1227 GQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGRC-CRS 1286
            GQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++      S
Sbjct: 1110 GQTERTNQTVEKLLRCVCSTHPNTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS 1169

Query: 1287 PVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQNRQKSYSDVRRKDL-EFEIGDKVF 1346
            P+   E+        E  Q T +  Q ++  ++T   + K Y D++ +++ EF+ GD V 
Sbjct: 1170 PL---ELPSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVM 1229

Query: 1347 LKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFHVSMLRK 1394
            +K     G L   +  KL+P F GPF +L++ GP  Y L LP S+  +    FHVS L K
Sbjct: 1230 VK-RTKTGFL--HKSNKLAPSFAGPFYVLQKSGPNNYELDLPDSIKHMFSSTFHVSHLEK 1257

BLAST of IVF0023520 vs. ExPASy TrEMBL
Match: A0A5A7U330 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold4358G00040 PE=4 SV=1)

HSP 1 Score: 2702.9 bits (7005), Expect = 0.0e+00
Identity = 1376/1528 (90.05%), Postives = 1397/1528 (91.43%), Query Frame = 0

Query: 19   MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMRE 78
            MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRD+IMQMRE
Sbjct: 1    MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMRE 60

Query: 79   QQKPASPTPAPAPAPVPAPAP----APVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS 138
            QQKPASPTPAPAPAP PAPAP    APVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS
Sbjct: 61   QQKPASPTPAPAPAPAPAPAPAPALAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS 120

Query: 139  LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW 198
            LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW
Sbjct: 121  LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW 180

Query: 199  QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD 258
            QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD
Sbjct: 181  QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD 240

Query: 259  KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP 318
            KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRG TSGQKRKAEQQP
Sbjct: 241  KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGLTSGQKRKAEQQP 300

Query: 319  VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGH 378
            VPVPQRNFR GGEFR FQQKPFEAGEAAR KPLCT CGKHHLGRCLFGTRTCFKCRQEGH
Sbjct: 301  VPVPQRNFRSGGEFRRFQQKPFEAGEAARWKPLCTICGKHHLGRCLFGTRTCFKCRQEGH 360

Query: 379  TADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGS 438
            TADRCPLR+TG AQNQGAGAPHQGRVFATN+TEAEKAGTVVTGTLPVLGHYALVLFD   
Sbjct: 361  TADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEKAGTVVTGTLPVLGHYALVLFD--- 420

Query: 439  SHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIV-- 498
                                    SVSTPSGECMLSKEKVK CQIEIAGHVIEVTL+V  
Sbjct: 421  ------------------------SVSTPSGECMLSKEKVKTCQIEIAGHVIEVTLLVLD 480

Query: 499  -----------WIC------------------WTLIFKFKGGGSKSLPQVISAIRASKLL 558
                       W+                       FKFKGGGS+SLPQVISAIRASKLL
Sbjct: 481  MLDFDVILGMDWLAAHHASIDCSRKEVTFNPPSRASFKFKGGGSRSLPQVISAIRASKLL 540

Query: 559  SQGTWGILASVVDTREA------------------------------------------- 618
            SQGTWGILASVVDTREA                                           
Sbjct: 541  SQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI 600

Query: 619  -VAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK 678
              APYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK
Sbjct: 601  SRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK 660

Query: 679  VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIV 738
            VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDED+PKTAFRSRYGHYEFIV
Sbjct: 661  VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIV 720

Query: 739  MSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK 798
            MSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK
Sbjct: 721  MSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK 780

Query: 799  LYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYR 858
            LYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYR
Sbjct: 781  LYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYR 840

Query: 859  RFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGNFVIYSD 918
            RFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKLVTAPVLTVPDGSG+FVIYSD
Sbjct: 841  RFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSD 900

Query: 919  ASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIF 978
            ASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIF
Sbjct: 901  ASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIF 960

Query: 979  TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR 1038
            TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
Sbjct: 961  TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR 1020

Query: 1039 QAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRLRIIDAQSNDPYLVEKRGLAEAGQTT 1098
            QAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLR RIIDAQSNDPYLVEKRGLAEAGQ  
Sbjct: 1021 QAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAV 1080

Query: 1099 EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMK 1158
            EFSLSSDGGLLFERRLCVPSDS VKTELLSEAHSSPFSMHPGSTKMY+D+KRVYWWRNMK
Sbjct: 1081 EFSLSSDGGLLFERRLCVPSDSVVKTELLSEAHSSPFSMHPGSTKMYRDVKRVYWWRNMK 1140

Query: 1159 REVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWV 1218
            REVAEFVS+CLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWV
Sbjct: 1141 REVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWV 1200

Query: 1219 VVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTA 1278
            VVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWK LQTA
Sbjct: 1201 VVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKSLQTA 1260

Query: 1279 MGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATI 1338
            MGTRLDFSTAFHPQTDGQTERLNQVLE MLRACALEFPGSWDSHLHLMEF YNNSYQATI
Sbjct: 1261 MGTRLDFSTAFHPQTDGQTERLNQVLEYMLRACALEFPGSWDSHLHLMEFVYNNSYQATI 1320

Query: 1339 GMAPFEALYGRCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQNRQKSYSDV 1398
            GMAPFEALYG+CCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQ+RQKSY+DV
Sbjct: 1321 GMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADV 1380

Query: 1399 RRKDLEFEIGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLS 1458
            RRKDLEFE+GDKVFLKVAPM+GVLRFERRGKLSPRF+GPFEILERIGPVAYRLALPPSLS
Sbjct: 1381 RRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFIGPFEILERIGPVAYRLALPPSLS 1440

Query: 1459 TVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKQIPLVKV 1468
            TVHDVFHVSMLRKYVPDPSHVVDY+PL+IDENLSY EQPVEVLAREVKTLRNK+IPLVKV
Sbjct: 1441 TVHDVFHVSMLRKYVPDPSHVVDYKPLKIDENLSYTEQPVEVLAREVKTLRNKEIPLVKV 1500

BLAST of IVF0023520 vs. ExPASy TrEMBL
Match: A0A5A7UAA8 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold55G00800 PE=4 SV=1)

HSP 1 Score: 2695.6 bits (6986), Expect = 0.0e+00
Identity = 1379/1528 (90.25%), Postives = 1390/1528 (90.97%), Query Frame = 0

Query: 19   MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMRE 78
            MPPRRGARRGGRGGRGRGAGRVQPE                                   
Sbjct: 1    MPPRRGARRGGRGGRGRGAGRVQPE----------------------------------- 60

Query: 79   QQKPASPTPAPAPAPVPAPAPAP----VPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS 138
              KPASPTPAPAPAP PAP PAP    VPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS
Sbjct: 61   --KPASPTPAPAPAPAPAPVPAPAPALVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS 120

Query: 139  LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW 198
            LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW
Sbjct: 121  LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW 180

Query: 199  QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD 258
            QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD
Sbjct: 181  QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD 240

Query: 259  KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP 318
            KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQE ANSSKTAGRGSTSGQKRKAEQQP
Sbjct: 241  KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQEMANSSKTAGRGSTSGQKRKAEQQP 300

Query: 319  VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGH 378
            VPVPQRNFR GGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGH
Sbjct: 301  VPVPQRNFRSGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGH 360

Query: 379  TADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGS 438
            TADRCPLR+TGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGS
Sbjct: 361  TADRCPLRLTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGS 420

Query: 439  SHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIV-- 498
            SHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLS+EKVKACQIEIAGHVIEVTLIV  
Sbjct: 421  SHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSREKVKACQIEIAGHVIEVTLIVLD 480

Query: 499  -----------WICWT------------------LIFKFKGGGSKSLPQVISAIRASKLL 558
                       W+                       FKFKGGGSKSLPQVISAIRASKLL
Sbjct: 481  MLDFDVILGMDWLAANHASIDCSRKDVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLL 540

Query: 559  SQGTWGILASVVDTREA------------------------------------------- 618
            SQGTWGILASVVDTREA                                           
Sbjct: 541  SQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI 600

Query: 619  -VAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK 678
              APYRMAPAELKELKVQLQELLDKGFIRP+VSPWGAPVLFVKKKDGSMRLCIDYRELNK
Sbjct: 601  SRAPYRMAPAELKELKVQLQELLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRELNK 660

Query: 679  VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIV 738
            VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGY+QLRIKDED+PKTAFRSRYGHYEFIV
Sbjct: 661  VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYNQLRIKDEDVPKTAFRSRYGHYEFIV 720

Query: 739  MSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK 798
            MSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK
Sbjct: 721  MSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK 780

Query: 799  LYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYR 858
            LYAKFSKCEFWLKQVSFLGHVVSKA VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYR
Sbjct: 781  LYAKFSKCEFWLKQVSFLGHVVSKARVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYR 840

Query: 859  RFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGNFVIYSD 918
            RFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKLVTAPVLTVPDGSG+FVIYSD
Sbjct: 841  RFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSD 900

Query: 919  ASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIF 978
            ASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIF
Sbjct: 901  ASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIF 960

Query: 979  TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR 1038
            TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
Sbjct: 961  TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR 1020

Query: 1039 QAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRLRIIDAQSNDPYLVEKRGLAEAGQTT 1098
            QAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLR RIIDAQSNDPYLVEKRGLAEAGQ  
Sbjct: 1021 QAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAV 1080

Query: 1099 EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMK 1158
            EFSLSSDGGL FE RLCVPSDSAVKTELL EAHSSPFSMHPGSTKMYQDLKRVYWWRNMK
Sbjct: 1081 EFSLSSDGGLSFEGRLCVPSDSAVKTELLFEAHSSPFSMHPGSTKMYQDLKRVYWWRNMK 1140

Query: 1159 REVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWV 1218
            REVAEFVSKCLVCQQVK PRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWV
Sbjct: 1141 REVAEFVSKCLVCQQVKEPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWV 1200

Query: 1219 VVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTA 1278
            VVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTA
Sbjct: 1201 VVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTA 1260

Query: 1279 MGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATI 1338
            MGTRLDFSTAFHPQ DGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATI
Sbjct: 1261 MGTRLDFSTAFHPQADGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATI 1320

Query: 1339 GMAPFEALYGRCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQNRQKSYSDV 1398
            GMAPFEALYG+CCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQ+RQKSY+DV
Sbjct: 1321 GMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADV 1380

Query: 1399 RRKDLEFEIGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLS 1458
            RRKDLEFEI DKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLS
Sbjct: 1381 RRKDLEFEIRDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLS 1440

Query: 1459 TVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKQIPLVKV 1468
            TVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNK+IPLVKV
Sbjct: 1441 TVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKEIPLVKV 1491

BLAST of IVF0023520 vs. ExPASy TrEMBL
Match: A0A5A7THE6 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold529G00030 PE=4 SV=1)

HSP 1 Score: 2695.6 bits (6986), Expect = 0.0e+00
Identity = 1371/1495 (91.71%), Postives = 1396/1495 (93.38%), Query Frame = 0

Query: 19   MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMRE 78
            MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRD+IMQMRE
Sbjct: 168  MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMRE 227

Query: 79   QQKPASPTPAPAPAPVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDP 138
            QQKPASPTPAPAPAP PAPA APVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDP
Sbjct: 228  QQKPASPTPAPAPAPGPAPASAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDP 287

Query: 139  TRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFK 198
            TRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFK
Sbjct: 288  TRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFK 347

Query: 199  ESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVR 258
            ESFYAKFFSASLRDAKRQEFLNL+QGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVR
Sbjct: 348  ESFYAKFFSASLRDAKRQEFLNLKQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVR 407

Query: 259  GLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVP 318
            GLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVP
Sbjct: 408  GLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVP 467

Query: 319  QRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADR 378
            QRNFR GGEFR FQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADR
Sbjct: 468  QRNFRSGGEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADR 527

Query: 379  CPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSF 438
            CPLR+TG AQNQ AGAPHQGRVFATN+TEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSF
Sbjct: 528  CPLRLTGNAQNQRAGAPHQGRVFATNKTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSF 587

Query: 439  ISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIV--WICW 498
            IS AFV HARLEVEPLHHVLSVSTPSGECMLSKEK+KACQIEIAGHVIEVTLIV   + +
Sbjct: 588  ISPAFVLHARLEVEPLHHVLSVSTPSGECMLSKEKMKACQIEIAGHVIEVTLIVLDMLDF 647

Query: 499  TLIFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRE----------------- 558
             +I    G   K + QVISAIRASKLLSQGT GILASVVDTRE                 
Sbjct: 648  NVIL---GRRVKVVAQVISAIRASKLLSQGTSGILASVVDTREVDVSLSSEPVVRDYPDV 707

Query: 559  ---------------------------AVAPYRMAPAELKELKVQLQELLDKGFIRPSVS 618
                                       + APYRMAPAELKELKVQLQELLDKGFIRPS+S
Sbjct: 708  FPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSMS 767

Query: 619  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSG 678
            PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSG
Sbjct: 768  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSG 827

Query: 679  YHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID 738
            YHQLRIKDED+PKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
Sbjct: 828  YHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID 887

Query: 739  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAK 798
            DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDP K
Sbjct: 888  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPVK 947

Query: 799  IEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSF 858
            IEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSF
Sbjct: 948  IEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSF 1007

Query: 859  QTLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPT 918
            Q LKQKLVTAPVL VPDGSG+FVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPT
Sbjct: 1008 QNLKQKLVTAPVLIVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPT 1067

Query: 919  HDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI 978
            HDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI
Sbjct: 1068 HDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI 1127

Query: 979  LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL 1038
            LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL
Sbjct: 1128 LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL 1187

Query: 1039 RLRIIDAQSNDPYLVEKRGLAEAGQTTEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAH 1098
            R RIIDAQSNDPYLVEKRGLAEAGQ  EFS+SSDGGLLFERRLCVPSDSA+KTELLSEAH
Sbjct: 1188 RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAIKTELLSEAH 1247

Query: 1099 SSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIP 1158
            SSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPA         
Sbjct: 1248 SSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPA--------- 1307

Query: 1159 EWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVR 1218
                         GLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVR
Sbjct: 1308 -------------GLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVR 1367

Query: 1219 LHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRAC 1278
            LHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQT+GQTERLNQVLEDMLRAC
Sbjct: 1368 LHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTNGQTERLNQVLEDMLRAC 1427

Query: 1279 ALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGRCCRSPVCWGEVGEQRLMGPELV 1338
            ALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG+CCRSPVCWGEVGEQRLMGPELV
Sbjct: 1428 ALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELV 1487

Query: 1339 QSTNEAIQKIRSRMHTAQNRQKSYSDVRRKDLEFEIGDKVFLKVAPMKGVLRFERRGKLS 1398
            +STNEAIQKIRSRMHTAQ+RQKSY+DVRRKDLEFE+GDKVFLKVAPM+GVLRFERRGKLS
Sbjct: 1488 RSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLS 1547

Query: 1399 PRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENL 1458
            PRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENL
Sbjct: 1548 PRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENL 1607

Query: 1459 SYVEQPVEVLAREVKTLRNKQIPLVKVLWRNHRVEEATWEREDDMRSRYPELFGE 1468
            SY EQPVEVLAREVKTLRNK+IP+VKVLWRNHRV EATWEREDDMRSRYPELF E
Sbjct: 1608 SYTEQPVEVLAREVKTLRNKEIPMVKVLWRNHRVGEATWEREDDMRSRYPELFEE 1637

BLAST of IVF0023520 vs. ExPASy TrEMBL
Match: A0A5A7VJE2 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold21G005560 PE=4 SV=1)

HSP 1 Score: 2690.2 bits (6972), Expect = 0.0e+00
Identity = 1374/1526 (90.04%), Postives = 1391/1526 (91.15%), Query Frame = 0

Query: 19   MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMRE 78
            MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMRE
Sbjct: 1    MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMRE 60

Query: 79   QQKPAS----PTPAPAPAPVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS 138
            QQKP S    P PAPAPAPVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS
Sbjct: 61   QQKPVSPNPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS 120

Query: 139  LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW 198
            LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW
Sbjct: 121  LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW 180

Query: 199  QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD 258
            QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD
Sbjct: 181  QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD 240

Query: 259  KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP 318
            KFVRGLRLDIQGLVRAFRPATH DALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP
Sbjct: 241  KFVRGLRLDIQGLVRAFRPATHVDALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP 300

Query: 319  VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGH 378
            VPVPQRNFR GGEFR FQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGH
Sbjct: 301  VPVPQRNFRSGGEFRYFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGH 360

Query: 379  TADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGS 438
            TADRCPLR+TGIAQNQGAGAPHQGRVFATNRTEAEKAGT+VTGTLPVLGHYALVLF SGS
Sbjct: 361  TADRCPLRLTGIAQNQGAGAPHQGRVFATNRTEAEKAGTIVTGTLPVLGHYALVLFYSGS 420

Query: 439  SHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIV-- 498
            SHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIV  
Sbjct: 421  SHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLD 480

Query: 499  -----------WICWT------------------LIFKFKGGGSKSLPQVISAIRASKLL 558
                       W+                       FKFKGGGSKSLPQVISAIRASKLL
Sbjct: 481  MLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLL 540

Query: 559  SQGTWGILASVVDTREA------------------------------------------- 618
            SQGTWGILASVVDTREA                                           
Sbjct: 541  SQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELESGTVPI 600

Query: 619  -VAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK 678
              APYRMAPAELK+LKVQLQELLDKG                              ELNK
Sbjct: 601  SRAPYRMAPAELKDLKVQLQELLDKG------------------------------ELNK 660

Query: 679  VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIV 738
            VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDED+PKT FRSRYGHYEFIV
Sbjct: 661  VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTTFRSRYGHYEFIV 720

Query: 739  MSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK 798
            MSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRD K
Sbjct: 721  MSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDYK 780

Query: 799  LYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYR 858
            LYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGL GYYR
Sbjct: 781  LYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYR 840

Query: 859  RFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGNFVIYSD 918
            RFVENFSRIATPLTQLTRKG PFVWSKACEDSFQ LKQKLVTAPVLTVPDGSG+FVIYSD
Sbjct: 841  RFVENFSRIATPLTQLTRKGVPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSD 900

Query: 919  ASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIF 978
            ASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIF
Sbjct: 901  ASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIF 960

Query: 979  TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR 1038
            TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHS ALITR
Sbjct: 961  TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSVALITR 1020

Query: 1039 QAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRLRIIDAQSNDPYLVEKRGLAEAGQTT 1098
            QAPLHRDLERAEIAVS+GAVTMQLA+L VQPTLR RIIDAQ NDPYLVEKRGL EAGQT 
Sbjct: 1021 QAPLHRDLERAEIAVSMGAVTMQLARLAVQPTLRQRIIDAQGNDPYLVEKRGLVEAGQTA 1080

Query: 1099 EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMK 1158
            EFSLSSDGGLLFERRLCVPSDSAVK ELLSEAHSSPFSMHPGSTK+YQDLKRVYWWRNMK
Sbjct: 1081 EFSLSSDGGLLFERRLCVPSDSAVKIELLSEAHSSPFSMHPGSTKIYQDLKRVYWWRNMK 1140

Query: 1159 REVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWV 1218
            REVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWV
Sbjct: 1141 REVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWV 1200

Query: 1219 VVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTA 1278
            VVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTA
Sbjct: 1201 VVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTA 1260

Query: 1279 MGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATI 1338
            MGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATI
Sbjct: 1261 MGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATI 1320

Query: 1339 GMAPFEALYGRCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQNRQKSYSDV 1398
            GMAPFEALYGRCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQ+RQKSY+DV
Sbjct: 1321 GMAPFEALYGRCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADV 1380

Query: 1399 RRKDLEFEIGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLS 1458
            RRKDLEFE+GDKVFLKVAPM+GV+RFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLS
Sbjct: 1381 RRKDLEFEVGDKVFLKVAPMRGVVRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLS 1440

Query: 1459 TVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKQIPLVKV 1466
            TVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSY E+PV+VLAREVKTLRNK+IPLVKV
Sbjct: 1441 TVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTERPVKVLAREVKTLRNKEIPLVKV 1496

BLAST of IVF0023520 vs. ExPASy TrEMBL
Match: A0A5A7TGX4 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold529G00100 PE=4 SV=1)

HSP 1 Score: 2616.3 bits (6780), Expect = 0.0e+00
Identity = 1337/1509 (88.60%), Postives = 1364/1509 (90.39%), Query Frame = 0

Query: 10   QESLVVVREMPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRF 69
            + SL +VREMPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRF
Sbjct: 80   KRSLPLVREMPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRF 139

Query: 70   RDMIMQMREQQKPASPTPAPAPAPVPAPA----PAPVPVAPQFVPDQLSAEAKHLRDFRK 129
            RD+IMQMREQQKPASPTPAP+PAP PAPA    PAPVPVAPQFVPDQLSAEAKHLRDFRK
Sbjct: 140  RDLIMQMREQQKPASPTPAPSPAPAPAPAPAPVPAPVPVAPQFVPDQLSAEAKHLRDFRK 199

Query: 130  YNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERML 189
            YNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERML
Sbjct: 200  YNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERML 259

Query: 190  GGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEM 249
            GGDVSQITWQQFKESFYAKFFSASLRDAK+QEFLNLEQGDMTVEQYDAEFDMLSRFAPEM
Sbjct: 260  GGDVSQITWQQFKESFYAKFFSASLRDAKQQEFLNLEQGDMTVEQYDAEFDMLSRFAPEM 319

Query: 250  IATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSG 309
            IATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSK AGRGSTSG
Sbjct: 320  IATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKAAGRGSTSG 379

Query: 310  QKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRT 369
            QKRKAEQ PVPVPQRNFRPGGEFR FQQKPFEAGEAARGKPLCTTCGKHHLGRCLF TRT
Sbjct: 380  QKRKAEQHPVPVPQRNFRPGGEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFETRT 439

Query: 370  CFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHY 429
            CFKCRQEGHTADRCPLR+TG AQNQGAGAPHQGRVFATN+TE EKAGTVVTGTLPVLGHY
Sbjct: 440  CFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEVEKAGTVVTGTLPVLGHY 499

Query: 430  ALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVS-----TPSGECMLSKEKVKACQIE 489
            ALVLFDSGSSHSFISSAFV HARLE+E   HV+ V+         + +L  + + A    
Sbjct: 500  ALVLFDSGSSHSFISSAFVLHARLEIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHAS 559

Query: 490  IAGHVIEVTLIVWICWTLIFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRE- 549
            I     EVT       +  FKFKGGGSKSLPQVISAIRASKLL+QGTWGIL SVVDTRE 
Sbjct: 560  IDCSRKEVTFNPPSLAS--FKFKGGGSKSLPQVISAIRASKLLNQGTWGILVSVVDTREV 619

Query: 550  -------------------------------------------AVAPYRMAPAELKELKV 609
                                                       + APYRMAPAELKELKV
Sbjct: 620  DVSLSSEPVVRDYPDVFPDELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKV 679

Query: 610  QLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFD 669
            QLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFD
Sbjct: 680  QLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFD 739

Query: 670  QLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMN 729
            QLQGATVFS+IDLRSGYHQLRIKDED+PKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMN
Sbjct: 740  QLQGATVFSEIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMN 799

Query: 730  RVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSF 789
            RVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTL DNKLYAKFSKCEFWLKQVSF
Sbjct: 800  RVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLWDNKLYAKFSKCEFWLKQVSF 859

Query: 790  LGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLT 849
            LGHVVSKAGVSVDPAKIEAVTGW RPSTVSEVRSFLGLAGYYR+FVENFSRIATPLTQLT
Sbjct: 860  LGHVVSKAGVSVDPAKIEAVTGWARPSTVSEVRSFLGLAGYYRQFVENFSRIATPLTQLT 919

Query: 850  RKGAPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVV 909
            RKGAPFVWSKACEDSFQ LKQKLVTAPVLTV DGSG+FVIYSDASKKGLGCVLMQQGKVV
Sbjct: 920  RKGAPFVWSKACEDSFQNLKQKLVTAPVLTVRDGSGSFVIYSDASKKGLGCVLMQQGKVV 979

Query: 910  AYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNM 969
            AYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNM
Sbjct: 980  AYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNM 1039

Query: 970  RQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSV 1029
            RQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSV
Sbjct: 1040 RQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSV 1099

Query: 1030 GAVTMQLAQLTVQPTLRLRIIDAQSNDPYLVEKRGLAEAGQTTEFSLSSDGGLLFERRLC 1089
            GAVTMQLAQLTVQPTLR RIIDAQSNDPYLVEKRGLAEAGQ  EFSLSSDGGLLFER LC
Sbjct: 1100 GAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAAEFSLSSDGGLLFERHLC 1159

Query: 1090 VPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVK 1149
            VPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW NMKREVAEFVS+CLVCQQVK
Sbjct: 1160 VPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWHNMKREVAEFVSRCLVCQQVK 1219

Query: 1150 APRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKST 1209
            APRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTK AHFVPGKST
Sbjct: 1220 APRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKLAHFVPGKST 1279

Query: 1210 YTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDG 1269
            YTASKWAQLYMSEIVRLHGVPVSIVS+RDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDG
Sbjct: 1280 YTASKWAQLYMSEIVRLHGVPVSIVSNRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDG 1339

Query: 1270 QTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGRCCRSPV 1329
            QTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG+CCRSPV
Sbjct: 1340 QTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPV 1399

Query: 1330 CWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQNRQKSYSDVRRKDLEFEIGDKVFLKV 1389
            CWGEV                                    DVRRKDLEFE+GDKVFLKV
Sbjct: 1400 CWGEV------------------------------------DVRRKDLEFEVGDKVFLKV 1459

Query: 1390 APMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPD 1449
            APM+GVLRFERRGKLSPRFVGPFEILERIGPVAYRL LPPSLSTVHDV HVSMLRKYVPD
Sbjct: 1460 APMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLVLPPSLSTVHDVIHVSMLRKYVPD 1519

Query: 1450 PSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKQIPLVKVLWRNHRVEEATWEREDD 1466
            PSHVVDYEPLEIDENLSY EQPVEVLAREVKTLRNK+IPLVKVLWRNHRVEEATWEREDD
Sbjct: 1520 PSHVVDYEPLEIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEREDD 1550

BLAST of IVF0023520 vs. NCBI nr
Match: KAA0048687.1 (pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 2695 bits (6986), Expect = 0.0
Identity = 1376/1528 (90.05%), Postives = 1397/1528 (91.43%), Query Frame = 0

Query: 19   MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMRE 78
            MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRD+IMQMRE
Sbjct: 1    MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMRE 60

Query: 79   QQKPASPTPAPAPAPVPAPAPAP----VPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS 138
            QQKPASPTPAPAPAP PAPAPAP    VPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS
Sbjct: 61   QQKPASPTPAPAPAPAPAPAPAPALAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS 120

Query: 139  LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW 198
            LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW
Sbjct: 121  LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW 180

Query: 199  QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD 258
            QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD
Sbjct: 181  QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD 240

Query: 259  KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP 318
            KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRG TSGQKRKAEQQP
Sbjct: 241  KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGLTSGQKRKAEQQP 300

Query: 319  VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGH 378
            VPVPQRNFR GGEFR FQQKPFEAGEAAR KPLCT CGKHHLGRCLFGTRTCFKCRQEGH
Sbjct: 301  VPVPQRNFRSGGEFRRFQQKPFEAGEAARWKPLCTICGKHHLGRCLFGTRTCFKCRQEGH 360

Query: 379  TADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGS 438
            TADRCPLR+TG AQNQGAGAPHQGRVFATN+TEAEKAGTVVTGTLPVLGHYALVLFDS  
Sbjct: 361  TADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEAEKAGTVVTGTLPVLGHYALVLFDS-- 420

Query: 439  SHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIV-- 498
                                     VSTPSGECMLSKEKVK CQIEIAGHVIEVTL+V  
Sbjct: 421  -------------------------VSTPSGECMLSKEKVKTCQIEIAGHVIEVTLLVLD 480

Query: 499  -----------WICW------------------TLIFKFKGGGSKSLPQVISAIRASKLL 558
                       W+                       FKFKGGGS+SLPQVISAIRASKLL
Sbjct: 481  MLDFDVILGMDWLAAHHASIDCSRKEVTFNPPSRASFKFKGGGSRSLPQVISAIRASKLL 540

Query: 559  SQGTWGILASVVDTREAV------------------------------------------ 618
            SQGTWGILASVVDTREA                                           
Sbjct: 541  SQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI 600

Query: 619  --APYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK 678
              APYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK
Sbjct: 601  SRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK 660

Query: 679  VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIV 738
            VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDED+PKTAFRSRYGHYEFIV
Sbjct: 661  VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIV 720

Query: 739  MSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK 798
            MSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK
Sbjct: 721  MSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK 780

Query: 799  LYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYR 858
            LYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYR
Sbjct: 781  LYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYR 840

Query: 859  RFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGNFVIYSD 918
            RFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKLVTAPVLTVPDGSG+FVIYSD
Sbjct: 841  RFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSD 900

Query: 919  ASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIF 978
            ASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIF
Sbjct: 901  ASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIF 960

Query: 979  TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR 1038
            TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
Sbjct: 961  TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR 1020

Query: 1039 QAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRLRIIDAQSNDPYLVEKRGLAEAGQTT 1098
            QAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLR RIIDAQSNDPYLVEKRGLAEAGQ  
Sbjct: 1021 QAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAV 1080

Query: 1099 EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMK 1158
            EFSLSSDGGLLFERRLCVPSDS VKTELLSEAHSSPFSMHPGSTKMY+D+KRVYWWRNMK
Sbjct: 1081 EFSLSSDGGLLFERRLCVPSDSVVKTELLSEAHSSPFSMHPGSTKMYRDVKRVYWWRNMK 1140

Query: 1159 REVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWV 1218
            REVAEFVS+CLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWV
Sbjct: 1141 REVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWV 1200

Query: 1219 VVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTA 1278
            VVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWK LQTA
Sbjct: 1201 VVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKSLQTA 1260

Query: 1279 MGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATI 1338
            MGTRLDFSTAFHPQTDGQTERLNQVLE MLRACALEFPGSWDSHLHLMEF YNNSYQATI
Sbjct: 1261 MGTRLDFSTAFHPQTDGQTERLNQVLEYMLRACALEFPGSWDSHLHLMEFVYNNSYQATI 1320

Query: 1339 GMAPFEALYGRCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQNRQKSYSDV 1398
            GMAPFEALYG+CCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQ+RQKSY+DV
Sbjct: 1321 GMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADV 1380

Query: 1399 RRKDLEFEIGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLS 1458
            RRKDLEFE+GDKVFLKVAPM+GVLRFERRGKLSPRF+GPFEILERIGPVAYRLALPPSLS
Sbjct: 1381 RRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFIGPFEILERIGPVAYRLALPPSLS 1440

Query: 1459 TVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKQIPLVKV 1467
            TVHDVFHVSMLRKYVPDPSHVVDY+PL+IDENLSY EQPVEVLAREVKTLRNK+IPLVKV
Sbjct: 1441 TVHDVFHVSMLRKYVPDPSHVVDYKPLKIDENLSYTEQPVEVLAREVKTLRNKEIPLVKV 1500

BLAST of IVF0023520 vs. NCBI nr
Match: KAA0040689.1 (pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 2689 bits (6970), Expect = 0.0
Identity = 1371/1495 (91.71%), Postives = 1396/1495 (93.38%), Query Frame = 0

Query: 19   MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMRE 78
            MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRD+IMQMRE
Sbjct: 168  MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDLIMQMRE 227

Query: 79   QQKPASPTPAPAPAPVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDP 138
            QQKPASPTPAPAPAP PAPA APVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDP
Sbjct: 228  QQKPASPTPAPAPAPGPAPASAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGSLEDP 287

Query: 139  TRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFK 198
            TRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFK
Sbjct: 288  TRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITWQQFK 347

Query: 199  ESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVR 258
            ESFYAKFFSASLRDAKRQEFLNL+QGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVR
Sbjct: 348  ESFYAKFFSASLRDAKRQEFLNLKQGDMTVEQYDAEFDMLSRFAPEMIATEAARADKFVR 407

Query: 259  GLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVP 318
            GLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVP
Sbjct: 408  GLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVP 467

Query: 319  QRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADR 378
            QRNFR GGEFR FQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADR
Sbjct: 468  QRNFRSGGEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGHTADR 527

Query: 379  CPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSF 438
            CPLR+TG AQNQ AGAPHQGRVFATN+TEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSF
Sbjct: 528  CPLRLTGNAQNQRAGAPHQGRVFATNKTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSF 587

Query: 439  ISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIV--WICW 498
            IS AFV HARLEVEPLHHVLSVSTPSGECMLSKEK+KACQIEIAGHVIEVTLIV   + +
Sbjct: 588  ISPAFVLHARLEVEPLHHVLSVSTPSGECMLSKEKMKACQIEIAGHVIEVTLIVLDMLDF 647

Query: 499  TLIFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRE----------------- 558
             +I    G   K + QVISAIRASKLLSQGT GILASVVDTRE                 
Sbjct: 648  NVIL---GRRVKVVAQVISAIRASKLLSQGTSGILASVVDTREVDVSLSSEPVVRDYPDV 707

Query: 559  ---------------------------AVAPYRMAPAELKELKVQLQELLDKGFIRPSVS 618
                                       + APYRMAPAELKELKVQLQELLDKGFIRPS+S
Sbjct: 708  FPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSMS 767

Query: 619  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSG 678
            PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSG
Sbjct: 768  PWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSG 827

Query: 679  YHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID 738
            YHQLRIKDED+PKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID
Sbjct: 828  YHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFID 887

Query: 739  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAK 798
            DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDP K
Sbjct: 888  DILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPVK 947

Query: 799  IEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSF 858
            IEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSF
Sbjct: 948  IEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSF 1007

Query: 859  QTLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPT 918
            Q LKQKLVTAPVL VPDGSG+FVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPT
Sbjct: 1008 QNLKQKLVTAPVLIVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPT 1067

Query: 919  HDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI 978
            HDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI
Sbjct: 1068 HDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI 1127

Query: 979  LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL 1038
            LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL
Sbjct: 1128 LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTL 1187

Query: 1039 RLRIIDAQSNDPYLVEKRGLAEAGQTTEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAH 1098
            R RIIDAQSNDPYLVEKRGLAEAGQ  EFS+SSDGGLLFERRLCVPSDSA+KTELLSEAH
Sbjct: 1188 RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAIKTELLSEAH 1247

Query: 1099 SSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIP 1158
            SSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGL       
Sbjct: 1248 SSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGL------- 1307

Query: 1159 EWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVR 1218
                           PRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVR
Sbjct: 1308 ---------------PRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVR 1367

Query: 1219 LHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRAC 1278
            LHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQT+GQTERLNQVLEDMLRAC
Sbjct: 1368 LHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTNGQTERLNQVLEDMLRAC 1427

Query: 1279 ALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGRCCRSPVCWGEVGEQRLMGPELV 1338
            ALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG+CCRSPVCWGEVGEQRLMGPELV
Sbjct: 1428 ALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELV 1487

Query: 1339 QSTNEAIQKIRSRMHTAQNRQKSYSDVRRKDLEFEIGDKVFLKVAPMKGVLRFERRGKLS 1398
            +STNEAIQKIRSRMHTAQ+RQKSY+DVRRKDLEFE+GDKVFLKVAPM+GVLRFERRGKLS
Sbjct: 1488 RSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLS 1547

Query: 1399 PRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENL 1458
            PRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENL
Sbjct: 1548 PRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENL 1607

Query: 1459 SYVEQPVEVLAREVKTLRNKQIPLVKVLWRNHRVEEATWEREDDMRSRYPELFGE 1467
            SY EQPVEVLAREVKTLRNK+IP+VKVLWRNHRV EATWEREDDMRSRYPELF E
Sbjct: 1608 SYTEQPVEVLAREVKTLRNKEIPMVKVLWRNHRVGEATWEREDDMRSRYPELFEE 1637

BLAST of IVF0023520 vs. NCBI nr
Match: KAA0051357.1 (pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 2688 bits (6968), Expect = 0.0
Identity = 1379/1528 (90.25%), Postives = 1390/1528 (90.97%), Query Frame = 0

Query: 19   MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMRE 78
            MPPRRGARRGGRGGRGRGAGRVQPE                                   
Sbjct: 1    MPPRRGARRGGRGGRGRGAGRVQPE----------------------------------- 60

Query: 79   QQKPASPTPAPAPAPVPAPAPAP----VPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS 138
              KPASPTPAPAPAP PAP PAP    VPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS
Sbjct: 61   --KPASPTPAPAPAPAPAPVPAPAPALVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS 120

Query: 139  LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW 198
            LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW
Sbjct: 121  LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW 180

Query: 199  QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD 258
            QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD
Sbjct: 181  QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD 240

Query: 259  KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP 318
            KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQE ANSSKTAGRGSTSGQKRKAEQQP
Sbjct: 241  KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQEMANSSKTAGRGSTSGQKRKAEQQP 300

Query: 319  VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGH 378
            VPVPQRNFR GGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGH
Sbjct: 301  VPVPQRNFRSGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGH 360

Query: 379  TADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGS 438
            TADRCPLR+TGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGS
Sbjct: 361  TADRCPLRLTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGS 420

Query: 439  SHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIV-- 498
            SHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLS+EKVKACQIEIAGHVIEVTLIV  
Sbjct: 421  SHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSREKVKACQIEIAGHVIEVTLIVLD 480

Query: 499  -----------WICWT------------------LIFKFKGGGSKSLPQVISAIRASKLL 558
                       W+                       FKFKGGGSKSLPQVISAIRASKLL
Sbjct: 481  MLDFDVILGMDWLAANHASIDCSRKDVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLL 540

Query: 559  SQGTWGILASVVDTREAV------------------------------------------ 618
            SQGTWGILASVVDTREA                                           
Sbjct: 541  SQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPI 600

Query: 619  --APYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK 678
              APYRMAPAELKELKVQLQELLDKGFIRP+VSPWGAPVLFVKKKDGSMRLCIDYRELNK
Sbjct: 601  SRAPYRMAPAELKELKVQLQELLDKGFIRPNVSPWGAPVLFVKKKDGSMRLCIDYRELNK 660

Query: 679  VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIV 738
            VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGY+QLRIKDED+PKTAFRSRYGHYEFIV
Sbjct: 661  VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYNQLRIKDEDVPKTAFRSRYGHYEFIV 720

Query: 739  MSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK 798
            MSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK
Sbjct: 721  MSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK 780

Query: 799  LYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYR 858
            LYAKFSKCEFWLKQVSFLGHVVSKA VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYR
Sbjct: 781  LYAKFSKCEFWLKQVSFLGHVVSKARVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYR 840

Query: 859  RFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGNFVIYSD 918
            RFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKLVTAPVLTVPDGSG+FVIYSD
Sbjct: 841  RFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSD 900

Query: 919  ASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIF 978
            ASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIF
Sbjct: 901  ASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIF 960

Query: 979  TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR 1038
            TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR
Sbjct: 961  TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR 1020

Query: 1039 QAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRLRIIDAQSNDPYLVEKRGLAEAGQTT 1098
            QAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLR RIIDAQSNDPYLVEKRGLAEAGQ  
Sbjct: 1021 QAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAV 1080

Query: 1099 EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMK 1158
            EFSLSSDGGL FE RLCVPSDSAVKTELL EAHSSPFSMHPGSTKMYQDLKRVYWWRNMK
Sbjct: 1081 EFSLSSDGGLSFEGRLCVPSDSAVKTELLFEAHSSPFSMHPGSTKMYQDLKRVYWWRNMK 1140

Query: 1159 REVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWV 1218
            REVAEFVSKCLVCQQVK PRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWV
Sbjct: 1141 REVAEFVSKCLVCQQVKEPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWV 1200

Query: 1219 VVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTA 1278
            VVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTA
Sbjct: 1201 VVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTA 1260

Query: 1279 MGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATI 1338
            MGTRLDFSTAFHPQ DGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATI
Sbjct: 1261 MGTRLDFSTAFHPQADGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATI 1320

Query: 1339 GMAPFEALYGRCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQNRQKSYSDV 1398
            GMAPFEALYG+CCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQ+RQKSY+DV
Sbjct: 1321 GMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADV 1380

Query: 1399 RRKDLEFEIGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLS 1458
            RRKDLEFEI DKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLS
Sbjct: 1381 RRKDLEFEIRDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLS 1440

Query: 1459 TVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKQIPLVKV 1467
            TVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNK+IPLVKV
Sbjct: 1441 TVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKEIPLVKV 1491

BLAST of IVF0023520 vs. NCBI nr
Match: KAA0066456.1 (pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 2685 bits (6960), Expect = 0.0
Identity = 1374/1526 (90.04%), Postives = 1391/1526 (91.15%), Query Frame = 0

Query: 19   MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMRE 78
            MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMRE
Sbjct: 1    MPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRFRDMIMQMRE 60

Query: 79   QQKPASPTPAPAPAP----VPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS 138
            QQKP SP PAPAPAP    VPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS
Sbjct: 61   QQKPVSPNPAPAPAPAPAPVPAPAPAPVPVAPQFVPDQLSAEAKHLRDFRKYNPTTFDGS 120

Query: 139  LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW 198
            LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW
Sbjct: 121  LEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERMLGGDVSQITW 180

Query: 199  QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD 258
            QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD
Sbjct: 181  QQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEMIATEAARAD 240

Query: 259  KFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP 318
            KFVRGLRLDIQGLVRAFRPATH DALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP
Sbjct: 241  KFVRGLRLDIQGLVRAFRPATHVDALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQP 300

Query: 319  VPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGH 378
            VPVPQRNFR GGEFR FQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGH
Sbjct: 301  VPVPQRNFRSGGEFRYFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRQEGH 360

Query: 379  TADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGS 438
            TADRCPLR+TGIAQNQGAGAPHQGRVFATNRTEAEKAGT+VTGTLPVLGHYALVLF SGS
Sbjct: 361  TADRCPLRLTGIAQNQGAGAPHQGRVFATNRTEAEKAGTIVTGTLPVLGHYALVLFYSGS 420

Query: 439  SHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIV-- 498
            SHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIV  
Sbjct: 421  SHSFISSAFVSHARLEVEPLHHVLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLD 480

Query: 499  -----------WICWT------------------LIFKFKGGGSKSLPQVISAIRASKLL 558
                       W+                       FKFKGGGSKSLPQVISAIRASKLL
Sbjct: 481  MLDFDVILGMDWLAANHASIDCSRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLL 540

Query: 559  SQGTWGILASVVDTREAV------------------------------------------ 618
            SQGTWGILASVVDTREA                                           
Sbjct: 541  SQGTWGILASVVDTREADVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELESGTVPI 600

Query: 619  --APYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNK 678
              APYRMAPAELK+LKVQLQELLDKG                              ELNK
Sbjct: 601  SRAPYRMAPAELKDLKVQLQELLDKG------------------------------ELNK 660

Query: 679  VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIV 738
            VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDED+PKT FRSRYGHYEFIV
Sbjct: 661  VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTTFRSRYGHYEFIV 720

Query: 739  MSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK 798
            MSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRD K
Sbjct: 721  MSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDYK 780

Query: 799  LYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYR 858
            LYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGL GYYR
Sbjct: 781  LYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYR 840

Query: 859  RFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGNFVIYSD 918
            RFVENFSRIATPLTQLTRKG PFVWSKACEDSFQ LKQKLVTAPVLTVPDGSG+FVIYSD
Sbjct: 841  RFVENFSRIATPLTQLTRKGVPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSD 900

Query: 919  ASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIF 978
            ASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIF
Sbjct: 901  ASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIF 960

Query: 979  TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITR 1038
            TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHS ALITR
Sbjct: 961  TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSVALITR 1020

Query: 1039 QAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRLRIIDAQSNDPYLVEKRGLAEAGQTT 1098
            QAPLHRDLERAEIAVS+GAVTMQLA+L VQPTLR RIIDAQ NDPYLVEKRGL EAGQT 
Sbjct: 1021 QAPLHRDLERAEIAVSMGAVTMQLARLAVQPTLRQRIIDAQGNDPYLVEKRGLVEAGQTA 1080

Query: 1099 EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMK 1158
            EFSLSSDGGLLFERRLCVPSDSAVK ELLSEAHSSPFSMHPGSTK+YQDLKRVYWWRNMK
Sbjct: 1081 EFSLSSDGGLLFERRLCVPSDSAVKIELLSEAHSSPFSMHPGSTKIYQDLKRVYWWRNMK 1140

Query: 1159 REVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWV 1218
            REVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWV
Sbjct: 1141 REVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWV 1200

Query: 1219 VVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTA 1278
            VVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTA
Sbjct: 1201 VVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTA 1260

Query: 1279 MGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATI 1338
            MGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATI
Sbjct: 1261 MGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATI 1320

Query: 1339 GMAPFEALYGRCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQNRQKSYSDV 1398
            GMAPFEALYGRCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQ+RQKSY+DV
Sbjct: 1321 GMAPFEALYGRCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADV 1380

Query: 1399 RRKDLEFEIGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLS 1458
            RRKDLEFE+GDKVFLKVAPM+GV+RFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLS
Sbjct: 1381 RRKDLEFEVGDKVFLKVAPMRGVVRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLS 1440

Query: 1459 TVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKQIPLVKV 1465
            TVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSY E+PV+VLAREVKTLRNK+IPLVKV
Sbjct: 1441 TVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTERPVKVLAREVKTLRNKEIPLVKV 1496

BLAST of IVF0023520 vs. NCBI nr
Match: KAA0040695.1 (pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 2610 bits (6765), Expect = 0.0
Identity = 1337/1509 (88.60%), Postives = 1364/1509 (90.39%), Query Frame = 0

Query: 10   QESLVVVREMPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRF 69
            + SL +VREMPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRF
Sbjct: 80   KRSLPLVREMPPRRGARRGGRGGRGRGAGRVQPEVQPVAQAPDPAAPVTHADLAAMEQRF 139

Query: 70   RDMIMQMREQQKPASPTPAPAPAPVPAPAPAPVP----VAPQFVPDQLSAEAKHLRDFRK 129
            RD+IMQMREQQKPASPTPAP+PAP PAPAPAPVP    VAPQFVPDQLSAEAKHLRDFRK
Sbjct: 140  RDLIMQMREQQKPASPTPAPSPAPAPAPAPAPVPAPVPVAPQFVPDQLSAEAKHLRDFRK 199

Query: 130  YNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERML 189
            YNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERML
Sbjct: 200  YNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWETTERML 259

Query: 190  GGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLSRFAPEM 249
            GGDVSQITWQQFKESFYAKFFSASLRDAK+QEFLNLEQGDMTVEQYDAEFDMLSRFAPEM
Sbjct: 260  GGDVSQITWQQFKESFYAKFFSASLRDAKQQEFLNLEQGDMTVEQYDAEFDMLSRFAPEM 319

Query: 250  IATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSG 309
            IATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSK AGRGSTSG
Sbjct: 320  IATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKAAGRGSTSG 379

Query: 310  QKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRT 369
            QKRKAEQ PVPVPQRNFRPGGEFR FQQKPFEAGEAARGKPLCTTCGKHHLGRCLF TRT
Sbjct: 380  QKRKAEQHPVPVPQRNFRPGGEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFETRT 439

Query: 370  CFKCRQEGHTADRCPLRVTGIAQNQGAGAPHQGRVFATNRTEAEKAGTVVTGTLPVLGHY 429
            CFKCRQEGHTADRCPLR+TG AQNQGAGAPHQGRVFATN+TE EKAGTVVTGTLPVLGHY
Sbjct: 440  CFKCRQEGHTADRCPLRLTGNAQNQGAGAPHQGRVFATNKTEVEKAGTVVTGTLPVLGHY 499

Query: 430  ALVLFDSGSSHSFISSAFVSHARLEVEPLHHVLSVSTP-----SGECMLSKEKVKACQIE 489
            ALVLFDSGSSHSFISSAFV HARLE+E   HV+ V+         + +L  + + A    
Sbjct: 500  ALVLFDSGSSHSFISSAFVLHARLEIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHAS 559

Query: 490  IAGHVIEVTLIVWICWTLIFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTRE- 549
            I     EVT       +  FKFKGGGSKSLPQVISAIRASKLL+QGTWGIL SVVDTRE 
Sbjct: 560  IDCSRKEVTFNPPSLAS--FKFKGGGSKSLPQVISAIRASKLLNQGTWGILVSVVDTREV 619

Query: 550  -------------------------------------------AVAPYRMAPAELKELKV 609
                                                       + APYRMAPAELKELKV
Sbjct: 620  DVSLSSEPVVRDYPDVFPDELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKV 679

Query: 610  QLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFD 669
            QLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFD
Sbjct: 680  QLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFD 739

Query: 670  QLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMN 729
            QLQGATVFS+IDLRSGYHQLRIKDED+PKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMN
Sbjct: 740  QLQGATVFSEIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMN 799

Query: 730  RVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSF 789
            RVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTL DNKLYAKFSKCEFWLKQVSF
Sbjct: 800  RVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLWDNKLYAKFSKCEFWLKQVSF 859

Query: 790  LGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLT 849
            LGHVVSKAGVSVDPAKIEAVTGW RPSTVSEVRSFLGLAGYYR+FVENFSRIATPLTQLT
Sbjct: 860  LGHVVSKAGVSVDPAKIEAVTGWARPSTVSEVRSFLGLAGYYRQFVENFSRIATPLTQLT 919

Query: 850  RKGAPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVV 909
            RKGAPFVWSKACEDSFQ LKQKLVTAPVLTV DGSG+FVIYSDASKKGLGCVLMQQGKVV
Sbjct: 920  RKGAPFVWSKACEDSFQNLKQKLVTAPVLTVRDGSGSFVIYSDASKKGLGCVLMQQGKVV 979

Query: 910  AYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNM 969
            AYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNM
Sbjct: 980  AYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNM 1039

Query: 970  RQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSV 1029
            RQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSV
Sbjct: 1040 RQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSV 1099

Query: 1030 GAVTMQLAQLTVQPTLRLRIIDAQSNDPYLVEKRGLAEAGQTTEFSLSSDGGLLFERRLC 1089
            GAVTMQLAQLTVQPTLR RIIDAQSNDPYLVEKRGLAEAGQ  EFSLSSDGGLLFER LC
Sbjct: 1100 GAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAAEFSLSSDGGLLFERHLC 1159

Query: 1090 VPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVK 1149
            VPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW NMKREVAEFVS+CLVCQQVK
Sbjct: 1160 VPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWHNMKREVAEFVSRCLVCQQVK 1219

Query: 1150 APRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKST 1209
            APRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTK AHFVPGKST
Sbjct: 1220 APRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKLAHFVPGKST 1279

Query: 1210 YTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDG 1269
            YTASKWAQLYMSEIVRLHGVPVSIVS+RDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDG
Sbjct: 1280 YTASKWAQLYMSEIVRLHGVPVSIVSNRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDG 1339

Query: 1270 QTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGRCCRSPV 1329
            QTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG+CCRSPV
Sbjct: 1340 QTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPV 1399

Query: 1330 CWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQNRQKSYSDVRRKDLEFEIGDKVFLKV 1389
            CWGEV                                    DVRRKDLEFE+GDKVFLKV
Sbjct: 1400 CWGEV------------------------------------DVRRKDLEFEVGDKVFLKV 1459

Query: 1390 APMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPD 1449
            APM+GVLRFERRGKLSPRFVGPFEILERIGPVAYRL LPPSLSTVHDV HVSMLRKYVPD
Sbjct: 1460 APMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLVLPPSLSTVHDVIHVSMLRKYVPD 1519

Query: 1450 PSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKQIPLVKVLWRNHRVEEATWEREDD 1465
            PSHVVDYEPLEIDENLSY EQPVEVLAREVKTLRNK+IPLVKVLWRNHRVEEATWEREDD
Sbjct: 1520 PSHVVDYEPLEIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEREDD 1550

BLAST of IVF0023520 vs. TAIR 10
Match: ATMG00860.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 122.1 bits (305), Expect = 3.7e-27
Identity = 58/125 (46.40%), Postives = 81/125 (64.80%), Query Frame = 0

Query: 707 HLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEAVTGWTRPST 766
           HL MVLQ    ++ YA   KC F   Q+++LG  H++S  GVS DPAK+EA+ GW  P  
Sbjct: 3   HLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEPKN 62

Query: 767 VSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPV 826
            +E+R FLGL GYYRRFV+N+ +I  PLT+L +K +   W++    +F+ LK  + T PV
Sbjct: 63  TTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKKNS-LKWTEMAALAFKALKGAVTTLPV 122

Query: 827 LTVPD 830
           L +PD
Sbjct: 123 LALPD 126

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P0CT415.8e-14232.85Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24... [more]
P0CT345.8e-14232.85Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
P0CT355.8e-14232.85Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
P0CT365.8e-14232.85Transposon Tf2-3 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
P0CT375.8e-14232.85Transposon Tf2-4 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
Match NameE-valueIdentityDescription
A0A5A7U3300.0e+0090.05Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold43... [more]
A0A5A7UAA80.0e+0090.25Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold55... [more]
A0A5A7THE60.0e+0091.71Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold52... [more]
A0A5A7VJE20.0e+0090.04Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold21... [more]
A0A5A7TGX40.0e+0088.60Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold52... [more]
Match NameE-valueIdentityDescription
KAA0048687.10.090.05pol protein [Cucumis melo var. makuwa][more]
KAA0040689.10.091.71pol protein [Cucumis melo var. makuwa][more]
KAA0051357.10.090.25pol protein [Cucumis melo var. makuwa][more]
KAA0066456.10.090.04pol protein [Cucumis melo var. makuwa][more]
KAA0040695.10.088.60pol protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
ATMG00860.13.7e-2746.40DNA/RNA polymerases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Melon (IVF77) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 1292..1312
NoneNo IPR availableGENE3D1.10.340.70coord: 1006..1096
e-value: 8.0E-18
score: 66.5
NoneNo IPR availableGENE3D3.10.10.10HIV Type 1 Reverse Transcriptase, subunit A, domain 1coord: 541..666
e-value: 1.3E-90
score: 304.4
NoneNo IPR availablePFAMPF08284RVP_2coord: 397..492
e-value: 2.8E-24
score: 85.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 293..310
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 84..99
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 293..329
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 17..55
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 77..99
NoneNo IPR availablePANTHERPTHR34072ENZYMATIC POLYPROTEIN-RELATEDcoord: 541..1362
NoneNo IPR availableCDDcd09274RNase_HI_RT_Ty3coord: 835..950
e-value: 2.97607E-58
score: 194.248
NoneNo IPR availableCDDcd01647RT_LTRcoord: 565..741
e-value: 7.65857E-95
score: 301.051
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 581..740
e-value: 1.5E-28
score: 99.9
IPR000477Reverse transcriptase domainPROSITEPS50878RT_POLcoord: 562..741
score: 11.437992
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 751..841
e-value: 1.9E-30
score: 106.7
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 606..741
e-value: 1.3E-90
score: 304.4
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 1106..1313
e-value: 2.4E-46
score: 159.6
IPR041588Integrase zinc-binding domainPFAMPF17921Integrase_H2C2coord: 1041..1097
e-value: 7.8E-17
score: 61.1
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 402..494
e-value: 4.3E-7
score: 31.6
IPR041373Reverse transcriptase, RNase H-like domainPFAMPF17917RT_RNaseHcoord: 832..928
e-value: 6.8E-34
score: 116.3
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 165..261
e-value: 3.6E-15
score: 56.0
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 428..439
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 366..380
score: 9.438442
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 1108..1271
score: 18.918982
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 540..934
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 1109..1267

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
IVF0023520.2IVF0023520.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0006508 proteolysis
biological_process GO:0006278 RNA-dependent DNA biosynthetic process
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0003964 RNA-directed DNA polymerase activity
molecular_function GO:0008270 zinc ion binding