Tan0005870 (gene) Snake gourd v1

Overview
NameTan0005870
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
LocationLG07: 20102462 .. 20110416 (-)
RNA-Seq ExpressionTan0005870
SyntenyTan0005870
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAGATTTTGGTGGCATTAGGAACGGAAGACTCGAGCGATGAGATGGTAATATGAAAGAGGGAAAAAACGAGAGCCCCCAAATCAAATCCCTAGCCTTTTGCAATTCGCTAGCCCTAGGCCCCAAATCGCTATGCCGTCATTGGTAATATGTATTTTTCAGGTCTTACAAGCCCTAGCTGCAGCTTCCGCCATCTCTTCCTCTTCTTCCACCAGCAACTCCAATCTTCCTCCACCTCCTAAAGATGCATGAAGTCTTGTGTACCAACGACTACTTCCTCGATGGAAATCTCTCTCTCACTCCCACCTGGTACCTTCGTCTCTCTGCCTCTCCCTCTCGTACTAAACCTAATTTTATCTTCTTCTTTGGAACCATGTTTTATTTGTTCGTACTTCCTCGCTGGAAATCTCTCTCTCATTCCATTTGTGTTAATTAAGCTTCGCTTACTGTTTATTCTGTCAATCATGTCATGGCAAGAGTTAGATTTTATTGTAGGTGCCCCTTGATATTGTATATGTATGTAATTTGTTGTTCTAATTTGCGATACCTCATTTGTTCCTTTTTTTTTGTCTTATCTATTATGTAGTCTCCAATTCGAATTTCGATATCAAAAGTTGATCAAGTGGATGCAACGCGGTTAGATATTGAAATGTCGGCCATGCTGAAAGAACAACTGGTTAAGGTTTTTGCTTTGATGAAGGTATGGGTTACTTGTTATTTTGACATTAAAATGTAAATTCTTGATTGTAAAGTTATTTAGGATTCATTCAGTATTGTCTTCTACTTTGATGGATTAGCTAGGAATGTTGTTTCAATATGAAGCAGAGTTTGATGCTTTTTTGGAGTTCCTTATTTGACACTTTTCAATTTAGATAGGCAAGCCCACACCGAGAATTTCTCTCATTAATTTGCAGTATAGAGATGAGCGTGCAATAAAAATTCCACGAAAAGGTAAGGTGGCCTAAACAACTCCTACTATGTTGCTTCTTTAACTGATTGTTATGGTATGGATGGTGTAGGAATATAATTACATTTATCCAAATTTGACAAGTAAGAAAAATAATTATAATTCAATTTGAAGTTTCTAGAATTAAAATTCAAACAACCATGAAATTGCAATATTCATGATTCTTGATATGTTTTTTAGAGGGATCGCGCCCATGTGTATATGGTTGTGTGTAGTTGAGAGTTGGAACAGCTTGTTAGAACTTGGTTAAAGATCAATCTTGGAATGCATGAATGACTTTGATTATAATTTATGTTTAGACATGCATGAATGGATAGTTGGTTTAAAATAAATGAACGTAAGAGAATGGATCATTTGAGATGTTTGGTTAACTGCAAAATTTTGTATACTTGAGCTATTTGAATGAATGGTTTGATAACGTCTACTTGAGGTGTTGGGGTGTTTAATCGAGCACGAAATTCCTGAAATACTTGAATGATTGGATGAGGACAATAGAAATATCCATAAAGTATACCTTATAGGTTATCCAAAGTATACTAACTTATTGATAAGAGTATGCTGAAAAATAATATGTCAAAGATAATCCAATGGTTGTGCTTATAAGTGATTGAGTATATGGAAAAGGGAAAACTTTTTATCTTATGTTATAAGATATCATAGAGTTTTGTGCCATAGTAGATATATCAAATAGCTGCATATTAGTCTATATCGCATACATAAATTTTTTAATAATATTATACACTATTTATATCTATTAAATTATATATAATCTGATGTATTTGTGTAAAAATTTGTGATAGATGCAAGTTTCTTTGGAAATACTTTCAAGAGACGGAAAGACTAAATATGTTCAGAGTAGTAGAGTTCAAACAATCTTGCACCATGTTTTGGTACACTGATGAAACCACGATAGACAATACTAATTTCATACAATCCTAGGTAAGAATCTAAATAATTCACTTGGTTATATCTAATATTGTTTTATTTTGATAGTGATAATTGCACATATATTGGTTGGTAATTTTTTCCCCATGTGTCCCTCTCAATTTCTTATCTCGTCATTGTCATTGGTTGCAATTGGGCCATAGTTTTCAGTGACCCATTGGAATTGTGCTCATATGTTTGGTCTCGGTTTTCACATGTTCTAAGATGTTTAAGTTTAATTTCAAAAACTGTGTAAAATAGTAAATTCCCTACCTTGTAGGAGTCGAGCAAGAGCATACAGTTAGAAGCATTTCATGTTTTTAAGGTAATGGTCTCTCAAATTAACCTTATGATGAATTTGAAGTCTCATTGAATCATGTTTATACGTCAACAACTTCTCGTTGCAAATCAAAACAAACCCGCTGATATAGTTGGCATCCTTATTGAGGTTAAAAGCCCACCTTGTGAAACATGCAGACAAAGCCCACAAAGGATTGATCCACTCTCCAAGAGCCCTGTTATTGGTGAAAGGAGCTGGCCCAAAGGAATACATCTAGAATGTTCACACCAACAAATTTGTTAGGAGGATTCCTCTTTCTCCTCTAAGTTTTTGCCATCTGTCCCTTGTTTGTTACTCTCAGAATTATTCTTTTATTATTTTATTCTTTGGCTCTTCTGCCCTTATATAACCGATGTATTATTATCGAGAAATGGTGAATAGAAATTGAGGCTTCCTCACAATTTTCTCTGTGTTAATTCTCAAACTTGGTATCAAAGCTTCAATGGCCAACGCCTTATTGAATGAATCCTCGTCGTTCTCTACTGGTGCACCCCATTTTAACAGTCCACCACTCAACCAACTCTTAAATCAGGTAACTACTATCAAATTGGAAAGAGGGAATTTCCTTCTATGGAAAAATCTAGCATTACCAATCCTTCGTAGCTACAAACTCGAGAGTCATCTCCTGGGAACCAAGCCTTGCCCTCCTATGTTTCTATCTCAAGCTAGAACCGAAGGGAATGTAACAGTCGAAGGCGCCTCCTTTAGCTCAGAATCAACTACATCAATCAATCCTCTCTATGAAGCATGGATGACAGTTGATCAGCTACTGATCAGTTGGCTTTATAATTCAATGATTTCAGAAGTCGCAACACAGGTTATGGGGTGCAACACGGCCAAAGACCTGTGGGATGTTATTCAGCTCTTATTTGGAGTTCAATCACGAGCAGAGGAAGATTACCTGTGTCAAACATTTCAACAATCACGCAAAGGTAATATGAAAATGTCTGAATATTTAAGGATTATGAAGTGTCATGCTGACAATCTAGGGCAAGCTAGAAGCCCTGTTTCCACTCGATCTTTGATATCACAGGTTTTACTTGGACTTGACGAAGAATACAACCCAGTTGTGGTTGGCATTCAAGGTAAATATGAGATATCATGGGCAGATGCTCAAAACGATCTTTTTATGTTTGAAAAACGACTGGAGTTTCAGAATACTCAACGAAACAGTGTGTCCTTCAGTCACAATGCCTCAGTCAATATGGCAATAAATAGAGGAACACAATCGCAGAGGCAACAACAACAAAATTATTCTCCAAACAATTACAATCGGTCGAACAACAACAACAGTCAACGAGGAGGAACCCCAAATTCCAGAGGACGTGGTCGAGGTAGAGGATATAACTCTAACAATCGACCAATATGTCAAGTTTGTGGGAAAATAGGACATACTGCACTCGTTTGTTACAATCGTTTCAATAAGGAATATTCTCCAAGCACAAATCAGAACAGACAAAGTCAACCTAATCAACAAGGTTTTGGACCGAATACCTCCTCCTCATCATTTCAGGCTCCAAATGTTTTTGTGGCCAGTCCTAACACTCCGAGTAACCCCTTCTTGGCCACTCCAGAAACTATTGGTGACCCCTCTTGGTATGCTAATAGTGGGGCTTCACATCATGTGACAACTGACTTTGGAAACCTAGCCAATCCAATTGAATATGGAGGTATGGACTCAATCATTGTAGGTAATGGTTCACAGTCTCCTATTACCTTCACTGGCAATTCATGTTTAACTTCTGGTAAATACAATCTGCGTTTGCAAAATGTGTTATGTGCACCTAATATGGCTAAGAATTTGATTAGCATTTCGAGACTTGCTCAAGATAATGATATTTATATTGAGTTTCATGACACCTATTGTGTTGTTAAGGACAAGGGCACGGGCAAACAACTTTTGAAAGGGGATCTTAAAGAAGGTCTATACTGCCTTGCGAATACCTTAGTCAAACCAGTGGACATCTCTCAGCCAATATTGAGTAGTAATGAGTCCAAAATGTACAAAAATAATAGTGTTGCTTTTTCTGTTTGTCACAAACCCAACAAGGTGACCAAGACTTTTTGGCACAGACATCTTGGTCATCCTTCAACTAAAATTTTAGACTCTGTCATTCGTTCTTGTAATCTTCCTGTTTTGGTTAATGAAGAACACTATTTTTGTAACTCTTGTCAGTATGGTAAATCACATGCTCTACCGTTTTCGATATCAGAGTCTCGCGCATCTAAGAAATTTGAATTGGTTTATTCTGATGTATGGGGACCTGCACCTGTTTTATCTACCTCTGGTTTCCGATATTATGTGCTATTCCTTGATGATTACAGTAGATTTGTGTGGGTTTATCCCTTAAAACAAAAATCAGACACTGGTAATGCGTTTCAACACTTCTTGGCTATGGTCCAGACTCAATTCAATGGTAACATTCAGTCATTTCAGTCTGATAATGGCACAGAATTCTTGAGAGTTCATCAACTCTGTAGTCAGCTGGGAATTAAGTCACGGTATTCGTGTCCTTACACTTCTCAACAGAATGGCAGGGCTGAACGGAAGCATAGACACTTGGTTGAGACTTGCTTAACTCTGTTAGCTCAGGCTTCGATGCCTCTTGTATTTTGGTGGTGGGAGCTTCTTGGTCGCGAATCGATCGATCAATGGCCTCCCCACACCTACATTGCAAGGTCAGTCACCACGTTTTCTTCTCACAGGTAAACACTTAGACTTTGCTAATCTAAGGGTGTTTGGGTGTGCTTGCTTTCCAAATTTACGACCCTATCAGAGGCACAAGTTTGATTTTCACTCTCAGCGGTGTGTTTATCTTGGTCCCAATCCAACTCATAAAGGTTTCAGTGCAAGAACTCTGCTGGCAGGATTTTTGTTACTCACCATGTGATATTTAATGAGGTTGATTTCCCTTTTACAGATGCTACTTGGGCCACACCTTCAGCTCCTCTCACGAGTACCAGTCCTCCCTTGTCTCCTTCCCCTGCTACCTGGTTTCCCTCCTACCCTGTGCCTCTCCCCAATACCTCCTCTCAGCATACCGTCAGTATACCTGCCTCATCGACCTCACAACCCTCTCCCAACCTTCCGCATTCCTCCCCGCTACTTGCCTCTCGGTCCTCCAGCAGTCCTCTGACACCTTGCTCTGCTCCCACCTGCCCCTCTCTATCTCCTAGTCCTCATCCCTCCTTGTCTCCAAATCATCCTATCCTCTCTGTTGATTCTTCCTTTGACAGTTCTGCTCCCCCTTCATCCTCCTTCTATCCTGACATTCCTTCTTCTCCTTTACCCTTGTCAGCTCCGTCTACTATGCCCCAGCCTTCACATCCTATGGTTACTCGTGGGAAAGCTGGCATTTTTTAAGCCTAAAGCTTGGTTATCCACAAAGCCTCAGGTTGACTGGTCCCTTACCGAACCCACGCGTGTTCAGGTTGACTGGTCCCCAATGGAAGGCTGCTATGGACCAGGAATACACTGCTCTAATGCAAAATCATACCTGGGAGTTGGTCCCACCTGATCCTATATATAATATCATTGGAAACAAGTGGATCTTTCGGATCAAACGGAATGCTGATGGATCCATTCAAAGGTACAAAGCCCGCCTTGTAGCCAAGGGCTTTCATCAGAATCCTGGGGTTGACTTCTTTGAGACGTTCAGTCCGGTTGTCAAATTCTCCACCATTCGAGTTGTTCTCAACTTGGCTGTCACAAATAATTGGAGGTTGCGGCAACTCGACTTCAACAATACATTTCTTAATGGTTCTCTCACTGAGGATGTCTATATGCAACAACCACCTGGCTATGTGGATCCCGTTCATCCCTCACATATTTGCAAGCTGACCAAGGCCATTTATGGTTTGAAGCAAGCTCCGAGAGCTTGGAATTCTACCTTGAAATCTGTCTTACTTGACTGGGGATTCGTGAATTCCAAGGCTGATACATCCCTCTTCATATATAGTTCTGGTCAGTCCATTCTACTTCTTTTAGTGTATGTTGATGATGTGGTTCTCACGGGCAATGATGCTGTTTTGATGGATAATCTGGTCACCTCTCTGGATCAACGTTTTGCTCTTAAAGATTTGGGTCCGTTGAGCTATTTCTTGGGCATTCAGGTACAATATGTTGAATCTGGCATTATTTTAACTCAATCTCAGTATGCCACTGATCTTCTCCTTCGCCTTGACTGCCCTGCATTGAAACCGGCGCCCTCCCCAAGTGTTGTGGGCAAATTTCTATCTGTCAATAGTGGAACTCCTGCCTAATCCGTTCATCTATCGGAGCACTATTGGTGCGCTTGATGCCTCACGAACACACGCCGACGATCATCCTACATTGTCAACTATCTCGAGCGGTTTCTTGGTCTCCAACCGATGAACACTGACAGGCTGTGAAGCGTGTCTTGCGCTACATCTCAGGCACAAAACATTATGGTCTACTGATTCAACCCAGCTCTGATCAATCCATCCATGCCTTTTCCGATGCCGACTAGGCCTCTAATCCTGACGACAGACGTTCAGTGGTTGCTTACTGTGTTTTTATTGGTAATAGTCTCATCTCTTGGTCCTCGAAGAAGCAATCCGTCGTGGCGAGGTCAAGCACGGAGTCAGAATATCGTGCCTTGGCTCATGCTTCCACTGAGATTATATGGCTTCAACAACTTCTTGGTGAATTAGGTGTTCAATCCTCAGCTCCACCAATTATATGGTGTGACAACTTAAGTGCCAGTGCCTTGGCTGCCAATCCTGTTTTTCACGCCAGGACCAAACATATCGAAATCGACGTACACTTTGTTCGTGATCAAGTACTCCGTGGTGCACTTGAGATACGTTACGTCCCTACTTCTGATCAAGTGGCCAACTGCTTGACCAAGCCATTGTCACACTCTCAGTTCTCTATGTTTCGATCCAAACTTGGGGTCACTTCGTTACCCTCTCGTTTCCGGGGGGTATTGAGGTTAAAAGCCCACCTTGTGAGACCAGTGACTCAACATGCAGACAAAGCCAACAAAGGATTGATCCACTCTCCAAGAGCCCTGTTATTGGTGAAAGGAGCTGGCCCAAAGGAATACATCTAGAATGTTCACACCAACAAATTTGTTACGAGGATTCCTCTCTCCTCTAAGTTTTTGCCATCTGTCCCTTGTTTGTTACTCTCAGAATTATTCTTTTATTCTTTTATTCTTTGGCTCTTCTGCCCTTATATAACCAGTGTATTATCATCGAGAAATGGTGAATAGAAATTGAGGCTTCCTCAAAATTTTCTGTGTTAATTCTCAAACTATCCTCGTCACAAATCCAAGCAAGCTTCTAAGGCTGTTTACTGATTTCAAAACTGATAAAGGTACACAACATAATCTTTGTAATGCCCATTAATTTCCTTGGCTTTGTTCTCTCTTCTACAATTCTACGTTACTTTTATCACAGAGGATGAACAGTTTGAGGCTGACAAAGCTCACGTTGTTAGAGAAATTGCTGCGCTTGAATCGAAAGGTCCATGAGAAAGAGGGAGATTCTTATATGAATATCTTGTATTTTTTTTCTTCCCTTCTCTATCCAATTTGGATGTCAATTATAATAGCTAAATCAGCTCTCCAAATGATGACTTTGATCAAATGTTTTACAATATTAATAATGGGCTTTATATATATTTATATATTCGCCTTTTTCTCTGTATTCTAACGGTTTCCACGTGTGTGTATATTATTATAAGGACACTGATTGTAC

mRNA sequence

AAAAGATTTTGGTGGCATTAGGAACGGAAGACTCGAGCGATGAGATGGTAATATGAAAGAGGGAAAAAACGAGAGCCCCCAAATCAAATCCCTAGCCTTTTGCAATTCGCTAGCCCTAGGCCCCAAATCGCTATGCCGTCATTGGTAATATGTATTTTTCAGGTCTTACAAGCCCTAGCTGCAGCTTCCGCCATCTCTTCCTCTTCTTCCACCAGCAACTCCAATCTTCCTCCACCTCCTAAAGATGCATGAAGTCTTGTGTACCAACGACTACTTCCTCGATGGAAATCTCTCTCTCACTCCCACCTGTCTCCAATTCGAATTTCGATATCAAAAGTTGATCAAGTGGATGCAACGCGGTTAGATATTGAAATGTCGGCCATGCTGAAAGAACAACTGGTTAAGGTTTTTGCTTTGATGAAGATAGGCAAGCCCACACCGAGAATTTCTCTCATTAATTTGCAGTATAGAGATGAGCGTGCAATAAAAATTCCACGAAAAGATGCAAGTTTCTTTGGAAATACTTTCAAGAGACGGAAAGACTAAATATGTTCAGAGTAGTAGAGTTCAAACAATCTTGCACCATGTTTTGGTACACTGATGAAACCACGATAGACAATACTAATTTCATACAATCCTAGGAGTCGAGCAAGAGCATACAGTTAGAAGCATTTCATGTTTTTAAGACAAAGCCCACAAAGGATTGATCCACTCTCCAAGAGCCCTGTTATTGGTGAAAGGAGCTGGCCCAAAGGAATACATCTAGAATGTTCACACCAACAAATTTGTTAGGAGGATTCCTCTTTCTCCTCTAAGTTTTTGCCATCTGTCCCTTGTTTGTTACTCTCAGAATTATTCTTTTATTATTTTATTCTTTGGCTCTTCTGCCCTTATATAACCGATGTATTATTATCGAGAAATGGTGAATAGAAATTGAGGCTTCCTCACAATTTTCTCTGTGTTAATTCTCAAACTTGGTATCAAAGCTTCAATGGCCAACGCCTTATTGAATGAATCCTCGTCGTTCTCTACTGGTGCACCCCATTTTAACAGTCCACCACTCAACCAACTCTTAAATCAGGTAACTACTATCAAATTGGAAAGAGGGAATTTCCTTCTATGGAAAAATCTAGCATTACCAATCCTTCGTAGCTACAAACTCGAGAGTCATCTCCTGGGAACCAAGCCTTGCCCTCCTATGTTTCTATCTCAAGCTAGAACCGAAGGGAATGTAACAGTCGAAGGCGCCTCCTTTAGCTCAGAATCAACTACATCAATCAATCCTCTCTATGAAGCATGGATGACAGTTGATCAGCTACTGATCAGTTGGCTTTATAATTCAATGATTTCAGAAGTCGCAACACAGGTTATGGGGTGCAACACGGCCAAAGACCTGTGGGATGTTATTCAGCTCTTATTTGGAGTTCAATCACGAGCAGAGGAAGATTACCTGTGTCAAACATTTCAACAATCACGCAAAGGTAATATGAAAATGTCTGAATATTTAAGGATTATGAAGTGTCATGCTGACAATCTAGGGCAAGCTAGAAGCCCTGTTTCCACTCGATCTTTGATATCACAGGTTTTACTTGGACTTGACGAAGAATACAACCCAGTTGTGGTTGGCATTCAAGGTAAATATGAGATATCATGGGCAGATGCTCAAAACGATCTTTTTATGTTTGAAAAACGACTGGAGTTTCAGAATACTCAACGAAACAGTGTGTCCTTCAGTCACAATGCCTCAGTCAATATGGCAATAAATAGAGGAACACAATCGCAGAGGCAACAACAACAAAATTATTCTCCAAACAATTACAATCGGTCGAACAACAACAACAGTCAACGAGGAGGAACCCCAAATTCCAGAGGACGTGGTCGAGGTAGAGGATATAACTCTAACAATCGACCAATATGTCAAGTTTGTGGGAAAATAGGACATACTGCACTCGTTTGTTACAATCGTTTCAATAAGGAATATTCTCCAAGCACAAATCAGAACAGACAAAGTCAACCTAATCAACAAGGTTTTGGACCGAATACCTCCTCCTCATCATTTCAGGCTCCAAATGTTTTTGTGGCCAGTCCTAACACTCCGAGTAACCCCTTCTTGGCCACTCCAGAAACTATTGGTGACCCCTCTTGGTATGCTAATAGTGGGGCTTCACATCATGTGACAACTGACTTTGGAAACCTAGCCAATCCAATTGAATATGGAGGTATGGACTCAATCATTGTAGGTAATGGTTCACAGTCTCCTATTACCTTCACTGGCAATTCATGTTTAACTTCTGGTAAATACAATCTGCGTTTGCAAAATGTGTTATGTGCACCTAATATGGCTAAGAATTTGATTAGCATTTCGAGACTTGCTCAAGATAATGATATTTATATTGAGTTTCATGACACCTATTGTGTTGTTAAGGACAAGGGCACGGGCAAACAACTTTTGAAAGGGGATCTTAAAGAAGGTCTATACTGCCTTGCGAATACCTTAGTCAAACCAGTGGACATCTCTCAGCCAATATTGAGTAGTAATGAGTCCAAAATGTACAAAAATAATAGTGTTGCTTTTTCTGTTTGTCACAAACCCAACAAGGTGACCAAGACTTTTTGGCACAGACATCTTGGTCATCCTTCAACTAAAATTTTAGACTCTGTCATTCGTTCTTGTAATCTTCCTGTTTTGGTTAATGAAGAACACTATTTTTGTAACTCTTGTCAGTATGGTAAATCACATGCTCTACCGTTTTCGATATCAGAGTCTCGCGCATCTAAGAAATTTGAATTGGTTTATTCTGATGTATGGGGACCTGCACCTGTTTTATCTACCTCTGGTTTCCGATATTATGTGCTATTCCTTGATGATTACAGTAGATTTGTGTGGGTTTATCCCTTAAAACAAAAATCAGACACTGGTAATGCGTTTCAACACTTCTTGGCTATGGTCCAGACTCAATTCAATGGTAACATTCAGTCATTTCAGTCTGATAATGGCACAGAATTCTTGAGAGTTCATCAACTCTGTAGTCAGCTGGGAATTAAGTCACGGTATTCGTGTCCTTACACTTCTCAACAGAATGGCAGGGCTGAACGGAAGCATAGACACTTGGTTGAGACTTGCTTAACTCTGTTAGCTCAGGCTTCGATGCCTCTTGTATTTTGGTGGTGGGAGCTTCTTGGTCGCGAATCGATCGATCAATGGCCTCCCCACACCTACATTGCAAGGTCAGTCACCACGTTTTCTTCTCACAGGTAAACACTTAGACTTTGCTAATCTAAGGGTGTTTGGGTGTGCTTGCTTTCCAAATTTACGACCCTATCAGAGGCACAAGTTTGATTTTCACTCTCAGCGGTGTGTTTATCTTGGTCCCAATCCAACTCATAAAGGTTTCAGTGCAAGAACTCTGCTGGCAGGATTTTTGTTACTCACCATGTGATATTTAATGAGGTTGATTTCCCTTTTACAGATGCTACTTGGGCCACACCTTCAGCTCCTCTCACGAGTACCAGTCCTCCCTTGTCTCCTTCCCCTGCTACCTGGTTTCCCTCCTACCCTGTGCCTCTCCCCAATACCTCCTCTCAGCATACCGTCAGTATACCTGCCTCATCGACCTCACAACCCTCTCCCAACCTTCCGCATTCCTCCCCGCTACTTGCCTCTCGGTCCTCCAGCAGTCCTCTGACACCTTGCTCTGCTCCCACCTGCCCCTCTCTATCTCCTAGTCCTCATCCCTCCTTGTCTCCAAATCATCCTATCCTCTCTGTTGATTCTTCCTTTGACAGTTCTGCTCCCCCTTCATCCTCCTTCTATCCTGACATTCCTTCTTCTCCTTTACCCTTGTCAGCTCCGTCTACTATGCCCCAGCCTTCACATCCTATGGTTACTCGTGGGAAAGCTGGCATTTTTTAAGCCTAAAGCTTGGTTATCCACAAAGCCTCAGGTTGACTGGTCCCTTACCGAACCCACGCGTGTTCAGGTTGACTGGTCCCCAATGGAAGGCTGCTATGGACCAGGAATACACTGCTCTAATGCAAAATCATACCTGGGAGTTGGTCCCACCTGATCCTATATATAATATCATTGGAAACAAGTGGATCTTTCGGATCAAACGGAATGCTGATGGATCCATTCAAAGGTACAAAGCCCGCCTTGTAGCCAAGGGCTTTCATCAGAATCCTGGGGTTGACTTCTTTGAGACGTTCAGTCCGGTTGTCAAATTCTCCACCATTCGAGTTGTTCTCAACTTGGCTGTCACAAATAATTGGAGGTTGCGGCAACTCGACTTCAACAATACATTTCTTAATGGTTCTCTCACTGAGGATGTCTATATGCAACAACCACCTGGCTATGTGGATCCCGTTCATCCCTCACATATTTGCAAGCTGACCAAGGCCATTTATGGTTTGAAGCAAGCTCCGAGAGCTTGGAATTCTACCTTGAAATCTGTCTTACTTGACTGGGGATTCGTGAATTCCAAGGCTGATACATCCCTCTTCATATATAGTTCTGGTCAGTCCATTCTACTTCTTTTAGTGTATGTTGATGATGTGGTTCTCACGGGCAATGATGCTGTTTTGATGGATAATCTGGTCACCTCTCTGGATCAACGTTTTGCTCTTAAAGATTTGGGTCCGTTGAGCTATTTCTTGGGCATTCAGGTACAATATGTTGAATCTGGCATTATTTTAACTCAATCTCAGTATGCCACTGATCTTCTCCTTCGCCTTGACTGCCCTGCATTGAAACCGGCGCCCTCCCCAAGTGTTGTGGGCAAATTTCTATCTGTCAATAGTGGAACTCCTGCCTAATCCGTTCATCTATCGGAGCACTATTGGTGCGCTTGATGCCTCACGAACACACGCCGACGATCATCCTACATTGTCAACTATCTCGAGCGGTTTCTTGGTCTCCAACCGATGAACACTGACAGGCTGTGAAGCGTGTCTTGCGCTACATCTCAGGCACAAAACATTATGGTCTACTGATTCAACCCAGCTCTGATCAATCCATCCATGCCTTTTCCGATGCCGACTAGGCCTCTAATCCTGACGACAGACGTTCAGTGGTTGCTTACTGTGTTTTTATTGGTAATAGTCTCATCTCTTGGTCCTCGAAGAAGCAATCCGTCGTGGCGAGGTCAAGCACGGAGTCAGAATATCGTGCCTTGGCTCATGCTTCCACTGAGATTATATGGCTTCAACAACTTCTTGGTGAATTAGGTGTTCAATCCTCAGCTCCACCAATTATATGGTGTGACAACTTAAGTGCCAGTGCCTTGGCTGCCAATCCTGTTTTTCACGCCAGGACCAAACATATCGAAATCGACGTACACTTTGTTCGTGATCAAGTACTCCGTGGTGCACTTGAGATACGTTACGTCCCTACTTCTGATCAAGTGGCCAACTGCTTGACCAAGCCATTGTCACACTCTCAGTTCTCTATGTTTCGATCCAAACTTGGGGTCACTTCGTTACCCTCTCGTTTCCGGGGGGTATTGAGGTTAAAAGCCCACCTTGTGAGACCAGTGACTCAACATGCAGACAAAGCCAACAAAGGATTGATCCACTCTCCAAGAGCCCTGTTATTGGTGAAAGGAGCTGGCCCAAAGGAATACATCTAGAATGTTCACACCAACAAATTTGTTACGAGGATTCCTCTCTCCTCTAAGTTTTTGCCATCTGTCCCTTGTTTGTTACTCTCAGAATTATTCTTTTATTCTTTTATTCTTTGGCTCTTCTGCCCTTATATAACCAGTGTATTATCATCGAGAAATGGTGAATAGAAATTGAGGCTTCCTCAAAATTTTCTGTGTTAATTCTCAAACTATCCTCGTCACAAATCCAAGCAAGCTTCTAAGGCTGTTTACTGATTTCAAAACTGATAAAGAGGATGAACAGTTTGAGGCTGACAAAGCTCACGTTGTTAGAGAAATTGCTGCGCTTGAATCGAAAGGTCCATGAGAAAGAGGGAGATTCTTATATGAATATCTTGTATTTTTTTTCTTCCCTTCTCTATCCAATTTGGATGTCAATTATAATAGCTAAATCAGCTCTCCAAATGATGACTTTGATCAAATGTTTTACAATATTAATAATGGGCTTTATATATATTTATATATTCGCCTTTTTCTCTGTATTCTAACGGTTTCCACGTGTGTGTATATTATTATAAGGACACTGATTGTAC

Coding sequence (CDS)

ATGGCCAACGCCTTATTGAATGAATCCTCGTCGTTCTCTACTGGTGCACCCCATTTTAACAGTCCACCACTCAACCAACTCTTAAATCAGGTAACTACTATCAAATTGGAAAGAGGGAATTTCCTTCTATGGAAAAATCTAGCATTACCAATCCTTCGTAGCTACAAACTCGAGAGTCATCTCCTGGGAACCAAGCCTTGCCCTCCTATGTTTCTATCTCAAGCTAGAACCGAAGGGAATGTAACAGTCGAAGGCGCCTCCTTTAGCTCAGAATCAACTACATCAATCAATCCTCTCTATGAAGCATGGATGACAGTTGATCAGCTACTGATCAGTTGGCTTTATAATTCAATGATTTCAGAAGTCGCAACACAGGTTATGGGGTGCAACACGGCCAAAGACCTGTGGGATGTTATTCAGCTCTTATTTGGAGTTCAATCACGAGCAGAGGAAGATTACCTGTGTCAAACATTTCAACAATCACGCAAAGGTAATATGAAAATGTCTGAATATTTAAGGATTATGAAGTGTCATGCTGACAATCTAGGGCAAGCTAGAAGCCCTGTTTCCACTCGATCTTTGATATCACAGGTTTTACTTGGACTTGACGAAGAATACAACCCAGTTGTGGTTGGCATTCAAGGTAAATATGAGATATCATGGGCAGATGCTCAAAACGATCTTTTTATGTTTGAAAAACGACTGGAGTTTCAGAATACTCAACGAAACAGTGTGTCCTTCAGTCACAATGCCTCAGTCAATATGGCAATAAATAGAGGAACACAATCGCAGAGGCAACAACAACAAAATTATTCTCCAAACAATTACAATCGGTCGAACAACAACAACAGTCAACGAGGAGGAACCCCAAATTCCAGAGGACGTGGTCGAGGTAGAGGATATAACTCTAACAATCGACCAATATGTCAAGTTTGTGGGAAAATAGGACATACTGCACTCGTTTGTTACAATCGTTTCAATAAGGAATATTCTCCAAGCACAAATCAGAACAGACAAAGTCAACCTAATCAACAAGGTTTTGGACCGAATACCTCCTCCTCATCATTTCAGGCTCCAAATGTTTTTGTGGCCAGTCCTAACACTCCGAGTAACCCCTTCTTGGCCACTCCAGAAACTATTGGTGACCCCTCTTGGTATGCTAATAGTGGGGCTTCACATCATGTGACAACTGACTTTGGAAACCTAGCCAATCCAATTGAATATGGAGGTATGGACTCAATCATTGTAGGTAATGGTTCACAGTCTCCTATTACCTTCACTGGCAATTCATGTTTAACTTCTGGTAAATACAATCTGCGTTTGCAAAATGTGTTATGTGCACCTAATATGGCTAAGAATTTGATTAGCATTTCGAGACTTGCTCAAGATAATGATATTTATATTGAGTTTCATGACACCTATTGTGTTGTTAAGGACAAGGGCACGGGCAAACAACTTTTGAAAGGGGATCTTAAAGAAGGTCTATACTGCCTTGCGAATACCTTAGTCAAACCAGTGGACATCTCTCAGCCAATATTGAGTAGTAATGAGTCCAAAATGTACAAAAATAATAGTGTTGCTTTTTCTGTTTGTCACAAACCCAACAAGGTGACCAAGACTTTTTGGCACAGACATCTTGGTCATCCTTCAACTAAAATTTTAGACTCTGTCATTCGTTCTTGTAATCTTCCTGTTTTGGTTAATGAAGAACACTATTTTTGTAACTCTTGTCAGTATGGTAAATCACATGCTCTACCGTTTTCGATATCAGAGTCTCGCGCATCTAAGAAATTTGAATTGGTTTATTCTGATGTATGGGGACCTGCACCTGTTTTATCTACCTCTGGTTTCCGATATTATGTGCTATTCCTTGATGATTACAGTAGATTTGTGTGGGTTTATCCCTTAAAACAAAAATCAGACACTGGTAATGCGTTTCAACACTTCTTGGCTATGGTCCAGACTCAATTCAATGGTAACATTCAGTCATTTCAGTCTGATAATGGCACAGAATTCTTGAGAGTTCATCAACTCTGTAGTCAGCTGGGAATTAAGTCACGGTATTCGTGTCCTTACACTTCTCAACAGAATGGCAGGGCTGAACGGAAGCATAGACACTTGGTTGAGACTTGCTTAACTCTGTTAGCTCAGGCTTCGATGCCTCTTGTATTTTGGTGGTGGGAGCTTCTTGGTCGCGAATCGATCGATCAATGGCCTCCCCACACCTACATTGCAAGGTCAGTCACCACGTTTTCTTCTCACAGGTAA

Protein sequence

MANALLNESSSFSTGAPHFNSPPLNQLLNQVTTIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVMGCNTAKDLWDVIQLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVGIQGKYEISWADAQNDLFMFEKRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFNKEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFTGNSCLTSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPILSSNESKMYKNNSVAFSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDYSRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFWWWELLGRESIDQWPPHTYIARSVTTFSSHR
Homology
BLAST of Tan0005870 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 317.8 bits (813), Expect = 3.4e-85
Identity = 233/716 (32.54%), Postives = 345/716 (48.18%), Query Frame = 0

Query: 20  NSPPLNQLLNQVTTIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEG 79
           N+  LN  ++ VT  KL   N+L+W      +   Y+L   L G+   PP          
Sbjct: 12  NTSILNVNMSNVT--KLTSTNYLMWSRQVHALFDGYELAGFLDGSTTMPP---------- 71

Query: 80  NVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVMGCNTAKDLWDVI 139
                 A+  +++   +NP Y  W   D+L+ S +  ++   V   V    TA  +W+ +
Sbjct: 72  ------ATIGTDAAPRVNPDYTRWKRQDKLIYSAVLGAISMSVQPAVSRATTAAQIWETL 131

Query: 140 QLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVL 199
           + ++   S      L    +Q  KG   + +Y++ +    D L     P+     + +VL
Sbjct: 132 RKIYANPSYGHVTQLRTQLKQWTKGTKTIDDYMQGLVTRFDQLALLGKPMDHDEQVERVL 191

Query: 200 LGLDEEYNPVVVGIQGKYEISWADAQNDLFMFEKRLEFQNTQRNSVSFSH--NASVNMAI 259
             L EEY PV+  I  K      D    L    +RL    ++  +VS +     + N   
Sbjct: 192 ENLPEEYKPVIDQIAAK------DTPPTLTEIHERLLNHESKILAVSSATVIPITANAVS 251

Query: 260 NRGTQSQRQQQQNYSPNNY-NRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPI---CQVCG 319
           +R T +          N Y NR+NNNNS+    P  +        N+ ++P    CQ+CG
Sbjct: 252 HRNTTTTNNNNNGNRNNRYDNRNNNNNSK----PWQQSSTNFHPNNNQSKPYLGKCQICG 311

Query: 320 KIGHTALVCYNRFNKEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNP- 379
             GH+A  C          S  Q+  S  N Q           Q P     SP TP  P 
Sbjct: 312 VQGHSAKRC----------SQLQHFLSSVNSQ-----------QPP-----SPFTPWQPR 371

Query: 380 -FLATPETIGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFTGNSC 439
             LA        +W  +SGA+HH+T+DF NL+    Y G D ++V +GS  PI+ TG++ 
Sbjct: 372 ANLALGSPYSSNNWLLDSGATHHITSDFNNLSLHQPYTGGDDVMVADGSTIPISHTGSTS 431

Query: 440 LTSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDL 499
           L++    L L N+L  PN+ KNLIS+ RL   N + +EF      VKD  TG  LL+G  
Sbjct: 432 LSTKSRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNTGVPLLQGKT 491

Query: 500 KEGLYCLANTLVKPVDISQPILSSNESKMYKNNSVAFSVCHKPNKVTKTFWHRHLGHPST 559
           K+ LY              PI SS    ++ + S         +K T + WH  LGHP+ 
Sbjct: 492 KDELY------------EWPIASSQPVSLFASPS---------SKATHSSWHARLGHPAP 551

Query: 560 KILDSVIRSCNLPVLVNEEHYF--CNSCQYGKSHALPFSISESRASKKFELVYSDVWGPA 619
            IL+SVI + +L VL N  H F  C+ C   KS+ +PFS S   +++  E +YSDVW  +
Sbjct: 552 SILNSVISNYSLSVL-NPSHKFLSCSDCLINKSNKVPFSQSTINSTRPLEYIYSDVWS-S 611

Query: 620 PVLSTSGFRYYVLFLDDYSRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNG 679
           P+LS   +RYYV+F+D ++R+ W+YPLKQKS     F  F  +++ +F   I +F SDNG
Sbjct: 612 PILSHDNYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTRIGTFYSDNG 650

Query: 680 TEFLRVHQLCSQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFW 726
            EF+ + +  SQ GI    S P+T + NG +ERKHRH+VET LTLL+ AS+P  +W
Sbjct: 672 GEFVALWEYFSQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHASIPKTYW 650

BLAST of Tan0005870 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 312.4 bits (799), Expect = 1.4e-83
Identity = 227/705 (32.20%), Postives = 333/705 (47.23%), Query Frame = 0

Query: 28  LNQVTTIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGAS 87
           +N     KL   N+L+W      +   Y+L   L G+ P PP                A+
Sbjct: 18  VNMSNVTKLTSTNYLMWSRQVHALFDGYELAGFLDGSTPMPP----------------AT 77

Query: 88  FSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVMGCNTAKDLWDVIQLLFGVQS 147
             +++   +NP Y  W   D+L+ S +  ++   V   V    TA  +W+ ++ ++    
Sbjct: 78  IGTDAVPRVNPDYTRWRRQDKLIYSAILGAISMSVQPAVSRATTAAQIWETLRKIY---- 137

Query: 148 RAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYN 207
            A   Y          G++    ++       D L     P+     + +VL  L ++Y 
Sbjct: 138 -ANPSY----------GHVTQLRFI----TRFDQLALLGKPMDHDEQVERVLENLPDDYK 197

Query: 208 PVVVGIQGKYEISWADAQNDLFMFEKRLEFQNTQRNSVSFSHNASV--NMAINRGTQSQR 267
           PV+  I  K      D    L    +RL  + ++  +++ +    +  N+  +R T + R
Sbjct: 198 PVIDQIAAK------DTPPSLTEIHERLINRESKLLALNSAEVVPITANVVTHRNTNTNR 257

Query: 268 QQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPI---CQVCGKIGHTALVC 327
            Q       NYN  NNNN      P+S G    R  N   +P    CQ+C   GH+A  C
Sbjct: 258 NQNNRGDNRNYN--NNNNRSNSWQPSSSG---SRSDNRQPKPYLGRCQICSVQGHSAKRC 317

Query: 328 YNRFNKEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGD 387
                    P  +Q + +   QQ   P T        N+ V SP   +N           
Sbjct: 318 ---------PQLHQFQSTTNQQQSTSPFTPWQ--PRANLAVNSPYNANN----------- 377

Query: 388 PSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFTGNSCLTSGKYNLRLQ 447
             W  +SGA+HH+T+DF NL+    Y G D +++ +GS  PIT TG++ L +   +L L 
Sbjct: 378 --WLLDSGATHHITSDFNNLSFHQPYTGGDDVMIADGSTIPITHTGSASLPTSSRSLDLN 437

Query: 448 NVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTL 507
            VL  PN+ KNLIS+ RL   N + +EF      VKD  TG  LL+G  K+ LY      
Sbjct: 438 KVLYVPNIHKNLISVYRLCNTNRVSVEFFPASFQVKDLNTGVPLLQGKTKDELY------ 497

Query: 508 VKPVDISQPILSSNESKMYKNNSVAFSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCN 567
                   PI SS    M+       S C   +K T + WH  LGHPS  IL+SVI + +
Sbjct: 498 ------EWPIASSQAVSMFA------SPC---SKATHSSWHSRLGHPSLAILNSVISNHS 557

Query: 568 LPVLVNEEHYF--CNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYY 627
           LPVL N  H    C+ C   KSH +PFS S   +SK  E +YSDVW  +P+LS   +RYY
Sbjct: 558 LPVL-NPSHKLLSCSDCFINKSHKVPFSNSTITSSKPLEYIYSDVWS-SPILSIDNYRYY 617

Query: 628 VLFLDDYSRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCS 687
           V+F+D ++R+ W+YPLKQKS   + F  F ++V+ +F   I +  SDNG EF+ +    S
Sbjct: 618 VIFVDHFTRYTWLYPLKQKSQVKDTFIIFKSLVENRFQTRIGTLYSDNGGEFVVLRDYLS 629

Query: 688 QLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFW 726
           Q GI    S P+T + NG +ERKHRH+VE  LTLL+ AS+P  +W
Sbjct: 678 QHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTLLSHASVPKTYW 629

BLAST of Tan0005870 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 146.0 bits (367), Expect = 1.8e-33
Identity = 128/459 (27.89%), Postives = 197/459 (42.92%), Query Frame = 0

Query: 275 NYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFNKEYSPST 334
           +Y RS+NN  + G    S+ R + R  N      C  C + GH    C N    +   S 
Sbjct: 204 SYQRSSNNYGRSGARGKSKNRSKSRVRN------CYNCNQPGHFKRDCPNPRKGKGETSG 263

Query: 335 NQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGASHH 394
            +N            NT++      NV +          L+ PE+     W  ++ ASHH
Sbjct: 264 QKN----------DDNTAAMVQNNDNVVLFINEEEECMHLSGPES----EWVVDTAASHH 323

Query: 395 VTTDFGNLANPIEYGGMDSIIVGNGSQSPITFTGNSCL-TSGKYNLRLQNVLCAPNMAKN 454
             T   +L      G   ++ +GN S S I   G+ C+ T+    L L++V   P++  N
Sbjct: 324 -ATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRHVPDLRMN 383

Query: 455 LISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPIL 514
           L  IS +A D D Y  +                 K  L +G   +A  + +         
Sbjct: 384 L--ISGIALDRDGYESYFANQ-------------KWRLTKGSLVIAKGVAR--------- 443

Query: 515 SSNESKMYKNNSVAFSVCH-----KPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVN 574
                 +Y+ N+    +C        ++++   WH+ +GH S K L  + +   +     
Sbjct: 444 ----GTLYRTNA---EICQGELNAAQDEISVDLWHKRMGHMSEKGLQILAKKSLISYAKG 503

Query: 575 EEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDYS 634
                C+ C +GK H + F  S  R     +LVYSDV GP  + S  G +Y+V F+DD S
Sbjct: 504 TTVKPCDYCLFGKQHRVSFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDAS 563

Query: 635 RFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFL--RVHQLCSQLGIKS 694
           R +WVY LK K      FQ F A+V+ +    ++  +SDNG E+      + CS  GI+ 
Sbjct: 564 RKLWVYILKTKDQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRH 610

Query: 695 RYSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFW 726
             + P T Q NG AER +R +VE   ++L  A +P  FW
Sbjct: 624 EKTVPGTPQHNGVAERMNRTIVEKVRSMLRMAKLPKSFW 610

BLAST of Tan0005870 vs. ExPASy Swiss-Prot
Match: Q03494 (Transposon Ty2-DR2 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY2B-DR2 PE=3 SV=2)

HSP 1 Score: 109.4 bits (272), Expect = 1.8e-22
Identity = 89/330 (26.97%), Postives = 146/330 (44.24%), Query Frame = 0

Query: 413 SIIVGNGSQSPITFTGNSCLTSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHD 472
           +I+       PI   GN               L  PN+A +L+S+S LA  N       +
Sbjct: 481 NIVDAQKQDIPINAIGNLHFNFQNGTKTSIKALHTPNIAYDLLSLSELANQNITACFTRN 540

Query: 473 TYCVVKDKGTGKQLLKGDLKEG-LYCLANTLVKPVDISQPILSSNESKMYKNNSVAFSVC 532
           T  + +  GT   +L   +K G  Y L+   + P  IS+  +++    + K+ SV     
Sbjct: 541 T--LERSDGT---VLAPIVKHGDFYWLSKKYLIPSHISKLTINN----VNKSKSV----- 600

Query: 533 HKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVL-------VNEEHYFCNSCQYGKS-- 592
              NK      HR LGH + + +   ++   +  L        N   Y C  C  GKS  
Sbjct: 601 ---NKYPYPLIHRMLGHANFRSIQKSLKKNAVTYLKESDIEWSNASTYQCPDCLIGKSTK 660

Query: 593 --HALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDYSRFVWVYPLKQKS 652
             H     +    + + F+ +++D++GP   L  S   Y++ F D+ +RF WVYPL  + 
Sbjct: 661 HRHVKGSRLKYQESYEPFQYLHTDIFGPVHHLPKSAPSYFISFTDEKTRFQWVYPLHDRR 720

Query: 653 DTG--NAFQHFLAMVQTQFNGNIQSFQSDNGTEFLR--VHQLCSQLGIKSRYSCPYTSQQ 712
           +    N F   LA ++ QFN  +   Q D G+E+    +H+  +  GI + Y+    S+ 
Sbjct: 721 EESILNVFTSILAFIKNQFNARVLVIQMDRGSEYTNKTLHKFFTNRGITACYTTTADSRA 780

Query: 713 NGRAERKHRHLVETCLTLLAQASMPLVFWW 727
           +G AER +R L+  C TLL  + +P   W+
Sbjct: 781 HGVAERLNRTLLNDCRTLLHCSGLPNHLWF 793

BLAST of Tan0005870 vs. ExPASy Swiss-Prot
Match: Q12337 (Transposon Ty2-GR1 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY2B-GR1 PE=5 SV=2)

HSP 1 Score: 109.4 bits (272), Expect = 1.8e-22
Identity = 89/330 (26.97%), Postives = 146/330 (44.24%), Query Frame = 0

Query: 413 SIIVGNGSQSPITFTGNSCLTSGKYNLRLQNVLCAPNMAKNLISISRLAQDNDIYIEFHD 472
           +I+       PI   GN               L  PN+A +L+S+S LA  N       +
Sbjct: 481 NIVDAQKQDIPINAIGNLHFNFQNGTKTSIKALHTPNIAYDLLSLSELANQNITACFTRN 540

Query: 473 TYCVVKDKGTGKQLLKGDLKEG-LYCLANTLVKPVDISQPILSSNESKMYKNNSVAFSVC 532
           T  + +  GT   +L   +K G  Y L+   + P  IS+  +++    + K+ SV     
Sbjct: 541 T--LERSDGT---VLAPIVKHGDFYWLSKKYLIPSHISKLTINN----VNKSKSV----- 600

Query: 533 HKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVL-------VNEEHYFCNSCQYGKS-- 592
              NK      HR LGH + + +   ++   +  L        N   Y C  C  GKS  
Sbjct: 601 ---NKYPYPLIHRMLGHANFRSIQKSLKKNAVTYLKESDIEWSNASTYQCPDCLIGKSTK 660

Query: 593 --HALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDYSRFVWVYPLKQKS 652
             H     +    + + F+ +++D++GP   L  S   Y++ F D+ +RF WVYPL  + 
Sbjct: 661 HRHVKGSRLKYQESYEPFQYLHTDIFGPVHHLPKSAPSYFISFTDEKTRFQWVYPLHDRR 720

Query: 653 DTG--NAFQHFLAMVQTQFNGNIQSFQSDNGTEFLR--VHQLCSQLGIKSRYSCPYTSQQ 712
           +    N F   LA ++ QFN  +   Q D G+E+    +H+  +  GI + Y+    S+ 
Sbjct: 721 EESILNVFTSILAFIKNQFNARVLVIQMDRGSEYTNKTLHKFFTNRGITACYTTTADSRA 780

Query: 713 NGRAERKHRHLVETCLTLLAQASMPLVFWW 727
           +G AER +R L+  C TLL  + +P   W+
Sbjct: 781 HGVAERLNRTLLNDCRTLLHCSGLPNHLWF 793

BLAST of Tan0005870 vs. NCBI nr
Match: PNX76291.1 (gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense])

HSP 1 Score: 517.7 bits (1332), Expect = 1.7e-142
Identity = 303/707 (42.86%), Postives = 410/707 (57.99%), Query Frame = 0

Query: 20  NSPPLNQLLNQVTTIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEG 79
           NS   N L + V ++KL+R N+ LW+++ LPI+R  +L+ ++LG K CP  F++ A    
Sbjct: 6   NSNHKNDLPSTV-SVKLDRDNYPLWQSMVLPIIRGARLDGYMLGKKKCPEEFITAA---- 65

Query: 80  NVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVMGCNTAKDLWDVI 139
                      +S+   NP +E W   DQ L+ WL NSM   +ATQ++ C T+  LWD  
Sbjct: 66  -----------DSSKKFNPEFEDWQAYDQQLLGWLRNSMTVGIATQLLHCETSMQLWDEA 125

Query: 140 QLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVL 199
           Q L G  +R++  YL   F  +RKG MKM +YL  MK  AD L  A +P+ST  LI Q L
Sbjct: 126 QSLAGAHTRSQITYLKSEFHSTRKGEMKMEDYLIKMKNLADKLKLAGNPISTSDLIIQTL 185

Query: 200 LGLDEEYNPVVVGIQGKYEISWADAQNDLFMFEKRLEFQNTQRNSVSFSHNASVNMAINR 259
            GLD EYNPVVV +  +  +SW D Q  L  FE R+E    Q NS++   N ++N   N 
Sbjct: 186 NGLDSEYNPVVVKLSDQTTLSWVDLQAQLLTFENRIE----QLNSLT---NLTLNATANV 245

Query: 260 GTQSQRQQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTA 319
             +S  +  +  S NN+  SNNN   RG   N RG   GRG   + +  CQVCG   H A
Sbjct: 246 AKKSDHRGNRFNSNNNWRGSNNN--WRGS--NFRGWRGGRGRGRSFKTTCQVCGLDNHIA 305

Query: 320 LVCYNRFNKEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPET 379
           + C+ RF+K YS S   N  +  ++QG                        N FLA+  +
Sbjct: 306 IDCFYRFDKTYSRS---NHSANNDKQG----------------------SHNAFLASQNS 365

Query: 380 IGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFTGNSCLTSGKYNL 439
           I D  WY +SGAS+HVT       N  E+ G +S+IVGNG +  I  TG+S L S    L
Sbjct: 366 IEDYDWYFDSGASNHVTHQTDKFQNLSEHHGKNSLIVGNGEKLEIVATGSSKLKS----L 425

Query: 440 RLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLA 499
            L ++L  P + KNL+S+S+LA DN+I +EF +  C VKDK TGK +L+G LK+GLY L+
Sbjct: 426 NLHDILYVPKITKNLLSVSKLAADNNILVEFDENCCFVKDKLTGKAILRGILKDGLYQLS 485

Query: 500 NTLVKPVDISQPILSSNESKMYKNNSVAFSVCHKPNKVTKTFWHRHLGHPSTKILDSVIR 559
                                 K++S   S+        K  WHR LGHP+ K+LD V++
Sbjct: 486 E---------------------KDSSAYVSI--------KESWHRKLGHPNNKVLDIVLK 545

Query: 560 SCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRY 619
           SCN+ +  +++  FC +CQYGK H LPF  S S A +  ELV++DVWGPAP++S+SGF+Y
Sbjct: 546 SCNVKLSPSDQFSFCEACQYGKMHFLPFKTSFSHAKEILELVHTDVWGPAPIISSSGFKY 605

Query: 620 YVLFLDDYSRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLC 679
           YV F+DD++RF W+YPLKQKSDT +AF  F  MV+ QF+  I++ Q D G E+  V +  
Sbjct: 606 YVHFIDDFTRFTWIYPLKQKSDTAHAFIQFKNMVENQFSKKIKTIQCDGGGEYKPVQKHA 627

Query: 680 SQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFWW 727
            + GI+ R SCPYTSQQNGRAERKHRH+ E  LTLLAQA MPL +WW
Sbjct: 666 IEAGIQFRMSCPYTSQQNGRAERKHRHIAEFGLTLLAQAKMPLNYWW 627

BLAST of Tan0005870 vs. NCBI nr
Match: GAU19483.1 (hypothetical protein TSUD_77270 [Trifolium subterraneum])

HSP 1 Score: 516.5 bits (1329), Expect = 3.8e-142
Identity = 298/694 (42.94%), Postives = 398/694 (57.35%), Query Frame = 0

Query: 33  TIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSES 92
           ++KL+R N+ LWK+L LP++R  KL+ ++LGT+ CP  F++               SS+S
Sbjct: 18  SVKLDRNNYPLWKSLVLPVIRGCKLDGYMLGTEGCPEEFIT---------------SSDS 77

Query: 93  TTSINPLYEAWMTVDQLLISWLYNSMISEVATQVMGCNTAKDLWDVIQLLFGVQSRAEED 152
           + + N  +  W   DQ L+ W+ NSM +E+ATQ++ C T+K LWD  Q L G  +R++  
Sbjct: 78  SKNKNSAFVEWQANDQRLLGWMLNSMTTEIATQLLHCETSKQLWDEAQSLAGAHTRSQII 137

Query: 153 YLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVG 212
           YL   F   RKG MKM +YL  MK   D L  A +PVST  LI Q L GLD EYNPVVV 
Sbjct: 138 YLKSEFHSIRKGEMKMEDYLIKMKNLVDKLKLAGNPVSTSDLIIQTLNGLDSEYNPVVVK 197

Query: 213 IQGKYEISWADAQNDLFMFEKRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYS 272
           +  +  +SW D Q  L  FE R+E  N   N  + + NA+ N+A        R   +  S
Sbjct: 198 LSDQTTLSWVDLQAQLLTFESRIEQLN---NLTNLTLNATANVA-------NRSDHRGKS 257

Query: 273 PNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFNKEYSP 332
            NN  R +N+   RG        GRGRG +  N   CQVCG   H A+ C++RF+K YS 
Sbjct: 258 SNNNWRGSNSRGWRG--------GRGRGKSGKNP--CQVCGLSNHIAIDCFHRFDKTYSR 317

Query: 333 STNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGAS 392
           S   N  +  ++QG                        N FLA+  ++ D  WY +SGAS
Sbjct: 318 S---NHSAGHDKQG----------------------SHNAFLASQNSVEDYDWYFDSGAS 377

Query: 393 HHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFTGNSCLTSGKYNLRLQNVLCAPNMAK 452
           +HVT       +  E+ G +S++VGNG +  I  TG+S L S    L L ++L  PN+ K
Sbjct: 378 NHVTHQTEKFQDLTEHHGKNSLVVGNGEKLAILATGSSKLKS----LNLHDILYVPNITK 437

Query: 453 NLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPI 512
           NL+S+S+LA DN+I +EF +  C VKDK TGK +LKG LK+GLY L+ T           
Sbjct: 438 NLLSVSKLAADNNILVEFDENCCFVKDKLTGKVILKGLLKDGLYQLSGT----------- 497

Query: 513 LSSNESKMYKNNSVAFSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHY 572
                    K N  AF          K  WHR LGHP+ K+LD V+ SC + V  ++   
Sbjct: 498 ---------KRNPSAF-------VSVKESWHRRLGHPNNKVLDKVLESCKVKVPPSDNFS 557

Query: 573 FCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDYSRFVW 632
           FC +CQYGK H LPF  S S A +  ELV++DVWGPAP++++SGF+YYV F+DD+SRF W
Sbjct: 558 FCEACQYGKMHLLPFKSSSSHAQEPLELVHTDVWGPAPIMTSSGFKYYVHFVDDFSRFTW 617

Query: 633 VYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPY 692
           +YPLKQKS+T  AF  F  + + QFN  I+  Q D G E+  V +L  + GI+ R SCPY
Sbjct: 618 IYPLKQKSETVQAFIQFKNLTENQFNKRIKVIQCDGGGEYKPVQKLAVEAGIQFRMSCPY 620

Query: 693 TSQQNGRAERKHRHLVETCLTLLAQASMPLVFWW 727
           TSQQNGRAERKHRH+ E  LTLLAQA MPL +WW
Sbjct: 678 TSQQNGRAERKHRHITEFGLTLLAQAQMPLHYWW 620

BLAST of Tan0005870 vs. NCBI nr
Match: PNX94503.1 (putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense])

HSP 1 Score: 503.8 bits (1296), Expect = 2.5e-138
Identity = 290/699 (41.49%), Postives = 394/699 (56.37%), Query Frame = 0

Query: 28  LNQVTTIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGAS 87
           L    ++KL+R NF LWK+L LP++R  K + ++LGTK CP  F++              
Sbjct: 12  LPSTVSVKLDRDNFPLWKSLVLPLIRGCKYDGYMLGTKKCPDQFVT-------------- 71

Query: 88  FSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVMGCNTAKDLWDVIQLLFGVQS 147
            S ++T  INP Y+ W   DQ L+ WL NSM  ++ATQV+ C T+K LWD  Q L G  +
Sbjct: 72  -SIDNTEKINPDYQDWQADDQALLGWLMNSMTVDIATQVLHCETSKQLWDEAQSLAGAHT 131

Query: 148 RAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYN 207
           R+   YL   F  + K  MKM +YL  MK  AD L  A SP+S+  L+ Q L GLD EYN
Sbjct: 132 RSRIIYLKSEFHNTHKREMKMEQYLAKMKNLADKLKLAGSPISSSDLMIQTLNGLDSEYN 191

Query: 208 PVVVGIQGKYEISWADAQNDLFMFEKRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQ 267
           PVVV +  +  ISW D Q  L  FE RL+  N   N    + NAS N A    +   +  
Sbjct: 192 PVVVKLSDQTNISWVDFQAQLLAFESRLDQLNNFNN---INLNASANFASKNESGGNK-- 251

Query: 268 QQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFN 327
              +      R +N+   RG      GRGR R  +   RPICQ+CGK GHTA  CY RF+
Sbjct: 252 ---FGSRGGWRGSNSRGMRG------GRGRAR-MSKPPRPICQICGKFGHTAAQCYYRFD 311

Query: 328 KEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYA 387
           K Y         ++ N    G  + S+                  F+A+P    D  WY 
Sbjct: 312 KSY---------TEKNHYAEGEGSHSA------------------FVASPYHGQDYEWYF 371

Query: 388 NSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFTGNSCLTSGKYNLRLQNVLCA 447
           +SGAS+HVT   G L +  E  G +S++VGNG +  I  +G++ L     ++ L+NVL  
Sbjct: 372 DSGASNHVTHQSGQLQDLNENNGKNSLLVGNGEKLKILASGSTKLN----DVNLRNVLYV 431

Query: 448 PNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVD 507
           P + KNL+S+S+L  DN+  +EF + YC VKDK TGK LLKG LK+GLY L+        
Sbjct: 432 PEITKNLLSVSKLTIDNNALVEFDENYCYVKDKLTGKALLKGRLKDGLYQLS-------- 491

Query: 508 ISQPILSSNESKMYKNNSVAFSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLV 567
                 ++ E    K+     S+        K  WHR LGHP+ K+L+ V++  N+ +  
Sbjct: 492 ------ANKEPPTNKDPCAYISL--------KEIWHRKLGHPNNKVLEKVLKDNNVKISP 551

Query: 568 NEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDY 627
           +++  FC +CQ+GK H LPF  S S A +  +L+++DVWGPAP+LS S F+YYV FLDD+
Sbjct: 552 SDKFTFCEACQFGKLHLLPFKTSSSHAKEPLDLIHTDVWGPAPILSQSNFKYYVHFLDDF 611

Query: 628 SRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSR 687
           SRF W++PLKQKS+T +AF  F  +V+ QFN  I+  + D G E+  V +     GI+ +
Sbjct: 612 SRFTWIFPLKQKSETIHAFNQFKNLVENQFNKKIKVIRCDGGGEYKPVQKCAIDSGIQFQ 627

Query: 688 YSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFWW 727
            SCPYTSQQNGRAERKHRH+ E  LTLLAQA MPL +WW
Sbjct: 672 MSCPYTSQQNGRAERKHRHVTELGLTLLAQAKMPLSYWW 627

BLAST of Tan0005870 vs. NCBI nr
Match: PNY01489.1 (copia-like polyprotein, partial [Trifolium pratense])

HSP 1 Score: 498.0 bits (1281), Expect = 1.4e-136
Identity = 292/699 (41.77%), Postives = 405/699 (57.94%), Query Frame = 0

Query: 28  LNQVTTIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGAS 87
           L  + ++KL+R N+ LWK+L LP++R  K + ++LGTK CP  F++              
Sbjct: 13  LPSIISVKLDRDNYPLWKSLVLPLIRGCKFDGYILGTKECPEQFVT-------------- 72

Query: 88  FSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVMGCNTAKDLWDVIQLLFGVQS 147
            S++ +  +NP ++ WM  DQ L+ WL NSM  ++ATQ++ C T+K LWD  Q L G  +
Sbjct: 73  -SADKSKKVNPDFQDWMADDQALLGWLMNSMAIDIATQLLHCETSKQLWDEAQSLAGAHT 132

Query: 148 RAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYN 207
           ++   YL   F  +RKG MKM EYL  MK  +D L  + SP+S   L+ Q L GLD EYN
Sbjct: 133 KSRIIYLKSEFHNTRKGEMKMEEYLIKMKNLSDKLKLSGSPISNSDLMIQTLNGLDAEYN 192

Query: 208 PVVVGIQGKYEISWADAQNDLFMFEKRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQ 267
           PVVV +  +  +SW D Q  L  FE RL+  N   N    + NAS N A     +++ + 
Sbjct: 193 PVVVKLSDQINLSWVDVQAQLLAFESRLDQLN---NFSGLTLNASANFA----NKTEFRG 252

Query: 268 QQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFN 327
            + +S  N+ RS N    RG        GRG+G  SN +  CQVC   GHTA+ C  RF+
Sbjct: 253 NKFHSRGNWRRS-NFRGMRG--------GRGKGRMSNTK--CQVCSGTGHTAVDCSYRFD 312

Query: 328 KEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYA 387
           + Y   T +N  ++ ++QG           + + FVASP               D  WY 
Sbjct: 313 RSY---TGRNYSTEADKQG-----------SHSAFVASPYHGQ-----------DYEWYF 372

Query: 388 NSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFTGNSCLTSGKYNLRLQNVLCA 447
           +SGAS+HVT          E+ G +S++VGNG +  I  +G++ L +    L L +VL  
Sbjct: 373 DSGASNHVTHQTDKFQGFNEHNGKNSLMVGNGEKLKIVASGSTKLNT----LNLHDVLYV 432

Query: 448 PNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVD 507
           P + KNL+S+S+L  DN+I++EF    C VKDK TG+ LLKG LK+GLY L+       D
Sbjct: 433 PQITKNLLSVSKLTADNNIFVEFDANCCSVKDKLTGQTLLKGRLKDGLYQLS-------D 492

Query: 508 ISQPILSSNESKMYKNNSVAFSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLV 567
           +S     SN     K+  V  SV        K  WHR LGHP+ K+L+ V++ CN+ +  
Sbjct: 493 VSP---QSN-----KDPCVYMSV--------KESWHRKLGHPNNKVLEKVLKDCNVKISP 552

Query: 568 NEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDY 627
           +++  FC +CQ+GK H LPF  S S   +   L++SDVWGPAP+LS SGF+YYV F+DD+
Sbjct: 553 SDQFSFCEACQFGKLHLLPFKSSSSHVQEPLGLIHSDVWGPAPILSPSGFKYYVHFIDDF 612

Query: 628 SRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSR 687
           SRF W++PLKQKSDT +AF  F  + + QFN  I+  Q D G E+  V ++  + GI+ R
Sbjct: 613 SRFTWIFPLKQKSDTIHAFIQFKNLAENQFNKKIKIIQCDGGGEYKAVQKVSIEAGIQFR 626

Query: 688 YSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFWW 727
            SCPYTSQQNGRAERKHRH+VE  LTLLAQA MPL +WW
Sbjct: 673 MSCPYTSQQNGRAERKHRHVVELGLTLLAQAKMPLRYWW 626

BLAST of Tan0005870 vs. NCBI nr
Match: PNX78574.1 (retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense])

HSP 1 Score: 498.0 bits (1281), Expect = 1.4e-136
Identity = 290/694 (41.79%), Postives = 388/694 (55.91%), Query Frame = 0

Query: 33  TIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSES 92
           ++ L+R NF LWK+L LPI+R  +L+ ++LGTK CP  F++ A   G             
Sbjct: 18  SVMLDRENFPLWKSLVLPIIRGCRLDGYILGTKECPEQFITSAEASGK------------ 77

Query: 93  TTSINPLYEAWMTVDQLLISWLYNSMISEVATQVMGCNTAKDLWDVIQLLFGVQSRAEED 152
              INP +  W   DQ ++ WL N+M +  A+Q++ C T+K LW+  Q L    +R+   
Sbjct: 78  --KINPDFGDWQAEDQRVLGWLLNTMTTGTASQLLHCETSKQLWEEAQSLASAHTRSRVI 137

Query: 153 YLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVG 212
           YL   F  +RKG  KM +YL  MK  AD L  A SP++   LI Q L GLD +YNP+VV 
Sbjct: 138 YLRSEFHNTRKGEKKMEDYLMKMKDLADKLKMAGSPITNVDLIIQTLNGLDSDYNPIVVK 197

Query: 213 IQGKYEISWADAQNDLFMFEKRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYS 272
           +  +  +SW D Q  L  FE RL+  N+  N    + NA+ N+A             N +
Sbjct: 198 LSDQINLSWVDLQAQLLAFESRLDQLNSFNN---LNRNATTNVA-------------NKA 257

Query: 273 PNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFNKEYSP 332
               N  N+  S RG +  +   GRG+G  SN+  ICQVC K GHTA+ C +R++K Y+ 
Sbjct: 258 QFRGNIYNHRGSWRGSSFRNTRGGRGKGRPSND--ICQVCNKHGHTAIECDHRYDKSYTG 317

Query: 333 STNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGAS 392
           S+  N   +  +                          N FLA+     D  WY +SGAS
Sbjct: 318 SSYSNANVERQR------------------------THNAFLASRYNSQDYEWYFDSGAS 377

Query: 393 HHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFTGNSCLTSGKYNLRLQNVLCAPNMAK 452
           +HVT          E  G +S+IVGNG++  I  +G+S L     NL L +VL  P + K
Sbjct: 378 NHVTHQADKFQELTENSGKNSLIVGNGAKLKIDASGSSKLK----NLNLHDVLYVPQITK 437

Query: 453 NLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPI 512
           NL+S+S+L  DN+I +EF +  C VKDK TGK LL+G LK+GLY L+N            
Sbjct: 438 NLLSVSKLTSDNNIIVEFDNDCCFVKDKLTGKVLLRGILKDGLYQLSN------------ 497

Query: 513 LSSNESKMYKNNSVAFSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHY 572
                S+  K+  V  SV        K  WHR LGHPS  +LD V++ CN+    +++  
Sbjct: 498 ---GSSQTNKDPCVYLSV--------KESWHRKLGHPSNNVLDKVLKICNVKTSPSDKFK 557

Query: 573 FCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDYSRFVW 632
           FC +CQ GKSH LPF  S S A +  EL+++DVWGPAP+ S SGF+YYV F+DD SRF W
Sbjct: 558 FCEACQLGKSHLLPFKSSSSHAQEVLELIHTDVWGPAPINSISGFKYYVHFIDDSSRFTW 617

Query: 633 VYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPY 692
           +YPLKQKSDT +AF  F  MV+ QFN  I+  Q D G EF  V ++  + GIK R SCPY
Sbjct: 618 IYPLKQKSDTIHAFMQFKNMVENQFNKRIKIIQCDGGGEFKPVQKVALETGIKFRMSCPY 628

Query: 693 TSQQNGRAERKHRHLVETCLTLLAQASMPLVFWW 727
           TSQQNGRAERKHRH+ E  LTLLAQA+M L +WW
Sbjct: 678 TSQQNGRAERKHRHVAELGLTLLAQANMSLHYWW 628

BLAST of Tan0005870 vs. ExPASy TrEMBL
Match: A0A2K3LCM1 (Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g032236 PE=4 SV=1)

HSP 1 Score: 517.7 bits (1332), Expect = 8.2e-143
Identity = 303/707 (42.86%), Postives = 410/707 (57.99%), Query Frame = 0

Query: 20  NSPPLNQLLNQVTTIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEG 79
           NS   N L + V ++KL+R N+ LW+++ LPI+R  +L+ ++LG K CP  F++ A    
Sbjct: 6   NSNHKNDLPSTV-SVKLDRDNYPLWQSMVLPIIRGARLDGYMLGKKKCPEEFITAA---- 65

Query: 80  NVTVEGASFSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVMGCNTAKDLWDVI 139
                      +S+   NP +E W   DQ L+ WL NSM   +ATQ++ C T+  LWD  
Sbjct: 66  -----------DSSKKFNPEFEDWQAYDQQLLGWLRNSMTVGIATQLLHCETSMQLWDEA 125

Query: 140 QLLFGVQSRAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVL 199
           Q L G  +R++  YL   F  +RKG MKM +YL  MK  AD L  A +P+ST  LI Q L
Sbjct: 126 QSLAGAHTRSQITYLKSEFHSTRKGEMKMEDYLIKMKNLADKLKLAGNPISTSDLIIQTL 185

Query: 200 LGLDEEYNPVVVGIQGKYEISWADAQNDLFMFEKRLEFQNTQRNSVSFSHNASVNMAINR 259
            GLD EYNPVVV +  +  +SW D Q  L  FE R+E    Q NS++   N ++N   N 
Sbjct: 186 NGLDSEYNPVVVKLSDQTTLSWVDLQAQLLTFENRIE----QLNSLT---NLTLNATANV 245

Query: 260 GTQSQRQQQQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTA 319
             +S  +  +  S NN+  SNNN   RG   N RG   GRG   + +  CQVCG   H A
Sbjct: 246 AKKSDHRGNRFNSNNNWRGSNNN--WRGS--NFRGWRGGRGRGRSFKTTCQVCGLDNHIA 305

Query: 320 LVCYNRFNKEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPET 379
           + C+ RF+K YS S   N  +  ++QG                        N FLA+  +
Sbjct: 306 IDCFYRFDKTYSRS---NHSANNDKQG----------------------SHNAFLASQNS 365

Query: 380 IGDPSWYANSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFTGNSCLTSGKYNL 439
           I D  WY +SGAS+HVT       N  E+ G +S+IVGNG +  I  TG+S L S    L
Sbjct: 366 IEDYDWYFDSGASNHVTHQTDKFQNLSEHHGKNSLIVGNGEKLEIVATGSSKLKS----L 425

Query: 440 RLQNVLCAPNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLA 499
            L ++L  P + KNL+S+S+LA DN+I +EF +  C VKDK TGK +L+G LK+GLY L+
Sbjct: 426 NLHDILYVPKITKNLLSVSKLAADNNILVEFDENCCFVKDKLTGKAILRGILKDGLYQLS 485

Query: 500 NTLVKPVDISQPILSSNESKMYKNNSVAFSVCHKPNKVTKTFWHRHLGHPSTKILDSVIR 559
                                 K++S   S+        K  WHR LGHP+ K+LD V++
Sbjct: 486 E---------------------KDSSAYVSI--------KESWHRKLGHPNNKVLDIVLK 545

Query: 560 SCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRY 619
           SCN+ +  +++  FC +CQYGK H LPF  S S A +  ELV++DVWGPAP++S+SGF+Y
Sbjct: 546 SCNVKLSPSDQFSFCEACQYGKMHFLPFKTSFSHAKEILELVHTDVWGPAPIISSSGFKY 605

Query: 620 YVLFLDDYSRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLC 679
           YV F+DD++RF W+YPLKQKSDT +AF  F  MV+ QF+  I++ Q D G E+  V +  
Sbjct: 606 YVHFIDDFTRFTWIYPLKQKSDTAHAFIQFKNMVENQFSKKIKTIQCDGGGEYKPVQKHA 627

Query: 680 SQLGIKSRYSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFWW 727
            + GI+ R SCPYTSQQNGRAERKHRH+ E  LTLLAQA MPL +WW
Sbjct: 666 IEAGIQFRMSCPYTSQQNGRAERKHRHIAEFGLTLLAQAKMPLNYWW 627

BLAST of Tan0005870 vs. ExPASy TrEMBL
Match: A0A2Z6MBG6 (Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_77270 PE=4 SV=1)

HSP 1 Score: 516.5 bits (1329), Expect = 1.8e-142
Identity = 298/694 (42.94%), Postives = 398/694 (57.35%), Query Frame = 0

Query: 33  TIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSES 92
           ++KL+R N+ LWK+L LP++R  KL+ ++LGT+ CP  F++               SS+S
Sbjct: 18  SVKLDRNNYPLWKSLVLPVIRGCKLDGYMLGTEGCPEEFIT---------------SSDS 77

Query: 93  TTSINPLYEAWMTVDQLLISWLYNSMISEVATQVMGCNTAKDLWDVIQLLFGVQSRAEED 152
           + + N  +  W   DQ L+ W+ NSM +E+ATQ++ C T+K LWD  Q L G  +R++  
Sbjct: 78  SKNKNSAFVEWQANDQRLLGWMLNSMTTEIATQLLHCETSKQLWDEAQSLAGAHTRSQII 137

Query: 153 YLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVG 212
           YL   F   RKG MKM +YL  MK   D L  A +PVST  LI Q L GLD EYNPVVV 
Sbjct: 138 YLKSEFHSIRKGEMKMEDYLIKMKNLVDKLKLAGNPVSTSDLIIQTLNGLDSEYNPVVVK 197

Query: 213 IQGKYEISWADAQNDLFMFEKRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYS 272
           +  +  +SW D Q  L  FE R+E  N   N  + + NA+ N+A        R   +  S
Sbjct: 198 LSDQTTLSWVDLQAQLLTFESRIEQLN---NLTNLTLNATANVA-------NRSDHRGKS 257

Query: 273 PNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFNKEYSP 332
            NN  R +N+   RG        GRGRG +  N   CQVCG   H A+ C++RF+K YS 
Sbjct: 258 SNNNWRGSNSRGWRG--------GRGRGKSGKNP--CQVCGLSNHIAIDCFHRFDKTYSR 317

Query: 333 STNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGAS 392
           S   N  +  ++QG                        N FLA+  ++ D  WY +SGAS
Sbjct: 318 S---NHSAGHDKQG----------------------SHNAFLASQNSVEDYDWYFDSGAS 377

Query: 393 HHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFTGNSCLTSGKYNLRLQNVLCAPNMAK 452
           +HVT       +  E+ G +S++VGNG +  I  TG+S L S    L L ++L  PN+ K
Sbjct: 378 NHVTHQTEKFQDLTEHHGKNSLVVGNGEKLAILATGSSKLKS----LNLHDILYVPNITK 437

Query: 453 NLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPI 512
           NL+S+S+LA DN+I +EF +  C VKDK TGK +LKG LK+GLY L+ T           
Sbjct: 438 NLLSVSKLAADNNILVEFDENCCFVKDKLTGKVILKGLLKDGLYQLSGT----------- 497

Query: 513 LSSNESKMYKNNSVAFSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHY 572
                    K N  AF          K  WHR LGHP+ K+LD V+ SC + V  ++   
Sbjct: 498 ---------KRNPSAF-------VSVKESWHRRLGHPNNKVLDKVLESCKVKVPPSDNFS 557

Query: 573 FCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDYSRFVW 632
           FC +CQYGK H LPF  S S A +  ELV++DVWGPAP++++SGF+YYV F+DD+SRF W
Sbjct: 558 FCEACQYGKMHLLPFKSSSSHAQEPLELVHTDVWGPAPIMTSSGFKYYVHFVDDFSRFTW 617

Query: 633 VYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPY 692
           +YPLKQKS+T  AF  F  + + QFN  I+  Q D G E+  V +L  + GI+ R SCPY
Sbjct: 618 IYPLKQKSETVQAFIQFKNLTENQFNKRIKVIQCDGGGEYKPVQKLAVEAGIQFRMSCPY 620

Query: 693 TSQQNGRAERKHRHLVETCLTLLAQASMPLVFWW 727
           TSQQNGRAERKHRH+ E  LTLLAQA MPL +WW
Sbjct: 678 TSQQNGRAERKHRHITEFGLTLLAQAQMPLHYWW 620

BLAST of Tan0005870 vs. ExPASy TrEMBL
Match: A0A2K3MUJ9 (Putative retrotransposon Ty1-copia subclass protein (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g017679 PE=4 SV=1)

HSP 1 Score: 503.8 bits (1296), Expect = 1.2e-138
Identity = 290/699 (41.49%), Postives = 394/699 (56.37%), Query Frame = 0

Query: 28  LNQVTTIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGAS 87
           L    ++KL+R NF LWK+L LP++R  K + ++LGTK CP  F++              
Sbjct: 12  LPSTVSVKLDRDNFPLWKSLVLPLIRGCKYDGYMLGTKKCPDQFVT-------------- 71

Query: 88  FSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVMGCNTAKDLWDVIQLLFGVQS 147
            S ++T  INP Y+ W   DQ L+ WL NSM  ++ATQV+ C T+K LWD  Q L G  +
Sbjct: 72  -SIDNTEKINPDYQDWQADDQALLGWLMNSMTVDIATQVLHCETSKQLWDEAQSLAGAHT 131

Query: 148 RAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYN 207
           R+   YL   F  + K  MKM +YL  MK  AD L  A SP+S+  L+ Q L GLD EYN
Sbjct: 132 RSRIIYLKSEFHNTHKREMKMEQYLAKMKNLADKLKLAGSPISSSDLMIQTLNGLDSEYN 191

Query: 208 PVVVGIQGKYEISWADAQNDLFMFEKRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQ 267
           PVVV +  +  ISW D Q  L  FE RL+  N   N    + NAS N A    +   +  
Sbjct: 192 PVVVKLSDQTNISWVDFQAQLLAFESRLDQLNNFNN---INLNASANFASKNESGGNK-- 251

Query: 268 QQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFN 327
              +      R +N+   RG      GRGR R  +   RPICQ+CGK GHTA  CY RF+
Sbjct: 252 ---FGSRGGWRGSNSRGMRG------GRGRAR-MSKPPRPICQICGKFGHTAAQCYYRFD 311

Query: 328 KEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYA 387
           K Y         ++ N    G  + S+                  F+A+P    D  WY 
Sbjct: 312 KSY---------TEKNHYAEGEGSHSA------------------FVASPYHGQDYEWYF 371

Query: 388 NSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFTGNSCLTSGKYNLRLQNVLCA 447
           +SGAS+HVT   G L +  E  G +S++VGNG +  I  +G++ L     ++ L+NVL  
Sbjct: 372 DSGASNHVTHQSGQLQDLNENNGKNSLLVGNGEKLKILASGSTKLN----DVNLRNVLYV 431

Query: 448 PNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVD 507
           P + KNL+S+S+L  DN+  +EF + YC VKDK TGK LLKG LK+GLY L+        
Sbjct: 432 PEITKNLLSVSKLTIDNNALVEFDENYCYVKDKLTGKALLKGRLKDGLYQLS-------- 491

Query: 508 ISQPILSSNESKMYKNNSVAFSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLV 567
                 ++ E    K+     S+        K  WHR LGHP+ K+L+ V++  N+ +  
Sbjct: 492 ------ANKEPPTNKDPCAYISL--------KEIWHRKLGHPNNKVLEKVLKDNNVKISP 551

Query: 568 NEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDY 627
           +++  FC +CQ+GK H LPF  S S A +  +L+++DVWGPAP+LS S F+YYV FLDD+
Sbjct: 552 SDKFTFCEACQFGKLHLLPFKTSSSHAKEPLDLIHTDVWGPAPILSQSNFKYYVHFLDDF 611

Query: 628 SRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSR 687
           SRF W++PLKQKS+T +AF  F  +V+ QFN  I+  + D G E+  V +     GI+ +
Sbjct: 612 SRFTWIFPLKQKSETIHAFNQFKNLVENQFNKKIKVIRCDGGGEYKPVQKCAIDSGIQFQ 627

Query: 688 YSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFWW 727
            SCPYTSQQNGRAERKHRH+ E  LTLLAQA MPL +WW
Sbjct: 672 MSCPYTSQQNGRAERKHRHVTELGLTLLAQAKMPLSYWW 627

BLAST of Tan0005870 vs. ExPASy TrEMBL
Match: A0A2K3NEN7 (Copia-like polyprotein (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g024786 PE=4 SV=1)

HSP 1 Score: 498.0 bits (1281), Expect = 6.7e-137
Identity = 292/699 (41.77%), Postives = 405/699 (57.94%), Query Frame = 0

Query: 28  LNQVTTIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGAS 87
           L  + ++KL+R N+ LWK+L LP++R  K + ++LGTK CP  F++              
Sbjct: 13  LPSIISVKLDRDNYPLWKSLVLPLIRGCKFDGYILGTKECPEQFVT-------------- 72

Query: 88  FSSESTTSINPLYEAWMTVDQLLISWLYNSMISEVATQVMGCNTAKDLWDVIQLLFGVQS 147
            S++ +  +NP ++ WM  DQ L+ WL NSM  ++ATQ++ C T+K LWD  Q L G  +
Sbjct: 73  -SADKSKKVNPDFQDWMADDQALLGWLMNSMAIDIATQLLHCETSKQLWDEAQSLAGAHT 132

Query: 148 RAEEDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYN 207
           ++   YL   F  +RKG MKM EYL  MK  +D L  + SP+S   L+ Q L GLD EYN
Sbjct: 133 KSRIIYLKSEFHNTRKGEMKMEEYLIKMKNLSDKLKLSGSPISNSDLMIQTLNGLDAEYN 192

Query: 208 PVVVGIQGKYEISWADAQNDLFMFEKRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQ 267
           PVVV +  +  +SW D Q  L  FE RL+  N   N    + NAS N A     +++ + 
Sbjct: 193 PVVVKLSDQINLSWVDVQAQLLAFESRLDQLN---NFSGLTLNASANFA----NKTEFRG 252

Query: 268 QQNYSPNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFN 327
            + +S  N+ RS N    RG        GRG+G  SN +  CQVC   GHTA+ C  RF+
Sbjct: 253 NKFHSRGNWRRS-NFRGMRG--------GRGKGRMSNTK--CQVCSGTGHTAVDCSYRFD 312

Query: 328 KEYSPSTNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYA 387
           + Y   T +N  ++ ++QG           + + FVASP               D  WY 
Sbjct: 313 RSY---TGRNYSTEADKQG-----------SHSAFVASPYHGQ-----------DYEWYF 372

Query: 388 NSGASHHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFTGNSCLTSGKYNLRLQNVLCA 447
           +SGAS+HVT          E+ G +S++VGNG +  I  +G++ L +    L L +VL  
Sbjct: 373 DSGASNHVTHQTDKFQGFNEHNGKNSLMVGNGEKLKIVASGSTKLNT----LNLHDVLYV 432

Query: 448 PNMAKNLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVD 507
           P + KNL+S+S+L  DN+I++EF    C VKDK TG+ LLKG LK+GLY L+       D
Sbjct: 433 PQITKNLLSVSKLTADNNIFVEFDANCCSVKDKLTGQTLLKGRLKDGLYQLS-------D 492

Query: 508 ISQPILSSNESKMYKNNSVAFSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLV 567
           +S     SN     K+  V  SV        K  WHR LGHP+ K+L+ V++ CN+ +  
Sbjct: 493 VSP---QSN-----KDPCVYMSV--------KESWHRKLGHPNNKVLEKVLKDCNVKISP 552

Query: 568 NEEHYFCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDY 627
           +++  FC +CQ+GK H LPF  S S   +   L++SDVWGPAP+LS SGF+YYV F+DD+
Sbjct: 553 SDQFSFCEACQFGKLHLLPFKSSSSHVQEPLGLIHSDVWGPAPILSPSGFKYYVHFIDDF 612

Query: 628 SRFVWVYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSR 687
           SRF W++PLKQKSDT +AF  F  + + QFN  I+  Q D G E+  V ++  + GI+ R
Sbjct: 613 SRFTWIFPLKQKSDTIHAFIQFKNLAENQFNKKIKIIQCDGGGEYKAVQKVSIEAGIQFR 626

Query: 688 YSCPYTSQQNGRAERKHRHLVETCLTLLAQASMPLVFWW 727
            SCPYTSQQNGRAERKHRH+VE  LTLLAQA MPL +WW
Sbjct: 673 MSCPYTSQQNGRAERKHRHVVELGLTLLAQAKMPLRYWW 626

BLAST of Tan0005870 vs. ExPASy TrEMBL
Match: A0A2K3LJ49 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Trifolium pratense OX=57577 GN=L195_g034552 PE=4 SV=1)

HSP 1 Score: 498.0 bits (1281), Expect = 6.7e-137
Identity = 290/694 (41.79%), Postives = 388/694 (55.91%), Query Frame = 0

Query: 33  TIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSES 92
           ++ L+R NF LWK+L LPI+R  +L+ ++LGTK CP  F++ A   G             
Sbjct: 18  SVMLDRENFPLWKSLVLPIIRGCRLDGYILGTKECPEQFITSAEASGK------------ 77

Query: 93  TTSINPLYEAWMTVDQLLISWLYNSMISEVATQVMGCNTAKDLWDVIQLLFGVQSRAEED 152
              INP +  W   DQ ++ WL N+M +  A+Q++ C T+K LW+  Q L    +R+   
Sbjct: 78  --KINPDFGDWQAEDQRVLGWLLNTMTTGTASQLLHCETSKQLWEEAQSLASAHTRSRVI 137

Query: 153 YLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVVVG 212
           YL   F  +RKG  KM +YL  MK  AD L  A SP++   LI Q L GLD +YNP+VV 
Sbjct: 138 YLRSEFHNTRKGEKKMEDYLMKMKDLADKLKMAGSPITNVDLIIQTLNGLDSDYNPIVVK 197

Query: 213 IQGKYEISWADAQNDLFMFEKRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQNYS 272
           +  +  +SW D Q  L  FE RL+  N+  N    + NA+ N+A             N +
Sbjct: 198 LSDQINLSWVDLQAQLLAFESRLDQLNSFNN---LNRNATTNVA-------------NKA 257

Query: 273 PNNYNRSNNNNSQRGGTPNSRGRGRGRGYNSNNRPICQVCGKIGHTALVCYNRFNKEYSP 332
               N  N+  S RG +  +   GRG+G  SN+  ICQVC K GHTA+ C +R++K Y+ 
Sbjct: 258 QFRGNIYNHRGSWRGSSFRNTRGGRGKGRPSND--ICQVCNKHGHTAIECDHRYDKSYTG 317

Query: 333 STNQNRQSQPNQQGFGPNTSSSSFQAPNVFVASPNTPSNPFLATPETIGDPSWYANSGAS 392
           S+  N   +  +                          N FLA+     D  WY +SGAS
Sbjct: 318 SSYSNANVERQR------------------------THNAFLASRYNSQDYEWYFDSGAS 377

Query: 393 HHVTTDFGNLANPIEYGGMDSIIVGNGSQSPITFTGNSCLTSGKYNLRLQNVLCAPNMAK 452
           +HVT          E  G +S+IVGNG++  I  +G+S L     NL L +VL  P + K
Sbjct: 378 NHVTHQADKFQELTENSGKNSLIVGNGAKLKIDASGSSKLK----NLNLHDVLYVPQITK 437

Query: 453 NLISISRLAQDNDIYIEFHDTYCVVKDKGTGKQLLKGDLKEGLYCLANTLVKPVDISQPI 512
           NL+S+S+L  DN+I +EF +  C VKDK TGK LL+G LK+GLY L+N            
Sbjct: 438 NLLSVSKLTSDNNIIVEFDNDCCFVKDKLTGKVLLRGILKDGLYQLSN------------ 497

Query: 513 LSSNESKMYKNNSVAFSVCHKPNKVTKTFWHRHLGHPSTKILDSVIRSCNLPVLVNEEHY 572
                S+  K+  V  SV        K  WHR LGHPS  +LD V++ CN+    +++  
Sbjct: 498 ---GSSQTNKDPCVYLSV--------KESWHRKLGHPSNNVLDKVLKICNVKTSPSDKFK 557

Query: 573 FCNSCQYGKSHALPFSISESRASKKFELVYSDVWGPAPVLSTSGFRYYVLFLDDYSRFVW 632
           FC +CQ GKSH LPF  S S A +  EL+++DVWGPAP+ S SGF+YYV F+DD SRF W
Sbjct: 558 FCEACQLGKSHLLPFKSSSSHAQEVLELIHTDVWGPAPINSISGFKYYVHFIDDSSRFTW 617

Query: 633 VYPLKQKSDTGNAFQHFLAMVQTQFNGNIQSFQSDNGTEFLRVHQLCSQLGIKSRYSCPY 692
           +YPLKQKSDT +AF  F  MV+ QFN  I+  Q D G EF  V ++  + GIK R SCPY
Sbjct: 618 IYPLKQKSDTIHAFMQFKNMVENQFNKRIKIIQCDGGGEFKPVQKVALETGIKFRMSCPY 628

Query: 693 TSQQNGRAERKHRHLVETCLTLLAQASMPLVFWW 727
           TSQQNGRAERKHRH+ E  LTLLAQA+M L +WW
Sbjct: 678 TSQQNGRAERKHRHVAELGLTLLAQANMSLHYWW 628

BLAST of Tan0005870 vs. TAIR 10
Match: AT5G48050.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G34070.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 62.8 bits (151), Expect = 1.4e-09
Identity = 72/278 (25.90%), Postives = 126/278 (45.32%), Query Frame = 0

Query: 33  TIKLERGNFLLWKNLALPILRSYKLESHLLGTKPCPPMFLSQARTEGNVTVEGASFSSES 92
           T+ L + N+ +W+ L   +  S+ +  H+ G+    PM      TE              
Sbjct: 25  TLDLNKLNYDVWRELFETLCLSFGVLGHIDGSSTPTPM------TE-------------- 84

Query: 93  TTSINPLYEAWMTVDQLLISWLYNSMISEVATQV--MGCNTAKDLWDVIQLLFGVQSRAE 152
                   + W   D L+  W+Y ++   +   +  +GC TA+DLW  ++ LF     A 
Sbjct: 85  --------KRWKERDGLVKMWIYGTITDSLLDTIIKVGC-TARDLWLSLENLFRDNKEAR 144

Query: 153 EDYLCQTFQQSRKGNMKMSEYLRIMKCHADNLGQARSPVSTRSLISQVLLGLDEEYNPVV 212
                   + +   ++ + EY + +K  +D L    SP+S R L+  +L GL E+Y+ ++
Sbjct: 145 ALQFENELRTTTIDDLSVHEYCQKLKSLSDLLTNVDSPISDRVLVMHLLNGLTEKYDYIL 204

Query: 213 VGIQGKYEI-SWADAQNDLFMFEKRLEFQNTQRNSVSFSHNASVNMAINRGTQSQRQQQQ 272
             I+ K    S+ +A++ L M E RL   N  ++S+S +++ S++  +    + Q +  Q
Sbjct: 205 NVIKHKSPFPSFTEARSMLLMEESRL--SNKSKSSLSHTNHPSLSNVLFTVPRQQERYPQ 264

Query: 273 NYSPNNYN--RSNNNNSQRGGTPNSRGRGRGRGYNSNN 306
            Y  NN N  R  +    RGG     G   GR YN+NN
Sbjct: 265 EYHNNNSNMGRGRSKKKNRGG-----GSSDGR-YNNNN 265

BLAST of Tan0005870 vs. TAIR 10
Match: ATMG00300.1 (Gag-Pol-related retrotransposon family protein )

HSP 1 Score: 49.3 bits (116), Expect = 1.6e-05
Identity = 21/70 (30.00%), Postives = 33/70 (47.14%), Query Frame = 0

Query: 542 WHRHLGHPSTKILDSVIRSCNLPVLVNEEHYFCNSCQYGKSHALPFSISESRASKKFELV 601
           WH  L H S + ++ +++   L         FC  C YGK+H + FS  +       + V
Sbjct: 72  WHSRLAHMSQRGMELLVKKGFLDSSKVSSLKFCEDCIYGKTHRVNFSTGQHTTKNPLDYV 131

Query: 602 YSDVWGPAPV 612
           +SD+WG   V
Sbjct: 132 HSDLWGAPSV 141

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q94HW23.4e-8532.54Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT941.4e-8332.20Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P109781.8e-3327.89Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Q034941.8e-2226.97Transposon Ty2-DR2 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... [more]
Q123371.8e-2226.97Transposon Ty2-GR1 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... [more]
Match NameE-valueIdentityDescription
PNX76291.11.7e-14242.86gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium praten... [more]
GAU19483.13.8e-14242.94hypothetical protein TSUD_77270 [Trifolium subterraneum][more]
PNX94503.12.5e-13841.49putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense... [more]
PNY01489.11.4e-13641.77copia-like polyprotein, partial [Trifolium pratense][more]
PNX78574.11.4e-13641.79retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense][more]
Match NameE-valueIdentityDescription
A0A2K3LCM18.2e-14342.86Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment) OS=Trifolium prat... [more]
A0A2Z6MBG61.8e-14242.94Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 ... [more]
A0A2K3MUJ91.2e-13841.49Putative retrotransposon Ty1-copia subclass protein (Fragment) OS=Trifolium prat... [more]
A0A2K3NEN76.7e-13741.77Copia-like polyprotein (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g024786... [more]
A0A2K3LJ496.7e-13741.79Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Trifolium pratens... [more]
Match NameE-valueIdentityDescription
AT5G48050.11.4e-0925.90CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... [more]
ATMG00300.11.6e-0530.00Gag-Pol-related retrotransposon family protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 597..693
e-value: 4.7E-12
score: 46.1
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 593..755
score: 18.293158
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 523..582
e-value: 5.2E-11
score: 42.3
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 589..730
e-value: 3.5E-30
score: 106.7
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 103..234
e-value: 6.0E-9
score: 35.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 331..363
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 261..304
NoneNo IPR availablePANTHERPTHR34222:SF48SUBFAMILY NOT NAMEDcoord: 44..432
NoneNo IPR availablePANTHERPTHR34222FAMILY NOT NAMEDcoord: 44..432
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 593..741

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0005870.1Tan0005870.1mRNA
Tan0005870.2Tan0005870.2mRNA
Tan0005870.3Tan0005870.3mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding