Lag0030971 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0030971
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionIntegrase catalytic domain-containing protein
Locationchr11: 3380437 .. 3388386 (-)
RNA-Seq ExpressionLag0030971
SyntenyLag0030971
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCTTCTTCATCTAGCGTCTCTGATAACTCTTCTGGCTGTAGTACTCAGCCAAATTCGTCAATTTTTCTTCTATCAAATATATGCAATCTCGTCCCTGTTCGCCTCGATTCCTCAAATTTCTTGTTCTGGAAGTTTCAAGTTCAGTCAATGCTCAAGGCTCATTCCCTGTTTGGAATCGTTGATGGATCTGTGCCGTGTCCGCCTGAGTTTTTGCTTGATGCTGACGGAAGGAAGACTACTGAAGTTAACTCTGCACATACTCAGTGGATTGCTCAAGATAGCGCTCTAATCACGTTGATTAACGCTACCCTATCGAAGACTGCTTATTCATATGTCATTGGATGTAAATCCTCCAAGGAGGTATGGCTTTCTCTTGAAAAACAATTTTCTTCTTTAACGAGGTCTCATGTTCATGAACTCAAGTCATCGCTTCATACAATCTCTAAATCTGCCACTGAATCTATTGATGAATACCTGCTTCGAGTGAAAGAATTAGTAGATAAGCTTGCCACTGTCTCAGTACCAATTGATGATGAAGATCTGTTGTTGTACACCTTAAATGGTTTGTCGTCGGAGTACAATTCATTTCGTACCTCTATTCGAACTCGAGAAGGAACTTTAAAATTACAAGAACTTCATGCCTTGTTGAAGTCAGAGGCCAAGATTCTCGAGCAGCAAAACAAGTCCATCACTACCACTCCTCTTGTTCCGACTGCGATGTTTTCGTCTGTTTCAGGCCAGTTCAATTCCAATCGAGGACGCGGTAGAGGCAGAAATTCAAATCCGAGTTACTGGTCAGGTCGCGGAGGTGGACGATCGAATCAGGGATACTATGGGAATTCAAATCAAGGTCGTGGTGGTTTCAATTCCTCTAATCCATCCCCTGGTTTTCCTCAGGGACATTCTGCTCCAAATCAGGGACGTGGTGCTAATTCTGGATCTAATTCGAGTTCTGGACGTGGAGTTATATGTCAAATTTGCAATCGGTCTGGCCATGGAGCCTTAGATTGTTACAATCGTCTGAATCTTTCGTTCCAGGGAAGACATCCTCCATCAAAGCTTGCTGCAATGGCCTCTATCTATGATCCTCAGTCGTCTAATAATAGTGGCAACAACAATTGTACTTGGTTGGCTGATAGTGGTTGTAACTCTCATGTTACACCTGATCTGTCTACTCTTGCTCTGAATTCAAACTATAATGGTGAGGATGCTATCACGGTTGCAAATGGGCAAGGTGTCCCTATTACTCAAACAGGTTCTGGCACACTCTCCACCTCCCACAGTGACCTTAATCTGTCGAAAATTCTTTGTGTTCCTGATCTCTCAGCCAATTTGTTATCTGTTTCACAATGTTGCCTTGATAACAACTGCATCTTTGTTTTCGATGCTGATTGGTTTTCCATACAGGACAAAACCTCGGGACGAATTTTATACAAGGGCAAGAGTAAGGATGGGTTATATCCCATATCTTCCATATCTTCGGCTGGTTCTTCTTTAAGAAGTAACTCTACAGTGGCTGGTTTGCATACGTCTGCACCTTTATCTCCTATATGTGCATCTGTTTTTGTTTCTCATGTTCCTAGTACTGTGTTGTGGCATTTACGTCTTGGACATCCCTCTTTTCCTGTTTTGCAAAAGTTGTTGTCTGCAAATTCTATTACTTGTGATACTAAGTATACTTGTCGAGACTGTGTGAGTTGTCTCAAAGGGAAAGCTTCTAAGCTTCCTTTTGCATCCTCTACTTCCATTACTACTAGGCCTCTTGCCCTGTTGCATAGTGATGTGTGGGGACCATCACCTTTAATCTCTGTTTCGGGCTTTCGATACTATGTCAATTTTGTTGATGACTTCAGCAAGTTTACTTGGATATTTCCTTTAGTACGAAAATCTGATGTTTCTTCTGTTATAAAGCAGTTTGTTCCTTTTATTGAAAATCAGTTATCTTGCTCTTTACAAGTATTTCGATCAGATGGAGGGGGTGAGTTCGTTAACCATTATGTGTATGAGTTCTTCTCATCCAAAGGTGTTCTACATCAGAGGAGTTGCCCTCATACACCTGAACAGAATGGGGCTGCTGAGCAAAAGCATCGATCTATTGTTGACACTGCTATTGCTCTTATGAATCATGCATCCGTTCCTTTAGAATTTTGGTATCATGCTTTTGCTACTGTTGTGTTTTTGTTGAATCGTCTGCCTTCTAGTGCTATTGGCTTTATGACTCCGTTTCAGAAGTTATATGGTTGTGTGCCTGATTTATCTCATTTGCGTGTGTTTGGTTGTGCCTGTTATCCACTGTTAAAGCCTTACAACACTCATAAGTTACAACCAAAAACTGCTCAGCATGTTTTCTTGGGCTATACCTTGGAATACAAAGGGTATCTATGTTATAATATGGAAACCAAGAAGATGTTGGTCTCTCGGCATGTTGTATTTCATGAAGATGTGTTCCCATTTGCGATTAGACCTGTCACACGGTCTCACAGTCACAGCCACACCCTCCCTTCACAGCCACAACCAAATCCTGCTGTTGTAACTTCGTTGTTGTCTTCACAATCAAGTCTTGTTGTTCGTGTACTACCAAATAATTCTAGTGTTGGTGATGTTTTATTATCACCAATTAACACACCATCTACAAGTGATGTTCAACCATCACCTGTCTTGCCTACTACTGATGTTTTACCATCTTCAGCTTCTCATTCTTCTGTTGTTGTGGCATCAGCAAACAGTAGTCCTAGTCAGTCTGCTGCTATATCAGCAAACAATCAGTCTGATCCACCTGCAGCTGTTACAATCAATAGTCACCCTATGATTACTCGCTCAAAAGCCGGTATTTCCAAACGGAAGGCTTTTAGTGCTGTTATTCCAAAGTCTATACCTGAACCAACTTCGTTCACAGCTGCTTCTAAAATCCCTGAGTGGAAACGTGCTATGTTGGATGAGTATACTGCCCTAACAAATCAGAATACTTGGACTCTTGTTCCATCTCAAGAAGACATGAATATTGTTGGTTGTAAGTGGGTCTTTCGCACTAAATTTAATCCAGATGGGTCAGTTGCTCGTTACAAAGCCCGTTTGGTTGCTAAAGGCTACAATCAACGTGAGGGTGTTGACTTCGATGAAACCTTTAGTCCTGTAGTCAAGAAAACTACTGTGCGAGTTATACTTGCTTTGGCAGCTCATTATGGGTGGTCTTTACATCAGTTAGATGTTAAGAACGCCTTTCTCCATGGCTATCTTAAGGAAGATGTATACATGGTTCAGCCCCCCGGCTTTAGGGATTCAAATTATCCTAATCACGTGTGTAAACTTCACAAATCATTATATGGTTTAAAACAAGCACCTCGTTACTCTTGTATGTTGATGATATAATCATCACTGGTTCTGATCCAAGTTATATCTCTGCTTTAAAGACAACTCTTGGTCGGGAGTTTCAACTTACTGATCTCGGTTCTCTTCGTTATTTTATTGGTCTCGAAATTACTCGATTTTCTGATGGTTTTCATGTAACTCAGTTGAAATATCTTACTGATTTACTAGAAAAGACTGGTATGGCAGACTCAAAGACATGTTCAACTCCCATGTCTACTACGTGTGAGCTTCATGGTGTTTCTCCATTTTTTTCTGATGCTTTGCTTTATAGACAGATTGTTGGTTCCTTACAATACTTGACCTTTACTAGACCCGACATTACCTATGTTGTAAGTAAAGTGAGTCAGTTTATGCACAAACCCACTGAAATACATTATTCTGCTGTGAAGCGTATTCTTCGCTATCTTAATGGCACTCGAGATTGTGGAATTCTGTTTTCCAAAGGGAAACTTGAACTCACTGCCTATAGTGATGCTGATTGGGCAGGAGATTCCATGGATCGACGGTCTACTTCTGGATATGTGGTCTTTTTTTGTGGTATTCCGGTTTCTTGGTCTGCTAAAAAGCAAAGCACGGTGTCTCGGTCTTCAACTGAGGCTGAATATCGCTCTCTTGCACAAACGGCTGCTGAACTATATTGGATAAGACAATTACTGTGTGATCTTTGTGTTTTTCTTCCTACAGTTCCTTTATTGTTTTGTGATATTTTCTACGGGAGAAAGTTGTTCGAGGTGATTTACAAGTATGTTTTACTCCTTCAGCTTCTCAGTTGGCAGATTTATTTACGAAACCACTTGGAGGTGCTGTTTTTCAGCAATTCTGCAGCAAACTTCTAAAGGTTCCTTCATCAGTTTGAGGGGGAGTAATAGAGTTCAGTAATTATATTTTATTAATCCCTCTGTAATAAGTCTGTTAGAGTTTGTTAAGTAGTAGTTTTGGCTTAACCTCATGTATAAATACAGTGTAAATTCTCGTGTATATCAGTTCTTGTATTCAGTGAGTTTGCACACTACACATATCTCCACAATCGTATGATATTGTCCACTTTGGGCATAACCCTCATGGCTTTGCTTTTGGTTCACTCCAAAAAGCCTCATACTAGTGGAGATAGCTGTCATCCCTTATAAACCCATGGTCATCCCCTTATCTAGTCGATGTGGGACTTTGGTCGCACTCCCAAACACTTCTTTCTAACTCACTTGGTTTAATTTCATTGCTCAATGGGGTGACGAATGGCGACGTTGAAGTTCATGCTCTACTATGGGATGAGGAAATCAGGAGGAGGTTCCGAGATGTGAAATTTATACATACTCCTTGAGATTACAACAAAATTGCTCACAAATTAGCTCGTGTGGGGCTAAGTTCATGTTCAAAATTATGGTTTTGTGAATAACCAACTTGGATTCTAAATATGGTTCATAGTGAAAAATCATTGTTTTGTATCCTTGAGGGAACTAGAAGCTTTTATCTTATTTCCAAAAAAAAAAAAAAAAAAAAAAAAACATTTTAAGTTCCACAATTATATTATTGTCGCTATTATTAGGAACTTGAAGGGCTAATATCTATAATGAATTTAATAATTAAAATGGTTTTGTAATTGTTTAAATGATTTTTAAAACAATCCATTTTTAAATAGTTAAATGCTAACATGAGATAATAGTAATGGATAAATAGGGATTTATTATTAATTATGTAATAAATTTTTATCTAAATTTATATTGCCTTTTTTACTAATAAAATCATAGGAAAATTTTAACAATGGTTAAAAGAAAAATATAATTTAAATAAATTATATACCATCAAGTTTTATATATAATAAATTAGTAGAATAATAGTTTTTTTTAAATGAAGGTATATATAAAAATATAAAAGTTGGTAGGTGAAGAGCTTGAATATAAAAGTTTTATTCATTAAATTTCTATGAATTATTTTTTTTAGAAAAAAATTCTATGAATTAATTATAAAATTTATAACTTCAAATTTAATTAACAATCAATGGTGGGATTTTATAGGTTTTGGAATTGCAAAAGTTTTGTAACATTCTTCAACTCCACTATATAAAGAACTCTCTCCTAGCTACTTTCTTCTACAAAAACCGAAGATCAAAACAATCTTTTTTTTTTTTTTTACAATATGTGGGAGGGGGATTCGATGTGGTTATCAGTACAAGGTTATGCCAGTTGAACTAAGCTCTTGTTCGCTAAACGATCTTCACATATGTATCTATTTCTAACATATATATATTTTCATTCTAACATTTCAAAAAAAAAAAAAATCATTCTAACACTATTATTCATTGATTTCTTTATTCTAATCCTAATAACTAATGTAAATATTATCTTAATTTAAAATTTTAAGATAATTTCGATAGTTTTAAATGAAAGTTAGTTTATTTAGTATTAATATAAAAGTTGTATGATGTAATGGTATTTTTCATTTTTTCTACAACAATTAGCTTTCTCATTTAATTGTTTTCTCATCTATTAGTCTTCATATATTTTAGATTTCGAAGATTATTGCATGTATATTATAACACACTATAGATGTTATAGGCAATTTTGGACCACCCCGATACACAAGGAGCTGACGAGGACAACCGGGGAGAAATCGGGCTGAAAGATGGACCAAGGAGGCAAAACCGGCAAATGGGACGGGCCAAGACCGAAGGGGTCGGGTTTTCGGCCCGACCCCCTGCTCGGCCATGGCCGAGGCCGACCCTCGGCCCGCTCGTGCGGGCCGAGTCCGTTTGGTCCCGTCTGGTCCCCACCGCCTCTGGATGCCCCGGTTTCGCCTGGTTTGACCTAAAACGCCTCCGAAACCCTAAAAAGGCTAGGAGGATGAACAGGTATTTATATCCCTCTTCGCCACTGAAGAGGGGATCCCGAATTCTATCCCTAAACTCTACTATTGACTCTCTACTTTCTGCTCTTGCTCTTACTTTTCCACGCCCTCCGTTCTGCTTTCTGACTTAAGCATCGGAGCCGGTGTGGCGAGCACCACACCGGTGTGCAGGTTTACTGTCTTGCAGGCCACAGTCTTCCCCCCTCAACTACAAATTTACCGTTGGTGGCACGTGAAGGTCAGGTGAGATCCTCTGGCCAAAATCGACCATCAACAGTTGGCGCCGTCTGTGGGGAAGAAAGCCTGTTAATCTGCACATCGGTTATTCCATGAGTAAGGGTATGGAAAAGAAAGACCAAGACGTAAACATAGAGAATTCGGATGGTGACCGCCACCAGCGGAGGTCACGTGACGAGGACAGCATCCGGGGGTCACCGAGACAAGCAGGCCGAGGCCGAGCCGAGGCCGAGCAGAGGATGCCGACACCAAGATTGCCGCCCTTGAGGATGAGGTGAAGGGAATGAATCGGAGTTTATCCAAAATACTCCAGATCCTGGATAAACCCGGCCCTAGCACCAAAGTCCATGAGGGGAGCTTGATTAGAGACCCGAGGAAGGGGAAGGAGCCTATGGAGCACACTGCAGAGTCAGGGACGAGGTCAAGAGGAAAGAAGACTGACAGCATGACCAGCAAGGTCAGGGGGCTCAAACCCACTGATCGCACGATTTTGAGGAGCCCAGAGTCAAGCACACTTAAGGGGCGTCACTACACAGTTTCTACCCCAAGCTTCGGTCATACTAAGACAGACCTGAGGAATCTGATCGTTGAGAAGCGCAGAAGTGCCAAAACTGTCGAGTCCGAGGCCAAAGCCGCCGAAGCTGAAGCCCGGGCTGCCGAGGCCGAGGCCAGAGCAGCCGAGGCCGAGGCTAGGTTGGCCGAGGCCGAGGCCAAGAAAGACGACCTCCCTTGGAAGACCGAGCTTCTTAATGCACTAAAGGAGCTCGGAAATCCTCAGGGAGACCAGCAGAGGTCAAAGAACTTTGGAGATCAAAACTTGGAGGAACTAGCCGACCAAGTCGATCCGCCCTTCACAGAAGAAGTCATGAAAGCCGAGGTGCCTCAAAAGTTTAAGGTACCCACGTTCAAACAGTACGATGGCAAGAAAGACCCCGTGCAACATCTAAATGCATACAGAAGTTGGATGGACTTCCACGGCGTCTCAGATGCAATCAGGTGCCGTGCATTCTTTTTCACCCTGACAGGATCAGCTAGGCACTGGTTTGAGAGGCTGAAAAGGAGATCCATCAGCTGTTTCAAAGAGTTAGCCCAAGCATTCCTTGCACAGTTCATGGGGCTAGAGAGCAGCGCAAGCCTCACATCAACCTCTTGACGGTCAAACAACAACCAGGTGAGAGCTTGCGTGATTACATAACTCGTTTTAATGATGAGGCACTACAGGTTGAGGGGTACAGCGAGGGAGCAGCCCTGGTAGCCATAACAGCCGGTCTGGAAGACGAAAGACTGCTCAATTCAATAGGTAAGAGCCAGCCTCGAACCTATGCGGAGTTCGTCTCCCGGGCACAGAAGTATATGAGCGCAGAGGAGTTACTGAAGTCAAAAAGGTCAGAACGAGAATATAAGAGGTTTTCTTCATCTAGCTATGACAGTAAAAAGGACAAAAGGCAGCGGACCGACGAAGGAGGCCGGGGCCGAGCAGACCATGGCCGAGGCTGA

mRNA sequence

ATGGCTTCTTCTTCATCTAGCGTCTCTGATAACTCTTCTGGCTGTAGTACTCAGCCAAATTCGTCAATTTTTCTTCTATCAAATATATGCAATCTCGTCCCTGTTCGCCTCGATTCCTCAAATTTCTTGTTCTGGAAGTTTCAAGTTCAGTCAATGCTCAAGGCTCATTCCCTGTTTGGAATCGTTGATGGATCTGTGCCGTGTCCGCCTGAGTTTTTGCTTGATGCTGACGGAAGGAAGACTACTGAAGTTAACTCTGCACATACTCAGTGGATTGCTCAAGATAGCGCTCTAATCACGTTGATTAACGCTACCCTATCGAAGACTGCTTATTCATATGTCATTGGATGTAAATCCTCCAAGGAGGTATGGCTTTCTCTTGAAAAACAATTTTCTTCTTTAACGAGGTCTCATGTTCATGAACTCAAGTCATCGCTTCATACAATCTCTAAATCTGCCACTGAATCTATTGATGAATACCTGCTTCGAGTGAAAGAATTAGTAGATAAGCTTGCCACTGTCTCAGTACCAATTGATGATGAAGATCTGTTGTTGTACACCTTAAATGGTTTGTCGTCGGAGTACAATTCATTTCGTACCTCTATTCGAACTCGAGAAGGAACTTTAAAATTACAAGAACTTCATGCCTTGTTGAAGTCAGAGGCCAAGATTCTCGAGCAGCAAAACAAGTCCATCACTACCACTCCTCTTGTTCCGACTGCGATGTTTTCGTCTGTTTCAGGCCAGTTCAATTCCAATCGAGGACGCGGTAGAGGCAGAAATTCAAATCCGAGTTACTGGTCAGGTCGCGGAGGTGGACGATCGAATCAGGGATACTATGGGAATTCAAATCAAGGTCGTGGTGGTTTCAATTCCTCTAATCCATCCCCTGGTTTTCCTCAGGGACATTCTGCTCCAAATCAGGGACGTGGTGCTAATTCTGGATCTAATTCGAGTTCTGGACGTGGAGTTATATGTCAAATTTGCAATCGGTCTGGCCATGGAGCCTTAGATTGTTACAATCGTCTGAATCTTTCGTTCCAGGGAAGACATCCTCCATCAAAGCTTGCTGCAATGGCCTCTATCTATGATCCTCAGTCGTCTAATAATAGTGGCAACAACAATTGTACTTGGTTGGCTGATAGTGGTTGTAACTCTCATGTTACACCTGATCTGTCTACTCTTGCTCTGAATTCAAACTATAATGGTGAGGATGCTATCACGGTTGCAAATGGGCAAGGTGTCCCTATTACTCAAACAGGTTCTGGCACACTCTCCACCTCCCACAGTGACCTTAATCTGTCGAAAATTCTTTGTGTTCCTGATCTCTCAGCCAATTTGTTATCTGTTTCACAATGTTGCCTTGATAACAACTGCATCTTTGTTTTCGATGCTGATTGGTTTTCCATACAGGACAAAACCTCGGGACGAATTTTATACAAGGGCAAGAGTAAGGATGGGTTATATCCCATATCTTCCATATCTTCGGCTGGTTCTTCTTTAAGAAGTAACTCTACAGTGGCTGGTTTGCATACGTCTGCACCTTTATCTCCTATATGTGCATCTGTTTTTGTTTCTCATGTTCCTAGTACTGTGTTGTGGCATTTACGTCTTGGACATCCCTCTTTTCCTGTTTTGCAAAAGTTGTTGTCTGCAAATTCTATTACTTGTGATACTAAGTATACTTGTCGAGACTGTGTGAGTTGTCTCAAAGGGAAAGCTTCTAAGCTTCCTTTTGCATCCTCTACTTCCATTACTACTAGGCCTCTTGCCCTGTTGCATAGTGATGTGTGGGGACCATCACCTTTAATCTCTGTTTCGGGCTTTCGATACTATGTCAATTTTGTTGATGACTTCAGCAAGTTTACTTGGATATTTCCTTTAGTACGAAAATCTGATGTTTCTTCTGTTATAAAGCAGTTTGTTCCTTTTATTGAAAATCAGTTATCTTGCTCTTTACAAGTATTTCGATCAGATGGAGGGGGTGAGTTCGTTAACCATTATGTGTATGAGTTCTTCTCATCCAAAGGTGTTCTACATCAGAGGAGTTGCCCTCATACACCTGAACAGAATGGGGCTGCTGAGCAAAAGCATCGATCTATTGTTGACACTGCTATTGCTCTTATGAATCATGCATCCGTTCCTTTAGAATTTTGGTATCATGCTTTTGCTACTGTTGTGTTTTTGTTGAATCGTCTGCCTTCTAGTGCTATTGGCTTTATGACTCCGTTTCAGAAGTTATATGGTTGTGTGCCTGATTTATCTCATTTGCGTGTGTTTGGTTGTGCCTGTTATCCACTGTTAAAGCCTTACAACACTCATAAGTTACAACCAAAAACTGCTCAGCATGTTTTCTTGGGCTATACCTTGGAATACAAAGGGTATCTATGTTATAATATGGAAACCAAGAAGATGTTGGTCTCTCGGCATGTTGTATTTCATGAAGATGTGTTCCCATTTGCGATTAGACCTGTCACACGGTCTCACAGTCACAGCCACACCCTCCCTTCACAGCCACAACCAAATCCTGCTGTTGTAACTTCGTTGTTGTCTTCACAATCAAGTCTTGTTGTTCGTGTACTACCAAATAATTCTAGTGTTGGTGATGTTTTATTATCACCAATTAACACACCATCTACAAGTGATGTTCAACCATCACCTGTCTTGCCTACTACTGATGTTTTACCATCTTCAGCTTCTCATTCTTCTGTTGTTGTGGCATCAGCAAACAGTAGTCCTAGTCAGTCTGCTGCTATATCAGCAAACAATCAGTCTGATCCACCTGCAGCTGTTACAATCAATAGTCACCCTATGATTACTCGCTCAAAAGCCGGTATTTCCAAACGGAAGGCTTTTAGTGCTGTTATTCCAAAGTCTATACCTGAACCAACTTCGTTCACAGCTGCTTCTAAAATCCCTGAGTGGAAACGTGCTATGTTGGATGAGTATACTGCCCTAACAAATCAGAATACTTGGACTCTTGTTCCATCTCAAGAAGACATGAATATTGTTGGTTGTAAGTGGGTCTTTCGCACTAAATTTAATCCAGATGGGTCAGTTGCTCGTTACAAAGCCCGTTTGGTTGCTAAAGGCTACAATCAACGTGAGGGTGTTGACTTCGATGAAACCTTTAGTCCTGTAGTCAAGAAAACTACTGTGCGAGTTATACTTGCTTTGGCAGCTCATTATGGGTGGTCTTTACATCAGTTAGATGTTAAGAACGCCTTTCTCCATGGCTATCTTAAGGAAGATGTATACATGGTTCAGCCCCCCGGCTTTAGGGATTCAAATTATCCTAATCACACAACTCTTGGTCGGGAGTTTCAACTTACTGATCTCGGTTCTCTTCGTTATTTTATTGGTCTCGAAATTACTCGATTTTCTGATGGTTTTCATGTAACTCAGTTGAAATATCTTACTGATTTACTAGAAAAGACTGGTATGGCAGACTCAAAGACATGTTCAACTCCCATGTCTACTACGTGTGAGCTTCATGGTGTTTCTCCATTTTTTTCTGATGCTTTGCTTTATAGACAGATTGTTGGTTCCTTACAATACTTGACCTTTACTAGACCCGACATTACCTATGTTGTAAGTAAAGTGAGTCAGTTTATGCACAAACCCACTGAAATACATTATTCTGCTGTGAAGCGTATTCTTCGCTATCTTAATGGCACTCGAGATTGTGGAATTCTGTTTTCCAAAGGGAAACTTGAACTCACTGCCTATAGTGATGCTGATTGGGCAGGAGATTCCATGGATCGACGGTCTACTTCTGGATATGTGGTCTTTTTTTGTGGTATTCCGGTTTCTTGGTCTGCTAAAAAGCAAAGCACGGTGTCTCGGTCTTCAACTGAGGCTGAATATCGCTCTCTTGCACAAACGGCTGCTGAACTATATTGGATAAGACAATTACTGTGTGATCTTTGTGTTTTTCTTCCTACAGTTCCTTTATTGTTTTGTGATATTTTCTACGGGAGAAAGTTGTTCGAGGCAATTTTGGACCACCCCGATACACAAGGAGCTGACGAGGACAACCGGGGAGAAATCGGGCTGAAAGATGGACCAAGGAGGCAAAACCGGCAAATGGGACGGGCCAAGACCGAAGGGGTCGGGTTTTCGGCCCGACCCCCTGCTCGGCCATGGCCGAGGCCGACCCTCGGCCCGCTCGTGCGGGCCGAGTCCGTTTGGTCCCGTCTGGTCCCCACCGCCTCTGGATGCCCCGGTTTCGCCTGGTTTGACCTAAAACGCCTCCGAAACCCTAAAAAGGCTAGGAGGATGAACAGGTCAGGTGAGATCCTCTGGCCAAAATCGACCATCAACAGTTGGCGCCGTCTGTGGGGAAGAAAGCCTGTTAATCTGCACATCGGTTATTCCATGAGTAAGGGTATGGAAAAGAAAGACCAAGACGTAAACATAGAGAATTCGGATGGTGACCGCCACCAGCGGAGGCCGAGGCCGAGCCGAGGCCGAGCAGAGGATGCCGACACCAAGATTGCCGCCCTTGAGGATGAGGTGAAGGGAATGAATCGGAGTTTATCCAAAATACTCCAGATCCTGGATAAACCCGGCCCTAGCACCAAAGTCCATGAGGGGAGCTTGATTAGAGACCCGAGGAAGGGGAAGGAGCCTATGGAGCACACTGCAGAGTCAGGGACGAGGTCAAGAGGAAAGAAGACTGACAGCATGACCAGCAAGGTCAGGGGGCTCAAACCCACTGATCGCACGATTTTGAGGAGCCCAGAGTCAAGCACACTTAAGGGGCGTCACTACACAGTTTCTACCCCAAGCTTCGGTCATACTAAGACAGACCTGAGGAATCTGATCGTTGAGAAGCGCAGAAGTGCCAAAACTGTCGAGTCCGAGGCCAAAGCCGCCGAAGCTGAAGCCCGGGCTGCCGAGGCCGAGGCCAGAGCAGCCGAGGCCGAGGCTAGGTTGGCCGAGGCCGAGGCCAAGAAAGACGACCTCCCTTGGAAGACCGAGCTTCTTAATGCACTAAAGGAGCTCGGAAATCCTCAGGGAGACCAGCAGAGGTCAAAGAACTTTGGAGATCAAAACTTGGAGGAACTAGCCGACCAAGTCGATCCGCCCTTCACAGAAGAAGTCATGAAAGCCGAGGTGCCTCAAAAGTTTAAGGTACCCACGTTCAAACAGTACGATGGCAAGAAAGACCCCGTGCAACATCTAAATGCATACAGAAGTTGGATGGACTTCCACGGCGTCTCAGATGCAATCAGAGTTAGCCCAAGCATTCCTTGCACAGTTCATGGGGCTAGAGAGCAGCGCAAGCCTCACATCAACCTCTTGACGGTCAAACAACAACCAGGTGAGAGCTTGCGTGATTACATAACTCGTTTTAATGATGAGGCACTACAGGTTGAGGGGTACAGCGAGGGAGCAGCCCTGGTAGCCATAACAGCCGGTCTGGAAGACGAAAGACTGCTCAATTCAATAGGTAAGAGCCAGCCTCGAACCTATGCGGAGTTCGTCTCCCGGGCACAGAAGTATATGAGCGCAGAGGAGTTACTGAAGTCAAAAAGGTCAGAACGAGAATATAAGAGGTTTTCTTCATCTAGCTATGACAGTAAAAAGGACAAAAGGCAGCGGACCGACGAAGGAGGCCGGGGCCGAGCAGACCATGGCCGAGGCTGA

Coding sequence (CDS)

ATGGCTTCTTCTTCATCTAGCGTCTCTGATAACTCTTCTGGCTGTAGTACTCAGCCAAATTCGTCAATTTTTCTTCTATCAAATATATGCAATCTCGTCCCTGTTCGCCTCGATTCCTCAAATTTCTTGTTCTGGAAGTTTCAAGTTCAGTCAATGCTCAAGGCTCATTCCCTGTTTGGAATCGTTGATGGATCTGTGCCGTGTCCGCCTGAGTTTTTGCTTGATGCTGACGGAAGGAAGACTACTGAAGTTAACTCTGCACATACTCAGTGGATTGCTCAAGATAGCGCTCTAATCACGTTGATTAACGCTACCCTATCGAAGACTGCTTATTCATATGTCATTGGATGTAAATCCTCCAAGGAGGTATGGCTTTCTCTTGAAAAACAATTTTCTTCTTTAACGAGGTCTCATGTTCATGAACTCAAGTCATCGCTTCATACAATCTCTAAATCTGCCACTGAATCTATTGATGAATACCTGCTTCGAGTGAAAGAATTAGTAGATAAGCTTGCCACTGTCTCAGTACCAATTGATGATGAAGATCTGTTGTTGTACACCTTAAATGGTTTGTCGTCGGAGTACAATTCATTTCGTACCTCTATTCGAACTCGAGAAGGAACTTTAAAATTACAAGAACTTCATGCCTTGTTGAAGTCAGAGGCCAAGATTCTCGAGCAGCAAAACAAGTCCATCACTACCACTCCTCTTGTTCCGACTGCGATGTTTTCGTCTGTTTCAGGCCAGTTCAATTCCAATCGAGGACGCGGTAGAGGCAGAAATTCAAATCCGAGTTACTGGTCAGGTCGCGGAGGTGGACGATCGAATCAGGGATACTATGGGAATTCAAATCAAGGTCGTGGTGGTTTCAATTCCTCTAATCCATCCCCTGGTTTTCCTCAGGGACATTCTGCTCCAAATCAGGGACGTGGTGCTAATTCTGGATCTAATTCGAGTTCTGGACGTGGAGTTATATGTCAAATTTGCAATCGGTCTGGCCATGGAGCCTTAGATTGTTACAATCGTCTGAATCTTTCGTTCCAGGGAAGACATCCTCCATCAAAGCTTGCTGCAATGGCCTCTATCTATGATCCTCAGTCGTCTAATAATAGTGGCAACAACAATTGTACTTGGTTGGCTGATAGTGGTTGTAACTCTCATGTTACACCTGATCTGTCTACTCTTGCTCTGAATTCAAACTATAATGGTGAGGATGCTATCACGGTTGCAAATGGGCAAGGTGTCCCTATTACTCAAACAGGTTCTGGCACACTCTCCACCTCCCACAGTGACCTTAATCTGTCGAAAATTCTTTGTGTTCCTGATCTCTCAGCCAATTTGTTATCTGTTTCACAATGTTGCCTTGATAACAACTGCATCTTTGTTTTCGATGCTGATTGGTTTTCCATACAGGACAAAACCTCGGGACGAATTTTATACAAGGGCAAGAGTAAGGATGGGTTATATCCCATATCTTCCATATCTTCGGCTGGTTCTTCTTTAAGAAGTAACTCTACAGTGGCTGGTTTGCATACGTCTGCACCTTTATCTCCTATATGTGCATCTGTTTTTGTTTCTCATGTTCCTAGTACTGTGTTGTGGCATTTACGTCTTGGACATCCCTCTTTTCCTGTTTTGCAAAAGTTGTTGTCTGCAAATTCTATTACTTGTGATACTAAGTATACTTGTCGAGACTGTGTGAGTTGTCTCAAAGGGAAAGCTTCTAAGCTTCCTTTTGCATCCTCTACTTCCATTACTACTAGGCCTCTTGCCCTGTTGCATAGTGATGTGTGGGGACCATCACCTTTAATCTCTGTTTCGGGCTTTCGATACTATGTCAATTTTGTTGATGACTTCAGCAAGTTTACTTGGATATTTCCTTTAGTACGAAAATCTGATGTTTCTTCTGTTATAAAGCAGTTTGTTCCTTTTATTGAAAATCAGTTATCTTGCTCTTTACAAGTATTTCGATCAGATGGAGGGGGTGAGTTCGTTAACCATTATGTGTATGAGTTCTTCTCATCCAAAGGTGTTCTACATCAGAGGAGTTGCCCTCATACACCTGAACAGAATGGGGCTGCTGAGCAAAAGCATCGATCTATTGTTGACACTGCTATTGCTCTTATGAATCATGCATCCGTTCCTTTAGAATTTTGGTATCATGCTTTTGCTACTGTTGTGTTTTTGTTGAATCGTCTGCCTTCTAGTGCTATTGGCTTTATGACTCCGTTTCAGAAGTTATATGGTTGTGTGCCTGATTTATCTCATTTGCGTGTGTTTGGTTGTGCCTGTTATCCACTGTTAAAGCCTTACAACACTCATAAGTTACAACCAAAAACTGCTCAGCATGTTTTCTTGGGCTATACCTTGGAATACAAAGGGTATCTATGTTATAATATGGAAACCAAGAAGATGTTGGTCTCTCGGCATGTTGTATTTCATGAAGATGTGTTCCCATTTGCGATTAGACCTGTCACACGGTCTCACAGTCACAGCCACACCCTCCCTTCACAGCCACAACCAAATCCTGCTGTTGTAACTTCGTTGTTGTCTTCACAATCAAGTCTTGTTGTTCGTGTACTACCAAATAATTCTAGTGTTGGTGATGTTTTATTATCACCAATTAACACACCATCTACAAGTGATGTTCAACCATCACCTGTCTTGCCTACTACTGATGTTTTACCATCTTCAGCTTCTCATTCTTCTGTTGTTGTGGCATCAGCAAACAGTAGTCCTAGTCAGTCTGCTGCTATATCAGCAAACAATCAGTCTGATCCACCTGCAGCTGTTACAATCAATAGTCACCCTATGATTACTCGCTCAAAAGCCGGTATTTCCAAACGGAAGGCTTTTAGTGCTGTTATTCCAAAGTCTATACCTGAACCAACTTCGTTCACAGCTGCTTCTAAAATCCCTGAGTGGAAACGTGCTATGTTGGATGAGTATACTGCCCTAACAAATCAGAATACTTGGACTCTTGTTCCATCTCAAGAAGACATGAATATTGTTGGTTGTAAGTGGGTCTTTCGCACTAAATTTAATCCAGATGGGTCAGTTGCTCGTTACAAAGCCCGTTTGGTTGCTAAAGGCTACAATCAACGTGAGGGTGTTGACTTCGATGAAACCTTTAGTCCTGTAGTCAAGAAAACTACTGTGCGAGTTATACTTGCTTTGGCAGCTCATTATGGGTGGTCTTTACATCAGTTAGATGTTAAGAACGCCTTTCTCCATGGCTATCTTAAGGAAGATGTATACATGGTTCAGCCCCCCGGCTTTAGGGATTCAAATTATCCTAATCACACAACTCTTGGTCGGGAGTTTCAACTTACTGATCTCGGTTCTCTTCGTTATTTTATTGGTCTCGAAATTACTCGATTTTCTGATGGTTTTCATGTAACTCAGTTGAAATATCTTACTGATTTACTAGAAAAGACTGGTATGGCAGACTCAAAGACATGTTCAACTCCCATGTCTACTACGTGTGAGCTTCATGGTGTTTCTCCATTTTTTTCTGATGCTTTGCTTTATAGACAGATTGTTGGTTCCTTACAATACTTGACCTTTACTAGACCCGACATTACCTATGTTGTAAGTAAAGTGAGTCAGTTTATGCACAAACCCACTGAAATACATTATTCTGCTGTGAAGCGTATTCTTCGCTATCTTAATGGCACTCGAGATTGTGGAATTCTGTTTTCCAAAGGGAAACTTGAACTCACTGCCTATAGTGATGCTGATTGGGCAGGAGATTCCATGGATCGACGGTCTACTTCTGGATATGTGGTCTTTTTTTGTGGTATTCCGGTTTCTTGGTCTGCTAAAAAGCAAAGCACGGTGTCTCGGTCTTCAACTGAGGCTGAATATCGCTCTCTTGCACAAACGGCTGCTGAACTATATTGGATAAGACAATTACTGTGTGATCTTTGTGTTTTTCTTCCTACAGTTCCTTTATTGTTTTGTGATATTTTCTACGGGAGAAAGTTGTTCGAGGCAATTTTGGACCACCCCGATACACAAGGAGCTGACGAGGACAACCGGGGAGAAATCGGGCTGAAAGATGGACCAAGGAGGCAAAACCGGCAAATGGGACGGGCCAAGACCGAAGGGGTCGGGTTTTCGGCCCGACCCCCTGCTCGGCCATGGCCGAGGCCGACCCTCGGCCCGCTCGTGCGGGCCGAGTCCGTTTGGTCCCGTCTGGTCCCCACCGCCTCTGGATGCCCCGGTTTCGCCTGGTTTGACCTAAAACGCCTCCGAAACCCTAAAAAGGCTAGGAGGATGAACAGGTCAGGTGAGATCCTCTGGCCAAAATCGACCATCAACAGTTGGCGCCGTCTGTGGGGAAGAAAGCCTGTTAATCTGCACATCGGTTATTCCATGAGTAAGGGTATGGAAAAGAAAGACCAAGACGTAAACATAGAGAATTCGGATGGTGACCGCCACCAGCGGAGGCCGAGGCCGAGCCGAGGCCGAGCAGAGGATGCCGACACCAAGATTGCCGCCCTTGAGGATGAGGTGAAGGGAATGAATCGGAGTTTATCCAAAATACTCCAGATCCTGGATAAACCCGGCCCTAGCACCAAAGTCCATGAGGGGAGCTTGATTAGAGACCCGAGGAAGGGGAAGGAGCCTATGGAGCACACTGCAGAGTCAGGGACGAGGTCAAGAGGAAAGAAGACTGACAGCATGACCAGCAAGGTCAGGGGGCTCAAACCCACTGATCGCACGATTTTGAGGAGCCCAGAGTCAAGCACACTTAAGGGGCGTCACTACACAGTTTCTACCCCAAGCTTCGGTCATACTAAGACAGACCTGAGGAATCTGATCGTTGAGAAGCGCAGAAGTGCCAAAACTGTCGAGTCCGAGGCCAAAGCCGCCGAAGCTGAAGCCCGGGCTGCCGAGGCCGAGGCCAGAGCAGCCGAGGCCGAGGCTAGGTTGGCCGAGGCCGAGGCCAAGAAAGACGACCTCCCTTGGAAGACCGAGCTTCTTAATGCACTAAAGGAGCTCGGAAATCCTCAGGGAGACCAGCAGAGGTCAAAGAACTTTGGAGATCAAAACTTGGAGGAACTAGCCGACCAAGTCGATCCGCCCTTCACAGAAGAAGTCATGAAAGCCGAGGTGCCTCAAAAGTTTAAGGTACCCACGTTCAAACAGTACGATGGCAAGAAAGACCCCGTGCAACATCTAAATGCATACAGAAGTTGGATGGACTTCCACGGCGTCTCAGATGCAATCAGAGTTAGCCCAAGCATTCCTTGCACAGTTCATGGGGCTAGAGAGCAGCGCAAGCCTCACATCAACCTCTTGACGGTCAAACAACAACCAGGTGAGAGCTTGCGTGATTACATAACTCGTTTTAATGATGAGGCACTACAGGTTGAGGGGTACAGCGAGGGAGCAGCCCTGGTAGCCATAACAGCCGGTCTGGAAGACGAAAGACTGCTCAATTCAATAGGTAAGAGCCAGCCTCGAACCTATGCGGAGTTCGTCTCCCGGGCACAGAAGTATATGAGCGCAGAGGAGTTACTGAAGTCAAAAAGGTCAGAACGAGAATATAAGAGGTTTTCTTCATCTAGCTATGACAGTAAAAAGGACAAAAGGCAGCGGACCGACGAAGGAGGCCGGGGCCGAGCAGACCATGGCCGAGGCTGA

Protein sequence

MASSSSSVSDNSSGCSTQPNSSIFLLSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQWIAQDSALITLINATLSKTAYSYVIGCKSSKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKLQELHALLKSEAKILEQQNKSITTTPLVPTAMFSSVSGQFNSNRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQGRGGFNSSNPSPGFPQGHSAPNQGRGANSGSNSSSGRGVICQICNRSGHGALDCYNRLNLSFQGRHPPSKLAAMASIYDPQSSNNSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAITVANGQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVAGLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPSFPVLQKLLSANSITCDTKYTCRDCVSCLKGKASKLPFASSTSITTRPLALLHSDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIALMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLVSRHVVFHEDVFPFAIRPVTRSHSHSHTLPSQPQPNPAVVTSLLSSQSSLVVRVLPNNSSVGDVLLSPINTPSTSDVQPSPVLPTTDVLPSSASHSSVVVASANSSPSQSAAISANNQSDPPAAVTINSHPMITRSKAGISKRKAFSAVIPKSIPEPTSFTAASKIPEWKRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPGFRDSNYPNHTTLGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTTCELHGVSPFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQFMHKPTEIHYSAVKRILRYLNGTRDCGILFSKGKLELTAYSDADWAGDSMDRRSTSGYVVFFCGIPVSWSAKKQSTVSRSSTEAEYRSLAQTAAELYWIRQLLCDLCVFLPTVPLLFCDIFYGRKLFEAILDHPDTQGADEDNRGEIGLKDGPRRQNRQMGRAKTEGVGFSARPPARPWPRPTLGPLVRAESVWSRLVPTASGCPGFAWFDLKRLRNPKKARRMNRSGEILWPKSTINSWRRLWGRKPVNLHIGYSMSKGMEKKDQDVNIENSDGDRHQRRPRPSRGRAEDADTKIAALEDEVKGMNRSLSKILQILDKPGPSTKVHEGSLIRDPRKGKEPMEHTAESGTRSRGKKTDSMTSKVRGLKPTDRTILRSPESSTLKGRHYTVSTPSFGHTKTDLRNLIVEKRRSAKTVESEAKAAEAEARAAEAEARAAEAEARLAEAEAKKDDLPWKTELLNALKELGNPQGDQQRSKNFGDQNLEELADQVDPPFTEEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRVSPSIPCTVHGAREQRKPHINLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRSEREYKRFSSSSYDSKKDKRQRTDEGGRGRADHGRG
Homology
BLAST of Lag0030971 vs. NCBI nr
Match: TQE01264.1 (hypothetical protein C1H46_013171 [Malus baccata])

HSP 1 Score: 966.5 bits (2497), Expect = 3.4e-277
Identity = 586/1358 (43.15%), Postives = 785/1358 (57.81%), Query Frame = 0

Query: 95   DSALITLINATLSKTAYSYVIGCKSSKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSAT 154
            D A++ LI ATLS TA S VIGC SS ++W+SL+ +FS++T++ + +LK+ L  I K   
Sbjct: 3    DRAVMQLIIATLSPTAMSCVIGCLSSNDMWISLKDRFSTVTKASIFQLKTELQNI-KKGN 62

Query: 155  ESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKLQEL 214
            +S+  YL R+K++ D L+   V  +D+D+++  L GL SEYN+F+T IR RE  + L+E 
Sbjct: 63   DSVSTYLQRIKDVRDHLSAAGVIFEDDDIVILALKGLPSEYNTFKTVIRGRENVISLKEF 122

Query: 215  HALLKSEAKILEQQNKSITTTPLVPTAMFSSVSG-------QFNSNRGRGRGR----NSN 274
             + L +E   +E  N SI+ +  V   +  + SG       Q  S+ G  + +     S 
Sbjct: 123  RSQLLAEEATVE--NNSISES-FVTAMVAQNTSGKGKALMLQEGSSSGSSQSQVYTGGST 182

Query: 275  PSYWSG--------------RGGGRSNQGYYGNSNQGRGGFNSSNP--SPGFPQGHSAPN 334
            PS +SG              RG GR    ++ NS    G   SS     P  P   + P 
Sbjct: 183  PSGYSGPSSHYNGGYSFNGFRGRGRGRNNFHSNSKYNTGPSTSSAGILGPAKPHISTCPE 242

Query: 335  QG-----------RG-------ANSGSNSSSGRGVICQICNRSGHGALDCYNRLNLSFQG 394
             G           RG           S S+    V CQIC + GH A+ CY+R N ++QG
Sbjct: 243  HGYEVPTCQICNKRGHIAADCFQRHSSPSAPSSKVQCQICWKYGHIAVQCYHRSNFTYQG 302

Query: 395  RHPPSKLAAMASIYDPQSSNNSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAITV 454
            R PPS L+AM + + P +          W+AD+G  SH+T DLS L L + ++G D +T 
Sbjct: 303  RSPPSTLSAMNTTFSPSAPQEQ-----FWVADTGATSHMTSDLSNLNLAAPFSGTDTVTT 362

Query: 455  ANGQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFS 514
            A+G G+PI+  GS TL T      L  IL VP LS +LLS+ Q C DN C F+ D   F 
Sbjct: 363  ASGSGLPISHIGSTTLHTPQYAFELKNILHVPQLSQHLLSIYQLCKDNKCRFICDEFCFW 422

Query: 515  IQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVAGLHTSAPLSPICASVFVSHVP 574
            IQDK +GRI+ +G  ++GLYPI         +++       H ++     C   ++    
Sbjct: 423  IQDKITGRIILQGLCREGLYPIPFHIPQHLLIQAQK-----HNASFKQQTC---YLGSQV 482

Query: 575  STVLWHLRLGHPSFPVLQKLLSANSITCDTKYTCRDCVSCLKGKASKLPFASSTSITTRP 634
               LWH RLGHPS  V   +L  + I     +    C SCL+GK +KLPF    + T  P
Sbjct: 483  KINLWHQRLGHPSNIVTSAMLKQSHIPMSLDHVSSICTSCLEGKFAKLPFLFPANKTVHP 542

Query: 635  LALLHSDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQL 694
            L ++HSDVWGPS  +S+ G+++YV+FVD+ ++FTWIFPL+ KS+V  V   F  F+  Q 
Sbjct: 543  LEVIHSDVWGPSSTVSIEGYKFYVSFVDECTRFTWIFPLMNKSEVFDVFVHFHSFLITQF 602

Query: 695  SCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIALM 754
            S +++VF+SDGGGE+ +H   ++   KG+LHQ+SCP+TP+QNG AE+KHR I++TAI L+
Sbjct: 603  SATVKVFQSDGGGEYSSHKFKQYLLQKGILHQKSCPYTPQQNGLAERKHRHILETAITLL 662

Query: 755  NHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLK 814
              AS+P + W+HA A  V+L+NR+    +   +PFQ L+G  P +SHL+VFGCAC+PLLK
Sbjct: 663  QTASLPHKLWFHACAISVYLINRMACQTLQMSSPFQCLFGTSPSISHLKVFGCACFPLLK 722

Query: 815  PYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLVSRHVVFHEDVFPFAIRPVTRSH 874
              N+ KLQPKT+Q +F+GY  +YKGYLC N  T K+ VSRHV+F E  FP++   +T + 
Sbjct: 723  QLNSSKLQPKTSQCIFIGYAGQYKGYLCLNPLTNKIYVSRHVLFDETTFPYS-SIITSNS 782

Query: 875  SHSHTLPSQPQPNPAVVTSLLSSQSSLVVRVLPNNSSVGDVLLSPINTPSTSDVQPSPVL 934
            + SH         P V+ SL++S ++ VV  +P  +S      SP   P+ S+   S  L
Sbjct: 783  ASSHISSPSLHVQPQVLPSLVNSHNT-VVSPIPLQASE----CSPSTPPNASECSLSTPL 842

Query: 935  PTTDVLPSSASHSSVVVASANSS---PSQSAAISANNQSDPPAAVTINSHPMITRSKAGI 994
            P +  + S  S  S     A+S+   P         + S       ++ HPM TRSK+GI
Sbjct: 843  PNSRFVDSPTSLPSPTALPADSTQSLPPDDPDFQPEDLSVVLPVPLVSWHPMQTRSKSGI 902

Query: 995  SKRKAFSAVIPKS--IPEPTSFTAASKIPEWKRAMLDEYTALTNQNTWTLVPSQEDMNIV 1054
            SK+K FSA +  S  + EP +F +A KIPEW  AM DE TAL +QNTW+LVP     N+V
Sbjct: 903  SKKKVFSAKLQSSVQVSEPATFKSAIKIPEWVAAMEDEITALKSQNTWSLVPLPSTKNLV 962

Query: 1055 GCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALAAHYG 1114
            GCKWV+R K NPDGSVARYKARLVAKGY+Q EGVD+ ETFSPVVK TTVR+ILALAA + 
Sbjct: 963  GCKWVYRIKTNPDGSVARYKARLVAKGYSQEEGVDYCETFSPVVKPTTVRLILALAAQFQ 1022

Query: 1115 WSLHQLDVKNAFLHGYLKEDVYMVQPPGFRDSNYPN------------------------ 1174
            WSL QLDVKNAFLHG L E+VYM QP GF  S +P+                        
Sbjct: 1023 WSLRQLDVKNAFLHGDLHEEVYMSQPQGFESSVHPSNYVCRLHKSLYGLKQAPRAWNEKF 1082

Query: 1175 ---------------------HTTLG-----------------------------REFQL 1234
                                 HT+LG                             +EF L
Sbjct: 1083 TSFLPGLGFKASLADPSLFVQHTSLGTVVLLLYVDDIILTGSSSQLIDGVIQALAKEFDL 1142

Query: 1235 TDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTTCELHGVS-- 1294
             DLG L YF+GL+IT    G  V+Q KY+ DLLEK  + DSK C+TP      L      
Sbjct: 1143 KDLGQLHYFLGLQITYQPQGLFVSQTKYIKDLLEKVDLQDSKPCNTPCLPYHRLSKTEGI 1202

Query: 1295 PFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQFMHKPTEIHYSAVKRILRYLNGTRD 1327
            P+ S    YR IVG+LQYLTFTRPDI + V++  QFMH P + H  AVK ILRYL+GT  
Sbjct: 1203 PYHSPH-QYRSIVGALQYLTFTRPDIAFSVNQCCQFMHHPMDSHVVAVKHILRYLSGTLH 1262

BLAST of Lag0030971 vs. NCBI nr
Match: TQD93593.1 (hypothetical protein C1H46_020801 [Malus baccata])

HSP 1 Score: 943.7 bits (2438), Expect = 2.4e-270
Identity = 590/1515 (38.94%), Postives = 815/1515 (53.80%), Query Frame = 0

Query: 25   LLSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEV 84
            L+ ++ N V V+LD SN++ W FQ+  +L+ + +FG VDGS+ CP ++  D+D  + T  
Sbjct: 17   LIPSVGNTVTVKLDDSNYVTWNFQMGLLLEGNGIFGFVDGSISCPDKY-QDSDSEEETVT 76

Query: 85   NSAHTQ-----WIAQDSALITLINATLSKTAYSYVIGCKSSKEVWLSLEKQFSSLTRSHV 144
            NS H       W   D AL+TLI ATLS  A S VIGC+SS+++W +L+++FS++TR+ +
Sbjct: 77   NSHHITDDYKVWKIHDKALMTLITATLSTAALSCVIGCQSSQDMWNNLKERFSNMTRTSI 136

Query: 145  HELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFR 204
             ++K  L  I K  +ESID YL R+K+  D+LA V V I DED+++  L GL  E+N+ +
Sbjct: 137  VQMKIDLQNIRK-GSESIDLYLQRIKDCRDQLAAVGVFISDEDIVIVALKGLPHEFNTIK 196

Query: 205  TSIRTREGTLKLQELHALLKSEAKILEQQNKS---------------------------- 264
              IR RE  + L+EL + LK+E   L++  K                             
Sbjct: 197  AVIRGRENLVSLKELRSQLKAEEATLDEVIKQAPIMSAMYASNSVYDVGGSSGGAAHNAS 256

Query: 265  ----------ITTTPL-----VPTAMFSSVSG-QFNSNRGRGRGRNSNPSYWSGRGGGRS 324
                      I++TP+      P  MF   S   F S  G G   N   + +  +G G+ 
Sbjct: 257  QHSVNGSPFPISSTPVFQQMPFPNQMFQMNSPLAFVSQGGSGTYNNFRGNNFKPKGKGKK 316

Query: 325  NQGYYGNSNQGRGGFNSSNPSPGFPQGHSAPNQGRGANSGSN----------SSSGRGVI 384
               ++    Q    F   + S  F QG S  +      +                G G +
Sbjct: 317  FYNHFQQPQQTGASFPQQSSSTNFHQGSSFQSLHSPLPTEQGYQPLQICQICDRKGHGAL 376

Query: 385  ------CQICNRSGHGALDCYNRLNLSFQGRHPPSKLAAMASIYDPQ------------- 444
                  CQICNR GH A  C++R N  F    PP + ++ +  + PQ             
Sbjct: 377  HCFQKGCQICNRKGHTAATCFDR-NSGFSSMVPP-QFSSSSHGFSPQQFQPSPQATFPAP 436

Query: 445  --------------SSNNSGNNNCT-----------------WLADSGCNSHVTPDLSTL 504
                          +  NS N++                   WL D G   H+T DLS +
Sbjct: 437  FPHAHVVHPPQFHPAMKNSQNHSPVAMTAQTTSSPSAPQQEYWLLDLGATHHMTSDLSNI 496

Query: 505  ALNSNYNGEDAITVANGQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCL 564
             + + Y+  D +T ANG+G+ I   G   L      L L  +L VP LS +LLS+ Q C 
Sbjct: 497  HMAAPYSSSDTVTGANGEGLHIAHIGHSNLPLQSHTLCLKSVLHVPQLSQHLLSMHQLCK 556

Query: 565  DNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVAGLHTSAP 624
            DNNC  + D     IQDK +  ILY+G S + +YP+  + S                   
Sbjct: 557  DNNCRCIVDESSVCIQDKVTQEILYRGLSNNAVYPLPVMKS------------------- 616

Query: 625  LSPICASVFVSHVPSTVLWHLRLGHPSFPVLQKLLSANSI---TCDTKYTCRDCVSCLKG 684
             SP+  + ++    ++ LWH RLGHP+  V++  LS   I     D+ YTC+   +CL+G
Sbjct: 617  -SPVSPAAYIRQRINSALWHCRLGHPASSVVKAALSKADIPFKCLDSFYTCK---ACLQG 676

Query: 685  KASKLPFASSTSITTRPLALLHSDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVRKS 744
            K + LPF S  S +  P  ++H+DVWGPSP +S+ G+RYYV+F+D+ +++TWIFP++ K+
Sbjct: 677  KFANLPFPSLASKSVIPFEVIHTDVWGPSPSLSIEGYRYYVSFIDECTRYTWIFPIMNKA 736

Query: 745  DVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPEQNG 804
             V  +  QF  F+ N  +  +++ +SDGGGE++  +   F   KG+LH +SCP+TP+QNG
Sbjct: 737  AVFGLFVQFQAFVHNFFNVHIRILQSDGGGEYIGIHFQNFLKDKGILHHKSCPYTPQQNG 796

Query: 805  AAEQKHRSIVDTAIALMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVP 864
              E+K+R I +TAI L+  A +P +FWYHA AT V+L+NR+P+  +   +PF+ LY   P
Sbjct: 797  LVERKNRHITETAITLLQQACLPPQFWYHACATAVYLINRMPTLVLSMKSPFEVLYHSSP 856

Query: 865  DLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLVSRHVV 924
             L HL++FGCACYP LKPY +HKL PKT++ +FLGY  +YKG++C+N +  K++VSRHV+
Sbjct: 857  KLDHLKIFGCACYPSLKPYRSHKLAPKTSECIFLGYAAQYKGFICFNPKDNKLIVSRHVL 916

Query: 925  FHEDVFP---FAIRPVTRSHSHS----------HTLPSQPQPNPAVVTSLLSSQ-SSLVV 984
            F E  FP    A R V+ S   S          H  P    P P V +   SSQ S  V 
Sbjct: 917  FDERHFPAPLMARRFVSGSKVASMSSIVPSNTFHHPPLASIPVPQVFSRDSSSQHSPPVC 976

Query: 985  RVLPN--------NSSVGDVLLSPINTPSTSDVQPSPVLPTTDVLPSSASHSSVVVASAN 1044
               P+        +S  GD+ + P+++ + SD   S + P T        H S V     
Sbjct: 977  SSAPSEFVYSGTISSIPGDISMGPLSSAALSDQARSELHPNT--------HLSQV----- 1036

Query: 1045 SSPSQSAAISANNQSDPPAAVTINSHPMITRSKAGISKRK-AFSAVIPKSIPEPTSFTAA 1104
             S  Q  +++A           +NSHPM TRSK+GI K+K AFS  +     EP+S++AA
Sbjct: 1037 -SELQQVSVAA-----------VNSHPMQTRSKSGIFKKKQAFSVDVQAGAKEPSSYSAA 1096

Query: 1105 SKIPEWKRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVA 1164
             K+ EW+ AM DE  AL  Q TWTLVP   D N+VGCKW+++ K +PDG+VARYKARLVA
Sbjct: 1097 CKLSEWRGAMQDEMDALFQQKTWTLVPLPPDKNLVGCKWIYKIKKHPDGTVARYKARLVA 1156

Query: 1165 KGYNQREGVDFDETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGYLKEDVYMVQ 1224
            KG++Q  G+D+ ETFSPVVK TTVR++L+LAA  GW L+QLDVKNAFLHG+L E+VYM Q
Sbjct: 1157 KGFSQEAGLDYYETFSPVVKPTTVRLLLSLAASNGWQLNQLDVKNAFLHGFLDEEVYMAQ 1216

Query: 1225 PPGFRDSNYPNHT--------------------------TLG------------------ 1284
            P GF D  +P H                           TLG                  
Sbjct: 1217 PQGFVDPLHPTHVCKLQRSLYGLKQAPRAWNERFTKFLLTLGFKSSYADPSLFVKYENQS 1276

Query: 1285 -----------------------------REFQLTDLGSLRYFIGLEITRFSDGFHVTQL 1330
                                          EF + +LG L YF+GL+I   S G  V Q 
Sbjct: 1277 IVVLLLYVDDIILTGNSVAGVHSVIQQLTAEFDMKNLGLLHYFLGLQIEYRSSGLFVHQS 1336

BLAST of Lag0030971 vs. NCBI nr
Match: BBG97282.1 (hypothetical protein Prudu_006352 [Prunus dulcis])

HSP 1 Score: 918.7 bits (2373), Expect = 8.2e-263
Identity = 580/1457 (39.81%), Postives = 815/1457 (55.94%), Query Frame = 0

Query: 21   SSIFLLSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEF-LLDADGR 80
            +S   L  +  ++ +RL   N+L W++Q++S+L+ H LFG  DGS+  PP++ +LD +G 
Sbjct: 3    TSTLKLDGLLGMLTIRLTDGNYLKWRYQIESVLEGHDLFGHFDGSIVAPPKYAILDEEG- 62

Query: 81   KTTEVNSAHTQWIAQDSALITLINATLSKTAYSYVIGCKSSKEVWLSLEKQFSSLTRSHV 140
             T+ + +A+  W+  D AL++L+ ATLS  A  YVIG K++ E W++L  ++++++R+ V
Sbjct: 63   VTSVITAAYKDWLKVDKALLSLLIATLSDEAIEYVIGSKTASEAWMNLTDRYATVSRARV 122

Query: 141  HELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFR 200
            + LK+ L T  K A +SI+++LLR+K + D+LA   + + D+DL+L  LNGL SEY+  +
Sbjct: 123  NLLKTELQTAQKGA-DSIEKFLLRLKHVRDQLAVAGISVSDDDLMLAVLNGLPSEYDMIK 182

Query: 201  TSIRTREGTLKLQEL--HALLKSEAKILEQQNKSITTTPLVPTAMFSSVS-GQFNSNRG- 260
            T +  R+ +L  ++   H L   +A         +  +P+V     SS S G  +S    
Sbjct: 183  TVLLARDTSLSFKDFRNHLLAAEQA---ADSRVILPHSPMVGMLSHSSPSTGSTSSTPSP 242

Query: 261  --RGRGRNSNPSYWSGRGGGRSNQGYYGNSNQ---GRGGFNSSNP-SPGFPQGHSAPNQG 320
               G G    PS  S    G  +   +G S+     RG F+ S P   GFP     P + 
Sbjct: 243  NLSGAGILPTPSVTSFSPTGYMSSYTHGRSSSSFGSRGRFSGSRPFGRGFPNKFQGPPKS 302

Query: 321  -----------RGANSG------SNSSSGRGVI-CQICNRSGHGALDCYNRLNLSFQGRH 380
                       RG  +       S SS G  VI CQIC + GHGALDCY+R N ++QG  
Sbjct: 303  GIVPECQICSKRGHTAANCFFRDSTSSQGSSVIECQICGKKGHGALDCYHRSNYAYQGSP 362

Query: 381  PPSKLAAMASIYDPQSSNNSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAITVAN 440
            PPS L AMA       +  S + +  W+ADSG + H+ P ++T+   +     + + V N
Sbjct: 363  PPSSLTAMA-------AQASFSPDAVWIADSGASHHMVPHMTTMHNVTPCTSAENVVVGN 422

Query: 441  GQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQ 500
            G+G+ I   G   + T  S L LSK+L VP L+ANLLSV Q C DNNC  +FD   F IQ
Sbjct: 423  GEGLHIAHIGKSHIPTVSSSLTLSKVLHVPQLTANLLSVYQLCHDNNCRMIFDTSGFLIQ 482

Query: 501  DKTSGRILYKGKSKDGLYPI-SSISSAGSSLRSNSTVAGLHTSAPLSPICASVFVSHVPS 560
            DK + + L +GKS+ GLYP+ +S+SS G+   S+S    + TS+  S    S F+     
Sbjct: 483  DKVTNKKLLQGKSEHGLYPVPTSLSSIGTGSSSSS----VGTSSRASQSQTSAFLGQKVR 542

Query: 561  TVLWHLRLGHPSFPVLQKLLSANSITCDTKYTCRDCVSCLKGKASKLPFASSTSITTRPL 620
            + LWH RLGHP+  V+Q +L+A  I   +    + C  CL GK  +LPF+S  +  + P 
Sbjct: 543  SSLWHSRLGHPTNEVVQLMLTAAQIPVVSDSISKLCSFCLDGKMHRLPFSSHHNKASSPF 602

Query: 621  ALLHSDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLS 680
              LHSDVWGPSP  S+SG+RY V+F+D+++ F W++PL  KS+V ++  + V F+    +
Sbjct: 603  YRLHSDVWGPSPCKSISGYRYVVSFIDEYTGFLWLYPLYAKSEVFTMFTRLVAFLTTHFN 662

Query: 681  CSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIALMN 740
             S++  +SDGGGE+++    EF +S G++HQ SCP TP+QNG AE+K+R +++T+I L+ 
Sbjct: 663  ASIKFLQSDGGGEYMSTQFNEFLASHGIVHQVSCPSTPQQNGLAERKNRHLLETSITLLQ 722

Query: 741  HASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLKP 800
             AS+P +FW+HA A   +L+NR+PS  +   +P+ +L    PD+ HLRVFG A YP L+ 
Sbjct: 723  EASMPDQFWFHAMAHSAYLINRMPSKVLSNQSPYYRLLHRHPDIRHLRVFGTAVYPCLRA 782

Query: 801  YNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLVSRHVVFHEDVFPFAIRPVTRSHS 860
             NT KLQP+T   VF+GY L YKG LCYN  T K LVSRHV+  E +FPF          
Sbjct: 783  TNTTKLQPRTVMCVFMGYLLGYKGVLCYNCSTSKFLVSRHVIHDETIFPF---------K 842

Query: 861  HSHTLPSQP-------------QPNPAVVTSLLSSQSS----------------LVVRVL 920
            H  +LPS P              P  A   S+L+S SS                +V    
Sbjct: 843  HRFSLPSSPCSPSTVLSPIPMMLPTSATSPSVLASHSSSSAPSITAPRFVSLSCVVSARA 902

Query: 921  PNNSSVG--DVLLSP------INTPSTSDVQPSPVLPTTDVLPSSASHS---SVVVASAN 980
               SSVG  D+L SP         P  S+ Q   +LP+ D + SS+  S   SV V SA 
Sbjct: 903  LYTSSVGQQDLLASPGLSQSEPYIPVLSEQQLQVLLPSIDYITSSSPASVQPSVPVGSAP 962

Query: 981  SSPSQSAAISANNQSDPPAAVTINSHPMITRSKAGISKRKAFSAVI-------PKSIPEP 1040
              P                     SH M+TRSK G+ ++K FS  +         S+ EP
Sbjct: 963  PGP---------------------SHSMLTRSKTGVVQKKDFSDYMCYTSIHDTTSLDEP 1022

Query: 1041 TSFTAASKIPEWKRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARY 1100
            +S+  AS   +W +AM +E +AL  Q TW LVP   + NIVG KW+++ K + DGS++RY
Sbjct: 1023 SSYHLASFSADWTKAMDEEISALQMQGTWVLVPPPANTNIVGSKWIYKLKRHSDGSISRY 1082

Query: 1101 KARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGYLKE 1160
            KARLVA+G++Q  G D++ETFSPVV+  TVR+IL+LAA   WSL QLDVKNAFLHG L+E
Sbjct: 1083 KARLVAQGFSQEAGFDYEETFSPVVRHATVRIILSLAASNHWSLRQLDVKNAFLHGELEE 1142

Query: 1161 DVYMVQPPGFRDSNYPNHT----------------------------------------- 1220
            +VYM QP GF D ++P++                                          
Sbjct: 1143 EVYMKQPQGFEDPHHPDYVCKLQKSLYGLKQAPRAWNAKFTGFLPALGFKMSHSDPSLFV 1202

Query: 1221 --------------------------------TLGREFQLTDLGSLRYFIGLEITRFSDG 1280
                                             LG  F+L D+G L YF+GL+I+  S+G
Sbjct: 1203 KYSDSAIVVLLLYVDDIILTGSNPQVIQEVIIELGSVFELKDMGILTYFLGLQISCKSNG 1262

Query: 1281 -FHVTQLKYLTDLLEKTGMADSKTCSTPMSTTCEL---HGVSPFFSDALLYRQIVGSLQY 1323
               V+Q KY TDLL K+GM+  K C TP+    ++    G+     D   YR IVG+LQY
Sbjct: 1263 DIFVSQQKYATDLLAKSGMSSCKPCPTPLKPHTQILLTDGIP--LKDPKQYRSIVGALQY 1322

BLAST of Lag0030971 vs. NCBI nr
Match: WP_081894301.1 (DDE-type integrase/transposase/recombinase [Acetobacter malorum] >KFL89552.1 hypothetical protein AmDm5_1575 [Acetobacter malorum])

HSP 1 Score: 903.3 bits (2333), Expect = 3.6e-258
Identity = 566/1395 (40.57%), Postives = 768/1395 (55.05%), Query Frame = 0

Query: 25   LLSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEV 84
            L+ N+   V V+LD +N+L W +Q++ +L++H + G VDGS  CP  F+ + D       
Sbjct: 17   LVPNVSTSVTVKLDDTNYLVWHYQLRLLLESHGILGFVDGSKLCPSRFVDEPDKEGVETE 76

Query: 85   NSAHTQWIAQDSALITLINATLSKTAYSYVIGCKSSKEVWLSLEKQFSSLTRSHVHELKS 144
            N  +  W   D AL+ LI  TLS TA S +IGC S+ E+W++L  +FS++T++ + ++K 
Sbjct: 77   N--YQIWKLHDRALMQLIIDTLSPTAMSCIIGCTSAHEIWINLRDRFSTVTKASIFQMKL 136

Query: 145  SLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRT 204
             L  I K  +ESI +Y  R+K++ D L+   V  DD+D+++  L GL SEYN+FRT IR 
Sbjct: 137  ELQNIQK-GSESISKYFQRIKDVRDHLSAAGVSFDDDDIVILALKGLPSEYNTFRTVIRG 196

Query: 205  REGTLKLQELHALLKSEAKILEQQNKSITTTPLVPTAMFSSVSGQFNSNRGRGRGRNSNP 264
            RE  + L++  A L +E   +E    S + T    TAM +    Q N ++G+G       
Sbjct: 197  RENVISLKDFRAQLLAEEATIENNQFSGSFT----TAMLA----QGNESKGKGLMLEEGS 256

Query: 265  SYWSGRGGGRSNQGYYGNSNQG--RGGFNSSN---PSPGFPQGHSAPNQGRGAN------ 324
            S+  G     S   +  +SNQG   G +NS+    PS GF   H+   + RG N      
Sbjct: 257  SHSKGFSPPHSGPYHGSSSNQGASSGSYNSNGPPYPSGGFRGFHNNRGRARGRNNSSSNF 316

Query: 325  --SGSN---------------SSSGRGV-ICQICNRS----------------------- 384
              SG+N               S  G GV  CQICN+                        
Sbjct: 317  RFSGNNSPGILGPARPHISTCSDHGNGVPTCQICNKRGHVASDCFQRHSSTNRPSFSLQC 376

Query: 385  ------GHGALDCYNRLNLSFQGRHPPSKLAAMASIYDPQSSNNSGNNNCTWLADSGCNS 444
                  GH AL CY+R N S+QGR PPS L  M + Y P +  +       W+AD+G  S
Sbjct: 377  QICWKFGHSALQCYHRANFSYQGRSPPSTLTVMHANYQPSAPLDQ-----FWVADTGATS 436

Query: 445  HVTPDLSTLALNSNYNGEDAITVANGQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSAN 504
            H+T DL+ L   + + G D IT A+G G+PI+ TGS  L        L  IL VP +S +
Sbjct: 437  HMTSDLTNLTQATPFLGADTITTASGSGLPISHTGSSFLHVPQYAFQLKDILHVPQISQH 496

Query: 505  LLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNST 564
            LLS+ + C DNNC F+ D   F IQDK +G IL +G  +DGLYPI         +  +  
Sbjct: 497  LLSMYKLCKDNNCRFICDEFCFWIQDKITGTILLQGLCRDGLYPIP------FHIPQHIL 556

Query: 565  VAGLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPSFPVLQKLLSANSITCDTKYTCRDC 624
                HTS  L+    + F+ H  +T LWH RLGHPS  V+  +L+ + I+     +   C
Sbjct: 557  PKASHTSHSLTN-NQTCFLGHHINTSLWHNRLGHPSNAVVSTMLNQSQISFSVDPSKHVC 616

Query: 625  VSCLKGKASKLPFASSTSITTRPLALLHSDVWGPSPLISVSGFRYYVNFVDDFSKFTWIF 684
            +SCL+GK +KLPF+     + +P  +LHSDVWGPSP +SV G+++YV F+D+ ++FTWIF
Sbjct: 617  ISCLEGKCTKLPFSFPAHKSVKPFEVLHSDVWGPSPTMSVEGYKFYVLFIDECTRFTWIF 676

Query: 685  PLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPH 744
            PL  KS+V  V   F  FI  Q S S++ F+SDGGGE+ +    +F   KG++H +SCPH
Sbjct: 677  PLRNKSEVFQVFVHFHAFISTQFSTSVKTFQSDGGGEYCSTRFQQFLLDKGIIHHKSCPH 736

Query: 745  TPEQNGAAEQKHRSIVDTAIALMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQK 804
            TPEQNG AE+KH  IV+TA+ L++ A +P +FW+HA A  V+L+NR+P S +   +P+  
Sbjct: 737  TPEQNGLAERKHMHIVETALTLLSTAQLPPQFWFHACAISVYLINRMPCSTLSMKSPYTC 796

Query: 805  LYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKML 864
            L+     L+HL+VFG +CYPLLKPYNT+KLQPKT Q +FLGY  +YKGY+C+N  + +  
Sbjct: 797  LFAQPSALTHLKVFGYSCYPLLKPYNTNKLQPKTVQCIFLGYAGQYKGYICFNPLSGRFY 856

Query: 865  VSRHVVFHEDVFPFAIRPVTRSHSHSHTLPSQPQP---NPAVVTSLLSSQSSLV------ 924
            VSRHVVF+E  FP+          H    PSQ  P    P  +T L++ Q+ +V      
Sbjct: 857  VSRHVVFYETNFPY---------KHLLVKPSQCSPVFVTPPSITPLVTGQNVVVSPHTSA 916

Query: 925  -------------VRVLPNNSSVGDVLLSPINTPSTSDVQPSPVLPTTDVLPSSASHSSV 984
                            L   +S  + L SPI  P  S    SP LP   +  +S S +S+
Sbjct: 917  SHASLPLSQPTRASEFLHEPTSASEFLASPIRVPEFS--SSSPPLP-DPIYHASESLTSL 976

Query: 985  VVASANSSPSQSAAISANNQSDPPAAV------TINSHPMITRSKAGISKRKAFSAVIP- 1044
                  SSPS  + + A+    P          +++ HPM TRSK+GI K+KAF A I  
Sbjct: 977  SPTVPASSPSTQSPVPADPDFQPENLTIVLPVHSVSLHPMQTRSKSGIIKKKAFVASISS 1036

Query: 1045 --KSIPEPTSFTAASKIPEWKRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFN 1104
              +S  EP++F AASKI EW+ AM DE  AL  Q+TW LVP     N+VGCKWV+R K N
Sbjct: 1037 VGQSDVEPSTFKAASKIVEWQSAMQDEIDALHAQHTWDLVPLPSGKNLVGCKWVYRVKKN 1096

Query: 1105 PDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNA 1164
            PDGS+ARYKARLVAKGYNQ EG+D+ ETFSPVVK TTVR+ILALAA + WSL QLDVKNA
Sbjct: 1097 PDGSIARYKARLVAKGYNQEEGIDYGETFSPVVKPTTVRLILALAAQFRWSLRQLDVKNA 1156

Query: 1165 FLHGYLKEDVYMVQPPGFRDSNYPN----------------------------------- 1224
            FLHG L E++YM QPPGF   ++P+                                   
Sbjct: 1157 FLHGDLHEEMYMSQPPGFGSPHHPSHFVYKLHKSLYGLKQAPRAWNDKFTSFLPGLGFQA 1216

Query: 1225 ----------HTTLG-----------------------------REFQLTDLGSLRYFIG 1255
                      HT+LG                             + F + DLG L YF+G
Sbjct: 1217 SLADPSLFVKHTSLGIVVLLLYVDDIIITGSSSSDIEIVILALTKAFDMKDLGQLHYFLG 1276

BLAST of Lag0030971 vs. NCBI nr
Match: CCH50966.1 (T4.5 [Malus x robusta])

HSP 1 Score: 889.4 bits (2297), Expect = 5.3e-254
Identity = 548/1428 (38.38%), Postives = 781/1428 (54.69%), Query Frame = 0

Query: 3    SSSSSVSDNSSGCSTQPNSSIFL-----------------LSNICNLVPVRLDSSNFLFW 62
            S+++S SD+ S     PNS   L                 ++N+  +VP +L+  N++ W
Sbjct: 227  SAAASSSDSPSPPPANPNSPANLPPSVLSNPPIMTLGTISITNVAGMVPTKLNRQNYITW 286

Query: 63   KFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQWIAQDSALITLINAT 122
            +     +LK   L G+V+G   CPP F+ D  G  T   N++   W  +D  L+  IN+T
Sbjct: 287  RSLFIPVLKRFKLIGLVNGEDLCPPPFVRDPSG--TCVPNASFETWCERDQILMIWINST 346

Query: 123  LSKTAYSYVIGCKSSKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVK 182
            LSK      IG + S+ +W SLE++FS  +R+HVH L+S + TI K    S+ ++L  +K
Sbjct: 347  LSKDLLPLTIGMEDSRSLWQSLERRFSGASRTHVHSLRSKIQTIHK-GDSSMTDFLNSIK 406

Query: 183  ELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKLQELHALLKSEAKIL 242
            E+ +KLA    P+ + DL+ Y L+GL  EY SF  SI TR  ++   ELH LL S+   L
Sbjct: 407  EISNKLAAAGEPLSESDLVAYILSGLPDEYESFVDSIETRNESVTADELHGLLLSKEISL 466

Query: 243  EQQNKSITTTPLVP----TAMFSSVSGQFNSNRGRGRGRNSNPSYWSGRGGGRSNQGYYG 302
            +++    +++   P     A  S+  G FN    RGR  N N    +   GG     ++ 
Sbjct: 467  QKRKTRASSSSNAPFHAYAAQSSTHVGHFNKGNSRGRFHNRNRYTQNRNFGGNKPHNWHA 526

Query: 303  NSNQGRGGFNSSNPSPGFPQGHSAPNQGRGANSGSNSSSGRGVICQICNRSGHGALDCYN 362
            N++   GG   + PS                 +G +SSSG  V CQ+C + GH A  C N
Sbjct: 527  NNS---GGILGAGPS--------------RQPAGPSSSSGCSVQCQLCLQYGHWAPMC-N 586

Query: 363  RLNLSFQGRHPPSKLAAMASIYDPQSSNNSGNNNCTWLADSGCNSHVTPDLSTLALNSNY 422
            RL+  F     P+ ++AM S   P            WL DSG + HVTPD S L     Y
Sbjct: 587  RLS-QFAQSQSPTAMSAMTSSASPS----------YWLTDSGASHHVTPDPSALNSAIPY 646

Query: 423  NGEDAITVANGQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCLDNNCIF 482
            +G D + V +G+G+ I+ TGS  + T H+   L+ +L VP  S NLLSV +   DN C  
Sbjct: 647  SGNDQLFVGDGKGLCISHTGSALIRTKHATFRLNDVLLVPQASHNLLSVYKFVYDNWCYL 706

Query: 483  VFDADWFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVAGLHTSAPLSPICA 542
             FD   F ++D ++G++L++G S+ GLYP    +S G        V+G+     +SP   
Sbjct: 707  TFDPFGFYVKDLSTGKMLFQGPSEGGLYPFYWNASNG--------VSGI----AISPTAL 766

Query: 543  SVFVSHVPSTVLWHLRLGHPSFPVLQKLLSANSI-TCDTKYTCRDCVSCLKGKASKLPFA 602
             +  + + +   WH RLGHPS   L  ++  N +           C +C  GK+ +L F+
Sbjct: 767  MIAKADIHT---WHRRLGHPSGGTLHSVVHKNHLPVIGYVNNMSVCTACQLGKSYRLSFS 826

Query: 603  SSTSITTRPLALLHSDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQ 662
            +    ++RPL LLH+DVWGPSP  S +G+R+Y+  VDDF+K++W++PL  KSDV S +K 
Sbjct: 827  TLPCTSSRPLQLLHTDVWGPSPTSSCTGYRFYLIIVDDFTKYSWLYPLHFKSDVFSTLKT 886

Query: 663  FVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRS 722
            F+  ++  L   +Q  RSD GGEF+N  +  FF+ +G+ HQ SC HT EQNG AE+KHR 
Sbjct: 887  FILKLQTLLDLQVQSIRSDSGGEFLNKSLQSFFNEQGITHQLSCLHTSEQNGCAERKHRH 946

Query: 723  IVDTAIALMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVF 782
            +V+    L++ + +P +FW  AF TVV+L+NRLP  +   ++P++ L+   P    L+ F
Sbjct: 947  VVEMGRTLLSQSDLPTQFWVEAFQTVVYLINRLPPQS-SVISPWELLFHASPKYHTLKAF 1006

Query: 783  GCACYPLLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLVSRHVVFHEDVFPF 842
            GCACYP L+PY+  KL  K+ Q VFLGY+L + GY C++  + ++ +SRHVVF E +FP+
Sbjct: 1007 GCACYPWLQPYSRDKLDFKSKQCVFLGYSLNHSGYRCWDPISNRLYISRHVVFDESLFPY 1066

Query: 843  AIRPVTRSHSHSHTLPSQPQP----NPAVVTSLLSSQSSLVVRVLPNNSSVGDVLLSPIN 902
                   SH HS  + S   P    +  +  S L  QSS    +   N+S   +  +  +
Sbjct: 1067 KSLSSQASH-HSPCVSSPLHPPMSLHLPLPVSHLEQQSSPAAALEGRNASPPSIFSTAAH 1126

Query: 903  TPSTSDVQPSPVLPTTDVLPSSASHSSVVVASANSSPSQSAAISANNQSDPPAAVTINSH 962
            T                 +PSSA   S+     +SSP++   +        P  + +N+H
Sbjct: 1127 T----------------TIPSSA-QESLHTPPVSSSPAEPPPL--------PPPIPVNTH 1186

Query: 963  PMITRSKAGISKRKAFSAV---IPKSI-------PEPTSFTAASKIPEWKRAMLDEYTAL 1022
             MITR+KAGI K K F+A    +P ++       P P++F  ASK   W  AM  E+ AL
Sbjct: 1187 TMITRAKAGIHKPKVFTATKHQLPSTVDSLTALPPTPSTFLQASKSSHWMEAMQFEFQAL 1246

Query: 1023 TNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSP 1082
             +  TW LVP+    NIVGCKWVF+ K  PDG++ RYKARLVAKG++Q+EG+DF ETFSP
Sbjct: 1247 QSTGTWELVPNHSTYNIVGCKWVFKVKHKPDGTIERYKARLVAKGFHQQEGLDFSETFSP 1306

Query: 1083 VVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPGFRDSNYPNH----- 1142
            V K TT+R++L++A  Y W +HQLDV NAFLHG+LKEDVYMVQPPGF D + P+H     
Sbjct: 1307 VAKPTTIRILLSIAVSYYWFIHQLDVSNAFLHGHLKEDVYMVQPPGFVDPSKPHHVCKLR 1366

Query: 1143 ------------------------------------------------------------ 1202
                                                                        
Sbjct: 1367 KSLYGLKQAPRAWYEAFYTAILSLGFSSSHSDTSLFIKRDTSITFILVYVDDIIITGSSV 1426

Query: 1203 -------TTLGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTC 1262
                   + L   F + DLG + YF+G+E+ +   G  + Q KY  DLL+KT M  +K C
Sbjct: 1427 TECQSIISQLQTMFPVKDLGDINYFLGIEVHKSDQGLLLHQAKYALDLLKKTDMLGAKPC 1486

Query: 1263 STPMSTTCELHGVSPFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQFMHKPTEIHYS 1322
            +TP+ST+ +L       SD   YR  VG+LQYLT+TRPD+ + V++V Q+MH P  IH  
Sbjct: 1487 ATPVSTS-KLDHSGTLLSDPTSYRSTVGALQYLTWTRPDLAFAVNQVCQYMHSPQTIHLQ 1546

BLAST of Lag0030971 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 770.4 bits (1988), Expect = 4.7e-221
Identity = 523/1453 (35.99%), Postives = 742/1453 (51.07%), Query Frame = 0

Query: 20   NSSIFLLSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFL-LDADG 79
            N++  L  N+ N+   +L S+N+L W  QV ++   + L G +DGS   PP  +  DA  
Sbjct: 11   NNTSILNVNMSNV--TKLTSTNYLMWSRQVHALFDGYELAGFLDGSTTMPPATIGTDAAP 70

Query: 80   RKTTEVNSAHTQWIAQDSALITLINATLSKTAYSYVIGCKSSKEVWLSLEKQFSSLTRSH 139
            R    VN  +T+W  QD  + + +   +S +    V    ++ ++W +L K +++ +  H
Sbjct: 71   R----VNPDYTRWKRQDKLIYSAVLGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGH 130

Query: 140  VHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSF 199
            V +L++ L   +K  T++ID+Y+  +    D+LA +  P+D ++ +   L  L  EY   
Sbjct: 131  VTQLRTQLKQWTK-GTKTIDDYMQGLVTRFDQLALLGKPMDHDEQVERVLENLPEEYKPV 190

Query: 200  RTSIRTREGTLKLQELH-ALLKSEAKILEQQNKSITTTPLVPTAMFSSVSGQFNSNRGRG 259
               I  ++    L E+H  LL  E+KIL     S T  P+   A+         S+R   
Sbjct: 191  IDQIAAKDTPPTLTEIHERLLNHESKILAV--SSATVIPITANAV---------SHRNTT 250

Query: 260  RGRNSNPSYWSGRGGGRSNQGYYGNSNQGRGGFNSSNPSPGFPQGHSAPNQGRGANSGSN 319
               N+N        G R+N+  Y N N      N++N         S P Q    N   N
Sbjct: 251  TTNNNN-------NGNRNNR--YDNRN------NNNN---------SKPWQQSSTNFHPN 310

Query: 320  SSSGRGVI--CQICNRSGHGALDCYNRLNL--SFQGRHPPSKLA-----AMASIYDPQSS 379
            ++  +  +  CQIC   GH A  C    +   S   + PPS        A  ++  P SS
Sbjct: 311  NNQSKPYLGKCQICGVQGHSAKRCSQLQHFLSSVNSQQPPSPFTPWQPRANLALGSPYSS 370

Query: 380  NNSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAITVANGQGVPITQTGSGTLSTS 439
            NN       WL DSG   H+T D + L+L+  Y G D + VA+G  +PI+ TGS +LST 
Sbjct: 371  NN-------WLLDSGATHHITSDFNNLSLHQPYTGGDDVMVADGSTIPISHTGSTSLSTK 430

Query: 440  HSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGL 499
               LNL  IL VP++  NL+SV + C  N     F    F ++D  +G  L +GK+KD L
Sbjct: 431  SRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNTGVPLLQGKTKDEL 490

Query: 500  YPISSISSAGSSLRSNSTVAGLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPSFPVLQK 559
            Y     SS   SL ++ +    H+S                    WH RLGHP+  +L  
Sbjct: 491  YEWPIASSQPVSLFASPSSKATHSS--------------------WHARLGHPAPSILNS 550

Query: 560  LLSANSIT-CDTKYTCRDCVSCLKGKASKLPFASSTSITTRPLALLHSDVWGPSPLISVS 619
            ++S  S++  +  +    C  CL  K++K+PF+ ST  +TRPL  ++SDVW  SP++S  
Sbjct: 551  VISNYSLSVLNPSHKFLSCSDCLINKSNKVPFSQSTINSTRPLEYIYSDVWS-SPILSHD 610

Query: 620  GFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNH 679
             +RYYV FVD F+++TW++PL +KS V      F   +EN+    +  F SD GGEFV  
Sbjct: 611  NYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTRIGTFYSDNGGEFV-- 670

Query: 680  YVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIALMNHASVPLEFWYHAFATVV 739
             ++E+FS  G+ H  S PHTPE NG +E+KHR IV+T + L++HAS+P  +W +AFA  V
Sbjct: 671  ALWEYFSQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHASIPKTYWPYAFAVAV 730

Query: 740  FLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLG 799
            +L+NRLP+  +   +PFQKL+G  P+   LRVFGCACYP L+PYN HKL  K+ Q VFLG
Sbjct: 731  YLINRLPTPLLQLESPFQKLFGTSPNYDKLRVFGCACYPWLRPYNQHKLDDKSRQCVFLG 790

Query: 800  YTLEYKGYLCYNMETKKMLVSRHVVFHEDVFPFA-----IRPVTRSHSHS------HTL- 859
            Y+L    YLC +++T ++ +SRHV F E+ FPF+     + PV      S      HT  
Sbjct: 791  YSLTQSAYLCLHLQTSRLYISRHVRFDENCFPFSNYLATLSPVQEQRRESSCVWSPHTTL 850

Query: 860  ------------------------PSQPQPNPAVVTSLLSSQSSLVVRVLPNNSSVGDVL 919
                                    PS P  N  V +S L S  S      P  ++     
Sbjct: 851  PTRTPVLPAPSCSDPHHAATPPSSPSAPFRNSQVSSSNLDSSFSSSFPSSPEPTAPRQNG 910

Query: 920  LSPINTPSTSDVQPSPVLPTTDVLPSSASHS------SVVVASANSSPSQSAAISANNQS 979
              P   P+ +  Q      T+   P++ S S      S    S++SSPS + + S+++ S
Sbjct: 911  PQPTTQPTQTQTQTHSSQNTSQNNPTNESPSQLAQSLSTPAQSSSSSPSPTTSASSSSTS 970

Query: 980  DPPAAVTI------------------NSHPMITRSKAGISK--RKAFSAVIPKSIPEPTS 1039
              P ++ I                  N+H M TR+KAGI K   K   AV   +  EP +
Sbjct: 971  PTPPSILIHPPPPLAQIVNNNNQAPLNTHSMGTRAKAGIIKPNPKYSLAVSLAAESEPRT 1030

Query: 1040 FTAASKIPEWKRAMLDEYTALTNQNTWTLV-PSQEDMNIVGCKWVFRTKFNPDGSVARYK 1099
               A K   W+ AM  E  A    +TW LV P    + IVGC+W+F  K+N DGS+ RYK
Sbjct: 1031 AIQALKDERWRNAMGSEINAQIGNHTWDLVPPPPSHVTIVGCRWIFTKKYNSDGSLNRYK 1090

Query: 1100 ARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGYLKED 1159
            ARLVAKGYNQR G+D+ ETFSPV+K T++R++L +A    W + QLDV NAFL G L +D
Sbjct: 1091 ARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDD 1150

Query: 1160 VYMVQPPGFRDSNYPNHT------------------------------------------ 1219
            VYM QPPGF D + PN+                                           
Sbjct: 1151 VYMSQPPGFIDKDRPNYVCKLRKALYGLKQAPRAWYVELRNYLLTIGFVNSVSDTSLFVL 1210

Query: 1220 -------------------------------TLGREFQLTDLGSLRYFIGLEITRFSDGF 1279
                                            L + F + D   L YF+G+E  R   G 
Sbjct: 1211 QRGKSIVYMLVYVDDILITGNDPTLLHNTLDNLSQRFSVKDHEELHYFLGIEAKRVPTGL 1270

Query: 1280 HVTQLKYLTDLLEKTGMADSKTCSTPMSTTCELHGVS-PFFSDALLYRQIVGSLQYLTFT 1323
            H++Q +Y+ DLL +T M  +K  +TPM+ + +L   S    +D   YR IVGSLQYL FT
Sbjct: 1271 HLSQRRYILDLLARTNMITAKPVTTPMAPSPKLSLYSGTKLTDPTEYRGIVGSLQYLAFT 1330

BLAST of Lag0030971 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 724.5 bits (1869), Expect = 3.0e-207
Identity = 503/1458 (34.50%), Postives = 740/1458 (50.75%), Query Frame = 0

Query: 16   STQPNSSIFLLSNICNL---VPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEF 75
            +T     + + +NI N+      +L S+N+L W  QV ++   + L G +DGS P PP  
Sbjct: 2    ATHAEEIVLVNTNILNVNMSNVTKLTSTNYLMWSRQVHALFDGYELAGFLDGSTPMPPAT 61

Query: 76   L-LDADGRKTTEVNSAHTQWIAQDSALITLINATLSKTAYSYVIGCKSSKEVWLSLEKQF 135
            +  DA  R    VN  +T+W  QD  + + I   +S +    V    ++ ++W +L K +
Sbjct: 62   IGTDAVPR----VNPDYTRWRRQDKLIYSAILGAISMSVQPAVSRATTAAQIWETLRKIY 121

Query: 136  SSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGL 195
            ++ +  HV +L+                ++ R     D+LA +  P+D ++ +   L  L
Sbjct: 122  ANPSYGHVTQLR----------------FITR----FDQLALLGKPMDHDEQVERVLENL 181

Query: 196  SSEYNSFRTSIRTREGTLKLQELH-ALLKSEAKILEQQNKSITTTPLVPTAMFSSVSGQF 255
              +Y      I  ++    L E+H  L+  E+K+L     ++ +  +VP      ++   
Sbjct: 182  PDDYKPVIDQIAAKDTPPSLTEIHERLINRESKLL-----ALNSAEVVP------ITANV 241

Query: 256  NSNRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQGRGGFNSSNPSPGFPQGHSAPNQGR 315
             ++R     RN N      RG    N+ Y  N+N+     NS  PS          +  R
Sbjct: 242  VTHRNTNTNRNQN-----NRG---DNRNYNNNNNRS----NSWQPS---------SSGSR 301

Query: 316  GANSGSNSSSGRGVICQICNRSGHGALDCYNRLNLSFQGRHPPSKLAAMASIYDPQS--S 375
              N       GR   CQIC+  GH A  C  +L+  FQ      +  +  + + P++  +
Sbjct: 302  SDNRQPKPYLGR---CQICSVQGHSAKRC-PQLH-QFQSTTNQQQSTSPFTPWQPRANLA 361

Query: 376  NNSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAITVANGQGVPITQTGSGTLSTS 435
             NS  N   WL DSG   H+T D + L+ +  Y G D + +A+G  +PIT TGS +L TS
Sbjct: 362  VNSPYNANNWLLDSGATHHITSDFNNLSFHQPYTGGDDVMIADGSTIPITHTGSASLPTS 421

Query: 436  HSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGL 495
               L+L+K+L VP++  NL+SV + C  N     F    F ++D  +G  L +GK+KD L
Sbjct: 422  SRSLDLNKVLYVPNIHKNLISVYRLCNTNRVSVEFFPASFQVKDLNTGVPLLQGKTKDEL 481

Query: 496  YPISSISSAGSSLRSNSTVAGLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPSFPVLQK 555
            Y     SS   S+ ++      H+S                    WH RLGHPS  +L  
Sbjct: 482  YEWPIASSQAVSMFASPCSKATHSS--------------------WHSRLGHPSLAILNS 541

Query: 556  LLSANSI-TCDTKYTCRDCVSCLKGKASKLPFASSTSITTRPLALLHSDVWGPSPLISVS 615
            ++S +S+   +  +    C  C   K+ K+PF++ST  +++PL  ++SDVW  SP++S+ 
Sbjct: 542  VISNHSLPVLNPSHKLLSCSDCFINKSHKVPFSNSTITSSKPLEYIYSDVWS-SPILSID 601

Query: 616  GFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNH 675
             +RYYV FVD F+++TW++PL +KS V      F   +EN+    +    SD GGEFV  
Sbjct: 602  NYRYYVIFVDHFTRYTWLYPLKQKSQVKDTFIIFKSLVENRFQTRIGTLYSDNGGEFV-- 661

Query: 676  YVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIALMNHASVPLEFWYHAFATVV 735
             + ++ S  G+ H  S PHTPE NG +E+KHR IV+  + L++HASVP  +W +AF+  V
Sbjct: 662  VLRDYLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTLLSHASVPKTYWPYAFSVAV 721

Query: 736  FLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLG 795
            +L+NRLP+  +   +PFQKL+G  P+   L+VFGCACYP L+PYN HKL+ K+ Q  F+G
Sbjct: 722  YLINRLPTPLLQLQSPFQKLFGQPPNYEKLKVFGCACYPWLRPYNRHKLEDKSKQCAFMG 781

Query: 796  YTLEYKGYLCYNMETKKMLVSRHVVFHEDVFPFAIRPVTRSHSH-----------SHT-- 855
            Y+L    YLC ++ T ++  SRHV F E  FPF+      S S            SHT  
Sbjct: 782  YSLTQSAYLCLHIPTGRLYTSRHVQFDERCFPFSTTNFGVSTSQEQRSDSAPNWPSHTTL 841

Query: 856  ------LPSQPQPNPAVVTSLL--SSQSSLVVRVLPNNSSVGDVLLSPINT----PSTSD 915
                  LP+ P   P + TS    SS S L    + +++     + SP ++    PS + 
Sbjct: 842  PTTPLVLPAPPCLGPHLDTSPRPPSSPSPLCTTQVSSSNLPSSSISSPSSSEPTAPSHNG 901

Query: 916  VQP-------------SPVLPT---TDVLPSSASHSSVVVASANSSP---------SQSA 975
             QP             SP+L         P+S + +S +  S  SSP         S+  
Sbjct: 902  PQPTAQPHQTQNSNSNSPILNNPNPNSPSPNSPNQNSPLPQSPISSPHIPTPSTSISEPN 961

Query: 976  AISANNQSDPP--------------AAVTINSHPMITRSKAGISK--RKAFSAVIPKSIP 1035
            + S+++ S PP              A   +N+H M TR+K GI K  +K   A    +  
Sbjct: 962  SPSSSSTSTPPLPPVLPAPPIIQVNAQAPVNTHSMATRAKDGIRKPNQKYSYATSLAANS 1021

Query: 1036 EPTSFTAASKIPEWKRAMLDEYTALTNQNTWTLV-PSQEDMNIVGCKWVFRTKFNPDGSV 1095
            EP +   A K   W++AM  E  A    +TW LV P    + IVGC+W+F  KFN DGS+
Sbjct: 1022 EPRTAIQAMKDDRWRQAMGSEINAQIGNHTWDLVPPPPPSVTIVGCRWIFTKKFNSDGSL 1081

Query: 1096 ARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGY 1155
             RYKARLVAKGYNQR G+D+ ETFSPV+K T++R++L +A    W + QLDV NAFL G 
Sbjct: 1082 NRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGT 1141

Query: 1156 LKEDVYMVQPPGFRDSNYPN---------------------------------------- 1215
            L ++VYM QPPGF D + P+                                        
Sbjct: 1142 LTDEVYMSQPPGFVDKDRPDYVCRLRKAIYGLKQAPRAWYVELRTYLLTVGFVNSISDTS 1201

Query: 1216 -------------------------------HT--TLGREFQLTDLGSLRYFIGLEITRF 1275
                                           HT   L + F + +   L YF+G+E  R 
Sbjct: 1202 LFVLQRGRSIIYMLVYVDDILITGNDTVLLKHTLDALSQRFSVKEHEDLHYFLGIEAKRV 1261

Query: 1276 SDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTT--CELHGVSPFFSDALLYRQIVGSLQ 1323
              G H++Q +Y  DLL +T M  +K  +TPM+T+    LH  +    D   YR IVGSLQ
Sbjct: 1262 PQGLHLSQRRYTLDLLARTNMLTAKPVATPMATSPKLTLHSGTK-LPDPTEYRGIVGSLQ 1321

BLAST of Lag0030971 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 394.4 bits (1012), Expect = 7.1e-108
Identity = 372/1371 (27.13%), Postives = 584/1371 (42.60%), Query Frame = 0

Query: 73   LLDADGRKTTEVNSAHTQWIAQDSALITLINATLSKTAYSYVIGCKSSKEVWLSLEKQFS 132
            +LD D +K   + +    W   D    + I   LS    + +I   +++ +W  LE  + 
Sbjct: 36   VLDVDSKKPDTMKA--EDWADLDERAASAIRLHLSDDVVNNIIDEDTARGIWTRLESLYM 95

Query: 133  SLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLS 192
            S T ++   LK  L+ +  S   +   +L     L+ +LA + V I++ED  +  LN L 
Sbjct: 96   SKTLTNKLYLKKQLYALHMSEGTNFLSHLNVFNGLITQLANLGVKIEEEDKAILLLNSLP 155

Query: 193  SEYNSFRTSIRTREGTLKLQEL-HALLKSE--AKILEQQNKSITTTPLVPTAMFSSVSGQ 252
            S Y++  T+I   + T++L+++  ALL +E   K  E Q +++ T               
Sbjct: 156  SSYDNLATTILHGKTTIELKDVTSALLLNEKMRKKPENQGQALIT--------------- 215

Query: 253  FNSNRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQGRGGFNSSNPSPGFPQGHSAPNQG 312
                RGR   R+SN    SG  G   N+    + ++ R  +N + P   F +    P +G
Sbjct: 216  --EGRGRSYQRSSNNYGRSGARGKSKNR----SKSRVRNCYNCNQPG-HFKRDCPNPRKG 275

Query: 313  RGANSGSNSSSGRGVICQICNRSGHGALDCYNRLNLSFQGRHPPSKLAAMASIYDPQSSN 372
            +G  SG  +      + Q             N  N+             +  I + +   
Sbjct: 276  KGETSGQKNDDNTAAMVQ-------------NNDNV-------------VLFINEEEECM 335

Query: 373  NSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAITVANGQGVPITQTGSGTLSTSH 432
            +       W+ D+  + H TP      L   Y   D  TV  G        G G +    
Sbjct: 336  HLSGPESEWVVDTAASHHATP---VRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKT 395

Query: 433  S---DLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKD 492
            +    L L  +  VPDL  NL+S      D    +  +  W   +      ++ KG ++ 
Sbjct: 396  NVGCTLVLKDVRHVPDLRMNLISGIALDRDGYESYFANQKW---RLTKGSLVIAKGVARG 455

Query: 493  GLYPISSISSAGSSLRSNSTVAGLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPSFPVL 552
             LY            R+N+ +     +A    I          S  LWH R+GH S   L
Sbjct: 456  TLY------------RTNAEICQGELNAAQDEI----------SVDLWHKRMGHMSEKGL 515

Query: 553  QKLLSANSITCDTKYTCRDCVSCLKGKASKLPFASSTSITTRPLALLHSDVWGPSPLISV 612
            Q L   + I+     T + C  CL GK  ++ F +S+      L L++SDV GP  + S+
Sbjct: 516  QILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQTSSERKLNILDLVYSDVCGPMEIESM 575

Query: 613  SGFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVN 672
             G +Y+V F+DD S+  W++ L  K  V  V ++F   +E +    L+  RSD GGE+ +
Sbjct: 576  GGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTS 635

Query: 673  HYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIALMNHASVPLEFWYHAFATV 732
                E+ SS G+ H+++ P TP+ NG AE+ +R+IV+   +++  A +P  FW  A  T 
Sbjct: 636  REFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTA 695

Query: 733  VFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFL 792
             +L+NR PS  + F  P +         SHL+VFGC  +  +      KL  K+   +F+
Sbjct: 696  CYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFI 755

Query: 793  GYTLEYKGYLCYNMETKKMLVSRHVVFHEDVFPFAIRPVTRSHSHSHTLPSQPQPNPAVV 852
            GY  E  GY  ++   KK++ SR VVF E                            + V
Sbjct: 756  GYGDEEFGYRLWDPVKKKVIRSRDVVFRE----------------------------SEV 815

Query: 853  TSLLSSQSSLVVRVLPNNSSVGDVLLSPINTPSTSDVQPSPVLPTTDVLPSSASHSSVVV 912
             +       +   ++PN           +  PSTS+  P+    TTD +         V+
Sbjct: 816  RTAADMSEKVKNGIIPNF----------VTIPSTSN-NPTSAESTTDEVSEQGEQPGEVI 875

Query: 913  ASANSSPSQSAAISANNQSDPPAAVTINSHPMITRSKAGISKRKAFSA---VIPKSIPEP 972
                        +    Q +         H  + RS+    + + + +   V+     EP
Sbjct: 876  EQGEQLDEGVEEVEHPTQGE-------EQHQPLRRSERPRVESRRYPSTEYVLISDDREP 935

Query: 973  TSFTAASKIPE---WKRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSV 1032
             S       PE     +AM +E  +L    T+ LV   +    + CKWVF+ K + D  +
Sbjct: 936  ESLKEVLSHPEKNQLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKL 995

Query: 1033 ARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGY 1092
             RYKARLV KG+ Q++G+DFDE FSPVVK T++R IL+LAA     + QLDVK AFLHG 
Sbjct: 996  VRYKARLVVKGFEQKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGD 1055

Query: 1093 LKEDVYMVQPPGFR-----------------------------DSNYPNHT--------- 1152
            L+E++YM QP GF                              DS   + T         
Sbjct: 1056 LEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPC 1115

Query: 1153 ------------------------------------TLGREFQLTDLGSLRYFIGLEIT- 1212
                                                 L + F + DLG  +  +G++I  
Sbjct: 1116 VYFKRFSENNFIILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVR 1175

Query: 1213 -RFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMS-----------TTCELHGVSPFFSD 1272
             R S    ++Q KY+  +LE+  M ++K  STP++           TT E  G       
Sbjct: 1176 ERTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMA---- 1235

Query: 1273 ALLYRQIVGSLQY-LTFTRPDITYVVSKVSQFMHKPTEIHYSAVKRILRYLNGTRDCGIL 1332
             + Y   VGSL Y +  TRPDI + V  VS+F+  P + H+ AVK ILRYL GT    + 
Sbjct: 1236 KVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWILRYLRGTTGDCLC 1277

Query: 1333 FSKGKLELTAYSDADWAGDSMDRRSTSGYVVFFCGIPVSWSAKKQSTVSRSSTEAEYRSL 1344
            F      L  Y+DAD AGD  +R+S++GY+  F G  +SW +K Q  V+ S+TEAEY + 
Sbjct: 1296 FGGSDPILKGYTDADMAGDIDNRKSSTGYLFTFSGGAISWQSKLQKCVALSTTEAEYIAA 1277

BLAST of Lag0030971 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 345.1 bits (884), Expect = 4.9e-93
Identity = 346/1416 (24.44%), Postives = 591/1416 (41.74%), Query Frame = 0

Query: 38   DSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQWIAQDSA 97
            D   +  WKF+++++L    +  +VDG +P                 N     W   +  
Sbjct: 12   DGEKYAIWKFRIRALLAEQDVLKVVDGLMP-----------------NEVDDSWKKAERC 71

Query: 98   LITLINATLSKTAYSYVIGCKSSKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESI 157
              + I   LS +  ++     +++++  +L+  +   + +    L+  L ++  S+  S+
Sbjct: 72   AKSTIIEYLSDSFLNFATSDITARQILENLDAVYERKSLASQLALRKRLLSLKLSSEMSL 131

Query: 158  DEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRT-REGTLKLQEL-H 217
              +     EL+ +L      I++ D + + L  L S Y+   T+I T  E  L L  + +
Sbjct: 132  LSHFHIFDELISELLAAGAKIEEMDKISHLLITLPSCYDGIITAIETLSEENLTLAFVKN 191

Query: 218  ALLKSEAKILEQQNKSITTTPLVPTAMFSSVSGQFNSNRGRGRGRNSNPSYWSGRGGGRS 277
             LL  E KI    N    T+  V  A+  + +  + +N  + R               + 
Sbjct: 192  RLLDQEIKIKNDHN---DTSKKVMNAIVHNNNNTYKNNLFKNRVT-------------KP 251

Query: 278  NQGYYGNSNQGRGGFNSSNPSPGFPQGHSAPNQGRGANSGSNSSSGRGVICQICNRSGHG 337
             + + GNS                                        V C  C R GH 
Sbjct: 252  KKIFKGNSKY-------------------------------------KVKCHHCGREGHI 311

Query: 338  ALDCYNRLNLSFQGRHPPSKLAAMASIYD-----PQSSNNSGNNNCTWLADSGCNSHVTP 397
              DC++   +         K    A+ +       + +N S  +NC ++ DSG + H+  
Sbjct: 312  KKDCFHYKRILNNKNKENEKQVQTATSHGIAFMVKEVNNTSVMDNCGFVLDSGASDHLIN 371

Query: 398  DLSTLALNSNYNGEDAITVA-NGQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLLS 457
            D S    +        I VA  G+ +  T+ G   L   H ++ L  +L   + + NL+S
Sbjct: 372  DESLYTDSVEVVPPLKIAVAKQGEFIYATKRGIVRLRNDH-EITLEDVLFCKEAAGNLMS 431

Query: 458  VSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVAG 517
            V +                SI+   SG  +    SK+GL  + +              +G
Sbjct: 432  VKR----------LQEAGMSIEFDKSGVTI----SKNGLMVVKN--------------SG 491

Query: 518  LHTSAPLSPICA-SVFVSHVPSTVLWHLRLGHPSFPVL-----QKLLSANSITCDTKYTC 577
            +  + P+    A S+   H  +  LWH R GH S   L     + + S  S+  + + +C
Sbjct: 492  MLNNVPVINFQAYSINAKHKNNFRLWHERFGHISDGKLLEIKRKNMFSDQSLLNNLELSC 551

Query: 578  RDCVSCLKGKASKLPFASSTSIT--TRPLALLHSDVWGPSPLISVSGFRYYVNFVDDFSK 637
              C  CL GK ++LPF      T   RPL ++HSDV GP   +++    Y+V FVD F+ 
Sbjct: 552  EICEPCLNGKQARLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTH 611

Query: 638  FTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQ 697
            +   + +  KSDV S+ + FV   E   +  +     D G E++++ + +F   KG+ + 
Sbjct: 612  YCVTYLIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYH 671

Query: 698  RSCPHTPEQNGAAEQKHRSIVDTAIALMNHASVPLEFWYHAFATVVFLLNRLPSSAI--G 757
             + PHTP+ NG +E+  R+I + A  +++ A +   FW  A  T  +L+NR+PS A+   
Sbjct: 672  LTVPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDS 731

Query: 758  FMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYN 817
              TP++  +   P L HLRVFG   Y  +K     K   K+ + +F+GY  E  G+  ++
Sbjct: 732  SKTPYEMWHNKKPYLKHLRVFGATVYVHIK-NKQGKFDDKSFKSIFVGY--EPNGFKLWD 791

Query: 818  METKKMLVSRHVVFHED--VFPFAIRPVT-----RSHSHSHTLPSQPQ-------PNPAV 877
               +K +V+R VV  E   V   A++  T        S +   P+  +       PN + 
Sbjct: 792  AVNEKFIVARDVVVDETNMVNSRAVKFETVFLKDSKESENKNFPNDSRKIIQTEFPNESK 851

Query: 878  ----VTSLLSSQSS-----------LVVRVLPNNSSVGDVL--LSPINTPSTSDVQPSPV 937
                +  L  S+ S           ++    PN S   D +  L      +   +  S  
Sbjct: 852  ECDNIQFLKDSKESENKNFPNDSRKIIQTEFPNESKECDNIQFLKDSKESNKYFLNESKK 911

Query: 938  LPTTDVLPSSASHSSVVVASANSSPSQSAAISANNQSDPPAAVTINSHPMITRSKAGISK 997
                D L  S    +   +  + +      I  +N +       IN      ++K  IS 
Sbjct: 912  RKRDDHLNESKGSGNPNESRESETAEHLKEIGIDNPTKNDGIEIINRRSERLKTKPQISY 971

Query: 998  RKAFSAV-------------IPKSIPEPTSFTAASKIPEWKRAMLDEYTALTNQNTWTLV 1057
             +  +++             +P S  E            W+ A+  E  A    NTWT+ 
Sbjct: 972  NEEDNSLNKVVLNAHTIFNDVPNSFDE---IQYRDDKSSWEEAINTELNAHKINNTWTIT 1031

Query: 1058 PSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRV 1117
               E+ NIV  +WVF  K+N  G+  RYKARLVA+G+ Q+  +D++ETF+PV + ++ R 
Sbjct: 1032 KRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDYEETFAPVARISSFRF 1091

Query: 1118 ILALAAHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPG----------------------- 1177
            IL+L   Y   +HQ+DVK AFL+G LKE++YM  P G                       
Sbjct: 1092 ILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGISCNSDNVCKLNKAIYGLKQAAR 1151

Query: 1178 ---------FRDSNYPNHTT---------------------------------------- 1237
                      ++  + N +                                         
Sbjct: 1152 CWFEVFEQALKECEFVNSSVDRCIYILDKGNINENIYVLLYVDDVVIATGDMTRMNNFKR 1211

Query: 1238 -LGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTTC 1297
             L  +F++TDL  +++FIG+ I    D  +++Q  Y+  +L K  M +    STP+ +  
Sbjct: 1212 YLMEKFRMTDLNEIKHFIGIRIEMQEDKIYLSQSAYVKKILSKFNMENCNAVSTPLPSKI 1271

Query: 1298 ELHGVSPFFSDALLYRQIVGSLQYLTF-TRPDITYVVSKVSQFMHKPTEIHYSAVKRILR 1314
                ++         R ++G L Y+   TRPD+T  V+ +S++  K     +  +KR+LR
Sbjct: 1272 NYELLNSDEDCNTPCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLR 1322

BLAST of Lag0030971 vs. ExPASy Swiss-Prot
Match: P92519 (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 GN=AtMg00810 PE=4 SV=1)

HSP 1 Score: 174.1 bits (440), Expect = 1.5e-41
Identity = 94/200 (47.00%), Postives = 124/200 (62.00%), Query Frame = 0

Query: 1104 LGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTTCE 1163
            L   F + DLG + YF+G++I     G  ++Q KY   +L   GM D K  STP+     
Sbjct: 27   LSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYAEQILNNAGMLDCKPMSTPLPLKLN 86

Query: 1164 LHGVSPFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQFMHKPTEIHYSAVKRILRYL 1223
                +  + D   +R IVG+LQYLT TRPDI+Y V+ V Q MH+PT   +  +KR+LRY+
Sbjct: 87   SSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYAVNIVCQRMHEPTLADFDLLKRVLRYV 146

Query: 1224 NGTRDCGILFSK-GKLELTAYSDADWAGDSMDRRSTSGYVVFF-CGIPVSWSAKKQSTVS 1283
             GT   G+   K  KL + A+ D+DWAG +  RRST+G+  F  C I +SWSAK+Q TVS
Sbjct: 147  KGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRSTTGFCTFLGCNI-ISWSAKRQPTVS 206

Query: 1284 RSSTEAEYRSLAQTAAELYW 1302
            RSSTE EYR+LA TAAEL W
Sbjct: 207  RSSTETEYRALALTAAELTW 225

BLAST of Lag0030971 vs. ExPASy TrEMBL
Match: A0A2N9GRJ0 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS30097 PE=4 SV=1)

HSP 1 Score: 1070.1 bits (2766), Expect = 1.1e-308
Identity = 630/1436 (43.87%), Postives = 852/1436 (59.33%), Query Frame = 0

Query: 16   STQPNSSIFLLSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLD 75
            +TQ ++ I LLSNI NLV V+LD++N++ WK+QV S+L+A+SL   +DGS PCP +FL D
Sbjct: 7    NTQISNPIILLSNISNLVSVKLDNTNYIVWKYQVTSILEAYSLLEFIDGSQPCPEKFLRD 66

Query: 76   ADGRKTTEVNSAHTQWIAQDSALITLINATLSKTAYSYVIGCKSSKEVWLSLEKQFSSLT 135
              G  +  VNS +T+W+++D  L+T++NATLS +  S V+G KS++ VW +LEK+F+S+ 
Sbjct: 67   EVGSFSLTVNSEYTKWMSRDKTLLTMLNATLSPSTLSMVVGQKSARGVWDTLEKRFTSVN 126

Query: 136  RSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEY 195
            RS++  LK  LH + K+  + +D +L RVKE  DKL  V V I DE++L   L GL +E+
Sbjct: 127  RSNILNLKMDLHGLMKN-NDPVDVFLQRVKESRDKLEAVDVHISDEEILHVVLKGLPTEF 186

Query: 196  NSFRTSIRTREGTLKLQELHALLKSEAKILEQQNKSITTTPLVPTAMFSSVS-------- 255
            +S R++IRTR   +   EL  LL +E   L+    +    PL+  AM S+ +        
Sbjct: 187  HSIRSAIRTRNDPISFDELRVLLSAEESSLKTNVDAPKDPPLM--AMLSTGNRFPPNHNP 246

Query: 256  GQFNSNRGRGRGRNSNPSYWSGRG-GGRSNQGYYGNSNQGRGGFNSSNPSPGFPQGHSAP 315
             QFN++  RGRGRN+N     GRG GGR+N         GRGGF + N S          
Sbjct: 247  PQFNNSSNRGRGRNNN-----GRGRGGRNN---------GRGGFQNQNFSSN-------- 306

Query: 316  NQGRGANSGSNSSSGRGVICQICNRSGHGALDCYNRLNLSFQGRHPPSKLAAMAS----- 375
                    G++ +  +   CQIC + GH ALDCY+R++ S+QGRHPP+KLAA+AS     
Sbjct: 307  ------TFGNSGNQSQRPYCQICGKVGHLALDCYHRMDYSYQGRHPPAKLAALASGNNLL 366

Query: 376  -----------------------------IYDPQS----SNNSGNNNCTWLADSGCNSHV 435
                                           +P S    S N   +  TW++D+G   H 
Sbjct: 367  NSAPNQNTSVSPWHNQNTPPWAHQNPPWAQQNPSSWQPLSTNQAPSTTTWVSDTGATDHF 426

Query: 436  TPDLSTLALNSNYNGEDAITVANGQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLL 495
            TPDL+ L    +Y G D +++ NG G+PIT  G   L  S    NL KIL VP +  NLL
Sbjct: 427  TPDLTNLNNPMDYPGSDQVSIGNGTGLPITHIGHSQLKASSHIFNLRKILRVPCMKTNLL 486

Query: 496  SVSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVA 555
            SV++ C DN C F FDA+ FSIQD  SGR LYKG SKDGLYPI  +SS+    + +ST  
Sbjct: 487  SVNKFCCDNACSFYFDANKFSIQDIFSGRTLYKGSSKDGLYPILGLSSS----QRHSTPC 546

Query: 556  GLHTSAPLSPICASVFVSHVPSTVLWHLRLGHPSFPVLQKLLSANS-ITCDT-KYTCRDC 615
              H+S P +    S F+    +  +WH RLGHP   VL  +L+    ++ +T K++   C
Sbjct: 547  --HSSTPPN----SAFLGTKGTKSVWHSRLGHPQDCVLHSVLNKQPWLSVNTAKFSSDCC 606

Query: 616  VSCLKGKASKLPFASSTSITTRPLALLHSDVWGPSPLISVSGFRYYVNFVDDFSKFTWIF 675
              C++GK  + PF SS+   T PL L+HSDVWGP+P+ S++G R+YV+FVD F++FTW+F
Sbjct: 607  THCVQGKLHQFPFPSSSFTATAPLELVHSDVWGPAPVTSINGTRFYVSFVDHFTRFTWLF 666

Query: 676  PLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPH 735
            P+  KS V +  + F   +EN L+  ++V R+D GGE+ N     F S++G+LHQ SCPH
Sbjct: 667  PIKHKSQVLATFQHFTATMENILNTRIKVLRTDCGGEYTNSAFESFCSTRGILHQFSCPH 726

Query: 736  TPEQNGAAEQKHRSIVDTAIALMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQK 795
            TP+QNG AE+KHR IV+TA+ L++ +S+PL++W +AF+T ++L+NR+P+  + F +P+Q 
Sbjct: 727  TPQQNGVAERKHRHIVETALTLISESSLPLQYWPYAFSTAIYLINRMPTPNLKFTSPWQL 786

Query: 796  LYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKML 855
            L+   PD S L+ FGC C+PLL+PYN HKL+P+++  VFLGY L  KGYLC N++T K+L
Sbjct: 787  LFHTNPDYSFLKTFGCLCFPLLRPYNKHKLEPRSSPCVFLGYALNAKGYLCLNLQTHKLL 846

Query: 856  VSRHVVFHEDVFPFAIR--PVTRSHSHSHTLPSQPQPNPAVVTSLLSSQSSL--VVRVLP 915
            +SRHV FHE+ FPF  +  P + S   +  L S    +P    S+L    SL  +    P
Sbjct: 847  ISRHVAFHENSFPFKSQTSPSSCSTPSNTWLSSVLYFHPCTAPSILGPPPSLPPLSGSTP 906

Query: 916  NNSSVGDVLLSPINTPSTSDVQPSPVLPTTDVLPSSASHSSVVVASANSSPSQSAAISAN 975
             +SS+ DV        S     PSP+L TT                  SSP  S  + + 
Sbjct: 907  LSSSLPDV--------SAQTEPPSPLLHTT------------------SSPHISCPVPSC 966

Query: 976  NQSDPPAAVTINSHPMITRSKAGISKRKAF--SAVIPKSIPEPTSFTAASKIPEWKRAML 1035
            +    P    INSHPM TR K+GISKRK    +  +     EP S+  ASK PEW+ AML
Sbjct: 967  SVPSGPILPPINSHPMQTRGKSGISKRKLLLHTKTLNPLETEPPSYKVASKYPEWQSAML 1026

Query: 1036 DEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDF 1095
            DEYTAL  Q TW+LVP   + NIVGCKWV++ K  PDGSVARYKARLVAKGY+Q+ G+D+
Sbjct: 1027 DEYTALQRQQTWSLVPPPSNHNIVGCKWVYKIKRKPDGSVARYKARLVAKGYHQQAGLDY 1086

Query: 1096 DETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPGFRDSNYPN 1155
            DETFSPVVK  TVR+IL++AA + WSL QLDV NAFLHG LKEDVYMVQP GF DS+ P+
Sbjct: 1087 DETFSPVVKPATVRLILSIAAQFRWSLRQLDVSNAFLHGLLKEDVYMVQPQGFVDSSRPH 1146

Query: 1156 HT---------------------------------------------------------- 1215
            H                                                           
Sbjct: 1147 HVCKLQKSLYGLKQAPRAWFERFTSQLLVLGFTASTADPSLFIYRSSSTVIFLLVYVDDI 1206

Query: 1216 ---------------TLGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTG 1275
                            L   F+L DLG L YF+GLE+   + GF V Q KY +DLL+K  
Sbjct: 1207 IITGNSPSALSSLVQQLATSFELKDLGPLTYFLGLEVDYSATGFFVHQHKYASDLLQKYN 1266

Query: 1276 MADSKTCSTPMSTTCEL-HGVSPFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQFMH 1323
            M D K CSTP  T+ +L   +     DA  +R +VG+LQYLTFTRPD+ Y V+ + QFMH
Sbjct: 1267 MWDCKPCSTPCCTSVKLTKQIGTPLPDATTFRSLVGALQYLTFTRPDLAYTVNSLCQFMH 1326

BLAST of Lag0030971 vs. ExPASy TrEMBL
Match: A0A2N9HKM9 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS40116 PE=4 SV=1)

HSP 1 Score: 1010.7 bits (2612), Expect = 7.7e-291
Identity = 599/1370 (43.72%), Postives = 815/1370 (59.49%), Query Frame = 0

Query: 31   NLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQ 90
            +LV V+LD++N++ WK+Q+ S+ + +SL  ++DG+VP P ++L D +G  +   N  + Q
Sbjct: 17   DLVSVKLDATNYVIWKYQILSIFETYSLVDMLDGTVPPPEQYLADENGDLSLHENILYKQ 76

Query: 91   WIAQDSALITLINATLSKTAYSYVIGCKSSKEVWLSLEKQFSSLTRSHVHELKSSLHTIS 150
            W A+D AL TLINATLS +A + VIG  +++ VW  LE++++SL+R+H+  LK+ L  + 
Sbjct: 77   WKARDQALKTLINATLSPSAITLVIGQTTAQGVWQVLERRYTSLSRTHILSLKAELDRVK 136

Query: 151  KSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLK 210
            KS TE+I  YL RVKE+ DKL +V V +DDEDLL   L GL +EY+ F +++RTR+  + 
Sbjct: 137  KSTTETITVYLDRVKEIRDKLGSVGVIVDDEDLLHTVLKGLPAEYDPFCSAMRTRDRAIS 196

Query: 211  LQELHALLKSEAKILEQQNKSITTTPLVPTAMFSSVSGQFNSNRGRGRGRNSNPSYWSGR 270
             +ELH LL SE +  +         P +  AM ++ S  F           S P  W+  
Sbjct: 197  CEELHVLLTSEEESKKNVKHGGNDQPHM--AMAATHSQFFTPTTNNPLPLLSAP--WNRG 256

Query: 271  GGGRSNQGYYGNSNQGRGGFNSSNPSPGFPQGHSAPNQGRGANSGSNSSSGRGVICQICN 330
             GGR N    G  N  RGGF+S++      QG ++   G   N  S+ +S R   CQIC 
Sbjct: 257  RGGRGNNRGRGGRNSNRGGFSSNS------QGFASNPLGFSPNYNSSGTSQRPQ-CQICG 316

Query: 331  RSGHGALDCYNRLNLSFQGRHPPSKLAAMASIYDPQSSNNSGNNNCTWLADSGCNSHVTP 390
            ++GH ALDC++R+N ++QGR PP+KLAA+AS     + N   +N  +W++D+G   H TP
Sbjct: 317  KTGHLALDCFHRMNFAYQGRQPPAKLAAIASTAMSSAINAPYSNQSSWISDTGATDHFTP 376

Query: 391  DLSTLALNSNYNGEDAITVANGQGVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLLSV 450
            D+S +    +Y G D +TV NGQ +PIT TG+  L  S     L KIL VP +S+NLLSV
Sbjct: 377  DISHIPDCHDYRGTDQVTVGNGQSLPITHTGNSQLYASSHLFKLRKILHVPSMSSNLLSV 436

Query: 451  SQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVAGL 510
             + C DNN  F FDA  F I+D +SGR+LY G S+ GLYPI       SS +        
Sbjct: 437  HRFCKDNNASFYFDASKFRIKDLSSGRLLYNGPSEHGLYPIHGAILPASSPKL------F 496

Query: 511  HTSAPLSPICASVFVSHVPSTVLWHLRLGHPSFPVLQKLLSANSITCDTKYTCRDCVSCL 570
            HTSA  S            S+ LWH RLGHP   V++ +L  N +      T   C+ CL
Sbjct: 497  HTSAVSS------------SSQLWHNRLGHPQQSVVKHVLQ-NKLRLPVSNTTSLCIHCL 556

Query: 571  KGKASKLPFASSTSITTRPLALLHSDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVR 630
            +GK  KLPF +S SIT+ PL ++HSDVWGP+P+ S +  RYYV FVDDF++FTW FPL  
Sbjct: 557  EGKMHKLPFPNSVSITSHPLEIVHSDVWGPAPITSNNETRYYVTFVDDFTRFTWFFPLQS 616

Query: 631  KSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPEQ 690
            KS V S    F   +EN LSC L++ R+D GGE+  H    F SS GV HQ +CPHT +Q
Sbjct: 617  KSQVLSSFMHFKSTMENLLSCKLKILRTDCGGEYTKHDFQSFCSSTGVFHQFTCPHTSQQ 676

Query: 691  NGAAEQKHRSIVDTAIALMNHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGC 750
            NG AE+KHR IVD  + LM+ AS+PL FW +AF+T VFL+NRLPS   G ++P++ L+G 
Sbjct: 677  NGVAERKHRHIVDMGLTLMSQASLPLTFWPYAFSTAVFLINRLPSPHRGLISPWESLFGS 736

Query: 751  VPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLVSRH 810
             P  S  R FGCACYPLL+PY+ HKL P++ Q +FLGY    KG+LC++  + +  VSRH
Sbjct: 737  SPPYSIFRSFGCACYPLLRPYSKHKLLPRSVQCIFLGYPSNAKGFLCFDPVSSRFFVSRH 796

Query: 811  VVFHEDVFPFAIRPVTRSHSHSHTLPSQPQ-PNPAVVTSLLSSQSSLVVRVL-----PNN 870
            V F E VFPF     T S SH   LP+  Q  NPA +++LL   S  +  +L     P  
Sbjct: 797  VTFDESVFPFHKLSSTPSFSH---LPAHSQASNPAWLSALLYFHSCSLPSLLGAPPTPVL 856

Query: 871  SSVGDVLLSPINTPSTSDVQPSPVLPTTDVLPSSASHSSVVVASANSSPSQSAA-ISANN 930
            +S  ++ + P+   S + +   PV+P++    S++S + V  +S    PS S A ++A++
Sbjct: 857  NST-NMPIPPLAPTSVTVLSSVPVVPSSTAPVSTSSTAPVPTSSTAPVPSSSTAPVTASS 916

Query: 931  QSD--PPAAVTINSHPMITRSKAGISKRKAFSAVIPKSIP-----EPTSFTAASKIPEWK 990
             +   P + +  N+HPM TR K+GI+K+K    ++ KS P     EP SF+ A  IP+W 
Sbjct: 917  LTPVAPSSPLVANAHPMQTRGKSGITKKK--QLLLTKSAPDYLHTEPPSFSVARTIPQWH 976

Query: 991  RAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQRE 1050
             AM  E+ ALT Q+TW+LVP   D +I+GC WVF+ K N DGSVARYKARLVAKG +Q  
Sbjct: 977  EAMASEFAALTRQSTWSLVPPSPDHHIIGCHWVFKLKRNSDGSVARYKARLVAKGNHQMP 1036

Query: 1051 GVDFDETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPGFRDS 1110
            G+DF ETFSPVVK  TVR+IL++AA   WSL QLDV NAFLHG LKE V+M QPPGF DS
Sbjct: 1037 GIDFAETFSPVVKPATVRLILSIAAQNQWSLRQLDVSNAFLHGSLKECVFMSQPPGFVDS 1096

Query: 1111 NYPNH------------------------------------------------------- 1170
              P+H                                                       
Sbjct: 1097 TAPSHVCLLHKSIYGLRQAPRAWFEKFSSHLLTVGFTASQADPSLFIYRHGSTVLYLLLY 1156

Query: 1171 ------------------TTLGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLL 1230
                              T L   F+L DLG L++F+GL+I   + GF V Q KY  D+L
Sbjct: 1157 VDDIIITGNHSTAVTELITNLASVFELKDLGPLKFFLGLQIDYKTSGFFVHQSKYALDVL 1216

Query: 1231 EKTGMADSKTCSTPMSTTCELHG-VSPFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVS 1290
             +  M   K C++P  +  +L   V  F  D   YR +VG+LQYLTFTRPD+++ V+ + 
Sbjct: 1217 SRHNMTTCKPCTSPFVSCSKLSSDVVEFLLDPTPYRSLVGALQYLTFTRPDLSFAVNSLC 1276

Query: 1291 QFMHKPTEIHYSAVKRILRYLNGTRDCGILFSKGKLELTAYSDADWAGDSMDRRSTSGYV 1313
            Q M  PT  H  A KR+LRY+ GT   GILF  G + LT ++DADWAG+ +DRRST+G++
Sbjct: 1277 QHMQNPTSAHMVAAKRVLRYVRGTLSHGILFQPGPMHLTVFTDADWAGNPVDRRSTTGFL 1336

BLAST of Lag0030971 vs. ExPASy TrEMBL
Match: A0A2N9IE26 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS50904 PE=4 SV=1)

HSP 1 Score: 1006.5 bits (2601), Expect = 1.4e-289
Identity = 602/1420 (42.39%), Postives = 819/1420 (57.68%), Query Frame = 0

Query: 9    SDNSSGCSTQPNSSIFLLSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPC 68
            ++ SS  S    S I LLSNI NL+  +LDS+N+  WK+Q+ S+ +++SL   +DGS   
Sbjct: 16   TNTSSNISNITQSPILLLSNISNLISAKLDSTNYTLWKYQLLSIFESYSLLDHIDGSTHS 75

Query: 69   PPEFLLDADGRKTTEVNSAHTQWIAQDSALITLINATLSKTAYSYVIGCKSSKEVWLSLE 128
            P  +L D  G  TT+ +  + QW  +D AL TL+NATLS +A S V+   +++ VW  LE
Sbjct: 76   PERYLQDESGAFTTQESVQYKQWKIRDQALKTLLNATLSPSALSLVLRQSTARGVWEVLE 135

Query: 129  KQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTL 188
            ++++SL+R+H+  LK  L  I K   ES+  +L RVKEL DKL+ V V +DDE+LL   L
Sbjct: 136  RRYTSLSRTHILTLKGELDRIQKK-NESMSAFLDRVKELRDKLSAVGVEVDDEELLHVVL 195

Query: 189  NGLSSEYNSFRTSIRTREGTLKLQELHALLKSEAKILEQQNKSITTTPLVPTAMFSSVSG 248
             GL SEY++F +++RTR+ ++  +ELH LL SE +   ++N    ++ +   AM ++ S 
Sbjct: 196  KGLPSEYDAFCSAMRTRDRSISCEELHVLLTSEEE--SKKNSKHMSSDVPHMAMAANASS 255

Query: 249  Q--------FNS--NRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQGRGGFNSSNPSPG 308
                     F+S  NRGRG GR+ N   + GRG     +G YGNS   RGGF        
Sbjct: 256  PATNTPLPLFSSPWNRGRG-GRSQN---YRGRG-----RGNYGNS---RGGFQQ------ 315

Query: 309  FPQGHSAPNQGRGANSGSNSSSGRGVICQICNRSGHGALDCYNRLNLSFQGRHPPSKLAA 368
            FPQ +  PN    + + S S + R   CQIC + GH ALDC++R+N ++QGRHPP+KLAA
Sbjct: 316  FPQ-NMQPNSQVFSQNSSTSQNSRPT-CQICGKPGHVALDCFHRMNFAYQGRHPPAKLAA 375

Query: 369  MASIYDPQSSNNSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAITVANGQGVPIT 428
            +AS     + +   +    W++D+G   H TPD++ +     Y G D +TV NGQ +PIT
Sbjct: 376  IASTNMSNAISAPTSTQSCWISDTGATDHFTPDITHIPDCHAYTGNDFVTVGNGQSLPIT 435

Query: 429  QTGSGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRI 488
             T +  L  S    NL K+L VP +S++LLSV + C DN+  F FDA  F I+   SG++
Sbjct: 436  HTVNSQLRASSHLFNLRKVLHVPSMSSSLLSVYRFCKDNDASFYFDASKFHIKALRSGKL 495

Query: 489  LYKGKSKDGLYPISSISSAGSSLRSNSTVAGLHTSAPLSPICASVFVSHVPSTVLWHLRL 548
            LY G S+ GLYP+      G+ L ++S+              +S   S   S  LWH RL
Sbjct: 496  LYSGLSERGLYPV-----RGAILPTSSS--------------SSFAFSSTTSAQLWHTRL 555

Query: 549  GHPS----FPVLQKLLSANSITCDTKYTCRDCVSCLKGKASKLPFASSTSITTRPLALLH 608
            GHP       VL K    NS++    +    C  C++GK  +LPF  S SITTRPL L+H
Sbjct: 556  GHPQSRVFSHVLNKFFPVNSVSNKVPF----CTHCVEGKHHQLPFNDSVSITTRPLELVH 615

Query: 609  SDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQ 668
            +DVWGP+P+ S +G RYYV+F+DDF++FTW FPL  KS V    K F   +EN L C ++
Sbjct: 616  TDVWGPAPVTSCNGTRYYVSFIDDFTRFTWFFPLKYKSQVLDSFKHFKSTMENILDCKIK 675

Query: 669  VFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIALMNHASV 728
            + RSD GGE+       F SS G+LHQ SCPHT +QNG AE+KHR IVD A+ L++ +S+
Sbjct: 676  LLRSDCGGEYSKSEFQSFCSSAGILHQFSCPHTSQQNGVAERKHRHIVDMALTLISQSSL 735

Query: 729  PLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTH 788
            PL  W +AF+T VFL+NRLPS +  F +P++ L+G  PD    RVFGC CYPLL+ Y+ H
Sbjct: 736  PLNLWPYAFSTAVFLINRLPSVSRQFTSPWECLFGSTPDYKSFRVFGCTCYPLLRSYSRH 795

Query: 789  KLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLVSRHVVFHEDVFPFAIRPVTRSHSHSHT 848
            KLQP++   VFLGY    KG+LCY+    +  VSRHV F E  FP+   P   S+S S +
Sbjct: 796  KLQPRSVPCVFLGYASNAKGFLCYDCSAHRFYVSRHVKFDETSFPYKNLPSPPSNSSSSS 855

Query: 849  LPSQPQP----------NPAVVTSLLSSQSSLVVRVLPNNSSVGDVLLSPINTPSTSDVQ 908
              +              +P  V S+L    S     L ++S V  V L      S+S + 
Sbjct: 856  SVTSHTSSIWLSHLLFFHPCSVPSILGPPPSNSSIPLVSSSIVPSVSLDSYTHSSSSSIP 915

Query: 909  PSPVLPTTDVLPSSASHSSVVVASANSSPSQSAAISANNQSDPPAAVTINSHPMITRSKA 968
              P  P +D  P            A  +P   A +SA      P   + N HPM TR+K+
Sbjct: 916  DLPTAPISDTQP------------APLTPEVPALVSA------PLISSTNIHPMCTRAKS 975

Query: 969  GISKRKAFSAVIPKSIP--------EPTSFTAASKIPEWKRAMLDEYTALTNQNTWTLVP 1028
            GI+KRK        S+P        EPT++T A KIP+W  AM  E+ AL  Q TWTLVP
Sbjct: 976  GITKRKPGFLATHTSLPGSLDYLNTEPTTYTIACKIPQWHAAMASEFAALQRQATWTLVP 1035

Query: 1029 SQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVI 1088
            S    +++GC+WVF+ K N DGSVAR+KARLVAKG +Q+ G+DFDETFSPVVK  TVR++
Sbjct: 1036 SSSSQHVIGCRWVFKLKRNADGSVARFKARLVAKGNHQQAGLDFDETFSPVVKPATVRLV 1095

Query: 1089 LALAAHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPGFRDSNYPNHT-------------- 1148
            L+LAA YGWSL QLDV NAFLHG LKE V+M QPPGF D N+P+H               
Sbjct: 1096 LSLAAQYGWSLRQLDVSNAFLHGSLKEHVFMRQPPGFVDPNHPSHVCLLQKSIYGLRQAP 1155

Query: 1149 -----------------------------------------------------------T 1208
                                                                        
Sbjct: 1156 RAWFEKFSSHLLTIGFTASLADPSLFVYKNGSTVIYLLLYVDDIILTGSVPAAIQELIRD 1215

Query: 1209 LGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTP-MSTTC 1268
            L + F+L DLG L+YF+GL++     G  V Q KY TDLL+K  M+  K CSTP +  + 
Sbjct: 1216 LAQAFELKDLGPLKYFLGLQVEYTPSGLLVHQTKYATDLLDKHNMSTCKPCSTPFVPPST 1275

Query: 1269 ELHGVSPFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQFMHKPTEIHYSAVKRILRY 1323
             +   S   SD   YR +VG+LQYLTFTRPD+++ ++ + Q MH+PT  H  A KR+LRY
Sbjct: 1276 SVLTESSLLSDPFSYRSLVGALQYLTFTRPDLSFAINSLCQHMHQPTTSHLVAAKRVLRY 1335

BLAST of Lag0030971 vs. ExPASy TrEMBL
Match: A0A2N9HUP1 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS43647 PE=4 SV=1)

HSP 1 Score: 997.3 bits (2577), Expect = 8.8e-287
Identity = 603/1425 (42.32%), Postives = 812/1425 (56.98%), Query Frame = 0

Query: 1    MASSSSSV-------SDNSSGCSTQPNSSIFLLSNICNLVPVRLDSSNFLFWKFQVQSML 60
            MA +SSS        ++ SS  S    S I LLSNI NL+  +LDS+N+  WK+Q+ S+ 
Sbjct: 1    MAETSSSTNTHTHQNTNTSSNISNITQSPILLLSNISNLISAKLDSTNYTLWKYQLLSIF 60

Query: 61   KAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQWIAQDSALITLINATLSKTAYSY 120
            +++SL   +DGS   P  +L D  G  TT+ +  + QW  +D AL TL+NATLS +A S+
Sbjct: 61   ESYSLLDHIDGSTHSPERYLQDESGAFTTQESVQYKQWKIRDQALKTLLNATLSPSALSF 120

Query: 121  VIGCKSSKEVWLSLEKQFSSLTRSHVHELKSSLHTISKSATESIDEYLLRVKELVDKLAT 180
            VI   +++ VW  LE++++SL+R+HV  LK  L  I K   ES+  +L RVKEL DKL+ 
Sbjct: 121  VIRQSTARGVWEVLERRYTSLSRTHVLTLKGELDQIQKK-NESMSAFLDRVKELRDKLSA 180

Query: 181  VSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLKLQELHALLKSEAKILEQQNKSIT 240
            V V +DDE+LL   L GL SEY++F +++RTR+ ++  +ELH LL SE    E+  K+  
Sbjct: 181  VGVEVDDEELLHVVLKGLPSEYDAFCSAMRTRDRSISCEELHVLLTSE----EESKKN-- 240

Query: 241  TTPLVPTAMFSSVSGQFNSNRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQGRGGFNSS 300
                         S   +S+  RGRG  S     + RG GR N   YGNS   RGGF   
Sbjct: 241  -------------SKHMSSDWNRGRGGRSQ----NYRGRGRGN---YGNS---RGGFQQ- 300

Query: 301  NPSPGFPQGHSAPNQGRGANSGSNSSSGRGVICQICNRSGHGALDCYNRLNLSFQGRHPP 360
                 FPQ     +Q    NS ++ +S     CQIC + GH ALDC++R+N ++QGRHPP
Sbjct: 301  -----FPQIMQLNSQVFPQNSSTSQNS--RPTCQICGKPGHVALDCFHRMNFAYQGRHPP 360

Query: 361  SKLAAMASIYDPQSSNNSGNNNCTWLADSGCNSHVTPDLSTLALNSNYNGEDAITVANGQ 420
            +KLAA+AS     + +   +    W++D+G   H TPD++ +     Y G D++TV NGQ
Sbjct: 361  AKLAAIASTNMSNAISAPTSTQSCWISDTGATDHFTPDITHIPDCHAYTGNDSVTVGNGQ 420

Query: 421  GVPITQTGSGTLSTSHSDLNLSKILCVPDLSANLLSVSQCCLDNNCIFVFDADWFSIQDK 480
             +PIT TG+  L  S    NL K+L VP +S++LLSV + C DN+  F FDA  F I+D 
Sbjct: 421  SLPITHTGNSQLRASSHLFNLRKVLHVPSMSSSLLSVYRFCKDNDASFHFDASKFHIKDL 480

Query: 481  TSGRILYKGKSKDGLYPISSISSAGSSLRSNSTVAGLHTSAPLSPICASVFVSHVPSTVL 540
             SG++LY G S+ GLYP+      G+ L  +S+              +S   S   S  L
Sbjct: 481  HSGKLLYSGLSERGLYPV-----RGTILPPSSS--------------SSFAFSSTTSAQL 540

Query: 541  WHLRLGHPS----FPVLQKLLSANSITCDTKYTCRDCVSCLKGKASKLPFASSTSITTRP 600
            WH RLGHP       VL K L  NS++    +    C  C++GK  +LPF  S SITTRP
Sbjct: 541  WHTRLGHPQSRVFSHVLNKFLPVNSVSNKVPF----CTHCVEGKHHQLPFNDSVSITTRP 600

Query: 601  LALLHSDVWGPSPLISVSGFRYYVNFVDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQL 660
            L L+H+DVWGP+P+ S +G RYYV+F+DDF++FTW FPL  KS V    K F   +EN L
Sbjct: 601  LELVHTDVWGPAPVTSCNGTRYYVSFIDDFTRFTWFFPLKYKSQVLDSFKHFKSTMENIL 660

Query: 661  SCSLQVFRSDGGGEFVNHYVYEFFSSKGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIALM 720
             C +++ RSD GGE+       F SS G+LHQ SCPHT +QNG AE+KHR IVD A+ L+
Sbjct: 661  DCKIKILRSDCGGEYSKSEFQSFCSSAGILHQFSCPHTSQQNGVAERKHRHIVDMALTLI 720

Query: 721  NHASVPLEFWYHAFATVVFLLNRLPSSAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLK 780
            + +S+PL  W +AF+T VFL+NRLPS +    +P++ L+G  PD    RVFGC CYPLL+
Sbjct: 721  SQSSLPLNLWPYAFSTAVFLINRLPSVSRQLASPWECLFGSTPDYKSFRVFGCTCYPLLR 780

Query: 781  PYNTHKLQPKTAQHVFLGYTLEYKGYLCYNMETKKMLVSRHVVFHEDVFPFAIRPVTRSH 840
             Y+ HKLQP++   VFLGY    KG+LCY+    +  VSRHV F E  FP+   P   S+
Sbjct: 781  SYSRHKLQPRSVPCVFLGYASNAKGFLCYDCSAHRFYVSRHVKFDETSFPYRNLPSQPSN 840

Query: 841  SHSHTLPSQPQP----------NPAVVTSLLSSQSSLVVRVLPNNSSVGDVLLSPINTPS 900
            S S +  +              +P  V S+L    S     L ++S V  V L      S
Sbjct: 841  SSSSSSVTSNTSSIWLSHLLFFHPCTVPSILGPPPSNSSVPLVSSSIVPSVSLDSYTHSS 900

Query: 901  TSDVQPSPVLPTTDVLPSSASHSSVVVASANSSPSQSAAISANNQSDPPAAVTINSHPMI 960
             S +   P  P  D  P            A  +P   A + A      P   + N HPM 
Sbjct: 901  LSSIPDLPTAPIFDTQP------------APLNPEVPALVPA------PLISSTNIHPMC 960

Query: 961  TRSKAGISKRKAFSAVIPKSIP--------EPTSFTAASKIPEWKRAMLDEYTALTNQNT 1020
            TR+K+GI+KRK        S+P        EPT++T A KIP+W   M  E+ AL  Q T
Sbjct: 961  TRAKSGITKRKPGFLATHTSLPGSLDYLNTEPTTYTIACKIPQWHADMASEFAALQRQAT 1020

Query: 1021 WTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKT 1080
            WTLVPS    +++GC+WVF+ K N DGSVAR+KARLVAKG +Q+ G+DFDETFSPVVK  
Sbjct: 1021 WTLVPSSSSQHVIGCRWVFKLKRNTDGSVARFKARLVAKGNHQKAGLDFDETFSPVVKLA 1080

Query: 1081 TVRVILALAAHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPGFRDSNYPNHT--------- 1140
            TVR++L+LAA Y WSL QLDV NAFLHG LKE V+M QPPGF D N+P+H          
Sbjct: 1081 TVRLVLSLAAQYRWSLRQLDVSNAFLHGSLKEHVFMRQPPGFVDPNHPSHVCLLQKSIYG 1140

Query: 1141 ------------------------------------------------------------ 1200
                                                                        
Sbjct: 1141 LRQAPRAWFEKFSNHLLTVGFTASLADPSLFVYKNGSTVIYLLLYVDDIILTGSVPAAIQ 1200

Query: 1201 ----TLGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTP- 1260
                 L + F+L DLG L+YF+GL++   + G  V Q KY TDLL+K  M+  K CSTP 
Sbjct: 1201 ELIRDLAQAFELKDLGPLKYFLGLQVEYTTSGLLVHQTKYATDLLDKHNMSTCKPCSTPF 1260

Query: 1261 MSTTCELHGVSPFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQFMHKPTEIHYSAVK 1320
            +  +  +   S   SD   YR +VG+LQYLTFTRPD+++ ++ + Q MH+PT  H  A K
Sbjct: 1261 VPPSTSVLTESSLLSDPFSYRSLVGALQYLTFTRPDLSFAINSLCQHMHQPTTSHLVAAK 1320

Query: 1321 RILRYLNGTRDCGILFSKGKLELTAYSDADWAGDSMDRRSTSGYVVFFCGIPVSWSAKKQ 1323
            R+LRY+ GT   GILF  G L LTA++D+DWAG+ +DRRST+G+++F     ++W++KKQ
Sbjct: 1321 RVLRYIRGTISHGILFQPGPLRLTAFTDSDWAGNPVDRRSTTGFLIFLGNNLLTWASKKQ 1346

BLAST of Lag0030971 vs. ExPASy TrEMBL
Match: A0A2N9IEP2 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS50263 PE=4 SV=1)

HSP 1 Score: 989.2 bits (2556), Expect = 2.4e-284
Identity = 578/1411 (40.96%), Postives = 805/1411 (57.05%), Query Frame = 0

Query: 23   IFLLSNICNLVPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTT 82
            + LLSNI NLV V+LD SN++ WK+Q+ S+LKA+S+   VDG+  CPPE+L +++G  + 
Sbjct: 1    MILLSNISNLVSVKLDHSNYVLWKYQITSILKAYSVLSFVDGTQQCPPEYLQNSNG--SL 60

Query: 83   EVNSAHTQWIAQDSALITLINATLSKTAYSYVIGCKSSKEVWLSLEKQFSSLTRSHVHEL 142
            + NS + QWI++D  L+TLIN+TLS TA S V+G  ++  VW  LEK+++S +RS++  L
Sbjct: 61   QENSLYQQWISRDQGLLTLINSTLSPTALSLVVGQTTAHGVWSILEKRYTSSSRSNILNL 120

Query: 143  KSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSI 202
            K  LH+I K +T+SI+ +L ++K+  D+L  V V ID+E++L   L GL  EY++F    
Sbjct: 121  KMELHSIKKESTDSINSFLQKIKDTRDRLGAVGVQIDNEEILHIVLKGLPHEYHAFGNRG 180

Query: 203  RTREGTLKLQELHALLKSEAKILEQQNKSITTTPLVPTAMFSSVSGQFNSNRGRGRGRNS 262
            R                                            G+ + NRGRGR  ++
Sbjct: 181  R--------------------------------------------GRNSFNRGRGRNFHN 240

Query: 263  NPSYWSGRGGGRSNQGYYGNSNQGR-GGFNSSNPSPGFPQGHSAPNQGRGANSGSNSSSG 322
            N    SGR GG +N G   + N G  GGFN+                    NS    S  
Sbjct: 241  N----SGR-GGYNNAGNNSSGNAGSLGGFNNH------------------YNSSPTQSYN 300

Query: 323  RGVICQICNRSGHGALDCYNRLNLSFQGRHPPSKLAAMASIYDPQSSNNSGNNNCTWLAD 382
            +   CQIC ++GH ALDCY+R++ S+QG+ PPSKLAAMA+     +SN+  ++   W++D
Sbjct: 301  QRPACQICGKNGHAALDCYHRMDYSYQGKQPPSKLAAMAA-----TSNSQHSDQSYWISD 360

Query: 383  SGCNSHVTPDLSTLALNSNYNGEDAITVANGQGVPITQTGSGTLSTSHSDLNLSKILCVP 442
            +G   H TPDLST+  +  Y G D  TV NGQ +PIT  G+  L  S    +L K+L VP
Sbjct: 361  TGATDHFTPDLSTIPDHQEYTGTDLATVGNGQAIPITHIGNSQLKASSHLFHLRKVLRVP 420

Query: 443  DLSANLLSVSQCCLDNNCIFVFDADWFSIQDKTSGRILYKGKSKDGLYPISSISSAGSSL 502
             +++NLLSV++ C DNNC F+FDA+ F I+D  +G++LY+G SK+GLYPI  +S      
Sbjct: 421  SMASNLLSVNKFCRDNNCCFLFDANQFKIKDMPTGKLLYRGPSKNGLYPIDGVS------ 480

Query: 503  RSNSTVAGLHTSAPLSPICASVFVSHVPST-----VLWHLRLGHPSFPVLQKLLSANSI- 562
                          L P C +   S + ST      +WH RLGHP+  V Q++ S + + 
Sbjct: 481  --------------LPPPCHTSNFSSIQSTKSVSSKVWHDRLGHPNSQVQQRIFSNSPVH 540

Query: 563  TCDTKYTCRDCVSCLKGKASKLPFASSTSITTRPLALLHSDVWGPSPLISVSGFRYYVNF 622
               +  T   C  C++GK + LPF  S S   +PL ++HSDVWGPSP+ S  G R+YV F
Sbjct: 541  NSSSNKTESACTHCIQGKMTHLPFHKSVSKACKPLEIIHSDVWGPSPITSDGGTRFYVIF 600

Query: 623  VDDFSKFTWIFPLVRKSDVSSVIKQFVPFIENQLSCSLQVFRSDGGGEFVNHYVYEFFSS 682
            VD+F++FTW +P+  KS V S    F   ++N L+  +++ R+D GGE+ ++  + F  S
Sbjct: 601  VDEFTRFTWFYPIRNKSQVLSCFVSFSNTMQNLLNHKIKILRTDCGGEYASNEFHSFCIS 660

Query: 683  KGVLHQRSCPHTPEQNGAAEQKHRSIVDTAIALMNHASVPLEFWYHAFATVVFLLNRLPS 742
             G+ HQ +CPHT +QNG AE+KHR IVD A+ L++ +S+PL FW +AF+T V+L+NR+P 
Sbjct: 661  HGITHQYTCPHTSQQNGLAERKHRHIVDIALTLISQSSLPLSFWPYAFSTAVYLINRVPP 720

Query: 743  SAIGFMTPFQKLYGCVPDLSHLRVFGCACYPLLKPYNTHKLQPKTAQHVFLGYTLEYKGY 802
            S     +P++ L+   P+ + LR FGC CYPL++PYN+HKLQP++ + VFLGY    KGY
Sbjct: 721  SNSKTSSPWELLFHRQPNYASLRTFGCLCYPLMRPYNSHKLQPRSVECVFLGYATNAKGY 780

Query: 803  LCYNMETKKMLVSRHVVFHEDVFPFAIRPVTRSHSHSHTLPSQPQPNPAVVTSLLSSQSS 862
            LCYN+ T+K   SRHV+F + VFPF          H  +    P P P  + + LS  + 
Sbjct: 781  LCYNIHTRKYYTSRHVIFTKSVFPF----------HKSSSVQTPIP-PTWLNTNLSFHTC 840

Query: 863  LVVRVLPNNSSVGDVLLSPINTPSTSDVQPS----PVL-PTTDVLPSSASHSSVVVASAN 922
             +  +L +    G V+ SP   PS     PS    P+L P +D+ P+ +S          
Sbjct: 841  PLTPILGS----GPVVSSP---PSILGPHPSLSSIPILDPPSDISPTLSS---------- 900

Query: 923  SSPSQSAAISANNQSDPPAAVTINSHPMITRSKAGISKRKAFSAVIPKSI--PEPTSFTA 982
            + P    A S  N    P       HPM TR+K+GI K K F   +       EP ++  
Sbjct: 901  NPPHLPLANSVTNPHSSP-------HPMQTRAKSGIFKPKQFHHTLVNDYLNTEPPTYKI 960

Query: 983  ASKIPEWKRAMLDEYTALTNQNTWTLVPSQEDMNIVGCKWVFRTKFNPDGSVARYKARLV 1042
            AS++P+W+ AM  E+ AL  QNTWTLVPS  + N+VGC+WV++ K N DGS+ARYKARLV
Sbjct: 961  ASQLPQWQDAMTSEFQALQRQNTWTLVPSSSNQNLVGCRWVYKLKRNSDGSIARYKARLV 1020

Query: 1043 AKGYNQREGVDFDETFSPVVKKTTVRVILALAAHYGWSLHQLDVKNAFLHGYLKEDVYMV 1102
            AKGY+Q++G+D+DETFSPVVK  TVR+IL++AA   WSL QLDV NAFLHG LKE+VYM 
Sbjct: 1021 AKGYHQQQGMDYDETFSPVVKPATVRLILSIAAQQNWSLKQLDVSNAFLHGLLKENVYMQ 1080

Query: 1103 QPPGFRDSNYPNH----------------------------------------------- 1162
            QPPGF D  YP H                                               
Sbjct: 1081 QPPGFIDPQYPKHVCQLQKALYGLKQAPRAWFERFTSHLLTMGFTPSLADPSLFLYRQGS 1140

Query: 1163 --------------------------TTLGREFQLTDLGSLRYFIGLEITRFSDGFHVTQ 1222
                                      T L +EF + DLG L++F+GL+I   S GF V Q
Sbjct: 1141 TVVYLLLYVDDIIVTGNQPTAIQSLITKLAQEFDIKDLGQLKFFLGLQIDYRSSGFFVHQ 1200

Query: 1223 LKYLTDLLEKTGMADSKTCSTPMSTTCELHGVSPF-FSDALLYRQIVGSLQYLTFTRPDI 1282
             KY TDLL K  M+  K CSTP  +   +         D   +R +VG LQYLTFTRPD+
Sbjct: 1201 HKYATDLLAKFNMSTCKPCSTPFVSLSRIRKDDGIPLPDPTPFRSMVGGLQYLTFTRPDL 1260

Query: 1283 TYVVSKVSQFMHKPTEIHYSAVKRILRYLNGTRDCGILFSKGKLELTAYSDADWAGDSMD 1342
            +Y V+ + QFMH+PT+ H  A KRILRY+ GT   G+ F  G L LTA++D+DWAGD MD
Sbjct: 1261 SYAVNHICQFMHQPTDHHLVAAKRILRYVQGTLHHGLTFRPGPLSLTAFTDSDWAGDPMD 1282

Query: 1343 RRSTSGYVVFFCGIPVSWSAKKQSTVSRSSTEAEYRSLAQTAAELYWIRQLLCDLCVFLP 1346
            RRST+G +VF    P++W +KKQ TV+RSSTEAEYR+LA  AA+L W+R +L DL +FL 
Sbjct: 1321 RRSTTGLIVFLGHNPITWQSKKQPTVARSSTEAEYRALANCAADLAWVRMVLKDLGIFLH 1282

BLAST of Lag0030971 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 275.4 bits (703), Expect = 3.4e-73
Identity = 184/524 (35.11%), Postives = 265/524 (50.57%), Query Frame = 0

Query: 881  SDVQPSPVLPTTDVLPSSASHSSVVVASANSS--PSQSAAISANNQSDPPAAVTINSHPM 940
            SD   S    + D++PS+   + V   S ++S   ++  A   +      A++TI+    
Sbjct: 3    SDADASTSSSSIDIMPSANIQNDVPEPSVHTSHRRTRKPAYLQDYYCHSVASLTIHDISQ 62

Query: 941  ITRSKAGISKRKAFSAVIPKSIPEPTSFTAASKIPEWKRAMLDEYTALTNQNTWTLVPSQ 1000
                +       +F   I K+  EP+++  A +   W  AM DE  A+   +TW +    
Sbjct: 63   FLSYEKVSPLYHSFLVCIAKA-KEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLP 122

Query: 1001 EDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRVILA 1060
             +   +GCKWV++ K+N DG++ RYKARLVAKGY Q+EG+DF ETFSPV K T+V++ILA
Sbjct: 123  PNKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILA 182

Query: 1061 LAAHYGWSLHQLDVKNAFLHGYLKEDVYMVQPPGFR----DSNYPN-------------- 1120
            ++A Y ++LHQLD+ NAFL+G L E++YM  PPG+     DS  PN              
Sbjct: 183  ISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQ 242

Query: 1121 ------------------------HT---------------------------------- 1180
                                    HT                                  
Sbjct: 243  ASRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELK 302

Query: 1181 -TLGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPM--S 1240
              L   F+L DLG L+YF+GLEI R + G ++ Q KY  DLL++TG+   K  S PM  S
Sbjct: 303  SQLKSCFKLRDLGPLKYFLGLEIARSAAGINICQRKYALDLLDETGLLGCKPSSVPMDPS 362

Query: 1241 TTCELHGVSPFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQFMHKPTEIHYSAVKRI 1300
             T   H    F  DA  YR+++G L YL  TR DI++ V+K+SQF   P   H  AV +I
Sbjct: 363  VTFSAHSGGDFV-DAKAYRRLIGRLMYLQITRLDISFAVNKLSQFSEAPRLAHQQAVMKI 422

Query: 1301 LRYLNGTRDCGILF-SKGKLELTAYSDADWAGDSMDRRSTSGYVVFFCGIPVSWSAKKQS 1323
            L Y+ GT   G+ + S+ +++L  +SDA +      RRST+GY +F     +SW +KKQ 
Sbjct: 423  LHYIKGTVGQGLFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQ 482

BLAST of Lag0030971 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 174.1 bits (440), Expect = 1.1e-42
Identity = 94/200 (47.00%), Postives = 124/200 (62.00%), Query Frame = 0

Query: 1104 LGREFQLTDLGSLRYFIGLEITRFSDGFHVTQLKYLTDLLEKTGMADSKTCSTPMSTTCE 1163
            L   F + DLG + YF+G++I     G  ++Q KY   +L   GM D K  STP+     
Sbjct: 27   LSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYAEQILNNAGMLDCKPMSTPLPLKLN 86

Query: 1164 LHGVSPFFSDALLYRQIVGSLQYLTFTRPDITYVVSKVSQFMHKPTEIHYSAVKRILRYL 1223
                +  + D   +R IVG+LQYLT TRPDI+Y V+ V Q MH+PT   +  +KR+LRY+
Sbjct: 87   SSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYAVNIVCQRMHEPTLADFDLLKRVLRYV 146

Query: 1224 NGTRDCGILFSK-GKLELTAYSDADWAGDSMDRRSTSGYVVFF-CGIPVSWSAKKQSTVS 1283
             GT   G+   K  KL + A+ D+DWAG +  RRST+G+  F  C I +SWSAK+Q TVS
Sbjct: 147  KGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRSTTGFCTFLGCNI-ISWSAKRQPTVS 206

Query: 1284 RSSTEAEYRSLAQTAAELYW 1302
            RSSTE EYR+LA TAAEL W
Sbjct: 207  RSSTETEYRALALTAAELTW 225

BLAST of Lag0030971 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 122.9 bits (307), Expect = 2.8e-27
Identity = 65/125 (52.00%), Postives = 85/125 (68.00%), Query Frame = 0

Query: 938  MITRSKAGISK-RKAFSAVIPKSI-PEPTSFTAASKIPEWKRAMLDEYTALTNQNTWTLV 997
            M+TRSKAGI+K    +S  I  +I  EP S   A K P W +AM +E  AL+   TW LV
Sbjct: 1    MLTRSKAGINKLNPKYSLTITTTIKKEPKSVIFALKDPGWCQAMQEELDALSRNKTWILV 60

Query: 998  PSQEDMNIVGCKWVFRTKFNPDGSVARYKARLVAKGYNQREGVDFDETFSPVVKKTTVRV 1057
            P   + NI+GCKWVF+TK + DG++ R KARLVAKG++Q EG+ F ET+SPVV+  T+R 
Sbjct: 61   PPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRT 120

Query: 1058 ILALA 1061
            IL +A
Sbjct: 121  ILNVA 125

BLAST of Lag0030971 vs. TAIR 10
Match: AT5G48050.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G34070.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 67.0 bits (162), Expect = 1.8e-10
Identity = 70/269 (26.02%), Postives = 119/269 (44.24%), Query Frame = 0

Query: 33  VPVRLDSSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRKTTEVNSAHTQWI 92
           V + L+  N+  W+   +++  +  + G +DGS               +T       +W 
Sbjct: 24  VTLDLNKLNYDVWRELFETLCLSFGVLGHIDGS---------------STPTPMTEKRWK 83

Query: 93  AQDSALITLINATLSKTAYSYVI--GCKSSKEVWLSLEKQFSSLTRSHVHELKSSLHTIS 152
            +D  +   I  T++ +    +I  GC +++++WLSLE  F     +   + ++ L T +
Sbjct: 84  ERDGLVKMWIYGTITDSLLDTIIKVGC-TARDLWLSLENLFRDNKEARALQFENELRTTT 143

Query: 153 KSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFRTSIRTREGTLK 212
                S+ EY  ++K L D L  V  PI D  L+++ LNGL+ +Y+     I+ +     
Sbjct: 144 IDDL-SVHEYCQKLKSLSDLLTNVDSPISDRVLVMHLLNGLTEKYDYILNVIKHKSPFPS 203

Query: 213 LQELHALLKSEAKILEQQNKSI---TTTPLVPTAMFSSVSGQ---------FNSNRGRGR 272
             E  ++L  E   L  ++KS    T  P +   +F+    Q          NSN GRGR
Sbjct: 204 FTEARSMLLMEESRLSNKSKSSLSHTNHPSLSNVLFTVPRQQERYPQEYHNNNSNMGRGR 263

Query: 273 GRNSNPSYWSGRGGGRSNQGYYGNSNQGR 288
            +  N      RGGG S+ G Y N+N  R
Sbjct: 264 SKKKN------RGGG-SSDGRYNNNNNWR 268

BLAST of Lag0030971 vs. TAIR 10
Match: AT1G34070.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G48050.1); Has 648 Blast hits to 647 proteins in 29 species: Archae - 0; Bacteria - 0; Metazoa - 16; Fungi - 25; Plants - 607; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 62.8 bits (151), Expect = 3.5e-09
Identity = 80/318 (25.16%), Postives = 135/318 (42.45%), Query Frame = 0

Query: 23  IFLLSNICNLVPVRLD--SSNFLFWKFQVQSMLKAHSLFGIVDGSVPCPPEFLLDADGRK 82
           I+ +SNI + +PV LD   SN+  W+    +   +  + G +DG++              
Sbjct: 10  IYGVSNIKSHIPVMLDIEESNYDAWRELFLTHCLSFDVMGHIDGTL-------------- 69

Query: 83  TTEVNSAHTQWIAQDSALITLINATLS-KTAYSYVIGCKSSKEVWLSLEKQFSSLTRSHV 142
               N+    W  +D  +   +  TL+ K      +   +S+++WL ++ QF +   +  
Sbjct: 70  -LPTNANDVNWQKRDGIVKLSLYGTLTPKQFQGSFVTSSTSRDIWLRIKNQFRNNKDARA 129

Query: 143 HELKSSLHTISKSATESIDEYLLRVKELVDKLATVSVPIDDEDLLLYTLNGLSSEYNSFR 202
             L S L T        + +Y  ++K+L D L  V VP+ D +L++Y LNGL+ ++++  
Sbjct: 130 LRLDSELRT-KDIGDMRVADYYRKMKKLADSLRNVDVPVTDRNLVMYVLNGLNPKFDNII 189

Query: 203 TSIRTREGTLKLQELHALLKSEAKILEQQNK--------SITTTPLV-----PTAMFSSV 262
             I+ R+      +   +L+ E   L++  K        S ++T L      P   F   
Sbjct: 190 NVIKHRQPFPSFDDAATMLQEEEDRLKRAIKPNPTHVDHSSSSTVLACSEAPPVTNFQRS 249

Query: 263 SGQFNSNRGRGRGRNSNPSYWSGRGGGRSNQGYYGNSNQGRGGFNSSNPSPGFPQGHSAP 322
            G     RGRGRG N     + GRGG  S   YY         FNS N  P +   +   
Sbjct: 250 GGNQMGYRGRGRGNN----IFRGRGGRFS---YYNMPT-----FNSWNRPPFYQNSYQMW 299

Query: 323 NQGRGANSGSNSSSGRGV 325
           N   G     N++ G G+
Sbjct: 310 NHPWGYPPYVNTNGGNGL 299

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TQE01264.13.4e-27743.15hypothetical protein C1H46_013171 [Malus baccata][more]
TQD93593.12.4e-27038.94hypothetical protein C1H46_020801 [Malus baccata][more]
BBG97282.18.2e-26339.81hypothetical protein Prudu_006352 [Prunus dulcis][more]
WP_081894301.13.6e-25840.57DDE-type integrase/transposase/recombinase [Acetobacter malorum] >KFL89552.1 hyp... [more]
CCH50966.15.3e-25438.38T4.5 [Malus x robusta][more]
Match NameE-valueIdentityDescription
Q94HW24.7e-22135.99Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT943.0e-20734.50Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P109787.1e-10827.13Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041464.9e-9324.44Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
P925191.5e-4147.00Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A2N9GRJ01.1e-30843.87Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS30097 PE=4 SV=1[more]
A0A2N9HKM97.7e-29143.72Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS40116 PE=4 SV=1[more]
A0A2N9IE261.4e-28942.39Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS50904 PE=4 SV=1[more]
A0A2N9HUP18.8e-28742.32Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS43647 PE=4 SV=1[more]
A0A2N9IEP22.4e-28440.96Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS50263 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G23160.13.4e-7335.11cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00810.11.1e-4247.00DNA/RNA polymerases superfamily protein [more]
ATMG00820.12.8e-2752.00Reverse transcriptase (RNA-dependent DNA polymerase) [more]
AT5G48050.11.8e-1026.02CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... [more]
AT1G34070.13.5e-0925.16CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 1492..1512
NoneNo IPR availableCOILSCoilCoilcoord: 1609..1652
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 91..222
e-value: 1.7E-19
score: 70.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1522..1600
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1461..1496
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1577..1600
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1341..1364
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 246..320
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1532..1565
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1341..1379
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1847..1883
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 1102..1258
coord: 428..1096
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 1241..1322
e-value: 3.50745E-39
score: 141.066
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 580..759
e-value: 7.8E-37
score: 128.5
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 990..1100
e-value: 5.4E-35
score: 121.1
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 521..574
e-value: 8.4E-10
score: 38.4
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 587..688
e-value: 6.6E-9
score: 36.0
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 585..751
score: 21.261814
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 991..1315
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 585..747

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0030971.1Lag0030971.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding