Lag0024687 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0024687
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionIntegrase catalytic domain-containing protein
Locationchr10: 4959994 .. 4969979 (-)
RNA-Seq ExpressionLag0024687
SyntenyLag0024687
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGCCATGGTCGGCCTCGGCCTAAGGTCGAGGAGCTTTCCCCCTCTAGTTGGTTTCTGGTGTCCCTGATGAGCCCGGTTCTACTTGGTTCAACTCTGAATCATCTCCAAACGCCTAGAAACCCCAAAACAGGAAAATGTATATAAACCTTTCTTCATCATTGAAGAAGAGATCGCAAACTCTACTCTCAGAACTCTCTCTTCTTCCTCTAACTCTCTGGTCTTGCTAATCTTTTCCTTCTGCTTTCTGACTTAAGCATCGGAGGTGGTGTGGCAAGCACCACACCGGTGTGTCTTGCATGCCACGTCTTCCCCATCTCATACATTTGGTGGCACGTGAAGGTTAGGTCTCACACCCATATTAATGAAACACCATCCCTATTTGATTCAAGCATTTTTCTTCTCCAAGATTTATGTTTATTCATGCAGTTAAGCAAGTCGTATACATGAGAGAAAGTTATACACTAACGATTTTAAAACAAAATTAGTTTACAATTTTAATTAAAACAAATTATCAGTTTGAAAGATACCACCCCAAAGTTTAGAGATATAAATAGATTTTTTTTTTCCAAAATTTTTAATAACCACACTTAGGAAAGGAAATTAACATCCTATTCTTTCATGGGGCCGGATATGTGATAAAAGGACCTTTAATAATGTTGTCTGCCATTGGTTTGGAAGATATTTTGTGAGGCAGCCATCATATGGGTTTTGCTGCCAGAAGTTGAACTTAAGCCGCCGTCTATCTGTTTTGGTCTGATGATAATGAAGCGTTTAAGAAAGTCACAGAGCTTTCCTTCAATTCCAAATCCACACTACGAAAATGAAAATGGGAAAAATATATCTGCTGAACATGAACATGAACATGCTGTTGGAGACCAATTTCTTGCTGCTCTGTTGAACAGCACGACAGTGGAGAGCCAGCAGACAATGGATGTCGATGCACCCTCGGATTCCGAATTCCAAACTAGTGGTAAAAATTCCAACTCCTTTTTATCGAGTTCTAATTTCTCTAACTACTTTTCAGATTATCAAAATTAACATTCTGTAATTAGACGAACCAAAATTACACGGCCAGCCTTTTTTTTTATATAGAGTATAAAAAAAATTCAGACCGTTTAGGTGCTGCTTGATGATGATTTGGACTCTGGATTTTAGTTTTTGAAAATTAAGCTTGCATACACTATTTGTTTTGTTGTATACTTTTTAATATTTTCAAAATTCAATCGAAGTGATGAAAACTAAAAACAAAAAAAAATAAAAAATAAAAGTTTTCAAAAACTTGATAACTATCGAGTTTTTGGTCTTTAGTCTCTAACTTGTTGACCATCTAGGCTCTAGCTTTTTGTTTTTTGAAAATTAAGTGTATAAAAACTATGAGTTTTTAAGTTTTTTCTTATACTTTATACTAGTATTTTCAAAACACAAGCTAAGTTTTGAAAACTAAAAAAATTAGTTTTCACAAAAAGATTTGTCTTTGGTCTTTAGTTTGATAACCATTTAGTCTATGGTTTTTTGGTTTTGAAAATTAAATATACTATGAGTTTCTTTGTTTTGGTGTATACCCGTCTTAATTTTTGTTGGCATCCTTAGTTCTTTCTTAAGAAAATAGTACAAAGTGTTCAAGAGATTTTTTTTCTTTTAAAAAAAACTTATTTAAAATATTTTTCCCATTTTTGTTAAGTGGAATTAAATAATATTAGACAAACGATTTAAGGGGGAGAGGGAGAATTTGGAGGGGAGGGAGTGAGACTCGGAGAGAAGAACATTTTCAATGCAATATATGAGGTGGAGATTTGAATCTCTAACTATTTGAACGATGAACGATGGAATGATGGGGAGAACTAACGACAGATTTGAATATCAAACTCTATTAATGTATATGTAACCATAGTTAAGTTGTTGTAACTCTTATAGTTAAGTTGTTGTAACTCTCAGTTCTCTCTCTTGATATTATTTCTTTTATTCTGTTATTTTAGTTCTATATAAATATGGGTTGTATAGCAGCAGTTTAATAATATAAGAGAGAAACAGTTTATTTACTCTCATTCTTGATATGGTATCAGAGCGCCGATAAGCTTCTTGCTTCTTCTGTGCTTCCTCATCTCTCTTCTTCTTTCTGTGCTTTCGCATCTTTCTGTTTTTTTTCCATGGCGGGAACTGGAAGCGCGAATTCTTCTTCATCGAATGTCACTGCTACATCGATTGAAGCTCAGATCAATCCATATTTCCTACACCATTCTTTTGGATCATCCTCTGTTCTTGTTTCACAGCCATTGCTCGGTGCGGTCAATTACACTTCCTGGAGTCGCGCAATGAGGATGGTAATTTCAGGTAAGAACAAGTTAGGGTTCATCACTGGTAAAATCTCCAAACCTCAGGAAGAAGGTGCCTTGCTTGAGGCTTGGGAATGCAATAATGATATTATTGCTTCATGGATTTTGAATTCCGTCTCGAAAGAAATCGCGGCGAGTATTGTATACACCGGTTCCGTTAAAGCTGTTTGGGATGAGTTGCAAGAACGATTCAAGCAAGCAAGTGGACCAGGAATTTATCAACTTCGCAAGGATTTGGTCACATTACGCCAAGGATCGATGTCAGTTGAAGTCTATTATACAAAGCTCAAGACAATTTGGCAAGATCTCAGTGATCTCCGCCCAACAGCAAGTTGTACTTGTGGAGGATTGAAGCCATTTCTTGAGCATCTTGATTCGGAGTATGTGATGACCTTCTTGATGGGATTAAATGAGACGTATGCGGCAATAAGGGCACAAATTTTGTTGATGAAGCCGCTTCCGTCCATCACTGAGGCCTTTTCCCTGCTGATTCAAGAAGAACATCAACGTTCGGCTGGAATCCTTGGACCATCACCAGATCCTATTGCCTTGGCAGTTAATGATACTTCGAAAACCTCTGATCCTCCTCGAAGGAAAGAGAATTCAGGACAACGACCGGTTTGTTCTCACTGCGGCATTAAGGGACATGTGAAAGATCGATGCTATAAATTGCACGGATATCCTCCCGGATACAAATTTCGATCTTCGAATTCTCCTGATAATTCTGCATCTGCTAAATCTGTTGTTGCTGCAAATTCTGCCGCTGCGTCAAGTTGTCCTCCTGCTACACCTAATTTTTTCTCTAGTCTCAACAACGTTCAGTATGGTCAGTTGATGGAATTGTTCAATTCGCACCTTCAGGCCGCCAAGACTGATCCTATTACCGTAGCTTCTGCTGTTTCACATGCTACAGGTATTTGTCACTTGGCTTCTTCCCCCATTACTTCTCTGAATGATTGTTGGATTGTTGATTCTGGTGCGTCTAGACATATTTGTCACACTCGAGCAGCCTTCCGTAACTGGCGGCGCATTGACCCTATTTCTATTGTTCTTCCTACTGCATATAGAGTGTGTGTTGAATATGTTGGGGAAATTCATATCTCAGCTGCTCTTGTTCTGCGTGATGTGCTTTTTGTCCCTGATTTTGCTTATAACCTGATGTCAGTGAGTTGCTTACTTCAGTCTGGAGAATTCTCTGTAGCCTTTACTAATAATCATTGCCTCATACAGGACAAACAGCTATTGACAATGATTGGCAAGGCTGAATGTCACCATGGTCTTTACATTCTCTCTAACATTTCAGGTTCTTCGACTGATTCATTGGTTACTTCGACTGTTGTTTCTGATATACCTGCTGTTTTCTCTGTTTCTGCTTCTATATGGCATTCTCGTCTAGGCCACTTATCTCCCAGACGTTTGTCCATGCTTAGAGATACTTTACAGTTTAAAGATTCTTGTGATGCCTCATGTACCGTTTGTCCATTAGCTAAACAAAAACGGATGCCTTTTACTTCCAATAATCATGTAGCATCAAATGTTTTTGATCTTATACATGCTGACGTTTGGGGACCCCTTAATACTTCTACTTATGATGGCTATCGCTATTTTTTAACCCTAGTTGATGATGCTTCTCGATTCACTTGGGTTTATCTACTGCGACAAAAATCCGATGTTTTGATGATCATTCCTCGTTTTTTCAAGATGGTAGAAACCCAATTCTCTAAGACCATAAAATGTTTTCGTTCTGACAATGCACCAGAGCTTAAGTTCACTGATTTTTTTGCCTCCACTGGGACTATACATCAATTTTCTTGTGTGGAGAGACCTCAACAAAACTCAGTTGTAGAACGGAAACACCAACACCTGTTAAATGTTGCCCGTAGCCTGTATTTCCAGTCTCGTGTGCCTTTACGTTTTTGGGAGACTGTATTCTCACAGCCACATTTCTCATCAATAGGACTCCTATGCCCTTATTACAAAATGAATGTCCTTTTACTGTCTTATATGGTGATGCAGTTGACTATTCATTTTTACGTGTTTTTGGTTGTCTATGCTACGCGTCCACCCTTACTGCTGGACGTTCAAAATTTGACCCTCGTGCAAGACCTTGTGTTCTCCTTGGTTATCCGCCTGGAATGAAAGGATATCGTTTATATGATATCTCCAAGAAGCAAGTTTTTATATCTCGTGATGTAACCTTCATTGAGGAAAACTTTCCTTTCCATTCTATTATTAATAATGACCAGCTCGTGGATGTTTTTCCAGGCCTTGTTTTACCTTTACCGGTGTTTGACTTTGTTCCATCACCTAGTGTTCCAAATAACACTGCAGCAACACCTTCTGAAGTTGGTTTTCCTCCGGCTGGTCTTCCATCTGATGTGTCTGCTGAGAATGGTGATATGCTACTTCAAGCATCACCTGCTGCTAGTGATGCTGATAGCCCTGCTCAGTCTACTGGGGTTATTCAGCCACGACGTTCAACAAGGCAACGACATCCACCTGATTTTTTGAAGGATTATCACTGTCACCTTTTGAGACATGATGTTCCTTTGCTTAGTCATGATGTTCCTCACTCCCTTGACAAATATGTTTCATATAGGCGCTTCTCTAATGATCATCGACAGTTCATTTTGAATGTTTCAACAGATTTTGAACCCACCTACTATCATCAGACAGTAAAATTTGCTCCTTGGCGCAAGGCTATGGATGATGAAATTGCAGCCATGGAGCGCACTAACACCTGGACTCTGGTTCCCTTACCATCTGGACGTCGTGCAGTTGGATGCAAGTGGGTATACAAAGTCAAATATAAAGCCGATGGAACTGTAGATCGATACAAAGCGCGCCTCGTTGCCAAGGGTTACAATCAGCAAGAAGGTATTGACTTTCTTGAGACTTTTTCTCCTGTGGCCAAGATTGTAACTGTTAAAGTTCTCCTTTCTTTAACGGCTTCTTTTGGTTGGTCTCTTGTGCAATTGGATGTGAATAATGCTTTTCTAAATGGTGATTTGTTTGAAGAGGTTTATATGTCGCTACCTCTTGGTTATTATACCGATCGTAAGTCTTCTGGTTGTACTCCTATTGTTTGTAAGCTCAACAAGTCCATATATGGGCTTAAACAGGCTTCTAGACAATGGTTCTCCAAGTTTTCTAGTGTTCTTCTTGCTAATGGGTTTTCACAGTCCAAAACTGACTACTCTCTTTTCATAAGAGGCCATGGTATTTCCTTTGTTGCGTTGCTTGTATATGTAGACGATATCTTAATTACTGGACCGTCTACTACTGAAATCAGTGCTGTGAAAGACATCCTATATCGTCACTTCTTACTTAAAGATCTTGGTCATGCCAAGTATTTCCTCGGCTTGGAGCTCTCTCGGTCTGATAAAGGGATCTACTTATCCCAGCGTAAGTACTGTCTTCAACTTATCGAAGACTCTGGCTATCTAGCTGCTAAACCGGTTAATCACCCCATGATTCCTAACCTTCGGTTATCTGCTAATGGTTCTGGGGAGCTTTTGAATGCTGATGATGCTAGTTCCTATAGACGATTGGTTGGTCGATTGCTATATTTGCAGGTATCTCGACCAGACATATCATTCACGGTTCATAATCTCAGCCAGTTTATAGCCAAGCCGTGTGTTCGCCATTTAGATGTTGTTTTTCATTTGTTGAGGTATTTAAAGGGCACTGCTGGACAGGGGATTCTTTTGCATTCCTCTCGAGATTTTCATCTTAAAGCCTTTGCTGATTCGGATTGGGGTTCTTGTCCGGACTCGAGGAAATCCGTCACTGGTTTTTGTATTTTCTTGGGAAAATCTCTTGTGTCGTGGAAGTCTAAGAAACAAGCCACAGTTTCACGCTCTTCAGCTGAGGCCGAATATCGTGCCCTCGCTACTGTTTCTAGTGAACTAATTTGGCTTTCTCATTTGCTCAAAGACTTACAAGTTCCCCTTCGGTCTCCTTCTGTGGTTTATTGTGATAATCTTGCTGCAATTGCTATTGCCAACAATCCAACCTTCCACGAGCGTACGAAACACATTGAAATTGATTGCCATTTCGTACGAGATCTCATTCAACAAAGAATTCTGCGGCTTCTACCCATTCGATCCAACCTTCAGTTGGCAGACATGTTTACCAAGCCACTCAATGCCCCTGCCATTCGTAATTTCTTAGTCAAGATGGGCATTTTTGATCTCCATAGTCCATCTTGAGGGGGGATATTAATGTATATGTAACCATAGTTAAGTTGTTGTAACTCTTATAGTTAAGTTGTTGTAACTCTCAGTTCTCTCTCTTGATATTATTTCTTTTATTCTGTTATTTTAGTTCTATATAAATATGGGTTGTATAGCAGCAGTTTAATAATATAAGAGAGAAACAGTTTATTTACTCTCATTCTTGATAAACTCTTTCCATGCTTATAAGTGTTGCATATAATTGTATTTGGCATATGATATGTTGATCAGCTGCAACGAAAATTTTTCTGCATCAATATGCACTAAAGGGTGAGTGGGAATATGTGGAATTACTGATGGACGAGTGCCCACATTATGTTCGTTCCACAATAACAAGAAACAAAGAGACCATTCTTCATATTGCTGCGGGAGCCAAACAAACTGAATTTGTGGAGAAATTGCTGCACAGAATGACCTCTGCTGACATGACACTGCAAAACAAATATGGAAACACAGCCCTTTGTTTTGCTGCTGCTTCGGGAGTTGTAAGAATTGCTCAGCTAATGGTGCAAAAGAACAAACATCTTCCACTTATTCGTGGCTTCAACAATATTGTAACTCCACTTTTCATAGCTGTATCATACAAGTGCATAGAGATGGTTTCTTATCTCTTGTCTATCACTGATCTCGACCAACTAAACAACCAAGAACAAATCGAGCTTCTTATTGCCACCATACATGGCAACTTTTTTGGTAATCATCTGTTTAGTCTCTGAGGTTTGTATTTATTTGGTTTCTAAACTTTAAAAAGTATTTAATAGGTTCTTAAGCGTATATAATAGGTCATTCAACTTTCAATTGTGTTTAGTAATTAGTTCGTAAATTTTGAAAAATGTTAACTAGAACCTATTAGACACAAAAAATTGAAAGTTTAGGTTCCGTTTGATAAACATTTGGTTTTTGAAAATTATACTTGTTTTTTGTTAAATTCTCTACCATGGTTTTCATGTTTGTTAAGGATCCATTTGAATACCTATAGCCAAATTCCAAAAACAAAAACAAGTTTTAGAAAACTATTTTTTTCTTTTTTCAAAATTTGACTTGGTTTTTTAAAACAAATAAGGTAGATATACTAAAACAAAGAAAGTACTTATGAGTGAAAGTATGTGTGTATAAGCTTAATTTTCAAAATCAACAACCAAAAACCAAAAGGTTATCAAACGGGACTTTAGGGACTTATTTAGACACAAAATTGTAAGTTTAAAGGCCTACCGAACACTTTTTAGAGTTTAAAGATCTATTAGATACTTTTTAAAATTTAAAGACTTTCTAGACACAAACACGGATTAAACTAATAATTTAACGTTTTTTTATTTTATCTCACCTATGGTAAATTTACCTGGCCTTTAGACTCTGTCTTAAATCTTCTTCACACAGATATAAGTATATGGATTCTGCAAAGATATCCATATTTAGCTACTATGAAAGACATGAATGAAGAGACTGCATTGCATGTGATGGCCAGAAAACCATCTGCCATGGATGTCACAAAGCAGCAAAGCATTTGGGAAAAGTACATTAACTCATGTTAGTTTTTTCCAAACAACGTAACCCTTTAATTTTTCTTTTATTTTACAGAATTCATTTGAATAAAACTCTAGCTAATTAATTATTCCTTCATGTTTTGACTTGATGGGCTTCAAAGGGATCTATGGTAAAGCTATGACGAAGACACTGGCTCATGAATTAGTTGTTTTGCTATGGACCAATGTTCTACGGAATTTGCCAGAAAAGGAGATGCTCCAATTCATTAACCATCCCACGAGATTACTGAATGAGGCTGCATGTACAGGAAATGTTGAGTTTTTGATTGTGCTCATTCGTAAATACCCGGATATAATATGGGAAGATGATGATGATGGTAAAAGCATCTTTCATGTAGCCATTGAAAATCGACTTGAAAATGTGTTCAATCTAATCAATGAGATTGGAAGGCTCAATGAGTTCACAGCAAAATATAGAACTTTCAAGGGGCGAAATTACAACATCCTGCATTTGGCTGGAAATCTAGCTGCTCCAAACCATCTCAATAGGGTCTCAGGAGCTGCCCTTCAAATGCAACGTGAATTGCTATGGTTCAAGGTACTTGTTTAATTTTAAGGTTATTGTAATGAGTAGCAATTGAAAATGAACAATACTTGACTGCATACCAATATGTAGGGATTTCTATTCCATTAAAATCTTGCCAACTAAGAAAGATAAAGGTAGAAAATAAGAAAAAAAAATTATTTTTCCTAGTTGTCAACATTTTAATGATGGCATGTGTAGAGAGATCCCTACATACCAGTATGCAACCAAATTTTTTCCATTGAAGATTTATTATTTGCAAATATAACGACCTTAACTTTTAAAAATTTTGCAAATATGGAAAGTCTATCAATGATAAACATCTATCACTGATAGACTTCCATTATCACAATCCACATGGTAGTTAAAATCAGCAATAATTGAATTGTGATTAAAGTCTATCATTGATTGACTTCTATCATTTGATGGACTTAAATTTTGTTAAATTTACAAATGGTTTCGATAGATCTATTAGTGATAGACATCATTAATAGACTTCGATCACAATGTAGTATATTAATGATAGACTACTATCACTGATAGAATAGTCTTCCAGTAGTAATTATACTTATGATTGAAGTCTATAGTATATTTGCAAATAGTTTTAGATTTTTAATTATTTTAGTCTATCCCTTTGAATTTTTTCTTTTCTTCCCTCTCATGAACGTTTAAAACTTTGAACTTGAAGGAAGTAGAGAAGATAGTTTTGCCTTCTCAACTCGAGGCCAAATCCAATGTTTTGTCTTACCAACACAAGCCCAAATCCAATTATCCAAATGTACCAAAGTTGACGCCACGCCAATTATTCACCCAAGAGCACAAAGATCTTCGCAAAGATGGTGAGGAGTGGATGAAAAACACAGCAAACTCATGCATGCTGGTCGCAACTTTAATCTCTACTGTAGTTTTTGCTGCAGCCTTCACAGTTCCTGGTGGCAGTGATAACAATGCAGGCACTCCTATTTTTCAACACAAGTTTTGGTTCACTGTGTTTTTAATGTCTGATGCTGTCGCTTTGTTTGCATCCTCAACTTCTATTCTAATGTTCATGTCCATCCTAACTTCACGCTACGCAGAAGAAGATTTTGTACACTCATTGCCGTCCAGATTGCTCTTTGGACTTGCAGCACTCTTCATATCCATTGTGTGCATGGTGGTGGCCTTTAGCGCCACATTCTTCTTGCTCTACCATAAGGCTAATATATGCATTCCCACTGTAGTTGCTGCGATGGCGATTGTTCCAGTTAGTTGTTTTTGTGCACTTCAGTTTAAACTTTGGGTTGATATTTTTCACAACACTTACTCGTCTAGATTCCTTTTCAAACCTCGTCAACGTAAATTGTTTTGA

mRNA sequence

ATGGGCCATGGTCGGCCTCGGCCTAAGGTCGAGGAGCTTTCCCCCTCTAGTTGGTTTCTGGTGTCCCTGATGAGCCCGGTTCTACTTGGTTCAACTCTGAATCATCTCCAAACGCCTAGAAACCCCAAAACAGGAAAATCCATCATATGGGTTTTGCTGCCAGAAGTTGAACTTAAGCCGCCGTCTATCTGTTTTGGTCTGATGATAATGAAGCGTTTAAGAAAGTCACAGAGCTTTCCTTCAATTCCAAATCCACACTACGAAAATGAAAATGGGAAAAATATATCTGCTGAACATGAACATGAACATGCTGTTGGAGACCAATTTCTTGCTGCTCTGTTGAACAGCACGACAGTGGAGAGCCAGCAGACAATGGATGTCGATGCACCCTCGGATTCCGAATTCCAAACTAGTGGGGGAGAGGGAGAATTTGGAGGGGAGGGAGTGAGACTCGGAGAGAAGAACATTTTCAATGCAATATATGAGAGCGCCGATAAGCTTCTTGCTTCTTCTGTGCTTCCTCATCTCTCTTCTTCTTTCTGTGCTTTCGCATCTTTCTGTTTTTTTTCCATGGCGGGAACTGGAAGCGCGAATTCTTCTTCATCGAATGTCACTGCTACATCGATTGAAGCTCAGATCAATCCATATTTCCTACACCATTCTTTTGGATCATCCTCTGTTCTTGTTTCACAGCCATTGCTCGGTGCGGTCAATTACACTTCCTGGAGTCGCGCAATGAGGATGGTAATTTCAGGTAAGAACAAGTTAGGGTTCATCACTGGTAAAATCTCCAAACCTCAGGAAGAAGGTGCCTTGCTTGAGGCTTGGGAATGCAATAATGATATTATTGCTTCATGGATTTTGAATTCCGTCTCGAAAGAAATCGCGGCGAGTATTGTATACACCGGTTCCGTTAAAGCTGTTTGGGATGAGTTGCAAGAACGATTCAAGCAAGCAAGTGGACCAGGAATTTATCAACTTCGCAAGGATTTGGTCACATTACGCCAAGGATCGATGTCAGTTGAAGTCTATTATACAAAGCTCAAGACAATTTGGCAAGATCTCAGTGATCTCCGCCCAACAGCAAGTTGTACTTGTGGAGGATTGAAGCCATTTCTTGAGCATCTTGATTCGGAGTATGTGATGACCTTCTTGATGGGATTAAATGAGACGTATGCGGCAATAAGGGCACAAATTTTGTTGATGAAGCCGCTTCCGTCCATCACTGAGGCCTTTTCCCTGCTGATTCAAGAAGAACATCAACGTTCGGCTGGAATCCTTGGACCATCACCAGATCCTATTGCCTTGGCAGTTAATGATACTTCGAAAACCTCTGATCCTCCTCGAAGGAAAGAGAATTCAGGACAACGACCGGTTTGTTCTCACTGCGGCATTAAGGGACATGTGAAAGATCGATGCTATAAATTGCACGGATATCCTCCCGGATACAAATTTCGATCTTCGAATTCTCCTGATAATTCTGCATCTGCTAAATCTGTTGTTGCTGCAAATTCTGCCGCTGCGTCAAGTTGTCCTCCTGCTACACCTAATTTTTTCTCTAGTCTCAACAACGTTCAGTATGGTCAGTTGATGGAATTGTTCAATTCGCACCTTCAGGCCGCCAAGACTGATCCTATTACCGTAGCTTCTGCTGTTTCACATGCTACAGGTATTTGTCACTTGGCTTCTTCCCCCATTACTTCTCTGAATGATTGTTGGATTGTTGATTCTGGTGCGTCTAGACATATTTGTCACACTCGAGCAGCCTTCCGTAACTGGCGGCGCATTGACCCTATTTCTATTGTTCTTCCTACTGCATATAGAGTGTGTGTTGAATATGTTGGGGAAATTCATATCTCAGCTGCTCTTGTTCTGCGTGATGTGCTTTTTGTCCCTGATTTTGCTTATAACCTGATGTCAGTGAGTTGCTTACTTCAGTCTGGAGAATTCTCTGTAGCCTTTACTAATAATCATTGCCTCATACAGGACAAACAGCTATTGACAATGATTGGCAAGGCTGAATGTCACCATGGTCTTTACATTCTCTCTAACATTTCAGGTTCTTCGACTGATTCATTGGTTACTTCGACTGTTGTTTCTGATATACCTGCTGTTTTCTCTGTTTCTGCTTCTATATGGCATTCTCGTCTAGGCCACTTATCTCCCAGACGTTTGTCCATGCTTAGAGATACTTTACAGTTTAAAGATTCTTGTGATGCCTCATGTACCGTTTGTCCATTAGCTAAACAAAAACGGATGCCTTTTACTTCCAATAATCATGTAGCATCAAATGTTTTTGATCTTATACATGCTGACGTTTGGGGACCCCTTAATACTTCTACTTATGATGGCTATCGCTATTTTTTAACCCTAGTTGATGATGCTTCTCGATTCACTTGGGTTTATCTACTGCGACAAAAATCCGATGTTTTGATGATCATTCCTCGTTTTTTCAAGATGGTAGAAACCCAATTCTCTAAGACCATAAAATGTTTTCGTTCTGACAATGCACCAGAGCTTAAGTTCACTGATTTTTTTGCCTCCACTGGGACTATACATCAATTTTCTTGTGTGGAGAGACCTCAACAAAACTCAGTTGTAGAACGGAAACACCAACACCTGTTAAATGTTGCCCGTAGCCTGTATTTCCAGTCTCTTGACTATTCATTTTTACGTGTTTTTGGTTGTCTATGCTACGCGTCCACCCTTACTGCTGGACGTTCAAAATTTGACCCTCGTGCAAGACCTTGTGTTCTCCTTGGTTATCCGCCTGGAATGAAAGGATATCGTTTATATGATATCTCCAAGAAGCAAGTTTTTATATCTCGTGATGTAACCTTCATTGAGGAAAACTTTCCTTTCCATTCTATTATTAATAATGACCAGCTCGTGGATGTTTTTCCAGGCCTTGTTTTACCTTTACCGGTGTTTGACTTTGTTCCATCACCTAGTGTTCCAAATAACACTGCAGCAACACCTTCTGAAGTTGGTTTTCCTCCGGCTGGTCTTCCATCTGATGTGTCTGCTGAGAATGGTGATATGCTACTTCAAGCATCACCTGCTGCTAGTGATGCTGATAGCCCTGCTCAGTCTACTGGGGTTATTCAGCCACGACGTTCAACAAGGCAACGACATCCACCTGATTTTTTGAAGGATTATCACTGTCACCTTTTGAGACATGATGTTCCTTTGCTTAGTCATGATGTTCCTCACTCCCTTGACAAATATGTTTCATATAGGCGCTTCTCTAATGATCATCGACAGTTCATTTTGAATGTTTCAACAGATTTTGAACCCACCTACTATCATCAGACAGTAAAATTTGCTCCTTGGCGCAAGGCTATGGATGATGAAATTGCAGCCATGGAGCGCACTAACACCTGGACTCTGGTTCCCTTACCATCTGGACGTCGTGCAGTTGGATGCAAGTGGGTATACAAAGTCAAATATAAAGCCGATGGAACTGTAGATCGATACAAAGCGCGCCTCGTTGCCAAGGGTTACAATCAGCAAGAAGGTATTGACTTTCTTGAGACTTTTTCTCCTGTGGCCAAGATTGTAACTGTTAAAGTTCTCCTTTCTTTAACGGCTTCTTTTGGTTGGTCTCTTGTGCAATTGGATGTGAATAATGCTTTTCTAAATGGTGATTTGTTTGAAGAGGTTTATATGTCGCTACCTCTTGGTTATTATACCGATCGTAAGTCTTCTGGTTGTACTCCTATTGTTTGTAAGCTCAACAAGTCCATATATGGGCTTAAACAGGCTTCTAGACAATGGTTCTCCAAGTTTTCTAGTGTTCTTCTTGCTAATGGGTTTTCACAGTCCAAAACTGACTACTCTCTTTTCATAAGAGGCCATGGTATTTCCTTTGTTGCGTTGCTTGTATATGTAGACGATATCTTAATTACTGGACCGTCTACTACTGAAATCAGTGCTGTGAAAGACATCCTATATCGTCACTTCTTACTTAAAGATCTTGGTCATGCCAAGTATTTCCTCGGCTTGGAGCTCTCTCGGTCTGATAAAGGGATCTACTTATCCCAGCGTAAGTACTGTCTTCAACTTATCGAAGACTCTGGCTATCTAGCTGCTAAACCGGTTAATCACCCCATGATTCCTAACCTTCGGTTATCTGCTAATGGTTCTGGGGAGCTTTTGAATGCTGATGATGCTAGTTCCTATAGACGATTGGTTGGTCGATTGCTATATTTGCAGGTATCTCGACCAGACATATCATTCACGGTTCATAATCTCAGCCAGTTTATAGCCAAGCCGTGTGTTCGCCATTTAGATGTTGTTTTTCATTTGTTGAGGTATTTAAAGGGCACTGCTGGACAGGGGATTCTTTTGCATTCCTCTCGAGATTTTCATCTTAAAGCCTTTGCTGATTCGGATTGGGGTTCTTGTCCGGACTCGAGGAAATCCGTCACTGGTTTTTGTATTTTCTTGGGAAAATCTCTTGTGTCGTGGAAGTCTAAGAAACAAGCCACAGTTTCACGCTCTTCAGCTGAGGCCGAATATCGTGCCCTCGCTACTGTTTCTAGTGAACTAATTTGGCTTTCTCATTTGCTCAAAGACTTACAAGTTCCCCTTCGGTCTCCTTCTGTGGTTTATTGTGATAATCTTGCTGCAATTGCTATTGCCAACAATCCAACCTTCCACGAGCGTACGAAACACATTGAAATTGATTGCCATTTCGTACGAGATCTCATTCAACAAAGAATTCTGCGGCTTCTACCCATTCGATCCAACCTTCAGTTGGCAGACATGTTTACCAAGCCACTCAATGCCCCTGCCATTCCTGCAACGAAAATTTTTCTGCATCAATATGCACTAAAGGGTGAGTGGGAATATGTGGAATTACTGATGGACGAGTGCCCACATTATGTTCGTTCCACAATAACAAGAAACAAAGAGACCATTCTTCATATTGCTGCGGGAGCCAAACAAACTGAATTTGTGGAGAAATTGCTGCACAGAATGACCTCTGCTGACATGACACTGCAAAACAAATATGGAAACACAGCCCTTTGTTTTGCTGCTGCTTCGGGAGTTGTAAGAATTGCTCAGCTAATGGTGCAAAAGAACAAACATCTTCCACTTATTCGTGGCTTCAACAATATTGTAACTCCACTTTTCATAGCTGTATCATACAAGTGCATAGAGATGGTTTCTTATCTCTTGTCTATCACTGATCTCGACCAACTAAACAACCAAGAACAAATCGAGCTTCTTATTGCCACCATACATGGCAACTTTTTTGATATAAGTATATGGATTCTGCAAAGATATCCATATTTAGCTACTATGAAAGACATGAATGAAGAGACTGCATTGCATGTGATGGCCAGAAAACCATCTGCCATGGATGTCACAAAGCAGCAAAGCATTTGGGAAAAGTACATTAACTCATGGATCTATGGTAAAGCTATGACGAAGACACTGGCTCATGAATTAGTTGTTTTGCTATGGACCAATGTTCTACGGAATTTGCCAGAAAAGGAGATGCTCCAATTCATTAACCATCCCACGAGATTACTGAATGAGGCTGCATGTACAGGAAATGTTGAGTTTTTGATTGTGCTCATTCGTAAATACCCGGATATAATATGGGAAGATGATGATGATGGTAAAAGCATCTTTCATGTAGCCATTGAAAATCGACTTGAAAATGTGTTCAATCTAATCAATGAGATTGGAAGGCTCAATGAGTTCACAGCAAAATATAGAACTTTCAAGGGGCGAAATTACAACATCCTGCATTTGGCTGGAAATCTAGCTGCTCCAAACCATCTCAATAGGGTCTCAGGAGCTGCCCTTCAAATGCAACGTGAATTGCTATGGTTCAAGGAAGTAGAGAAGATAGTTTTGCCTTCTCAACTCGAGGCCAAATCCAATGTTTTGTCTTACCAACACAAGCCCAAATCCAATTATCCAAATGTACCAAAGTTGACGCCACGCCAATTATTCACCCAAGAGCACAAAGATCTTCGCAAAGATGGTGAGGAGTGGATGAAAAACACAGCAAACTCATGCATGCTGGTCGCAACTTTAATCTCTACTGTAGTTTTTGCTGCAGCCTTCACAGTTCCTGGTGGCAGTGATAACAATGCAGGCACTCCTATTTTTCAACACAAGTTTTGGTTCACTGTGTTTTTAATGTCTGATGCTGTCGCTTTGTTTGCATCCTCAACTTCTATTCTAATGTTCATGTCCATCCTAACTTCACGCTACGCAGAAGAAGATTTTGTACACTCATTGCCGTCCAGATTGCTCTTTGGACTTGCAGCACTCTTCATATCCATTGTGTGCATGGTGGTGGCCTTTAGCGCCACATTCTTCTTGCTCTACCATAAGGCTAATATATGCATTCCCACTGTAGTTGCTGCGATGGCGATTGTTCCAGTTAGTTGTTTTTGTGCACTTCAGTTTAAACTTTGGGTTGATATTTTTCACAACACTTACTCGTCTAGATTCCTTTTCAAACCTCGTCAACGTAAATTGTTTTGA

Coding sequence (CDS)

ATGGGCCATGGTCGGCCTCGGCCTAAGGTCGAGGAGCTTTCCCCCTCTAGTTGGTTTCTGGTGTCCCTGATGAGCCCGGTTCTACTTGGTTCAACTCTGAATCATCTCCAAACGCCTAGAAACCCCAAAACAGGAAAATCCATCATATGGGTTTTGCTGCCAGAAGTTGAACTTAAGCCGCCGTCTATCTGTTTTGGTCTGATGATAATGAAGCGTTTAAGAAAGTCACAGAGCTTTCCTTCAATTCCAAATCCACACTACGAAAATGAAAATGGGAAAAATATATCTGCTGAACATGAACATGAACATGCTGTTGGAGACCAATTTCTTGCTGCTCTGTTGAACAGCACGACAGTGGAGAGCCAGCAGACAATGGATGTCGATGCACCCTCGGATTCCGAATTCCAAACTAGTGGGGGAGAGGGAGAATTTGGAGGGGAGGGAGTGAGACTCGGAGAGAAGAACATTTTCAATGCAATATATGAGAGCGCCGATAAGCTTCTTGCTTCTTCTGTGCTTCCTCATCTCTCTTCTTCTTTCTGTGCTTTCGCATCTTTCTGTTTTTTTTCCATGGCGGGAACTGGAAGCGCGAATTCTTCTTCATCGAATGTCACTGCTACATCGATTGAAGCTCAGATCAATCCATATTTCCTACACCATTCTTTTGGATCATCCTCTGTTCTTGTTTCACAGCCATTGCTCGGTGCGGTCAATTACACTTCCTGGAGTCGCGCAATGAGGATGGTAATTTCAGGTAAGAACAAGTTAGGGTTCATCACTGGTAAAATCTCCAAACCTCAGGAAGAAGGTGCCTTGCTTGAGGCTTGGGAATGCAATAATGATATTATTGCTTCATGGATTTTGAATTCCGTCTCGAAAGAAATCGCGGCGAGTATTGTATACACCGGTTCCGTTAAAGCTGTTTGGGATGAGTTGCAAGAACGATTCAAGCAAGCAAGTGGACCAGGAATTTATCAACTTCGCAAGGATTTGGTCACATTACGCCAAGGATCGATGTCAGTTGAAGTCTATTATACAAAGCTCAAGACAATTTGGCAAGATCTCAGTGATCTCCGCCCAACAGCAAGTTGTACTTGTGGAGGATTGAAGCCATTTCTTGAGCATCTTGATTCGGAGTATGTGATGACCTTCTTGATGGGATTAAATGAGACGTATGCGGCAATAAGGGCACAAATTTTGTTGATGAAGCCGCTTCCGTCCATCACTGAGGCCTTTTCCCTGCTGATTCAAGAAGAACATCAACGTTCGGCTGGAATCCTTGGACCATCACCAGATCCTATTGCCTTGGCAGTTAATGATACTTCGAAAACCTCTGATCCTCCTCGAAGGAAAGAGAATTCAGGACAACGACCGGTTTGTTCTCACTGCGGCATTAAGGGACATGTGAAAGATCGATGCTATAAATTGCACGGATATCCTCCCGGATACAAATTTCGATCTTCGAATTCTCCTGATAATTCTGCATCTGCTAAATCTGTTGTTGCTGCAAATTCTGCCGCTGCGTCAAGTTGTCCTCCTGCTACACCTAATTTTTTCTCTAGTCTCAACAACGTTCAGTATGGTCAGTTGATGGAATTGTTCAATTCGCACCTTCAGGCCGCCAAGACTGATCCTATTACCGTAGCTTCTGCTGTTTCACATGCTACAGGTATTTGTCACTTGGCTTCTTCCCCCATTACTTCTCTGAATGATTGTTGGATTGTTGATTCTGGTGCGTCTAGACATATTTGTCACACTCGAGCAGCCTTCCGTAACTGGCGGCGCATTGACCCTATTTCTATTGTTCTTCCTACTGCATATAGAGTGTGTGTTGAATATGTTGGGGAAATTCATATCTCAGCTGCTCTTGTTCTGCGTGATGTGCTTTTTGTCCCTGATTTTGCTTATAACCTGATGTCAGTGAGTTGCTTACTTCAGTCTGGAGAATTCTCTGTAGCCTTTACTAATAATCATTGCCTCATACAGGACAAACAGCTATTGACAATGATTGGCAAGGCTGAATGTCACCATGGTCTTTACATTCTCTCTAACATTTCAGGTTCTTCGACTGATTCATTGGTTACTTCGACTGTTGTTTCTGATATACCTGCTGTTTTCTCTGTTTCTGCTTCTATATGGCATTCTCGTCTAGGCCACTTATCTCCCAGACGTTTGTCCATGCTTAGAGATACTTTACAGTTTAAAGATTCTTGTGATGCCTCATGTACCGTTTGTCCATTAGCTAAACAAAAACGGATGCCTTTTACTTCCAATAATCATGTAGCATCAAATGTTTTTGATCTTATACATGCTGACGTTTGGGGACCCCTTAATACTTCTACTTATGATGGCTATCGCTATTTTTTAACCCTAGTTGATGATGCTTCTCGATTCACTTGGGTTTATCTACTGCGACAAAAATCCGATGTTTTGATGATCATTCCTCGTTTTTTCAAGATGGTAGAAACCCAATTCTCTAAGACCATAAAATGTTTTCGTTCTGACAATGCACCAGAGCTTAAGTTCACTGATTTTTTTGCCTCCACTGGGACTATACATCAATTTTCTTGTGTGGAGAGACCTCAACAAAACTCAGTTGTAGAACGGAAACACCAACACCTGTTAAATGTTGCCCGTAGCCTGTATTTCCAGTCTCTTGACTATTCATTTTTACGTGTTTTTGGTTGTCTATGCTACGCGTCCACCCTTACTGCTGGACGTTCAAAATTTGACCCTCGTGCAAGACCTTGTGTTCTCCTTGGTTATCCGCCTGGAATGAAAGGATATCGTTTATATGATATCTCCAAGAAGCAAGTTTTTATATCTCGTGATGTAACCTTCATTGAGGAAAACTTTCCTTTCCATTCTATTATTAATAATGACCAGCTCGTGGATGTTTTTCCAGGCCTTGTTTTACCTTTACCGGTGTTTGACTTTGTTCCATCACCTAGTGTTCCAAATAACACTGCAGCAACACCTTCTGAAGTTGGTTTTCCTCCGGCTGGTCTTCCATCTGATGTGTCTGCTGAGAATGGTGATATGCTACTTCAAGCATCACCTGCTGCTAGTGATGCTGATAGCCCTGCTCAGTCTACTGGGGTTATTCAGCCACGACGTTCAACAAGGCAACGACATCCACCTGATTTTTTGAAGGATTATCACTGTCACCTTTTGAGACATGATGTTCCTTTGCTTAGTCATGATGTTCCTCACTCCCTTGACAAATATGTTTCATATAGGCGCTTCTCTAATGATCATCGACAGTTCATTTTGAATGTTTCAACAGATTTTGAACCCACCTACTATCATCAGACAGTAAAATTTGCTCCTTGGCGCAAGGCTATGGATGATGAAATTGCAGCCATGGAGCGCACTAACACCTGGACTCTGGTTCCCTTACCATCTGGACGTCGTGCAGTTGGATGCAAGTGGGTATACAAAGTCAAATATAAAGCCGATGGAACTGTAGATCGATACAAAGCGCGCCTCGTTGCCAAGGGTTACAATCAGCAAGAAGGTATTGACTTTCTTGAGACTTTTTCTCCTGTGGCCAAGATTGTAACTGTTAAAGTTCTCCTTTCTTTAACGGCTTCTTTTGGTTGGTCTCTTGTGCAATTGGATGTGAATAATGCTTTTCTAAATGGTGATTTGTTTGAAGAGGTTTATATGTCGCTACCTCTTGGTTATTATACCGATCGTAAGTCTTCTGGTTGTACTCCTATTGTTTGTAAGCTCAACAAGTCCATATATGGGCTTAAACAGGCTTCTAGACAATGGTTCTCCAAGTTTTCTAGTGTTCTTCTTGCTAATGGGTTTTCACAGTCCAAAACTGACTACTCTCTTTTCATAAGAGGCCATGGTATTTCCTTTGTTGCGTTGCTTGTATATGTAGACGATATCTTAATTACTGGACCGTCTACTACTGAAATCAGTGCTGTGAAAGACATCCTATATCGTCACTTCTTACTTAAAGATCTTGGTCATGCCAAGTATTTCCTCGGCTTGGAGCTCTCTCGGTCTGATAAAGGGATCTACTTATCCCAGCGTAAGTACTGTCTTCAACTTATCGAAGACTCTGGCTATCTAGCTGCTAAACCGGTTAATCACCCCATGATTCCTAACCTTCGGTTATCTGCTAATGGTTCTGGGGAGCTTTTGAATGCTGATGATGCTAGTTCCTATAGACGATTGGTTGGTCGATTGCTATATTTGCAGGTATCTCGACCAGACATATCATTCACGGTTCATAATCTCAGCCAGTTTATAGCCAAGCCGTGTGTTCGCCATTTAGATGTTGTTTTTCATTTGTTGAGGTATTTAAAGGGCACTGCTGGACAGGGGATTCTTTTGCATTCCTCTCGAGATTTTCATCTTAAAGCCTTTGCTGATTCGGATTGGGGTTCTTGTCCGGACTCGAGGAAATCCGTCACTGGTTTTTGTATTTTCTTGGGAAAATCTCTTGTGTCGTGGAAGTCTAAGAAACAAGCCACAGTTTCACGCTCTTCAGCTGAGGCCGAATATCGTGCCCTCGCTACTGTTTCTAGTGAACTAATTTGGCTTTCTCATTTGCTCAAAGACTTACAAGTTCCCCTTCGGTCTCCTTCTGTGGTTTATTGTGATAATCTTGCTGCAATTGCTATTGCCAACAATCCAACCTTCCACGAGCGTACGAAACACATTGAAATTGATTGCCATTTCGTACGAGATCTCATTCAACAAAGAATTCTGCGGCTTCTACCCATTCGATCCAACCTTCAGTTGGCAGACATGTTTACCAAGCCACTCAATGCCCCTGCCATTCCTGCAACGAAAATTTTTCTGCATCAATATGCACTAAAGGGTGAGTGGGAATATGTGGAATTACTGATGGACGAGTGCCCACATTATGTTCGTTCCACAATAACAAGAAACAAAGAGACCATTCTTCATATTGCTGCGGGAGCCAAACAAACTGAATTTGTGGAGAAATTGCTGCACAGAATGACCTCTGCTGACATGACACTGCAAAACAAATATGGAAACACAGCCCTTTGTTTTGCTGCTGCTTCGGGAGTTGTAAGAATTGCTCAGCTAATGGTGCAAAAGAACAAACATCTTCCACTTATTCGTGGCTTCAACAATATTGTAACTCCACTTTTCATAGCTGTATCATACAAGTGCATAGAGATGGTTTCTTATCTCTTGTCTATCACTGATCTCGACCAACTAAACAACCAAGAACAAATCGAGCTTCTTATTGCCACCATACATGGCAACTTTTTTGATATAAGTATATGGATTCTGCAAAGATATCCATATTTAGCTACTATGAAAGACATGAATGAAGAGACTGCATTGCATGTGATGGCCAGAAAACCATCTGCCATGGATGTCACAAAGCAGCAAAGCATTTGGGAAAAGTACATTAACTCATGGATCTATGGTAAAGCTATGACGAAGACACTGGCTCATGAATTAGTTGTTTTGCTATGGACCAATGTTCTACGGAATTTGCCAGAAAAGGAGATGCTCCAATTCATTAACCATCCCACGAGATTACTGAATGAGGCTGCATGTACAGGAAATGTTGAGTTTTTGATTGTGCTCATTCGTAAATACCCGGATATAATATGGGAAGATGATGATGATGGTAAAAGCATCTTTCATGTAGCCATTGAAAATCGACTTGAAAATGTGTTCAATCTAATCAATGAGATTGGAAGGCTCAATGAGTTCACAGCAAAATATAGAACTTTCAAGGGGCGAAATTACAACATCCTGCATTTGGCTGGAAATCTAGCTGCTCCAAACCATCTCAATAGGGTCTCAGGAGCTGCCCTTCAAATGCAACGTGAATTGCTATGGTTCAAGGAAGTAGAGAAGATAGTTTTGCCTTCTCAACTCGAGGCCAAATCCAATGTTTTGTCTTACCAACACAAGCCCAAATCCAATTATCCAAATGTACCAAAGTTGACGCCACGCCAATTATTCACCCAAGAGCACAAAGATCTTCGCAAAGATGGTGAGGAGTGGATGAAAAACACAGCAAACTCATGCATGCTGGTCGCAACTTTAATCTCTACTGTAGTTTTTGCTGCAGCCTTCACAGTTCCTGGTGGCAGTGATAACAATGCAGGCACTCCTATTTTTCAACACAAGTTTTGGTTCACTGTGTTTTTAATGTCTGATGCTGTCGCTTTGTTTGCATCCTCAACTTCTATTCTAATGTTCATGTCCATCCTAACTTCACGCTACGCAGAAGAAGATTTTGTACACTCATTGCCGTCCAGATTGCTCTTTGGACTTGCAGCACTCTTCATATCCATTGTGTGCATGGTGGTGGCCTTTAGCGCCACATTCTTCTTGCTCTACCATAAGGCTAATATATGCATTCCCACTGTAGTTGCTGCGATGGCGATTGTTCCAGTTAGTTGTTTTTGTGCACTTCAGTTTAAACTTTGGGTTGATATTTTTCACAACACTTACTCGTCTAGATTCCTTTTCAAACCTCGTCAACGTAAATTGTTTTGA

Protein sequence

MGHGRPRPKVEELSPSSWFLVSLMSPVLLGSTLNHLQTPRNPKTGKSIIWVLLPEVELKPPSICFGLMIMKRLRKSQSFPSIPNPHYENENGKNISAEHEHEHAVGDQFLAALLNSTTVESQQTMDVDAPSDSEFQTSGGEGEFGGEGVRLGEKNIFNAIYESADKLLASSVLPHLSSSFCAFASFCFFSMAGTGSANSSSSNVTATSIEAQINPYFLHHSFGSSSVLVSQPLLGAVNYTSWSRAMRMVISGKNKLGFITGKISKPQEEGALLEAWECNNDIIASWILNSVSKEIAASIVYTGSVKAVWDELQERFKQASGPGIYQLRKDLVTLRQGSMSVEVYYTKLKTIWQDLSDLRPTASCTCGGLKPFLEHLDSEYVMTFLMGLNETYAAIRAQILLMKPLPSITEAFSLLIQEEHQRSAGILGPSPDPIALAVNDTSKTSDPPRRKENSGQRPVCSHCGIKGHVKDRCYKLHGYPPGYKFRSSNSPDNSASAKSVVAANSAAASSCPPATPNFFSSLNNVQYGQLMELFNSHLQAAKTDPITVASAVSHATGICHLASSPITSLNDCWIVDSGASRHICHTRAAFRNWRRIDPISIVLPTAYRVCVEYVGEIHISAALVLRDVLFVPDFAYNLMSVSCLLQSGEFSVAFTNNHCLIQDKQLLTMIGKAECHHGLYILSNISGSSTDSLVTSTVVSDIPAVFSVSASIWHSRLGHLSPRRLSMLRDTLQFKDSCDASCTVCPLAKQKRMPFTSNNHVASNVFDLIHADVWGPLNTSTYDGYRYFLTLVDDASRFTWVYLLRQKSDVLMIIPRFFKMVETQFSKTIKCFRSDNAPELKFTDFFASTGTIHQFSCVERPQQNSVVERKHQHLLNVARSLYFQSLDYSFLRVFGCLCYASTLTAGRSKFDPRARPCVLLGYPPGMKGYRLYDISKKQVFISRDVTFIEENFPFHSIINNDQLVDVFPGLVLPLPVFDFVPSPSVPNNTAATPSEVGFPPAGLPSDVSAENGDMLLQASPAASDADSPAQSTGVIQPRRSTRQRHPPDFLKDYHCHLLRHDVPLLSHDVPHSLDKYVSYRRFSNDHRQFILNVSTDFEPTYYHQTVKFAPWRKAMDDEIAAMERTNTWTLVPLPSGRRAVGCKWVYKVKYKADGTVDRYKARLVAKGYNQQEGIDFLETFSPVAKIVTVKVLLSLTASFGWSLVQLDVNNAFLNGDLFEEVYMSLPLGYYTDRKSSGCTPIVCKLNKSIYGLKQASRQWFSKFSSVLLANGFSQSKTDYSLFIRGHGISFVALLVYVDDILITGPSTTEISAVKDILYRHFLLKDLGHAKYFLGLELSRSDKGIYLSQRKYCLQLIEDSGYLAAKPVNHPMIPNLRLSANGSGELLNADDASSYRRLVGRLLYLQVSRPDISFTVHNLSQFIAKPCVRHLDVVFHLLRYLKGTAGQGILLHSSRDFHLKAFADSDWGSCPDSRKSVTGFCIFLGKSLVSWKSKKQATVSRSSAEAEYRALATVSSELIWLSHLLKDLQVPLRSPSVVYCDNLAAIAIANNPTFHERTKHIEIDCHFVRDLIQQRILRLLPIRSNLQLADMFTKPLNAPAIPATKIFLHQYALKGEWEYVELLMDECPHYVRSTITRNKETILHIAAGAKQTEFVEKLLHRMTSADMTLQNKYGNTALCFAAASGVVRIAQLMVQKNKHLPLIRGFNNIVTPLFIAVSYKCIEMVSYLLSITDLDQLNNQEQIELLIATIHGNFFDISIWILQRYPYLATMKDMNEETALHVMARKPSAMDVTKQQSIWEKYINSWIYGKAMTKTLAHELVVLLWTNVLRNLPEKEMLQFINHPTRLLNEAACTGNVEFLIVLIRKYPDIIWEDDDDGKSIFHVAIENRLENVFNLINEIGRLNEFTAKYRTFKGRNYNILHLAGNLAAPNHLNRVSGAALQMQRELLWFKEVEKIVLPSQLEAKSNVLSYQHKPKSNYPNVPKLTPRQLFTQEHKDLRKDGEEWMKNTANSCMLVATLISTVVFAAAFTVPGGSDNNAGTPIFQHKFWFTVFLMSDAVALFASSTSILMFMSILTSRYAEEDFVHSLPSRLLFGLAALFISIVCMVVAFSATFFLLYHKANICIPTVVAAMAIVPVSCFCALQFKLWVDIFHNTYSSRFLFKPRQRKLF
Homology
BLAST of Lag0024687 vs. NCBI nr
Match: KZV25004.1 (Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras hygrometricum])

HSP 1 Score: 1215.7 bits (3144), Expect = 0.0e+00
Identity = 653/1456 (44.85%), Postives = 910/1456 (62.50%), Query Frame = 0

Query: 193  GTGSANSSSSNVTATSIEAQINPYFLHHSFGSSSVLVSQPLLGAVNYTSWSRAMRMVISG 252
            G G   ++   +  T++E   +PY+LH+       LVS PL+G+ NY +W RAM + ++ 
Sbjct: 4    GGGGQVANQLPIVRTTLEDSSSPYYLHNGDHPGLTLVSNPLIGS-NYNTWRRAMIVALTA 63

Query: 253  KNKLGFITGKISKPQEEGALLEAWECNNDIIASWILNSVSKEIAASIVYTGSVKAVWDEL 312
            KNKLGFI   I +P+ E  L  +W   N ++ SWILNSV++ IA S++Y  + + +W +L
Sbjct: 64   KNKLGFIDRSIDRPRSEDLLYGSWIRCNSMVISWILNSVARNIADSLMYMQTAEEIWTDL 123

Query: 313  QERFKQASGPGIYQLRKDLVTLRQGSMSVEVYYTKLKTIWQDLSDLRPTASCTCGGLKPF 372
             ERF +++ P IYQ++K L  L+QGSM V  YYTKL+T+W +L D +PT++CTCG ++ +
Sbjct: 124  YERFHESNAPRIYQIKKLLSGLQQGSMDVSSYYTKLRTLWDELRDYQPTSACTCGSMREW 183

Query: 373  LEHLDSEYVMTFLMGLNETYAAIRAQILLMKPLPSITEAFSLLIQEEHQRS----AGILG 432
              + + E VM FLMGLN++YA +RAQ+L+++PLP+I + F+L+IQEE QRS        G
Sbjct: 184  FNYQNQECVMHFLMGLNDSYAQVRAQVLMIEPLPTIAKVFALVIQEERQRSIHYDVSKAG 243

Query: 433  PSPDPIALAVNDTSKTSDPPRRKENS----GQRPVCSHCGIKGHVKDRCYKLHGYPPGYK 492
                 I   VN ++ T+   R  +NS    G R +CSHC  + H  D+CYKLHGYPPG+ 
Sbjct: 244  VDHSGILSNVNSSANTATSLRTSQNSKGGRGDRIICSHCHFRNHTVDKCYKLHGYPPGHP 303

Query: 493  FRSSNSPDNSASAKSVVAANSAAASSCPPATPNFFSSLNNVQYGQLMELFNSHLQAAKT- 552
               S     SA A     A+S++ +       +   SL   Q  QL+E  +S LQ  +  
Sbjct: 304  KFKSQISQGSAHAHQ---ASSSSETHQETQQIDHSDSLTQSQCKQLIEFLSSKLQTRQNL 363

Query: 553  ----DPITVASAVSHATGICHLASSPITSLNDCWIVDSGASRHICHTRAAFRNWRRIDPI 612
                 P T  S +   TGIC   S         WI+D+GA+ HIC + + F++ R I   
Sbjct: 364  LMEHQPETTVSCL---TGICSATSHIPAITRKDWIMDTGATHHICCSLSMFKSSRAIQS- 423

Query: 613  SIVLPTAYRVCVEYVGEIHISAALVLRDVLFVPDFAYNLMSVSCLLQSGEFSVAFTNNHC 672
             +VLP    + V   G + +++ LVL++VL+VP F +NL+SVS L  +   SV+F ++ C
Sbjct: 424  KVVLPNTLTIPVTIAGTVAVTSNLVLQNVLYVPVFQFNLLSVSSLTDNHNCSVSFMSDSC 483

Query: 673  LIQDKQLLTMIGKAECHHGLYILSNISGSSTDSLVTSTVVSDIPAVFSVSASIWHSRLGH 732
             IQD   + MIG  +    LY+L        D  + S + +     F  ++ +WH R+GH
Sbjct: 484  KIQDISQIRMIGMGKRIGNLYVL-----QQPDRFLPSYICN----TFVSNSELWHRRMGH 543

Query: 733  LSPRRLSMLRDTLQFKDSCDAS-CTVCPLAKQKRMPFTSNNHVASNVFDLIHADVWGPLN 792
             S  +LS L++ L  +++   + C  C L+KQ+R+P  S N++++ +F+L+H D WGP +
Sbjct: 544  PSFNKLSSLKNVLNIENTDIVNICHSCHLSKQRRLPLASRNNISARIFELLHIDTWGPFS 603

Query: 793  TSTYDGYRYFLTLVDDASRFTWVYLLRQKSDVLMIIPRFFKMVETQFSKTIKCFRSDNAP 852
             ++ DG+R+F T+VDD SR+TWVY+L+ KSDVL I P F +MV TQF  T+K  RSDNAP
Sbjct: 604  QTSVDGFRFFFTIVDDHSRYTWVYMLKSKSDVLSIFPDFCRMVSTQFGVTVKSVRSDNAP 663

Query: 853  ELKFTDFFASTGTIHQFSCVERPQQNSVVERKHQHLLNVARSLYFQS---LD-------- 912
            EL F DFFA  G  H  SCVERPQQNSVVERKHQH+LNVAR+L FQS   LD        
Sbjct: 664  ELGFADFFAKAGITHYHSCVERPQQNSVVERKHQHILNVARALLFQSHIPLDYWCDCINT 723

Query: 913  ----------------------------YSFLRVFGCLCYASTLTAGRSKFDPRARPCVL 972
                                        YS L+VFGCLCYASTL + R KF PRA  CV 
Sbjct: 724  SVYLINRTPSPILAHKTPFELLHGKLPSYSHLKVFGCLCYASTLLSSRHKFSPRAIRCVF 783

Query: 973  LGYPPGMKGYRLYDISKKQVFISRDVTFIEENFPFHSIINNDQLVDVFPGLVLPLPVFDF 1032
            +GYPPG KGY+L ++   ++FISRDV F E  FP+                         
Sbjct: 784  IGYPPGYKGYKLLNLETNEIFISRDVIFHENTFPY------------------------- 843

Query: 1033 VPSPSVPNNTAATPSEVGFPPAGLPSDVSAENGDMLLQASPAASDADSPAQSTGVIQPRR 1092
                    NT+             P  +S    DM  + SP  S   +P+      Q  R
Sbjct: 844  -------QNTS-------------PMSLS----DMTFEVSP--SSQITPSIPADAQQHSR 903

Query: 1093 STRQRHPPDFLKDYHCHLLRHDVPLLSHDVPHSLDKYVSYRRFSNDHRQFILNVSTDFEP 1152
            ++R  + P  L+DYHC+ +    P  S    H +   V+Y + S+ HR F+ N+S+  EP
Sbjct: 904  TSRPHNTPSHLRDYHCYSI--STP-CSTSTAHPIHPLVNYSKLSSSHRAFVQNISSILEP 963

Query: 1153 TYYHQTVKFAPWRKAMDDEIAAMERTNTWTLVPLPSGRRAVGCKWVYKVKYKADGTVDRY 1212
            T + Q V    WR+AMD+E+ A+E  +TW++V LP G+ AVGC+WVYK K+ ADG++ RY
Sbjct: 964  TTFSQAVSLPEWRQAMDEELKALELNHTWSIVSLPQGKSAVGCRWVYKAKFAADGSLQRY 1023

Query: 1213 KARLVAKGYNQQEGIDFLETFSPVAKIVTVKVLLSLTASFGWSLVQLDVNNAFLNGDLFE 1272
            KARLVAKGY QQEG+D+LETFSPVAK+VTV+ LL+L A  GW L+QLDVNNAFL+GDL E
Sbjct: 1024 KARLVAKGYTQQEGLDYLETFSPVAKLVTVRTLLALAAVRGWFLIQLDVNNAFLHGDLTE 1083

Query: 1273 EVYMSLPLGYYTDRKSSGCTPIVCKLNKSIYGLKQASRQWFSKFSSVLLANGFSQSKTDY 1332
            EVYM+LP G+ ++ +    +  VCKL+KSIYGLKQASRQWF+KFSS LL+ GF QS  D 
Sbjct: 1084 EVYMTLPPGFCSEGELP--SRAVCKLHKSIYGLKQASRQWFAKFSSTLLSIGFIQSHADN 1143

Query: 1333 SLFIRGHGISFVALLVYVDDILITGPSTTEISAVKDILYRHFLLKDLGHAKYFLGLELSR 1392
            SLFIR     F+AL+VYVDDI+I        S +KD L   F LKDLG+ KYFLG+E++R
Sbjct: 1144 SLFIRSDKNIFLALVVYVDDIVIATNDQNAASELKDFLNSKFKLKDLGNLKYFLGIEVAR 1203

Query: 1393 SDKGIYLSQRKYCLQLIEDSGYLAAKPVNHPMIPNLRLSANGSGELLNADDASSYRRLVG 1452
            S +G+ + QR Y + L+ ++G L  KP   PM  N +L A  SGE+L+  D +SYRRL+G
Sbjct: 1204 STRGVSICQRNYAMTLLTEAGLLGCKPRTTPMEANTKL-AQDSGEMLS--DPASYRRLIG 1263

Query: 1453 RLLYLQVSRPDISFTVHNLSQFIAKPCVRHLDVVFHLLRYLKGTAGQGILLHSSRDFHLK 1512
            RLLYL ++RPD+ F V+ LSQ+++ P + H++   ++L+Y+KGT GQG+   SS D  L+
Sbjct: 1264 RLLYLTITRPDLVFAVNKLSQYVSMPRIPHMEAALNILKYVKGTVGQGLFYSSSSDLKLR 1323

Query: 1513 AFADSDWGSCPDSRKSVTGFCIFLGKSLVSWKSKKQATVSRSSAEAEYRALATVSSELIW 1572
            AF+D+DWG+C D+R+SVTG+C+FLG+SL+SW++KKQ TVSRSSAEAEYR+LA  + E++W
Sbjct: 1324 AFSDADWGACLDTRRSVTGYCVFLGESLISWRAKKQQTVSRSSAEAEYRSLAASTCEILW 1383

Query: 1573 LSHLLKDLQVPLRSPSVVYCDNLAAIAIANNPTFHERTKHIEIDCHFVRDLIQQRILRLL 1596
            +  LL DL V    P+V++CD+ AA+ IA+NP FHERTKHI+IDCH VR+ +QQ+I++L+
Sbjct: 1384 IHQLLADLGVTYNEPTVLFCDSQAAVHIASNPVFHERTKHIDIDCHIVREKVQQKIVKLM 1383

BLAST of Lag0024687 vs. NCBI nr
Match: RVW82526.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 1174.1 bits (3036), Expect = 0.0e+00
Identity = 644/1448 (44.48%), Postives = 877/1448 (60.57%), Query Frame = 0

Query: 209  IEAQINPYFLHHSFGSSSVLVSQPLLGA-VNYTSWSRAMRMVISGKNKLGFITGKISKPQ 268
            +E   +PYFLH+    S  LVS  L G+  NY SW R+M   ++ KNKLGFI G IS+P 
Sbjct: 15   MEDHSSPYFLHNGDHPSLSLVSLSLAGSGSNYHSWRRSMVTALNAKNKLGFIDGTISRPA 74

Query: 269  EEGALLEAWECNNDIIASWILNSVSKEIAASIVYTGSVKAVWDELQERFKQASGPGIYQL 328
                L   W   N ++ SW+ NSV KEIA SI+Y  +   +W++L ERF Q SGP I++L
Sbjct: 75   ATDLLASPWSRCNSMVISWLSNSVCKEIAESILYHETAIEIWNDLYERFHQGSGPRIFEL 134

Query: 329  RKDLVTLRQGSMSVEVYYTKLKTIWQDLSDLRPTASCTCGGLKPFLEHLDSEYVMTFLMG 388
            ++ ++   QGS  V  YYT+LK++W +L + +    C CGG++ ++E    E VM FL+G
Sbjct: 135  KQKILAHTQGSADVNTYYTRLKSLWDELREFKAIPICNCGGMRVYMEDQQRETVMQFLLG 194

Query: 389  LNETYAAIRAQILLMKPLPSITEAFSLLIQEEHQRSAGILGPSP---DPIALAVNDTSKT 448
            LNE++A I+AQILLM+P P + + FSL++QEE QRS      SP    P++      S+ 
Sbjct: 195  LNESFAPIQAQILLMEPTPPLNKVFSLVVQEEWQRSL-TTSNSPAFTTPVSSRFQAASRA 254

Query: 449  SDPPRRKENSGQRPVCSHCGIKGHVKDRCYKLHGYPPGYKFRSSNSPD---------NSA 508
            S P     +   RP+C+HC I GH  DRCYK+HGY PG++ R +  P+         NS 
Sbjct: 255  SSPTNSSRSRKDRPLCTHCNILGHTVDRCYKIHGYTPGFRNRPNFRPNGSRPNQMLPNSL 314

Query: 509  SAKSVVAANSAAASSCPPATPNFFSSLNNVQYGQLMELFNSHLQAAKT----DPITVASA 568
                +   + + AS+ PP        L + Q+ QL+ L + H  +  +    D   +  +
Sbjct: 315  HTNQLTLTDGSIASASPP-------PLTHDQHNQLLALLSLHSSSGSSASFGDSNPLQQS 374

Query: 569  VSHATGICHLASSPITSLNDCWIVDSGASRHICHTRAAFRNWRRIDPISIVLPTAYRVCV 628
            +S+ TGI  L+ S  T     WI+DSGA+ H+C   + F +       ++ LPT  ++ +
Sbjct: 375  ISNFTGILSLSPSSSTLNPSIWILDSGATHHVCTNSSMFHSIHSFSSNTVTLPTGTKIPI 434

Query: 629  EYVGEIHISAALVLRDVLFVPDFAYNLMSVSCLLQSGEFSVAFTNNHCLIQDKQLLTMIG 688
              +G IH+S  LVL  VL++P F +NL+S+S L Q+  FS  FT + C IQD     +IG
Sbjct: 435  TGIGTIHLSPHLVLEHVLYIPTFQFNLISISALTQTNCFSFDFTAHFCFIQDHSQGKLIG 494

Query: 689  KAECHHGLYILSNISGSSTDSLVTSTVVSDIPAVFSVSASIWHSRLGHLSPRRLSMLRDT 748
                   LY+L     SS    ++S  V D      V+  +WH RL H S  +LS+L+  
Sbjct: 495  MGRRQGNLYLLD----SSVFRSISSVFVVDNNTSAHVN-KLWHFRLSHPSNVKLSVLKPH 554

Query: 749  LQFKD--SCDASCTVCPLAKQKRMPFTSNNHVASNVFDLIHADVWGPLNTSTYDGYRYFL 808
            LQ +   + + SC++CPLAKQKR+PF  +N+++S+ FDLIH D+WGP +  T+DG+RYFL
Sbjct: 555  LQLQSNGNTNLSCSICPLAKQKRLPFDCHNNLSSSPFDLIHCDIWGPFHIPTHDGFRYFL 614

Query: 809  TLVDDASRFTWVYLLRQKSDVLMIIPRFFKMVETQFSKTIKCFRSDNAPELKFTDFFAST 868
            T+VDD +R TWV+LLR KSDV  I P+FF MV+T+F  TIK  RSDNAPEL  ++ F   
Sbjct: 615  TIVDDCTRNTWVHLLRAKSDVKTIFPQFFSMVKTKFGLTIKAVRSDNAPELNLSNLFTQL 674

Query: 869  GTIHQFSCVERPQQNSVVERKHQHLLNVARSLYFQ------------------------- 928
              +H FSCVE PQQNSVVERKHQH+LNVAR+LYFQ                         
Sbjct: 675  DVLHFFSCVETPQQNSVVERKHQHILNVARALYFQSNIPIGYWGDCVLTSVYLINRIPSP 734

Query: 929  --------------SLDYSFLRVFGCLCYASTLTAGRSKFDPRARPCVLLGYPPGMKGYR 988
                          S  YS L+ FGCLCY+STL + R KF PRA PCV LGYP G KGY+
Sbjct: 735  LLNNKTPFELLHHKSPSYSHLKSFGCLCYSSTLPSTRHKFSPRALPCVFLGYPFGYKGYK 794

Query: 989  LYDISKKQVFISRDVTFIEENFPFHSIINNDQLV-DVFPGLVLP-LPVFDFVPSPSVPNN 1048
            + D+   ++ +SR+VTF E  FPF    NN+ +  D F   VLP +PV    PSPS  N+
Sbjct: 795  ILDLETNRISVSRNVTFQESVFPFKLSQNNNSVASDFFSKKVLPVVPV--STPSPSFDNS 854

Query: 1049 TAATPSEVGFPPAGLPSDVSAENGDMLLQASPAASDADSPAQSTGVIQPRRSTRQRHPPD 1108
            T+                            +P +S  D+   +T      RS+R   PP 
Sbjct: 855  TSH-------------------------PNNPDSSFNDTSPHTTS-HTTTRSSRVSQPPK 914

Query: 1109 FLKDYHCHLLRHDVPL-LSHDVPHSLDKYVSYRRFSNDHRQFILNVSTDFEPTYYHQTVK 1168
            +L DYHCHL        +S+  P+ L   +SY + S   R F +++ST  EPT Y + V 
Sbjct: 915  YLSDYHCHLASSTPHFDISNSTPYPLSDVISYNKLSPSFRAFSISISTITEPTTYAEAVV 974

Query: 1169 FAPWRKAMDDEIAAMERTNTWTLVPLPSGRRAVGCKWVYKVKYKADGTVDRYKARLVAKG 1228
               W+ AM  E+ A+E  NTW+L  LP G+ AVGCKW+Y+VKY  DG+++RYKARLVAKG
Sbjct: 975  VPEWQHAMRAELQALESNNTWSLCTLPPGKTAVGCKWLYRVKYHVDGSIERYKARLVAKG 1034

Query: 1229 YNQQEGIDFLETFSPVAKIVTVKVLLSLTASFGWSLVQLDVNNAFLNGDLFEEVYMSLPL 1288
            + QQEG+DF   F    +                            +G   +EV+M LP 
Sbjct: 1035 FTQQEGVDFFLYFFTCCQ----------------------------DGHC-QEVFMHLPP 1094

Query: 1289 GYYTDRKSSGCTPIVCKLNKSIYGLKQASRQWFSKFSSVLLANGFSQSKTDYSLFIRGHG 1348
            GY+ +R+    + IVCKL+KSIYGL+QASRQWF+KFS VL++ GF QS +DYSLFI+  G
Sbjct: 1095 GYHREREPLLPSNIVCKLHKSIYGLRQASRQWFAKFSGVLISEGFQQSHSDYSLFIKTAG 1154

Query: 1349 ISFVALLVYVDDILITGPSTTEISAVKDILYRHFLLKDLGHAKYFLGLELSRSDKGIYLS 1408
              F+ALLVYVDDI++   +      +K+ L + F LKDLG+ KYFLGLE++RS KGI ++
Sbjct: 1155 NDFIALLVYVDDIIVASNNKIAADNLKNSLNKSFKLKDLGNLKYFLGLEVARSAKGILIN 1214

Query: 1409 QRKYCLQLIEDSGYLAAKPVNHPMIPNLRLSANGSGELLNADDASSYRRLVGRLLYLQVS 1468
            QRKY L+L+ ++GYL  KP   PM PN++LS +  GELL   D + YRRL+G+L+YL ++
Sbjct: 1215 QRKYALELLSETGYLGCKPAKTPMQPNMQLSQD-DGELLT--DPNMYRRLIGKLIYLTIT 1274

Query: 1469 RPDISFTVHNLSQFIAKPCVRHLDVVFHLLRYLKGTAGQGILLHSSRDFHLKAFADSDWG 1528
            RPD++++V+ LSQF+++P   HL  V+ +L+Y+KG+ G+GI   +S    LKAF+DSDW 
Sbjct: 1275 RPDLTYSVNKLSQFLSQPRRPHLQAVYRILQYIKGSPGKGIFFSASSSLQLKAFSDSDWA 1334

Query: 1529 SCPDSRKSVTGFCIFLGKSLVSWKSKKQATVSRSSAEAEYRALATVSSELIWLSHLLKDL 1588
            +CPDSRKSVTGFCIFL  SL+SWKSKKQ TVSRSSAEAEYRA+A V+ EL WL  LLKDL
Sbjct: 1335 ACPDSRKSVTGFCIFLRDSLISWKSKKQRTVSRSSAEAEYRAMAHVTCELTWLIALLKDL 1389

Query: 1589 QVPLRSPSVVYCDNLAAIAIANNPTFHERTKHIEIDCHFVRDLIQQRILRLLPIRSNLQL 1596
             +P   P+++YCDN AA+ IA NP FHERTKHIEIDCH VR+ IQ  +L+ L + S  QL
Sbjct: 1395 GIPHTQPALLYCDNQAALHIAANPVFHERTKHIEIDCHIVREKIQTSMLKTLHVASQHQL 1389

BLAST of Lag0024687 vs. NCBI nr
Match: KAG7578768.1 (GAG-pre-integrase domain [Arabidopsis thaliana x Arabidopsis arenosa])

HSP 1 Score: 1129.0 bits (2919), Expect = 0.0e+00
Identity = 640/1527 (41.91%), Postives = 898/1527 (58.81%), Query Frame = 0

Query: 204  VTATSIEAQINPYFLHHSFGSSSVLVSQPLLGAVNYTSWSRAMRMVISGKNKLGFITGKI 263
            + A + ++  NPY+L ++  S  VLVS  L GA ++ SW ++M M ++G+NKLGF+ G +
Sbjct: 1    MAAPTRDSYDNPYYLTNNDHSGLVLVSDRLTGAGDFGSWHQSMLMALNGRNKLGFVDGSL 60

Query: 264  SKPQEEGALLEAWECNNDIIASWILNSVSKEIAASIVYTGSVKAVWDELQERFKQASGPG 323
             KP +       W   ND++ SW++NSVSK I  S++Y  +   +W +L +RFKQ + P 
Sbjct: 61   PKPDDGHRDAATWSRVNDVVRSWLINSVSKTIGQSVLYVKTAHGIWQKLLQRFKQNNVPR 120

Query: 324  IYQLRKDLVTLRQGSMSVEVYYTKLKTIWQDLSDLRPTASCTCGGL-----KPFLEHLDS 383
            +Y++ + L  LRQGS+ V  +YTKL TIW+++   +    C CGG      + +++  + 
Sbjct: 121  LYRIEQKLAGLRQGSLDVNTFYTKLVTIWEEVKSAQDFPVCQCGGCDCEVNRKWMDLFER 180

Query: 384  EYVMTFLMGLNETYAAIRAQILLMKPLPSITEAFSLLIQEEH-------QRSAGIL---- 443
             +V+ FL GLN++Y  +R  I+++ PLP + +  +++IQ+EH        +S  ++    
Sbjct: 181  NFVIKFLFGLNDSYENVRESIIMLDPLPDLEKTLNMVIQKEHTQEIKQVPQSGSVVFQMS 240

Query: 444  ---GPSPDPIALAVNDTSKTSDPPRRKENSG---------QRPVCSHCGIKGHVKDRCYK 503
                PSP       + +S T D   + +  G         QRPVC++CG++GH+  +CYK
Sbjct: 241  SQHAPSPQ-FDQNFSSSSITDDYSGQSDFVGAVSGGYKPRQRPVCTYCGLQGHLVTKCYK 300

Query: 504  LHGYPPGYKF------RSSNSP--------------------------DNSASAKSV--- 563
            LHGYP GYK        + N+P                          +NS+  +     
Sbjct: 301  LHGYPLGYKSSNPSYGNTQNTPSTQPFAPKQFSPRPPMSSQQQYNPQFNNSSRMQGQGQR 360

Query: 564  ---------VAANSAAASSCPPATPNFFSSLNNVQYGQLMELFNSHLQAAKTDPITVASA 623
                     V  NS A         N  + L+  Q  QL    NS     +T  I  A  
Sbjct: 361  GQRDNVVGNVITNSPAVHDHFHQVSNALAQLSPDQIEQLASQLNSK-ATCQTPSINEAHG 420

Query: 624  VSHA-TGICHLASSPI--TSLN-DCWIVDSGASRHICHTRAAFRNWRRIDPISIVLPTAY 683
            V++A T   +   S I  + LN   WI+DSGA+ H+C   + F +   I   ++ LP + 
Sbjct: 421  VNYASTSAGYFVCSTILESCLNFTAWIIDSGATTHVCCNLSLFDDINSISETTVKLPNST 480

Query: 684  RVCVEYVGEIHISAALVLRDVLFVPDFAYNLMSVSCLLQSGEFSVAFTNNHCLIQDKQLL 743
            ++ +   G + +S  L+LR+VLF+P F  NL+SVS LLQ   +SV F  + C IQ+    
Sbjct: 481  QIAINQSGTVKLSDKLLLRNVLFIPSFHMNLISVSSLLQDCAYSVNFFPSFCTIQEFTRG 540

Query: 744  TMIGKAECHHGLYILSNISGSSTDSLVTSTVVSDIPAVFSVSASIWHSRLGHLSPRRLSM 803
             MIGK    + LY L   S SS     TS V +      S S+S+WHSRLGH S  +L  
Sbjct: 541  LMIGKGRLENKLYFLDLESPSSQSPSSTSLVCN---LNVSESSSLWHSRLGHPSFPKLQA 600

Query: 804  LRDTLQFKDS---CDASCTVCPLAKQKRMPFTSNNHVASNVFDLIHADVWGPLNTSTYDG 863
            L + L    S     + C  C LAKQKR+ F S N+++   FDL+H DVWGP +  +++ 
Sbjct: 601  LSEDLSISKSKLKDWSHCKTCHLAKQKRLSFPSLNNISKQPFDLVHMDVWGPFSVVSHEV 660

Query: 864  YRYFLTLVDDASRFTWVYLLRQKSDVLMIIPRFFKMVETQFSKTIKCFRSDNAPELKFTD 923
            ++YFLTLVDD +R TW+YLL+ KSDV  I P F   VETQ++  +K  RSDNAPEL FT 
Sbjct: 661  FKYFLTLVDDCTRVTWIYLLKAKSDVHQIFPAFLNSVETQYNNKVKAIRSDNAPELSFTS 720

Query: 924  FFASTGTIHQFSCVERPQQNSVVERKHQHLLNVARSLYFQS------------------- 983
               S G  H FSCV+ PQQNSVVERKHQH+LNVAR+L FQS                   
Sbjct: 721  LLQSKGIFHFFSCVDTPQQNSVVERKHQHILNVARALLFQSNIPIQYWSDCIRTSVYLIN 780

Query: 984  --------------------LDYSFLRVFGCLCYASTLTAGRSKFDPRARPCVLLGYPPG 1043
                                  YS L+ FGCLCY ST    R KF PRA   V LGYP G
Sbjct: 781  RTPSPLLNNKTPFELLMNKKPKYSHLKSFGCLCYVSTYPKDRHKFTPRAEASVFLGYPSG 840

Query: 1044 MKGYRLYDISKKQVFISRDVTFIEENFPFHSIINNDQLVDVFPGLVLPLPVFDFVPSPSV 1103
             KGY++ ++    + ISR+V F E+ FPFHS       +D+F   +LPLP+ D       
Sbjct: 841  YKGYKVLNLETHSISISRNVIFHEDIFPFHSSDLAPSTLDLFNSNILPLPLPD------- 900

Query: 1104 PNNTAATPSEVGFPPAGLPSDVSAENGDMLLQASPAASDADSPAQSTGVIQPRRSTRQRH 1163
               +++ P +  FP           N D+L   S ++ D+D+    T   +P+R+ R   
Sbjct: 901  -TTSSSIPVQHPFP----------TNNDVLSDNSGSSVDSDNTIPVT-TNRPKRNIR--- 960

Query: 1164 PPDFLKDYHCHLLRHDVPLLSHDVPHSLDKYVSYRRFSNDHRQFILNVSTDFEPTYYHQT 1223
             P +L DYHC+L+ HD+P +S +  H L   + Y + +  ++QFILN+S + EP  + + 
Sbjct: 961  APSYLADYHCNLV-HDLPTVSGNTAHPLSSVLDYTKLNPHYQQFILNISAESEPKTFLEA 1020

Query: 1224 VKFAPWRKAMDDEIAAMERTNTWTLVPLPSGRRAVGCKWVYKVKYKADGTVDRYKARLVA 1283
            V+   W   M++E+     T T+++V LP+G++ +GC+WVYK+K+ ADGT+DRY+ARLVA
Sbjct: 1021 VRSEKWHGPMNEELQTCVDTGTFSVVSLPAGKQPIGCRWVYKIKHNADGTIDRYRARLVA 1080

Query: 1284 KGYNQQEGIDFLETFSPVAKIVTVKVLLSLTASFGWSLVQLDVNNAFLNGDLFEEVYMSL 1343
            KGY QQEG+D+++TFSPVAK+VTVK+LL L+A  GWSL Q+DV NAFL+GDL EE+YM L
Sbjct: 1081 KGYTQQEGVDYIDTFSPVAKLVTVKLLLDLSAKQGWSLTQMDVTNAFLHGDLEEEIYMDL 1140

Query: 1344 PLGYYTDRKSSGCTPIVCKLNKSIYGLKQASRQWFSKFSSVLLANGFSQSKTDYSLFIRG 1403
            P GY      +     V +L+KS+YGLKQASRQW  KFS VLLA GF+QS++D++LF++ 
Sbjct: 1141 PPGYTPPPGETLPPNAVWRLHKSLYGLKQASRQWNKKFSDVLLAAGFTQSESDHTLFVKH 1200

Query: 1404 HGISFVALLVYVDDILITGPSTTEISAVKDILYRHFLLKDLGHAKYFLGLELSRSDKGIY 1463
                F+ALLVYVDDILI   S   +S +K +L   F LKDLG AKYFLGLE++R+  GI 
Sbjct: 1201 VNNIFIALLVYVDDILIASNSDAAVSDLKSVLAASFKLKDLGQAKYFLGLEIARNKSGIS 1260

Query: 1464 LSQRKYCLQLIEDSGYLAAKPVNHPMIPNLRLSANGSGELLNADDASSYRRLVGRLLYLQ 1523
            +SQRKY L L+E  G L  KPV+ PM   ++L+   SG+LL   DA+ YR L+G+LLYL 
Sbjct: 1261 VSQRKYALDLLESVGLLGCKPVSTPMDSTVQLTTE-SGDLL--PDATVYRALIGKLLYLT 1320

Query: 1524 VSRPDISFTVHNLSQFIAKPCVRHLDVVFHLLRYLKGTAGQGILLHSSRDFHLKAFADSD 1583
            ++R DI+F VH LSQF+++P   HL+    ++RYLKG  G+G+   +  D  L+AF+D+D
Sbjct: 1321 ITRADITFAVHKLSQFLSQPRTLHLEAAHRIIRYLKGDPGRGLFYSAQSDLRLQAFSDAD 1380

Query: 1584 WGSCPDSRKSVTGFCIFLGKSLVSWKSKKQATVSRSSAEAEYRALATVSSELIWLSHLLK 1612
            WG+C D+R+S TGFC+FLG SL+SWKSKKQ T SRSSAE+EYRALA  + EL+WLS LLK
Sbjct: 1381 WGTCQDTRRSTTGFCVFLGTSLLSWKSKKQPTASRSSAESEYRALADTTCELLWLSKLLK 1440

BLAST of Lag0024687 vs. NCBI nr
Match: KAG7588551.1 (Ribonuclease H domain [Arabidopsis suecica])

HSP 1 Score: 1118.6 bits (2892), Expect = 0.0e+00
Identity = 647/1573 (41.13%), Postives = 891/1573 (56.64%), Query Frame = 0

Query: 198  NSSSSNVTATSIEAQINPYFLHHSFGSSSVLVSQPLLGAVNYTSWSRAMRMVISGKNKLG 257
            ++S+S  +  S++   NPYFLH S  +  VLVS  L    ++ SW R++RM ++ +NKLG
Sbjct: 2    STSNSEGSRQSLDQYDNPYFLHKSDHAGLVLVSDRLSTGADFHSWKRSIRMALNVRNKLG 61

Query: 258  FITGKISKPQEEGALLEAWECNNDIIASWILNSVSKEIAASIVYTGSVKAVWDELQERFK 317
            FI G +++PQ +     +W   ND++A+W++NSVSK+I  S+++  + + +W  L  RFK
Sbjct: 62   FIDGTVTQPQSDHRDYGSWSRCNDMVATWLMNSVSKKIGQSLLFISTAEGIWKNLMSRFK 121

Query: 318  QASGPGIYQLRKDLVTLRQGSMSVEVYYTKLKTIWQDLSDLRPTASCTCG----GLKPFL 377
            Q   P +Y++ + L +++QGSM V  YYT+L T+W++  +      CTCG          
Sbjct: 122  QDDAPRVYEIEQRLSSIQQGSMDVSAYYTELVTMWEEYKNYVEIPVCTCGKCECNAAILW 181

Query: 378  EHLDSEYVMT-FLMGLNETYAAIRAQILLMKPLPSITEAFSLLIQEEHQRS------AGI 437
            E L     +T FLMGLNE+Y A R  IL++KP+PSI + F+++ Q+E Q+S       GI
Sbjct: 182  EKLQQRSRVTKFLMGLNESYEATRRHILMLKPIPSIEDVFNMVTQDERQKSIKPSKPEGI 241

Query: 438  LGPSPDPIALAVNDTSKTSDPPRR--KENSG-----------QRPVCSHCGIKGHVKDRC 497
            +  +  P A A  D S    P  +  ++N+             RP+C+HCG  GHV  +C
Sbjct: 242  IFQATGPSAQANPDMSTYQGPTYQGPQDNAAYAMQNGYRPRQPRPLCTHCGQSGHVIQKC 301

Query: 498  YKLHGYPPGY-----------------------KFRSSNSPDNSASAKSVVAANSAAASS 557
            +KLHGYPPGY                        F+      N+  A SV  AN      
Sbjct: 302  FKLHGYPPGYIPGFKSISSGYHSQRMPTPTPQPVFQPRGQMQNNQRAHSV--ANVMQTPY 361

Query: 558  CP-PATPNF---FSSLNNVQYGQLMELFNSHLQAAKT---DPIT-------VASAVSHAT 617
             P PAT      FS ++  Q   L+    +H+Q  +T    P+        V +A S + 
Sbjct: 362  IPSPATNAISLDFSKISPDQMQSLLHQLAAHVQLPETTVPSPMVSCITENGVMAAESSSG 421

Query: 618  GICHLASSPITSLND-----------------------------CWIVDSGASRHICHTR 677
             I  L+SS  T+++                               WI+DSGA+ H+C   
Sbjct: 422  NIHSLSSSSHTNIHSLSNSIRYENNQLTFQHQCLSSLYTNLPHGSWIIDSGATSHVCSDL 481

Query: 678  AAFRNWRRIDPISIVLPTAYRVCVEYVGEIHISAALVLRDVLFVPDFAYNLMSVSCLLQS 737
            + F     +  +++ LP   RV + + G I IS +L+L DVL VP F +NL+SVS LL+S
Sbjct: 482  SLFSETIPVSGVTVSLPNDTRVAITHTGTIPISHSLILHDVLHVPSFKFNLISVSSLLKS 541

Query: 738  GEFSVAFTNNHCLIQDKQLLTMIGKAECHHGLYILSNISGSSTDSLVTSTVVSDIPAVFS 797
             + S  F    C IQ+      IGK    H LYIL      S  SL   +  +       
Sbjct: 542  SKCSAHFYTTSCFIQESTQGLTIGKGILLHNLYILQI---ESPLSLTAPSHTTHFSGSLV 601

Query: 798  VSASIWHSRLGHLSPRRLSMLRDTLQFKDSC---DASCTVCPLAKQKRMPFTSNNHVASN 857
            V   +WH RLGH S  +L  L  TL    S       C VC LAKQKR+ F S+NH++S+
Sbjct: 602  VDGDLWHRRLGHPSSDKLQALSSTLSLPKSSLQNKCPCHVCSLAKQKRLSFESHNHLSSS 661

Query: 858  VFDLIHADVWGPLNTSTYDGYRYFLTLVDDASRFTWVYLLRQKSDVLMIIPRFFKMVETQ 917
             FDLIH DVWGP +T + +GYRYFLT+VDD +R TW+YL+R KS+V      F ++V TQ
Sbjct: 662  PFDLIHLDVWGPFSTESVEGYRYFLTIVDDCTRVTWIYLMRNKSEVSKHFTTFIQLVLTQ 721

Query: 918  FSKTIKCFRSDNAPELKFTDFFASTGTIHQFSCVERPQQNSVVERKHQHLLNVARSLYFQ 977
            +   IK  R+DNAPEL FT+     G +HQFSC   PQQNSVVERKHQHLLNVAR+L FQ
Sbjct: 722  YKAVIKKIRTDNAPELAFTEIINKNGIMHQFSCAYTPQQNSVVERKHQHLLNVARALLFQ 781

Query: 978  S---------------------------------------LDYSFLRVFGCLCYASTLTA 1037
            S                                        DYS LR FGCLCYASTL  
Sbjct: 782  SNVPLAYWSDCISTAVFLINRMPSVLLKNISPYELLLKKPPDYSLLRCFGCLCYASTLLK 841

Query: 1038 GRSKFDPRARPCVLLGYPPGMKGYRLYDISKKQVFISRDVTFIEENFPFHSIINNDQLVD 1097
             R KF PRA  CV +GY  G KGY+L  +    V +SR+V F E  FPFH I     L  
Sbjct: 842  DRHKFSPRADKCVFIGYSSGYKGYKLLHLDTNIVSVSRNVVFYEHIFPFHDITVASPL-- 901

Query: 1098 VFPGLVLPLPVFDFVPSPSVPNNTAATPSEVGFPP-----AGLPSDVSAENGDMLLQASP 1157
            +F   +LPLP+       SV  +     S+   PP     A   S  S     M   + P
Sbjct: 902  IFSNNILPLPI-------SVALDIVEHSSQQNVPPPNQSYASSSSHTSHTRSSM--HSVP 961

Query: 1158 AASDADSPAQSTGVIQPRRSTRQRHPPDFLKDYHCHLL-RHDVPLLSHDV------PHSL 1217
                A++   S   ++P+RS +    P +L DYHC L+ +   PLLS  +      P+ L
Sbjct: 962  ETVPAETSIVSLPNVRPKRSAK---TPSYLTDYHCSLIQKSSSPLLSDPLPKKLTTPYPL 1021

Query: 1218 DKYVSYRRFSNDHRQFILNVSTDFEPTYYHQTVKFAPWRKAMDDEIAAMERTNTWTLVPL 1277
               +SY +  N ++  +L+ S + EP+ + Q +    W KAMD E+ AME  +TW++V L
Sbjct: 1022 SSVLSYSQLKNPYQSIVLSYSIESEPSNFKQAIASIQWTKAMDVELQAMEDNHTWSIVEL 1081

Query: 1278 PSGRRAVGCKWVYKVKYKADGTVDRYKARLVAKGYNQQEGIDFLETFSPVAKIVTVKVLL 1337
            P G+  VG KWVY +KY ADGT++RYKARLVAKG+ QQEG+DF +TFSPVAK+ +VK++L
Sbjct: 1082 PPGKNIVGSKWVYTIKYNADGTIERYKARLVAKGFTQQEGVDFFDTFSPVAKLASVKLIL 1141

Query: 1338 SLTASFGWSLVQLDVNNAFLNGDLFEEVYMSLPLGYYTDRKSSGCTPIVCKLNKSIYGLK 1397
             L A+  W+L Q+DV+NAFL+ +L EE+YMSLP GY      +     VC+L+KSIYGLK
Sbjct: 1142 GLAAAKDWNLTQMDVSNAFLHSELEEEIYMSLPQGYTPAPGQTLPPNPVCRLHKSIYGLK 1201

Query: 1398 QASRQWFSKFSSVLLANGFSQSKTDYSLFIRGHGISFVALLVYVDDILITGPSTTEISAV 1457
            QASRQW+  FS VLL N F QS +D +LF++  G SF+ LLVYVDDI+I   S   +S++
Sbjct: 1202 QASRQWYRCFSKVLLGNNFLQSASDNTLFVKISGNSFIVLLVYVDDIMIASNSVEAVSSL 1261

Query: 1458 KDILYRHFLLKDLGHAKYFLGLELSRSDKGIYLSQRKYCLQLIEDSGYLAAKPVNHPMIP 1517
            K IL + F +KDLG  ++FLGLE++R+ +GI +SQRKYCL L++D+G+L  KP   PM P
Sbjct: 1262 KAILAQEFKIKDLGPVRFFLGLEVARNKEGISVSQRKYCLDLLKDAGFLGCKPRTVPMDP 1321

Query: 1518 NLRLSANGSGELLNADDASSYRRLVGRLLYLQVSRPDISFTVHNLSQFIAKPCVRHLDVV 1577
             + L  + +G LL   D   YR L+GRLLYL ++RPDI+F V+ LSQF++ P   HL   
Sbjct: 1322 KVPLFKD-TGTLLT--DGKPYRELIGRLLYLTITRPDITFAVNRLSQFLSCPTDVHLQAA 1381

Query: 1578 FHLLRYLKGTAGQGILLHSSRDFHLKAFADSDWGSCPDSRKSVTGFCIFLGKSLVSWKSK 1620
            +H+L+YLK   GQG+    + D  L  F+D+DWG+C D+R+S TG C+FLG SL++ KSK
Sbjct: 1382 YHILKYLKANPGQGLFYSVNTDLCLNGFSDADWGNCKDTRRSTTGMCVFLGTSLITHKSK 1441

BLAST of Lag0024687 vs. NCBI nr
Match: KAG7574150.1 (Integrase catalytic core [Arabidopsis suecica])

HSP 1 Score: 1097.0 bits (2836), Expect = 0.0e+00
Identity = 599/1445 (41.45%), Postives = 856/1445 (59.24%), Query Frame = 0

Query: 214  NPYFLHHSFGSSSVLVSQPLLGAVNYTSWSRAMRMVISGKNKLGFITGKISKPQEEGALL 273
            +PY+L HS      ++S+ +    NY  W  A+++ +  KNKL FI G + +P E   L 
Sbjct: 53   SPYYLTHSDNPGVSIISE-VFDGTNYDDWQIAIKIALDAKNKLVFIDGSVPRPPESDRLF 112

Query: 274  EAWECNNDIIASWILNSVSKEIAASIVYTGSVKAVWDELQERFKQASGPGIYQLRKDLVT 333
              W   N ++ SW+LNSVSK I  SI+       +W++L  R++  + P  Y L + + +
Sbjct: 113  RIWSRCNSLVKSWLLNSVSKPIYKSILRFDDASEIWNDLLTRYRITNLPRSYHLSQQIWS 172

Query: 334  LRQGSMSVEVYYTKLKTIWQDLSDLRPTASCT-CGGLKPFLEHLDSEYVMTFLMGLNETY 393
            L+QG+M +  YYT L+T+W +L        C  C   K   +  D   V+ FL GLNE+Y
Sbjct: 173  LQQGTMDLATYYTTLRTLWNELDGSDCVTLCKHCDCCKAVDKKADHARVIKFLAGLNESY 232

Query: 394  AAIRAQILLMKPLPSITEAFSLLIQEEHQRSAGILGPSPDPIALAVNDTSKTSDPPRRKE 453
            + IR QI++ K +PS+ E ++LL Q+  QRS   +  +     ++ +D+ + S       
Sbjct: 233  SVIRTQIIMKKHVPSLAEVYNLLDQDHSQRSFTPVPTNAVAFHVSASDSIQPSVNATYNN 292

Query: 454  NSGQRPVCSHCGIKGHVKDRCYKLHGYPPGYKFRSSNSPDNSASAKSVVAANS---AAAS 513
               Q+ +C+HCG  GH  DRCYK+HGYP G+K +  N  +  +S++  V A     A  +
Sbjct: 293  AKPQKIICTHCGYTGHTIDRCYKIHGYPLGFKHKHKNQSEKGSSSEKPVTAIKPVVAQLA 352

Query: 514  SCPPATPNFFSSLNNV----QYGQLMELFNSHLQ---AAKTDPITVAS----AVSHAT-- 573
                 T +  + L  V    Q   ++  FNS +Q    A T   T+ +    A S +T  
Sbjct: 353  MTETTTNDLINGLTKVLTKDQINGVVAYFNSQIQTPSVASTSGATITALPGIAFSSSTIG 412

Query: 574  --GICHLASSPITSLNDCWIVDSGASRHICHTRAAFRNWRRIDPISIVLPTAYRVCVEYV 633
              G+     + ++S  + WI+DSGA+ H+CH ++ F N       S+ LPT + V +  +
Sbjct: 413  FVGVLRATGNVLSS--ESWIIDSGATHHVCHDKSLFLNLSETMNNSVTLPTGFGVKITGI 472

Query: 634  GEIHISAALVLRDVLFVPDFAYNLMSVSCLLQSGEFSVAFTNNHCLIQDKQLLTMIGKAE 693
            G + ++  L+L +VL++PDF  NL+S+S L +   + V F  + C+IQD     MIGK E
Sbjct: 473  GTVQLNEFLILNNVLYIPDFRLNLLSISQLTKDLGYRVTFDEDSCIIQDHIKGLMIGKGE 532

Query: 694  CHHGLYILS--NISGSSTDSLVTSTVVSDIPAVFSVSASIWHSRLGHLSPRRLSMLRDTL 753
                LY+L   +I  S       ST +        V +S+WHSRLGH S    +++ D L
Sbjct: 533  QISNLYVLDVHSIKDSKDQKRTFSTNI-------VVDSSLWHSRLGHPSVATSNIVTDVL 592

Query: 754  QFKDSCDAS--CTVCPLAKQKRMPFTSNNHVASNVFDLIHADVWGPLNTSTYDGYRYFLT 813
             F      S  CTVCPLAKQK +PF S N+V  + FDL+H D+WGP N  T DGYRYFLT
Sbjct: 593  GFNQRNKRSFHCTVCPLAKQKHLPFPSKNNVCDSAFDLVHIDIWGPFNVPTPDGYRYFLT 652

Query: 814  LVDDASRFTWVYLLRQKSDVLMIIPRFFKMVETQFSKTIKCFRSDNAPELKFTDFFASTG 873
            +VDD +R TW+YLL+ KS+VL I P F KMVETQ+   +K  RSDNAPELKF + F + G
Sbjct: 653  IVDDHTRVTWLYLLKNKSEVLTIFPDFLKMVETQYKTQVKGVRSDNAPELKFVELFKAKG 712

Query: 874  TIHQFSCVERPQQNSVVERKHQHLLNVARSLYFQSL------------------------ 933
             +H FSC E P+QNSVVERKHQH+LNVARSL FQ+                         
Sbjct: 713  ILHYFSCPETPEQNSVVERKHQHILNVARSLMFQAQVPIEYWGECVLTAVFLINRLPTPL 772

Query: 934  ---------------DYSFLRVFGCLCYASTLTAGRSKFDPRARPCVLLGYPPGMKGYRL 993
                           D+S LRVFGCLCY+ST T  R+KF PRA+ CV LGYPPG+KGYRL
Sbjct: 773  LKDKSPFEVLTHKMPDFSGLRVFGCLCYSSTSTKNRNKFQPRAKACVFLGYPPGVKGYRL 832

Query: 994  YDISKKQVFISRDVTFIEENFPFHSIINNDQLVDVFPGLVLPLPVFDFVPSPSVPNNTAA 1053
             D+    +++SR+V F E  FPF +   +  + D+F          D   S   P   + 
Sbjct: 833  LDLETNVIYVSRNVVFHENIFPF-AKGESTVMPDIFSPSEESFVEDDAAVSIDSPVVVSE 892

Query: 1054 TPSEVGFPPAGLPSDVSAENGDMLLQASPAASDADSPAQSTGVIQPRRSTRQRHPPDFLK 1113
            +P+ V            A N ++ +  + ++  +D  A  T              P +L+
Sbjct: 893  SPTVV----------TDAANTNVPINNNSSSQKSDRRASKT--------------PAYLQ 952

Query: 1114 DYHCHLLRHDVPLLSHDVPHSLDKYVSYRRFSNDHRQFILNVSTDFEPTYYHQTVKFAPW 1173
            DY+C+       L ++ V H +  ++SY   S  +R +I +V+   EPT + Q  K   W
Sbjct: 953  DYYCN-------LSTNGVEHPISDFLSYDGLSTPYRAYICSVTKYAEPTSFTQARKSDDW 1012

Query: 1174 RKAMDDEIAAMERTNTWTLVPLPSGRRAVGCKWVYKVKYKADGTVDRYKARLVAKGYNQQ 1233
             +AM++E+ A+E T TW +  LPSG+ ++GC+WVYKVK  ADG+++RYKARLVAKGY QQ
Sbjct: 1013 LQAMNEELKALEGTATWEICSLPSGKHSIGCRWVYKVKLNADGSLERYKARLVAKGYTQQ 1072

Query: 1234 EGIDFLETFSPVAKIVTVKVLLSLTASFGWSLVQLDVNNAFLNGDLFEEVYMSLPLGYYT 1293
            EG+DF++TFSPVAK+ TVK LL++ A+  WSL QLD++NAFLNGDL EE+YM+LP G YT
Sbjct: 1073 EGVDFVDTFSPVAKMTTVKTLLAVAAAKKWSLHQLDISNAFLNGDLEEEIYMTLPPG-YT 1132

Query: 1294 DRKSSGCTP-IVCKLNKSIYGLKQASRQWFSKFSSVLLANGFSQSKTDYSLFIRGHGISF 1353
             +      P  VCKL KS+YGLKQASRQWF KFS+ L++ GF +S+ D++LF+R     +
Sbjct: 1133 AKDGENLPPNAVCKLKKSLYGLKQASRQWFLKFSTTLMSLGFQKSQADHTLFVRNQNGKY 1192

Query: 1354 VALLVYVDDILITGPSTTEISAVKDILYRHFLLKDLGHAKYFLGLELSRSDKGIYLSQRK 1413
            +A+LVYVDDI+I      E+  +K+ L + F L+DLG  KYFLGLE++R+  GI + QRK
Sbjct: 1193 LAVLVYVDDIIIASNDDEEVIQLKEDLQKAFKLRDLGSVKYFLGLEIARNASGISVCQRK 1252

Query: 1414 YCLQLIEDSGYLAAKPVNHPMIPNLRLSANGSGELLNADDASSYRRLVGRLLYLQVSRPD 1473
            Y L L++++G LA KP + PM P+L+L ++G    +   D ++YRRLVG+++YL ++RPD
Sbjct: 1253 YALGLLDETGLLACKPSSIPMEPSLKLISDGDEPPMK--DPAAYRRLVGKMMYLTITRPD 1312

Query: 1474 ISFTVHNLSQFIAKPCVRHLDVVFHLLRYLKGTAGQGILLHSSRDFHLKAFADSDWGSCP 1533
            I+F V+ L QF A P   H+     +L Y+KGT G G+   +  D  L+A+ D+DW SC 
Sbjct: 1313 ITFAVNKLCQFTAAPKESHMKAACKVLHYVKGTIGTGLFYSADCDLTLQAYTDADWASCR 1372

Query: 1534 DSRKSVTGFCIFLGKSLVSWKSKKQATVSRSSAEAEYRALATVSSELIWLSHLLKDLQVP 1593
            DSR+S +GFC+FLG SL+SWKSKKQ T S SSAE+EYRA+     E+ WL +LL++ Q P
Sbjct: 1373 DSRRSTSGFCMFLGTSLISWKSKKQPTASHSSAESEYRAMEFAVREVAWLVNLLREFQAP 1432

Query: 1594 LRSPSVVYCDNLAAIAIANNPTFHERTKHIEIDCHFVRDLIQQRILRLLPIRSNLQLADM 1596
                   +CD+ AAI IANN  FHERTKH+E+DCH VRD +   +++ L ++++ Q+AD+
Sbjct: 1433 QTKSVPFFCDSTAAIHIANNAVFHERTKHVELDCHIVRDKVISGLIKTLHVKTDQQIADV 1452

BLAST of Lag0024687 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 593.6 bits (1529), Expect = 9.1e-168
Identity = 457/1506 (30.35%), Postives = 683/1506 (45.35%), Query Frame = 0

Query: 234  LGAVNYTSWSRAMRMVISGKNKLGFITGKISKPQEEGALLEA---------WECNNDIIA 293
            L + NY  WSR +  +  G    GF+ G  + P        A         W+  + +I 
Sbjct: 26   LTSTNYLMWSRQVHALFDGYELAGFLDGSTTMPPATIGTDAAPRVNPDYTRWKRQDKLIY 85

Query: 294  SWILNSVSKEIAASIVYTGSVKAVWDELQERFKQASGPGIYQLRKDLVTLRQGSMSVEVY 353
            S +L ++S  +  ++    +   +W+ L++ +   S   + QLR  L    +G+ +++ Y
Sbjct: 86   SAVLGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVTQLRTQLKQWTKGTKTIDDY 145

Query: 354  YTKLKTIWQDLSDLRPTASCTCGGLKPFLEHLDSEYVMTFLMGLNETYAAIRAQILLMKP 413
               L T +  L+ L           KP ++H   E V   L  L E Y  +  QI     
Sbjct: 146  MQGLVTRFDQLALLG----------KP-MDH--DEQVERVLENLPEEYKPVIDQIAAKDT 205

Query: 414  LPSITEAFSLLIQEEHQRSAGILGPSPDPIALAVNDTSKTSDPPRRKENSGQRPVCSHCG 473
             P++TE    L+  E   S  +   S   I +  N  S  +       N+G R       
Sbjct: 206  PPTLTEIHERLLNHE---SKILAVSSATVIPITANAVSHRNTTTTNNNNNGNR------- 265

Query: 474  IKGHVKDRCYKLHGYPPGYKFRSSNSPDNSASAK-----SVVAANSAAASSCPPATPNFF 533
               +  D     +   P  +  ++  P+N+ S        +      +A  C     +F 
Sbjct: 266  --NNRYDNRNNNNNSKPWQQSSTNFHPNNNQSKPYLGKCQICGVQGHSAKRC-SQLQHFL 325

Query: 534  SSLNNVQYGQLMELFNSHLQAAKTDPITVASAVSHATGICHLASSPITSLNDCWIVDSGA 593
            SS+N+ Q                  P T     ++         SP +S N  W++DSGA
Sbjct: 326  SSVNSQQ---------------PPSPFTPWQPRANLA-----LGSPYSSNN--WLLDSGA 385

Query: 594  SRHICHTRAAFRNWRRIDPIS----IVLPTAYRVCVEYVGEIHISA---ALVLRDVLFVP 653
            + HI    + F N     P +    +++     + + + G   +S     L L ++L+VP
Sbjct: 386  THHI---TSDFNNLSLHQPYTGGDDVMVADGSTIPISHTGSTSLSTKSRPLNLHNILYVP 445

Query: 654  DFAYNLMSVSCLLQSGEFSVAFTNNHCLIQDKQLLTMIGKAECHHGLYILSNISGSSTDS 713
            +   NL+SV  L  +   SV F      ++D            + G+ +L    G + D 
Sbjct: 446  NIHKNLISVYRLCNANGVSVEFFPASFQVKD-----------LNTGVPLL---QGKTKDE 505

Query: 714  LVTSTVVSDIPAVFSVSA------SIWHSRLGHLSPR---------RLSMLRDTLQFKDS 773
            L    + S  P     S       S WH+RLGH +P           LS+L  + +F   
Sbjct: 506  LYEWPIASSQPVSLFASPSSKATHSSWHARLGHPAPSILNSVISNYSLSVLNPSHKF--- 565

Query: 774  CDASCTVCPLAKQKRMPFTSNNHVASNVFDLIHADVWGPLNTSTYDGYRYFLTLVDDASR 833
               SC+ C + K  ++PF+ +   ++   + I++DVW     S +D YRY++  VD  +R
Sbjct: 566  --LSCSDCLINKSNKVPFSQSTINSTRPLEYIYSDVWSSPILS-HDNYRYYVIFVDHFTR 625

Query: 834  FTWVYLLRQKSDVLMIIPRFFKMVETQFSKTIKCFRSDNAPE-LKFTDFFASTGTIHQFS 893
            +TW+Y L+QKS V      F  ++E +F   I  F SDN  E +   ++F+  G  H  S
Sbjct: 626  YTWLYPLKQKSQVKETFITFKNLLENRFQTRIGTFYSDNGGEFVALWEYFSQHGISHLTS 685

Query: 894  CVERPQQNSVVERKHQHLLNVARSLY---------------------------------- 953
                P+ N + ERKH+H++    +L                                   
Sbjct: 686  PPHTPEHNGLSERKHRHIVETGLTLLSHASIPKTYWPYAFAVAVYLINRLPTPLLQLESP 745

Query: 954  FQSL-----DYSFLRVFGCLCYASTLTAGRSKFDPRARPCVLLGYPPGMKGYRLYDISKK 1013
            FQ L     +Y  LRVFGC CY       + K D ++R CV LGY      Y    +   
Sbjct: 746  FQKLFGTSPNYDKLRVFGCACYPWLRPYNQHKLDDKSRQCVFLGYSLTQSAYLCLHLQTS 805

Query: 1014 QVFISRDVTFIEENFPFHSII---------NNDQLVDVFPGLVLP--LPVFDFVPSPSVP 1073
            +++ISR V F E  FPF + +           +      P   LP   PV    PS S P
Sbjct: 806  RLYISRHVRFDENCFPFSNYLATLSPVQEQRRESSCVWSPHTTLPTRTPVLP-APSCSDP 865

Query: 1074 NNTAATPSEVGFPPAGLPSDVSAENGDMLLQAS-PAASDADSPAQS--TGVIQPRRSTRQ 1133
            ++ A  PS    P     S VS+ N D    +S P++ +  +P Q+      QP ++  Q
Sbjct: 866  HHAATPPSSPSAPFRN--SQVSSSNLDSSFSSSFPSSPEPTAPRQNGPQPTTQPTQTQTQ 925

Query: 1134 RH--------------PPDFLKDYHC-------------------------HLLRHDVPL 1193
             H              P    +                              +L H  P 
Sbjct: 926  THSSQNTSQNNPTNESPSQLAQSLSTPAQSSSSSPSPTTSASSSSTSPTPPSILIHPPPP 985

Query: 1194 LSHDVPHSLDKYVSYRRFS----------NDHRQFILNVSTDFEPTYYHQTVKFAPWRKA 1253
            L+  V ++    ++               N      ++++ + EP    Q +K   WR A
Sbjct: 986  LAQIVNNNNQAPLNTHSMGTRAKAGIIKPNPKYSLAVSLAAESEPRTAIQALKDERWRNA 1045

Query: 1254 MDDEIAAMERTNTWTLV-PLPSGRRAVGCKWVYKVKYKADGTVDRYKARLVAKGYNQQEG 1313
            M  EI A    +TW LV P PS    VGC+W++  KY +DG+++RYKARLVAKGYNQ+ G
Sbjct: 1046 MGSEINAQIGNHTWDLVPPPPSHVTIVGCRWIFTKKYNSDGSLNRYKARLVAKGYNQRPG 1105

Query: 1314 IDFLETFSPVAKIVTVKVLLSLTASFGWSLVQLDVNNAFLNGDLFEEVYMSLPLGYYTDR 1373
            +D+ ETFSPV K  +++++L +     W + QLDVNNAFL G L ++VYMS P G+    
Sbjct: 1106 LDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDDVYMSQPPGFIDKD 1165

Query: 1374 KSSGCTPIVCKLNKSIYGLKQASRQWFSKFSSVLLANGFSQSKTDYSLFIRGHGISFVAL 1433
            + +     VCKL K++YGLKQA R W+ +  + LL  GF  S +D SLF+   G S V +
Sbjct: 1166 RPN----YVCKLRKALYGLKQAPRAWYVELRNYLLTIGFVNSVSDTSLFVLQRGKSIVYM 1225

Query: 1434 LVYVDDILITGPSTTEISAVKDILYRHFLLKDLGHAKYFLGLELSRSDKGIYLSQRKYCL 1493
            LVYVDDILITG   T +    D L + F +KD     YFLG+E  R   G++LSQR+Y L
Sbjct: 1226 LVYVDDILITGNDPTLLHNTLDNLSQRFSVKDHEELHYFLGIEAKRVPTGLHLSQRRYIL 1285

Query: 1494 QLIEDSGYLAAKPVNHPMIPNLRLSANGSGELLNADDASSYRRLVGRLLYLQVSRPDISF 1553
             L+  +  + AKPV  PM P+ +LS     +L    D + YR +VG L YL  +RPDIS+
Sbjct: 1286 DLLARTNMITAKPVTTPMAPSPKLSLYSGTKL---TDPTEYRGIVGSLQYLAFTRPDISY 1345

Query: 1554 TVHNLSQFIAKPCVRHLDVVFHLLRYLKGTAGQGILLHSSRDFHLKAFADSDWGSCPDSR 1600
             V+ LSQF+  P   HL  +  +LRYL GT   GI L       L A++D+DW    D  
Sbjct: 1346 AVNRLSQFMHMPTEEHLQALKRILRYLAGTPNHGIFLKKGNTLSLHAYSDADWAGDKDDY 1405

BLAST of Lag0024687 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 557.0 bits (1434), Expect = 9.5e-157
Identity = 447/1525 (29.31%), Postives = 665/1525 (43.61%), Query Frame = 0

Query: 234  LGAVNYTSWSRAMRMVISGKNKLGFITGKISKP---------QEEGALLEAWECNNDIIA 293
            L + NY  WSR +  +  G    GF+ G    P                  W   + +I 
Sbjct: 26   LTSTNYLMWSRQVHALFDGYELAGFLDGSTPMPPATIGTDAVPRVNPDYTRWRRQDKLIY 85

Query: 294  SWILNSVSKEIAASIVYTGSVKAVWDELQERFKQASGPGIYQLRKDLVTLRQGSMSVEVY 353
            S IL ++S  +  ++    +   +W+ L++ +   S   + QLR                
Sbjct: 86   SAILGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVTQLR---------------- 145

Query: 354  YTKLKTIWQDLSDLRPTASCTCGGLKPFLEHLDSEYVMTFLMGLNETYAAIRAQILLMKP 413
                 T +  L+ L           KP ++H   E V   L  L + Y  +  QI     
Sbjct: 146  ---FITRFDQLALLG----------KP-MDH--DEQVERVLENLPDDYKPVIDQIAAKDT 205

Query: 414  LPSITEAFSLLIQEEHQ----RSAGILGPSPDPIALAVNDTSKT---------------- 473
             PS+TE    LI  E +     SA ++  + + +     +T++                 
Sbjct: 206  PPSLTEIHERLINRESKLLALNSAEVVPITANVVTHRNTNTNRNQNNRGDNRNYNNNNNR 265

Query: 474  ------SDPPRRKENSGQRPV---CSHCGIKGHVKDRCYKLHGYPPGYKFRSSNSPDNSA 533
                  S    R +N   +P    C  C ++GH   RC +LH +      + S SP    
Sbjct: 266  SNSWQPSSSGSRSDNRQPKPYLGRCQICSVQGHSAKRCPQLHQFQSTTNQQQSTSPFTPW 325

Query: 534  SAKSVVAANSAAASS----CPPATPNFFSSLNNVQYGQLMELFNSHLQAAKTDPITVASA 593
              ++ +A NS   ++       AT +  S  NN+ + Q                      
Sbjct: 326  QPRANLAVNSPYNANNWLLDSGATHHITSDFNNLSFHQ---------------------- 385

Query: 594  VSHATGICHLASSPITSLNDCWIVDSGASRHICHTRAAFRNWRRIDPISIVLPTAYRVCV 653
                         P T  +D  I D G++  I HT +A             LPT+     
Sbjct: 386  -------------PYTGGDDVMIAD-GSTIPITHTGSA------------SLPTS----- 445

Query: 654  EYVGEIHISAALVLRDVLFVPDFAYNLMSVSCLLQSGEFSVAFTNNHCLIQDKQLLTMIG 713
                    S +L L  VL+VP+   NL+SV  L  +   SV F      ++D      + 
Sbjct: 446  --------SRSLDLNKVLYVPNIHKNLISVYRLCNTNRVSVEFFPASFQVKDLNTGVPLL 505

Query: 714  KAECHHGLYILSNISGSSTDSLVTSTVVSDIPAVFSVSASIWHSRLGH---------LSP 773
            + +    LY    I+ S   S+  S            + S WHSRLGH         +S 
Sbjct: 506  QGKTKDELYEWP-IASSQAVSMFAS-------PCSKATHSSWHSRLGHPSLAILNSVISN 565

Query: 774  RRLSMLRDTLQFKDSCDASCTVCPLAKQKRMPFTSNNHVASNVFDLIHADVWGPLNTSTY 833
              L +L  + +       SC+ C + K  ++PF+++   +S   + I++DVW     S  
Sbjct: 566  HSLPVLNPSHKL-----LSCSDCFINKSHKVPFSNSTITSSKPLEYIYSDVWSSPILS-I 625

Query: 834  DGYRYFLTLVDDASRFTWVYLLRQKSDVLMIIPRFFKMVETQFSKTIKCFRSDNAPE-LK 893
            D YRY++  VD  +R+TW+Y L+QKS V      F  +VE +F   I    SDN  E + 
Sbjct: 626  DNYRYYVIFVDHFTRYTWLYPLKQKSQVKDTFIIFKSLVENRFQTRIGTLYSDNGGEFVV 685

Query: 894  FTDFFASTGTIHQFSCVERPQQNSVVERKHQHLLNVA----------------------- 953
              D+ +  G  H  S    P+ N + ERKH+H++ +                        
Sbjct: 686  LRDYLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTLLSHASVPKTYWPYAFSVAVY 745

Query: 954  ----------------RSLYFQSLDYSFLRVFGCLCYASTLTAGRSKFDPRARPCVLLGY 1013
                            + L+ Q  +Y  L+VFGC CY       R K + +++ C  +GY
Sbjct: 746  LINRLPTPLLQLQSPFQKLFGQPPNYEKLKVFGCACYPWLRPYNRHKLEDKSKQCAFMGY 805

Query: 1014 PPGMKGYRLYDISKKQVFISRDVTFIEENFPFHSI-----INNDQLVDVFPG-------- 1073
                  Y    I   +++ SR V F E  FPF +       + +Q  D  P         
Sbjct: 806  SLTQSAYLCLHIPTGRLYTSRHVQFDERCFPFSTTNFGVSTSQEQRSDSAPNWPSHTTLP 865

Query: 1074 ---LVLPLPV-----FDFVPSP----------------------SVPNNTAAT-PSEVGF 1133
               LVLP P       D  P P                      S P+++  T PS  G 
Sbjct: 866  TTPLVLPAPPCLGPHLDTSPRPPSSPSPLCTTQVSSSNLPSSSISSPSSSEPTAPSHNGP 925

Query: 1134 PPAGLPSDV--SAENGDMLLQASPAASDADSPAQ-----------------STGVIQPR- 1193
             P   P     S  N  +L   +P +   +SP Q                 ST + +P  
Sbjct: 926  QPTAQPHQTQNSNSNSPILNNPNPNSPSPNSPNQNSPLPQSPISSPHIPTPSTSISEPNS 985

Query: 1194 ---RSTRQRHPPDFLKDYHCHLLRHDVPLLSHDVPHSLDKYVSYRRFSNDHRQFILNVST 1253
                ST     P  L       +    P+ +H +       +   R  N    +  +++ 
Sbjct: 986  PSSSSTSTPPLPPVLPAPPIIQVNAQAPVNTHSMATRAKDGI---RKPNQKYSYATSLAA 1045

Query: 1254 DFEPTYYHQTVKFAPWRKAMDDEIAAMERTNTWTLV-PLPSGRRAVGCKWVYKVKYKADG 1313
            + EP    Q +K   WR+AM  EI A    +TW LV P P     VGC+W++  K+ +DG
Sbjct: 1046 NSEPRTAIQAMKDDRWRQAMGSEINAQIGNHTWDLVPPPPPSVTIVGCRWIFTKKFNSDG 1105

Query: 1314 TVDRYKARLVAKGYNQQEGIDFLETFSPVAKIVTVKVLLSLTASFGWSLVQLDVNNAFLN 1373
            +++RYKARLVAKGYNQ+ G+D+ ETFSPV K  +++++L +     W + QLDVNNAFL 
Sbjct: 1106 SLNRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQ 1165

Query: 1374 GDLFEEVYMSLPLGYYTDRKSSGCTPIVCKLNKSIYGLKQASRQWFSKFSSVLLANGFSQ 1433
            G L +EVYMS P G+    +       VC+L K+IYGLKQA R W+ +  + LL  GF  
Sbjct: 1166 GTLTDEVYMSQPPGFVDKDRPD----YVCRLRKAIYGLKQAPRAWYVELRTYLLTVGFVN 1225

Query: 1434 SKTDYSLFIRGHGISFVALLVYVDDILITGPSTTEISAVKDILYRHFLLKDLGHAKYFLG 1493
            S +D SLF+   G S + +LVYVDDILITG  T  +    D L + F +K+     YFLG
Sbjct: 1226 SISDTSLFVLQRGRSIIYMLVYVDDILITGNDTVLLKHTLDALSQRFSVKEHEDLHYFLG 1285

Query: 1494 LELSRSDKGIYLSQRKYCLQLIEDSGYLAAKPVNHPMIPNLRLSANGSGELLNADDASSY 1553
            +E  R  +G++LSQR+Y L L+  +  L AKPV  PM  + +L+ +   +L    D + Y
Sbjct: 1286 IEAKRVPQGLHLSQRRYTLDLLARTNMLTAKPVATPMATSPKLTLHSGTKL---PDPTEY 1345

Query: 1554 RRLVGRLLYLQVSRPDISFTVHNLSQFIAKPCVRHLDVVFHLLRYLKGTAGQGILLHSSR 1600
            R +VG L YL  +RPD+S+ V+ LSQ++  P   H + +  +LRYL GT   GI L    
Sbjct: 1346 RGIVGSLQYLAFTRPDLSYAVNRLSQYMHMPTDDHWNALKRVLRYLAGTPDHGIFLKKGN 1405

BLAST of Lag0024687 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 500.4 bits (1287), Expect = 1.1e-139
Identity = 347/1082 (32.07%), Postives = 538/1082 (49.72%), Query Frame = 0

Query: 573  WIVDSGASRHICHTRAAFRNWRRIDPISIVLPTAYRVCVEYVGEI----HISAALVLRDV 632
            W+VD+ AS H    R  F  +   D  ++ +       +  +G+I    ++   LVL+DV
Sbjct: 294  WVVDTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDV 353

Query: 633  LFVPDFAYNLMSVSCLLQSGEFSVAFTNNHCLIQDKQLLTMIGKAECHHGLYILSNISGS 692
              VPD   NL+S   L + G +   F N    +    L  +I K      LY        
Sbjct: 354  RHVPDLRMNLISGIALDRDG-YESYFANQKWRLTKGSL--VIAKGVARGTLY-------- 413

Query: 693  STDSLVTSTVVSDIPAVFSVSASIWHSRLGHLSPRRLSML--RDTLQF-KDSCDASCTVC 752
             T++ +    ++   A   +S  +WH R+GH+S + L +L  +  + + K +    C  C
Sbjct: 414  RTNAEICQGELN--AAQDEISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYC 473

Query: 753  PLAKQKRMPFTSNNHVASNVFDLIHADVWGPLNTSTYDGYRYFLTLVDDASRFTWVYLLR 812
               KQ R+ F +++    N+ DL+++DV GP+   +  G +YF+T +DDASR  WVY+L+
Sbjct: 474  LFGKQHRVSFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILK 533

Query: 813  QKSDVLMIIPRFFKMVETQFSKTIKCFRSDNAPEL---KFTDFFASTGTIHQFSCVERPQ 872
             K  V  +  +F  +VE +  + +K  RSDN  E    +F ++ +S G  H+ +    PQ
Sbjct: 534  TKDQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQ 593

Query: 873  QNSVVERKHQHLLNVARSLY---------------------------------------F 932
             N V ER ++ ++   RS+                                         
Sbjct: 594  HNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTN 653

Query: 933  QSLDYSFLRVFGCLCYASTLTAGRSKFDPRARPCVLLGYPPGMKGYRLYDISKKQVFISR 992
            + + YS L+VFGC  +A      R+K D ++ PC+ +GY     GYRL+D  KK+V  SR
Sbjct: 654  KEVSYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSR 713

Query: 993  DVTFIEENFPFHSIINNDQLVDVFPGLVLPLPVFDFVPSPSVPNN-TAATPSEVGFPPAG 1052
            DV F E      + ++      + P         +FV  PS  NN T+A  +       G
Sbjct: 714  DVVFRESEVRTAADMSEKVKNGIIP---------NFVTIPSTSNNPTSAESTTDEVSEQG 773

Query: 1053 LPSDVSAENGDMLLQASPAASDADSPAQSTGVIQP-RRSTRQRHPPDFLKDYHCHLLRHD 1112
                   E G+   Q      + + P Q     QP RRS R R                 
Sbjct: 774  EQPGEVIEQGE---QLDEGVEEVEHPTQGEEQHQPLRRSERPR----------------- 833

Query: 1113 VPLLSHDVPHSLDKYVSYRRFSNDHRQFILNVSTDFEPTYYHQTVKFA---PWRKAMDDE 1172
                           V  RR+ +   +++L +S D EP    + +         KAM +E
Sbjct: 834  ---------------VESRRYPS--TEYVL-ISDDREPESLKEVLSHPEKNQLMKAMQEE 893

Query: 1173 IAAMERTNTWTLVPLPSGRRAVGCKWVYKVKYKADGTVDRYKARLVAKGYNQQEGIDFLE 1232
            + ++++  T+ LV LP G+R + CKWV+K+K   D  + RYKARLV KG+ Q++GIDF E
Sbjct: 894  MESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFDE 953

Query: 1233 TFSPVAKIVTVKVLLSLTASFGWSLVQLDVNNAFLNGDLFEEVYMSLPLGYYTDRKSSGC 1292
             FSPV K+ +++ +LSL AS    + QLDV  AFL+GDL EE+YM  P G+    + +G 
Sbjct: 954  IFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGF----EVAGK 1013

Query: 1293 TPIVCKLNKSIYGLKQASRQWFSKFSSVLLANGFSQSKTDYSL-FIRGHGISFVALLVYV 1352
              +VCKLNKS+YGLKQA RQW+ KF S + +  + ++ +D  + F R    +F+ LL+YV
Sbjct: 1014 KHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYV 1073

Query: 1353 DDILITGPSTTEISAVKDILYRHFLLKDLGHAKYFLGLEL--SRSDKGIYLSQRKYCLQL 1412
            DD+LI G     I+ +K  L + F +KDLG A+  LG+++   R+ + ++LSQ KY  ++
Sbjct: 1074 DDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERV 1133

Query: 1413 IEDSGYLAAKPVNHPMIPNLRLSANGSGELLNADDASS---YRRLVGRLLYLQV-SRPDI 1472
            +E      AKPV+ P+  +L+LS       +      +   Y   VG L+Y  V +RPDI
Sbjct: 1134 LERFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDI 1193

Query: 1473 SFTVHNLSQFIAKPCVRHLDVVFHLLRYLKGTAGQGILLHSSRDFHLKAFADSDWGSCPD 1532
            +  V  +S+F+  P   H + V  +LRYL+GT G   L     D  LK + D+D     D
Sbjct: 1194 AHAVGVVSRFLENPGKEHWEAVKWILRYLRGTTGD-CLCFGGSDPILKGYTDADMAGDID 1253

Query: 1533 SRKSVTGFCIFLGKSLVSWKSKKQATVSRSSAEAEYRALATVSSELIWLSHLLKDLQVPL 1592
            +RKS TG+        +SW+SK Q  V+ S+ EAEY A      E+IWL   L++L +  
Sbjct: 1254 NRKSSTGYLFTFSGGAISWQSKLQKCVALSTTEAEYIAATETGKEMIWLKRFLQELGLH- 1309

Query: 1593 RSPSVVYCDNLAAIAIANNPTFHERTKHIEIDCHFVRDLIQQRILRLLPIRSNLQLADMF 1594
            +   VVYCD+ +AI ++ N  +H RTKHI++  H++R+++    L++L I +N   ADM 
Sbjct: 1314 QKEYVVYCDSQSAIDLSKNSMYHARTKHIDVRYHWIREMVDDESLKVLKISTNENPADML 1309

BLAST of Lag0024687 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 411.0 bits (1055), Expect = 8.4e-113
Identity = 353/1265 (27.91%), Postives = 553/1265 (43.72%), Query Frame = 0

Query: 451  KENSGQRPVCSHCGIKGHVKDRCYKLHGYPPGYKFRSSNSPDNSASAKSVVAANSAAASS 510
            K NS  +  C HCG +GH+K  C+                                    
Sbjct: 223  KGNSKYKVKCHHCGREGHIKKDCF------------------------------------ 282

Query: 511  CPPATPNFFSSLNNVQYGQLMELFNSHLQAAKTDPITVASAVSHATGICHLASSPITSLN 570
                  ++   LNN                 K +   V +A SH         +  + ++
Sbjct: 283  ------HYKRILNNKN---------------KENEKQVQTATSHGIAFMVKEVNNTSVMD 342

Query: 571  DC-WIVDSGASRHICHTRAAFRNWRRIDPISIVLPTAYRVCVEYVGE-----------IH 630
            +C +++DSGAS H+ +  + +      D + +V P   ++ V   GE           + 
Sbjct: 343  NCGFVLDSGASDHLINDESLY-----TDSVEVVPP--LKIAVAKQGEFIYATKRGIVRLR 402

Query: 631  ISAALVLRDVLFVPDFAYNLMSVSCLLQSGEFSVAFTNNHCLIQDKQLLTMIGKAECHHG 690
                + L DVLF  + A NLMSV  L ++G  S+ F        DK  +T+       +G
Sbjct: 403  NDHEITLEDVLFCKEAAGNLMSVKRLQEAG-MSIEF--------DKSGVTI-----SKNG 462

Query: 691  LYILSNISGSSTDSLVTSTVVSDIPAVFSVSASIWHSRLGHLSPRRLSMLRDTLQFKD-- 750
            L ++ N SG   +  V +     I A    +  +WH R GH+S  +L  ++    F D  
Sbjct: 463  LMVVKN-SGMLNNVPVINFQAYSINAKHKNNFRLWHERFGHISDGKLLEIKRKNMFSDQS 522

Query: 751  -------SCDASCTVCPLAKQKRMPF---TSNNHVASNVFDLIHADVWGPLNTSTYDGYR 810
                   SC+  C  C   KQ R+PF       H+   +F ++H+DV GP+   T D   
Sbjct: 523  LLNNLELSCEI-CEPCLNGKQARLPFKQLKDKTHIKRPLF-VVHSDVCGPITPVTLDDKN 582

Query: 811  YFLTLVDDASRFTWVYLLRQKSDVLMIIPRFFKMVETQFSKTIKCFRSDNAPEL---KFT 870
            YF+  VD  + +   YL++ KSDV  +   F    E  F+  +     DN  E    +  
Sbjct: 583  YFVIFVDQFTHYCVTYLIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMR 642

Query: 871  DFFASTGTIHQFSCVERPQQNSVVERKHQHLLNVARSLYF-QSLDYSF------------ 930
             F    G  +  +    PQ N V ER  + +   AR++     LD SF            
Sbjct: 643  QFCVKKGISYHLTVPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLI 702

Query: 931  ----------------------------LRVFGCLCYASTLTAGRSKFDPRARPCVLLGY 990
                                        LRVFG   Y   +   + KFD ++   + +GY
Sbjct: 703  NRIPSRALVDSSKTPYEMWHNKKPYLKHLRVFGATVYVH-IKNKQGKFDDKSFKSIFVGY 762

Query: 991  PPGMKGYRLYDISKKQVFISRDVTFIEENF------PFHSIINNDQ-------------- 1050
             P   G++L+D   ++  ++RDV   E N        F ++   D               
Sbjct: 763  EP--NGFKLWDAVNEKFIVARDVVVDETNMVNSRAVKFETVFLKDSKESENKNFPNDSRK 822

Query: 1051 -LVDVFPGLVLPLPVFDFV------PSPSVPNNTAATPSEVGFPPAGLPSD------VSA 1110
             +   FP          F+       + + PN++     +  FP      D       S 
Sbjct: 823  IIQTEFPNESKECDNIQFLKDSKESENKNFPNDSRKI-IQTEFPNESKECDNIQFLKDSK 882

Query: 1111 ENGDMLLQASPAASDADSPAQSTGVIQPRRSTRQRHPPDFLKDYHC-HLLRHD-VPLLSH 1170
            E+    L  S      D   +S G   P  S R+    + LK+    +  ++D + +++ 
Sbjct: 883  ESNKYFLNESKKRKRDDHLNESKGSGNPNES-RESETAEHLKEIGIDNPTKNDGIEIINR 942

Query: 1171 DVPHSLDK-YVSYRRFSNDHRQFILNVSTDFE--PTYYHQTVKF----APWRKAMDDEIA 1230
                   K  +SY    N   + +LN  T F   P  + + +++    + W +A++ E+ 
Sbjct: 943  RSERLKTKPQISYNEEDNSLNKVVLNAHTIFNDVPNSFDE-IQYRDDKSSWEEAINTELN 1002

Query: 1231 AMERTNTWTLVPLPSGRRAVGCKWVYKVKYKADGTVDRYKARLVAKGYNQQEGIDFLETF 1290
            A +  NTWT+   P  +  V  +WV+ VKY   G   RYKARLVA+G+ Q+  ID+ ETF
Sbjct: 1003 AHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDYEETF 1062

Query: 1291 SPVAKIVTVKVLLSLTASFGWSLVQLDVNNAFLNGDLFEEVYMSLPLGYYTDRKSSGCTP 1350
            +PVA+I + + +LSL   +   + Q+DV  AFLNG L EE+YM LP G   +  +     
Sbjct: 1063 APVARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGISCNSDN----- 1122

Query: 1351 IVCKLNKSIYGLKQASRQWFSKFSSVLLANGFSQSKTDYSLFI--RGHGISFVALLVYVD 1410
             VCKLNK+IYGLKQA+R WF  F   L    F  S  D  ++I  +G+    + +L+YVD
Sbjct: 1123 -VCKLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVDRCIYILDKGNINENIYVLLYVD 1182

Query: 1411 DILITGPSTTEISAVKDILYRHFLLKDLGHAKYFLGLELSRSDKGIYLSQRKYCLQLIED 1470
            D++I     T ++  K  L   F + DL   K+F+G+ +   +  IYLSQ  Y  +++  
Sbjct: 1183 DVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRIEMQEDKIYLSQSAYVKKILSK 1242

Query: 1471 SGYLAAKPVNHPMIPNLRLSANGSGELLNADD--ASSYRRLVGRLLYLQV-SRPDISFTV 1530
                    V+ P+   +        ELLN+D+   +  R L+G L+Y+ + +RPD++  V
Sbjct: 1243 FNMENCNAVSTPLPSKINY------ELLNSDEDCNTPCRSLIGCLMYIMLCTRPDLTTAV 1302

Query: 1531 HNLSQFIAKPCVRHLDVVFHLLRYLKGTAGQGILLHSSRDFHLK--AFADSDWGSCPDSR 1590
            + LS++ +K        +  +LRYLKGT    ++   +  F  K   + DSDW      R
Sbjct: 1303 NILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFKKNLAFENKIIGYVDSDWAGSEIDR 1362

Query: 1591 KSVTGFCI-FLGKSLVSWKSKKQATVSRSSAEAEYRALATVSSELIWLSHLLKDLQVPLR 1598
            KS TG+       +L+ W +K+Q +V+ SS EAEY AL     E +WL  LL  + + L 
Sbjct: 1363 KSTTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYMALFEAVREALWLKFLLTSINIKLE 1388

BLAST of Lag0024687 vs. ExPASy Swiss-Prot
Match: P92519 (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 GN=AtMg00810 PE=4 SV=1)

HSP 1 Score: 200.3 bits (508), Expect = 2.3e-49
Identity = 100/229 (43.67%), Postives = 146/229 (63.76%), Query Frame = 0

Query: 1293 LLVYVDDILITGPSTTEISAVKDILYRHFLLKDLGHAKYFLGLELSRSDKGIYLSQRKYC 1352
            LL+YVDDIL+TG S T ++ +   L   F +KDLG   YFLG+++     G++LSQ KY 
Sbjct: 3    LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62

Query: 1353 LQLIEDSGYLAAKPVNHPMIPNLRLSANGSGELLNADDASSYRRLVGRLLYLQVSRPDIS 1412
             Q++ ++G L  KP++ P    L L  N S       D S +R +VG L YL ++RPDIS
Sbjct: 63   EQILNNAGMLDCKPMSTP----LPLKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDIS 122

Query: 1413 FTVHNLSQFIAKPCVRHLDVVFHLLRYLKGTAGQGILLHSSRDFHLKAFADSDWGSCPDS 1472
            + V+ + Q + +P +   D++  +LRY+KGT   G+ +H +   +++AF DSDW  C  +
Sbjct: 123  YAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTST 182

Query: 1473 RKSVTGFCIFLGKSLVSWKSKKQATVSRSSAEAEYRALATVSSELIWLS 1522
            R+S TGFC FLG +++SW +K+Q TVSRSS E EYRALA  ++EL W S
Sbjct: 183  RRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTWSS 227

BLAST of Lag0024687 vs. ExPASy TrEMBL
Match: A0A2Z7AT15 (Cysteine-rich RLK (Receptor-like protein kinase) 8 OS=Dorcoceras hygrometricum OX=472368 GN=F511_01974 PE=4 SV=1)

HSP 1 Score: 1215.7 bits (3144), Expect = 0.0e+00
Identity = 653/1456 (44.85%), Postives = 910/1456 (62.50%), Query Frame = 0

Query: 193  GTGSANSSSSNVTATSIEAQINPYFLHHSFGSSSVLVSQPLLGAVNYTSWSRAMRMVISG 252
            G G   ++   +  T++E   +PY+LH+       LVS PL+G+ NY +W RAM + ++ 
Sbjct: 4    GGGGQVANQLPIVRTTLEDSSSPYYLHNGDHPGLTLVSNPLIGS-NYNTWRRAMIVALTA 63

Query: 253  KNKLGFITGKISKPQEEGALLEAWECNNDIIASWILNSVSKEIAASIVYTGSVKAVWDEL 312
            KNKLGFI   I +P+ E  L  +W   N ++ SWILNSV++ IA S++Y  + + +W +L
Sbjct: 64   KNKLGFIDRSIDRPRSEDLLYGSWIRCNSMVISWILNSVARNIADSLMYMQTAEEIWTDL 123

Query: 313  QERFKQASGPGIYQLRKDLVTLRQGSMSVEVYYTKLKTIWQDLSDLRPTASCTCGGLKPF 372
             ERF +++ P IYQ++K L  L+QGSM V  YYTKL+T+W +L D +PT++CTCG ++ +
Sbjct: 124  YERFHESNAPRIYQIKKLLSGLQQGSMDVSSYYTKLRTLWDELRDYQPTSACTCGSMREW 183

Query: 373  LEHLDSEYVMTFLMGLNETYAAIRAQILLMKPLPSITEAFSLLIQEEHQRS----AGILG 432
              + + E VM FLMGLN++YA +RAQ+L+++PLP+I + F+L+IQEE QRS        G
Sbjct: 184  FNYQNQECVMHFLMGLNDSYAQVRAQVLMIEPLPTIAKVFALVIQEERQRSIHYDVSKAG 243

Query: 433  PSPDPIALAVNDTSKTSDPPRRKENS----GQRPVCSHCGIKGHVKDRCYKLHGYPPGYK 492
                 I   VN ++ T+   R  +NS    G R +CSHC  + H  D+CYKLHGYPPG+ 
Sbjct: 244  VDHSGILSNVNSSANTATSLRTSQNSKGGRGDRIICSHCHFRNHTVDKCYKLHGYPPGHP 303

Query: 493  FRSSNSPDNSASAKSVVAANSAAASSCPPATPNFFSSLNNVQYGQLMELFNSHLQAAKT- 552
               S     SA A     A+S++ +       +   SL   Q  QL+E  +S LQ  +  
Sbjct: 304  KFKSQISQGSAHAHQ---ASSSSETHQETQQIDHSDSLTQSQCKQLIEFLSSKLQTRQNL 363

Query: 553  ----DPITVASAVSHATGICHLASSPITSLNDCWIVDSGASRHICHTRAAFRNWRRIDPI 612
                 P T  S +   TGIC   S         WI+D+GA+ HIC + + F++ R I   
Sbjct: 364  LMEHQPETTVSCL---TGICSATSHIPAITRKDWIMDTGATHHICCSLSMFKSSRAIQS- 423

Query: 613  SIVLPTAYRVCVEYVGEIHISAALVLRDVLFVPDFAYNLMSVSCLLQSGEFSVAFTNNHC 672
             +VLP    + V   G + +++ LVL++VL+VP F +NL+SVS L  +   SV+F ++ C
Sbjct: 424  KVVLPNTLTIPVTIAGTVAVTSNLVLQNVLYVPVFQFNLLSVSSLTDNHNCSVSFMSDSC 483

Query: 673  LIQDKQLLTMIGKAECHHGLYILSNISGSSTDSLVTSTVVSDIPAVFSVSASIWHSRLGH 732
             IQD   + MIG  +    LY+L        D  + S + +     F  ++ +WH R+GH
Sbjct: 484  KIQDISQIRMIGMGKRIGNLYVL-----QQPDRFLPSYICN----TFVSNSELWHRRMGH 543

Query: 733  LSPRRLSMLRDTLQFKDSCDAS-CTVCPLAKQKRMPFTSNNHVASNVFDLIHADVWGPLN 792
             S  +LS L++ L  +++   + C  C L+KQ+R+P  S N++++ +F+L+H D WGP +
Sbjct: 544  PSFNKLSSLKNVLNIENTDIVNICHSCHLSKQRRLPLASRNNISARIFELLHIDTWGPFS 603

Query: 793  TSTYDGYRYFLTLVDDASRFTWVYLLRQKSDVLMIIPRFFKMVETQFSKTIKCFRSDNAP 852
             ++ DG+R+F T+VDD SR+TWVY+L+ KSDVL I P F +MV TQF  T+K  RSDNAP
Sbjct: 604  QTSVDGFRFFFTIVDDHSRYTWVYMLKSKSDVLSIFPDFCRMVSTQFGVTVKSVRSDNAP 663

Query: 853  ELKFTDFFASTGTIHQFSCVERPQQNSVVERKHQHLLNVARSLYFQS---LD-------- 912
            EL F DFFA  G  H  SCVERPQQNSVVERKHQH+LNVAR+L FQS   LD        
Sbjct: 664  ELGFADFFAKAGITHYHSCVERPQQNSVVERKHQHILNVARALLFQSHIPLDYWCDCINT 723

Query: 913  ----------------------------YSFLRVFGCLCYASTLTAGRSKFDPRARPCVL 972
                                        YS L+VFGCLCYASTL + R KF PRA  CV 
Sbjct: 724  SVYLINRTPSPILAHKTPFELLHGKLPSYSHLKVFGCLCYASTLLSSRHKFSPRAIRCVF 783

Query: 973  LGYPPGMKGYRLYDISKKQVFISRDVTFIEENFPFHSIINNDQLVDVFPGLVLPLPVFDF 1032
            +GYPPG KGY+L ++   ++FISRDV F E  FP+                         
Sbjct: 784  IGYPPGYKGYKLLNLETNEIFISRDVIFHENTFPY------------------------- 843

Query: 1033 VPSPSVPNNTAATPSEVGFPPAGLPSDVSAENGDMLLQASPAASDADSPAQSTGVIQPRR 1092
                    NT+             P  +S    DM  + SP  S   +P+      Q  R
Sbjct: 844  -------QNTS-------------PMSLS----DMTFEVSP--SSQITPSIPADAQQHSR 903

Query: 1093 STRQRHPPDFLKDYHCHLLRHDVPLLSHDVPHSLDKYVSYRRFSNDHRQFILNVSTDFEP 1152
            ++R  + P  L+DYHC+ +    P  S    H +   V+Y + S+ HR F+ N+S+  EP
Sbjct: 904  TSRPHNTPSHLRDYHCYSI--STP-CSTSTAHPIHPLVNYSKLSSSHRAFVQNISSILEP 963

Query: 1153 TYYHQTVKFAPWRKAMDDEIAAMERTNTWTLVPLPSGRRAVGCKWVYKVKYKADGTVDRY 1212
            T + Q V    WR+AMD+E+ A+E  +TW++V LP G+ AVGC+WVYK K+ ADG++ RY
Sbjct: 964  TTFSQAVSLPEWRQAMDEELKALELNHTWSIVSLPQGKSAVGCRWVYKAKFAADGSLQRY 1023

Query: 1213 KARLVAKGYNQQEGIDFLETFSPVAKIVTVKVLLSLTASFGWSLVQLDVNNAFLNGDLFE 1272
            KARLVAKGY QQEG+D+LETFSPVAK+VTV+ LL+L A  GW L+QLDVNNAFL+GDL E
Sbjct: 1024 KARLVAKGYTQQEGLDYLETFSPVAKLVTVRTLLALAAVRGWFLIQLDVNNAFLHGDLTE 1083

Query: 1273 EVYMSLPLGYYTDRKSSGCTPIVCKLNKSIYGLKQASRQWFSKFSSVLLANGFSQSKTDY 1332
            EVYM+LP G+ ++ +    +  VCKL+KSIYGLKQASRQWF+KFSS LL+ GF QS  D 
Sbjct: 1084 EVYMTLPPGFCSEGELP--SRAVCKLHKSIYGLKQASRQWFAKFSSTLLSIGFIQSHADN 1143

Query: 1333 SLFIRGHGISFVALLVYVDDILITGPSTTEISAVKDILYRHFLLKDLGHAKYFLGLELSR 1392
            SLFIR     F+AL+VYVDDI+I        S +KD L   F LKDLG+ KYFLG+E++R
Sbjct: 1144 SLFIRSDKNIFLALVVYVDDIVIATNDQNAASELKDFLNSKFKLKDLGNLKYFLGIEVAR 1203

Query: 1393 SDKGIYLSQRKYCLQLIEDSGYLAAKPVNHPMIPNLRLSANGSGELLNADDASSYRRLVG 1452
            S +G+ + QR Y + L+ ++G L  KP   PM  N +L A  SGE+L+  D +SYRRL+G
Sbjct: 1204 STRGVSICQRNYAMTLLTEAGLLGCKPRTTPMEANTKL-AQDSGEMLS--DPASYRRLIG 1263

Query: 1453 RLLYLQVSRPDISFTVHNLSQFIAKPCVRHLDVVFHLLRYLKGTAGQGILLHSSRDFHLK 1512
            RLLYL ++RPD+ F V+ LSQ+++ P + H++   ++L+Y+KGT GQG+   SS D  L+
Sbjct: 1264 RLLYLTITRPDLVFAVNKLSQYVSMPRIPHMEAALNILKYVKGTVGQGLFYSSSSDLKLR 1323

Query: 1513 AFADSDWGSCPDSRKSVTGFCIFLGKSLVSWKSKKQATVSRSSAEAEYRALATVSSELIW 1572
            AF+D+DWG+C D+R+SVTG+C+FLG+SL+SW++KKQ TVSRSSAEAEYR+LA  + E++W
Sbjct: 1324 AFSDADWGACLDTRRSVTGYCVFLGESLISWRAKKQQTVSRSSAEAEYRSLAASTCEILW 1383

Query: 1573 LSHLLKDLQVPLRSPSVVYCDNLAAIAIANNPTFHERTKHIEIDCHFVRDLIQQRILRLL 1596
            +  LL DL V    P+V++CD+ AA+ IA+NP FHERTKHI+IDCH VR+ +QQ+I++L+
Sbjct: 1384 IHQLLADLGVTYNEPTVLFCDSQAAVHIASNPVFHERTKHIDIDCHIVREKVQQKIVKLM 1383

BLAST of Lag0024687 vs. ExPASy TrEMBL
Match: A0A2N9EHN7 (Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS2137 PE=4 SV=1)

HSP 1 Score: 1201.8 bits (3108), Expect = 0.0e+00
Identity = 676/1540 (43.90%), Postives = 936/1540 (60.78%), Query Frame = 0

Query: 190  SMAGTGSANSSSSNVTATSIEAQINPYFLHHSFGSSSVLVSQPLLGAVNYTSWSRAMRMV 249
            S + + ++N +SSN     IE   +PY+L++       +V  PL G  NY +W R+M   
Sbjct: 10   SSSSSSTSNQTSSNSMPIPIENSRSPYYLNNGDNPGIRIVPDPLTGD-NYQAWRRSMTTA 69

Query: 250  ISGKNKLGFITGKISKPQEEGALL-EAWECNNDIIASWILNSVSKEIAASIVYTGSVKAV 309
            +S KNKLGF+ G I +P +E  LL   W+  ND++ SWI N +SK+I A+++Y  + K V
Sbjct: 70   LSAKNKLGFVNGSILQPNDESDLLFSDWQRCNDLVLSWITNCLSKQIHATVLYVYTAKEV 129

Query: 310  WDELQERFKQASGPGIYQLRKDLVTLRQGSMSVEVYYTKLKTIWQDLSDLRPTASCTCG- 369
            WD+LQ+R+ Q++G  ++ L++ + +L+Q +M V  Y+T+LK +W +  + RP   CTCG 
Sbjct: 130  WDDLQQRYSQSNGTRVHHLKQAIASLKQDNMPVSDYFTQLKGLWDEFLNYRPIPGCTCGA 189

Query: 370  ----GL-KPFLEHLDSEYVMTFLMGLNETYAAIRAQILLMKPLPSITEAFSLLIQEEHQR 429
                GL +  +++   +YV +FLMGLN+++A +R QILLM+PLP+I + FSL+  +E QR
Sbjct: 190  KCMCGLSRTLMDYQHYDYVHSFLMGLNDSFAPVRGQILLMEPLPNINKVFSLIQNDEKQR 249

Query: 430  SAGILG-PSPDPIALAVNDTSKTSDPPR---------------RKENSGQ--------RP 489
             AG+L  P+  P   +    S+  + P                R +NS Q        +P
Sbjct: 250  GAGLLPLPTGFPTVGSTALLSRLENGPNTALSYPNTGPNAFFTRTDNSKQYYQYPRKDKP 309

Query: 490  --VCSHCGIKGHVKDRCYKLHGYPPGYKFRSSNSPDNSASAKSVVAANSAAASSCPPATP 549
              +CSHCG KGH  D+CYKLHGYPPG++ +  N    S  + S V  + +A +    + P
Sbjct: 310  PCICSHCGYKGHTADKCYKLHGYPPGFRSKGRNIAVASQVSSSAVPHSESANN--VQSIP 369

Query: 550  NFFSSLNNVQYGQLMELF-------------NSHLQAAKTDPITVASAVSHATG------ 609
            N   +  +VQ  QL+ +              ++H  AA    I+V    S+  G      
Sbjct: 370  NL--AAMSVQCQQLLNMLTTQAQQTNSVSDSHNHQAAASISSISVTQPHSNMAGKPTCLS 429

Query: 610  ------ICHLASSPITSLND-----CWIVDSGASRHICHTRAAFRNWRRIDPISIVLPTA 669
                  + H   S   ++        W++D+GA+ H+  T   +     +D IS+ LP  
Sbjct: 430  TFSKPNMDHSVFSAKFTVKPHFSPAQWVIDTGATDHMVITTQFYTTMHCVDNISVNLPNG 489

Query: 670  YRVCVEYVGEIHISAALVLRDVLFVPDFAYNLMSVSCLLQSGEFSVAFTNNHCLIQDKQL 729
              V V ++G + I+  L+L DVL VP F +NL+SVS L  S    + F + +C IQD   
Sbjct: 490  QSVLVTHIGSVQITPTLLLTDVLCVPSFDFNLISVSKLTSSLHCCIFFLSTYCFIQDLMH 549

Query: 730  LTMIGKAECHHGLYILSNISGSSTDSLVTSTVVSDI-PAVFSVSA--------SIWHSRL 789
              MIG  + H+GLY+L   S S+  +    +  SD+   ++S+S+         +WH R 
Sbjct: 550  WRMIGMGKQHNGLYLLDFSSDSTNTAAAALSSDSDLHKHLYSLSSIKNSNKDIHVWHCRF 609

Query: 790  GHLSPRR---LSMLRDTLQFKDSCDASCTVCPLAKQKRMPFTSNNHVASNVFDLIHADVW 849
            GH S  R   LS +   +       ++CTVCPLAKQKR+PF + NH++ N FDL+H D+W
Sbjct: 610  GHPSLSRMHFLSSIVPNMSLSSEDASTCTVCPLAKQKRLPFPNKNHLSLNSFDLLHIDIW 669

Query: 850  GPLNTSTYDGYRYFLTLVDDASRFTWVYLLRQKSDVLMIIPRFFKMVETQFSKTIKCFRS 909
            GP +  T +GYRYFLTLVDD +R TW+YL+R KSD   ++  F  M++TQF   IK  RS
Sbjct: 670  GPYHVPTVEGYRYFLTLVDDCTRTTWIYLMRSKSDTRPLLTSFITMIQTQFHTMIKQIRS 729

Query: 910  DNAPELKFTDFFASTGTIHQFSCVERPQQNSVVERKHQHLLNVARSLYFQSL-------- 969
            DN  E    +F+AS G IHQ SCVE PQQNSVVERKHQH+LNVARSL FQS         
Sbjct: 730  DNGQEFHMPEFYASKGIIHQHSCVETPQQNSVVERKHQHILNVARSLCFQSYLPLQYWGH 789

Query: 970  -------------------------------DYSFLRVFGCLCYASTLTAGRSKFDPRAR 1029
                                            Y+ L+VFGCLC+ASTL++ R+KFDPRA+
Sbjct: 790  CIQTAVYLINRLPCPILSNKSPFEALLHKTPSYTHLKVFGCLCFASTLSSHRTKFDPRAQ 849

Query: 1030 PCVLLGYPPGMKGYRLYDISKKQVFISRDVTFIEENFPFHS----------IINNDQLVD 1089
             CV LGYP G+KGY+L D++  +VFISRDV F E  FPF +          + +  + + 
Sbjct: 850  SCVFLGYPSGVKGYKLLDLTTHKVFISRDVVFHETIFPFQTQTPPPDFTTFLNSTPEPIS 909

Query: 1090 VFPGLVLPLPVF--DFVPSPSVPNNTAATPSEVGFPPAGLPSDVSAENGDMLLQASPAAS 1149
              P  +    +   D +P   +P  +A  PS    P   LP    + + D  L +SP+  
Sbjct: 910  TTPHFIPSCSIIADDILPCSPIP-PSAPVPSISTSP---LPFSDISPHLDHTLSSSPSLD 969

Query: 1150 --DADSPAQSTGVIQP-RRSTRQRHPPDFLKDYHCHLLR-----HDVPLLSHDVPHSLDK 1209
              + +SP QS  V  P RRSTR   PP +L+DYHC L          P+ S   P+ L  
Sbjct: 970  HIELNSPGQS--VSSPLRRSTRVHKPPTYLQDYHCQLAHCVGSTSSPPIASSGTPYPLST 1029

Query: 1210 YVSYRRFSNDHRQFILNVSTDFEPTYYHQTVKFAPWRKAMDDEIAAMERTNTWTLVPLPS 1269
             +SY   S  HR F L+V+   EP+ +HQ  +   W++AM  E+AA+E  NTWTL PLP 
Sbjct: 1030 SLSYDHLSPTHRNFALSVTAISEPSSFHQANQNPHWQEAMFAELAALEANNTWTLTPLPP 1089

Query: 1270 GRRAVGCKWVYKVKYKADGTVDRYKARLVAKGYNQQEGIDFLETFSPVAKIVTVKVLLSL 1329
            G+  +GCKWVYKVK K+DG+++RYKARLVAKGY QQEG+D+ ETFSPVAK  TV+ LL++
Sbjct: 1090 GKHPIGCKWVYKVKLKSDGSLERYKARLVAKGYTQQEGLDYSETFSPVAKFSTVRTLLAV 1149

Query: 1330 TASFGWSLVQLDVNNAFLNGDLFEEVYMSLPLGYYTDRKSSGCTPIVCKLNKSIYGLKQA 1389
             ++  WSL QLDVNNAFL+GDL EEVYM+LPLG+ +  ++S    +VCKLNKS+YGLKQA
Sbjct: 1150 ASAKNWSLTQLDVNNAFLHGDLAEEVYMALPLGFPSKGETSN---LVCKLNKSLYGLKQA 1209

Query: 1390 SRQWFSKFSSVLLANGFSQSKTDYSLFIRGHGISFVALLVYVDDILITGPSTTEISAVKD 1449
            SRQWF+KFSS ++  GF QS +DYSLF R  GI F+ALLVYVDDILI       ++A+KD
Sbjct: 1210 SRQWFAKFSSTIIKQGFVQSHSDYSLFTRTQGIVFIALLVYVDDILIASNDMPSVNALKD 1269

Query: 1450 ILYRHFLLKDLGHAKYFLGLELSRSDKGIYLSQRKYCLQLIEDSGYLAAKPVNHPMIPNL 1509
             L+  F LKDLG+ K+FLGLE++RS KGI L QRKY L ++ DSG L +KP+  PM  NL
Sbjct: 1270 SLHAEFKLKDLGNLKFFLGLEVARSTKGISLCQRKYALDILSDSGMLGSKPMATPMEQNL 1329

Query: 1510 RLSANGSGELLNADDASSYRRLVGRLLYLQVSRPDISFTVHNLSQFIAKPCVRHLDVVFH 1569
            ++S   +GE+L   D S YRRL+GRLLYL V+RPDIS++V  LSQF++KP   HL   + 
Sbjct: 1330 KIS-QSTGEIL--ADPSPYRRLIGRLLYLTVTRPDISYSVQRLSQFMSKPTDIHLTAAYR 1389

Query: 1570 LLRYLKGTAGQGILLHSSRDFHLKAFADSDWGSCPDSRKSVTGFCIFLGKSLVSWKSKKQ 1596
            +LRY+KGT+GQG+   S  D  LKAF+DSDW  CPD+R+S+TG+C+++G SL+SWKSKKQ
Sbjct: 1390 VLRYIKGTSGQGLFFPSHSDLQLKAFSDSDWAGCPDTRRSITGYCVYIGGSLISWKSKKQ 1449

BLAST of Lag0024687 vs. ExPASy TrEMBL
Match: A0A2N9GZW3 (Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS33057 PE=4 SV=1)

HSP 1 Score: 1200.3 bits (3104), Expect = 0.0e+00
Identity = 670/1531 (43.76%), Postives = 926/1531 (60.48%), Query Frame = 0

Query: 190  SMAGTGSANSSSSNVTATSIEAQINPYFLHHSFGSSSVLVSQPLLGAVNYTSWSRAMRMV 249
            SMA + S  S+ ++      +   + Y+LHH     ++LVSQ L+G  NY +WSR+M M 
Sbjct: 419  SMANSDSFPSAPTDTPEYLGDVPTSKYYLHHGDSPGAILVSQSLVGD-NYHTWSRSMVMA 478

Query: 250  ISGKNKLGFITGKISKPQEE-GALLEAWECNNDIIASWILNSVSKEIAASIVYTGSVKAV 309
            ++ KNK+GF+ G I +PQ+E      AW   N ++ SW+LNS+SKEIA+S++Y  + K +
Sbjct: 479  LTAKNKIGFVNGVIEQPQDEFSPAYNAWVRCNTMVISWLLNSLSKEIASSVIYANTAKEI 538

Query: 310  WDELQERFKQASGPGIYQLRKDLVTLRQGSMSVEVYYTKLKTIWQDLSDLRPTASCTCGG 369
            W++L+ERF Q +GP I++++K +  L Q + SV  YYT+LK++W +LS+ RP   C+CG 
Sbjct: 539  WEDLRERFAQGNGPRIFEIQKSISVLSQDNSSVSSYYTRLKSLWDELSNFRPIPDCSCGA 598

Query: 370  LKPFLEHLDSEYVMTFLMGLNETYAAIRAQILLMKPLPSITEAFSLLIQEEHQRSAGI-- 429
            +K  L++   EYVM FLMGLN++++ +RAQIL+  PLPSIT+AF+L+IQEE QR+  I  
Sbjct: 599  MKVLLDNKQHEYVMQFLMGLNDSFSHVRAQILMTDPLPSITKAFALVIQEERQRNINIPS 658

Query: 430  LGPSPDPIALAVNDTSKTSDPPRRKENSGQRPVCSHCGIKGHVKDRCYKLHGYPPGYKFR 489
            L P+ D +AL     +   +  + +     RP+CSHCGI GH  D+CYKLHGYPPGYKF+
Sbjct: 659  LAPAADSVALFTRGEATRHNYGKNQSYKKDRPICSHCGITGHTVDKCYKLHGYPPGYKFK 718

Query: 490  SSNSPDNSASAKSVVAANSAAASSCPP-----ATPNFFSSLNNVQYGQLMELFNSHLQAA 549
            +     + +SA           + C       ++    +SL + Q+    ++ +      
Sbjct: 719  AKMHSAHQSSAVVEDPHLPFTQAQCQQLLSMLSSQASLASLQSSQHPVNNQVVSQESAGT 778

Query: 550  KTDPITVASAVSH-ATGICHLASS-PITSL------------NDCWIVDSGASRHICHTR 609
             + P   ASA+SH  +GI   + + P  S+            +  WI+D+GA+ H+ H+ 
Sbjct: 779  SSTPHQAASAISHFMSGISSFSHTVPKHSIFSVQHVNKTRFSHSTWILDTGATDHMVHSL 838

Query: 610  AAFRNWRRIDPISIVLPTAYRVCVEYVGEIHISAALVLRDVLFVPDFAYNLMSVSCLLQS 669
              F +        I LP   +V   ++G + ++ +L+L DVL VP F++NL+S+S L  +
Sbjct: 839  RKFTSITSSINTYIHLPNGEKVLATHIGTVQVTTSLLLTDVLCVPSFSFNLISISKLTNT 898

Query: 670  GEFSVAFTNNHCLIQDKQLLTMIGKAECHHGLYILSN----ISGSSTDSLVTSTVVSDIP 729
                V F ++ C IQD      IG     +GLY L +    +  SS   +   T V++ P
Sbjct: 899  PSCCVFFLSHFCFIQDLVTWKRIGLGRKKNGLYFLQDSTDAVPSSSFPLVAAHTAVNNTP 958

Query: 730  AVFSVSASIWHSRLGHLSPRRLSMLRDTLQ--FKDSCDASCTVCPLAKQKRMPFTSNNHV 789
             VF V    WH RLGH S  RLS+L++ +      S +  C VC ++KQKR+PF +  H 
Sbjct: 959  -VFDV----WHHRLGHPSLSRLSLLKNVISDLVMPSANEHCKVCHISKQKRLPFHTAVHF 1018

Query: 790  ASNVFDLIHADVWGPLNTSTYDGYRYFLTLVDDASRFTWVYLLRQKSDVLMIIPRFFKMV 849
            A   FDLIH D+WGP +  T D  RYFLT+VDD +R TWV+L++QKS+   +I  FF ++
Sbjct: 1019 ADLPFDLIHCDIWGPYHVPTIDQQRYFLTIVDDCTRCTWVFLMKQKSETSPLIQSFFALI 1078

Query: 850  ETQFSKTIKCFRSDNAPELKFTDFFASTGTIHQFSCVERPQQNSVVERKHQHLLNVARSL 909
            +TQFS +IK  RSDN PE K   F+A  GT+HQ SCV  PQQN+ VERKHQHLL VAR+L
Sbjct: 1079 KTQFSASIKMVRSDNGPEFKMPSFYAQHGTLHQKSCVGTPQQNATVERKHQHLLMVARAL 1138

Query: 910  YFQS---------------------------------------LDYSFLRVFGCLCYAST 969
             FQ+                                        +YS LRVFGCLCYA+T
Sbjct: 1139 RFQANLPLPFWGYCVLTATHLINRIPTPLLGNKSPFELLFKKLPNYSCLRVFGCLCYAAT 1198

Query: 970  LTAGRSKFDPRARPCVLLGYPPGMKGYRLYDISKKQVFISRDVTFIEENFPFHSIINNDQ 1029
            L+  R KF PR++ CV+LGYP G+KGYRL D+  KQVF+SRDV F E +FPFH++    Q
Sbjct: 1199 LSHNRHKFAPRSKQCVMLGYPQGIKGYRLLDLDTKQVFVSRDVLFYENSFPFHTL----Q 1258

Query: 1030 LVDVFPGLVLPLPVFDFVP--SPSVPNNTAATPSEVGFPPAGLPSDVSAEN-------GD 1089
                   +VLP P+ D     SP   +   +T S +   P   P   S  +         
Sbjct: 1259 PSTPTASMVLPSPITDLPMSLSPITFDTNTSTSSSLFNSPLHSPLSPSHSHTSSPLPVNS 1318

Query: 1090 MLLQASPAASDADSPAQSTGVIQPRRSTRQRHPPDFLKDYHCHLLRHDVPLLSHDVPHS- 1149
             LLQ     SD  +P  +      R+STR   PP +L+ +HC+      P  S   P + 
Sbjct: 1319 TLLQPPDIVSDQTAPPFNPPSTTLRKSTRIHKPPSYLQAFHCNTASSG-PAHSPSSPATN 1378

Query: 1150 ---------LDKYVSYRRFSNDHRQFILNVSTDFEPTYYHQTVKFAPWRKAMDDEIAAME 1209
                     L  Y+SY + +  +  F+L+ S   EPT +H+  K   W +AM  E+AA+E
Sbjct: 1379 QGTAPTVFPLSNYISYSQLAPCYHSFVLSASAIREPTSFHEASKDPNWCQAMQTELAALE 1438

Query: 1210 RTNTWTLVPLPSGRRAVGCKWVYKVKYKADGTVDRYKARLVAKGYNQQEGIDFLETFSPV 1269
              +TW+L PLP G+  +G KWV+KVK ++DG+++RYKARLVAKGYNQQEG D+ ETFSPV
Sbjct: 1439 ANHTWSLQPLPPGKVPIGSKWVFKVKLRSDGSLERYKARLVAKGYNQQEGFDYFETFSPV 1498

Query: 1270 AKIVTVKVLLSLTASFGWSLVQLDVNNAFLNGDLFEEVYMSLPLGYYTDRKSSGCTPIVC 1329
            AK VTV+ LL++ A  GW+L QLDVNNAFL+G+L EEVYM+LP G ++  + S    IVC
Sbjct: 1499 AKFVTVRSLLAIAAVKGWALYQLDVNNAFLHGELDEEVYMTLPQGLHSKGEPSN---IVC 1558

Query: 1330 KLNKSIYGLKQASRQWFSKFSSVLLANGFSQSKTDYSLFIRGHGISFVALLVYVDDILIT 1389
            KL KS+YGLKQASRQWFSKFS+ LL +GF QSK DYSLF R  G SF+ALLVYVDDILI 
Sbjct: 1559 KLTKSLYGLKQASRQWFSKFSATLLNHGFIQSKADYSLFTRQDGSSFIALLVYVDDILIA 1618

Query: 1390 GPSTTEISAVKDILYRHFLLKDLGHAKYFLGLELSRSDKGIYLSQRKYCLQLIEDSGYLA 1449
                  +  +K  L   F LKDLG  +YFLGLE++RS +GI +SQRKY L+++ED+G L 
Sbjct: 1619 SSDAVAVQRLKLFLDAQFKLKDLGPVRYFLGLEIARSSQGISVSQRKYALEILEDAGLLG 1678

Query: 1450 AKPVNHPMIPNLRLSANGSGELLNADDASSYRRLVGRLLYLQVSRPDISFTVHNLSQFIA 1509
             KPV  PM  NL+LS    G LL   D + YRRL+GRL+YL ++RPDI F VH LSQF+ 
Sbjct: 1679 CKPVKCPMDQNLKLS-KLEGPLL--PDPTVYRRLIGRLMYLTLTRPDIVFAVHKLSQFME 1738

Query: 1510 KPCVRHLDVVFHLLRYLKGTAGQGILLHSSRDFHLKAFADSDWGSCPDSRKSVTGFCIFL 1569
             P   H     H+L+Y+KG   QG+   S+ + H+KAF+DSDW  CPD+R+S TG+CIFL
Sbjct: 1739 HPREPHYKAAQHILQYIKGAPSQGMFYPSNSELHIKAFSDSDWAGCPDTRRSTTGYCIFL 1798

Query: 1570 GKSLVSWKSKKQATVSRSSAEAEYRALATVSSELIWLSHLLKDLQVPLRSPSVVYCDNLA 1629
            G SLVSW+SKKQ TVSRSSAEAEYRA+A+   E+IWL  LL DLQ+   + ++++ D+ A
Sbjct: 1799 GHSLVSWRSKKQNTVSRSSAEAEYRAMASAVCEVIWLRSLLHDLQISHPNAALLFSDSQA 1858

Query: 1630 AIAIANNPTFHERTKHIEIDCHFVRDLIQQRILRLLPIRSNLQLADMFTKPLNAPAIPAT 1635
            AI IA NP FHERTKHIEIDCH VRD IQ+ ++R + + S  Q+AD+ TK L      + 
Sbjct: 1859 AIHIAANPVFHERTKHIEIDCHLVRDKIQEGVIRNIHVPSKHQVADIMTKALGFTLFSSL 1918

BLAST of Lag0024687 vs. ExPASy TrEMBL
Match: A0A2N9IZK3 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS57667 PE=4 SV=1)

HSP 1 Score: 1197.6 bits (3097), Expect = 0.0e+00
Identity = 680/1510 (45.03%), Postives = 908/1510 (60.13%), Query Frame = 0

Query: 196  SANSSSSNVTATSIEAQINPYFLHHSFGSSSVLVSQPLLGAVNYTSWSRAMRMVISGKNK 255
            +++S S+  T   IE   +PY+L++       +V  PL G  NY SW  +M   +S KNK
Sbjct: 9    TSSSVSNQGTLVPIENSRSPYYLNNGDHPGIRIVPDPLTGD-NYQSWRTSMTRALSVKNK 68

Query: 256  LGFITGKISKPQEEG-ALLEAWECNNDIIASWILNSVSKEIAASIVYTGSVKAVWDELQE 315
            LGF+ G I +P ++   +   W+  ND++ SWI N +S++I A+++Y  + K VWD+LQ+
Sbjct: 69   LGFVNGTILQPNDQSDPVFSDWQRCNDLVLSWITNCLSRQIYATVLYAHTAKEVWDDLQQ 128

Query: 316  RFKQASGPGIYQLRKDLVTLRQGSMSVEVYYTKLKTIWQDLSDLRPTASCTCG-----GL 375
            R+ Q++G  ++ L++ + +L+Q  +SV  Y+T LK +W +  + RP  SCTCG     GL
Sbjct: 129  RYSQSNGTRVHHLKQAIASLKQEGLSVSDYFTHLKGLWDEFLNYRPIPSCTCGAKCMCGL 188

Query: 376  -KPFLEHLDSEYVMTFLMGLNETYAAIRAQILLMKPLPSITEAFSLLIQEEHQRSAGILG 435
             K  +E+   +YV +FLMGLNET+AA+R QILLM+PLP I + FSL+   E Q+ AGIL 
Sbjct: 189  SKTLIEYQHYDYVHSFLMGLNETFAAVRGQILLMEPLPGINKVFSLIQNHEKQKGAGIL- 248

Query: 436  PSPDPIALAVNDTSKTSDPPRRKENSGQRPVCSHCGIKGHVKDRCYKLHGYPPGYKFRSS 495
              P P+  +  D++  +    RK+    +P+CSHCG KGHV ++CYKLHGYPPG++ +  
Sbjct: 249  --PLPVGFSSVDSTALAS---RKD----KPICSHCGYKGHVAEKCYKLHGYPPGFQRKPR 308

Query: 496  NSP-DNSASAKSVVAANSAAASSCPPATPNFFSSLNNVQYGQLM------ELFNSHLQAA 555
            N+P  N  S    +A+N    S   P+         N+   Q        +   S  QAA
Sbjct: 309  NAPAANQVSCPMTMASNGHDNSQNVPSLAMQCQQFLNMLTAQAQKGPSSSDSHTSPHQAA 368

Query: 556  KTDPITVASA----------VSHATGICHLASS-----------------PITSLNDCWI 615
                +T  SA           S+  GI    S+                  ++     W+
Sbjct: 369  TLITVTQPSAQPSIQAPIQPPSNMAGIPMCLSTFSKPNMAYSVFSNDHFDKVSVSASEWV 428

Query: 616  VDSGASRHICHTRAAFRNWRRIDPISIVLPTAYRVCVEYVGEIHISAALVLRDVLFVPDF 675
            +D+GA+ H+  T   F   + +  +++ LP    V V ++G I ++A+L+L DVL VP F
Sbjct: 429  IDTGATDHMVTTTHYFTTMKLVHNVTVNLPNGQSVNVTHIGSIQLTASLLLTDVLCVPSF 488

Query: 676  AYNLMSVSCLLQSGEFSVAFTNNHCLIQDKQLLTMIGKAECHHGLYILSNISGSSTDSLV 735
             +NL+SVS L  S +  + F + +C IQD     MIG     +GLY+L      S+ S +
Sbjct: 489  DFNLISVSKLTSSLQCCIFFLSTYCFIQDLMQWRMIGMGRQQNGLYMLD----LSSHSKL 548

Query: 736  TSTVVSDIPAVF-------------SVSASIWHSRLGHLSPRRLSMLRDTL-QFKDSCDA 795
            T+ V  ++P  F             S S   WH RLGH S  R++ L   +     SC  
Sbjct: 549  TAAV--NVPDSFHKLLYSFSTIKHSSNSFHTWHCRLGHPSSSRMNFLSTVMPDISHSCKD 608

Query: 796  S--CTVCPLAKQKRMPFTSNNHVASNVFDLIHADVWGPLNTSTYDGYRYFLTLVDDASRF 855
            +  CTVCPLAKQKR+PF +NNHV+S  FD++H D+WGP +  T +GY+YFLTLVDD +R 
Sbjct: 609  THVCTVCPLAKQKRLPFPNNNHVSSIAFDILHVDIWGPYHVPTVEGYKYFLTLVDDCTRT 668

Query: 856  TWVYLLRQKSDVLMIIPRFFKMVETQFSKTIKCFRSDNAPELKFTDFFASTGTIHQFSCV 915
            TWVYL++ KS+   ++  F  M++TQF   +K  RSDN  E    DF+A+ G IHQ SCV
Sbjct: 669  TWVYLMKSKSETRPLLISFITMIQTQFGSHVKHVRSDNGQEFSMPDFYATQGIIHQHSCV 728

Query: 916  ERPQQNSVVERKHQHLLNVARSLYFQS--------------------------------- 975
            E PQQNSVVERKHQH+LNVARSL FQS                                 
Sbjct: 729  ETPQQNSVVERKHQHILNVARSLCFQSNLPLKFWGHSVLTAVYLINRLPSPILSHKSPYE 788

Query: 976  ------LDYSFLRVFGCLCYASTLTAGRSKFDPRARPCVLLGYPPGMKGYRLYDISKKQV 1035
                    YS LRVFGCLC+ASTL+  R+KFDPRA+PCV LGYP G+KGY+L D++   V
Sbjct: 789  KLLHKAPSYSHLRVFGCLCFASTLSNHRTKFDPRAKPCVFLGYPSGVKGYKLLDLTNHNV 848

Query: 1036 FISRDVTFIEENFPFHSIINNDQLVDVFP---GLVLPLPVFDFVP---SPSVPNNTAATP 1095
             ISRDV F E  FPF     N    D  P    L    P F  +P   + S P N   + 
Sbjct: 849  IISRDVIFHEHVFPF----ANTPSADFSPFDNNLPTSQPNFSDIPLDSTISCPMNQGLSS 908

Query: 1096 SEVGFPPAGLPSDVSAENGDMLLQASPAASDADSPAQSTGVIQP-RRSTRQRHPPDFLKD 1155
             E       + +  SAE        SP     D P  S  V  P RRSTR   PP +L+D
Sbjct: 909  EEPCSVSTPILTSPSAE--------SPTIPHLDVPPCSESVSSPLRRSTRVSKPPTYLQD 968

Query: 1156 YHCHLLRHDVPLLSHDVP-----HSLDKYVSYRRFSNDHRQFILNVSTDFEPTYYHQTVK 1215
            YHC + +      S         + L   +SY   S  HR F L+V+   EPT + Q  +
Sbjct: 969  YHCKIAQSAPSTSSSSTASTGTLYPLSSSLSYDHLSPSHRTFALSVTAISEPTSFTQANQ 1028

Query: 1216 FAPWRKAMDDEIAAMERTNTWTLVPLPSGRRAVGCKWVYKVKYKADGTVDRYKARLVAKG 1275
             + WR+AM DE+ A+E  NTW+L  LP G+  +GCKWVYKVK KADG+++RYKARLVAKG
Sbjct: 1029 HSHWRQAMTDELKALEANNTWSLTHLPPGKHPIGCKWVYKVKLKADGSLERYKARLVAKG 1088

Query: 1276 YNQQEGIDFLETFSPVAKIVTVKVLLSLTASFGWSLVQLDVNNAFLNGDLFEEVYMSLPL 1335
            Y QQEG+D+ ETFSPVAK  TV+ LL++ ++  WSL QLDVNNAFL+GDL EEVYM LP 
Sbjct: 1089 YTQQEGLDYSETFSPVAKFSTVRTLLAIASAQHWSLTQLDVNNAFLHGDLNEEVYMVLPP 1148

Query: 1336 GYYTDRKSSGCTPIVCKLNKSIYGLKQASRQWFSKFSSVLLANGFSQSKTDYSLFIRGHG 1395
            G+     S G T +VCKL KS+YGLKQASRQWF+KFSS L+  GF QSK+DYSLF R  G
Sbjct: 1149 GF----PSKGETNLVCKLQKSLYGLKQASRQWFAKFSSTLIKQGFLQSKSDYSLFTRTQG 1208

Query: 1396 ISFVALLVYVDDILITGPSTTEISAVKDILYRHFLLKDLGHAKYFLGLELSRSDKGIYLS 1455
             +F+ LLVYVDDILI   + T +  +KD L+  F LKDLG+ KYFLGLE++RS KGI L 
Sbjct: 1209 TTFIGLLVYVDDILIASNNVTAVHTLKDSLHAEFKLKDLGNLKYFLGLEVARSSKGISLC 1268

Query: 1456 QRKYCLQLIEDSGYLAAKPVNHPMIPNLRLSANGSGELLNADDASSYRRLVGRLLYLQVS 1515
            QRKY L ++ DSG L +KPV  PM   L+LS +    L    D S YRRLVGRLLYL V+
Sbjct: 1269 QRKYALDVLSDSGMLGSKPVVTPMEQKLKLSQSDGDAL---SDPSQYRRLVGRLLYLTVT 1328

Query: 1516 RPDISFTVHNLSQFIAKPCVRHLDVVFHLLRYLKGTAGQGILLHSSRDFHLKAFADSDWG 1575
            RPDIS++V  LSQF+AKP   HL   + +L+Y+KGT+GQG+   S+ D HLK+F+DSDW 
Sbjct: 1329 RPDISYSVQRLSQFMAKPTTTHLAAAYRVLKYIKGTSGQGLFFPSNTDLHLKSFSDSDWA 1388

Query: 1576 SCPDSRKSVTGFCIFLGKSLVSWKSKKQATVSRSSAEAEYRALATVSSELIWLSHLLKDL 1598
            SCPD+R+SVTG+C+FLG SL+SWKSKKQ T+SRSSAEAEYRA+A+   EL+WL  LLK+L
Sbjct: 1389 SCPDTRRSVTGYCVFLGNSLISWKSKKQHTISRSSAEAEYRAMASAVCELMWLLPLLKEL 1448

BLAST of Lag0024687 vs. ExPASy TrEMBL
Match: A0A2N9H2Y3 (Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS34107 PE=4 SV=1)

HSP 1 Score: 1196.8 bits (3095), Expect = 0.0e+00
Identity = 680/1536 (44.27%), Postives = 942/1536 (61.33%), Query Frame = 0

Query: 190  SMAGTGSANSSSSNVTATSIEAQINPYFLHHSFGSSSVLVSQPLLGAVNYTSWSRAMRMV 249
            S + T S N ++SN     IE   +P++L++       +V  PL G  NY +W R+M   
Sbjct: 827  SSSSTSSTNQTTSNSIPIPIENSRSPFYLNNGDNPGIRIVPDPLTGD-NYQAWRRSMTTA 886

Query: 250  ISGKNKLGFITGKISKPQEEG-ALLEAWECNNDIIASWILNSVSKEIAASIVYTGSVKAV 309
            +S KNKLGF+ G I +P +E   L   W+  ND++ SWI N +S++I A+++Y  + K V
Sbjct: 887  LSAKNKLGFVNGAILQPNDESDPLFSDWQRCNDLVLSWITNCLSRQIHATVLYVYTAKEV 946

Query: 310  WDELQERFKQASGPGIYQLRKDLVTLRQGSMSVEVYYTKLKTIWQDLSDLRPTASCTCG- 369
            WD+LQ+R+ Q++G  ++ L++ + +L+Q +M V  Y+T+LK +W +  + RP   CTCG 
Sbjct: 947  WDDLQQRYCQSNGTRVHHLKQAIASLKQDNMPVSDYFTQLKGLWDEFLNYRPIPGCTCGA 1006

Query: 370  ----GL-KPFLEHLDSEYVMTFLMGLNETYAAIRAQILLMKPLPSITEAFSLLIQEEHQR 429
                GL +  +++   +YV +FLMGLN+++A +R QILLM+PLP+I + FSL+  +E QR
Sbjct: 1007 KCICGLSRTLMDYQHYDYVHSFLMGLNDSFAPVRGQILLMEPLPNINKVFSLIQNDEKQR 1066

Query: 430  SAGILG-PSPDPIAL--AVNDTSKTSDPP---------RRKENSGQ--------RP--VC 489
             AG+L  P+ D  AL   + +   T+ P           R +N  Q        +P  +C
Sbjct: 1067 GAGLLPLPTVDSTALLSRLENGPNTAFPYPNTGSNAFFTRTDNQKQHYQYPRKDKPPCIC 1126

Query: 490  SHCGIKGHVKDRCYKLHGYPPGYKFRSSN-SPDNSASAKSVVAANSA-AASSCPPATPNF 549
            SHCG KGH  D+CYKLHGYPPG++ +  N +  N  S+ +V  + SA  A S P  T   
Sbjct: 1127 SHCGYKGHTADKCYKLHGYPPGFRSKGRNVAVANQVSSSAVPHSESADNAQSIPNLT--- 1186

Query: 550  FSSLNNVQYGQLMELFNSHLQAAKTDPIT------VASAVSHATGICHLASSPI------ 609
                 +VQ  QL+ +  +  QA + +P++       A+++S      ++A  P       
Sbjct: 1187 ---AMSVQCQQLLNMLTA--QAQQANPVSDSQNHQAATSISVTQSHSNMAGKPTCLSTFS 1246

Query: 610  -----------------TSLNDCWIVDSGASRHICHTRAAFRNWRRIDPISIVLPTAYRV 669
                             T  +  W++D+GA  H+  T   +     +D IS+ LP    V
Sbjct: 1247 NPNMDHSVFSDKFTVKPTFSSTQWVIDTGAKDHMVITTQFYTTKHIVDNISVNLPNGQSV 1306

Query: 670  CVEYVGEIHISAALVLRDVLFVPDFAYNLMSVSCLLQSGEFSVAFTNNHCLIQDKQLLTM 729
             V ++G + ++  L+L +VL VP F +NL+SVS L  S    + F + +C IQD     M
Sbjct: 1307 MVTHIGSVQLTPTLLLTNVLCVPSFDFNLISVSKLTSSLHCCIFFLSTYCFIQDLMHWRM 1366

Query: 730  IGKAECHHGLYILSNISGSSTDSLVTSTVVSDIPA-VFSVSA--------SIWHSRLGHL 789
            IG    H+GLY+L + S  ST +  T T  S +P  ++S+S+         +WH RLGH 
Sbjct: 1367 IGMGRQHNGLYLLDS-SSDSTTTAATITSDSSLPKHLYSLSSIKNPNKDIHVWHCRLGHP 1426

Query: 790  SPRR---LSMLRDTLQFKDSCDASCTVCPLAKQKRMPFTSNNHVASNVFDLIHADVWGPL 849
            S  R   LS +     +  +  ++CTVCPLAKQ+++PF +NNH++   FDL+H D+WGP 
Sbjct: 1427 SLSRMHFLSSIVPNASYSSNDASTCTVCPLAKQRKLPFPNNNHLSLKSFDLLHIDIWGPY 1486

Query: 850  NTSTYDGYRYFLTLVDDASRFTWVYLLRQKSDVLMIIPRFFKMVETQFSKTIKCFRSDNA 909
            +  T +GYRYFLTLVDD +R TW+YL+R KSD   ++  F  M+ TQF   IK  RSDN 
Sbjct: 1487 HIPTVEGYRYFLTLVDDCTRTTWIYLMRSKSDTSTLLTSFITMIHTQFHTVIKQLRSDNG 1546

Query: 910  PELKFTDFFASTGTIHQFSCVERPQQNSVVERKHQHLLNVARSLYFQS------------ 969
             E    DF+AS G IHQ SCVE PQQNSVVERKHQH+LNVAR+L FQS            
Sbjct: 1547 QEFHMPDFYASKGIIHQHSCVETPQQNSVVERKHQHILNVARALCFQSHLPLKYWGHCIQ 1606

Query: 970  ---------------------------LDYSFLRVFGCLCYASTLTAGRSKFDPRARPCV 1029
                                         Y+ L+VFGCLC+ASTL+  R+KFDPRA+ C 
Sbjct: 1607 TAVYLINRLPCPILSNKSPFEALLHKTPSYTHLKVFGCLCFASTLSGHRTKFDPRAKACA 1666

Query: 1030 LLGYPPGMKGYRLYDISKKQVFISRDVTFIEENFPFHS---IINNDQLVDVFPGLVLPLP 1089
             LGYP G+KGY+L +++  +V ISRDV F E  FPF +   + +    +   P  + P P
Sbjct: 1667 FLGYPSGVKGYKLLELNTHKVLISRDVVFHETIFPFQNQTPLPDFSTFLSCSPEPLSPTP 1726

Query: 1090 VFDFVPSPSVPNNTAATPSEVGFPPA-GLPSDVSAENGDMLLQASPAAS------DADSP 1149
             F   PS  + +  +AT +    PPA  + + +S  +   LL  + ++S      + DSP
Sbjct: 1727 HF-IPPSHLIADMPSATSAPA--PPAPPVSASLSPLDTSSLLDHNSSSSPSLDHIETDSP 1786

Query: 1150 AQSTGVIQP-RRSTRQRHPPDFLKDYHCHLLR-----HDVPLLSHDVPHSLDKYVSYRRF 1209
             QS  V  P RRSTR   PP +L+DYHC L          PL S   P+ L   +SY   
Sbjct: 1787 GQS--VSSPLRRSTRVHKPPTYLQDYHCQLAHCVGSTSSPPLASSGKPYPLSTSLSYDHL 1846

Query: 1210 SNDHRQFILNVSTDFEPTYYHQTVKFAPWRKAMDDEIAAMERTNTWTLVPLPSGRRAVGC 1269
            S  HR F L+V+   EP+++HQ  +   W++AM  E+AA+E  NTWTL PLP G+  +GC
Sbjct: 1847 SPTHRNFALSVTAILEPSFFHQANQSPHWQEAMFAELAALEANNTWTLTPLPLGKHPIGC 1906

Query: 1270 KWVYKVKYKADGTVDRYKARLVAKGYNQQEGIDFLETFSPVAKIVTVKVLLSLTASFGWS 1329
            KWVYKVK K+DG+++RYKARLVAKGY QQEG+D+ ETFSPVAK  TV+ LL++ +   WS
Sbjct: 1907 KWVYKVKLKSDGSLERYKARLVAKGYTQQEGLDYSETFSPVAKFSTVRTLLAVASVKHWS 1966

Query: 1330 LVQLDVNNAFLNGDLFEEVYMSLPLGYYTDRKSSGCTP-IVCKLNKSIYGLKQASRQWFS 1389
            L QLDVNNAFL+GDL EEVYM+LP G+     S G TP +VCKLNKS+YGLKQASRQWF+
Sbjct: 1967 LTQLDVNNAFLHGDLAEEVYMALPPGF----PSKGETPNLVCKLNKSLYGLKQASRQWFA 2026

Query: 1390 KFSSVLLANGFSQSKTDYSLFIRGHGISFVALLVYVDDILITGPSTTEISAVKDILYRHF 1449
            KFSS ++  GF QSK+DYSLF R  G +F+ALLVYVDDILI   ++ ++ ++KD L+  F
Sbjct: 2027 KFSSTIIKQGFVQSKSDYSLFTRTQGTAFIALLVYVDDILI---ASNDMPSLKDSLHAEF 2086

Query: 1450 LLKDLGHAKYFLGLELSRSDKGIYLSQRKYCLQLIEDSGYLAAKPVNHPMIPNLRLSANG 1509
             LKDLG+ KYFLGLE++RS KGI L QRKY L ++ DSG L +KPV  PM  NL++S   
Sbjct: 2087 KLKDLGNLKYFLGLEVARSTKGISLCQRKYALDILSDSGMLGSKPVTTPMEQNLKIS-QS 2146

Query: 1510 SGELLNADDASSYRRLVGRLLYLQVSRPDISFTVHNLSQFIAKPCVRHLDVVFHLLRYLK 1569
            +GE+L  DD S YRRLVGRLLYL V+RPDIS++V  LSQF++KP   HL   + +LRY+K
Sbjct: 2147 TGEIL--DDPSPYRRLVGRLLYLTVTRPDISYSVQKLSQFMSKPTSMHLSAAYRVLRYIK 2206

Query: 1570 GTAGQGILLHSSRDFHLKAFADSDWGSCPDSRKSVTGFCIFLGKSLVSWKSKKQATVSRS 1598
            GT+GQG+   S     LKAF+DSDW  C D+R+S+TG+C+++G SL+SWKSKKQ TVSRS
Sbjct: 2207 GTSGQGLFFPSHSYLQLKAFSDSDWAGCLDTRRSITGYCVYIGDSLISWKSKKQHTVSRS 2266

BLAST of Lag0024687 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 543.9 bits (1400), Expect = 5.9e-154
Identity = 273/568 (48.06%), Postives = 380/568 (66.90%), Query Frame = 0

Query: 1005 SDVSAENGDMLLQASPAA---SDADSPAQSTGVIQPRRSTRQRHPPDFLKDYHCHLLRHD 1064
            SD  A      +   P+A   +D   P+  T       S R+   P +L+DY+CH +   
Sbjct: 3    SDADASTSSSSIDIMPSANIQNDVPEPSVHT-------SHRRTRKPAYLQDYYCHSV--- 62

Query: 1065 VPLLSHDVPHSLDKYVSYRRFSNDHRQFILNVSTDFEPTYYHQTVKFAPWRKAMDDEIAA 1124
              L  HD+     +++SY + S  +  F++ ++   EP+ Y++  +F  W  AMDDEI A
Sbjct: 63   ASLTIHDI----SQFLSYEKVSPLYHSFLVCIAKAKEPSTYNEAKEFLVWCGAMDDEIGA 122

Query: 1125 MERTNTWTLVPLPSGRRAVGCKWVYKVKYKADGTVDRYKARLVAKGYNQQEGIDFLETFS 1184
            ME T+TW +  LP  ++ +GCKWVYK+KY +DGT++RYKARLVAKGY QQEGIDF+ETFS
Sbjct: 123  METTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGIDFIETFS 182

Query: 1185 PVAKIVTVKVLLSLTASFGWSLVQLDVNNAFLNGDLFEEVYMSLPLGYYTDRKSSGCTPI 1244
            PV K+ +VK++L+++A + ++L QLD++NAFLNGDL EE+YM LP GY   +  S     
Sbjct: 183  PVCKLTSVKLILAISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNA 242

Query: 1245 VCKLNKSIYGLKQASRQWFSKFSSVLLANGFSQSKTDYSLFIRGHGISFVALLVYVDDIL 1304
            VC L KSIYGLKQASRQWF KFS  L+  GF QS +D++ F++     F+ +LVYVDDI+
Sbjct: 243  VCYLKKSIYGLKQASRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVLVYVDDII 302

Query: 1305 ITGPSTTEISAVKDILYRHFLLKDLGHAKYFLGLELSRSDKGIYLSQRKYCLQLIEDSGY 1364
            I   +   +  +K  L   F L+DLG  KYFLGLE++RS  GI + QRKY L L++++G 
Sbjct: 303  ICSNNDAAVDELKSQLKSCFKLRDLGPLKYFLGLEIARSAAGINICQRKYALDLLDETGL 362

Query: 1365 LAAKPVNHPMIPNLRLSANGSGELLNADDASSYRRLVGRLLYLQVSRPDISFTVHNLSQF 1424
            L  KP + PM P++  SA+  G+ +   DA +YRRL+GRL+YLQ++R DISF V+ LSQF
Sbjct: 363  LGCKPSSVPMDPSVTFSAHSGGDFV---DAKAYRRLIGRLMYLQITRLDISFAVNKLSQF 422

Query: 1425 IAKPCVRHLDVVFHLLRYLKGTAGQGILLHSSRDFHLKAFADSDWGSCPDSRKSVTGFCI 1484
               P + H   V  +L Y+KGT GQG+   S  +  L+ F+D+ + SC D+R+S  G+C+
Sbjct: 423  SEAPRLAHQQAVMKILHYIKGTVGQGLFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCM 482

Query: 1485 FLGKSLVSWKSKKQATVSRSSAEAEYRALATVSSELIWLSHLLKDLQVPLRSPSVVYCDN 1544
            FLG SL+SWKSKKQ  VS+SSAEAEYRAL+  + E++WL+   ++LQ+PL  P++++CDN
Sbjct: 483  FLGTSLISWKSKKQQVVSKSSAEAEYRALSFATDEMMWLAQFFRELQLPLSKPTLLFCDN 542

Query: 1545 LAAIAIANNPTFHERTKHIEIDCHFVRD 1570
             AAI IA N  FHERTKHIE DCH VR+
Sbjct: 543  TAAIHIATNAVFHERTKHIESDCHSVRE 553

BLAST of Lag0024687 vs. TAIR 10
Match: AT3G54070.1 (Ankyrin repeat family protein )

HSP 1 Score: 353.2 bits (905), Expect = 1.5e-96
Identity = 218/579 (37.65%), Postives = 325/579 (56.13%), Query Frame = 0

Query: 1601 PATKIFLHQYALKGEWEYVELLMDECPHYVRSTITRNKETILHIAAGAKQTEFVEKLLHR 1660
            P ++  +++  L G+W+    L+      V   IT N E  LHIA  AK  +FV  LL  
Sbjct: 48   PHSRNLMYKAVLTGDWKTASTLISRKECNVVEQITGNSEIALHIAVAAKHKDFVRNLLRE 107

Query: 1661 MTSADMTLQNKYGNTALCFAAASGVVRIAQLMVQKNKHLPLIRGFNNIVTPLFIAVSYKC 1720
            M   D++L+NK GNT L FAAA G +  A++++   + LP I      +TP+ IA  Y  
Sbjct: 108  MDPPDLSLKNKDGNTPLSFAAALGDIETAEMLINMIRDLPDISN-EKTMTPIHIAALYGH 167

Query: 1721 IEMVSYLLSITDLDQLNNQEQIEL----LIATIHGNFFDISIWILQRYPYLATMKDM--N 1780
             EMV YL S T +  LN+Q+ + L    + A I+G F D+ +W+L+R         +  N
Sbjct: 168  GEMVQYLFSKTSIKDLNDQQYLNLFHTMISADIYGVFADVPLWMLERVDLYRKELALYPN 227

Query: 1781 EETALHVMARKPSAMDVTKQQSIWEKYINSWIYGKAMTKTLAHELVVLLWTNVLRNLPEK 1840
               ALH++ARK SA+    Q +++++  +SW                             
Sbjct: 228  SNKALHLLARKTSAISHKSQLNLFQQVASSW----------------------------- 287

Query: 1841 EMLQFINHPTRLLNEAACTGNVEFLIVLIRKYPDIIWEDDDDGKSIFHVAIENRLENVFN 1900
                       LL +AA  GNVE L++LIR + D++W  D++ +++FHVA   R EN+F+
Sbjct: 288  -----------LLFDAAELGNVEILVILIRSHLDLLWIVDNNNRTLFHVAALYRHENIFS 347

Query: 1901 LINEIGRLNEFTAKYRTFKGRNYNILHLAGNLAAPNHLNRVSGAALQMQRELLWFKEVEK 1960
            LI E+G + +  A Y+  + ++  +LHL   L   N     SGAAL MQ+ELLWFK V++
Sbjct: 348  LIYELGGIKDLIASYKEKQSKD-TLLHLVARLPPMNRQQVGSGAALHMQKELLWFKAVKE 407

Query: 1961 IVLPSQLEAKSNVLSYQHKPKSNYPNVPKLTPRQLFTQEHKDLRKDGEEWMKNTANSCML 2020
            IV  S +E K+      H                +FT++H++LRK+GE WMK TA +CML
Sbjct: 408  IVPRSYIETKNTKGELAH---------------DIFTEQHENLRKEGERWMKETATACML 467

Query: 2021 VATLISTVVFAAAFTVPGGSD------NNAGTPIFQHKFWFTVFLMSDAVALFASSTSIL 2080
             ATLI+TVVFAAA T+PGG+D      N  G P F+ +  F +F +SD+VALF+S  SI+
Sbjct: 468  GATLIATVVFAAAITIPGGNDDSGDKANTLGFPNFRKRLLFDIFTLSDSVALFSSMMSIV 527

Query: 2081 MFMSILTSRYAEEDFVHSLPSRLLFGLAALFISIVCMVVAFSATFFLL-YHKANICIPTV 2140
            +F+SI TSRYAEEDF + LP++L+FGL+ALFISI+ M++AF+ +  L+   KA++ +  +
Sbjct: 528  IFLSIFTSRYAEEDFRYDLPTKLMFGLSALFISIISMILAFTFSMILIRVEKASLSL-VL 568

Query: 2141 VAAMAIVPVSCFCALQFKLWVDIFHNTYSSRFLFKPRQR 2167
            ++ +A +    F  L F LW +   + Y S FLF  R+R
Sbjct: 588  ISCLASLTALTFAYLYFHLWFNTLRSVYISMFLFLGRKR 568

BLAST of Lag0024687 vs. TAIR 10
Match: AT3G18670.1 (Ankyrin repeat family protein )

HSP 1 Score: 302.0 bits (772), Expect = 3.9e-81
Identity = 200/577 (34.66%), Postives = 313/577 (54.25%), Query Frame = 0

Query: 1602 ATKIFLHQYALKGEWEYVELLMDECPHYVRSTITRNKETILHIAAGAKQTEFVEKLLHRM 1661
            +T + L +    GE E  +  +D  P  + + +T N +T +H A  +   + VE+++ R+
Sbjct: 48   STYLVLFKNIDSGELEATKDFLDRNPEALTAILTSNGDTPIHKAVLSGHIKIVEEIIRRI 107

Query: 1662 TSADMTL--QNKYGNTALCFAAASGVVRIAQLMVQKNKHLPLIRGFNNIVTPLFIAVSYK 1721
               +  L  +N  G TAL +AA  G+VRIA+ +V K   L  +R     + P+ +A  Y 
Sbjct: 108  HDPEQVLKIKNDNGYTALTYAATGGIVRIAECLVNKCPGLVSVRNAKEHI-PIVVASLYG 167

Query: 1722 CIEMVSYLLS---ITDLDQLNNQEQIE------LLIATIHGNFFDISIWILQRYPYLATM 1781
               +V YL S   ++DLD  ++ ++ +      L+   I    + I++ ++QRYP LA  
Sbjct: 168  HKHLVQYLYSHTPLSDLDPCDDSDEHKGKNGAMLVTNCIVDGLYCIALDLIQRYPKLAYT 227

Query: 1782 KDMNEETALHVMARKPSAMDVTKQQSIWEKYINSWIYGKAMTKTLAHELVVLLWTNVLRN 1841
            +D + +TA+  +A+ P A     +           I  +     L H     +   + + 
Sbjct: 228  RDSDNDTAIMALAQTPYAFPSVPR-----------IIRRVYKLKLGHAQAKEILDCICQE 287

Query: 1842 LPEKEMLQFINHP-TRLLNEAACTGNVEFLIVLIRKYPDIIWEDDDDGKSIFHVAIENRL 1901
            +P+ +  Q  N    + L +A   G VE++  ++R YPDI+W  +  G +IF  A+  R 
Sbjct: 288  IPKFDAAQQKNAGLNQALFKAVENGIVEYIEEMMRHYPDIVWSKNSSGLNIFFYAVSQRQ 347

Query: 1902 ENVFNLINEIG-RLNEFTAKYRTFKGRNYNILHLAGNLAAPNHLNRVSGAALQMQRELLW 1961
            E +F+LI  IG + N     +  F   + N+LH A   A  + LN + GAALQMQREL W
Sbjct: 348  EKIFSLIYNIGAKKNILATNWDIF---HNNMLHHAAYRAPASRLNLIPGAALQMQRELQW 407

Query: 1962 FKEVEKIVLPSQLEAKSNVLSYQHKPKSNYPNVPKLTPRQLFTQEHKDLRKDGEEWMKNT 2021
            FKEVEK+V P            +H+   N     K TP+ LFT +HKDL + GE+WMK T
Sbjct: 408  FKEVEKLVQP------------KHRKMVNLKQ--KKTPKALFTDQHKDLVEQGEKWMKET 467

Query: 2022 ANSCMLVATLISTVVFAAAFTVPGGSDNNAGTPIFQHKFWFTVFLMSDAVALFASSTSIL 2081
            A SC +VA LI+T++F++AFTVPGG  ++ G P++ H+  F +FL+SDA++LF S  S+L
Sbjct: 468  ATSCTVVAALITTMMFSSAFTVPGGYRSD-GMPLYIHQHRFKIFLISDAISLFTSCMSLL 527

Query: 2082 MFMSILTSRYAEEDFVHSLPSRLLFGLAALFISIVCMVVAFSATFFLLYHKANICIPTVV 2141
            MF+ IL SRY EEDF+ SLP++L+ GL ALF+S+  M+V F  T   L  +    +    
Sbjct: 528  MFLGILKSRYREEDFLRSLPTKLIVGLLALFLSMATMIVTFVVTLMTLVGEKISWVSAQF 587

Query: 2142 AAMAIVPVSCFCALQFKLWVDIFHNTYSSRFLFKPRQ 2166
              +A++P+  F  LQF + ++IF  TY      KPR+
Sbjct: 588  MFLAVIPLGMFVVLQFPVLLEIFRATYCPNVFDKPRR 594

BLAST of Lag0024687 vs. TAIR 10
Match: AT5G35810.1 (Ankyrin repeat family protein )

HSP 1 Score: 297.7 bits (761), Expect = 7.4e-80
Identity = 166/362 (45.86%), Postives = 234/362 (64.64%), Query Frame = 0

Query: 1813 KTLAHELVVLLWTNVLRNLPEKEMLQFINHPTRLLNEAACTGNVEFLIVLIRKYPDIIWE 1872
            +TLAH +V  LW+ V++ LP +E+ QF+     LL +AA +GN+E L++LIR YPD+IW 
Sbjct: 2    RTLAHMVVEELWSFVIK-LPVEEISQFVGSSPMLLFDAAQSGNLELLLILIRSYPDLIWT 61

Query: 1873 DDDDGKSIFHVAIENRLENVFNLINEIGRLNEFTAKYRTFKGRNYNILHLAGNLAAPNHL 1932
             D   +S+FH+A  NR E +FN I E+G + +  A Y+  K  N N+LHL   L  PN L
Sbjct: 62   VDHKNQSLFHIAAINRHEKIFNRIYELGAIKDLIAMYKE-KESNDNLLHLVARLPPPNRL 121

Query: 1933 NRVSGAALQMQRELLWFKEVEKIVLPSQLEAKSNVLSYQHKPKSNYPNVPKLTPRQLFTQ 1992
              VSGAALQMQRE+LW+K V++IV    ++ K+      H                LFT+
Sbjct: 122  QVVSGAALQMQREILWYKAVKEIVPRVYIKTKNKKEEVAH---------------DLFTK 181

Query: 1993 EHKDLRKDGEEWMKNTANSCMLVATLISTVVFAAAFTVPGGSDNNA-----GTPIFQHKF 2052
            EH +LRK+GE+WMK TA +C+LV+TLI+TVVFAAAFT+PGG+D +      G P F+ +F
Sbjct: 182  EHDNLRKEGEKWMKETATACILVSTLIATVVFAAAFTLPGGNDTSGDIKTLGFPTFRKEF 241

Query: 2053 WFTVFLMSDAVALFASSTSILMFMSILTSRYAEEDFVHSLPSRLLFGLAALFISIVCMVV 2112
            WF VF++SD+VAL +S TSI++F+SILTSRYAE  F  +LP++L+ GL ALF+SI+ MV+
Sbjct: 242  WFEVFIISDSVALLSSVTSIMIFLSILTSRYAEASFQTTLPTKLMLGLLALFVSIISMVL 301

Query: 2113 AFSATFFLLYHKANICIPTVVAAMAIVPVSCFCALQFKLWVDIFHNTYSSRFLFKPRQRK 2170
            AF+AT  L+  +       ++  +A      F  L F+LW D   + Y S+FLF  R+  
Sbjct: 302  AFTATLILIRDQEPKWSLILLVYVASATALSFVVLHFQLWFDTLRSAYLSKFLFHGRKSG 346

BLAST of Lag0024687 vs. TAIR 10
Match: AT5G04690.1 (Ankyrin repeat family protein )

HSP 1 Score: 248.1 bits (632), Expect = 6.7e-65
Identity = 191/577 (33.10%), Postives = 299/577 (51.82%), Query Frame = 0

Query: 1605 IFLHQYALKGEWEYVELLMDECPHYVRSTITRNKETILHIAAGAKQTEFVEKLLHRMTSA 1664
            I L+Q   +G  E V+  ++  P  V   I    ET L  A      E V+ LL RMT  
Sbjct: 77   IQLNQGISQGRVEAVKDFLNRRPDAVDKYI-NPYETPLLKACAYGNPEIVKLLLRRMTPE 136

Query: 1665 DM---TLQNKYGNTALCFAAASGVVRIAQLMVQKNKHLPLIRGFNNIVTPLFIAVSYKCI 1724
             M     QN + NT L   A SG + IA+ +V KN  L  I G NN   P+ +AV    +
Sbjct: 137  QMLPKMSQNNFYNTPLTVVAVSGNMEIAEALVAKNPKLLEIPG-NNGEIPVVVAVENTQM 196

Query: 1725 EMVSYLLSITDLDQLNNQE---QIELLIATIHGNFFDISIWILQRYPYLATMKDMN-EET 1784
            EM  YL + T +  L  ++    I L +  I+    D+++ +  +   LA  K +  E  
Sbjct: 197  EMARYLYNRTPVQVLLEKDGFHGILLFLNAIYYKKLDMALDLFNKSRRLAVTKHLRIESV 256

Query: 1785 ALHVMARKPSAMDVTKQQSIWEKYINSWIYGKAMTKTLAHELVVLLWTNVLRNLPEKEML 1844
             + V+A KP     T    +  K ++  I    + +    +++ L    +L+ + E+ + 
Sbjct: 257  PIIVLASKPDLFPDTLMGKVL-KCLSKCI---GIDEVYRLKVMHLQAKKLLKGISEETLA 316

Query: 1845 QFINHPTRLLNEAAC----TGNVEFLIVLIRKYPDIIWEDDDDGKSIFHVAIENRLENVF 1904
              +   +  ++EA       GNV+FL+ +I+   +++W       ++F+ A++ R E VF
Sbjct: 317  LGLKERSESVDEALLFAVRYGNVDFLVEMIKNNSELLWSTGT--STLFNTAVQVRQEKVF 376

Query: 1905 NLINEIGRLNEFTAKYRTFKGRNYNILHLAGNLAAPNHLNRVSGAALQMQRELLWFKEVE 1964
            +L+  +G         +   G   ++LHLAG       L  V  A LQMQREL WFKE+E
Sbjct: 377  SLLYGLGDRKYLFLADKDSDGN--SVLHLAGYPPPNYKLATVVSATLQMQRELQWFKEME 436

Query: 1965 KIVLPSQLEAKSNVLSYQHKPKSNYPNVPKLTPRQLFTQEHKDLRKDGEEWMKNTANSCM 2024
            +IV   + E                 N   LTP ++F +EH+ +R + E+WMK+TA SC 
Sbjct: 437  RIVPAIENER---------------VNTENLTPIEIFRKEHEAMRLEAEKWMKDTAMSCS 496

Query: 2025 LVATLISTVVFAAAFTVPGGSDNNA-GTPIFQHKFWFTVFLMSDAVALFASSTSILMFMS 2084
            LVA LI TV FAA FTVPGG+D+N+ G P  +H+  F +F++SD ++ FA+ TS+L+F+ 
Sbjct: 497  LVAALIVTVTFAAIFTVPGGTDDNSGGRPFHRHERIFVIFIVSDLISCFAACTSVLIFLG 556

Query: 2085 ILTSRYAEEDFVHSLPSRLLFGLAALFISIVCMVVAFSATFFLLYHKANICIPTVVAAMA 2144
            ILT+RYA +DF+ SLP+ ++ GL+ LF+SI  M+VAFS+  F +++   I  PT+    A
Sbjct: 557  ILTARYAFDDFLFSLPANMIAGLSTLFVSIAAMLVAFSSALFTIFNDPWIVAPTIF--FA 616

Query: 2145 IVPVSCFCALQFKLWVDIFHNTYSSRFLFKPRQRKLF 2170
              P   F  +Q+ L  ++  +TY  R +F    + LF
Sbjct: 617  CFPALLFVMIQYPLLKELIFSTYGKR-IFDRNMKSLF 625

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KZV25004.10.0e+0044.85Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras hygrometricum][more]
RVW82526.10.0e+0044.48Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
KAG7578768.10.0e+0041.91GAG-pre-integrase domain [Arabidopsis thaliana x Arabidopsis arenosa][more]
KAG7588551.10.0e+0041.13Ribonuclease H domain [Arabidopsis suecica][more]
KAG7574150.10.0e+0041.45Integrase catalytic core [Arabidopsis suecica][more]
Match NameE-valueIdentityDescription
Q94HW29.1e-16830.35Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT949.5e-15729.31Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P109781.1e-13932.07Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041468.4e-11327.91Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
P925192.3e-4943.67Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A2Z7AT150.0e+0044.85Cysteine-rich RLK (Receptor-like protein kinase) 8 OS=Dorcoceras hygrometricum O... [more]
A0A2N9EHN70.0e+0043.90Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB... [more]
A0A2N9GZW30.0e+0043.76Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB... [more]
A0A2N9IZK30.0e+0045.03Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS57667 PE=4 SV=1[more]
A0A2N9H2Y30.0e+0044.27Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB... [more]
Match NameE-valueIdentityDescription
AT4G23160.15.9e-15448.06cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
AT3G54070.11.5e-9637.65Ankyrin repeat family protein [more]
AT3G18670.13.9e-8134.66Ankyrin repeat family protein [more]
AT5G35810.17.4e-8045.86Ankyrin repeat family protein [more]
AT5G04690.16.7e-6533.10Ankyrin repeat family protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 2166..2169
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1020..1046
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 426..459
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1023..1039
NoneNo IPR availablePANTHERPTHR24177CASKINcoord: 1622..2163
NoneNo IPR availablePANTHERPTHR24177:SF303PROTEIN ACCELERATED CELL DEATH 6-LIKEcoord: 1622..2163
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 1459..1598
e-value: 2.16274E-75
score: 244.685
IPR002110Ankyrin repeatSMARTSM00248ANK_2acoord: 1637..1666
e-value: 7.7
score: 15.5
coord: 1672..1701
e-value: 4.2
score: 16.4
coord: 1876..1907
e-value: 340.0
score: 8.5
coord: 1707..1735
e-value: 36.0
score: 13.2
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 1126..1372
e-value: 5.1E-68
score: 229.3
IPR036770Ankyrin repeat-containing domain superfamilyGENE3D1.25.40.20coord: 1603..1917
e-value: 3.6E-25
score: 90.6
IPR036770Ankyrin repeat-containing domain superfamilySUPERFAMILY48403Ankyrin repeatcoord: 1607..1900
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 281..390
e-value: 6.0E-12
score: 45.7
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 678..750
e-value: 3.4E-8
score: 33.2
IPR020683Ankyrin repeat-containing domainPFAMPF12796Ank_2coord: 1607..1696
e-value: 8.6E-8
score: 32.7
IPR029472Retrotransposon Copia-like, N-terminalPFAMPF14244Retrotran_gag_3coord: 219..266
e-value: 4.2E-14
score: 52.1
IPR026961PGG domainPFAMPF13962PGGcoord: 2002..2114
e-value: 2.7E-28
score: 98.1
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 757..901
e-value: 1.0E-26
score: 95.4
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 750..932
score: 12.548403
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 1126..1566
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 760..887

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0024687.1Lag0024687.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0005515 protein binding