Lag0011650 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0011650
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionIntegrase catalytic domain-containing protein
Locationchr1: 29971063 .. 29976592 (-)
RNA-Seq ExpressionLag0011650
SyntenyLag0011650
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGTTTTGGAGTCATCTCGGTGTCTTGGACGGCGAGGAGGTGAAAAGACCCAATTGTGCTGAAGTGAACCTGAAGAATGACACCACACGCCGAAGATTGAAGCCCCGTACCGAGAAGAAACGAGACAAGACCGAGAAAATAAATAGTAAGATAAGGATCAGGGACGACGTCTCGACGTCGACCCGATTTTCATCTGGAATGTAAAGTTCGTTGCGTCAAGATGCTGCCCTTGGCTTCTCGACGCCAACAAGTTTTCTATAAATACGTAGCCTCGGTTTAGTCCGCCATTAGGTTAGAAATTTGGTCATTTTACAATTTATTTTGTCTCTTTTACGTTTCAGTACTATTGTAGACCCTTTTTAGATCGTATTTCATGTCTATTTATGTTTTTACTTCATCTATGAATAGCTAATCGTATTTAGATCGTTGGATTGAGATTATGCTTTGATTTCTAATAAGATTGGCTGAGTTTTCTGGAGTTTACTTATTGTTCTTGCTGTCTTTTCATTCAGTTTTGCATGTTATTCCGTTTAAACCAATACATGAGGCTAGTCTATTAGGATTGACGACCCTATGTGATTACTGCTTGTGTATTCAATCAGTTATTTTTAGATAGCCACTAGGTTGAGGCAAGCCACTCTCTACGCGGGGATGTCAATTTGGATTGTCCATAGAGGATCCAAATTCTCACATTAAATCCTTTCTAGTTATTTGTAGGACGGTAAAAATTAATGGTGTTACTGAGGATGTCATTCGCTTACGCTTATTTCCTTTTTCATTGCAGGATAAAGCTCGTGATTGGTTGCAGTCTATTCCTCCTGGAAGCATCACCACCTGGGATGCTTTGGTCCAGGAATTCTTGAAGAAATTCTTCCCTCCTGCAAAGACAGTCAAGCTGAAAACCGAGATTTGAACATTCCAGCAACAGTTTGATGAACAATTGTTTGAGGCTTGGGAACGATACAAAGAGCTACTGAGGAAGTGCCTCCATCATGGATATCCTGATTGGTTGCAAATTCAACTTTTGTATAATGGTTTAAATCCAAGTACTAAGACTATTGTTGATGCGGCTGCAGATGGGACTCTGTTGTCCAAGGCTATGGAGATGCACGTACATTACTGGAGGATATGGCCACAAACAGCTATCAGTGGCCATCTGAGCAGTCTACACCAAAGAAGGTTGCAGCTGGGGTGTTCGAGATCGATAATGTAAGTGCCCTCCAAGCTCAAATGTCATCCCTTGCTAATGCTTTATTGAAGTTTTCAGGTACAAGGAGTGCACAATTTATCGAGTCAGCAGTTGCCATTGCATCTCACTCTCAAGAAGAGACCATAGAACATGTTCAATATGTGTCAAATTCCAATTTTAGGGGATATAATAACTCTACACCTACACACTATCACCCTAACAATAGGAACCATGAAAATTTTTCTTATGTAATAATAAGAATGTGTTAAATCTCCTAGTTTTGCCCCTCAAAAGCAAGAAAATAAACAATCTCTTGAGGATCTTGTTGGAGTTTTTATTGCAGAGTCAAGTAATAGGACCAATAAGCTTGAGGAGGCAATGATCGCCATAACACCACAGTGTTTGGCCACAGTACAACGATAAAAAATATTGAAACTCGCTGGGGCAACTTGTAAATGTTGTCAACACTATGCAGAAAGGTAAGGCCTCAGCTGAGCAGGAGAGACCCCAAATGGAGTACTGTAAGGCCATCACTGTGCACTAGGAGGAGGAGATTGAAAAAGCTTAGGAACGTGAGACTGATGAGTATGATACTCCCACTAGAGAAGCTGAGGAGGGCATATCCTCAGACGAAGCTGCAAAGCTTGACCCAAGCCCCTATCCCTTCTCCTACTATTTTGGTTCCTAAAAGGAAGAAAAAGAAGAAGAAAACAATCAGGTTTAGTTTGAAAAGTTTATGAATGCTTTTATGAATTTAAATATTAACATACCTTTTGCAGAAGCATAAGAGATGCCTTAGTACGACAAATTCATGAAGGAATGACTTTCAAAGAAGAAGAAGGAAAAGCAGGTAAATACGGTGTACCTCGCCTCGACATGCAGTGCCCGCGTACAACAGAAAGTACCTGAGAGATTAGTAGATCCAGGGAGTTTTTCTGTTCCTTGTAGCTTTGGTACTTATTCCTTTCGTGCACTATGTGATTTAGGTGCTAGTATCAATATTATTCCTTTATCTTTATGTAAGAAGTTGAATATAGGAGATATTAAATCTACCCCTGTTAAACTGCAATTAGTTGATCAATCTGTGGTTAGTCCACATATGGAGTTATTGAAAATGTCTTGATTAAAGTAGCTAGATTTTTTTATTCATGTTGACTTCTTTGTTATGGATATGATGGAAAATCCTTCAGTACCTGTTATTTTAGAGAGACCATTCCTCACCACCGGGCGAGTTATTATTAGCATTGAGCGCAAGGAGCTCACTGTTAGAGTCTAGCACAAAAAGGAAGTATTCAAAGCTTTTGTGGACTCTAAAGATCAGTCTGAGGCGCTCGTGATGGGTTACANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCAGGTGCAATAGTAATCTGGTTATACCCAGAATAACCATCCAAGAAACAGTAATAGGCCTGACCAGCCAATCGATCCAACATCTGGTCGATAAATGGTAGAGGGAAATGATCCTCACATTTATCAACTTCGCAAAGTAATGGTCCTCACATTTATCAACTTCGCAAAGACCTAGTCACCACAGTTCAAGGTACTATGTCAATTGAGGCCTATTATACCAAATNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTGAAGACAATTTGGCAAGATTTGGTTGATTATCGTCCTACATACGATTGTTCCTGTGGAGGAATTAAGCCAATCATACAACACATGGAGTCTGAGTTCGTGATGATCTTCTTGATGGGACTCAACGATTCATACTCCTCCGTTCGCGCCCAGATTCTTTTAATGAATCCTATTCCTGACATTACTAAGGTGTTTTCACTGGTTATACAAGAAGAGCGTCAGAGAATTGCTGGTAATCTTGTGCCATCTACTTCCTCTGATCAGATTACTCTTCTTGCTGCTGAAGCCTCCAAGAAACAAAATAATAATCGCTTTAGGAGAAATGATAATCAAAGGCCTGTTTGTTCTCATTGCAATGTCAAAGGTCATACAGTGGATAAATGCTACAAAATACATGGTTATCCACCTGGCTACCGGTCTCGGAATACTAAAGCTTCGTCTACTAAGGCTGTTGAAGCAAACGCTGTTACTCAGCCTCAGTCAAATTTTTTCTCAAGCCTCAACCAAACTCAATACAGTCAGCTGATCGAAATGCTTAACACTCATCTTCAGCAGCCAAAACCGAGCCTATAAACGCGATCTCTTCAACTGCACACATTGCAGGTATTTGCTCCTCTGCTGTGCATAATTCTAGTGCTTGGATTTTAGATTCTGGTGCAGCTCGACACATATGTCATCAGTTTTCTTTGTTTCAAAATTGGCGTCGGGTTTATGGAATTACTGTTGTTCTTCCTACTACCTATCGTATGAGTGTTGAGTTTATGGGAGATATTCAGGTTTTGAGTTCTTGTTGCTGCGGGATGTTCTGTTTGTTCCCCAATTTAGCTATAACCTAATATCCGTAAGTTGTTTGCTAAAGGAAAGAGGCTTGGATATTAACTTTTCTGGATCTTCTTGTACCATACAGGACAAGAATCGCTTGATGATGATTGGCAGGGCTGAGTCTTCTAATGGCCTTTATATTTTGCTTCCTCCAGATAAGCCTTGTTTACTTTCTGAAACTATATGTTCTGTTAGTATGGTACTTGGCATGATCGTCTTGGACACATCTCACCTCAGAGGTTGTCTTTGCTGAAAAATACTTTACATTTTTCTCATTTGTCTACTGATTGTAATTCATGCAATATTTGCCCTTTAGCTAAGCAACGGAGGCTTGCTTTCCCTTTTAACAATCATGTCGCATCTGATATTTTTGATGTTGTCCATTGTGATGTTTGGGGACCCTTTAGAACCCCCACTTATGCTGGTTACAAATATTTTTTGACACTAGTCGATGACTGTTCGAGGTATACATGGACTTTCTTAATGCATTCCAAATCTGATGCTATCCATATTATACCTCGTTTCTTCCAGCTTGTCCTTACCCAATTTAATAAAACCATTAAGGTTTTTCGTTCAGACAATGCCCCTAAGCTTCAGTTTAAGGAATTCTTTGCTACAAAGGGAACGGTTCATCAATTCTCTTGCATTGAAACTCCCCAACAAAATTCTGTGGCTGAAAGGAAACACCAGCACCTCCTTAACGTTGCCAGAGCTTTACTCTTTCAGTCCAAAGTTCCCCTCAGATTCTGGGGAGATTGTGTGTTAACAGCCACATACCTCATCAATCGGATTCCAGCTCCCTTGTTAAAGCATAAGACTCCTTTTGAACTCTTGCACAAACGATCTGTTGATTACTCTGGACTCCGAGTCTTTGGTTGTCTATGTTATGCTTCTACGCTTGCTAACAACCGTTCAAAGTTTGATCCTCGTGCCAAACCTTGTGTGTTTCTTGGCTATTCGCCTGGTGTTAAAGGGTATCGATTGTCTGATATCGTTAGGAGACAACTTATCATATCTCGGGACGTTGTTTTCTTCGAAAATAAGTTTCCTTTTCATTCAACTGATGTCTCCGCTGAGGCCATAGATACTTTATTCTCTGATCATGTTTTGCCATGCTCAATCGTGGATCCAGTTGCTTTACATGAAGCAAATAATCTTTTGGATTCTCAAGAACATCAGGAGCCTTTGATATTTCCAGGTTCATCTACTGATTTTGTTGCAGCACAACCTGATGCTCAAACTGATGTTCCTAATCCTGCACAGGATTCTGTTGTACAACCTGATTTGGTTGATTTGGTTGATCCTGAGGTTGTTAATCAGCCATCTACTCATGTTTCTTTGAGGCGGTCCACTCGTCCCCATGTCAAGCCAAGCTTTCTCAATCAGTACCATTGCAACTCAGCTTGCTTATATCCTATTGATGATTATTTGTCCTATGATCATTTTTCTACAACACATAAGCATTTCATTCTGAATGTGTCTGCTGCTTATGAGCCATCTTACTTCCATCAAGCTATTAAATTTGATCATTGGAAGGAGGCTATGGACTCTGAAATTCGTGCAATGGAACGTACTTCTACGTGGACTATTGTTCCTTTACCTCCTGGCAAGCACATTGTTGGATGTAAATGGGTCTATCGGAATAAATATAAAACTGATGGTACCGTAGACCGTTATAAGGCCCGGCTCGTTGCCAAGGGTTACAGTCAACAAGAGGGCATTGATTTTTTTATACTTTTTCCCCTGTAG

mRNA sequence

ATGGAGTTTTGGAGTCATCTCGGTGTCTTGGACGGCGAGGAGGTGAAAAGACCCAATTGTGCTGAAGTGAACCTGAAGAATGACACCACACGCCGAAGATTGAAGCCCCGTACCGAGAAGAAACGAGACAAGACCGAGAAAATAAATAGTAAGATAAGGATCAGGGACGACGTCTCGACGTCGACCCGATTTTCATCTGGAATATGGGACTCTGTTGTCCAAGGCTATGGAGATGCACGTACATTACTGGAGGATATGGCCACAAACAGCTATCAGTGGCCATCTGAGCAGTCTACACCAAAGAAGGTTGCAGCTGGGGTGTTCGAGATCGATAATGTAAGTGCCCTCCAAGCTCAAATGTCATCCCTTGCTAATGCTTTATTGAAGTTTTCAGAGTCAAGTAATAGGACCAATAAGCTTGAGGAGGCAATGATCGCCATAACACCACAGTGTTTGGCCACAGTAAGGCCTCAGCTGAGCAGGAGAGACCCCAAATGGAGTACTGTAAGGCCATCACTGTGCACTAGGAGGAGGAGATTGAAAAAGCTTAGGAACGTGAGACTGATGAGTATGATACTCCCACTAGAGAAGCTGAGGAGGGCATATCCTCAGACGAAGCTGCAAAGCTTGACCCAAGCCCCTATCCCTTCTCCTACTATTTTGGTTCCTAAAAGGAAGAAAAAGAAGAAGAAAACAATCAGAAGCATAAGAGATGCCTTAGTACGACAAATTCATGAAGGAATGACTTTCAAAGAAGAAGAAGGAAAAGCAGGTGCTAGTATCAATATTATTCCTTTATCTTTATGTAAGAAGTTGAATATAGGAGATATTAAATCTACCCCTGTTAAACTGCAATTAGTTGATCAATCTGTGACAATTTGGCAAGATTTGGTTGATTATCGTCCTACATACGATTGTTCCTGTGGAGGAATTAAGCCAATCATACAACACATGGAGTCTGAGTTCGTGATGATCTTCTTGATGGGACTCAACGATTCATACTCCTCCGTTCGCGCCCAGATTCTTTTAATGAATCCTATTCCTGACATTACTAAGGTGTTTTCACTGGTTATACAAGAAGAGCGTCAGAGAATTGCTGGTAATCTTGTGCCATCTACTTCCTCTGATCAGATTACTCTTCTTGCTGCTGAAGCCTCCAAGAAACAAAATAATAATCGCTTTAGGAGAAATGATAATCAAAGGCCTGTTTGTTCTCATTGCAATGTCAAAGGTCATACAGTGGATAAATGCTACAAAATACATGGTTATCCACCTGGCTACCGGTCTCGGAATACTAAAGCTTCGTCTACTAAGGCTGTTGAAGCAAACGCTGTTACTCAGCCTCAGTCAAATTTTTTCTCAAGCCTCAACCAAACTCAATACAGTATTTGCTCCTCTGCTGTGCATAATTCTAGTGCTTGGATTTTAGATTCTGGTGCAGCTCGACACATATGTCATCAGTTTTCTTTGTTTCAAAATTGGCGTCGGGTTTATGGAATTACTGTTGTTCTTCCTACTACCTATCGTATGAGTGTTGAGTTTATGGGAGATATTCAGGACAAGAATCGCTTGATGATGATTGGCAGGGCTGAGTCTTCTAATGGCCTTTATATTTTGCTTCCTCCAGATAAGCCTTGTTTACTTTCTGAAACTATATGTTCTGTTAGTATGGTACTTGGCATGATCGTCTTGGACACATCTCACCTCAGAGCTAAGCAACGGAGGCTTGCTTTCCCTTTTAACAATCATGTCGCATCTGATATTTTTGATGTTGTCCATTGTGATGTTTGGGGACCCTTTAGAACCCCCACTTATGCTGGTTACAAATATTTTTTGACACTAGTCGATGACTGTTCGAGGTATACATGGACTTTCTTAATGCATTCCAAATCTGATGCTATCCATATTATACCTCGTTTCTTCCAGCTTGTCCTTACCCAATTTAATAAAACCATTAAGGTTTTTCGTTCAGACAATGCCCCTAAGCTTCAGTTTAAGGAATTCTTTGCTACAAAGGGAACGGTTCATCAATTCTCTTGCATTGAAACTCCCCAACAAAATTCTGTGGCTGAAAGGAAACACCAGCACCTCCTTAACGTTGCCAGAGCTTTACTCTTTCAGTCCAAAGTTCCCCTCAGATTCTGGGGAGATTGTGTGTTAACAGCCACATACCTCATCAATCGGATTCCAGCTCCCTTGTTAAAGCATAAGACTCCTTTTGAACTCTTGCACAAACGATCTGTTGATTACTCTGGACTCCGAGTCTTTGGTTGTCTATGTTATGCTTCTACGCTTGCTAACAACCGTTCAAAGTTTGATCCTCGTGCCAAACCTTGTGTGTTTCTTGGCTATTCGCCTGGTGTTAAAGGGTATCGATTGTCTGATATCGTTAGGAGACAACTTATCATATCTCGGGACGTTGTTTTCTTCGAAAATAAGTTTCCTTTTCATTCAACTGATGTCTCCGCTGAGGCCATAGATACTTTATTCTCTGATCATGTTTTGCCATGCTCAATCGTGGATCCAGTTGCTTTACATGAAGCAAATAATCTTTTGGATTCTCAAGAACATCAGGAGCCTTTGATATTTCCAGGTTCATCTACTGATTTTGTTGCAGCACAACCTGATGCTCAAACTGATGTTCCTAATCCTGCACAGGATTCTGTTGTACAACCTGATTTGGTTGATTTGGTTGATCCTGAGGTTGTTAATCAGCCATCTACTCATGTTTCTTTGAGGCGGTCCACTCGTCCCCATGTCAAGCCAAGCTTTCTCAATCAGTACCATTGCAACTCAGCTTGCTTATATCCTATTGATGATTATTTGTCCTATGATCATTTTTCTACAACACATAAGCATTTCATTCTGAATGTGTCTGCTGCTTATGAGCCATCTTACTTCCATCAAGCTATTAAATTTGATCATTGGAAGGAGGCTATGGACTCTGAAATTCGTGCAATGGAACGTACTTCTACGTGGACTATTGTTCCTTTACCTCCTGGCAAGCACATTGTTGGATGTAAATGGGTCTATCGGAATAAATATAAAACTGATGGTACCGTAGACCGTTATAAGGCCCGGCTCGTTGCCAAGGGTTACAGTCAACAAGAGGGCATTGATTTTTTTATACTTTTTCCCCTGTAG

Coding sequence (CDS)

ATGGAGTTTTGGAGTCATCTCGGTGTCTTGGACGGCGAGGAGGTGAAAAGACCCAATTGTGCTGAAGTGAACCTGAAGAATGACACCACACGCCGAAGATTGAAGCCCCGTACCGAGAAGAAACGAGACAAGACCGAGAAAATAAATAGTAAGATAAGGATCAGGGACGACGTCTCGACGTCGACCCGATTTTCATCTGGAATATGGGACTCTGTTGTCCAAGGCTATGGAGATGCACGTACATTACTGGAGGATATGGCCACAAACAGCTATCAGTGGCCATCTGAGCAGTCTACACCAAAGAAGGTTGCAGCTGGGGTGTTCGAGATCGATAATGTAAGTGCCCTCCAAGCTCAAATGTCATCCCTTGCTAATGCTTTATTGAAGTTTTCAGAGTCAAGTAATAGGACCAATAAGCTTGAGGAGGCAATGATCGCCATAACACCACAGTGTTTGGCCACAGTAAGGCCTCAGCTGAGCAGGAGAGACCCCAAATGGAGTACTGTAAGGCCATCACTGTGCACTAGGAGGAGGAGATTGAAAAAGCTTAGGAACGTGAGACTGATGAGTATGATACTCCCACTAGAGAAGCTGAGGAGGGCATATCCTCAGACGAAGCTGCAAAGCTTGACCCAAGCCCCTATCCCTTCTCCTACTATTTTGGTTCCTAAAAGGAAGAAAAAGAAGAAGAAAACAATCAGAAGCATAAGAGATGCCTTAGTACGACAAATTCATGAAGGAATGACTTTCAAAGAAGAAGAAGGAAAAGCAGGTGCTAGTATCAATATTATTCCTTTATCTTTATGTAAGAAGTTGAATATAGGAGATATTAAATCTACCCCTGTTAAACTGCAATTAGTTGATCAATCTGTGACAATTTGGCAAGATTTGGTTGATTATCGTCCTACATACGATTGTTCCTGTGGAGGAATTAAGCCAATCATACAACACATGGAGTCTGAGTTCGTGATGATCTTCTTGATGGGACTCAACGATTCATACTCCTCCGTTCGCGCCCAGATTCTTTTAATGAATCCTATTCCTGACATTACTAAGGTGTTTTCACTGGTTATACAAGAAGAGCGTCAGAGAATTGCTGGTAATCTTGTGCCATCTACTTCCTCTGATCAGATTACTCTTCTTGCTGCTGAAGCCTCCAAGAAACAAAATAATAATCGCTTTAGGAGAAATGATAATCAAAGGCCTGTTTGTTCTCATTGCAATGTCAAAGGTCATACAGTGGATAAATGCTACAAAATACATGGTTATCCACCTGGCTACCGGTCTCGGAATACTAAAGCTTCGTCTACTAAGGCTGTTGAAGCAAACGCTGTTACTCAGCCTCAGTCAAATTTTTTCTCAAGCCTCAACCAAACTCAATACAGTATTTGCTCCTCTGCTGTGCATAATTCTAGTGCTTGGATTTTAGATTCTGGTGCAGCTCGACACATATGTCATCAGTTTTCTTTGTTTCAAAATTGGCGTCGGGTTTATGGAATTACTGTTGTTCTTCCTACTACCTATCGTATGAGTGTTGAGTTTATGGGAGATATTCAGGACAAGAATCGCTTGATGATGATTGGCAGGGCTGAGTCTTCTAATGGCCTTTATATTTTGCTTCCTCCAGATAAGCCTTGTTTACTTTCTGAAACTATATGTTCTGTTAGTATGGTACTTGGCATGATCGTCTTGGACACATCTCACCTCAGAGCTAAGCAACGGAGGCTTGCTTTCCCTTTTAACAATCATGTCGCATCTGATATTTTTGATGTTGTCCATTGTGATGTTTGGGGACCCTTTAGAACCCCCACTTATGCTGGTTACAAATATTTTTTGACACTAGTCGATGACTGTTCGAGGTATACATGGACTTTCTTAATGCATTCCAAATCTGATGCTATCCATATTATACCTCGTTTCTTCCAGCTTGTCCTTACCCAATTTAATAAAACCATTAAGGTTTTTCGTTCAGACAATGCCCCTAAGCTTCAGTTTAAGGAATTCTTTGCTACAAAGGGAACGGTTCATCAATTCTCTTGCATTGAAACTCCCCAACAAAATTCTGTGGCTGAAAGGAAACACCAGCACCTCCTTAACGTTGCCAGAGCTTTACTCTTTCAGTCCAAAGTTCCCCTCAGATTCTGGGGAGATTGTGTGTTAACAGCCACATACCTCATCAATCGGATTCCAGCTCCCTTGTTAAAGCATAAGACTCCTTTTGAACTCTTGCACAAACGATCTGTTGATTACTCTGGACTCCGAGTCTTTGGTTGTCTATGTTATGCTTCTACGCTTGCTAACAACCGTTCAAAGTTTGATCCTCGTGCCAAACCTTGTGTGTTTCTTGGCTATTCGCCTGGTGTTAAAGGGTATCGATTGTCTGATATCGTTAGGAGACAACTTATCATATCTCGGGACGTTGTTTTCTTCGAAAATAAGTTTCCTTTTCATTCAACTGATGTCTCCGCTGAGGCCATAGATACTTTATTCTCTGATCATGTTTTGCCATGCTCAATCGTGGATCCAGTTGCTTTACATGAAGCAAATAATCTTTTGGATTCTCAAGAACATCAGGAGCCTTTGATATTTCCAGGTTCATCTACTGATTTTGTTGCAGCACAACCTGATGCTCAAACTGATGTTCCTAATCCTGCACAGGATTCTGTTGTACAACCTGATTTGGTTGATTTGGTTGATCCTGAGGTTGTTAATCAGCCATCTACTCATGTTTCTTTGAGGCGGTCCACTCGTCCCCATGTCAAGCCAAGCTTTCTCAATCAGTACCATTGCAACTCAGCTTGCTTATATCCTATTGATGATTATTTGTCCTATGATCATTTTTCTACAACACATAAGCATTTCATTCTGAATGTGTCTGCTGCTTATGAGCCATCTTACTTCCATCAAGCTATTAAATTTGATCATTGGAAGGAGGCTATGGACTCTGAAATTCGTGCAATGGAACGTACTTCTACGTGGACTATTGTTCCTTTACCTCCTGGCAAGCACATTGTTGGATGTAAATGGGTCTATCGGAATAAATATAAAACTGATGGTACCGTAGACCGTTATAAGGCCCGGCTCGTTGCCAAGGGTTACAGTCAACAAGAGGGCATTGATTTTTTTATACTTTTTCCCCTGTAG

Protein sequence

MEFWSHLGVLDGEEVKRPNCAEVNLKNDTTRRRLKPRTEKKRDKTEKINSKIRIRDDVSTSTRFSSGIWDSVVQGYGDARTLLEDMATNSYQWPSEQSTPKKVAAGVFEIDNVSALQAQMSSLANALLKFSESSNRTNKLEEAMIAITPQCLATVRPQLSRRDPKWSTVRPSLCTRRRRLKKLRNVRLMSMILPLEKLRRAYPQTKLQSLTQAPIPSPTILVPKRKKKKKKTIRSIRDALVRQIHEGMTFKEEEGKAGASINIIPLSLCKKLNIGDIKSTPVKLQLVDQSVTIWQDLVDYRPTYDCSCGGIKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDITKVFSLVIQEERQRIAGNLVPSTSSDQITLLAAEASKKQNNNRFRRNDNQRPVCSHCNVKGHTVDKCYKIHGYPPGYRSRNTKASSTKAVEANAVTQPQSNFFSSLNQTQYSICSSAVHNSSAWILDSGAARHICHQFSLFQNWRRVYGITVVLPTTYRMSVEFMGDIQDKNRLMMIGRAESSNGLYILLPPDKPCLLSETICSVSMVLGMIVLDTSHLRAKQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKTIKVFRSDNAPKLQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVFGCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHSTDVSAEAIDTLFSDHVLPCSIVDPVALHEANNLLDSQEHQEPLIFPGSSTDFVAAQPDAQTDVPNPAQDSVVQPDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYHCNSACLYPIDDYLSYDHFSTTHKHFILNVSAAYEPSYFHQAIKFDHWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDFFILFPL
Homology
BLAST of Lag0011650 vs. NCBI nr
Match: KZV25004.1 (Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras hygrometricum])

HSP 1 Score: 582.4 bits (1500), Expect = 7.8e-162
Identity = 340/896 (37.95%), Postives = 469/896 (52.34%), Query Frame = 0

Query: 276  DIKSTPVKLQLVDQSVTIWQDLVDYRPTYDCSCGGIKPIIQHMESEFVMIFLMGLNDSYS 335
            D+ S   KL+      T+W +L DY+PT  C+CG ++    +   E VM FLMGLNDSY+
Sbjct: 150  DVSSYYTKLR------TLWDELRDYQPTSACTCGSMREWFNYQNQECVMHFLMGLNDSYA 209

Query: 336  SVRAQILLMNPIPDITKVFSLVIQEERQRIAGNLVPSTSSDQITLLAAEASKKQNNNRFR 395
             VRAQ+L++ P+P I KVF+LVIQEERQR     V     D   +L+   S        R
Sbjct: 210  QVRAQVLMIEPLPTIAKVFALVIQEERQRSIHYDVSKAGVDHSGILSNVNSSANTATSLR 269

Query: 396  RNDN------QRPVCSHCNVKGHTVDKCYKIHGYPPGY------------------RSRN 455
             + N       R +CSHC+ + HTVDKCYK+HGYPPG+                   S  
Sbjct: 270  TSQNSKGGRGDRIICSHCHFRNHTVDKCYKLHGYPPGHPKFKSQISQGSAHAHQASSSSE 329

Query: 456  TKASSTKAVEANAVTQPQS----NFFSSLNQTQYS----------------ICSSAVH-- 515
            T   + +   ++++TQ Q      F SS  QT+ +                ICS+  H  
Sbjct: 330  THQETQQIDHSDSLTQSQCKQLIEFLSSKLQTRQNLLMEHQPETTVSCLTGICSATSHIP 389

Query: 516  --NSSAWILDSGAARHICHQFSLFQNWRRVYGITVVLPTT-------------------- 575
                  WI+D+GA  HIC   S+F++ R +    VVLP T                    
Sbjct: 390  AITRKDWIMDTGATHHICCSLSMFKSSRAIQS-KVVLPNTLTIPVTIAGTVAVTSNLVLQ 449

Query: 576  ---------------------YRMSVEFMGD---IQDKNRLMMIGRAESSNGLYILLPPD 635
                                 +  SV FM D   IQD +++ MIG  +    LY+L  PD
Sbjct: 450  NVLYVPVFQFNLLSVSSLTDNHNCSVSFMSDSCKIQDISQIRMIGMGKRIGNLYVLQQPD 509

Query: 636  K--PCLLSET-------------------ICSVSMVLGMIVLD------TSHLRAKQRRL 695
            +  P  +  T                   + S+  VL +   D      + HL +KQRRL
Sbjct: 510  RFLPSYICNTFVSNSELWHRRMGHPSFNKLSSLKNVLNIENTDIVNICHSCHL-SKQRRL 569

Query: 696  AFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHI 755
                 N++++ IF+++H D WGPF   +  G+++F T+VDD SRYTW +++ SKSD + I
Sbjct: 570  PLASRNNISARIFELLHIDTWGPFSQTSVDGFRFFFTIVDDHSRYTWVYMLKSKSDVLSI 629

Query: 756  IPRFFQLVLTQFNKTIKVFRSDNAPKLQFKEFFATKGTVHQFSCIETPQQNSVAERKHQH 815
             P F ++V TQF  T+K  RSDNAP+L F +FFA  G  H  SC+E PQQNSV ERKHQH
Sbjct: 630  FPDFCRMVSTQFGVTVKSVRSDNAPELGFADFFAKAGITHYHSCVERPQQNSVVERKHQH 689

Query: 816  LLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVF 875
            +LNVARALLFQS +PL +W DC+ T+ YLINR P+P+L HKTPFELLH +   YS L+VF
Sbjct: 690  ILNVARALLFQSHIPLDYWCDCINTSVYLINRTPSPILAHKTPFELLHGKLPSYSHLKVF 749

Query: 876  GCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPF 935
            GCLCYASTL ++R KF PRA  CVF+GY PG KGY+L ++   ++ ISRDV+F EN FP+
Sbjct: 750  GCLCYASTLLSSRHKFSPRAIRCVFIGYPPGYKGYKLLNLETNEIFISRDVIFHENTFPY 809

Query: 936  HSTDVSAEAIDTLFSDHVLPCSIVDPVALHEANNLLDSQEHQEPLIFPGSSTDFVAAQPD 995
             +T   + + D  F   V P S + P      +   D+Q+H                   
Sbjct: 810  QNTSPMSLS-DMTF--EVSPSSQITP------SIPADAQQHS------------------ 869

Query: 996  AQTDVPNPAQDSVVQPDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYH------- 1046
                                                 R++RPH  PS L  YH       
Sbjct: 870  -------------------------------------RTSRPHNTPSHLRDYHCYSISTP 929

BLAST of Lag0011650 vs. NCBI nr
Match: RVW82526.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 568.2 bits (1463), Expect = 1.5e-157
Identity = 332/905 (36.69%), Postives = 469/905 (51.82%), Query Frame = 0

Query: 292  TIWQDLVDYRPTYDCSCGGIKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDIT 351
            ++W +L +++    C+CGG++  ++  + E VM FL+GLN+S++ ++AQILLM P P + 
Sbjct: 157  SLWDELREFKAIPICNCGGMRVYMEDQQRETVMQFLLGLNESFAPIQAQILLMEPTPPLN 216

Query: 352  KVFSLVIQEERQRIAGNLVPSTSSDQITLL------AAEASKKQNNNRFRRNDNQRPVCS 411
            KVFSLV+QEE QR   +L  S S    T +      A+ AS   N++R R++   RP+C+
Sbjct: 217  KVFSLVVQEEWQR---SLTTSNSPAFTTPVSSRFQAASRASSPTNSSRSRKD---RPLCT 276

Query: 412  HCNVKGHTVDKCYKIHGYPPGYRSR----------------------------------- 471
            HCN+ GHTVD+CYKIHGY PG+R+R                                   
Sbjct: 277  HCNILGHTVDRCYKIHGYTPGFRNRPNFRPNGSRPNQMLPNSLHTNQLTLTDGSIASASP 336

Query: 472  ------------------NTKASSTKAVEANAVTQPQSNFFSSLNQTQYSICSSAVHNSS 531
                              ++  SS    ++N + Q  SNF   L+ +     SS+  N S
Sbjct: 337  PPLTHDQHNQLLALLSLHSSSGSSASFGDSNPLQQSISNFTGILSLSP----SSSTLNPS 396

Query: 532  AWILDSGAARHICHQFSLFQNWRRVYGITVVLPTTYRMSVEFMGD--------------- 591
             WILDSGA  H+C   S+F +       TV LPT  ++ +  +G                
Sbjct: 397  IWILDSGATHHVCTNSSMFHSIHSFSSNTVTLPTGTKIPITGIGTIHLSPHLVLEHVLYI 456

Query: 592  -----------------------------IQDKNRLMMIGRAESSNGLYIL--------- 651
                                         IQD ++  +IG       LY+L         
Sbjct: 457  PTFQFNLISISALTQTNCFSFDFTAHFCFIQDHSQGKLIGMGRRQGNLYLLDSSVFRSIS 516

Query: 652  ----------------------------LPPDKPCLLSETICSVSMVLGMIVLDTSHLRA 711
                                        L   KP L  ++  + ++   +  L      A
Sbjct: 517  SVFVVDNNTSAHVNKLWHFRLSHPSNVKLSVLKPHLQLQSNGNTNLSCSICPL------A 576

Query: 712  KQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKS 771
            KQ+RL F  +N+++S  FD++HCD+WGPF  PT+ G++YFLT+VDDC+R TW  L+ +KS
Sbjct: 577  KQKRLPFDCHNNLSSSPFDLIHCDIWGPFHIPTHDGFRYFLTIVDDCTRNTWVHLLRAKS 636

Query: 772  DAIHIIPRFFQLVLTQFNKTIKVFRSDNAPKLQFKEFFATKGTVHQFSCIETPQQNSVAE 831
            D   I P+FF +V T+F  TIK  RSDNAP+L     F     +H FSC+ETPQQNSV E
Sbjct: 637  DVKTIFPQFFSMVKTKFGLTIKAVRSDNAPELNLSNLFTQLDVLHFFSCVETPQQNSVVE 696

Query: 832  RKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYS 891
            RKHQH+LNVARAL FQS +P+ +WGDCVLT+ YLINRIP+PLL +KTPFELLH +S  YS
Sbjct: 697  RKHQHILNVARALYFQSNIPIGYWGDCVLTSVYLINRIPSPLLNNKTPFELLHHKSPSYS 756

Query: 892  GLRVFGCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFE 951
             L+ FGCLCY+STL + R KF PRA PCVFLGY  G KGY++ D+   ++ +SR+V F E
Sbjct: 757  HLKSFGCLCYSSTLPSTRHKFSPRALPCVFLGYPFGYKGYKILDLETNRISVSRNVTFQE 816

Query: 952  NKFPFHSTDVSAEAIDTLFSDHVLPCSIVDPVALHEANNLLDSQEHQEPLIFPGSSTDFV 1011
            + FPF  +  +       FS  VLP                       P+  P  S D  
Sbjct: 817  SVFPFKLSQNNNSVASDFFSKKVLPV---------------------VPVSTPSPSFDNS 876

Query: 1012 AAQPDAQTDVPNPAQDSVVQPDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYHCN 1046
             + P+      NP  DS           P   +  +T     RS+R    P +L+ YHC+
Sbjct: 877  TSHPN------NP--DSSFND-----TSPHTTSHTTT-----RSSRVSQPPKYLSDYHCH 936

BLAST of Lag0011650 vs. NCBI nr
Match: KZV39348.1 (hypothetical protein F511_17540 [Dorcoceras hygrometricum])

HSP 1 Score: 562.8 bits (1449), Expect = 6.4e-156
Identity = 329/878 (37.47%), Postives = 452/878 (51.48%), Query Frame = 0

Query: 292  TIWQDLVDYRPTYDCSCGGIKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDIT 351
            T+W +L D++P   C CG +K  + +   E  M FLMGLN+SY+ +RAQILLM+P+P I+
Sbjct: 66   TLWDELKDFQPISVCRCGSMKEWMDYQNQECAMQFLMGLNESYAQIRAQILLMDPLPVIS 125

Query: 352  KVFSLVIQEERQRIAGNLVPSTSSDQITLL--AAEASKKQNNNRFRRNDNQRPVCSHCNV 411
            K+FSLV+QEERQR     V     DQ  ++   A  +  +     +   + +  C+HC++
Sbjct: 126  KIFSLVVQEERQRSIHQGVGGKLLDQPLVMNYGANVAAVKGTYNPKGIKSDKVTCTHCHL 185

Query: 412  KGHTVDKCYKIHGYPPGYRSRNTKASSTKA------------------------------ 471
              HTVDKCYK+HGYPPG+     K S  K+                              
Sbjct: 186  PNHTVDKCYKLHGYPPGHPRYKLKQSDKKSHMIQSQPQADGTASVVGDILKPEHCRQLIA 245

Query: 472  -------------VEANAVTQPQSNFFSSLNQTQYSICSSAVHNSSAWILDSGAARHIC- 531
                         +      QP  +  S  N T     S     + +WI+D+GA  HIC 
Sbjct: 246  FLSSQLQLGNGTTMALQQPQQPPESSTSCFNDTYSLSTSHTAFPTFSWIIDTGATHHICC 305

Query: 532  --HQFSLFQ-----------------------------------------NWRRVYGITV 591
              H F  F+                                         N   +  +T 
Sbjct: 306  SLHHFVSFKPFNSNVTLPNSLNIPVTHIGSVMLLPEIILQNVLFVPQFKFNLLSISSLTK 365

Query: 592  VLPTTYRMSVEFMGDIQDKNRLMMIGRAESSNGLYILL-PPDKPCLLSETICSVSM---- 651
             +P +   S E +  IQ  N+   IG       LY+L  PP     +  T+ S +     
Sbjct: 366  QIPCSVSFSSE-LCQIQVLNQARTIGTGRRIGDLYLLTSPPSSRMEVCATVHSKTQLWHY 425

Query: 652  -----------VLGMIVLDT------------SHLRAKQRRLAFPFNNHVASDIFDVVHC 711
                       +LG ++ ++             HL +KQ+RL F  NN V    FD+VH 
Sbjct: 426  RLGHISLPRLSILGDVIQESFSNNEALSACEICHL-SKQKRLPFISNNTVVDSCFDLVHI 485

Query: 712  DVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKTIKV 771
            D+WGPF      G+KYFLT+VDD SRYTW  L+ SKSD   I P F +++ TQF K+IK 
Sbjct: 486  DIWGPFNPMNVDGFKYFLTIVDDHSRYTWVQLLKSKSDVTIIFPAFCRMIRTQFGKSIKA 545

Query: 772  FRSDNAPKLQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRF 831
             RSDNAP+LQF EFF  +G V   SC+E PQQNS+ ERKHQH+LNVARALLFQS +PL +
Sbjct: 546  VRSDNAPELQFSEFFKAEGIVSYHSCVERPQQNSIVERKHQHILNVARALLFQSNIPLVY 605

Query: 832  WGDCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVFGCLCYASTLANNRSKFDP 891
            W DC+LT+ YLINR+PAP+L +KTPFE++H +  ++S LRVFGCLCY STL ++R+KF P
Sbjct: 606  WSDCILTSVYLINRVPAPILSNKTPFEVMHTKIPNFSHLRVFGCLCYGSTLLSHRTKFSP 665

Query: 892  RAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHSTDVSAEAIDTLFSDHV 951
            RA   +FLGY PG KGY+L ++   ++ ISRDV F E  FPF +  +SA           
Sbjct: 666  RAIRSIFLGYPPGYKGYKLLNLDTNEIYISRDVTFHETVFPFRNKPLSAAE--------- 725

Query: 952  LPCSIVDPVALHEANNLLDSQEHQEPLIFPGSSTDFVAAQPDAQTDVPNPAQDSVVQPDL 1011
             PC       L+E + L  +Q  Q P                     PN           
Sbjct: 726  -PC-------LNEFDTLSTTQNVQNP---------------------PN----------- 785

Query: 1012 VDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYHCNSAC-------LYPIDDYLSYDHF 1046
               VD  VVN P        + R   +PS L+ YHC + C        +P+   LS    
Sbjct: 786  ---VDAPVVNDP-------LNGRHRKRPSHLSDYHCYAVCDPSCTSTAHPLSKVLSTHKL 845

BLAST of Lag0011650 vs. NCBI nr
Match: KZV17946.1 (hypothetical protein F511_10775 [Dorcoceras hygrometricum])

HSP 1 Score: 550.4 bits (1417), Expect = 3.3e-152
Identity = 332/880 (37.73%), Postives = 454/880 (51.59%), Query Frame = 0

Query: 292  TIWQDLVDYRPTYDCSCGGIKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDIT 351
            T+W +L D++P   C CG +K  + +   E  M FLMGLN+SY+ +RAQILLM+P+P I+
Sbjct: 105  TLWDELKDFQPVSVCRCGSMKEWMDYRNQECAMQFLMGLNESYAQIRAQILLMDPLPTIS 164

Query: 352  KVFSLVIQEERQRIAGNLVPSTSSDQITLLA----AEASKKQNNNRFRRNDNQRPVCSHC 411
            K+FSLV+QEERQR     V     +Q  +++      A K   N++  + D  +  CSHC
Sbjct: 165  KIFSLVVQEERQRSINQGVEGRILEQPLIMSHGANVAAVKGSYNSKGTKTD--KVTCSHC 224

Query: 412  NVKGHTVDKCYKIHGYPPGYRSRNTKASSTK-------------AVEANAVTQPQS---- 471
            ++  HTVDKCYK+HGYPPG+     K S  K             A   N   +P+     
Sbjct: 225  HLPNHTVDKCYKLHGYPPGHPKYKVKQSDKKSHMTQSHSIADGVASTVNDFLKPEHCRQL 284

Query: 472  -NFFSS-----------LNQT----------QYSICSS-AVHNSSAWILDSGAARHIC-- 531
              F SS           L QT           YS+ +S  +   S+WI+D+GA  HIC  
Sbjct: 285  IAFLSSQLQIGNGTTMTLQQTPESSASCFNGTYSLATSHTILPPSSWIVDTGATHHICCS 344

Query: 532  -HQFSLFQNWRRVYGITVVLPTTYRMSVEFMG---------------------------- 591
             H F  F+     +   V LP    + V  +G                            
Sbjct: 345  PHHFVSFE----PFNSNVTLPNNLNIPVTHIGSVILSSEITLHNVLFVPQFKFNLLSISS 404

Query: 592  ----------------DIQDKNRLMMIGRAESSNGLYIL--------------------- 651
                             IQ  N+   IG       LYIL                     
Sbjct: 405  LTKQIPCLVSFSSESCQIQVLNQAKTIGTGRRVGDLYILTGSSPKIEVCTAAQSKTQLWH 464

Query: 652  -----LPPDKPCLLSETICSVSMVLG--MIVLDTSHLRAKQRRLAFPFNNHVASDIFDVV 711
                 +P  K  +L +T+ + S +    +   +  HL +KQ+RL F  NN +    FD+V
Sbjct: 465  FRLGHIPLPKLSILGDTLQN-SFINNDELSTCEICHL-SKQKRLPFISNNSIVDCCFDLV 524

Query: 712  HCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKTI 771
            H D+WGPF      G+KYFLT+VDD SRYTW  L+ SKS+ I I P F +++  QF K+I
Sbjct: 525  HIDIWGPFNPMNVDGFKYFLTIVDDHSRYTWVQLLKSKSEVIDIFPTFCRMIHKQFGKSI 584

Query: 772  KVFRSDNAPKLQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPL 831
            K  RSDNAP+L+F EFF  +G V   SC+E PQQNSV ERKHQH+LNVARALLFQS +PL
Sbjct: 585  KSVRSDNAPELKFSEFFKAEGIVAFHSCVERPQQNSVVERKHQHILNVARALLFQSGIPL 644

Query: 832  RFWGDCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVFGCLCYASTLANNRSKF 891
             +W +C+LTA YLINR PAPLL +KTPFEL+H +   YS LRVFGCLCY STL N R+KF
Sbjct: 645  VYWSECILTAVYLINRTPAPLLSNKTPFELMHNKPPTYSHLRVFGCLCYGSTLLNQRTKF 704

Query: 892  DPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHSTDVSAEAIDTLFSD 951
             PRA   +FLGY PG KGY+L ++   ++ ISRDV+F E  FPF +   S+         
Sbjct: 705  SPRATRSIFLGYPPGYKGYKLLNLDTNEVYISRDVIFHETVFPFKNKSTSSPE------- 764

Query: 952  HVLPCSIVDPVALHEANNLLDSQEHQEPLIFPGSSTDFVAAQPDAQTDVPNPAQDSVVQP 1011
                         H  +N+++   +Q P               +  T++P          
Sbjct: 765  -------------HCLDNIINDGSNQLP-------------TQNFATEIP---------- 824

Query: 1012 DLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYHCNSAC-------LYPIDDYLSYD 1046
                      VN   T +S     R   KPS LN YHC + C        +PI + LS  
Sbjct: 825  ---------TVNPDETLIS-----RHKRKPSHLNDYHCYAVCNPTGSSTAHPISNVLSTH 884

BLAST of Lag0011650 vs. NCBI nr
Match: KAG7588381.1 (Integrase catalytic core [Arabidopsis suecica])

HSP 1 Score: 548.1 bits (1411), Expect = 1.6e-151
Identity = 328/902 (36.36%), Postives = 466/902 (51.66%), Query Frame = 0

Query: 292  TIWQDLVDYRPTYDC-----SCGGIKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNP 351
            T+W +L       DC     SC   K + +  ++  V+ FL GLN+SYS +R+QI++   
Sbjct: 188  TLWNEL----DASDCVKLCKSCDCCKAMDKRGDNARVIKFLAGLNESYSVIRSQIIMKKH 247

Query: 352  IPDITKVFSLVIQEERQRIAGNLVPSTSSDQITLLAAEASKKQNNNRFRRNDNQRPVCSH 411
            +P + ++++L+ Q+  QR   +  P+ S+     ++A  S + + N       Q+ +CSH
Sbjct: 248  VPSLAEIYNLLDQDHSQR---SFTPTPSNAVAFHVSAPESVQASVNATYVKP-QKVICSH 307

Query: 412  CNVKGHTVDKCYKIHGYPPGYRSRNTKASSTKAVEAN------------AVTQPQSN--- 471
            C   GHTVDKCYKIHGYP G++ +N K  + K V  N            A+T+  +N   
Sbjct: 308  CGYTGHTVDKCYKIHGYPMGFKHKN-KNQADKTVNVNHTAPVKPVVAQLAMTETATNDLL 367

Query: 472  ------------------FFSSLNQTQYSICSSA-------------------------- 531
                              F S L  T  +  SS+                          
Sbjct: 368  TGLTRVLTKDQINGVVAYFNSQLQNTPPATASSSGAMITALPGISFSSSTIGFVGVLRAT 427

Query: 532  --VHNSSAWILDSGAARHICHQFSLFQNWRRVYGITVVLPTTYRMSVEFMGD-------- 591
              V +S +WI+DSGA  H+CH  +LF +       +V LPT + + +  +G         
Sbjct: 428  GNVLSSESWIIDSGATHHVCHDKALFLSLSETLNNSVTLPTGFGVKITGIGTFKLNEFLI 487

Query: 592  ------------------------------------IQDKNRLMMIGRAESSNGLYIL-- 651
                                                IQD  + +MIGR E  + LY+L  
Sbjct: 488  LNNVLYIPDFRLNLLSISQLTKDLGYRVTFDEASCIIQDPIKGLMIGRGEQISNLYVLDV 547

Query: 652  ----LPPDKPCLLSETICSVSM---VLGMIVLDTSHL---------------------RA 711
                 P D+    +  +   S+    LG   ++ S +                      A
Sbjct: 548  QSVVNPADQITFSTNIVVDSSLWHSRLGHPSMEKSDIITDVLGFKQRNKRSFHCTICPLA 607

Query: 712  KQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKS 771
            KQ+ L F   N+V    FD+VH DVWGPF  PT+ GY+YFLT+VDD +R TW +L+ +KS
Sbjct: 608  KQKHLPFKSKNNVCDSAFDLVHIDVWGPFAVPTHDGYRYFLTIVDDHTRITWLYLLRNKS 667

Query: 772  DAIHIIPRFFQLVLTQFNKTIKVFRSDNAPKLQFKEFFATKGTVHQFSCIETPQQNSVAE 831
            + + I P F ++V TQ+  T+K  RSDNAP+L+F E F  KG +H FSC ETP+QNSV E
Sbjct: 668  EVLTIFPDFLKMVETQYKTTVKGVRSDNAPELKFVELFKAKGIIHYFSCPETPEQNSVVE 727

Query: 832  RKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYS 891
            RKHQH+LNVAR+L+FQ+ VPL +WG+CVLTA +LINR+P PLLK K+P E+L  +  DY 
Sbjct: 728  RKHQHILNVARSLMFQAHVPLEYWGECVLTAVFLINRLPTPLLKDKSPIEVLTSKVPDYG 787

Query: 892  GLRVFGCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFE 951
            G RVFGCLCY+ST + NR+KF PRAKPC+FLGY PGVKGY+L D+    + +SR+VVF E
Sbjct: 788  GFRVFGCLCYSSTSSKNRNKFQPRAKPCIFLGYPPGVKGYKLLDLETNAIHVSRNVVFHE 847

Query: 952  NKFPFHSTDVSA-----EAIDTLFSDHVLPCSIVDPVALHEANNLLDSQEHQEPLIFPGS 1011
            + FPF+  + S      ++ID    D  +  + + PV + E+                  
Sbjct: 848  DIFPFNKGNTSTLPDFFKSIDVAVEDTTVGPNNIVPVVVSES------------------ 907

Query: 1012 STDFVAAQPDAQTDVPNPAQDSVVQPDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLN 1046
                    P    DVPN    +V  P       P+ VN+           R    P +LN
Sbjct: 908  --------PTVVNDVPNSVSPAVTAP-------PKEVNR-----------RVSKAPEYLN 967

BLAST of Lag0011650 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 240.7 bits (613), Expect = 7.3e-62
Identity = 148/484 (30.58%), Postives = 241/484 (49.79%), Query Frame = 0

Query: 569  LRAKQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMH 628
            L  KQ R++F  ++    +I D+V+ DV GP    +  G KYF+T +DD SR  W +++ 
Sbjct: 461  LFGKQHRVSFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILK 520

Query: 629  SKSDAIHIIPRFFQLVLTQFNKTIKVFRSDNAPKL---QFKEFFATKGTVHQFSCIETPQ 688
            +K     +  +F  LV  +  + +K  RSDN  +    +F+E+ ++ G  H+ +   TPQ
Sbjct: 521  TKDQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQ 580

Query: 689  QNSVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLKHKTPFELLHK 748
             N VAER ++ ++   R++L  +K+P  FWG+ V TA YLINR P+  L  + P  +   
Sbjct: 581  HNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTN 640

Query: 749  RSVDYSGLRVFGCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISR 808
            + V YS L+VFGC  +A      R+K D ++ PC+F+GY     GYRL D V++++I SR
Sbjct: 641  KEVSYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSR 700

Query: 809  DVVFFENKFPFHSTDVSAEAIDTLFSDHV-LPCSIVDPVALHEANNLLDSQEHQEPLIFP 868
            DVVF E++    + D+S +  + +  + V +P +  +P +     + +  Q  Q     P
Sbjct: 701  DVVFRESEVR-TAADMSEKVKNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQ-----P 760

Query: 869  GSSTDFVAAQPDAQTDVPNPAQDSVVQPDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSF 928
            G   +      +   +V +P Q                      H  LRRS RP V+   
Sbjct: 761  GEVIEQGEQLDEGVEEVEHPTQGE------------------EQHQPLRRSERPRVE--- 820

Query: 929  LNQYHCNSACLYPIDDYLSYDHFSTTHKHFILNVSAAYEPSYFHQAIKF---DHWKEAMD 988
                    +  YP  +Y+               +S   EP    + +     +   +AM 
Sbjct: 821  --------SRRYPSTEYVL--------------ISDDREPESLKEVLSHPEKNQLMKAMQ 880

Query: 989  SEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKYKTDGTVDRYKARLVAKGYSQQEGIDF 1046
             E+ ++++  T+ +V LP GK  + CKWV++ K   D  + RYKARLV KG+ Q++GIDF
Sbjct: 881  EEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDF 895

BLAST of Lag0011650 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 232.3 bits (591), Expect = 2.6e-59
Identity = 225/890 (25.28%), Postives = 370/890 (41.57%), Query Frame = 0

Query: 327  LMGLNDSYSSVRAQILLMNPIPDITKVFSLVIQEERQRIAGN---LVPSTSSDQITLLAA 386
            L  L D Y  V  QI   +  P +T++   +I  E + +A N   +VP T ++ +T    
Sbjct: 154  LENLPDDYKPVIDQIAAKDTPPSLTEIHERLINRESKLLALNSAEVVPIT-ANVVTHRNT 213

Query: 387  EASKKQNNNRFRRN--------------------DNQRPV-----CSHCNVKGHTVDKCY 446
              ++ QNN    RN                    DN++P      C  C+V+GH+  +C 
Sbjct: 214  NTNRNQNNRGDNRNYNNNNNRSNSWQPSSSGSRSDNRQPKPYLGRCQICSVQGHSAKRCP 273

Query: 447  KIHGYPPGYRSRNTKASSTKAVEANAVTQPQSNFFSSLNQTQYSICSSAVHNSSAWILDS 506
            ++H     ++S   +  ST         QP++N           +  ++ +N++ W+LDS
Sbjct: 274  QLH----QFQSTTNQQQSTSPF---TPWQPRAN-----------LAVNSPYNANNWLLDS 333

Query: 507  GAARHICHQFSLFQNWRRVY----------GITV--------VLPT-------------- 566
            GA  HI   F+   ++ + Y          G T+         LPT              
Sbjct: 334  GATHHITSDFNNL-SFHQPYTGGDDVMIADGSTIPITHTGSASLPTSSRSLDLNKVLYVP 393

Query: 567  --------------TYRMSVEFMG---DIQDKNRLMMIGRAESSNGLY--------ILLP 626
                          T R+SVEF      ++D N  + + + ++ + LY         +  
Sbjct: 394  NIHKNLISVYRLCNTNRVSVEFFPASFQVKDLNTGVPLLQGKTKDELYEWPIASSQAVSM 453

Query: 627  PDKPC-----------LLSETICSVSMVL---GMIVLDTSH--------LRAKQRRLAFP 686
               PC           L   ++  ++ V+    + VL+ SH           K  ++ F 
Sbjct: 454  FASPCSKATHSSWHSRLGHPSLAILNSVISNHSLPVLNPSHKLLSCSDCFINKSHKVPFS 513

Query: 687  FNNHVASDIFDVVHCDVWGPFRTPTYA--GYKYFLTLVDDCSRYTWTFLMHSKSDAIHII 746
             +   +S   + ++ DVW    +P  +   Y+Y++  VD  +RYTW + +  KS      
Sbjct: 514  NSTITSSKPLEYIYSDVWS---SPILSIDNYRYYVIFVDHFTRYTWLYPLKQKSQVKDTF 573

Query: 747  PRFFQLVLTQFNKTIKVFRSDNAPK-LQFKEFFATKGTVHQFSCIETPQQNSVAERKHQH 806
              F  LV  +F   I    SDN  + +  +++ +  G  H  S   TP+ N ++ERKH+H
Sbjct: 574  IIFKSLVENRFQTRIGTLYSDNGGEFVVLRDYLSQHGISHFTSPPHTPEHNGLSERKHRH 633

Query: 807  LLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVF 866
            ++ +   LL  + VP  +W      A YLINR+P PLL+ ++PF+ L  +  +Y  L+VF
Sbjct: 634  IVEMGLTLLSHASVPKTYWPYAFSVAVYLINRLPTPLLQLQSPFQKLFGQPPNYEKLKVF 693

Query: 867  GCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPF 926
            GC CY      NR K + ++K C F+GYS     Y    I   +L  SR V F E  FPF
Sbjct: 694  GCACYPWLRPYNRHKLEDKSKQCAFMGYSLTQSAYLCLHIPTGRLYTSRHVQFDERCFPF 753

Query: 927  HSTDVSAEAIDTLFSDH---------------VLPC---------------SIVDPVALH 986
             +T+          SD                VLP                S   P+   
Sbjct: 754  STTNFGVSTSQEQRSDSAPNWPSHTTLPTTPLVLPAPPCLGPHLDTSPRPPSSPSPLCTT 813

Query: 987  E-ANNLLDSQEHQEPLIFPGSSTDFVAAQPDAQTDVPNPAQDSVVQPDLVDLVDPE--VV 1042
            + +++ L S     P     ++      QP AQ   P+  Q+S     +++  +P     
Sbjct: 814  QVSSSNLPSSSISSPSSSEPTAPSHNGPQPTAQ---PHQTQNSNSNSPILNNPNPNSPSP 873

BLAST of Lag0011650 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 218.4 bits (555), Expect = 3.9e-55
Identity = 167/568 (29.40%), Postives = 256/568 (45.07%), Query Frame = 0

Query: 545  PCLLSETICSVSMVLGMIVLDTSH--------LRAKQRRLAFPFNNHVASDIFDVVHCDV 604
            P +L+  I + S    + VL+ SH        L  K  ++ F  +   ++   + ++ DV
Sbjct: 476  PSILNSVISNYS----LSVLNPSHKFLSCSDCLINKSNKVPFSQSTINSTRPLEYIYSDV 535

Query: 605  WGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKTIKVFR 664
            W      ++  Y+Y++  VD  +RYTW + +  KS        F  L+  +F   I  F 
Sbjct: 536  WSS-PILSHDNYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTRIGTFY 595

Query: 665  SDNAPK-LQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFW 724
            SDN  + +   E+F+  G  H  S   TP+ N ++ERKH+H++     LL  + +P  +W
Sbjct: 596  SDNGGEFVALWEYFSQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHASIPKTYW 655

Query: 725  GDCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVFGCLCYASTLANNRSKFDPR 784
                  A YLINR+P PLL+ ++PF+ L   S +Y  LRVFGC CY      N+ K D +
Sbjct: 656  PYAFAVAVYLINRLPTPLLQLESPFQKLFGTSPNYDKLRVFGCACYPWLRPYNQHKLDDK 715

Query: 785  AKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHSTDVSAEAI-------DT 844
            ++ CVFLGYS     Y    +   +L ISR V F EN FPF +   +   +         
Sbjct: 716  SRQCVFLGYSLTQSAYLCLHLQTSRLYISRHVRFDENCFPFSNYLATLSPVQEQRRESSC 775

Query: 845  LFSDH--------VLPC-SIVDPVALHEA-----------------NNLLDS-------- 904
            ++S H        VLP  S  DP   H A                 ++ LDS        
Sbjct: 776  VWSPHTTLPTRTPVLPAPSCSDP---HHAATPPSSPSAPFRNSQVSSSNLDSSFSSSFPS 835

Query: 905  --------QEHQEPLIFP-GSSTDFVAAQPDAQTDVPNPAQDSVVQPDLVDLVDPEVVNQ 964
                    Q   +P   P  + T   ++Q  +Q +  N +   + Q              
Sbjct: 836  SPEPTAPRQNGPQPTTQPTQTQTQTHSSQNTSQNNPTNESPSQLAQSLSTPAQSSSSSPS 895

Query: 965  PSTHVSLRRSTRP-------HVKPSFLNQYHCNSAC---LYPIDDYLSYDHFSTTHKHFI 1024
            P+T  S   ST P       H  P      + N+      + +             K+ +
Sbjct: 896  PTTSAS-SSSTSPTPPSILIHPPPPLAQIVNNNNQAPLNTHSMGTRAKAGIIKPNPKYSL 955

Query: 1025 -LNVSAAYEPSYFHQAIKFDHWKEAMDSEIRAMERTSTWTIVPLPPGK-HIVGCKWVYRN 1042
             ++++A  EP    QA+K + W+ AM SEI A     TW +VP PP    IVGC+W++  
Sbjct: 956  AVSLAAESEPRTAIQALKDERWRNAMGSEINAQIGNHTWDLVPPPPSHVTIVGCRWIFTK 1015

BLAST of Lag0011650 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 187.2 bits (474), Expect = 9.5e-46
Identity = 153/526 (29.09%), Postives = 232/526 (44.11%), Query Frame = 0

Query: 569  LRAKQRRLAF---PFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTF 628
            L  KQ RL F       H+   +F VVH DV GP    T     YF+  VD  + Y  T+
Sbjct: 459  LNGKQARLPFKQLKDKTHIKRPLF-VVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTY 518

Query: 629  LMHSKSDAIHIIPRFFQLVLTQFNKTIKVFRSDNAPKL---QFKEFFATKGTVHQFSCIE 688
            L+  KSD   +   F       FN  +     DN  +    + ++F   KG  +  +   
Sbjct: 519  LIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPH 578

Query: 689  TPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLL--KHKTPF 748
            TPQ N V+ER  + +   AR ++  +K+   FWG+ VLTATYLINRIP+  L    KTP+
Sbjct: 579  TPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPY 638

Query: 749  ELLHKRSVDYSGLRVFGCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQ 808
            E+ H +      LRVFG   Y   + N + KFD ++   +F+GY P   G++L D V  +
Sbjct: 639  EMWHNKKPYLKHLRVFGATVYVH-IKNKQGKFDDKSFKSIFVGYEP--NGFKLWDAVNEK 698

Query: 809  LIISRDVVFFEN--------KFPFHSTDVSAEAIDTLF-SDHVLPCSIVDPVALHEANNL 868
             I++RDVV  E         KF       S E+ +  F +D         P    E +N+
Sbjct: 699  FIVARDVVVDETNMVNSRAVKFETVFLKDSKESENKNFPNDSRKIIQTEFPNESKECDNI 758

Query: 869  --LDSQEHQEPLIFPGSSTDFVAAQPDAQTDVPNPAQDSVVQPDLVDLVDPEVVNQPSTH 928
              L   +  E   FP  S   +      QT+ PN +++     ++  L D +  N+   +
Sbjct: 759  QFLKDSKESENKNFPNDSRKII------QTEFPNESKEC---DNIQFLKDSKESNKYFLN 818

Query: 929  VSLRR---------------------STRPHVKPSFLNQYHCNSAC--------LYPIDD 988
             S +R                      T  H+K   ++    N                 
Sbjct: 819  ESKKRKRDDHLNESKGSGNPNESRESETAEHLKEIGIDNPTKNDGIEIINRRSERLKTKP 878

Query: 989  YLSYDHFSTTHKHFILNVSAAYE--PSYFHQAIKFD---HWKEAMDSEIRAMERTSTWTI 1042
             +SY+    +    +LN    +   P+ F +    D    W+EA+++E+ A +  +TWTI
Sbjct: 879  QISYNEEDNSLNKVVLNAHTIFNDVPNSFDEIQYRDDKSSWEEAINTELNAHKINNTWTI 938

BLAST of Lag0011650 vs. ExPASy Swiss-Prot
Match: P92520 (Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana OX=3702 GN=AtMg00820 PE=4 SV=1)

HSP 1 Score: 90.1 bits (222), Expect = 1.6e-16
Identity = 40/79 (50.63%), Postives = 54/79 (68.35%), Query Frame = 0

Query: 963  EPSYFHQAIKFDHWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKYKTDGTVD 1022
            EP     A+K   W +AM  E+ A+ R  TW +VP P  ++I+GCKWV++ K  +DGT+D
Sbjct: 27   EPKSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLD 86

Query: 1023 RYKARLVAKGYSQQEGIDF 1042
            R KARLVAKG+ Q+EGI F
Sbjct: 87   RLKARLVAKGFHQEEGIYF 105

BLAST of Lag0011650 vs. ExPASy TrEMBL
Match: A0A2N9HKE6 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS40300 PE=3 SV=1)

HSP 1 Score: 597.4 bits (1539), Expect = 1.1e-166
Identity = 341/851 (40.07%), Postives = 475/851 (55.82%), Query Frame = 0

Query: 292  TIWQDLVDYRPTYDCSCGGIKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDIT 351
            ++W +L ++RP  DCSCG +K ++ + + E+VM FLMGLNDS+S VRAQIL+ +P+P IT
Sbjct: 579  SLWDELSNFRPIPDCSCGAMKVLLDNKQHEYVMQFLMGLNDSFSHVRAQILMTDPLPSIT 638

Query: 352  KVFSLVIQEERQRIAGNLVPSTSSDQITLLAAEASKKQNNNRFRRNDNQRPVCSHCNVKG 411
            K F+LVIQEERQR       + ++D + L     + + N  + +     RP+CSHC + G
Sbjct: 639  KAFALVIQEERQRNINIPSLAPAADSVALFTRGEATRHNYGKNQSYKKDRPICSHCGITG 698

Query: 412  HTVDKCYKIHGYPPGYRSRNTKASSTKAVEANAV--------TQPQSNFFSSLNQTQYSI 471
            HTVDKCYK+HGYPPGY+    KA    A +++AV        TQ Q     S+  +Q S+
Sbjct: 699  HTVDKCYKLHGYPPGYK---FKAKMHSAHQSSAVVEDPHLPFTQAQCQQLLSMLSSQASL 758

Query: 472  CS--------------------SAVHNSSAWILDSGAARHICHQFSLFQNWRRVYGITVV 531
             S                    S+  + +A  +    + H+ H    F +        + 
Sbjct: 759  ASLQSSQHPVNNQVVSQESAGTSSTPHQAASAISHFMSDHMVHSLRKFTSITSSINTYIH 818

Query: 532  LPTTYRMSVEFMGDIQDKNRLMM---------IGRAESSNGLYILLPPDK--PCLLSETI 591
            LP   ++    +G +Q    L++         IG     NGLY L       P     ++
Sbjct: 819  LPNGEKVLATHIGTVQVTTSLLLTDDLVTWKRIGLGRKKNGLYFLQDSTDAVPSSSFPSV 878

Query: 592  CSVSMVLGMIVLDTSHLR---------------------------------AKQRRLAFP 651
             + + V    V D  H R                                 +KQ+RL F 
Sbjct: 879  AAHTAVNNTPVFDVWHHRLGHPSLSRLSLLKNVISDLVMPSANEHCKVCHISKQKRLPFH 938

Query: 652  FNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHIIPR 711
               H A   FD++HCD+WGP+  PT    +YFLT+VDDC+R TW FLM  KS+   +I  
Sbjct: 939  TAVHFADLPFDLIHCDIWGPYHVPTIDQQRYFLTIVDDCTRCTWVFLMKQKSETSPLIQS 998

Query: 712  FFQLVLTQFNKTIKVFRSDNAPKLQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLN 771
            FF L+ TQF+ +IK+ RSDN P+ +   F+A  GT+HQ SC+ TPQQN+  ERKHQHLL 
Sbjct: 999  FFALIKTQFSASIKMVRSDNGPEFKMPSFYAQHGTLHQKSCVGTPQQNATVERKHQHLLM 1058

Query: 772  VARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVFGCL 831
            VARAL FQ+ +PL FWG CVLTAT+LINRIP PLL +K  FELL K+  +YS LRVFGCL
Sbjct: 1059 VARALRFQANLPLPFWGYCVLTATHLINRIPTPLLGNKF-FELLFKKLPNYSCLRVFGCL 1118

Query: 832  CYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHST 891
            CYA+TL++NR KF PR+K CV LGY  G+KGYRL D+  +Q+ +SRDV+F+EN FPFH+ 
Sbjct: 1119 CYAATLSHNRHKFAPRSKQCVMLGYPQGIKGYRLLDLDTKQVFVSRDVLFYENSFPFHTL 1178

Query: 892  DVSAEAIDTLFSDHVLPCSIVD-PVAL----HEANNLLDSQEHQEPLIFPGSSTDFVAAQ 951
              S     T  +  VLP  I D P++L     + N    S     PL  P S +      
Sbjct: 1179 QPS-----TPTACMVLPSPITDLPMSLSPITFDTNTSTSSSLFNSPLHSPLSPS------ 1238

Query: 952  PDAQTDVPNPAQDSVVQ-PDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYHCNSA 1011
              + T  P P   +++Q PD+V        N PST  +LR+STR H  PS+L  +HCN+A
Sbjct: 1239 -HSHTSSPLPVNSTLLQPPDIVSDQTAPPFNPPST--TLRKSTRIHKPPSYLQAFHCNTA 1298

Query: 1012 -------------------CLYPIDDYLSYDHFSTTHKHFILNVSAAYEPSYFHQAIKFD 1046
                                ++P+ +Y+SY   +  +  F+L+ SA  EP+ FH+A K  
Sbjct: 1299 SSGPAHSPSSPATNQGTAPTVFPLSNYISYSQLAPCYHSFVLSASAIREPTSFHEASKDP 1358

BLAST of Lag0011650 vs. ExPASy TrEMBL
Match: A0A2N9HKX8 (Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS40206 PE=4 SV=1)

HSP 1 Score: 596.3 bits (1536), Expect = 2.5e-166
Identity = 325/831 (39.11%), Postives = 466/831 (56.08%), Query Frame = 0

Query: 292  TIWQDLVDYRPTYDCSCGGIKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDIT 351
            ++W +L ++R   DCSCG +K ++ + + E+VM FLMGLNDS++ VRAQIL+ +P+P IT
Sbjct: 78   SLWDELNNFRSIPDCSCGALKVLLDNKQHEYVMQFLMGLNDSFTHVRAQILMTDPLPTIT 137

Query: 352  KVFSLVIQEERQRIAGNLVPSTSSDQITLLA-AEASKKQNNNRFRRNDNQRPVCSHCNVK 411
            K F+LV+QEERQR       + + D + L    EA +     + +    +RP+CSHC + 
Sbjct: 138  KAFALVVQEERQRNINIPSLAPAGDSVALFTRGEAPRNHYGGKGQFIKKERPLCSHCGIT 197

Query: 412  GHTVDKCYKIHGYPPGYRSRNT--KASSTKA--------------------VEANAVTQP 471
            GHTVDKCYK+HGYPPGY+ +N    A+ T A                    + + A   P
Sbjct: 198  GHTVDKCYKLHGYPPGYKFKNKMHSANQTSATGEEIHLPFTQVQCQQLLAMLSSQASLNP 257

Query: 472  QSNFFSSLNQTQYSICSSAVHNSSAWILDSGAARHICHQFSLFQNWRRVYGITVVLPTTY 531
                 S+    Q    SS+  + +A  +    A H+ H  S F +        + LP   
Sbjct: 258  SQPQMSNQITCQNQDASSSTPHQAASAISQFMADHMVHSLSKFSSVISTINTYIHLPNGE 317

Query: 532  RMSVEFMGDIQDKNRLMMIGRAESSNGLYIL-------LPPDKPCLL------------- 591
            +     +G +QD      IG     NGLY L        P   P +              
Sbjct: 318  KALATHIGTVQDLVAWKRIGLGRKRNGLYFLQVSTTATKPHSFPSVAVHTAVNNTPTFDV 377

Query: 592  ---------SETICSVSMVLGMIVLDTS--HLR----AKQRRLAFPFNNHVASDIFDVVH 651
                     S  +  +  V+  +V+ ++  H +    +KQ+RL F  + HV +  F+++H
Sbjct: 378  WHHRLGHPSSSRLSLLKHVINDLVIPSANEHCKVCHISKQKRLPFTNSVHVTAVPFELIH 437

Query: 652  CDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKTIK 711
            CD+WGP+  PT    KYFLT+VDD +R TW FLM  KS+ + +I  FF L+ TQF+ TIK
Sbjct: 438  CDIWGPYHVPTLDNQKYFLTIVDDFTRCTWVFLMKQKSETVSLIQSFFTLIKTQFSATIK 497

Query: 712  VFRSDNAPKLQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLR 771
              RSDN  +     F+A  GT+HQ SC+ TPQQN+  ERKHQHLL VARAL FQ+ +PL 
Sbjct: 498  KIRSDNGLEFHMPSFYAQHGTLHQKSCVGTPQQNATVERKHQHLLAVARALRFQANLPLP 557

Query: 772  FWGDCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVFGCLCYASTLANNRSKFD 831
            FWG CVLTAT+LINR P PLL +K+PFE+L  +S +YS LRVFGCLCYA+TL++NR KF 
Sbjct: 558  FWGYCVLTATHLINRTPTPLLANKSPFEVLFNKSPNYSYLRVFGCLCYATTLSHNRHKFA 617

Query: 832  PRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHSTDVSAEAIDTLFSDH 891
            PR+  C+ LGY  G+KGYRL ++  RQ+ +SRDV+F+EN FPFH++     A   +    
Sbjct: 618  PRSIQCIMLGYPQGIKGYRLLNLSTRQIFVSRDVIFYENSFPFHTSYNLPTATSMVLPHP 677

Query: 892  VLPC-SIVDPVALHEANNLLDSQEHQEPLIFPGSSTDFVAA---QPDAQTDVPNPAQDSV 951
            V    + + P++    N+ L   +H  P +    S    ++    P   T  P  A    
Sbjct: 678  VTDTPTAISPISFDSDNSALSLSDHNSPTLSDQVSISLHSSPSHSPLHNTSQPASAVSVP 737

Query: 952  VQPDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYHCNSACL-------------- 1011
            +    V + +  V + PS   ++R+STRPH  PS+L ++HCNSA L              
Sbjct: 738  LSNPTVPIPENMVPSHPSIVPAIRKSTRPHKAPSYLQEFHCNSASLSTASHSSSTTAQGT 797

Query: 1012 ----YPIDDYLSYDHFSTTHKHFILNVSAAYEPSYFHQAIKFDHWKEAMDSEIRAMERTS 1043
                +P+ ++LSY + +  +  F+LN S   EP+ F +A +   W EAM +E+ A+E  +
Sbjct: 798  ISTNFPLSNFLSYSNLAPCYHSFVLNASTIREPTSFQEASQDPKWCEAMQAELAALEANN 857

BLAST of Lag0011650 vs. ExPASy TrEMBL
Match: A0A2N9GZW3 (Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS33057 PE=4 SV=1)

HSP 1 Score: 592.8 bits (1527), Expect = 2.8e-165
Identity = 349/923 (37.81%), Postives = 481/923 (52.11%), Query Frame = 0

Query: 292  TIWQDLVDYRPTYDCSCGGIKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNPIPDIT 351
            ++W +L ++RP  DCSCG +K ++ + + E+VM FLMGLNDS+S VRAQIL+ +P+P IT
Sbjct: 579  SLWDELSNFRPIPDCSCGAMKVLLDNKQHEYVMQFLMGLNDSFSHVRAQILMTDPLPSIT 638

Query: 352  KVFSLVIQEERQRIAGNLVPSTSSDQITLLAAEASKKQNNNRFRRNDNQRPVCSHCNVKG 411
            K F+LVIQEERQR       + ++D + L     + + N  + +     RP+CSHC + G
Sbjct: 639  KAFALVIQEERQRNINIPSLAPAADSVALFTRGEATRHNYGKNQSYKKDRPICSHCGITG 698

Query: 412  HTVDKCYKIHGYPPGYRSRNTKASSTKAVEANAV--------TQPQSNFFSSLNQTQYSI 471
            HTVDKCYK+HGYPPGY+    KA    A +++AV        TQ Q     S+  +Q S+
Sbjct: 699  HTVDKCYKLHGYPPGYK---FKAKMHSAHQSSAVVEDPHLPFTQAQCQQLLSMLSSQASL 758

Query: 472  CS--SAVH---------------------------------------------------- 531
             S  S+ H                                                    
Sbjct: 759  ASLQSSQHPVNNQVVSQESAGTSSTPHQAASAISHFMSGISSFSHTVPKHSIFSVQHVNK 818

Query: 532  ---NSSAWILDSGAARHICHQFSLFQNWRRVYGITVVLPTTYRMSVEFMGDIQDKNRLMM 591
               + S WILD+GA  H+ H    F +        + LP   ++    +G +Q    L++
Sbjct: 819  TRFSHSTWILDTGATDHMVHSLRKFTSITSSINTYIHLPNGEKVLATHIGTVQVTTSLLL 878

Query: 592  --------------------------------------------IGRAESSNGLYILLPP 651
                                                        IG     NGLY L   
Sbjct: 879  TDVLCVPSFSFNLISISKLTNTPSCCVFFLSHFCFIQDLVTWKRIGLGRKKNGLYFLQDS 938

Query: 652  DK--PCLLSETICSVSMVLGMIVLDTSHLR------------------------------ 711
                P      + + + V    V D  H R                              
Sbjct: 939  TDAVPSSSFPLVAAHTAVNNTPVFDVWHHRLGHPSLSRLSLLKNVISDLVMPSANEHCKV 998

Query: 712  ---AKQRRLAFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLM 771
               +KQ+RL F    H A   FD++HCD+WGP+  PT    +YFLT+VDDC+R TW FLM
Sbjct: 999  CHISKQKRLPFHTAVHFADLPFDLIHCDIWGPYHVPTIDQQRYFLTIVDDCTRCTWVFLM 1058

Query: 772  HSKSDAIHIIPRFFQLVLTQFNKTIKVFRSDNAPKLQFKEFFATKGTVHQFSCIETPQQN 831
              KS+   +I  FF L+ TQF+ +IK+ RSDN P+ +   F+A  GT+HQ SC+ TPQQN
Sbjct: 1059 KQKSETSPLIQSFFALIKTQFSASIKMVRSDNGPEFKMPSFYAQHGTLHQKSCVGTPQQN 1118

Query: 832  SVAERKHQHLLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLKHKTPFELLHKRS 891
            +  ERKHQHLL VARAL FQ+ +PL FWG CVLTAT+LINRIP PLL +K+PFELL K+ 
Sbjct: 1119 ATVERKHQHLLMVARALRFQANLPLPFWGYCVLTATHLINRIPTPLLGNKSPFELLFKKL 1178

Query: 892  VDYSGLRVFGCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDV 951
             +YS LRVFGCLCYA+TL++NR KF PR+K CV LGY  G+KGYRL D+  +Q+ +SRDV
Sbjct: 1179 PNYSCLRVFGCLCYAATLSHNRHKFAPRSKQCVMLGYPQGIKGYRLLDLDTKQVFVSRDV 1238

Query: 952  VFFENKFPFHSTDVSAEAIDTLFSDHVLPCSIVD-PVAL----HEANNLLDSQEHQEPLI 1011
            +F+EN FPFH+   S     T  +  VLP  I D P++L     + N    S     PL 
Sbjct: 1239 LFYENSFPFHTLQPS-----TPTASMVLPSPITDLPMSLSPITFDTNTSTSSSLFNSPLH 1298

Query: 1012 FPGSSTDFVAAQPDAQTDVPNPAQDSVVQ-PDLVDLVDPEVVNQPSTHVSLRRSTRPHVK 1046
             P S +        + T  P P   +++Q PD+V        N PST  +LR+STR H  
Sbjct: 1299 SPLSPS-------HSHTSSPLPVNSTLLQPPDIVSDQTAPPFNPPST--TLRKSTRIHKP 1358

BLAST of Lag0011650 vs. ExPASy TrEMBL
Match: A0A2Z7AT15 (Cysteine-rich RLK (Receptor-like protein kinase) 8 OS=Dorcoceras hygrometricum OX=472368 GN=F511_01974 PE=4 SV=1)

HSP 1 Score: 582.4 bits (1500), Expect = 3.8e-162
Identity = 340/896 (37.95%), Postives = 469/896 (52.34%), Query Frame = 0

Query: 276  DIKSTPVKLQLVDQSVTIWQDLVDYRPTYDCSCGGIKPIIQHMESEFVMIFLMGLNDSYS 335
            D+ S   KL+      T+W +L DY+PT  C+CG ++    +   E VM FLMGLNDSY+
Sbjct: 150  DVSSYYTKLR------TLWDELRDYQPTSACTCGSMREWFNYQNQECVMHFLMGLNDSYA 209

Query: 336  SVRAQILLMNPIPDITKVFSLVIQEERQRIAGNLVPSTSSDQITLLAAEASKKQNNNRFR 395
             VRAQ+L++ P+P I KVF+LVIQEERQR     V     D   +L+   S        R
Sbjct: 210  QVRAQVLMIEPLPTIAKVFALVIQEERQRSIHYDVSKAGVDHSGILSNVNSSANTATSLR 269

Query: 396  RNDN------QRPVCSHCNVKGHTVDKCYKIHGYPPGY------------------RSRN 455
             + N       R +CSHC+ + HTVDKCYK+HGYPPG+                   S  
Sbjct: 270  TSQNSKGGRGDRIICSHCHFRNHTVDKCYKLHGYPPGHPKFKSQISQGSAHAHQASSSSE 329

Query: 456  TKASSTKAVEANAVTQPQS----NFFSSLNQTQYS----------------ICSSAVH-- 515
            T   + +   ++++TQ Q      F SS  QT+ +                ICS+  H  
Sbjct: 330  THQETQQIDHSDSLTQSQCKQLIEFLSSKLQTRQNLLMEHQPETTVSCLTGICSATSHIP 389

Query: 516  --NSSAWILDSGAARHICHQFSLFQNWRRVYGITVVLPTT-------------------- 575
                  WI+D+GA  HIC   S+F++ R +    VVLP T                    
Sbjct: 390  AITRKDWIMDTGATHHICCSLSMFKSSRAIQS-KVVLPNTLTIPVTIAGTVAVTSNLVLQ 449

Query: 576  ---------------------YRMSVEFMGD---IQDKNRLMMIGRAESSNGLYILLPPD 635
                                 +  SV FM D   IQD +++ MIG  +    LY+L  PD
Sbjct: 450  NVLYVPVFQFNLLSVSSLTDNHNCSVSFMSDSCKIQDISQIRMIGMGKRIGNLYVLQQPD 509

Query: 636  K--PCLLSET-------------------ICSVSMVLGMIVLD------TSHLRAKQRRL 695
            +  P  +  T                   + S+  VL +   D      + HL +KQRRL
Sbjct: 510  RFLPSYICNTFVSNSELWHRRMGHPSFNKLSSLKNVLNIENTDIVNICHSCHL-SKQRRL 569

Query: 696  AFPFNNHVASDIFDVVHCDVWGPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHI 755
                 N++++ IF+++H D WGPF   +  G+++F T+VDD SRYTW +++ SKSD + I
Sbjct: 570  PLASRNNISARIFELLHIDTWGPFSQTSVDGFRFFFTIVDDHSRYTWVYMLKSKSDVLSI 629

Query: 756  IPRFFQLVLTQFNKTIKVFRSDNAPKLQFKEFFATKGTVHQFSCIETPQQNSVAERKHQH 815
             P F ++V TQF  T+K  RSDNAP+L F +FFA  G  H  SC+E PQQNSV ERKHQH
Sbjct: 630  FPDFCRMVSTQFGVTVKSVRSDNAPELGFADFFAKAGITHYHSCVERPQQNSVVERKHQH 689

Query: 816  LLNVARALLFQSKVPLRFWGDCVLTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVF 875
            +LNVARALLFQS +PL +W DC+ T+ YLINR P+P+L HKTPFELLH +   YS L+VF
Sbjct: 690  ILNVARALLFQSHIPLDYWCDCINTSVYLINRTPSPILAHKTPFELLHGKLPSYSHLKVF 749

Query: 876  GCLCYASTLANNRSKFDPRAKPCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPF 935
            GCLCYASTL ++R KF PRA  CVF+GY PG KGY+L ++   ++ ISRDV+F EN FP+
Sbjct: 750  GCLCYASTLLSSRHKFSPRAIRCVFIGYPPGYKGYKLLNLETNEIFISRDVIFHENTFPY 809

Query: 936  HSTDVSAEAIDTLFSDHVLPCSIVDPVALHEANNLLDSQEHQEPLIFPGSSTDFVAAQPD 995
             +T   + + D  F   V P S + P      +   D+Q+H                   
Sbjct: 810  QNTSPMSLS-DMTF--EVSPSSQITP------SIPADAQQHS------------------ 869

Query: 996  AQTDVPNPAQDSVVQPDLVDLVDPEVVNQPSTHVSLRRSTRPHVKPSFLNQYH------- 1046
                                                 R++RPH  PS L  YH       
Sbjct: 870  -------------------------------------RTSRPHNTPSHLRDYHCYSISTP 929

BLAST of Lag0011650 vs. ExPASy TrEMBL
Match: A0A2N9IZK3 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS57667 PE=4 SV=1)

HSP 1 Score: 579.3 bits (1492), Expect = 3.2e-161
Identity = 353/953 (37.04%), Postives = 477/953 (50.05%), Query Frame = 0

Query: 293  IWQDLVDYRPTYDCSCGG------IKPIIQHMESEFVMIFLMGLNDSYSSVRAQILLMNP 352
            +W + ++YRP   C+CG        K +I++   ++V  FLMGLN+++++VR QILLM P
Sbjct: 164  LWDEFLNYRPIPSCTCGAKCMCGLSKTLIEYQHYDYVHSFLMGLNETFAAVRGQILLMEP 223

Query: 353  IPDITKVFSLVIQEERQRIAGNL---VPSTSSDQITLLAAEASKKQNNNRFRRNDNQRPV 412
            +P I KVFSL+   E+Q+ AG L   V  +S D   L    AS+K            +P+
Sbjct: 224  LPGINKVFSLIQNHEKQKGAGILPLPVGFSSVDSTAL----ASRK-----------DKPI 283

Query: 413  CSHCNVKGHTVDKCYKIHGYPPGYRSRNTKASSTKAVEA--------------------- 472
            CSHC  KGH  +KCYK+HGYPPG++ +   A +   V                       
Sbjct: 284  CSHCGYKGHVAEKCYKLHGYPPGFQRKPRNAPAANQVSCPMTMASNGHDNSQNVPSLAMQ 343

Query: 473  -------------------------------------------NAVTQPQSNF------- 532
                                                        A  QP SN        
Sbjct: 344  CQQFLNMLTAQAQKGPSSSDSHTSPHQAATLITVTQPSAQPSIQAPIQPPSNMAGIPMCL 403

Query: 533  --FSSLNQTQYSICSS-----AVHNSSAWILDSGAARHICHQFSLFQNWRRVYGITVVLP 592
              FS  N   YS+ S+        ++S W++D+GA  H+      F   + V+ +TV LP
Sbjct: 404  STFSKPNMA-YSVFSNDHFDKVSVSASEWVIDTGATDHMVTTTHYFTTMKLVHNVTVNLP 463

Query: 593  TTYRMSVEFMGD--------------------------------------------IQDK 652
                ++V  +G                                             IQD 
Sbjct: 464  NGQSVNVTHIGSIQLTASLLLTDVLCVPSFDFNLISVSKLTSSLQCCIFFLSTYCFIQDL 523

Query: 653  NRLMMIGRAESSNGLYIL------------LPPDK---------------------PCLL 712
             +  MIG     NGLY+L              PD                       C L
Sbjct: 524  MQWRMIGMGRQQNGLYMLDLSSHSKLTAAVNVPDSFHKLLYSFSTIKHSSNSFHTWHCRL 583

Query: 713  SETICSVSMVLGMIVLDTSHL-----------RAKQRRLAFPFNNHVASDIFDVVHCDVW 772
                 S    L  ++ D SH             AKQ+RL FP NNHV+S  FD++H D+W
Sbjct: 584  GHPSSSRMNFLSTVMPDISHSCKDTHVCTVCPLAKQKRLPFPNNNHVSSIAFDILHVDIW 643

Query: 773  GPFRTPTYAGYKYFLTLVDDCSRYTWTFLMHSKSDAIHIIPRFFQLVLTQFNKTIKVFRS 832
            GP+  PT  GYKYFLTLVDDC+R TW +LM SKS+   ++  F  ++ TQF   +K  RS
Sbjct: 644  GPYHVPTVEGYKYFLTLVDDCTRTTWVYLMKSKSETRPLLISFITMIQTQFGSHVKHVRS 703

Query: 833  DNAPKLQFKEFFATKGTVHQFSCIETPQQNSVAERKHQHLLNVARALLFQSKVPLRFWGD 892
            DN  +    +F+AT+G +HQ SC+ETPQQNSV ERKHQH+LNVAR+L FQS +PL+FWG 
Sbjct: 704  DNGQEFSMPDFYATQGIIHQHSCVETPQQNSVVERKHQHILNVARSLCFQSNLPLKFWGH 763

Query: 893  CVLTATYLINRIPAPLLKHKTPFELLHKRSVDYSGLRVFGCLCYASTLANNRSKFDPRAK 952
             VLTA YLINR+P+P+L HK+P+E L  ++  YS LRVFGCLC+ASTL+N+R+KFDPRAK
Sbjct: 764  SVLTAVYLINRLPSPILSHKSPYEKLLHKAPSYSHLRVFGCLCFASTLSNHRTKFDPRAK 823

Query: 953  PCVFLGYSPGVKGYRLSDIVRRQLIISRDVVFFENKFPFHST--------DVSAEAIDTL 1012
            PCVFLGY  GVKGY+L D+    +IISRDV+F E+ FPF +T        D +       
Sbjct: 824  PCVFLGYPSGVKGYKLLDLTNHNVIISRDVIFHEHVFPFANTPSADFSPFDNNLPTSQPN 883

Query: 1013 FSDHVLPCSIVDPVALHEANNLLDSQEHQEPLIFPGSSTDFVAAQPDAQT------DVPN 1042
            FSD  L  +I  P+     N  L S+E       P S +  +   P A++      DVP 
Sbjct: 884  FSDIPLDSTISCPM-----NQGLSSEE-------PCSVSTPILTSPSAESPTIPHLDVP- 943

BLAST of Lag0011650 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 144.8 bits (364), Expect = 3.8e-34
Identity = 72/146 (49.32%), Postives = 97/146 (66.44%), Query Frame = 0

Query: 903  VNQPSTHVSLRRSTRPHVKPSFLNQYHCNSAC---LYPIDDYLSYDHFSTTHKHFILNVS 962
            V +PS H S RR+     KP++L  Y+C+S     ++ I  +LSY+  S  +  F++ ++
Sbjct: 26   VPEPSVHTSHRRTR----KPAYLQDYYCHSVASLTIHDISQFLSYEKVSPLYHSFLVCIA 85

Query: 963  AAYEPSYFHQAIKFDHWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKYKTDG 1022
             A EPS +++A +F  W  AMD EI AME T TW I  LPP K  +GCKWVY+ KY +DG
Sbjct: 86   KAKEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDG 145

Query: 1023 TVDRYKARLVAKGYSQQEGIDFFILF 1046
            T++RYKARLVAKGY+QQEGIDF   F
Sbjct: 146  TIERYKARLVAKGYTQQEGIDFIETF 167

BLAST of Lag0011650 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 90.1 bits (222), Expect = 1.1e-17
Identity = 40/79 (50.63%), Postives = 54/79 (68.35%), Query Frame = 0

Query: 963  EPSYFHQAIKFDHWKEAMDSEIRAMERTSTWTIVPLPPGKHIVGCKWVYRNKYKTDGTVD 1022
            EP     A+K   W +AM  E+ A+ R  TW +VP P  ++I+GCKWV++ K  +DGT+D
Sbjct: 27   EPKSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLD 86

Query: 1023 RYKARLVAKGYSQQEGIDF 1042
            R KARLVAKG+ Q+EGI F
Sbjct: 87   RLKARLVAKGFHQEEGIYF 105

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KZV25004.17.8e-16237.95Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras hygrometricum][more]
RVW82526.11.5e-15736.69Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
KZV39348.16.4e-15637.47hypothetical protein F511_17540 [Dorcoceras hygrometricum][more]
KZV17946.13.3e-15237.73hypothetical protein F511_10775 [Dorcoceras hygrometricum][more]
KAG7588381.11.6e-15136.36Integrase catalytic core [Arabidopsis suecica][more]
Match NameE-valueIdentityDescription
P109787.3e-6230.58Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Q9ZT942.6e-5925.28Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW23.9e-5529.40Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P041469.5e-4629.09Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
P925201.6e-1650.63Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A2N9HKE61.1e-16640.07Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS40300 PE=3 SV=1[more]
A0A2N9HKX82.5e-16639.11Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB... [more]
A0A2N9GZW32.8e-16537.81Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB... [more]
A0A2Z7AT153.8e-16237.95Cysteine-rich RLK (Receptor-like protein kinase) 8 OS=Dorcoceras hygrometricum O... [more]
A0A2N9IZK33.2e-16137.04Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS57667 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G23160.13.8e-3449.32cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00820.11.1e-1750.63Reverse transcriptase (RNA-dependent DNA polymerase) [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 991..1042
e-value: 3.4E-10
score: 39.9
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 581..754
e-value: 5.5E-35
score: 122.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 28..48
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 948..1041
coord: 585..847
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 575..747
score: 15.484966
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 584..743

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0011650.1Lag0011650.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding