Lag0031657 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0031657
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionIntegrase catalytic domain-containing protein
Locationchr11: 11506735 .. 11507931 (+)
RNA-Seq ExpressionLag0031657
SyntenyLag0031657
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGCTTCTACTTCGCACTCTTCTCCAGGCACTTTACTTAATACTTCTCCATCTGATTCTAATGTGTGGTTGTCTGATATAGGGTGTAATGCACATCTTACTAGTGACCTTGCAAACTTGGGCATTTCTACTGCTTATAATGGGGAAGAGAACATAACAGTTGGTAATGGTCAGTCACTACCCATTTCTCATTTTGGTCCTGGTCAGCTTTCCCTTCCCAATGCCTCTTTTACTTTATCTAATCTTTTTCGTGTTCCTGATATATCAACAAATCTCCTTTCTGTTCATCAATTATGTATAGACAATAATTGTTGTTTCATCTTTGATTCATCCTCTTTTACCATTCAGGACAAATCAACGGGCAAAGTTCTCTTCCACGGACCTAGTGTCAACGGTCTTTATCCACTGGTTGCAAAATCTCCTTCTCCAGCACAAGTAACCCTTACGGCCCAAGTTGGTATCAAGGCTTCCACTATTGTGTGGCATGATTGGTTAGGTCACCCTTGTCTTTCGATTCTAAATTCTGTTTTGAATTCCTCTTCTATTCCAGTTAGTCGGTCTGATATTGGTGTTTGTAAACATTGTCTTGATGGCAAGTTGTCTAAACAACCTTTTCTCCTATCATCCTCTCTTTCTTGTTCTCCTTTAGAGTTACTGCATAGTGATGCATGGGGCCCTGCTCCTGATAAATCAATAAATGGTCATCGCTATTATGTTTCTTTTGTTGATGATTTTTCACGCTATACTTGGATCTTTCCCATGTGTTACAAATCTGATGTGTTTTCTATTTTTAGTCAGTTTCTGCCATTTGCTAAAAACCTACTTTCTTCCCGCCTTAACGTTTTTCGTAGTGATGGGGGTGGTGAATATCTTAGCAATGACCTTAAAAATTTATTTGCTAATCAGGGTATACTTCACCAAAAATCTTGTCCTTACACCCCTGAGCAAAATGGTATCGATGAACGTAAACATCGACATATTGTCAATATGGCATTGTCATTACTATCTAAATCATCTATTCCTATGCGGTTCTGGTTCTTCGCCTTTGCAACTGCCGAATATCTCATAAATCGCCTACCGTCTCCAAACTTAGCTCACAAATCTCCTTTTGAACTTCTCTTTAAAAAACCTCCAGATTATACTTCTCTTCATGTTTTTGGGTGTGCCTGTTATCATTTATTACGTCCTTATTGA

mRNA sequence

ATGGCTGCTTCTACTTCGCACTCTTCTCCAGGCACTTTACTTAATACTTCTCCATCTGATTCTAATGTGTGGTTGTCTGATATAGGGTGTAATGCACATCTTACTAGTGACCTTGCAAACTTGGGCATTTCTACTGCTTATAATGGGGAAGAGAACATAACAGTTGGTAATGGTCAGTCACTACCCATTTCTCATTTTGGTCCTGGTCAGCTTTCCCTTCCCAATGCCTCTTTTACTTTATCTAATCTTTTTCGTGTTCCTGATATATCAACAAATCTCCTTTCTGTTCATCAATTATGTATAGACAATAATTGTTGTTTCATCTTTGATTCATCCTCTTTTACCATTCAGGACAAATCAACGGGCAAAGTTCTCTTCCACGGACCTAGTGTCAACGGTCTTTATCCACTGGTTGCAAAATCTCCTTCTCCAGCACAAGTAACCCTTACGGCCCAAGTTGGTATCAAGGCTTCCACTATTGTGTGGCATGATTGGTTAGGTCACCCTTGTCTTTCGATTCTAAATTCTGTTTTGAATTCCTCTTCTATTCCAGTTAGTCGGTCTGATATTGGTGTTTGTAAACATTGTCTTGATGGCAAGTTGTCTAAACAACCTTTTCTCCTATCATCCTCTCTTTCTTGTTCTCCTTTAGAGTTACTGCATAGTGATGCATGGGGCCCTGCTCCTGATAAATCAATAAATGGTCATCGCTATTATGTTTCTTTTGTTGATGATTTTTCACGCTATACTTGGATCTTTCCCATGTGTTACAAATCTGATGTGTTTTCTATTTTTAGTCAGTTTCTGCCATTTGCTAAAAACCTACTTTCTTCCCGCCTTAACGTTTTTCGTAGTGATGGGGGTGGTGAATATCTTAGCAATGACCTTAAAAATTTATTTGCTAATCAGGGTATACTTCACCAAAAATCTTGTCCTTACACCCCTGAGCAAAATGGTATCGATGAACGTAAACATCGACATATTGTCAATATGGCATTGTCATTACTATCTAAATCATCTATTCCTATGCGGTTCTGGTTCTTCGCCTTTGCAACTGCCGAATATCTCATAAATCGCCTACCGTCTCCAAACTTAGCTCACAAATCTCCTTTTGAACTTCTCTTTAAAAAACCTCCAGATTATACTTCTCTTCATGTTTTTGGGTGTGCCTGTTATCATTTATTACGTCCTTATTGA

Coding sequence (CDS)

ATGGCTGCTTCTACTTCGCACTCTTCTCCAGGCACTTTACTTAATACTTCTCCATCTGATTCTAATGTGTGGTTGTCTGATATAGGGTGTAATGCACATCTTACTAGTGACCTTGCAAACTTGGGCATTTCTACTGCTTATAATGGGGAAGAGAACATAACAGTTGGTAATGGTCAGTCACTACCCATTTCTCATTTTGGTCCTGGTCAGCTTTCCCTTCCCAATGCCTCTTTTACTTTATCTAATCTTTTTCGTGTTCCTGATATATCAACAAATCTCCTTTCTGTTCATCAATTATGTATAGACAATAATTGTTGTTTCATCTTTGATTCATCCTCTTTTACCATTCAGGACAAATCAACGGGCAAAGTTCTCTTCCACGGACCTAGTGTCAACGGTCTTTATCCACTGGTTGCAAAATCTCCTTCTCCAGCACAAGTAACCCTTACGGCCCAAGTTGGTATCAAGGCTTCCACTATTGTGTGGCATGATTGGTTAGGTCACCCTTGTCTTTCGATTCTAAATTCTGTTTTGAATTCCTCTTCTATTCCAGTTAGTCGGTCTGATATTGGTGTTTGTAAACATTGTCTTGATGGCAAGTTGTCTAAACAACCTTTTCTCCTATCATCCTCTCTTTCTTGTTCTCCTTTAGAGTTACTGCATAGTGATGCATGGGGCCCTGCTCCTGATAAATCAATAAATGGTCATCGCTATTATGTTTCTTTTGTTGATGATTTTTCACGCTATACTTGGATCTTTCCCATGTGTTACAAATCTGATGTGTTTTCTATTTTTAGTCAGTTTCTGCCATTTGCTAAAAACCTACTTTCTTCCCGCCTTAACGTTTTTCGTAGTGATGGGGGTGGTGAATATCTTAGCAATGACCTTAAAAATTTATTTGCTAATCAGGGTATACTTCACCAAAAATCTTGTCCTTACACCCCTGAGCAAAATGGTATCGATGAACGTAAACATCGACATATTGTCAATATGGCATTGTCATTACTATCTAAATCATCTATTCCTATGCGGTTCTGGTTCTTCGCCTTTGCAACTGCCGAATATCTCATAAATCGCCTACCGTCTCCAAACTTAGCTCACAAATCTCCTTTTGAACTTCTCTTTAAAAAACCTCCAGATTATACTTCTCTTCATGTTTTTGGGTGTGCCTGTTATCATTTATTACGTCCTTATTGA

Protein sequence

MAASTSHSSPGTLLNTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPLVAKSPSPAQVTLTAQVGIKASTIVWHDWLGHPCLSILNSVLNSSSIPVSRSDIGVCKHCLDGKLSKQPFLLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFANQGILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLLRPY
Homology
BLAST of Lag0031657 vs. NCBI nr
Match: RWR76373.1 (putative polyprotein [Cinnamomum micranthum f. kanehirae])

HSP 1 Score: 375.9 bits (964), Expect = 4.2e-100
Identity = 176/379 (46.44%), Postives = 252/379 (66.49%), Query Frame = 0

Query: 24  WLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNASFTLSNL 83
           W +D G   H+TS++ NL + + Y+  + ++VGNG  L ISH G   +S P+++F L+N+
Sbjct: 458 WYTDTGATDHITSNIGNLSLRSDYHRPDKVSVGNGAGLHISHIGSNSISTPSSNFRLNNM 517

Query: 84  FRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYPL-VAKSP 143
             VP ISTNL+SVH+   DNNC FIFDSS F I+DK++GK LF G S NGLYP  + + P
Sbjct: 518 LCVPHISTNLISVHRFANDNNCFFIFDSSGFCIKDKASGKTLFRGQSKNGLYPFPIRRLP 577

Query: 144 SPAQVT-LTAQVGIKASTIVWHDWLGHPCLSILNSVLNSSSIPVSRSD--IGVCKHCLDG 203
           + +      A VG + +  +WH  LGHP  ++   + ++  +PV  S     +C  C  G
Sbjct: 578 THSNNDGHAAFVGERVTASIWHSRLGHPASAVFQHLASAFQLPVDGSSKLSSICTPCQMG 637

Query: 204 KLSKQPFLLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKS 263
           K  K PF +SSS+S +PL+L+H D WG +P+ SI+G+ YYVSF+DD ++Y W +P+  KS
Sbjct: 638 KSKKLPFSISSSISSNPLDLIHCDLWGSSPELSISGYSYYVSFIDDCTKYVWFYPLATKS 697

Query: 264 DVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFANQGILHQKSCPYTPEQNG 323
             F  F +F  + +N+LS+ +  F+SDGGGE++SN  +N   + GI H+ SCP+TPEQNG
Sbjct: 698 QTFVTFLKFKAYVENMLSTTIKAFQSDGGGEFMSNRFQNFLTSHGIAHRVSCPHTPEQNG 757

Query: 324 IDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLFKKPP 383
           + ERKH HIV M L+LL+ S +P+++W  AF TA +LINRLP+  L +KSP+E LF + P
Sbjct: 758 VAERKHCHIVEMGLTLLATSHMPLQYWVEAFNTAGFLINRLPTKVLNNKSPWECLFNRSP 817

Query: 384 DYTSLHVFGCACYHLLRPY 399
           +Y  LH FGC C+  LRPY
Sbjct: 818 NYCFLHTFGCLCFPWLRPY 836

BLAST of Lag0031657 vs. NCBI nr
Match: TQE09310.1 (hypothetical protein C1H46_005046 [Malus baccata])

HSP 1 Score: 370.9 bits (951), Expect = 1.3e-98
Identity = 186/397 (46.85%), Postives = 250/397 (62.97%), Query Frame = 0

Query: 3   ASTSHSSPGTLLNTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLP 62
           A T+ S+P    ++S   S  WL D G   H+TSDL+NL ++  Y+  + IT  NG  L 
Sbjct: 11  AMTARSTP----SSSAPQSEYWLLDSGATHHMTSDLSNLHVAVPYSSSDTITGANGAGLQ 70

Query: 63  ISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTG 122
           I+H G   LSLP  +  L+++  VP +S +LLS+HQLC DNNC  I D  S  IQDK T 
Sbjct: 71  IAHIGQSTLSLPTNNLCLTSVLHVPQLSQHLLSMHQLCKDNNCRCIVDEFSVCIQDKVTQ 130

Query: 123 KVLFHGPSVNGLYPLVAKSPSPAQVTLTAQVGIKASTIVWHDWLGHPCLSILNSVLNSSS 182
           KVL+ G S N +YPL     SP  V+  A +G + S+ +WH  LGHP   +L + L+ + 
Sbjct: 131 KVLYQGLSNNAVYPLPVLKSSP--VSPAAYIGQRISSALWHCRLGHPANPVLKAALSKAD 190

Query: 183 IPVSRSDIG-VCKHCLDGKLSKQPFLLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVS 242
           I  S +D    CK CL GK +  PF   +S S  P E++H+D WGP+P  SI  +RYYVS
Sbjct: 191 ISFSCTDSSTTCKACLQGKFTGLPFPSLASKSVIPFEVIHTDVWGPSPSVSIENYRYYVS 250

Query: 243 FVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFA 302
           F+D+ +RYTWIFP+  K+ VF +F QF  F  N     + + +SDGGGEY+    +N   
Sbjct: 251 FIDECTRYTWIFPIMNKAAVFGLFVQFQAFVHNYFKVSIRILQSDGGGEYVGLQFQNFLK 310

Query: 303 NQGILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLP 362
            +GILH KSCPYTP+QNG+ ERK+RHI   A++LL ++ +P +FW+ A ATA YLINR+P
Sbjct: 311 TKGILHHKSCPYTPQQNGLAERKNRHITETAVTLLQQARLPPKFWYHACATAVYLINRMP 370

Query: 363 SPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLLRPY 399
           +P LA +SPFE L+  PP    L +FGCACY  LRPY
Sbjct: 371 TPVLAMQSPFEKLYHSPPKLDHLKIFGCACYPSLRPY 401

BLAST of Lag0031657 vs. NCBI nr
Match: KAB2610253.1 (hypothetical protein D8674_018285 [Pyrus ussuriensis x Pyrus communis])

HSP 1 Score: 365.9 bits (938), Expect = 4.3e-97
Identity = 172/390 (44.10%), Postives = 256/390 (65.64%), Query Frame = 0

Query: 15  NTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLP 74
           ++S   + +WL+D G   H+T+DL+NL +++ Y   + +   NG+ L +SH G   +  P
Sbjct: 292 SSSQPSTQLWLADSGATNHMTTDLSNLTLASPYPTNKTVQTANGEGLRVSHVGSTVIHTP 351

Query: 75  NASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGL 134
                L+++  VP +S NLLSVH++C+DNNC  IFD+  F IQDK TG++L+ G   NGL
Sbjct: 352 VHPIQLNSVLYVPKLSQNLLSVHRMCLDNNCWLIFDAFCFWIQDKDTGRILYKGLCSNGL 411

Query: 135 YPL--VAKSP---SPAQVTLTAQVGIKASTIVWHDWLGHPCLSILNSVLNSSSIPVSRSD 194
           YP+  +AK P   SP  +  +A +G   S+ +WH  LGHP  +I++++L+ ++I  S+ D
Sbjct: 412 YPIPSLAKHPASFSPTNIKASAYLGQLISSSLWHSRLGHPTNNIVSTMLSKANIRCSKDD 471

Query: 195 IG-VCKHCLDGKLSKQPFLLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSR 254
           +  VC  CL+GK +K PF  S+  S  P E++HSD WGPAP  SI+G ++YV+ +D+ +R
Sbjct: 472 VPIVCHSCLEGKFTKLPFQSSTHQSQIPFEVVHSDLWGPAPCNSIDGFKFYVTIIDECTR 531

Query: 255 YTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFANQGILHQ 314
           + W+FP+  KSD F  F  F  F +   S+ + + +SDGGGEY+++ L+     +G++H 
Sbjct: 532 FCWVFPLINKSDFFYTFVSFYAFVQAQFSATIKILQSDGGGEYINHKLQAFLKVKGVVHH 591

Query: 315 KSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHK 374
            SCPYTPEQNG+ ERKHRH++   ++LL  + +P +FW FA   A YLINR+P+P L HK
Sbjct: 592 ISCPYTPEQNGLAERKHRHLIETTVTLLQYAKLPSQFWSFACQAAAYLINRMPTPILKHK 651

Query: 375 SPFELLFKKPPDYTSLHVFGCACYHLLRPY 399
           SPFELLF   P  T L VFGC+C+ LL+PY
Sbjct: 652 SPFELLFGTSPVITHLRVFGCSCFPLLKPY 681

BLAST of Lag0031657 vs. NCBI nr
Match: KAA8524269.1 (hypothetical protein F0562_010692 [Nyssa sinensis])

HSP 1 Score: 364.0 bits (933), Expect = 1.6e-96
Identity = 189/412 (45.87%), Postives = 247/412 (59.95%), Query Frame = 0

Query: 17  SPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNA 76
           S    N W +D G   H+T+DLANL     Y G++NIT+ NGQ+L ISH G   +   + 
Sbjct: 381 SDCSPNYWYTDTGATNHITADLANLNFPVEYQGDDNITIANGQALDISHSGQSSIHANDH 440

Query: 77  SFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYP 136
           +F L+N+  VP ++TNLLSVHQ C DN+C FIFDS  F IQDK+T ++LF GPS +GLYP
Sbjct: 441 TFRLNNVLCVPSMATNLLSVHQFCKDNHCRFIFDSEMFQIQDKATKQLLFQGPSDHGLYP 500

Query: 137 L----VAKSPSPA-QVTL-------------------------TAQVGIKASTIVWHDWL 196
           L    + K  +P+ Q  L                         TA +G + ST++WHD L
Sbjct: 501 LPTSSITKHSAPSLQPPLHFQHYNKHCANHSPLQRNNYSDSPHTAYLGKQVSTVLWHDRL 560

Query: 197 GHPCLSILNSVLNSSSIPVSRSDIGVCKHCLDGKLSKQPFLLSSSLSCSPLELLHSDAWG 256
           GHP  + L S+L+S+SI   R    +C+HCL GK++K PF LS++ S +PL+L+HSD WG
Sbjct: 561 GHPSTATLQSILSSASITAPRDSAPLCQHCLIGKMTKLPFPLSTTESTAPLQLVHSDLWG 620

Query: 257 PAPDKSINGHRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSD 316
           PAP  S +   YYVSFVDDFS                                    RSD
Sbjct: 621 PAPHTSFDNFTYYVSFVDDFS------------------------------------RSD 680

Query: 317 GGGEYLSNDLKNLFANQGILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFW 376
           GGGEY   +L  L    GI H++SCP+TP+QNGI ERKHRHIV   L+LLS++S+P+++W
Sbjct: 681 GGGEYNKTELTQLLTQSGIHHERSCPHTPQQNGIAERKHRHIVETGLTLLSRASLPLKYW 740

Query: 377 FFAFATAEYLINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLLRPY 399
             AF+TA YLINR+P+  L+H SP+E LF  PPDYT L  FG ACY LL+PY
Sbjct: 741 TLAFSTATYLINRMPTKVLSHLSPYEKLFHSPPDYTILKTFGYACYPLLKPY 756

BLAST of Lag0031657 vs. NCBI nr
Match: PKU75882.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Dendrobium catenatum])

HSP 1 Score: 361.7 bits (927), Expect = 8.2e-96
Identity = 178/391 (45.52%), Postives = 241/391 (61.64%), Query Frame = 0

Query: 10  PGTLLNTSPS--DSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFG 69
           P T   +SPS   S+ W  D G + HLTSD      S  Y G   + +GNG  LPI + G
Sbjct: 323 PNTAFFSSPSPHTSSEWYLDSGASTHLTSDQTQFQSSQPYTGSSQVILGNGNQLPIHNTG 382

Query: 70  PGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFH 129
            G L  P  S  L NL  VP++S NLLSV+QL  DNNC   F S  F I+D  T +VL  
Sbjct: 383 KGILPTPQGSLQLKNLNLVPNLSFNLLSVYQLTRDNNCLITFSSCGFEIKDMMTHQVLLK 442

Query: 130 GPSVNGLYPLVAKSPSPAQVTLTAQVGIKASTIVWHDWLGHPCLSILNSVLNSSSIPVSR 189
           GP +NGLY + A SP+ ++  L A + ++A   +WH  LGHP  S L+S+    S     
Sbjct: 443 GPCINGLYSIRATSPNLSKTEL-ALISVQAIPDLWHRRLGHPSASTLSSLAKHFSDICIS 502

Query: 190 SDIGVCKHCLDGKLSKQPFLLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFS 249
           S   +C  C   K  ++PF +S S S SP +L+HSD WGP+P  S  G+RYYVSF+D+FS
Sbjct: 503 SASSMCNSCQMAKSHRKPFPVSQSTSVSPFDLVHSDVWGPSPSTSCQGYRYYVSFIDNFS 562

Query: 250 RYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFANQGILH 309
           ++TW++P+ +KS+VF  F +F    K    + + + R+DGGGE+++N    L  N GI+H
Sbjct: 563 KFTWVYPLIHKSEVFQKFCEFQKMIKCQFKTDIRILRTDGGGEFINNKFTALLKNLGIIH 622

Query: 310 QKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAH 369
           Q +CPY+P QNG+ ERKHRH+     SLL ++S+P  FW     TA YLINRLPSPN +H
Sbjct: 623 QFTCPYSPPQNGVAERKHRHLAETIRSLLLEASLPHTFWVDTLFTATYLINRLPSPNTSH 682

Query: 370 KSPFELLFKKPPDYTSLHVFGCACYHLLRPY 399
           KSP+E+L+++ P+Y  L VFGC CY  L+PY
Sbjct: 683 KSPYEILYRRSPNYKFLKVFGCLCYPWLKPY 712

BLAST of Lag0031657 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 309.7 bits (792), Expect = 4.8e-83
Identity = 171/384 (44.53%), Postives = 225/384 (58.59%), Query Frame = 0

Query: 17  SPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLPNA 76
           SP  SN WL D G   H+TSD  NL +   Y G +++ V +G ++PISH G   LS  + 
Sbjct: 324 SPYSSNNWLLDSGATHHITSDFNNLSLHQPYTGGDDVMVADGSTIPISHTGSTSLSTKSR 383

Query: 77  SFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYP 136
              L N+  VP+I  NL+SV++LC  N     F  +SF ++D +TG  L  G + + LY 
Sbjct: 384 PLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNTGVPLLQGKTKDELYE 443

Query: 137 LVAKSPSPAQVTLTAQVGIKASTIVWHDWLGHPCLSILNSVLNSSSIPVSRSD--IGVCK 196
               S  P  V+L A    KA+   WH  LGHP  SILNSV+++ S+ V         C 
Sbjct: 444 WPIASSQP--VSLFASPSSKATHSSWHARLGHPAPSILNSVISNYSLSVLNPSHKFLSCS 503

Query: 197 HCLDGKLSKQPFLLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFP 256
            CL  K +K PF  S+  S  PLE ++SD W  +P  S + +RYYV FVD F+RYTW++P
Sbjct: 504 DCLINKSNKVPFSQSTINSTRPLEYIYSDVWS-SPILSHDNYRYYVIFVDHFTRYTWLYP 563

Query: 257 MCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFANQGILHQKSCPYT 316
           +  KS V   F  F    +N   +R+  F SD GGE+++  L   F+  GI H  S P+T
Sbjct: 564 LKQKSQVKETFITFKNLLENRFQTRIGTFYSDNGGEFVA--LWEYFSQHGISHLTSPPHT 623

Query: 317 PEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELL 376
           PE NG+ ERKHRHIV   L+LLS +SIP  +W +AFA A YLINRLP+P L  +SPF+ L
Sbjct: 624 PEHNGLSERKHRHIVETGLTLLSHASIPKTYWPYAFAVAVYLINRLPTPLLQLESPFQKL 683

Query: 377 FKKPPDYTSLHVFGCACYHLLRPY 399
           F   P+Y  L VFGCACY  LRPY
Sbjct: 684 FGTSPNYDKLRVFGCACYPWLRPY 702

BLAST of Lag0031657 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 307.4 bits (786), Expect = 2.4e-82
Identity = 170/403 (42.18%), Postives = 236/403 (58.56%), Query Frame = 0

Query: 4   STSHSSP----GTLLNTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQ 63
           STS  +P      L   SP ++N WL D G   H+TSD  NL     Y G +++ + +G 
Sbjct: 286 STSPFTPWQPRANLAVNSPYNANNWLLDSGATHHITSDFNNLSFHQPYTGGDDVMIADGS 345

Query: 64  SLPISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDK 123
           ++PI+H G   L   + S  L+ +  VP+I  NL+SV++LC  N     F  +SF ++D 
Sbjct: 346 TIPITHTGSASLPTSSRSLDLNKVLYVPNIHKNLISVYRLCNTNRVSVEFFPASFQVKDL 405

Query: 124 STGKVLFHGPSVNGLY--PLVAKSPSPAQVTLTAQVGIKASTIVWHDWLGHPCLSILNSV 183
           +TG  L  G + + LY  P+     S   V++ A    KA+   WH  LGHP L+ILNSV
Sbjct: 406 NTGVPLLQGKTKDELYEWPIA----SSQAVSMFASPCSKATHSSWHSRLGHPSLAILNSV 465

Query: 184 LNSSSIPVSRSD--IGVCKHCLDGKLSKQPFLLSSSLSCSPLELLHSDAWGPAPDKSING 243
           +++ S+PV      +  C  C   K  K PF  S+  S  PLE ++SD W  +P  SI+ 
Sbjct: 466 ISNHSLPVLNPSHKLLSCSDCFINKSHKVPFSNSTITSSKPLEYIYSDVWS-SPILSIDN 525

Query: 244 HRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSND 303
           +RYYV FVD F+RYTW++P+  KS V   F  F    +N   +R+    SD GGE++   
Sbjct: 526 YRYYVIFVDHFTRYTWLYPLKQKSQVKDTFIIFKSLVENRFQTRIGTLYSDNGGEFVV-- 585

Query: 304 LKNLFANQGILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEY 363
           L++  +  GI H  S P+TPE NG+ ERKHRHIV M L+LLS +S+P  +W +AF+ A Y
Sbjct: 586 LRDYLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTLLSHASVPKTYWPYAFSVAVY 645

Query: 364 LINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLLRPY 399
           LINRLP+P L  +SPF+ LF +PP+Y  L VFGCACY  LRPY
Sbjct: 646 LINRLPTPLLQLQSPFQKLFGQPPNYEKLKVFGCACYPWLRPY 681

BLAST of Lag0031657 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 161.8 bits (408), Expect = 1.6e-38
Identity = 104/317 (32.81%), Postives = 160/317 (50.47%), Query Frame = 0

Query: 77  SFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGLYP 136
           +  L ++  VPD+  NL+S   L  D    + F +  + +   S   V+  G +   LY 
Sbjct: 347 TLVLKDVRHVPDLRMNLISGIALDRDGYESY-FANQKWRLTKGSL--VIAKGVARGTLYR 406

Query: 137 LVAKSPSPAQVTLTAQVGIKASTIVWHDWLGHPCLSILNSVLNSSSIPVSR-SDIGVCKH 196
             A+     Q  L A    + S  +WH  +GH     L  +   S I  ++ + +  C +
Sbjct: 407 TNAEI---CQGELNAAQD-EISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDY 466

Query: 197 CLDGKLSKQPFLLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRYTWIFPM 256
           CL GK  +  F  SS    + L+L++SD  GP   +S+ G++Y+V+F+DD SR  W++ +
Sbjct: 467 CLFGKQHRVSFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYIL 526

Query: 257 CYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFANQGILHQKSCPYTP 316
             K  VF +F +F    +     +L   RSD GGEY S + +   ++ GI H+K+ P TP
Sbjct: 527 KTKDQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTP 586

Query: 317 EQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKSPFELLF 376
           + NG+ ER +R IV    S+L  + +P  FW  A  TA YLINR PS  LA + P  +  
Sbjct: 587 QHNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWT 646

Query: 377 KKPPDYTSLHVFGCACY 393
            K   Y+ L VFGC  +
Sbjct: 647 NKEVSYSHLKVFGCRAF 656

BLAST of Lag0031657 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 142.9 bits (359), Expect = 7.8e-33
Identity = 117/396 (29.55%), Postives = 190/396 (47.98%), Query Frame = 0

Query: 15  NTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVG-NGQSLPISHFGPGQLSL 74
           NTS  D+  ++ D G + HL +D +    S        I V   G+ +  +  G  +L  
Sbjct: 280 NTSVMDNCGFVLDSGASDHLINDESLYTDSVEVVPPLKIAVAKQGEFIYATKRGIVRLR- 339

Query: 75  PNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQD------KSTGKVLFH 134
            +   TL ++    + + NL+SV +L  +      FD S  TI        K++G +L +
Sbjct: 340 NDHEITLEDVLFCKEAAGNLMSVKRL-QEAGMSIEFDKSGVTISKNGLMVVKNSG-MLNN 399

Query: 135 GPSVN-GLYPLVAKSPSPAQVTLTAQVGIKASTIVWHDWLGH----PCLSILNSVLNSSS 194
            P +N   Y + AK               K +  +WH+  GH      L I    + S  
Sbjct: 400 VPVINFQAYSINAKH--------------KNNFRLWHERFGHISDGKLLEIKRKNMFSDQ 459

Query: 195 IPVSRSDIG--VCKHCLDGKLSKQPF--LLSSSLSCSPLELLHSDAWGPAPDKSINGHRY 254
             ++  ++   +C+ CL+GK ++ PF  L   +    PL ++HSD  GP    +++   Y
Sbjct: 460 SLLNNLELSCEICEPCLNGKQARLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNY 519

Query: 255 YVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKN 314
           +V FVD F+ Y   + + YKSDVFS+F  F+  ++   + ++     D G EYLSN+++ 
Sbjct: 520 FVIFVDQFTHYCVTYLIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQ 579

Query: 315 LFANQGILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLIN 374
               +GI +  + P+TP+ NG+ ER  R I   A +++S + +   FW  A  TA YLIN
Sbjct: 580 FCVKKGISYHLTVPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLIN 639

Query: 375 RLPSPNL--AHKSPFELLFKKPPDYTSLHVFGCACY 393
           R+PS  L  + K+P+E+   K P    L VFG   Y
Sbjct: 640 RIPSRALVDSSKTPYEMWHNKKPYLKHLRVFGATVY 658

BLAST of Lag0031657 vs. ExPASy Swiss-Prot
Match: Q07791 (Transposon Ty2-DR3 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY2B-DR3 PE=3 SV=1)

HSP 1 Score: 118.2 bits (295), Expect = 2.1e-25
Identity = 103/378 (27.25%), Postives = 168/378 (44.44%), Query Frame = 0

Query: 8   SSPGTLLNTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFG 67
           S P   ++++    +  L D G +  L      L  +T  N E NI     Q +PI+  G
Sbjct: 438 SKPTRTIDSNDELPDHLLIDSGASQTLVRSAHYLHHATP-NSEINIVDAQKQDIPINAIG 497

Query: 68  PGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDN-NCCFIFDSSSFTIQDKSTGKVLF 127
               +  N + T       P+I+ +LLS+ +L   N   CF  ++      ++S G VL 
Sbjct: 498 NLHFNFQNGTKTSIKALHTPNIAYDLLSLSELANQNITACFTRNT-----LERSDGTVLA 557

Query: 128 HGPSVNGLYPLVAKSPSPAQVT-LTAQVGIKASTI------VWHDWLGHPCL-SILNSV- 187
                   Y L  K   P+ ++ LT     K+ ++      + H  LGH    SI  S+ 
Sbjct: 558 PIVKHGDFYWLSKKYLIPSHISKLTINNVNKSKSVNKYPYPLIHRMLGHANFRSIQKSLK 617

Query: 188 ------LNSSSIPVSRSDIGVCKHCLDGKLSKQPFLLSSSL----SCSPLELLHSDAWGP 247
                 L  S I  S +    C  CL GK +K   +  S L    S  P + LH+D +GP
Sbjct: 618 KNAVTYLKESDIEWSNASTYQCPDCLIGKSTKHRHIKGSRLKYQESYEPFQYLHTDIFGP 677

Query: 248 APDKSINGHRYYVSFVDDFSRYTWIFPMCYKSD--VFSIFSQFLPFAKNLLSSRLNVFRS 307
                 +   Y++SF D+ +R+ W++P+  + +  + ++F+  L F KN  ++R+ V + 
Sbjct: 678 VHHLPKSAPSYFISFTDEKTRFQWVYPLHDRREESILNVFTSILAFIKNQFNARVLVIQM 737

Query: 308 DGGGEYLSNDLKNLFANQGILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRF 364
           D G EY +  L   F N+GI    +       +G+ ER +R ++N   +LL  S +P   
Sbjct: 738 DRGSEYTNKTLHKFFTNRGITACYTTTADSRAHGVAERLNRTLLNDCRTLLHCSGLPNHL 797

BLAST of Lag0031657 vs. ExPASy TrEMBL
Match: A0A2N9GRJ0 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS30097 PE=4 SV=1)

HSP 1 Score: 384.0 bits (985), Expect = 7.4e-103
Identity = 188/395 (47.59%), Postives = 260/395 (65.82%), Query Frame = 0

Query: 15  NTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLP 74
           N +PS +  W+SD G   H T DL NL     Y G + +++GNG  LPI+H G  QL   
Sbjct: 377 NQAPS-TTTWVSDTGATDHFTPDLTNLNNPMDYPGSDQVSIGNGTGLPITHIGHSQLKAS 436

Query: 75  NASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGL 134
           +  F L  + RVP + TNLLSV++ C DN C F FD++ F+IQD  +G+ L+ G S +GL
Sbjct: 437 SHIFNLRKILRVPCMKTNLLSVNKFCCDNACSFYFDANKFSIQDIFSGRTLYKGSSKDGL 496

Query: 135 YPLVAKSPSPAQVT--------LTAQVGIKASTIVWHDWLGHPCLSILNSVLNSS---SI 194
           YP++  S S    T         +A +G K +  VWH  LGHP   +L+SVLN     S+
Sbjct: 497 YPILGLSSSQRHSTPCHSSTPPNSAFLGTKGTKSVWHSRLGHPQDCVLHSVLNKQPWLSV 556

Query: 195 PVSRSDIGVCKHCLDGKLSKQPFLLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFV 254
             ++     C HC+ GKL + PF  SS  + +PLEL+HSD WGPAP  SING R+YVSFV
Sbjct: 557 NTAKFSSDCCTHCVQGKLHQFPFPSSSFTATAPLELVHSDVWGPAPVTSINGTRFYVSFV 616

Query: 255 DDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFANQ 314
           D F+R+TW+FP+ +KS V + F  F    +N+L++R+ V R+D GGEY ++  ++  + +
Sbjct: 617 DHFTRFTWLFPIKHKSQVLATFQHFTATMENILNTRIKVLRTDCGGEYTNSAFESFCSTR 676

Query: 315 GILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSP 374
           GILHQ SCP+TP+QNG+ ERKHRHIV  AL+L+S+SS+P+++W +AF+TA YLINR+P+P
Sbjct: 677 GILHQFSCPHTPQQNGVAERKHRHIVETALTLISESSLPLQYWPYAFSTAIYLINRMPTP 736

Query: 375 NLAHKSPFELLFKKPPDYTSLHVFGCACYHLLRPY 399
           NL   SP++LLF   PDY+ L  FGC C+ LLRPY
Sbjct: 737 NLKFTSPWQLLFHTNPDYSFLKTFGCLCFPLLRPY 770

BLAST of Lag0031657 vs. ExPASy TrEMBL
Match: A0A2N9FMC6 (Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS16294 PE=4 SV=1)

HSP 1 Score: 384.0 bits (985), Expect = 7.4e-103
Identity = 188/395 (47.59%), Postives = 260/395 (65.82%), Query Frame = 0

Query: 15  NTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLP 74
           N +PS +  W+SD G   H T DL NL     Y G + +++GNG  LPI+H G  QL   
Sbjct: 204 NQAPS-TTTWVSDTGATDHFTPDLTNLNNPMDYPGSDQVSIGNGTGLPITHIGHSQLKAS 263

Query: 75  NASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGL 134
           +  F L  + RVP + TNLLSV++ C DN C F FD++ F+IQD  +G+ L+ G S +GL
Sbjct: 264 SHIFNLRKILRVPCMKTNLLSVNKFCCDNACSFYFDANKFSIQDIFSGRTLYKGSSKDGL 323

Query: 135 YPLVAKSPSPAQVT--------LTAQVGIKASTIVWHDWLGHPCLSILNSVLNSS---SI 194
           YP++  S S    T         +A +G K +  VWH  LGHP   +L+SVLN     S+
Sbjct: 324 YPILGLSSSQRHSTPCHSSTPPNSAFLGTKGTKSVWHSRLGHPQDCVLHSVLNKQPWLSV 383

Query: 195 PVSRSDIGVCKHCLDGKLSKQPFLLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFV 254
             ++     C HC+ GKL + PF  SS  + +PLEL+HSD WGPAP  SING R+YVSFV
Sbjct: 384 NTAKFSSDCCTHCVQGKLHQFPFPSSSFTATAPLELVHSDVWGPAPVTSINGTRFYVSFV 443

Query: 255 DDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFANQ 314
           D F+R+TW+FP+ +KS V + F  F    +N+L++R+ V R+D GGEY ++  ++  + +
Sbjct: 444 DHFTRFTWLFPIKHKSQVLATFQHFTATMENILNTRIKVLRTDCGGEYTNSAFESFCSTR 503

Query: 315 GILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSP 374
           GILHQ SCP+TP+QNG+ ERKHRHIV  AL+L+S+SS+P+++W +AF+TA YLINR+P+P
Sbjct: 504 GILHQFSCPHTPQQNGVAERKHRHIVETALTLISESSLPLQYWPYAFSTAIYLINRMPTP 563

Query: 375 NLAHKSPFELLFKKPPDYTSLHVFGCACYHLLRPY 399
           NL   SP++LLF   PDY+ L  FGC C+ LLRPY
Sbjct: 564 NLKFTSPWQLLFHTNPDYSFLKTFGCLCFPLLRPY 597

BLAST of Lag0031657 vs. ExPASy TrEMBL
Match: A0A2N9G7E3 (Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS23257 PE=4 SV=1)

HSP 1 Score: 383.3 bits (983), Expect = 1.3e-102
Identity = 197/402 (49.00%), Postives = 260/402 (64.68%), Query Frame = 0

Query: 15  NTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLP 74
           +++ S SN W+SD G   H T DLANL  +  YNG + +TVGNGQ LPI+H G  QL   
Sbjct: 334 SSNASSSNCWVSDTGATDHFTPDLANLQQARDYNGNDAVTVGNGQQLPITHIGNSQLRAS 393

Query: 75  NASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGL 134
                L    RVP++ TNLLSV + C DNNCCF FD+S F+IQD  +GKVL+ G +  GL
Sbjct: 394 KHILHLRQALRVPNMKTNLLSVFKCCKDNNCCFHFDASKFSIQDIPSGKVLYKGFNEAGL 453

Query: 135 YPLV--------AKSPSPA-------QVTLTAQVGIKASTIVWHDWLGHPCLSILNSV-- 194
           YP+          ++P P            +A    K S+  WH  LGHP   IL SV  
Sbjct: 454 YPIYGDPFHSTKVQTPCPTFTKSALPSFHKSAYTVTKVSSSTWHSRLGHPNSKILQSVFK 513

Query: 195 -LNSSSIPVSRSDIGVCKHCLDGKLSKQPFLLSSSLSCSPLELLHSDAWGPAPDKSINGH 254
            L +S I  S S+   CKHC  GK+S+ PF  S + +  PL+L+HSD WGPAP  SING 
Sbjct: 514 HLPTSPIDSSSSN-SFCKHCTLGKMSQLPFSHSCTHATEPLQLVHSDVWGPAPITSINGT 573

Query: 255 RYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDL 314
           RYYVSF+DDFS++TW FP+ +KS V S F  F    +NLL+ +L V R+D GGEY  +  
Sbjct: 574 RYYVSFIDDFSKFTWFFPLKHKSQVLSTFVHFKSTLENLLNYKLKVLRTDCGGEYTDSAF 633

Query: 315 KNLFANQGILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYL 374
           ++  ++QGI HQ SCP+TP+QNG+ ERKHRHI+  AL+L+S+SS+P+ +W +AFA++ +L
Sbjct: 634 QHYCSSQGIFHQFSCPHTPQQNGVAERKHRHIIETALTLISQSSLPLSYWPYAFASSIFL 693

Query: 375 INRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLLRPY 399
           INRLP+ +L  KSP+E+LF  PPDY+   VFGC+CY LL PY
Sbjct: 694 INRLPTVSLHLKSPWEVLFHTPPDYSFFKVFGCSCYPLLTPY 734

BLAST of Lag0031657 vs. ExPASy TrEMBL
Match: A0A2N9IEP2 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS50263 PE=4 SV=1)

HSP 1 Score: 379.4 bits (973), Expect = 1.8e-101
Identity = 182/389 (46.79%), Postives = 254/389 (65.30%), Query Frame = 0

Query: 15  NTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLPISHFGPGQLSLP 74
           N+  SD + W+SD G   H T DL+ +     Y G +  TVGNGQ++PI+H G  QL   
Sbjct: 274 NSQHSDQSYWISDTGATDHFTPDLSTIPDHQEYTGTDLATVGNGQAIPITHIGNSQLKAS 333

Query: 75  NASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTGKVLFHGPSVNGL 134
           +  F L  + RVP +++NLLSV++ C DNNCCF+FD++ F I+D  TGK+L+ GPS NGL
Sbjct: 334 SHLFHLRKVLRVPSMASNLLSVNKFCRDNNCCFLFDANQFKIKDMPTGKLLYRGPSKNGL 393

Query: 135 YPLVAKS-PSPAQVT--LTAQVGIKASTIVWHDWLGHPCLSILNSVLNSSSIPVSRSD-- 194
           YP+   S P P   +   + Q     S+ VWHD LGHP   +   + ++S +  S S+  
Sbjct: 394 YPIDGVSLPPPCHTSNFSSIQSTKSVSSKVWHDRLGHPNSQVQQRIFSNSPVHNSSSNKT 453

Query: 195 IGVCKHCLDGKLSKQPFLLSSSLSCSPLELLHSDAWGPAPDKSINGHRYYVSFVDDFSRY 254
              C HC+ GK++  PF  S S +C PLE++HSD WGP+P  S  G R+YV FVD+F+R+
Sbjct: 454 ESACTHCIQGKMTHLPFHKSVSKACKPLEIIHSDVWGPSPITSDGGTRFYVIFVDEFTRF 513

Query: 255 TWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSNDLKNLFANQGILHQK 314
           TW +P+  KS V S F  F    +NLL+ ++ + R+D GGEY SN+  +   + GI HQ 
Sbjct: 514 TWFYPIRNKSQVLSCFVSFSNTMQNLLNHKIKILRTDCGGEYASNEFHSFCISHGITHQY 573

Query: 315 SCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEYLINRLPSPNLAHKS 374
           +CP+T +QNG+ ERKHRHIV++AL+L+S+SS+P+ FW +AF+TA YLINR+P  N    S
Sbjct: 574 TCPHTSQQNGLAERKHRHIVDIALTLISQSSLPLSFWPYAFSTAVYLINRVPPSNSKTSS 633

Query: 375 PFELLFKKPPDYTSLHVFGCACYHLLRPY 399
           P+ELLF + P+Y SL  FGC CY L+RPY
Sbjct: 634 PWELLFHRQPNYASLRTFGCLCYPLMRPY 662

BLAST of Lag0031657 vs. ExPASy TrEMBL
Match: A0A2N9I765 (Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS49729 PE=3 SV=1)

HSP 1 Score: 377.9 bits (969), Expect = 5.3e-101
Identity = 199/403 (49.38%), Postives = 256/403 (63.52%), Query Frame = 0

Query: 3   ASTSHSSPGTLLNTSPSDSNVWLSDIGCNAHLTSDLANLGISTAYNGEENITVGNGQSLP 62
           ASTS+ + G          + WL+D G   HLT+++ NL + T Y G + + VGNGQS+P
Sbjct: 18  ASTSNGAQG---------GDTWLTDTGATDHLTANMNNLQVQTPYKGTDQVAVGNGQSIP 77

Query: 63  ISHFGPGQLSLPNASFTLSNLFRVPDISTNLLSVHQLCIDNNCCFIFDSSSFTIQDKSTG 122
           I++ G GQL+     F L+NL     IS+NLLSVH+LC DN C   FDS+ F IQD  +G
Sbjct: 78  INNIGHGQLNTSFYKFRLNNLLHSSKISSNLLSVHKLCKDNTCSCYFDSNKFLIQDLHSG 137

Query: 123 KVLFHGPSVNGLYPLVAKSPSP----AQVTLTAQVGIKASTIVWHDWLGHPCLSILNSVL 182
           KVL+ G S NGLYP +   PSP    A  T++A +  K    +WH  LGHP   +L S L
Sbjct: 138 KVLYKGLSSNGLYP-IHTQPSPSFTTASPTVSAFLSSKNKWQLWHHRLGHPSDRVLVSTL 197

Query: 183 NSSSIPVSRSDIGV---CKHCLDGKLSKQPFLLSSSLSCSPLELLHSDAWGPAPDKSING 242
            S S  +S  +  V   CKHCL GK+ K PF  S   S  PLEL+HSD WGPAP  S NG
Sbjct: 198 PSLSSCISVRNKHVQHHCKHCLIGKMHKLPFAHSQFQSTQPLELVHSDVWGPAPVSSSNG 257

Query: 243 HRYYVSFVDDFSRYTWIFPMCYKSDVFSIFSQFLPFAKNLLSSRLNVFRSDGGGEYLSND 302
           ++YY+ FVDDFS+Y+W+F +  KSDV + F  F    +  LS+++   R+D GGEY SN 
Sbjct: 258 YKYYLLFVDDFSKYSWLFLLKQKSDVLATFKHFKATVETQLSAQIKFLRTDCGGEYTSNA 317

Query: 303 LKNLFANQGILHQKSCPYTPEQNGIDERKHRHIVNMALSLLSKSSIPMRFWFFAFATAEY 362
             +  ++ GI HQ SCP+TP+QNGI ERKHRHI+  AL+LLS +S+P   W +A  TA +
Sbjct: 318 FTDFCSSHGITHQFSCPHTPQQNGIVERKHRHIIECALTLLSHASLPTTHWTYAVTTAIH 377

Query: 363 LINRLPSPNLAHKSPFELLFKKPPDYTSLHVFGCACYHLLRPY 399
           LINRLPSP+L+HKSP+E LF K PD T L  FGC CY  LRPY
Sbjct: 378 LINRLPSPHLSHKSPWEHLFHKAPDITHLRTFGCLCYPYLRPY 410

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RWR76373.14.2e-10046.44putative polyprotein [Cinnamomum micranthum f. kanehirae][more]
TQE09310.11.3e-9846.85hypothetical protein C1H46_005046 [Malus baccata][more]
KAB2610253.14.3e-9744.10hypothetical protein D8674_018285 [Pyrus ussuriensis x Pyrus communis][more]
KAA8524269.11.6e-9645.87hypothetical protein F0562_010692 [Nyssa sinensis][more]
PKU75882.18.2e-9645.52Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Dendrobium catenatu... [more]
Match NameE-valueIdentityDescription
Q94HW24.8e-8344.53Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT942.4e-8242.18Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P109781.6e-3832.81Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041467.8e-3329.55Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q077912.1e-2527.25Transposon Ty2-DR3 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... [more]
Match NameE-valueIdentityDescription
A0A2N9GRJ07.4e-10347.59Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS30097 PE=4 SV=1[more]
A0A2N9FMC67.4e-10347.59Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB... [more]
A0A2N9G7E31.3e-10249.00Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB... [more]
A0A2N9IEP21.8e-10146.79Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS50263 PE=4 SV=1[more]
A0A2N9I7655.3e-10149.38Integrase catalytic domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 133..200
e-value: 7.6E-9
score: 35.3
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 209..386
e-value: 3.8E-35
score: 123.0
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 159..398
NoneNo IPR availablePANTHERPTHR11439:SF324RIBONUCLEASE H-LIKE DOMAIN, GAG-PRE-INTEGRASE DOMAIN, GAG-POLYPEPTIDE OF LTR COPIA-TYPE-RELATEDcoord: 159..398
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 212..378
score: 20.668083
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 216..387

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0031657.1Lag0031657.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding