Lag0027536 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0027536
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Locationchr8: 1827956 .. 1830226 (+)
RNA-Seq ExpressionLag0027536
SyntenyLag0027536
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACACCTGAAATTGCCACACAAGTCATGGGATTCGACAATGCGAAAGATCTCTGGAGTGCTATTCAGAGTTTATTTGGTATTCAATCAAGAGCAGAGGAAGATTTCCTCAGACAAACCTTTCAACAATCTAGAAAAGGTACGTCCAAAATGACTGATTATTTACGATTAATGAAGTCTCATGCCGATAATCTAGGGCAAGCAGGAAGTCCAGTTTCGACAAGGAACTTAGTATCTCAAGTATTGCTCGGACTCGATGAGGAGTATAATCCGATTGTAGCCATGATCCATGGAAGGGGAGACATCTCGTGGTCTGAAATGCAGGCCGAACTCCTTGTGTTTGAGAAGAGATTGGAACTACAGAATACTCAAAAAGCCGTTGTCTCCTTTAATCACACTCCCACCGTCAATGTGGCAAATAACAAGAACAACATGAATCAAAACAACAATAGAGGCTGGAATTACAGTCACAGTAATGGCCAGAGAGGACAGTTCTATAACAACAATCAACGTGGGGGTTCAAATTTTAACAATGGCAGGGGACGAGGAGGCCGTGGCAGAGGATATGGAGGATATGGCAACTCAAATAATCGCCAAGTTTGCCAAGTATGTGGAAGACCTGGTCATTCGGCACTTATGTGTTATCACAGATTTGATAAAGAGTTCAGCCCAAATGTGAACAGAGGTGGCAATCAGAATCCAAATAACTCAGGTAACTCAGGGAACACTCAGCCACCGTCTGCTTTTGTGGCCAACTCAAACAGTCAATATGCTTGTCCTGAGACAGTAATAGACTCCAACTGGTACGCTGACAGTGGAGCTTCGAATCATGTCACCGGAGACTTCAACAATCTTGCTAATACCAAGGAATATGGAGGTAATGAACAAGTGGTCATAGGTAATGGAGAATCTCTCCCTATTACTTTCACTGGAGATACTTATTTATCTAATGGTGCTGCTATTCTTAGTCTCAATAACGTTTTGTGTGTTCCTGAAATAACTAAAAACCTAGTTAGTGTATCAAAACTAGCTCAGGACAATGACGTTTTCATTGAATTTCATGGTGATTGTTGTATTATTAAGGACAAGCGTTCGGGTCAGGAGGTGCTGAAAGGAGTACTTAGGGACGGTCTCTACCAGCTTAACAATGTCACGAGGGTACCAGGAGTGAATGAAGGATGTTCTGAGTCAATTTCCAAGAATTCTACGGCCAATAATCATTCCTCAGTTTTTGTTGTTTCTCGTTATCCACTGAGTGTTAATATTATTGTGTCTAAGAATGTATGGCACAAACGTCTGGGTCATCCAGCATCTCGGGTTTTAGATTTTGTTATCAATGATTGTAAGCTTCAAGTTAAAGAGAATGAGATGCTCAGTTTTTGCGAGTCATGTCAATTTGGCAAGTCACACAATTTACCTTTCCCTCTATCTTCAAGTCGGGCAAAGTATCCATTTGAATTGATTCACTCGGACCTTTGGGGTCCTGCTCCGGTCTTGTCTACTGATGGCTTTCAATATTATGTTTTATTTCTAGATGATTACAGCAGATATCTATGGCTTTATCCATTGAAAAAGAAAAATGATGCGCTTGCTGCCTTTCACCACTTTATCTCTATGGTCAGGAATCAGTTTGGTTGTCAAATAAAGATTCTTCAGTCTGACAATGGTGGCGAATACGCTAGGATTCATCAGGAATGTTATCGACTTGGTATTATCTCTCGATATTCTTGTCCCTACACGTTTGCACAAAATGGAAGAGCAGAACGGAAGCATAGACACGTTGTTGAAACTGGTCTGACATTGCTTGCTCAGGCGTCAATGCCTCTTCAGTATTGGTGGGATGCGTTTTTAGCGGCTGTCCAGCTGATAAATGGTTTACCAACCTCAGTTCTTGAAGGTAAGTCACCATTGGAAGTGTTACATCACAAGAAACTTAATTTTGCAGGTCTACGCTCATTTGGATGTGCCTGTTATCCATGCCTGAGGCCTTACCATAACCACAAATTTCAGTTTCACTCCGAGAGGTGCGTTTATCTTGGCTTCAGCCCCTCTCATAAAGGACATAAATGCCTTAGTGCTTCTGGTCGTGTATTTATTTCTCGACATGTGCAGTTTAATGAACTCATGTTTCCATTTGCACTTGATTTTGGAAAACCCTCAAGCTCCCCAACATTTTCACCCTCTCATGGTCCATCTATCTTAACCTGGTTTCAATCTCTAGAACACAGTCACTTATCCCAAGAAAATACCAACAGGCAATTGAAATATTGA

mRNA sequence

ATGACACCTGAAATTGCCACACAAGTCATGGGATTCGACAATGCGAAAGATCTCTGGAGTGCTATTCAGAGTTTATTTGGTATTCAATCAAGAGCAGAGGAAGATTTCCTCAGACAAACCTTTCAACAATCTAGAAAAGGTACGTCCAAAATGACTGATTATTTACGATTAATGAAGTCTCATGCCGATAATCTAGGGCAAGCAGGAAGTCCAGTTTCGACAAGGAACTTAGTATCTCAAGTATTGCTCGGACTCGATGAGGAGTATAATCCGATTGTAGCCATGATCCATGGAAGGGGAGACATCTCGTGGTCTGAAATGCAGGCCGAACTCCTTGTGTTTGAGAAGAGATTGGAACTACAGAATACTCAAAAAGCCGTTGTCTCCTTTAATCACACTCCCACCGTCAATGTGGCAAATAACAAGAACAACATGAATCAAAACAACAATAGAGGCTGGAATTACAGTCACAGTAATGGCCAGAGAGGACAGTTCTATAACAACAATCAACGTGGGGGTTCAAATTTTAACAATGGCAGGGGACGAGGAGGCCGTGGCAGAGGATATGGAGGATATGGCAACTCAAATAATCGCCAAGTTTGCCAAGTATGTGGAAGACCTGGTCATTCGGCACTTATGTGTTATCACAGATTTGATAAAGAGTTCAGCCCAAATGTGAACAGAGGTGGCAATCAGAATCCAAATAACTCAGGTAACTCAGGGAACACTCAGCCACCGTCTGCTTTTGTGGCCAACTCAAACAGTCAATATGCTTGTCCTGAGACAGTAATAGACTCCAACTGGTACGCTGACAGTGGAGCTTCGAATCATGTCACCGGAGACTTCAACAATCTTGCTAATACCAAGGAATATGGAGGTAATGAACAAGTGGTCATAGGTAATGGAGAATCTCTCCCTATTACTTTCACTGGAGATACTTATTTATCTAATGGTGCTGCTATTCTTAGTCTCAATAACGTTTTGTGTGTTCCTGAAATAACTAAAAACCTAGTTAGTGTATCAAAACTAGCTCAGGACAATGACGTTTTCATTGAATTTCATGGTGATTGTTGTATTATTAAGGACAAGCGTTCGGGTCAGGAGGTGCTGAAAGGAGTACTTAGGGACGGTCTCTACCAGCTTAACAATGTCACGAGGGTACCAGGAGTGAATGAAGGATGTTCTGAGTCAATTTCCAAGAATTCTACGGCCAATAATCATTCCTCAGTTTTTGTTGTTTCTCGTTATCCACTGAGTGTTAATATTATTGTGTCTAAGAATGTATGGCACAAACGTCTGGGTCATCCAGCATCTCGGGTTTTAGATTTTGTTATCAATGATTGTAAGCTTCAAGTTAAAGAGAATGAGATGCTCAGTTTTTGCGAGTCATGTCAATTTGGCAAGTCACACAATTTACCTTTCCCTCTATCTTCAAGTCGGGCAAAGTATCCATTTGAATTGATTCACTCGGACCTTTGGGGTCCTGCTCCGGTCTTGTCTACTGATGGCTTTCAATATTATGTTTTATTTCTAGATGATTACAGCAGATATCTATGGCTTTATCCATTGAAAAAGAAAAATGATGCGCTTGCTGCCTTTCACCACTTTATCTCTATGGTCAGGAATCAGTTTGGTTGTCAAATAAAGATTCTTCAGTCTGACAATGGTGGCGAATACGCTAGGATTCATCAGGAATGTTATCGACTTGGTATTATCTCTCGATATTCTTGTCCCTACACGTTTGCACAAAATGGAAGAGCAGAACGGAAGCATAGACACGTTGTTGAAACTGGTCTGACATTGCTTGCTCAGGCGTCAATGCCTCTTCAGTATTGGTGGGATGCGTTTTTAGCGGCTGTCCAGCTGATAAATGGTTTACCAACCTCAGTTCTTGAAGGTAAGTCACCATTGGAAGTGTTACATCACAAGAAACTTAATTTTGCAGGTCTACGCTCATTTGGATGTGCCTGTTATCCATGCCTGAGGCCTTACCATAACCACAAATTTCAGTTTCACTCCGAGAGGTGCGTTTATCTTGGCTTCAGCCCCTCTCATAAAGGACATAAATGCCTTAGTGCTTCTGGTCGTGTATTTATTTCTCGACATGTGCAGTTTAATGAACTCATGTTTCCATTTGCACTTGATTTTGGAAAACCCTCAAGCTCCCCAACATTTTCACCCTCTCATGGTCCATCTATCTTAACCTGGTTTCAATCTCTAGAACACAGTCACTTATCCCAAGAAAATACCAACAGGCAATTGAAATATTGA

Coding sequence (CDS)

ATGACACCTGAAATTGCCACACAAGTCATGGGATTCGACAATGCGAAAGATCTCTGGAGTGCTATTCAGAGTTTATTTGGTATTCAATCAAGAGCAGAGGAAGATTTCCTCAGACAAACCTTTCAACAATCTAGAAAAGGTACGTCCAAAATGACTGATTATTTACGATTAATGAAGTCTCATGCCGATAATCTAGGGCAAGCAGGAAGTCCAGTTTCGACAAGGAACTTAGTATCTCAAGTATTGCTCGGACTCGATGAGGAGTATAATCCGATTGTAGCCATGATCCATGGAAGGGGAGACATCTCGTGGTCTGAAATGCAGGCCGAACTCCTTGTGTTTGAGAAGAGATTGGAACTACAGAATACTCAAAAAGCCGTTGTCTCCTTTAATCACACTCCCACCGTCAATGTGGCAAATAACAAGAACAACATGAATCAAAACAACAATAGAGGCTGGAATTACAGTCACAGTAATGGCCAGAGAGGACAGTTCTATAACAACAATCAACGTGGGGGTTCAAATTTTAACAATGGCAGGGGACGAGGAGGCCGTGGCAGAGGATATGGAGGATATGGCAACTCAAATAATCGCCAAGTTTGCCAAGTATGTGGAAGACCTGGTCATTCGGCACTTATGTGTTATCACAGATTTGATAAAGAGTTCAGCCCAAATGTGAACAGAGGTGGCAATCAGAATCCAAATAACTCAGGTAACTCAGGGAACACTCAGCCACCGTCTGCTTTTGTGGCCAACTCAAACAGTCAATATGCTTGTCCTGAGACAGTAATAGACTCCAACTGGTACGCTGACAGTGGAGCTTCGAATCATGTCACCGGAGACTTCAACAATCTTGCTAATACCAAGGAATATGGAGGTAATGAACAAGTGGTCATAGGTAATGGAGAATCTCTCCCTATTACTTTCACTGGAGATACTTATTTATCTAATGGTGCTGCTATTCTTAGTCTCAATAACGTTTTGTGTGTTCCTGAAATAACTAAAAACCTAGTTAGTGTATCAAAACTAGCTCAGGACAATGACGTTTTCATTGAATTTCATGGTGATTGTTGTATTATTAAGGACAAGCGTTCGGGTCAGGAGGTGCTGAAAGGAGTACTTAGGGACGGTCTCTACCAGCTTAACAATGTCACGAGGGTACCAGGAGTGAATGAAGGATGTTCTGAGTCAATTTCCAAGAATTCTACGGCCAATAATCATTCCTCAGTTTTTGTTGTTTCTCGTTATCCACTGAGTGTTAATATTATTGTGTCTAAGAATGTATGGCACAAACGTCTGGGTCATCCAGCATCTCGGGTTTTAGATTTTGTTATCAATGATTGTAAGCTTCAAGTTAAAGAGAATGAGATGCTCAGTTTTTGCGAGTCATGTCAATTTGGCAAGTCACACAATTTACCTTTCCCTCTATCTTCAAGTCGGGCAAAGTATCCATTTGAATTGATTCACTCGGACCTTTGGGGTCCTGCTCCGGTCTTGTCTACTGATGGCTTTCAATATTATGTTTTATTTCTAGATGATTACAGCAGATATCTATGGCTTTATCCATTGAAAAAGAAAAATGATGCGCTTGCTGCCTTTCACCACTTTATCTCTATGGTCAGGAATCAGTTTGGTTGTCAAATAAAGATTCTTCAGTCTGACAATGGTGGCGAATACGCTAGGATTCATCAGGAATGTTATCGACTTGGTATTATCTCTCGATATTCTTGTCCCTACACGTTTGCACAAAATGGAAGAGCAGAACGGAAGCATAGACACGTTGTTGAAACTGGTCTGACATTGCTTGCTCAGGCGTCAATGCCTCTTCAGTATTGGTGGGATGCGTTTTTAGCGGCTGTCCAGCTGATAAATGGTTTACCAACCTCAGTTCTTGAAGGTAAGTCACCATTGGAAGTGTTACATCACAAGAAACTTAATTTTGCAGGTCTACGCTCATTTGGATGTGCCTGTTATCCATGCCTGAGGCCTTACCATAACCACAAATTTCAGTTTCACTCCGAGAGGTGCGTTTATCTTGGCTTCAGCCCCTCTCATAAAGGACATAAATGCCTTAGTGCTTCTGGTCGTGTATTTATTTCTCGACATGTGCAGTTTAATGAACTCATGTTTCCATTTGCACTTGATTTTGGAAAACCCTCAAGCTCCCCAACATTTTCACCCTCTCATGGTCCATCTATCTTAACCTGGTTTCAATCTCTAGAACACAGTCACTTATCCCAAGAAAATACCAACAGGCAATTGAAATATTGA

Protein sequence

MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKSHADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRGDISWSEMQAELLVFEKRLELQNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGRGRGGRGRGYGGYGNSNNRQVCQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIGNGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISKNSTANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVLSTDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLTLLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFISRHVQFNELMFPFALDFGKPSSSPTFSPSHGPSILTWFQSLEHSHLSQENTNRQLKY
Homology
BLAST of Lag0027536 vs. NCBI nr
Match: KYP50444.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 618.6 bits (1594), Expect = 7.0e-173
Identity = 344/761 (45.20%), Postives = 467/761 (61.37%), Query Frame = 0

Query: 1   MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKS 60
           MT E+ATQ++  + ++ +W   QSL G  +R+   FL+  F ++RKG  KM +YL  MK 
Sbjct: 1   MTQEVATQLLHCETSQQIWEDAQSLAGAHTRSRITFLKTEFHRTRKGGLKMEEYLTKMKE 60

Query: 61  HADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRGDISWSEMQAELLVFEKRLEL 120
            AD+L  AGS VST +LV+Q L GLD EYNPIV  +  +  ++W EMQA+LL +E RLE 
Sbjct: 61  IADDLALAGSSVSTMDLVTQTLAGLDNEYNPIVVQLSDKEHLTWVEMQAQLLTYENRLEQ 120

Query: 121 QNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGR 180
            N Q  +       T+N ++N + +   N RG + +   G+ GQ             N  
Sbjct: 121 INNQSNL-------TLNPSSNISTI-LYNRRGKSNAFGGGRGGQI------------NRG 180

Query: 181 GRGGRGRGYGGYGNSNNRQVCQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNS 240
            RGGRGRG      + +R VCQVC +PGH+A  CYHRF+K +   + +  ++  +     
Sbjct: 181 ARGGRGRGRA----TKDRIVCQVCCKPGHAASHCYHRFNKNY---IGQNSDEQKSEKDKE 240

Query: 241 GNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIG 300
            N         N N+  A P TV D +WY DSGASNHVT D N +    E  G   + +G
Sbjct: 241 QN--------YNFNAYVASPSTVEDLDWYFDSGASNHVTYDQNKVQEVNENDGKSFLTVG 300

Query: 301 NGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCII 360
           NG +L I   GD+ L      L+L ++L VP+ITKNL+S+SKL  DND+++EFH   C +
Sbjct: 301 NGANLKIIACGDSSLDTQQKSLNLKDILYVPKITKNLLSISKLTFDNDIYVEFHDVACFV 360

Query: 361 KDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISKNSTANNHSSVFVVSRYPLSV 420
           KDK +G+ +L+G ++DGLYQL      PG           +++ N    VF         
Sbjct: 361 KDKLTGRILLEGKIKDGLYQL------PG----------GSTSTNKRPHVF--------- 420

Query: 421 NIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSR 480
                K  WH++LGHP S+VL+ V+  C ++    E   FCE+CQFGK+HNLPF  S S 
Sbjct: 421 --FSIKETWHRKLGHPNSKVLNEVMKLCNIEASPCENFEFCEACQFGKAHNLPFQNSVSC 480

Query: 481 AKYPFELIHSDLWGPAPVLSTDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMV 540
           AK P +L+HSD+WGPAP+ S  GF+YYVLFLDD+SR+ W+YPLK+K+D   AF  F ++V
Sbjct: 481 AKEPLDLVHSDVWGPAPISSVSGFKYYVLFLDDWSRFTWIYPLKQKSDVFQAFIQFRNLV 540

Query: 541 RNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLT 600
            NQF  +IK LQ D GGE+  + +   + GI  R SCPYT AQNGRAERKHRHVVE+GLT
Sbjct: 541 ENQFNKRIKTLQCDGGGEFKSLSKVLIKTGIQLRESCPYTSAQNGRAERKHRHVVESGLT 600

Query: 601 LLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPC 660
           LLAQA MPL YWW+AF  AV LIN LPT V++ KSP + L  K  ++  +++FGCACYPC
Sbjct: 601 LLAQAKMPLHYWWEAFSTAVFLINRLPTQVIKNKSPYQQLFDKNPDYTAMKTFGCACYPC 660

Query: 661 LRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFISRHVQFNELMFPF---ALDFG 720
           L+PY+ HK QFH+ +CV+LG+S SHKG+KCL+++GR+FISRHV FNE  FPF    L+  
Sbjct: 661 LKPYNQHKLQFHTTKCVFLGYSGSHKGYKCLNSTGRIFISRHVVFNEHHFPFHDGFLNTR 698

Query: 721 KPS---SSPT-----FSPSHGPSILTWFQSLEHSHLSQENT 751
           KP+   + PT      SP+ G ++    Q L  ++ S  NT
Sbjct: 721 KPAEIITDPTSLLFPISPT-GSNVANEEQRLHTNNNSSSNT 698

BLAST of Lag0027536 vs. NCBI nr
Match: GAU19483.1 (hypothetical protein TSUD_77270 [Trifolium subterraneum])

HSP 1 Score: 614.0 bits (1582), Expect = 1.7e-171
Identity = 325/712 (45.65%), Postives = 441/712 (61.94%), Query Frame = 0

Query: 1   MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKS 60
           MT EIATQ++  + +K LW   QSL G  +R++  +L+  F   RKG  KM DYL  MK+
Sbjct: 88  MTTEIATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEFHSIRKGEMKMEDYLIKMKN 147

Query: 61  HADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRGDISWSEMQAELLVFEKRLEL 120
             D L  AG+PVST +L+ Q L GLD EYNP+V  +  +  +SW ++QA+LL FE R+E 
Sbjct: 148 LVDKLKLAGNPVSTSDLIIQTLNGLDSEYNPVVVKLSDQTTLSWVDLQAQLLTFESRIEQ 207

Query: 121 QNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGR 180
            N    + +     T NVAN  ++  +++N  W  S+S G                    
Sbjct: 208 LNN---LTNLTLNATANVANRSDHRGKSSNNNWRGSNSRG-------------------- 267

Query: 181 GRGGRGRGYGGYGNSNNRQVCQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNS 240
            RGGRGRG  G      +  CQVCG   H A+ C+HRFDK +S           N+S   
Sbjct: 268 WRGGRGRGKSG------KNPCQVCGLSNHIAIDCFHRFDKTYS---------RSNHSAGH 327

Query: 241 GNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIG 300
                 +AF+A+ NS       V D +WY DSGASNHVT       +  E+ G   +V+G
Sbjct: 328 DKQGSHNAFLASQNS-------VEDYDWYFDSGASNHVTHQTEKFQDLTEHHGKNSLVVG 387

Query: 301 NGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCII 360
           NGE L I  TG + L +    L+L+++L VP ITKNL+SVSKLA DN++ +EF  +CC +
Sbjct: 388 NGEKLAILATGSSKLKS----LNLHDILYVPNITKNLLSVSKLAADNNILVEFDENCCFV 447

Query: 361 KDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISKNSTANNHSSVFVVSRYPLSV 420
           KDK +G+ +LKG+L+DGLYQL+   R P                    S FV      SV
Sbjct: 448 KDKLTGKVILKGLLKDGLYQLSGTKRNP--------------------SAFV------SV 507

Query: 421 NIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSR 480
                K  WH+RLGHP ++VLD V+  CK++V  ++  SFCE+CQ+GK H LPF  SSS 
Sbjct: 508 -----KESWHRRLGHPNNKVLDKVLESCKVKVPPSDNFSFCEACQYGKMHLLPFKSSSSH 567

Query: 481 AKYPFELIHSDLWGPAPVLSTDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMV 540
           A+ P EL+H+D+WGPAP++++ GF+YYV F+DD+SR+ W+YPLK+K++ + AF  F ++ 
Sbjct: 568 AQEPLELVHTDVWGPAPIMTSSGFKYYVHFVDDFSRFTWIYPLKQKSETVQAFIQFKNLT 627

Query: 541 RNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLT 600
            NQF  +IK++Q D GGEY  + +     GI  R SCPYT  QNGRAERKHRH+ E GLT
Sbjct: 628 ENQFNKRIKVIQCDGGGEYKPVQKLAVEAGIQFRMSCPYTSQQNGRAERKHRHITEFGLT 687

Query: 601 LLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPC 660
           LLAQA MPL YWW+AF  AV LIN LP+ V + +SP  ++  K+ ++  L++FGCACYPC
Sbjct: 688 LLAQAQMPLHYWWEAFSTAVYLINRLPSQVTQNESPYSLMLQKEPDYKLLKTFGCACYPC 719

Query: 661 LRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFISRHVQFNELMFPF 713
           L+PY+ HK Q+H+ RCV+LG+S SHKG+KCL++ GR+FISRHV FNE  FPF
Sbjct: 748 LKPYNQHKLQYHTTRCVFLGYSNSHKGYKCLNSHGRIFISRHVIFNEDHFPF 719

BLAST of Lag0027536 vs. NCBI nr
Match: PNX76291.1 (gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense])

HSP 1 Score: 612.5 bits (1578), Expect = 5.1e-171
Identity = 329/731 (45.01%), Postives = 455/731 (62.24%), Query Frame = 0

Query: 1   MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKS 60
           MT  IATQ++  + +  LW   QSL G  +R++  +L+  F  +RKG  KM DYL  MK+
Sbjct: 88  MTVGIATQLLHCETSMQLWDEAQSLAGAHTRSQITYLKSEFHSTRKGEMKMEDYLIKMKN 147

Query: 61  HADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRGDISWSEMQAELLVFEKRLEL 120
            AD L  AG+P+ST +L+ Q L GLD EYNP+V  +  +  +SW ++QA+LL FE R+E 
Sbjct: 148 LADKLKLAGNPISTSDLIIQTLNGLDSEYNPVVVKLSDQTTLSWVDLQAQLLTFENRIEQ 207

Query: 121 QNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGR 180
            N   ++ +     T NVA       ++++RG  ++ +N  RG   NNN R GSNF   R
Sbjct: 208 LN---SLTNLTLNATANVA------KKSDHRGNRFNSNNNWRGS--NNNWR-GSNFRGWR 267

Query: 181 GRGGRGRGYGGYGNSNNRQVCQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNS 240
           G  GRGR +        +  CQVCG   H A+ C++RFDK +S           N+S N+
Sbjct: 268 GGRGRGRSF--------KTTCQVCGLDNHIAIDCFYRFDKTYS---------RSNHSANN 327

Query: 241 GNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIG 300
                 +AF+A+ NS       + D +WY DSGASNHVT   +   N  E+ G   +++G
Sbjct: 328 DKQGSHNAFLASQNS-------IEDYDWYFDSGASNHVTHQTDKFQNLSEHHGKNSLIVG 387

Query: 301 NGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCII 360
           NGE L I  TG + L +    L+L+++L VP+ITKNL+SVSKLA DN++ +EF  +CC +
Sbjct: 388 NGEKLEIVATGSSKLKS----LNLHDILYVPKITKNLLSVSKLAADNNILVEFDENCCFV 447

Query: 361 KDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISKNSTANNHSSVFVVSRYPLSV 420
           KDK +G+ +L+G+L+DGLYQL+                 K+S+A                
Sbjct: 448 KDKLTGKAILRGILKDGLYQLS----------------EKDSSA---------------- 507

Query: 421 NIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSR 480
             +  K  WH++LGHP ++VLD V+  C +++  ++  SFCE+CQ+GK H LPF  S S 
Sbjct: 508 -YVSIKESWHRKLGHPNNKVLDIVLKSCNVKLSPSDQFSFCEACQYGKMHFLPFKTSFSH 567

Query: 481 AKYPFELIHSDLWGPAPVLSTDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMV 540
           AK   EL+H+D+WGPAP++S+ GF+YYV F+DD++R+ W+YPLK+K+D   AF  F +MV
Sbjct: 568 AKEILELVHTDVWGPAPIISSSGFKYYVHFIDDFTRFTWIYPLKQKSDTAHAFIQFKNMV 627

Query: 541 RNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLT 600
            NQF  +IK +Q D GGEY  + +     GI  R SCPYT  QNGRAERKHRH+ E GLT
Sbjct: 628 ENQFSKKIKTIQCDGGGEYKPVQKHAIEAGIQFRMSCPYTSQQNGRAERKHRHIAEFGLT 687

Query: 601 LLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPC 660
           LLAQA MPL YWW+AF  AV LIN LP+SV   KSP  +LH ++ ++  L+ FGCACYP 
Sbjct: 688 LLAQAKMPLNYWWEAFSTAVYLINRLPSSVTHNKSPYSLLHKREPDYNSLKPFGCACYPF 745

Query: 661 LRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFISRHVQFNELMFPF---ALDFG 720
           L+PY+ HK QFH+ RCV+LG+S SHKG+KC+++ GR+FISRHV FNE  FPF    L+  
Sbjct: 748 LKPYNKHKLQFHTTRCVFLGYSNSHKGYKCVNSHGRIFISRHVVFNEDHFPFHDGFLNTR 745

Query: 721 KPSSSPTFSPS 729
            P  + T SPS
Sbjct: 808 VPLKTLTESPS 745

BLAST of Lag0027536 vs. NCBI nr
Match: PNX94503.1 (putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense])

HSP 1 Score: 581.6 bits (1498), Expect = 9.5e-162
Identity = 321/717 (44.77%), Postives = 437/717 (60.95%), Query Frame = 0

Query: 1   MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKS 60
           MT +IATQV+  + +K LW   QSL G  +R+   +L+  F  + K   KM  YL  MK+
Sbjct: 87  MTVDIATQVLHCETSKQLWDEAQSLAGAHTRSRIIYLKSEFHNTHKREMKMEQYLAKMKN 146

Query: 61  HADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRGDISWSEMQAELLVFEKRLEL 120
            AD L  AGSP+S+ +L+ Q L GLD EYNP+V  +  + +ISW + QA+LL FE RL+ 
Sbjct: 147 LADKLKLAGSPISSSDLMIQTLNGLDSEYNPVVVKLSDQTNISWVDFQAQLLAFESRLD- 206

Query: 121 QNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGR 180
                              NN NN+N N +   N++  N   G  + +  RGG   +N R
Sbjct: 207 -----------------QLNNFNNINLNASA--NFASKNESGGNKFGS--RGGWRGSNSR 266

Query: 181 G-RGGRGRGYGGYGNSNNRQVCQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGN 240
           G RGGRGR      +   R +CQ+CG+ GH+A  CY+RFDK ++        +N    G 
Sbjct: 267 GMRGGRGR---ARMSKPPRPICQICGKFGHTAAQCYYRFDKSYT-------EKNHYAEGE 326

Query: 241 SGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVI 300
             +    SAFVA+       P    D  WY DSGASNHVT     L +  E  G   +++
Sbjct: 327 GSH----SAFVAS-------PYHGQDYEWYFDSGASNHVTHQSGQLQDLNENNGKNSLLV 386

Query: 301 GNGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCI 360
           GNGE L I  +G T L++    ++L NVL VPEITKNL+SVSKL  DN+  +EF  + C 
Sbjct: 387 GNGEKLKILASGSTKLND----VNLRNVLYVPEITKNLLSVSKLTIDNNALVEFDENYCY 446

Query: 361 IKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISKNSTANNHSSVFVVSRYPLS 420
           +KDK +G+ +LKG L+DGLYQL+     P   + C+                        
Sbjct: 447 VKDKLTGKALLKGRLKDGLYQLSANKEPPTNKDPCA------------------------ 506

Query: 421 VNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSS 480
              I  K +WH++LGHP ++VL+ V+ D  +++  ++  +FCE+CQFGK H LPF  SSS
Sbjct: 507 --YISLKEIWHRKLGHPNNKVLEKVLKDNNVKISPSDKFTFCEACQFGKLHLLPFKTSSS 566

Query: 481 RAKYPFELIHSDLWGPAPVLSTDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISM 540
            AK P +LIH+D+WGPAP+LS   F+YYV FLDD+SR+ W++PLK+K++ + AF+ F ++
Sbjct: 567 HAKEPLDLIHTDVWGPAPILSQSNFKYYVHFLDDFSRFTWIFPLKQKSETIHAFNQFKNL 626

Query: 541 VRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGL 600
           V NQF  +IK+++ D GGEY  + +     GI  + SCPYT  QNGRAERKHRHV E GL
Sbjct: 627 VENQFNKKIKVIRCDGGGEYKPVQKCAIDSGIQFQMSCPYTSQQNGRAERKHRHVTELGL 686

Query: 601 TLLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYP 660
           TLLAQA MPL YWW+AF  AV LIN LP+SV   +SP  ++  K+ ++  L+ FGCACYP
Sbjct: 687 TLLAQAKMPLSYWWEAFSTAVYLINRLPSSVNPNESPYTLVFKKEPDYTALKPFGCACYP 730

Query: 661 CLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFISRHVQFNELMFPFALDF 717
           CL+PY+ HK QFH+ RCV+LG+S SHKG+KC+++ GRVF+SRHV FNE  FPF   F
Sbjct: 747 CLKPYNQHKLQFHTTRCVFLGYSNSHKGYKCVNSHGRVFVSRHVVFNENHFPFQEGF 730

BLAST of Lag0027536 vs. NCBI nr
Match: GAU51268.1 (hypothetical protein TSUD_412550 [Trifolium subterraneum])

HSP 1 Score: 576.2 bits (1484), Expect = 4.0e-160
Identity = 316/713 (44.32%), Postives = 433/713 (60.73%), Query Frame = 0

Query: 1   MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKS 60
           M  +IATQ++  + +K LW   QSL G  +++   +L+  F  +RKG  KM +YL  MK+
Sbjct: 88  MAIDIATQLLHCETSKQLWDETQSLAGAHTKSRITYLKSEFHNTRKGEMKMEEYLIKMKN 147

Query: 61  HADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRGDISWSEMQAELLVFEKRLEL 120
            +D L  AGSP+S  +L+ Q L GLD EYNP+V  +  + ++SW ++QA+LL FE RL+ 
Sbjct: 148 LSDKLKLAGSPISNSDLMIQTLNGLDAEYNPVVVKLSDQINLSWVDVQAQLLAFESRLD- 207

Query: 121 QNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGR 180
                    FN+   + +  + N  N+   RG  +             N RG    +N R
Sbjct: 208 --------QFNNFSGLTLNASANFANKTEFRGNKF-------------NSRGNWRRSNFR 267

Query: 181 G-RGGRGRGYGGYGNSNNRQVCQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGN 240
           G RGGRG+G      SN +  CQVC   GH A+ C +RFD+   P   R  +   +  G+
Sbjct: 268 GMRGGRGKG----RMSNTK--CQVCNGTGHIAVDCSYRFDR---PYTGRNYSTEADKQGS 327

Query: 241 SGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVI 300
                  SAF+A+       P    D  WY DSGA+NHVT   +      E+ G   +++
Sbjct: 328 H------SAFIAS-------PYHGQDYEWYFDSGANNHVTHQTDKFQGFNEHNGKNSLMV 387

Query: 301 GNGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCI 360
           GNGE L I  +G T L+N    L+L++VL VP+ITKNL+SVSKL  DN++ +EF  +CC 
Sbjct: 388 GNGEKLKIVASGSTKLNN----LNLHDVLYVPQITKNLLSVSKLTADNNILVEFDANCCS 447

Query: 361 IKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISKNSTANNHSSVFVVSRYPLS 420
           +KDK +GQ +LKG L+DGLYQL+N        E C     K S                 
Sbjct: 448 VKDKLTGQTLLKGRLKDGLYQLSN-------KEPCVYMSVKES----------------- 507

Query: 421 VNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSS 480
                    WH++LGHP ++VLD V+ DC +++  ++  SFCE+CQFGK H LPF  SSS
Sbjct: 508 ---------WHRKLGHPNNKVLDKVLKDCNVKISHSDQFSFCEACQFGKLHLLPFKPSSS 567

Query: 481 RAKYPFELIHSDLWGPAPVLSTDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISM 540
             + P  LIHSD+WGPAP+LS  GF+YYV F+DD+SR+ W++PLK+K+D + AF  F ++
Sbjct: 568 HVQEPLALIHSDVWGPAPILSPSGFKYYVHFIDDFSRFTWIFPLKQKSDTIHAFIQFKNL 627

Query: 541 VRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGL 600
             NQF  +IKI+Q D GGEY  + +     GI  R SCPYT  QNGRAERKHRHV E GL
Sbjct: 628 AENQFNKKIKIIQCDGGGEYKAVQKVSIEAGIQFRMSCPYTSQQNGRAERKHRHVAELGL 687

Query: 601 TLLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYP 660
           TLLAQA MPL+YWW+AF  AV LIN LP+SV   +SP  ++  ++ ++  L+ FGCACYP
Sbjct: 688 TLLAQAKMPLRYWWEAFSTAVYLINRLPSSVNPNESPYSLMFKREPDYNALKPFGCACYP 719

Query: 661 CLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFISRHVQFNELMFPF 713
           CL+PY+ HK QFH+ RCV++G+S SHKG+KC+++ GR+F+SRHV FNE  FPF
Sbjct: 748 CLKPYNQHKLQFHTTRCVFVGYSNSHKGYKCINSHGRIFVSRHVIFNENHFPF 719

BLAST of Lag0027536 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 385.2 bits (988), Expect = 1.7e-105
Identity = 251/709 (35.40%), Postives = 369/709 (52.05%), Query Frame = 0

Query: 15  AKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKSHADNLGQAGSPVST 74
           A  +W  ++ ++   S      LR   +Q  KGT  + DY++ + +  D L   G P+  
Sbjct: 106 AAQIWETLRKIYANPSYGHVTQLRTQLKQWTKGTKTIDDYMQGLVTRFDQLALLGKPMDH 165

Query: 75  RNLVSQVLLGLDEEYNPIVAMIHGRG-DISWSEMQAELLVFEKRLELQNTQKAVVSFNHT 134
              V +VL  L EEY P++  I  +    + +E+   LL  E ++       AV S    
Sbjct: 166 DEQVERVLENLPEEYKPVIDQIAAKDTPPTLTEIHERLLNHESKI------LAVSSATVI 225

Query: 135 P-TVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGRGRGGRGRGYGGY 194
           P T N  +++N    NNN       +NG R   Y+N        NN   +  +      +
Sbjct: 226 PITANAVSHRNTTTTNNN-------NNGNRNNRYDNRN------NNNNSKPWQQSSTNFH 285

Query: 195 GNSNNRQ----VCQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSA 254
            N+N  +     CQ+CG  GHSA  C     + F  +VN              + QPPS 
Sbjct: 286 PNNNQSKPYLGKCQICGVQGHSAKRCSQL--QHFLSSVN--------------SQQPPSP 345

Query: 255 FVA-NSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIGNGESLPI 314
           F      +  A       +NW  DSGA++H+T DFNNL+  + Y G + V++ +G ++PI
Sbjct: 346 FTPWQPRANLALGSPYSSNNWLLDSGATHHITSDFNNLSLHQPYTGGDDVMVADGSTIPI 405

Query: 315 TFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQ 374
           + TG T LS  +  L+L+N+L VP I KNL+SV +L   N V +EF      +KD  +G 
Sbjct: 406 SHTGSTSLSTKSRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNTGV 465

Query: 375 EVLKGVLRDGLYQLNNVTRVPGVNEGCSESISKNSTANNHSSVFVVSRYPLSVNIIVSKN 434
            +L+G  +D LY+    +  P          +  S+   HSS                  
Sbjct: 466 PLLQGKTKDELYEWPIASSQP------VSLFASPSSKATHSS------------------ 525

Query: 435 VWHKRLGHPASRVLDFVINDCKLQV--KENEMLSFCESCQFGKSHNLPFPLSSSRAKYPF 494
            WH RLGHPA  +L+ VI++  L V    ++ LS C  C   KS+ +PF  S+  +  P 
Sbjct: 526 -WHARLGHPAPSILNSVISNYSLSVLNPSHKFLS-CSDCLINKSNKVPFSQSTINSTRPL 585

Query: 495 ELIHSDLWGPAPVLSTDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMVRNQFG 554
           E I+SD+W  +P+LS D ++YYV+F+D ++RY WLYPLK+K+     F  F +++ N+F 
Sbjct: 586 EYIYSDVWS-SPILSHDNYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQ 645

Query: 555 CQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLTLLAQA 614
            +I    SDNGGE+  + +   + GI    S P+T   NG +ERKHRH+VETGLTLL+ A
Sbjct: 646 TRIGTFYSDNGGEFVALWEYFSQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHA 705

Query: 615 SMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPCLRPYH 674
           S+P  YW  AF  AV LIN LPT +L+ +SP + L     N+  LR FGCACYP LRPY+
Sbjct: 706 SIPKTYWPYAFAVAVYLINRLPTPLLQLESPFQKLFGTSPNYDKLRVFGCACYPWLRPYN 752

Query: 675 NHKFQFHSERCVYLGFSPSHKGHKCLS-ASGRVFISRHVQFNELMFPFA 714
            HK    S +CV+LG+S +   + CL   + R++ISRHV+F+E  FPF+
Sbjct: 766 QHKLDDKSRQCVFLGYSLTQSAYLCLHLQTSRLYISRHVRFDENCFPFS 752

BLAST of Lag0027536 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 359.4 bits (921), Expect = 1.0e-97
Identity = 240/673 (35.66%), Postives = 353/673 (52.45%), Query Frame = 0

Query: 63  DNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRG-DISWSEMQAELLVFEKRLELQ 122
           D L   G P+     V +VL  L ++Y P++  I  +    S +E+   L+  E +L   
Sbjct: 135 DQLALLGKPMDHDEQVERVLENLPDDYKPVIDQIAAKDTPPSLTEIHERLINRESKLLAL 194

Query: 123 NTQKAV-VSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGR 182
           N+ + V ++ N     N   N+N  N+ +NR  NY+++N +   +  ++   GS  +N +
Sbjct: 195 NSAEVVPITANVVTHRNTNTNRNQNNRGDNR--NYNNNNNRSNSWQPSS--SGSRSDNRQ 254

Query: 183 GRGGRGRGYGGYGNSNNRQVCQVCGRPGHSALMC--YHRFDKEFSPNVNRGGNQNPNNSG 242
            +   GR             CQ+C   GHSA  C   H+F             Q+  N  
Sbjct: 255 PKPYLGR-------------CQICSVQGHSAKRCPQLHQF-------------QSTTNQQ 314

Query: 243 NSGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVV 302
            S  T P + +   +N     P     +NW  DSGA++H+T DFNNL+  + Y G + V+
Sbjct: 315 QS--TSPFTPWQPRANLAVNSPYNA--NNWLLDSGATHHITSDFNNLSFHQPYTGGDDVM 374

Query: 303 IGNGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCC 362
           I +G ++PIT TG   L   +  L LN VL VP I KNL+SV +L   N V +EF     
Sbjct: 375 IADGSTIPITHTGSASLPTSSRSLDLNKVLYVPNIHKNLISVYRLCNTNRVSVEFFPASF 434

Query: 363 IIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISKNSTANNHSSVFVVSRYPL 422
            +KD  +G  +L+G  +D LY+       P  +       +   +   HSS         
Sbjct: 435 QVKDLNTGVPLLQGKTKDELYEW------PIASSQAVSMFASPCSKATHSS--------- 494

Query: 423 SVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQV-KENEMLSFCESCQFGKSHNLPFPLS 482
                     WH RLGHP+  +L+ VI++  L V   +  L  C  C   KSH +PF  S
Sbjct: 495 ----------WHSRLGHPSLAILNSVISNHSLPVLNPSHKLLSCSDCFINKSHKVPFSNS 554

Query: 483 SSRAKYPFELIHSDLWGPAPVLSTDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFI 542
           +  +  P E I+SD+W  +P+LS D ++YYV+F+D ++RY WLYPLK+K+     F  F 
Sbjct: 555 TITSSKPLEYIYSDVWS-SPILSIDNYRYYVIFVDHFTRYTWLYPLKQKSQVKDTFIIFK 614

Query: 543 SMVRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVET 602
           S+V N+F  +I  L SDNGGE+  +     + GI    S P+T   NG +ERKHRH+VE 
Sbjct: 615 SLVENRFQTRIGTLYSDNGGEFVVLRDYLSQHGISHFTSPPHTPEHNGLSERKHRHIVEM 674

Query: 603 GLTLLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCAC 662
           GLTLL+ AS+P  YW  AF  AV LIN LPT +L+ +SP + L  +  N+  L+ FGCAC
Sbjct: 675 GLTLLSHASVPKTYWPYAFSVAVYLINRLPTPLLQLQSPFQKLFGQPPNYEKLKVFGCAC 734

Query: 663 YPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLS-ASGRVFISRHVQFNELMFPFA-LD 722
           YP LRPY+ HK +  S++C ++G+S +   + CL   +GR++ SRHVQF+E  FPF+  +
Sbjct: 735 YPWLRPYNRHKLEDKSKQCAFMGYSLTQSAYLCLHIPTGRLYTSRHVQFDERCFPFSTTN 747

Query: 723 FGKPSSSPTFSPS 729
           FG  +S    S S
Sbjct: 795 FGVSTSQEQRSDS 747

BLAST of Lag0027536 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 189.9 bits (481), Expect = 1.1e-46
Identity = 157/558 (28.14%), Postives = 239/558 (42.83%), Query Frame = 0

Query: 184 GRGRGY----------GGYGNSNNR-----QVCQVCGRPGHSALMCYHRFDKEFSPNVNR 243
           GRGR Y          G  G S NR     + C  C +PGH    C         PN  +
Sbjct: 200 GRGRSYQRSSNNYGRSGARGKSKNRSKSRVRNCYNCNQPGHFKRDC---------PNPRK 259

Query: 244 GGNQ---NPNNSGNSGNTQPPSAFVANSNSQYACPE-TVIDSNWYADSGASNHVTGDFNN 303
           G  +     N+   +   Q     V   N +  C   +  +S W  D+ AS+H T    +
Sbjct: 260 GKGETSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLSGPESEWVVDTAASHHAT-PVRD 319

Query: 304 LANTKEYGGNEQVVIGNGESLPITFTGDTYL-SNGAAILSLNNVLCVPEITKNLVSVSKL 363
           L      G    V +GN     I   GD  + +N    L L +V  VP++  NL  +S +
Sbjct: 320 LFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRHVPDLRMNL--ISGI 379

Query: 364 AQDNDVFIEFHGDCCIIKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISKNST 423
           A D D +  +  +      K S   + KGV R  LY+ N       +N    E       
Sbjct: 380 ALDRDGYESYFANQKWRLTKGS-LVIAKGVARGTLYRTNAEICQGELNAAQDE------- 439

Query: 424 ANNHSSVFVVSRYPLSVNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCES 483
                               +S ++WHKR+GH + + L  +     +   +   +  C+ 
Sbjct: 440 --------------------ISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDY 499

Query: 484 CQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPAPVLSTDGFQYYVLFLDDYSRYLWLYPL 543
           C FGK H + F  SS R     +L++SD+ GP  + S  G +Y+V F+DD SR LW+Y L
Sbjct: 500 CLFGKQHRVSFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYIL 559

Query: 544 KKKNDALAAFHHFISMVRNQFGCQIKILQSDNGGEYA--RIHQECYRLGIISRYSCPYTF 603
           K K+     F  F ++V  + G ++K L+SDNGGEY      + C   GI    + P T 
Sbjct: 560 KTKDQVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTP 619

Query: 604 AQNGRAERKHRHVVETGLTLLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLH 663
             NG AER +R +VE   ++L  A +P  +W +A   A  LIN  P+  L  + P  V  
Sbjct: 620 QHNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWT 679

Query: 664 HKKLNFAGLRSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLS-ASGRVFIS 719
           +K+++++ L+ FGC  +  +      K    S  C+++G+     G++       +V  S
Sbjct: 680 NKEVSYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRS 717

BLAST of Lag0027536 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 164.5 bits (415), Expect = 4.8e-39
Identity = 147/529 (27.79%), Postives = 229/529 (43.29%), Query Frame = 0

Query: 193 GNSNNRQVCQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNSGNTQPPSAFVAN 252
           GNS  +  C  CGR GH    C+H   K    N N+   +    + + G      AF+  
Sbjct: 224 GNSKYKVKCHHCGREGHIKKDCFHY--KRILNNKNKENEKQVQTATSHG-----IAFMVK 283

Query: 253 SNSQYACPETVIDS-NWYADSGASNHVTGDFNNLANTKEYGGNEQVVIG-NGESLPITFT 312
             +      +V+D+  +  DSGAS+H+  D +   ++ E     ++ +   GE +  T  
Sbjct: 284 EVNN----TSVMDNCGFVLDSGASDHLINDESLYTDSVEVVPPLKIAVAKQGEFIYATKR 343

Query: 313 GDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKDKRSGQEVL 372
           G   L N   I +L +VL   E   NL+SV +L Q+  + IEF          +SG  + 
Sbjct: 344 GIVRLRNDHEI-TLEDVLFCKEAAGNLMSVKRL-QEAGMSIEF---------DKSGVTIS 403

Query: 373 KGVLRDGLYQLNNVTRVPGVNEGCSESISKNSTANNHSSVFVVSRYPLSVNIIVSKNVWH 432
           K  L                       + KNS   N+  V     Y ++     +  +WH
Sbjct: 404 KNGL----------------------MVVKNSGMLNNVPVINFQAYSINAKHKNNFRLWH 463

Query: 433 KRLGHPASRVLDFVINDCK-LQVKENEMLS-------------FCESCQFGKSHNLPFP- 492
           +R GH         I+D K L++K   M S              CE C  GK   LPF  
Sbjct: 464 ERFGH---------ISDGKLLEIKRKNMFSDQSLLNNLELSCEICEPCLNGKQARLPFKQ 523

Query: 493 -LSSSRAKYPFELIHSDLWGPAPVLSTDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFH 552
               +  K P  ++HSD+ GP   ++ D   Y+V+F+D ++ Y   Y +K K+D  + F 
Sbjct: 524 LKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIKYKSDVFSMFQ 583

Query: 553 HFISMVRNQFGCQIKILQSDNGGEYA--RIHQECYRLGIISRYSCPYTFAQNGRAERKHR 612
            F++     F  ++  L  DNG EY    + Q C + GI    + P+T   NG +ER  R
Sbjct: 584 DFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQLNGVSERMIR 643

Query: 613 HVVETGLTLLAQASMPLQYWWDAFLAAVQLINGLPTSVL--EGKSPLEVLHHKKLNFAGL 672
            + E   T+++ A +   +W +A L A  LIN +P+  L    K+P E+ H+KK     L
Sbjct: 644 TITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKKPYLKHL 696

Query: 673 RSFGCACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFI 700
           R FG   Y  ++     KF   S + +++G+ P+  G K   A    FI
Sbjct: 704 RVFGATVYVHIKNKQG-KFDDKSFKSIFVGYEPN--GFKLWDAVNEKFI 696

BLAST of Lag0027536 vs. ExPASy Swiss-Prot
Match: Q07791 (Transposon Ty2-DR3 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY2B-DR3 PE=3 SV=1)

HSP 1 Score: 109.0 bits (271), Expect = 2.4e-22
Identity = 113/454 (24.89%), Postives = 186/454 (40.97%), Query Frame = 0

Query: 260 PETVIDSN------WYADSGASNHVTGDFNNLANTKEYGGNEQVVIGNGESLPITFTGDT 319
           P   IDSN         DSGAS  +    + L +         +V    + +PI   G+ 
Sbjct: 440 PTRTIDSNDELPDHLLIDSGASQTLVRSAHYLHHATP-NSEINIVDAQKQDIPINAIGNL 499

Query: 320 YLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCIIKD--KRSGQEVLK 379
           + +      +    L  P I  +L+S+S+LA  N          C  ++  +RS   VL 
Sbjct: 500 HFNFQNGTKTSIKALHTPNIAYDLLSLSELANQNIT-------ACFTRNTLERSDGTVLA 559

Query: 380 GVLRDG-LYQLNNVTRVPGVNEGCSESISKNSTANNHSSVFVVSRYPLSVNIIVSKNVWH 439
            +++ G  Y L+    +P         ISK  T NN +    V++YP          + H
Sbjct: 560 PIVKHGDFYWLSKKYLIP-------SHISK-LTINNVNKSKSVNKYPYP--------LIH 619

Query: 440 KRLGHPASRVLDFVI-NDCKLQVKE------NEMLSFCESCQFGKSHNLPFPLSSSRAKY 499
           + LGH   R +   +  +    +KE      N     C  C  GKS      +  SR KY
Sbjct: 620 RMLGHANFRSIQKSLKKNAVTYLKESDIEWSNASTYQCPDCLIGKSTKHRH-IKGSRLKY 679

Query: 500 -----PFELIHSDLWGPAPVLSTDGFQYYVLFLDDYSRYLWLYPL--KKKNDALAAFHHF 559
                PF+ +H+D++GP   L      Y++ F D+ +R+ W+YPL  +++   L  F   
Sbjct: 680 QESYEPFQYLHTDIFGPVHHLPKSAPSYFISFTDEKTRFQWVYPLHDRREESILNVFTSI 739

Query: 560 ISMVRNQFGCQIKILQSDNGGEYAR--IHQECYRLGIISRYSCPYTFAQNGRAERKHRHV 619
           ++ ++NQF  ++ ++Q D G EY    +H+     GI + Y+       +G AER +R +
Sbjct: 740 LAFIKNQFNARVLVIQMDRGSEYTNKTLHKFFTNRGITACYTTTADSRAHGVAERLNRTL 799

Query: 620 VETGLTLLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFG 679
           +    TLL  + +P   W+ A   +  + N L  S    KS  +      L+   +  FG
Sbjct: 800 LNDCRTLLHCSGLPNHLWFSAVEFSTIIRNSL-VSPKNDKSARQHAGLAGLDITTILPFG 859

Query: 680 CACYPCLRPYHNHKFQFHSERCVYLGFSPSHKGH 689
               P +   HN   + H          PS   +
Sbjct: 860 ---QPVIVNNHNPDSKIHPRGIPGYALHPSRNSY 864

BLAST of Lag0027536 vs. ExPASy TrEMBL
Match: A0A151S6M8 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=3821 GN=KK1_027809 PE=4 SV=1)

HSP 1 Score: 618.6 bits (1594), Expect = 3.4e-173
Identity = 344/761 (45.20%), Postives = 467/761 (61.37%), Query Frame = 0

Query: 1   MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKS 60
           MT E+ATQ++  + ++ +W   QSL G  +R+   FL+  F ++RKG  KM +YL  MK 
Sbjct: 1   MTQEVATQLLHCETSQQIWEDAQSLAGAHTRSRITFLKTEFHRTRKGGLKMEEYLTKMKE 60

Query: 61  HADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRGDISWSEMQAELLVFEKRLEL 120
            AD+L  AGS VST +LV+Q L GLD EYNPIV  +  +  ++W EMQA+LL +E RLE 
Sbjct: 61  IADDLALAGSSVSTMDLVTQTLAGLDNEYNPIVVQLSDKEHLTWVEMQAQLLTYENRLEQ 120

Query: 121 QNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGR 180
            N Q  +       T+N ++N + +   N RG + +   G+ GQ             N  
Sbjct: 121 INNQSNL-------TLNPSSNISTI-LYNRRGKSNAFGGGRGGQI------------NRG 180

Query: 181 GRGGRGRGYGGYGNSNNRQVCQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNS 240
            RGGRGRG      + +R VCQVC +PGH+A  CYHRF+K +   + +  ++  +     
Sbjct: 181 ARGGRGRGRA----TKDRIVCQVCCKPGHAASHCYHRFNKNY---IGQNSDEQKSEKDKE 240

Query: 241 GNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIG 300
            N         N N+  A P TV D +WY DSGASNHVT D N +    E  G   + +G
Sbjct: 241 QN--------YNFNAYVASPSTVEDLDWYFDSGASNHVTYDQNKVQEVNENDGKSFLTVG 300

Query: 301 NGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCII 360
           NG +L I   GD+ L      L+L ++L VP+ITKNL+S+SKL  DND+++EFH   C +
Sbjct: 301 NGANLKIIACGDSSLDTQQKSLNLKDILYVPKITKNLLSISKLTFDNDIYVEFHDVACFV 360

Query: 361 KDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISKNSTANNHSSVFVVSRYPLSV 420
           KDK +G+ +L+G ++DGLYQL      PG           +++ N    VF         
Sbjct: 361 KDKLTGRILLEGKIKDGLYQL------PG----------GSTSTNKRPHVF--------- 420

Query: 421 NIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSR 480
                K  WH++LGHP S+VL+ V+  C ++    E   FCE+CQFGK+HNLPF  S S 
Sbjct: 421 --FSIKETWHRKLGHPNSKVLNEVMKLCNIEASPCENFEFCEACQFGKAHNLPFQNSVSC 480

Query: 481 AKYPFELIHSDLWGPAPVLSTDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMV 540
           AK P +L+HSD+WGPAP+ S  GF+YYVLFLDD+SR+ W+YPLK+K+D   AF  F ++V
Sbjct: 481 AKEPLDLVHSDVWGPAPISSVSGFKYYVLFLDDWSRFTWIYPLKQKSDVFQAFIQFRNLV 540

Query: 541 RNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLT 600
            NQF  +IK LQ D GGE+  + +   + GI  R SCPYT AQNGRAERKHRHVVE+GLT
Sbjct: 541 ENQFNKRIKTLQCDGGGEFKSLSKVLIKTGIQLRESCPYTSAQNGRAERKHRHVVESGLT 600

Query: 601 LLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPC 660
           LLAQA MPL YWW+AF  AV LIN LPT V++ KSP + L  K  ++  +++FGCACYPC
Sbjct: 601 LLAQAKMPLHYWWEAFSTAVFLINRLPTQVIKNKSPYQQLFDKNPDYTAMKTFGCACYPC 660

Query: 661 LRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFISRHVQFNELMFPF---ALDFG 720
           L+PY+ HK QFH+ +CV+LG+S SHKG+KCL+++GR+FISRHV FNE  FPF    L+  
Sbjct: 661 LKPYNQHKLQFHTTKCVFLGYSGSHKGYKCLNSTGRIFISRHVVFNEHHFPFHDGFLNTR 698

Query: 721 KPS---SSPT-----FSPSHGPSILTWFQSLEHSHLSQENT 751
           KP+   + PT      SP+ G ++    Q L  ++ S  NT
Sbjct: 721 KPAEIITDPTSLLFPISPT-GSNVANEEQRLHTNNNSSSNT 698

BLAST of Lag0027536 vs. ExPASy TrEMBL
Match: A0A2Z6MBG6 (Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_77270 PE=4 SV=1)

HSP 1 Score: 614.0 bits (1582), Expect = 8.4e-172
Identity = 325/712 (45.65%), Postives = 441/712 (61.94%), Query Frame = 0

Query: 1   MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKS 60
           MT EIATQ++  + +K LW   QSL G  +R++  +L+  F   RKG  KM DYL  MK+
Sbjct: 88  MTTEIATQLLHCETSKQLWDEAQSLAGAHTRSQIIYLKSEFHSIRKGEMKMEDYLIKMKN 147

Query: 61  HADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRGDISWSEMQAELLVFEKRLEL 120
             D L  AG+PVST +L+ Q L GLD EYNP+V  +  +  +SW ++QA+LL FE R+E 
Sbjct: 148 LVDKLKLAGNPVSTSDLIIQTLNGLDSEYNPVVVKLSDQTTLSWVDLQAQLLTFESRIEQ 207

Query: 121 QNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGR 180
            N    + +     T NVAN  ++  +++N  W  S+S G                    
Sbjct: 208 LNN---LTNLTLNATANVANRSDHRGKSSNNNWRGSNSRG-------------------- 267

Query: 181 GRGGRGRGYGGYGNSNNRQVCQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNS 240
            RGGRGRG  G      +  CQVCG   H A+ C+HRFDK +S           N+S   
Sbjct: 268 WRGGRGRGKSG------KNPCQVCGLSNHIAIDCFHRFDKTYS---------RSNHSAGH 327

Query: 241 GNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIG 300
                 +AF+A+ NS       V D +WY DSGASNHVT       +  E+ G   +V+G
Sbjct: 328 DKQGSHNAFLASQNS-------VEDYDWYFDSGASNHVTHQTEKFQDLTEHHGKNSLVVG 387

Query: 301 NGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCII 360
           NGE L I  TG + L +    L+L+++L VP ITKNL+SVSKLA DN++ +EF  +CC +
Sbjct: 388 NGEKLAILATGSSKLKS----LNLHDILYVPNITKNLLSVSKLAADNNILVEFDENCCFV 447

Query: 361 KDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISKNSTANNHSSVFVVSRYPLSV 420
           KDK +G+ +LKG+L+DGLYQL+   R P                    S FV      SV
Sbjct: 448 KDKLTGKVILKGLLKDGLYQLSGTKRNP--------------------SAFV------SV 507

Query: 421 NIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSR 480
                K  WH+RLGHP ++VLD V+  CK++V  ++  SFCE+CQ+GK H LPF  SSS 
Sbjct: 508 -----KESWHRRLGHPNNKVLDKVLESCKVKVPPSDNFSFCEACQYGKMHLLPFKSSSSH 567

Query: 481 AKYPFELIHSDLWGPAPVLSTDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMV 540
           A+ P EL+H+D+WGPAP++++ GF+YYV F+DD+SR+ W+YPLK+K++ + AF  F ++ 
Sbjct: 568 AQEPLELVHTDVWGPAPIMTSSGFKYYVHFVDDFSRFTWIYPLKQKSETVQAFIQFKNLT 627

Query: 541 RNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLT 600
            NQF  +IK++Q D GGEY  + +     GI  R SCPYT  QNGRAERKHRH+ E GLT
Sbjct: 628 ENQFNKRIKVIQCDGGGEYKPVQKLAVEAGIQFRMSCPYTSQQNGRAERKHRHITEFGLT 687

Query: 601 LLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPC 660
           LLAQA MPL YWW+AF  AV LIN LP+ V + +SP  ++  K+ ++  L++FGCACYPC
Sbjct: 688 LLAQAQMPLHYWWEAFSTAVYLINRLPSQVTQNESPYSLMLQKEPDYKLLKTFGCACYPC 719

Query: 661 LRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFISRHVQFNELMFPF 713
           L+PY+ HK Q+H+ RCV+LG+S SHKG+KCL++ GR+FISRHV FNE  FPF
Sbjct: 748 LKPYNQHKLQYHTTRCVFLGYSNSHKGYKCLNSHGRIFISRHVIFNEDHFPF 719

BLAST of Lag0027536 vs. ExPASy TrEMBL
Match: A0A2K3LCM1 (Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g032236 PE=4 SV=1)

HSP 1 Score: 612.5 bits (1578), Expect = 2.4e-171
Identity = 329/731 (45.01%), Postives = 455/731 (62.24%), Query Frame = 0

Query: 1   MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKS 60
           MT  IATQ++  + +  LW   QSL G  +R++  +L+  F  +RKG  KM DYL  MK+
Sbjct: 88  MTVGIATQLLHCETSMQLWDEAQSLAGAHTRSQITYLKSEFHSTRKGEMKMEDYLIKMKN 147

Query: 61  HADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRGDISWSEMQAELLVFEKRLEL 120
            AD L  AG+P+ST +L+ Q L GLD EYNP+V  +  +  +SW ++QA+LL FE R+E 
Sbjct: 148 LADKLKLAGNPISTSDLIIQTLNGLDSEYNPVVVKLSDQTTLSWVDLQAQLLTFENRIEQ 207

Query: 121 QNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGR 180
            N   ++ +     T NVA       ++++RG  ++ +N  RG   NNN R GSNF   R
Sbjct: 208 LN---SLTNLTLNATANVA------KKSDHRGNRFNSNNNWRGS--NNNWR-GSNFRGWR 267

Query: 181 GRGGRGRGYGGYGNSNNRQVCQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGNS 240
           G  GRGR +        +  CQVCG   H A+ C++RFDK +S           N+S N+
Sbjct: 268 GGRGRGRSF--------KTTCQVCGLDNHIAIDCFYRFDKTYS---------RSNHSANN 327

Query: 241 GNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVIG 300
                 +AF+A+ NS       + D +WY DSGASNHVT   +   N  E+ G   +++G
Sbjct: 328 DKQGSHNAFLASQNS-------IEDYDWYFDSGASNHVTHQTDKFQNLSEHHGKNSLIVG 387

Query: 301 NGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCII 360
           NGE L I  TG + L +    L+L+++L VP+ITKNL+SVSKLA DN++ +EF  +CC +
Sbjct: 388 NGEKLEIVATGSSKLKS----LNLHDILYVPKITKNLLSVSKLAADNNILVEFDENCCFV 447

Query: 361 KDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISKNSTANNHSSVFVVSRYPLSV 420
           KDK +G+ +L+G+L+DGLYQL+                 K+S+A                
Sbjct: 448 KDKLTGKAILRGILKDGLYQLS----------------EKDSSA---------------- 507

Query: 421 NIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSR 480
             +  K  WH++LGHP ++VLD V+  C +++  ++  SFCE+CQ+GK H LPF  S S 
Sbjct: 508 -YVSIKESWHRKLGHPNNKVLDIVLKSCNVKLSPSDQFSFCEACQYGKMHFLPFKTSFSH 567

Query: 481 AKYPFELIHSDLWGPAPVLSTDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISMV 540
           AK   EL+H+D+WGPAP++S+ GF+YYV F+DD++R+ W+YPLK+K+D   AF  F +MV
Sbjct: 568 AKEILELVHTDVWGPAPIISSSGFKYYVHFIDDFTRFTWIYPLKQKSDTAHAFIQFKNMV 627

Query: 541 RNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGLT 600
            NQF  +IK +Q D GGEY  + +     GI  R SCPYT  QNGRAERKHRH+ E GLT
Sbjct: 628 ENQFSKKIKTIQCDGGGEYKPVQKHAIEAGIQFRMSCPYTSQQNGRAERKHRHIAEFGLT 687

Query: 601 LLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYPC 660
           LLAQA MPL YWW+AF  AV LIN LP+SV   KSP  +LH ++ ++  L+ FGCACYP 
Sbjct: 688 LLAQAKMPLNYWWEAFSTAVYLINRLPSSVTHNKSPYSLLHKREPDYNSLKPFGCACYPF 745

Query: 661 LRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFISRHVQFNELMFPF---ALDFG 720
           L+PY+ HK QFH+ RCV+LG+S SHKG+KC+++ GR+FISRHV FNE  FPF    L+  
Sbjct: 748 LKPYNKHKLQFHTTRCVFLGYSNSHKGYKCVNSHGRIFISRHVVFNEDHFPFHDGFLNTR 745

Query: 721 KPSSSPTFSPS 729
            P  + T SPS
Sbjct: 808 VPLKTLTESPS 745

BLAST of Lag0027536 vs. ExPASy TrEMBL
Match: A0A2K3MUJ9 (Putative retrotransposon Ty1-copia subclass protein (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g017679 PE=4 SV=1)

HSP 1 Score: 581.6 bits (1498), Expect = 4.6e-162
Identity = 321/717 (44.77%), Postives = 437/717 (60.95%), Query Frame = 0

Query: 1   MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKS 60
           MT +IATQV+  + +K LW   QSL G  +R+   +L+  F  + K   KM  YL  MK+
Sbjct: 87  MTVDIATQVLHCETSKQLWDEAQSLAGAHTRSRIIYLKSEFHNTHKREMKMEQYLAKMKN 146

Query: 61  HADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRGDISWSEMQAELLVFEKRLEL 120
            AD L  AGSP+S+ +L+ Q L GLD EYNP+V  +  + +ISW + QA+LL FE RL+ 
Sbjct: 147 LADKLKLAGSPISSSDLMIQTLNGLDSEYNPVVVKLSDQTNISWVDFQAQLLAFESRLD- 206

Query: 121 QNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGR 180
                              NN NN+N N +   N++  N   G  + +  RGG   +N R
Sbjct: 207 -----------------QLNNFNNINLNASA--NFASKNESGGNKFGS--RGGWRGSNSR 266

Query: 181 G-RGGRGRGYGGYGNSNNRQVCQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGN 240
           G RGGRGR      +   R +CQ+CG+ GH+A  CY+RFDK ++        +N    G 
Sbjct: 267 GMRGGRGR---ARMSKPPRPICQICGKFGHTAAQCYYRFDKSYT-------EKNHYAEGE 326

Query: 241 SGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVI 300
             +    SAFVA+       P    D  WY DSGASNHVT     L +  E  G   +++
Sbjct: 327 GSH----SAFVAS-------PYHGQDYEWYFDSGASNHVTHQSGQLQDLNENNGKNSLLV 386

Query: 301 GNGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCI 360
           GNGE L I  +G T L++    ++L NVL VPEITKNL+SVSKL  DN+  +EF  + C 
Sbjct: 387 GNGEKLKILASGSTKLND----VNLRNVLYVPEITKNLLSVSKLTIDNNALVEFDENYCY 446

Query: 361 IKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISKNSTANNHSSVFVVSRYPLS 420
           +KDK +G+ +LKG L+DGLYQL+     P   + C+                        
Sbjct: 447 VKDKLTGKALLKGRLKDGLYQLSANKEPPTNKDPCA------------------------ 506

Query: 421 VNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSS 480
              I  K +WH++LGHP ++VL+ V+ D  +++  ++  +FCE+CQFGK H LPF  SSS
Sbjct: 507 --YISLKEIWHRKLGHPNNKVLEKVLKDNNVKISPSDKFTFCEACQFGKLHLLPFKTSSS 566

Query: 481 RAKYPFELIHSDLWGPAPVLSTDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISM 540
            AK P +LIH+D+WGPAP+LS   F+YYV FLDD+SR+ W++PLK+K++ + AF+ F ++
Sbjct: 567 HAKEPLDLIHTDVWGPAPILSQSNFKYYVHFLDDFSRFTWIFPLKQKSETIHAFNQFKNL 626

Query: 541 VRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGL 600
           V NQF  +IK+++ D GGEY  + +     GI  + SCPYT  QNGRAERKHRHV E GL
Sbjct: 627 VENQFNKKIKVIRCDGGGEYKPVQKCAIDSGIQFQMSCPYTSQQNGRAERKHRHVTELGL 686

Query: 601 TLLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYP 660
           TLLAQA MPL YWW+AF  AV LIN LP+SV   +SP  ++  K+ ++  L+ FGCACYP
Sbjct: 687 TLLAQAKMPLSYWWEAFSTAVYLINRLPSSVNPNESPYTLVFKKEPDYTALKPFGCACYP 730

Query: 661 CLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFISRHVQFNELMFPFALDF 717
           CL+PY+ HK QFH+ RCV+LG+S SHKG+KC+++ GRVF+SRHV FNE  FPF   F
Sbjct: 747 CLKPYNQHKLQFHTTRCVFLGYSNSHKGYKCVNSHGRVFVSRHVVFNENHFPFQEGF 730

BLAST of Lag0027536 vs. ExPASy TrEMBL
Match: A0A2Z6P4D5 (Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_412550 PE=4 SV=1)

HSP 1 Score: 576.2 bits (1484), Expect = 1.9e-160
Identity = 316/713 (44.32%), Postives = 433/713 (60.73%), Query Frame = 0

Query: 1   MTPEIATQVMGFDNAKDLWSAIQSLFGIQSRAEEDFLRQTFQQSRKGTSKMTDYLRLMKS 60
           M  +IATQ++  + +K LW   QSL G  +++   +L+  F  +RKG  KM +YL  MK+
Sbjct: 88  MAIDIATQLLHCETSKQLWDETQSLAGAHTKSRITYLKSEFHNTRKGEMKMEEYLIKMKN 147

Query: 61  HADNLGQAGSPVSTRNLVSQVLLGLDEEYNPIVAMIHGRGDISWSEMQAELLVFEKRLEL 120
            +D L  AGSP+S  +L+ Q L GLD EYNP+V  +  + ++SW ++QA+LL FE RL+ 
Sbjct: 148 LSDKLKLAGSPISNSDLMIQTLNGLDAEYNPVVVKLSDQINLSWVDVQAQLLAFESRLD- 207

Query: 121 QNTQKAVVSFNHTPTVNVANNKNNMNQNNNRGWNYSHSNGQRGQFYNNNQRGGSNFNNGR 180
                    FN+   + +  + N  N+   RG  +             N RG    +N R
Sbjct: 208 --------QFNNFSGLTLNASANFANKTEFRGNKF-------------NSRGNWRRSNFR 267

Query: 181 G-RGGRGRGYGGYGNSNNRQVCQVCGRPGHSALMCYHRFDKEFSPNVNRGGNQNPNNSGN 240
           G RGGRG+G      SN +  CQVC   GH A+ C +RFD+   P   R  +   +  G+
Sbjct: 268 GMRGGRGKG----RMSNTK--CQVCNGTGHIAVDCSYRFDR---PYTGRNYSTEADKQGS 327

Query: 241 SGNTQPPSAFVANSNSQYACPETVIDSNWYADSGASNHVTGDFNNLANTKEYGGNEQVVI 300
                  SAF+A+       P    D  WY DSGA+NHVT   +      E+ G   +++
Sbjct: 328 H------SAFIAS-------PYHGQDYEWYFDSGANNHVTHQTDKFQGFNEHNGKNSLMV 387

Query: 301 GNGESLPITFTGDTYLSNGAAILSLNNVLCVPEITKNLVSVSKLAQDNDVFIEFHGDCCI 360
           GNGE L I  +G T L+N    L+L++VL VP+ITKNL+SVSKL  DN++ +EF  +CC 
Sbjct: 388 GNGEKLKIVASGSTKLNN----LNLHDVLYVPQITKNLLSVSKLTADNNILVEFDANCCS 447

Query: 361 IKDKRSGQEVLKGVLRDGLYQLNNVTRVPGVNEGCSESISKNSTANNHSSVFVVSRYPLS 420
           +KDK +GQ +LKG L+DGLYQL+N        E C     K S                 
Sbjct: 448 VKDKLTGQTLLKGRLKDGLYQLSN-------KEPCVYMSVKES----------------- 507

Query: 421 VNIIVSKNVWHKRLGHPASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSS 480
                    WH++LGHP ++VLD V+ DC +++  ++  SFCE+CQFGK H LPF  SSS
Sbjct: 508 ---------WHRKLGHPNNKVLDKVLKDCNVKISHSDQFSFCEACQFGKLHLLPFKPSSS 567

Query: 481 RAKYPFELIHSDLWGPAPVLSTDGFQYYVLFLDDYSRYLWLYPLKKKNDALAAFHHFISM 540
             + P  LIHSD+WGPAP+LS  GF+YYV F+DD+SR+ W++PLK+K+D + AF  F ++
Sbjct: 568 HVQEPLALIHSDVWGPAPILSPSGFKYYVHFIDDFSRFTWIFPLKQKSDTIHAFIQFKNL 627

Query: 541 VRNQFGCQIKILQSDNGGEYARIHQECYRLGIISRYSCPYTFAQNGRAERKHRHVVETGL 600
             NQF  +IKI+Q D GGEY  + +     GI  R SCPYT  QNGRAERKHRHV E GL
Sbjct: 628 AENQFNKKIKIIQCDGGGEYKAVQKVSIEAGIQFRMSCPYTSQQNGRAERKHRHVAELGL 687

Query: 601 TLLAQASMPLQYWWDAFLAAVQLINGLPTSVLEGKSPLEVLHHKKLNFAGLRSFGCACYP 660
           TLLAQA MPL+YWW+AF  AV LIN LP+SV   +SP  ++  ++ ++  L+ FGCACYP
Sbjct: 688 TLLAQAKMPLRYWWEAFSTAVYLINRLPSSVNPNESPYSLMFKREPDYNALKPFGCACYP 719

Query: 661 CLRPYHNHKFQFHSERCVYLGFSPSHKGHKCLSASGRVFISRHVQFNELMFPF 713
           CL+PY+ HK QFH+ RCV++G+S SHKG+KC+++ GR+F+SRHV FNE  FPF
Sbjct: 748 CLKPYNQHKLQFHTTRCVFVGYSNSHKGYKCINSHGRIFVSRHVIFNENHFPF 719

BLAST of Lag0027536 vs. TAIR 10
Match: ATMG00300.1 (Gag-Pol-related retrotransposon family protein )

HSP 1 Score: 62.8 bits (151), Expect = 1.4e-09
Identity = 34/122 (27.87%), Postives = 52/122 (42.62%), Query Frame = 0

Query: 390 VNEGCSESISK-------NSTANNHSSVFVV------SRYPLSVNIIVSKNVWHKRLGHP 449
           V   CSE + K           N H S++++          L+        +WH RL H 
Sbjct: 20  VEASCSEGVLKVLKGCRTILKGNRHDSLYILQGSVETGESNLAETAKDETRLWHSRLAHM 79

Query: 450 ASRVLDFVINDCKLQVKENEMLSFCESCQFGKSHNLPFPLSSSRAKYPFELIHSDLWGPA 499
           + R ++ ++    L   +   L FCE C +GK+H + F       K P + +HSDLWG  
Sbjct: 80  SQRGMELLVKKGFLDSSKVSSLKFCEDCIYGKTHRVNFSTGQHTTKNPLDYVHSDLWGAP 139

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KYP50444.17.0e-17345.20Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
GAU19483.11.7e-17145.65hypothetical protein TSUD_77270 [Trifolium subterraneum][more]
PNX76291.15.1e-17145.01gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium praten... [more]
PNX94503.19.5e-16244.77putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense... [more]
GAU51268.14.0e-16044.32hypothetical protein TSUD_412550 [Trifolium subterraneum][more]
Match NameE-valueIdentityDescription
Q94HW21.7e-10535.40Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT941.0e-9735.66Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P109781.1e-4628.14Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041464.8e-3927.79Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q077912.4e-2224.89Transposon Ty2-DR3 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... [more]
Match NameE-valueIdentityDescription
A0A151S6M83.4e-17345.20Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=... [more]
A0A2Z6MBG68.4e-17245.65Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 ... [more]
A0A2K3LCM12.4e-17145.01Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment) OS=Trifolium prat... [more]
A0A2K3MUJ94.6e-16244.77Putative retrotransposon Ty1-copia subclass protein (Fragment) OS=Trifolium prat... [more]
A0A2Z6P4D51.9e-16044.32Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 ... [more]
Match NameE-valueIdentityDescription
ATMG00300.11.4e-0927.87Gag-Pol-related retrotransposon family protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 406..469
e-value: 7.1E-11
score: 41.8
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 1..118
e-value: 6.8E-10
score: 38.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 137..177
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 225..252
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 137..192
NoneNo IPR availablePANTHERPTHR34222FAMILY NOT NAMEDcoord: 2..316
NoneNo IPR availablePANTHERPTHR34222:SF48SUBFAMILY NOT NAMEDcoord: 2..316
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 484..579
e-value: 4.2E-12
score: 46.3
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 480..644
score: 20.427382
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 476..652
e-value: 9.0E-32
score: 112.0
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 480..640

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0027536.1Lag0027536.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding