Lag0006769 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0006769
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionIntegrase catalytic domain-containing protein
Locationchr6: 45678367 .. 45680574 (-)
RNA-Seq ExpressionLag0006769
SyntenyLag0006769
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATTCAAGGGAGACAACGAGGATTTGGAGGAAGGCCTTTATGGAAAATCAGAATGCAAATCCGTTTGTTGCTTCCCCTGAAACTGTTGTTGATCCTAATTGGTATGTGGATAGCCGGAGCTTCCAATCATGTGACTGCTAAATTTGTTTGTTGCTTCCCCTAAAACTGTTGTTGATCCTAATTGGTATGTGGATAGCCGGAGCTTCCAATCATGTGACTACTGACTACAACAACTTGGCTAATCCAACCGAATATGAAGGTAAAGAACAAGTATCTGCTAGTAATGGTAGTCAACTTAAAATAGCTTTTGTTGGTAATGCTTGTCTATCGGCCGGAAATATGAAGTTTAATTTGGAAAAAGTTTTGTGTGTTCCAAATATAGTTAAAAATCTTGTTAGTGTATCCAAGTTGGCTAGAGATAATGGTGTTTTTGTGGAATTTCATGAAAATTTTTGTGTTGTTAAGGACAAGGTTTCGGGCAAGGAGTTGCTGAGAGGGGTGTTCAGTGAAGGGCTCTATTGGTTTAATGGTGCAAAGACAACTGCAATAGATATATCTAGTTCAACTACTAACGACAACAAAATCTATAGTATAAATAATGCTGAGCTTTCTGCTTTTGTTGTGTCTCATTCAGTAAATATTACTTTGTCTATAGTAATATGGCATAGGCGTCTTGGCCATCCATCAGAAAAAGTTTTTGAATCATTTGCTCAACGATGTTATCTCTCATATAGAGTTAATGAAAAGTCCAAGTTTTGTGAGGCCTGTGAGTTTGGTAAAATTAAACACATGCTCTACCCTTTTCCAACTCTAATTCTCATGCATCAAGTAGGTTTGATCTTATTCATATCGATATCTGGAGGCCAGTACCCATAAGTTTAGTTAAAGGGTTTAAATATTACATTCTGTTTGTTGACGATTTTAGCCGTTTTGTATGGATTTATCCACTGAAACAGAAAAGTGAAGCATTAGCAGCATTATTTCTACACTTTACAACAATGGTGAAAAATCAGTTTAATAGTAGTATAAAGGCTTTACAACCAGATAATGGCGGGGAATATGCTAGAATACTTAAACTGTGTAATGAATGTGGTGTTCAAACAAGACTCACCCGTCCCTACACCTCTCAGCAAAACGGTAGAGCAGAAAGAAAGCATAGGCACATTGTGGAGACCGATTTAACACTTCTTGCCCAAACATCAATGCCTCTAAGGTACTGGTGGGATGCGTTTTTAGTTGCCTCTATGCTAATAAATGGTCTTCCTTCACAGGTCATTAATGGAAAATCTCCAATGGAGATAATGTATGGCAAGAGCATTGATTTTAGTGCACTCAGGACATTCGGTTGTACGTGTTATTCCTGTCTCCGTCCATATCAAACACACAAATTTCAGTTTCATACTGAAAAGTGTGCTTACTTGGGGCCAAGTCCGATTCATAAAGGTCATATATGCCTCAGTTCCAGCGGTCGGGTGTATGTTTCGCGCCATGTCTCTTCTGTTATTTTCTTAAGTTGGTATCAGAGCCCATAAAACCCAAACGAGTATTCGGTCCAAGAAAAAAAATTGGGATCTGATCCAAGAATGGTGAACCCAAAGATTCACCATCTTGAGGGAGCATATTGAGGATCCCACATTGAAAAGATTAGTGAAGACCTCACAATTTATAAGCTACAAGAGTCACCCCACTCATTGTCAATTGGTTTTGAGATGGAAGCGTCCATGTTATTTGATATAGGATGGTAGAAAAGCAATTTTGTCCAACCTTTTCAATTCTTTAAAAGTTTGATCCATTAAATGGAACCAATTTCCTCATCTCCCTCTGGACATCGTTCTACAATTCGAATGCTCAAAATACCAATTGTTAGGGGATAAAATATAATTCAAGAATTCGTTGTGGCGGTCTTTCATCTCAGCTAAATATCAGGCTTCCTATCCGGGGGATATTCCAACCAAGTCCAAGTATTCTAGCTCCCGAGCCCCATAATTCTCTATTTGTAAAATGCATGGTGTTTTTACTAGAAATACCAAATGGGTGCTTTGTGATGGCAGTAGACTCCGATTTTGGCATGACAGTTGGATTGGTAATGGCCCGCTTAAAGAGGCACATCCGTGCATGTTTCTTCGCTATCACCATCAACAAGGACCTCCTAGTCTCCGAAGATCTCTCGGAGCTTCCCGCCCACGTCCCCTTCCTTGGGGCCTGA

mRNA sequence

ATGATTCAAGGGAGACAACGAGGATTTGGAGGAAGGCCTTTATGGAAAATCAGAATGCAAATCCGTTTGTTGCTTCCCCTGAAACTGTTGTTGATCCTAATTGCCGGAGCTTCCAATCATGTGACTACTGACTACAACAACTTGGCTAATCCAACCGAATATGAAGGTAAAGAACAAGTATCTGCTAGTAATGGTAGTCAACTTAAAATAGCTTTTGTTGGTAATGCTTGTCTATCGGCCGGAAATATGAAGTTTAATTTGGAAAAAGTTTTGTGTGTTCCAAATATAGTTAAAAATCTTGTTAGTGTATCCAAGTTGGCTAGAGATAATGGTGTTTTTGTGGAATTTCATGAAAATTTTTGTGTTGTTAAGGACAAGGTTTCGGGCAAGGAGTTGCTGAGAGGGGTGTTCAGTGAAGGGCTCTATTGGTTTAATGGTGCAAAGACAACTGCAATAGATATATCTAGTTCAACTACTAACGACAACAAAATCTATAGTATAAATAATGCTGAGCTTTCTGCTTTTGTTGTGTCTCATTCAGTAAATATTACTTTGTCTATAGTAATATGGCATAGGCGTCTTGGCCATCCATCAGAAAAAGTTTTTGAATCATTTGCTCAACGATGTTATCTCTCATATAGAGTTAATGAAAAGTCCAAGTTTTGTGAGGCCTTACCCATAAGTTTAGTTAAAGGGTTTAAATATTACATTCTGTTTGTTGACGATTTTAGCCGTTTTGTATGGATTTATCCACTGAAACAGAAAAGTGAAGCATTAGCAGCATTATTTCTACACTTTACAACAATGGTGAAAAATCAGTTTAATAGTAGTATAAAGGCTTTACAACCAGATAATGGCGGGGAATATGCTAGAATACTTAAACTGTGTAATGAATGTGGTGTTCAAACAAGACTCACCCGTCCCTACACCTCTCAGCAAAACGGTAGAGCAGAAAGAAAGCATAGGCACATTGTGGAGACCGATTTAACACTTCTTGCCCAAACATCAATGCCTCTAAGGTACTGGTGGGATGCGTTTTTAGTTGCCTCTATGCTAATAAATGGTCTTCCTTCACAGGTCATTAATGGAAAATCTCCAATGGAGATAATGTATGGCAAGAGCATTGATTTTAGTGCACTCAGGACATTCGGTTGTACGTGTTATTCCTGTCTCCGTCCATATCAAACACACAAATTTCAGTTTCATACTGAAAAGTGTGCTTACTTGGGGCCAAGTCCGATTCATAAAGGTCATATATGCCTCAGTTCCAGCGGTCGGGTGTATGTTTCGCGCCATGTCTCTTCTGTTATTTTCTTAAGTTGGCTTCCTATCCGGGGGATATTCCAACCAAGTCCAAGTATTCTAGCTCCCGAGCCCCATAATTCTCTATTTGTAAAATGCATGGTGTTTTTACTAGAAATACCAAATGGGTGCTTTGTGATGGCAGTAGACTCCGATTTTGGCATGACAGTTGGATTGGTAATGGCCCGCTTAAAGAGGCACATCCGTGCATGTTTCTTCGCTATCACCATCAACAAGGACCTCCTAGTCTCCGAAGATCTCTCGGAGCTTCCCGCCCACGTCCCCTTCCTTGGGGCCTGA

Coding sequence (CDS)

ATGATTCAAGGGAGACAACGAGGATTTGGAGGAAGGCCTTTATGGAAAATCAGAATGCAAATCCGTTTGTTGCTTCCCCTGAAACTGTTGTTGATCCTAATTGCCGGAGCTTCCAATCATGTGACTACTGACTACAACAACTTGGCTAATCCAACCGAATATGAAGGTAAAGAACAAGTATCTGCTAGTAATGGTAGTCAACTTAAAATAGCTTTTGTTGGTAATGCTTGTCTATCGGCCGGAAATATGAAGTTTAATTTGGAAAAAGTTTTGTGTGTTCCAAATATAGTTAAAAATCTTGTTAGTGTATCCAAGTTGGCTAGAGATAATGGTGTTTTTGTGGAATTTCATGAAAATTTTTGTGTTGTTAAGGACAAGGTTTCGGGCAAGGAGTTGCTGAGAGGGGTGTTCAGTGAAGGGCTCTATTGGTTTAATGGTGCAAAGACAACTGCAATAGATATATCTAGTTCAACTACTAACGACAACAAAATCTATAGTATAAATAATGCTGAGCTTTCTGCTTTTGTTGTGTCTCATTCAGTAAATATTACTTTGTCTATAGTAATATGGCATAGGCGTCTTGGCCATCCATCAGAAAAAGTTTTTGAATCATTTGCTCAACGATGTTATCTCTCATATAGAGTTAATGAAAAGTCCAAGTTTTGTGAGGCCTTACCCATAAGTTTAGTTAAAGGGTTTAAATATTACATTCTGTTTGTTGACGATTTTAGCCGTTTTGTATGGATTTATCCACTGAAACAGAAAAGTGAAGCATTAGCAGCATTATTTCTACACTTTACAACAATGGTGAAAAATCAGTTTAATAGTAGTATAAAGGCTTTACAACCAGATAATGGCGGGGAATATGCTAGAATACTTAAACTGTGTAATGAATGTGGTGTTCAAACAAGACTCACCCGTCCCTACACCTCTCAGCAAAACGGTAGAGCAGAAAGAAAGCATAGGCACATTGTGGAGACCGATTTAACACTTCTTGCCCAAACATCAATGCCTCTAAGGTACTGGTGGGATGCGTTTTTAGTTGCCTCTATGCTAATAAATGGTCTTCCTTCACAGGTCATTAATGGAAAATCTCCAATGGAGATAATGTATGGCAAGAGCATTGATTTTAGTGCACTCAGGACATTCGGTTGTACGTGTTATTCCTGTCTCCGTCCATATCAAACACACAAATTTCAGTTTCATACTGAAAAGTGTGCTTACTTGGGGCCAAGTCCGATTCATAAAGGTCATATATGCCTCAGTTCCAGCGGTCGGGTGTATGTTTCGCGCCATGTCTCTTCTGTTATTTTCTTAAGTTGGCTTCCTATCCGGGGGATATTCCAACCAAGTCCAAGTATTCTAGCTCCCGAGCCCCATAATTCTCTATTTGTAAAATGCATGGTGTTTTTACTAGAAATACCAAATGGGTGCTTTGTGATGGCAGTAGACTCCGATTTTGGCATGACAGTTGGATTGGTAATGGCCCGCTTAAAGAGGCACATCCGTGCATGTTTCTTCGCTATCACCATCAACAAGGACCTCCTAGTCTCCGAAGATCTCTCGGAGCTTCCCGCCCACGTCCCCTTCCTTGGGGCCTGA

Protein sequence

MIQGRQRGFGGRPLWKIRMQIRLLLPLKLLLILIAGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLRGVFSEGLYWFNGAKTTAIDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYRVNEKSKFCEALPISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQTRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFHTEKCAYLGPSPIHKGHICLSSSGRVYVSRHVSSVIFLSWLPIRGIFQPSPSILAPEPHNSLFVKCMVFLLEIPNGCFVMAVDSDFGMTVGLVMARLKRHIRACFFAITINKDLLVSEDLSELPAHVPFLGA
Homology
BLAST of Lag0006769 vs. NCBI nr
Match: GAU19483.1 (hypothetical protein TSUD_77270 [Trifolium subterraneum])

HSP 1 Score: 374.0 bits (959), Expect = 2.1e-99
Identity = 199/431 (46.17%), Postives = 248/431 (57.54%), Query Frame = 0

Query: 35  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVP 94
           +GASNHVT       + TE+ GK  +   NG +L I   G++ L +     NL  +L VP
Sbjct: 314 SGASNHVTHQTEKFQDLTEHHGKNSLVVGNGEKLAILATGSSKLKS----LNLHDILYVP 373

Query: 95  NIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLRGVFSEGLYWFNGAKTTAIDI 154
           NI KNL+SVSKLA DN + VEF EN C VKDK++GK +L+G+  +GLY  +G K      
Sbjct: 374 NITKNLLSVSKLAADNNILVEFDENCCFVKDKLTGKVILKGLLKDGLYQLSGTKRNP--- 433

Query: 155 SSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYR 214
                             SAFV   SV  +     WHRRLGHP+ KV +   + C +   
Sbjct: 434 ------------------SAFV---SVKES-----WHRRLGHPNNKVLDKVLESCKVKVP 493

Query: 215 VNEKSKFCEA--------------------------------LPISLVKGFKYYILFVDD 274
            ++   FCEA                                 PI    GFKYY+ FVDD
Sbjct: 494 PSDNFSFCEACQYGKMHLLPFKSSSSHAQEPLELVHTDVWGPAPIMTSSGFKYYVHFVDD 553

Query: 275 FSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQ 334
           FSRF WIYPLKQKSE + A F+ F  + +NQFN  IK +Q D GGEY  + KL  E G+Q
Sbjct: 554 FSRFTWIYPLKQKSETVQA-FIQFKNLTENQFNKRIKVIQCDGGGEYKPVQKLAVEAGIQ 613

Query: 335 TRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVIN 394
            R++ PYTSQQNGRAERKHRHI E  LTLLAQ  MPL YWW+AF  A  LIN LPSQV  
Sbjct: 614 FRMSCPYTSQQNGRAERKHRHITEFGLTLLAQAQMPLHYWWEAFSTAVYLINRLPSQVTQ 673

Query: 395 GKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFHTEKCAYLGPSPIHKGHICLS 434
            +SP  +M  K  D+  L+TFGC CY CL+PY  HK Q+HT +C +LG S  HKG+ CL+
Sbjct: 674 NESPYSLMLQKEPDYKLLKTFGCACYPCLKPYNQHKLQYHTTRCVFLGYSNSHKGYKCLN 710

BLAST of Lag0006769 vs. NCBI nr
Match: PNY02796.1 (copia protein (gag-int-pol protein), partial [Trifolium pratense])

HSP 1 Score: 357.5 bits (916), Expect = 2.1e-94
Identity = 192/431 (44.55%), Postives = 244/431 (56.61%), Query Frame = 0

Query: 35  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVP 94
           +GASNHVT   +   + T + GK  +   NG +LKI   G+  L       NL  VL VP
Sbjct: 41  SGASNHVTHQTDKFQDLTGHNGKNSLMVGNGEKLKIVASGSTKLK----NLNLYDVLYVP 100

Query: 95  NIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLRGVFSEGLYWFNGAKTTAIDI 154
            I KNL+SVSKL  DN + VEF  + C VKDK++GK LL+G   EGLY  +       ++
Sbjct: 101 EITKNLLSVSKLTADNNIIVEFDADCCSVKDKLTGKALLKGKLKEGLYQVS-------NV 160

Query: 155 SSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYR 214
           SS +  D   Y           V  S         WHR+LGHP+ KV +   + C +   
Sbjct: 161 SSQSNKDACTY---------MSVKES---------WHRKLGHPNNKVLDKVLKHCNVKTS 220

Query: 215 VNEKSKFCEA--------------------------------LPISLVKGFKYYILFVDD 274
            +++ KFCEA                                 PI    GFKYY+ F+DD
Sbjct: 221 SSDQFKFCEACQFGKLHLLPFKSSYSHAQEPLDLIHTDVWGPAPIMSNSGFKYYVHFIDD 280

Query: 275 FSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQ 334
           FSRF WIYPLKQKSE + A F  F T+V+NQFN  IK +Q D GGEY  + KL  E G+Q
Sbjct: 281 FSRFTWIYPLKQKSETIHA-FTQFKTLVENQFNKRIKIVQCDGGGEYKAVQKLALEAGIQ 340

Query: 335 TRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVIN 394
            R++ PYTSQQNGRAERKHRH+ E  LT+LAQ  MPL YWW+AF  +  LIN LPS +  
Sbjct: 341 FRMSCPYTSQQNGRAERKHRHVAELGLTMLAQARMPLCYWWEAFSTSVYLINRLPSSINQ 400

Query: 395 GKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFHTEKCAYLGPSPIHKGHICLS 434
              P  ++Y K  D+S L+ FGC CY CL+PY  HK QFHT +C +LG S  HKG+ C++
Sbjct: 401 NACPYTLIYKKEPDYSVLKPFGCACYPCLKPYNKHKLQFHTTRCVFLGYSNSHKGYKCIN 441

BLAST of Lag0006769 vs. NCBI nr
Match: PNX76291.1 (gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense])

HSP 1 Score: 356.3 bits (913), Expect = 4.6e-94
Identity = 188/431 (43.62%), Postives = 250/431 (58.00%), Query Frame = 0

Query: 35  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVP 94
           +GASNHVT   +   N +E+ GK  +   NG +L+I   G++ L +     NL  +L VP
Sbjct: 323 SGASNHVTHQTDKFQNLSEHHGKNSLIVGNGEKLEIVATGSSKLKS----LNLHDILYVP 382

Query: 95  NIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLRGVFSEGLYWFNGAKTTAIDI 154
            I KNL+SVSKLA DN + VEF EN C VKDK++GK +LRG+  +GL             
Sbjct: 383 KITKNLLSVSKLAADNNILVEFDENCCFVKDKLTGKAILRGILKDGL------------- 442

Query: 155 SSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYR 214
                     Y ++  + SA+     V+I  S   WHR+LGHP+ KV +   + C +   
Sbjct: 443 ----------YQLSEKDSSAY-----VSIKES---WHRKLGHPNNKVLDIVLKSCNVKLS 502

Query: 215 VNEKSKFCEA--------------------------------LPISLVKGFKYYILFVDD 274
            +++  FCEA                                 PI    GFKYY+ F+DD
Sbjct: 503 PSDQFSFCEACQYGKMHFLPFKTSFSHAKEILELVHTDVWGPAPIISSSGFKYYVHFIDD 562

Query: 275 FSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQ 334
           F+RF WIYPLKQKS+  A  F+ F  MV+NQF+  IK +Q D GGEY  + K   E G+Q
Sbjct: 563 FTRFTWIYPLKQKSDT-AHAFIQFKNMVENQFSKKIKTIQCDGGGEYKPVQKHAIEAGIQ 622

Query: 335 TRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVIN 394
            R++ PYTSQQNGRAERKHRHI E  LTLLAQ  MPL YWW+AF  A  LIN LPS V +
Sbjct: 623 FRMSCPYTSQQNGRAERKHRHIAEFGLTLLAQAKMPLNYWWEAFSTAVYLINRLPSSVTH 682

Query: 395 GKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFHTEKCAYLGPSPIHKGHICLS 434
            KSP  +++ +  D+++L+ FGC CY  L+PY  HK QFHT +C +LG S  HKG+ C++
Sbjct: 683 NKSPYSLLHKREPDYNSLKPFGCACYPFLKPYNKHKLQFHTTRCVFLGYSNSHKGYKCVN 717

BLAST of Lag0006769 vs. NCBI nr
Match: KYP50444.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 356.3 bits (913), Expect = 4.6e-94
Identity = 194/420 (46.19%), Postives = 253/420 (60.24%), Query Frame = 0

Query: 35  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVP 94
           +GASNHVT D N +    E +GK  ++  NG+ LKI   G++ L       NL+ +L VP
Sbjct: 237 SGASNHVTYDQNKVQEVNENDGKSFLTVGNGANLKIIACGDSSLDTQQKSLNLKDILYVP 296

Query: 95  NIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLRGVFSEGLYWFNGAKTTA--- 154
            I KNL+S+SKL  DN ++VEFH+  C VKDK++G+ LL G   +GLY   G  T+    
Sbjct: 297 KITKNLLSISKLTFDNDIYVEFHDVACFVKDKLTGRILLEGKIKDGLYQLPGGSTSTNKR 356

Query: 155 --IDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRC 214
             +  S   T   K+   N+  L+   V    NI  S          P E  FE F + C
Sbjct: 357 PHVFFSIKETWHRKLGHPNSKVLNE--VMKLCNIEAS----------PCEN-FE-FCEAC 416

Query: 215 YLSYRVN---EKSKFCE-------------ALPISLVKGFKYYILFVDDFSRFVWIYPLK 274
                 N   + S  C                PIS V GFKYY+LF+DD+SRF WIYPLK
Sbjct: 417 QFGKAHNLPFQNSVSCAKEPLDLVHSDVWGPAPISSVSGFKYYVLFLDDWSRFTWIYPLK 476

Query: 275 QKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQTRLTRPYTSQQ 334
           QKS+   A F+ F  +V+NQFN  IK LQ D GGE+  + K+  + G+Q R + PYTS Q
Sbjct: 477 QKSDVFQA-FIQFRNLVENQFNKRIKTLQCDGGGEFKSLSKVLIKTGIQLRESCPYTSAQ 536

Query: 335 NGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGK 394
           NGRAERKHRH+VE+ LTLLAQ  MPL YWW+AF  A  LIN LP+QVI  KSP + ++ K
Sbjct: 537 NGRAERKHRHVVESGLTLLAQAKMPLHYWWEAFSTAVFLINRLPTQVIKNKSPYQQLFDK 596

Query: 395 SIDFSALRTFGCTCYSCLRPYQTHKFQFHTEKCAYLGPSPIHKGHICLSSSGRVYVSRHV 434
           + D++A++TFGC CY CL+PY  HK QFHT KC +LG S  HKG+ CL+S+GR+++SRHV
Sbjct: 597 NPDYTAMKTFGCACYPCLKPYNQHKLQFHTTKCVFLGYSGSHKGYKCLNSTGRIFISRHV 641

BLAST of Lag0006769 vs. NCBI nr
Match: GAU51268.1 (hypothetical protein TSUD_412550 [Trifolium subterraneum])

HSP 1 Score: 350.1 bits (897), Expect = 3.3e-92
Identity = 185/431 (42.92%), Postives = 248/431 (57.54%), Query Frame = 0

Query: 35  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVP 94
           +GA+NHVT   +      E+ GK  +   NG +LKI   G+  L+      NL  VL VP
Sbjct: 316 SGANNHVTHQTDKFQGFNEHNGKNSLMVGNGEKLKIVASGSTKLN----NLNLHDVLYVP 375

Query: 95  NIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLRGVFSEGLYWFNGAKTTAIDI 154
            I KNL+SVSKL  DN + VEF  N C VKDK++G+ LL+G   +GL             
Sbjct: 376 QITKNLLSVSKLTADNNILVEFDANCCSVKDKLTGQTLLKGRLKDGL------------- 435

Query: 155 SSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYR 214
                     Y ++N E   ++   SV  +     WHR+LGHP+ KV +   + C +   
Sbjct: 436 ----------YQLSNKEPCVYM---SVKES-----WHRKLGHPNNKVLDKVLKDCNVKIS 495

Query: 215 VNEKSKFCEAL-------------------PISLV-------------KGFKYYILFVDD 274
            +++  FCEA                    P++L+              GFKYY+ F+DD
Sbjct: 496 HSDQFSFCEACQFGKLHLLPFKPSSSHVQEPLALIHSDVWGPAPILSPSGFKYYVHFIDD 555

Query: 275 FSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQ 334
           FSRF WI+PLKQKS+ + A F+ F  + +NQFN  IK +Q D GGEY  + K+  E G+Q
Sbjct: 556 FSRFTWIFPLKQKSDTIHA-FIQFKNLAENQFNKKIKIIQCDGGGEYKAVQKVSIEAGIQ 615

Query: 335 TRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVIN 394
            R++ PYTSQQNGRAERKHRH+ E  LTLLAQ  MPLRYWW+AF  A  LIN LPS V  
Sbjct: 616 FRMSCPYTSQQNGRAERKHRHVAELGLTLLAQAKMPLRYWWEAFSTAVYLINRLPSSVNP 675

Query: 395 GKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFHTEKCAYLGPSPIHKGHICLS 434
            +SP  +M+ +  D++AL+ FGC CY CL+PY  HK QFHT +C ++G S  HKG+ C++
Sbjct: 676 NESPYSLMFKREPDYNALKPFGCACYPCLKPYNQHKLQFHTTRCVFVGYSNSHKGYKCIN 710

BLAST of Lag0006769 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 261.5 bits (667), Expect = 2.0e-68
Identity = 151/436 (34.63%), Postives = 228/436 (52.29%), Query Frame = 0

Query: 32  ILIAGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVL 91
           +L +GA++H+T+D+NNL+    Y G + V  ++GS + I+  G+  LS  +   NL  +L
Sbjct: 332 LLDSGATHHITSDFNNLSLHQPYTGGDDVMVADGSTIPISHTGSTSLSTKSRPLNLHNIL 391

Query: 92  CVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLRGVFSEGLYWFNGAKTTA 151
            VPNI KNL+SV +L   NGV VEF      VKD  +G  LL+G   + LY +  A +  
Sbjct: 392 YVPNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNTGVPLLQGKTKDELYEWPIASSQP 451

Query: 152 IDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYL 211
           + + +S ++                 +HS         WH RLGHP+  +  S      L
Sbjct: 452 VSLFASPSSK---------------ATHS--------SWHARLGHPAPSILNSVISNYSL 511

Query: 212 SYRVNEKSKFCE---------------------------------ALPISLVKGFKYYIL 271
           S  +N   KF                                   + PI     ++YY++
Sbjct: 512 SV-LNPSHKFLSCSDCLINKSNKVPFSQSTINSTRPLEYIYSDVWSSPILSHDNYRYYVI 571

Query: 272 FVDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNE 331
           FVD F+R+ W+YPLKQKS+ +   F+ F  +++N+F + I     DNGGE+  + +  ++
Sbjct: 572 FVDHFTRYTWLYPLKQKSQ-VKETFITFKNLLENRFQTRIGTFYSDNGGEFVALWEYFSQ 631

Query: 332 CGVQTRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPS 391
            G+    + P+T + NG +ERKHRHIVET LTLL+  S+P  YW  AF VA  LIN LP+
Sbjct: 632 HGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHASIPKTYWPYAFAVAVYLINRLPT 691

Query: 392 QVINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFHTEKCAYLGPSPIHKGH 434
            ++  +SP + ++G S ++  LR FGC CY  LRPY  HK    + +C +LG S     +
Sbjct: 692 PLLQLESPFQKLFGTSPNYDKLRVFGCACYPWLRPYNQHKLDDKSRQCVFLGYSLTQSAY 742

BLAST of Lag0006769 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 258.8 bits (660), Expect = 1.3e-67
Identity = 154/435 (35.40%), Postives = 235/435 (54.02%), Query Frame = 0

Query: 32  ILIAGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVL 91
           +L +GA++H+T+D+NNL+    Y G + V  ++GS + I   G+A L   +   +L KVL
Sbjct: 311 LLDSGATHHITSDFNNLSFHQPYTGGDDVMIADGSTIPITHTGSASLPTSSRSLDLNKVL 370

Query: 92  CVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLRGVFSEGLYWFNGAKTTA 151
            VPNI KNL+SV +L   N V VEF      VKD  +G  LL+G   + LY +  A + A
Sbjct: 371 YVPNIHKNLISVYRLCNTNRVSVEFFPASFQVKDLNTGVPLLQGKTKDELYEWPIASSQA 430

Query: 152 IDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQR--- 211
           + + +S  +                 +HS         WH RLGHPS  +  S       
Sbjct: 431 VSMFASPCSK---------------ATHS--------SWHSRLGHPSLAILNSVISNHSL 490

Query: 212 --------------CYL--SYRVN------EKSKFCEAL-------PISLVKGFKYYILF 271
                         C++  S++V         SK  E +       PI  +  ++YY++F
Sbjct: 491 PVLNPSHKLLSCSDCFINKSHKVPFSNSTITSSKPLEYIYSDVWSSPILSIDNYRYYVIF 550

Query: 272 VDDFSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNEC 331
           VD F+R+ W+YPLKQKS+ +   F+ F ++V+N+F + I  L  DNGGE+  +    ++ 
Sbjct: 551 VDHFTRYTWLYPLKQKSQ-VKDTFIIFKSLVENRFQTRIGTLYSDNGGEFVVLRDYLSQH 610

Query: 332 GVQTRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQ 391
           G+    + P+T + NG +ERKHRHIVE  LTLL+  S+P  YW  AF VA  LIN LP+ 
Sbjct: 611 GISHFTSPPHTPEHNGLSERKHRHIVEMGLTLLSHASVPKTYWPYAFSVAVYLINRLPTP 670

Query: 392 VINGKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFHTEKCAYLGPSPIHKGHI 434
           ++  +SP + ++G+  ++  L+ FGC CY  LRPY  HK +  +++CA++G S     ++
Sbjct: 671 LLQLQSPFQKLFGQPPNYEKLKVFGCACYPWLRPYNRHKLEDKSKQCAFMGYSLTQSAYL 721

BLAST of Lag0006769 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 134.0 bits (336), Expect = 4.9e-30
Identity = 105/386 (27.20%), Postives = 167/386 (43.26%), Query Frame = 0

Query: 60  VSASNGSQLKIAFVGNACLSAG-NMKFNLEKVLCVPNIVKNLVSVSKLARDNGVFVEFHE 119
           V   N S  KIA +G+ C+         L+ V  VP++  NL+S   L RD       ++
Sbjct: 322 VKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRHVPDLRMNLISGIALDRDGYESYFANQ 381

Query: 120 NFCVVKDKVSGKELLRGVFSEGLYWFNGAKTTAIDISSSTTNDNKIYSINNAELSAFVVS 179
            + + K  +    + +GV    LY  N                     I   EL+A    
Sbjct: 382 KWRLTKGSL---VIAKGVARGTLYRTNA-------------------EICQGELNA---- 441

Query: 180 HSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSY------------------RVNEK-- 239
                 +S+ +WH+R+GH SEK  +  A++  +SY                  RV+ +  
Sbjct: 442 --AQDEISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQTS 501

Query: 240 ------------SKFCEALPISLVKGFKYYILFVDDFSRFVWIYPLKQKSEALAALFLHF 299
                       S  C  + I  + G KY++ F+DD SR +W+Y LK K +    +F  F
Sbjct: 502 SERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVF-QVFQKF 561

Query: 300 TTMVKNQFNSSIKALQPDNGGEYA--RILKLCNECGVQTRLTRPYTSQQNGRAERKHRHI 359
             +V+ +    +K L+ DNGGEY      + C+  G++   T P T Q NG AER +R I
Sbjct: 562 HALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTI 621

Query: 360 VETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGKSIDFSALRTFG 411
           VE   ++L    +P  +W +A   A  LIN  PS  +  + P  +   K + +S L+ FG
Sbjct: 622 VEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFG 678

BLAST of Lag0006769 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 99.0 bits (245), Expect = 1.7e-19
Identity = 98/408 (24.02%), Postives = 170/408 (41.67%), Query Frame = 0

Query: 32  ILIAGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVL 91
           +L +GAS+H+  D +   +  E     +++ +   +   A          + +  LE VL
Sbjct: 290 VLDSGASDHLINDESLYTDSVEVVPPLKIAVAKQGEFIYATKRGIVRLRNDHEITLEDVL 349

Query: 92  CVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKD------KVSGKELLRGVFSEGLYWFN 151
                  NL+SV +L ++ G+ +EF ++   +        K SG      V +   Y  N
Sbjct: 350 FCKEAAGNLMSVKRL-QEAGMSIEFDKSGVTISKNGLMVVKNSGMLNNVPVINFQAYSIN 409

Query: 152 GAKTTAIDI---SSSTTNDNKIYSI------------NNAELSAFVVSHSVNITLSIVIW 211
                   +        +D K+  I            NN ELS  +    +N        
Sbjct: 410 AKHKNNFRLWHERFGHISDGKLLEIKRKNMFSDQSLLNNLELSCEICEPCLN-------- 469

Query: 212 HRRLGHPSEKVFESFAQRCYLSYRV-NEKSKFCEALPISLVKGFKYYILFVDDFSRFVWI 271
               G  +   F+    + ++   +    S  C  +    +    Y+++FVD F+ +   
Sbjct: 470 ----GKQARLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVT 529

Query: 272 YPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYA--RILKLCNECGVQTRLTR 331
           Y +K KS+   ++F  F    +  FN  +  L  DNG EY    + + C + G+   LT 
Sbjct: 530 YLIKYKSDVF-SMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTV 589

Query: 332 PYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVI--NGKS 391
           P+T Q NG +ER  R I E   T+++   +   +W +A L A+ LIN +PS+ +  + K+
Sbjct: 590 PHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKT 649

Query: 392 PMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFHTEKCAYLGPSP 414
           P E+ + K      LR FG T Y  ++  Q  KF   + K  ++G  P
Sbjct: 650 PYEMWHNKKPYLKHLRVFGATVYVHIKNKQ-GKFDDKSFKSIFVGYEP 682

BLAST of Lag0006769 vs. ExPASy Swiss-Prot
Match: Q12491 (Transposon Ty2-B Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY2B-B PE=3 SV=1)

HSP 1 Score: 81.3 bits (199), Expect = 3.7e-14
Identity = 96/401 (23.94%), Postives = 157/401 (39.15%), Query Frame = 0

Query: 31  LILIAGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKV 90
           L++ +GAS  +    + L + T       V A     + I  +GN   +  N      K 
Sbjct: 454 LLIDSGASQTLVRSAHYLHHATPNSEINIVDAQK-QDIPINAIGNLHFNFQNGTKTSIKA 513

Query: 91  LCVPNIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLRGVFSEGLYWFNGAKTT 150
           L  PNI  +L+S+S+LA  N +   F  N     ++  G  L   V     YW +     
Sbjct: 514 LHTPNIAYDLLSLSELANQN-ITACFTRN---TLERSDGTVLAPIVKHGDFYWLSKKYLI 573

Query: 151 AIDISSSTTND-NKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRC 210
              IS  T N+ NK  S+N                    + HR LGH + +  +   ++ 
Sbjct: 574 PSHISKLTINNVNKSKSVNK---------------YPYPLIHRMLGHANFRSIQKSLKKN 633

Query: 211 YLSYRVNEKSKFCEA----LPISL---------VKGFK---------------------- 270
            ++Y      ++  A     P  L         VKG +                      
Sbjct: 634 AVTYLKESDIEWSNASTYQCPDCLIGKSTKHRHVKGSRLKYQESYEPFQYLHTDIFGPVH 693

Query: 271 --------YYILFVDDFSRFVWIYPL-KQKSEALAALFLHFTTMVKNQFNSSIKALQPDN 330
                   Y+I F D+ +RF W+YPL  ++ E++  +F      +KNQFN+ +  +Q D 
Sbjct: 694 HLPKSAPSYFISFTDEKTRFQWVYPLHDRREESILNVFTSILAFIKNQFNARVLVIQMDR 753

Query: 331 GGEYAR--ILKLCNECGVQTRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWW 385
           G EY    + K     G+    T    S+ +G AER +R ++    TLL  + +P   W+
Sbjct: 754 GSEYTNKTLHKFFTNRGITACYTTTADSRAHGVAERLNRTLLNDCRTLLHCSGLPNHLWF 813

BLAST of Lag0006769 vs. ExPASy TrEMBL
Match: A0A2Z6MBG6 (Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_77270 PE=4 SV=1)

HSP 1 Score: 374.0 bits (959), Expect = 1.0e-99
Identity = 199/431 (46.17%), Postives = 248/431 (57.54%), Query Frame = 0

Query: 35  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVP 94
           +GASNHVT       + TE+ GK  +   NG +L I   G++ L +     NL  +L VP
Sbjct: 314 SGASNHVTHQTEKFQDLTEHHGKNSLVVGNGEKLAILATGSSKLKS----LNLHDILYVP 373

Query: 95  NIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLRGVFSEGLYWFNGAKTTAIDI 154
           NI KNL+SVSKLA DN + VEF EN C VKDK++GK +L+G+  +GLY  +G K      
Sbjct: 374 NITKNLLSVSKLAADNNILVEFDENCCFVKDKLTGKVILKGLLKDGLYQLSGTKRNP--- 433

Query: 155 SSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYR 214
                             SAFV   SV  +     WHRRLGHP+ KV +   + C +   
Sbjct: 434 ------------------SAFV---SVKES-----WHRRLGHPNNKVLDKVLESCKVKVP 493

Query: 215 VNEKSKFCEA--------------------------------LPISLVKGFKYYILFVDD 274
            ++   FCEA                                 PI    GFKYY+ FVDD
Sbjct: 494 PSDNFSFCEACQYGKMHLLPFKSSSSHAQEPLELVHTDVWGPAPIMTSSGFKYYVHFVDD 553

Query: 275 FSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQ 334
           FSRF WIYPLKQKSE + A F+ F  + +NQFN  IK +Q D GGEY  + KL  E G+Q
Sbjct: 554 FSRFTWIYPLKQKSETVQA-FIQFKNLTENQFNKRIKVIQCDGGGEYKPVQKLAVEAGIQ 613

Query: 335 TRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVIN 394
            R++ PYTSQQNGRAERKHRHI E  LTLLAQ  MPL YWW+AF  A  LIN LPSQV  
Sbjct: 614 FRMSCPYTSQQNGRAERKHRHITEFGLTLLAQAQMPLHYWWEAFSTAVYLINRLPSQVTQ 673

Query: 395 GKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFHTEKCAYLGPSPIHKGHICLS 434
            +SP  +M  K  D+  L+TFGC CY CL+PY  HK Q+HT +C +LG S  HKG+ CL+
Sbjct: 674 NESPYSLMLQKEPDYKLLKTFGCACYPCLKPYNQHKLQYHTTRCVFLGYSNSHKGYKCLN 710

BLAST of Lag0006769 vs. ExPASy TrEMBL
Match: A0A2K3NIC3 (Copia protein (Gag-int-pol protein) (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g026116 PE=4 SV=1)

HSP 1 Score: 357.5 bits (916), Expect = 1.0e-94
Identity = 192/431 (44.55%), Postives = 244/431 (56.61%), Query Frame = 0

Query: 35  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVP 94
           +GASNHVT   +   + T + GK  +   NG +LKI   G+  L       NL  VL VP
Sbjct: 41  SGASNHVTHQTDKFQDLTGHNGKNSLMVGNGEKLKIVASGSTKLK----NLNLYDVLYVP 100

Query: 95  NIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLRGVFSEGLYWFNGAKTTAIDI 154
            I KNL+SVSKL  DN + VEF  + C VKDK++GK LL+G   EGLY  +       ++
Sbjct: 101 EITKNLLSVSKLTADNNIIVEFDADCCSVKDKLTGKALLKGKLKEGLYQVS-------NV 160

Query: 155 SSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYR 214
           SS +  D   Y           V  S         WHR+LGHP+ KV +   + C +   
Sbjct: 161 SSQSNKDACTY---------MSVKES---------WHRKLGHPNNKVLDKVLKHCNVKTS 220

Query: 215 VNEKSKFCEA--------------------------------LPISLVKGFKYYILFVDD 274
            +++ KFCEA                                 PI    GFKYY+ F+DD
Sbjct: 221 SSDQFKFCEACQFGKLHLLPFKSSYSHAQEPLDLIHTDVWGPAPIMSNSGFKYYVHFIDD 280

Query: 275 FSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQ 334
           FSRF WIYPLKQKSE + A F  F T+V+NQFN  IK +Q D GGEY  + KL  E G+Q
Sbjct: 281 FSRFTWIYPLKQKSETIHA-FTQFKTLVENQFNKRIKIVQCDGGGEYKAVQKLALEAGIQ 340

Query: 335 TRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVIN 394
            R++ PYTSQQNGRAERKHRH+ E  LT+LAQ  MPL YWW+AF  +  LIN LPS +  
Sbjct: 341 FRMSCPYTSQQNGRAERKHRHVAELGLTMLAQARMPLCYWWEAFSTSVYLINRLPSSINQ 400

Query: 395 GKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFHTEKCAYLGPSPIHKGHICLS 434
              P  ++Y K  D+S L+ FGC CY CL+PY  HK QFHT +C +LG S  HKG+ C++
Sbjct: 401 NACPYTLIYKKEPDYSVLKPFGCACYPCLKPYNKHKLQFHTTRCVFLGYSNSHKGYKCIN 441

BLAST of Lag0006769 vs. ExPASy TrEMBL
Match: A0A151S6M8 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=3821 GN=KK1_027809 PE=4 SV=1)

HSP 1 Score: 356.3 bits (913), Expect = 2.2e-94
Identity = 194/420 (46.19%), Postives = 253/420 (60.24%), Query Frame = 0

Query: 35  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVP 94
           +GASNHVT D N +    E +GK  ++  NG+ LKI   G++ L       NL+ +L VP
Sbjct: 237 SGASNHVTYDQNKVQEVNENDGKSFLTVGNGANLKIIACGDSSLDTQQKSLNLKDILYVP 296

Query: 95  NIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLRGVFSEGLYWFNGAKTTA--- 154
            I KNL+S+SKL  DN ++VEFH+  C VKDK++G+ LL G   +GLY   G  T+    
Sbjct: 297 KITKNLLSISKLTFDNDIYVEFHDVACFVKDKLTGRILLEGKIKDGLYQLPGGSTSTNKR 356

Query: 155 --IDISSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRC 214
             +  S   T   K+   N+  L+   V    NI  S          P E  FE F + C
Sbjct: 357 PHVFFSIKETWHRKLGHPNSKVLNE--VMKLCNIEAS----------PCEN-FE-FCEAC 416

Query: 215 YLSYRVN---EKSKFCE-------------ALPISLVKGFKYYILFVDDFSRFVWIYPLK 274
                 N   + S  C                PIS V GFKYY+LF+DD+SRF WIYPLK
Sbjct: 417 QFGKAHNLPFQNSVSCAKEPLDLVHSDVWGPAPISSVSGFKYYVLFLDDWSRFTWIYPLK 476

Query: 275 QKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQTRLTRPYTSQQ 334
           QKS+   A F+ F  +V+NQFN  IK LQ D GGE+  + K+  + G+Q R + PYTS Q
Sbjct: 477 QKSDVFQA-FIQFRNLVENQFNKRIKTLQCDGGGEFKSLSKVLIKTGIQLRESCPYTSAQ 536

Query: 335 NGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVINGKSPMEIMYGK 394
           NGRAERKHRH+VE+ LTLLAQ  MPL YWW+AF  A  LIN LP+QVI  KSP + ++ K
Sbjct: 537 NGRAERKHRHVVESGLTLLAQAKMPLHYWWEAFSTAVFLINRLPTQVIKNKSPYQQLFDK 596

Query: 395 SIDFSALRTFGCTCYSCLRPYQTHKFQFHTEKCAYLGPSPIHKGHICLSSSGRVYVSRHV 434
           + D++A++TFGC CY CL+PY  HK QFHT KC +LG S  HKG+ CL+S+GR+++SRHV
Sbjct: 597 NPDYTAMKTFGCACYPCLKPYNQHKLQFHTTKCVFLGYSGSHKGYKCLNSTGRIFISRHV 641

BLAST of Lag0006769 vs. ExPASy TrEMBL
Match: A0A2K3LCM1 (Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g032236 PE=4 SV=1)

HSP 1 Score: 356.3 bits (913), Expect = 2.2e-94
Identity = 188/431 (43.62%), Postives = 250/431 (58.00%), Query Frame = 0

Query: 35  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVP 94
           +GASNHVT   +   N +E+ GK  +   NG +L+I   G++ L +     NL  +L VP
Sbjct: 323 SGASNHVTHQTDKFQNLSEHHGKNSLIVGNGEKLEIVATGSSKLKS----LNLHDILYVP 382

Query: 95  NIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLRGVFSEGLYWFNGAKTTAIDI 154
            I KNL+SVSKLA DN + VEF EN C VKDK++GK +LRG+  +GL             
Sbjct: 383 KITKNLLSVSKLAADNNILVEFDENCCFVKDKLTGKAILRGILKDGL------------- 442

Query: 155 SSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYR 214
                     Y ++  + SA+     V+I  S   WHR+LGHP+ KV +   + C +   
Sbjct: 443 ----------YQLSEKDSSAY-----VSIKES---WHRKLGHPNNKVLDIVLKSCNVKLS 502

Query: 215 VNEKSKFCEA--------------------------------LPISLVKGFKYYILFVDD 274
            +++  FCEA                                 PI    GFKYY+ F+DD
Sbjct: 503 PSDQFSFCEACQYGKMHFLPFKTSFSHAKEILELVHTDVWGPAPIISSSGFKYYVHFIDD 562

Query: 275 FSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQ 334
           F+RF WIYPLKQKS+  A  F+ F  MV+NQF+  IK +Q D GGEY  + K   E G+Q
Sbjct: 563 FTRFTWIYPLKQKSDT-AHAFIQFKNMVENQFSKKIKTIQCDGGGEYKPVQKHAIEAGIQ 622

Query: 335 TRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVIN 394
            R++ PYTSQQNGRAERKHRHI E  LTLLAQ  MPL YWW+AF  A  LIN LPS V +
Sbjct: 623 FRMSCPYTSQQNGRAERKHRHIAEFGLTLLAQAKMPLNYWWEAFSTAVYLINRLPSSVTH 682

Query: 395 GKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFHTEKCAYLGPSPIHKGHICLS 434
            KSP  +++ +  D+++L+ FGC CY  L+PY  HK QFHT +C +LG S  HKG+ C++
Sbjct: 683 NKSPYSLLHKREPDYNSLKPFGCACYPFLKPYNKHKLQFHTTRCVFLGYSNSHKGYKCVN 717

BLAST of Lag0006769 vs. ExPASy TrEMBL
Match: A0A2Z6P4D5 (Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_412550 PE=4 SV=1)

HSP 1 Score: 350.1 bits (897), Expect = 1.6e-92
Identity = 185/431 (42.92%), Postives = 248/431 (57.54%), Query Frame = 0

Query: 35  AGASNHVTTDYNNLANPTEYEGKEQVSASNGSQLKIAFVGNACLSAGNMKFNLEKVLCVP 94
           +GA+NHVT   +      E+ GK  +   NG +LKI   G+  L+      NL  VL VP
Sbjct: 316 SGANNHVTHQTDKFQGFNEHNGKNSLMVGNGEKLKIVASGSTKLN----NLNLHDVLYVP 375

Query: 95  NIVKNLVSVSKLARDNGVFVEFHENFCVVKDKVSGKELLRGVFSEGLYWFNGAKTTAIDI 154
            I KNL+SVSKL  DN + VEF  N C VKDK++G+ LL+G   +GL             
Sbjct: 376 QITKNLLSVSKLTADNNILVEFDANCCSVKDKLTGQTLLKGRLKDGL------------- 435

Query: 155 SSSTTNDNKIYSINNAELSAFVVSHSVNITLSIVIWHRRLGHPSEKVFESFAQRCYLSYR 214
                     Y ++N E   ++   SV  +     WHR+LGHP+ KV +   + C +   
Sbjct: 436 ----------YQLSNKEPCVYM---SVKES-----WHRKLGHPNNKVLDKVLKDCNVKIS 495

Query: 215 VNEKSKFCEAL-------------------PISLV-------------KGFKYYILFVDD 274
            +++  FCEA                    P++L+              GFKYY+ F+DD
Sbjct: 496 HSDQFSFCEACQFGKLHLLPFKPSSSHVQEPLALIHSDVWGPAPILSPSGFKYYVHFIDD 555

Query: 275 FSRFVWIYPLKQKSEALAALFLHFTTMVKNQFNSSIKALQPDNGGEYARILKLCNECGVQ 334
           FSRF WI+PLKQKS+ + A F+ F  + +NQFN  IK +Q D GGEY  + K+  E G+Q
Sbjct: 556 FSRFTWIFPLKQKSDTIHA-FIQFKNLAENQFNKKIKIIQCDGGGEYKAVQKVSIEAGIQ 615

Query: 335 TRLTRPYTSQQNGRAERKHRHIVETDLTLLAQTSMPLRYWWDAFLVASMLINGLPSQVIN 394
            R++ PYTSQQNGRAERKHRH+ E  LTLLAQ  MPLRYWW+AF  A  LIN LPS V  
Sbjct: 616 FRMSCPYTSQQNGRAERKHRHVAELGLTLLAQAKMPLRYWWEAFSTAVYLINRLPSSVNP 675

Query: 395 GKSPMEIMYGKSIDFSALRTFGCTCYSCLRPYQTHKFQFHTEKCAYLGPSPIHKGHICLS 434
            +SP  +M+ +  D++AL+ FGC CY CL+PY  HK QFHT +C ++G S  HKG+ C++
Sbjct: 676 NESPYSLMFKREPDYNALKPFGCACYPCLKPYNQHKLQFHTTRCVFVGYSNSHKGYKCIN 710

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GAU19483.12.1e-9946.17hypothetical protein TSUD_77270 [Trifolium subterraneum][more]
PNY02796.12.1e-9444.55copia protein (gag-int-pol protein), partial [Trifolium pratense][more]
PNX76291.14.6e-9443.62gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium praten... [more]
KYP50444.14.6e-9446.19Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
GAU51268.13.3e-9242.92hypothetical protein TSUD_412550 [Trifolium subterraneum][more]
Match NameE-valueIdentityDescription
Q94HW22.0e-6834.63Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT941.3e-6735.40Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P109784.9e-3027.20Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041461.7e-1924.02Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q124913.7e-1423.94Transposon Ty2-B Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
A0A2Z6MBG61.0e-9946.17Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 ... [more]
A0A2K3NIC31.0e-9444.55Copia protein (Gag-int-pol protein) (Fragment) OS=Trifolium pratense OX=57577 GN... [more]
A0A151S6M82.2e-9446.19Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=... [more]
A0A2K3LCM12.2e-9443.62Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment) OS=Trifolium prat... [more]
A0A2Z6P4D51.6e-9242.92Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 ... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 226..310
e-value: 7.3E-10
score: 39.1
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 193..374
score: 19.207825
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 165..224
e-value: 3.2E-8
score: 33.3
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 219..383
e-value: 6.1E-28
score: 99.5
NoneNo IPR availablePANTHERPTHR11439:SF324RIBONUCLEASE H-LIKE DOMAIN, GAG-PRE-INTEGRASE DOMAIN, GAG-POLYPEPTIDE OF LTR COPIA-TYPE-RELATEDcoord: 118..430
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 118..430
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 230..382

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0006769.1Lag0006769.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding