Lag0000341 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0000341
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Locationchr4: 4668578 .. 4670897 (+)
RNA-Seq ExpressionLag0000341
SyntenyLag0000341
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTTTGTCAGAGTGGTATCAGTGCTGCGATGGAGGAGTCTTCAGCTTCCTCTCAAATCTTTGGCTCTGGTAATAAGGTTTTGATTGTGAAACTTACTGATGATAACTTCCTTTTGTGGAAATTTCATATTCAATTTGCTCTTGAGGGTTACGATTTGGAATCACATTTGCATGATAATTCTCCCTCTTAATACTTACAATCCCCTCTGATTCTACCACGGGTGAAGGTTCTTCGGTGGTAAAAACCCCAAATCTAGCTTATACTAAATGTAAACGTCATGACAAAATAATTTCATCATGGCTTGTCGATTCGATGACAGAGGAAATTATTCATCAAATGCTTCACTGTGGAACAACGAAGGAAATCTGGGATTGTCTTGCTCAAATTTTCTCTTCTTGGGATCTTGTCCAGGTGATGAAATTCAAAACTAAATTACAAACTATCCAGAAGGGAGGTATGTCTCTAAAAGAGTATTTCTAAAAAAGACAACAATATGTTGATGCATTGACTGGTGCTGGTAAAACTGTTGAAGTTGAGGATCATATTTTATATATCTTATATGGTCTCGGTTCTGAATACGAATCGATGGTTTCGGTTATTTCGGCAAAGGTAGGTCCTCAATCGGTCCATGAGGTCATGGCCCTCTTGTTTACTCAAGAGAATAGGATTGAGAGTAAACTTGTTCACACTGATACTTCTCTACCATATGTAAATCTCTCCGTTCAATCAAAACCTGCTGATAACGATGCTCAGAAATATAATCCACCTTCGTTTCCTCCTCATTTTAGTGGTGGTAACAGAGGACGTTGGGGTGACCAGTCCAATCGAGGAGGTAGAACATGGAATAATCGAAATAGAATTCAATGCCAATTGTGTGGGAAATTCAATCATACTGCAGTGAAGTGCTATTTTCGATATGCTCCTCCCAGTGCTCCTCCCAATCCAAGTTCGTTTGCTCCTTCTTATAACCAATTTAATCGATCTCCTTCATTTCCTCGGATGAATGTTATGCTCACTGCTCCTGATATTAACCAAGATACGACTTGGTACCCTGACTCCGATACCACGAATCACCTTACTCATAACTTTGGGAATCTCTTCGTTGGAACTGAGTATGGTGGTGTGAATTAAGTCCATGTGGGAAATGGACCAGGTTTGCCTATACTTAACTTTGGATATTCCTCATTTTCTTCTCCTCTTTGTACTGACCGAATGTTCTACTTACATAATCTTCTTCACGTTCCTTCTATTACTAAGAACCTTATTAGTGTCAGTCAATTTGCGCGTAATAATGGTGTCTTTTTTTAATTTCCTCCCACACTATGCTATGTGAAGGATCAAGCGTTTGGTCGAGTTCTGCTCCAAGGGACTCTCCATGAGGGACTTTATCGATTCAATGTCTCTATCTCTCAACAACCATCTCATAAACCAACGGTTCAAGCTCTTCATTCCACCACCACTATTCTGATTCATACTGCTTATCTTTCTGTTTACTCTGTTTCAAATAGTTCTAAATTAGATATTTGGCATAGACGTCTAGGCCATCCAAGTTTGTCCACTGTCAAACATGTGTTACAGTTGTTTAAACCAAATATGTCTATAAATAATATGAAGTTTCAATTCTGTGATGCTTGTGCAATGGAAAAAACTCACTCCCTACCCTTCTCTCCTTCCTCTACTACTTACACTGCTCCTCTTCAGTTAGTTGTATCTGATTTGTGGGGACCTACCTATATTCCTTTAGCAAATGGTTATAGGTACTATATAAGTTTTGTTGATGTGTCTTTAGTAAATATACCTGGATTTATTTCTTAAAATCGAAGTCGGATGCCTTTGATGCTTTTGTTCATATTGAGAAACTTCTAAACTTACCAATTGTGCAATTTCCATCTGATAATGGTGGTGAGTTCCTATGTTTCAAACCATTTTTGGAGTCTCATGGCATTACTCGTAGGTTTTCTTATCCTCACACATCCCAACAAAATGGGATTGCAGAATGCACGCACAGACACATTGTTGATACTGACCTTGCCTTACTCTCTCATTCCTCAATGCCTCTAAAATTCCAGGATGAAGCGTTTTCTATCGCTCTGTTTTTAATTAATAGGCTGCCTTCTGAAGTTCTTCATGGTAGGAGTCCCTTGGAAATCATCTTTAACACTAAACCTGACTATTCTTTTCTTAAGGCCTTTGGTTTCCAATGTTTTCCTTGTCTCTGTCCACATAATCGATCTCATAAGTTAGCCTACAGGTCTACTCCTAGCACCTTTATTGGTTACAACTCATCTCATAAAGGTTTATTGTTGTTTGTCTTCTAA

mRNA sequence

ATGTTTTGTCAGAGTGGTATCAGTGCTGCGATGGAGGAGTCTTCAGCTTCCTCTCAAATCTTTGGCTCTGGTAATAAGGTTTTGATTGTGAAACTTACTGATGATAACTTCCTTTTGTGGAAATTTCATATTCAATTTGCTCTTGAGGGTTCTTCGGTGGTAAAAACCCCAAATCTAGCTTATACTAAATGTAAACGTCATGACAAAATAATTTCATCATGGCTTGTCGATTCGATGACAGAGGAAATTATTCATCAAATGCTTCACTGTGGAACAACGAAGGAAATCTGGGATTGTCTTGCTCAAATTTTCTCTTCTTGGGATCTTGTCCAGGTGATGAAATTCAAAACTAAATTACAAACTATCCAGAAGGGAGTTGAGGATCATATTTTATATATCTTATATGGTCTCGGTTCTGAATACGAATCGATGGTTTCGGTTATTTCGGCAAAGGTAGGTCCTCAATCGGTCCATGAGGTCATGGCCCTCTTGTTTACTCAAGAGAATAGGATTGAGAGTAAACTTGTTCACACTGATACTTCTCTACCATATGTAAATCTCTCCGTTCAATCAAAACCTGCTGATAACGATGCTCAGAAATATAATCCACCTTCGTTTCCTCCTCATTTTAGTGGTGGTAACAGAGGACGTTGGGGTGACCAGTCCAATCGAGGAGGTAGAACATGGAATAATCGAAATAGAATTCAATGCCAATTGTGTGGGAAATTCAATCATACTGCAGTGAAGTGCTATTTTCGATATGCTCCTCCCAGTGCTCCTCCCAATCCAAGTTCGTTTGCTCCTTCTTATAACCAATTTAATCGATCTCCTTCATTTCCTCGGATGAATGTTATGCTCACTGCTCCTGATATTAACCAAGATACGACTTGGTACCCTGACTCCGATACCACGAATCACCTTACTCATAACTTTGGGAATCTCTTCGTTGGAACTGAGTATGGTGGTGATCAAGCGTTTGGTCGAGTTCTGCTCCAAGGGACTCTCCATGAGGGACTTTATCGATTCAATGTCTCTATCTCTCAACAACCATCTCATAAACCAACGGTTCAAGCTCTTCATTCCACCACCACTATTCTGATTCATACTGCTTATCTTTCTGTTTACTCTGTTTCAAATAGTTCTAAATTAGATATTTGGCATAGACGTCTAGGCCATCCAAGTTTGTCCACTGTCAAACATGTGTTACAGTTGTTTAAACCAAATATGTCTATAAATAATATGAAGTTTCAATTCTGTGATGCTTGTGCAATGGAAAAAACTCACTCCCTACCCTTCTCTCCTTCCTCTACTACTTACACTGCTCCTCTTCATAAATATACCTGGATTTATTTCTTAAAATCGAAGTCGGATGCCTTTGATGCTTTTGTTCATATTGAGAAACTTCTAAACTTACCAATTGTGCAATTTCCATCTGATAATGGTGGTGAGTTCCTATGTTTCAAACCATTTTTGGAGTCTCATGGCATTACTCGTAGGTTTTCTTATCCTCACACATCCCAACAAAATGGGATTGCAGAATGCACGCACAGACACATTGTTGATACTGACCTTGCCTTACTCTCTCATTCCTCAATGCCTCTAAAATTCCAGGATGAAGCGTTTTCTATCGCTCTGTTTTTAATTAATAGGCTGCCTTCTGAAGTTCTTCATGGTAGGAGTCCCTTGGAAATCATCTTTAACACTAAACCTGACTATTCTTTTCTTAAGGCCTTTGGTTTCCAATGTTTTCCTTGTCTCTGTCCACATAATCGATCTCATAAGTTAGCCTACAGGTCTACTCCTAGCACCTTTATTGGTTACAACTCATCTCATAAAGGTTTATTGTTGTTTGTCTTCTAA

Coding sequence (CDS)

ATGTTTTGTCAGAGTGGTATCAGTGCTGCGATGGAGGAGTCTTCAGCTTCCTCTCAAATCTTTGGCTCTGGTAATAAGGTTTTGATTGTGAAACTTACTGATGATAACTTCCTTTTGTGGAAATTTCATATTCAATTTGCTCTTGAGGGTTCTTCGGTGGTAAAAACCCCAAATCTAGCTTATACTAAATGTAAACGTCATGACAAAATAATTTCATCATGGCTTGTCGATTCGATGACAGAGGAAATTATTCATCAAATGCTTCACTGTGGAACAACGAAGGAAATCTGGGATTGTCTTGCTCAAATTTTCTCTTCTTGGGATCTTGTCCAGGTGATGAAATTCAAAACTAAATTACAAACTATCCAGAAGGGAGTTGAGGATCATATTTTATATATCTTATATGGTCTCGGTTCTGAATACGAATCGATGGTTTCGGTTATTTCGGCAAAGGTAGGTCCTCAATCGGTCCATGAGGTCATGGCCCTCTTGTTTACTCAAGAGAATAGGATTGAGAGTAAACTTGTTCACACTGATACTTCTCTACCATATGTAAATCTCTCCGTTCAATCAAAACCTGCTGATAACGATGCTCAGAAATATAATCCACCTTCGTTTCCTCCTCATTTTAGTGGTGGTAACAGAGGACGTTGGGGTGACCAGTCCAATCGAGGAGGTAGAACATGGAATAATCGAAATAGAATTCAATGCCAATTGTGTGGGAAATTCAATCATACTGCAGTGAAGTGCTATTTTCGATATGCTCCTCCCAGTGCTCCTCCCAATCCAAGTTCGTTTGCTCCTTCTTATAACCAATTTAATCGATCTCCTTCATTTCCTCGGATGAATGTTATGCTCACTGCTCCTGATATTAACCAAGATACGACTTGGTACCCTGACTCCGATACCACGAATCACCTTACTCATAACTTTGGGAATCTCTTCGTTGGAACTGAGTATGGTGGTGATCAAGCGTTTGGTCGAGTTCTGCTCCAAGGGACTCTCCATGAGGGACTTTATCGATTCAATGTCTCTATCTCTCAACAACCATCTCATAAACCAACGGTTCAAGCTCTTCATTCCACCACCACTATTCTGATTCATACTGCTTATCTTTCTGTTTACTCTGTTTCAAATAGTTCTAAATTAGATATTTGGCATAGACGTCTAGGCCATCCAAGTTTGTCCACTGTCAAACATGTGTTACAGTTGTTTAAACCAAATATGTCTATAAATAATATGAAGTTTCAATTCTGTGATGCTTGTGCAATGGAAAAAACTCACTCCCTACCCTTCTCTCCTTCCTCTACTACTTACACTGCTCCTCTTCATAAATATACCTGGATTTATTTCTTAAAATCGAAGTCGGATGCCTTTGATGCTTTTGTTCATATTGAGAAACTTCTAAACTTACCAATTGTGCAATTTCCATCTGATAATGGTGGTGAGTTCCTATGTTTCAAACCATTTTTGGAGTCTCATGGCATTACTCGTAGGTTTTCTTATCCTCACACATCCCAACAAAATGGGATTGCAGAATGCACGCACAGACACATTGTTGATACTGACCTTGCCTTACTCTCTCATTCCTCAATGCCTCTAAAATTCCAGGATGAAGCGTTTTCTATCGCTCTGTTTTTAATTAATAGGCTGCCTTCTGAAGTTCTTCATGGTAGGAGTCCCTTGGAAATCATCTTTAACACTAAACCTGACTATTCTTTTCTTAAGGCCTTTGGTTTCCAATGTTTTCCTTGTCTCTGTCCACATAATCGATCTCATAAGTTAGCCTACAGGTCTACTCCTAGCACCTTTATTGGTTACAACTCATCTCATAAAGGTTTATTGTTGTTTGTCTTCTAA

Protein sequence

MFCQSGISAAMEESSASSQIFGSGNKVLIVKLTDDNFLLWKFHIQFALEGSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIFSSWDLVQVMKFKTKLQTIQKGVEDHILYILYGLGSEYESMVSVISAKVGPQSVHEVMALLFTQENRIESKLVHTDTSLPYVNLSVQSKPADNDAQKYNPPSFPPHFSGGNRGRWGDQSNRGGRTWNNRNRIQCQLCGKFNHTAVKCYFRYAPPSAPPNPSSFAPSYNQFNRSPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNLFVGTEYGGDQAFGRVLLQGTLHEGLYRFNVSISQQPSHKPTVQALHSTTTILIHTAYLSVYSVSNSSKLDIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACAMEKTHSLPFSPSSTTYTAPLHKYTWIYFLKSKSDAFDAFVHIEKLLNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNGIAECTHRHIVDTDLALLSHSSMPLKFQDEAFSIALFLINRLPSEVLHGRSPLEIIFNTKPDYSFLKAFGFQCFPCLCPHNRSHKLAYRSTPSTFIGYNSSHKGLLLFVF
Homology
BLAST of Lag0000341 vs. NCBI nr
Match: KAA0048297.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 488.8 bits (1257), Expect = 6.9e-134
Identity = 312/763 (40.89%), Postives = 409/763 (53.60%), Query Frame = 0

Query: 12  EESSASSQIFGSGNKVLIVKLTDDNFLLWKFHIQFALE---------------------- 71
           E SS  +QIFGSGNK+ +VKL DD FLLWKF I  ALE                      
Sbjct: 14  EASSPINQIFGSGNKISLVKLNDDTFLLWKFQILTALEAYDLENFLESESEPPSKYLIST 73

Query: 72  ---GSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIFSS 131
               +S   TPN AY   KR D++ISSWL+ SM+EEI++QMLHC + KEIW+ L  IFSS
Sbjct: 74  ESSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWETLQGIFSS 133

Query: 132 WDLVQVMKFKTKLQTIQKG--------------------------VEDHILYILYGLGSE 191
             L Q M+FK KL  I+KG                           +DHILYIL GLGS+
Sbjct: 134 RYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSD 193

Query: 192 YESMVSVISAKVGPQSVHEVMALLFTQENRIESKLVHTDTSLPYVNLSVQSKPADNDAQK 251
           Y+SM+SVISA+    SV EVM+LL TQE++ ESKL+ ++T+LP VN+  Q+   +  A+ 
Sbjct: 194 YQSMISVISARTDSPSVQEVMSLLLTQESQNESKLI-SETALPSVNIVTQT--TEKGAES 253

Query: 252 Y---NPPSFPPHFSGGNR-GRWGDQSNRGGRTWNNRNRIQCQLCGKFNHTAVKCYFRYAP 311
           Y   N  ++  + S   R GR   +SNRG R   NRN+ QCQ+C K  ++A +C+FRY P
Sbjct: 254 YIRTNQNNYHNNHSYNQRGGRGNGRSNRGRR--GNRNKPQCQICAKLGYSADRCFFRYTP 313

Query: 312 --PSAPPNPSSFAPSYNQFNRSPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNL 371
              S+  +P+S   SY   N   + P+M+ M+ A D+N D+ WYPDS  TNHLTH+  NL
Sbjct: 314 RSNSSGYSPNSHNTSYTNMN---NHPQMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNL 373

Query: 372 FVGTEYGG---------------------------------------------------- 431
            +G+EYGG                                                    
Sbjct: 374 SIGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVSQ 433

Query: 432 -------------------DQAFGRVLLQGTLHEGLYRFNVSISQQPSHKPTVQALHSTT 491
                              D   G+VLLQG L++GLY+F +    +PSHK    +  +T 
Sbjct: 434 FAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTI----EPSHKRLHHSNSNTK 493

Query: 492 TILIHTAYLSVYSVSNSSKLDIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACA 551
            +     + +V   SN+  LD+WHRRLGHP L  VK VL     N S    K  FC+ACA
Sbjct: 494 PV-----FNTVVPKSNTPLLDLWHRRLGHPHLPIVKAVLNHI-DNSSGTINKLNFCEACA 553

Query: 552 MEKTHSLPFSPSSTTYTAPLH-----------------------------KYTWIYFLKS 611
           + K H+LPFS S T YT PL                              +YTWIYFL S
Sbjct: 554 LGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNS 613

Query: 612 KSDAFDAF----VHIEKLLNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNG 614
           KSDAF AF      +EK L   I    +D G EF  FKPFL+ HGI  R + P+TS+QN 
Sbjct: 614 KSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQND 673

BLAST of Lag0000341 vs. NCBI nr
Match: TYK10642.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 488.8 bits (1257), Expect = 6.9e-134
Identity = 312/763 (40.89%), Postives = 409/763 (53.60%), Query Frame = 0

Query: 12  EESSASSQIFGSGNKVLIVKLTDDNFLLWKFHIQFALE---------------------- 71
           E SS  +QIFGSGNK+ +VKL DD FLLWKF I  ALE                      
Sbjct: 14  EASSPINQIFGSGNKISLVKLNDDTFLLWKFQILTALEAYDLENFLESESEPPSKYLIST 73

Query: 72  ---GSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIFSS 131
               +S   TPN AY   KR D++ISSWL+ SM+EEI++QMLHC + KEIW+ L  IFSS
Sbjct: 74  ESSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWETLQGIFSS 133

Query: 132 WDLVQVMKFKTKLQTIQKG--------------------------VEDHILYILYGLGSE 191
             L Q M+FK KL  I+KG                           +DHILYIL GLGS+
Sbjct: 134 RYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSD 193

Query: 192 YESMVSVISAKVGPQSVHEVMALLFTQENRIESKLVHTDTSLPYVNLSVQSKPADNDAQK 251
           Y+SM+SVISA+    SV EVM+LL TQE++ ESKL+ ++T+LP VN+  Q+   +  A+ 
Sbjct: 194 YQSMISVISARTDSPSVQEVMSLLLTQESQNESKLI-SETALPSVNIVTQT--TEKGAES 253

Query: 252 Y---NPPSFPPHFSGGNR-GRWGDQSNRGGRTWNNRNRIQCQLCGKFNHTAVKCYFRYAP 311
           Y   N  ++  + S   R GR   +SNRG R   NRN+ QCQ+C K  ++A +C+FRY P
Sbjct: 254 YIRTNQNNYHNNHSYNQRGGRGNGRSNRGRR--GNRNKPQCQICAKLGYSADRCFFRYTP 313

Query: 312 --PSAPPNPSSFAPSYNQFNRSPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNL 371
              S+  +P+S   SY   N   + P+M+ M+ A D+N D+ WYPDS  TNHLTH+  NL
Sbjct: 314 RSNSSGYSPNSHNTSYTNMN---NHPQMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNL 373

Query: 372 FVGTEYGG---------------------------------------------------- 431
            +G+EYGG                                                    
Sbjct: 374 SIGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVSQ 433

Query: 432 -------------------DQAFGRVLLQGTLHEGLYRFNVSISQQPSHKPTVQALHSTT 491
                              D   G+VLLQG L++GLY+F +    +PSHK    +  +T 
Sbjct: 434 FAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTI----EPSHKRLHHSNSNTK 493

Query: 492 TILIHTAYLSVYSVSNSSKLDIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACA 551
            +     + +V   SN+  LD+WHRRLGHP L  VK VL     N S    K  FC+ACA
Sbjct: 494 PV-----FNTVVPKSNTPLLDLWHRRLGHPHLPIVKAVLNHI-DNSSGTINKLNFCEACA 553

Query: 552 MEKTHSLPFSPSSTTYTAPLH-----------------------------KYTWIYFLKS 611
           + K H+LPFS S T YT PL                              +YTWIYFL S
Sbjct: 554 LGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNS 613

Query: 612 KSDAFDAF----VHIEKLLNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNG 614
           KSDAF AF      +EK L   I    +D G EF  FKPFL+ HGI  R + P+TS+QN 
Sbjct: 614 KSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQND 673

BLAST of Lag0000341 vs. NCBI nr
Match: KZV26181.1 (hypothetical protein F511_06348 [Dorcoceras hygrometricum])

HSP 1 Score: 362.1 bits (928), Expect = 9.7e-96
Identity = 242/697 (34.72%), Postives = 343/697 (49.21%), Query Frame = 0

Query: 50  GSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIFSSWDL 109
           G++ V  PN  +    R D+++ S+L+ SM+E    QM+ C T+ ++W  + Q+F++   
Sbjct: 6   GAAEVMNPN--FVTWNRQDQLLFSFLLASMSESAQSQMIGCQTSSQLWTRVTQLFATRSK 65

Query: 110 VQVMKFKTKLQTIQKG--------------------------VEDHILYILYGLGSEYES 169
            +VM++K +LQT++KG                           +D IL+IL G+G EYES
Sbjct: 66  ARVMQYKLQLQTLKKGNLSMKDYLGKMKGYIDILAACGNSIPEDDQILHILGGVGPEYES 125

Query: 170 MVSVISAKVGPQSVHEVMALLFTQENRIES-KLVHTDTSLPYVNLSVQSKPADNDAQKYN 229
           +V  ++++V   S+ EV ALL   E RIE+  +    T+ P VN  V + P+   A+  N
Sbjct: 126 VVVHVTSRVESLSLSEVGALLLAHEGRIETYNITGGHTASPSVN--VTTAPSQRKAE--N 185

Query: 230 PPSFPPHFSGGNRGRWGDQSNRGGR-TWNNRNRIQCQLCGKFNHTAVKCYFRYAPPSAPP 289
                P + G  RGR G    RGGR  W+N  R  CQ+CG   H A  CY+R+     P 
Sbjct: 186 TSQSQPVYRGRGRGRNG----RGGRKPWHNNGRPVCQICGIPGHVAEICYYRFDKEFVPK 245

Query: 290 NPSSFAPSYNQFNR-SPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNLFVGTEY 349
           +      S  QFNR SPS+P      T  +   +  WYPDS  ++H+T++ GNL V +EY
Sbjct: 246 SSGVSRTSQQQFNRSSPSYPPSAFASTKSESASEEWWYPDSGASHHVTNDLGNLSVSSEY 305

Query: 350 GG---------------------------------------------------------- 409
            G                                                          
Sbjct: 306 TGGSKVQVGNGAGLSISNIGESNLNMFPSSRPFLLKNLLHVPLITKNLISVSKFAYDNHV 365

Query: 410 ------------DQAFGRVLLQGTLHEGLYRFNV-SISQQPSHKPTVQALHSTTTILIHT 469
                       D A   VLL+GTLH GLYRFN+ S    P H P    L S+ + +   
Sbjct: 366 YFEFHPSFCLVKDPATHVVLLRGTLHNGLYRFNLKSRISGPLHSPA--CLQSSVSPIKVP 425

Query: 470 AYLSVYSVSNSSKLDIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACAMEKTHS 529
               +    N+  LD WH RLGHPS++TVK VL      +S N+    FC +C + K H 
Sbjct: 426 DQSPLCLPQNT--LDKWHLRLGHPSIATVKQVLLDCNERISKND-NISFCSSCQLGKNHL 485

Query: 530 LPFSPSSTTYTAPLH-----------------------------KYTWIYFLKSKSDAFD 589
           LPF  S+T ++AP                               +YTWIYFLK KS+   
Sbjct: 486 LPFPQSTTNFSAPFEVVYSDLWGPAHIPSRNGSRYYISFVDAYTRYTWIYFLKLKSEVTQ 545

Query: 590 AFVHIEKL----LNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNGIAECTH 614
            F++ +K      N  I    +D GGEF     + +S+GI  RFS P+TS+QNG+ E  H
Sbjct: 546 TFINFQKYTELHFNAKIKTLQTDGGGEFRSLTAYCQSNGILHRFSCPYTSKQNGVVERKH 605

BLAST of Lag0000341 vs. NCBI nr
Match: RVW60229.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 353.2 bits (905), Expect = 4.5e-93
Identity = 247/782 (31.59%), Postives = 354/782 (45.27%), Query Frame = 0

Query: 25  NKVLIVKLTDDNFLLWKFHIQFALEGSSV--------------------VKTPNLAYTKC 84
           ++++ ++L DDNFL+WK+ I+ A+ G  +                    V  PN  +   
Sbjct: 147 SQLITMRLEDDNFLMWKYQIENAVRGYGLEGFLFGTEQVPPKMVTDKIGVLVPNPKFRDY 206

Query: 85  KRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIFSSWDLVQVMKFKTKLQTIQK 144
           +R D ++ SWL+ S+    + Q++ C +  E+W+ ++Q F+S    +VM +K+++Q ++K
Sbjct: 207 QRQDHLLISWLLSSIGSAFLPQVVGCSSAFEVWNTISQNFNSQSSAKVMFYKSQMQMLKK 266

Query: 145 --------------------------GVEDHILYILYGLGSEYESMVSVISAKVGPQSVH 204
                                        DHIL I+ GLG EYES+++VIS+K    S+ 
Sbjct: 267 DGLTMRDYLTKMKNYCDLLATAGHKISDTDHILAIMQGLGDEYESVIAVISSKKSSPSLQ 326

Query: 205 EVMALLFTQENRIESKLVHTDTSLPYVNLSVQSKPADNDAQKYNPPSFPPHFSGG--NRG 264
            V + L   E RI  K+   D S+ Y +      P+ +    +N   +P   S G  NR 
Sbjct: 327 YVTSTLIAHEGRIAHKISSNDLSVNYTSQYSNRGPSSS----WNSNGYP---SSGFQNRN 386

Query: 265 RWGDQSNRGGRTWNNRNR----------IQCQLCGKFNHTAVKCYFRYAP------PSAP 324
           ++G      G   +NR R           QCQLC KF HT  +C++RY P      P+  
Sbjct: 387 QFGGNQVTRGSFVHNRGRGRGRAQGGIKPQCQLCNKFGHTVHRCFYRYDPNFHGNMPANG 446

Query: 325 PNPS--------------SFAPSYN----QFNRSPSFPRMNVMLTAPDINQDTTWYPDSD 384
           P P               S A + N        +  +  M  M+  P+  Q+  W+PDS 
Sbjct: 447 PTPGVLGSGARNGASGSISSAGNVNLTEYDAQENQDYSEMEAMVATPEDLQNCCWFPDSG 506

Query: 385 TTNHLTHNFGNLFVGTEYGG---------------------------------------- 444
            TNH+TH+ GNL  G EY G                                        
Sbjct: 507 ATNHVTHDLGNLNSGAEYNGNSKIHMGNGTGLKISHIGLSVFPSSSSPNKVLFLKNILRV 566

Query: 445 -------------------------------DQAFGRVLLQGTLHEGLYRFNVS-----I 504
                                          D++   +LLQG LH+GLY+FN+S      
Sbjct: 567 PAIKKNLLSVSQFARDNNVYFEFHPKVCFVKDKSNHSLLLQGNLHKGLYQFNLSKKLFGK 626

Query: 505 SQQPSHKPTVQALHSTTTILIHTAYLSVYSVSNSS--KLDIWHRRLGHPSLSTVKHVLQL 564
           +   S       L      L+H         +NSS    D+WH+RLGHP+   V  VL  
Sbjct: 627 ASGLSLSNDKNELTCCNASLVHNDNSDFPEKTNSSFHVFDLWHKRLGHPASKIVTQVLND 686

Query: 565 FKPNMSINNMKFQFCDACAMEKTHSLPFSPSSTTYTAPLH-------------------- 614
            K   S  +     C AC + K+H+LPF  S T YT PL                     
Sbjct: 687 NKIPFSTKSGS-SICSACQLGKSHNLPFPISQTVYTKPLQLVVSDLWGPAPINSSYGFTY 746

BLAST of Lag0000341 vs. NCBI nr
Match: KAF7814697.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Senna tora])

HSP 1 Score: 347.8 bits (891), Expect = 1.9e-91
Identity = 242/730 (33.15%), Postives = 344/730 (47.12%), Query Frame = 0

Query: 30  VKLTDDNFLLWKFHI---------------------QFALEGSSVVKTPNLAYTKCKRHD 89
           +KLT++NFL+WK  I                     +FA    +  +  N  Y      D
Sbjct: 34  IKLTEENFLVWKMQITTTINGFNLQKYLIGDKFIPDKFATAEDAAAEKINQDYLHWVNQD 93

Query: 90  KIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIFSSWDLVQVMKFKTKLQTIQKG--- 149
           +++ SWL+ SMTEE++++ + C TTK++W+ L   +S+    +  + + +L+  +KG   
Sbjct: 94  QLLMSWLISSMTEEMVNKFVECSTTKDLWEQLRTYYSTNTKPKERQLRNQLRETKKGNSA 153

Query: 150 -----------------------VEDHILYILYGLGSEYESMVSVISAKVGPQSVHEVMA 209
                                    DH+  I  GL  EYES V+ IS +    SV E+ A
Sbjct: 154 MNDYLSKIKKITNALASIGASLSTHDHVETIFDGLSEEYESFVTSISLRTEEYSVSEIEA 213

Query: 210 LLFTQENRIE--SKLVHT-DTSLPYVNLSVQSKPADNDA--QKYNPPSFPPHFSGGNRGR 269
           LL  QE R+E   K V +   ++  V+LS ++ P+ N    Q +N        +G  RG 
Sbjct: 214 LLLAQEARVEKFKKTVESVSANMTLVDLS-KNPPSRNQTSNQSFN------RNNGQGRGN 273

Query: 270 WGDQSNRGGR------TWNNRNRIQCQLCGKFNHTAVKCYFRYAPPSAPPNPSSFAPSYN 329
           + + + + GR      TW   NR QCQ+CGK  H AV CY R+       +         
Sbjct: 274 FRNNNFQRGRGRSSISTWQG-NRPQCQVCGKMGHIAVNCYNRFNQRYTEASLLQQQIVNQ 333

Query: 330 QFNRSPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNL-----FVGTE---YGGD 389
           Q NR P+   M  M+  P+   D  WYPDS  +NHLT++  NL     + GTE    G  
Sbjct: 334 QTNRQPAPSTMEAMVATPETLFDAAWYPDSGASNHLTNDSTNLQHKHPYDGTEKVYVGNG 393

Query: 390 QAFG--------------RVLLQGTLHEGLYRFN-VSISQQPSHKPT----------VQA 449
           Q                  + L+  +H      N +S+S+                 V++
Sbjct: 394 QGMSISHIGEASIVTKNKPLTLKQLIHAPQITKNLISVSKLAKDNKVYFEFHANHCLVKS 453

Query: 450 LHSTTTILIHTAYLSVYSVSNSSKL----------------------DIWHRRLGHPSLS 509
             +  T+L  +    +Y V + S L                      ++WH RLGHPS  
Sbjct: 454 QETNETLLKGSFRNGLYCVDDLSLLHHQPQPHITAHIASTQSTIKDFNVWHSRLGHPSSR 513

Query: 510 TVKHVLQLFKPNMSINNMKFQFCDACAMEKTHSLPFSPSSTTYTAPLH------------ 569
            V HVL     ++ +N      C AC + K+H+LPFS S T+Y+APL             
Sbjct: 514 VVSHVLSTCNISIPMNKTNNSTCHACCLGKSHTLPFSLSQTSYSAPLQLVHTDIWGHAPI 573

Query: 570 -----------------KYTWIYFLKSKSDAFDAFVHIEKL----LNLPIVQFPSDNGGE 614
                            K+TW+Y LK+K DA  AF+  + L    LN  I    SD GGE
Sbjct: 574 LSSTGYTYYISFIDAYSKFTWLYLLKTKGDALQAFIQFKNLAENQLNTKIKAVQSDFGGE 633

BLAST of Lag0000341 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 228.0 bits (580), Expect = 2.9e-58
Identity = 188/700 (26.86%), Postives = 310/700 (44.29%), Query Frame = 0

Query: 29  IVKLTDDNFLLW--KFHIQF-----------------ALEGSSVVKTPNLAYTKCKRHDK 88
           + KLT  N+L+W  + H  F                 A  G+  V   N  YT+ +R DK
Sbjct: 23  VTKLTSTNYLMWSRQVHALFDGYELAGFLDGSTPMPPATIGTDAVPRVNPDYTRWRRQDK 82

Query: 89  IISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIFS--SWDLVQVMKFKTKLQTI----- 148
           +I S ++ +++  +   +    T  +IW+ L +I++  S+  V  ++F T+   +     
Sbjct: 83  LIYSAILGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVTQLRFITRFDQLALLGK 142

Query: 149 QKGVEDHILYILYGLGSEYESMVSVISAKVGPQSVHEVMALLFTQENRIESKLVHTDTSL 208
               ++ +  +L  L  +Y+ ++  I+AK  P S+ E+   L  +E+++ +  +++   +
Sbjct: 143 PMDHDEQVERVLENLPDDYKPVIDQIAAKDTPPSLTEIHERLINRESKLLA--LNSAEVV 202

Query: 209 PYVNLSVQSKPADNDAQKYNPPSFPPHFSGGNRGRWGDQSNRGGRTWNNRNRI---QCQL 268
           P     V  +  + +  + N      + +  NR      S+ G R+ N + +    +CQ+
Sbjct: 203 PITANVVTHRNTNTNRNQNNRGDNRNYNNNNNRSNSWQPSSSGSRSDNRQPKPYLGRCQI 262

Query: 269 CGKFNHTAVKCYFRYAPPSAPPNPSSFAPSYNQFNRSPSF----PRMNVMLTAPDINQDT 328
           C    H+A +C          P    F  + NQ   +  F    PR N+ + +P      
Sbjct: 263 CSVQGHSAKRC----------PQLHQFQSTTNQQQSTSPFTPWQPRANLAVNSP--YNAN 322

Query: 329 TWYPDSDTTNHLTHNFGNLFVGTEY-GGDQA--------------------------FGR 388
            W  DS  T+H+T +F NL     Y GGD                              +
Sbjct: 323 NWLLDSGATHHITSDFNNLSFHQPYTGGDDVMIADGSTIPITHTGSASLPTSSRSLDLNK 382

Query: 389 VLLQGTLHEGL---YRF----NVSISQQPSHKPTVQALHSTTTILIHTAYLSVYS--VSN 448
           VL    +H+ L   YR      VS+   P+    V+ L++   +L       +Y   +++
Sbjct: 383 VLYVPNIHKNLISVYRLCNTNRVSVEFFPA-SFQVKDLNTGVPLLQGKTKDELYEWPIAS 442

Query: 449 SSKLDI------------WHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACAMEKT 508
           S  + +            WH RLGHPSL+ +  V+      +   + K   C  C + K+
Sbjct: 443 SQAVSMFASPCSKATHSSWHSRLGHPSLAILNSVISNHSLPVLNPSHKLLSCSDCFINKS 502

Query: 509 HSLPFSPSSTTYTAPLH----------------------------KYTWIYFLKSKSDAF 568
           H +PFS S+ T + PL                             +YTW+Y LK KS   
Sbjct: 503 HKVPFSNSTITSSKPLEYIYSDVWSSPILSIDNYRYYVIFVDHFTRYTWLYPLKQKSQVK 562

Query: 569 DAFV----HIEKLLNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNGIAECT 616
           D F+     +E      I    SDNGGEF+  + +L  HGI+   S PHT + NG++E  
Sbjct: 563 DTFIIFKSLVENRFQTRIGTLYSDNGGEFVVLRDYLSQHGISHFTSPPHTPEHNGLSERK 622

BLAST of Lag0000341 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 223.4 bits (568), Expect = 7.1e-57
Identity = 183/679 (26.95%), Postives = 298/679 (43.89%), Query Frame = 0

Query: 50  GSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIFSSWDL 109
           G+      N  YT+ KR DK+I S ++ +++  +   +    T  +IW+ L +I+++   
Sbjct: 63  GTDAAPRVNPDYTRWKRQDKLIYSAVLGAISMSVQPAVSRATTAAQIWETLRKIYANPSY 122

Query: 110 VQVMKFKTKLQTIQKGV--------------------------EDHILYILYGLGSEYES 169
             V + +T+L+   KG                           ++ +  +L  L  EY+ 
Sbjct: 123 GHVTQLRTQLKQWTKGTKTIDDYMQGLVTRFDQLALLGKPMDHDEQVERVLENLPEEYKP 182

Query: 170 MVSVISAKVGPQSVHEVMALLFTQENRIESKLVHTDTSLPYVNLSVQSKPADNDAQKYNP 229
           ++  I+AK  P ++ E+   L   E++I +    T   +    +S ++    N+    N 
Sbjct: 183 VIDQIAAKDTPPTLTEIHERLLNHESKILAVSSATVIPITANAVSHRNTTTTNNNNNGNR 242

Query: 230 PSFPPHFSGGNRGRWGDQSNRGGRTWNNRNRI---QCQLCGKFNHTAVKC----YFRYAP 289
            +   + +  N  +   QS+      NN+++    +CQ+CG   H+A +C    +F  + 
Sbjct: 243 NNRYDNRNNNNNSKPWQQSSTNFHPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLSSV 302

Query: 290 PSAPPNPSSFAPSYNQFNRSPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNLFV 349
            S  P PS F P           PR N+ L +P       W  DS  T+H+T +F NL +
Sbjct: 303 NSQQP-PSPFTPWQ---------PRANLALGSP--YSSNNWLLDSGATHHITSDFNNLSL 362

Query: 350 GTEY-GGDQA--------------------------FGRVLLQGTLHEGL---YRF---- 409
              Y GGD                               +L    +H+ L   YR     
Sbjct: 363 HQPYTGGDDVMVADGSTIPISHTGSTSLSTKSRPLNLHNILYVPNIHKNLISVYRLCNAN 422

Query: 410 NVSISQQPSHKPTVQALHSTTTILIHTAYLSVY--------------SVSNSSKLDIWHR 469
            VS+   P+    V+ L++   +L       +Y              S S+ +    WH 
Sbjct: 423 GVSVEFFPA-SFQVKDLNTGVPLLQGKTKDELYEWPIASSQPVSLFASPSSKATHSSWHA 482

Query: 470 RLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACAMEKTHSLPFSPSSTTYTAPLH---- 529
           RLGHP+ S +  V+  +  ++   + KF  C  C + K++ +PFS S+   T PL     
Sbjct: 483 RLGHPAPSILNSVISNYSLSVLNPSHKFLSCSDCLINKSNKVPFSQSTINSTRPLEYIYS 542

Query: 530 ------------------------KYTWIYFLKSKSDAFDAFVHIEKLL----NLPIVQF 589
                                   +YTW+Y LK KS   + F+  + LL       I  F
Sbjct: 543 DVWSSPILSHDNYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTRIGTF 602

Query: 590 PSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNGIAECTHRHIVDTDLALLSHSSMPLKF 616
            SDNGGEF+    +   HGI+   S PHT + NG++E  HRHIV+T L LLSH+S+P  +
Sbjct: 603 YSDNGGEFVALWEYFSQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHASIPKTY 662

BLAST of Lag0000341 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 92.0 bits (227), Expect = 2.5e-17
Identity = 76/278 (27.34%), Postives = 114/278 (41.01%), Query Frame = 0

Query: 375 YSVS--NSSKLDIWHRRLGHPSLSTVKHVLQ--LFKPNMSINNMKF--QFCDACAMEKTH 434
           YS++  + +   +WH R GH S   +  + +  +F     +NN++   + C+ C   K  
Sbjct: 405 YSINAKHKNNFRLWHERFGHISDGKLLEIKRKNMFSDQSLLNNLELSCEICEPCLNGKQA 464

Query: 435 SLPFS--PSSTTYTAPLH-----------------------------KYTWIYFLKSKSD 494
            LPF      T    PL                               Y   Y +K KSD
Sbjct: 465 RLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIKYKSD 524

Query: 495 AFDAF----VHIEKLLNLPIVQFPSDNGGEFLC--FKPFLESHGITRRFSYPHTSQQNGI 554
            F  F       E   NL +V    DNG E+L    + F    GI+   + PHT Q NG+
Sbjct: 525 VFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQLNGV 584

Query: 555 AECTHRHIVDTDLALLSHSSMPLKFQDEAFSIALFLINRLPSEVL--HGRSPLEIIFNTK 608
           +E   R I +    ++S + +   F  EA   A +LINR+PS  L    ++P E+  N K
Sbjct: 585 SERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKK 644

BLAST of Lag0000341 vs. ExPASy TrEMBL
Match: A0A5A7U233 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold264G00060 PE=4 SV=1)

HSP 1 Score: 488.8 bits (1257), Expect = 3.3e-134
Identity = 312/763 (40.89%), Postives = 409/763 (53.60%), Query Frame = 0

Query: 12  EESSASSQIFGSGNKVLIVKLTDDNFLLWKFHIQFALE---------------------- 71
           E SS  +QIFGSGNK+ +VKL DD FLLWKF I  ALE                      
Sbjct: 14  EASSPINQIFGSGNKISLVKLNDDTFLLWKFQILTALEAYDLENFLESESEPPSKYLIST 73

Query: 72  ---GSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIFSS 131
               +S   TPN AY   KR D++ISSWL+ SM+EEI++QMLHC + KEIW+ L  IFSS
Sbjct: 74  ESSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWETLQGIFSS 133

Query: 132 WDLVQVMKFKTKLQTIQKG--------------------------VEDHILYILYGLGSE 191
             L Q M+FK KL  I+KG                           +DHILYIL GLGS+
Sbjct: 134 RYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSD 193

Query: 192 YESMVSVISAKVGPQSVHEVMALLFTQENRIESKLVHTDTSLPYVNLSVQSKPADNDAQK 251
           Y+SM+SVISA+    SV EVM+LL TQE++ ESKL+ ++T+LP VN+  Q+   +  A+ 
Sbjct: 194 YQSMISVISARTDSPSVQEVMSLLLTQESQNESKLI-SETALPSVNIVTQT--TEKGAES 253

Query: 252 Y---NPPSFPPHFSGGNR-GRWGDQSNRGGRTWNNRNRIQCQLCGKFNHTAVKCYFRYAP 311
           Y   N  ++  + S   R GR   +SNRG R   NRN+ QCQ+C K  ++A +C+FRY P
Sbjct: 254 YIRTNQNNYHNNHSYNQRGGRGNGRSNRGRR--GNRNKPQCQICAKLGYSADRCFFRYTP 313

Query: 312 --PSAPPNPSSFAPSYNQFNRSPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNL 371
              S+  +P+S   SY   N   + P+M+ M+ A D+N D+ WYPDS  TNHLTH+  NL
Sbjct: 314 RSNSSGYSPNSHNTSYTNMN---NHPQMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNL 373

Query: 372 FVGTEYGG---------------------------------------------------- 431
            +G+EYGG                                                    
Sbjct: 374 SIGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVSQ 433

Query: 432 -------------------DQAFGRVLLQGTLHEGLYRFNVSISQQPSHKPTVQALHSTT 491
                              D   G+VLLQG L++GLY+F +    +PSHK    +  +T 
Sbjct: 434 FAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTI----EPSHKRLHHSNSNTK 493

Query: 492 TILIHTAYLSVYSVSNSSKLDIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACA 551
            +     + +V   SN+  LD+WHRRLGHP L  VK VL     N S    K  FC+ACA
Sbjct: 494 PV-----FNTVVPKSNTPLLDLWHRRLGHPHLPIVKAVLNHI-DNSSGTINKLNFCEACA 553

Query: 552 MEKTHSLPFSPSSTTYTAPLH-----------------------------KYTWIYFLKS 611
           + K H+LPFS S T YT PL                              +YTWIYFL S
Sbjct: 554 LGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNS 613

Query: 612 KSDAFDAF----VHIEKLLNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNG 614
           KSDAF AF      +EK L   I    +D G EF  FKPFL+ HGI  R + P+TS+QN 
Sbjct: 614 KSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQND 673

BLAST of Lag0000341 vs. ExPASy TrEMBL
Match: A0A5D3CH97 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold25G00040 PE=4 SV=1)

HSP 1 Score: 488.8 bits (1257), Expect = 3.3e-134
Identity = 312/763 (40.89%), Postives = 409/763 (53.60%), Query Frame = 0

Query: 12  EESSASSQIFGSGNKVLIVKLTDDNFLLWKFHIQFALE---------------------- 71
           E SS  +QIFGSGNK+ +VKL DD FLLWKF I  ALE                      
Sbjct: 14  EASSPINQIFGSGNKISLVKLNDDTFLLWKFQILTALEAYDLENFLESESEPPSKYLIST 73

Query: 72  ---GSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIFSS 131
               +S   TPN AY   KR D++ISSWL+ SM+EEI++QMLHC + KEIW+ L  IFSS
Sbjct: 74  ESSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWETLQGIFSS 133

Query: 132 WDLVQVMKFKTKLQTIQKG--------------------------VEDHILYILYGLGSE 191
             L Q M+FK KL  I+KG                           +DHILYIL GLGS+
Sbjct: 134 RYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAGLGSD 193

Query: 192 YESMVSVISAKVGPQSVHEVMALLFTQENRIESKLVHTDTSLPYVNLSVQSKPADNDAQK 251
           Y+SM+SVISA+    SV EVM+LL TQE++ ESKL+ ++T+LP VN+  Q+   +  A+ 
Sbjct: 194 YQSMISVISARTDSPSVQEVMSLLLTQESQNESKLI-SETALPSVNIVTQT--TEKGAES 253

Query: 252 Y---NPPSFPPHFSGGNR-GRWGDQSNRGGRTWNNRNRIQCQLCGKFNHTAVKCYFRYAP 311
           Y   N  ++  + S   R GR   +SNRG R   NRN+ QCQ+C K  ++A +C+FRY P
Sbjct: 254 YIRTNQNNYHNNHSYNQRGGRGNGRSNRGRR--GNRNKPQCQICAKLGYSADRCFFRYTP 313

Query: 312 --PSAPPNPSSFAPSYNQFNRSPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNL 371
              S+  +P+S   SY   N   + P+M+ M+ A D+N D+ WYPDS  TNHLTH+  NL
Sbjct: 314 RSNSSGYSPNSHNTSYTNMN---NHPQMSAMVAALDLNIDSNWYPDSGATNHLTHSLSNL 373

Query: 372 FVGTEYGG---------------------------------------------------- 431
            +G+EYGG                                                    
Sbjct: 374 SIGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNLISVSQ 433

Query: 432 -------------------DQAFGRVLLQGTLHEGLYRFNVSISQQPSHKPTVQALHSTT 491
                              D   G+VLLQG L++GLY+F +    +PSHK    +  +T 
Sbjct: 434 FAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTI----EPSHKRLHHSNSNTK 493

Query: 492 TILIHTAYLSVYSVSNSSKLDIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACA 551
            +     + +V   SN+  LD+WHRRLGHP L  VK VL     N S    K  FC+ACA
Sbjct: 494 PV-----FNTVVPKSNTPLLDLWHRRLGHPHLPIVKAVLNHI-DNSSGTINKLNFCEACA 553

Query: 552 MEKTHSLPFSPSSTTYTAPLH-----------------------------KYTWIYFLKS 611
           + K H+LPFS S T YT PL                              +YTWIYFL S
Sbjct: 554 LGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNS 613

Query: 612 KSDAFDAF----VHIEKLLNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNG 614
           KSDAF AF      +EK L   I    +D G EF  FKPFL+ HGI  R + P+TS+QN 
Sbjct: 614 KSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQND 673

BLAST of Lag0000341 vs. ExPASy TrEMBL
Match: A0A2Z7AWA7 (Integrase catalytic domain-containing protein OS=Dorcoceras hygrometricum OX=472368 GN=F511_06348 PE=4 SV=1)

HSP 1 Score: 362.1 bits (928), Expect = 4.7e-96
Identity = 242/697 (34.72%), Postives = 343/697 (49.21%), Query Frame = 0

Query: 50  GSSVVKTPNLAYTKCKRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIFSSWDL 109
           G++ V  PN  +    R D+++ S+L+ SM+E    QM+ C T+ ++W  + Q+F++   
Sbjct: 6   GAAEVMNPN--FVTWNRQDQLLFSFLLASMSESAQSQMIGCQTSSQLWTRVTQLFATRSK 65

Query: 110 VQVMKFKTKLQTIQKG--------------------------VEDHILYILYGLGSEYES 169
            +VM++K +LQT++KG                           +D IL+IL G+G EYES
Sbjct: 66  ARVMQYKLQLQTLKKGNLSMKDYLGKMKGYIDILAACGNSIPEDDQILHILGGVGPEYES 125

Query: 170 MVSVISAKVGPQSVHEVMALLFTQENRIES-KLVHTDTSLPYVNLSVQSKPADNDAQKYN 229
           +V  ++++V   S+ EV ALL   E RIE+  +    T+ P VN  V + P+   A+  N
Sbjct: 126 VVVHVTSRVESLSLSEVGALLLAHEGRIETYNITGGHTASPSVN--VTTAPSQRKAE--N 185

Query: 230 PPSFPPHFSGGNRGRWGDQSNRGGR-TWNNRNRIQCQLCGKFNHTAVKCYFRYAPPSAPP 289
                P + G  RGR G    RGGR  W+N  R  CQ+CG   H A  CY+R+     P 
Sbjct: 186 TSQSQPVYRGRGRGRNG----RGGRKPWHNNGRPVCQICGIPGHVAEICYYRFDKEFVPK 245

Query: 290 NPSSFAPSYNQFNR-SPSFPRMNVMLTAPDINQDTTWYPDSDTTNHLTHNFGNLFVGTEY 349
           +      S  QFNR SPS+P      T  +   +  WYPDS  ++H+T++ GNL V +EY
Sbjct: 246 SSGVSRTSQQQFNRSSPSYPPSAFASTKSESASEEWWYPDSGASHHVTNDLGNLSVSSEY 305

Query: 350 GG---------------------------------------------------------- 409
            G                                                          
Sbjct: 306 TGGSKVQVGNGAGLSISNIGESNLNMFPSSRPFLLKNLLHVPLITKNLISVSKFAYDNHV 365

Query: 410 ------------DQAFGRVLLQGTLHEGLYRFNV-SISQQPSHKPTVQALHSTTTILIHT 469
                       D A   VLL+GTLH GLYRFN+ S    P H P    L S+ + +   
Sbjct: 366 YFEFHPSFCLVKDPATHVVLLRGTLHNGLYRFNLKSRISGPLHSPA--CLQSSVSPIKVP 425

Query: 470 AYLSVYSVSNSSKLDIWHRRLGHPSLSTVKHVLQLFKPNMSINNMKFQFCDACAMEKTHS 529
               +    N+  LD WH RLGHPS++TVK VL      +S N+    FC +C + K H 
Sbjct: 426 DQSPLCLPQNT--LDKWHLRLGHPSIATVKQVLLDCNERISKND-NISFCSSCQLGKNHL 485

Query: 530 LPFSPSSTTYTAPLH-----------------------------KYTWIYFLKSKSDAFD 589
           LPF  S+T ++AP                               +YTWIYFLK KS+   
Sbjct: 486 LPFPQSTTNFSAPFEVVYSDLWGPAHIPSRNGSRYYISFVDAYTRYTWIYFLKLKSEVTQ 545

Query: 590 AFVHIEKL----LNLPIVQFPSDNGGEFLCFKPFLESHGITRRFSYPHTSQQNGIAECTH 614
            F++ +K      N  I    +D GGEF     + +S+GI  RFS P+TS+QNG+ E  H
Sbjct: 546 TFINFQKYTELHFNAKIKTLQTDGGGEFRSLTAYCQSNGILHRFSCPYTSKQNGVVERKH 605

BLAST of Lag0000341 vs. ExPASy TrEMBL
Match: A0A438FJP6 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_1134 PE=4 SV=1)

HSP 1 Score: 353.2 bits (905), Expect = 2.2e-93
Identity = 247/782 (31.59%), Postives = 354/782 (45.27%), Query Frame = 0

Query: 25  NKVLIVKLTDDNFLLWKFHIQFALEGSSV--------------------VKTPNLAYTKC 84
           ++++ ++L DDNFL+WK+ I+ A+ G  +                    V  PN  +   
Sbjct: 147 SQLITMRLEDDNFLMWKYQIENAVRGYGLEGFLFGTEQVPPKMVTDKIGVLVPNPKFRDY 206

Query: 85  KRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIFSSWDLVQVMKFKTKLQTIQK 144
           +R D ++ SWL+ S+    + Q++ C +  E+W+ ++Q F+S    +VM +K+++Q ++K
Sbjct: 207 QRQDHLLISWLLSSIGSAFLPQVVGCSSAFEVWNTISQNFNSQSSAKVMFYKSQMQMLKK 266

Query: 145 --------------------------GVEDHILYILYGLGSEYESMVSVISAKVGPQSVH 204
                                        DHIL I+ GLG EYES+++VIS+K    S+ 
Sbjct: 267 DGLTMRDYLTKMKNYCDLLATAGHKISDTDHILAIMQGLGDEYESVIAVISSKKSSPSLQ 326

Query: 205 EVMALLFTQENRIESKLVHTDTSLPYVNLSVQSKPADNDAQKYNPPSFPPHFSGG--NRG 264
            V + L   E RI  K+   D S+ Y +      P+ +    +N   +P   S G  NR 
Sbjct: 327 YVTSTLIAHEGRIAHKISSNDLSVNYTSQYSNRGPSSS----WNSNGYP---SSGFQNRN 386

Query: 265 RWGDQSNRGGRTWNNRNR----------IQCQLCGKFNHTAVKCYFRYAP------PSAP 324
           ++G      G   +NR R           QCQLC KF HT  +C++RY P      P+  
Sbjct: 387 QFGGNQVTRGSFVHNRGRGRGRAQGGIKPQCQLCNKFGHTVHRCFYRYDPNFHGNMPANG 446

Query: 325 PNPS--------------SFAPSYN----QFNRSPSFPRMNVMLTAPDINQDTTWYPDSD 384
           P P               S A + N        +  +  M  M+  P+  Q+  W+PDS 
Sbjct: 447 PTPGVLGSGARNGASGSISSAGNVNLTEYDAQENQDYSEMEAMVATPEDLQNCCWFPDSG 506

Query: 385 TTNHLTHNFGNLFVGTEYGG---------------------------------------- 444
            TNH+TH+ GNL  G EY G                                        
Sbjct: 507 ATNHVTHDLGNLNSGAEYNGNSKIHMGNGTGLKISHIGLSVFPSSSSPNKVLFLKNILRV 566

Query: 445 -------------------------------DQAFGRVLLQGTLHEGLYRFNVS-----I 504
                                          D++   +LLQG LH+GLY+FN+S      
Sbjct: 567 PAIKKNLLSVSQFARDNNVYFEFHPKVCFVKDKSNHSLLLQGNLHKGLYQFNLSKKLFGK 626

Query: 505 SQQPSHKPTVQALHSTTTILIHTAYLSVYSVSNSS--KLDIWHRRLGHPSLSTVKHVLQL 564
           +   S       L      L+H         +NSS    D+WH+RLGHP+   V  VL  
Sbjct: 627 ASGLSLSNDKNELTCCNASLVHNDNSDFPEKTNSSFHVFDLWHKRLGHPASKIVTQVLND 686

Query: 565 FKPNMSINNMKFQFCDACAMEKTHSLPFSPSSTTYTAPLH-------------------- 614
            K   S  +     C AC + K+H+LPF  S T YT PL                     
Sbjct: 687 NKIPFSTKSGS-SICSACQLGKSHNLPFPISQTVYTKPLQLVVSDLWGPAPINSSYGFTY 746

BLAST of Lag0000341 vs. ExPASy TrEMBL
Match: A0A438K147 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_2516 PE=4 SV=1)

HSP 1 Score: 340.9 bits (873), Expect = 1.1e-89
Identity = 239/726 (32.92%), Postives = 349/726 (48.07%), Query Frame = 0

Query: 25  NKVLIVKLTDDNFLLWKFHIQFALEGSSVVK---------------------TPNLAYTK 84
           N  L VKL + NFL+WK  I  A+ G  + K                          + +
Sbjct: 26  NHSLSVKLDNKNFLIWKQQIVSAIRGYGLQKFVFSDDEVPVQFLTREDARSGKATKEFLE 85

Query: 85  CKRHDKIISSWLVDSMTEEIIHQMLHCGTTKEIWDCLAQIFSSWDLVQVMKFKTKLQTIQ 144
            ++ D+++ SWL+ S++E I+ +++ C T+  +W  L Q F+S    +  +FKT+LQ  +
Sbjct: 86  WEQQDQLLLSWLLSSVSESILPRLVGCDTSSLLWGRLEQYFASQTRAKAKQFKTQLQHTK 145

Query: 145 KG--------------------------VEDHILYILYGLGSEYESMVSVISAKVGPQSV 204
           KG                           +DH+  IL GL ++YES ++ +  +    SV
Sbjct: 146 KGGSTIDEYLAKIKVCVDSLASVGVSLSTKDHVESILDGLPNDYESFITSVILRNDDFSV 205

Query: 205 HEVMALLFTQENRIESKLVHTDTS-LPYVNLSVQSKPADNDAQKYNPPSFPPHFSG--GN 264
            E+ ALL   E+R+E      D+S   +V  S   +  +   Q Y   +   + SG  G+
Sbjct: 206 EEIEALLMAHESRVEKNNSSLDSSPSAHVASSNAVEKGNRFKQDYYAANSQGNHSGYNGS 265

Query: 265 RGRWGD-------------------QSNRGG-------------------RTWNNRNRIQ 324
            GR GD                   +SNRGG                     WN+ N+ +
Sbjct: 266 FGRGGDFGRRGGFNGGRGFNWNYNGRSNRGGFRGRGGFRGRGNRGNFQARPPWNSDNQNE 325

Query: 325 ---CQLCGKFNHTAVKCYFRYAPP-SAPPNPSSFAPS---YNQFNRSPSFPRMNVMLTAP 384
              CQLCGK  H   +CY+R+      P N S   PS   Y  F+     P++N ++   
Sbjct: 326 KPACQLCGKIGHVVAQCYYRFDHTFQVPQNLSGRNPSPRAYYSFS-----PQVNGVIPTS 385

Query: 385 DINQDTTWYPDSDTTNHLTHNFGNLFVGTEYGGD------QAFGRVLLQ--GTLHEGLYR 444
           ++  D  WYPDS  +NH+T N  NL    E+ G          G       G + +GLY 
Sbjct: 386 EVFSDDNWYPDSGASNHVTPNPANLMKSVEFAGQNQVHVGNGTGNPSCSNVGKVRDGLYA 445

Query: 445 FNVSISQQPSHKPTVQALHSTTTILIHTAYLSVYSVSNSSKLDIWHRRLGHPSLSTVKHV 504
           F+   S   + +PT Q+L  + +++  +    V   S SS  D+WH+RLG PS +T+K+V
Sbjct: 446 FD---SSHLALRPT-QSLSKSPSVVASSFSSKVCIASLSSTFDLWHKRLGQPSAATIKNV 505

Query: 505 LQLFKPNMS-INNMKFQFCDACAMEKTHSLPFSPSSTTYTAPLH---------------- 564
           L   K N++ IN M   FC +C + K H  PFS S TTYT PL                 
Sbjct: 506 LS--KCNVAHINKMDSNFCSSCCLGKIHMFPFSLSHTTYTKPLELIHSDLWGPAPVLSNS 565

Query: 565 -------------KYTWIYFLKSKSDAFDAFVH----IEKLLNLPIVQFPSDNGGEFLCF 614
                        +++WI+ L++KS+A   FV+    +E   +L I    +D GGEF  F
Sbjct: 566 GYRYYIHFVDAFSRFSWIFLLRNKSEAIKTFVNFKTQVELQFDLKIKSLQTDWGGEFRAF 625

BLAST of Lag0000341 vs. TAIR 10
Match: ATMG00710.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 45.1 bits (105), Expect = 2.5e-04
Identity = 20/64 (31.25%), Postives = 38/64 (59.38%), Query Frame = 0

Query: 516 HRHIVDTDLALLSHSSMPLKFQDEAFSIALFLINRLPSEVLHGRSPLEIIFNTKPDYSFL 575
           +R I++   ++L    +P  F+ +A + A+ +IN+ PS  ++   P E+ F + P YS+L
Sbjct: 2   NRTIIEKVRSMLCECGLPKTFRADAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYSYL 61

Query: 576 KAFG 580
           + FG
Sbjct: 62  RRFG 65

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0048297.16.9e-13440.89Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
TYK10642.16.9e-13440.89Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
KZV26181.19.7e-9634.72hypothetical protein F511_06348 [Dorcoceras hygrometricum][more]
RVW60229.14.5e-9331.59Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
KAF7814697.11.9e-9133.15Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Senna tora][more]
Match NameE-valueIdentityDescription
Q9ZT942.9e-5826.86Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW27.1e-5726.95Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P041462.5e-1727.34Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Match NameE-valueIdentityDescription
A0A5A7U2333.3e-13440.89Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5D3CH973.3e-13440.89Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A2Z7AWA74.7e-9634.72Integrase catalytic domain-containing protein OS=Dorcoceras hygrometricum OX=472... [more]
A0A438FJP62.2e-9331.59Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
A0A438K1471.1e-8932.92Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
Match NameE-valueIdentityDescription
ATMG00710.12.5e-0431.25Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 339..426
e-value: 1.1E-11
score: 44.4
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 437..573
e-value: 3.2E-19
score: 71.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 194..229
NoneNo IPR availablePANTHERPTHR34222:SF48SUBFAMILY NOT NAMEDcoord: 52..325
NoneNo IPR availablePANTHERPTHR34222FAMILY NOT NAMEDcoord: 52..325
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 403..569
score: 13.719819
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 444..563

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0000341.1Lag0000341.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding