Lag0018094 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0018094
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionIntegrase catalytic domain-containing protein
Locationchr5: 15997413 .. 16000061 (-)
RNA-Seq ExpressionLag0018094
SyntenyLag0018094
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTGTGCTCAAGGATACACCAGAAGAAACACATACACCAACAACACCAACAGATCAAGACGTCAACACAACTATAATCCTCAATTTGAGACATGATACTCCAATTTTGCAAGTGAACCAAGGCATATTTGAGACCAATGAAACTGATATATCAAAATTGGGCAAAGACTTCACTCTCTTGGAAAATCAAGTTCATTACCAGGATGGAACATATGCTCATGACGATTCCATGACCATGACACCTACGCTCATAACTCCTAATGAAGCAGCTAAGTTATCTGATATTAGTAAAAATCTTCATATTGACTTGCCCACTTTGATTGCAGGAACTATACCAAAGCAAGGAGTATTGGCACAAGTCAATGGTTTTAGCTCTCACCACATGGTCACAACAAGCAAGTTAGCAAAGAATCTGGTCCTTGATCCCACTCTTGCACATCAAATTAGCAAACTCAAAAACTCAAGACAACAAAGAACACTAATAGCAATGAGTCACAAATCTGAGCCACAAAACTACAAAATGGCCCTATCTTTGCCTCATTGGAAGGATGCCATGGATGAAGAAATGAAGGCTTTAATGTTGAATAACACATGGATCCTAGTTCAAAGACCACAAGATACAAATGTAGTAGAGTCAAAGTCGATCTTCAAGATAAAGTGCAAGGAAGATGGAACGATTAATCGCTACAAGGCTAGACTTGTAGCTCAAGGATACACACAAATTGAGGGTCTTAACTATGAAGAAACTTATAGCCCAGTAAATCAAACCAACAACAATCAGGTTAATTCTATCCTTAGCTACTAGTGCAGGATGGTCCCTAAGACAACTAGATGTTAAAAATGCTTTCTTGCATGGTCATTTGAAGGAACAACTAAGTTATCTGATATTAGTAAAAATCTTCATATTGACTTGCCCACTTTGATTGCAGGAACTATACCAAAGCAAGGAGTATTGGCACAAGTCAATGATATTAGCTCTCACCACATGGTCACAAGAAGCAAGTTATTAGCAAAGAATCCAGTCCTTGATCCCACTCTTGCACATGAAATTAGCAAACTCAAAAACTCAAGACAACAAAGAACACTTATAGCAATGAGTCACAAATATGAGCCAAAAAACTACAAAATGGCCCTATCTTTTCCTCATTGGAAGGATGCCATGGAGGAAGAAATGAAGGCTTTAATGTTGAACAACACATGGATCCTAGTTCAAAGACCACAAGATACAAATGTAGTAGAGTCAAAGTGGATCTTCAAGACAAAGTGCAAGGGAGATGGAACGATTAATCGCTACAAGGCTAGACTTGTAGCTCAAGGATACACACAAATTGAGGGTCTTAACTATGAAGAAACTTATAGCCCAGTAAATCAAACCAACAACAATCAGGTTAATTTATCCTTAGCTAATAGTGCAGGATGGTCCCTAAGACAACTAGATGTTAAAAATGTTTCTTGCATGGTCATTTGAAGGAACAAGTCTTTATGGAACAACCTCTAGGTTTATTAATCCTCACCATCCAAACTATGTTTGTAAGCTTCAAAAGTCCATCTATGGCCTCAAACAGGCCCCACGAGCATGGCTTGATAGACTAGCTAATTTTCTCCTCCATATTGGTTTTAACAGTTCTCATTCTGACCCATCTCTTTTTATATTTAGAGACAAATCTATTTAACACTAATGCTTAATTACGTTGATGAAATTATATTGACAGGAAACAACTCATCACATATCCAACAACTCATAGGAACACTACGTTCACAATTTTGCCTTGAAAGACTTGGACAACTTACACTATTTCTTAGCATAGAAGTGAAGCACACTGCAAAAGGAATTATGCTATCACAAGGAAAATATGCTCGTGAAATACTCACCAAGTCAGGAATGTTGGGAGTTGCTGCCATTAATACTCCAATCATGACCTCACCTCAAGATACCCATAAGGACACTCAACCCACTGATGCAAAGGAATACAGGAGCATCGTTGGATTGTTGCAATACCTTACACTCACACGACCTGATATAGTATATGCAGTTAATTGTGTATGCCAACACCTACAACAGCCCATCGTTAAGGATCTCAAAGCTGTGAAAAGGATTCTTCGATACATACAAGGAACCCTCGACTACGATATCTCTCTTTATAAAAACAATTCACTAAATTTATATGCTTTTTGTGATGCAGATTGGGGTGGCTGCCCTCTTACACGAAGTACTACAGGCTTTTGTGTATTCCTCAGATCCAATTGTATCTCATCACTACAAGAATCTCGGGATTCGTCGACGCAAACAGACGTCGGCAAATCCCTCTTTTGCGTCGGCAAAAAGGAAACGACGATGCAATCATGCGTCGGCTTAAATGTCGCTAAATCCTTGTTGGCATCCAAAAAACTGACGAAAATGGCGGTTCGTGGGCAGAAAGGTACATTTTCGCCAACGCAAACTCTTGCGTTGCCTAAACCTGTAATAACCGATGCAACATGCGTCGGTTATAATACGGTTTCATTTATCAGATTTTTGAGTTTAGCCCACGCAAGAAGCGTCGGTGAAAGTCATAAATTGCGTCGGCTACAATACGACTTCATTCCACAGATTTTCGAGATGAGGCCACGAAATTGCGTCGGTGAAAATAAAAATTGCGTCGGCTAA

mRNA sequence

ATGGTTGTGCTCAAGGATACACCAGAAGAAACACATACACCAACAACACCAACAGATCAAGACGTCAACACAACTATAATCCTCAATTTGAGACATGATACTCCAATTTTGCAAGTGAACCAAGGCATATTTGAGACCAATGAAACTGATATATCAAAATTGGGCAAAGACTTCACTCTCTTGGAAAATCAAGTTCATTACCAGGATGGAACATATGCTCATGACGATTCCATGACCATGACACCTACGCTCATAACTCCTAATGAAGCAGCTAAGTTATCTGATATTAGTAAAAATCTTCATATTGACTTGCCCACTTTGATTGCAGGAACTATACCAAAGCAAGGAGTATTGGCACAAGTCAATGGTTTTAGCTCTCACCACATGGTCACAACAAGCAAGTTAGCAAAGAATCTGGTCCTTGATCCCACTCTTGCACATCAAATTAGCAAACTCAAAAACTCAAGACAACAAAGAACACTAATAGCAATGAGTCACAAATCTGAGCCACAAAACTACAAAATGGCCCTATCTTTGCCTCATTGGAAGGATGCCATGGATGAAGAAATGAAGGCTTTAATGTTGAATAACACATGGATCCTAGTTCAAAGACCACAAGATACAAATGTAGTAGAGTCAAAGTCGATCTTCAAGATAAAGTGCAAGGAAGATGGAACGATTAATCGCTACAAGGCTAGACTTGTAGCTCAAGGATACACACAAATTGAGGGTCTTAACTATGAAGAAACTTATAGCCCAGTAAATCAAACCAACAACAATCAGTTATCTGATATTAGTAAAAATCTTCATATTGACTTGCCCACTTTGATTGCAGGAACTATACCAAAGCAAGGAGTATTGGCACAAGTCAATGATATTAGCTCTCACCACATGGTCACAAGAAGCAAGTTATTAGCAAAGAATCCAGTCCTTGATCCCACTCTTGCACATGAAATTAGCAAACTCAAAAACTCAAGACAACAAAGAACACTTATAGCAATGAGTCACAAATATGAGCCAAAAAACTACAAAATGGCCCTATCTTTTCCTCATTGGAAGGATGCCATGGAGGAAGAAATGAAGGCTTTAATGTTGAACAACACATGGATCCTAGTTCAAAGACCACAAGATACAAATGTAGTAGAGTCAAAGTGGATCTTCAAGACAAAGTGCAAGGGAGATGGAACGATTAATCGCTACAAGGCTAGACTTGTAGCTCAAGGATACACACAAATTGAGGGTCTTAACTATGAAGAAACTTATAGCCCAGTAAATCAAACCAACAACAATCAGAGACAAATCTATTTAACACTAATGCTTAATTACGTTGATGAAATTATATTGACAGGAAACAACTCATCACATATCCAACAACTCATAGGAACACTACGTTCACAATTTTGCCTTGAAAGACTTGGACAACTTACACTATTTCTTAGCATAGAAGTGAAGCACACTGCAAAAGGAATTATGCTATCACAAGGAAAATATGCTCGTGAAATACTCACCAAGTCAGGAATGTTGGGAGTTGCTGCCATTAATACTCCAATCATGACCTCACCTCAAGATACCCATAAGGACACTCAACCCACTGATGCAAAGGAATACAGGAGCATCGTTGGATTGTTGCAATACCTTACACTCACACGACCTGATATAGTATATGCAGTTAATTGTGTATGCCAACACCTACAACAGCCCATCGTTAAGGATCTCAAAGCTGTGAAAAGGATTCTTCGATACATACAAGGAACCCTCGACTACGATATCTCTCTTTATAAAAACAATTCACTAAATTTATATGCTTTTTGTGATGCAGATTGGGGTGGCTGCCCTCTTACACGAAGTACTACAGGCTTTTGTGTATTCCTCAGATCCAATTGTATCTCATCACTACAAGAATCTCGGGATTCGTCGACGCAAACAGACGTCGGCAAATCCCTCTTTTGCGTCGGCAAAAAGGAAACGACGATGCAATCATGCGTCGGCTTAAATGTCGCTAAATCCTTGTTGGCATCCAAAAAACTGACGAAAATGGCGGTTCGTGGGCAGAAAGGTACATTTTCGCCAACGCAAACTCTTGCGTTGCCTAAACCTGTAATAACCGATGCAACATGCGTCGGTTATAATACGGTTTCATTTATCAGATTTTTGAGTTTAGCCCACGCAAGAAGCGTCGGTGAAAGTCATAAATTGCGTCGGCTACAATACGACTTCATTCCACAGATTTTCGAGATGAGGCCACGAAATTGCGTCGGTGAAAATAAAAATTGCGTCGGCTAA

Coding sequence (CDS)

ATGGTTGTGCTCAAGGATACACCAGAAGAAACACATACACCAACAACACCAACAGATCAAGACGTCAACACAACTATAATCCTCAATTTGAGACATGATACTCCAATTTTGCAAGTGAACCAAGGCATATTTGAGACCAATGAAACTGATATATCAAAATTGGGCAAAGACTTCACTCTCTTGGAAAATCAAGTTCATTACCAGGATGGAACATATGCTCATGACGATTCCATGACCATGACACCTACGCTCATAACTCCTAATGAAGCAGCTAAGTTATCTGATATTAGTAAAAATCTTCATATTGACTTGCCCACTTTGATTGCAGGAACTATACCAAAGCAAGGAGTATTGGCACAAGTCAATGGTTTTAGCTCTCACCACATGGTCACAACAAGCAAGTTAGCAAAGAATCTGGTCCTTGATCCCACTCTTGCACATCAAATTAGCAAACTCAAAAACTCAAGACAACAAAGAACACTAATAGCAATGAGTCACAAATCTGAGCCACAAAACTACAAAATGGCCCTATCTTTGCCTCATTGGAAGGATGCCATGGATGAAGAAATGAAGGCTTTAATGTTGAATAACACATGGATCCTAGTTCAAAGACCACAAGATACAAATGTAGTAGAGTCAAAGTCGATCTTCAAGATAAAGTGCAAGGAAGATGGAACGATTAATCGCTACAAGGCTAGACTTGTAGCTCAAGGATACACACAAATTGAGGGTCTTAACTATGAAGAAACTTATAGCCCAGTAAATCAAACCAACAACAATCAGTTATCTGATATTAGTAAAAATCTTCATATTGACTTGCCCACTTTGATTGCAGGAACTATACCAAAGCAAGGAGTATTGGCACAAGTCAATGATATTAGCTCTCACCACATGGTCACAAGAAGCAAGTTATTAGCAAAGAATCCAGTCCTTGATCCCACTCTTGCACATGAAATTAGCAAACTCAAAAACTCAAGACAACAAAGAACACTTATAGCAATGAGTCACAAATATGAGCCAAAAAACTACAAAATGGCCCTATCTTTTCCTCATTGGAAGGATGCCATGGAGGAAGAAATGAAGGCTTTAATGTTGAACAACACATGGATCCTAGTTCAAAGACCACAAGATACAAATGTAGTAGAGTCAAAGTGGATCTTCAAGACAAAGTGCAAGGGAGATGGAACGATTAATCGCTACAAGGCTAGACTTGTAGCTCAAGGATACACACAAATTGAGGGTCTTAACTATGAAGAAACTTATAGCCCAGTAAATCAAACCAACAACAATCAGAGACAAATCTATTTAACACTAATGCTTAATTACGTTGATGAAATTATATTGACAGGAAACAACTCATCACATATCCAACAACTCATAGGAACACTACGTTCACAATTTTGCCTTGAAAGACTTGGACAACTTACACTATTTCTTAGCATAGAAGTGAAGCACACTGCAAAAGGAATTATGCTATCACAAGGAAAATATGCTCGTGAAATACTCACCAAGTCAGGAATGTTGGGAGTTGCTGCCATTAATACTCCAATCATGACCTCACCTCAAGATACCCATAAGGACACTCAACCCACTGATGCAAAGGAATACAGGAGCATCGTTGGATTGTTGCAATACCTTACACTCACACGACCTGATATAGTATATGCAGTTAATTGTGTATGCCAACACCTACAACAGCCCATCGTTAAGGATCTCAAAGCTGTGAAAAGGATTCTTCGATACATACAAGGAACCCTCGACTACGATATCTCTCTTTATAAAAACAATTCACTAAATTTATATGCTTTTTGTGATGCAGATTGGGGTGGCTGCCCTCTTACACGAAGTACTACAGGCTTTTGTGTATTCCTCAGATCCAATTGTATCTCATCACTACAAGAATCTCGGGATTCGTCGACGCAAACAGACGTCGGCAAATCCCTCTTTTGCGTCGGCAAAAAGGAAACGACGATGCAATCATGCGTCGGCTTAAATGTCGCTAAATCCTTGTTGGCATCCAAAAAACTGACGAAAATGGCGGTTCGTGGGCAGAAAGGTACATTTTCGCCAACGCAAACTCTTGCGTTGCCTAAACCTGTAATAACCGATGCAACATGCGTCGGTTATAATACGGTTTCATTTATCAGATTTTTGAGTTTAGCCCACGCAAGAAGCGTCGGTGAAAGTCATAAATTGCGTCGGCTACAATACGACTTCATTCCACAGATTTTCGAGATGAGGCCACGAAATTGCGTCGGTGAAAATAAAAATTGCGTCGGCTAA

Protein sequence

MVVLKDTPEETHTPTTPTDQDVNTTIILNLRHDTPILQVNQGIFETNETDISKLGKDFTLLENQVHYQDGTYAHDDSMTMTPTLITPNEAAKLSDISKNLHIDLPTLIAGTIPKQGVLAQVNGFSSHHMVTTSKLAKNLVLDPTLAHQISKLKNSRQQRTLIAMSHKSEPQNYKMALSLPHWKDAMDEEMKALMLNNTWILVQRPQDTNVVESKSIFKIKCKEDGTINRYKARLVAQGYTQIEGLNYEETYSPVNQTNNNQLSDISKNLHIDLPTLIAGTIPKQGVLAQVNDISSHHMVTRSKLLAKNPVLDPTLAHEISKLKNSRQQRTLIAMSHKYEPKNYKMALSFPHWKDAMEEEMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPVNQTNNNQRQIYLTLMLNYVDEIILTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLTLTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLTRSTTGFCVFLRSNCISSLQESRDSSTQTDVGKSLFCVGKKETTMQSCVGLNVAKSLLASKKLTKMAVRGQKGTFSPTQTLALPKPVITDATCVGYNTVSFIRFLSLAHARSVGESHKLRRLQYDFIPQIFEMRPRNCVGENKNCVG
Homology
BLAST of Lag0018094 vs. NCBI nr
Match: GAU44375.1 (hypothetical protein TSUD_243070 [Trifolium subterraneum])

HSP 1 Score: 300.8 bits (769), Expect = 3.3e-77
Identity = 168/384 (43.75%), Postives = 211/384 (54.95%), Query Frame = 0

Query: 339  EPKNYKMALSFPHWKDAMEEEMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTIN 398
            EPK YK AL + +W+ AM++E+ AL  NNTW LVQRP D NV+ SKW+F+TK   DG+I+
Sbjct: 713  EPKTYKTALKYSNWQAAMQDEIDALHSNNTWTLVQRPLDANVIGSKWVFRTKLNEDGSID 772

Query: 399  RYKARLVAQGYTQIEGLNYEETYSPV---------------------------------- 458
            R+KARLVA+GYTQI GL++ ET+SPV                                  
Sbjct: 773  RFKARLVAKGYTQIPGLDFGETFSPVIKAPTIRIILSLAVHFKWPLKQLDVKNAFLHGTL 832

Query: 459  ------------------NQTNNNQRQIY------------------------------- 518
                              N      + +Y                               
Sbjct: 833  NERVYMEQPPGFEHPHLPNHVCQLHKSLYGLKQAPRAWFEKLSACLISLGFICSKADPSL 892

Query: 519  --------LTLMLNYVDEIILTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTA 578
                     TL+L YVD+IILTGN  S I  L+  L  +F L+ LGQL  FL IE+KH  
Sbjct: 893  FIHRYDTNFTLLLVYVDDIILTGNAPSFISHLVKQLHEKFALKDLGQLHYFLGIEIKHFC 952

Query: 579  KGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLT 631
             GI +SQ KYA ++L ++ MLG + INTPI + P +   D  P DA EYR + G LQYLT
Sbjct: 953  GGITISQTKYAHDLLKRAHMLGASKINTPIASKPNELPDDNNPVDATEYRRLCGSLQYLT 1012

BLAST of Lag0018094 vs. NCBI nr
Match: RVX04589.1 (Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera])

HSP 1 Score: 296.6 bits (758), Expect = 6.2e-76
Identity = 184/464 (39.66%), Postives = 243/464 (52.37%), Query Frame = 0

Query: 259 NNQLSDISKNLHIDLPTLIAGTIPKQGVLAQVNDISSHHMVTRSKLLAKNPVLDPTLAHE 318
           ++++ D SK + +D+        P QG   Q  D    HM+TRSKL  KN   DP+L  +
Sbjct: 461 DDEVPDSSKQIVVDIS-------PPQG---QHTDNKGTHMITRSKL--KN---DPSLKSQ 520

Query: 319 ISKLKNSRQQRTLIAMSHKYEPKNYKMALSFPHWKDAMEEEMKALMLNNTWILVQRPQDT 378
           +     +R        S   EPK Y+  L  PHW  AM+EE+KAL+ N TW LV RP  T
Sbjct: 521 MVTFAATR--------SDISEPKTYRTTLKIPHWLKAMQEEIKALIQNRTWDLVPRPPTT 580

Query: 379 NVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPV-------------- 438
           N+V SKW+FKTK K DGTI+RYKARLVA+G++QI GL++ ET+SPV              
Sbjct: 581 NIVGSKWVFKTKLKEDGTIDRYKARLVARGFSQIPGLDFGETFSPVIKHTTIRMIFSLAV 640

Query: 439 --------------------------------------NQTNNNQRQIY----------- 498
                                                 N      R +Y           
Sbjct: 641 TLGWKMRQLDVKNAFLHGFLKEEVFMEQPPGFINEDLPNHVCKLNRSLYGLKQAPRAWFD 700

Query: 499 ----------------------------LTLMLNYVDEIILTGNNSSHIQQLIGTLRSQF 558
                                       + L+L YVD+II+TGN+++ I  LI TL S+F
Sbjct: 701 RLSQCLLHLGFCCGKADSSLFILRKGQSIVLLLIYVDDIIVTGNDNNIISDLINTLSSEF 760

Query: 559 CLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKD 618
            L+ LG L  FL +EVK+   G+ +SQ KY R++L  + M+    INTP+      T  D
Sbjct: 761 SLKDLGSLHYFLGLEVKYLPNGLFVSQTKYIRDLLEHTKMMECTPINTPMALKSIITSFD 820

Query: 619 TQPTDAKEYRSIVGLLQYLTLTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLD 631
            QP D  +YR +VG LQYLT TRPDIV+AVN  CQH Q P   DL+AVKRILRY++GT++
Sbjct: 821 EQPIDPTQYRQLVGSLQYLTFTRPDIVHAVNKACQHFQAPTKADLRAVKRILRYLKGTME 880

BLAST of Lag0018094 vs. NCBI nr
Match: RVW43526.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 296.2 bits (757), Expect = 8.1e-76
Identity = 184/464 (39.66%), Postives = 243/464 (52.37%), Query Frame = 0

Query: 259  NNQLSDISKNLHIDLPTLIAGTIPKQGVLAQVNDISSHHMVTRSKLLAKNPVLDPTLAHE 318
            ++++ D SK + +D+        P QG   Q  D    HM+TRSKL  KN   DP+L  +
Sbjct: 876  DDEVPDNSKQIVVDIS-------PPQG---QHTDNKGTHMITRSKL--KN---DPSLKSQ 935

Query: 319  ISKLKNSRQQRTLIAMSHKYEPKNYKMALSFPHWKDAMEEEMKALMLNNTWILVQRPQDT 378
            +     +R        S   EPK Y+  L  PHW  AM+EE+KAL+ N TW LV RP  T
Sbjct: 936  MVTFAATR--------SDISEPKTYRTTLKIPHWLKAMQEEIKALIQNRTWDLVPRPPTT 995

Query: 379  NVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPV-------------- 438
            N+V SKW+FKTK K DGTI+RYKARLVA+G++QI GL++ ET+SPV              
Sbjct: 996  NIVGSKWVFKTKLKEDGTIDRYKARLVARGFSQIPGLDFGETFSPVIKHTTIRMIFSLAV 1055

Query: 439  --------------------------------------NQTNNNQRQIY----------- 498
                                                  N      R +Y           
Sbjct: 1056 TLGWKMRQLDVKNAFLHGFLKEEVFMEQPPGFINEDLSNHVCKLNRSLYGLKQAPRAWFD 1115

Query: 499  ----------------------------LTLMLNYVDEIILTGNNSSHIQQLIGTLRSQF 558
                                        + L+L YVD+II+TGN+++ I  LI TL S+F
Sbjct: 1116 RLSQCLLHLGFCCGKADSSLFILRKCQSIVLLLIYVDDIIVTGNDNNIISDLISTLSSEF 1175

Query: 559  CLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKD 618
             L+ LG L  FL +EVK+   G+ +SQ KY R++L  + M+    INTP+      T  D
Sbjct: 1176 SLKDLGSLHYFLGLEVKYLPNGLFVSQTKYIRDLLEHTKMMECTPINTPMALKSIITSFD 1235

Query: 619  TQPTDAKEYRSIVGLLQYLTLTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLD 631
             QP D  +YR +VG LQYLT TRPDIV+AVN  CQH Q P   DL+AVKRILRY++GT++
Sbjct: 1236 EQPIDPTQYRQLVGSLQYLTFTRPDIVHAVNKACQHFQAPTKADLRAVKRILRYLKGTME 1295

BLAST of Lag0018094 vs. NCBI nr
Match: RVW19921.1 (Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera])

HSP 1 Score: 292.0 bits (746), Expect = 1.5e-74
Identity = 175/431 (40.60%), Postives = 239/431 (55.45%), Query Frame = 0

Query: 259  NNQLSDISKNLHIDLPTLIAGTIPKQGVLAQVNDISSHHMVTRSKLLAKNPVLDPTLAHE 318
            ++++ D SK + +D+        P QG   Q  D    HM+TRSKL  KN   DP+L  +
Sbjct: 853  DDEVLDSSKQIVVDIS-------PPQG---QHTDNKGTHMITRSKL--KN---DPSLKSQ 912

Query: 319  ISKLKNSRQQRTLIAMSHKYEPKNYKMALSFPHWKDAMEEEMKALMLNNTWILVQRPQDT 378
            +     +R        S   EPK Y+ AL  PHW  AM+EE+KAL+ N TW LV RP  T
Sbjct: 913  MVTFAATR--------SDISEPKTYRTALKIPHWLKAMQEEIKALIQNRTWDLVPRPPTT 972

Query: 379  NVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPVNQTNN--------- 438
            N+V SKW+FKTK K DGTI+RYKARLVA+G++QI GL++ ET+SPV +            
Sbjct: 973  NIVGSKWVFKTKLKEDGTIDRYKARLVARGFSQIPGLDFGETFSPVIKHTTIRMIFSLAV 1032

Query: 439  -------------------NQRQIYLTLMLNYVDE------------------------- 498
                                + ++++     +++E                         
Sbjct: 1033 TLGWKMRQLDVKNAFLHGFLKEEVFMEQPPGFINEDLPNHVCKLNRSLYGLKQAPRAWFD 1092

Query: 499  -----IILTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYARE 558
                   + GN+++ I  LI TL S+F L+ LG L  FL +EVK+   G+ +SQ KY R+
Sbjct: 1093 RLSNVFFIWGNDNNIISDLISTLSSEFSLKDLGSLHYFLGLEVKYLPNGLFVSQTKYIRD 1152

Query: 559  ILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLTLTRPDIVYAVNCV 618
            +L  + M+    INTP+      T  D QP D  +YR +VG LQYLT TRPDIV+AVN  
Sbjct: 1153 LLEHTKMMECTPINTPMALKSIITSFDEQPIDPTQYRQLVGSLQYLTFTRPDIVHAVNKA 1212

Query: 619  CQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RSTTG 631
            CQH Q P   DL+AVKRILRY++GT+++ I  +K +SL L  FCDADW GC  T RST+G
Sbjct: 1213 CQHFQAPTKADLRAVKRILRYLKGTMEHGIRFFKQSSLRLTGFCDADWAGCTNTRRSTSG 1260

BLAST of Lag0018094 vs. NCBI nr
Match: PNY16899.1 (copia-like polyprotein, partial [Trifolium pratense])

HSP 1 Score: 291.6 bits (745), Expect = 2.0e-74
Identity = 166/384 (43.23%), Postives = 207/384 (53.91%), Query Frame = 0

Query: 339 EPKNYKMALSFPHWKDAMEEEMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTIN 398
           EPK YK AL + +W++AM+EE+ AL  NNTW LV RP D NVV SKW+F+TK   DG+I+
Sbjct: 276 EPKTYKTALKYSNWQEAMKEEINALHSNNTWTLVPRPLDANVVGSKWVFRTKLNEDGSID 335

Query: 399 RYKARLVAQGYTQIEGLNYEETYSPV---------------------------------- 458
           R+KARLVA+GYTQI GL++ ET+SPV                                  
Sbjct: 336 RFKARLVAKGYTQIHGLDFGETFSPVVKAPTIRVILSLAVHFKWPLKQLDVKNAFLHGTL 395

Query: 459 ------------------NQTNNNQRQIY------------------------------- 518
                             N      + +Y                               
Sbjct: 396 NERVYMEQPPGFEHPHLKNHVCQLHKSLYGLKQAPRAWFEKLSTCLFSLGFICSKADPSL 455

Query: 519 --------LTLMLNYVDEIILTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTA 578
                    TL+L YVD+IILTGN  S I  LI  L  +F L+ LGQL  FL IE+K+  
Sbjct: 456 FILRRDTNFTLLLVYVDDIILTGNTPSFISHLIKQLHEKFALKDLGQLHYFLGIEIKYFC 515

Query: 579 KGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLT 631
            GI +SQ KYA ++L ++ ML  + INTPI   P     D +  DA EYR + G LQYLT
Sbjct: 516 GGITISQTKYAHDLLKRAHMLDASKINTPIAPKPNQLPDDNKLVDATEYRRLCGSLQYLT 575

BLAST of Lag0018094 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 189.5 bits (480), Expect = 1.4e-46
Identity = 143/451 (31.71%), Postives = 203/451 (45.01%), Query Frame = 0

Query: 274  PTLIAGTIPKQGVLAQVNDISSHHMVTRSKLLAKNPVLDPTLAHEISKLKNSRQQRTLIA 333
            P L A  I +    A VN   +H M TR+K   + P  +   ++  S   NS        
Sbjct: 891  PVLPAPPIIQVNAQAPVN---THSMATRAKDGIRKP--NQKYSYATSLAANS-------- 950

Query: 334  MSHKYEPKNYKMALSFPHWKDAMEEEMKALMLNNTWILV-QRPQDTNVVESKWIFKTKCK 393
                 EP+    A+    W+ AM  E+ A + N+TW LV   P    +V  +WIF  K  
Sbjct: 951  -----EPRTAIQAMKDDRWRQAMGSEINAQIGNHTWDLVPPPPPSVTIVGCRWIFTKKFN 1010

Query: 394  GDGTINRYKARLVAQGYTQIEGLNYEETYSPVNQTN---------------------NN- 453
             DG++NRYKARLVA+GY Q  GL+Y ET+SPV ++                      NN 
Sbjct: 1011 SDGSLNRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNA 1070

Query: 454  ------------------------------------------------------------ 513
                                                                        
Sbjct: 1071 FLQGTLTDEVYMSQPPGFVDKDRPDYVCRLRKAIYGLKQAPRAWYVELRTYLLTVGFVNS 1130

Query: 514  ---------QRQIYLTLMLNYVDEIILTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSI 573
                     QR   +  ML YVD+I++TGN++  ++  +  L  +F ++    L  FL I
Sbjct: 1131 ISDTSLFVLQRGRSIIYMLVYVDDILITGNDTVLLKHTLDALSQRFSVKEHEDLHYFLGI 1190

Query: 574  EVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDT-HKDTQPTDAKEYRSIV 631
            E K   +G+ LSQ +Y  ++L ++ ML    + TP+ TSP+ T H  T+  D  EYR IV
Sbjct: 1191 EAKRVPQGLHLSQRRYTLDLLARTNMLTAKPVATPMATSPKLTLHSGTKLPDPTEYRGIV 1250

BLAST of Lag0018094 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 188.0 bits (476), Expect = 4.1e-46
Identity = 143/502 (28.49%), Postives = 222/502 (44.22%), Query Frame = 0

Query: 252  SPVNQTNNNQLSDISKNLHIDLPTLIAGTIPKQGVLAQVNDISSHHMVTRSKLLAKNPVL 311
            SP    +++  S    ++ I  P  +A  +           +++H M TR    AK  ++
Sbjct: 887  SPTTSASSSSTSPTPPSILIHPPPPLAQIVNNNNQA----PLNTHSMGTR----AKAGII 946

Query: 312  DPTLAHEISKLKNSRQQRTLIAMSHKYEPKNYKMALSFPHWKDAMEEEMKALMLNNTWIL 371
             P   + ++           ++++ + EP+    AL    W++AM  E+ A + N+TW L
Sbjct: 947  KPNPKYSLA-----------VSLAAESEPRTAIQALKDERWRNAMGSEINAQIGNHTWDL 1006

Query: 372  V-QRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPVNQTN-- 431
            V   P    +V  +WIF  K   DG++NRYKARLVA+GY Q  GL+Y ET+SPV ++   
Sbjct: 1007 VPPPPSHVTIVGCRWIFTKKYNSDGSLNRYKARLVAKGYNQRPGLDYAETFSPVIKSTSI 1066

Query: 432  -------------------NN--------------------------------------- 491
                               NN                                       
Sbjct: 1067 RIVLGVAVDRSWPIRQLDVNNAFLQGTLTDDVYMSQPPGFIDKDRPNYVCKLRKALYGLK 1126

Query: 492  -------------------------------QRQIYLTLMLNYVDEIILTGNNSSHIQQL 551
                                           QR   +  ML YVD+I++TGN+ + +   
Sbjct: 1127 QAPRAWYVELRNYLLTIGFVNSVSDTSLFVLQRGKSIVYMLVYVDDILITGNDPTLLHNT 1186

Query: 552  IGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMT 611
            +  L  +F ++   +L  FL IE K    G+ LSQ +Y  ++L ++ M+    + TP+  
Sbjct: 1187 LDNLSQRFSVKDHEELHYFLGIEAKRVPTGLHLSQRRYILDLLARTNMITAKPVTTPMAP 1246

Query: 612  SPQ-DTHKDTQPTDAKEYRSIVGLLQYLTLTRPDIVYAVNCVCQHLQQPIVKDLKAVKRI 660
            SP+   +  T+ TD  EYR IVG LQYL  TRPDI YAVN + Q +  P  + L+A+KRI
Sbjct: 1247 SPKLSLYSGTKLTDPTEYRGIVGSLQYLAFTRPDISYAVNRLSQFMHMPTEEHLQALKRI 1306

BLAST of Lag0018094 vs. ExPASy Swiss-Prot
Match: P92519 (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 GN=AtMg00810 PE=4 SV=1)

HSP 1 Score: 184.1 bits (466), Expect = 5.8e-45
Identity = 95/210 (45.24%), Postives = 137/210 (65.24%), Query Frame = 0

Query: 439 MLNYVDEIILTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYA 498
           +L YVD+I+LTG++++ +  LI  L S F ++ LG +  FL I++K    G+ LSQ KYA
Sbjct: 3   LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62

Query: 499 REILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLTLTRPDIVYAVN 558
            +IL  +GML    ++TP+      +    +  D  ++RSIVG LQYLTLTRPDI YAVN
Sbjct: 63  EQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYAVN 122

Query: 559 CVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RST 618
            VCQ + +P + D   +KR+LRY++GT+ + + ++KN+ LN+ AFCD+DW GC  T RST
Sbjct: 123 IVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRST 182

Query: 619 TGFCVFLRSNCISSLQESRD----SSTQTD 644
           TGFC FL  N IS   + +     SST+T+
Sbjct: 183 TGFCTFLGCNIISWSAKRQPTVSRSSTETE 212

BLAST of Lag0018094 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 127.5 bits (319), Expect = 6.5e-28
Identity = 90/364 (24.73%), Postives = 161/364 (44.23%), Query Frame = 0

Query: 352  WKDAMEEEMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQ 411
            W++A+  E+ A  +NNTW + +RP++ N+V+S+W+F  K    G   RYKARLVA+G+TQ
Sbjct: 906  WEEAINTELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQ 965

Query: 412  IEGLNYEETYSPVNQTNN----------------------------NQRQIYLTL----- 471
               ++YEET++PV + ++                             + +IY+ L     
Sbjct: 966  KYQIDYEETFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGIS 1025

Query: 472  ----------------------------------------------------------ML 531
                                                                      +L
Sbjct: 1026 CNSDNVCKLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVDRCIYILDKGNINENIYVL 1085

Query: 532  NYVDEIILTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYARE 591
             YVD++++   + + +      L  +F +  L ++  F+ I ++     I LSQ  Y ++
Sbjct: 1086 LYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRIEMQEDKIYLSQSAYVKK 1145

Query: 592  ILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLTL-TRPDIVYAVNC 621
            IL+K  M    A++TP+ +       ++        RS++G L Y+ L TRPD+  AVN 
Sbjct: 1146 ILSKFNMENCNAVSTPLPSKINYELLNSDEDCNTPCRSLIGCLMYIMLCTRPDLTTAVNI 1205

BLAST of Lag0018094 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 114.4 bits (285), Expect = 5.7e-24
Identity = 111/426 (26.06%), Postives = 176/426 (41.31%), Query Frame = 0

Query: 313  PTLAHEISKLKNSRQQRT-LIAMSHKYEPKNYKMALSFPHWKD---AMEEEMKALMLNNT 372
            P    E  ++++ R   T  + +S   EP++ K  LS P       AM+EEM++L  N T
Sbjct: 783  PLRRSERPRVESRRYPSTEYVLISDDREPESLKEVLSHPEKNQLMKAMQEEMESLQKNGT 842

Query: 373  WILVQRPQDTNVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPVNQTN 432
            + LV+ P+    ++ KW+FK K  GD  + RYKARLV +G+ Q +G++++E +SPV +  
Sbjct: 843  YKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFDEIFSPVVKMT 902

Query: 433  NNQ--------------------------------------------------------- 492
            + +                                                         
Sbjct: 903  SIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYG 962

Query: 493  -----RQIYL------------------------------TLMLNYVDEIILTGNNSSHI 552
                 RQ Y+                               ++L YVD++++ G +   I
Sbjct: 963  LKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGKDKGLI 1022

Query: 553  QQLIGTLRSQFCLERLG--QLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAIN 612
             +L G L   F ++ LG  Q  L + I  + T++ + LSQ KY   +L +  M     ++
Sbjct: 1023 AKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKPVS 1082

Query: 613  TPIMTSPQDTHKDTQPTDAKE--------YRSIVGLLQY-LTLTRPDIVYAVNCVCQHLQ 631
            TP +       K   PT  +E        Y S VG L Y +  TRPDI +AV  V + L+
Sbjct: 1083 TP-LAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLE 1142

BLAST of Lag0018094 vs. ExPASy TrEMBL
Match: A0A2Z6P7T0 (Reverse transcriptase Ty1/copia-type domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_243070 PE=4 SV=1)

HSP 1 Score: 300.8 bits (769), Expect = 1.6e-77
Identity = 168/384 (43.75%), Postives = 211/384 (54.95%), Query Frame = 0

Query: 339  EPKNYKMALSFPHWKDAMEEEMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTIN 398
            EPK YK AL + +W+ AM++E+ AL  NNTW LVQRP D NV+ SKW+F+TK   DG+I+
Sbjct: 713  EPKTYKTALKYSNWQAAMQDEIDALHSNNTWTLVQRPLDANVIGSKWVFRTKLNEDGSID 772

Query: 399  RYKARLVAQGYTQIEGLNYEETYSPV---------------------------------- 458
            R+KARLVA+GYTQI GL++ ET+SPV                                  
Sbjct: 773  RFKARLVAKGYTQIPGLDFGETFSPVIKAPTIRIILSLAVHFKWPLKQLDVKNAFLHGTL 832

Query: 459  ------------------NQTNNNQRQIY------------------------------- 518
                              N      + +Y                               
Sbjct: 833  NERVYMEQPPGFEHPHLPNHVCQLHKSLYGLKQAPRAWFEKLSACLISLGFICSKADPSL 892

Query: 519  --------LTLMLNYVDEIILTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTA 578
                     TL+L YVD+IILTGN  S I  L+  L  +F L+ LGQL  FL IE+KH  
Sbjct: 893  FIHRYDTNFTLLLVYVDDIILTGNAPSFISHLVKQLHEKFALKDLGQLHYFLGIEIKHFC 952

Query: 579  KGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLT 631
             GI +SQ KYA ++L ++ MLG + INTPI + P +   D  P DA EYR + G LQYLT
Sbjct: 953  GGITISQTKYAHDLLKRAHMLGASKINTPIASKPNELPDDNNPVDATEYRRLCGSLQYLT 1012

BLAST of Lag0018094 vs. ExPASy TrEMBL
Match: A0A438J6K3 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Vitis vinifera OX=29760 GN=RE1_103 PE=4 SV=1)

HSP 1 Score: 296.6 bits (758), Expect = 3.0e-76
Identity = 184/464 (39.66%), Postives = 243/464 (52.37%), Query Frame = 0

Query: 259 NNQLSDISKNLHIDLPTLIAGTIPKQGVLAQVNDISSHHMVTRSKLLAKNPVLDPTLAHE 318
           ++++ D SK + +D+        P QG   Q  D    HM+TRSKL  KN   DP+L  +
Sbjct: 461 DDEVPDSSKQIVVDIS-------PPQG---QHTDNKGTHMITRSKL--KN---DPSLKSQ 520

Query: 319 ISKLKNSRQQRTLIAMSHKYEPKNYKMALSFPHWKDAMEEEMKALMLNNTWILVQRPQDT 378
           +     +R        S   EPK Y+  L  PHW  AM+EE+KAL+ N TW LV RP  T
Sbjct: 521 MVTFAATR--------SDISEPKTYRTTLKIPHWLKAMQEEIKALIQNRTWDLVPRPPTT 580

Query: 379 NVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPV-------------- 438
           N+V SKW+FKTK K DGTI+RYKARLVA+G++QI GL++ ET+SPV              
Sbjct: 581 NIVGSKWVFKTKLKEDGTIDRYKARLVARGFSQIPGLDFGETFSPVIKHTTIRMIFSLAV 640

Query: 439 --------------------------------------NQTNNNQRQIY----------- 498
                                                 N      R +Y           
Sbjct: 641 TLGWKMRQLDVKNAFLHGFLKEEVFMEQPPGFINEDLPNHVCKLNRSLYGLKQAPRAWFD 700

Query: 499 ----------------------------LTLMLNYVDEIILTGNNSSHIQQLIGTLRSQF 558
                                       + L+L YVD+II+TGN+++ I  LI TL S+F
Sbjct: 701 RLSQCLLHLGFCCGKADSSLFILRKGQSIVLLLIYVDDIIVTGNDNNIISDLINTLSSEF 760

Query: 559 CLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKD 618
            L+ LG L  FL +EVK+   G+ +SQ KY R++L  + M+    INTP+      T  D
Sbjct: 761 SLKDLGSLHYFLGLEVKYLPNGLFVSQTKYIRDLLEHTKMMECTPINTPMALKSIITSFD 820

Query: 619 TQPTDAKEYRSIVGLLQYLTLTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLD 631
            QP D  +YR +VG LQYLT TRPDIV+AVN  CQH Q P   DL+AVKRILRY++GT++
Sbjct: 821 EQPIDPTQYRQLVGSLQYLTFTRPDIVHAVNKACQHFQAPTKADLRAVKRILRYLKGTME 880

BLAST of Lag0018094 vs. ExPASy TrEMBL
Match: A0A438E6Z5 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_1575 PE=4 SV=1)

HSP 1 Score: 296.2 bits (757), Expect = 3.9e-76
Identity = 184/464 (39.66%), Postives = 243/464 (52.37%), Query Frame = 0

Query: 259  NNQLSDISKNLHIDLPTLIAGTIPKQGVLAQVNDISSHHMVTRSKLLAKNPVLDPTLAHE 318
            ++++ D SK + +D+        P QG   Q  D    HM+TRSKL  KN   DP+L  +
Sbjct: 876  DDEVPDNSKQIVVDIS-------PPQG---QHTDNKGTHMITRSKL--KN---DPSLKSQ 935

Query: 319  ISKLKNSRQQRTLIAMSHKYEPKNYKMALSFPHWKDAMEEEMKALMLNNTWILVQRPQDT 378
            +     +R        S   EPK Y+  L  PHW  AM+EE+KAL+ N TW LV RP  T
Sbjct: 936  MVTFAATR--------SDISEPKTYRTTLKIPHWLKAMQEEIKALIQNRTWDLVPRPPTT 995

Query: 379  NVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPV-------------- 438
            N+V SKW+FKTK K DGTI+RYKARLVA+G++QI GL++ ET+SPV              
Sbjct: 996  NIVGSKWVFKTKLKEDGTIDRYKARLVARGFSQIPGLDFGETFSPVIKHTTIRMIFSLAV 1055

Query: 439  --------------------------------------NQTNNNQRQIY----------- 498
                                                  N      R +Y           
Sbjct: 1056 TLGWKMRQLDVKNAFLHGFLKEEVFMEQPPGFINEDLSNHVCKLNRSLYGLKQAPRAWFD 1115

Query: 499  ----------------------------LTLMLNYVDEIILTGNNSSHIQQLIGTLRSQF 558
                                        + L+L YVD+II+TGN+++ I  LI TL S+F
Sbjct: 1116 RLSQCLLHLGFCCGKADSSLFILRKCQSIVLLLIYVDDIIVTGNDNNIISDLISTLSSEF 1175

Query: 559  CLERLGQLTLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKD 618
             L+ LG L  FL +EVK+   G+ +SQ KY R++L  + M+    INTP+      T  D
Sbjct: 1176 SLKDLGSLHYFLGLEVKYLPNGLFVSQTKYIRDLLEHTKMMECTPINTPMALKSIITSFD 1235

Query: 619  TQPTDAKEYRSIVGLLQYLTLTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLD 631
             QP D  +YR +VG LQYLT TRPDIV+AVN  CQH Q P   DL+AVKRILRY++GT++
Sbjct: 1236 EQPIDPTQYRQLVGSLQYLTFTRPDIVHAVNKACQHFQAPTKADLRAVKRILRYLKGTME 1295

BLAST of Lag0018094 vs. ExPASy TrEMBL
Match: A0A438C9J9 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Vitis vinifera OX=29760 GN=RE1_1040 PE=4 SV=1)

HSP 1 Score: 292.0 bits (746), Expect = 7.4e-75
Identity = 175/431 (40.60%), Postives = 239/431 (55.45%), Query Frame = 0

Query: 259  NNQLSDISKNLHIDLPTLIAGTIPKQGVLAQVNDISSHHMVTRSKLLAKNPVLDPTLAHE 318
            ++++ D SK + +D+        P QG   Q  D    HM+TRSKL  KN   DP+L  +
Sbjct: 853  DDEVLDSSKQIVVDIS-------PPQG---QHTDNKGTHMITRSKL--KN---DPSLKSQ 912

Query: 319  ISKLKNSRQQRTLIAMSHKYEPKNYKMALSFPHWKDAMEEEMKALMLNNTWILVQRPQDT 378
            +     +R        S   EPK Y+ AL  PHW  AM+EE+KAL+ N TW LV RP  T
Sbjct: 913  MVTFAATR--------SDISEPKTYRTALKIPHWLKAMQEEIKALIQNRTWDLVPRPPTT 972

Query: 379  NVVESKWIFKTKCKGDGTINRYKARLVAQGYTQIEGLNYEETYSPVNQTNN--------- 438
            N+V SKW+FKTK K DGTI+RYKARLVA+G++QI GL++ ET+SPV +            
Sbjct: 973  NIVGSKWVFKTKLKEDGTIDRYKARLVARGFSQIPGLDFGETFSPVIKHTTIRMIFSLAV 1032

Query: 439  -------------------NQRQIYLTLMLNYVDE------------------------- 498
                                + ++++     +++E                         
Sbjct: 1033 TLGWKMRQLDVKNAFLHGFLKEEVFMEQPPGFINEDLPNHVCKLNRSLYGLKQAPRAWFD 1092

Query: 499  -----IILTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYARE 558
                   + GN+++ I  LI TL S+F L+ LG L  FL +EVK+   G+ +SQ KY R+
Sbjct: 1093 RLSNVFFIWGNDNNIISDLISTLSSEFSLKDLGSLHYFLGLEVKYLPNGLFVSQTKYIRD 1152

Query: 559  ILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLTLTRPDIVYAVNCV 618
            +L  + M+    INTP+      T  D QP D  +YR +VG LQYLT TRPDIV+AVN  
Sbjct: 1153 LLEHTKMMECTPINTPMALKSIITSFDEQPIDPTQYRQLVGSLQYLTFTRPDIVHAVNKA 1212

Query: 619  CQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RSTTG 631
            CQH Q P   DL+AVKRILRY++GT+++ I  +K +SL L  FCDADW GC  T RST+G
Sbjct: 1213 CQHFQAPTKADLRAVKRILRYLKGTMEHGIRFFKQSSLRLTGFCDADWAGCTNTRRSTSG 1260

BLAST of Lag0018094 vs. ExPASy TrEMBL
Match: A0A2K3PNP5 (Copia-like polyprotein (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g013628 PE=4 SV=1)

HSP 1 Score: 291.6 bits (745), Expect = 9.6e-75
Identity = 166/384 (43.23%), Postives = 207/384 (53.91%), Query Frame = 0

Query: 339 EPKNYKMALSFPHWKDAMEEEMKALMLNNTWILVQRPQDTNVVESKWIFKTKCKGDGTIN 398
           EPK YK AL + +W++AM+EE+ AL  NNTW LV RP D NVV SKW+F+TK   DG+I+
Sbjct: 276 EPKTYKTALKYSNWQEAMKEEINALHSNNTWTLVPRPLDANVVGSKWVFRTKLNEDGSID 335

Query: 399 RYKARLVAQGYTQIEGLNYEETYSPV---------------------------------- 458
           R+KARLVA+GYTQI GL++ ET+SPV                                  
Sbjct: 336 RFKARLVAKGYTQIHGLDFGETFSPVVKAPTIRVILSLAVHFKWPLKQLDVKNAFLHGTL 395

Query: 459 ------------------NQTNNNQRQIY------------------------------- 518
                             N      + +Y                               
Sbjct: 396 NERVYMEQPPGFEHPHLKNHVCQLHKSLYGLKQAPRAWFEKLSTCLFSLGFICSKADPSL 455

Query: 519 --------LTLMLNYVDEIILTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTA 578
                    TL+L YVD+IILTGN  S I  LI  L  +F L+ LGQL  FL IE+K+  
Sbjct: 456 FILRRDTNFTLLLVYVDDIILTGNTPSFISHLIKQLHEKFALKDLGQLHYFLGIEIKYFC 515

Query: 579 KGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLT 631
            GI +SQ KYA ++L ++ ML  + INTPI   P     D +  DA EYR + G LQYLT
Sbjct: 516 GGITISQTKYAHDLLKRAHMLDASKINTPIAPKPNQLPDDNKLVDATEYRRLCGSLQYLT 575

BLAST of Lag0018094 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 184.1 bits (466), Expect = 4.2e-46
Identity = 95/210 (45.24%), Postives = 137/210 (65.24%), Query Frame = 0

Query: 439 MLNYVDEIILTGNNSSHIQQLIGTLRSQFCLERLGQLTLFLSIEVKHTAKGIMLSQGKYA 498
           +L YVD+I+LTG++++ +  LI  L S F ++ LG +  FL I++K    G+ LSQ KYA
Sbjct: 3   LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62

Query: 499 REILTKSGMLGVAAINTPIMTSPQDTHKDTQPTDAKEYRSIVGLLQYLTLTRPDIVYAVN 558
            +IL  +GML    ++TP+      +    +  D  ++RSIVG LQYLTLTRPDI YAVN
Sbjct: 63  EQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYAVN 122

Query: 559 CVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFCDADWGGCPLT-RST 618
            VCQ + +P + D   +KR+LRY++GT+ + + ++KN+ LN+ AFCD+DW GC  T RST
Sbjct: 123 IVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRST 182

Query: 619 TGFCVFLRSNCISSLQESRD----SSTQTD 644
           TGFC FL  N IS   + +     SST+T+
Sbjct: 183 TGFCTFLGCNIISWSAKRQPTVSRSSTETE 212

BLAST of Lag0018094 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 169.9 bits (429), Expect = 8.1e-42
Identity = 118/397 (29.72%), Postives = 176/397 (44.33%), Query Frame = 0

Query: 331 LIAMSHKYEPKNYKMALSFPHWKDAMEEEMKALMLNNTWILVQRPQDTNVVESKWIFKTK 390
           L+ ++   EP  Y  A  F  W  AM++E+ A+   +TW +   P +   +  KW++K K
Sbjct: 77  LVCIAKAKEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIK 136

Query: 391 CKGDGTINRYKARLVAQGYTQIEGLNYEETYSPVNQ------------------------ 450
              DGTI RYKARLVA+GYTQ EG+++ ET+SPV +                        
Sbjct: 137 YNSDGTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDIS 196

Query: 451 ----TNNNQRQIYLTL-------------------------------------------- 510
                 +   +IY+ L                                            
Sbjct: 197 NAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIG 256

Query: 511 -----------------------MLNYVDEIILTGNNSSHIQQLIGTLRSQFCLERLGQL 570
                                  +L YVD+II+  NN + + +L   L+S F L  LG L
Sbjct: 257 FGFVQSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPL 316

Query: 571 TLFLSIEVKHTAKGIMLSQGKYAREILTKSGMLGVAAINTPIMTSPQ-DTHKDTQPTDAK 630
             FL +E+  +A GI + Q KYA ++L ++G+LG    + P+  S     H      DAK
Sbjct: 317 KYFLGLEIARSAAGINICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAHSGGDFVDAK 376

BLAST of Lag0018094 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 100.5 bits (249), Expect = 6.0e-21
Identity = 50/96 (52.08%), Postives = 65/96 (67.71%), Query Frame = 0

Query: 332 IAMSHKYEPKNYKMALSFPHWKDAMEEEMKALMLNNTWILVQRPQDTNVVESKWIFKTKC 391
           I  + K EPK+   AL  P W  AM+EE+ AL  N TWILV  P + N++  KW+FKTK 
Sbjct: 20  ITTTIKKEPKSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKL 79

Query: 392 KGDGTINRYKARLVAQGYTQIEGLNYEETYSPVNQT 428
             DGT++R KARLVA+G+ Q EG+ + ETYSPV +T
Sbjct: 80  HSDGTLDRLKARLVAKGFHQEEGIYFVETYSPVVRT 115

BLAST of Lag0018094 vs. TAIR 10
Match: ATMG00240.1 (Gag-Pol-related retrotransposon family protein )

HSP 1 Score: 70.1 bits (170), Expect = 8.7e-12
Identity = 35/92 (38.04%), Postives = 54/92 (58.70%), Query Frame = 0

Query: 545 YLTLTRPDIVYAVNCVCQHLQQPIVKDLKAVKRILRYIQGTLDYDISLYKNNSLNLYAFC 604
           YLT+TRPD+ +AVN + Q         ++AV ++L Y++GT+   +     + L L AF 
Sbjct: 2   YLTITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSDLQLKAFA 61

Query: 605 DADWGGCPLT-RSTTGFCVFLRSNCISSLQES 636
           D+DW  CP T RS TGFC  +    + +L++S
Sbjct: 62  DSDWASCPDTRRSVTGFCSLVPLWFLGALRKS 93

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GAU44375.13.3e-7743.75hypothetical protein TSUD_243070 [Trifolium subterraneum][more]
RVX04589.16.2e-7639.66Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera][more]
RVW43526.18.1e-7639.66Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
RVW19921.11.5e-7440.60Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera][more]
PNY16899.12.0e-7443.23copia-like polyprotein, partial [Trifolium pratense][more]
Match NameE-valueIdentityDescription
Q9ZT941.4e-4631.71Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW24.1e-4628.49Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P925195.8e-4545.24Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 ... [more]
P041466.5e-2824.73Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
P109785.7e-2426.06Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Match NameE-valueIdentityDescription
A0A2Z6P7T01.6e-7743.75Reverse transcriptase Ty1/copia-type domain-containing protein OS=Trifolium subt... [more]
A0A438J6K33.0e-7639.66Retrovirus-related Pol polyprotein from transposon RE1 OS=Vitis vinifera OX=2976... [more]
A0A438E6Z53.9e-7639.66Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
A0A438C9J97.4e-7540.60Retrovirus-related Pol polyprotein from transposon RE1 OS=Vitis vinifera OX=2976... [more]
A0A2K3PNP59.6e-7543.23Copia-like polyprotein (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g013628... [more]
Match NameE-valueIdentityDescription
ATMG00810.14.2e-4645.24DNA/RNA polymerases superfamily protein [more]
AT4G23160.18.1e-4229.72cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00820.16.0e-2152.08Reverse transcriptase (RNA-dependent DNA polymerase) [more]
ATMG00240.18.7e-1238.04Gag-Pol-related retrotransposon family protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 441..517
e-value: 2.0E-11
score: 43.9
coord: 366..427
e-value: 1.2E-13
score: 51.2
coord: 196..258
e-value: 1.0E-11
score: 44.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..20
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 432..597
coord: 166..290
coord: 340..426
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 367..610

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0018094.1Lag0018094.1mRNA