Lag0018797 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0018797
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionReverse transcriptase domain-containing protein
Locationchr5: 34648109 .. 34650841 (-)
RNA-Seq ExpressionLag0018797
SyntenyLag0018797
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACTGACTATTTTCAGCATCTGTTTTCCTCATCGGGTCCGAGTGTCCAGGATTTTGAGGTGGCGCTGCGAGATTTGGAGCCTTCTGTGGATGATGAGATGAACCAAACACTATTGCGACCTTTTACCAAAGAGGAGGTCTTGTTGGCTTTGAAGCAGACGCATCCTAACAATGCCCCAGGTCCAGATGGGCTGTCGGGGAGCTTTTACAAGTACCACTGGGACATTGTTGGGCCAGATATTATTCAGAGTTGTTTGACAGTTCTGAATCACGGATGCTCCCCAGGTGCTGTTAACGATACTATGATTGTGCTCATCCCAAAGATTAATGCAGCCCGACGGATGGTCGACTTTCGACCCATCTCCCTTTGTAATGTGAGTTACAAGCTGATCTCGAAGGTTTTGGTCAACCGCATGAAGTATATACTGCCTCAGCTGATCTCGCAGAATCAGAGTGCTTTTATTCCAAGCAGGTGTGTTGTGGACAATGCCATTCTGGGGTTTGAGTGTATCCATGAGCTGCGGAGGAGAAGTAGGGGGAGGGCCAAATGGGCGACGCTAAAACTAGATATGAGTAAAGCATATGACAGGGTGGAATGGGCTTTCCTCCGGGAGGTTATGCTACGACTGGGTTTTGCGCAACAGTGGGTTGATTTGATCCTCCGATGTGTCAGCTCGGTATCGTTTTCCTTCAGTCTGAATGGGGAAAAGGTGGGGCAGGTGGTACCGTCTAGGGGTCTCCGGCAGGGGGATCCCCTCTCCCCATACTTGTTTCTGTTATGTGCTGAAGGGTTGTCCAGTCTGCTGTGTGGAGCTGAGTGGAGGTCTCAGATTACGGGATTTCGGGTTGGACGCTCTAGCCCATCGATTTCACACCTTTTTTTTGCAGACGACAACCTCCTCTTCTTCAGGGCCATCGGGAGTGAAGCGCTGGTTATTCGAGAGCTGCTGGAACGGTATGAGAGGGCGTCAGGGCAGACTATCAATTATGATAAGTCTGTTGTTGCTTTCAGCCCAAATACGGGGGAAGAGGCTCAGCAGTATATCAGCCAAGTTCTTTCCGTGTCTCGCTGCCCTTGTCATCAGCAGTATCTAGGCCTACCCTCGTTTATGCCACGTAACCGGTCGAGGGCGTTGAAGTTTGTGAAGGATCGTATTTGGCGCCAAATTCAGGGATGGAAGGGCAAGTTCTTCTCAATGGCGGGGAAGGAGGTTCTCCTCAAGTCCATAGTTCAGGCGATTCCTTGCTACACGATGGACTGTTTTCGACTGCCCAGGGGTTTGATCAAGGAGATTCACAGGACTATGGCTAGATTTTGGTGGAGTGGTTCTGAAGAAGAGAGACAAATACATTGGCTGAGTTGGGATTCTCTATGTCTCCCAAAGTTCTTGGGTGGGTTGGGGTTCCGTAATATGGAGCTTTTTAACGAAGCCCTACTGGCTAAACAGTGTTGGCGTGTTCTCCAGGATCCTTCATCTCTTCTGGGCTCTGTGCTAAAGGGCCGCGATTTTCCCCAGTCGGGTTTCTTGGAGGCAGGTATTGGGTCACGCCCGTCTTTCGTCTGGCGCAGTTTGTTATGGGGGCAGGAGCTCTTAGTTCGTGGATGTCGTTGGAGGATTGGTAATGGGCGTGCTACGCCCATCTATGGCTCGAACTGGCTACCGAATGAGTTTTCGCTTCAAATACAGTCGGCTTCAGTGCTTTCTCCTGCTAGTACGGTGAGTGAGTTGTTCACTGCGTCTGGTGGATGGGATGTGGCTTTACTCAGGACGATTTTCAATGGGGCTGATTGTGAGGCTATTTTGAGAATTCCTCTACGACAGGGCTCGGGGGAGAATCGCTTAATCTGGCACTTTGAGAAGCATGAGAATTTTTCGGTGAAGAGTGGGTATCGGCTTGCTCATACATTGGCTACTCAGGACCGACCTGGTTCCTCCAACTTCGAGAGAGTGCGCATGTGGTGGTCTAGCCTCTGGAGGTTGAATGTGCCCAATAAGCATAGGTTCTTCCTCTGGCGTCTGTGCCACGACCGCTTGCCAACTAAGGTAAACCTTCTCAAACGTGGACTCACTGTATCCCCTTTGTGTGTTTTGTGTGATGATGATGCAGAAGACTGTCTCCATCTGTTTTGGACCTGCCCTGTGGTTAAGAGTATGTGGTTGGGCTCCAAATTTTCCCTCTTCCACCAATCTTTTTCCCATTTCAGGTTCGAGGAAATCATTGGGGCGATGAGGGAAAAACTGACAGGGCTGGATTTTGAGCTTATGGTCATTTTTTGGTGGTCTGTGTGGAATCTACGAAACAACATGTTTTCGGGTGGGCAGTCAGACGGTCGGGATCTCTGGGCATATTCGAGTGATTACCTCAGTGCCTTCCATGTTGGTGGGGGACGTTGCGGGACAAGGGACTCATGGGCTCAATCGATAGAGCAGGAAGAGCGCGGTGTATGGAGACCGCACCCTAATAGGGAGCTGAAACTTAATATCGATGCTTCGGTACGGCCGGATACAGGGGAAGCGGGGGGTGGCTGTGTGCTGCGAGGGGCTGAGGGTGAGGTATTCATGGCAGCTTGTTTGAGCTTACAGAGGTGTTGGAGCGTGGATTTGGATGAGGGTTGGGTTGTGTATAGAGGGATCCAACTTGCTCGACAGTTGGGGTTTGTGGATTTTGTGGTGGAGACTGACTCTCTAAGACTGGTCAAAATTCTGAATGGGGAGCTGCATGATGTGTAG

mRNA sequence

ATGACTGACTATTTTCAGCATCTGTTTTCCTCATCGGGTCCGAGTGTCCAGGATTTTGAGGTGGCGCTGCGAGATTTGGAGCCTTCTGTGGATGATGAGATGAACCAAACACTATTGCGACCTTTTACCAAAGAGGAGGTCTTGTTGGCTTTGAAGCAGACGCATCCTAACAATGCCCCAGGTCCAGATGGGCTGTCGGGGAGCTTTTACAAGTACCACTGGGACATTGTTGGGCCAGATATTATTCAGAGTTGTTTGACAGTTCTGAATCACGGATGCTCCCCAGGTGCTGTTAACGATACTATGATTGTGCTCATCCCAAAGATTAATGCAGCCCGACGGATGGTCGACTTTCGACCCATCTCCCTTTGTAATGTGAGTTACAAGCTGATCTCGAAGGTTTTGGTCAACCGCATGAAGTATATACTGCCTCAGCTGATCTCGCAGAATCAGAGTGCTTTTATTCCAAGCAGGTGTGTTGTGGACAATGCCATTCTGGGGTTTGAGTGTATCCATGAGCTGCGGAGGAGAAGTAGGGGGAGGGCCAAATGGGCGACGCTAAAACTAGATATGAGTAAAGCATATGACAGGGTGGAATGGGCTTTCCTCCGGGAGGTTATGCTACGACTGGGTTTTGCGCAACAGTGGGTTGATTTGATCCTCCGATGTGTCAGCTCGGTATCGTTTTCCTTCAGTCTGAATGGGGAAAAGGTGGGGCAGGTGGTACCGTCTAGGGGTCTCCGGCAGGGGGATCCCCTCTCCCCATACTTGTTTCTGTTATGTGCTGAAGGGTTGTCCAGTCTGCTGTGTGGAGCTGAGTGGAGGTCTCAGATTACGGGATTTCGGGTTGGACGCTCTAGCCCATCGATTTCACACCTTTTTTTTGCAGACGACAACCTCCTCTTCTTCAGGGCCATCGGGAGTGAAGCGCTGGTTATTCGAGAGCTGCTGGAACGGTATGAGAGGGCGTCAGGGCAGACTATCAATTATGATAAGTCTGTTGTTGCTTTCAGCCCAAATACGGGGGAAGAGGCTCAGCAGTATATCAGCCAAGTTCTTTCCGTGTCTCGCTGCCCTTGTCATCAGCAGTATCTAGGCCTACCCTCGTTTATGCCACGTAACCGGTCGAGGGCGTTGAAGTTTGTGAAGGATCGTATTTGGCGCCAAATTCAGGGATGGAAGGGCAAGTTCTTCTCAATGGCGGGGAAGGAGGTTCTCCTCAAGTCCATAGTTCAGGCGATTCCTTGCTACACGATGGACTGTTTTCGACTGCCCAGGGGTTTGATCAAGGAGATTCACAGGACTATGGCTAGATTTTGGTGGAGTGGTTCTGAAGAAGAGAGACAAATACATTGGCTGAGTTGGGATTCTCTATGTCTCCCAAAGTTCTTGGGTGGGTTGGGGTTCCGTAATATGGAGCTTTTTAACGAAGCCCTACTGGCTAAACAGTGTTGGCGTGTTCTCCAGGATCCTTCATCTCTTCTGGGCTCTGTGCTAAAGGGCCGCGATTTTCCCCAGTCGGGTTTCTTGGAGGCAGGTATTGGGTCACGCCCGTCTTTCGTCTGGCGCAGTTTGTTATGGGGGCAGGAGCTCTTAGTTCGTGGATGTCGTTGGAGGATTGGTAATGGGCGTGCTACGCCCATCTATGGCTCGAACTGGCTACCGAATGAGTTTTCGCTTCAAATACAGTCGGCTTCAGTGCTTTCTCCTGCTAGTACGGTGAGTGAGTTGTTCACTGCGTCTGGTGGATGGGATGTGGCTTTACTCAGGACGATTTTCAATGGGGCTGATTGTGAGGCTATTTTGAGAATTCCTCTACGACAGGGCTCGGGGGAGAATCGCTTAATCTGGCACTTTGAGAAGCATGAGAATTTTTCGGTGAAGAGTGGGTATCGGCTTGCTCATACATTGGCTACTCAGGACCGACCTGGTTCCTCCAACTTCGAGAGAGTGCGCATGTGGTGGTCTAGCCTCTGGAGGTTGAATGTGCCCAATAAGCATAGGTTCTTCCTCTGGCGTCTGTGCCACGACCGCTTGCCAACTAAGGTAAACCTTCTCAAACGTGGACTCACTGTATCCCCTTTGTGTGTTTTGTGTGATGATGATGCAGAAGACTGTCTCCATCTGTTTTGGACCTGCCCTGTGGTTAAGAGTATGTGGTTGGGCTCCAAATTTTCCCTCTTCCACCAATCTTTTTCCCATTTCAGGTTCGAGGAAATCATTGGGGCGATGAGGGAAAAACTGACAGGGCTGGATTTTGAGCTTATGGTCATTTTTTGGTGGTCTGTGTGGAATCTACGAAACAACATGTTTTCGGGTGGGCAGTCAGACGGTCGGGATCTCTGGGCATATTCGAGTGATTACCTCAGTGCCTTCCATGTTGGTGGGGGACGTTGCGGGACAAGGGACTCATGGGCTCAATCGATAGAGCAGGAAGAGCGCGGTGTATGGAGACCGCACCCTAATAGGGAGCTGAAACTTAATATCGATGCTTCGGTACGGCCGGATACAGGGGAAGCGGGGGGTGGCTGTGTGCTGCGAGGGGCTGAGGGTGAGGTATTCATGGCAGCTTGTTTGAGCTTACAGAGGTGTTGGAGCGTGGATTTGGATGAGGGTTGGGTTGTGTATAGAGGGATCCAACTTGCTCGACAGTTGGGGTTTGTGGATTTTGTGGTGGAGACTGACTCTCTAAGACTGGTCAAAATTCTGAATGGGGAGCTGCATGATGTGTAG

Coding sequence (CDS)

ATGACTGACTATTTTCAGCATCTGTTTTCCTCATCGGGTCCGAGTGTCCAGGATTTTGAGGTGGCGCTGCGAGATTTGGAGCCTTCTGTGGATGATGAGATGAACCAAACACTATTGCGACCTTTTACCAAAGAGGAGGTCTTGTTGGCTTTGAAGCAGACGCATCCTAACAATGCCCCAGGTCCAGATGGGCTGTCGGGGAGCTTTTACAAGTACCACTGGGACATTGTTGGGCCAGATATTATTCAGAGTTGTTTGACAGTTCTGAATCACGGATGCTCCCCAGGTGCTGTTAACGATACTATGATTGTGCTCATCCCAAAGATTAATGCAGCCCGACGGATGGTCGACTTTCGACCCATCTCCCTTTGTAATGTGAGTTACAAGCTGATCTCGAAGGTTTTGGTCAACCGCATGAAGTATATACTGCCTCAGCTGATCTCGCAGAATCAGAGTGCTTTTATTCCAAGCAGGTGTGTTGTGGACAATGCCATTCTGGGGTTTGAGTGTATCCATGAGCTGCGGAGGAGAAGTAGGGGGAGGGCCAAATGGGCGACGCTAAAACTAGATATGAGTAAAGCATATGACAGGGTGGAATGGGCTTTCCTCCGGGAGGTTATGCTACGACTGGGTTTTGCGCAACAGTGGGTTGATTTGATCCTCCGATGTGTCAGCTCGGTATCGTTTTCCTTCAGTCTGAATGGGGAAAAGGTGGGGCAGGTGGTACCGTCTAGGGGTCTCCGGCAGGGGGATCCCCTCTCCCCATACTTGTTTCTGTTATGTGCTGAAGGGTTGTCCAGTCTGCTGTGTGGAGCTGAGTGGAGGTCTCAGATTACGGGATTTCGGGTTGGACGCTCTAGCCCATCGATTTCACACCTTTTTTTTGCAGACGACAACCTCCTCTTCTTCAGGGCCATCGGGAGTGAAGCGCTGGTTATTCGAGAGCTGCTGGAACGGTATGAGAGGGCGTCAGGGCAGACTATCAATTATGATAAGTCTGTTGTTGCTTTCAGCCCAAATACGGGGGAAGAGGCTCAGCAGTATATCAGCCAAGTTCTTTCCGTGTCTCGCTGCCCTTGTCATCAGCAGTATCTAGGCCTACCCTCGTTTATGCCACGTAACCGGTCGAGGGCGTTGAAGTTTGTGAAGGATCGTATTTGGCGCCAAATTCAGGGATGGAAGGGCAAGTTCTTCTCAATGGCGGGGAAGGAGGTTCTCCTCAAGTCCATAGTTCAGGCGATTCCTTGCTACACGATGGACTGTTTTCGACTGCCCAGGGGTTTGATCAAGGAGATTCACAGGACTATGGCTAGATTTTGGTGGAGTGGTTCTGAAGAAGAGAGACAAATACATTGGCTGAGTTGGGATTCTCTATGTCTCCCAAAGTTCTTGGGTGGGTTGGGGTTCCGTAATATGGAGCTTTTTAACGAAGCCCTACTGGCTAAACAGTGTTGGCGTGTTCTCCAGGATCCTTCATCTCTTCTGGGCTCTGTGCTAAAGGGCCGCGATTTTCCCCAGTCGGGTTTCTTGGAGGCAGGTATTGGGTCACGCCCGTCTTTCGTCTGGCGCAGTTTGTTATGGGGGCAGGAGCTCTTAGTTCGTGGATGTCGTTGGAGGATTGGTAATGGGCGTGCTACGCCCATCTATGGCTCGAACTGGCTACCGAATGAGTTTTCGCTTCAAATACAGTCGGCTTCAGTGCTTTCTCCTGCTAGTACGGTGAGTGAGTTGTTCACTGCGTCTGGTGGATGGGATGTGGCTTTACTCAGGACGATTTTCAATGGGGCTGATTGTGAGGCTATTTTGAGAATTCCTCTACGACAGGGCTCGGGGGAGAATCGCTTAATCTGGCACTTTGAGAAGCATGAGAATTTTTCGGTGAAGAGTGGGTATCGGCTTGCTCATACATTGGCTACTCAGGACCGACCTGGTTCCTCCAACTTCGAGAGAGTGCGCATGTGGTGGTCTAGCCTCTGGAGGTTGAATGTGCCCAATAAGCATAGGTTCTTCCTCTGGCGTCTGTGCCACGACCGCTTGCCAACTAAGGTAAACCTTCTCAAACGTGGACTCACTGTATCCCCTTTGTGTGTTTTGTGTGATGATGATGCAGAAGACTGTCTCCATCTGTTTTGGACCTGCCCTGTGGTTAAGAGTATGTGGTTGGGCTCCAAATTTTCCCTCTTCCACCAATCTTTTTCCCATTTCAGGTTCGAGGAAATCATTGGGGCGATGAGGGAAAAACTGACAGGGCTGGATTTTGAGCTTATGGTCATTTTTTGGTGGTCTGTGTGGAATCTACGAAACAACATGTTTTCGGGTGGGCAGTCAGACGGTCGGGATCTCTGGGCATATTCGAGTGATTACCTCAGTGCCTTCCATGTTGGTGGGGGACGTTGCGGGACAAGGGACTCATGGGCTCAATCGATAGAGCAGGAAGAGCGCGGTGTATGGAGACCGCACCCTAATAGGGAGCTGAAACTTAATATCGATGCTTCGGTACGGCCGGATACAGGGGAAGCGGGGGGTGGCTGTGTGCTGCGAGGGGCTGAGGGTGAGGTATTCATGGCAGCTTGTTTGAGCTTACAGAGGTGTTGGAGCGTGGATTTGGATGAGGGTTGGGTTGTGTATAGAGGGATCCAACTTGCTCGACAGTTGGGGTTTGTGGATTTTGTGGTGGAGACTGACTCTCTAAGACTGGTCAAAATTCTGAATGGGGAGCTGCATGATGTGTAG

Protein sequence

MTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQTLLRPFTKEEVLLALKQTHPNNAPGPDGLSGSFYKYHWDIVGPDIIQSCLTVLNHGCSPGAVNDTMIVLIPKINAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFIPSRCVVDNAILGFECIHELRRRSRGRAKWATLKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFSLNGEKVGQVVPSRGLRQGDPLSPYLFLLCAEGLSSLLCGAEWRSQITGFRVGRSSPSISHLFFADDNLLFFRAIGSEALVIRELLERYERASGQTINYDKSVVAFSPNTGEEAQQYISQVLSVSRCPCHQQYLGLPSFMPRNRSRALKFVKDRIWRQIQGWKGKFFSMAGKEVLLKSIVQAIPCYTMDCFRLPRGLIKEIHRTMARFWWSGSEEERQIHWLSWDSLCLPKFLGGLGFRNMELFNEALLAKQCWRVLQDPSSLLGSVLKGRDFPQSGFLEAGIGSRPSFVWRSLLWGQELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSASVLSPASTVSELFTASGGWDVALLRTIFNGADCEAILRIPLRQGSGENRLIWHFEKHENFSVKSGYRLAHTLATQDRPGSSNFERVRMWWSSLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCDDDAEDCLHLFWTCPVVKSMWLGSKFSLFHQSFSHFRFEEIIGAMREKLTGLDFELMVIFWWSVWNLRNNMFSGGQSDGRDLWAYSSDYLSAFHVGGGRCGTRDSWAQSIEQEERGVWRPHPNRELKLNIDASVRPDTGEAGGGCVLRGAEGEVFMAACLSLQRCWSVDLDEGWVVYRGIQLARQLGFVDFVVETDSLRLVKILNGELHDV
Homology
BLAST of Lag0018797 vs. NCBI nr
Match: VVA32947.1 (PREDICTED: retrotransposon [Prunus dulcis])

HSP 1 Score: 770.0 bits (1987), Expect = 2.3e-218
Identity = 379/898 (42.20%), Postives = 535/898 (59.58%), Query Frame = 0

Query: 3    DYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQTLLRPFTKEEVLLALKQTHPNNAPGP 62
            DYF+ LFSS+G   Q  E  L ++ P +   MN  LL+ FT+EE+   L Q  P  APG 
Sbjct: 323  DYFKTLFSSTGG--QQMERILNEVRPVITSAMNDRLLQAFTREELEHTLFQMFPTKAPGH 382

Query: 63   DGLSGSFYKYHWDIVGPDIIQSCLTVLNHGCSPGAVNDTMIVLIPKINAARRMVDFRPIS 122
            DG+   F++ +W IVG  + + CL +LN   S    N T+I LIPK+     + +FRPIS
Sbjct: 383  DGMPALFFQKYWHIVGDKVAKKCLQILNGEGSVREFNHTLIALIPKVKMPTTVSEFRPIS 442

Query: 123  LCNVSYKLISKVLVNRMKYILPQLISQNQSAFIPSRCVVDNAILGFECIHELRRRSRGRA 182
            LC   YK+I+K + NR+K +LP +I++NQSAF+P+R ++DN +  FE +H ++   +GR 
Sbjct: 443  LCTTVYKMIAKTIANRLKTVLPHVITENQSAFVPNRMILDNVMAAFEIMHTIKGVKKGRD 502

Query: 183  KWATLKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFSLNGEKVGQVV 242
                LKLDM+KAYDRVEW FLRE+ML+LGF+  WV  ++ C+S+ +FS    G  VG ++
Sbjct: 503  VKMALKLDMAKAYDRVEWVFLREMMLKLGFSATWVAKVMDCISTTTFSMLWKGNPVGHIM 562

Query: 243  PSRGLRQGDPLSPYLFLLCAEGLSSLLCGAEWRSQITGFRVGRSSPSISHLFFADDNLLF 302
            P RGLRQG PLSPYLFL+C EG S LL GAE R  + G +V R  PS++HL FADD++LF
Sbjct: 563  PQRGLRQGCPLSPYLFLMCTEGFSCLLRGAERRGDLVGVQVARGGPSVTHLLFADDSILF 622

Query: 303  FRAIGSEALVIRELLERYERASGQTINYDKSVVAFSPNTGEEAQQYISQVLSVSRCPCHQ 362
             +A       +  L + YE  SGQ INY KS  + SPN        I  VL+V    CH+
Sbjct: 623  MKATNEACRALETLFQTYEEVSGQQINYSKSAFSLSPNATRADFDMIKGVLNVPVVQCHE 682

Query: 363  QYLGLPSFMPRNRSRALKFVKDRIWRQIQGWKGKFFSMAGKEVLLKSIVQAIPCYTMDCF 422
            +YLGLP+   + R +  + +KD++W+ I GWK K  S AGKE+L+K+++QAIP Y+M CF
Sbjct: 683  KYLGLPTIAGKGRKQLFQHLKDKLWKHISGWKEKLLSRAGKEILMKAVLQAIPTYSMSCF 742

Query: 423  RLPRGLIKEIHRTMARFWWSGSEEERQIHWLSWDSLCLPKFLGGLGFRNMELFNEALLAK 482
            R+P+GL KE++  MARFWW+ ++++R IHW+ W+ LC  KF GGLGFR++E FN+ALLAK
Sbjct: 743  RIPKGLCKELNGIMARFWWAKAKDKRGIHWVKWELLCKSKFAGGLGFRDLEAFNQALLAK 802

Query: 483  QCWRVLQDPSSLLGSVLKGRDFPQSGFLEAGIGSRPSFVWRSLLWGQELLVRGCRWRIGN 542
            QCWR+L+ P SL+  + + R  P   FLEA +G+ PSF+WRSL WG+ELL +G RWR+GN
Sbjct: 803  QCWRILRTPESLVARIFRARYHPSVPFLEAEVGTNPSFIWRSLQWGKELLNKGLRWRVGN 862

Query: 543  GRATPIYGSNWLPNEFSLQIQSASVLSPASTVSELFTASGGWDVALLRTIFNGADCEAIL 602
            G +  +Y   WLP     +I S   L  ++ V +LFT+SG W+V LL+ IF   + +A L
Sbjct: 863  GVSIQVYTDKWLPAPSFFKIMSPPQLPLSTLVCDLFTSSGQWNVPLLKDIFWDQEVDAKL 922

Query: 603  RIPLRQGSGENRLIWHFEKHENFSVKSGYRLAHTLATQDRPGSSNFERVRM---WWSSLW 662
            +IPL   +G + LIWH+E++  +SVKSGYRLA     +D+       RV +   +W  +W
Sbjct: 923  QIPLASLAGHDCLIWHYERNGMYSVKSGYRLA--CLEKDKMSGEPSVRVDLNSKFWKKIW 982

Query: 663  RLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCDDDAEDCLHLFWTCPVVKSM 722
             L +PNK +FFLWR   D LP    L  R +  +P+C  C   AE  LH  W C   K +
Sbjct: 983  ALKIPNKIKFFLWRCAWDFLPCGQILFNRKIAPTPICPNCHRKAESVLHAVWLCETAKEV 1042

Query: 723  WLGSKFSLFHQSFSHFRFEEIIGAMREKLTGLDFELMVIFWWSVWNLRNNMFSGGQSDGR 782
            W  S +    + +    F E+  A++   +G +  L     W +WN RN+    G+S+  
Sbjct: 1043 WRNSAWGNVCEEWRVNSFRELWHALQLSSSGEEQGLFAYLCWGLWNRRNSFIFEGKSETA 1102

Query: 783  DLWAYSSDYLSAFHVGGGRCGTRDSWAQSIEQEERGVWRPHPNRELKLNIDASVRPDTGE 842
                +    L+                QS  Q     WRP P    K+N+D +V+     
Sbjct: 1103 TQLLHRMTKLAQEFSNANNLSHTIHGRQSSPQAPLHGWRPPPAGIYKINVDGAVKSGDSV 1162

Query: 843  AGGGCVLRGAEGEVFMAACL-SLQRCWSVDLDEGWVVYRGIQLARQLGFVDFVVETDS 897
             G G V+R A GE FMAAC+  +Q  +     E      G++ A  +GF   V+E D+
Sbjct: 1163 RGVGVVVRNANGE-FMAACVRRIQASYGARQTELMATIEGLRFAIDMGFTAAVLEMDA 1215

BLAST of Lag0018797 vs. NCBI nr
Match: ONI01138.1 (hypothetical protein PRUPE_6G123900 [Prunus persica])

HSP 1 Score: 747.7 bits (1929), Expect = 1.2e-211
Identity = 376/906 (41.50%), Postives = 535/906 (59.05%), Query Frame = 0

Query: 3   DYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQTLLRPFTKEEVLLALKQTHPNNAPGP 62
           DYF+ LFSSSG   Q  E  L ++ P +   MN  LL+ FT+EE+   L Q  P  APG 
Sbjct: 57  DYFKTLFSSSGG--QQMERILNEVRPVITSAMNAQLLQAFTREELEHTLFQMFPTKAPGH 116

Query: 63  DGLSGSFYKYHWDIVGPDIIQSCLTVLNHGCSPGAVNDTMIVLIPKINAARRMVDFRPIS 122
           DG+   F++ +W IVG  + + CL +LN   S    N T+I LIPK+     + +FRPIS
Sbjct: 117 DGMPALFFQKYWHIVGDKVAKKCLQILNGEGSVREFNHTLIALIPKVKMPTIVSEFRPIS 176

Query: 123 LCNVSYKLISKVLVNRMKYILPQLISQNQSAFIPSRCVVDNAILGFECIHELRRRSRGRA 182
           LC   YK+I+K + NR+K +L  +I++ QSAF+P+R ++DN +  FE ++ ++   +GR 
Sbjct: 177 LCTTVYKMIAKTIANRLKTVLSHVITETQSAFVPNRMILDNVMAAFEIMNTIKGVKKGRD 236

Query: 183 KWATLKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFSLNGEKVGQVV 242
               LKLDM+KAYDRVEW FLR +ML+LGF+  WV  ++ C+S+ +FS    G  VG ++
Sbjct: 237 VQMALKLDMAKAYDRVEWVFLRAMMLKLGFSATWVSKVMDCISTTTFSVLWKGTPVGHIM 296

Query: 243 PSRGLRQGDPLSPYLFLLCAEGLSSLLCGAEWRSQITGFRVGRSSPSISHLFFADDNLLF 302
           P RGLRQG PLSPYLFL+C EG S LL GAE R  + G +V R +PS++HL FADD++LF
Sbjct: 297 PQRGLRQGCPLSPYLFLICTEGFSCLLRGAERRGDLVGVQVARGAPSVTHLLFADDSILF 356

Query: 303 FRAIGSEALVIRELLERYERASGQTINYDKSVVAFSPNTGEEAQQYISQVLSVSRCPCHQ 362
            +A   + + +  L + YE  +GQ INY KS ++ SPN        I  VL+V    CH+
Sbjct: 357 MKATNKDCMALETLFQTYEEVTGQQINYSKSALSLSPNATRADFDMIEGVLNVPVVRCHE 416

Query: 363 QYLGLPSFMPRNRSRALKFVKDRIWRQIQGWKGKFFSMAGKEVLLKSIVQAIPCYTMDCF 422
            YLGLP+   + R +  + +KD++W+ I GWK K  S AGKE+L+K+++QAIP Y+M CF
Sbjct: 417 NYLGLPTIAGKGRKQLFQHLKDKLWKHISGWKEKLLSRAGKEILIKAVLQAIPTYSMSCF 476

Query: 423 RLPRGLIKEIHRTMARFWWSGSEEERQIHWLSWDSLCLPKFLGGLGFRNMELFNEALLAK 482
           R+P+GL KE++  MARFWW+ ++++R IHW+ W+ LC  KF GGLGFR++E FN+ALLAK
Sbjct: 477 RIPKGLCKELNGIMARFWWAKAKDKRGIHWVKWELLCKSKFAGGLGFRDLEAFNQALLAK 536

Query: 483 QCWRVLQDPSSLLGSVLKGRDFPQSGFLEAGIGSRPSFVWRSLLWGQELLVRGCRWRIGN 542
           QCWR+L+ P SL+  + + R  P   FLEA +G+ PSF+WRSL WG+ELL +G RWR+G+
Sbjct: 537 QCWRILRTPESLVARIFRARYHPSVPFLEAEVGTNPSFIWRSLQWGKELLNKGLRWRVGS 596

Query: 543 GRATPIYGSNWLPNEFSLQIQSASVLSPASTVSELFTASGGWDVALLRTIFNGADCEAIL 602
           G +  +Y   WLP     +I S   L  ++ V +LFT+SG W+V LL+ IF   + +AIL
Sbjct: 597 GVSIQVYTDKWLPAPSCFKIMSPPQLPLSTRVCDLFTSSGQWNVPLLKDIFWDQEVDAIL 656

Query: 603 RIPLRQGSGENRLIWHFEKHENFSVKSGYRLAHTLATQDRPGSSNFERVRM---WWSSLW 662
           +IPL   +G + LIWH+E++  +SVKSGYRLA     +D+       RV +   +W  +W
Sbjct: 657 QIPLASLAGHDCLIWHYERNGMYSVKSGYRLAG--LEKDKMSGEPSARVDLNSKFWKKIW 716

Query: 663 RLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCDDDAEDCLHLFWTCPVVKSM 722
            L +PNK +FFLWR   D LP    L  R +  +P+C  C   AE  LH  W C   K +
Sbjct: 717 ALKIPNKIKFFLWRCAWDFLPCGQILFNRKIAPTPICPKCHRKAESVLHAVWLCEAAKEV 776

Query: 723 WLGSKFSLFHQSFSHFRFEEIIGAMREKLTGLDFELMVIFWWSVWNLRNNMFSGGQSDG- 782
           W  S +    + +    F E+  A++   +G +  L     W +WN RN+    G+S+  
Sbjct: 777 WRNSAWGNVCEVWRVNSFRELWHALQLSSSGEEQGLFAYLCWGLWNRRNSFIFEGKSETA 836

Query: 783 -------RDLWAYSSDYLSAFHVGGGRCGTRDSWAQSIEQEERGVWRPHPNRELKLNIDA 842
                    L    SD  +  H   GR        QS  Q     WRP P          
Sbjct: 837 IQLLSRMTKLAQEFSDANNILHTIHGR--------QSSPQAPLQGWRPPP---------- 896

Query: 843 SVRPDTGEAGGGCVLRGAEGEVFMAACL-SLQRCWSVDLDEGWVVYRGIQLARQLGFVDF 897
           +V+      G G V+R A GE FMAAC+  +   +     E      G++ A  +GF D 
Sbjct: 897 AVKSGDSVRGVGVVVRNANGE-FMAACVRRIHASYGARQTELMATIEGLRFAIDMGFTDA 939

BLAST of Lag0018797 vs. NCBI nr
Match: XP_023909336.1 (uncharacterized protein LOC112020997 [Quercus suber])

HSP 1 Score: 740.7 bits (1911), Expect = 1.5e-209
Identity = 381/910 (41.87%), Postives = 543/910 (59.67%), Query Frame = 0

Query: 1    MTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQTLLRPFTKEEVLLALKQTHPNNAP 60
            M+DYF  LF+++ PS  D +  L+ ++  V  +MNQ L R FT  EV  ALKQ    +AP
Sbjct: 397  MSDYFSDLFTTATPS--DLDSILQGIDRKVTPQMNQELTREFTANEVEAALKQMKSISAP 456

Query: 61   GPDGLSGSFYKYHWDIVGPDIIQSCLTVLNHGCSPGAVNDTMIVLIPKINAARRMVDFRP 120
            GPDG+   F+K++W+ VGPD++ + L+VLN G  P  +N T I LIPK  +     DFRP
Sbjct: 457  GPDGMPPIFFKHYWNTVGPDVLSATLSVLNSGIIPPNINHTFISLIPKTKSPETAKDFRP 516

Query: 121  ISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFIPSRCVVDNAILGFECIHELRRRSRG 180
            ISLCNV YKLISK + NR+K  LP+LIS +QSAF+ +R + DN ++ FE +H L+ + +G
Sbjct: 517  ISLCNVIYKLISKTIANRLKKCLPKLISDSQSAFLSNRLITDNILIAFETLHHLKNKRKG 576

Query: 181  RAKWATLKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFSLNGEKVGQ 240
            +  +  LKLDMSKAYDRVEW FL  +M +LGFA++W+DLI  C+S+VSFS  +NG   G 
Sbjct: 577  KTGYMALKLDMSKAYDRVEWTFLENLMDKLGFARKWIDLIKSCISTVSFSILINGAPYGL 636

Query: 241  VVPSRGLRQGDPLSPYLFLLCAEGLSSLLCGAEWRSQITGFRVGRSSPSISHLFFADDNL 300
            + P RGLRQGDPLSPYLFLLCAEGL +L+  A     I+G  + R  P ++HL FADD+L
Sbjct: 637  IHPQRGLRQGDPLSPYLFLLCAEGLHALIKQAATNGTISGVSLCREGPRVTHLLFADDSL 696

Query: 301  LFFRAIGSEALVIRELLERYERASGQTINYDKSVVAFSPNTGEEAQQYISQVLSVSRCPC 360
            L  +A   E   + ELLE+YERASGQ IN DK+ + FS NT ++ +  I   L V+    
Sbjct: 697  LLCKANSRECNSVLELLEKYERASGQRINRDKTQLFFSSNTNQQTRNSIKSSLGVAVSHQ 756

Query: 361  HQQYLGLPSFMPRNRSRALKFVKDRIWRQIQGWKGKFFSMAGKEVLLKSIVQAIPCYTMD 420
              +YLGLPSF+ R + ++  ++++RIW++IQGWK K  S AGKEVL+KSI+QA+P Y+M+
Sbjct: 757  LDKYLGLPSFVGRGKKQSFSYIRERIWQKIQGWKEKLLSQAGKEVLIKSILQAMPTYSMN 816

Query: 421  CFRLPRGLIKEIHRTMARFWWSGSEEERQIHWLSWDSLCLPKFLGGLGFRNMELFNEALL 480
            CF+LPR L K+I   + +FWW    E+R+ HW++W+ +CLPK  GGLGFR++E FN ALL
Sbjct: 817  CFKLPRSLCKDIESLIRKFWWGYRGEQRKTHWVAWNKMCLPKCQGGLGFRDIENFNLALL 876

Query: 481  AKQCWRVLQDPSSLLGSVLKGRDFPQSGFLEAGIGSRPSFVWRSLLWGQELLVRGCRWRI 540
             KQ WR+L +  SL   V K R FP    ++ G+ +  S+ W+S+L  ++++  G  WRI
Sbjct: 877  GKQVWRLLHNQDSLFYKVFKARFFPNCSIMDEGVKTNGSYAWQSILQARKVVDMGSYWRI 936

Query: 541  GNGRATPIYGSNWLPNEFSLQIQSASVLSPAS-TVSELFTASG-GWDVALLRTIFNGADC 600
            G+GR+  I G  WLP     ++ S     P +  V  L   +G  WD   +R+ F   + 
Sbjct: 937  GDGRSVLIRGDKWLPGSHHSKVLSPQNHFPMNMKVCALLNENGTSWDADRIRSEFLPCEA 996

Query: 601  EAILRIPLRQGSGENRLIWHFEKHENFSVKSGYRLAHTLATQDRPGSSNFERVRMWWSSL 660
            + IL IPL      +  IW   K+  +S KS YRL    A  ++PG+SN   +  +W+++
Sbjct: 997  QEILSIPLSSRRPVDGRIWKETKNGVYSTKSAYRLLSKTAISNQPGTSNPSMLNSFWTNI 1056

Query: 661  WRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCDDDAEDCLHLFWTCPVVKS 720
            W+LN+PNK + FLWR C D LPTK+NL++R +  +  C LC D  ED +H  W C  VK 
Sbjct: 1057 WKLNIPNKVKHFLWRACSDSLPTKMNLVRRKIITNVTCDLCRDQPEDAIHALWDCHGVKE 1116

Query: 721  MWLGSKF--SLFHQSFSHFRFEEIIGAMREKLTGLDFELMVIFWWSVWNLRNNMFSGGQS 780
            +W   +       + F +F+ +  +G ++     L  E +    WS+W  RN + +G  S
Sbjct: 1117 IWWKEEVCKPFLLERFVNFQ-DLFLGILKAHDPHL-AERVAFIAWSIWYKRNAVRAGSPS 1176

Query: 781  DGRDLWAYSSDYLSAFHVGGGRCGTRDSWAQSIEQEERGVWRPHPNRELKLNIDASVRPD 840
                   YS  +  A          ++     I + E   W P PN   K N D ++  +
Sbjct: 1177 -----LPYSMIHTEAMERLQEFQRVQEIPTTPIHEAEPIRWSPPPNSWCKANFDGAIFQE 1236

Query: 841  TGEAGGGCVLRGAEGEVFMAACLSLQRCWSVDLDEGWVVYRGIQLARQLGFVDFVVETDS 900
             G AG G V+R  EG+V  A    +    SVD  E     R I  A++LG    + E D+
Sbjct: 1237 LGAAGLGVVIRDHEGKVVGALSERIVLPTSVDDVEAMAGRRAISFAKELGLPKVIFEGDA 1296

Query: 901  LRLVKILNGE 907
            + ++  LN E
Sbjct: 1297 VGIIHSLNAE 1297

BLAST of Lag0018797 vs. NCBI nr
Match: XP_030508852.1 (uncharacterized protein LOC115723496 [Cannabis sativa])

HSP 1 Score: 738.4 bits (1905), Expect = 7.3e-209
Identity = 378/911 (41.49%), Postives = 532/911 (58.40%), Query Frame = 0

Query: 1    MTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQTLLRPFTKEEVLLALKQTHPNNAP 60
            ++DY+  LF+S G   +  ++ L  +  ++DD     +  PFT  +V  ALK    + +P
Sbjct: 400  ISDYYNDLFTSRGADQESLDIILDSIPSTLDDTARTFISAPFTAADVYDALKTMSDDKSP 459

Query: 61   GPDGLSGSFYKYHWDIVGPDIIQSCLTVLNHGCSPGAVNDTMIVLIPKINAARRMVDFRP 120
            G DG+S  FY  +W IVGP +  + L VLN+G  P + N T++ LIPK+    ++  +RP
Sbjct: 460  GIDGMSVMFYTNYWHIVGPLVTAAVLNVLNNGADPSSFNSTLVTLIPKVKKPSQISQYRP 519

Query: 121  ISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFIPSRCVVDNAILGFECIHELRRRSRG 180
            ISLCNV YKL+SK +V R+K  L Q+IS+ QSAF+  R + DN ++ FE +H L+ R RG
Sbjct: 520  ISLCNVLYKLVSKAIVMRLKPFLSQVISEYQSAFLSQRLITDNILVAFELLHSLKNRKRG 579

Query: 181  RAKWATLKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFSLNGEKVGQ 240
               +A +KLDMSKA+DRVEW F+ +VM+++GF    V+LILRC+ SVS+SF LNG   GQ
Sbjct: 580  SKGFAAIKLDMSKAFDRVEWHFVAQVMIKMGFGTVMVELILRCLQSVSYSFLLNGTIQGQ 639

Query: 241  VVPSRGLRQGDPLSPYLFLLCAEGLSSLLCGAEWRSQITGFRVGRSSPSISHLFFADDNL 300
            V+PSRG+RQGDPLSPYLFL+CAEGLS LL   E    + G ++ RS+PS+SHLFFADD++
Sbjct: 640  VIPSRGIRQGDPLSPYLFLICAEGLSRLLQYEELAGSLEGLKISRSAPSVSHLFFADDSV 699

Query: 301  LFFRAIGSEALVIRELLERYERASGQTINYDKSVVAFSPNTGEEAQQYISQVLSVSRCPC 360
            LF RA    A  I   L  Y RASGQ IN +K V++FS NT +  Q +   +L +   PC
Sbjct: 700  LFCRANQQSARAIHRCLITYSRASGQVINPEKCVLSFSENTRQHEQIFFKDLLGMPIQPC 759

Query: 361  HQQYLGLPSFMPRNRSRALKFVKDRIWRQIQGWKGKFFSMAGKEVLLKSIVQAIPCYTMD 420
            H+QYLGLPSF  +N+ +    + D+IW+ +  WK   FS  GKEVLLK++VQAIP Y M 
Sbjct: 760  HEQYLGLPSFSGKNKKQLFGGITDKIWKLLSSWKEHLFSAGGKEVLLKAVVQAIPTYAMS 819

Query: 421  CFRLPRGLIKEIHRTMARFWWSGSEEERQIHWLSWDSLCLPKFLGGLGFRNMELFNEALL 480
            CFRLP  L  +I   MARFWW  +   + IHW +W+ LC  K  GGLGFRN   FN+ALL
Sbjct: 820  CFRLPVTLCHQIESMMARFWWGSTATGKTIHWKNWNFLCKAKVQGGLGFRNFIHFNQALL 879

Query: 481  AKQCWRVLQDPSSLLGSVLKGRDFPQSGFLEAGIGSRPSFVWRSLLWGQELLVRGCRWRI 540
            AKQ WR+L+ P+SLL ++L+ R F    +L AG+GS PS  WRSL+WG+ELL++G RWR+
Sbjct: 880  AKQAWRILEFPNSLLSNLLRHRYFSNGNYLIAGLGSNPSLTWRSLVWGKELLLKGLRWRV 939

Query: 541  GNGRATPIYGSNWLPNEFSLQIQSASVLSPASTVSELFTASGGWDVALLRTIFNGADCEA 600
            G+G        +WLP   + +        P   V++L T    WD+  L T FN AD   
Sbjct: 940  GSGERINCKTDSWLPGHTTFKPYFFKGPDPNLLVADLITEHRTWDMISLETNFNQADINR 999

Query: 601  ILRIPLRQGSGENRLIWHFEKHENFSVKSGYRLAHTLATQDRPGSSNFERVRMWWSSLWR 660
            +L IPL     ++ LIW+      ++VKSGY  A +LA QD    SN   +  WWS+ W+
Sbjct: 1000 VLSIPLSPYPHDDVLIWNQSFTGVYNVKSGYHFAVSLAEQDDSTCSN--SIEHWWSNFWK 1059

Query: 661  LNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCDDDAEDCLHLFWTCPVVKSMW 720
            L +P K R F+W++ H  LP    L +R +  SP C +C+   E   H  ++CP  K++W
Sbjct: 1060 LKLPPKVRIFVWKVFHTSLPVAAELYRRHIAFSPYCTICNSCEETVHHALFSCPRAKAVW 1119

Query: 721  LGSKFSLFHQSFSHFRFEEIIGAMREKLTGLDFELMVIFWWSVWNLRNNMFSGGQ-SDGR 780
              S FS+  Q+       + +  +   L+  + EL ++  WS+W+ RN ++ G       
Sbjct: 1120 ELSNFSIDFQTIERSSTADTLLLLSTSLSSSELELFLVLCWSIWHERNAIYHGNSVRTPA 1179

Query: 781  DLWAYSSDYLSAFHVGGGR-------CGTRDSWAQSIEQEERGVWRPHPNRELKLNIDAS 840
             + AY+  YL+ F     +        G       S E      W   P   LKLN DA+
Sbjct: 1180 AVAAYAPSYLTEFQQARAKNAKPVTASGAATPSRPSSEFIHAPKWTTPPRGRLKLNTDAA 1239

Query: 841  VRPDTGEAGGGCVLRGAEGEVFMAACLSLQRCWSVDLDE--GWVVYRGIQLARQLGFVDF 900
            +  +    G G VLR ++G +  A     +  +  +  E  G  +     L+  L  VDF
Sbjct: 1240 IDKERNTIGIGAVLRNSDGIIVAALSKPFRGNFKAEEMEALGLALSLNWLLSHNLS-VDF 1299

Query: 901  VVETDSLRLVK 902
             +ETDSL +V+
Sbjct: 1300 -IETDSLLVVQ 1306

BLAST of Lag0018797 vs. NCBI nr
Match: XP_030939975.1 (uncharacterized protein LOC115964883 [Quercus lobata])

HSP 1 Score: 724.5 bits (1869), Expect = 1.1e-204
Identity = 377/918 (41.07%), Postives = 534/918 (58.17%), Query Frame = 0

Query: 1    MTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQTLLRPFTKEEVLLALKQTHPNNAP 60
            M DYF+ +F+S+ PS  +F+  L+ ++  V   MN  L R FT +EV  ALKQ  P  AP
Sbjct: 368  MVDYFKQIFASTMPS--NFDQILQGIDTKVTPAMNADLTREFTADEVEFALKQMKPLTAP 427

Query: 61   GPDGLSGSFYKYHWDIVGPDIIQSCLTVLNHGCSPGAVNDTMIVLIPKINAARRMVDFRP 120
            G DG+S  FYK  W+ +G D+I + L +LN G  P ++N T I LIPKI +  +  DFRP
Sbjct: 428  GLDGMSPIFYKSCWNFIGHDVIDASLAILNSGNMPASLNHTYISLIPKIKSPEKATDFRP 487

Query: 121  ISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFIPSRCVVDNAILGFECIHELRRRSRG 180
            ISLCNV YK++SK + NR+K +LP+L+S++QSAF+  R + DN ++ FE +H L+ +++G
Sbjct: 488  ISLCNVLYKIVSKTIANRLKKLLPKLVSESQSAFMSDRLISDNILVAFETLHHLKTKTKG 547

Query: 181  RAKWATLKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFSLNGEKVGQ 240
            ++ +  +KLDMSKAYDRVEWAFL +VM +LGF  +W+ L+  C+ SVSFS  +NGE  G 
Sbjct: 548  KSGFMAIKLDMSKAYDRVEWAFLEKVMEKLGFDNRWITLVSSCIRSVSFSVLVNGEPHGN 607

Query: 241  VVPSRGLRQGDPLSPYLFLLCAEGLSSLLCGAEWRSQITGFRVGRSSPSISHLFFADDNL 300
              P+RGLRQGDPLSPYLFLLCAEGL SL+  AE    I G  +  + P +SHLFFADD+L
Sbjct: 608  FTPNRGLRQGDPLSPYLFLLCAEGLHSLIQQAEISGTIKGVSLCSTGPKVSHLFFADDSL 667

Query: 301  LFFRAIGSEALVIRELLERYERASGQTINYDKSVVAFSPNTGEEAQQYISQVLSVSRCPC 360
            LF RA   EA  I E+L++YE ASGQ IN +K+ + FSPNT    Q+ I  +L V+    
Sbjct: 668  LFCRANSQEASSIMEILKQYEEASGQQINREKTQLFFSPNTDPHVQEEIKTLLGVATTTN 727

Query: 361  HQQYLGLPSFMPRNRSRALKFVKDRIWRQIQGWKGKFFSMAGKEVLLKSIVQAIPCYTMD 420
            +++YLGLPSF+ R + ++  ++++RIW ++QGWK +  S  G+EVL+K+++QA+P +TM 
Sbjct: 728  YEKYLGLPSFVGRGKKQSFGYIRERIWHKMQGWKERLLSQGGREVLIKAVLQAMPTFTMG 787

Query: 421  CFRLPRGLIKEIHRTMARFWWSGSEEERQIHWLSWDSLCLPKFLGGLGFRNMELFNEALL 480
            CF++P+ L K+I   + +FWW    E R+IHW+ W  LC  K  GGLGF+++ELFN A+L
Sbjct: 788  CFKIPKSLCKDIESLIRKFWWGYKGEARKIHWVGWKKLCKSKSHGGLGFKDIELFNIAML 847

Query: 481  AKQCWRVLQDPSSLLGSVLKGRDFPQSGFLEAGIGSRPSFVWRSLLWGQELLVRGCRWRI 540
             KQ WR++ +  SL   V K + FP    L+ G+    S+ W+S+L  + ++  G +WRI
Sbjct: 848  GKQVWRLIHNKDSLFYKVFKAKFFPNCSILDEGVKENGSYAWQSILKARGVVRMGSKWRI 907

Query: 541  GNGRATPIYGSNWLPNEFSLQIQSASVLSPAST-VSELFTASGG-WDVALLRTIFNGADC 600
            G+G +  I G  WLP+ FS ++ S     P +T V  L       W    +R  F   + 
Sbjct: 908  GDGHSVRIRGDKWLPDLFSSRVVSPQKNFPNNTRVCALIDEENRCWMEDRIREEFLPHEA 967

Query: 601  EAILRIPLRQGSGENRLIWHFEKHENFSVKSGYRLAHTLATQDRPGSSNFERVRMWWSSL 660
            EAIL +PL     E+RLIW    +  ++ KS YRL    A    PG+SN    + +W  L
Sbjct: 968  EAILSLPLSFNGKEDRLIWAETANGYYTTKSAYRLLLQAAEAAAPGTSNPADQKPFWQEL 1027

Query: 661  WRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCDDDAEDCLHLFWTCPVVKS 720
            W LNVPNK R FLWR  +D LPTK NLLKR +     C  C  + ED +H  W C ++K 
Sbjct: 1028 WSLNVPNKIRHFLWRAANDSLPTKKNLLKRNIIQDTTCERCGGEIEDGIHAIWGCQMIKQ 1087

Query: 721  MW---------LGSKFSLFHQSFSHFRFEEIIGAMREKLTGLDFELMVIFWWSVWNLRNN 780
            +W         L  KF+ FH        + + G + +K+     EL     WS+W+ RN 
Sbjct: 1088 VWWELEKCREFLNEKFASFH--------DLLQGILAQKIPNF-AELFAFIGWSIWHDRNA 1147

Query: 781  MFSGGQS-DGRDLWAYSSDYLSAFHVGGGRCGTRDSWAQSIEQEERGVWRPHPNRELKLN 840
               G  S     ++  + + L  FH        ++     +       W P      K+N
Sbjct: 1148 RRLGSPSLPTEKIYRVAVERLREFH------SVQEDPRPQLPVHHPTHWLPPSPSVYKVN 1207

Query: 841  IDASVRPDTGEAGGGCVLRGAEGEVFMAACLSLQRCWSVDLDEGWVVYRGIQLARQLGFV 900
             D +   D   AG G V+R +EG V  A    +    +V   E     R I  AR+LG  
Sbjct: 1208 FDGATFLDIAAAGLGVVIRDSEGLVIAALSERIHLPPTVAALEALACRRSIVFARELGLQ 1267

Query: 901  DFVVETDSLRLVKILNGE 907
            D V E DS  + K+L  E
Sbjct: 1268 DVVFEGDSEVVFKLLTAE 1268

BLAST of Lag0018797 vs. ExPASy Swiss-Prot
Match: P0C2F6 (Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1g65750 PE=3 SV=1)

HSP 1 Score: 185.7 bits (470), Expect = 2.4e-45
Identity = 143/556 (25.72%), Postives = 242/556 (43.53%), Query Frame = 0

Query: 367 LPSFMPRNRSRALKFVKDRIWRQIQGWKGKFFSMAGKEVLLKSIVQAIPCYTMDCFRLPR 426
           +P    R        + +R+  ++ GW+ K  S AG+  L K+++ ++P ++M    LP+
Sbjct: 1   MPVLQKRINKDTFGEILERVSSRMSGWREKTLSFAGRLTLTKAVLSSMPVHSMSTILLPQ 60

Query: 427 GLIKEIHRTMARFWWSGSEEERQIHWLSWDSLCLPKFLGGLGFRNMELFNEALLAKQCWR 486
            ++  + +    F W  + E+++ H + W  +C PK  GGLG R  +  N AL++K  WR
Sbjct: 61  SILNRLDQLSRTFLWGSTAEKKKQHLVKWSKVCSPKKEGGLGVRAAKSMNRALISKVGWR 120

Query: 487 VLQDPSSLLGSVLKGR----DFPQSGFLEAGIGSRPSFVWRSLLWG-QELLVRGCRWRIG 546
           +LQ+ +SL   VL+ +    +   S +L    GS  S  WRS+  G ++++  G  W  G
Sbjct: 121 LLQEKNSLWTLVLQKKYHVGEIRDSRWL-IPKGSWSS-TWRSIAIGLRDVVSHGVGWIPG 180

Query: 547 NGRATPIYGSNWLPNEFSLQIQSASVLSPASTV--SELFTASGGWDVALLRTIFNGADCE 606
           +G+    +   W+  +  L++ +    +   TV   +L+    GWD A +          
Sbjct: 181 DGQQIRFWTDRWVSGKPLLELDNGERPTDCDTVVAKDLWIPGRGWDFAKIDPYTTNNTRL 240

Query: 607 AILRIPLRQGSG-ENRLIWHFEKHENFSVKSGYRLAHTLATQDRPGSSNFERVRMWWSSL 666
            +  + L   +G  +RL W F +   FSV+S Y +  T+    RP  ++F      ++ L
Sbjct: 241 ELRAVVLDLVTGARDRLSWKFSQDGQFSVRSAYEML-TVDEVPRPNMASF------FNCL 300

Query: 667 WRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCDDDAEDCLHLFWTCPVVKS 726
           W++ VP + + FLW + +  + T+    +R L+ S +C +C    E  LH+   CP    
Sbjct: 301 WKVRVPERVKTFLWLVGNQAVMTEEERHRRHLSASNVCQVCKGGVESMLHVLRDCPAQLG 360

Query: 727 MWLGSKFSLFHQS-FSHFRFEEIIGAMREKLTGLDFE----LMVIFWWSVWNLRNNMFSG 786
           +W+        Q  FS   FE +   + ++    D        VI WW  W  R     G
Sbjct: 361 IWVRVVPQRRQQGFFSKSLFEWLYDNLGDRSGCEDIPWSTIFAVIIWWG-WKWRCGNIFG 420

Query: 787 GQSDGRDLWAYSSDYLSAFHVGGGRCGTRDSWAQSIEQEERGVWRPHPNRELKLNIDASV 846
             +  RD   +  ++  A  V     G           E    W       +K+N D + 
Sbjct: 421 ENTKCRDRVKFVKEW--AVEVYRAHSGNVLVGITQPRVERMIGWVSPCVGWVKVNTDGAS 480

Query: 847 RPDTGEAGGGCVLRGAEGEVFMAACLSLQRCWSVDLDEGWVVYRGIQLARQLGFVDFVVE 906
           R + G A  G VLR   G       L++ RC S    E W VY G+  A +       +E
Sbjct: 481 RGNPGLASAGGVLRDCTGAWCGGFSLNIGRC-SAPQAELWGVYYGLYFAWEKKVPRVELE 540

Query: 907 TDSLRLVKILNGELHD 910
            DS  +V  L   + D
Sbjct: 541 VDSEVIVGFLKTGISD 543

BLAST of Lag0018797 vs. ExPASy Swiss-Prot
Match: P14381 (Transposon TX1 uncharacterized 149 kDa protein OS=Xenopus laevis OX=8355 PE=4 SV=1)

HSP 1 Score: 170.2 bits (430), Expect = 1.0e-40
Identity = 132/481 (27.44%), Postives = 212/481 (44.07%), Query Frame = 0

Query: 4   YFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQTLLRPFTKEEVLLALKQTHPNNAPGPD 63
           ++Q+LFS   P   D    L D  P V +   + L  P T +E+  AL+    N +PG D
Sbjct: 412 FYQNLFSPD-PISPDACEELWDGLPVVSERRKERLETPITLDELSQALRLMPHNKSPGLD 471

Query: 64  GLSGSFYKYHWDIVGPDIIQSCLTVLNHGCSPGAVNDTMIVLIPKINAARRMVDFRPISL 123
           GL+  F+++ WD +GPD  +        G  P +    ++ L+PK    R + ++RP+SL
Sbjct: 472 GLTIEFFQFFWDTLGPDFHRVLTEAFKKGELPLSCRRAVLSLLPKKGDLRLIKNWRPVSL 531

Query: 124 CNVSYKLISKVLVNRMKYILPQLISQNQSAFIPSRCVVDNAILGFECIHELRRRSRGRAK 183
            +  YK+++K +  R+K +L ++I  +QS  +P R + DN  L  + +H  RR       
Sbjct: 532 LSTDYKIVAKAISLRLKSVLAEVIHPDQSYTVPGRTIFDNVFLIRDLLHFARRTG---LS 591

Query: 184 WATLKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFSLNGEKVGQVVP 243
            A L LD  KA+DRV+  +L   +    F  Q+V  +    +S      +N      +  
Sbjct: 592 LAFLSLDQEKAFDRVDHQYLIGTLQAYSFGPQFVGYLKTMYASAECLVKINWSLTAPLAF 651

Query: 244 SRGLRQGDPLSPYLFLLCAEGLSSLLCGAEWRSQITGFRVGRSSPSISHLFFADDNLLFF 303
            RG+RQG PLS  L+ L  E    LL     R ++TG  +      +    +ADD +L  
Sbjct: 652 GRGVRQGCPLSGQLYSLAIEPFLCLL-----RKRLTGLVLKEPDMRVVLSAYADDVILVA 711

Query: 304 RAIGSEALVIRELLERYERASGQTINYDKS--------VVAFSPNTGEEAQ------QYI 363
           + +  +    +E  E Y  AS   IN+ KS         V F P    +        +Y+
Sbjct: 712 QDL-VDLERAQECQEVYAAASSARINWSKSSGLLEGSLKVDFLPPAFRDISWESKIIKYL 771

Query: 364 SQVLSVSRCPCHQQYLGLPSFMPRNRSRALKFVKDRIWRQIQGWKG--KFFSMAGKEVLL 423
              LS    P  Q ++ L               ++ +  ++  WKG  K  SM G+ +++
Sbjct: 772 GVYLSAEEYPVSQNFIEL---------------EECVLTRLGKWKGFAKVLSMRGRALVI 831

Query: 424 KSIVQAIPCYTMDCFRLPRGLIKEIHRTMARFWWSGSEEERQIHWLSWDSLCLPKFLGGL 469
             +V +   Y + C    +  I +I R +  F W G       HW+S     LP   GG 
Sbjct: 832 NQLVASQIWYRLICLSPTQEFIAKIQRRLLDFLWIGK------HWVSAGVSSLPLKEGGQ 861

BLAST of Lag0018797 vs. ExPASy Swiss-Prot
Match: P11369 (LINE-1 retrotransposable element ORF2 protein OS=Mus musculus OX=10090 GN=Pol PE=1 SV=2)

HSP 1 Score: 156.4 bits (394), Expect = 1.6e-36
Identity = 118/493 (23.94%), Postives = 227/493 (46.04%), Query Frame = 0

Query: 4   YFQHLFSSSGPSVQDFEVAL-RDLEPSVDDEMNQTLLRPFTKEEVLLALKQTHPNNAPGP 63
           +++ L+S+   ++ + +  L R   P ++ +    L  P + +E+   +       +PGP
Sbjct: 421 FYKRLYSTKLENLDEMDKFLDRYQVPKLNQDQVDHLNSPISPKEIEAVINSLPTKKSPGP 480

Query: 64  DGLSGSFYKYHWDIVGPDIIQSCLTVLNHGCSPGAVNDTMIVLIPKINA-ARRMVDFRPI 123
           DG S  FY+   + + P + +    +   G  P +  +  I LIPK      ++ +FRPI
Sbjct: 481 DGFSAEFYQTFKEDLIPILHKLFHKIEVEGTLPNSFYEATITLIPKPQKDPTKIENFRPI 540

Query: 124 SLCNVSYKLISKVLVNRMKYILPQLISQNQSAFIPSRCVVDNAILGFECIHELRRRSRGR 183
           SL N+  K+++K+L NR++  +  +I  +Q  FIP      N       IH + +     
Sbjct: 541 SLMNIDAKILNKILANRIQEHIKAIIHPDQVGFIPGMQGWFNIRKSINVIHYINKLK--D 600

Query: 184 AKWATLKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFSLNGEKVGQV 243
                + LD  KA+D+++  F+ +V+ R G    ++++I    S    +  +NGEK+  +
Sbjct: 601 KNHMIISLDAEKAFDKIQHPFMIKVLERSGIQGPYLNMIKAIYSKPVANIKVNGEKLEAI 660

Query: 244 VPSRGLRQGDPLSPYLFLLCAEGLSSLLCGAEWRSQITGFRVGRSSPSISHLFFADDNLL 303
               G RQG PLSPYLF +  E L+  +     + +I G ++G+    IS L  ADD ++
Sbjct: 661 PLKSGTRQGCPLSPYLFNIVLEVLARAI---RQQKEIKGIQIGKEEVKISLL--ADDMIV 720

Query: 304 FFRAIGSEALVIRELLERYERASGQTINYDKSVVAFSPNTGEEAQQYISQVLSVSRCPCH 363
           +     +    +  L+  +    G  IN +KS +AF     ++A++ I +    S    +
Sbjct: 721 YISDPKNSTRELLNLINSFGEVVGYKINSNKS-MAFLYTKNKQAEKEIRETTPFSIVTNN 780

Query: 364 QQYLG--LPSFMPRNRSRALKFVKDRIWRQIQGWKGKFFSMAGKEVLLKSIV--QAIPCY 423
            +YLG  L   +     +  K +K  I   ++ WK    S  G+  ++K  +  +AI  +
Sbjct: 781 IKYLGVTLTKEVKDLYDKNFKSLKKEIKEDLRRWKDLPCSWIGRINIVKMAILPKAIYRF 840

Query: 424 TMDCFRLPRGLIKEIHRTMARFWWSGSEEERQIHWLSWDSLCLPKFLGGLGFRNMELFNE 483
                ++P     E+   + +F W+  +       ++   L   +  GG+   +++L+  
Sbjct: 841 NAIPIKIPTQFFNELEGAICKFVWNNKKPR-----IAKSLLKDKRTSGGITMPDLKLYYR 900

Query: 484 ALLAKQCWRVLQD 491
           A++ K  W   +D
Sbjct: 901 AIVIKTAWYWYRD 900

BLAST of Lag0018797 vs. ExPASy Swiss-Prot
Match: P93295 (Uncharacterized mitochondrial protein AtMg00310 OS=Arabidopsis thaliana OX=3702 GN=AtMg00310 PE=4 SV=1)

HSP 1 Score: 150.6 bits (379), Expect = 8.6e-35
Identity = 67/149 (44.97%), Postives = 96/149 (64.43%), Query Frame = 0

Query: 413 AIPCYTMDCFRLPRGLIKEIHRTMARFWWSGSEEERQIHWLSWDSLCLPK-FLGGLGFRN 472
           A+P Y M CFRL + L K++   M  FWWS  E +R+I W++W  LC  K   GGLGFR+
Sbjct: 2   ALPVYAMSCFRLSKLLCKKLTSAMTEFWWSSCENKRKISWVAWQKLCKSKEDDGGLGFRD 61

Query: 473 MELFNEALLAKQCWRVLQDPSSLLGSVLKGRDFPQSGFLEAGIGSRPSFVWRSLLWGQEL 532
           +  FN+ALLAKQ +R++  P +LL  +L+ R FP S  +E  +G+RPS+ WRS++ G+EL
Sbjct: 62  LGWFNQALLAKQSFRIIHQPHTLLSRLLRSRYFPHSSMMECSVGTRPSYAWRSIIHGREL 121

Query: 533 LVRGCRWRIGNGRATPIYGSNWLPNEFSL 561
           L RG    IG+G  T ++   W+ +E  L
Sbjct: 122 LSRGLLRTIGDGIHTKVWLDRWIMDETPL 150

BLAST of Lag0018797 vs. ExPASy Swiss-Prot
Match: P16423 (Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 89.7 bits (221), Expect = 1.8e-16
Identity = 89/334 (26.65%), Postives = 155/334 (46.41%), Query Frame = 0

Query: 1   MTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQTLLRPFTK-EEVLLALKQTHPNNA 60
           M  Y++ + +   PS    EV           +M+ +L R ++   E  L   +   +++
Sbjct: 310 MVPYWREVMTQPSPSSCSGEVI----------QMDHSLERVWSAITEQDLRASRVSLSSS 369

Query: 61  PGPDGLSGSFYKYHWDIVGPDIIQSCLTVLNHGCSPGAVNDTMIVLIPKINAARRMVDFR 120
           PGPDG++    K   ++    +++    +L  G  P ++     V IPK   A+R  DFR
Sbjct: 370 PGPDGITP---KSAREVPSGIMLRIMNLILWCGNLPHSIRLARTVFIPKTVTAKRPQDFR 429

Query: 121 PISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFIPSRCVVDNAILGFECIHELRRRSR 180
           PIS+ +V  + ++ +L  R+   +       Q  F+P+    DNA +  + +  LR   +
Sbjct: 430 PISVPSVLVRQLNAILATRLNSSINW--DPRQRGFLPTDGCADNATI-VDLV--LRHSHK 489

Query: 181 GRAKWATLKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFSLNGEKVG 240
                    LD+SKA+D +  A + + +   G  + +VD +         S + +G    
Sbjct: 490 HFRSCYIANLDVSKAFDSLSHASIYDTLRAYGAPKGFVDYVQNTYEGGGTSLNGDGWSSE 549

Query: 241 QVVPSRGLRQGDPLSPYLFLLCAEGLSSLLCGAEWRSQITGFRVGRSSPSISHLFFADDN 300
           + VP+RG++QGDPLSP LF L  + L   L      S+I G +VG +  + +   FADD 
Sbjct: 550 EFVPARGVKQGDPLSPILFNLVMDRLLRTL-----PSEI-GAKVGNAITNAA--AFADDL 609

Query: 301 LLFFRA-IGSEALVIRELLERYERASGQTINYDK 333
           +LF    +G + L+ + L   +    G  +N DK
Sbjct: 610 VLFAETRMGLQVLLDKTL--DFLSIVGLKLNADK 615

BLAST of Lag0018797 vs. ExPASy TrEMBL
Match: A0A5E4FZN9 (PREDICTED: retrotransposon OS=Prunus dulcis OX=3755 GN=ALMOND_2B007697 PE=4 SV=1)

HSP 1 Score: 770.0 bits (1987), Expect = 1.1e-218
Identity = 379/898 (42.20%), Postives = 535/898 (59.58%), Query Frame = 0

Query: 3    DYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQTLLRPFTKEEVLLALKQTHPNNAPGP 62
            DYF+ LFSS+G   Q  E  L ++ P +   MN  LL+ FT+EE+   L Q  P  APG 
Sbjct: 323  DYFKTLFSSTGG--QQMERILNEVRPVITSAMNDRLLQAFTREELEHTLFQMFPTKAPGH 382

Query: 63   DGLSGSFYKYHWDIVGPDIIQSCLTVLNHGCSPGAVNDTMIVLIPKINAARRMVDFRPIS 122
            DG+   F++ +W IVG  + + CL +LN   S    N T+I LIPK+     + +FRPIS
Sbjct: 383  DGMPALFFQKYWHIVGDKVAKKCLQILNGEGSVREFNHTLIALIPKVKMPTTVSEFRPIS 442

Query: 123  LCNVSYKLISKVLVNRMKYILPQLISQNQSAFIPSRCVVDNAILGFECIHELRRRSRGRA 182
            LC   YK+I+K + NR+K +LP +I++NQSAF+P+R ++DN +  FE +H ++   +GR 
Sbjct: 443  LCTTVYKMIAKTIANRLKTVLPHVITENQSAFVPNRMILDNVMAAFEIMHTIKGVKKGRD 502

Query: 183  KWATLKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFSLNGEKVGQVV 242
                LKLDM+KAYDRVEW FLRE+ML+LGF+  WV  ++ C+S+ +FS    G  VG ++
Sbjct: 503  VKMALKLDMAKAYDRVEWVFLREMMLKLGFSATWVAKVMDCISTTTFSMLWKGNPVGHIM 562

Query: 243  PSRGLRQGDPLSPYLFLLCAEGLSSLLCGAEWRSQITGFRVGRSSPSISHLFFADDNLLF 302
            P RGLRQG PLSPYLFL+C EG S LL GAE R  + G +V R  PS++HL FADD++LF
Sbjct: 563  PQRGLRQGCPLSPYLFLMCTEGFSCLLRGAERRGDLVGVQVARGGPSVTHLLFADDSILF 622

Query: 303  FRAIGSEALVIRELLERYERASGQTINYDKSVVAFSPNTGEEAQQYISQVLSVSRCPCHQ 362
             +A       +  L + YE  SGQ INY KS  + SPN        I  VL+V    CH+
Sbjct: 623  MKATNEACRALETLFQTYEEVSGQQINYSKSAFSLSPNATRADFDMIKGVLNVPVVQCHE 682

Query: 363  QYLGLPSFMPRNRSRALKFVKDRIWRQIQGWKGKFFSMAGKEVLLKSIVQAIPCYTMDCF 422
            +YLGLP+   + R +  + +KD++W+ I GWK K  S AGKE+L+K+++QAIP Y+M CF
Sbjct: 683  KYLGLPTIAGKGRKQLFQHLKDKLWKHISGWKEKLLSRAGKEILMKAVLQAIPTYSMSCF 742

Query: 423  RLPRGLIKEIHRTMARFWWSGSEEERQIHWLSWDSLCLPKFLGGLGFRNMELFNEALLAK 482
            R+P+GL KE++  MARFWW+ ++++R IHW+ W+ LC  KF GGLGFR++E FN+ALLAK
Sbjct: 743  RIPKGLCKELNGIMARFWWAKAKDKRGIHWVKWELLCKSKFAGGLGFRDLEAFNQALLAK 802

Query: 483  QCWRVLQDPSSLLGSVLKGRDFPQSGFLEAGIGSRPSFVWRSLLWGQELLVRGCRWRIGN 542
            QCWR+L+ P SL+  + + R  P   FLEA +G+ PSF+WRSL WG+ELL +G RWR+GN
Sbjct: 803  QCWRILRTPESLVARIFRARYHPSVPFLEAEVGTNPSFIWRSLQWGKELLNKGLRWRVGN 862

Query: 543  GRATPIYGSNWLPNEFSLQIQSASVLSPASTVSELFTASGGWDVALLRTIFNGADCEAIL 602
            G +  +Y   WLP     +I S   L  ++ V +LFT+SG W+V LL+ IF   + +A L
Sbjct: 863  GVSIQVYTDKWLPAPSFFKIMSPPQLPLSTLVCDLFTSSGQWNVPLLKDIFWDQEVDAKL 922

Query: 603  RIPLRQGSGENRLIWHFEKHENFSVKSGYRLAHTLATQDRPGSSNFERVRM---WWSSLW 662
            +IPL   +G + LIWH+E++  +SVKSGYRLA     +D+       RV +   +W  +W
Sbjct: 923  QIPLASLAGHDCLIWHYERNGMYSVKSGYRLA--CLEKDKMSGEPSVRVDLNSKFWKKIW 982

Query: 663  RLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCDDDAEDCLHLFWTCPVVKSM 722
             L +PNK +FFLWR   D LP    L  R +  +P+C  C   AE  LH  W C   K +
Sbjct: 983  ALKIPNKIKFFLWRCAWDFLPCGQILFNRKIAPTPICPNCHRKAESVLHAVWLCETAKEV 1042

Query: 723  WLGSKFSLFHQSFSHFRFEEIIGAMREKLTGLDFELMVIFWWSVWNLRNNMFSGGQSDGR 782
            W  S +    + +    F E+  A++   +G +  L     W +WN RN+    G+S+  
Sbjct: 1043 WRNSAWGNVCEEWRVNSFRELWHALQLSSSGEEQGLFAYLCWGLWNRRNSFIFEGKSETA 1102

Query: 783  DLWAYSSDYLSAFHVGGGRCGTRDSWAQSIEQEERGVWRPHPNRELKLNIDASVRPDTGE 842
                +    L+                QS  Q     WRP P    K+N+D +V+     
Sbjct: 1103 TQLLHRMTKLAQEFSNANNLSHTIHGRQSSPQAPLHGWRPPPAGIYKINVDGAVKSGDSV 1162

Query: 843  AGGGCVLRGAEGEVFMAACL-SLQRCWSVDLDEGWVVYRGIQLARQLGFVDFVVETDS 897
             G G V+R A GE FMAAC+  +Q  +     E      G++ A  +GF   V+E D+
Sbjct: 1163 RGVGVVVRNANGE-FMAACVRRIQASYGARQTELMATIEGLRFAIDMGFTAAVLEMDA 1215

BLAST of Lag0018797 vs. ExPASy TrEMBL
Match: M5W5F3 (Reverse transcriptase domain-containing protein (Fragment) OS=Prunus persica OX=3760 GN=PRUPE_ppa026368mg PE=4 SV=1)

HSP 1 Score: 747.7 bits (1929), Expect = 5.9e-212
Identity = 376/906 (41.50%), Postives = 535/906 (59.05%), Query Frame = 0

Query: 3   DYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQTLLRPFTKEEVLLALKQTHPNNAPGP 62
           DYF+ LFSSSG   Q  E  L ++ P +   MN  LL+ FT+EE+   L Q  P  APG 
Sbjct: 94  DYFKTLFSSSGG--QQMERILNEVRPVITSAMNAQLLQAFTREELEHTLFQMFPTKAPGH 153

Query: 63  DGLSGSFYKYHWDIVGPDIIQSCLTVLNHGCSPGAVNDTMIVLIPKINAARRMVDFRPIS 122
           DG+   F++ +W IVG  + + CL +LN   S    N T+I LIPK+     + +FRPIS
Sbjct: 154 DGMPALFFQKYWHIVGDKVAKKCLQILNGEGSVREFNHTLIALIPKVKMPTIVSEFRPIS 213

Query: 123 LCNVSYKLISKVLVNRMKYILPQLISQNQSAFIPSRCVVDNAILGFECIHELRRRSRGRA 182
           LC   YK+I+K + NR+K +L  +I++ QSAF+P+R ++DN +  FE ++ ++   +GR 
Sbjct: 214 LCTTVYKMIAKTIANRLKTVLSHVITETQSAFVPNRMILDNVMAAFEIMNTIKGVKKGRD 273

Query: 183 KWATLKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFSLNGEKVGQVV 242
               LKLDM+KAYDRVEW FLR +ML+LGF+  WV  ++ C+S+ +FS    G  VG ++
Sbjct: 274 VQMALKLDMAKAYDRVEWVFLRAMMLKLGFSATWVSKVMDCISTTTFSVLWKGTPVGHIM 333

Query: 243 PSRGLRQGDPLSPYLFLLCAEGLSSLLCGAEWRSQITGFRVGRSSPSISHLFFADDNLLF 302
           P RGLRQG PLSPYLFL+C EG S LL GAE R  + G +V R +PS++HL FADD++LF
Sbjct: 334 PQRGLRQGCPLSPYLFLICTEGFSCLLRGAERRGDLVGVQVARGAPSVTHLLFADDSILF 393

Query: 303 FRAIGSEALVIRELLERYERASGQTINYDKSVVAFSPNTGEEAQQYISQVLSVSRCPCHQ 362
            +A   + + +  L + YE  +GQ INY KS ++ SPN        I  VL+V    CH+
Sbjct: 394 MKATNKDCMALETLFQTYEEVTGQQINYSKSALSLSPNATRADFDMIEGVLNVPVVRCHE 453

Query: 363 QYLGLPSFMPRNRSRALKFVKDRIWRQIQGWKGKFFSMAGKEVLLKSIVQAIPCYTMDCF 422
            YLGLP+   + R +  + +KD++W+ I GWK K  S AGKE+L+K+++QAIP Y+M CF
Sbjct: 454 NYLGLPTIAGKGRKQLFQHLKDKLWKHISGWKEKLLSRAGKEILIKAVLQAIPTYSMSCF 513

Query: 423 RLPRGLIKEIHRTMARFWWSGSEEERQIHWLSWDSLCLPKFLGGLGFRNMELFNEALLAK 482
           R+P+GL KE++  MARFWW+ ++++R IHW+ W+ LC  KF GGLGFR++E FN+ALLAK
Sbjct: 514 RIPKGLCKELNGIMARFWWAKAKDKRGIHWVKWELLCKSKFAGGLGFRDLEAFNQALLAK 573

Query: 483 QCWRVLQDPSSLLGSVLKGRDFPQSGFLEAGIGSRPSFVWRSLLWGQELLVRGCRWRIGN 542
           QCWR+L+ P SL+  + + R  P   FLEA +G+ PSF+WRSL WG+ELL +G RWR+G+
Sbjct: 574 QCWRILRTPESLVARIFRARYHPSVPFLEAEVGTNPSFIWRSLQWGKELLNKGLRWRVGS 633

Query: 543 GRATPIYGSNWLPNEFSLQIQSASVLSPASTVSELFTASGGWDVALLRTIFNGADCEAIL 602
           G +  +Y   WLP     +I S   L  ++ V +LFT+SG W+V LL+ IF   + +AIL
Sbjct: 634 GVSIQVYTDKWLPAPSCFKIMSPPQLPLSTRVCDLFTSSGQWNVPLLKDIFWDQEVDAIL 693

Query: 603 RIPLRQGSGENRLIWHFEKHENFSVKSGYRLAHTLATQDRPGSSNFERVRM---WWSSLW 662
           +IPL   +G + LIWH+E++  +SVKSGYRLA     +D+       RV +   +W  +W
Sbjct: 694 QIPLASLAGHDCLIWHYERNGMYSVKSGYRLAG--LEKDKMSGEPSARVDLNSKFWKKIW 753

Query: 663 RLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCDDDAEDCLHLFWTCPVVKSM 722
            L +PNK +FFLWR   D LP    L  R +  +P+C  C   AE  LH  W C   K +
Sbjct: 754 ALKIPNKIKFFLWRCAWDFLPCGQILFNRKIAPTPICPKCHRKAESVLHAVWLCEAAKEV 813

Query: 723 WLGSKFSLFHQSFSHFRFEEIIGAMREKLTGLDFELMVIFWWSVWNLRNNMFSGGQSDG- 782
           W  S +    + +    F E+  A++   +G +  L     W +WN RN+    G+S+  
Sbjct: 814 WRNSAWGNVCEVWRVNSFRELWHALQLSSSGEEQGLFAYLCWGLWNRRNSFIFEGKSETA 873

Query: 783 -------RDLWAYSSDYLSAFHVGGGRCGTRDSWAQSIEQEERGVWRPHPNRELKLNIDA 842
                    L    SD  +  H   GR        QS  Q     WRP P          
Sbjct: 874 IQLLSRMTKLAQEFSDANNILHTIHGR--------QSSPQAPLQGWRPPP---------- 933

Query: 843 SVRPDTGEAGGGCVLRGAEGEVFMAACL-SLQRCWSVDLDEGWVVYRGIQLARQLGFVDF 897
           +V+      G G V+R A GE FMAAC+  +   +     E      G++ A  +GF D 
Sbjct: 934 AVKSGDSVRGVGVVVRNANGE-FMAACVRRIHASYGARQTELMATIEGLRFAIDMGFTDA 976

BLAST of Lag0018797 vs. ExPASy TrEMBL
Match: A0A251NPF0 (Reverse transcriptase domain-containing protein OS=Prunus persica OX=3760 GN=PRUPE_6G123900 PE=4 SV=1)

HSP 1 Score: 747.7 bits (1929), Expect = 5.9e-212
Identity = 376/906 (41.50%), Postives = 535/906 (59.05%), Query Frame = 0

Query: 3   DYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQTLLRPFTKEEVLLALKQTHPNNAPGP 62
           DYF+ LFSSSG   Q  E  L ++ P +   MN  LL+ FT+EE+   L Q  P  APG 
Sbjct: 57  DYFKTLFSSSGG--QQMERILNEVRPVITSAMNAQLLQAFTREELEHTLFQMFPTKAPGH 116

Query: 63  DGLSGSFYKYHWDIVGPDIIQSCLTVLNHGCSPGAVNDTMIVLIPKINAARRMVDFRPIS 122
           DG+   F++ +W IVG  + + CL +LN   S    N T+I LIPK+     + +FRPIS
Sbjct: 117 DGMPALFFQKYWHIVGDKVAKKCLQILNGEGSVREFNHTLIALIPKVKMPTIVSEFRPIS 176

Query: 123 LCNVSYKLISKVLVNRMKYILPQLISQNQSAFIPSRCVVDNAILGFECIHELRRRSRGRA 182
           LC   YK+I+K + NR+K +L  +I++ QSAF+P+R ++DN +  FE ++ ++   +GR 
Sbjct: 177 LCTTVYKMIAKTIANRLKTVLSHVITETQSAFVPNRMILDNVMAAFEIMNTIKGVKKGRD 236

Query: 183 KWATLKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFSLNGEKVGQVV 242
               LKLDM+KAYDRVEW FLR +ML+LGF+  WV  ++ C+S+ +FS    G  VG ++
Sbjct: 237 VQMALKLDMAKAYDRVEWVFLRAMMLKLGFSATWVSKVMDCISTTTFSVLWKGTPVGHIM 296

Query: 243 PSRGLRQGDPLSPYLFLLCAEGLSSLLCGAEWRSQITGFRVGRSSPSISHLFFADDNLLF 302
           P RGLRQG PLSPYLFL+C EG S LL GAE R  + G +V R +PS++HL FADD++LF
Sbjct: 297 PQRGLRQGCPLSPYLFLICTEGFSCLLRGAERRGDLVGVQVARGAPSVTHLLFADDSILF 356

Query: 303 FRAIGSEALVIRELLERYERASGQTINYDKSVVAFSPNTGEEAQQYISQVLSVSRCPCHQ 362
            +A   + + +  L + YE  +GQ INY KS ++ SPN        I  VL+V    CH+
Sbjct: 357 MKATNKDCMALETLFQTYEEVTGQQINYSKSALSLSPNATRADFDMIEGVLNVPVVRCHE 416

Query: 363 QYLGLPSFMPRNRSRALKFVKDRIWRQIQGWKGKFFSMAGKEVLLKSIVQAIPCYTMDCF 422
            YLGLP+   + R +  + +KD++W+ I GWK K  S AGKE+L+K+++QAIP Y+M CF
Sbjct: 417 NYLGLPTIAGKGRKQLFQHLKDKLWKHISGWKEKLLSRAGKEILIKAVLQAIPTYSMSCF 476

Query: 423 RLPRGLIKEIHRTMARFWWSGSEEERQIHWLSWDSLCLPKFLGGLGFRNMELFNEALLAK 482
           R+P+GL KE++  MARFWW+ ++++R IHW+ W+ LC  KF GGLGFR++E FN+ALLAK
Sbjct: 477 RIPKGLCKELNGIMARFWWAKAKDKRGIHWVKWELLCKSKFAGGLGFRDLEAFNQALLAK 536

Query: 483 QCWRVLQDPSSLLGSVLKGRDFPQSGFLEAGIGSRPSFVWRSLLWGQELLVRGCRWRIGN 542
           QCWR+L+ P SL+  + + R  P   FLEA +G+ PSF+WRSL WG+ELL +G RWR+G+
Sbjct: 537 QCWRILRTPESLVARIFRARYHPSVPFLEAEVGTNPSFIWRSLQWGKELLNKGLRWRVGS 596

Query: 543 GRATPIYGSNWLPNEFSLQIQSASVLSPASTVSELFTASGGWDVALLRTIFNGADCEAIL 602
           G +  +Y   WLP     +I S   L  ++ V +LFT+SG W+V LL+ IF   + +AIL
Sbjct: 597 GVSIQVYTDKWLPAPSCFKIMSPPQLPLSTRVCDLFTSSGQWNVPLLKDIFWDQEVDAIL 656

Query: 603 RIPLRQGSGENRLIWHFEKHENFSVKSGYRLAHTLATQDRPGSSNFERVRM---WWSSLW 662
           +IPL   +G + LIWH+E++  +SVKSGYRLA     +D+       RV +   +W  +W
Sbjct: 657 QIPLASLAGHDCLIWHYERNGMYSVKSGYRLAG--LEKDKMSGEPSARVDLNSKFWKKIW 716

Query: 663 RLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCDDDAEDCLHLFWTCPVVKSM 722
            L +PNK +FFLWR   D LP    L  R +  +P+C  C   AE  LH  W C   K +
Sbjct: 717 ALKIPNKIKFFLWRCAWDFLPCGQILFNRKIAPTPICPKCHRKAESVLHAVWLCEAAKEV 776

Query: 723 WLGSKFSLFHQSFSHFRFEEIIGAMREKLTGLDFELMVIFWWSVWNLRNNMFSGGQSDG- 782
           W  S +    + +    F E+  A++   +G +  L     W +WN RN+    G+S+  
Sbjct: 777 WRNSAWGNVCEVWRVNSFRELWHALQLSSSGEEQGLFAYLCWGLWNRRNSFIFEGKSETA 836

Query: 783 -------RDLWAYSSDYLSAFHVGGGRCGTRDSWAQSIEQEERGVWRPHPNRELKLNIDA 842
                    L    SD  +  H   GR        QS  Q     WRP P          
Sbjct: 837 IQLLSRMTKLAQEFSDANNILHTIHGR--------QSSPQAPLQGWRPPP---------- 896

Query: 843 SVRPDTGEAGGGCVLRGAEGEVFMAACL-SLQRCWSVDLDEGWVVYRGIQLARQLGFVDF 897
           +V+      G G V+R A GE FMAAC+  +   +     E      G++ A  +GF D 
Sbjct: 897 AVKSGDSVRGVGVVVRNANGE-FMAACVRRIHASYGARQTELMATIEGLRFAIDMGFTDA 939

BLAST of Lag0018797 vs. ExPASy TrEMBL
Match: A0A2N9IP69 (Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS54170 PE=4 SV=1)

HSP 1 Score: 738.0 bits (1904), Expect = 4.6e-209
Identity = 373/911 (40.94%), Postives = 536/911 (58.84%), Query Frame = 0

Query: 3    DYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQTLLRPFTKEEVLLALKQTHPNNAPGP 62
            +Y+Q LF+++   + D EV L  +  SV +EMNQTL  PFT+ E+  A+KQ  P  APGP
Sbjct: 241  EYYQSLFTAA--PLVDAEVVLDGINRSVSEEMNQTLTSPFTEAEITAAIKQMAPLKAPGP 300

Query: 63   DGLSGSFYKYHWDIVGPDIIQSCLTVLNHGCSPGAVNDTMIVLIPKINAARRMVDFRPIS 122
            DG+   FY+ +W ++G D+IQ+ L+ LN G    ++N T + LIPK+    ++ D+RPIS
Sbjct: 301  DGMPPVFYQSYWHVIGTDVIQAVLSSLNSGTLLPSINHTFVTLIPKVKNPEQVTDYRPIS 360

Query: 123  LCNVSYKLISKVLVNRMKYILPQLISQNQSAFIPSRCVVDNAILGFECIHELRRRSRGRA 182
            LCNV Y+LISKVL NR K +LP +IS+ QSAF+P R + DN ++ FE +H +  +  G+ 
Sbjct: 361  LCNVIYELISKVLANRFKKVLPYIISETQSAFVPGRLITDNILIAFETLHYMNNQRSGKV 420

Query: 183  KWATLKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFSLNGEKVGQVV 242
                LKLDMSKAYDRVEWAFL++VML++GF   WV LI+ C+S+VS+S  +NGE  G ++
Sbjct: 421  GSMALKLDMSKAYDRVEWAFLKQVMLKMGFHSHWVSLIMECISTVSYSLLINGEPTGNII 480

Query: 243  PSRGLRQGDPLSPYLFLLCAEGLSSLLCGAEWRSQITGFRVGRSSPSISHLFFADDNLLF 302
            PSRGLRQGDP+SPYLFLLCAEGL+ LL  A  +  I G  + R  P ++HLFFADD+LLF
Sbjct: 481  PSRGLRQGDPISPYLFLLCAEGLNGLLNKAASKGDIHGVSICRRGPKLTHLFFADDSLLF 540

Query: 303  FRAIGSEALVIRELLERYERASGQTINYDKSVVAFSPNTGEEAQQYISQVLSVSRCPCHQ 362
             RA  +E   I+E+L+ YER SGQ +N  K+ + FS NT +  Q  I  +L V     ++
Sbjct: 541  CRATQAECGKIQEVLQVYERVSGQQLNKAKTTLFFSRNTPQATQDDIKDILGVPSIQQYE 600

Query: 363  QYLGLPSFMPRNRSRALKFVKDRIWRQIQGWKGKFFSMAGKEVLLKSIVQAIPCYTMDCF 422
            +YL LPS + + +      +K+R+W +++GWK K  S AG+E+L+K++VQAIP YTM+CF
Sbjct: 601  KYLRLPSLVGKKKISCFAQIKERVWSKVKGWKEKLLSQAGREILIKAVVQAIPSYTMNCF 660

Query: 423  RLPRGLIKEIHRTMARFWWSGSEEERQIHWLSWDSLCLPKFLGGLGFRNMELFNEALLAK 482
            +LP GL K+I   + RFWW   E  R+IHW+ W+ LC PK +GGLGFR ++ FN ALLAK
Sbjct: 661  KLPVGLCKDIEAIIRRFWWGEKENNRKIHWIRWEKLCQPKGVGGLGFRELQNFNLALLAK 720

Query: 483  QCWRVLQDPSSLLGSVLKGRDFPQSGFLEAGIGSRPSFVWRSLLWGQELLVRGCRWRIGN 542
            Q WR++   +SLL  V   + FP    +EA   +R SF WRS+L  ++L++ G  WR+G+
Sbjct: 721  QFWRLMHCKNSLLFKVFSAKFFPSGNIMEASTNNRGSFAWRSILKAKDLIIAGSSWRVGD 780

Query: 543  GRATPIYGSNWLPNEFSLQIQS--ASVLSPASTVSELFTASGGWDVALLRTIFNGADCEA 602
            G+  PI  +NWL  E   ++ S   ++ + A     + ++   WD A +R++F   D +A
Sbjct: 781  GKQIPIKDTNWLLEEGHRRVISPLPNLHADAKVADLIQSSPPAWDEAKIRSLFLPYDSDA 840

Query: 603  ILRIPLRQGSGENRLIWHFEKHENFSVKSGYRLAHTLATQDRPGSSNFERVRMWWSSLWR 662
            IL+IPL      ++L WH   +  +SV+SGY+L         P SSN       W  +W 
Sbjct: 841  ILQIPLSDRCPSDKLYWHATTNGKYSVRSGYQLLLRERMISNPNSSNQGEPNTLWKQIWS 900

Query: 663  LNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCDDDAEDCLHLFWTCPVVKSMW 722
            L  P K + F+WR C + LPTK  L +R +  SP C  C   AEDCLH   +CPV+  +W
Sbjct: 901  LRTPTKVKSFMWRACQEALPTKAGLFRRKVIPSPGCDNCGTGAEDCLHALLSCPVISQVW 960

Query: 723  LGSKFSLFH--QSFSHFRFEEIIGAMREKLTGLDFELMVIFWWSVWNLRNNMFSG-GQSD 782
              S     H  Q  S+  F +++  +  K T L  E   +  W +W+ RN  +     +D
Sbjct: 961  --SLVPALHEAQQKSYTSFYDLVHHVALKPTDLILEKFAVLSWFIWHKRNQAWLRLPSTD 1020

Query: 783  GRDLW----AYSSDYLSAFHVGGGRCGTRDSWAQSIEQEERGVWRPHPNRELKLNIDASV 842
               LW    AY +++L A  +        D   +      R  W P  +   K+N D ++
Sbjct: 1021 YNQLWTNAHAYLNEFLEATQI--------DKTVKPAPPLVR--WSPPMHNGFKVNFDGAL 1080

Query: 843  RPDTGEAGGGCVLRGAEGEVFMAACLSLQRCWSVDLDEGWVVYRGIQLARQLGFVDFVVE 902
              D  E G G V+R   G V       ++    VDL E     R I  A ++G  D   E
Sbjct: 1081 FKDKNEGGIGVVIRDCSGLVIATLSQRVKTGALVDLIEALAAKRAITFAMEVGVTDVEFE 1137

Query: 903  TDSLRLVKILN 905
             DS  +++ L+
Sbjct: 1141 GDSENVIQDLS 1137

BLAST of Lag0018797 vs. ExPASy TrEMBL
Match: A0A803NGJ4 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 737.3 bits (1902), Expect = 7.9e-209
Identity = 377/927 (40.67%), Postives = 537/927 (57.93%), Query Frame = 0

Query: 1    MTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQTLLRPFTKEEVLLALKQTHPNNAP 60
            ++ YF  LF +S    Q  +  L  +  +V  EMN +L +P+T +EV+ AL+   P+ +P
Sbjct: 278  ISSYFAALFKASPVDPQALQTTLNTIPTTVTAEMNNSLTQPYTSQEVITALRLMSPDKSP 337

Query: 61   GPDGLSGSFYKYHWDIVGPDIIQSCLTVLNHGCSPGAVNDTMIVLIPKINAARRMVDFRP 120
            G DG+S  FY+ +WDIVG D+ +  L VLN G S  ++N ++I LIPKI     M  FRP
Sbjct: 338  GSDGMSAMFYQQYWDIVGNDVTKVVLAVLNEGYSMDSINRSLITLIPKIKLPSDMNAFRP 397

Query: 121  ISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFIPSRCVVDNAILGFECIHELRRRSRG 180
            ISLCNV YKLISKVL NR K +LP +IS+NQSAF  +R + DN ++ FE IH L+ ++ G
Sbjct: 398  ISLCNVIYKLISKVLANRFKEVLPSVISENQSAFFANRLITDNILVAFELIHHLKHKTHG 457

Query: 181  RAKWATLKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFSLNGEKVGQ 240
               ++ LKLDMSKA+DRVEW ++ EVM ++GF   W++ I  C++S SFSF LNGE VG 
Sbjct: 458  SKGFSALKLDMSKAFDRVEWVYICEVMRKMGFHTTWIETIHNCLNSTSFSFMLNGEAVGN 517

Query: 241  VVPSRGLRQGDPLSPYLFLLCAEGLSSLLCGAEWRSQITGFRVGRSSPSISHLFFADDNL 300
            V P+RGLRQGDPLSPYLFL+C+EGLS LL   E    + G R+ R SPSISHL FADD+L
Sbjct: 518  VQPTRGLRQGDPLSPYLFLICSEGLSRLLQYEESLGNLKGLRLTRHSPSISHLLFADDSL 577

Query: 301  LFFRAIGSEALVIRELLERYERASGQTINYDKSVVAFSPNTGEEAQQYISQVLSVSRCPC 360
            LF  A  S A  ++ +L+ Y RAS Q +N +KSV++FSPNT + A+     +L +    C
Sbjct: 578  LFCEATQSSASALKRILDIYHRASDQLLNNNKSVMSFSPNTTQAAKDLFHHILGMPIAEC 637

Query: 361  HQQYLGLPSFMPRNRSRALKFVKDRIWRQIQGWKGKFFSMAGKEVLLKSIVQAIPCYTMD 420
            H++YLGLP++  R++      VK+RIW+++  W  K FS+ GKEVLLK++VQ+IP Y M 
Sbjct: 638  HERYLGLPAYASRDKKEMFSDVKERIWQKLHAWNEKLFSVGGKEVLLKAVVQSIPTYAMS 697

Query: 421  CFRLPRGLIKEIHRTMARFWWSGSEEERQIHWLSWDSLCLPKFLGGLGFRNMELFNEALL 480
            CFRLP     ++   +A FWW  +++  +IHW  W  LC  KF GG+GF +   FN+ALL
Sbjct: 698  CFRLPITFCNQLESMVANFWWGANKDGSKIHWKRWKLLCKSKFEGGMGFCSFVHFNQALL 757

Query: 481  AKQCWRVLQDPSSLLGSVLKGRDFPQSGFLEAGIGSRPSFVWRSLLWGQELLVRGCRWRI 540
            AKQ WR+ + P+SLL  +LK R F  + FLEA +G  PS  W+ + WG+ELL+ G R++I
Sbjct: 758  AKQAWRIFEYPNSLLSRLLKHRYFSNNSFLEARLGHSPSLTWQGIHWGRELLIEGLRFKI 817

Query: 541  GNGRATPIYGSNWLPNEFSLQIQSASVLSPAS-TVSELFTASGGWDVALLRTIFNGADCE 600
            GNG         W+P  +S   Q  S   P+S TV+ L T S  W++ LL   F   D +
Sbjct: 818  GNGHNVQARIDKWIPGHYS--FQPISFNGPSSLTVAALITESREWNINLLHQYFQPIDID 877

Query: 601  AILRIPLRQGSGENRLIWHFEKHENFSVKSGYRLAHTLATQDRPGSSNFERVRMWWSSLW 660
             IL IPL      +RLIWH      ++V SG+ LA  +       +SN      WW S W
Sbjct: 878  KILSIPLSFFPTPDRLIWHHTTTRIYTVNSGFHLACNIEESMNTSASNSHSA--WWKSFW 937

Query: 661  RLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCDDDAEDCLHLFWTCPVVKSM 720
            +LN+P K + F W++  + LP    L KR +  S  C +C    E   H  +TC   +S+
Sbjct: 938  QLNLPPKIKIFAWKVMQNALPVAATLHKRKVIDSASCSMCTTAWESIGHALFTCHSARSV 997

Query: 721  WLGSKFSL-FHQSFSHFRFEEIIGAMREKLTGLDFELMVIFWWSVWNLRNNMFSGGQS-D 780
            W  S+FS+ FH + + +  + +I  +  + +  DFEL++   W++W  RN +  GGQ  +
Sbjct: 998  WKKSRFSIDFHNARNMYNGDYLI-HLSNQYSKPDFELLICIMWAIWGERNKVLHGGQKRE 1057

Query: 781  GRDLWAYSSDYLSAF--------------HVGGGRCGTRDSWAQSIEQEERGVWRPHPNR 840
            G   + ++++YL  +                  G         +  +Q     WRP    
Sbjct: 1058 GLHTFIFANNYLDKYKQATDAPNQSSLPLSSSNGHVQQTTQTPEPAQQPFGAQWRPPDPL 1117

Query: 841  ELKLNIDASVRPDTGEAGGGCVLRGAEGEVFMAACLSLQRCWSVDLDEGWVVYRGIQLAR 900
             LKLN+DA++  D    G G V+R  +G+V  A    +Q C+  D  E   ++  I    
Sbjct: 1118 GLKLNVDAALHSDKKILGVGAVVRNHQGQVIAAFSKPVQGCFRSDEMEAKALFHSINWVM 1177

Query: 901  QLGFVDFVVETDSLRLVKILNGELHDV 911
            Q       +ETD+LR+   LN    D+
Sbjct: 1178 QQQLPITHIETDALRVSMALNSSSIDL 1199

BLAST of Lag0018797 vs. TAIR 10
Match: AT4G29090.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 227.3 bits (578), Expect = 5.1e-59
Identity = 163/524 (31.11%), Postives = 250/524 (47.71%), Query Frame = 0

Query: 413 AIPCYTMDCFRLPRGLIKEIHRTMARFWWSGSEEERQIHWLSWDSLCLPKFLGGLGFRNM 472
           A+P YTM CF LP+ + K+I   +A FWW   +E + +HW +WD L   K  GG+GF+++
Sbjct: 2   ALPTYTMACFLLPKTVCKQIISVLADFWWRNKQEAKGMHWKAWDHLSCYKAEGGIGFKDI 61

Query: 473 ELFNEALLAKQCWRVLQDPSSLLGSVLKGRDFPQSGFLEAGIGSRPSFVWRSLLWGQELL 532
           E FN ALL KQ WR+L  P SL+  V K R F +S  L A +GSRPSFVW+S+   QE+L
Sbjct: 62  EAFNLALLGKQMWRMLSRPESLMAKVFKSRYFHKSDPLNAPLGSRPSFVWKSIHASQEIL 121

Query: 533 VRGCRWRIGNGRATPIYGSNWL---PNEFSLQIQSASVLSPAST-----VSELFTASG-G 592
            +G R  +GNG    I+   WL   P   +L++Q       AS      VS+L   SG  
Sbjct: 122 RQGARAVVGNGEDIIIWRHKWLDSKPASAALRMQRVPPQEYASVSSILKVSDLIDESGRE 181

Query: 593 WDVALLRTIFNGADCEAILRIPLRQGSGE--NRLIWHFEKHENFSVKSGY-RLAHTLATQ 652
           W   ++  +F   + E  L   LR G     +   W +    +++VKSGY  L   +  +
Sbjct: 182 WRKDVIEMLF--PEVERKLIGELRPGGRRILDSYTWDYTSSGDYTVKSGYWVLTQIINKR 241

Query: 653 DRPGSSNFERVRMWWSSLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCD 712
             P   +   +   +  +W+     K + FLW+   + LP    L  R L+    C+ C 
Sbjct: 242 SSPQEVSEPSLNPIYQKIWKSQTSPKIQHFLWKCLSNSLPVAGALAYRHLSKESACIRCP 301

Query: 713 DDAEDCLHLFWTCPVVKSMW--------LGSKF--SLFHQSFSHFRFEEIIGAMREKLTG 772
              E   HL + C   +  W        LG ++  S++   +  F          EK + 
Sbjct: 302 SCKETVNHLLFKCTFARLTWAISSIPIPLGGEWADSIYVNLYWVFNLGN-GNPQWEKAS- 361

Query: 773 LDFELMVIFWWSVWNLRNNM-FSGGQSDGRDLWAYSSDYLSAFHV--GGGRCGTRDSWAQ 832
              +L+    W +W  RN + F G + + +++   + D L  + +      CGT+     
Sbjct: 362 ---QLVPWLLWRLWKNRNELVFRGREFNAQEVLRRAEDDLEEWRIRTEAESCGTK----P 421

Query: 833 SIEQEERGVWRPHPNRELKLNIDASVRPDTGEAGGGCVLRGAEGEVFMAACLSLQRCWSV 892
            + +   G WRP P++ +K N DA+   D    G G VLR  +GEV      +L +  SV
Sbjct: 422 QVNRSSCGRWRPPPHQWVKCNTDATWNRDNERCGIGWVLRNEKGEVKWMGARALPKLKSV 481

Query: 893 ---DLDE-GWVVYRGIQLAR-QLGFVDFVVETDSLRLVKILNGE 907
              +L+   W V   + L+R Q  +V F  E+DS  L++ILN +
Sbjct: 482 LEAELEAMRWAV---LSLSRFQYNYVIF--ESDSQVLIEILNND 509

BLAST of Lag0018797 vs. TAIR 10
Match: ATMG00310.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 150.6 bits (379), Expect = 6.1e-36
Identity = 67/149 (44.97%), Postives = 96/149 (64.43%), Query Frame = 0

Query: 413 AIPCYTMDCFRLPRGLIKEIHRTMARFWWSGSEEERQIHWLSWDSLCLPK-FLGGLGFRN 472
           A+P Y M CFRL + L K++   M  FWWS  E +R+I W++W  LC  K   GGLGFR+
Sbjct: 2   ALPVYAMSCFRLSKLLCKKLTSAMTEFWWSSCENKRKISWVAWQKLCKSKEDDGGLGFRD 61

Query: 473 MELFNEALLAKQCWRVLQDPSSLLGSVLKGRDFPQSGFLEAGIGSRPSFVWRSLLWGQEL 532
           +  FN+ALLAKQ +R++  P +LL  +L+ R FP S  +E  +G+RPS+ WRS++ G+EL
Sbjct: 62  LGWFNQALLAKQSFRIIHQPHTLLSRLLRSRYFPHSSMMECSVGTRPSYAWRSIIHGREL 121

Query: 533 LVRGCRWRIGNGRATPIYGSNWLPNEFSL 561
           L RG    IG+G  T ++   W+ +E  L
Sbjct: 122 LSRGLLRTIGDGIHTKVWLDRWIMDETPL 150

BLAST of Lag0018797 vs. TAIR 10
Match: AT3G24255.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 98.2 bits (243), Expect = 3.6e-20
Identity = 89/361 (24.65%), Postives = 140/361 (38.78%), Query Frame = 0

Query: 363 QYLGLPSFMPRNRSRALKFVKDRIWRQIQGWKGKFFSMAGKEVLLKSIVQAIPCYTMDCF 422
           +YLGLP    +  +     + ++I  +I  W  +  S AG+  L+ S++ ++  + M  F
Sbjct: 26  RYLGLPLLTKKMTTSDYGPLVEKIRVRIGKWTARHLSFAGRLQLISSVIHSLTNFWMSAF 85

Query: 423 RLPRGLIKEIHRTMARFWWSGSEEERQIHWLSWDSLCLPKFLGGLGFRNMELFNEALLAK 482
           RLP   IKEI    + F WSG E   +   ++W  +C PK  GGLG R+++  N      
Sbjct: 86  RLPSACIKEIDSICSSFLWSGPELNTKKAKVAWSDVCTPKDEGGLGIRSLKEAN------ 145

Query: 483 QCWRVLQDPSSLLGSVLKGRDFPQSGFLEAGIGSRPSFVWRSLLWGQELLVRGCRWRIGN 542
                            KG  +  SG    G     S++W+ +L  + L     +  I N
Sbjct: 146 -----------------KGSFWSISGNTTLG-----SWMWKKILKHRALASGFVKHDIHN 205

Query: 543 GRATPIYGSNWLPNEFSLQIQSASVLSPASTVSELFTASGGWDVALLRTIFNGADCEAIL 602
           G  T  +  NW     S   +   V      +    T       A++         + +L
Sbjct: 206 GSNTSFWFDNW-----SKIGRLIDVTGHRGCIDMGITLHASVAEAVVNHRPRRHRHDTLL 265

Query: 603 RIP------LRQG--SGENRLIWHFEKHENFSVKSGYRLAHTLATQDRPGSSNFERVRM- 662
           RI         QG  SGE+ + W   K      K  +    T A    P      ++++ 
Sbjct: 266 RIEDVIAEVRHQGLTSGEDTVRW---KGNGDIFKPCFNTKETWAATREP------KLKVN 325

Query: 663 WWSSLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLTVSPLCVLCDDDAEDCLHLFWTC 715
           W+  +W  +   K+    W    +RL T   +L         CVLC    E   HLF+TC
Sbjct: 326 WYKGVWFSHATPKYSVLAWIAIKNRLTTGDRMLSWNAGADSSCVLCHHLVETRDHLFFTC 344

BLAST of Lag0018797 vs. TAIR 10
Match: ATMG01250.1 (RNA-directed DNA polymerase (reverse transcriptase) )

HSP 1 Score: 85.9 bits (211), Expect = 1.8e-16
Identity = 40/68 (58.82%), Postives = 48/68 (70.59%), Query Frame = 0

Query: 231 FSLNGEKVGQVVPSRGLRQGDPLSPYLFLLCAEGLSSLLCGAEWRSQITGFRVGRSSPSI 290
           F +NG   G V PSRGLRQGDPLSPYLF+LC E LS L   A+ + ++ G RV  +SP I
Sbjct: 12  FIINGAPQGLVTPSRGLRQGDPLSPYLFILCTEVLSGLCRRAQEQGRLPGIRVSNNSPRI 71

Query: 291 SHLFFADD 299
           +HL FADD
Sbjct: 72  NHLLFADD 79

BLAST of Lag0018797 vs. TAIR 10
Match: AT4G20520.1 (RNA binding;RNA-directed DNA polymerases )

HSP 1 Score: 78.2 bits (191), Expect = 3.8e-14
Identity = 46/145 (31.72%), Postives = 73/145 (50.34%), Query Frame = 0

Query: 135 LVNRMKYILPQLISQNQSAFIPSRCVVDNAILGFECIHELRRRSRGRAKWATLKLDMSKA 194
           +V R+K ++  LI   Q++FIP R   DN +   E +H +RR+ +G   W  LKLD+ KA
Sbjct: 1   MVERLKPLMTNLIGPAQASFIPGRVSTDNIVFVQEAVHSMRRK-KGVKGWMLLKLDLEKA 60

Query: 195 YDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFSLNGEKVGQVVPSR--------- 254
           YDR+ W +L + ++  GF + W+  I R     +F       +VG+   S+         
Sbjct: 61  YDRIRWDYLEDTLISAGFPEVWLPEIARS----TFGARRVAPEVGRADASKRPRVSDHRW 120

Query: 255 GLRQGDPLSPYL--FLLCAEGLSSL 269
           G R  D  +P+    + CAE L  +
Sbjct: 121 GFRYDDMAAPFTSNSVACAELLREI 140

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
VVA32947.12.3e-21842.20PREDICTED: retrotransposon [Prunus dulcis][more]
ONI01138.11.2e-21141.50hypothetical protein PRUPE_6G123900 [Prunus persica][more]
XP_023909336.11.5e-20941.87uncharacterized protein LOC112020997 [Quercus suber][more]
XP_030508852.17.3e-20941.49uncharacterized protein LOC115723496 [Cannabis sativa][more]
XP_030939975.11.1e-20441.07uncharacterized protein LOC115964883 [Quercus lobata][more]
Match NameE-valueIdentityDescription
P0C2F62.4e-4525.72Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1... [more]
P143811.0e-4027.44Transposon TX1 uncharacterized 149 kDa protein OS=Xenopus laevis OX=8355 PE=4 SV... [more]
P113691.6e-3623.94LINE-1 retrotransposable element ORF2 protein OS=Mus musculus OX=10090 GN=Pol PE... [more]
P932958.6e-3544.97Uncharacterized mitochondrial protein AtMg00310 OS=Arabidopsis thaliana OX=3702 ... [more]
P164231.8e-1626.65Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM OS... [more]
Match NameE-valueIdentityDescription
A0A5E4FZN91.1e-21842.20PREDICTED: retrotransposon OS=Prunus dulcis OX=3755 GN=ALMOND_2B007697 PE=4 SV=1[more]
M5W5F35.9e-21241.50Reverse transcriptase domain-containing protein (Fragment) OS=Prunus persica OX=... [more]
A0A251NPF05.9e-21241.50Reverse transcriptase domain-containing protein OS=Prunus persica OX=3760 GN=PRU... [more]
A0A2N9IP694.6e-20940.94Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=F... [more]
A0A803NGJ47.9e-20940.67Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G29090.15.1e-5931.11Ribonuclease H-like superfamily protein [more]
ATMG00310.16.1e-3644.97RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
AT3G24255.13.6e-2024.65RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
ATMG01250.11.8e-1658.82RNA-directed DNA polymerase (reverse transcriptase) [more]
AT4G20520.13.8e-1431.72RNA binding;RNA-directed DNA polymerases [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR026960Reverse transcriptase zinc-binding domainPFAMPF13966zf-RVTcoord: 625..720
e-value: 1.3E-22
score: 80.3
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 828..906
e-value: 2.4E-11
score: 43.5
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 106..338
e-value: 5.5E-38
score: 130.7
IPR000477Reverse transcriptase domainPROSITEPS50878RT_POLcoord: 87..353
score: 16.275757
NoneNo IPR availablePANTHERPTHR19446REVERSE TRANSCRIPTASEScoord: 3..835
NoneNo IPR availablePANTHERPTHR19446:SF440SUBFAMILY NOT NAMEDcoord: 3..835
NoneNo IPR availableCDDcd01650RT_nLTR_likecoord: 103..368
e-value: 1.99526E-48
score: 169.394
IPR044730Ribonuclease H-like domain, plant typeCDDcd06222RNase_H_likecoord: 827..909
e-value: 3.59554E-10
score: 56.5536
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 825..905

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0018797.1Lag0018797.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity