Lag0018493 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0018493
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionReverse transcriptase domain-containing protein
Locationchr5: 28291429 .. 28295633 (-)
RNA-Seq ExpressionLag0018493
SyntenyLag0018493
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGCTTCCGCTGCGTGGGGGCTTTTATTCCCTTCACGAAGGCCCTTAAAGAGTGGGGTTACAAAAAAAACAGAGTTCGATGGGATAATATTCGCCAAGTTAAGGACAAGATTAAGGTAGCTTATGATAGACCTACGCTAATTGATTTCACTATTGTGCATCGGCTTGAGAGTCACCTCAATAAGTTATTGTTGGACGAGGAGATATATTGGAAATAACGCTCGCGTGAGAATTGGCTCAAATGGGGAGACCGTAACACTAAATGGTTCCATCACCGTGCCTCTTATAGACATAAAAGAAACACAATCAGTGGTGTGGAAGGCGCAGATGGGTTGTGGCTTACAGAGGCGGACCAGATTCACAAGGCCTTTGAATCGTATTTTAAAGACATTTTCAATTCCCAATCTCCTTCTGCCACGGATTTCGAAAAAGTGTTGAAGTTTATTCCTCGAAAAGTCACACCAGATATGAGTCACACGCTTACTCAAGATTACTCGCGCGAGGAGGTGGAAGGAGTGGTTCGAAAGTTTTATCCTACAAAAGCGCCAGGTCCAGATGGTTTCCCTACCTTATTCTACCAAAAATATTGGGATATGGTAGGACCTCAAACGGTGGATGAATGTTTAGCAATTCTGAATCGTAAGCGTTCAGTCAAGGATTGGAACCATACCAATATTGTGCTCATCCCAAAGGTTCCAAACCCACGGTCAGTAACTGATTTTCGACCTATTAGCCTGTGTAACAGTTGTTATAAGATTGTTACTAAAGTGATTGCCAACCGACTGAAAAATGTGCTTATTTTGTAATTGATGAGTGTCAGTCGGCGTTTCTTCCTGGTAGATCTATTTCAGATAATATGATTGTGGGCCATGAATTGTTGCACTTTATTAATACTAGGAGGAAGGGGAAGCAGGGCTATGCGGCTCTTAAGCTTGATATAAGTAAGACCTATGATAGAGTAGAATGGTCTTTTATTAGGGCGGTTATGGAAAGATTGGGTTTCCCATGCGACTGGATTAATCTGATTAATGATTGCATTTCTACTGCTTCTTTTTCTATTATTATTAATGGGGAGGCTAAAGGGCATTTTTACCCGTCAAGAGGTTTGAGACAAGGAGACCCTTTGTCCCCGTACTTGTTTTTGCTATGCTCAGAGGGTTTGTCTGCTATTTTGGGTACTGCCAAGAGGAATGGTCTCATGGGGATTGCTATGACCCCGTCATCACCAAAAATCTTCCATCTATTTTTTGCAGACGATAGTCTCATCTTTCTAAAGGCCTCAACGGAGGAATTTGGTCATTTAAAGATTATCATGGCTGACTATGAATGTGCGTCGGGTCAAAGCATTAATGTGGATAAATCTCAAATATGTTTCTTCAGGAATGTGCCAAGCGATACTCAGTCTTACCTTAGCTCAATTTTGCAAATGAAGTCTGCGGACAATTTGGGATCTTACCTTGGCTTGCCTTCGTCTTTCCACCGTAGTCGGAGCAAGGATTTCAAGGGTATTCTTGATCGTGTATGGTCATATCTTCAAGGGTGGAAAAAGAATTCTTATCAAGGGTGTGATTCAGGCCATACCGACTTATGCCATGAATTGTTTCAGGCTACCGAAAGGTCTTCTAGACAGTATTTCTTCATTATGTGCGAGGTTTTGGTGGGGCTCTTCTGATACGAAAAAGCGTATTCATTGGAAGAAGTGGAAGGAGTTGTGTAAGCCAGAAGAGCAGGGCGGTCTGAATTTTTGGGACTTAGAGATATTCATTCAAGCAATGCTCGCAAAACAAGCCTGGCGTGTCCTGACCCTTCCAGAGTCAACGGTGGCAAGAGTTCTTAAAGGGAGGTATTTTCCATCCTCAGACGTTTTAGAATCAGAGGTACGCTCAAACTCTTCTTATTTTTGGAAAGGTTTTATATGGGGATTGGATTTGTTGAAATCTGGAATTAGGAAAAAAATTGGTAATGGAAATTCTGTTCGAGTAATGGTGGACCCTTGGATTCCTCGTCCTTATACTTTTAAAGTGCTTGGTTACAAGATATTTGATCCAGAGTTGACAGTAGTTCTTAAAGGGATTGGATTTTCCATCCTCAGACGTTTTAGAATCAGAGGTACGCTCAAACTCTTCTTATTTTTGGAAAGGTTTTATATGAGGATTGGATTTGTTGAAATCTGGAATTAGGAAAAAAATTGGTAATGGAAATTCTGTTCGAGTAATGGTGGACCCTTGGATTCCTCGTCCTTATACTTTTAAAGTGCTTGGTTACAAGATATTTGATCCAGAGTTGACAGTAGTGGATTGTATTCTGCCGTCTATTCAATGGGATATTCCGAAACTCCAACATGTTCTTTTGGATGAGGATGTTCAAGAGATTATAAGACTCCCAGCTAGCGAGACAACTCCGGATAGATGGATTTGGCATTTTGATAAATTTGGAGGTTATATGGTTAAGAGTGGGTACAAATTGGGTATGTATCAGAGAATAGAGGAGTCACCTTCGGACACTGATATCAGTTCCAAGTGGTGGAAGAGACTTTGGTCGACTTTGGGGTATAGTGATATGGTGAGGGCAGAGTTCATTATGAATATTCAGGACCGGTGGATACATATCTGCAATACTGTTTCAATATTGGATCTGGAAAGGATCTGTTTGGGATCTTGGGCTCTATGGAACGATCGGAATTGCGTGTTTCATAAGCGGCCAATTCCTCCGGTGGGGGTCCGGTGTGATTGGACTTTGGATTATCTTTCTGAATACCAATCAGCTCACCGGTCCAACGATCGAATATTCCAAACGAGGGATATGGTCTCTCAGATGATTTCAGGTGGGGAGGATTTTATTCTAAATGTCGACGCAGTGTGGTCCAAACATACCACGACCAGTGGAGTGAGGGTAGTATTACACACAAAGTCGGGTAAGTTGGTGGCTATTTTACAAAAAGGGATTCCTTTACCTTCTTCTCCATTATGTGTTGAGGTGATTGCGATACTTGAAGGTCTTAATATGACTTCTTCTTTGAGGATTAGTAAGGTAACGGTATGCTCGAATTCCCTATCGTTGATTACCATGCTTCGTAATAAAGATCGGTGTCAGGCAGATTGCTTCCCAGTGGTGGCAGATATTCATTGTCTAATTGGTTCTTTCGAAAAGATTTCATACTGTCATATAAAGAGGGAGTATAATTTAATGTCGCATGAGCTAGCTAGGTTAGGTATGGGGTCTTCTACTCGATTTTGGAGTAGGAATTTCCCTCAGTGGGTGTTAGACTTGGCTAGTAGAGAATTGGCATTTTTTGTAGCCCCATGTGGGAATCTTTGTTCTTGAATGAAATATTTGTTATTCTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGCTTGGGATAACAACACAAATGGACCTTTTTATACACCTATATTTCATTAATATCCAAAAAGTATAGATTTTACACTTCTATCCAATGGTTACAGGCTTTAATGACTAAAATAGGAGTTCAATTATCACTAAGCCCTGGACACACCACCTATGGAAAAAAAAATAATAAAAGAAAGAAATTGTTTTCAATTAAGGAAAAGTTGCTCCTTTCCATAACTGTTTGTGTTTTTCACTTTCCTTCCGCTTTCCTTACATGTGCAATTTCAAATTTTCTCCCATATTTTTAATTTGCCTCCTTCCCTTACCATATTATTTGAAGTTTTCTCTACGTATGGCTTGCACTTGCAAACAAAAAATTTAGTGGGTTTTGATTTTTTTTAATTTTTAATGCGTATTGTTATTTTTTTTTTGGGTATCTACAAGAATGGGCTGAAATCAGCCCATATATTTCATTTTTGTCCAAAAAGATTGAAAATTTCATTTCAGTCCAGCCCGTTTTAATTAATTGGAGCCCATTTACCATTTTGACCCTTTGATAAAAAAAAAAAAAACAGAAAAGGTAAAAACGGATCGTATTTTGCATTTGAAAAAATCCCCAAAAACACGCTTCCTTCCGAAAATGAGCACTCTATCGTCTTCTTCCTTTCGACTTGCCATTTTTTGCAGCACGGTCTTCTCCATTAGATTGAGGCCACCATTGTGATAATCCCAGAAAGCGTTGTCTCTGGCGACGACAAGGAAGCTCACAGCGGCACTGCGACTGCGTTGGTCTCCAGTGAAGAAGCTATATCAAATGTTAATCGAGAAACCAAGGAGGATGTTTCTTGA

mRNA sequence

ATGCGCTTCCGCTGCGTGGGGGCTTTTATTCCCTTCACGAAGGCCCTTAAAGAGTGGGGTTACAAAAAAAACAGAGTTCGATGGGATAATATTCGCCAAGTTAAGGACAAGATTAAGGTAGCTTATGATAGACCTACGCTAATTGATTTCACTATTGTGCATCGGCTTGAGAGTCACCTCAATAAACATAAAAGAAACACAATCAGTGGTGTGGAAGGCGCAGATGGGTTGTGGCTTACAGAGGCGGACCAGATTCACAAGGCCTTTGAATCGTATTTTAAAGACATTTTCAATTCCCAATCTCCTTCTGCCACGGATTTCGAAAAAGTGTTGAAGTTTATTCCTCGAAAAGTCACACCAGATATGAGTCACACGCTTACTCAAGATTACTCGCGCGAGGAGGTGGAAGGAGTGGTTCGAAAGTTTTATCCTACAAAAGCGCCAGGTCCAGATGGTTTCCCTACCTTATTCTACCAAAAATATTGGGATATGGTAGGACCTCAAACGGTGGATGAATGTTTAGCAATTCTGAATCGTAAGCGTTCAGTCAAGGATTGGAACCATACCAATATTGTGCTCATCCCAAAGGTTCCAAACCCACGGAGGAAGGGGAAGCAGGGCTATGCGGCTCTTAAGCTTGATATAAGTAAGACCTATGATAGAGTAGAATGGTCTTTTATTAGGGCGGTTATGGAAAGATTGGGTTTCCCATGCGACTGGATTAATCTGATTAATGATTGCATTTCTACTGCTTCTTTTTCTATTATTATTAATGGGGAGGCTAAAGGGCATTTTTACCCGTCAAGAGGTTTGAGACAAGGAGACCCTTTGTCCCCGTACTTGTTTTTGCTATGCTCAGAGGGTTTGTCTGCTATTTTGGGTACTGCCAAGAGGAATGGTCTCATGGGGATTGCTATGACCCCGTCATCACCAAAAATCTTCCATCTATTTTTTGCAGACGATAGTCTCATCTTTCTAAAGGCCTCAACGGAGGAATTTGGTCATTTAAAGATTATCATGGCTGACTATGAATGTGCGTCGGGTCAAAGCATTAATGTGGATAAATCTCAAATATGTTTCTTCAGGAATGTGCCAAGCGATACTCAGTCTTACCTTAGCTCAATTTTGCAAATGAAGTCTGCGGACAATTTGGGATCTTACCTTGGCTTGCCTTCGTCTTTCCACCGTAGTCGGAGCAAGGATTTCAAGGGTATTCTTGATCGTGTATGGCTACCGAAAGGTCTTCTAGACAGTATTTCTTCATTATGTGCGAGGTTTTGGTGGGGCTCTTCTGATACGAAAAAGCGTATTCATTGGAAGAAGTGGAAGGAGTTGTGTAAGCCAGAAGAGCAGGGCGGTCTGAATTTTTGGGACTTAGAGATATTCATTCAAGCAATGCTCGCAAAACAAGCCTGGCGTGTCCTGACCCTTCCAGAGTCAACGGTGGCAAGAGTTCTTAAAGGGAGGTATTTTCCATCCTCAGACGTTTTAGAATCAGAGGTACGCTCAAACTCTTCTTATTTTTGGAAAGGTTTTATATGGGGATTGGATTTGTTGAAATCTGGAATTAGGAAAAAAATTGGTAATGGAAATTCTGTTCGAGTAATGGTGGACCCTTGGATTCCTCGTCCTTATACTTTTAAAGTGCTTGGTTACAAGATATTTGATCCAGAGTTGACAGTAGTTCTTAAAGGGATTGGATTTTCCATCCTCAGACGTTTTAGAATCAGAGTGCTTGGTTACAAGATATTTGATCCAGAGTTGACAGTAGTGGATTGTATTCTGCCGTCTATTCAATGGGATATTCCGAAACTCCAACATGTTCTTTTGGATGAGGATGTTCAAGAGATTATAAGACTCCCAGCTAGCGAGACAACTCCGGATAGATGGATTTGGCATTTTGATAAATTTGGAGGTTATATGGTTAAGAGTGGGTACAAATTGGGTATGTATCAGAGAATAGAGGAGTCACCTTCGGACACTGATATCAGTTCCAAGTGGTGGAAGAGACTTTGGTCGACTTTGGGGTATAGTGATATGGTGAGGGCAGAGTTCATTATGAATATTCAGGACCGGTGGATACATATCTGCAATACTGTTTCAATATTGGATCTGGAAAGGATCTGTTTGGGATCTTGGGCTCTATGGAACGATCGGAATTGCGTGTTTCATAAGCGGCCAATTCCTCCGGTGGGGGTCCGGTGTGATTGGACTTTGGATTATCTTTCTGAATACCAATCAGCTCACCGGTCCAACGATCGAATATTCCAAACGAGGGATATGGTCTCTCAGATGATTTCAGGTGGGGAGGATTTTATTCTAAATGTCGACGCAGTGTGGTCCAAACATACCACGACCAGTGGAGTGAGGGTAGTATTACACACAAAGTCGGGTAAGTTGGTGGCTATTTTACAAAAAGGGATTCCTTTACCTTCTTCTCCATTATGTGTTGAGGTGATTGCGATACTTGAAGGTCTTAATATGACTTCTTCTTTGAGGATTAGTAAGATTGAGGCCACCATTGTGATAATCCCAGAAAGCGTTGTCTCTGGCGACGACAAGGAAGCTCACAGCGGCACTGCGACTGCGTTGGTCTCCAGTGAAGAAGCTATATCAAATGTTAATCGAGAAACCAAGGAGGATGTTTCTTGA

Coding sequence (CDS)

ATGCGCTTCCGCTGCGTGGGGGCTTTTATTCCCTTCACGAAGGCCCTTAAAGAGTGGGGTTACAAAAAAAACAGAGTTCGATGGGATAATATTCGCCAAGTTAAGGACAAGATTAAGGTAGCTTATGATAGACCTACGCTAATTGATTTCACTATTGTGCATCGGCTTGAGAGTCACCTCAATAAACATAAAAGAAACACAATCAGTGGTGTGGAAGGCGCAGATGGGTTGTGGCTTACAGAGGCGGACCAGATTCACAAGGCCTTTGAATCGTATTTTAAAGACATTTTCAATTCCCAATCTCCTTCTGCCACGGATTTCGAAAAAGTGTTGAAGTTTATTCCTCGAAAAGTCACACCAGATATGAGTCACACGCTTACTCAAGATTACTCGCGCGAGGAGGTGGAAGGAGTGGTTCGAAAGTTTTATCCTACAAAAGCGCCAGGTCCAGATGGTTTCCCTACCTTATTCTACCAAAAATATTGGGATATGGTAGGACCTCAAACGGTGGATGAATGTTTAGCAATTCTGAATCGTAAGCGTTCAGTCAAGGATTGGAACCATACCAATATTGTGCTCATCCCAAAGGTTCCAAACCCACGGAGGAAGGGGAAGCAGGGCTATGCGGCTCTTAAGCTTGATATAAGTAAGACCTATGATAGAGTAGAATGGTCTTTTATTAGGGCGGTTATGGAAAGATTGGGTTTCCCATGCGACTGGATTAATCTGATTAATGATTGCATTTCTACTGCTTCTTTTTCTATTATTATTAATGGGGAGGCTAAAGGGCATTTTTACCCGTCAAGAGGTTTGAGACAAGGAGACCCTTTGTCCCCGTACTTGTTTTTGCTATGCTCAGAGGGTTTGTCTGCTATTTTGGGTACTGCCAAGAGGAATGGTCTCATGGGGATTGCTATGACCCCGTCATCACCAAAAATCTTCCATCTATTTTTTGCAGACGATAGTCTCATCTTTCTAAAGGCCTCAACGGAGGAATTTGGTCATTTAAAGATTATCATGGCTGACTATGAATGTGCGTCGGGTCAAAGCATTAATGTGGATAAATCTCAAATATGTTTCTTCAGGAATGTGCCAAGCGATACTCAGTCTTACCTTAGCTCAATTTTGCAAATGAAGTCTGCGGACAATTTGGGATCTTACCTTGGCTTGCCTTCGTCTTTCCACCGTAGTCGGAGCAAGGATTTCAAGGGTATTCTTGATCGTGTATGGCTACCGAAAGGTCTTCTAGACAGTATTTCTTCATTATGTGCGAGGTTTTGGTGGGGCTCTTCTGATACGAAAAAGCGTATTCATTGGAAGAAGTGGAAGGAGTTGTGTAAGCCAGAAGAGCAGGGCGGTCTGAATTTTTGGGACTTAGAGATATTCATTCAAGCAATGCTCGCAAAACAAGCCTGGCGTGTCCTGACCCTTCCAGAGTCAACGGTGGCAAGAGTTCTTAAAGGGAGGTATTTTCCATCCTCAGACGTTTTAGAATCAGAGGTACGCTCAAACTCTTCTTATTTTTGGAAAGGTTTTATATGGGGATTGGATTTGTTGAAATCTGGAATTAGGAAAAAAATTGGTAATGGAAATTCTGTTCGAGTAATGGTGGACCCTTGGATTCCTCGTCCTTATACTTTTAAAGTGCTTGGTTACAAGATATTTGATCCAGAGTTGACAGTAGTTCTTAAAGGGATTGGATTTTCCATCCTCAGACGTTTTAGAATCAGAGTGCTTGGTTACAAGATATTTGATCCAGAGTTGACAGTAGTGGATTGTATTCTGCCGTCTATTCAATGGGATATTCCGAAACTCCAACATGTTCTTTTGGATGAGGATGTTCAAGAGATTATAAGACTCCCAGCTAGCGAGACAACTCCGGATAGATGGATTTGGCATTTTGATAAATTTGGAGGTTATATGGTTAAGAGTGGGTACAAATTGGGTATGTATCAGAGAATAGAGGAGTCACCTTCGGACACTGATATCAGTTCCAAGTGGTGGAAGAGACTTTGGTCGACTTTGGGGTATAGTGATATGGTGAGGGCAGAGTTCATTATGAATATTCAGGACCGGTGGATACATATCTGCAATACTGTTTCAATATTGGATCTGGAAAGGATCTGTTTGGGATCTTGGGCTCTATGGAACGATCGGAATTGCGTGTTTCATAAGCGGCCAATTCCTCCGGTGGGGGTCCGGTGTGATTGGACTTTGGATTATCTTTCTGAATACCAATCAGCTCACCGGTCCAACGATCGAATATTCCAAACGAGGGATATGGTCTCTCAGATGATTTCAGGTGGGGAGGATTTTATTCTAAATGTCGACGCAGTGTGGTCCAAACATACCACGACCAGTGGAGTGAGGGTAGTATTACACACAAAGTCGGGTAAGTTGGTGGCTATTTTACAAAAAGGGATTCCTTTACCTTCTTCTCCATTATGTGTTGAGGTGATTGCGATACTTGAAGGTCTTAATATGACTTCTTCTTTGAGGATTAGTAAGATTGAGGCCACCATTGTGATAATCCCAGAAAGCGTTGTCTCTGGCGACGACAAGGAAGCTCACAGCGGCACTGCGACTGCGTTGGTCTCCAGTGAAGAAGCTATATCAAATGTTAATCGAGAAACCAAGGAGGATGTTTCTTGA

Protein sequence

MRFRCVGAFIPFTKALKEWGYKKNRVRWDNIRQVKDKIKVAYDRPTLIDFTIVHRLESHLNKHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTPDMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQKYWDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKVPNPRRKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTAKRNGLMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDKSQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFHRSRSKDFKGILDRVWLPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCKPEEQGGLNFWDLEIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRPYTFKVLGYKIFDPELTVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPSIQWDIPKLQHVLLDEDVQEIIRLPASETTPDRWIWHFDKFGGYMVKSGYKLGMYQRIEESPSDTDISSKWWKRLWSTLGYSDMVRAEFIMNIQDRWIHICNTVSILDLERICLGSWALWNDRNCVFHKRPIPPVGVRCDWTLDYLSEYQSAHRSNDRIFQTRDMVSQMISGGEDFILNVDAVWSKHTTTSGVRVVLHTKSGKLVAILQKGIPLPSSPLCVEVIAILEGLNMTSSLRISKIEATIVIIPESVVSGDDKEAHSGTATALVSSEEAISNVNRETKEDVS
Homology
BLAST of Lag0018493 vs. NCBI nr
Match: XP_022158377.1 (uncharacterized protein LOC111024874 [Momordica charantia])

HSP 1 Score: 513.5 bits (1321), Expect = 3.7e-141
Identity = 315/961 (32.78%), Postives = 448/961 (46.62%), Query Frame = 0

Query: 13   TKALKEWGYKKNRVRWDNIRQVKDK---IKVAYDRPTLIDFTIVHRLESHLNKHKRNTIS 72
            + AL+ WG  ++ V WD  +Q+K +   I  AY++P  +DFTI+H LE        N ++
Sbjct: 488  SSALRHWG--RSNV-WDLFKQIKAQKAAIIDAYNQPLPLDFTIIHALE--------NDLA 547

Query: 73   GVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTPDMSHTLTQD 132
            G+     L L E     ++ E + K  +     +A D E ++  IP ++T +++  L   
Sbjct: 548  GL-----LELEEIFWKQRSREDWLK--WGIAILNALDIEAIINLIPTRITSEVNEQLLAP 607

Query: 133  YSREEVEGVVRKFYPTKAPGPDGFPTLFYQKYWDMVGPQTVDECLAILNRKRSVKDWNHT 192
            Y++EE+E  +R+ +PTKA GPDGFP LFYQ YW +VGP+T++ CL  LN    +K WN T
Sbjct: 608  YTKEEIELAIRQMFPTKALGPDGFPALFYQTYWHVVGPKTLEACLNALNNGDDIKKWNST 667

Query: 193  NIVLIPKVPNPR------------------------------------------------ 252
             I LIPK+  PR                                                
Sbjct: 668  YIALIPKIKQPRSISDFRPISLCNVSYKIISKSITNRLKNVIGLVISDAQSAFVPSRAIS 727

Query: 253  ----------------RKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLIN 312
                            + G  G AALKLD+SK +DRVEW+++  +M ++GF   WI  I 
Sbjct: 728  DNVIIGHECLHTINSCKSGLIGMAALKLDLSKAFDRVEWTYLECIMRKMGFNEGWIQAII 787

Query: 313  DCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTAKRNG-LMGI 372
             CIST  FSI +NG   G F PSRG+RQGDPLSPYLFLLC+EGLSA++     +G L GI
Sbjct: 788  QCISTVRFSIHLNGSPGGCFQPSRGIRQGDPLSPYLFLLCAEGLSALINHENNSGRLTGI 847

Query: 373  AMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDKSQICFFRNV 432
                ++  I HL FADDSLIFL++   E   L+ ++  Y  ASGQ IN  KS + F  NV
Sbjct: 848  HFEENNTSITHLLFADDSLIFLRSLESECLALRRLLDSYGRASGQCINFSKSALLFSPNV 907

Query: 433  PSDTQSYLSSILQMKSADNLGSYLGLPSSFHRSRSKDFKGILDRVWLPKGLLDSISSLCA 492
              + Q YL  IL +K   + G+YLGLPS F R R +                        
Sbjct: 908  HPERQQYLQCILNVKLVSHFGNYLGLPSHFTRRRGE------------------------ 967

Query: 493  RFWWGSSDTKKRIHWKKWKELCKPEEQGGLNFWDLEIFIQAMLAKQAWRVLTLPESTVAR 552
                      +++HW KW  +C P+E GGLNF DLE F QA++AK  WR L  P   V++
Sbjct: 968  ---------SRKLHWMKWGRMCYPKECGGLNFRDLEGFNQALVAKHVWRFLQHPNLLVSK 1027

Query: 553  VLKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRP 612
            VLK +YF  + +L++   S SSYFWKGF+WG DLL  G+R ++GNG++++   DPW+PRP
Sbjct: 1028 VLKHKYFKDTSLLQASNNSKSSYFWKGFLWGRDLLVKGLRLRVGNGSTIKAFSDPWLPRP 1087

Query: 613  YTFKVLGYKIFDPELTVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPSIQWDIPK 672
             TFK L                      RF    L       + TV   I     WD+  
Sbjct: 1088 TTFKPL----------------------RFNNGAL-------DTTVASFITADGNWDVTS 1147

Query: 673  LQHVLLDEDVQEIIRLP-ASETTPDRWIWHFDKFGGYMVKSGYKLGMYQRIEESPSDTDI 732
            + H   +ED   I+ +P +S    D W+WH+DK G Y V+SGYKL M+ +   + + T+ 
Sbjct: 1148 ISHSFCNEDRDLILSMPISSYNLQDSWLWHYDKRGNYSVRSGYKLYMHLKCNATSASTNY 1207

Query: 733  SSKWW------------------------------------------------------- 792
                W                                                       
Sbjct: 1208 RGTQWNSIWKLTVPTKIKIFIWRSAHEHIPTAQNLLLRGIGELPACTICGDRRESIIHAF 1267

Query: 793  ------KRLWSTL-GYSDMVRAEFIMNIQDRWIHICNTVSILDLERICLGSWALWNDRNC 839
                  +++W TL  +   + AE  ++  + W  +   +   DL    +  W +WNDRN 
Sbjct: 1268 FHCKRARQIWRTLFPFLTCLSAEDNISFLELWSSLTEQLEPKDLNLAAITGWGIWNDRNS 1327

BLAST of Lag0018493 vs. NCBI nr
Match: XP_023878301.1 (uncharacterized protein LOC111990748 [Quercus suber])

HSP 1 Score: 472.2 bits (1214), Expect = 9.5e-129
Identity = 300/946 (31.71%), Postives = 452/946 (47.78%), Query Frame = 0

Query: 62   KHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTPD 121
            + K+NTI G+    G W    + I +A  SYF +I++S  PS    E+V + IP KVT +
Sbjct: 326  RRKQNTIVGIWDEQGRWCDNEESIAQAAISYFNNIYSSSHPS--QIEEVTEAIPFKVTEE 385

Query: 122  MSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQKYWDMVGPQTVDECLAILNRKR 181
            M+ +L +++++EEV   +++ +P KAPGPDG   +F+QKYW +VG    D  L +LN   
Sbjct: 386  MNESLIREFTKEEVAVALKQIHPNKAPGPDGMSAVFFQKYWSIVGNNVTDMVLNVLNHNL 445

Query: 182  SVKDWNHTNIVLIPKVPNPRR--------------------------------------- 241
             + + N TNI LIPK  NP+R                                       
Sbjct: 446  PIPELNKTNISLIPKTNNPKRMTDFRPISLCNVVYKLISKILANRLKPLLPHIISENQSA 505

Query: 242  -------------------------KGKQGYAALKLDISKTYDRVEWSFIRAVMERLGFP 301
                                      GK+G+ A+KLD+SK +DRVEW FI  VME++GF 
Sbjct: 506  FTSDRLITDNVLVAFELMHYLDHKTAGKEGFMAIKLDMSKAFDRVEWGFIAKVMEQMGFC 565

Query: 302  CDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTAK 361
              W +L+  CI++ S+SI+ING A G+ YPSRGLRQGDPLSP LFLLC+EGLSA++  A 
Sbjct: 566  NRWRDLVMQCITSVSYSILINGVAHGNIYPSRGLRQGDPLSPSLFLLCAEGLSALINQAA 625

Query: 362  RNGLM-GIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDKS 421
            RN L+ GI++    PK+ HLFFADDS++F KA+ EE   L+ I+  YE ASGQ IN DKS
Sbjct: 626  RNKLITGISINRGCPKVTHLFFADDSILFCKAAYEECHLLRSILGQYEEASGQKINTDKS 685

Query: 422  QICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFHRSRSKDFKGILDRV------W 481
             I F  N   +T+  + +IL          YLGLPS   RS+S+ F  + ++V      W
Sbjct: 686  SIFFSPNTAQETRDEIFNILGPMQNSRHTKYLGLPSLIGRSKSQVFAMLKEKVGHKLAGW 745

Query: 482  ------------------------------LPKGLLDSISSLCARFWWGSSDTKKRIHWK 541
                                          LP+GL D +  +   FWWG  + + ++ W 
Sbjct: 746  KGKLLSMGGKEILIKAVAQAIPTYTMSCFLLPQGLCDDMERMMKNFWWGQRNQETKMGWI 805

Query: 542  KWKELCKPEEQGGLNFWDLEIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESE 601
             WK +C  +  GGL F +L+ F  AMLAKQAWR+L  P S V RVLK RYFP+ D+L ++
Sbjct: 806  SWKRMCNSKASGGLGFRNLKAFNLAMLAKQAWRILYNPNSLVGRVLKARYFPTGDLLNAK 865

Query: 602  VRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRPYTFKVLGYKIFDPELT 661
            + S+ SY W+     L++++ G R ++GNG  + +  D W+P P T+KV+  +I + E  
Sbjct: 866  LGSSPSYSWRSIHSSLEVIRRGTRWRVGNGKQIHIWEDRWLPTPSTYKVISPQIHNFEFP 925

Query: 662  VVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPSIQWDIPKLQHVLLDEDVQEIIRL 721
            +V                    + DP+         +  W +  L+ + L  +V+ I+R+
Sbjct: 926  LV------------------SSLIDPD---------TKWWKVEALRSIFLPFEVETILRI 985

Query: 722  PASETTP-DRWIWHFDKFGGYMVKSGYKLGMYQRIEESP----SDTDISSKWWKRLW--- 781
            P S   P D+ IW  +K G + VKS Y +  +  I+ +     S+ D     WK+LW   
Sbjct: 986  PLSYNLPEDKLIWIGNKKGEFSVKSAYHIA-HSIIDPNERGECSNGDPYRLLWKKLWLLN 1045

Query: 782  -------------------------------STLGYSDMV------------RAEFIM-- 838
                                           ST     +V             A  +   
Sbjct: 1046 LPGKIKIFAWRACVDGLPTYDNISKRGICCSSTCPICGLVTEDVNHALLYCEAASLVWCF 1105

BLAST of Lag0018493 vs. NCBI nr
Match: ONI01138.1 (hypothetical protein PRUPE_6G123900 [Prunus persica])

HSP 1 Score: 462.2 bits (1188), Expect = 9.9e-126
Identity = 249/718 (34.68%), Postives = 371/718 (51.67%), Query Frame = 0

Query: 61  NKHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTP 120
           ++ KRN + G+  A+  W TE  +I   F  YFK +F+S        E++L  +   +T 
Sbjct: 27  SRSKRNRVCGIFDANQAWQTEEQRIGDLFCDYFKTLFSSS--GGQQMERILNEVRPVITS 86

Query: 121 DMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQKYWDMVGPQTVDECLAILNRK 180
            M+  L Q ++REE+E  + + +PTKAPG DG P LF+QKYW +VG +   +CL ILN +
Sbjct: 87  AMNAQLLQAFTREELEHTLFQMFPTKAPGHDGMPALFFQKYWHIVGDKVAKKCLQILNGE 146

Query: 181 RSVKDWNHTNIVLIPKVPNPR--------------------------------------- 240
            SV+++NHT I LIPKV  P                                        
Sbjct: 147 GSVREFNHTLIALIPKVKMPTIVSEFRPISLCTTVYKMIAKTIANRLKTVLSHVITETQS 206

Query: 241 -------------------------RKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGF 300
                                    +KG+    ALKLD++K YDRVEW F+RA+M +LGF
Sbjct: 207 AFVPNRMILDNVMAAFEIMNTIKGVKKGRDVQMALKLDMAKAYDRVEWVFLRAMMLKLGF 266

Query: 301 PCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTA 360
              W++ + DCIST +FS++  G   GH  P RGLRQG PLSPYLFL+C+EG S +L  A
Sbjct: 267 SATWVSKVMDCISTTTFSVLWKGTPVGHIMPQRGLRQGCPLSPYLFLICTEGFSCLLRGA 326

Query: 361 KRNG-LMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDK 420
           +R G L+G+ +   +P + HL FADDS++F+KA+ ++   L+ +   YE  +GQ IN  K
Sbjct: 327 ERRGDLVGVQVARGAPSVTHLLFADDSILFMKATNKDCMALETLFQTYEEVTGQQINYSK 386

Query: 421 SQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFHRSRSKDFKGILDRVW----- 480
           S +    N        +  +L +       +YLGLP+   + R + F+ + D++W     
Sbjct: 387 SALSLSPNATRADFDMIEGVLNVPVVRCHENYLGLPTIAGKGRKQLFQHLKDKLWKHISG 446

Query: 481 -------------------------------LPKGLLDSISSLCARFWWGSSDTKKRIHW 540
                                          +PKGL   ++ + ARFWW  +  K+ IHW
Sbjct: 447 WKEKLLSRAGKEILIKAVLQAIPTYSMSCFRIPKGLCKELNGIMARFWWAKAKDKRGIHW 506

Query: 541 KKWKELCKPEEQGGLNFWDLEIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLES 600
            KW+ LCK +  GGL F DLE F QA+LAKQ WR+L  PES VAR+ + RY PS   LE+
Sbjct: 507 VKWELLCKSKFAGGLGFRDLEAFNQALLAKQCWRILRTPESLVARIFRARYHPSVPFLEA 566

Query: 601 EVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRPYTFKVLGYKIFDPEL 660
           EV +N S+ W+   WG +LL  G+R ++G+G S++V  D W+P P  FK++      P+L
Sbjct: 567 EVGTNPSFIWRSLQWGKELLNKGLRWRVGSGVSIQVYTDKWLPAPSCFKIMS----PPQL 626

Query: 661 TVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPSIQWDIPKLQHVLLDEDVQEIIR 674
            +  +                         V D    S QW++P L+ +  D++V  I++
Sbjct: 627 PLSTR-------------------------VCDLFTSSGQWNVPLLKDIFWDQEVDAILQ 686

BLAST of Lag0018493 vs. NCBI nr
Match: XP_024156142.1 (uncharacterized protein LOC112164137 [Rosa chinensis])

HSP 1 Score: 456.8 bits (1174), Expect = 4.1e-124
Identity = 277/836 (33.13%), Postives = 399/836 (47.73%), Query Frame = 0

Query: 61  NKHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTP 120
           N+ KRN ISG+   DG+W TE   +      YF  +F++ SP   D        P+ VT 
Sbjct: 170 NRKKRNAISGLFNNDGVWCTEDSDLENIVLDYFGTLFSTSSPKNMDL--FTNLFPQVVTG 229

Query: 121 DMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQKYWDMVGPQTVDECLAILNRK 180
           +M+  L +++  EE+   + + +P KAPGPDGF  +FYQ+YW +VG   +      +N +
Sbjct: 230 EMNSELVREFGEEEILQALNQMHPLKAPGPDGFSPIFYQRYWSVVGRDVIAAVRCFMNSE 289

Query: 181 RSVKDWNHTNIVLIPKVP------------------------------------------ 240
             +++ N T + LIPKV                                           
Sbjct: 290 DFLREVNGTYVTLIPKVKEVENMQQLRPISLCNVIYKLGSKVLANRLKPLLQDIIAPTQS 349

Query: 241 ----------------------NPRRKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGF 300
                                   R  G  GY ALKLD+SK YDRVEW FI AVM  +GF
Sbjct: 350 AFVPGRQISDNSLLAFELSHFLKRRTGGSHGYGALKLDMSKAYDRVEWEFIEAVMRSMGF 409

Query: 301 PCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILG-T 360
              WIN I  C++T S+S ++NGE +GH  P+RGLRQGD +SPYLFLLC+EGLS +L   
Sbjct: 410 DQIWINWIMGCVTTVSYSFLLNGEPRGHLIPTRGLRQGDSISPYLFLLCAEGLSRMLSYE 469

Query: 361 AKRNGLMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDK 420
            +++ L GIA+   +P I HLFFADDS +F+KA  EE   +K I+  YE ASGQ +N  K
Sbjct: 470 EEQHRLHGIAIAMGAPSINHLFFADDSFVFMKAEREECARVKEILKWYEDASGQQVNFQK 529

Query: 421 SQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFHRSRSKDFKGILDRV------ 480
           S+I F +NV    Q  L+ +  ++  D    YLGLP+    S+ + F+ I+++       
Sbjct: 530 SKISFSKNVDIGCQEELAEVFGVERVDKHDKYLGLPTKVSYSKIEAFQFIMEKTKNKMKN 589

Query: 481 W------------------------------LPKGLLDSISSLCARFWWGSSDTKKRIHW 540
           W                              LPK L   +    A FWWG S+  ++IHW
Sbjct: 590 WKDKTLSVAGKEVMIKSVVQSVPTYVMSCFELPKHLCQEMHRCMAEFWWGDSEKGRKIHW 649

Query: 541 KKWKELCKPEEQGGLNFWDLEIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLES 600
             W ++C P+E+GGL F ++E F QA+LAKQ WR+L  P+S + + LK +YFP++D + +
Sbjct: 650 LAWDKMCVPKEEGGLGFRNMEYFNQALLAKQGWRILRHPDSLLGKTLKAKYFPNNDFIHA 709

Query: 601 EVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRPYTFKVLGYKIFDPEL 660
            V    SY W+  + G  LL+ G+R ++G G  + V  DPWIPRPY+F+           
Sbjct: 710 SVNQGDSYTWRSLMKGKVLLEKGLRFQVGLGTRISVWFDPWIPRPYSFR---------PY 769

Query: 661 TVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILP-SIQWDIPKLQHVLLDEDVQEII 720
           + V++G+                    +LTV D I P S  W +  L+ +   ++V  I 
Sbjct: 770 STVMEGL-------------------EDLTVADLIDPDSKDWMVDWLEELFFADEVDLIR 829

Query: 721 RLPASETTP-DRWIWHFDKFGGYMVKSGY-------KLGMYQRIEESPSDTDISSKWWKR 780
           ++P S   P DR IWHFDK G Y VKSGY        L  +     S  D D+    W+R
Sbjct: 830 KIPLSLRNPEDRLIWHFDKRGLYSVKSGYHVARCVASLSSHVSTSNSQGDKDL----WRR 889

Query: 781 LWSTLGYSDMVRA---EFIMNIQDRWIHICNTVSILDLERICLGSWALWNDRNCVFHKRP 783
           +W        VR      + NI    +++   V+ LD ERIC            VF +  
Sbjct: 890 VWHA-RVQPKVRNFVWRLVKNIVPTKVNLGRRVN-LD-ERICPFCRCESETTLHVFMECN 949

BLAST of Lag0018493 vs. NCBI nr
Match: XP_024172304.2 (uncharacterized protein LOC112178381 [Rosa chinensis])

HSP 1 Score: 455.7 bits (1171), Expect = 9.2e-124
Identity = 276/836 (33.01%), Postives = 400/836 (47.85%), Query Frame = 0

Query: 61   NKHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTP 120
            N+ KRN ISG+   DG+W TE   +      YF  +F++ SP   + E      P+ VT 
Sbjct: 378  NRKKRNAISGLFNNDGVWCTEDSDLENIVLDYFGTLFSTSSPK--NMELFTNLFPQVVTG 437

Query: 121  DMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQKYWDMVGPQTVDECLAILNRK 180
             M+  L +++  EE+   + + +P KAPGPDGF  +FYQ+YW +VG   +      +N +
Sbjct: 438  AMNSELVREFGEEEILQALNQMHPLKAPGPDGFSPIFYQRYWSVVGRDVIAAVRCFMNSE 497

Query: 181  RSVKDWNHTNIVLIPKVP------------------------------------------ 240
              +++ N T + LIPKV                                           
Sbjct: 498  DFLREVNGTYVTLIPKVKEVENMQQLRPISLCNVIYKLGSKVLANRLKPLLQDIIAPTQS 557

Query: 241  ----------------------NPRRKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGF 300
                                    R  G  GY ALKLD+SK YDRVEW FI AVM  +GF
Sbjct: 558  AFVPGRQISDNSLLAFELSHFLKRRTGGSHGYGALKLDMSKAYDRVEWEFIEAVMRSMGF 617

Query: 301  PCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILG-T 360
               WI  I  C++T S+S ++NGE +GH  P+RGLRQGD +SPYLFLLC+EGLS +L   
Sbjct: 618  DQIWIKWIMGCVTTVSYSFLLNGEPRGHLIPTRGLRQGDSISPYLFLLCAEGLSRMLSYE 677

Query: 361  AKRNGLMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDK 420
             +++ L GIA+   +P I HLFFADDS +F+KA  EE   +K I+  YE ASGQ +N  K
Sbjct: 678  EEQHRLHGIAIAMGAPSINHLFFADDSFVFMKAEREECARVKEILKWYEDASGQQVNFQK 737

Query: 421  SQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFHRSRSKDFKGILDRV------ 480
            S+I F +NV    Q  L+ +  ++  D    YLGLP+    S+++ F+ I+++       
Sbjct: 738  SKISFSKNVDIGCQEELAEVFGVERVDKHDKYLGLPTEVSYSKTEAFQFIMEKTRNKMKN 797

Query: 481  W------------------------------LPKGLLDSISSLCARFWWGSSDTKKRIHW 540
            W                              LPK L   +    A FWWG S+  ++IHW
Sbjct: 798  WKDKTLSVAGKEVMIKSVVQSVPTYVMSCFELPKHLCQEMHRCMAEFWWGDSEKGRKIHW 857

Query: 541  KKWKELCKPEEQGGLNFWDLEIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLES 600
              W ++C P+E+GGL F ++E F QA+LAKQ WR+L  P+S + + LK +YFP++D + +
Sbjct: 858  LAWDKMCVPKEKGGLGFRNMEYFNQALLAKQGWRILRHPDSLLGKTLKAKYFPNNDFIHA 917

Query: 601  EVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRPYTFKVLGYKIFDPEL 660
             V    SY W+  + G  LL+ G+R ++G+G  + V  DPWIPRPY+F+           
Sbjct: 918  SVNQGDSYTWRSLMKGKVLLEKGLRFQVGSGTRISVWFDPWIPRPYSFR---------PY 977

Query: 661  TVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILP-SIQWDIPKLQHVLLDEDVQEII 720
            + V++G+                    +LTV D I P S  W +  L+ +   ++V  I 
Sbjct: 978  STVMEGL-------------------EDLTVADLIDPDSKDWMVDWLEELFFADEVDLIR 1037

Query: 721  RLPASETTP-DRWIWHFDKFGGYMVKSGY-------KLGMYQRIEESPSDTDISSKWWKR 780
            ++P S   P DR IWHFDK G Y VKSGY        L  +     S  D D+    W+R
Sbjct: 1038 KIPLSLRNPEDRLIWHFDKRGLYSVKSGYHVARCVASLSSHVSTSNSQGDKDL----WRR 1097

Query: 781  LWSTLGYSDMVRA---EFIMNIQDRWIHICNTVSILDLERICLGSWALWNDRNCVFHKRP 783
            +W        VR      + NI    +++   V+ LD ERIC            VF +  
Sbjct: 1098 VWHA-RVQPKVRNFVWRLVKNIVPTKVNLGRRVN-LD-ERICPFCRCESETTLHVFMECN 1157

BLAST of Lag0018493 vs. ExPASy Swiss-Prot
Match: P93295 (Uncharacterized mitochondrial protein AtMg00310 OS=Arabidopsis thaliana OX=3702 GN=AtMg00310 PE=4 SV=1)

HSP 1 Score: 121.3 bits (303), Expect = 5.4e-26
Identity = 55/132 (41.67%), Postives = 83/132 (62.88%), Query Frame = 0

Query: 411 LPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCK-PEEQGGLNFWDLEIFIQAMLAK 470
           L K L   ++S    FWW S + K++I W  W++LCK  E+ GGL F DL  F QA+LAK
Sbjct: 13  LSKLLCKKLTSAMTEFWWSSCENKRKISWVAWQKLCKSKEDDGGLGFRDLGWFNQALLAK 72

Query: 471 QAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGN 530
           Q++R++  P + ++R+L+ RYFP S ++E  V +  SY W+  I G +LL  G+ + IG+
Sbjct: 73  QSFRIIHQPHTLLSRLLRSRYFPHSSMMECSVGTRPSYAWRSIIHGRELLSRGLLRTIGD 132

Query: 531 GNSVRVMVDPWI 542
           G   +V +D WI
Sbjct: 133 GIHTKVWLDRWI 144

BLAST of Lag0018493 vs. ExPASy Swiss-Prot
Match: O00370 (LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens OX=9606 PE=1 SV=1)

HSP 1 Score: 100.9 bits (250), Expect = 7.6e-20
Identity = 87/424 (20.52%), Postives = 166/424 (39.15%), Query Frame = 0

Query: 31  IRQVKDKIKVAYDRPTLIDFTIVHRLESHLNKHKRNTISGVEGADGLWLTEADQIHKAFE 90
           ++++ +     ++R   ID  +   ++    K ++N I  ++   G   T+  +I     
Sbjct: 356 LQKINESRSWFFERINKIDRPLARLIK---KKREKNQIDTIKNDKGDITTDPTEIQTTIR 415

Query: 91  SYFKDIFNSQSPSATDFEKVL-KFIPRKVTPDMSHTLTQDYSREEVEGVVRKFYPTKAPG 150
            Y+K ++ ++  +  + +  L  +   ++  +   +L +  +  E+  ++      K+PG
Sbjct: 416 EYYKHLYANKLENLEEMDTFLDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTKKSPG 475

Query: 151 PDGFPTLFYQKYWDMVGPQTVDECLAILNRKRSVKDWNHTNIVLIPKVPNPRRK------ 210
           PDGF   FYQ+Y + + P  +    +I         +   +I+LIPK      K      
Sbjct: 476 PDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRP 535

Query: 211 ------------------------------------GKQG-------------------- 270
                                               G QG                    
Sbjct: 536 ISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRAKDK 595

Query: 271 -YAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLINDCISTASFSIIINGEAKGHFY 330
            +  + +D  K +D+++  F+   + +LG    ++ +I       + +II+NG+    F 
Sbjct: 596 NHVIISIDAEKAFDKIQQPFMLKTLNKLGIDGMYLKIIRAIYDKPTANIILNGQKLEAFP 655

Query: 331 PSRGLRQGDPLSPYLFLLCSEGLSAILGTAKRNGLMGIAMTPSSPKIFHLFFADDSLIFL 390
              G RQG PLSP LF +  E L+  +   K   + GI +     K+    FADD +++L
Sbjct: 656 LKTGTRQGCPLSPLLFNIVLEVLARAIRQEKE--IKGIQLGKEEVKL--SLFADDMIVYL 715

BLAST of Lag0018493 vs. ExPASy Swiss-Prot
Match: P08548 (LINE-1 reverse transcriptase homolog OS=Nycticebus coucang OX=9470 PE=4 SV=1)

HSP 1 Score: 96.7 bits (239), Expect = 1.4e-18
Identity = 89/395 (22.53%), Postives = 156/395 (39.49%), Query Frame = 0

Query: 62  KHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLK--FIPRKVT 121
           K  ++ IS +   +    T+  +I K    Y+K +++ +  +  + ++ L+   +PR   
Sbjct: 383 KRVKSLISSIRNGNDEITTDPSEIQKILNEYYKKLYSHKYENLKEIDQYLEACHLPRLSQ 442

Query: 122 PDMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQKYWDMVGPQTVDECLAILNR 181
            ++   L +  S  E+   ++     K+PGPDGF + FYQ + + + P  ++    I   
Sbjct: 443 KEV-EMLNRPISSSEIASTIQNLPKKKSPGPDGFTSEFYQTFKEELVPILLNLFQNIEKE 502

Query: 182 KRSVKDWNHTNIVLIPKV-PNPRRK----------------------------------- 241
                 +   NI LIPK   +P RK                                   
Sbjct: 503 GILPNTFYEANITLIPKPGKDPTRKENYRPISLMNIDAKILNKILTNRIQQHIKKIIHHD 562

Query: 242 ------GKQG---------------------YAALKLDISKTYDRVEWSFIRAVMERLGF 301
                 G QG                     +  L +D  K +D ++  F+   ++++G 
Sbjct: 563 QVGFIPGSQGWFNIRKSINVIQHINKLKNKDHMILSIDAEKAFDNIQHPFMIRTLKKIGI 622

Query: 302 PCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTA 361
              ++ LI    S  + +II+NG     F    G RQG PLSP LF +  E L+  +   
Sbjct: 623 EGTFLKLIEAIYSKPTANIILNGVKLKSFPLRSGTRQGCPLSPLLFNIVMEVLA--IAIR 682

Query: 362 KRNGLMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDKS 389
           +   + GI +   S +I    FADD +++L+ + +    L  ++ +Y   SG  IN  KS
Sbjct: 683 EEKAIKGIHI--GSEEIKLSLFADDMIVYLENTRDSTTKLLEVIKEYSNVSGYKINTHKS 742

BLAST of Lag0018493 vs. ExPASy Swiss-Prot
Match: P0C2F6 (Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1g65750 PE=3 SV=1)

HSP 1 Score: 81.6 bits (200), Expect = 4.8e-14
Identity = 61/250 (24.40%), Postives = 107/250 (42.80%), Query Frame = 0

Query: 406 LDRVWLPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCKPEEQGGLNFWDLEIFIQA 465
           +  + LP+ +L+ +  L   F WGS+  KK+ H  KW ++C P+++GGL     +   +A
Sbjct: 53  MSTILLPQSILNRLDQLSRTFLWGSTAEKKKQHLVKWSKVCSPKKEGGLGVRAAKSMNRA 112

Query: 466 MLAKQAWRVLTLPESTVARVLKGRYFPSSDVLESE---VRSNSSYFWKGFIWGL-DLLKS 525
           +++K  WR+L    S    VL+ +Y    ++ +S     + + S  W+    GL D++  
Sbjct: 113 LISKVGWRLLQEKNSLWTLVLQKKYH-VGEIRDSRWLIPKGSWSSTWRSIAIGLRDVVSH 172

Query: 526 GIRKKIGNGNSVRVMVDPWIP-RPYTFKVLGYKIFDPELTVVLKGIGFSILRRFRIRVLG 585
           G+    G+G  +R   D W+  +P      G +  D + TVV K                
Sbjct: 173 GVGWIPGDGQQIRFWTDRWVSGKPLLELDNGERPTDCD-TVVAK---------------- 232

Query: 586 YKIFDPELTVVDCILPSIQWDIPKLQHVLLDEDVQEI--IRLPASETTPDRWIWHFDKFG 645
                      D  +P   WD  K+     +    E+  + L       DR  W F + G
Sbjct: 233 -----------DLWIPGRGWDFAKIDPYTTNNTRLELRAVVLDLVTGARDRLSWKFSQDG 273

Query: 646 GYMVKSGYKL 649
            + V+S Y++
Sbjct: 293 QFSVRSAYEM 273

BLAST of Lag0018493 vs. ExPASy Swiss-Prot
Match: P92555 (Uncharacterized mitochondrial protein AtMg01250 OS=Arabidopsis thaliana OX=3702 GN=AtMg01250 PE=4 SV=1)

HSP 1 Score: 80.1 bits (196), Expect = 1.4e-13
Identity = 40/68 (58.82%), Postives = 50/68 (73.53%), Query Frame = 0

Query: 256 IINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTAKRNG-LMGIAMTPSSPKIF 315
           IING  +G   PSRGLRQGDPLSPYLF+LC+E LS +   A+  G L GI ++ +SP+I 
Sbjct: 13  IINGAPQGLVTPSRGLRQGDPLSPYLFILCTEVLSGLCRRAQEQGRLPGIRVSNNSPRIN 72

Query: 316 HLFFADDS 323
           HL FADD+
Sbjct: 73  HLLFADDT 80

BLAST of Lag0018493 vs. ExPASy TrEMBL
Match: A0A6J1DX30 (uncharacterized protein LOC111024874 OS=Momordica charantia OX=3673 GN=LOC111024874 PE=4 SV=1)

HSP 1 Score: 513.5 bits (1321), Expect = 1.8e-141
Identity = 315/961 (32.78%), Postives = 448/961 (46.62%), Query Frame = 0

Query: 13   TKALKEWGYKKNRVRWDNIRQVKDK---IKVAYDRPTLIDFTIVHRLESHLNKHKRNTIS 72
            + AL+ WG  ++ V WD  +Q+K +   I  AY++P  +DFTI+H LE        N ++
Sbjct: 488  SSALRHWG--RSNV-WDLFKQIKAQKAAIIDAYNQPLPLDFTIIHALE--------NDLA 547

Query: 73   GVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTPDMSHTLTQD 132
            G+     L L E     ++ E + K  +     +A D E ++  IP ++T +++  L   
Sbjct: 548  GL-----LELEEIFWKQRSREDWLK--WGIAILNALDIEAIINLIPTRITSEVNEQLLAP 607

Query: 133  YSREEVEGVVRKFYPTKAPGPDGFPTLFYQKYWDMVGPQTVDECLAILNRKRSVKDWNHT 192
            Y++EE+E  +R+ +PTKA GPDGFP LFYQ YW +VGP+T++ CL  LN    +K WN T
Sbjct: 608  YTKEEIELAIRQMFPTKALGPDGFPALFYQTYWHVVGPKTLEACLNALNNGDDIKKWNST 667

Query: 193  NIVLIPKVPNPR------------------------------------------------ 252
             I LIPK+  PR                                                
Sbjct: 668  YIALIPKIKQPRSISDFRPISLCNVSYKIISKSITNRLKNVIGLVISDAQSAFVPSRAIS 727

Query: 253  ----------------RKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGFPCDWINLIN 312
                            + G  G AALKLD+SK +DRVEW+++  +M ++GF   WI  I 
Sbjct: 728  DNVIIGHECLHTINSCKSGLIGMAALKLDLSKAFDRVEWTYLECIMRKMGFNEGWIQAII 787

Query: 313  DCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTAKRNG-LMGI 372
             CIST  FSI +NG   G F PSRG+RQGDPLSPYLFLLC+EGLSA++     +G L GI
Sbjct: 788  QCISTVRFSIHLNGSPGGCFQPSRGIRQGDPLSPYLFLLCAEGLSALINHENNSGRLTGI 847

Query: 373  AMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDKSQICFFRNV 432
                ++  I HL FADDSLIFL++   E   L+ ++  Y  ASGQ IN  KS + F  NV
Sbjct: 848  HFEENNTSITHLLFADDSLIFLRSLESECLALRRLLDSYGRASGQCINFSKSALLFSPNV 907

Query: 433  PSDTQSYLSSILQMKSADNLGSYLGLPSSFHRSRSKDFKGILDRVWLPKGLLDSISSLCA 492
              + Q YL  IL +K   + G+YLGLPS F R R +                        
Sbjct: 908  HPERQQYLQCILNVKLVSHFGNYLGLPSHFTRRRGE------------------------ 967

Query: 493  RFWWGSSDTKKRIHWKKWKELCKPEEQGGLNFWDLEIFIQAMLAKQAWRVLTLPESTVAR 552
                      +++HW KW  +C P+E GGLNF DLE F QA++AK  WR L  P   V++
Sbjct: 968  ---------SRKLHWMKWGRMCYPKECGGLNFRDLEGFNQALVAKHVWRFLQHPNLLVSK 1027

Query: 553  VLKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRP 612
            VLK +YF  + +L++   S SSYFWKGF+WG DLL  G+R ++GNG++++   DPW+PRP
Sbjct: 1028 VLKHKYFKDTSLLQASNNSKSSYFWKGFLWGRDLLVKGLRLRVGNGSTIKAFSDPWLPRP 1087

Query: 613  YTFKVLGYKIFDPELTVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPSIQWDIPK 672
             TFK L                      RF    L       + TV   I     WD+  
Sbjct: 1088 TTFKPL----------------------RFNNGAL-------DTTVASFITADGNWDVTS 1147

Query: 673  LQHVLLDEDVQEIIRLP-ASETTPDRWIWHFDKFGGYMVKSGYKLGMYQRIEESPSDTDI 732
            + H   +ED   I+ +P +S    D W+WH+DK G Y V+SGYKL M+ +   + + T+ 
Sbjct: 1148 ISHSFCNEDRDLILSMPISSYNLQDSWLWHYDKRGNYSVRSGYKLYMHLKCNATSASTNY 1207

Query: 733  SSKWW------------------------------------------------------- 792
                W                                                       
Sbjct: 1208 RGTQWNSIWKLTVPTKIKIFIWRSAHEHIPTAQNLLLRGIGELPACTICGDRRESIIHAF 1267

Query: 793  ------KRLWSTL-GYSDMVRAEFIMNIQDRWIHICNTVSILDLERICLGSWALWNDRNC 839
                  +++W TL  +   + AE  ++  + W  +   +   DL    +  W +WNDRN 
Sbjct: 1268 FHCKRARQIWRTLFPFLTCLSAEDNISFLELWSSLTEQLEPKDLNLAAITGWGIWNDRNS 1327

BLAST of Lag0018493 vs. ExPASy TrEMBL
Match: M5VU98 (Reverse transcriptase domain-containing protein OS=Prunus persica OX=3760 GN=PRUPE_ppa022115mg PE=4 SV=1)

HSP 1 Score: 484.2 bits (1245), Expect = 1.2e-132
Identity = 294/900 (32.67%), Postives = 425/900 (47.22%), Query Frame = 0

Query: 61   NKHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTP 120
            N+ +RN I G+E ++G W T    I      YF D+F S   S    E++L  +  KVT 
Sbjct: 779  NRRRRNIIKGLEDSNGCWRTSRQGITSIVIDYFGDLFRSSGSSM--MEEILSALEPKVTA 838

Query: 121  DMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQKYWDMVGPQTVDECLAILNRK 180
            DM   L  D+S +E++  V +  P+KAPGPDG P LFYQKYW +VG   V    A L   
Sbjct: 839  DMQQVLIADFSYQEIKDAVFQMQPSKAPGPDGLPPLFYQKYWRIVGDDVVAAVRAFLQSN 898

Query: 181  RSVKDWNHTNIVLIPKVPNP---------------------------------------- 240
              ++  NHT + LIPKV  P                                        
Sbjct: 899  EMLRQLNHTFVTLIPKVKEPRTMAQLRPISLCNVLYRIGAKTLANRMKFVMQSVISESQS 958

Query: 241  ------------------------RRKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGF 300
                                    RR+G++G  ALKLD+SK YDRVEW F+  +M  +GF
Sbjct: 959  AFVPGRLITDNSIVAFEIAHFLKQRRRGRKGSLALKLDMSKAYDRVEWEFLEKMMLAMGF 1018

Query: 301  PCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTA 360
            P  W+ ++ DC++T S+S ++NGE     YP+RGLRQGDPLSPYLFLLC+EG + +L  A
Sbjct: 1019 PILWVRMVMDCVTTVSYSFLVNGEPTRILYPTRGLRQGDPLSPYLFLLCAEGFTTLLSKA 1078

Query: 361  KRNG-LMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDK 420
            +R G L GI +   +P + HLFFADDS +F KA+    G LK I   YE ASGQ IN  K
Sbjct: 1079 ERQGQLQGIVICRGAPTVSHLFFADDSFVFAKATDNNCGVLKHIFEVYEHASGQQINCQK 1138

Query: 421  SQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFHRSRSKDFKGILDRVW----- 480
            S + F  N+  DTQS L+S+L +   D+  +YLGLP    R+++  F+ + +RVW     
Sbjct: 1139 SCVAFSANIHMDTQSRLASVLGVPRVDSHATYLGLPMMLGRNKTVCFRYLKERVWKKLQG 1198

Query: 481  -------------------------------LPKGLLDSISSLCARFWWGSSDTKKRIHW 540
                                           LP+GL   I  + ARFWWG     ++IHW
Sbjct: 1199 WREQTLSIAGKEVLLKVVAQSIPLYVMSCFLLPQGLCHEIEQMMARFWWGQQGENRKIHW 1258

Query: 541  KKWKELCKPEEQGGLNFWDLEIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLES 600
             +W+ LCK + +GG+ F  L+ F  AMLAKQ WR++  P S  +R+LK +YFP ++  E+
Sbjct: 1259 MRWERLCKAKTEGGMGFRCLQAFNMAMLAKQGWRLVHNPHSLASRLLKAKYFPQTNFWEA 1318

Query: 601  EVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRPYTFKVLGYKIFDPEL 660
             + S  S  WK       +L+ G R +IG+G SVR+  D W+PRP TF V+   +   E 
Sbjct: 1319 TLGSRPSCVWKSIWTARKVLEMGSRFQIGDGKSVRIWGDKWVPRPATFAVITSPLDGMEN 1378

Query: 661  TVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPSIQWDIPKLQHVLLDEDVQEIIR 720
            T V + I                          C   S QWD+ KL ++ L  DV +I+R
Sbjct: 1379 TKVSELI--------------------------CNEGSPQWDLQKLNNLFLPVDVVDIVR 1438

Query: 721  LPAS-ETTPDRWIWHFDKFGGYMVKSGYKLGMYQRI---EESPSDTDISSKWWKRLWSTL 780
            +P S    PDR +W++DK G + VKS Y++ +       +ES S    +   W+ +W+  
Sbjct: 1439 IPLSIRAPPDRIVWNYDKHGLFTVKSAYRVALRVTSGDEDESSSSNSDTGMLWRHIWNAT 1498

Query: 781  GYSDM-------------VRAEFI---MNIQDRWIHICN-TVSILDLERICLGSWALWND 833
              + +              +A  I   +++QD  +   + T S L +  +C  + A WN 
Sbjct: 1499 VPTKLKIFAWRVAHDILPTKANLIKKGVDMQDMCMFCGDITESALHVLAMCPFAVATWNI 1558

BLAST of Lag0018493 vs. ExPASy TrEMBL
Match: A0A803PI64 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 463.4 bits (1191), Expect = 2.1e-126
Identity = 287/860 (33.37%), Postives = 418/860 (48.60%), Query Frame = 0

Query: 62   KHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTPD 121
            + K+N I G+    G+W +EA  + +  E+YF +IF S S     FE+V+  IP KVT D
Sbjct: 612  RRKKNAIKGLMDDIGVWHSEAGMVQRLVENYFWNIFCSSSMPHDVFEEVINVIPPKVTDD 671

Query: 122  MSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQKYWDMVGPQTVDECLAILNRKR 181
            M+  L +D++ EE+   V+   PTKAPG DG P LFY K+W  +    +  CL +LN   
Sbjct: 672  MNEMLLEDFTAEEIVKAVKDMNPTKAPGCDGLPALFYHKFWSNLKQDVIGMCLKVLNHGA 731

Query: 182  SVKDWNHTNIVLIPKVPNPR---------------------------------------- 241
            +++  N T I LIPKV  P+                                        
Sbjct: 732  NLECLNETIIALIPKVEKPKKVEEFRPISLCNVIYKIVSKCLVARLSGVMDLVISDTQSA 791

Query: 242  ---------------------RKGK---QGYAALKLDISKTYDRVEWSFIRAVMERLGFP 301
                                 RK +    G  ALKLD++K YDRVEW F+ AVM RLGF 
Sbjct: 792  FIKDRLIHDNAIIGYESLHCMRKNRFQNGGKMALKLDMAKAYDRVEWLFLSAVMSRLGFA 851

Query: 302  CDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTAK 361
              W++ I  C+++ SFS +INGE KG   P RGLRQGDPLSP+LFL C+E LS+++   +
Sbjct: 852  QCWVDKIMRCVTSTSFSFLINGETKGKLIPERGLRQGDPLSPFLFLFCAEALSSLIQQEE 911

Query: 362  RNG-LMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDKS 421
              G L GI        + HLFFADDSL+F+ A  +     + I+  Y  ASGQ +N  KS
Sbjct: 912  SAGRLRGIRFNRLGVSVSHLFFADDSLVFIDADMDSCLQFQQILTKYTAASGQIVNYHKS 971

Query: 422  QICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFHRSRSKDFKGILDRVWLPKGLL 481
            + CF  NV ++T+  L+ ++ ++  DN G YLGLPS   R++              K  L
Sbjct: 972  EACFGCNVSAETRFQLAGMMGVREVDNHGKYLGLPSFVGRNK--------------KEFL 1031

Query: 482  DSISSLCARFWWGSSDTKKRIHWKKWKELCKPEEQGGLNFWDLEIFIQAMLAKQAWRVLT 541
            D I            + +K+IHW KW+ LC+P+++GGL F DL +F QA+LAKQ WR + 
Sbjct: 1032 DEI-----------KNKEKKIHWCKWRYLCRPKDKGGLGFRDLGMFNQALLAKQIWRCIR 1091

Query: 542  LPESTVARVLKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVM 601
             P+   +RVLK  YFP    LE+   +N+S+ W+  +WG  L+  G R ++GNG SVRV+
Sbjct: 1092 HPQQLCSRVLKASYFPHKGFLEAGCGANASFVWRSLVWGKKLILKGYRWRVGNGESVRVL 1151

Query: 602  VDPWIPRPYTFKVLGYKIFDPELTVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILP 661
             DPW+PRP TFKV       PE                             L V D    
Sbjct: 1152 EDPWLPRPLTFKVYDQPTL-PE----------------------------NLYVTDLKRA 1211

Query: 662  SIQWDIPKLQHVLLDEDVQEIIRLPASETT-PDRWIWHFDKFGGYMVKSGYKLGMYQRIE 721
              QWD   ++ V    D + I+ +P S+    D+ + H+ K G Y VKSGY++      E
Sbjct: 1212 DGQWDESFIRSVFNTIDAELILAIPFSDCDFEDKILLHYSKNGEYTVKSGYRMASSLITE 1271

Query: 722  ESPSDTDISSKWWKRLWSTLGYSDMVRAEFIMN-IQDRWIHICNT--VSIL--------- 781
               S      +WWK+LW         +  ++++ + D +  +  T  +S+L         
Sbjct: 1272 HHQSSDHSLVQWWKKLWRLKIPPKASKGYWVVSGVYDEFKKMLGTDNLSLLMRMAAEWEK 1331

Query: 782  -DLERICLGSWALWNDRNCVFHKRPIPPVGVRCDWTLDYLSEY--QSAHRSNDRIFQTRD 837
              LE   L SW +WN RN V H    P      +W   YL E+  ++  RSN  + + R 
Sbjct: 1332 EKLEFFLLVSWNVWNVRNSVVHGSYHPKPEDMIEWCGRYLDEFRGEAGSRSNSAMVEERR 1391

BLAST of Lag0018493 vs. ExPASy TrEMBL
Match: M5W5F3 (Reverse transcriptase domain-containing protein (Fragment) OS=Prunus persica OX=3760 GN=PRUPE_ppa026368mg PE=4 SV=1)

HSP 1 Score: 462.2 bits (1188), Expect = 4.8e-126
Identity = 249/718 (34.68%), Postives = 371/718 (51.67%), Query Frame = 0

Query: 61  NKHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTP 120
           ++ KRN + G+  A+  W TE  +I   F  YFK +F+S        E++L  +   +T 
Sbjct: 64  SRSKRNRVCGIFDANQAWQTEEQRIGDLFCDYFKTLFSSS--GGQQMERILNEVRPVITS 123

Query: 121 DMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQKYWDMVGPQTVDECLAILNRK 180
            M+  L Q ++REE+E  + + +PTKAPG DG P LF+QKYW +VG +   +CL ILN +
Sbjct: 124 AMNAQLLQAFTREELEHTLFQMFPTKAPGHDGMPALFFQKYWHIVGDKVAKKCLQILNGE 183

Query: 181 RSVKDWNHTNIVLIPKVPNPR--------------------------------------- 240
            SV+++NHT I LIPKV  P                                        
Sbjct: 184 GSVREFNHTLIALIPKVKMPTIVSEFRPISLCTTVYKMIAKTIANRLKTVLSHVITETQS 243

Query: 241 -------------------------RKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGF 300
                                    +KG+    ALKLD++K YDRVEW F+RA+M +LGF
Sbjct: 244 AFVPNRMILDNVMAAFEIMNTIKGVKKGRDVQMALKLDMAKAYDRVEWVFLRAMMLKLGF 303

Query: 301 PCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTA 360
              W++ + DCIST +FS++  G   GH  P RGLRQG PLSPYLFL+C+EG S +L  A
Sbjct: 304 SATWVSKVMDCISTTTFSVLWKGTPVGHIMPQRGLRQGCPLSPYLFLICTEGFSCLLRGA 363

Query: 361 KRNG-LMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDK 420
           +R G L+G+ +   +P + HL FADDS++F+KA+ ++   L+ +   YE  +GQ IN  K
Sbjct: 364 ERRGDLVGVQVARGAPSVTHLLFADDSILFMKATNKDCMALETLFQTYEEVTGQQINYSK 423

Query: 421 SQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFHRSRSKDFKGILDRVW----- 480
           S +    N        +  +L +       +YLGLP+   + R + F+ + D++W     
Sbjct: 424 SALSLSPNATRADFDMIEGVLNVPVVRCHENYLGLPTIAGKGRKQLFQHLKDKLWKHISG 483

Query: 481 -------------------------------LPKGLLDSISSLCARFWWGSSDTKKRIHW 540
                                          +PKGL   ++ + ARFWW  +  K+ IHW
Sbjct: 484 WKEKLLSRAGKEILIKAVLQAIPTYSMSCFRIPKGLCKELNGIMARFWWAKAKDKRGIHW 543

Query: 541 KKWKELCKPEEQGGLNFWDLEIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLES 600
            KW+ LCK +  GGL F DLE F QA+LAKQ WR+L  PES VAR+ + RY PS   LE+
Sbjct: 544 VKWELLCKSKFAGGLGFRDLEAFNQALLAKQCWRILRTPESLVARIFRARYHPSVPFLEA 603

Query: 601 EVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRPYTFKVLGYKIFDPEL 660
           EV +N S+ W+   WG +LL  G+R ++G+G S++V  D W+P P  FK++      P+L
Sbjct: 604 EVGTNPSFIWRSLQWGKELLNKGLRWRVGSGVSIQVYTDKWLPAPSCFKIMS----PPQL 663

Query: 661 TVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPSIQWDIPKLQHVLLDEDVQEIIR 674
            +  +                         V D    S QW++P L+ +  D++V  I++
Sbjct: 664 PLSTR-------------------------VCDLFTSSGQWNVPLLKDIFWDQEVDAILQ 723

BLAST of Lag0018493 vs. ExPASy TrEMBL
Match: A0A251NPF0 (Reverse transcriptase domain-containing protein OS=Prunus persica OX=3760 GN=PRUPE_6G123900 PE=4 SV=1)

HSP 1 Score: 462.2 bits (1188), Expect = 4.8e-126
Identity = 249/718 (34.68%), Postives = 371/718 (51.67%), Query Frame = 0

Query: 61  NKHKRNTISGVEGADGLWLTEADQIHKAFESYFKDIFNSQSPSATDFEKVLKFIPRKVTP 120
           ++ KRN + G+  A+  W TE  +I   F  YFK +F+S        E++L  +   +T 
Sbjct: 27  SRSKRNRVCGIFDANQAWQTEEQRIGDLFCDYFKTLFSSS--GGQQMERILNEVRPVITS 86

Query: 121 DMSHTLTQDYSREEVEGVVRKFYPTKAPGPDGFPTLFYQKYWDMVGPQTVDECLAILNRK 180
            M+  L Q ++REE+E  + + +PTKAPG DG P LF+QKYW +VG +   +CL ILN +
Sbjct: 87  AMNAQLLQAFTREELEHTLFQMFPTKAPGHDGMPALFFQKYWHIVGDKVAKKCLQILNGE 146

Query: 181 RSVKDWNHTNIVLIPKVPNPR--------------------------------------- 240
            SV+++NHT I LIPKV  P                                        
Sbjct: 147 GSVREFNHTLIALIPKVKMPTIVSEFRPISLCTTVYKMIAKTIANRLKTVLSHVITETQS 206

Query: 241 -------------------------RKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGF 300
                                    +KG+    ALKLD++K YDRVEW F+RA+M +LGF
Sbjct: 207 AFVPNRMILDNVMAAFEIMNTIKGVKKGRDVQMALKLDMAKAYDRVEWVFLRAMMLKLGF 266

Query: 301 PCDWINLINDCISTASFSIIINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTA 360
              W++ + DCIST +FS++  G   GH  P RGLRQG PLSPYLFL+C+EG S +L  A
Sbjct: 267 SATWVSKVMDCISTTTFSVLWKGTPVGHIMPQRGLRQGCPLSPYLFLICTEGFSCLLRGA 326

Query: 361 KRNG-LMGIAMTPSSPKIFHLFFADDSLIFLKASTEEFGHLKIIMADYECASGQSINVDK 420
           +R G L+G+ +   +P + HL FADDS++F+KA+ ++   L+ +   YE  +GQ IN  K
Sbjct: 327 ERRGDLVGVQVARGAPSVTHLLFADDSILFMKATNKDCMALETLFQTYEEVTGQQINYSK 386

Query: 421 SQICFFRNVPSDTQSYLSSILQMKSADNLGSYLGLPSSFHRSRSKDFKGILDRVW----- 480
           S +    N        +  +L +       +YLGLP+   + R + F+ + D++W     
Sbjct: 387 SALSLSPNATRADFDMIEGVLNVPVVRCHENYLGLPTIAGKGRKQLFQHLKDKLWKHISG 446

Query: 481 -------------------------------LPKGLLDSISSLCARFWWGSSDTKKRIHW 540
                                          +PKGL   ++ + ARFWW  +  K+ IHW
Sbjct: 447 WKEKLLSRAGKEILIKAVLQAIPTYSMSCFRIPKGLCKELNGIMARFWWAKAKDKRGIHW 506

Query: 541 KKWKELCKPEEQGGLNFWDLEIFIQAMLAKQAWRVLTLPESTVARVLKGRYFPSSDVLES 600
            KW+ LCK +  GGL F DLE F QA+LAKQ WR+L  PES VAR+ + RY PS   LE+
Sbjct: 507 VKWELLCKSKFAGGLGFRDLEAFNQALLAKQCWRILRTPESLVARIFRARYHPSVPFLEA 566

Query: 601 EVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRVMVDPWIPRPYTFKVLGYKIFDPEL 660
           EV +N S+ W+   WG +LL  G+R ++G+G S++V  D W+P P  FK++      P+L
Sbjct: 567 EVGTNPSFIWRSLQWGKELLNKGLRWRVGSGVSIQVYTDKWLPAPSCFKIMS----PPQL 626

Query: 661 TVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPSIQWDIPKLQHVLLDEDVQEIIR 674
            +  +                         V D    S QW++P L+ +  D++V  I++
Sbjct: 627 PLSTR-------------------------VCDLFTSSGQWNVPLLKDIFWDQEVDAILQ 686

BLAST of Lag0018493 vs. TAIR 10
Match: AT4G29090.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 124.8 bits (312), Expect = 3.5e-28
Identity = 84/276 (30.43%), Postives = 119/276 (43.12%), Query Frame = 0

Query: 411 LPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCKPEEQGGLNFWDLEIFIQAMLAKQ 470
           LPK +   I S+ A FWW +    K +HWK W  L   + +GG+ F D+E F  A+L KQ
Sbjct: 13  LPKTVCKQIISVLADFWWRNKQEAKGMHWKAWDHLSCYKAEGGIGFKDIEAFNLALLGKQ 72

Query: 471 AWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNG 530
            WR+L+ PES +A+V K RYF  SD L + + S  S+ WK      ++L+ G R  +GNG
Sbjct: 73  MWRMLSRPESLMAKVFKSRYFHKSDPLNAPLGSRPSFVWKSIHASQEILRQGARAVVGNG 132

Query: 531 NSVRVMVDPWI-PRPYTFKVLGYKIFDPELTVVLKGIGFSILRRFRIRVLGYKIFDPELT 590
             + +    W+  +P +                      + LR  R+    Y      L 
Sbjct: 133 EDIIIWRHKWLDSKPAS----------------------AALRMQRVPPQEYASVSSILK 192

Query: 591 VVDCILPS--------IQWDIPKLQHVLLDEDVQEIIRLPASETTPDRWIWHFDKFGGYM 650
           V D I  S        I+   P+++  L+ E        P      D + W +   G Y 
Sbjct: 193 VSDLIDESGREWRKDVIEMLFPEVERKLIGE------LRPGGRRILDSYTWDYTSSGDYT 252

Query: 651 VKSGY--------KLGMYQRIEESPSDTDISSKWWK 670
           VKSGY        K    Q + E PS   I  K WK
Sbjct: 253 VKSGYWVLTQIINKRSSPQEVSE-PSLNPIYQKIWK 259

BLAST of Lag0018493 vs. TAIR 10
Match: ATMG00310.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 121.3 bits (303), Expect = 3.8e-27
Identity = 55/132 (41.67%), Postives = 83/132 (62.88%), Query Frame = 0

Query: 411 LPKGLLDSISSLCARFWWGSSDTKKRIHWKKWKELCK-PEEQGGLNFWDLEIFIQAMLAK 470
           L K L   ++S    FWW S + K++I W  W++LCK  E+ GGL F DL  F QA+LAK
Sbjct: 13  LSKLLCKKLTSAMTEFWWSSCENKRKISWVAWQKLCKSKEDDGGLGFRDLGWFNQALLAK 72

Query: 471 QAWRVLTLPESTVARVLKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGN 530
           Q++R++  P + ++R+L+ RYFP S ++E  V +  SY W+  I G +LL  G+ + IG+
Sbjct: 73  QSFRIIHQPHTLLSRLLRSRYFPHSSMMECSVGTRPSYAWRSIIHGRELLSRGLLRTIGD 132

Query: 531 GNSVRVMVDPWI 542
           G   +V +D WI
Sbjct: 133 GIHTKVWLDRWI 144

BLAST of Lag0018493 vs. TAIR 10
Match: ATMG01250.1 (RNA-directed DNA polymerase (reverse transcriptase) )

HSP 1 Score: 80.1 bits (196), Expect = 9.8e-15
Identity = 40/68 (58.82%), Postives = 50/68 (73.53%), Query Frame = 0

Query: 256 IINGEAKGHFYPSRGLRQGDPLSPYLFLLCSEGLSAILGTAKRNG-LMGIAMTPSSPKIF 315
           IING  +G   PSRGLRQGDPLSPYLF+LC+E LS +   A+  G L GI ++ +SP+I 
Sbjct: 13  IINGAPQGLVTPSRGLRQGDPLSPYLFILCTEVLSGLCRRAQEQGRLPGIRVSNNSPRIN 72

Query: 316 HLFFADDS 323
           HL FADD+
Sbjct: 73  HLLFADDT 80

BLAST of Lag0018493 vs. TAIR 10
Match: AT3G09510.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 63.5 bits (153), Expect = 9.5e-10
Identity = 48/168 (28.57%), Postives = 75/168 (44.64%), Query Frame = 0

Query: 486 LKGRYFPSSDVLESEVRSNSSYFWKGFIWGLDLLKSGIRKKIGNGNSVRV----MVDPWI 545
           +K RYF    +L+++VR   SY W   + G+ LLK G R  IG+G ++R+    +VD   
Sbjct: 1   MKARYFKDVSILDAKVRKQQSYGWASLLDGIALLKKGTRHLIGDGQNIRIGLDNIVDSHP 60

Query: 546 PRPYTFKVLGYKIFDPELTVVLKGIGFSILRRFRIRVLGYKIFDPELTVVDCILPSIQWD 605
           PRP   +   YK    E+T+                     +F+ + +          WD
Sbjct: 61  PRPLNTEET-YK----EMTI-------------------NNLFERKGSY-------YFWD 120

Query: 606 IPKLQHVLLDEDVQEIIRL-PASETTPDRWIWHFDKFGGYMVKSGYKL 649
             K+   +   D   I R+  A    PD+ IW+++  G Y V+SGY L
Sbjct: 121 DSKISQFVDQSDHGFIHRIYLAKSKKPDKIIWNYNTTGEYTVRSGYWL 137

BLAST of Lag0018493 vs. TAIR 10
Match: AT4G20520.1 (RNA binding;RNA-directed DNA polymerases )

HSP 1 Score: 48.1 bits (113), Expect = 4.1e-05
Identity = 20/54 (37.04%), Postives = 32/54 (59.26%), Query Frame = 0

Query: 190 NIVLIPKVPNP--RRKGKQGYAALKLDISKTYDRVEWSFIRAVMERLGFPCDWI 242
           NIV + +  +   R+KG +G+  LKLD+ K YDR+ W ++   +   GFP  W+
Sbjct: 29  NIVFVQEAVHSMRRKKGVKGWMLLKLDLEKAYDRIRWDYLEDTLISAGFPEVWL 82

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022158377.13.7e-14132.78uncharacterized protein LOC111024874 [Momordica charantia][more]
XP_023878301.19.5e-12931.71uncharacterized protein LOC111990748 [Quercus suber][more]
ONI01138.19.9e-12634.68hypothetical protein PRUPE_6G123900 [Prunus persica][more]
XP_024156142.14.1e-12433.13uncharacterized protein LOC112164137 [Rosa chinensis][more]
XP_024172304.29.2e-12433.01uncharacterized protein LOC112178381 [Rosa chinensis][more]
Match NameE-valueIdentityDescription
P932955.4e-2641.67Uncharacterized mitochondrial protein AtMg00310 OS=Arabidopsis thaliana OX=3702 ... [more]
O003707.6e-2020.52LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens OX=9606 PE=1 SV=1[more]
P085481.4e-1822.53LINE-1 reverse transcriptase homolog OS=Nycticebus coucang OX=9470 PE=4 SV=1[more]
P0C2F64.8e-1424.40Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1... [more]
P925551.4e-1358.82Uncharacterized mitochondrial protein AtMg01250 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A6J1DX301.8e-14132.78uncharacterized protein LOC111024874 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
M5VU981.2e-13232.67Reverse transcriptase domain-containing protein OS=Prunus persica OX=3760 GN=PRU... [more]
A0A803PI642.1e-12633.37Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
M5W5F34.8e-12634.68Reverse transcriptase domain-containing protein (Fragment) OS=Prunus persica OX=... [more]
A0A251NPF04.8e-12634.68Reverse transcriptase domain-containing protein OS=Prunus persica OX=3760 GN=PRU... [more]
Match NameE-valueIdentityDescription
AT4G29090.13.5e-2830.43Ribonuclease H-like superfamily protein [more]
ATMG00310.13.8e-2741.67RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
ATMG01250.19.8e-1558.82RNA-directed DNA polymerase (reverse transcriptase) [more]
AT3G09510.19.5e-1028.57Ribonuclease H-like superfamily protein [more]
AT4G20520.14.1e-0537.04RNA binding;RNA-directed DNA polymerases [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 775..838
e-value: 2.4E-5
score: 24.1
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 205..373
e-value: 1.0E-27
score: 97.1
NoneNo IPR availablePANTHERPTHR19446:SF440SUBFAMILY NOT NAMEDcoord: 206..548
NoneNo IPR availablePANTHERPTHR19446:SF440SUBFAMILY NOT NAMEDcoord: 61..201
NoneNo IPR availablePANTHERPTHR19446REVERSE TRANSCRIPTASEScoord: 61..201
NoneNo IPR availablePANTHERPTHR19446REVERSE TRANSCRIPTASEScoord: 206..548
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 84..363

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0018493.1Lag0018493.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity