Lag0032313 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0032313
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionReverse transcriptase domain-containing protein
Locationchr11: 30308480 .. 30311705 (-)
RNA-Seq ExpressionLag0032313
SyntenyLag0032313
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATTCAACACTCAAATTCAAAAGACAATCATCCAAATTCCATACTTTTAGATGCTCTCTTTTGATGGTGTATGAAGACTTGCTCGAGAACTGGGAAAGATTCAGCCTATCAGCTGTAGAGGAGGCCACTGAAGTTGATGTCGATCGCCAAGCTGCAGTAGTCACGAGACAATCACTGGGTTTCAGTTTGACCGGGAAGCTTCTAGCTCCCTGTATCATTTATGGGGACGTTATGCATCGAACGTTTAAACCTGCTTGGAATATACCCAATGGTCTAATTATGGAAAAACTTGAAGCAAATCTATTCCTTTTCTCGCTGAGGTTAGAAGTCGACCAAATGAGGGTGCTGAGGCAAGAACCGTGTCTGTTCGACAAATTCCTTCTTTTCCTTTCGAAGCCAATCCCTATGGTAAAACCTACGGCCATGGAATTTAAGTTCGCAGCTTTCTGGGTCCACTTCTGCGAGCTCCCAATGGATCTCTACAACCGGTCAATGGCGGAACGACTTGGTAATGCCATAGGACAATTTTAGAGCTATCACAATGGCGGGTGAGGATACGGGTGGAAGGAGAGCCTCTGAGTTCGCGTTAATTTGGATATCACGCAACCCCTACGACGAGGTATCAAAGTCCATCTTGATGAACCGTTAGGCAGCGTATGGACACCGATCAAGTATGAAAAGCTTCCGGACATTTGTGCATTTTGCGGCCGCATCGACCATGGAATGAGAGATTGTACTTTTAATTATCTTGAGTCTGGTTCATCTTCACGCCGACAAGAGTACGGTATGTGGATGGCCTTCAACGACAGGACTTCCAGCGTGTTTCGTTCGCCTAGCAACAGTCCAATCGGTAATCAACAGTTGATGGTAGCTTCTCCAAATCGAAATCCTTCACTTCAGCCATCTTTGACGGTCGAAACTGGTACTCGATTGGATAATTCAAGGAAATCCTCTCCGATGACTGGATCTGGTAGCCGACCAATGGATATCTCGCCGGTGATGGAGGAAGACGGTCCAGTTTCGGCGATCGGTAACATACCGTCAATTAATGCAGGTATTTATGGCATTAATGATTCTGATTTCACCAAAGCTAAAAAGATTTTTTTTTTTTGGAGTGGAGTCGGAAACGACTGAGGCAAAACGGAAAGGGAAATAAGTCATAACGGAAGAAATTAAGGGATTCAAATTTAAAGCTCAAGTTGCAGATCACAAGCAGCCAGATTTCTCGACAAATTCTTTCTCGATGGTTGACTCGATCTCGGGCCAAAAGAACGTTGGAGTTACAACTGCAATTTACTCCCTTTTAGGTGGGCTTAATGGGCAGCAACAAATGACTCAAGCCCAATTCTAATCGGTGGATCCAGTTGGTGGGTTGAATTATCAATTATCTCTTAATGCCCAATCCAATCCATGGGCCAATTTTAACAATAAGCCCACGTTTGAGATAGGTGAAGCCACAGTTACAGCAATGTACTTATCTAAAATCGGGCCAAAATTGAAGCATTGGAAACGCAAAGCTCGTAAGAATGTTGGAGGCTCTATTTCAAGTGAAGCATCTGAGAAAAAATGTATTGGTGAAGTGCTGGCAGATGGTCCTATGAAACGCGCTAAAGAGGATGGTGATGCTACTACCAATGTGATGGAACCAACGGCGGAGGCTGACAATCAACCCCGTCGAGAGCCATGAAAATCATGTGTTGGAATGTCTGTGGTCTGGGGAATCCATGGACATTTCGAGCTGTTAGTGACAGTATACGCCACTATAATCCTCATTTAATTTTTTTGCAAGAGACAAAATGTAGTTCCAATGCTCGTAAATTAATGAGAGATTGGATGGTTTCTAGCAAATGAAGAATTCCTTCGCCTATTCCCTCAGGGATCAGTGCAACATCTGAATTGGGCTCAATCGAATCACCGCCCAATTCTGTTCAATACATGTCAGGTGCATGGAAATGGTTATCAGAACAAAAGACCTCGTCTGTTTCGTTTCGAAGAAGTATGGACTCAACATCCAAAATGCAAAGAAATCATTACTCATCAGGGTTGTTGGACAGGACAAGGCAATAGTAGCAGCCGATTTGAGAGTTGTCTCCAAAGTTGCCGATCACGTTTAAAGCATTGGGGTAGAGGCACATTCTCATCCATTTGGAGACAGATTGAAACTAACCAGAGGATTCTTCAAGACCTCTATAGTAAGCCTCCACCTTGGGATTTTCATGAGATAAAACGTGTAGAAGACCAATTAGACCAGGCTCTAGAAGAGGATGAAATATACTGGAAACAACGGTCACGTGAGAACTGGCTTCAATGGGGGGATAAGAACACACGTTGGTTCCATAACCAAGCCACTATAAGAAGGAAGAGAAATGAAATTCGTGAAGTGCAGGATATAAACGGTAACCTGATTGTTGACCAAAGACAAATGGAGGAGGCTTTTGAATTGTATTTTTCAAATATGTTTTCATTGTCTAATCCAAATTCGGAAGACATTGACATTGCATTGCAGGATATTCCAGTTAGGGTAACGCAAAGCATGAATGAAAGACTGTTAGCCCCATTCACTAGATGCGAGATCGAGTGAGCCATTAAGCAAATGCATCCTTCCAAGGTGTCTGGACCTGATGGCTTCTCTTCATGTTTTTACCAGAAATTCTAGAACGAGGTTGGTTACATTACTGTTCTTAATTGTCTGGATATGCTTAATATGGTTAGATCGATTAGACCATGGAATGATACATTCATTGCCTTAATTTCTAAAGTTAAGCATCCAAAGCTTATCTCTGATTTCAGACCTATAAGCCTCTGTAATGTCTCATACAAAATAATAGCTAAAGTCCTGGTAAATCGTATGAAATGGGCTCTTCAGGAGATAATATCTGAAAACCAATCTGCTTTTGTACCTAGTAGATCAATTCATGGTAATGTAATTATAGGATATGAATGCCTGCATACGATCAAGAGTAAAAGAACAAGTCATGGGGGATGGATAGCCTTGAAATTAGACATGAGTAAGGCATATGACAGAGTGGAATGGTGCTTTTTAGAAAGGCTTTTGTTAAAAATTGGGTTCCACTCCCAATGGGTTAAAGTTGATTATGGAATGTGTTCGGACTCCATCCTTTTCAATATAACTGAATGGGGTCCCTTCGAGGCAAATCATTCCTCAAAGAGGACTTCGTCAGGGGGATCCCTTATCTCCCTATTTGCTTTTGCTTGTATCTGA

mRNA sequence

ATGGATTCAACACTCAAATTCAAAAGACAATCATCCAAATTCCATACTTTTAGATGCTCTCTTTTGATGGTGTATGAAGACTTGCTCGAGAACTGGGAAAGATTCAGCCTATCAGCTGTAGAGGAGGCCACTGAAGTTGATGTCGATCGCCAAGCTGCAGTAGTCACGAGACAATCACTGGGTTTCAGTTTGACCGGGAAGCTTCTAGCTCCCTGTATCATTTATGGGGACGTTATGCATCGAACGTTTAAACCTGCTTGGAATATACCCAATGGTCTAATTATGGAAAAACTTGAAGCAAATCTATTCCTTTTCTCGCTGAGGTTAGAAGTCGACCAAATGAGGGTGCTGAGGCAAGAACCGTGTCTGTTCGACAAATTCCTTCTTTTCCTTTCGAAGCCAATCCCTATGGTAAAACCTACGGCCATGGAATTTAAGTTCGCAGCTTTCTGGGTCCACTTCTGCGAGCTCCCAATGGATCTCTACAACCGGTCAATGGCGGAACGACTTGGCAGCGTATGGACACCGATCAAGTATGAAAAGCTTCCGGACATTTGTGCATTTTGCGGCCGCATCGACCATGGAATGAGAGATTGTACTTTTAATTATCTTGAGTCTGGTTCATCTTCACGCCGACAAGAGTACGGTATGTGGATGGCCTTCAACGACAGGACTTCCAGCGTGTTTCGTTCGCCTAGCAACAGTCCAATCGGTAATCAACAGTTGATGGTAGCTTCTCCAAATCGAAATCCTTCACTTCAGCCATCTTTGACGGTCGAAACTGGTACTCGATTGGATAATTCAAGGAAATCCTCTCCGATGACTGGATCTGGTAGCCGACCAATGGATATCTCGCCGGTGATGGAGGAAGACGGTCCAGTTTCGGCGATCGGTAACATACCGTCAATTAATGCAGGTGAAGCCACAGTTACAGCAATGTACTTATCTAAAATCGGGCCAAAATTGAAGCATTGGAAACGCAAAGCTCGTAAGAATGTTGGAGGCTCTATTTCAAGTGAAGCATCTGAGAAAAAATGTATTGGTGAAGTGCTGGCAGATGGTCCTATGAAACGCGCTAAAGAGGATGGTGATGCTACTACCAATGGATCAGTGCAACATCTGAATTGGGCTCAATCGAATCACCGCCCAATTCTGTTCAATACATGTCAGGTGCATGGAAATGGTTATCAGAACAAAAGACCTCGTCTGTTTCGTTTCGAAGAAGTATGGACTCAACATCCAAAATGCAAAGAAATCATTACTCATCAGGGTTGTTGGACAGGACAAGGCAATAGTAGCAGCCGATTTGAGAGTTGTCTCCAAAGTTGCCGATCACGTTTAAAGCATTGGGGTAGAGGCACATTCTCATCCATTTGGAGACAGATTGAAACTAACCAGAGGATTCTTCAAGACCTCTATAGTAAGCCTCCACCTTGGGATTTTCATGAGATAAAACGTGTAGAAGACCAATTAGACCAGGCTCTAGAAGAGGATGAAATATACTGGAAACAACGGTCACGTGAGAACTGGCTTCAATGGGGGGATAAGAACACACGTTGGTTCCATAACCAAGCCACTATAAGAAGGAAGAGAAATGAAATTCGTGAAGTGCAGGATATAAACGGTAACCTGATTGTTGACCAAAGACAAATGGAGGAGGCTTTTGAATTGTATTTTTCAAATATGTTTTCATTGTCTAATCCAAATTCGGAAGACATTGACATTGCATTGCAGGATATTCCAGTTAGGAACGAGGTTGGTTACATTACTGTTCTTAATTGTCTGGATATGCTTAATATGGTTAGATCGATTAGACCATGGAATGATACATTCATTGCCTTAATTTCTAAAGTTAAGCATCCAAAGCTTATCTCTGATTTCAGACCTATAAGCCTCTGTAATGTCTCATACAAAATAATAGCTAAAGTCCTGGTAAATCGTATGAAATGGGCTCTTCAGGAGATAATATCTGAAAACCAATCTGCTTTTGTACCTAGTAGATCAATTCATGGTAATGTAATTATAGGATATGAATGCCTGCATACGATCAAGAGTAAAAGAACAAGTCATGGGGGATGGATAGCCTTGAAATTAGACATGAGTAAGGCATATGACAGAGTGGAATGGTGCTTTTTAGAAAGGCTTTTGTTAAAAATTGGGTTCCACTCCCAATGGGTTAAAGTTGATTATGGAATGTGTTCGGACTCCATCCTTTTCAATATAACTGAATGGGGTCCCTTCGAGGCAAATCATTCCTCAAAGAGGACTTCGTCAGGGGGATCCCTTATCTCCCTATTTGCTTTTGCTTGTATCTGA

Coding sequence (CDS)

ATGGATTCAACACTCAAATTCAAAAGACAATCATCCAAATTCCATACTTTTAGATGCTCTCTTTTGATGGTGTATGAAGACTTGCTCGAGAACTGGGAAAGATTCAGCCTATCAGCTGTAGAGGAGGCCACTGAAGTTGATGTCGATCGCCAAGCTGCAGTAGTCACGAGACAATCACTGGGTTTCAGTTTGACCGGGAAGCTTCTAGCTCCCTGTATCATTTATGGGGACGTTATGCATCGAACGTTTAAACCTGCTTGGAATATACCCAATGGTCTAATTATGGAAAAACTTGAAGCAAATCTATTCCTTTTCTCGCTGAGGTTAGAAGTCGACCAAATGAGGGTGCTGAGGCAAGAACCGTGTCTGTTCGACAAATTCCTTCTTTTCCTTTCGAAGCCAATCCCTATGGTAAAACCTACGGCCATGGAATTTAAGTTCGCAGCTTTCTGGGTCCACTTCTGCGAGCTCCCAATGGATCTCTACAACCGGTCAATGGCGGAACGACTTGGCAGCGTATGGACACCGATCAAGTATGAAAAGCTTCCGGACATTTGTGCATTTTGCGGCCGCATCGACCATGGAATGAGAGATTGTACTTTTAATTATCTTGAGTCTGGTTCATCTTCACGCCGACAAGAGTACGGTATGTGGATGGCCTTCAACGACAGGACTTCCAGCGTGTTTCGTTCGCCTAGCAACAGTCCAATCGGTAATCAACAGTTGATGGTAGCTTCTCCAAATCGAAATCCTTCACTTCAGCCATCTTTGACGGTCGAAACTGGTACTCGATTGGATAATTCAAGGAAATCCTCTCCGATGACTGGATCTGGTAGCCGACCAATGGATATCTCGCCGGTGATGGAGGAAGACGGTCCAGTTTCGGCGATCGGTAACATACCGTCAATTAATGCAGGTGAAGCCACAGTTACAGCAATGTACTTATCTAAAATCGGGCCAAAATTGAAGCATTGGAAACGCAAAGCTCGTAAGAATGTTGGAGGCTCTATTTCAAGTGAAGCATCTGAGAAAAAATGTATTGGTGAAGTGCTGGCAGATGGTCCTATGAAACGCGCTAAAGAGGATGGTGATGCTACTACCAATGGATCAGTGCAACATCTGAATTGGGCTCAATCGAATCACCGCCCAATTCTGTTCAATACATGTCAGGTGCATGGAAATGGTTATCAGAACAAAAGACCTCGTCTGTTTCGTTTCGAAGAAGTATGGACTCAACATCCAAAATGCAAAGAAATCATTACTCATCAGGGTTGTTGGACAGGACAAGGCAATAGTAGCAGCCGATTTGAGAGTTGTCTCCAAAGTTGCCGATCACGTTTAAAGCATTGGGGTAGAGGCACATTCTCATCCATTTGGAGACAGATTGAAACTAACCAGAGGATTCTTCAAGACCTCTATAGTAAGCCTCCACCTTGGGATTTTCATGAGATAAAACGTGTAGAAGACCAATTAGACCAGGCTCTAGAAGAGGATGAAATATACTGGAAACAACGGTCACGTGAGAACTGGCTTCAATGGGGGGATAAGAACACACGTTGGTTCCATAACCAAGCCACTATAAGAAGGAAGAGAAATGAAATTCGTGAAGTGCAGGATATAAACGGTAACCTGATTGTTGACCAAAGACAAATGGAGGAGGCTTTTGAATTGTATTTTTCAAATATGTTTTCATTGTCTAATCCAAATTCGGAAGACATTGACATTGCATTGCAGGATATTCCAGTTAGGAACGAGGTTGGTTACATTACTGTTCTTAATTGTCTGGATATGCTTAATATGGTTAGATCGATTAGACCATGGAATGATACATTCATTGCCTTAATTTCTAAAGTTAAGCATCCAAAGCTTATCTCTGATTTCAGACCTATAAGCCTCTGTAATGTCTCATACAAAATAATAGCTAAAGTCCTGGTAAATCGTATGAAATGGGCTCTTCAGGAGATAATATCTGAAAACCAATCTGCTTTTGTACCTAGTAGATCAATTCATGGTAATGTAATTATAGGATATGAATGCCTGCATACGATCAAGAGTAAAAGAACAAGTCATGGGGGATGGATAGCCTTGAAATTAGACATGAGTAAGGCATATGACAGAGTGGAATGGTGCTTTTTAGAAAGGCTTTTGTTAAAAATTGGGTTCCACTCCCAATGGGTTAAAGTTGATTATGGAATGTGTTCGGACTCCATCCTTTTCAATATAACTGAATGGGGTCCCTTCGAGGCAAATCATTCCTCAAAGAGGACTTCGTCAGGGGGATCCCTTATCTCCCTATTTGCTTTTGCTTGTATCTGA

Protein sequence

MDSTLKFKRQSSKFHTFRCSLLMVYEDLLENWERFSLSAVEEATEVDVDRQAAVVTRQSLGFSLTGKLLAPCIIYGDVMHRTFKPAWNIPNGLIMEKLEANLFLFSLRLEVDQMRVLRQEPCLFDKFLLFLSKPIPMVKPTAMEFKFAAFWVHFCELPMDLYNRSMAERLGSVWTPIKYEKLPDICAFCGRIDHGMRDCTFNYLESGSSSRRQEYGMWMAFNDRTSSVFRSPSNSPIGNQQLMVASPNRNPSLQPSLTVETGTRLDNSRKSSPMTGSGSRPMDISPVMEEDGPVSAIGNIPSINAGEATVTAMYLSKIGPKLKHWKRKARKNVGGSISSEASEKKCIGEVLADGPMKRAKEDGDATTNGSVQHLNWAQSNHRPILFNTCQVHGNGYQNKRPRLFRFEEVWTQHPKCKEIITHQGCWTGQGNSSSRFESCLQSCRSRLKHWGRGTFSSIWRQIETNQRILQDLYSKPPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIREVQDINGNLIVDQRQMEEAFELYFSNMFSLSNPNSEDIDIALQDIPVRNEVGYITVLNCLDMLNMVRSIRPWNDTFIALISKVKHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLKIGFHSQWVKVDYGMCSDSILFNITEWGPFEANHSSKRTSSGGSLISLFAFACI
Homology
BLAST of Lag0032313 vs. NCBI nr
Match: XP_022158377.1 (uncharacterized protein LOC111024874 [Momordica charantia])

HSP 1 Score: 320.9 bits (821), Expect = 3.1e-83
Identity = 244/806 (30.27%), Postives = 367/806 (45.53%), Query Frame = 0

Query: 27  DLLENWERFSLSAVEEATEVDVDRQAAVVTRQSLGFSLTGKLLAPCIIYGDVMHRTFKPA 86
           DLLE W+ F L++ EE T +DVD  A   T   L   L GKL     I   VM  T + A
Sbjct: 5   DLLEEWKNFKLTSEEEETAIDVDASAPATTGSRLEQILVGKLFIKRPITCPVMKNTMRTA 64

Query: 87  WNIPNGLI-MEKLEANLFLFSLRLEVDQMRVLRQEPCLFDKFLLFLSKPIPMVKPTAMEF 146
           W + N    ++ L  NLFLFS    +D+ ++ +  P  FD+ L+ ++KP+ ++ P+ ++F
Sbjct: 65  WKLENNAFEVQSLGYNLFLFSFARALDRNKIYKSGPWTFDRTLVLINKPVALIPPSELDF 124

Query: 147 KFAAFWVHFCELPMDLYNRSMAERLGS--------------------------------- 206
                WV F +LP+    R MA RLG+                                 
Sbjct: 125 TKLPIWVRFFDLPLGCITRDMAIRLGNALGGFEEADCDDLNPDWGSNLRVRVMLDISKPL 184

Query: 207 --------------VWTPIKYEKLPDICAFCGRIDHGMRDCTFNYLESGSSSRRQEYGMW 266
                          W PI+YE+LPD C  CG                 SS ++ +YG W
Sbjct: 185 RRGIKLNLDGPIGGAWIPIQYERLPDFCYHCG---------------LSSSRKKHQYGSW 244

Query: 267 MAFNDRTSSVFRSPSNSPIGNQQLMVASPNRNPSLQPSLT-VETGTRLDNSRKSSPMTGS 326
           + +          P+   +   Q  +   + N S   S + V  G++     +S+P TG 
Sbjct: 245 LRYQGTV-----KPTMPQMKQPQEDLLDKSGNNSFSSSTSPVGAGSQ---GVQSAPATGP 304

Query: 327 GSRPMDISPVME---EDGPVSAIGNIP-SINAGEATVTAMYLSKIGPKLKHWKRKARKNV 386
            + PM+ SPV E   +    S  G  P  I+ GE  +    +S + P LK      + + 
Sbjct: 305 IAIPME-SPVTETPKKGAEPSQQGKSPVLIDEGEQRINVKEISNLNPPLKSGAPSMQPSY 364

Query: 387 GGSISSEASEKKCIGEVLA------------DGPMKRAKEDGDAT---TNGSVQHLNWAQ 446
             S++         G                   + R   + DA+     G +  + W  
Sbjct: 365 SDSLTRMDLSPGFSGRFTGFYGHPAAHKRHLTWELLRRISNLDASPWLIGGDMNAILWNY 424

Query: 447 SNHRPILFNTCQVHG----------------NGYQNKRPRLFRFEEVWTQHPK--CKEII 506
                  ++T Q+                   G        F  +++W +  +  C +  
Sbjct: 425 EASYTSSYDTSQIEAFRNIMDACSLTDMGFKGGIFTWCNNRFAGDQLWKRLDRFLCNDTF 484

Query: 507 TH-----QGCWTGQGNSSSRFESCLQSCRSRLKHWGRGTFSSIWRQIETNQRILQDLYSK 566
            +      G W+   ++ S F   +Q+  S L+HWGR     +++QI+  +  + D Y++
Sbjct: 485 NYVFPDASGHWSNATHNYS-FSDSIQASSSALRHWGRSNVWDLFKQIKAQKAAIIDAYNQ 544

Query: 567 PPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIR 626
           P P DF  I  +E+ L   LE +EI+WKQRSRE+WL+WG         +A I      I 
Sbjct: 545 PLPLDFTIIHALENDLAGLLELEEIFWKQRSREDWLKWGIAILNALDIEAIINLIPTRI- 604

Query: 627 EVQDINGNLIVDQRQMEEAFELYFSNMFSLSNPNSEDIDIALQDIPVRNEVGYITVLNCL 686
              ++N  L+      +E  EL    MF       +    AL      + VG  T+  CL
Sbjct: 605 -TSEVNEQLLAP--YTKEEIELAIRQMFPTKALGPDGFP-ALFYQTYWHVVGPKTLEACL 664

Query: 687 DMLNMVRSIRPWNDTFIALISKVKHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEI 742
           + LN    I+ WN T+IALI K+K P+ ISDFRPISLCNVSYKII+K + NR+K  +  +
Sbjct: 665 NALNNGDDIKKWNSTYIALIPKIKQPRSISDFRPISLCNVSYKIISKSITNRLKNVIGLV 724

BLAST of Lag0032313 vs. NCBI nr
Match: XP_024038343.1 (uncharacterized protein LOC112097373 [Citrus clementina])

HSP 1 Score: 256.5 bits (654), Expect = 7.2e-64
Identity = 149/417 (35.73%), Postives = 226/417 (54.20%), Query Frame = 0

Query: 364 DATTNGSVQHLNWAQSNHRPILFNTCQVHGNG--YQNKRPRLFRFEEVWTQHPKCKEII- 423
           D  ++G+  +++   S+H P++    QV G+G  +  +R  L  +E++W+ +  CKEII 
Sbjct: 161 DVFSDGAATNIDSWTSDHCPVVMEV-QVRGSGMNFNQRRATLIHYEDMWSPYDTCKEIIE 220

Query: 424 ---THQGCWTGQGNSSSRFESCLQSCRSRLKHWGRGTFSSIWRQIETNQRILQDL-YSKP 483
              + QGCW    N  S F+   ++  +RL  W +  F    +++E     L+ L  S+ 
Sbjct: 221 KEWSLQGCWNAV-NPVSMFQKVSKNSMARLILWSKEEFRGRQKKLEKLMNQLRSLKLSRV 280

Query: 484 PPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIRE 543
                ++IK VE Q+   L +DEIYWKQRSR +WL+ GDKNT++FH++A+ R+K+N I  
Sbjct: 281 QYVKGNKIKEVERQIQYMLADDEIYWKQRSRADWLKGGDKNTKFFHHKASSRKKKNRIWG 340

Query: 544 VQDINGNLIVDQRQMEEAFELYFSNMFSLSNPNSEDIDIALQDIPVR------------- 603
           +++  GN I +   +E  F  YF+N+F+ S PN + I  AL  I  R             
Sbjct: 341 IENAAGNWIENAEGVEFEFNKYFTNLFTTSKPNQDQIAAALSGISRRVSTEMNESLEMPF 400

Query: 604 --------------------------------NEVGYITVLNCLDMLNMVRSIRPWNDTF 663
                                             V    +  CL +LN    + P+N T+
Sbjct: 401 TPEEVVEALTQMCPTKAPGPDGLPAVFFQKHWQRVKQGVLSTCLHILNKQGDVAPFNHTY 460

Query: 664 IALISKVKHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHG 723
           I LISK   P+ ++DFRPISLCNV Y+I+AK + NR+K  L  +IS  QSAF+P+  I  
Sbjct: 461 IVLISKKGKPRKVTDFRPISLCNVIYRIVAKAIANRLKNVLPNLISPMQSAFIPNWLITD 520

Query: 724 NVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLKIGFHSQWVKV 729
           N+I+GYECLH I+  +    G +ALKLD+SKAYD++EW FLE+ +  +GF   WV +
Sbjct: 521 NIIVGYECLHKIRHCKGRKNGLVALKLDVSKAYDKLEWVFLEQTMKSLGFSQNWVSL 575

BLAST of Lag0032313 vs. NCBI nr
Match: XP_042962496.1 (uncharacterized protein LOC122296768 [Carya illinoinensis])

HSP 1 Score: 249.2 bits (635), Expect = 1.1e-61
Identity = 215/840 (25.60%), Postives = 347/840 (41.31%), Query Frame = 0

Query: 23  MVYEDLLENWERFSLSAVEEATEVDVDRQAAVVTRQSLGFSLTGKLLAPCIIYGDVMHRT 82
           M  EDL++ WER  L+  EE+    V+ + +    +     + G+ +    +  +    T
Sbjct: 1   MEAEDLVKRWERLQLT-TEESIPFHVNPEGSKKEDRMSEHCIVGRAMVERPVNSEAFRTT 60

Query: 83  FKPAWNIPNGLIMEKLEANLFLFSLRLEVDQMRVLRQEPCLFDKFLLFLSKPIPMVKPTA 142
               W +   +   ++    F+   +   D+ +VL   P  FD+ LL L +    V    
Sbjct: 61  MSQVWRLDGWIRFIEIGDQSFIIEFQKLEDKDKVLGGRPWFFDRCLLSLQEVDDTVSINK 120

Query: 143 MEFKFAAFWVHFCELPMDLYNRSM----AERLGSV------------------------- 202
            +F++  FWV    LP+   N  +    A  +G V                         
Sbjct: 121 TQFRYEPFWVQLHNLPLATMNEEVGSQFAASIGHVIRVETESDGRGWGRCLRVRVAVDIL 180

Query: 203 ----------------WTPIKYEKLPDICAFCGRIDHGMRDCTFNYLESGSSSRRQEYGM 262
                           W   KYE+L + C  CG ++H  ++C     E+  + +      
Sbjct: 181 KPLLRGKWMRFEEEEHWISFKYERLQNFCFHCGILNHKGKNCNKLRFENQDAEQAP---- 240

Query: 263 WMAFNDRTSSVFRSPSNSPIGNQQLMVASPNRNPSLQPSLTVETGTRLDNSRKSSPMTGS 322
            + F  R +S            Q L +  P       P +        +         G+
Sbjct: 241 -LQFAKRKTSW-----------QLLEMLKP------PPPMAWLCAGDFNEILHQGEKQGA 300

Query: 323 GSRPMDISPVMEEDGPVSAIGNIPSINAGEATVTAMYLSKIGPKLKHWKRKARKNVGGSI 382
           GSRP      +EE   V  + ++  I+      T             W    R       
Sbjct: 301 GSRPY---KQIEEFRKVVEVCDLRDIHHQGHYFT-------------WSNNRR------- 360

Query: 383 SSEASEKKCIGEVLADGPMKRAKEDGDATTNGSVQHLNWAQSNHRP---ILFNTCQVHGN 442
                 K+ I   LA+      KE  +  +N S   L   QS+H P   +L N+ +V+  
Sbjct: 361 -GRHFTKERIDRALAN------KEWHELFSNASCTTLAAIQSDHSPLSILLQNSAKVY-- 420

Query: 443 GYQNKRPRLFRFEEVWTQHPKCKEIITHQGCWTGQG---NSSSRFESCLQSCRSRLKHWG 502
                  R FR+E  W     CK+++ +   W G       ++     L +C+  L  W 
Sbjct: 421 ---RHEARCFRYEVAWDLKEDCKKVVEYS--WKGVSLGLEGANTVRQRLSACQRNLSQWS 480

Query: 503 RGTFSSIWRQIETNQRILQDLYSKPPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWL 562
           +   +   + I    R ++ L          E++ ++  ++ AL  +E+ W+QR++++W+
Sbjct: 481 QTEKTMKHKDINQAVRRIRHLQEMGTGNHITEMQGLQKGVELALSTEEMKWRQRAKQHWM 540

Query: 563 QWGDKNTRWFHNQATIRRKRNEIREVQDINGNLIVDQRQMEEAFELYFSNMFSLSNPNSE 622
           + GD+NT +FH QA+ RRK N I  ++D  G ++  Q  +  AF  YFS++F+ S+P++ 
Sbjct: 541 KVGDRNTTFFHLQASHRRKANSIVNLEDPQGRILTKQEDIGSAFTGYFSSLFTTSSPSNY 600

Query: 623 DIDIALQDIPVRNE-------------------------------------------VGY 682
           +  +   +  + NE                                           VG 
Sbjct: 601 ETCLDALETKLSNEMENWLLLPFTREEIYTSVTQMNPLGSPGPDGFPASFYQKHWGVVGE 660

Query: 683 ITVLNCLDMLNMVRSIRPWNDTFIALISKVKHPKLISDFRPISLCNVSYKIIAKVLVNRM 742
                 L +LN   S    NDT I+LI KVK+P  I DFRPISLCNV YKII+K + NR 
Sbjct: 661 EVCSYALQVLNHGGSCTDVNDTLISLIPKVKNPIKIPDFRPISLCNVLYKIISKTIANRF 720

Query: 743 KWALQEIISENQSAFVPSRSIHGNVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVE 769
           K  L ++IS NQ+AFVP R I  NV++ YE LH++ ++     G +ALKLDMSKAYDR+E
Sbjct: 721 KTILPKLISLNQTAFVPGRLITDNVLVAYETLHSMSTRMKGKKGCLALKLDMSKAYDRIE 779

BLAST of Lag0032313 vs. NCBI nr
Match: XP_017217082.1 (PREDICTED: uncharacterized protein LOC108194640 [Daucus carota subsp. sativus])

HSP 1 Score: 248.4 bits (633), Expect = 2.0e-61
Identity = 136/397 (34.26%), Postives = 218/397 (54.91%), Query Frame = 0

Query: 372 QHLNWAQSNHRPILFNTCQVHGNGYQNKRPRLFRFEEVWTQHPKCKEIITHQGCWTGQGN 431
           + + W       ++           + K+   F +E+ W +   C ++I++     G  N
Sbjct: 127 KEMTWVGKFSNGVVMERLDSEVKSNKRKKNNRFHYEDAWGEDGDCSKVISNFWENVGSSN 186

Query: 432 SSSRFESCLQSCRSRLKHWGRGTFSSIWRQIETNQRILQDLYSKPPPWDFHEIKRVEDQL 491
           S    +  L  C ++LK W       + R I+ +++ + DL     P  + EIK  E +L
Sbjct: 187 SPKELKQKLVGCGAKLKWWNENKRKELRRNIDVSKKKISDLSLANTPSLWREIKEEEKRL 246

Query: 492 DQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIREVQDINGNLIVDQRQM 551
           +  LE++E+YW+QRSR  WL+ GDKNTR+FH++A+ R+K+NEI+ ++D +G   V++ ++
Sbjct: 247 NLLLEKEEVYWRQRSRALWLKCGDKNTRFFHHKASSRKKKNEIKGLKDQDGCWQVEKSRV 306

Query: 552 EEAFELYFSNMFSLSNPNSEDIDIALQDIPVRNEVGYITVLNCLDMLNMVRSIRPWNDTF 611
                 +F             +D+   D+          +  CL +LN    +  +NDT 
Sbjct: 307 SNIIFSWF---------YQTHLDVVKDDV----------IRICLHILNNNGPVDYFNDTL 366

Query: 612 IALISKVKHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHG 671
           +ALI K++ P+ + +FRPISLCNV YKI++K + NR+K +L E+ISENQSAFV  R IH 
Sbjct: 367 VALIPKIEKPERVENFRPISLCNVIYKIVSKCIANRLKKSLDEVISENQSAFVGGRIIHD 426

Query: 672 NVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLKIGFHSQWVKVDYG 731
           NVIIG+E LH  K  R  +G  +ALKLDM+KAYDRVEW F+E +++K+G++  WV     
Sbjct: 427 NVIIGFEGLHCTKKDRFHNGSKVALKLDMAKAYDRVEWRFIEAVMIKLGYNKNWV-TKVM 486

Query: 732 MCSDSILFNITEWGPFEANHSSKRTSSGGSLISLFAF 769
            C  S++++    G        +R    G  +S + F
Sbjct: 487 RCVTSVVYSFLVNGEITGKVIPQRGLRQGDPLSPYLF 503

BLAST of Lag0032313 vs. NCBI nr
Match: XP_042962692.1 (uncharacterized protein LOC122296963 [Carya illinoinensis])

HSP 1 Score: 246.5 bits (628), Expect = 7.4e-61
Identity = 224/926 (24.19%), Postives = 364/926 (39.31%), Query Frame = 0

Query: 26  EDLLENWERFSLSAVEEATEVDVDRQAAVVTRQSLGFSLTGKLLAPCIIYGDVMHRTFKP 85
           EDL E W+   L   EE   +++D + +         SL GKL +  +I  +VM  T   
Sbjct: 2   EDLEEKWKELRLRE-EEKMVIEIDDEVSGDLLMKEQRSLLGKLCSNRLISKEVMESTLAK 61

Query: 86  AWNIPNGLIMEKLEANLFLFSLRLEVDQMRVLRQEPCLFDKFLLFLSKPIPMVKPTAMEF 145
            W I       ++  N F        D+ +V    P LFD  L+ L +         + F
Sbjct: 62  IWRISKKAQFTEVSPNTFAIVFGNIADKQKVWSGRPWLFDNQLVVLKEFDGFTPLKQVNF 121

Query: 146 KFAAFWVHFCELPMDLYNRSMAERLGS--------------------------------- 205
              +FWV F  LP+        E++GS                                 
Sbjct: 122 TSESFWVRFHNLPLSCMTEVRGEQIGSTVGRVDVQGDGSGWGKFLRVQIHMDLNQPLARG 181

Query: 206 ---------VWTPIKYEKLPDICAFCGRIDHGMRDCTFNYLESGSSSRRQEYGMW----- 265
                    +W P  YEK+P IC  CG I+HG+ +C       G  +   +YG W     
Sbjct: 182 RTLMVKGNEIWIPFSYEKMPRICFSCGCIEHGLNECKEGGRIVGDEN---QYGQWLRAKQ 241

Query: 266 ---------MAFNDRTSSVFRSPSNSPI-------------------------------- 325
                    M  N+  +  ++   N P                                 
Sbjct: 242 DYRNNFFVKMQKNEERNGWWKEKENMPYAEGSGGSNKEGGSRVREESVEGIEVNEKGKER 301

Query: 326 ---GNQQ--------------------------LMVASPNRNPSLQPSLT---------- 385
              GN++                          +++    +  +++  L           
Sbjct: 302 GLKGNEKTKEGEDSGELGSSNGVEGVKERDESTMLITMHGQKENMETGLQDVAVQFETMG 361

Query: 386 VETGTRLDNSRKSSPMTGSGSRPMDISPVMEEDGPVSAIGNIPSINAGEATVTAMYLSKI 445
            E G+ L +      + GS S  + +  V      +SA   +      +  +T  Y +  
Sbjct: 362 KEVGSELRSKEVGERVEGSRSEKLGLDIVNYSQRHISAW--VIEEGVSKWLLTCFYGAPE 421

Query: 446 GPK-------LKHWKRKARKN---VGGSISSEASEKKCIGEVLADGPMKRAKE---DGDA 505
             K       LK  K K  +    VG       +++K  G+V  DG M+  +E   +GD 
Sbjct: 422 TAKRKDTWVMLKSLKPKGGEGWLIVGDFNEILTADEKWGGKVRPDGQMELFREVMSEGDL 481

Query: 506 TTNG------------------------SVQHLNWAQ--------------SNHRPIL-- 565
              G                        +V +  W                S+H+P+L  
Sbjct: 482 HDLGWRGDKYTWSNSHADSTFTKKRLDRAVANPKWMDIYSEAWVEVLVARTSDHKPLLVH 541

Query: 566 FNTCQVHGNGYQNKRPRLFRFEEVWTQHPKCKEIITHQGCWTGQGNSSSRFESCLQSCRS 625
            N           ++ R F++E  W    +C+ ++  +  W   G  +      L + R 
Sbjct: 542 LNRQDQRVVAQLRQKKRGFKYEACWALEKECEVVL--RKAWGEDGPHNQTVMGLLNNSRK 601

Query: 626 RLKHWGRGTFSSIWRQIETNQRILQDLYSKPPPWDFHEIKRVEDQLDQALEEDEIYWKQR 685
            L+ W +       ++IE   + L+ L ++    +  EI++    L   LE+++++WKQR
Sbjct: 602 ALQLWSKSKRQRSGKEIEDKTKYLKRLQAEESRGNVEEIRKTTTDLHLLLEKEDLWWKQR 661

Query: 686 SRENWLQWGDKNTRWFHNQATIRRKRNEIREVQDINGNLIVDQRQMEEAFELYFSNMFSL 727
           ++ NW ++GD+NT++FH  A  R+KRN I+E++D + N++   +Q+EE F  YF  +F  
Sbjct: 662 AKTNWYKYGDRNTKYFHACANQRKKRNFIKEIEDPSNNMVCGFKQVEETFRNYFGTVFQS 721

BLAST of Lag0032313 vs. ExPASy Swiss-Prot
Match: P14381 (Transposon TX1 uncharacterized 149 kDa protein OS=Xenopus laevis OX=8355 PE=4 SV=1)

HSP 1 Score: 68.2 bits (165), Expect = 4.7e-10
Identity = 42/115 (36.52%), Postives = 63/115 (54.78%), Query Frame = 0

Query: 612 IALISKVKHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHG 671
           ++L+ K    +LI ++RP+SL +  YKI+AK +  R+K  L E+I  +QS  VP R+I  
Sbjct: 510 LSLLPKKGDLRLIKNWRPVSLLSTDYKIVAKAISLRLKSVLAEVIHPDQSYTVPGRTIFD 569

Query: 672 NVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLKIGFHSQWV 727
           NV +  + LH  +    S      L LD  KA+DRV+  +L   L    F  Q+V
Sbjct: 570 NVFLIRDLLHFARRTGLS---LAFLSLDQEKAFDRVDHQYLIGTLQAYSFGPQFV 621

BLAST of Lag0032313 vs. ExPASy Swiss-Prot
Match: P08548 (LINE-1 reverse transcriptase homolog OS=Nycticebus coucang OX=9470 PE=4 SV=1)

HSP 1 Score: 65.9 bits (159), Expect = 2.4e-09
Identity = 68/311 (21.86%), Postives = 135/311 (43.41%), Query Frame = 0

Query: 466 QRILQDLYSKPPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQA 525
           +++ ++ +S P P    EI ++  +L++   +  I    +S+  + +  +K  +   N  
Sbjct: 321 KQLEKEEHSNPKPSRRKEITKIRAELNEIENKRIIQQINKSKSWFFEKINKIDKPLANLT 380

Query: 526 TIRRKRNEIREVQDINGNLIVDQRQMEEAFELYFSNMFSLSNPNSEDID----------- 585
             +R ++ I  +++ N  +  D  ++++    Y+  ++S    N ++ID           
Sbjct: 381 RKKRVKSLISSIRNGNDEITTDPSEIQKILNEYYKKLYSHKYENLKEIDQYLEACHLPRL 440

Query: 586 ------------------IALQDIPVRNEVG--------YITVLNCL--DMLNMVRSI-- 645
                               +Q++P +   G        Y T    L   +LN+ ++I  
Sbjct: 441 SQKEVEMLNRPISSSEIASTIQNLPKKKSPGPDGFTSEFYQTFKEELVPILLNLFQNIEK 500

Query: 646 -----RPWNDTFIALISKV-KHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISE 705
                  + +  I LI K  K P    ++RPISL N+  KI+ K+L NR++  +++II  
Sbjct: 501 EGILPNTFYEANITLIPKPGKDPTRKENYRPISLMNIDAKILNKILTNRIQQHIKKIIHH 560

Query: 706 NQSAFVPSRSIHGNVIIGYECL-HTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLL 729
           +Q  F+P      N+      + H  K K   H   + L +D  KA+D ++  F+ R L 
Sbjct: 561 DQVGFIPGSQGWFNIRKSINVIQHINKLKNKDH---MILSIDAEKAFDNIQHPFMIRTLK 620

BLAST of Lag0032313 vs. ExPASy Swiss-Prot
Match: P11369 (LINE-1 retrotransposable element ORF2 protein OS=Mus musculus OX=10090 GN=Pol PE=1 SV=2)

HSP 1 Score: 63.2 bits (152), Expect = 1.5e-08
Identity = 39/119 (32.77%), Postives = 65/119 (54.62%), Query Frame = 0

Query: 612 IALISK-VKHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIH 671
           I LI K  K P  I +FRPISL N+  KI+ K+L NR++  ++ II  +Q  F+P     
Sbjct: 521 ITLIPKPQKDPTKIENFRPISLMNIDAKILNKILANRIQEHIKAIIHPDQVGFIPGMQGW 580

Query: 672 GNVIIGYECLHTI-KSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLKIGFHSQWVKV 729
            N+      +H I K K  +H   + + LD  KA+D+++  F+ ++L + G    ++ +
Sbjct: 581 FNIRKSINVIHYINKLKDKNH---MIISLDAEKAFDKIQHPFMIKVLERSGIQGPYLNM 636

BLAST of Lag0032313 vs. ExPASy Swiss-Prot
Match: O00370 (LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens OX=9606 PE=1 SV=1)

HSP 1 Score: 61.2 bits (147), Expect = 5.8e-08
Identity = 30/104 (28.85%), Postives = 60/104 (57.69%), Query Frame = 0

Query: 626 DFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECL-HTIK 685
           +FRPISL N+  KI+ K+L NR++  ++++I  +Q  F+P      N+      + H  +
Sbjct: 529 NFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINR 588

Query: 686 SKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLKIGFHSQWVKV 729
           +K  +H   + + +D  KA+D+++  F+ + L K+G    ++K+
Sbjct: 589 AKDKNH---VIISIDAEKAFDKIQQPFMLKTLNKLGIDGMYLKI 629

BLAST of Lag0032313 vs. ExPASy TrEMBL
Match: A0A6J1DX30 (uncharacterized protein LOC111024874 OS=Momordica charantia OX=3673 GN=LOC111024874 PE=4 SV=1)

HSP 1 Score: 320.9 bits (821), Expect = 1.5e-83
Identity = 244/806 (30.27%), Postives = 367/806 (45.53%), Query Frame = 0

Query: 27  DLLENWERFSLSAVEEATEVDVDRQAAVVTRQSLGFSLTGKLLAPCIIYGDVMHRTFKPA 86
           DLLE W+ F L++ EE T +DVD  A   T   L   L GKL     I   VM  T + A
Sbjct: 5   DLLEEWKNFKLTSEEEETAIDVDASAPATTGSRLEQILVGKLFIKRPITCPVMKNTMRTA 64

Query: 87  WNIPNGLI-MEKLEANLFLFSLRLEVDQMRVLRQEPCLFDKFLLFLSKPIPMVKPTAMEF 146
           W + N    ++ L  NLFLFS    +D+ ++ +  P  FD+ L+ ++KP+ ++ P+ ++F
Sbjct: 65  WKLENNAFEVQSLGYNLFLFSFARALDRNKIYKSGPWTFDRTLVLINKPVALIPPSELDF 124

Query: 147 KFAAFWVHFCELPMDLYNRSMAERLGS--------------------------------- 206
                WV F +LP+    R MA RLG+                                 
Sbjct: 125 TKLPIWVRFFDLPLGCITRDMAIRLGNALGGFEEADCDDLNPDWGSNLRVRVMLDISKPL 184

Query: 207 --------------VWTPIKYEKLPDICAFCGRIDHGMRDCTFNYLESGSSSRRQEYGMW 266
                          W PI+YE+LPD C  CG                 SS ++ +YG W
Sbjct: 185 RRGIKLNLDGPIGGAWIPIQYERLPDFCYHCG---------------LSSSRKKHQYGSW 244

Query: 267 MAFNDRTSSVFRSPSNSPIGNQQLMVASPNRNPSLQPSLT-VETGTRLDNSRKSSPMTGS 326
           + +          P+   +   Q  +   + N S   S + V  G++     +S+P TG 
Sbjct: 245 LRYQGTV-----KPTMPQMKQPQEDLLDKSGNNSFSSSTSPVGAGSQ---GVQSAPATGP 304

Query: 327 GSRPMDISPVME---EDGPVSAIGNIP-SINAGEATVTAMYLSKIGPKLKHWKRKARKNV 386
            + PM+ SPV E   +    S  G  P  I+ GE  +    +S + P LK      + + 
Sbjct: 305 IAIPME-SPVTETPKKGAEPSQQGKSPVLIDEGEQRINVKEISNLNPPLKSGAPSMQPSY 364

Query: 387 GGSISSEASEKKCIGEVLA------------DGPMKRAKEDGDAT---TNGSVQHLNWAQ 446
             S++         G                   + R   + DA+     G +  + W  
Sbjct: 365 SDSLTRMDLSPGFSGRFTGFYGHPAAHKRHLTWELLRRISNLDASPWLIGGDMNAILWNY 424

Query: 447 SNHRPILFNTCQVHG----------------NGYQNKRPRLFRFEEVWTQHPK--CKEII 506
                  ++T Q+                   G        F  +++W +  +  C +  
Sbjct: 425 EASYTSSYDTSQIEAFRNIMDACSLTDMGFKGGIFTWCNNRFAGDQLWKRLDRFLCNDTF 484

Query: 507 TH-----QGCWTGQGNSSSRFESCLQSCRSRLKHWGRGTFSSIWRQIETNQRILQDLYSK 566
            +      G W+   ++ S F   +Q+  S L+HWGR     +++QI+  +  + D Y++
Sbjct: 485 NYVFPDASGHWSNATHNYS-FSDSIQASSSALRHWGRSNVWDLFKQIKAQKAAIIDAYNQ 544

Query: 567 PPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIR 626
           P P DF  I  +E+ L   LE +EI+WKQRSRE+WL+WG         +A I      I 
Sbjct: 545 PLPLDFTIIHALENDLAGLLELEEIFWKQRSREDWLKWGIAILNALDIEAIINLIPTRI- 604

Query: 627 EVQDINGNLIVDQRQMEEAFELYFSNMFSLSNPNSEDIDIALQDIPVRNEVGYITVLNCL 686
              ++N  L+      +E  EL    MF       +    AL      + VG  T+  CL
Sbjct: 605 -TSEVNEQLLAP--YTKEEIELAIRQMFPTKALGPDGFP-ALFYQTYWHVVGPKTLEACL 664

Query: 687 DMLNMVRSIRPWNDTFIALISKVKHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEI 742
           + LN    I+ WN T+IALI K+K P+ ISDFRPISLCNVSYKII+K + NR+K  +  +
Sbjct: 665 NALNNGDDIKKWNSTYIALIPKIKQPRSISDFRPISLCNVSYKIISKSITNRLKNVIGLV 724

BLAST of Lag0032313 vs. ExPASy TrEMBL
Match: A0A2N9INH4 (Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS54999 PE=4 SV=1)

HSP 1 Score: 266.5 bits (680), Expect = 3.4e-67
Identity = 223/811 (27.50%), Postives = 356/811 (43.90%), Query Frame = 0

Query: 26  EDLLENWERFSLSAVEEATEVDVDRQAAVVTRQSLGFSLTGKLLAPCIIYGDVMHRTFKP 85
           E L + W+RFSLS  ++  +VD+    A  T+QS    L  K L   ++  D + RTFKP
Sbjct: 2   EPLEDMWKRFSLSN-KKGFDVDL----AHTTQQSENI-LVAKCLTLPVLNIDSVARTFKP 61

Query: 86  AWNIPNGLIMEKLEANLFLFSLRLEVDQMRVLRQEPCLFDKFLLFLSKPIPMVKPTAMEF 145
            W      +++ L  N         +D  RVL  EP  FDKFL+   +    +    + F
Sbjct: 62  LWKTKKSFLVQDLGKNRTALVFEDALDLKRVLANEPWSFDKFLVVFERLGEDINVDDLLF 121

Query: 146 KFAAFWVHFCELPMDLYNRSMAERLGSV-------------------------------- 205
               FWV    LP+       A  +G                                  
Sbjct: 122 SHVTFWVQIHNLPVRRMTEESAAVIGKTLGKVERVADKDDERGGENCMRVQVRLDVTIPL 181

Query: 206 --------------WTPIKYEKLPDICAFCGRIDHGMRDCTFNYLESG-SSSRRQEYGMW 265
                         W   +YE+LP+ C  CG +DH  +DC     +    SS   +YG W
Sbjct: 182 CRGRMIKMEEGKKSWIAFRYERLPNFCYLCGCLDHAEKDCDDGLKKKNVVSSEGFQYGAW 241

Query: 266 MAFNDRTSSVFRSPSNSPIGNQQLMVASPNRNPSLQPSLTVETGTRLDNSRKSSPMTGSG 325
           M      + + R P  + I +   ++A+ + +  L  S+ +   T +     S  ++ S 
Sbjct: 242 M-----RAEMDRPPRKTMIVS---LMATQHSHGDLHASMGLLKHTSVRIHGTSFELSRSQ 301

Query: 326 SRPMDISPVMEEDGPVSAIGNIPSINAGEATVTAMYLSKIGPKLKHWKR-KARKNVGGSI 385
                               ++P    G+          IG + ++ ++ +  +NV    
Sbjct: 302 F-------------------SLPWCCTGDFNEIVRCSESIGRRSRNDRQMQGFRNVIDDC 361

Query: 386 SSEASEKKCIGEVLADGPMKRA----KEDGDATTNG--------SVQHLNWAQSNHRPIL 445
                  + +     +     A    + D   TTN          V HL   +S+H+PI 
Sbjct: 362 EFLDLGYRGLPFTWCNNRRGEATTWLRLDRFMTTNEWLMHFHSVVVYHLECTESDHKPIW 421

Query: 446 FNTCQVHGNGYQNKRPRLFRFEEVWTQHPKCKEIITHQGCWTGQGNSSSRFESCLQSCRS 505
             T  +     Q  +P+LFRF+E+W     C+E IT       +G+   + +  +  C  
Sbjct: 422 LTTAPMQS---QRPKPKLFRFKEMWRTEVGCEETITKAWVPKVRGSPMVQVQDMIHRCGK 481

Query: 506 RLKHWGRGTFSSIWRQIETNQRIL-----QDLYSKPPPWDFHEIKRVEDQLDQALEEDEI 565
            L  W R  F S+ + I   +  L     Q + S+       ++  +  +L+    ++  
Sbjct: 482 DLTAWSRVHFGSVTQNIRKKKAELKRAEEQSILSR----GHDQVLSIRKELNSLYCKEGK 541

Query: 566 YWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIREVQDINGNLIVDQRQMEEAFELYFS 625
            W+QRSR  WL+ GD+NT++FHN+AT R++RN +  ++D  G LI D  ++ + F  Y+ 
Sbjct: 542 MWQQRSRALWLKDGDQNTKYFHNRATYRKRRNSLVGLRDDTGGLITDIHKIGDQFVRYYD 601

Query: 626 NMFSLS------------NPN--------------SEDIDIALQDI-PVR---------- 685
           ++F  +            NP+               +++++AL+ + P++          
Sbjct: 602 DLFQAAPLAEVEHVLNGINPSVTVEMNSKLIRPYTEQEVEVALKQMAPLKAPGPDGMPPA 661

Query: 686 ------NEVGYITVLNCLDMLNMVRSIRPWNDTFIALISKVKHPKLISDFRPISLCNVSY 729
                 N VG  TV   L  +N    +   N TF+ALI KVK+P+ ++D+R ISLCNV Y
Sbjct: 662 FYQSYWNVVGKETVQAVLSSINSGTLLPSINHTFVALIPKVKNPEHVTDYRTISLCNVIY 721

BLAST of Lag0032313 vs. ExPASy TrEMBL
Match: A0A2N9IWN7 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS56441 PE=4 SV=1)

HSP 1 Score: 262.7 bits (670), Expect = 4.8e-66
Identity = 155/423 (36.64%), Postives = 238/423 (56.26%), Query Frame = 0

Query: 371  VQHLNWAQSNHRPILFNTCQVHGNGYQNKRP---RLFRFEEVWTQHPKCKEIITHQGCWT 430
            V HL+   S+H PI      +  +   N RP   R+FRF+E+W  H  CKE IT    W 
Sbjct: 871  VHHLHAVSSDHCPI-----SIQFSHPLNSRPRPNRIFRFKEMWLSHTGCKETIT--SAWQ 930

Query: 431  GQGNSSSRFE--SCLQSCRSRLKHWGRGTFSSIWRQIETNQRILQDLYSKPPPWDFHE-I 490
             Q + ++ F+    L+SCR+ L+ W R +F ++ R+++    +L++  S+      HE  
Sbjct: 931  TQKHGTAMFQVHDKLRSCRNSLRQWSRDSFGNVTRELKKKTLMLREAESESMKGKGHEKA 990

Query: 491  KRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIREVQDINGNL 550
              ++ ++   L  +E  W+QRSR+ WL+WGDK+T +FH+ AT RR+RN I E+QDI+GN 
Sbjct: 991  HALKREVSTLLNREECMWRQRSRKKWLRWGDKDTSFFHHSATQRRRRNLISEIQDIHGNR 1050

Query: 551  IVDQRQMEEAFELYFSNMFSLSNP--------------------------NSEDIDIALQ 610
                  +   FE +F  +FS S+P                           +E++D AL+
Sbjct: 1051 YNSDEDIARTFEDHFVLLFSSSHPTEFDSALSGVHRVVTDEMNAELVREFTAEEVDSALK 1110

Query: 611  DIPVRNEVG--------------------YITVLNCLDMLNMVRSIRPWNDTFIALISKV 670
             +      G                       +L+CL+  ++++++   N T+I LI K 
Sbjct: 1111 QMAPSTAPGPDGMSPLFYQSCWGLVGSDVSQAILSCLNSGSLLKAV---NHTYITLIPKT 1170

Query: 671  KHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYE 730
            + P+ +SDFRPISLCNV YKI++KV+ NR+K  L +IISE QSAFVP R I  N+++ +E
Sbjct: 1171 QTPQKVSDFRPISLCNVIYKILSKVITNRLKHILPKIISETQSAFVPCRLITDNILVAFE 1230

Query: 731  CLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLKIGFHSQWVKVDYGMCSDSIL 742
             LH +K+ R+   G +ALKLDMSKAYDRVEW FL++++LK+GF +QWV +    C  ++ 
Sbjct: 1231 TLHHMKTARSGRPGSMALKLDMSKAYDRVEWVFLKQIMLKMGFATQWVNLVL-ECISTVS 1282

BLAST of Lag0032313 vs. ExPASy TrEMBL
Match: A0A2N9FMJ0 (Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS19898 PE=4 SV=1)

HSP 1 Score: 261.2 bits (666), Expect = 1.4e-65
Identity = 250/975 (25.64%), Postives = 398/975 (40.82%), Query Frame = 0

Query: 26  EDLLENWERFSLSAVEEATEVDVDRQAAVVTRQSLGFSLTGKLLAPCIIYGDVMHRTFKP 85
           E+L   W+RFSL      TE + D+     +  S  F+L  K     II  + + RTFKP
Sbjct: 2   EELEGQWQRFSL------TEGEGDKFNLDPSYPSETFTLAAKFFTRRIINVEAITRTFKP 61

Query: 86  AWNIPNGLIMEKLEANLFLFSLRLEVDQMRVLRQEPCLFDKFLLFLSKPIPMVKPTAMEF 145
            W    G +   +  N+ LF    + D  RVL  EP  +DK+L+   +    ++  ++ F
Sbjct: 62  LWRAEKGFMARDMGDNVVLFEFEEKADLERVLLLEPWSYDKYLVAFRRLEEDMEAASLAF 121

Query: 146 KFAAFWVHFCELPMDLYNRSMAERLGSV-------------------------------- 205
           + A FWV    LP+    R +AE LGS                                 
Sbjct: 122 EHAVFWVQIENLPILSQKREVAEALGSTIGEVLKTTDSDAELGGGKGMRIRVRINITQPL 181

Query: 206 --------------WTPIKYEKLPDICAFCGRIDHGMRDCTFNYLESGS--SSRRQEYGM 265
                         W   KYE+LP+ C +CG + HG +DC + +L +    +   Q YG 
Sbjct: 182 LRGRKIGMAKGREGWVSFKYERLPNFCYWCGILTHGDKDCEY-WLRNHDRLNKTEQGYGP 241

Query: 266 WM-AFNDRTSSVF--------RSPSNSPIGNQQLMVA----------SPNRN-------- 325
           W+ A  DR +           +S +N P  +Q+   A          SPN          
Sbjct: 242 WLKAELDRPNRKVEVHVEGRNQSSANRPKTHQKTTPAMGVPPSSAAKSPNPKAMPASNME 301

Query: 326 ----PSLQPSLT----VETGTRLDNSRKSSPMTGSGSRPMDISPVMEEDGP--VSAIGNI 385
               P L P  +    +   T  +  R+     G     +D+  V E   P  ++    I
Sbjct: 302 KADIPGLDPKSSEIREIHRATFEEQLREIDREMGFLKENLDVLKVAENQSPPCITESVII 361

Query: 386 PSINAGEATVTAMYLSKIG---------PKLKHWKRKARKNVGGS--------------- 445
           PS+       T + L ++          P    WK+KAR    G                
Sbjct: 362 PSVVNSPNDSTRIPLQELSNAPTTKVFKPGSGSWKKKARAKGNGPGPLFLPLTEKRPSDA 421

Query: 446 --ISSEASEKK---------------------------------------------CIG- 505
             I S+A                                                 C+G 
Sbjct: 422 MLIDSDAENNSRPEKILRTDSAWRLSFIYGEPVTHKRMVTWNLLRRLHNQYSLPWCCLGD 481

Query: 506 --EVLADGPMKRAK----------------------------------EDGDATT----- 565
             E++ +  M+  +                                   D   TT     
Sbjct: 482 FNEIIKNEEMQGRRPRPDRQMQAFRDAIDDCNLLDMGYSGFPFTWCNNRDPPHTTWVRLD 541

Query: 566 -------------NGSVQHLNWAQSNHRPILFNTCQVHGNGYQNKRPRLFRFEEVWTQHP 625
                        +  ++H++   S+H+ +L  T + H   +  ++P  FRFEEVWT   
Sbjct: 542 RGLANMDWLQQHPHTIMEHIDVTSSDHKCLLM-TWEPHPTSHFQRKP--FRFEEVWTSDE 601

Query: 626 KCKEIITHQGCWTGQGNSSSRFE--SCLQSCRSRLKHWGRGTFSSIWRQIETNQRILQDL 685
            C+  IT +  W  + + ++ F+  + L  C+  L +W R +F +I +Q+   +R+L+  
Sbjct: 602 GCE--ITIKDSWECRVDGTAMFKVANKLTQCKRGLGNWSRRSFGNISKQLAEKKRLLKSA 661

Query: 686 -YSKPPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKR 742
                   +   +K ++ +++  L ++E  W+QRSR NWL+ GD+NTR+FH +A+ RR+R
Sbjct: 662 ELEAVRSGNMSAVKDLKMEVNSLLGKEERLWRQRSRSNWLREGDQNTRFFHGRASQRRRR 721

BLAST of Lag0032313 vs. ExPASy TrEMBL
Match: A0A803PWX1 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 258.8 bits (660), Expect = 7.0e-65
Identity = 159/448 (35.49%), Postives = 240/448 (53.57%), Query Frame = 0

Query: 371  VQHLNWAQSNHRPILFN-TCQVHGNGY-QNKRPRLFRFEEVWTQHPKCKEIITHQGCW-- 430
            VQ L+W +S+HR ++ N   +V G+   + KR   F FEE W Q  +C EII +   W  
Sbjct: 749  VQLLDWWESDHRALVINIPVRVDGDKCGKAKRKNRFHFEEAWCQEEECTEIIDNM--WKE 808

Query: 431  -TGQGNSSSRFESCLQSCRSRLKHWGRGTFSSIWRQIETNQRILQDLYSKPPPWDFHEIK 490
              G+G   S F   +  C   L+ W +   + +  +I   ++IL +L  +  P  +  I+
Sbjct: 809  RQGRGRPVS-FRCKINKCGKALQDWNKKKKARLNNEIAKTKKILHELTMQQQPGVWEAIQ 868

Query: 491  RVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIREVQDINGNLI 550
             +ED+L+  LE+DE YW+QRSR  WLQWGD+NT++FH++A+ RRK+NEI+ +QD  G   
Sbjct: 869  HMEDKLNGLLEKDEQYWRQRSRALWLQWGDRNTKYFHHKASSRRKKNEIKGLQDQMGVWQ 928

Query: 551  VDQRQMEEAFELYFSNMFSLSNPNSEDIDIALQ----------------DIPVRNEVGYI 610
             D+  + +  E Y+  +F  S+ +   +   L+                D  V   V  +
Sbjct: 929  DDKLLVCQIVEDYYKGLFMGSDIDQGVMQEVLEVVQPKVSMSMNEELMVDFSVEEVVQAV 988

Query: 611  TVLN-----------------------------CLDMLNMVRSIRPWNDTFIALISKVKH 670
              +N                             CL++LN    +   NDT +ALI KV  
Sbjct: 989  KGMNPTKAPGADGLPALFYQKFWSKLKDEVIAVCLNVLNNGADLSCLNDTVVALIPKVDK 1048

Query: 671  PKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECL 730
            P+ I +FRPISLCNV YKI++K L NR++ +L +++S++QSAF+  R IH N I+GYECL
Sbjct: 1049 PQKIEEFRPISLCNVIYKIVSKCLANRLRHSLDQVVSDSQSAFLKGRLIHDNAIVGYECL 1108

Query: 731  HTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLKIGFHSQWVKVDYGMCSDSILFN 769
            H ++  R  +G  +ALKLDM+KAYDRVEW FLE ++LK+G+   WV      C  S+ F+
Sbjct: 1109 HVMRKNRFRNGTKVALKLDMAKAYDRVEWRFLEAMMLKLGYDVPWVS-KIMRCLTSVQFS 1168

BLAST of Lag0032313 vs. TAIR 10
Match: AT4G20520.1 (RNA binding;RNA-directed DNA polymerases )

HSP 1 Score: 68.9 bits (167), Expect = 2.0e-11
Identity = 30/83 (36.14%), Postives = 50/83 (60.24%), Query Frame = 0

Query: 644 LVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECLHTIKSKRTSHGGWIALKLDMSKA 703
           +V R+K  +  +I   Q++F+P R    N++   E +H+++ K+    GW+ LKLD+ KA
Sbjct: 1   MVERLKPLMTNLIGPAQASFIPGRVSTDNIVFVQEAVHSMRRKK-GVKGWMLLKLDLEKA 60

Query: 704 YDRVEWCFLERLLLKIGFHSQWV 727
           YDR+ W +LE  L+  GF   W+
Sbjct: 61  YDRIRWDYLEDTLISAGFPEVWL 82

BLAST of Lag0032313 vs. TAIR 10
Match: AT1G43760.1 (DNAse I-like superfamily protein )

HSP 1 Score: 46.6 bits (109), Expect = 1.1e-04
Identity = 45/190 (23.68%), Postives = 78/190 (41.05%), Query Frame = 0

Query: 499 EIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIREVQDINGNLIVDQRQMEEAFELY 558
           E +++Q+SR  WLQ GD NTR+FH      + +N I+ ++  +   + +  Q++E    Y
Sbjct: 432 ESFYRQKSRIKWLQDGDANTRFFHKVILANQAKNLIKFLRMDDDVRVENVTQVKEMIVAY 491

Query: 559 FSNMFSLSN------------------------------PNSEDIDIALQDIPVRNE--- 618
           ++++    +                              P+ ++I  A+  +P RN+   
Sbjct: 492 YTHLLGSDSDILTPDSVQRIKDIHPFRCNDTLASRLSALPSDKEITAAVFAMP-RNKAPG 551

Query: 619 ---------------VGYITVLNCLDMLNMVRSIRPWNDTFIALISKVKHPKLISDFRPI 641
                          V   T+    +       ++ +N T I LI KV     +S FRP+
Sbjct: 552 PDSFTAEFFWESWFVVKDSTIAAVKEFFRTGHLLKRFNATAITLIPKVTGVDQLSMFRPV 611

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022158377.13.1e-8330.27uncharacterized protein LOC111024874 [Momordica charantia][more]
XP_024038343.17.2e-6435.73uncharacterized protein LOC112097373 [Citrus clementina][more]
XP_042962496.11.1e-6125.60uncharacterized protein LOC122296768 [Carya illinoinensis][more]
XP_017217082.12.0e-6134.26PREDICTED: uncharacterized protein LOC108194640 [Daucus carota subsp. sativus][more]
XP_042962692.17.4e-6124.19uncharacterized protein LOC122296963 [Carya illinoinensis][more]
Match NameE-valueIdentityDescription
P143814.7e-1036.52Transposon TX1 uncharacterized 149 kDa protein OS=Xenopus laevis OX=8355 PE=4 SV... [more]
P085482.4e-0921.86LINE-1 reverse transcriptase homolog OS=Nycticebus coucang OX=9470 PE=4 SV=1[more]
P113691.5e-0832.77LINE-1 retrotransposable element ORF2 protein OS=Mus musculus OX=10090 GN=Pol PE... [more]
O003705.8e-0828.85LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens OX=9606 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1DX301.5e-8330.27uncharacterized protein LOC111024874 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
A0A2N9INH43.4e-6727.50Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=F... [more]
A0A2N9IWN74.8e-6636.64Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS56441 PE=4 SV=1[more]
A0A2N9FMJ01.4e-6525.64Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=F... [more]
A0A803PWX17.0e-6535.49Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G20520.12.0e-1136.14RNA binding;RNA-directed DNA polymerases [more]
AT1G43760.11.1e-0423.68DNAse I-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025558Domain of unknown function DUF4283PFAMPF14111DUF4283coord: 60..172
e-value: 2.0E-15
score: 56.6
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 621..727
e-value: 1.5E-18
score: 67.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 247..284
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 247..279
NoneNo IPR availablePANTHERPTHR19446:SF440SUBFAMILY NOT NAMEDcoord: 590..741
coord: 377..578
NoneNo IPR availablePANTHERPTHR19446REVERSE TRANSCRIPTASEScoord: 590..741
coord: 377..578
NoneNo IPR availableCDDcd01650RT_nLTR_likecoord: 609..716
e-value: 3.2158E-29
score: 113.925
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 605..722

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0032313.1Lag0032313.1mRNA