Lag0021974 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0021974
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionReverse transcriptase domain-containing protein
Locationchr7: 15250899 .. 15252457 (+)
RNA-Seq ExpressionLag0021974
SyntenyLag0021974
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGATGGATATGGCGGTGGCTGGATCCCAACCCCACCCGGGATTATGAGTCTGATATTTTGAAATGTTCGGGGGTTGGGGTCTCCCCGTTCATTCCGACGCTTGGCCAAGTTGGTGCAAGCAAAATACCTTTGGTGCTTTTTCTGTCTGAGACTAAATTGTCATCAAACAGAATGGCGCCAGCGAAGCGAGCTCTAGGCTTCGAGAACTGTTTTTGTGTTGACAATAGAGGGAAAAGTGGTGGTTTGGCTCTTTTGTGGAATTCGTCTGTCACTTTCAGCCTCCTGTCGTATTCGAATAATCACATTGATGGGTGGATCACGTGGGATGATTATCATTGGCGTCTCACTTGCATAGGTTTCCCTACAGCGGACATGCGAGACCAGACGTGGGCCCTTCTCTCTAAGTTAAGGGGTGGCCCTGAAACTCCTTGGCTTATAGGGGGACTTTAATGCCCTGTTATATCAGCACGAAAAGGAGGGTGGCAGAGATTAATGCCCTGTTTATTCTTCAGTTGTTGACTGATTATGTCCAGCAGTTCTTCTTGTCATTAGATCCCAATGACCAGGATTTTGATATATCTCTCAGGGACCTTCCTCGCTTTGTGGATAGTGAGATGAATGTCGTTCTAATGCAGCCTTTTACAGAGGATGAGATTCTTCGGGCTTTGAAGCAGTCTCACCCCCATAAGGCCCCAGGCCCAGATGGGTTATCTGGCAGTTTCTACAAAAACCACTGGTCGATAGTGGGGCCTTCAGTGATCCAGAGTTGTTTGGCTGTGTTGAATCAGGGATGCTCCCCGGGGTCAATCAATGAGACTATGATTGTCCTCGTTTCGAAGGTCAAGGCCCCTCGTCGGGTATCTTATTTTCGACCCATCTCTCTCTACAATGTTAGTTATAAGCTAATATCGAAGGCCTTGGTCAACAGGATGAAACATATTCTTCCAAAACTTATTTCTCCCAACCAGAGTGCCTTTATCCCAGGGAGGTGTGTTGTGGATAATGCCATCTTGGGGTTTGAGTGCATCCATGAGTTGAGGAGGCAGACTGGAGGGAAATCTAAATGGGCTACTCTAAAACTTGACATGAGCAAAGCTTATGACATGATAGAATGGTCTTTTTTGCGAATAGTTATGGCTAGAATGGGTTTCGCTCAGCAGTGAATTGATTTAATTCTCCGGTGTGTTAGCTCGGTCTCCTTTTCTTTTAACCTAAACGGGGAGAGGTTGGGACATGTGACTCCTTCCCATGGACTCAGGCAGGGGGATCCGCGGTCCCTATATCTGTTTTTACTCTGTGTTGAGGGTTTATCGAGCCTGTTGCGAGGAGCAGAGCAGCAATATTTGATATCTGGGTTTCGATTTGCACGGAGTAGCCCCCCGATTTCTCATCTATTTTTTGCGAATGATAGCCTCCTGTTCTTCAGGGCAAATGCTATGGGAGTTGTGGCTATCCGGGACCTATTGATCCGCTATGAACGAACCTCAGGACAGGTGGTCAATTATGAGAAGTCAGTGGTTGCATTCAGCCCAAATACTAGAAAGGACACACAATAG

mRNA sequence

ATGCGATGGATATGGCGGTGGCTGGATCCCAACCCCACCCGGGATTATGAGTCTGATATTTTGAAATGTTCGGGGGTTGGGGTCTCCCCGTTCATTCCGACGCTTGGCCAAGTTGGTGCAAGCAAAATACCTTTGGTGCTTTTTCTGTCTGAGACTAAATTGTCATCAAACAGAATGGCGCCAGCGAAGCGAGCTCTAGGCTTCGAGAACTGTTTTTGTGTTGACAATAGAGGGAAAAGTGGTGGTTTGGCTCTTTTGTGGAATTCGTCTGTCACTTTCAGCCTCCTGTCGTATTCGAATAATCACATTGATGGGTGGATCACGTGGGATGATTATCATTGGCGTCTCACTTGCATAGGTTTCCCTACAGCGGACATGCGAGACCAGACCACGAAAAGGAGGGTGGCAGAGATTAATGCCCTGTTTATTCTTCAGTTGTTGACTGATTATGTCCAGCAGTTCTTCTTGTCATTAGATCCCAATGACCAGGATTTTGATATATCTCTCAGGGACCTTCCTCGCTTTGTGGATAGTGAGATGAATGTCGTTCTAATGCAGCCTTTTACAGAGGATGAGATTCTTCGGGCTTTGAAGCAGTCTCACCCCCATAAGGCCCCAGGCCCAGATGGGTTATCTGGCAGTTTCTACAAAAACCACTGGTCGATAGTGGGGCCTTCAGTGATCCAGAGTTGTTTGGCTGTGTTGAATCAGGGATGCTCCCCGGGGTCAATCAATGAGACTATGATTGTCCTCGTTTCGAAGGTCAAGGCCCCTCGTCGGGTATCTTATTTTCGACCCATCTCTCTCTACAATGTTAGTTATAAGCTAATATCGAAGGCCTTGGTCAACAGGATGAAACATATTCTTCCAAAACTTATTTCTCCCAACCAGAGTGCCTTTATCCCAGGGAGGTGTGTTGTGGATAATGCCATCTTGGGGTTTGAGTGCATCCATGAGTTGAGGAGGCAGACTGGAGGGAAATCTAAATGGGCTACTCTAAAACTTGACATGAGCAAAGCTTATGACATGATAGAATGGTCTTTTTTGCGAATAGTTATGGCTAGAATGGGTTTCGCTCAGCAGCAGGGGGATCCGCGGTCCCTATATCTGTTTTTACTCTGTGTTGAGGGTTTATCGAGCCTGTTGCGAGGAGCAGAGCAGCAATATTTGATATCTGGGTTTCGATTTGCACGGAGTAGCCCCCCGATTTCTCATCTATTTTTTGCGAATGATAGCCTCCTGTTCTTCAGGGCAAATGCTATGGGAGTTGTGGCTATCCGGGACCTATTGATCCGCTATGAACGAACCTCAGGACAGGTGGTCAATTATGAGAAGTCAGTGGTTGCATTCAGCCCAAATACTAGAAAGGACACACAATAG

Coding sequence (CDS)

ATGCGATGGATATGGCGGTGGCTGGATCCCAACCCCACCCGGGATTATGAGTCTGATATTTTGAAATGTTCGGGGGTTGGGGTCTCCCCGTTCATTCCGACGCTTGGCCAAGTTGGTGCAAGCAAAATACCTTTGGTGCTTTTTCTGTCTGAGACTAAATTGTCATCAAACAGAATGGCGCCAGCGAAGCGAGCTCTAGGCTTCGAGAACTGTTTTTGTGTTGACAATAGAGGGAAAAGTGGTGGTTTGGCTCTTTTGTGGAATTCGTCTGTCACTTTCAGCCTCCTGTCGTATTCGAATAATCACATTGATGGGTGGATCACGTGGGATGATTATCATTGGCGTCTCACTTGCATAGGTTTCCCTACAGCGGACATGCGAGACCAGACCACGAAAAGGAGGGTGGCAGAGATTAATGCCCTGTTTATTCTTCAGTTGTTGACTGATTATGTCCAGCAGTTCTTCTTGTCATTAGATCCCAATGACCAGGATTTTGATATATCTCTCAGGGACCTTCCTCGCTTTGTGGATAGTGAGATGAATGTCGTTCTAATGCAGCCTTTTACAGAGGATGAGATTCTTCGGGCTTTGAAGCAGTCTCACCCCCATAAGGCCCCAGGCCCAGATGGGTTATCTGGCAGTTTCTACAAAAACCACTGGTCGATAGTGGGGCCTTCAGTGATCCAGAGTTGTTTGGCTGTGTTGAATCAGGGATGCTCCCCGGGGTCAATCAATGAGACTATGATTGTCCTCGTTTCGAAGGTCAAGGCCCCTCGTCGGGTATCTTATTTTCGACCCATCTCTCTCTACAATGTTAGTTATAAGCTAATATCGAAGGCCTTGGTCAACAGGATGAAACATATTCTTCCAAAACTTATTTCTCCCAACCAGAGTGCCTTTATCCCAGGGAGGTGTGTTGTGGATAATGCCATCTTGGGGTTTGAGTGCATCCATGAGTTGAGGAGGCAGACTGGAGGGAAATCTAAATGGGCTACTCTAAAACTTGACATGAGCAAAGCTTATGACATGATAGAATGGTCTTTTTTGCGAATAGTTATGGCTAGAATGGGTTTCGCTCAGCAGCAGGGGGATCCGCGGTCCCTATATCTGTTTTTACTCTGTGTTGAGGGTTTATCGAGCCTGTTGCGAGGAGCAGAGCAGCAATATTTGATATCTGGGTTTCGATTTGCACGGAGTAGCCCCCCGATTTCTCATCTATTTTTTGCGAATGATAGCCTCCTGTTCTTCAGGGCAAATGCTATGGGAGTTGTGGCTATCCGGGACCTATTGATCCGCTATGAACGAACCTCAGGACAGGTGGTCAATTATGAGAAGTCAGTGGTTGCATTCAGCCCAAATACTAGAAAGGACACACAATAG

Protein sequence

MRWIWRWLDPNPTRDYESDILKCSGVGVSPFIPTLGQVGASKIPLVLFLSETKLSSNRMAPAKRALGFENCFCVDNRGKSGGLALLWNSSVTFSLLSYSNNHIDGWITWDDYHWRLTCIGFPTADMRDQTTKRRVAEINALFILQLLTDYVQQFFLSLDPNDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSINETMIVLVSKVKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWATLKLDMSKAYDMIEWSFLRIVMARMGFAQQQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFANDSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPNTRKDTQ
Homology
BLAST of Lag0021974 vs. NCBI nr
Match: XP_018815621.1 (uncharacterized protein LOC108987197 [Juglans regia] >KAF5442204.1 hypothetical protein F2P56_034891 [Juglans regia])

HSP 1 Score: 292.0 bits (746), Expect = 9.2e-75
Identity = 170/454 (37.44%), Postives = 240/454 (52.86%), Query Frame = 0

Query: 70  NCFCVDNRGKSGGLALLWNSSVTFSLLSYSNNHIDGWITWDDYHWRLTCIGFPTADMRDQ 129
           +C  VD  G+ GG+ALLW   V+ S+LSYS+ HID  I  D           P   +   
Sbjct: 17  DCSTVDGVGRKGGIALLWGRDVSLSILSYSHYHIDAAIEDD-----------PAKGLNLS 76

Query: 130 TTKRRVAEINALFIL------------------------------------QLLTDYVQQ 189
             ++ + ++NAL  L                                    +++ DY ++
Sbjct: 77  FARKHMEKVNALDSLGDNIELLQKARVDDQKWLERDELLWKQRSKVGEQRDKIILDYFEE 136

Query: 190 FFLSLDP-NDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLS 249
            F S +P    DF   L  L   V S MN  L QP+TE E+  +L Q HP KAPGPDG+S
Sbjct: 137 LFTSSNPVGSTDF---LCSLAGKVTSSMNKGLAQPYTEAEVTASLAQMHPSKAPGPDGMS 196

Query: 250 GSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSINETMIVLVSKVKAPRRVSYFRPISLYNV 309
             FY+ +W +VG SV ++ L  LN G  P ++N T I+L+ K K P +V  +RPISL NV
Sbjct: 197 PMFYQKYWDVVGISVTKAVLTALNSGSFPSTLNHTNIILIPKKKFPEKVDDYRPISLCNV 256

Query: 310 SYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWAT 369
           +YKLI+K + NR+K +LP +I  +QSAF+PGR + DN ++ +E +H L  +T GK  + +
Sbjct: 257 AYKLIAKVVSNRLKFVLPSVIEESQSAFVPGRLITDNVLIAYELVHYLNHKTKGKKGYMS 316

Query: 370 LKLDMSKAYDMIEWSFLRIVMARMGFAQ-------------------------------- 429
           +KLDMSKAYD +EW F+R VM  MGF +                                
Sbjct: 317 IKLDMSKAYDRVEWGFIRTVMTVMGFDRTFIEMIMFCVSSVSFSVLINGESKGSIKPTRG 376

Query: 430 -QQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFANDSLLFFRAN 454
            +Q DP S YLFLLC EGL +LL+ A  +  IS  +  + +P I+HL FA+DS++F RA+
Sbjct: 377 LRQRDPLSPYLFLLCTEGLIALLKEAGIRKKISSIQICKGAPHINHLLFADDSVVFCRAD 436

BLAST of Lag0021974 vs. NCBI nr
Match: XP_028068804.1 (uncharacterized protein LOC114271378 [Camellia sinensis])

HSP 1 Score: 284.6 bits (727), Expect = 1.5e-72
Identity = 162/351 (46.15%), Postives = 207/351 (58.97%), Query Frame = 0

Query: 144 LQLLTDYVQQFFLSLDPNDQDFDIS--LRDLPRFVDSEMNVVLMQPFTEDEILRALKQSH 203
           L+ L   V  +F  L    Q  DI   L  +   V  E NV L +P+T +E+  AL Q H
Sbjct: 325 LEDLERIVVGYFAELFDAGQQCDIGEVLASIHVVVPPERNVELARPYTAEEVSYALFQMH 384

Query: 204 PHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSINETMIVLVSKVKAPRRV 263
           P KAPGPDG    FY+  W IVG  V ++ L VLN+G +  ++N+T IVL+ KVK+P+R+
Sbjct: 385 PTKAPGPDGKPTLFYQRFWPIVGAQVTRAVLGVLNEGKAVTAMNDTFIVLIPKVKSPKRM 444

Query: 264 SYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELR 323
           S FRPISL NV YKL+SK L NRM+ ILP +IS NQSAF+ GR + DN +  FE  H L+
Sbjct: 445 SQFRPISLCNVVYKLVSKVLANRMRKILPDIISMNQSAFVAGRLISDNMLASFEIFHFLK 504

Query: 324 RQTGGKSKWATLKLDMSKAYDMIEWSFLRIVMARMGFAQ--------------------- 383
            +  GK     LKLDMSKAYD +EWSFLR VM RMGF Q                     
Sbjct: 505 NKRHGKEGHFALKLDMSKAYDRVEWSFLRGVMERMGFNQLFVDTIVHCISSVSYSVLVNG 564

Query: 384 ------------QQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFF 443
                       +QGDP S YLF+LC EGLS+L++ AE +  ++G    R SP +SHL F
Sbjct: 565 SPIKKFVPTRGLRQGDPLSPYLFVLCAEGLSALIKRAEGEGKLTGVAVCRGSPRVSHLLF 624

Query: 444 ANDSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPNTRKDTQ 460
           A+DSLLF  AN   +V ++D+L +YE  SGQ +N EKS + FS N   D Q
Sbjct: 625 ADDSLLFGAANMQELVVVQDILGKYELVSGQKINLEKSAICFSKNVGIDIQ 675

BLAST of Lag0021974 vs. NCBI nr
Match: XP_017250619.1 (PREDICTED: uncharacterized protein LOC108221234 [Daucus carota subsp. sativus])

HSP 1 Score: 283.9 bits (725), Expect = 2.5e-72
Identity = 143/347 (41.21%), Postives = 207/347 (59.65%), Query Frame = 0

Query: 146 LLTDYVQQFFLSLDPNDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKA 205
           ++  + +  F +  P+    D  L  +   V  +MN +L + FT  E+ RA+    P K+
Sbjct: 601 IIQRFYENLFTTSSPSADKIDELLAAVQPLVTYKMNRILQRDFTMGEVKRAIFSMSPDKS 660

Query: 206 PGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSINETMIVLVSKVKAPRRVSYFR 265
           PGPDG++  F++ HW IVGP V ++ L  LN+      IN T++ L+ KVK P+ V  FR
Sbjct: 661 PGPDGMNAMFFQQHWEIVGPVVSKAILDCLNEEAGLEMINSTLVTLIPKVKEPKNVGDFR 720

Query: 266 PISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTG 325
           PISL NV YK I+K L+NR+K ILP +I+  QSAF+PGR + DNA++ +EC+H LR    
Sbjct: 721 PISLCNVIYKAIAKVLINRLKPILPLIINGTQSAFVPGRLITDNALIAYECLHHLRHMRS 780

Query: 326 GKSKWATLKLDMSKAYDMIEWSFLRIVMARMGFAQQ------------------------ 385
           GK  +  +KLDMSKAYD +EW F+  ++ ++GF+QQ                        
Sbjct: 781 GKKCFVAMKLDMSKAYDRVEWIFIEKMLTKLGFSQQWVKKIMKCVTSVNYSFQVNGQIYG 840

Query: 386 ---------QGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFANDS 445
                    QGDP S YLFL+C EG S+LLR AE +  I G + AR++P ISHLFFA+DS
Sbjct: 841 KVFPSRGLRQGDPLSPYLFLICAEGFSALLRQAENRSDILGLKIARNAPSISHLFFADDS 900

Query: 446 LLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPNTRKDTQ 460
           LLF +A+   + +I+++   Y   SGQ++N+ KS++ FSPNT  D +
Sbjct: 901 LLFLKASTRSLNSIQNIFSLYSECSGQMINFNKSLLFFSPNTTNDVR 947

BLAST of Lag0021974 vs. NCBI nr
Match: XP_030505314.1 (uncharacterized protein LOC115720302 [Cannabis sativa])

HSP 1 Score: 283.5 bits (724), Expect = 3.3e-72
Identity = 151/339 (44.54%), Postives = 200/339 (59.00%), Query Frame = 0

Query: 149 DYVQQFFLSLDPNDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGP 208
           DY    F +   +    D+ L  +P  +  EMN +L QPFT DEIL AL      K+PGP
Sbjct: 402 DYYANLFATAGVDATAMDLVLDTIPTTITPEMNFILQQPFTSDEILAALNSMGNDKSPGP 461

Query: 209 DGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSINETMIVLVSKVKAPRRVSYFRPIS 268
           DG+S  F+ NHWS +GP +  + L VLN    P  IN+T+I L+ KVK P  V+ +RPIS
Sbjct: 462 DGMSVMFFTNHWSTIGPHITTAVLDVLNNRSDPSCINKTLITLIPKVKHPTSVTQYRPIS 521

Query: 269 LYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKS 328
           L NV YKLISKA+V RMK +L  +IS  QSAFI  R ++DN ++ +E +H LR +T G+ 
Sbjct: 522 LCNVIYKLISKAIVLRMKPVLHLVISEFQSAFIFDRLIIDNVLVAYELLHCLRNKTRGRV 581

Query: 329 KWATLKLDMSKAYDMIEWSFLRIVMARMGFAQ---------------------------- 388
            +A LKLDMSKA+D +EW FL  V+ +MGF                              
Sbjct: 582 SYAALKLDMSKAFDRVEWHFLERVLLKMGFGHGLVGLIMRCISSTSFSFLINGHVTGQLQ 641

Query: 389 -----QQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFANDSLLF 448
                 QGDP S YLFL+C E LS LL+  EQQ +++G    R +P +SHL FANDSLLF
Sbjct: 642 PQRGIPQGDPLSPYLFLVCSEALSRLLQLHEQQGMLTGLGVTRQAPKVSHLLFANDSLLF 701

Query: 449 FRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPNT 455
            R +A  +  ++  L  Y   SGQ++NY+KSV++FSPNT
Sbjct: 702 CRTDARSLAVVKHTLDVYSNASGQLINYDKSVMSFSPNT 740

BLAST of Lag0021974 vs. NCBI nr
Match: XP_030505314.1 (uncharacterized protein LOC115720302 [Cannabis sativa])

HSP 1 Score: 65.1 bits (157), Expect = 1.8e-06
Identity = 40/121 (33.06%), Postives = 63/121 (52.07%), Query Frame = 0

Query: 25  GVGVSPFIPTLGQVGASKIPLVLFLSETKLSSNRMAPAKRALGFENCFCVDNRGKSGGLA 84
           G+G       L  + + + P +LFL ETKL +  +   + AL F N F V  RG  GGL 
Sbjct: 10  GLGKPSKFRQLRLLNSQQAPHLLFLMETKLPTGSINKIRAALNFPNGFEVPRRGLGGGLM 69

Query: 85  LLWNSSVTFSLLSYSNNHIDGWITWDDY--HWRLTCIGFPTADMRDQTTK--RRVAEINA 142
           LLW  ++  +LL+YS NHI  ++  D+    +     G P   +R  T +  +R+A+I+ 
Sbjct: 70  LLWKDNIDVTLLNYSMNHITCYVQCDNIPKFYFSGFYGAPETQLRPHTWRVLKRLADISP 129


HSP 2 Score: 280.8 bits (717), Expect = 2.1e-71
Identity = 151/348 (43.39%), Postives = 208/348 (59.77%), Query Frame = 0

Query: 145 QLLTDYVQQFFLSLDPNDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHK 204
           + + +Y +Q F S  P+  +FD  L+ +   V   MN  L + FT DE+  ALKQ  P  
Sbjct: 395 EAMVEYFKQIFASTMPS--NFDQILQGIDTKVTPAMNADLTREFTADEVEFALKQMKPLT 454

Query: 205 APGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSINETMIVLVSKVKAPRRVSYF 264
           APGPDG+S  FYK+ W+ +G  VI + LA+LN G  P S+N T I L+ K+K+P + + F
Sbjct: 455 APGPDGMSPIFYKSCWNFIGHDVIDASLAILNSGNMPASLNHTFISLIPKIKSPEKATDF 514

Query: 265 RPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQT 324
           RPISL NV YK++SK + NR+K +LPKL+S +QSAF+  R + DN ++ FE +H L+ +T
Sbjct: 515 RPISLCNVLYKIVSKTIANRLKKLLPKLVSESQSAFMSDRLISDNILVAFETLHHLKTKT 574

Query: 325 GGKSKWATLKLDMSKAYDMIEWSFLRIVMARMGF-------------------------- 384
            GK+ +  +KLDMSKAYD +EW+FL  VM ++GF                          
Sbjct: 575 KGKTGFMAIKLDMSKAYDRVEWAFLEKVMEKLGFDNRWITLVSSCIRSVSFSVLVNGEPH 634

Query: 385 -------AQQQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFAND 444
                    +QGDP S YLFLLC EGL SL++  E    I G     + P +SHLFFA+D
Sbjct: 635 GNFTPNRGLRQGDPLSPYLFLLCAEGLHSLIQQVEISGSIKGVSLCSTVPKVSHLFFADD 694

Query: 445 SLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPNTRKDTQ 460
           SLLF RAN+  V +I ++L +YE  SGQ +N EK+ + FSPNT    Q
Sbjct: 695 SLLFCRANSQEVSSIMEILKQYEEASGQQINREKTQLFFSPNTDPHVQ 740

BLAST of Lag0021974 vs. ExPASy Swiss-Prot
Match: P14381 (Transposon TX1 uncharacterized 149 kDa protein OS=Xenopus laevis OX=8355 PE=4 SV=1)

HSP 1 Score: 106.3 bits (264), Expect = 9.4e-22
Identity = 78/269 (29.00%), Postives = 129/269 (47.96%), Query Frame = 0

Query: 145 QLLTDYVQQFFLSL---DPNDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSH 204
           + + D  + F+ +L   DP   D    L D    V       L  P T DE+ +AL+   
Sbjct: 403 EAIRDRARSFYQNLFSPDPISPDACEELWDGLPVVSERRKERLETPITLDELSQALRLMP 462

Query: 205 PHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSINETMIVLVSKVKAPRRV 264
            +K+PG DGL+  F++  W  +GP   +       +G  P S    ++ L+ K    R +
Sbjct: 463 HNKSPGLDGLTIEFFQFFWDTLGPDFHRVLTEAFKKGELPLSCRRAVLSLLPKKGDLRLI 522

Query: 265 SYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELR 324
             +RP+SL +  YK+++KA+  R+K +L ++I P+QS  +PGR + DN  L  + +H  R
Sbjct: 523 KNWRPVSLLSTDYKIVAKAISLRLKSVLAEVIHPDQSYTVPGRTIFDNVFLIRDLLHFAR 582

Query: 325 RQTGGKSKWATLKLDMSKAYDMIEWSFLRIVMARMGFAQQ-QGDPRSLYLFLLCVEGLSS 384
           R TG     A L LD  KA+D ++  +L   +    F  Q  G  +++Y    C+  ++ 
Sbjct: 583 R-TG--LSLAFLSLDQEKAFDRVDHQYLIGTLQAYSFGPQFVGYLKTMYASAECLVKINW 642

Query: 385 LL-------RGAEQQYLISGFRFARSSPP 403
            L       RG  Q   +SG  ++ +  P
Sbjct: 643 SLTAPLAFGRGVRQGCPLSGQLYSLAIEP 668

BLAST of Lag0021974 vs. ExPASy Swiss-Prot
Match: O00370 (LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens OX=9606 PE=1 SV=1)

HSP 1 Score: 102.1 bits (253), Expect = 1.8e-20
Identity = 84/349 (24.07%), Postives = 150/349 (42.98%), Query Frame = 0

Query: 147 LTDYVQQFFLSLDPNDQDFD--ISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHK 206
           + +Y +  + +   N ++ D  +    LPR    E+   L +P T  EI+  +      K
Sbjct: 411 IREYYKHLYANKLENLEEMDTFLDTYTLPRLNQEEVE-SLNRPITGSEIVAIINSLPTKK 470

Query: 207 APGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSINETMIVLVSKV-KAPRRVSY 266
           +PGPDG +  FY+ +   + P +++   ++  +G  P S  E  I+L+ K  +   +   
Sbjct: 471 SPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKEN 530

Query: 267 FRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQ 326
           FRPISL N+  K+++K L NR++  + KLI  +Q  FIPG     N       I  + R 
Sbjct: 531 FRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINR- 590

Query: 327 TGGKSKWATLKLDMSKAYDMIEWSFLRIVMARMGF------------------------- 386
                    + +D  KA+D I+  F+   + ++G                          
Sbjct: 591 -AKDKNHVIISIDAEKAFDKIQQPFMLKTLNKLGIDGMYLKIIRAIYDKPTANIILNGQK 650

Query: 387 --------AQQQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFAN 446
                     +QG P S  LF + +E L+  +R   Q+  I G +  +    +S   FA+
Sbjct: 651 LEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIR---QEKEIKGIQLGKEEVKLS--LFAD 710

Query: 447 DSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPNTRKDTQ 460
           D +++     +    +  L+  + + SG  +N +KS  AF  N  + T+
Sbjct: 711 DMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKS-QAFLYNNNRQTE 750

BLAST of Lag0021974 vs. ExPASy Swiss-Prot
Match: P08548 (LINE-1 reverse transcriptase homolog OS=Nycticebus coucang OX=9470 PE=4 SV=1)

HSP 1 Score: 98.2 bits (243), Expect = 2.5e-19
Identity = 84/350 (24.00%), Postives = 144/350 (41.14%), Query Frame = 0

Query: 143 ILQLLTDYVQQFFLSLDPNDQDFDISLR--DLPRFVDSEMNVVLMQPFTEDEILRALKQS 202
           I ++L +Y ++ +     N ++ D  L    LPR    E+  +L +P +  EI   ++  
Sbjct: 406 IQKILNEYYKKLYSHKYENLKEIDQYLEACHLPRLSQKEVE-MLNRPISSSEIASTIQNL 465

Query: 203 HPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSINETMIVLVSKV-KAPR 262
              K+PGPDG +  FY+     + P ++     +  +G  P +  E  I L+ K  K P 
Sbjct: 466 PKKKSPGPDGFTSEFYQTFKEELVPILLNLFQNIEKEGILPNTFYEANITLIPKPGKDPT 525

Query: 263 RVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHE 322
           R   +RPISL N+  K+++K L NR++  + K+I  +Q  FIPG     N       I  
Sbjct: 526 RKENYRPISLMNIDAKILNKILTNRIQQHIKKIIHHDQVGFIPGSQGWFNIRKSINVIQH 585

Query: 323 LRRQTGGKSKWATLKLDMSKAYDMIEWSFLRIVMARMGF--------------------- 382
           + +          L +D  KA+D I+  F+   + ++G                      
Sbjct: 586 INKLK--NKDHMILSIDAEKAFDNIQHPFMIRTLKKIGIEGTFLKLIEAIYSKPTANIIL 645

Query: 383 ------------AQQQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHL 442
                         +QG P S  LF + +E L+  +R   ++  I G      S  I   
Sbjct: 646 NGVKLKSFPLRSGTRQGCPLSPLLFNIVMEVLAIAIR---EEKAIKGIHI--GSEEIKLS 705

Query: 443 FFANDSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPNTRK 457
            FA+D +++          + +++  Y   SG  +N  KSV     N  +
Sbjct: 706 LFADDMIVYLENTRDSTTKLLEVIKEYSNVSGYKINTHKSVAFIYTNNNQ 747

BLAST of Lag0021974 vs. ExPASy Swiss-Prot
Match: P11369 (LINE-1 retrotransposable element ORF2 protein OS=Mus musculus OX=10090 GN=Pol PE=1 SV=2)

HSP 1 Score: 95.9 bits (237), Expect = 1.3e-18
Identity = 79/298 (26.51%), Postives = 124/298 (41.61%), Query Frame = 0

Query: 184 LMQPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGS 243
           L  P +  EI   +      K+PGPDG S  FY+     + P + +    +  +G  P S
Sbjct: 456 LNSPISPKEIEAVINSLPTKKSPGPDGFSAEFYQTFKEDLIPILHKLFHKIEVEGTLPNS 515

Query: 244 INETMIVLVSK-VKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIP 303
             E  I L+ K  K P ++  FRPISL N+  K+++K L NR++  +  +I P+Q  FIP
Sbjct: 516 FYEATITLIPKPQKDPTKIENFRPISLMNIDAKILNKILANRIQEHIKAIIHPDQVGFIP 575

Query: 304 GRCVVDNAILGFECIHELRRQTGGKSKWATLKLDMSKAYDMIEWSFLRIVMARMGF---- 363
           G     N       IH + +          + LD  KA+D I+  F+  V+ R G     
Sbjct: 576 GMQGWFNIRKSINVIHYINKLK--DKNHMIISLDAEKAFDKIQHPFMIKVLERSGIQGPY 635

Query: 364 -----------------------------AQQQGDPRSLYLFLLCVEGLSSLLRGAEQQY 423
                                          +QG P S YLF + +E L+  +R   QQ 
Sbjct: 636 LNMIKAIYSKPVANIKVNGEKLEAIPLKSGTRQGCPLSPYLFNIVLEVLARAIR---QQK 695

Query: 424 LISGFRFARSSPPISHLFFANDSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSV 448
            I G +  +    IS L  A+D +++          + +L+  +    G  +N  KS+
Sbjct: 696 EIKGIQIGKEEVKISLL--ADDMIVYISDPKNSTRELLNLINSFGEVVGYKINSNKSM 746

BLAST of Lag0021974 vs. ExPASy Swiss-Prot
Match: P92555 (Uncharacterized mitochondrial protein AtMg01250 OS=Arabidopsis thaliana OX=3702 GN=AtMg01250 PE=4 SV=1)

HSP 1 Score: 58.2 bits (139), Expect = 2.9e-07
Identity = 27/52 (51.92%), Postives = 37/52 (71.15%), Query Frame = 0

Query: 361 QQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFANDS 413
           +QGDP S YLF+LC E LS L R A++Q  + G R + +SP I+HL FA+D+
Sbjct: 29  RQGDPLSPYLFILCTEVLSGLCRRAQEQGRLPGIRVSNNSPRINHLLFADDT 80

BLAST of Lag0021974 vs. ExPASy TrEMBL
Match: A0A2N9GJ35 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS27322 PE=3 SV=1)

HSP 1 Score: 294.7 bits (753), Expect = 6.9e-76
Identity = 161/367 (43.87%), Postives = 218/367 (59.40%), Query Frame = 0

Query: 126  MRDQTTKRRVAEINALFILQLLTDYVQQFFLSLDPNDQDFDISLRDLPRFVDSEMNVVLM 185
            +RDQ +  R      L + Q+  DY    F S +P  +  D  L ++   V   MN VLM
Sbjct: 713  LRDQQSNWRT---EPLEVEQIAVDYFSSLFASSNP--RAIDEVLHEVEGVVTPGMNNVLM 772

Query: 186  QPFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSIN 245
            +PFT++EI RAL Q HP K+PGPDG+S  F++ +W IV   V  + L  L  G   GSIN
Sbjct: 773  RPFTQEEIKRALFQMHPSKSPGPDGMSALFFQKYWHIVDSDVSNAVLDFLKNGRMLGSIN 832

Query: 246  ETMIVLVSKVKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRC 305
             T +VL+ KV AP  ++ FRPISL NV YK++SK LVNRMK ILP++IS +QSAF+PGR 
Sbjct: 833  FTHLVLIPKVAAPENITQFRPISLCNVIYKIVSKVLVNRMKTILPQVISDSQSAFVPGRM 892

Query: 306  VVDNAILGFECIHELRRQTGGKSKWATLKLDMSKAYDMIEWSFLRIVMARMGFAQQ---- 365
            + DN I+ FE IH L+    G +    +KLDMSKAYD +EW +L+ +M ++GF  Q    
Sbjct: 893  ITDNVIIAFETIHYLKNLQNGNNVQMAVKLDMSKAYDRVEWDYLQAIMIKLGFHAQWVKL 952

Query: 366  -----------------------------QGDPRSLYLFLLCVEGLSSLLRGAEQQYLIS 425
                                         QGDP S YLFLLC EGLS++LR AE++ L+ 
Sbjct: 953  VMACVKTATYSILVNGEPKGYITPQRGLRQGDPLSPYLFLLCTEGLSAVLRKAERESLLK 1012

Query: 426  GFRFARSSPPISHLFFANDSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSP 460
            G    R  P +SHLFFA+DS++F RA     V +++LL +Y   SGQVVN +K+ + FSP
Sbjct: 1013 GVSICRGGPRVSHLFFADDSIVFCRATNADCVTLQNLLTKYAHASGQVVNSDKTALFFSP 1072

BLAST of Lag0021974 vs. ExPASy TrEMBL
Match: A0A2N9GJ35 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS27322 PE=3 SV=1)

HSP 1 Score: 79.7 bits (195), Expect = 3.5e-11
Identity = 42/112 (37.50%), Postives = 65/112 (58.04%), Query Frame = 0

Query: 21  LKCSGVGVSPFIPTLGQVGASKIPLVLFLSETKLSSNRMAPAKRALGFENCFCVDNRGKS 80
           L C G+G    +  L  +   + P ++FL ET+L+   +   +  LG + C  V+  G+ 
Sbjct: 337 LNCRGLGNPQTVNELHNLVKKEGPNIVFLMETRLNVRNLEWLRVRLGMKGCLGVERHGQG 396

Query: 81  GGLALLWNSSVTFSLLSYSNNHIDGWITWDD-YHWRLT-CIGFPTADMRDQT 131
           GGLALLW+SSV  ++ SYS +HIDG +  +D   WRLT   G+P A +R ++
Sbjct: 397 GGLALLWDSSVMINIQSYSEHHIDGEVVQNDGLRWRLTGFYGYPEAHLRHRS 448


HSP 2 Score: 292.0 bits (746), Expect = 4.4e-75
Identity = 170/454 (37.44%), Postives = 240/454 (52.86%), Query Frame = 0

Query: 70  NCFCVDNRGKSGGLALLWNSSVTFSLLSYSNNHIDGWITWDDYHWRLTCIGFPTADMRDQ 129
           +C  VD  G+ GG+ALLW   V+ S+LSYS+ HID  I  D           P   +   
Sbjct: 17  DCSTVDGVGRKGGIALLWGRDVSLSILSYSHYHIDAAIEDD-----------PAKGLNLS 76

Query: 130 TTKRRVAEINALFIL------------------------------------QLLTDYVQQ 189
             ++ + ++NAL  L                                    +++ DY ++
Sbjct: 77  FARKHMEKVNALDSLGDNIELLQKARVDDQKWLERDELLWKQRSKVGEQRDKIILDYFEE 136

Query: 190 FFLSLDP-NDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLS 249
            F S +P    DF   L  L   V S MN  L QP+TE E+  +L Q HP KAPGPDG+S
Sbjct: 137 LFTSSNPVGSTDF---LCSLAGKVTSSMNKGLAQPYTEAEVTASLAQMHPSKAPGPDGMS 196

Query: 250 GSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSINETMIVLVSKVKAPRRVSYFRPISLYNV 309
             FY+ +W +VG SV ++ L  LN G  P ++N T I+L+ K K P +V  +RPISL NV
Sbjct: 197 PMFYQKYWDVVGISVTKAVLTALNSGSFPSTLNHTNIILIPKKKFPEKVDDYRPISLCNV 256

Query: 310 SYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWAT 369
           +YKLI+K + NR+K +LP +I  +QSAF+PGR + DN ++ +E +H L  +T GK  + +
Sbjct: 257 AYKLIAKVVSNRLKFVLPSVIEESQSAFVPGRLITDNVLIAYELVHYLNHKTKGKKGYMS 316

Query: 370 LKLDMSKAYDMIEWSFLRIVMARMGFAQ-------------------------------- 429
           +KLDMSKAYD +EW F+R VM  MGF +                                
Sbjct: 317 IKLDMSKAYDRVEWGFIRTVMTVMGFDRTFIEMIMFCVSSVSFSVLINGESKGSIKPTRG 376

Query: 430 -QQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFANDSLLFFRAN 454
            +Q DP S YLFLLC EGL +LL+ A  +  IS  +  + +P I+HL FA+DS++F RA+
Sbjct: 377 LRQRDPLSPYLFLLCTEGLIALLKEAGIRKKISSIQICKGAPHINHLLFADDSVVFCRAD 436

BLAST of Lag0021974 vs. ExPASy TrEMBL
Match: A0A803PQT0 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 288.9 bits (738), Expect = 3.8e-74
Identity = 169/486 (34.77%), Postives = 254/486 (52.26%), Query Frame = 0

Query: 40  ASKIPLVLFLSETKLSSNRMAPAKRALGFENCFCVDNRGKSGGLALLWNSSVTFSLLSYS 99
           + + P VLFL ETKL S  ++  + +L F +   V   G SGG+ LLW S    ++ +YS
Sbjct: 346 SEQAPCVLFLMETKLQSGSISKFRNSLHFPHGIEVPQIGLSGGVMLLWKSDKDIAINNYS 405

Query: 100 NNHIDGWITWDD---YHWRLTCIGFPTADMRDQTTKR-----RVAEINALFIL----QLL 159
           +NHID ++ + D   +H+     G P  + R  T  +      VA +    ++    +  
Sbjct: 406 SNHIDCFVEFHDGQSFHF-TGFYGHPQVNQRIHTWTKLKRCFNVAPLRPWLVMGDFNENF 465

Query: 160 TDYVQQFFLSLDPNDQDFD---------------------ISLRDLPRFVDSEMNVVLMQ 219
            D  Q  F +L  ++ + D                       L  +P  +  E + +L Q
Sbjct: 466 ADTSQSHFTALKHSEANLDELLSQEETYWHHRGIDQFALEAVLSTIPTTISIENHAILSQ 525

Query: 220 PFTEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSINE 279
           P+T +++  ALK    +++PG DG+S  FY N+W IVG  V  + L VLN GC P  +N 
Sbjct: 526 PYTSEDVYHALKSMSEYRSPGLDGMSVMFYTNYWPIVGSLVTSAVLDVLNNGCDPTLLNR 585

Query: 280 TMIVLVSKVKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCV 339
           T+I L+ KVK P +++ +RPISL NV YKL+SK +V R++  L  +IS  QSAF+    +
Sbjct: 586 TLITLIPKVKKPTKITQYRPISLCNVLYKLVSKTIVLRLQPFLHLVISEFQSAFLSQHLI 645

Query: 340 VDNAILGFECIHELRRQTGGKSKWATLKLDMSKAYDMIEWSFLRIVMARMGFAQ------ 399
            DN ++ FE +H ++ +  G   +A +KLDMSKA+D +EW  ++ VM +MGF        
Sbjct: 646 TDNVLVAFEVLHSIKNRKRGNKGFAAMKLDMSKAFDRVEWHLIQQVMLKMGFGDTIVQII 705

Query: 400 ---------------------------QQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISG 459
                                      +QGDP S YLFL+C E  S LL+  E Q  + G
Sbjct: 706 SRCVQSVSYSFLLNGAIHGNITPHRGIRQGDPLSPYLFLICSECFSRLLQHEENQGNLEG 765

BLAST of Lag0021974 vs. ExPASy TrEMBL
Match: A0A2N9I335 (Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS48349 PE=4 SV=1)

HSP 1 Score: 285.8 bits (730), Expect = 3.2e-73
Identity = 151/356 (42.42%), Postives = 210/356 (58.99%), Query Frame = 0

Query: 135 VAEINALFILQLLTDYVQQFFLSLDPNDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEIL 194
           V + + + +  +  DY Q  F S +P D+  +  L  L R V  EMN +L++ F  +E+ 
Sbjct: 279 VLQTDKIKMANIAVDYFQSIFSSSNPGDETINSCLDGLERVVTEEMNNMLLEDFNSEEVS 338

Query: 195 RALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSINETMIVLVSK 254
           +ALKQ +P KAPGPDG+S  FY+ +W IVGP V Q+ L++L+ G     IN T I L+ K
Sbjct: 339 QALKQMYPTKAPGPDGMSAVFYQTYWDIVGPEVTQAILSILHSGYMVNKINYTHIALIPK 398

Query: 255 VKAPRRVSYFRPISLYNVSYKLISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGF 314
           VK P R++ FRPISL NV YK++SK L NR+K +LP +IS +QSAF+PGR + DN ++ F
Sbjct: 399 VKNPERITDFRPISLCNVIYKIVSKILANRLKKVLPYVISESQSAFVPGRLITDNVLVAF 458

Query: 315 ECIHELRRQTGGKSKWATLKLDMSKAYDMIEWSFLRIVMARMGFAQ-------------- 374
           E +H +  +  G+     LKLDMSKAYD +EW F+  +M R+GFA+              
Sbjct: 459 EVMHSMSLKRIGRRGQMALKLDMSKAYDRVEWVFVEAIMRRLGFAEDWINLIMMCLKSVS 518

Query: 375 -------------------QQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSP 434
                              +QGD  S YLFLLC EGLS LLR A  +  ISG   +R  P
Sbjct: 519 YSVLINGEQHGYFKASRGIRQGDSLSPYLFLLCAEGLSFLLRKAVMEKKISGVAASRGGP 578

Query: 435 PISHLFFANDSLLFFRANAMGVVAIRDLLIRYERTSGQVVNYEKSVVAFSPNTRKD 458
            ++HLFFA+DSLLF +A     +A+  +L +YE  SGQ +N  K+ + F+ NT  D
Sbjct: 579 KLTHLFFADDSLLFCQATMANCLAVSHILQQYEMVSGQQLNRAKTSLFFTRNTSSD 634

BLAST of Lag0021974 vs. ExPASy TrEMBL
Match: A0A803QP43 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 285.4 bits (729), Expect = 4.2e-73
Identity = 168/444 (37.84%), Postives = 237/444 (53.38%), Query Frame = 0

Query: 49  LSETKLSSNRMAPAKRALGFENCFCVDNRGKSGGLALLWNSSVTFSLLSYSNNHIDGWIT 108
           +++   + N  A    +LG   CF VD +GKSGGLALLW       + SY+++HID  + 
Sbjct: 418 MTDLTFAMNLEAANSTSLG---CFSVDAKGKSGGLALLWTKDYAVHINSYTSSHIDALVD 477

Query: 109 WD-DYHWRLTCIGFPTADMRDQTTKRRVAEINALF------------ILQLLTDYVQQFF 168
            +  + WR T  GF  +     +T+++   I  LF            I  +   Y +Q F
Sbjct: 478 NNLGFSWRFT--GFYGSPDPGASTRQKKNTIEGLFDDNQQWKTNEEDIENIAITYFKQLF 537

Query: 169 LSLDPNDQDFDISLRDLPRFVDSEMNVVLMQPFTEDEILRALKQSHPHKAPGPDGLSGSF 228
              +      +I  R +P  ++ + N  L++PFT DE+  ++   HP KAPG DGL G F
Sbjct: 538 TKANGGVAIQEILNRCVPNRLNIDDNSKLLEPFTRDEVQASMFHIHPLKAPGKDGLPGLF 597

Query: 229 YKNHWSIVGPSVIQSCLAVLNQGCSPGSINETMIVLVSKVKAPRRVSYFRPISLYNVSYK 288
           ++  W  VG  VI +CL +LN       INET+I L+ KV  P ++  FRPISL NV YK
Sbjct: 598 FQKSWDTVGKEVINACLDILNNNADCSPINETLICLMPKVPKPTKMFEFRPISLCNVVYK 657

Query: 289 LISKALVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWATLKL 348
           ++SK L NRMK  L  +IS NQSAFI GR + DNAI+GFE +H +R+   G  +   +KL
Sbjct: 658 VVSKCLANRMKKCLNTVISANQSAFIGGRIIQDNAIIGFESLHCMRKVRFGNGRKMAVKL 717

Query: 349 DMSKAYDMIEWSFLRIVMARMGF---------------------------------AQQQ 408
           DMSKAYD +EW FL  +M  +GF                                   +Q
Sbjct: 718 DMSKAYDRVEWDFLEAMMISLGFDHKWITKIMNCVRTVSFSILINGSIKGFFIPERGLRQ 777

Query: 409 GDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFANDSLLFFRANAMG 447
           GDP SL+LFLLC EGLS ++  AE+   I G RF      +SHL FA++SL+F  A    
Sbjct: 778 GDPLSLFLFLLCSEGLSCMIFEAERAGKIHGLRFGNMEQRLSHLLFADNSLVFIDATMEE 837

BLAST of Lag0021974 vs. TAIR 10
Match: AT4G20520.1 (RNA binding;RNA-directed DNA polymerases )

HSP 1 Score: 75.1 bits (183), Expect = 1.6e-13
Identity = 34/80 (42.50%), Postives = 48/80 (60.00%), Query Frame = 0

Query: 281 LVNRMKHILPKLISPNQSAFIPGRCVVDNAILGFECIHELRRQTGGKSKWATLKLDMSKA 340
           +V R+K ++  LI P Q++FIPGR   DN +   E +H +RR+ G K  W  LKLD+ KA
Sbjct: 1   MVERLKPLMTNLIGPAQASFIPGRVSTDNIVFVQEAVHSMRRKKGVKG-WMLLKLDLEKA 60

Query: 341 YDMIEWSFLRIVMARMGFAQ 361
           YD I W +L   +   GF +
Sbjct: 61  YDRIRWDYLEDTLISAGFPE 79

BLAST of Lag0021974 vs. TAIR 10
Match: ATMG01250.1 (RNA-directed DNA polymerase (reverse transcriptase) )

HSP 1 Score: 58.2 bits (139), Expect = 2.1e-08
Identity = 27/52 (51.92%), Postives = 37/52 (71.15%), Query Frame = 0

Query: 361 QQGDPRSLYLFLLCVEGLSSLLRGAEQQYLISGFRFARSSPPISHLFFANDS 413
           +QGDP S YLF+LC E LS L R A++Q  + G R + +SP I+HL FA+D+
Sbjct: 29  RQGDPLSPYLFILCTEVLSGLCRRAQEQGRLPGIRVSNNSPRINHLLFADDT 80

BLAST of Lag0021974 vs. TAIR 10
Match: AT1G43760.1 (DNAse I-like superfamily protein )

HSP 1 Score: 53.9 bits (128), Expect = 3.9e-07
Identity = 30/90 (33.33%), Postives = 44/90 (48.89%), Query Frame = 0

Query: 189 TEDEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVIQSCLAVLNQGCSPGSINETM 248
           ++ EI  A+     +KAPGPD  +  F+   W +V  S I +       G      N T 
Sbjct: 532 SDKEITAAVFAMPRNKAPGPDSFTAEFFWESWFVVKDSTIAAVKEFFRTGHLLKRFNATA 591

Query: 249 IVLVSKVKAPRRVSYFRPISLYNVSYKLIS 279
           I L+ KV    ++S FRP+S   V YK+I+
Sbjct: 592 ITLIPKVTGVDQLSMFRPVSCCTVVYKIIT 621

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_018815621.19.2e-7537.44uncharacterized protein LOC108987197 [Juglans regia] >KAF5442204.1 hypothetical ... [more]
XP_028068804.11.5e-7246.15uncharacterized protein LOC114271378 [Camellia sinensis][more]
XP_017250619.12.5e-7241.21PREDICTED: uncharacterized protein LOC108221234 [Daucus carota subsp. sativus][more]
XP_030505314.13.3e-7244.54uncharacterized protein LOC115720302 [Cannabis sativa][more]
XP_030505314.11.8e-0633.06uncharacterized protein LOC115720302 [Cannabis sativa][more]
Match NameE-valueIdentityDescription
P143819.4e-2229.00Transposon TX1 uncharacterized 149 kDa protein OS=Xenopus laevis OX=8355 PE=4 SV... [more]
O003701.8e-2024.07LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens OX=9606 PE=1 SV=1[more]
P085482.5e-1924.00LINE-1 reverse transcriptase homolog OS=Nycticebus coucang OX=9470 PE=4 SV=1[more]
P113691.3e-1826.51LINE-1 retrotransposable element ORF2 protein OS=Mus musculus OX=10090 GN=Pol PE... [more]
P925552.9e-0751.92Uncharacterized mitochondrial protein AtMg01250 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A2N9GJ356.9e-7643.87Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS27322 PE=3 SV=1[more]
A0A2N9GJ353.5e-1137.50Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS27322 PE=3 SV=1[more]
A0A803PQT03.8e-7434.77Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A2N9I3353.2e-7342.42Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=F... [more]
A0A803QP434.2e-7337.84Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G20520.11.6e-1342.50RNA binding;RNA-directed DNA polymerases [more]
ATMG01250.12.1e-0851.92RNA-directed DNA polymerase (reverse transcriptase) [more]
AT1G43760.13.9e-0733.33DNAse I-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 262..360
e-value: 2.6E-16
score: 59.8
IPR036691Endonuclease/exonuclease/phosphatase superfamilyGENE3D3.60.10.10Endonuclease/exonuclease/phosphatasecoord: 36..200
e-value: 2.5E-6
score: 29.4
NoneNo IPR availablePANTHERPTHR19446:SF440SUBFAMILY NOT NAMEDcoord: 162..455
NoneNo IPR availablePANTHERPTHR19446REVERSE TRANSCRIPTASEScoord: 162..455
NoneNo IPR availableCDDcd01650RT_nLTR_likecoord: 249..459
e-value: 9.42479E-42
score: 145.897
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 187..359

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0021974.1Lag0021974.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0090304 nucleic acid metabolic process
molecular_function GO:0003824 catalytic activity