Lcy07g007050 (gene) Sponge gourd (P93075) v1

Overview
NameLcy07g007050
Typegene
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
LocationChr07: 33565738 .. 33567359 (-)
RNA-Seq ExpressionLcy07g007050
SyntenyLcy07g007050
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATTTGGCTTCTGCGAGGTTGAAAATAGTCCATCTTGGATACTTGAGTTCGGATCATAGGTCAATCTCGGCTGAGTTGGAATGGGAAATAGTCACAAAAAGGAAAAAAGGAAGGAGCAAAAGATTACGTTTCGAAGAAGGGTGGATAAAGATAAAAGACACAAGGAAGATAATAGCAGAAAGTTGGGGGAATGGGGCCATAAGTGATGCAAAACAGTTCACGACCAACATGGACAGATGCATTAAAACCCTTCACAATTGGAGTAAAAGAAGGCTGGAAGGAAGTCTGAAAGCGGCCATTGGCAGAAGAGAAAAGGAAATAAAGGCTATACGAGGAGGGGGAACAGGAGAATTGTGAAATGATATCTACTATGGAAGAGGAGCTGGATGACCTCCTAAGGGAAGAGGAGTAGAGGACTACTGGAGAAGTAGATCCAGGGAAGCTTGGTTACAACATGGTGACAAGAACACAAAGTGGTTCCACTCCAAGGCTAACCAAAGGAAGAAACGAAATGAGATAAGGGGGATATTGAACAGGGAAGGTCAATGGGTTGAGGATGAAGATGAAATCGGTGAGACTGCCTCTGAGTACTTTAGAGAACTCTTCCAATCATCCTATCCTGACTTGGAAGCTATTGCCTCAACAACAGAAGGGATAAATAATTGCTTATCGGTTCCGGATGTTAGGGAGCTGGAGAGGCAGTTTACTCGTGGGGAAGTTGAAAAAGCTATCAAGATGATGAACCCCAGCAAAGCCCCCGGTATAGATGGGATGCATGCGGCCTTCTACCAGAATTACTGGAGCATTGTAGGGGATGATACCGTTAGTATGTGCTTACAAATTTTAAACCAAGAAGGAGATATTGCCCCTCTGAACAAAACGCAGATAGCACTAATCTCCAAGGTCAAAGACCCAAAGTTAATGAGCGATTTCGGACCGATTAGTCTATGCAATGTCAGCTATAAGATTATAGCCAAAACCATTGCGAATAGGCTAAAAAGAGTCCTTGACAAGATCATCTCCCCTACCCAAGCAGCATTCGTGCCAGGCAGACAAATCTCAGACAATGTCCTGGTCGGATTCGAATGTATTCATGCGATCAACAGTAGAAGAAAAGGGAAAGAGGGTCAGATAGCCATAAAGCTAGACATGAACAAGGCATATGACAGGGTCGAATGGGTGTTTATTAGGAGCATGCTAGCGAAGATGGGGTTCAGTGAGAAGTGGGTTAATTTGATTATGAGGTGTGTTGAACCAGTGTCTTTTTCGATTTTGGTAAACGGATATCCCCAGAATGAATTCTCCCCAGGCAGAGGAATCAGACAAGGAGATCCTCTATCGCCGTATCTTTCCTCATATGCGCTGAAGGCTTCTCCAACATTCTAAATTGGGAAGTTGAAAATCGAAACCTTCAAGGTTTTTGCATTAACAACTTTTGTCCCCCTTTATCCCACCTATTTTTTGCTGACGATAGTCTCATTTTTTGCAGGGCAACGATGGAGGAGTGCACATCGATTAAAAGAGCGTGCCAGCTATATGAGAGAGCTTCCGGGCAAAAGATCAATTTTGATAAGTCAAGACTAATGGTCAATAAGAATGTTAATGAAGAAAAGGGCTAA

mRNA sequence

ATGGATTTGGCTTCTGCGAGGTTGAAAATAGTCCATCTTGGATACTTGAGTTCGGATCATAGGTCAATCTCGGCTGAAGGAGTAGAGGACTACTGGAGAAGTAGATCCAGGGAAGCTTGGTTACAACATGGTGACAAGAACACAAAGTGGTTCCACTCCAAGGCTAACCAAAGGAAGAAACGAAATGAGATAAGGGGGATATTGAACAGGGAAGGTCAATGGGTTGAGGATGAAGATGAAATCGGTGAGACTGCCTCTGAGTACTTTAGAGAACTCTTCCAATCATCCTATCCTGACTTGGAAGCTATTGCCTCAACAACAGAAGGGATAAATAATTGCTTATCGGTTCCGGATGTTAGGGAGCTGGAGAGGCAGTTTACTCGTGGGGAAGTTGAAAAAGCTATCAAGATGATGAACCCCAGCAAAGCCCCCGGTATAGATGGGATGCATGCGGCCTTCTACCAGAATTACTGGAGCATTGTAGGGGATGATACCGTTAGTATGTGCTTACAAATTTTAAACCAAGAAGGAGATATTGCCCCTCTGAACAAAACGCAGATAGCACTAATCTCCAAGGTCAAAGACCCAAAGTTAATGAGCGATTTCGGACCGATTAGTCTATGCAATGTCAGCTATAAGATTATAGCCAAAACCATTGCGAATAGGCTAAAAAGAGTCCTTGACAAGATCATCTCCCCTACCCAAGCAGCATTCGTGCCAGGCAGACAAATCTCAGACAATGTCCTGGTCGGATTCGAATGTATTCATGCGATCAACAGTAGAAGAAAAGGGAAAGAGGGTCAGATAGCCATAAAGCTAGACATGAACAAGGCATATGACAGGGTCGAATGGGTGTTTATTAGGAGCATGCTAGCGAAGATGGGGTTCAGTGAGAAGTGGGTTAATTTGATTATGAGGGCAACGATGGAGGAGTGCACATCGATTAAAAGAGCGTGCCAGCTATATGAGAGAGCTTCCGGGCAAAAGATCAATTTTGATAAGTCAAGACTAATGGTCAATAAGAATGTTAATGAAGAAAAGGGCTAA

Coding sequence (CDS)

ATGGATTTGGCTTCTGCGAGGTTGAAAATAGTCCATCTTGGATACTTGAGTTCGGATCATAGGTCAATCTCGGCTGAAGGAGTAGAGGACTACTGGAGAAGTAGATCCAGGGAAGCTTGGTTACAACATGGTGACAAGAACACAAAGTGGTTCCACTCCAAGGCTAACCAAAGGAAGAAACGAAATGAGATAAGGGGGATATTGAACAGGGAAGGTCAATGGGTTGAGGATGAAGATGAAATCGGTGAGACTGCCTCTGAGTACTTTAGAGAACTCTTCCAATCATCCTATCCTGACTTGGAAGCTATTGCCTCAACAACAGAAGGGATAAATAATTGCTTATCGGTTCCGGATGTTAGGGAGCTGGAGAGGCAGTTTACTCGTGGGGAAGTTGAAAAAGCTATCAAGATGATGAACCCCAGCAAAGCCCCCGGTATAGATGGGATGCATGCGGCCTTCTACCAGAATTACTGGAGCATTGTAGGGGATGATACCGTTAGTATGTGCTTACAAATTTTAAACCAAGAAGGAGATATTGCCCCTCTGAACAAAACGCAGATAGCACTAATCTCCAAGGTCAAAGACCCAAAGTTAATGAGCGATTTCGGACCGATTAGTCTATGCAATGTCAGCTATAAGATTATAGCCAAAACCATTGCGAATAGGCTAAAAAGAGTCCTTGACAAGATCATCTCCCCTACCCAAGCAGCATTCGTGCCAGGCAGACAAATCTCAGACAATGTCCTGGTCGGATTCGAATGTATTCATGCGATCAACAGTAGAAGAAAAGGGAAAGAGGGTCAGATAGCCATAAAGCTAGACATGAACAAGGCATATGACAGGGTCGAATGGGTGTTTATTAGGAGCATGCTAGCGAAGATGGGGTTCAGTGAGAAGTGGGTTAATTTGATTATGAGGGCAACGATGGAGGAGTGCACATCGATTAAAAGAGCGTGCCAGCTATATGAGAGAGCTTCCGGGCAAAAGATCAATTTTGATAAGTCAAGACTAATGGTCAATAAGAATGTTAATGAAGAAAAGGGCTAA

Protein sequence

MDLASARLKIVHLGYLSSDHRSISAEGVEDYWRSRSREAWLQHGDKNTKWFHSKANQRKKRNEIRGILNREGQWVEDEDEIGETASEYFRELFQSSYPDLEAIASTTEGINNCLSVPDVRELERQFTRGEVEKAIKMMNPSKAPGIDGMHAAFYQNYWSIVGDDTVSMCLQILNQEGDIAPLNKTQIALISKVKDPKLMSDFGPISLCNVSYKIIAKTIANRLKRVLDKIISPTQAAFVPGRQISDNVLVGFECIHAINSRRKGKEGQIAIKLDMNKAYDRVEWVFIRSMLAKMGFSEKWVNLIMRATMEECTSIKRACQLYERASGQKINFDKSRLMVNKNVNEEKG
Homology
BLAST of Lcy07g007050 vs. ExPASy Swiss-Prot
Match: P14381 (Transposon TX1 uncharacterized 149 kDa protein OS=Xenopus laevis OX=8355 PE=4 SV=1)

HSP 1 Score: 121.3 bits (303), Expect = 2.1e-26
Identity = 77/270 (28.52%), Postives = 140/270 (51.85%), Query Frame = 0

Query: 35  RSREAWLQHGDKNTKWFHSKANQRKKRNEIRGILNREGQWVEDEDEIGETASEYFRELFQ 94
           RSR   L   D+ +++F++   ++  R +I  +   +G  +ED + I + A  +++ LF 
Sbjct: 359 RSRMQLLCDMDRGSRFFYALEKKKGNRKQITCLFAEDGTPLEDPEAIRDRARSFYQNLFS 418

Query: 95  SSYPDLEAIASTTEGINNCLSVPDVRELERQFTRGEVEKAIKMMNPSKAPGIDGMHAAFY 154
                 +A     +G+   +S      LE   T  E+ +A+++M  +K+PG+DG+   F+
Sbjct: 419 PDPISPDACEELWDGL-PVVSERRKERLETPITLDELSQALRLMPHNKSPGLDGLTIEFF 478

Query: 155 QNYWSIVGDDTVSMCLQILNQEGDIAPLNKTQIALISKVKDPKLMSDFGPISLCNVSYKI 214
           Q +W  +G D   +  +   +        +  ++L+ K  D +L+ ++ P+SL +  YKI
Sbjct: 479 QFFWDTLGPDFHRVLTEAFKKGELPLSCRRAVLSLLPKKGDLRLIKNWRPVSLLSTDYKI 538

Query: 215 IAKTIANRLKRVLDKIISPTQAAFVPGRQISDNVLVGFECIHAINSRRKGKEGQIAIKLD 274
           +AK I+ RLK VL ++I P Q+  VPGR I DNV +  + +H   +RR G      + LD
Sbjct: 539 VAKAISLRLKSVLAEVIHPDQSYTVPGRTIFDNVFLIRDLLHF--ARRTGL-SLAFLSLD 598

Query: 275 MNKAYDRVEWVFIRSMLAKMGFSEKWVNLI 305
             KA+DRV+  ++   L    F  ++V  +
Sbjct: 599 QEKAFDRVDHQYLIGTLQAYSFGPQFVGYL 624

BLAST of Lcy07g007050 vs. ExPASy Swiss-Prot
Match: O00370 (LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens OX=9606 PE=1 SV=1)

HSP 1 Score: 108.6 bits (270), Expect = 1.4e-22
Identity = 81/288 (28.12%), Postives = 139/288 (48.26%), Query Frame = 0

Query: 41  LQHGDKNTKWFHSKAN-----------QRKKRNEIRGILNREGQWVEDEDEIGETASEYF 100
           LQ  +++  WF  + N           +++++N+I  I N +G    D  EI  T  EY+
Sbjct: 356 LQKINESRSWFFERINKIDRPLARLIKKKREKNQIDTIKNDKGDITTDPTEIQTTIREYY 415

Query: 101 RELFQSSYPDLEAIASTTEGIN-NCLSVPDVRELERQFTRGEVEKAIKMMNPSKAPGIDG 160
           + L+ +   +LE + +  +      L+  +V  L R  T  E+   I  +   K+PG DG
Sbjct: 416 KHLYANKLENLEEMDTFLDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDG 475

Query: 161 MHAAFYQNYWSIVGDDTVSMCLQILNQEGDIA-PLNKTQIALISKV-KDPKLMSDFGPIS 220
             A FYQ Y   +    + +  Q + +EG +     +  I LI K  +D     +F PIS
Sbjct: 476 FTAEFYQRYKEELVPFLLKL-FQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPIS 535

Query: 221 LCNVSYKIIAKTIANRLKRVLDKIISPTQAAFVPGRQISDNVLVGFECIHAINSRRKGKE 280
           L N+  KI+ K +ANR+++ + K+I   Q  F+PG Q   N+      I  IN  R   +
Sbjct: 536 LMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHIN--RAKDK 595

Query: 281 GQIAIKLDMNKAYDRVEWVFIRSMLAKMGFSEKWVNLIMRATMEECTS 315
             + I +D  KA+D+++  F+   L K+G    ++ +I RA  ++ T+
Sbjct: 596 NHVIISIDAEKAFDKIQQPFMLKTLNKLGIDGMYLKII-RAIYDKPTA 639

BLAST of Lcy07g007050 vs. ExPASy Swiss-Prot
Match: P11369 (LINE-1 retrotransposable element ORF2 protein OS=Mus musculus OX=10090 GN=Pol PE=1 SV=2)

HSP 1 Score: 105.5 bits (262), Expect = 1.2e-21
Identity = 78/293 (26.62%), Postives = 135/293 (46.08%), Query Frame = 0

Query: 41  LQHGDKNTKWFHSKANQ-----------RKKRNEIRGILNREGQWVEDEDEIGETASEYF 100
           +Q  ++   WF  K N+            + +  I  I N +G    D +EI  T   ++
Sbjct: 363 IQRINQTRSWFFEKINKIDKPLARLTKGHRDKILINKIRNEKGDITTDPEEIQNTIRSFY 422

Query: 101 RELFQSSYPDLEAIASTTEGIN-NCLSVPDVRELERQFTRGEVEKAIKMMNPSKAPGIDG 160
           + L+ +   +L+ +    +      L+   V  L    +  E+E  I  +   K+PG DG
Sbjct: 423 KRLYSTKLENLDEMDKFLDRYQVPKLNQDQVDHLNSPISPKEIEAVINSLPTKKSPGPDG 482

Query: 161 MHAAFYQNYWSIVGDDTVSMCLQILNQ---EGDIA-PLNKTQIALISK-VKDPKLMSDFG 220
             A FYQ +     +D + +  ++ ++   EG +     +  I LI K  KDP  + +F 
Sbjct: 483 FSAEFYQTF----KEDLIPILHKLFHKIEVEGTLPNSFYEATITLIPKPQKDPTKIENFR 542

Query: 221 PISLCNVSYKIIAKTIANRLKRVLDKIISPTQAAFVPGRQISDNVLVGFECIHAINSRRK 280
           PISL N+  KI+ K +ANR++  +  II P Q  F+PG Q   N+      IH IN  + 
Sbjct: 543 PISLMNIDAKILNKILANRIQEHIKAIIHPDQVGFIPGMQGWFNIRKSINVIHYINKLK- 602

Query: 281 GKEGQIAIKLDMNKAYDRVEWVFIRSMLAKMGFSEKWVNLIMRATMEECTSIK 317
             +  + I LD  KA+D+++  F+  +L + G    ++N+I     +   +IK
Sbjct: 603 -DKNHMIISLDAEKAFDKIQHPFMIKVLERSGIQGPYLNMIKAIYSKPVANIK 649

BLAST of Lcy07g007050 vs. ExPASy TrEMBL
Match: A0A2N9GPZ7 (Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS29430 PE=4 SV=1)

HSP 1 Score: 297.4 bits (760), Expect = 8.0e-77
Identity = 141/280 (50.36%), Postives = 203/280 (72.50%), Query Frame = 0

Query: 26  EGVEDYWRSRSREAWLQHGDKNTKWFHSKANQRKKRNEIRGILNREGQWVEDEDEIGETA 85
           E  E +WR RSR AW+  GDKNTK+FH++ N+R++ N I G+ +R+G W  ++ +I E A
Sbjct: 203 EKEEIFWRQRSRVAWMSEGDKNTKFFHAQCNERRRTNHISGLRDRDGVWQTEKTKIAEIA 262

Query: 86  SEYFRELFQSSYPDLEAIASTTEGINNCLSVPDVRELERQFTRGEVEKAIKMMNPSKAPG 145
            +YF+ +F SS P  E+I +  +G+ + ++     +L+ +FT+ EV  A+K M P+KAPG
Sbjct: 263 VDYFQGIFTSSNPSAESITTVLQGMESVVTNAMNDQLQAEFTKDEVSLALKQMYPTKAPG 322

Query: 146 IDGMHAAFYQNYWSIVGDDTVSMCLQILNQEGDIAPLNKTQIALISKVKDPKLMSDFGPI 205
            DGM A FYQ YW IVG +     L IL+    +  +N T IALI KVK+P+ ++DF PI
Sbjct: 323 PDGMSAIFYQTYWDIVGPEVTQAILSILHSGYMLRKINYTHIALIPKVKNPENITDFRPI 382

Query: 206 SLCNVSYKIIAKTIANRLKRVLDKIISPTQAAFVPGRQISDNVLVGFECIHAINSRRKGK 265
           SLCNV YKI++K +ANRLK+VL  +IS  Q+AFVPGR I+DNVLV FE +H+++ +RKGK
Sbjct: 383 SLCNVIYKIVSKVLANRLKKVLPFVISEAQSAFVPGRLITDNVLVAFEVMHSMSLKRKGK 442

Query: 266 EGQIAIKLDMNKAYDRVEWVFIRSMLAKMGFSEKWVNLIM 306
           +GQ+A+KLDM+KAYDRVEWVF+ S++  MGF+++W+ L+M
Sbjct: 443 KGQMALKLDMSKAYDRVEWVFLESIMRGMGFAKEWIRLMM 482

BLAST of Lcy07g007050 vs. ExPASy TrEMBL
Match: A0A2N9IPS8 (Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS55418 PE=4 SV=1)

HSP 1 Score: 297.4 bits (760), Expect = 8.0e-77
Identity = 141/280 (50.36%), Postives = 203/280 (72.50%), Query Frame = 0

Query: 26   EGVEDYWRSRSREAWLQHGDKNTKWFHSKANQRKKRNEIRGILNREGQWVEDEDEIGETA 85
            E  E +WR RSR AW+  GDKNTK+FH++ N+R++ N I G+ +R+G W  ++ +I E A
Sbjct: 778  EKEEIFWRQRSRVAWMSEGDKNTKFFHAQCNERRRTNHISGLRDRDGVWQTEKTKIAEIA 837

Query: 86   SEYFRELFQSSYPDLEAIASTTEGINNCLSVPDVRELERQFTRGEVEKAIKMMNPSKAPG 145
             +YF+ +F SS P  E+I +  +G+ + ++     +L+ +FT+ EV  A+K M P+KAPG
Sbjct: 838  VDYFQGIFTSSNPSAESITTVLQGMESVVTNAMNDQLQAEFTKDEVSLALKQMYPTKAPG 897

Query: 146  IDGMHAAFYQNYWSIVGDDTVSMCLQILNQEGDIAPLNKTQIALISKVKDPKLMSDFGPI 205
             DGM A FYQ YW IVG +     L IL+    +  +N T IALI KVK+P+ ++DF PI
Sbjct: 898  PDGMSAIFYQTYWDIVGPEVTQAILSILHSGYMLRKINYTHIALIPKVKNPENITDFRPI 957

Query: 206  SLCNVSYKIIAKTIANRLKRVLDKIISPTQAAFVPGRQISDNVLVGFECIHAINSRRKGK 265
            SLCNV YKI++K +ANRLK+VL  +IS  Q+AFVPGR I+DNVLV FE +H+++ +RKGK
Sbjct: 958  SLCNVIYKIVSKVLANRLKKVLPFVISEAQSAFVPGRLITDNVLVAFEVMHSMSLKRKGK 1017

Query: 266  EGQIAIKLDMNKAYDRVEWVFIRSMLAKMGFSEKWVNLIM 306
            +GQ+A+KLDM+KAYDRVEWVF+ S++  MGF+++W+ L+M
Sbjct: 1018 KGQMALKLDMSKAYDRVEWVFLESIMRGMGFAKEWIRLMM 1057

BLAST of Lcy07g007050 vs. ExPASy TrEMBL
Match: A0A2N9GM07 (Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS28341 PE=4 SV=1)

HSP 1 Score: 286.2 bits (731), Expect = 1.8e-73
Identity = 140/279 (50.18%), Postives = 190/279 (68.10%), Query Frame = 0

Query: 29  EDYWRSRSREAWLQHGDKNTKWFHSKANQRKKRNEIRGILNREGQWVEDEDEIGETASEY 88
           E  W+ R+R  WL  GDKNT++FH KA+QR++RNEI+G+    G WV ++ +I  T  EY
Sbjct: 94  EQMWKQRARVQWLMEGDKNTRYFHIKASQRRQRNEIKGLFTTTGVWVTNKYDIQHTVVEY 153

Query: 89  FRELFQSSYPD--LEAIASTTEGINNCLSVPDVRELERQFTRGEVEKAIKMMNPSKAPGI 148
           F ++F +S P    E + +    +   ++     +L R FT  EV +A++ M+P+KAPG 
Sbjct: 154 FEQMFTTSMPSHVQEGVRAIQLKVTRAMN----DQLCRNFTAEEVHQALQQMHPTKAPGP 213

Query: 149 DGMHAAFYQNYWSIVGDDTVSMCLQILNQEGDIAPLNKTQIALISKVKDPKLMSDFGPIS 208
           DGM A F+Q YW+IVG +     LQ+LN     A  NKT IALI K K P+ M++F PIS
Sbjct: 214 DGMSAVFFQKYWAIVGKEVTEEVLQVLNTNASAAAYNKTNIALIPKTKTPQRMTEFRPIS 273

Query: 209 LCNVSYKIIAKTIANRLKRVLDKIISPTQAAFVPGRQISDNVLVGFECIHAINSRRKGKE 268
           LCNV+YK+I+K IANRLK VL  +IS TQ+AFVPGR I+DN LV FE +H    +R GK+
Sbjct: 274 LCNVTYKLISKVIANRLKSVLSDLISETQSAFVPGRNITDNALVAFEIMHYFQQKRSGKD 333

Query: 269 GQIAIKLDMNKAYDRVEWVFIRSMLAKMGFSEKWVNLIM 306
             +A+KLDM+KAYDRVEWVFI  ++ K+GF EKW++LIM
Sbjct: 334 TYMALKLDMSKAYDRVEWVFIEQVMKKLGFCEKWISLIM 368

BLAST of Lcy07g007050 vs. ExPASy TrEMBL
Match: A0A2N9I335 (Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS48349 PE=4 SV=1)

HSP 1 Score: 282.7 bits (722), Expect = 2.0e-72
Identity = 138/280 (49.29%), Postives = 194/280 (69.29%), Query Frame = 0

Query: 26  EGVEDYWRSRSREAWLQHGDKNTKWFHSKANQRKKRNEIRGILNREGQWVEDEDEIGETA 85
           E  E YWR RSR +W++ GDKNTK+FH+  N R++ N I+G+ + EG    D+ ++   A
Sbjct: 232 ENEEIYWRQRSRVSWMREGDKNTKFFHAHCNHRREINLIKGLRDNEGVLQTDKIKMANIA 291

Query: 86  SEYFRELFQSSYPDLEAIASTTEGINNCLSVPDVRELERQFTRGEVEKAIKMMNPSKAPG 145
            +YF+ +F SS P  E I S  +G+   ++      L   F   EV +A+K M P+KAPG
Sbjct: 292 VDYFQSIFSSSNPGDETINSCLDGLERVVTEEMNNMLLEDFNSEEVSQALKQMYPTKAPG 351

Query: 146 IDGMHAAFYQNYWSIVGDDTVSMCLQILNQEGDIAPLNKTQIALISKVKDPKLMSDFGPI 205
            DGM A FYQ YW IVG +     L IL+    +  +N T IALI KVK+P+ ++DF PI
Sbjct: 352 PDGMSAVFYQTYWDIVGPEVTQAILSILHSGYMVNKINYTHIALIPKVKNPERITDFRPI 411

Query: 206 SLCNVSYKIIAKTIANRLKRVLDKIISPTQAAFVPGRQISDNVLVGFECIHAINSRRKGK 265
           SLCNV YKI++K +ANRLK+VL  +IS +Q+AFVPGR I+DNVLV FE +H+++ +R G+
Sbjct: 412 SLCNVIYKIVSKILANRLKKVLPYVISESQSAFVPGRLITDNVLVAFEVMHSMSLKRIGR 471

Query: 266 EGQIAIKLDMNKAYDRVEWVFIRSMLAKMGFSEKWVNLIM 306
            GQ+A+KLDM+KAYDRVEWVF+ +++ ++GF+E W+NLIM
Sbjct: 472 RGQMALKLDMSKAYDRVEWVFVEAIMRRLGFAEDWINLIM 511

BLAST of Lcy07g007050 vs. ExPASy TrEMBL
Match: A0A7N2LIH6 (Uncharacterized protein OS=Quercus lobata OX=97700 PE=3 SV=1)

HSP 1 Score: 282.0 bits (720), Expect = 3.5e-72
Identity = 152/400 (38.00%), Postives = 224/400 (56.00%), Query Frame = 0

Query: 29   EDYWRSRSREAWLQHGDKNTKWFHSKANQRKKRNEIRGILNREGQWVEDEDEIGETASEY 88
            E  W+ RSR +WLQ+GDKN+K+FH+ A+QR+++N I G+++  G W ED++   +   +Y
Sbjct: 1029 EVMWKQRSRVSWLQYGDKNSKFFHATASQRRQKNRIGGLMDDLGVWHEDQETTEKLILDY 1088

Query: 89   FRELFQSSYPDLEAIASTTEGINNCLSVPDVRELERQFTRGEVEKAIKMMNPSKAPGIDG 148
            F++++ S+ P   +   + E ++  ++     EL+++F   EV +A++ M+P+KAPG DG
Sbjct: 1089 FKDIYSSNQP--TSFDVSLEAMDERVTPEMNDELQKEFKAVEVWQALQQMHPTKAPGPDG 1148

Query: 149  MHAAFYQNYWSIVGDDTVSMCLQILNQEGDIAPLNKTQIALISKVKDPKLMSDFGPISLC 208
            M   FYQ YW IVG    +  LQ LN       +NKT I LI K K+P+ +++F PISLC
Sbjct: 1149 MSPIFYQKYWDIVGSSVTNCVLQALNSGVMPKDINKTYICLIPKTKNPQKITEFRPISLC 1208

Query: 209  NVSYKIIAKTIANRLKRVLDKIISPTQAAFVPGRQISDNVLVGFECIHAINSRRKGKEGQ 268
            NV YKII+K +ANRLK+VL  +I   Q+AFVPGR I+DNV+V FE +H+IN RRKGKEG 
Sbjct: 1209 NVIYKIISKVLANRLKKVLHGVIDEAQSAFVPGRMITDNVIVAFESMHSINQRRKGKEGL 1268

Query: 269  IAIKLDMNKAYDRVEWVFIRSMLAKMGFSEKWVNLIM----------------------- 328
            +AIKLDM+KAYDRVEW ++ SM+ KMGF ++W++LIM                       
Sbjct: 1269 MAIKLDMSKAYDRVEWAYLESMMKKMGFGDRWISLIMMCVTSVSFSVLINGEPKGSFTPS 1328

Query: 329  -----------------------------------------------------------R 347
                                                                       R
Sbjct: 1329 RGLRQGDPISPYLFLLCGEGLSAMIKKKEREGLIRGVVAARQAPRISHLFFADDSIIFCR 1388

BLAST of Lcy07g007050 vs. NCBI nr
Match: XP_030923330.1 (uncharacterized protein LOC115950239 [Quercus lobata])

HSP 1 Score: 299.3 bits (765), Expect = 4.4e-77
Identity = 149/288 (51.74%), Postives = 206/288 (71.53%), Query Frame = 0

Query: 29   EDYWRSRSREAWLQHGDKNTKWFHSKANQRKKRNEIRGILNREGQWVEDEDEIGETASEY 88
            E YW  RSR  WL+HGD+NTK+FH+KA+QR+++N IRGI N +GQWVE+ +E+G+ A++Y
Sbjct: 734  EIYWAQRSRINWLRHGDRNTKFFHAKASQRRRKNFIRGIRNSQGQWVENLEEVGQVAADY 793

Query: 89   FRELFQSSYPDLEAIASTTEGINNCLSVPDVRE-LERQFTRGEVEKAIKMMNPSKAPGID 148
            F  LFQ+   D   +    + ++  ++  D+RE L  QFT  EV+ A+  M P+KAPG D
Sbjct: 794  FDNLFQAGAGD--QMEECLDAVDTKVT-EDMREFLSNQFTAEEVQAALFQMGPTKAPGPD 853

Query: 149  GMHAAFYQNYWSIVGDDTVSMCLQILNQEGDIAPLNKTQIALISKVKDPKLMSDFGPISL 208
            GM+A FYQ +W IVGD  VS  L  LN    +  +N T I LI KV++P+ MS+F PISL
Sbjct: 854  GMNALFYQKFWHIVGDSVVSAVLDFLNNGNMLPEINHTNIVLIPKVQNPERMSEFRPISL 913

Query: 209  CNVSYKIIAKTIANRLKRVLDKIISPTQAAFVPGRQISDNVLVGFECIHAINSRRKGKEG 268
            CNV YKII+K +ANRLK+VL +IIS TQ+AFVPGR I+DNVLV +E +H +++R+KGK+G
Sbjct: 914  CNVIYKIISKVLANRLKQVLPQIISSTQSAFVPGRLITDNVLVAYETLHTMHARKKGKKG 973

Query: 269  QIAIKLDMNKAYDRVEWVFIRSMLAKMGFSEKWVNLIMRATMEECTSI 316
             +A+KLD++KAYDRVEW F++S++ KMGF   W+  +M        SI
Sbjct: 974  DVALKLDISKAYDRVEWHFLQSIMEKMGFPAGWIERVMSCVTTPSFSI 1018

BLAST of Lcy07g007050 vs. NCBI nr
Match: XP_024038343.1 (uncharacterized protein LOC112097373 [Citrus clementina])

HSP 1 Score: 295.0 bits (754), Expect = 8.2e-76
Identity = 151/355 (42.54%), Postives = 213/355 (60.00%), Query Frame = 0

Query: 29  EDYWRSRSREAWLQHGDKNTKWFHSKANQRKKRNEIRGILNREGQWVEDEDEIGETASEY 88
           E YW+ RSR  WL+ GDKNTK+FH KA+ RKK+N I GI N  G W+E+ + +    ++Y
Sbjct: 301 EIYWKQRSRADWLKGGDKNTKFFHHKASSRKKKNRIWGIENAAGNWIENAEGVEFEFNKY 360

Query: 89  FRELFQSSYPDLEAIASTTEGINNCLSVPDVRELERQFTRGEVEKAIKMMNPSKAPGIDG 148
           F  LF +S P+ + IA+   GI+  +S      LE  FT  EV +A+  M P+KAPG DG
Sbjct: 361 FTNLFTTSKPNQDQIAAALSGISRRVSTEMNESLEMPFTPEEVVEALTQMCPTKAPGPDG 420

Query: 149 MHAAFYQNYWSIVGDDTVSMCLQILNQEGDIAPLNKTQIALISKVKDPKLMSDFGPISLC 208
           + A F+Q +W  V    +S CL ILN++GD+AP N T I LISK   P+ ++DF PISLC
Sbjct: 421 LPAVFFQKHWQRVKQGVLSTCLHILNKQGDVAPFNHTYIVLISKKGKPRKVTDFRPISLC 480

Query: 209 NVSYKIIAKTIANRLKRVLDKIISPTQAAFVPGRQISDNVLVGFECIHAINSRRKGKEGQ 268
           NV Y+I+AK IANRLK VL  +ISP Q+AF+P   I+DN++VG+EC+H I   +  K G 
Sbjct: 481 NVIYRIVAKAIANRLKNVLPNLISPMQSAFIPNWLITDNIIVGYECLHKIRHCKGRKNGL 540

Query: 269 IAIKLDMNKAYDRVEWVFIRSMLAKMGFSEKWVNLIM----------------------- 328
           +A+KLD++KAYD++EWVF+   +  +GFS+ WV+LIM                       
Sbjct: 541 VALKLDVSKAYDKLEWVFLEQTMKSLGFSQNWVSLIMSLLLSSPKAEHQRLIHGLFFGNE 600

Query: 329 ----------------RATMEECTSIKRACQLYERASGQKINFDKSRLMVNKNVN 345
                           RA+  +C ++K+    Y   SGQ  NF+KS + +N N++
Sbjct: 601 LKISHLLFADDSLVFTRASDTDCQNLKKIFDCYSATSGQLFNFEKSSMFLNGNIS 655

BLAST of Lcy07g007050 vs. NCBI nr
Match: XP_023913142.1 (uncharacterized protein LOC112024740 [Quercus suber])

HSP 1 Score: 290.8 bits (743), Expect = 1.5e-74
Identity = 144/284 (50.70%), Postives = 193/284 (67.96%), Query Frame = 0

Query: 29  EDYWRSRSREAWLQHGDKNTKWFHSKANQRKKRNEIRGILNREGQWVEDEDEIGETASEY 88
           E +W   SR +WL+HGD+NTK+FHSKA+QR+KRN I GI N+ G WVED  E+ E A  Y
Sbjct: 185 EIFWAQHSRVSWLKHGDRNTKFFHSKASQRRKRNFIHGIQNQHGNWVEDIGEVAEVAINY 244

Query: 89  FRELFQSSYPDLEAIASTTEGINNCLSVPDVR-------ELERQFTRGEVEKAIKMMNPS 148
           F  +F S          T E +  CL+    R       EL + +T  EV+ A+  M P+
Sbjct: 245 FETIFHS---------GTCERMEECLNTVPQRMTTNMKEELSKPYTGEEVKAALFQMGPT 304

Query: 149 KAPGIDGMHAAFYQNYWSIVGDDTVSMCLQILNQEGDIAPLNKTQIALISKVKDPKLMSD 208
           KAPG DGM+A FYQ +W IVG+D  S  L  LN    +  +N T I LI KVK P+ M+D
Sbjct: 305 KAPGPDGMNALFYQRFWHIVGNDVSSAVLDFLNSGTMLPEINYTHIVLIPKVKSPEKMTD 364

Query: 209 FGPISLCNVSYKIIAKTIANRLKRVLDKIISPTQAAFVPGRQISDNVLVGFECIHAINSR 268
           F PISLCNV YKII+K +ANRLK +L ++ISPTQ+AFVPGR I+DNVL+ +E +HA++ R
Sbjct: 365 FRPISLCNVIYKIISKVLANRLKTILPQLISPTQSAFVPGRLITDNVLLAYETLHAMHGR 424

Query: 269 RKGKEGQIAIKLDMNKAYDRVEWVFIRSMLAKMGFSEKWVNLIM 306
           +KGK   +A+KLD++KAYDRVEW F++ M+ ++GF E+W+N +M
Sbjct: 425 KKGKTRALALKLDVSKAYDRVEWDFLKGMMIRLGFPEEWINRVM 459

BLAST of Lcy07g007050 vs. NCBI nr
Match: XP_042939444.1 (uncharacterized protein LOC122274474 [Carya illinoinensis])

HSP 1 Score: 287.0 bits (733), Expect = 2.2e-73
Identity = 153/330 (46.36%), Postives = 209/330 (63.33%), Query Frame = 0

Query: 32  WRSRSREAWLQHGDKNTKWFHSKANQRKKRNEIRGILNREGQWVEDEDEIGETASEYFRE 91
           W+ R+++AWL+ GD+NTK+FH  ++QRK+ N I  I  + G   +D   I  T  E+F E
Sbjct: 299 WQQRAKQAWLKDGDRNTKFFHQCSSQRKRTNSILKIQTKSGLLTQDPQTICHTLLEFFTE 358

Query: 92  LFQSSYPDLEAIASTTEGINNCLS-----VPD--VRELERQFTRGEVEKAIKMMNPSKAP 151
           LF SS+P          GI++CLS     + D     L   FT  EV++A   MNP  +P
Sbjct: 359 LFTSSHP---------SGIDDCLSPLQKIITDDMFTSLSAVFTEVEVKEAAFSMNPLGSP 418

Query: 152 GIDGMHAAFYQNYWSIVGDDTVSMCLQILNQEGDIAPLNKTQIALISKVKDPKLMSDFGP 211
           G DG  A F+Q YW IVG       L++LN       LN+T IALI K  +P L++DF P
Sbjct: 419 GPDGFPALFFQRYWDIVGSSVTKASLEVLNGGKWNNTLNETLIALIPKKHNPSLVTDFRP 478

Query: 212 ISLCNVSYKIIAKTIANRLKRVLDKIISPTQAAFVPGRQISDNVLVGFECIHAINSRRKG 271
           ISLCNV YKIIAKT+ANRLK++L  IISPTQ AFVPGR I+DN++V FE +H + +R KG
Sbjct: 479 ISLCNVLYKIIAKTLANRLKKILPAIISPTQTAFVPGRLITDNIIVAFEALHTMKARLKG 538

Query: 272 KEGQIAIKLDMNKAYDRVEWVFIRSMLAKMGFSEKWVNLIM--------RATMEECTSIK 331
           +EG +A+KLDM+KAYDR+EW F+RS++ KMGF  KW+ L+M        +A   E + ++
Sbjct: 539 REGYMALKLDMSKAYDRIEWAFLRSVMLKMGFPVKWIELVMKYDSLLFCKANSIEWSQLQ 598

Query: 332 RACQLYERASGQKINFDKSRLMVNKNVNEE 347
                YERASGQ++N DK+ +  + N  E+
Sbjct: 599 FLLASYERASGQRLNKDKTSIFFSSNTREQ 619

BLAST of Lcy07g007050 vs. NCBI nr
Match: XP_023881891.1 (uncharacterized protein LOC111994244 [Quercus suber])

HSP 1 Score: 287.0 bits (733), Expect = 2.2e-73
Identity = 162/404 (40.10%), Postives = 222/404 (54.95%), Query Frame = 0

Query: 32  WRSRSREAWLQHGDKNTKWFHSKANQRKKRNEIRGILNREGQWVEDEDEIGETASEYFRE 91
           W+ RSR  WL  GD+NTK+FH+KA+ R++RN I GI++  G W +  + I + A  YF+ 
Sbjct: 176 WQQRSRVQWLGLGDRNTKYFHTKASDRRRRNTINGIMDENGNWQDSTEGIAKVAVSYFQT 235

Query: 92  LFQSSYPD-----LEAIAST-TEGINNCLSVPDVRELERQFTRGEVEKAIKMMNPSKAPG 151
           ++ SS P      L+AI +T TE +N+         L ++FTR E+E A+  M+P+KAPG
Sbjct: 236 IYSSSVPTRISEVLDAIPTTVTEEMNH--------SLIQEFTREEIETALNQMHPTKAPG 295

Query: 152 IDGMHAAFYQNYWSIVGDDTVSMCLQILNQEGDIAPLNKTQIALISKVKDPKLMSDFGPI 211
            DGM A F+Q YW+IVG+D V M L +LN    +  +NKT I L+ K+K+P  MSDF PI
Sbjct: 296 PDGMSAIFFQKYWNIVGNDIVCMVLDVLNSNMSMVEINKTNITLVPKIKNPTKMSDFRPI 355

Query: 212 SLCNVSYKIIAKTIANRLKRVLDKIISPTQAAFVPGRQISDNVLVGFECIHAINSRRKGK 271
           SLCNV YK+I+K +ANRLK +L +IIS  Q+AF+ GR I+DNVLV FE +H +  +++GK
Sbjct: 356 SLCNVVYKLISKVLANRLKNILPQIISENQSAFLSGRLITDNVLVAFELMHYLEHKKEGK 415

Query: 272 EGQIAIKLDMNKAYDRVEWVFIRSMLAKMGFSEKWVNLIM-------------------- 331
           EG  AIKLDM+KAYDRVEW FI+ ++ KMGF EKW+ L+M                    
Sbjct: 416 EGFAAIKLDMSKAYDRVEWGFIKQVMEKMGFHEKWIKLVMHCITSVSYSILVNGGAYGSI 475

Query: 332 ------------------------------------------------------------ 348
                                                                       
Sbjct: 476 TPTRGLRQGDPISPYIFLLCADGFSSLLNDVARKLRISGVSICRGCPKITHLFFADDSLL 535

BLAST of Lcy07g007050 vs. TAIR 10
Match: AT4G20520.1 (RNA binding;RNA-directed DNA polymerases )

HSP 1 Score: 80.9 bits (198), Expect = 2.3e-15
Identity = 36/90 (40.00%), Postives = 58/90 (64.44%), Query Frame = 0

Query: 219 IANRLKRVLDKIISPTQAAFVPGRQISDNVLVGFECIHAINSRRKGKEGQIAIKLDMNKA 278
           +  RLK ++  +I P QA+F+PGR  +DN++   E +H++  R+KG +G + +KLD+ KA
Sbjct: 1   MVERLKPLMTNLIGPAQASFIPGRVSTDNIVFVQEAVHSMR-RKKGVKGWMLLKLDLEKA 60

Query: 279 YDRVEWVFIRSMLAKMGFSEKWVNLIMRAT 309
           YDR+ W ++   L   GF E W+  I R+T
Sbjct: 61  YDRIRWDYLEDTLISAGFPEVWLPEIARST 89

BLAST of Lcy07g007050 vs. TAIR 10
Match: AT1G43760.1 (DNAse I-like superfamily protein )

HSP 1 Score: 79.3 bits (194), Expect = 6.6e-15
Identity = 57/198 (28.79%), Postives = 87/198 (43.94%), Query Frame = 0

Query: 25  AEGVEDYWRSRSREAWLQHGDKNTKWFHSKANQRKKRNEIRGILNREGQWVEDEDEIGET 84
           A  +E ++R +SR  WLQ GD NT++FH      + +N I+ +   +   VE+  ++ E 
Sbjct: 428 AAALESFYRQKSRIKWLQDGDANTRFFHKVILANQAKNLIKFLRMDDDVRVENVTQVKEM 487

Query: 85  ASEYFRELFQSSYPDLEAIASTTEGINNCLSVPDVR-------ELERQFTRGEVEKAIKM 144
              Y+  L  S    L     T + +     +   R        L    +  E+  A+  
Sbjct: 488 IVAYYTHLLGSDSDIL-----TPDSVQRIKDIHPFRCNDTLASRLSALPSDKEITAAVFA 547

Query: 145 MNPSKAPGIDGMHAAFYQNYWSIVGDDTVSMCLQILNQEGDIAPLNKTQIALISKVKDPK 204
           M  +KAPG D   A F+   W +V D T++   +       +   N T I LI KV    
Sbjct: 548 MPRNKAPGPDSFTAEFFWESWFVVKDSTIAAVKEFFRTGHLLKRFNATAITLIPKVTGVD 607

Query: 205 LMSDFGPISLCNVSYKII 216
            +S F P+S C V YKII
Sbjct: 608 QLSMFRPVSCCTVVYKII 620

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P143812.1e-2628.52Transposon TX1 uncharacterized 149 kDa protein OS=Xenopus laevis OX=8355 PE=4 SV... [more]
O003701.4e-2228.13LINE-1 retrotransposable element ORF2 protein OS=Homo sapiens OX=9606 PE=1 SV=1[more]
P113691.2e-2126.62LINE-1 retrotransposable element ORF2 protein OS=Mus musculus OX=10090 GN=Pol PE... [more]
Match NameE-valueIdentityDescription
A0A2N9GPZ78.0e-7750.36Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=F... [more]
A0A2N9IPS88.0e-7750.36Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=F... [more]
A0A2N9GM071.8e-7350.18Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=F... [more]
A0A2N9I3352.0e-7249.29Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=F... [more]
A0A7N2LIH63.5e-7238.00Uncharacterized protein OS=Quercus lobata OX=97700 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
XP_030923330.14.4e-7751.74uncharacterized protein LOC115950239 [Quercus lobata][more]
XP_024038343.18.2e-7642.54uncharacterized protein LOC112097373 [Citrus clementina][more]
XP_023913142.11.5e-7450.70uncharacterized protein LOC112024740 [Quercus suber][more]
XP_042939444.12.2e-7346.36uncharacterized protein LOC122274474 [Carya illinoinensis][more]
XP_023881891.12.2e-7340.10uncharacterized protein LOC111994244 [Quercus suber][more]
Match NameE-valueIdentityDescription
AT4G20520.12.3e-1540.00RNA binding;RNA-directed DNA polymerases [more]
AT1G43760.16.6e-1528.79DNAse I-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (P93075) v1
Date Performed: 2021-12-06
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 199..307
e-value: 3.6E-19
score: 69.2
NoneNo IPR availablePANTHERPTHR19446:SF440SUBFAMILY NOT NAMEDcoord: 42..339
NoneNo IPR availablePANTHERPTHR19446REVERSE TRANSCRIPTASEScoord: 42..339
NoneNo IPR availableCDDcd01650RT_nLTR_likecoord: 187..344
e-value: 2.47564E-32
score: 118.548
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 110..309

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lcy07g007050.1Lcy07g007050.1mRNA