Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGTAGGGATAAAGATTTAATTTTAGCACCATTTGATCCCGAGATAGAAAGAACAATTCTTAGGCTTCGAAGGGAAAATAGAGAAATTATTCAAATGGCAGATCAAAATCCACCTGAGGAGCCTAGGCCTATTAGAGATTATTTTCAGCCTGTGTTTCAGGGACAACAATCGGGAATTGTCTATGCACCGATTAATGCCAACAACTTTGAGCTGAAGACCGGCCTCATTCAGATGGCTCGAGACTGTGCATACAGAGGATCACCCACCGAGGATCCAAATTCTCATCTTAAATCATTCTTGGATATTTGTGGGACGGTAAAGATTAATGGTGTCTCTGAGGATGCTATTCGCTTACGATTATTTCCTTTTTCTTTGCAGGATAAAGCACGAGATTGGTTGCAGTCTATTACTCTTGGGAGCATCACCACTTGGGATGCTTTGGTCCAGGCCTTTTTAAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAACAACAATATGATGAGCAGCTGTTCGAAGCTTGGGAGCGATTCAAAGAGCTACTGAGGAAGTGTCCTCAGCATGGTTACCCCGATTGGCTTCAGGTACAGTTGTTTTATAATGGTTTAACTCCCAGTACAAAAACGATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTAGAAAACGCTCGCACACTTTTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGATCTACACCTAAAAAGATTTTTGCTGGAGTGTTTGAGGATGATAAAGTAAGTGCACTCCAGGCCCAGATGACTTCCCTCGCTAATGCTTTTATGAAATTTTCAGGTACAGGGAGTGCTCAATCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACTATTGAGCAGGTTCAGTATGTATCAAATTTTAATTCTAGGGGATATAATAATAGTTCTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCTAATACTAAGAATGTTCTTAATCCTCCTGGTTTTGCCCCTCAAACTCAAGATAATAAAAAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTCTAGTAACAGGACAACCAAACTAGAGGAGGTTGTCATTGCTATCAACTCAACAGTGAATGGCACAGTGCAGCCATCAAGAACATTGAAACTCAGCTGGGACAGTTGGTGA
mRNA sequence
ATGCGTAGGGATAAAGATTTAATTTTAGCACCATTTGATCCCGAGATAGAAAGAACAATTCTTAGGCTTCGAAGGGAAAATAGAGAAATTATTCAAATGGCAGATCAAAATCCACCTGAGGAGCCTAGGCCTATTAGAGATTATTTTCAGCCTGTGTTTCAGGGACAACAATCGGGAATTGTCTATGCACCGATTAATGCCAACAACTTTGAGCTGAAGACCGGCCTCATTCAGATGGCTCGAGACTGTGCATACAGAGGATCACCCACCGAGGATCCAAATTCTCATCTTAAATCATTCTTGGATATTTGTGGGACGGTAAAGATTAATGGTGTCTCTGAGGATGCTATTCGCTTACGATTATTTCCTTTTTCTTTGCAGGATAAAGCACGAGATTGGTTGCAGTCTATTACTCTTGGGAGCATCACCACTTGGGATGCTTTGGTCCAGGCCTTTTTAAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAACAACAATATGATGAGCAGCTGTTCGAAGCTTGGGAGCGATTCAAAGAGCTACTGAGGAAGTGTCCTCAGCATGGTTACCCCGATTGGCTTCAGGTACAGTTGTTTTATAATGGTTTAACTCCCAGTACAAAAACGATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTAGAAAACGCTCGCACACTTTTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGATCTACACCTAAAAAGATTTTTGCTGGAGTGTTTGAGGATGATAAAGTAAGTGCACTCCAGGCCCAGATGACTTCCCTCGCTAATGCTTTTATGAAATTTTCAGGTACAGGGAGTGCTCAATCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACTATTGAGCAGGTTCAGTATGTATCAAATTTTAATTCTAGGGGATATAATAATAGTTCTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCTAATACTAAGAATGTTCTTAATCCTCCTGGTTTTGCCCCTCAAACTCAAGATAATAAAAAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTCTAGTAACAGGACAACCAAACTAGAGGAGGTTGTCATTGCTATCAACTCAACAGTGAATGGCACAGTGCAGCCATCAAGAACATTGAAACTCAGCTGGGACAGTTGGTGA
Coding sequence (CDS)
ATGCGTAGGGATAAAGATTTAATTTTAGCACCATTTGATCCCGAGATAGAAAGAACAATTCTTAGGCTTCGAAGGGAAAATAGAGAAATTATTCAAATGGCAGATCAAAATCCACCTGAGGAGCCTAGGCCTATTAGAGATTATTTTCAGCCTGTGTTTCAGGGACAACAATCGGGAATTGTCTATGCACCGATTAATGCCAACAACTTTGAGCTGAAGACCGGCCTCATTCAGATGGCTCGAGACTGTGCATACAGAGGATCACCCACCGAGGATCCAAATTCTCATCTTAAATCATTCTTGGATATTTGTGGGACGGTAAAGATTAATGGTGTCTCTGAGGATGCTATTCGCTTACGATTATTTCCTTTTTCTTTGCAGGATAAAGCACGAGATTGGTTGCAGTCTATTACTCTTGGGAGCATCACCACTTGGGATGCTTTGGTCCAGGCCTTTTTAAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAACAACAATATGATGAGCAGCTGTTCGAAGCTTGGGAGCGATTCAAAGAGCTACTGAGGAAGTGTCCTCAGCATGGTTACCCCGATTGGCTTCAGGTACAGTTGTTTTATAATGGTTTAACTCCCAGTACAAAAACGATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTAGAAAACGCTCGCACACTTTTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGATCTACACCTAAAAAGATTTTTGCTGGAGTGTTTGAGGATGATAAAGTAAGTGCACTCCAGGCCCAGATGACTTCCCTCGCTAATGCTTTTATGAAATTTTCAGGTACAGGGAGTGCTCAATCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACTATTGAGCAGGTTCAGTATGTATCAAATTTTAATTCTAGGGGATATAATAATAGTTCTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCTAATACTAAGAATGTTCTTAATCCTCCTGGTTTTGCCCCTCAAACTCAAGATAATAAAAAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTCTAGTAACAGGACAACCAAACTAGAGGAGGTTGTCATTGCTATCAACTCAACAGTGAATGGCACAGTGCAGCCATCAAGAACATTGAAACTCAGCTGGGACAGTTGGTGA
Protein sequence
MRRDKDLILAPFDPEIERTILRLRRENREIIQMADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITLGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIFAGVFEDDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAALASRPQEETIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHENFSYANTKNVLNPPGFAPQTQDNKKLEDLVGAFIAESSNRTTKLEEVVIAINSTVNGTVQPSRTLKLSWDSW
Homology
BLAST of Lag0015504.1 vs. NCBI nr
Match:
WP_217833153.1 (retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002])
HSP 1 Score: 433.7 bits (1114), Expect = 1.8e-117
Identity = 234/417 (56.12%), Postives = 288/417 (69.06%), Query Frame = 0
Query: 1 MRRDKDLILAPFDPEIERTILRLRRENREIIQMADQNPPEEPRPIRDYFQPVFQGQQSGI 60
M RD +L P DPEI+RT RR R ++ + E P+ IRDYFQP Q GI
Sbjct: 17 MPRDNTNLL-PLDPEIDRT---YRRNLRALLNQTTEMAEEIPKAIRDYFQPTLPASQPGI 76
Query: 61 VYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLR 120
+ PIN NNFELK GLIQMAR+ A+RG EDP+ HL+SFL+ICGTVK+NGVS DAI+LR
Sbjct: 77 MNVPINVNNFELKPGLIQMARELAFRGRTNEDPHKHLRSFLEICGTVKMNGVSNDAIKLR 136
Query: 121 LFPFSLQDKARDWLQSITLGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQL 180
LFPFSLQD+A+DWL++I SITTW+ L QAFL K+FPPAK+ +LRTEIGTF+Q DEQL
Sbjct: 137 LFPFSLQDRAKDWLETIPPDSITTWEILAQAFLNKYFPPAKSQRLRTEIGTFRQLEDEQL 196
Query: 181 FEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLL 240
+EAWER+K+LLR+CPQHGYPDWLQ+QLFYNGL STK+I+DA AGG++ SK + A T+L
Sbjct: 197 YEAWERYKDLLRRCPQHGYPDWLQIQLFYNGLASSTKSILDATAGGSIFSKNAQEAYTIL 256
Query: 241 EDMATNSYQWPSERSTPK-KIFAGVFEDDKVSALQAQMTSLANAFMKFSGTGSAQ----S 300
ED+AT SY WP ER++P AG++E D+V++L+AQM SL NA K + G AQ S
Sbjct: 257 EDLATTSYNWPCERASPNIPKAAGLYEVDEVNSLKAQMASLTNALSKLTAGGQAQTNPPS 316
Query: 301 IESAAALASR-PQEETIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHENFSYANTKNVLN 360
I S AALAS E YV + R Y + PTHYHPN RNHENFSYAN KNVL
Sbjct: 317 IASLAALASEMGVHGDNETANYVDRGHYRNYQHQQLPTHYHPNLRNHENFSYANNKNVLQ 376
Query: 361 -PPGF-APQTQDNKKLEDLVGAFIAESSNRTTKLEEVVIAINSTVNGTVQPSRTLKL 410
P GF LED++ F+ ES +RTT LE V AI STV + + L++
Sbjct: 377 APQGFNGAGNAKTSSLEDIMLDFVKESRSRTTTLENSVQAIASTVQSQGKALQNLEV 429
BLAST of Lag0015504.1 vs. NCBI nr
Match:
KAG7990634.1 (hypothetical protein I3843_02G035100 [Carya illinoinensis])
HSP 1 Score: 393.7 bits (1010), Expect = 2.0e-105
Identity = 206/393 (52.42%), Postives = 272/393 (69.21%), Query Frame = 0
Query: 1 MRRDKDLILAPFDPEIERTILRLRRENREIIQMADQNPPEEPRPIRDYFQPVFQGQQSGI 60
MRR + + P DPEIERT+ LRR +I+ MA+++ PR ++DY +PV G S I
Sbjct: 1 MRRARSRDIIPVDPEIERTLRSLRR--NKILAMAEEDREVLPRTLKDYVRPVVNGNYSSI 60
Query: 61 VYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLR 120
+ PINANNFELK LI M + + GSP +DPN HL FL+IC TVKINGV+ED IRLR
Sbjct: 61 MRQPINANNFELKPALISMVQQAQFSGSPLDDPNIHLAMFLEICDTVKINGVTEDTIRLR 120
Query: 121 LFPFSLQDKARDWLQSITLGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQL 180
LFPFSL+DKAR WLQS+ GSI +W + + FL KFFPPAKT +LR+EIG F+Q E L
Sbjct: 121 LFPFSLRDKARGWLQSLQPGSIVSWQDMAERFLAKFFPPAKTAQLRSEIGQFKQNDFESL 180
Query: 181 FEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLL 240
+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL T+TIVDAA+GGTL+SKT E A LL
Sbjct: 181 YEAWERYKDLIRRCPQHGLPDWLQVQMFYNGLNGQTRTIVDAASGGTLMSKTAEGATALL 240
Query: 241 EDMATNSYQWPSERSTPKKIFAGVFEDDKVSALQAQMTSLANAFMKFSGTGSAQSIE--S 300
E+MA+N+YQWP+ER+ KK+ AG+ + + ++AL AQ+ +L++ + QS E +
Sbjct: 241 EEMASNNYQWPTERTLAKKV-AGIHDLEPIAALSAQVATLSHQISALTTQRIPQSTEYLA 300
Query: 301 AAALASRPQEETIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHENFSYANTKNVL---NP 360
+ ++ E + EQVQYV+N N Y + P +YHP RNHEN SY NTKNVL +P
Sbjct: 301 STSMIVPSNEASQEQVQYVNNRN-YNYRGNPMPNYYHPGLRNHENLSYGNTKNVLQPQHP 360
Query: 361 PGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLE 388
PGF Q + K LED + +F+ E++ R K +
Sbjct: 361 PGFDSQPSERKMSLEDAMVSFVQETNARFKKTD 389
BLAST of Lag0015504.1 vs. NCBI nr
Match:
XP_022843226.1 (uncharacterized protein LOC111366761 [Olea europaea var. sylvestris])
HSP 1 Score: 377.9 bits (969), Expect = 1.1e-100
Identity = 209/399 (52.38%), Postives = 269/399 (67.42%), Query Frame = 0
Query: 1 MRRDKDLILAPFDPEIERT--ILR-LRRENREIIQMAD---QNPPEEPRPIRDYFQPVFQ 60
MRR ++L L DPE ERT ILR ++R RE + D N + R IRDY +PV
Sbjct: 94 MRRARNLDLLHVDPEPERTFRILRGIQRNEREAMAEQDVRAANEDNQQRAIRDYIRPVVN 153
Query: 61 GQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSE 120
SGI I A NFELK GLI M + + G+ EDPN+HL SFL+IC TVK+NGV+E
Sbjct: 154 DNYSGIARPAIVAKNFELKPGLIDMVQQNQFGGAAVEDPNAHLGSFLEICDTVKMNGVTE 213
Query: 121 DAIRLRLFPFSLQDKARDWLQSITLGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQ 180
DAIRLRLF FSL+DKA+ W QS+ GSITTWD L Q FL K+FPP+K+ +LR EI F+Q
Sbjct: 214 DAIRLRLFSFSLRDKAKAWFQSLPYGSITTWDDLAQKFLTKYFPPSKSAQLRGEISQFKQ 273
Query: 181 QYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVE 240
E +EAWERFK+LLR+CPQHG+ W+Q+++FYNGL T+T+VDAAAGG L++KT E
Sbjct: 274 LDFEPFYEAWERFKDLLRRCPQHGFQKWVQIEIFYNGLNGQTRTMVDAAAGGILMAKTAE 333
Query: 241 NARTLLEDMATNSYQWPSERSTPKKIFAGVFEDDKVSALQAQMTSLANAFMKFSGTGSAQ 300
A LL+D+ATNSYQWPSERS KK+ AG+ E D ++AL AQ+ SL N + + G+ Q
Sbjct: 334 AAYALLDDIATNSYQWPSERSGVKKV-AGLHEVDPITALAAQVASLTNQIVMLTTQGNQQ 393
Query: 301 SIESAAALASRPQEETI--EQVQYVS--NFNSRGYNNSSTPTHYHPNNRNHENFSYANTK 360
+++S + +S QE + EQVQY+ N+N RG ++ HYHP RNHEN SY N +
Sbjct: 394 NVDSVISTSSSHQETEVANEQVQYIDSRNYNQRGGYQAN---HYHPGLRNHENLSYGNNR 453
Query: 361 NVLN-PPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLE 388
N L PPGF Q D K LED++G FI+E+ +R K E
Sbjct: 454 NTLQPPPGFNTQNSDGKPPLEDILGTFISETRSRFNKNE 488
BLAST of Lag0015504.1 vs. NCBI nr
Match:
KAG7947748.1 (hypothetical protein I3843_14G109500 [Carya illinoinensis])
HSP 1 Score: 372.1 bits (954), Expect = 6.3e-99
Identity = 192/361 (53.19%), Postives = 252/361 (69.81%), Query Frame = 0
Query: 33 MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTED 92
MA+++ PR ++DY +PV G S I+ PINANNFELK LI M + + GSP +D
Sbjct: 1 MAEEDREVLPRTLKDYVRPVVNGNYSSIMRQPINANNFELKPALISMVQQAQFSGSPLDD 60
Query: 93 PNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITLGSITTWDALVQAF 152
PN HL FL+IC TVKINGV+ED IRLRLFPFSL+DKAR WLQS+ GSI +W + + F
Sbjct: 61 PNVHLAMFLEICDTVKINGVTEDTIRLRLFPFSLRDKARGWLQSLQPGSIVSWQDMAERF 120
Query: 153 LKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGL 212
L KFFPPAKT +LR+EIG F+Q E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL
Sbjct: 121 LAKFFPPAKTAQLRSEIGQFKQNDFESLYEAWERYKDLIRRCPQHGLPDWLQVQMFYNGL 180
Query: 213 TPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIFAGVFEDDKVSA 272
T+TIVDAA+GGTL+SKT E A LLE+MA+N+YQWP+ER+ KK+ AG+ E + ++A
Sbjct: 181 NGQTRTIVDAASGGTLMSKTAEGATALLEEMASNNYQWPTERTLAKKV-AGIHELEPIAA 240
Query: 273 LQAQMTSLANAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQYVSNFNSRGYNNSST 332
L AQ+ +L++ + QS E ++ ++ E + EQVQYV+N N Y +
Sbjct: 241 LSAQVATLSHQISALTTQRIPQSTEYVASTSMIVPSNEASQEQVQYVNNRN-YNYRGNPM 300
Query: 333 PTHYHPNNRNHENFSYANTKNVL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKL 388
P +YHP RNHEN SY NTKNVL +PPGF Q + K LED + +F+ E++ R K
Sbjct: 301 PNYYHPGLRNHENLSYGNTKNVLQPQHPPGFDSQPSEKKMSLEDAMVSFVQETNARFKKT 359
BLAST of Lag0015504.1 vs. NCBI nr
Match:
KAG6734747.1 (hypothetical protein I3842_01G285500 [Carya illinoinensis])
HSP 1 Score: 372.1 bits (954), Expect = 6.3e-99
Identity = 192/361 (53.19%), Postives = 252/361 (69.81%), Query Frame = 0
Query: 33 MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTED 92
MA+++ PR ++DY +PV G S I+ PINANNFELK LI M + + GSP +D
Sbjct: 1 MAEEDREVLPRTLKDYVRPVVNGNYSSIMRQPINANNFELKPALISMVQQAQFSGSPLDD 60
Query: 93 PNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITLGSITTWDALVQAF 152
PN HL FL+IC TVKINGV+ED IRLRLFPFSL+DKAR WLQS+ GSI +W + + F
Sbjct: 61 PNVHLAMFLEICDTVKINGVTEDTIRLRLFPFSLRDKARGWLQSLQPGSIVSWQDMAERF 120
Query: 153 LKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGL 212
L KFFPPAKT +LR+EIG F+Q E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL
Sbjct: 121 LAKFFPPAKTAQLRSEIGQFKQNDFESLYEAWERYKDLIRRCPQHGLPDWLQVQMFYNGL 180
Query: 213 TPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIFAGVFEDDKVSA 272
T+TIVDAA+GGTL+SKT E A LLE+MA+N+YQWP+ER+ KK+ AG+ E + ++A
Sbjct: 181 NGQTRTIVDAASGGTLMSKTAEGATALLEEMASNNYQWPTERTLAKKV-AGIHELEPIAA 240
Query: 273 LQAQMTSLANAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQYVSNFNSRGYNNSST 332
L AQ+ +L++ + QS E ++ ++ E + EQVQYV+N N Y +
Sbjct: 241 LSAQVATLSHQISALTTQRIPQSTEYVASTSMIVPSNEASQEQVQYVNNRN-YNYRGNPM 300
Query: 333 PTHYHPNNRNHENFSYANTKNVL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKL 388
P +YHP RNHEN SY NTKNVL +PPGF Q + K LED + +F+ E++ R K
Sbjct: 301 PNYYHPGLRNHENLSYGNTKNVLQPQHPPGFDSQPSEKKMSLEDAMVSFVQETNARFKKT 359
BLAST of Lag0015504.1 vs. ExPASy TrEMBL
Match:
A0A6J0ZX64 (LOW QUALITY PROTEIN: uncharacterized protein LOC110412945 OS=Herrania umbratica OX=108875 GN=LOC110412945 PE=4 SV=1)
HSP 1 Score: 335.1 bits (858), Expect = 4.1e-88
Identity = 205/450 (45.56%), Postives = 271/450 (60.22%), Query Frame = 0
Query: 1 MRRDKDLILAPFDPEIERTILRLRRENREII----QMADQN----------PPEEPRPIR 60
M+R +L L PFDP+IERT R RREN ++ MA+ N PE R +R
Sbjct: 1 MQRRNNLNLVPFDPDIERTFRRHRRENLQVATLNQTMAEDNNNNGNNAINLVPEANRALR 60
Query: 61 DYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKSFLDICG 120
DY P+ QG I INANNFE+K IQM + + G P++DPNSHL +FL+IC
Sbjct: 61 DYVVPLVQGLHQSIRRPSINANNFEIKPAYIQMIQSSVQFSGLPSDDPNSHLVNFLEICD 120
Query: 121 TVKINGVSEDAIRLRLFPFSLQDKARDWLQSITLGSITTWDALVQAFLKKFFPPAKTVKL 180
T K NGV++DAIRLRLFPFSL+DKA+ WL S+ GSITTW+ L Q FL KFFPPAKT K+
Sbjct: 121 TFKYNGVTDDAIRLRLFPFSLRDKAKSWLNSLPNGSITTWEDLAQKFLAKFFPPAKTAKM 180
Query: 181 RTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAG 240
R +I +F Q E L+EAWERFKELLR+CP HG PDWLQVQ FYNGL S KTI+DAAAG
Sbjct: 181 RNDITSFIQFDGESLYEAWERFKELLRRCPHHGIPDWLQVQTFYNGLVGSIKTIIDAAAG 240
Query: 241 GTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIFAGVFEDDKVSALQAQMTSLANAFM 300
G L+SK +A LLE+MA+N+YQWPSERS +K G +E D + L Q+ +L+
Sbjct: 241 GALMSKNAVDAYNLLEEMASNNYQWPSERSGSRKA-VGAYEIDALGTLTTQVAALS---- 300
Query: 301 KFSGTGSAQSIESAAALASRPQEE--------TIEQVQYVSNFNSRGYNNSSTPTHYHPN 360
K T +++++ + + E VQ+V NFN R NN + T Y+P
Sbjct: 301 KKLDTLGVHAVQNSLVVCEMCGDSHSYDQCPYNSESVQFVGNFN-RQQNNPYSNT-YNPG 360
Query: 361 NRNHENFSYANTKNVLN-----PPGF----APQTQDNK-KLEDLVGAFIAESS------- 407
RNH NFS++N N PPGF PQ + K +LE+L+ +I+++
Sbjct: 361 WRNHPNFSWSNNAGPSNPKPIMPPGFQQQARPQIPEKKSQLEELLLQYISKTDAIIQSQG 420
BLAST of Lag0015504.1 vs. ExPASy TrEMBL
Match:
A0A6J0ZYV0 (uncharacterized protein LOC110413413 OS=Herrania umbratica OX=108875 GN=LOC110413413 PE=4 SV=1)
HSP 1 Score: 332.8 bits (852), Expect = 2.0e-87
Identity = 193/401 (48.13%), Postives = 246/401 (61.35%), Query Frame = 0
Query: 1 MRRDKDLILAPFDPEIERTILRLRRENREII----QMADQN----------PPEEPRPIR 60
M+R +L L PFDP+IERT R RREN ++ MA+ N PE R +R
Sbjct: 1 MQRRNNLNLVPFDPDIERTFRRHRRENLQVATLNQTMAEDNNNNGNNAINLVPEANRALR 60
Query: 61 DYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKSFLDICG 120
DY P+ QG I INANNFE+K IQM + + G P++DPNSHL +FL+IC
Sbjct: 61 DYAVPLVQGLHQSIRRPSINANNFEIKPAYIQMIQSSVQFSGLPSDDPNSHLVNFLEICD 120
Query: 121 TVKINGVSEDAIRLRLFPFSLQDKARDWLQSITLGSITTWDALVQAFLKKFFPPAKTVKL 180
T K NGV++DAIRLRLFPFSL+DKA+ WL S+ GSITTW+ L Q FL KFFPPAKT K+
Sbjct: 121 TFKYNGVTDDAIRLRLFPFSLRDKAKSWLNSLPNGSITTWEDLAQKFLAKFFPPAKTAKM 180
Query: 181 RTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAG 240
R +I +F Q E L+EAWERFKELLR+CP HG PDWLQVQ FYNGL S KTI+DAAAG
Sbjct: 181 RNDITSFIQFDGESLYEAWERFKELLRRCPHHGIPDWLQVQTFYNGLVGSIKTIIDAAAG 240
Query: 241 GTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIFAGVFEDDKVSALQAQMTSLANAFM 300
G L+SK +A LLE+MA+N+YQWPSERS +K G +E D + L Q+ +L+
Sbjct: 241 GALMSKNAVDAYNLLEEMASNNYQWPSERSGSRKA-VGAYEIDALGTLTTQVAALS---- 300
Query: 301 KFSGTGSAQSIESAAALASRPQEE--------TIEQVQYVSNFNSRGYNNSSTPTHYHPN 360
K T +++++ + + E VQ+V NFN R NN + T Y+P
Sbjct: 301 KKLDTLGVHAVQNSLVVCEMCGDSHSYDQCPYNSESVQFVGNFN-RQQNNPYSNT-YNPG 360
Query: 361 NRNHENFSYANTKNVLN-----PPGFAPQTQDNKKLEDLVG 374
RNH NFS++N N PPGF Q + + E +G
Sbjct: 361 WRNHPNFSWSNNAGPSNPKPIMPPGFQQQARPQFQKEVPIG 394
BLAST of Lag0015504.1 vs. ExPASy TrEMBL
Match:
A0A3S3N117 (Retrotrans_gag domain-containing protein OS=Cinnamomum micranthum f. kanehirae OX=337451 GN=CKAN_01212200 PE=4 SV=1)
HSP 1 Score: 323.9 bits (829), Expect = 9.5e-85
Identity = 185/399 (46.37%), Postives = 249/399 (62.41%), Query Frame = 0
Query: 1 MRRDKDLILAPFDPEIERTILRLRRENR-----EIIQMADQNPPEEPRPIRDYFQPVFQG 60
MRR+++L L P DPEIERT+ RL++E + EI +M +Q R + DY P+ G
Sbjct: 1 MRRNQNLNLVPLDPEIERTLRRLKKEKKQQSEFEITEMKEQ----ANRSLGDYAVPLVTG 60
Query: 61 QQSGIVYAPINANNFELKTGLIQM-ARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSE 120
S I I ANNFE+K +IQM A + G P +DPN+H+ +FL++C T K NGV++
Sbjct: 61 ATSSIRRPVIQANNFEIKPAIIQMVASTVQFSGLPDDDPNAHISNFLELCDTFKYNGVTD 120
Query: 121 DAIRLRLFPFSLQDKARDWLQSITLGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQ 180
DA+RLRL PFSL+DKA+ WL S+ +ITTWD L + FL KFFPP KTVK+R +I TF Q
Sbjct: 121 DAVRLRLLPFSLRDKAKAWLNSLPQSTITTWDELAKKFLAKFFPPTKTVKMRNDITTFAQ 180
Query: 181 QYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVE 240
E L+EAWER+KELLRKCP HG P W+QVQ FYNGL +T+T +DAA GGTL+ K+ E
Sbjct: 181 NEMESLYEAWERYKELLRKCPHHGLPLWIQVQTFYNGLQSATRTSIDAATGGTLMKKSPE 240
Query: 241 NARTLLEDMATNSYQWPSERSTPKKIFAGVFEDDKVSALQAQMTSLANAF--MKFSGTGS 300
A L+E+MATN+YQWPS+ KKI GV E D +SAL AQ+ +L+ MK S
Sbjct: 241 EAYELVEEMATNNYQWPSDHVQQKKI-QGVHELDSISALTAQVANLSKQIQSMKVHAVQS 300
Query: 301 AQSIESAAALASRPQE-------ETIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHENFS 360
+ A + + EQV YVSN++ + S+T Y+P RNH NFS
Sbjct: 301 TNMVCEFCAGNHMGVDCQVGNPFNSQEQVHYVSNYSRQNNPYSNT---YNPGWRNHPNFS 360
Query: 361 YANTKN-VLNPPGF-APQTQDNKKLEDLVGAFIAESSNR 383
+ N +N PP F PQ ++ LE ++ FI++ ++
Sbjct: 361 WNNAQNSARQPPRFQQPQQEEKSGLEKMMAQFISKVDSK 391
BLAST of Lag0015504.1 vs. ExPASy TrEMBL
Match:
A0A6P6XAQ1 (Reverse transcriptase OS=Coffea arabica OX=13443 GN=LOC113740608 PE=4 SV=1)
HSP 1 Score: 318.5 bits (815), Expect = 4.0e-83
Identity = 174/360 (48.33%), Postives = 232/360 (64.44%), Query Frame = 0
Query: 43 RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLD 102
R +RD+ P QG Q+ IV +NANNFE+K LIQM + Y G+ TEDPNSHL +FL+
Sbjct: 9 RILRDFALPGAQGSQTSIVRPTVNANNFEIKPSLIQMVQQSQYGGNATEDPNSHLSTFLE 68
Query: 103 ICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITLGSITTWDALVQAFLKKFFPPAKT 162
IC T+K NGVSEDAI+LRLFPFSL+DKA+ WLQS + TTWD L +AFL KFFPP KT
Sbjct: 69 ICDTIKFNGVSEDAIKLRLFPFSLRDKAKVWLQSHPPNTFTTWDELAKAFLNKFFPPGKT 128
Query: 163 VKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDA 222
KLR +I +F QQ E L+EAWER++EL R+CP HG PDWL VQ FYNGLT TKT VDA
Sbjct: 129 AKLRMDITSFSQQEGETLYEAWERYRELQRRCPHHGLPDWLVVQTFYNGLTYPTKTHVDA 188
Query: 223 AAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIFAGVFEDDKVSALQAQMTSLAN 282
AAGG L+ KT E A+ L+E+MA N+YQW +ER ++ AG+ E D ++ L A+M ++
Sbjct: 189 AAGGALMGKTAEEAQQLIEEMAANNYQWANERGNSRRT-AGMLEVDTLNMLSAKMDNVVK 248
Query: 283 AFMKFSGTGSAQSIESAAALASRPQEE-----TIEQVQYVSNFNSRGYNNSSTPTHYHPN 342
+ G+ S Q + A+ + + EQVQY++N+N NN + T Y+P
Sbjct: 249 MLNRQVGSSSNQGVVVASCTICGGDHDDFMCSSSEQVQYLNNYNRPPQNNPYSNT-YNPG 308
Query: 343 NRNHENFSY---ANTKNVLNPPGFAPQ--TQDNKKLEDLVGAFIAESSNRTTKLEEVVIA 393
RNH NF + N + +NPPGF + ++K +L +A +SN K+E++ A
Sbjct: 309 WRNHPNFGWKDQGNQQRPVNPPGFQQKQTLHESKPAWELAIEKLANASN--DKIEKLASA 364
BLAST of Lag0015504.1 vs. ExPASy TrEMBL
Match:
A0A6J1DU19 (uncharacterized protein LOC111024361 OS=Momordica charantia OX=3673 GN=LOC111024361 PE=4 SV=1)
HSP 1 Score: 310.5 bits (794), Expect = 1.1e-80
Identity = 178/366 (48.63%), Postives = 225/366 (61.48%), Query Frame = 0
Query: 45 IRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDIC 104
IRDY QP F GI+ PINANN ELK GLIQM R+ +RG+ TEDPN+HL FLD+C
Sbjct: 27 IRDYCQPNFP-NHVGIINLPINANNSELKPGLIQMVRENTFRGNATEDPNNHLTIFLDVC 86
Query: 105 GTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITLGSITTWDALVQAFLKKFFPPAKTVK 164
GTVK+NGV +DAIRLRLFP SLQDK +VQAFL FFPPAKT +
Sbjct: 87 GTVKMNGVIDDAIRLRLFPLSLQDK-----------------EMVQAFLTNFFPPAKTTQ 146
Query: 165 LRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAA 224
LRTEI +F++ EQLFE WER+KELLRKCPQHG +WLQ+Q+FYNGL T+TI+DAAA
Sbjct: 147 LRTEIRSFRKYDYEQLFEVWERYKELLRKCPQHGNLEWLQIQMFYNGLNGQTRTILDAAA 206
Query: 225 GGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIFAGVFEDDKVSALQAQMTSLANAF 284
GGTLLS+T ENA LL+DMA NS+QWPSERS KK+ AG++E D++S+L+AQ+ +L NA
Sbjct: 207 GGTLLSRTPENAYILLKDMADNSFQWPSERSNAKKV-AGMYEIDELSSLKAQVQALTNAV 266
Query: 285 MKFSGTGSAQSIESAAALASRP-QEETIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHEN 344
K SG G++ S E AA + E TIEQ Q+ S HP
Sbjct: 267 SKLSGPGTSHSNELVAATDTYSYYEPTIEQAQFTS----------------HP------- 326
Query: 345 FSYANTKNVLNPPGFAPQTQDNKKLEDLVGAFIAESSNRTTKLEEVVIAINSTVNGTVQP 404
+ LEDL+GAFI E +R +++E V + + G
Sbjct: 327 ------------------AEKKSSLEDLLGAFINECRSRASRIENQVEGMEVKLEGNTTS 332
Query: 405 SRTLKL 410
+ +++
Sbjct: 387 IKNMEV 332
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
WP_217833153.1 | 1.8e-117 | 56.12 | retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 70... | [more] |
KAG7990634.1 | 2.0e-105 | 52.42 | hypothetical protein I3843_02G035100 [Carya illinoinensis] | [more] |
XP_022843226.1 | 1.1e-100 | 52.38 | uncharacterized protein LOC111366761 [Olea europaea var. sylvestris] | [more] |
KAG7947748.1 | 6.3e-99 | 53.19 | hypothetical protein I3843_14G109500 [Carya illinoinensis] | [more] |
KAG6734747.1 | 6.3e-99 | 53.19 | hypothetical protein I3842_01G285500 [Carya illinoinensis] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J0ZX64 | 4.1e-88 | 45.56 | LOW QUALITY PROTEIN: uncharacterized protein LOC110412945 OS=Herrania umbratica ... | [more] |
A0A6J0ZYV0 | 2.0e-87 | 48.13 | uncharacterized protein LOC110413413 OS=Herrania umbratica OX=108875 GN=LOC11041... | [more] |
A0A3S3N117 | 9.5e-85 | 46.37 | Retrotrans_gag domain-containing protein OS=Cinnamomum micranthum f. kanehirae O... | [more] |
A0A6P6XAQ1 | 4.0e-83 | 48.33 | Reverse transcriptase OS=Coffea arabica OX=13443 GN=LOC113740608 PE=4 SV=1 | [more] |
A0A6J1DU19 | 1.1e-80 | 48.63 | uncharacterized protein LOC111024361 OS=Momordica charantia OX=3673 GN=LOC111024... | [more] |
Match Name | E-value | Identity | Description | |