Lag0015504.1 (mRNA) Sponge gourd (AG‐4) v1

Overview
NameLag0015504.1
TypemRNA
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionRetrotrans_gag domain-containing protein
Locationchr12: 14533675 .. 14534919 (+)
Sequence length1245
RNA-Seq ExpressionLag0015504.1
SyntenyLag0015504.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGTAGGGATAAAGATTTAATTTTAGCACCATTTGATCCCGAGATAGAAAGAACAATTCTTAGGCTTCGAAGGGAAAATAGAGAAATTATTCAAATGGCAGATCAAAATCCACCTGAGGAGCCTAGGCCTATTAGAGATTATTTTCAGCCTGTGTTTCAGGGACAACAATCGGGAATTGTCTATGCACCGATTAATGCCAACAACTTTGAGCTGAAGACCGGCCTCATTCAGATGGCTCGAGACTGTGCATACAGAGGATCACCCACCGAGGATCCAAATTCTCATCTTAAATCATTCTTGGATATTTGTGGGACGGTAAAGATTAATGGTGTCTCTGAGGATGCTATTCGCTTACGATTATTTCCTTTTTCTTTGCAGGATAAAGCACGAGATTGGTTGCAGTCTATTACTCTTGGGAGCATCACCACTTGGGATGCTTTGGTCCAGGCCTTTTTAAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAACAACAATATGATGAGCAGCTGTTCGAAGCTTGGGAGCGATTCAAAGAGCTACTGAGGAAGTGTCCTCAGCATGGTTACCCCGATTGGCTTCAGGTACAGTTGTTTTATAATGGTTTAACTCCCAGTACAAAAACGATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTAGAAAACGCTCGCACACTTTTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGATCTACACCTAAAAAGATTTTTGCTGGAGTGTTTGAGGATGATAAAGTAAGTGCACTCCAGGCCCAGATGACTTCCCTCGCTAATGCTTTTATGAAATTTTCAGGTACAGGGAGTGCTCAATCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACTATTGAGCAGGTTCAGTATGTATCAAATTTTAATTCTAGGGGATATAATAATAGTTCTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCTAATACTAAGAATGTTCTTAATCCTCCTGGTTTTGCCCCTCAAACTCAAGATAATAAAAAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTCTAGTAACAGGACAACCAAACTAGAGGAGGTTGTCATTGCTATCAACTCAACAGTGAATGGCACAGTGCAGCCATCAAGAACATTGAAACTCAGCTGGGACAGTTGGTGA

mRNA sequence

ATGCGTAGGGATAAAGATTTAATTTTAGCACCATTTGATCCCGAGATAGAAAGAACAATTCTTAGGCTTCGAAGGGAAAATAGAGAAATTATTCAAATGGCAGATCAAAATCCACCTGAGGAGCCTAGGCCTATTAGAGATTATTTTCAGCCTGTGTTTCAGGGACAACAATCGGGAATTGTCTATGCACCGATTAATGCCAACAACTTTGAGCTGAAGACCGGCCTCATTCAGATGGCTCGAGACTGTGCATACAGAGGATCACCCACCGAGGATCCAAATTCTCATCTTAAATCATTCTTGGATATTTGTGGGACGGTAAAGATTAATGGTGTCTCTGAGGATGCTATTCGCTTACGATTATTTCCTTTTTCTTTGCAGGATAAAGCACGAGATTGGTTGCAGTCTATTACTCTTGGGAGCATCACCACTTGGGATGCTTTGGTCCAGGCCTTTTTAAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAACAACAATATGATGAGCAGCTGTTCGAAGCTTGGGAGCGATTCAAAGAGCTACTGAGGAAGTGTCCTCAGCATGGTTACCCCGATTGGCTTCAGGTACAGTTGTTTTATAATGGTTTAACTCCCAGTACAAAAACGATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTAGAAAACGCTCGCACACTTTTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGATCTACACCTAAAAAGATTTTTGCTGGAGTGTTTGAGGATGATAAAGTAAGTGCACTCCAGGCCCAGATGACTTCCCTCGCTAATGCTTTTATGAAATTTTCAGGTACAGGGAGTGCTCAATCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACTATTGAGCAGGTTCAGTATGTATCAAATTTTAATTCTAGGGGATATAATAATAGTTCTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCTAATACTAAGAATGTTCTTAATCCTCCTGGTTTTGCCCCTCAAACTCAAGATAATAAAAAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTCTAGTAACAGGACAACCAAACTAGAGGAGGTTGTCATTGCTATCAACTCAACAGTGAATGGCACAGTGCAGCCATCAAGAACATTGAAACTCAGCTGGGACAGTTGGTGA

Coding sequence (CDS)

ATGCGTAGGGATAAAGATTTAATTTTAGCACCATTTGATCCCGAGATAGAAAGAACAATTCTTAGGCTTCGAAGGGAAAATAGAGAAATTATTCAAATGGCAGATCAAAATCCACCTGAGGAGCCTAGGCCTATTAGAGATTATTTTCAGCCTGTGTTTCAGGGACAACAATCGGGAATTGTCTATGCACCGATTAATGCCAACAACTTTGAGCTGAAGACCGGCCTCATTCAGATGGCTCGAGACTGTGCATACAGAGGATCACCCACCGAGGATCCAAATTCTCATCTTAAATCATTCTTGGATATTTGTGGGACGGTAAAGATTAATGGTGTCTCTGAGGATGCTATTCGCTTACGATTATTTCCTTTTTCTTTGCAGGATAAAGCACGAGATTGGTTGCAGTCTATTACTCTTGGGAGCATCACCACTTGGGATGCTTTGGTCCAGGCCTTTTTAAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAACAACAATATGATGAGCAGCTGTTCGAAGCTTGGGAGCGATTCAAAGAGCTACTGAGGAAGTGTCCTCAGCATGGTTACCCCGATTGGCTTCAGGTACAGTTGTTTTATAATGGTTTAACTCCCAGTACAAAAACGATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTAGAAAACGCTCGCACACTTTTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGATCTACACCTAAAAAGATTTTTGCTGGAGTGTTTGAGGATGATAAAGTAAGTGCACTCCAGGCCCAGATGACTTCCCTCGCTAATGCTTTTATGAAATTTTCAGGTACAGGGAGTGCTCAATCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACTATTGAGCAGGTTCAGTATGTATCAAATTTTAATTCTAGGGGATATAATAATAGTTCTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCTAATACTAAGAATGTTCTTAATCCTCCTGGTTTTGCCCCTCAAACTCAAGATAATAAAAAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTCTAGTAACAGGACAACCAAACTAGAGGAGGTTGTCATTGCTATCAACTCAACAGTGAATGGCACAGTGCAGCCATCAAGAACATTGAAACTCAGCTGGGACAGTTGGTGA

Protein sequence

MRRDKDLILAPFDPEIERTILRLRRENREIIQMADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITLGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIFAGVFEDDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAALASRPQEETIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHENFSYANTKNVLNPPGFAPQTQDNKKLEDLVGAFIAESSNRTTKLEEVVIAINSTVNGTVQPSRTLKLSWDSW
Homology
BLAST of Lag0015504.1 vs. NCBI nr
Match: WP_217833153.1 (retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002])

HSP 1 Score: 433.7 bits (1114), Expect = 1.8e-117
Identity = 234/417 (56.12%), Postives = 288/417 (69.06%), Query Frame = 0

Query: 1   MRRDKDLILAPFDPEIERTILRLRRENREIIQMADQNPPEEPRPIRDYFQPVFQGQQSGI 60
           M RD   +L P DPEI+RT    RR  R ++    +   E P+ IRDYFQP     Q GI
Sbjct: 17  MPRDNTNLL-PLDPEIDRT---YRRNLRALLNQTTEMAEEIPKAIRDYFQPTLPASQPGI 76

Query: 61  VYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLR 120
           +  PIN NNFELK GLIQMAR+ A+RG   EDP+ HL+SFL+ICGTVK+NGVS DAI+LR
Sbjct: 77  MNVPINVNNFELKPGLIQMARELAFRGRTNEDPHKHLRSFLEICGTVKMNGVSNDAIKLR 136

Query: 121 LFPFSLQDKARDWLQSITLGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQL 180
           LFPFSLQD+A+DWL++I   SITTW+ L QAFL K+FPPAK+ +LRTEIGTF+Q  DEQL
Sbjct: 137 LFPFSLQDRAKDWLETIPPDSITTWEILAQAFLNKYFPPAKSQRLRTEIGTFRQLEDEQL 196

Query: 181 FEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLL 240
           +EAWER+K+LLR+CPQHGYPDWLQ+QLFYNGL  STK+I+DA AGG++ SK  + A T+L
Sbjct: 197 YEAWERYKDLLRRCPQHGYPDWLQIQLFYNGLASSTKSILDATAGGSIFSKNAQEAYTIL 256

Query: 241 EDMATNSYQWPSERSTPK-KIFAGVFEDDKVSALQAQMTSLANAFMKFSGTGSAQ----S 300
           ED+AT SY WP ER++P     AG++E D+V++L+AQM SL NA  K +  G AQ    S
Sbjct: 257 EDLATTSYNWPCERASPNIPKAAGLYEVDEVNSLKAQMASLTNALSKLTAGGQAQTNPPS 316

Query: 301 IESAAALASR-PQEETIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHENFSYANTKNVLN 360
           I S AALAS        E   YV   + R Y +   PTHYHPN RNHENFSYAN KNVL 
Sbjct: 317 IASLAALASEMGVHGDNETANYVDRGHYRNYQHQQLPTHYHPNLRNHENFSYANNKNVLQ 376

Query: 361 -PPGF-APQTQDNKKLEDLVGAFIAESSNRTTKLEEVVIAINSTVNGTVQPSRTLKL 410
            P GF          LED++  F+ ES +RTT LE  V AI STV    +  + L++
Sbjct: 377 APQGFNGAGNAKTSSLEDIMLDFVKESRSRTTTLENSVQAIASTVQSQGKALQNLEV 429

BLAST of Lag0015504.1 vs. NCBI nr
Match: KAG7990634.1 (hypothetical protein I3843_02G035100 [Carya illinoinensis])

HSP 1 Score: 393.7 bits (1010), Expect = 2.0e-105
Identity = 206/393 (52.42%), Postives = 272/393 (69.21%), Query Frame = 0

Query: 1   MRRDKDLILAPFDPEIERTILRLRRENREIIQMADQNPPEEPRPIRDYFQPVFQGQQSGI 60
           MRR +   + P DPEIERT+  LRR   +I+ MA+++    PR ++DY +PV  G  S I
Sbjct: 1   MRRARSRDIIPVDPEIERTLRSLRR--NKILAMAEEDREVLPRTLKDYVRPVVNGNYSSI 60

Query: 61  VYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLR 120
           +  PINANNFELK  LI M +   + GSP +DPN HL  FL+IC TVKINGV+ED IRLR
Sbjct: 61  MRQPINANNFELKPALISMVQQAQFSGSPLDDPNIHLAMFLEICDTVKINGVTEDTIRLR 120

Query: 121 LFPFSLQDKARDWLQSITLGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQL 180
           LFPFSL+DKAR WLQS+  GSI +W  + + FL KFFPPAKT +LR+EIG F+Q   E L
Sbjct: 121 LFPFSLRDKARGWLQSLQPGSIVSWQDMAERFLAKFFPPAKTAQLRSEIGQFKQNDFESL 180

Query: 181 FEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLL 240
           +EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL   T+TIVDAA+GGTL+SKT E A  LL
Sbjct: 181 YEAWERYKDLIRRCPQHGLPDWLQVQMFYNGLNGQTRTIVDAASGGTLMSKTAEGATALL 240

Query: 241 EDMATNSYQWPSERSTPKKIFAGVFEDDKVSALQAQMTSLANAFMKFSGTGSAQSIE--S 300
           E+MA+N+YQWP+ER+  KK+ AG+ + + ++AL AQ+ +L++     +     QS E  +
Sbjct: 241 EEMASNNYQWPTERTLAKKV-AGIHDLEPIAALSAQVATLSHQISALTTQRIPQSTEYLA 300

Query: 301 AAALASRPQEETIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHENFSYANTKNVL---NP 360
           + ++     E + EQVQYV+N N   Y  +  P +YHP  RNHEN SY NTKNVL   +P
Sbjct: 301 STSMIVPSNEASQEQVQYVNNRN-YNYRGNPMPNYYHPGLRNHENLSYGNTKNVLQPQHP 360

Query: 361 PGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLE 388
           PGF  Q  + K  LED + +F+ E++ R  K +
Sbjct: 361 PGFDSQPSERKMSLEDAMVSFVQETNARFKKTD 389

BLAST of Lag0015504.1 vs. NCBI nr
Match: XP_022843226.1 (uncharacterized protein LOC111366761 [Olea europaea var. sylvestris])

HSP 1 Score: 377.9 bits (969), Expect = 1.1e-100
Identity = 209/399 (52.38%), Postives = 269/399 (67.42%), Query Frame = 0

Query: 1   MRRDKDLILAPFDPEIERT--ILR-LRRENREIIQMAD---QNPPEEPRPIRDYFQPVFQ 60
           MRR ++L L   DPE ERT  ILR ++R  RE +   D    N   + R IRDY +PV  
Sbjct: 94  MRRARNLDLLHVDPEPERTFRILRGIQRNEREAMAEQDVRAANEDNQQRAIRDYIRPVVN 153

Query: 61  GQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSE 120
              SGI    I A NFELK GLI M +   + G+  EDPN+HL SFL+IC TVK+NGV+E
Sbjct: 154 DNYSGIARPAIVAKNFELKPGLIDMVQQNQFGGAAVEDPNAHLGSFLEICDTVKMNGVTE 213

Query: 121 DAIRLRLFPFSLQDKARDWLQSITLGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQ 180
           DAIRLRLF FSL+DKA+ W QS+  GSITTWD L Q FL K+FPP+K+ +LR EI  F+Q
Sbjct: 214 DAIRLRLFSFSLRDKAKAWFQSLPYGSITTWDDLAQKFLTKYFPPSKSAQLRGEISQFKQ 273

Query: 181 QYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVE 240
              E  +EAWERFK+LLR+CPQHG+  W+Q+++FYNGL   T+T+VDAAAGG L++KT E
Sbjct: 274 LDFEPFYEAWERFKDLLRRCPQHGFQKWVQIEIFYNGLNGQTRTMVDAAAGGILMAKTAE 333

Query: 241 NARTLLEDMATNSYQWPSERSTPKKIFAGVFEDDKVSALQAQMTSLANAFMKFSGTGSAQ 300
            A  LL+D+ATNSYQWPSERS  KK+ AG+ E D ++AL AQ+ SL N  +  +  G+ Q
Sbjct: 334 AAYALLDDIATNSYQWPSERSGVKKV-AGLHEVDPITALAAQVASLTNQIVMLTTQGNQQ 393

Query: 301 SIESAAALASRPQEETI--EQVQYVS--NFNSRGYNNSSTPTHYHPNNRNHENFSYANTK 360
           +++S  + +S  QE  +  EQVQY+   N+N RG   ++   HYHP  RNHEN SY N +
Sbjct: 394 NVDSVISTSSSHQETEVANEQVQYIDSRNYNQRGGYQAN---HYHPGLRNHENLSYGNNR 453

Query: 361 NVLN-PPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLE 388
           N L  PPGF  Q  D K  LED++G FI+E+ +R  K E
Sbjct: 454 NTLQPPPGFNTQNSDGKPPLEDILGTFISETRSRFNKNE 488

BLAST of Lag0015504.1 vs. NCBI nr
Match: KAG7947748.1 (hypothetical protein I3843_14G109500 [Carya illinoinensis])

HSP 1 Score: 372.1 bits (954), Expect = 6.3e-99
Identity = 192/361 (53.19%), Postives = 252/361 (69.81%), Query Frame = 0

Query: 33  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTED 92
           MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +D
Sbjct: 1   MAEEDREVLPRTLKDYVRPVVNGNYSSIMRQPINANNFELKPALISMVQQAQFSGSPLDD 60

Query: 93  PNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITLGSITTWDALVQAF 152
           PN HL  FL+IC TVKINGV+ED IRLRLFPFSL+DKAR WLQS+  GSI +W  + + F
Sbjct: 61  PNVHLAMFLEICDTVKINGVTEDTIRLRLFPFSLRDKARGWLQSLQPGSIVSWQDMAERF 120

Query: 153 LKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGL 212
           L KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL
Sbjct: 121 LAKFFPPAKTAQLRSEIGQFKQNDFESLYEAWERYKDLIRRCPQHGLPDWLQVQMFYNGL 180

Query: 213 TPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIFAGVFEDDKVSA 272
              T+TIVDAA+GGTL+SKT E A  LLE+MA+N+YQWP+ER+  KK+ AG+ E + ++A
Sbjct: 181 NGQTRTIVDAASGGTLMSKTAEGATALLEEMASNNYQWPTERTLAKKV-AGIHELEPIAA 240

Query: 273 LQAQMTSLANAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQYVSNFNSRGYNNSST 332
           L AQ+ +L++     +     QS E  ++ ++     E + EQVQYV+N N   Y  +  
Sbjct: 241 LSAQVATLSHQISALTTQRIPQSTEYVASTSMIVPSNEASQEQVQYVNNRN-YNYRGNPM 300

Query: 333 PTHYHPNNRNHENFSYANTKNVL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKL 388
           P +YHP  RNHEN SY NTKNVL   +PPGF  Q  + K  LED + +F+ E++ R  K 
Sbjct: 301 PNYYHPGLRNHENLSYGNTKNVLQPQHPPGFDSQPSEKKMSLEDAMVSFVQETNARFKKT 359

BLAST of Lag0015504.1 vs. NCBI nr
Match: KAG6734747.1 (hypothetical protein I3842_01G285500 [Carya illinoinensis])

HSP 1 Score: 372.1 bits (954), Expect = 6.3e-99
Identity = 192/361 (53.19%), Postives = 252/361 (69.81%), Query Frame = 0

Query: 33  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTED 92
           MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +D
Sbjct: 1   MAEEDREVLPRTLKDYVRPVVNGNYSSIMRQPINANNFELKPALISMVQQAQFSGSPLDD 60

Query: 93  PNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITLGSITTWDALVQAF 152
           PN HL  FL+IC TVKINGV+ED IRLRLFPFSL+DKAR WLQS+  GSI +W  + + F
Sbjct: 61  PNVHLAMFLEICDTVKINGVTEDTIRLRLFPFSLRDKARGWLQSLQPGSIVSWQDMAERF 120

Query: 153 LKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGL 212
           L KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL
Sbjct: 121 LAKFFPPAKTAQLRSEIGQFKQNDFESLYEAWERYKDLIRRCPQHGLPDWLQVQMFYNGL 180

Query: 213 TPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIFAGVFEDDKVSA 272
              T+TIVDAA+GGTL+SKT E A  LLE+MA+N+YQWP+ER+  KK+ AG+ E + ++A
Sbjct: 181 NGQTRTIVDAASGGTLMSKTAEGATALLEEMASNNYQWPTERTLAKKV-AGIHELEPIAA 240

Query: 273 LQAQMTSLANAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQYVSNFNSRGYNNSST 332
           L AQ+ +L++     +     QS E  ++ ++     E + EQVQYV+N N   Y  +  
Sbjct: 241 LSAQVATLSHQISALTTQRIPQSTEYVASTSMIVPSNEASQEQVQYVNNRN-YNYRGNPM 300

Query: 333 PTHYHPNNRNHENFSYANTKNVL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKL 388
           P +YHP  RNHEN SY NTKNVL   +PPGF  Q  + K  LED + +F+ E++ R  K 
Sbjct: 301 PNYYHPGLRNHENLSYGNTKNVLQPQHPPGFDSQPSEKKMSLEDAMVSFVQETNARFKKT 359

BLAST of Lag0015504.1 vs. ExPASy TrEMBL
Match: A0A6J0ZX64 (LOW QUALITY PROTEIN: uncharacterized protein LOC110412945 OS=Herrania umbratica OX=108875 GN=LOC110412945 PE=4 SV=1)

HSP 1 Score: 335.1 bits (858), Expect = 4.1e-88
Identity = 205/450 (45.56%), Postives = 271/450 (60.22%), Query Frame = 0

Query: 1   MRRDKDLILAPFDPEIERTILRLRRENREII----QMADQN----------PPEEPRPIR 60
           M+R  +L L PFDP+IERT  R RREN ++      MA+ N           PE  R +R
Sbjct: 1   MQRRNNLNLVPFDPDIERTFRRHRRENLQVATLNQTMAEDNNNNGNNAINLVPEANRALR 60

Query: 61  DYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKSFLDICG 120
           DY  P+ QG    I    INANNFE+K   IQM +    + G P++DPNSHL +FL+IC 
Sbjct: 61  DYVVPLVQGLHQSIRRPSINANNFEIKPAYIQMIQSSVQFSGLPSDDPNSHLVNFLEICD 120

Query: 121 TVKINGVSEDAIRLRLFPFSLQDKARDWLQSITLGSITTWDALVQAFLKKFFPPAKTVKL 180
           T K NGV++DAIRLRLFPFSL+DKA+ WL S+  GSITTW+ L Q FL KFFPPAKT K+
Sbjct: 121 TFKYNGVTDDAIRLRLFPFSLRDKAKSWLNSLPNGSITTWEDLAQKFLAKFFPPAKTAKM 180

Query: 181 RTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAG 240
           R +I +F Q   E L+EAWERFKELLR+CP HG PDWLQVQ FYNGL  S KTI+DAAAG
Sbjct: 181 RNDITSFIQFDGESLYEAWERFKELLRRCPHHGIPDWLQVQTFYNGLVGSIKTIIDAAAG 240

Query: 241 GTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIFAGVFEDDKVSALQAQMTSLANAFM 300
           G L+SK   +A  LLE+MA+N+YQWPSERS  +K   G +E D +  L  Q+ +L+    
Sbjct: 241 GALMSKNAVDAYNLLEEMASNNYQWPSERSGSRKA-VGAYEIDALGTLTTQVAALS---- 300

Query: 301 KFSGTGSAQSIESAAALASRPQEE--------TIEQVQYVSNFNSRGYNNSSTPTHYHPN 360
           K   T    +++++  +     +           E VQ+V NFN R  NN  + T Y+P 
Sbjct: 301 KKLDTLGVHAVQNSLVVCEMCGDSHSYDQCPYNSESVQFVGNFN-RQQNNPYSNT-YNPG 360

Query: 361 NRNHENFSYANTKNVLN-----PPGF----APQTQDNK-KLEDLVGAFIAESS------- 407
            RNH NFS++N     N     PPGF     PQ  + K +LE+L+  +I+++        
Sbjct: 361 WRNHPNFSWSNNAGPSNPKPIMPPGFQQQARPQIPEKKSQLEELLLQYISKTDAIIQSQG 420

BLAST of Lag0015504.1 vs. ExPASy TrEMBL
Match: A0A6J0ZYV0 (uncharacterized protein LOC110413413 OS=Herrania umbratica OX=108875 GN=LOC110413413 PE=4 SV=1)

HSP 1 Score: 332.8 bits (852), Expect = 2.0e-87
Identity = 193/401 (48.13%), Postives = 246/401 (61.35%), Query Frame = 0

Query: 1   MRRDKDLILAPFDPEIERTILRLRRENREII----QMADQN----------PPEEPRPIR 60
           M+R  +L L PFDP+IERT  R RREN ++      MA+ N           PE  R +R
Sbjct: 1   MQRRNNLNLVPFDPDIERTFRRHRRENLQVATLNQTMAEDNNNNGNNAINLVPEANRALR 60

Query: 61  DYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKSFLDICG 120
           DY  P+ QG    I    INANNFE+K   IQM +    + G P++DPNSHL +FL+IC 
Sbjct: 61  DYAVPLVQGLHQSIRRPSINANNFEIKPAYIQMIQSSVQFSGLPSDDPNSHLVNFLEICD 120

Query: 121 TVKINGVSEDAIRLRLFPFSLQDKARDWLQSITLGSITTWDALVQAFLKKFFPPAKTVKL 180
           T K NGV++DAIRLRLFPFSL+DKA+ WL S+  GSITTW+ L Q FL KFFPPAKT K+
Sbjct: 121 TFKYNGVTDDAIRLRLFPFSLRDKAKSWLNSLPNGSITTWEDLAQKFLAKFFPPAKTAKM 180

Query: 181 RTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAG 240
           R +I +F Q   E L+EAWERFKELLR+CP HG PDWLQVQ FYNGL  S KTI+DAAAG
Sbjct: 181 RNDITSFIQFDGESLYEAWERFKELLRRCPHHGIPDWLQVQTFYNGLVGSIKTIIDAAAG 240

Query: 241 GTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIFAGVFEDDKVSALQAQMTSLANAFM 300
           G L+SK   +A  LLE+MA+N+YQWPSERS  +K   G +E D +  L  Q+ +L+    
Sbjct: 241 GALMSKNAVDAYNLLEEMASNNYQWPSERSGSRKA-VGAYEIDALGTLTTQVAALS---- 300

Query: 301 KFSGTGSAQSIESAAALASRPQEE--------TIEQVQYVSNFNSRGYNNSSTPTHYHPN 360
           K   T    +++++  +     +           E VQ+V NFN R  NN  + T Y+P 
Sbjct: 301 KKLDTLGVHAVQNSLVVCEMCGDSHSYDQCPYNSESVQFVGNFN-RQQNNPYSNT-YNPG 360

Query: 361 NRNHENFSYANTKNVLN-----PPGFAPQTQDNKKLEDLVG 374
            RNH NFS++N     N     PPGF  Q +   + E  +G
Sbjct: 361 WRNHPNFSWSNNAGPSNPKPIMPPGFQQQARPQFQKEVPIG 394

BLAST of Lag0015504.1 vs. ExPASy TrEMBL
Match: A0A3S3N117 (Retrotrans_gag domain-containing protein OS=Cinnamomum micranthum f. kanehirae OX=337451 GN=CKAN_01212200 PE=4 SV=1)

HSP 1 Score: 323.9 bits (829), Expect = 9.5e-85
Identity = 185/399 (46.37%), Postives = 249/399 (62.41%), Query Frame = 0

Query: 1   MRRDKDLILAPFDPEIERTILRLRRENR-----EIIQMADQNPPEEPRPIRDYFQPVFQG 60
           MRR+++L L P DPEIERT+ RL++E +     EI +M +Q      R + DY  P+  G
Sbjct: 1   MRRNQNLNLVPLDPEIERTLRRLKKEKKQQSEFEITEMKEQ----ANRSLGDYAVPLVTG 60

Query: 61  QQSGIVYAPINANNFELKTGLIQM-ARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSE 120
             S I    I ANNFE+K  +IQM A    + G P +DPN+H+ +FL++C T K NGV++
Sbjct: 61  ATSSIRRPVIQANNFEIKPAIIQMVASTVQFSGLPDDDPNAHISNFLELCDTFKYNGVTD 120

Query: 121 DAIRLRLFPFSLQDKARDWLQSITLGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQ 180
           DA+RLRL PFSL+DKA+ WL S+   +ITTWD L + FL KFFPP KTVK+R +I TF Q
Sbjct: 121 DAVRLRLLPFSLRDKAKAWLNSLPQSTITTWDELAKKFLAKFFPPTKTVKMRNDITTFAQ 180

Query: 181 QYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVE 240
              E L+EAWER+KELLRKCP HG P W+QVQ FYNGL  +T+T +DAA GGTL+ K+ E
Sbjct: 181 NEMESLYEAWERYKELLRKCPHHGLPLWIQVQTFYNGLQSATRTSIDAATGGTLMKKSPE 240

Query: 241 NARTLLEDMATNSYQWPSERSTPKKIFAGVFEDDKVSALQAQMTSLANAF--MKFSGTGS 300
            A  L+E+MATN+YQWPS+    KKI  GV E D +SAL AQ+ +L+     MK     S
Sbjct: 241 EAYELVEEMATNNYQWPSDHVQQKKI-QGVHELDSISALTAQVANLSKQIQSMKVHAVQS 300

Query: 301 AQSIESAAALASRPQE-------ETIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHENFS 360
              +    A      +        + EQV YVSN++ +    S+T   Y+P  RNH NFS
Sbjct: 301 TNMVCEFCAGNHMGVDCQVGNPFNSQEQVHYVSNYSRQNNPYSNT---YNPGWRNHPNFS 360

Query: 361 YANTKN-VLNPPGF-APQTQDNKKLEDLVGAFIAESSNR 383
           + N +N    PP F  PQ ++   LE ++  FI++  ++
Sbjct: 361 WNNAQNSARQPPRFQQPQQEEKSGLEKMMAQFISKVDSK 391

BLAST of Lag0015504.1 vs. ExPASy TrEMBL
Match: A0A6P6XAQ1 (Reverse transcriptase OS=Coffea arabica OX=13443 GN=LOC113740608 PE=4 SV=1)

HSP 1 Score: 318.5 bits (815), Expect = 4.0e-83
Identity = 174/360 (48.33%), Postives = 232/360 (64.44%), Query Frame = 0

Query: 43  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLD 102
           R +RD+  P  QG Q+ IV   +NANNFE+K  LIQM +   Y G+ TEDPNSHL +FL+
Sbjct: 9   RILRDFALPGAQGSQTSIVRPTVNANNFEIKPSLIQMVQQSQYGGNATEDPNSHLSTFLE 68

Query: 103 ICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITLGSITTWDALVQAFLKKFFPPAKT 162
           IC T+K NGVSEDAI+LRLFPFSL+DKA+ WLQS    + TTWD L +AFL KFFPP KT
Sbjct: 69  ICDTIKFNGVSEDAIKLRLFPFSLRDKAKVWLQSHPPNTFTTWDELAKAFLNKFFPPGKT 128

Query: 163 VKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDA 222
            KLR +I +F QQ  E L+EAWER++EL R+CP HG PDWL VQ FYNGLT  TKT VDA
Sbjct: 129 AKLRMDITSFSQQEGETLYEAWERYRELQRRCPHHGLPDWLVVQTFYNGLTYPTKTHVDA 188

Query: 223 AAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIFAGVFEDDKVSALQAQMTSLAN 282
           AAGG L+ KT E A+ L+E+MA N+YQW +ER   ++  AG+ E D ++ L A+M ++  
Sbjct: 189 AAGGALMGKTAEEAQQLIEEMAANNYQWANERGNSRRT-AGMLEVDTLNMLSAKMDNVVK 248

Query: 283 AFMKFSGTGSAQSIESAAALASRPQEE-----TIEQVQYVSNFNSRGYNNSSTPTHYHPN 342
              +  G+ S Q +  A+        +     + EQVQY++N+N    NN  + T Y+P 
Sbjct: 249 MLNRQVGSSSNQGVVVASCTICGGDHDDFMCSSSEQVQYLNNYNRPPQNNPYSNT-YNPG 308

Query: 343 NRNHENFSY---ANTKNVLNPPGFAPQ--TQDNKKLEDLVGAFIAESSNRTTKLEEVVIA 393
            RNH NF +    N +  +NPPGF  +    ++K   +L    +A +SN   K+E++  A
Sbjct: 309 WRNHPNFGWKDQGNQQRPVNPPGFQQKQTLHESKPAWELAIEKLANASN--DKIEKLASA 364

BLAST of Lag0015504.1 vs. ExPASy TrEMBL
Match: A0A6J1DU19 (uncharacterized protein LOC111024361 OS=Momordica charantia OX=3673 GN=LOC111024361 PE=4 SV=1)

HSP 1 Score: 310.5 bits (794), Expect = 1.1e-80
Identity = 178/366 (48.63%), Postives = 225/366 (61.48%), Query Frame = 0

Query: 45  IRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDIC 104
           IRDY QP F     GI+  PINANN ELK GLIQM R+  +RG+ TEDPN+HL  FLD+C
Sbjct: 27  IRDYCQPNFP-NHVGIINLPINANNSELKPGLIQMVRENTFRGNATEDPNNHLTIFLDVC 86

Query: 105 GTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITLGSITTWDALVQAFLKKFFPPAKTVK 164
           GTVK+NGV +DAIRLRLFP SLQDK                  +VQAFL  FFPPAKT +
Sbjct: 87  GTVKMNGVIDDAIRLRLFPLSLQDK-----------------EMVQAFLTNFFPPAKTTQ 146

Query: 165 LRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAA 224
           LRTEI +F++   EQLFE WER+KELLRKCPQHG  +WLQ+Q+FYNGL   T+TI+DAAA
Sbjct: 147 LRTEIRSFRKYDYEQLFEVWERYKELLRKCPQHGNLEWLQIQMFYNGLNGQTRTILDAAA 206

Query: 225 GGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIFAGVFEDDKVSALQAQMTSLANAF 284
           GGTLLS+T ENA  LL+DMA NS+QWPSERS  KK+ AG++E D++S+L+AQ+ +L NA 
Sbjct: 207 GGTLLSRTPENAYILLKDMADNSFQWPSERSNAKKV-AGMYEIDELSSLKAQVQALTNAV 266

Query: 285 MKFSGTGSAQSIESAAALASRP-QEETIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHEN 344
            K SG G++ S E  AA  +    E TIEQ Q+ S                HP       
Sbjct: 267 SKLSGPGTSHSNELVAATDTYSYYEPTIEQAQFTS----------------HP------- 326

Query: 345 FSYANTKNVLNPPGFAPQTQDNKKLEDLVGAFIAESSNRTTKLEEVVIAINSTVNGTVQP 404
                              +    LEDL+GAFI E  +R +++E  V  +   + G    
Sbjct: 327 ------------------AEKKSSLEDLLGAFINECRSRASRIENQVEGMEVKLEGNTTS 332

Query: 405 SRTLKL 410
            + +++
Sbjct: 387 IKNMEV 332

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
WP_217833153.11.8e-11756.12retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 70... [more]
KAG7990634.12.0e-10552.42hypothetical protein I3843_02G035100 [Carya illinoinensis][more]
XP_022843226.11.1e-10052.38uncharacterized protein LOC111366761 [Olea europaea var. sylvestris][more]
KAG7947748.16.3e-9953.19hypothetical protein I3843_14G109500 [Carya illinoinensis][more]
KAG6734747.16.3e-9953.19hypothetical protein I3842_01G285500 [Carya illinoinensis][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J0ZX644.1e-8845.56LOW QUALITY PROTEIN: uncharacterized protein LOC110412945 OS=Herrania umbratica ... [more]
A0A6J0ZYV02.0e-8748.13uncharacterized protein LOC110413413 OS=Herrania umbratica OX=108875 GN=LOC11041... [more]
A0A3S3N1179.5e-8546.37Retrotrans_gag domain-containing protein OS=Cinnamomum micranthum f. kanehirae O... [more]
A0A6P6XAQ14.0e-8348.33Reverse transcriptase OS=Coffea arabica OX=13443 GN=LOC113740608 PE=4 SV=1[more]
A0A6J1DU191.1e-8048.63uncharacterized protein LOC111024361 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 121..213
e-value: 1.8E-18
score: 66.6
NoneNo IPR availablePANTHERPTHR24559:SF334SUBFAMILY NOT NAMEDcoord: 93..280
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 93..280

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Lag0015504Lag0015504gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Lag0015504.1.exon1Lag0015504.1.exon1exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
cds.Lag0015504.1cds.Lag0015504.1CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Lag0015504.1Lag0015504.1-proteinpolypeptide