Lag0041579 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0041579
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionRetrotrans_gag domain-containing protein
Locationchr13: 21088984 .. 21090624 (-)
RNA-Seq ExpressionLag0041579
SyntenyLag0041579
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAGATCAAAATCCACCTGAGGAGCCTAGGCCTATTAGAGATTATTTTCAGCCTGTGTTTCAGGGGCAACAGTCGGGAATTGTCTATGCACCGATTAATGCCAACAACTTTGAGCTAAAGACCGGTCTCATTCAGATGGCTCGAGACTGTGCATACAGAGGATCACCCACTGAGGATCCAAATTCTCATCTTAAATCATTCTTGGATATATGTGGGACGGTAAAGATTAATGGTGTCTCTGAGGATGCTATTCGCTTACGTTTATTTCCTTTTTCTTTGCAGGATAAAGCACGTGATTGGTTGCAGTCTATTACCCCTGGGAGCATCACCACTTGGGATGCTTTGGTCCAGGCCTTTTTAAATAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAACAACAATACGATGAGCAGCTGTTCGAAGCTTGGGAGCGATTCAAAGAGCTACTGAGGAAGTGTCCTCAGCATGGTTACCCCGACTGGCTTCAGGTTCAATTGTTTTATAATGGTTTAACTCCTAGTACAAAAACTATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTGGAAAACGCTCGCACACTTTTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGGTCTACACCTAAAAAGATTGCTGCTGGAGTGTTTGAGGTTGATAAAGTAAGTGCACTCCAAGCCCAGATGACTTCCCTTGCCAATGCTTTTATGAAATTTTCAGGTACAGGGAGTGCACAGTCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACCATCGAGCAGGTTCAGTATGTATCAAATTTTAATTCTAGGGGATATAATAATAGTTCTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCTAATACTAAGAATGTTCTTAACCCTCCTGGTTTTGCCCCTCAAACTCAAGATAATAAAAAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTCTAGTAACAGGACAACCAAGCTGGAGGAGGCAGTCATTGCCATCAATTCAACAGTGAATGGCCACAGTGCTGCCATAAAGAACATAGAGACTCAGCTGGGACAGTTGGTGAATGTTGTAAGCACCATGAACAAAGGTAAGGCCCCAGCTGAACAAGAGAAACCCCAGATGGAATATTGCAAGGCAATCACTGTTCACCAGGAGGAATCTGAAGAGGAACCTGAATCTGAGGACTATGAAACACCTACAGGGGAAGCTGAGGAGGACACATCATCTGATGAGGCTGAACAGCCTAACCTTGAGCCTCCTATTCCTTCTCCTACACTGTTGGTTCCTAAGGAAAAGAAAAAGAAAAAGAAGAAAAAGAATAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTATGAATCTGAACATTAATATTCCTTTTGCAGAGGCATTAGAAATGCCCCAATACAACAGGTTCATGAAGGAGTGGTTAGCAAAGAAGCGAAAAGAAAAGAAGGTTGACACTGTATATCTTGCTTCCACATGCAGCACCAGAGTACAACAGAAAGTACCTGAAAAAGTAGCAGATCCAGGGAGTTTTTCTGTTCTTTAG

mRNA sequence

ATGGCAGATCAAAATCCACCTGAGGAGCCTAGGCCTATTAGAGATTATTTTCAGCCTGTGTTTCAGGGGCAACAGTCGGGAATTGTCTATGCACCGATTAATGCCAACAACTTTGAGCTAAAGACCGGTCTCATTCAGATGGCTCGAGACTGTGCATACAGAGGATCACCCACTGAGGATCCAAATTCTCATCTTAAATCATTCTTGGATATATGTGGGACGGTAAAGATTAATGGTGTCTCTGAGGATGCTATTCGCTTACGTTTATTTCCTTTTTCTTTGCAGGATAAAGCACGTGATTGGTTGCAGTCTATTACCCCTGGGAGCATCACCACTTGGGATGCTTTGGTCCAGGCCTTTTTAAATAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAACAACAATACGATGAGCAGCTGTTCGAAGCTTGGGAGCGATTCAAAGAGCTACTGAGGAAGTGTCCTCAGCATGGTTACCCCGACTGGCTTCAGGTTCAATTGTTTTATAATGGTTTAACTCCTAGTACAAAAACTATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTGGAAAACGCTCGCACACTTTTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGGTCTACACCTAAAAAGATTGCTGCTGGAGTGTTTGAGGTTGATAAAGTAAGTGCACTCCAAGCCCAGATGACTTCCCTTGCCAATGCTTTTATGAAATTTTCAGGTACAGGGAGTGCACAGTCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACCATCGAGCAGGTTCAGTATGTATCAAATTTTAATTCTAGGGGATATAATAATAGTTCTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCTAATACTAAGAATGTTCTTAACCCTCCTGGTTTTGCCCCTCAAACTCAAGATAATAAAAAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTCTAGTAACAGGACAACCAAGCTGGAGGAGGCAGTCATTGCCATCAATTCAACAGTGAATGGCCACAGTGCTGCCATAAAGAACATAGAGACTCAGCTGGGACAGTTGGTGAATGTTGTAAGCACCATGAACAAAGGTAAGGCCCCAGCTGAACAAGAGAAACCCCAGATGGAATATTGCAAGGCAATCACTGTTCACCAGGAGGAATCTGAAGAGGAACCTGAATCTGAGGACTATGAAACACCTACAGGGGAAGCTGAGGAGGACACATCATCTGATGAGGCTGAACAGCCTAACCTTGAGCCTCCTATTCCTTCTCCTACACTGTTGGTTCCTAAGGAAAAGAAAAAGAAAAAGAAGAAAAAGAATAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTATGAATCTGAACATTAATATTCCTTTTGCAGAGGCATTAGAAATGCCCCAATACAACAGGTTCATGAAGGAGTGGTTAGCAAAGAAGCGAAAAGAAAAGAAGGTTGACACTGTATATCTTGCTTCCACATGCAGCACCAGAGTACAACAGAAAGTACCTGAAAAAGTAGCAGATCCAGGGAGTTTTTCTGTTCTTTAG

Coding sequence (CDS)

ATGGCAGATCAAAATCCACCTGAGGAGCCTAGGCCTATTAGAGATTATTTTCAGCCTGTGTTTCAGGGGCAACAGTCGGGAATTGTCTATGCACCGATTAATGCCAACAACTTTGAGCTAAAGACCGGTCTCATTCAGATGGCTCGAGACTGTGCATACAGAGGATCACCCACTGAGGATCCAAATTCTCATCTTAAATCATTCTTGGATATATGTGGGACGGTAAAGATTAATGGTGTCTCTGAGGATGCTATTCGCTTACGTTTATTTCCTTTTTCTTTGCAGGATAAAGCACGTGATTGGTTGCAGTCTATTACCCCTGGGAGCATCACCACTTGGGATGCTTTGGTCCAGGCCTTTTTAAATAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAACAACAATACGATGAGCAGCTGTTCGAAGCTTGGGAGCGATTCAAAGAGCTACTGAGGAAGTGTCCTCAGCATGGTTACCCCGACTGGCTTCAGGTTCAATTGTTTTATAATGGTTTAACTCCTAGTACAAAAACTATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTGGAAAACGCTCGCACACTTTTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGGTCTACACCTAAAAAGATTGCTGCTGGAGTGTTTGAGGTTGATAAAGTAAGTGCACTCCAAGCCCAGATGACTTCCCTTGCCAATGCTTTTATGAAATTTTCAGGTACAGGGAGTGCACAGTCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACCATCGAGCAGGTTCAGTATGTATCAAATTTTAATTCTAGGGGATATAATAATAGTTCTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCTAATACTAAGAATGTTCTTAACCCTCCTGGTTTTGCCCCTCAAACTCAAGATAATAAAAAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTCTAGTAACAGGACAACCAAGCTGGAGGAGGCAGTCATTGCCATCAATTCAACAGTGAATGGCCACAGTGCTGCCATAAAGAACATAGAGACTCAGCTGGGACAGTTGGTGAATGTTGTAAGCACCATGAACAAAGGTAAGGCCCCAGCTGAACAAGAGAAACCCCAGATGGAATATTGCAAGGCAATCACTGTTCACCAGGAGGAATCTGAAGAGGAACCTGAATCTGAGGACTATGAAACACCTACAGGGGAAGCTGAGGAGGACACATCATCTGATGAGGCTGAACAGCCTAACCTTGAGCCTCCTATTCCTTCTCCTACACTGTTGGTTCCTAAGGAAAAGAAAAAGAAAAAGAAGAAAAAGAATAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTATGAATCTGAACATTAATATTCCTTTTGCAGAGGCATTAGAAATGCCCCAATACAACAGGTTCATGAAGGAGTGGTTAGCAAAGAAGCGAAAAGAAAAGAAGGTTGACACTGTATATCTTGCTTCCACATGCAGCACCAGAGTACAACAGAAAGTACCTGAAAAAGTAGCAGATCCAGGGAGTTTTTCTGTTCTTTAG

Protein sequence

MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLNKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFSGTGSAQSIESAAALASRPQEETIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHENFSYANTKNVLNPPGFAPQTQDNKKLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEAEEDTSSDEAEQPNLEPPIPSPTLLVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALEMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVL
Homology
BLAST of Lag0041579 vs. NCBI nr
Match: KAG7990634.1 (hypothetical protein I3843_02G035100 [Carya illinoinensis])

HSP 1 Score: 479.2 bits (1232), Expect = 4.8e-131
Identity = 270/570 (47.37%), Postives = 371/570 (65.09%), Query Frame = 0

Query: 1   MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTED 60
           MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +D
Sbjct: 31  MAEEDREVLPRTLKDYVRPVVNGNYSSIMRQPINANNFELKPALISMVQQAQFSGSPLDD 90

Query: 61  PNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAF 120
           PN HL  FL+IC TVKINGV+ED IRLRLFPFSL+DKAR WLQS+ PGSI +W  + + F
Sbjct: 91  PNIHLAMFLEICDTVKINGVTEDTIRLRLFPFSLRDKARGWLQSLQPGSIVSWQDMAERF 150

Query: 121 LNKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGL 180
           L KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL
Sbjct: 151 LAKFFPPAKTAQLRSEIGQFKQNDFESLYEAWERYKDLIRRCPQHGLPDWLQVQMFYNGL 210

Query: 181 TPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSA 240
              T+TIVDAA+GGTL+SKT E A  LLE+MA+N+YQWP+ER+  KK+ AG+ +++ ++A
Sbjct: 211 NGQTRTIVDAASGGTLMSKTAEGATALLEEMASNNYQWPTERTLAKKV-AGIHDLEPIAA 270

Query: 241 LQAQMTSLANAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQYVSNFNSRGYNNSST 300
           L AQ+ +L++     +     QS E  ++ ++     E + EQVQYV+N N   Y  +  
Sbjct: 271 LSAQVATLSHQISALTTQRIPQSTEYLASTSMIVPSNEASQEQVQYVNNRN-YNYRGNPM 330

Query: 301 PTHYHPNNRNHENFSYANTKNVL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKL 360
           P +YHP  RNHEN SY NTKNVL   +PPGF  Q  + K  LED + +F+ E++ R  K 
Sbjct: 331 PNYYHPGLRNHENLSYGNTKNVLQPQHPPGFDSQPSERKMSLEDAMVSFVQETNARFKKT 390

Query: 361 EEAVIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQ- 420
           +  +  I +  +   AAIKNIE Q+GQL   ++   +G  P+  E    E CKAIT+   
Sbjct: 391 DSRLDNIETHCSNMGAAIKNIEVQIGQLATTINAQQRGAFPSNTEVNPKEQCKAITLRSG 450

Query: 421 EESEEEPESEDYETPT----GEAEEDTSSDEAEQPNLE-------------PPIPSPTLL 480
           +E E  P  E   TPT    G+++     DE     LE             PPI +P L 
Sbjct: 451 KEIERSPLKESKSTPTAVNIGQSKNKVEEDEIVNDTLEETDFAPTISFPDNPPILAPPLP 510

Query: 481 VPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKK 540
            P+  +K+K  K    QF KF++ F  ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++
Sbjct: 511 YPQRFQKQKLDK----QFSKFLDIFKKIHINIPFADALEQMPNYVKFLKDIISKKRRLEE 570

Query: 541 VDTVYLASTCSTRVQQKVPEKVADPGSFSV 546
            +TV L+  CS  +Q+K+P+K+ DPGSF++
Sbjct: 571 FETVKLSEECSAILQKKLPQKLKDPGSFTL 594

BLAST of Lag0041579 vs. NCBI nr
Match: KAG7947748.1 (hypothetical protein I3843_14G109500 [Carya illinoinensis])

HSP 1 Score: 476.1 bits (1224), Expect = 4.1e-130
Identity = 267/570 (46.84%), Postives = 371/570 (65.09%), Query Frame = 0

Query: 1   MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTED 60
           MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +D
Sbjct: 1   MAEEDREVLPRTLKDYVRPVVNGNYSSIMRQPINANNFELKPALISMVQQAQFSGSPLDD 60

Query: 61  PNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAF 120
           PN HL  FL+IC TVKINGV+ED IRLRLFPFSL+DKAR WLQS+ PGSI +W  + + F
Sbjct: 61  PNVHLAMFLEICDTVKINGVTEDTIRLRLFPFSLRDKARGWLQSLQPGSIVSWQDMAERF 120

Query: 121 LNKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGL 180
           L KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL
Sbjct: 121 LAKFFPPAKTAQLRSEIGQFKQNDFESLYEAWERYKDLIRRCPQHGLPDWLQVQMFYNGL 180

Query: 181 TPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSA 240
              T+TIVDAA+GGTL+SKT E A  LLE+MA+N+YQWP+ER+  KK+ AG+ E++ ++A
Sbjct: 181 NGQTRTIVDAASGGTLMSKTAEGATALLEEMASNNYQWPTERTLAKKV-AGIHELEPIAA 240

Query: 241 LQAQMTSLANAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQYVSNFNSRGYNNSST 300
           L AQ+ +L++     +     QS E  ++ ++     E + EQVQYV+N N   Y  +  
Sbjct: 241 LSAQVATLSHQISALTTQRIPQSTEYVASTSMIVPSNEASQEQVQYVNNRN-YNYRGNPM 300

Query: 301 PTHYHPNNRNHENFSYANTKNVL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKL 360
           P +YHP  RNHEN SY NTKNVL   +PPGF  Q  + K  LED + +F+ E++ R  K 
Sbjct: 301 PNYYHPGLRNHENLSYGNTKNVLQPQHPPGFDSQPSEKKMSLEDAMVSFVQETNARFKKT 360

Query: 361 EEAVIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQ- 420
           +  +  I +  +   A +KN+E Q+GQL   ++   +G  P+  E    E CKAIT+   
Sbjct: 361 DSRLDNIETHCSNMGATMKNLEVQIGQLATTINAQQRGAFPSNTEVNPKEQCKAITLRSG 420

Query: 421 EESEEEPESEDYETPT----GEAEEDTSSDEAEQPNLE-------------PPIPSPTLL 480
           +E E  P  E   TPT    G++++    +E     LE             PPI +P L 
Sbjct: 421 KEIERAPLKESKSTPTAANNGQSKDQVEEEEIVNDTLEETDLPPTISFPDNPPILAPPLP 480

Query: 481 VPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKK 540
            P+  +K+K  K    QF KF++ F  ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++
Sbjct: 481 YPQRFQKQKLDK----QFSKFLDIFKKIHINIPFADALEQMPNYVKFLKDIISKKRRLEE 540

Query: 541 VDTVYLASTCSTRVQQKVPEKVADPGSFSV 546
            +TV L+  CS  +Q+K+P+K+ DPGSF++
Sbjct: 541 FETVKLSEECSAILQKKLPQKLKDPGSFTL 564

BLAST of Lag0041579 vs. NCBI nr
Match: KAG6734747.1 (hypothetical protein I3842_01G285500 [Carya illinoinensis])

HSP 1 Score: 473.0 bits (1216), Expect = 3.5e-129
Identity = 266/570 (46.67%), Postives = 370/570 (64.91%), Query Frame = 0

Query: 1   MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTED 60
           MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +D
Sbjct: 1   MAEEDREVLPRTLKDYVRPVVNGNYSSIMRQPINANNFELKPALISMVQQAQFSGSPLDD 60

Query: 61  PNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAF 120
           PN HL  FL+IC TVKINGV+ED IRLRLFPFSL+DKAR WLQS+ PGSI +W  + + F
Sbjct: 61  PNVHLAMFLEICDTVKINGVTEDTIRLRLFPFSLRDKARGWLQSLQPGSIVSWQDMAERF 120

Query: 121 LNKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGL 180
           L KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL
Sbjct: 121 LAKFFPPAKTAQLRSEIGQFKQNDFESLYEAWERYKDLIRRCPQHGLPDWLQVQMFYNGL 180

Query: 181 TPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSA 240
              T+TIVDAA+GGTL+SKT E A  LLE+MA+N+YQWP+ER+  KK+ AG+ E++ ++A
Sbjct: 181 NGQTRTIVDAASGGTLMSKTAEGATALLEEMASNNYQWPTERTLAKKV-AGIHELEPIAA 240

Query: 241 LQAQMTSLANAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQYVSNFNSRGYNNSST 300
           L AQ+ +L++     +     QS E  ++ ++     E + EQVQYV+N N   Y  +  
Sbjct: 241 LSAQVATLSHQISALTTQRIPQSTEYVASTSMIVPSNEASQEQVQYVNNRN-YNYRGNPM 300

Query: 301 PTHYHPNNRNHENFSYANTKNVL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKL 360
           P +YHP  RNHEN SY NTKNVL   +PPGF  Q  + K  LED + +F+ E++ R  K 
Sbjct: 301 PNYYHPGLRNHENLSYGNTKNVLQPQHPPGFDSQPSEKKMSLEDAMVSFVQETNARFKKT 360

Query: 361 EEAVIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQ- 420
           +  +  I +  +   A +KN+E Q+GQL   ++   +G  P+  E    E CKAIT+   
Sbjct: 361 DSRLDNIETHCSNMGATMKNLEVQIGQLATTINAQQRGAFPSNTEVNPKEQCKAITLRSG 420

Query: 421 EESEEEPESEDYETPT----GEAEEDTSSDEAEQPNLE-------------PPIPSPTLL 480
           +E E  P  E   TPT    G++++    +E     LE             PPI +P L 
Sbjct: 421 KEIERAPLKESKSTPTAANNGQSKDQVEEEEIVNDTLEETDLPPTISFPDNPPILAPPLP 480

Query: 481 VPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKK 540
            P+  +K+K  K    QF KF++ F  ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++
Sbjct: 481 YPQRFQKQKLDK----QFSKFLDIFKKIHINIPFADALEQMPNYVKFLKDIISKKRRLEE 540

Query: 541 VDTVYLASTCSTRVQQKVPEKVADPGSFSV 546
            +TV L+  CS  +Q+K+P+K+ DP SF++
Sbjct: 541 FETVKLSEECSAILQKKLPQKLKDPESFTL 564

BLAST of Lag0041579 vs. NCBI nr
Match: XP_023874613.1 (uncharacterized protein LOC111987139 [Quercus suber])

HSP 1 Score: 467.2 bits (1201), Expect = 1.9e-127
Identity = 261/564 (46.28%), Postives = 360/564 (63.83%), Query Frame = 0

Query: 1   MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTED 60
           MA+     +PR ++DY +P+     SGI    INANNFELK  LI M +   + GSP +D
Sbjct: 1   MAEGEQNAQPRTLKDYVRPIVNDNYSGIRRQTINANNFELKPALISMVQQAQFSGSPLDD 60

Query: 61  PNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAF 120
           PN HL  FL+IC T+K+NGV+ED IRLRLFPFSL+DKAR WLQS+ PGSIT+W  + + F
Sbjct: 61  PNIHLAMFLEICDTIKMNGVTEDTIRLRLFPFSLRDKARGWLQSLQPGSITSWQDMAEKF 120

Query: 121 LNKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGL 180
           L KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R CPQHG PDWLQVQ+FYNGL
Sbjct: 121 LAKFFPPAKTAQLRSEIGQFRQNDFESLYEAWERYKDLIRCCPQHGLPDWLQVQMFYNGL 180

Query: 181 TPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSA 240
              T+TIVDAA+GGTL+SKT E A +LLE+MA+N+YQWP+ER+  KK+ AG+ E++  +A
Sbjct: 181 NGQTRTIVDAASGGTLMSKTAEGATSLLEEMASNNYQWPTERTMAKKV-AGIHELEPFAA 240

Query: 241 LQAQMTSLANAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQYVSNFNSRGYNNSST 300
           L AQ+ SL++     +     Q  E  +A+++     E + EQVQY++N N   Y  +  
Sbjct: 241 LSAQVASLSHQVSALTTQRIPQGAEYVAASSMTVPMNEASQEQVQYINNRN-YNYRGNPM 300

Query: 301 PTHYHPNNRNHENFSYANTKNVLN-PPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEE 360
           P +YHP  RNHENFSY NTKNVL  PPGF  Q  + K  LED + +F+ E+     K + 
Sbjct: 301 PNYYHPGLRNHENFSYGNTKNVLQPPPGFDSQPSEKKMSLEDAMVSFVEETKATFKKSDS 360

Query: 361 AVIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQ-EE 420
            +  I +  +   A +KN+E Q+GQL   ++   +G  P+  E    E CKAIT+    E
Sbjct: 361 QLDNIETHCSNMGATMKNLEVQIGQLATTINAQQRGTFPSNTEVNPKEQCKAITLRSGRE 420

Query: 421 SEEEPESEDYETPTG-------------EAEEDTSSDEAEQPNLEPPIPSPTLLVPKEKK 480
            E  P  E   TPT              E  EDT  +    P++  P   P L  P    
Sbjct: 421 IERSPSKETETTPTAPNNGQSKNKVEEEEIVEDTLRETDMPPSISFPDNPPILSTPLPYP 480

Query: 481 KKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYL 540
           ++ +K+    QF KF++ F  ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++ +TV L
Sbjct: 481 QRFQKQKLDKQFSKFLDIFKKIHINIPFADALEQMPNYAKFLKDIISKKRRLEEFETVKL 540

Query: 541 ASTCSTRVQQKVPEKVADPGSFSV 546
           +  CS  +Q+K+P+K+ DPGSF++
Sbjct: 541 SEECSAIIQKKLPQKLKDPGSFTL 562

BLAST of Lag0041579 vs. NCBI nr
Match: WP_217833153.1 (retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002])

HSP 1 Score: 455.7 bits (1171), Expect = 5.7e-124
Identity = 245/431 (56.84%), Postives = 301/431 (69.84%), Query Frame = 0

Query: 8   EEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKS 67
           E P+ IRDYFQP     Q GI+  PIN NNFELK GLIQMAR+ A+RG   EDP+ HL+S
Sbjct: 52  EIPKAIRDYFQPTLPASQPGIMNVPINVNNFELKPGLIQMARELAFRGRTNEDPHKHLRS 111

Query: 68  FLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLNKFFPP 127
           FL+ICGTVK+NGVS DAI+LRLFPFSLQD+A+DWL++I P SITTW+ L QAFLNK+FPP
Sbjct: 112 FLEICGTVKMNGVSNDAIKLRLFPFSLQDRAKDWLETIPPDSITTWEILAQAFLNKYFPP 171

Query: 128 AKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTI 187
           AK+ +LRTEIGTF+Q  DEQL+EAWER+K+LLR+CPQHGYPDWLQ+QLFYNGL  STK+I
Sbjct: 172 AKSQRLRTEIGTFRQLEDEQLYEAWERYKDLLRRCPQHGYPDWLQIQLFYNGLASSTKSI 231

Query: 188 VDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPK-KIAAGVFEVDKVSALQAQMT 247
           +DA AGG++ SK  + A T+LED+AT SY WP ER++P    AAG++EVD+V++L+AQM 
Sbjct: 232 LDATAGGSIFSKNAQEAYTILEDLATTSYNWPCERASPNIPKAAGLYEVDEVNSLKAQMA 291

Query: 248 SLANAFMKFSGTGSAQ----SIESAAALASR-PQEETIEQVQYVSNFNSRGYNNSSTPTH 307
           SL NA  K +  G AQ    SI S AALAS        E   YV   + R Y +   PTH
Sbjct: 292 SLTNALSKLTAGGQAQTNPPSIASLAALASEMGVHGDNETANYVDRGHYRNYQHQQLPTH 351

Query: 308 YHPNNRNHENFSYANTKNVLN-PPGF-APQTQDNKKLEDLVGAFIAESSNRTTKLEEAVI 367
           YHPN RNHENFSYAN KNVL  P GF          LED++  F+ ES +RTT LE +V 
Sbjct: 352 YHPNLRNHENFSYANNKNVLQAPQGFNGAGNAKTSSLEDIMLDFVKESRSRTTTLENSVQ 411

Query: 368 AINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQEESEEE 427
           AI STV     A++N+E QL Q+   + TM KGK P+  E    E CKA+T+   +    
Sbjct: 412 AIASTVQSQGKALQNLEVQLSQMKTSLQTMQKGKFPSCPEINPREECKAVTLRSGKKLST 471

BLAST of Lag0041579 vs. ExPASy TrEMBL
Match: A0A6J0ZX64 (LOW QUALITY PROTEIN: uncharacterized protein LOC110412945 OS=Herrania umbratica OX=108875 GN=LOC110412945 PE=4 SV=1)

HSP 1 Score: 400.6 bits (1028), Expect = 1.1e-107
Identity = 253/576 (43.92%), Postives = 347/576 (60.24%), Query Frame = 0

Query: 7   PEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-YRGSPTEDPNSHL 66
           PE  R +RDY  P+ QG    I    INANNFE+K   IQM +    + G P++DPNSHL
Sbjct: 53  PEANRALRDYVVPLVQGLHQSIRRPSINANNFEIKPAYIQMIQSSVQFSGLPSDDPNSHL 112

Query: 67  KSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLNKFF 126
            +FL+IC T K NGV++DAIRLRLFPFSL+DKA+ WL S+  GSITTW+ L Q FL KFF
Sbjct: 113 VNFLEICDTFKYNGVTDDAIRLRLFPFSLRDKAKSWLNSLPNGSITTWEDLAQKFLAKFF 172

Query: 127 PPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTK 186
           PPAKT K+R +I +F Q   E L+EAWERFKELLR+CP HG PDWLQVQ FYNGL  S K
Sbjct: 173 PPAKTAKMRNDITSFIQFDGESLYEAWERFKELLRRCPHHGIPDWLQVQTFYNGLVGSIK 232

Query: 187 TIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQM 246
           TI+DAAAGG L+SK   +A  LLE+MA+N+YQWPSERS  +K A G +E+D +  L  Q+
Sbjct: 233 TIIDAAAGGALMSKNAVDAYNLLEEMASNNYQWPSERSGSRK-AVGAYEIDALGTLTTQV 292

Query: 247 TSLANAFMKFSGTGSAQSIESAAALASRPQEE--------TIEQVQYVSNFNSRGYNNSS 306
            +L+    K   T    +++++  +     +           E VQ+V NFN R  NN  
Sbjct: 293 AALS----KKLDTLGVHAVQNSLVVCEMCGDSHSYDQCPYNSESVQFVGNFN-RQQNNPY 352

Query: 307 TPTHYHPNNRNHENFSYANTKNVLN-----PPGF----APQTQDNK-KLEDLVGAFIAES 366
           + T Y+P  RNH NFS++N     N     PPGF     PQ  + K +LE+L+  +I+++
Sbjct: 353 SNT-YNPGWRNHPNFSWSNNAGPSNPKPIMPPGFQQQARPQIPEKKSQLEELLLQYISKT 412

Query: 367 SNRTTKLEEAVIAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAE-QEKPQ-MEY 426
                         ++ +    A+++N+ETQ+GQL N ++   +G  P++ Q  P+  E 
Sbjct: 413 --------------DAIIQSQGASLRNLETQVGQLANSINNRPQGSLPSDTQINPKGKEQ 472

Query: 427 CKAIT---------VHQEESEEEPESED------YETPTGEAEEDTSSDEAEQPNLEPPI 486
           C+AIT         V+Q+  E E E  D       E    + ++D + ++     + PP 
Sbjct: 473 CQAITLRSGKEIEGVNQKAVESEIEHVDKEGMCENEIEIQQKDDDKAENQGTSQVIHPPP 532

Query: 487 PSPTLLVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWLAK 546
           P P  L   +K+K +K      QF KF+N F  L+INIPFAEALE MP Y +F+K+ L+K
Sbjct: 533 PFPQRL---QKQKLEK------QFQKFLNVFKKLHINIPFAEALEQMPSYVKFLKDILSK 592

BLAST of Lag0041579 vs. ExPASy TrEMBL
Match: A0A6J1DU19 (uncharacterized protein LOC111024361 OS=Momordica charantia OX=3673 GN=LOC111024361 PE=4 SV=1)

HSP 1 Score: 389.0 bits (998), Expect = 3.2e-104
Identity = 238/537 (44.32%), Postives = 320/537 (59.59%), Query Frame = 0

Query: 13  IRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDIC 72
           IRDY QP F     GI+  PINANN ELK GLIQM R+  +RG+ TEDPN+HL  FLD+C
Sbjct: 27  IRDYCQPNFP-NHVGIINLPINANNSELKPGLIQMVRENTFRGNATEDPNNHLTIFLDVC 86

Query: 73  GTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLNKFFPPAKTVK 132
           GTVK+NGV +DAIRLRLFP SLQDK                  +VQAFL  FFPPAKT +
Sbjct: 87  GTVKMNGVIDDAIRLRLFPLSLQDK-----------------EMVQAFLTNFFPPAKTTQ 146

Query: 133 LRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAA 192
           LRTEI +F++   EQLFE WER+KELLRKCPQHG  +WLQ+Q+FYNGL   T+TI+DAAA
Sbjct: 147 LRTEIRSFRKYDYEQLFEVWERYKELLRKCPQHGNLEWLQIQMFYNGLNGQTRTILDAAA 206

Query: 193 GGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAF 252
           GGTLLS+T ENA  LL+DMA NS+QWPSERS  KK+ AG++E+D++S+L+AQ+ +L NA 
Sbjct: 207 GGTLLSRTPENAYILLKDMADNSFQWPSERSNAKKV-AGMYEIDELSSLKAQVQALTNAV 266

Query: 253 MKFSGTGSAQSIESAAALASRP-QEETIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHEN 312
            K SG G++ S E  AA  +    E TIEQ Q+ S                HP       
Sbjct: 267 SKLSGPGTSHSNELVAATDTYSYYEPTIEQAQFTS----------------HP------- 326

Query: 313 FSYANTKNVLNPPGFAPQTQDNKKLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHSAA 372
                              +    LEDL+GAFI E  +R +++E  V  +   + G++ +
Sbjct: 327 ------------------AEKKSSLEDLLGAFINECRSRASRIENQVEGMEVKLEGNTTS 386

Query: 373 IKNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQEESEEEPESEDYETPTGE 432
           IKN+E Q+GQ+   ++TM KGK P++ E    E+CKA+T+   +  +EPE +  E P   
Sbjct: 387 IKNMEVQIGQIAPTLNTMQKGKFPSDIEVKPREHCKAVTLRSGKELQEPEKKKMEEPVIT 446

Query: 433 AEEDTSSDEAEQPNLEPPIPSPTLLVPKEKKKKKKKKNNQVQFDKFMNAFMNLNIN-IPF 492
            EE  + +E                         K+    +Q DK  ++ ++   N +P+
Sbjct: 447 TEERENKEEV-----------------------VKEATPALQADKPTSSIVSSPPNSLPY 480

Query: 493 AE-ALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSV 546
            + ALE MP Y RFMK+ +  KRK +  +TV L   CS  +Q+K+P+K+ DPGSF++
Sbjct: 507 PQHALEQMPNYVRFMKDIMTGKRKLEAYETVNLTEECSAILQRKLPQKLKDPGSFTI 480

BLAST of Lag0041579 vs. ExPASy TrEMBL
Match: A0A6P6XAQ1 (Reverse transcriptase OS=Coffea arabica OX=13443 GN=LOC113740608 PE=4 SV=1)

HSP 1 Score: 375.6 bits (963), Expect = 3.6e-100
Identity = 233/550 (42.36%), Postives = 319/550 (58.00%), Query Frame = 0

Query: 11  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLD 70
           R +RD+  P  QG Q+ IV   +NANNFE+K  LIQM +   Y G+ TEDPNSHL +FL+
Sbjct: 9   RILRDFALPGAQGSQTSIVRPTVNANNFEIKPSLIQMVQQSQYGGNATEDPNSHLSTFLE 68

Query: 71  ICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLNKFFPPAKT 130
           IC T+K NGVSEDAI+LRLFPFSL+DKA+ WLQS  P + TTWD L +AFLNKFFPP KT
Sbjct: 69  ICDTIKFNGVSEDAIKLRLFPFSLRDKAKVWLQSHPPNTFTTWDELAKAFLNKFFPPGKT 128

Query: 131 VKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDA 190
            KLR +I +F QQ  E L+EAWER++EL R+CP HG PDWL VQ FYNGLT  TKT VDA
Sbjct: 129 AKLRMDITSFSQQEGETLYEAWERYRELQRRCPHHGLPDWLVVQTFYNGLTYPTKTHVDA 188

Query: 191 AAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLAN 250
           AAGG L+ KT E A+ L+E+MA N+YQW +ER   ++  AG+ EVD ++ L A+M ++  
Sbjct: 189 AAGGALMGKTAEEAQQLIEEMAANNYQWANERGNSRR-TAGMLEVDTLNMLSAKMDNVVK 248

Query: 251 AFMKFSGTGSAQSIESAAALASRPQEE-----TIEQVQYVSNFNSRGYNNSSTPTHYHPN 310
              +  G+ S Q +  A+        +     + EQVQY++N+N    NN  + T Y+P 
Sbjct: 249 MLNRQVGSSSNQGVVVASCTICGGDHDDFMCSSSEQVQYLNNYNRPPQNNPYSNT-YNPG 308

Query: 311 NRNHENFSY---ANTKNVLNPPGFAPQ--TQDNKKLEDLVGAFIAESSN-RTTKLEEAVI 370
            RNH NF +    N +  +NPPGF  +    ++K   +L    +A +SN +  KL  A  
Sbjct: 309 WRNHPNFGWKDQGNQQRPVNPPGFQQKQTLHESKPAWELAIEKLANASNDKIEKLASATT 368

Query: 371 AINSTVNGHSAAI----KNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQEE 430
                + G    +    +N+E QLGQ+ N V+  N+G  P++ E    E+ KAIT+   +
Sbjct: 369 QRFERIEGRMDQLTNMYRNVEVQLGQIANAVNNRNQGDLPSKTEVNPREHVKAITLRSGK 428

Query: 431 SEEEPESEDYETPTGEAEEDTSSDEAEQPNLEPPIPSPTLLVPKEKKKKKKKKNNQVQFD 490
              EP         G   E    +  +   L+           KE+K K+K + N++Q  
Sbjct: 429 ELVEP------PVVGSGREFEKRENKKLSELKEG--------SKEEKGKEKIEENELQ-- 488

Query: 491 KFMNAFMNLNINIPFAEALEMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPE 546
                 M     IP      +P Y +F+KE + KKRK    +T+ L   CS  +Q K+P 
Sbjct: 489 ------MEDATPIP----PPIPSYAKFLKEIMTKKRKLVDSETIALTEECSAIIQNKLPP 530

BLAST of Lag0041579 vs. ExPASy TrEMBL
Match: A0A6P6X688 (uncharacterized protein LOC113739791 OS=Coffea arabica OX=13443 GN=LOC113739791 PE=4 SV=1)

HSP 1 Score: 369.8 bits (948), Expect = 2.0e-98
Identity = 232/567 (40.92%), Postives = 328/567 (57.85%), Query Frame = 0

Query: 13  IRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDIC 72
           +RD+  P  QG Q+ I    +NAN FE++  LIQM +   Y G+ TED +SHL +F +IC
Sbjct: 11  LRDFVLPGTQGSQTSIARPTVNANKFEIRPSLIQMVQQSQYGGNATEDLDSHLSTFFEIC 70

Query: 73  GTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLNKFFPPAKTVK 132
            T+K NGVS+DAI+ RLFPFSL+DKA+ WLQ  +P + T W  L + FLNKFF P KT K
Sbjct: 71  DTIKFNGVSDDAIKFRLFPFSLRDKAKIWLQFYSPNTFTIWAELAKPFLNKFFAPGKTAK 130

Query: 133 LRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAA 192
            R +I +F QQ +E L+E WER++EL R+CP HG PDWL VQ FYNGLT  TKT VDAAA
Sbjct: 131 FRMDITSFSQQEEETLYEVWERYRELQRRCPHHGLPDWLVVQTFYNGLTYPTKTHVDAAA 190

Query: 193 GGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAF 252
           GG L+ KTV+ A+ L+E+MA N+YQW +ER   ++  AG+ EVD ++ L A+M ++    
Sbjct: 191 GGALMRKTVDEAQQLIEEMAANNYQWANERDNTRR-TAGMLEVDTLNMLSAKMDNVVKML 250

Query: 253 MKFSGTGSAQSIESAAALASRPQEE-----TIEQVQYVSNFNSRGYNNSSTPTHYHPNNR 312
            ++ G+ S + +  A         +     +  QVQY++N+N    NN  + T Y+P  R
Sbjct: 251 NRYVGSTSNRGVVVACCTTCDDDHDDSMCSSSGQVQYLNNYNRPPQNNPYSNT-YNPGWR 310

Query: 313 NHENFSY---ANTKNVLNPPGFAPQ--TQDNKKLEDLVGAFIAESSNRTTKLEEAVIA-- 372
           NH NF +    N +  +NPP F P+    ++K    L    +A  SN   K+E+ V A  
Sbjct: 311 NHPNFGWKDQGNQQRPVNPPDFQPKQPLPESKPTWKLAIEKLANVSN--DKIEKLVSATT 370

Query: 373 -----INSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQEE 432
                I   ++  +   +N+E QLGQ+ NVV+  N+   P++ E    E+ KAIT+  ++
Sbjct: 371 QRFERIEGRMDQLTNMYRNVENQLGQITNVVNNRNQWDLPSKTEVDPKEHVKAITLRSDK 430

Query: 433 S-------EEEPESEDYET-PTGEAEED--------TSSDEAEQPNLEPPIPSPTLLVPK 492
                   E E E E  E     E  ED        T  +   Q     PIP P   VP 
Sbjct: 431 EVGELPVVEHERECERRENKQLSELVEDGKKIKRKETMEENELQMGDTTPIPPP---VPF 490

Query: 493 EKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEA-LEMPQYNRFMKEWLAKKRKEKKVDT 546
            ++ K  K  N  +F+KF+N F  L+INIPF +A L++P Y +F+KE + KKRK    +T
Sbjct: 491 PQRLKPSK--NDKEFEKFVNIFKQLHINIPFVDAILQVPSYAKFLKEIMTKKRKLVDGET 550

BLAST of Lag0041579 vs. ExPASy TrEMBL
Match: A0A3S3N117 (Retrotrans_gag domain-containing protein OS=Cinnamomum micranthum f. kanehirae OX=337451 GN=CKAN_01212200 PE=4 SV=1)

HSP 1 Score: 332.4 bits (851), Expect = 3.5e-87
Identity = 207/509 (40.67%), Postives = 291/509 (57.17%), Query Frame = 0

Query: 11  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQM-ARDCAYRGSPTEDPNSHLKSFL 70
           R + DY  P+  G  S I    I ANNFE+K  +IQM A    + G P +DPN+H+ +FL
Sbjct: 44  RSLGDYAVPLVTGATSSIRRPVIQANNFEIKPAIIQMVASTVQFSGLPDDDPNAHISNFL 103

Query: 71  DICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLNKFFPPAK 130
           ++C T K NGV++DA+RLRL PFSL+DKA+ WL S+   +ITTWD L + FL KFFPP K
Sbjct: 104 ELCDTFKYNGVTDDAVRLRLLPFSLRDKAKAWLNSLPQSTITTWDELAKKFLAKFFPPTK 163

Query: 131 TVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVD 190
           TVK+R +I TF Q   E L+EAWER+KELLRKCP HG P W+QVQ FYNGL  +T+T +D
Sbjct: 164 TVKMRNDITTFAQNEMESLYEAWERYKELLRKCPHHGLPLWIQVQTFYNGLQSATRTSID 223

Query: 191 AAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLA 250
           AA GGTL+ K+ E A  L+E+MATN+YQWPS+    KKI  GV E+D +SAL AQ+ +L+
Sbjct: 224 AATGGTLMKKSPEEAYELVEEMATNNYQWPSDHVQQKKI-QGVHELDSISALTAQVANLS 283

Query: 251 NAF--MKFSGTGSAQSIESAAALASRPQE-------ETIEQVQYVSNFNSRGYNNSSTPT 310
                MK     S   +    A      +        + EQV YVSN++ +    S+T  
Sbjct: 284 KQIQSMKVHAVQSTNMVCEFCAGNHMGVDCQVGNPFNSQEQVHYVSNYSRQNNPYSNT-- 343

Query: 311 HYHPNNRNHENFSYANTKN-VLNPPGF-APQTQDNKKLEDLVGAFIAESSNRTTKLEEAV 370
            Y+P  RNH NFS+ N +N    PP F  PQ ++   LE ++  FI++  ++    + A+
Sbjct: 344 -YNPGWRNHPNFSWNNAQNSARQPPRFQQPQQEEKSGLEKMMAQFISKVDSKLQDHDNAL 403

Query: 371 IAINSTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVH------ 430
               + +     AI+NIE  +GQL N+++   +G  P+  E    E  +AIT+       
Sbjct: 404 KCQENELKSQGIAIRNIERTMGQLANMMTERAQGSLPSTTENNPREQVRAITLRSGKELN 463

Query: 431 -----QEESEEEPESEDYETPTGEAEEDTSSDEAEQPNLEPPIPSPTLLVPKEKKKKKKK 490
                + E E+ P     +     +EE    ++   P       +P   VP     ++ +
Sbjct: 464 PNLRAKPEKEQSPHDRKVKVSNQPSEEKKEEEDDFVPCRITFPDNPAPYVPPIPYPQRLR 523

Query: 491 KNN-QVQFDKFMNAFMNLNINIPFAEALE 496
           KN    QF KF++ F  L++NIPFA+ALE
Sbjct: 524 KNKLDKQFAKFLDIFKKLHVNIPFADALE 548

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG7990634.14.8e-13147.37hypothetical protein I3843_02G035100 [Carya illinoinensis][more]
KAG7947748.14.1e-13046.84hypothetical protein I3843_14G109500 [Carya illinoinensis][more]
KAG6734747.13.5e-12946.67hypothetical protein I3842_01G285500 [Carya illinoinensis][more]
XP_023874613.11.9e-12746.28uncharacterized protein LOC111987139 [Quercus suber][more]
WP_217833153.15.7e-12456.84retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 70... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J0ZX641.1e-10743.92LOW QUALITY PROTEIN: uncharacterized protein LOC110412945 OS=Herrania umbratica ... [more]
A0A6J1DU193.2e-10444.32uncharacterized protein LOC111024361 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
A0A6P6XAQ13.6e-10042.36Reverse transcriptase OS=Coffea arabica OX=13443 GN=LOC113740608 PE=4 SV=1[more]
A0A6P6X6882.0e-9840.92uncharacterized protein LOC113739791 OS=Coffea arabica OX=13443 GN=LOC113739791 ... [more]
A0A3S3N1173.5e-8740.67Retrotrans_gag domain-containing protein OS=Cinnamomum micranthum f. kanehirae O... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 89..181
e-value: 1.7E-18
score: 66.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 416..441
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 412..469
NoneNo IPR availablePANTHERPTHR24559:SF334SUBFAMILY NOT NAMEDcoord: 61..545
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 61..545

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0041579.1Lag0041579.1mRNA