Lag0035150 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0035150
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionReverse transcriptase
Locationchr3: 15767313 .. 15769398 (-)
RNA-Seq ExpressionLag0035150
SyntenyLag0035150
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGACCAAAATCCACCTGAGGAGCCTAGGCCTATTAGAGATTATTTTCAGCCTGTGTTTCAGGGGCAACAGTCGGGGATTGTCTATGCACCGATTAATGCCAACAACTTTGAGCTGAAAACCGGCCTCATTCAGATGGCTCGAGACTGTGCATATAAAGAATCACCCACCGAGGATCCAAATTCTCATCTTAAATCATTCTTGGATATTTGTGGGACGGTAAAAATTAATGGTGTCTCTGAGGATGCTATTCGCTTACGTTTATTTCCCTTTTCTTTGCAGGATAAAGCACGAGATTGGTTGCAGTCTATTACTCCTGGGAGCATCACCACCTGGGATGCTTTGGTCCAGGCCTTTTTAAAGAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAACAACAATATGATGAGCAGCTGTTCGAAGCTTGGGAGCGATTCAAAGAGCTACTGAGGAAGTGTCCTCAGCATGGTTACCCCGATTGGCTTCAGGTACAGTTGTTTTATAATGGTTTAACTCCTAGTACAAAAACGATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTGGAAAACGCTCGCACACTTCTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGGTCTACACATAAAAAGATTGCTGCTGGAGTGTTTGAGGTTGATAAAGTAAGTGCACTCCAGGCCCAGATGACTTCCCTCGCTAGTGCTTTTATGAAATTTTCAGGTACAGGGAGTGCACAGTCAATTGAATCAGCTGCTGCTTTGGCATCTAGACCTCAGGAGGAGACTATTGAACAAGTTCAGTATGTTTCAAATTCGAATTTTAGGGGTATAATAATAATTCTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCTAATACTAAGAATGTTCTTAATCCTCCTGGTTTTGCCCCTCAAACTCAAGATAATAAAAAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTCTAGTAACAGGACAACCAAATTAGAGGAGGCAGTCATTGCTATCAACACCACGGTGAATGGCCACAGTGCAGCCATAAAGAACATTGAGACTCAGCTGGGACAGTTGGTAAATGTTGTAAGTACCATGAATAAAGGTAAGGCCCCAGCTGAACAAGAGAAACCTCAGATGGAGTACTGTAAGGCAATCACTGTGCACCAGGAGGAATCTGAAGAGGAACCTGAATCTGAGGACTATGAAACGCCTACAGGGGAAGCTGAGGAGGACACGTCAGCAGATGAGGCTGAAAAGCCTAACCTTCAGCCTCCTATTCCTTCTCCCACACTGTTGGTTCCCAAGGAAAAGAAAAAGAAAAAGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTATGAATCTGAATATTAATATTCCTTTTGCAGAGGCATTAGAGATGCCCCAATACAACAGGTTCATGAAGGAGTGGTTAGCAAAGAAGCGAAAAGAAAAGAAGGTTGACACTGTATATCTTGCTTCCACATGCAGCACTCGAGTCCAACAGAAGGTACCTGAAAAAGTAGCAGATCCAGGGAGTTTTTCTGTTCCTTGTAGTTTTGGTACTTATTCTTTCAGAGCATTATGTGATTTAGGCGCTAGCATTAATATCATTCCTCTATCGTTGTGTAAAAAGTTAGATATAGGTGAGATTAAGTCTACTCCTGTAAAGCTCCAATTGGCTGATCAATCTGTGGTGAGACCAGTTGGTATTGTAGAAAATGTGTTAATCAGAGTAGGTAGATTTTTTCTCCCTATTGATTTATATGTTATGGACATGATAGAAAATCCTTCAATGCCTGTCATATTAGGAAGATCATTCCTCGCTACTGGGCGAGTGATTATAGATATTGAGTGCAGGGAGCTCACTATTAGAGTCAAGAACGAAAAAGAAATCTTTAAAGCAGTTGAAGACTCTAAAGATGAAGTGCTTTTCATGGGATATAGGAAAGGTGCAAGAAAGAGCACCTCTTTTGGATTCATAGAACAAAAGCCTCCTTGA

mRNA sequence

ATGGCTGACCAAAATCCACCTGAGGAGCCTAGGCCTATTAGAGATTATTTTCAGCCTGTGTTTCAGGGGCAACAGTCGGGGATTGTCTATGCACCGATTAATGCCAACAACTTTGAGCTGAAAACCGGCCTCATTCAGATGGCTCGAGACTGTGCATATAAAGAATCACCCACCGAGGATCCAAATTCTCATCTTAAATCATTCTTGGATATTTGTGGGACGGTAAAAATTAATGGTGTCTCTGAGGATGCTATTCGCTTACGCCTTTTTAAAGAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAACAACAATATGATGAGCAGCTGTTCGAAGCTTGGGAGCGATTCAAAGAGCTACTGAGGAAGTGTCCTCAGCATGGTTACCCCGATTGGCTTCAGGTACAGTTGTTTTATAATGGTTTAACTCCTAGTACAAAAACGATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTGGAAAACGCTCGCACACTTCTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGGTCTACACATAAAAAGATTGCTGCTGGAGTGTTTGAGGTTGATAAAGTAAGTGCACTCCAGGCCCAGATGACTTCCCTCGCTAGTGCTTTTATGAAATTTTCAGGTACAGGGAGTGCACAGTCAATTGAATCAGCTGCTGCTTTGGCATCTAGACCTCAGGAGGAGACTATTGAACAAGTTCAGAACCATGAAAATTTCTCTTATGCTAATACTAAGAATGTTCTTAATCCTCCTGGTTTTGCCCCTCAAACTCAAGATAATAAAAAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTCTAGTAACAGGACAACCAAATTAGAGGAGGCAGTCATTGCTATCAACACCACGGTGAATGGCCACAGTGCAGCCATAAAGAACATTGAGACTCAGCTGGGACAGTTGGTAAATGTTGTAAGTACCATGAATAAAGGTAAGGCCCCAGCTGAACAAGAGAAACCTCAGATGGAGTACTGTAAGGCAATCACTGTGCACCAGGAGGAATCTGAAGAGGAACCTGAATCTGAGGACTATGAAACGCCTACAGGGGAAGCTGAGGAGGACACGTCAGCAGATGAGGCTGAAAAGCCTAACCTTCAGCCTCCTATTCCTTCTCCCACACTGTTGGTTCCCAAGGAAAAGAAAAAGAAAAAGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTATGAATCTGAATATTAATATTCCTTTTGCAGAGGCATTAGAGATGCCCCAATACAACAGGTTCATGAAGGAGTGGTTAGCAAAGAAGCGAAAAGAAAAGAAGGTTGACACTGTATATCTTGCTTCCACATGCAGCACTCGAGTCCAACAGAAGGTACCTGAAAAAGTAGCAGATCCAGGGAGTTTTTCTGTTCCTTGTAGTTTTGGTACTTATTCTTTCAGAGCATTATGTGATTTAGGCGCTAGCATTAATATCATTCCTCTATCGTTGTGTAAAAAGTTAGATATAGGTGAGATTAAGTCTACTCCTGTAAAGCTCCAATTGGCTGATCAATCTGTGGTGAGACCAGTTGGTATTGTAGAAAATGTGTTAATCAGAGTAGGTAGATTTTTTCTCCCTATTGATTTATATGTTATGGACATGATAGAAAATCCTTCAATGCCTGTCATATTAGGAAGATCATTCCTCGCTACTGGGCGAGTGATTATAGATATTGAGTGCAGGGAGCTCACTATTAGAGTCAAGAACGAAAAAGAAATCTTTAAAGCAGTTGAAGACTCTAAAGATGAAGTGCTTTTCATGGGATATAGGAAAGGTGCAAGAAAGAGCACCTCTTTTGGATTCATAGAACAAAAGCCTCCTTGA

Coding sequence (CDS)

ATGGCTGACCAAAATCCACCTGAGGAGCCTAGGCCTATTAGAGATTATTTTCAGCCTGTGTTTCAGGGGCAACAGTCGGGGATTGTCTATGCACCGATTAATGCCAACAACTTTGAGCTGAAAACCGGCCTCATTCAGATGGCTCGAGACTGTGCATATAAAGAATCACCCACCGAGGATCCAAATTCTCATCTTAAATCATTCTTGGATATTTGTGGGACGGTAAAAATTAATGGTGTCTCTGAGGATGCTATTCGCTTACGCCTTTTTAAAGAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAACAACAATATGATGAGCAGCTGTTCGAAGCTTGGGAGCGATTCAAAGAGCTACTGAGGAAGTGTCCTCAGCATGGTTACCCCGATTGGCTTCAGGTACAGTTGTTTTATAATGGTTTAACTCCTAGTACAAAAACGATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTGGAAAACGCTCGCACACTTCTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGGTCTACACATAAAAAGATTGCTGCTGGAGTGTTTGAGGTTGATAAAGTAAGTGCACTCCAGGCCCAGATGACTTCCCTCGCTAGTGCTTTTATGAAATTTTCAGGTACAGGGAGTGCACAGTCAATTGAATCAGCTGCTGCTTTGGCATCTAGACCTCAGGAGGAGACTATTGAACAAGTTCAGAACCATGAAAATTTCTCTTATGCTAATACTAAGAATGTTCTTAATCCTCCTGGTTTTGCCCCTCAAACTCAAGATAATAAAAAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTCTAGTAACAGGACAACCAAATTAGAGGAGGCAGTCATTGCTATCAACACCACGGTGAATGGCCACAGTGCAGCCATAAAGAACATTGAGACTCAGCTGGGACAGTTGGTAAATGTTGTAAGTACCATGAATAAAGGTAAGGCCCCAGCTGAACAAGAGAAACCTCAGATGGAGTACTGTAAGGCAATCACTGTGCACCAGGAGGAATCTGAAGAGGAACCTGAATCTGAGGACTATGAAACGCCTACAGGGGAAGCTGAGGAGGACACGTCAGCAGATGAGGCTGAAAAGCCTAACCTTCAGCCTCCTATTCCTTCTCCCACACTGTTGGTTCCCAAGGAAAAGAAAAAGAAAAAGAAGAAAAAGAACAATCAGGTTCAGTTTGATAAATTTATGAATGCTTTTATGAATCTGAATATTAATATTCCTTTTGCAGAGGCATTAGAGATGCCCCAATACAACAGGTTCATGAAGGAGTGGTTAGCAAAGAAGCGAAAAGAAAAGAAGGTTGACACTGTATATCTTGCTTCCACATGCAGCACTCGAGTCCAACAGAAGGTACCTGAAAAAGTAGCAGATCCAGGGAGTTTTTCTGTTCCTTGTAGTTTTGGTACTTATTCTTTCAGAGCATTATGTGATTTAGGCGCTAGCATTAATATCATTCCTCTATCGTTGTGTAAAAAGTTAGATATAGGTGAGATTAAGTCTACTCCTGTAAAGCTCCAATTGGCTGATCAATCTGTGGTGAGACCAGTTGGTATTGTAGAAAATGTGTTAATCAGAGTAGGTAGATTTTTTCTCCCTATTGATTTATATGTTATGGACATGATAGAAAATCCTTCAATGCCTGTCATATTAGGAAGATCATTCCTCGCTACTGGGCGAGTGATTATAGATATTGAGTGCAGGGAGCTCACTATTAGAGTCAAGAACGAAAAAGAAATCTTTAAAGCAGTTGAAGACTCTAAAGATGAAGTGCTTTTCATGGGATATAGGAAAGGTGCAAGAAAGAGCACCTCTTTTGGATTCATAGAACAAAAGCCTCCTTGA

Protein sequence

MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFKEFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTHKKIAAGVFEVDKVSALQAQMTSLASAFMKFSGTGSAQSIESAAALASRPQEETIEQVQNHENFSYANTKNVLNPPGFAPQTQDNKKLEDLVGAFIAESSNRTTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEAEEDTSADEAEKPNLQPPIPSPTLLVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALEMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSFRALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRSFLATGRVIIDIECRELTIRVKNEKEIFKAVEDSKDEVLFMGYRKGARKSTSFGFIEQKPP
Homology
BLAST of Lag0035150 vs. NCBI nr
Match: KAG7990634.1 (hypothetical protein I3843_02G035100 [Carya illinoinensis])

HSP 1 Score: 515.4 bits (1326), Expect = 7.1e-142
Identity = 302/688 (43.90%), Postives = 421/688 (61.19%), Query Frame = 0

Query: 1   MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTED 60
           MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   +  SP +D
Sbjct: 31  MAEEDREVLPRTLKDYVRPVVNGNYSSIMRQPINANNFELKPALISMVQQAQFSGSPLDD 90

Query: 61  PNSHLKSFLDICGTVKINGVSEDAIRLRLF------------------------------ 120
           PN HL  FL+IC TVKINGV+ED IRLRLF                              
Sbjct: 91  PNIHLAMFLEICDTVKINGVTEDTIRLRLFPFSLRDKARGWLQSLQPGSIVSWQDMAERF 150

Query: 121 -KEFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGL 180
             +FFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL
Sbjct: 151 LAKFFPPAKTAQLRSEIGQFKQNDFESLYEAWERYKDLIRRCPQHGLPDWLQVQMFYNGL 210

Query: 181 TPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTHKKIAAGVFEVDKVSA 240
              T+TIVDAA+GGTL+SKT E A  LLE+MA+N+YQWP+ER+  KK+ AG+ +++ ++A
Sbjct: 211 NGQTRTIVDAASGGTLMSKTAEGATALLEEMASNNYQWPTERTLAKKV-AGIHDLEPIAA 270

Query: 241 LQAQMTSLASAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQ--------------- 300
           L AQ+ +L+      +     QS E  ++ ++     E + EQVQ               
Sbjct: 271 LSAQVATLSHQISALTTQRIPQSTEYLASTSMIVPSNEASQEQVQYVNNRNYNYRGNPMP 330

Query: 301 --------NHENFSYANTKNVL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLE 360
                   NHEN SY NTKNVL   +PPGF  Q  + K  LED + +F+ E++ R  K +
Sbjct: 331 NYYHPGLRNHENLSYGNTKNVLQPQHPPGFDSQPSERKMSLEDAMVSFVQETNARFKKTD 390

Query: 361 EAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQ-E 420
             +  I T  +   AAIKNIE Q+GQL   ++   +G  P+  E    E CKAIT+   +
Sbjct: 391 SRLDNIETHCSNMGAAIKNIEVQIGQLATTINAQQRGAFPSNTEVNPKEQCKAITLRSGK 450

Query: 421 ESEEEPESEDYETPT----GEAEEDTSADEAEKPNLQ-------------PPIPSPTLLV 480
           E E  P  E   TPT    G+++     DE     L+             PPI +P L  
Sbjct: 451 EIERSPLKESKSTPTAVNIGQSKNKVEEDEIVNDTLEETDFAPTISFPDNPPILAPPLPY 510

Query: 481 PKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKV 540
           P+  +K+K  K    QF KF++ F  ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++ 
Sbjct: 511 PQRFQKQKLDK----QFSKFLDIFKKIHINIPFADALEQMPNYVKFLKDIISKKRRLEEF 570

Query: 541 DTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKL 600
           +TV L+  CS  +Q+K+P+K+ DPGSF++PC+ G   F + LCDLGASIN++PLS+C+KL
Sbjct: 571 ETVKLSEECSAILQKKLPQKLKDPGSFTLPCTIGDSFFDKVLCDLGASINLMPLSVCRKL 630

Query: 601 DIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRS 609
            + E+K T + LQLAD+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+ILGR 
Sbjct: 631 GLEEMKPTTISLQLADRSIKYPRGIIEDVLVKVDKFIFPADFVVLDMEEDEEVPLILGRP 690

BLAST of Lag0035150 vs. NCBI nr
Match: KAG7947748.1 (hypothetical protein I3843_14G109500 [Carya illinoinensis])

HSP 1 Score: 513.1 bits (1320), Expect = 3.5e-141
Identity = 300/694 (43.23%), Postives = 422/694 (60.81%), Query Frame = 0

Query: 1   MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTED 60
           MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   +  SP +D
Sbjct: 1   MAEEDREVLPRTLKDYVRPVVNGNYSSIMRQPINANNFELKPALISMVQQAQFSGSPLDD 60

Query: 61  PNSHLKSFLDICGTVKINGVSEDAIRLRLF------------------------------ 120
           PN HL  FL+IC TVKINGV+ED IRLRLF                              
Sbjct: 61  PNVHLAMFLEICDTVKINGVTEDTIRLRLFPFSLRDKARGWLQSLQPGSIVSWQDMAERF 120

Query: 121 -KEFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGL 180
             +FFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL
Sbjct: 121 LAKFFPPAKTAQLRSEIGQFKQNDFESLYEAWERYKDLIRRCPQHGLPDWLQVQMFYNGL 180

Query: 181 TPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTHKKIAAGVFEVDKVSA 240
              T+TIVDAA+GGTL+SKT E A  LLE+MA+N+YQWP+ER+  KK+ AG+ E++ ++A
Sbjct: 181 NGQTRTIVDAASGGTLMSKTAEGATALLEEMASNNYQWPTERTLAKKV-AGIHELEPIAA 240

Query: 241 LQAQMTSLASAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQ--------------- 300
           L AQ+ +L+      +     QS E  ++ ++     E + EQVQ               
Sbjct: 241 LSAQVATLSHQISALTTQRIPQSTEYVASTSMIVPSNEASQEQVQYVNNRNYNYRGNPMP 300

Query: 301 --------NHENFSYANTKNVL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLE 360
                   NHEN SY NTKNVL   +PPGF  Q  + K  LED + +F+ E++ R  K +
Sbjct: 301 NYYHPGLRNHENLSYGNTKNVLQPQHPPGFDSQPSEKKMSLEDAMVSFVQETNARFKKTD 360

Query: 361 EAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQ-E 420
             +  I T  +   A +KN+E Q+GQL   ++   +G  P+  E    E CKAIT+   +
Sbjct: 361 SRLDNIETHCSNMGATMKNLEVQIGQLATTINAQQRGAFPSNTEVNPKEQCKAITLRSGK 420

Query: 421 ESEEEPESEDYETPT----GEAEEDTSADEAEKPNLQ-------------PPIPSPTLLV 480
           E E  P  E   TPT    G++++    +E     L+             PPI +P L  
Sbjct: 421 EIERAPLKESKSTPTAANNGQSKDQVEEEEIVNDTLEETDLPPTISFPDNPPILAPPLPY 480

Query: 481 PKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKV 540
           P+  +K+K  K    QF KF++ F  ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++ 
Sbjct: 481 PQRFQKQKLDK----QFSKFLDIFKKIHINIPFADALEQMPNYVKFLKDIISKKRRLEEF 540

Query: 541 DTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKL 600
           +TV L+  CS  +Q+K+P+K+ DPGSF++PC+ G   F R LCDLGASIN++P S+C+KL
Sbjct: 541 ETVKLSEECSAILQKKLPQKLKDPGSFTLPCTIGDSFFDRVLCDLGASINLMPFSVCRKL 600

Query: 601 DIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRS 615
            +GE+K T + LQLAD+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+ILGR 
Sbjct: 601 GLGEMKHTTISLQLADRSIKYPRGIIEDVLVKVDKFIFPADFVVLDMEEDEDVPLILGRP 660

BLAST of Lag0035150 vs. NCBI nr
Match: KAG6734747.1 (hypothetical protein I3842_01G285500 [Carya illinoinensis])

HSP 1 Score: 507.7 bits (1306), Expect = 1.5e-139
Identity = 298/694 (42.94%), Postives = 420/694 (60.52%), Query Frame = 0

Query: 1   MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTED 60
           MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   +  SP +D
Sbjct: 1   MAEEDREVLPRTLKDYVRPVVNGNYSSIMRQPINANNFELKPALISMVQQAQFSGSPLDD 60

Query: 61  PNSHLKSFLDICGTVKINGVSEDAIRLRLF------------------------------ 120
           PN HL  FL+IC TVKINGV+ED IRLRLF                              
Sbjct: 61  PNVHLAMFLEICDTVKINGVTEDTIRLRLFPFSLRDKARGWLQSLQPGSIVSWQDMAERF 120

Query: 121 -KEFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGL 180
             +FFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL
Sbjct: 121 LAKFFPPAKTAQLRSEIGQFKQNDFESLYEAWERYKDLIRRCPQHGLPDWLQVQMFYNGL 180

Query: 181 TPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTHKKIAAGVFEVDKVSA 240
              T+TIVDAA+GGTL+SKT E A  LLE+MA+N+YQWP+ER+  KK+ AG+ E++ ++A
Sbjct: 181 NGQTRTIVDAASGGTLMSKTAEGATALLEEMASNNYQWPTERTLAKKV-AGIHELEPIAA 240

Query: 241 LQAQMTSLASAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQ--------------- 300
           L AQ+ +L+      +     QS E  ++ ++     E + EQVQ               
Sbjct: 241 LSAQVATLSHQISALTTQRIPQSTEYVASTSMIVPSNEASQEQVQYVNNRNYNYRGNPMP 300

Query: 301 --------NHENFSYANTKNVL---NPPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLE 360
                   NHEN SY NTKNVL   +PPGF  Q  + K  LED + +F+ E++ R  K +
Sbjct: 301 NYYHPGLRNHENLSYGNTKNVLQPQHPPGFDSQPSEKKMSLEDAMVSFVQETNARFKKTD 360

Query: 361 EAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQ-E 420
             +  I T  +   A +KN+E Q+GQL   ++   +G  P+  E    E CKAIT+   +
Sbjct: 361 SRLDNIETHCSNMGATMKNLEVQIGQLATTINAQQRGAFPSNTEVNPKEQCKAITLRSGK 420

Query: 421 ESEEEPESEDYETPT----GEAEEDTSADEAEKPNLQ-------------PPIPSPTLLV 480
           E E  P  E   TPT    G++++    +E     L+             PPI +P L  
Sbjct: 421 EIERAPLKESKSTPTAANNGQSKDQVEEEEIVNDTLEETDLPPTISFPDNPPILAPPLPY 480

Query: 481 PKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKV 540
           P+  +K+K  K    QF KF++ F  ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++ 
Sbjct: 481 PQRFQKQKLDK----QFSKFLDIFKKIHINIPFADALEQMPNYVKFLKDIISKKRRLEEF 540

Query: 541 DTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKL 600
           +TV L+  CS  +Q+K+P+K+ DP SF++PC+ G   F R LCDLGASIN++P  +C+KL
Sbjct: 541 ETVKLSEECSAILQKKLPQKLKDPESFTLPCTIGDSFFDRVLCDLGASINLMPFFVCRKL 600

Query: 601 DIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRS 615
            +GE+K T + LQLAD+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+ILGR 
Sbjct: 601 GLGEMKHTTISLQLADRSIKYPRGIIEDVLVKVDKFIFPADFVVLDMEEDEDVPLILGRP 660

BLAST of Lag0035150 vs. NCBI nr
Match: XP_023874613.1 (uncharacterized protein LOC111987139 [Quercus suber])

HSP 1 Score: 500.4 bits (1287), Expect = 2.4e-137
Identity = 294/688 (42.73%), Postives = 409/688 (59.45%), Query Frame = 0

Query: 1   MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTED 60
           MA+     +PR ++DY +P+     SGI    INANNFELK  LI M +   +  SP +D
Sbjct: 1   MAEGEQNAQPRTLKDYVRPIVNDNYSGIRRQTINANNFELKPALISMVQQAQFSGSPLDD 60

Query: 61  PNSHLKSFLDICGTVKINGVSEDAIRLRLF------------------------------ 120
           PN HL  FL+IC T+K+NGV+ED IRLRLF                              
Sbjct: 61  PNIHLAMFLEICDTIKMNGVTEDTIRLRLFPFSLRDKARGWLQSLQPGSITSWQDMAEKF 120

Query: 121 -KEFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGL 180
             +FFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R CPQHG PDWLQVQ+FYNGL
Sbjct: 121 LAKFFPPAKTAQLRSEIGQFRQNDFESLYEAWERYKDLIRCCPQHGLPDWLQVQMFYNGL 180

Query: 181 TPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTHKKIAAGVFEVDKVSA 240
              T+TIVDAA+GGTL+SKT E A +LLE+MA+N+YQWP+ER+  KK+ AG+ E++  +A
Sbjct: 181 NGQTRTIVDAASGGTLMSKTAEGATSLLEEMASNNYQWPTERTMAKKV-AGIHELEPFAA 240

Query: 241 LQAQMTSLASAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQ--------------- 300
           L AQ+ SL+      +     Q  E  +A+++     E + EQVQ               
Sbjct: 241 LSAQVASLSHQVSALTTQRIPQGAEYVAASSMTVPMNEASQEQVQYINNRNYNYRGNPMP 300

Query: 301 --------NHENFSYANTKNVLN-PPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEA 360
                   NHENFSY NTKNVL  PPGF  Q  + K  LED + +F+ E+     K +  
Sbjct: 301 NYYHPGLRNHENFSYGNTKNVLQPPPGFDSQPSEKKMSLEDAMVSFVEETKATFKKSDSQ 360

Query: 361 VIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQ-EES 420
           +  I T  +   A +KN+E Q+GQL   ++   +G  P+  E    E CKAIT+    E 
Sbjct: 361 LDNIETHCSNMGATMKNLEVQIGQLATTINAQQRGTFPSNTEVNPKEQCKAITLRSGREI 420

Query: 421 EEEPESEDYETPTG-------------EAEEDTSADEAEKPNLQPPIPSPTLLVPKEKKK 480
           E  P  E   TPT              E  EDT  +    P++  P   P L  P    +
Sbjct: 421 ERSPSKETETTPTAPNNGQSKNKVEEEEIVEDTLRETDMPPSISFPDNPPILSTPLPYPQ 480

Query: 481 KKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLA 540
           + +K+    QF KF++ F  ++INIPFA+ALE MP Y +F+K+ ++KKR+ ++ +TV L+
Sbjct: 481 RFQKQKLDKQFSKFLDIFKKIHINIPFADALEQMPNYAKFLKDIISKKRRLEEFETVKLS 540

Query: 541 STCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIK 600
             CS  +Q+K+P+K+ DPGSF++PC+ G   F + LCDLGASIN++PLS+ +KL +GE+K
Sbjct: 541 EECSAIIQKKLPQKLKDPGSFTLPCTIGNSFFDKVLCDLGASINLMPLSVYRKLGLGEMK 600

Query: 601 STPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRSFLATGR 615
            T + LQLAD+S+  P GI+E+VL++V +F  P D  V+DM E+  +P+ILGR FLATGR
Sbjct: 601 QTTISLQLADRSIKYPRGIIEDVLVKVDKFIFPADFVVLDMEEDQEVPLILGRPFLATGR 660

BLAST of Lag0035150 vs. NCBI nr
Match: XP_022843226.1 (uncharacterized protein LOC111366761 [Olea europaea var. sylvestris])

HSP 1 Score: 485.0 bits (1247), Expect = 1.0e-132
Identity = 289/663 (43.59%), Postives = 398/663 (60.03%), Query Frame = 0

Query: 5   NPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTEDPNSH 64
           N   + R IRDY +PV     SGI    I A NFELK GLI M +   +  +  EDPN+H
Sbjct: 136 NEDNQQRAIRDYIRPVVNDNYSGIARPAIVAKNFELKPGLIDMVQQNQFGGAAVEDPNAH 195

Query: 65  LKSFLDICGTVKINGVSEDAIRLRLFK-------------------------------EF 124
           L SFL+IC TVK+NGV+EDAIRLRLF                                ++
Sbjct: 196 LGSFLEICDTVKMNGVTEDAIRLRLFSFSLRDKAKAWFQSLPYGSITTWDDLAQKFLTKY 255

Query: 125 FPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPST 184
           FPP+K+ +LR EI  F+Q   E  +EAWERFK+LLR+CPQHG+  W+Q+++FYNGL   T
Sbjct: 256 FPPSKSAQLRGEISQFKQLDFEPFYEAWERFKDLLRRCPQHGFQKWVQIEIFYNGLNGQT 315

Query: 185 KTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTHKKIAAGVFEVDKVSALQAQ 244
           +T+VDAAAGG L++KT E A  LL+D+ATNSYQWPSERS  KK+ AG+ EVD ++AL AQ
Sbjct: 316 RTMVDAAAGGILMAKTAEAAYALLDDIATNSYQWPSERSGVKKV-AGLHEVDPITALAAQ 375

Query: 245 MTSLASAFMKFSGTGSAQSIESAAALASRPQEETI--EQVQ------------------- 304
           + SL +  +  +  G+ Q+++S  + +S  QE  +  EQVQ                   
Sbjct: 376 VASLTNQIVMLTTQGNQQNVDSVISTSSSHQETEVANEQVQYIDSRNYNQRGGYQANHYH 435

Query: 305 ----NHENFSYANTKNVLN-PPGFAPQTQDNK-KLEDLVGAFIAESSNRTTKLEEAVIAI 364
               NHEN SY N +N L  PPGF  Q  D K  LED++G FI+E+ +R  K E  +  I
Sbjct: 436 PGLRNHENLSYGNNRNTLQPPPGFNTQNSDGKPPLEDILGTFISETRSRFNKNELRLDNI 495

Query: 365 NTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQEESEEEPE 424
            T V+   A +KN+E Q+GQL  ++ +  KGK P++ E    E+C AIT+   +  EE +
Sbjct: 496 ETHVSKIGATMKNLEVQIGQLATLMKSQQKGKFPSDTEVNPREHCNAITLRSGKMVEESK 555

Query: 425 SEDYETP------TGEAEEDTSADEAEKPNL----------QPPIPSPTLLVPKEKKKKK 484
            +    P      T E + +    EAE   +           PPI  P L  P+   KKK
Sbjct: 556 PKKIMVPTPDVIVTDERQSERQKTEAEGTKIYKPYSISFPDNPPILKPPLPFPQRFMKKK 615

Query: 485 KKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLAST 544
                  QF KF+  F  ++INIPFAE L +MP Y +F+KE ++ K+K ++ +T+ L   
Sbjct: 616 FDD----QFAKFLEVFKKIHINIPFAETLAQMPNYAKFLKEVMSNKKKLEEFETIKLTEG 675

Query: 545 CSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKST 592
           CS  + QK+P K+ DPGSF++PC+ G  +F RALCD GASIN++PLS+ KKL +GE+K T
Sbjct: 676 CSD-ILQKLPHKLKDPGSFNIPCNIGGITFDRALCDFGASINLMPLSVFKKLGLGEVKPT 735

BLAST of Lag0035150 vs. ExPASy TrEMBL
Match: A0A6J1DU19 (uncharacterized protein LOC111024361 OS=Momordica charantia OX=3673 GN=LOC111024361 PE=4 SV=1)

HSP 1 Score: 431.4 bits (1108), Expect = 6.5e-117
Identity = 264/613 (43.07%), Postives = 361/613 (58.89%), Query Frame = 0

Query: 13  IRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTEDPNSHLKSFLDIC 72
           IRDY QP F     GI+  PINANN ELK GLIQM R+  ++ + TEDPN+HL  FLD+C
Sbjct: 27  IRDYCQPNFP-NHVGIINLPINANNSELKPGLIQMVRENTFRGNATEDPNNHLTIFLDVC 86

Query: 73  GTVKINGVSEDAIRLRLF--------------KEFFPPAKTVKLRTEIGTFQQQYDEQLF 132
           GTVK+NGV +DAIRLRLF                FFPPAKT +LRTEI +F++   EQLF
Sbjct: 87  GTVKMNGVIDDAIRLRLFPLSLQDKEMVQAFLTNFFPPAKTTQLRTEIRSFRKYDYEQLF 146

Query: 133 EAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLE 192
           E WER+KELLRKCPQHG  +WLQ+Q+FYNGL   T+TI+DAAAGGTLLS+T ENA  LL+
Sbjct: 147 EVWERYKELLRKCPQHGNLEWLQIQMFYNGLNGQTRTILDAAAGGTLLSRTPENAYILLK 206

Query: 193 DMATNSYQWPSERSTHKKIAAGVFEVDKVSALQAQMTSLASAFMKFSGTGSAQSIESAAA 252
           DMA NS+QWPSERS  KK+ AG++E+D++S+L+AQ+ +L +A  K SG G++ S E  AA
Sbjct: 207 DMADNSFQWPSERSNAKKV-AGMYEIDELSSLKAQVQALTNAVSKLSGPGTSHSNELVAA 266

Query: 253 LASRP-QEETIEQVQNHENFSYANTKNVLNPPGFAPQTQDNK-KLEDLVGAFIAESSNRT 312
             +    E TIEQ Q                  F     + K  LEDL+GAFI E  +R 
Sbjct: 267 TDTYSYYEPTIEQAQ------------------FTSHPAEKKSSLEDLLGAFINECRSRA 326

Query: 313 TKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITV 372
           +++E  V  +   + G++ +IKN+E Q+GQ+   ++TM KGK P++ E    E+CKA+T+
Sbjct: 327 SRIENQVEGMEVKLEGNTTSIKNMEVQIGQIAPTLNTMQKGKFPSDIEVKPREHCKAVTL 386

Query: 373 HQEESEEEPESEDYETPTGEAEEDTSADEAEKPNLQPPIPSPTLLVPKEKKKKKKKKNNQ 432
              +  +EPE +  E P    EE  + +E  K        +P L   K          N 
Sbjct: 387 RSGKELQEPEKKKMEEPVITTEERENKEEVVKE------ATPALQADKPTSSIVSSPPNS 446

Query: 433 VQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQ 492
           + + +                ALE MP Y RFMK+ +  KRK +  +TV L   CS  +Q
Sbjct: 447 LPYPQ---------------HALEQMPNYVRFMKDIMTGKRKLEAYETVNLTEECSAILQ 506

Query: 493 QKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQL 552
           +K+P+K+ DPGSF++PC+  + SF +ALCD+ ASIN++PL                    
Sbjct: 507 RKLPQKLKDPGSFTIPCTISSSSFNKALCDICASINLMPL-------------------- 566

Query: 553 ADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRSFLATGRVIIDIECR 608
                    G++E+VL++V R   P D  V+   E+  +P+ILGR FLATG  +ID++  
Sbjct: 567 ---------GVIEDVLVKVDRLIFPADFVVLXXEEDSEIPIILGRXFLATGXALIDVQLG 569

BLAST of Lag0035150 vs. ExPASy TrEMBL
Match: A0A6J0ZX64 (LOW QUALITY PROTEIN: uncharacterized protein LOC110412945 OS=Herrania umbratica OX=108875 GN=LOC110412945 PE=4 SV=1)

HSP 1 Score: 425.6 bits (1093), Expect = 3.6e-115
Identity = 278/695 (40.00%), Postives = 387/695 (55.68%), Query Frame = 0

Query: 7   PEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-YKESPTEDPNSHL 66
           PE  R +RDY  P+ QG    I    INANNFE+K   IQM +    +   P++DPNSHL
Sbjct: 53  PEANRALRDYVVPLVQGLHQSIRRPSINANNFEIKPAYIQMIQSSVQFSGLPSDDPNSHL 112

Query: 67  KSFLDICGTVKINGVSEDAIRLRLF-------------------------------KEFF 126
            +FL+IC T K NGV++DAIRLRLF                                +FF
Sbjct: 113 VNFLEICDTFKYNGVTDDAIRLRLFPFSLRDKAKSWLNSLPNGSITTWEDLAQKFLAKFF 172

Query: 127 PPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTK 186
           PPAKT K+R +I +F Q   E L+EAWERFKELLR+CP HG PDWLQVQ FYNGL  S K
Sbjct: 173 PPAKTAKMRNDITSFIQFDGESLYEAWERFKELLRRCPHHGIPDWLQVQTFYNGLVGSIK 232

Query: 187 TIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTHKKIAAGVFEVDKVSALQAQM 246
           TI+DAAAGG L+SK   +A  LLE+MA+N+YQWPSERS  +K A G +E+D +  L  Q+
Sbjct: 233 TIIDAAAGGALMSKNAVDAYNLLEEMASNNYQWPSERSGSRK-AVGAYEIDALGTLTTQV 292

Query: 247 TSLASAFMKFS-------------------------GTGSAQSIESAAALASRPQEETIE 306
            +L+                                 + S Q + +     + P   T  
Sbjct: 293 AALSKKLDTLGVHAVQNSLVVCEMCGDSHSYDQCPYNSESVQFVGNFNRQQNNPYSNTYN 352

Query: 307 Q-VQNHENFSYANTKNVLN-----PPGF----APQTQDNK-KLEDLVGAFIAESSNRTTK 366
              +NH NFS++N     N     PPGF     PQ  + K +LE+L+  +I+++      
Sbjct: 353 PGWRNHPNFSWSNNAGPSNPKPIMPPGFQQQARPQIPEKKSQLEELLLQYISKT------ 412

Query: 367 LEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAE-QEKPQ-MEYCKAIT- 426
                   +  +    A+++N+ETQ+GQL N ++   +G  P++ Q  P+  E C+AIT 
Sbjct: 413 --------DAIIQSQGASLRNLETQVGQLANSINNRPQGSLPSDTQINPKGKEQCQAITL 472

Query: 427 --------VHQEESEEEPESED------YETPTGEAEEDTSADEAEKPNLQPPIPSPTLL 486
                   V+Q+  E E E  D       E    + ++D + ++     + PP P P  L
Sbjct: 473 RSGKEIEGVNQKAVESEIEHVDKEGMCENEIEIQQKDDDKAENQGTSQVIHPPPPFPQRL 532

Query: 487 VPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWLAKKRKEKK 546
              +K+K +K      QF KF+N F  L+INIPFAEALE MP Y +F+K+ L+KKRK  +
Sbjct: 533 ---QKQKLEK------QFQKFLNVFKKLHINIPFAEALEQMPSYVKFLKDILSKKRKLGE 592

Query: 547 VDTVYLASTCSTRVQQKVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKK 606
            +TV+L   CS  +Q K+P K+ DPGSF++PC+ G   F +AL DLGASIN++P S+ +K
Sbjct: 593 FETVFLTEECSAILQNKLPPKLKDPGSFTIPCTIGNLFFTKALSDLGASINLMPWSIFEK 652

Query: 607 LDIGEIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGR 615
           L +GE K T V LQLAD+S V P GI+E+VL++V +F  P+D  ++DM E+  +P+ILGR
Sbjct: 653 LGLGECKPTSVTLQLADRSYVYPRGIIEDVLVKVDKFIFPVDFLILDMEEDRQIPIILGR 712

BLAST of Lag0035150 vs. ExPASy TrEMBL
Match: A0A6P6XAQ1 (Reverse transcriptase OS=Coffea arabica OX=13443 GN=LOC113740608 PE=4 SV=1)

HSP 1 Score: 398.3 bits (1022), Expect = 6.1e-107
Identity = 256/667 (38.38%), Postives = 365/667 (54.72%), Query Frame = 0

Query: 11  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTEDPNSHLKSFLD 70
           R +RD+  P  QG Q+ IV   +NANNFE+K  LIQM +   Y  + TEDPNSHL +FL+
Sbjct: 9   RILRDFALPGAQGSQTSIVRPTVNANNFEIKPSLIQMVQQSQYGGNATEDPNSHLSTFLE 68

Query: 71  ICGTVKINGVSEDAIRLRLF-------------------------------KEFFPPAKT 130
           IC T+K NGVSEDAI+LRLF                                +FFPP KT
Sbjct: 69  ICDTIKFNGVSEDAIKLRLFPFSLRDKAKVWLQSHPPNTFTTWDELAKAFLNKFFPPGKT 128

Query: 131 VKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDA 190
            KLR +I +F QQ  E L+EAWER++EL R+CP HG PDWL VQ FYNGLT  TKT VDA
Sbjct: 129 AKLRMDITSFSQQEGETLYEAWERYRELQRRCPHHGLPDWLVVQTFYNGLTYPTKTHVDA 188

Query: 191 AAGGTLLSKTVENARTLLEDMATNSYQWPSERSTHKKIAAGVFEVDKVSALQAQMTSLAS 250
           AAGG L+ KT E A+ L+E+MA N+YQW +ER   ++  AG+ EVD ++ L A+M ++  
Sbjct: 189 AAGGALMGKTAEEAQQLIEEMAANNYQWANERGNSRR-TAGMLEVDTLNMLSAKMDNVVK 248

Query: 251 AFMKFSGTGSAQSIESAAALASRPQEE-----TIEQVQ---------------------- 310
              +  G+ S Q +  A+        +     + EQVQ                      
Sbjct: 249 MLNRQVGSSSNQGVVVASCTICGGDHDDFMCSSSEQVQYLNNYNRPPQNNPYSNTYNPGW 308

Query: 311 -NHENFSY---ANTKNVLNPPGFAPQ--TQDNKKLEDLVGAFIAESSN-RTTKLEEAVIA 370
            NH NF +    N +  +NPPGF  +    ++K   +L    +A +SN +  KL  A   
Sbjct: 309 RNHPNFGWKDQGNQQRPVNPPGFQQKQTLHESKPAWELAIEKLANASNDKIEKLASATTQ 368

Query: 371 INTTVNGHSAAI----KNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQEES 430
               + G    +    +N+E QLGQ+ N V+  N+G  P++ E    E+ KAIT+   + 
Sbjct: 369 RFERIEGRMDQLTNMYRNVEVQLGQIANAVNNRNQGDLPSKTEVNPREHVKAITLRSGKE 428

Query: 431 EEEPESEDYETPTGEAEEDTSADEAEKPNLQPPIPSPTLLVPKEKKKKKKKKNNQVQFDK 490
             EP         G   E    +  +   L+           KE+K K+K + N++Q   
Sbjct: 429 LVEP------PVVGSGREFEKRENKKLSELKEG--------SKEEKGKEKIEENELQ--- 488

Query: 491 FMNAFMNLNINIPFAEALEMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEK 550
                M     IP      +P Y +F+KE + KKRK    +T+ L   CS  +Q K+P K
Sbjct: 489 -----MEDATPIP----PPIPSYAKFLKEIMTKKRKLVDSETIALTEECSAIIQNKLPPK 548

Query: 551 VADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIGEIKSTPVKLQLADQSVV 608
           + DPGSF+VPC+ G   F +ALCDLGAS+++IPL++ ++L + E+K T + LQLAD+S+ 
Sbjct: 549 LKDPGSFTVPCTIGNVEFSKALCDLGASVSLIPLTVARQLGLKELKRTNISLQLADRSIR 608

BLAST of Lag0035150 vs. ExPASy TrEMBL
Match: A0A6P8DKJ2 (uncharacterized protein LOC116204231 OS=Punica granatum OX=22663 GN=LOC116204231 PE=4 SV=1)

HSP 1 Score: 365.2 bits (936), Expect = 5.8e-97
Identity = 246/691 (35.60%), Postives = 361/691 (52.24%), Query Frame = 0

Query: 11  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTEDPNSHLKSFLD 70
           R +RDY  P   G  S I    I ANNFELK  LIQM +   +   P E P+ H+  FL 
Sbjct: 158 RALRDYAVPTIMG--SAIRRPTIPANNFELKPALIQMVQSNQFGGYPNESPDEHIAGFLQ 217

Query: 71  ICGTVKINGVSEDAIRLRLF-------------------------------KEFFPPAKT 130
            C TVK+N V++D IRL+LF                               + FFPPA+T
Sbjct: 218 YCNTVKMNNVTDDVIRLQLFPFSLRDKARAWFNSLPQESITTWADLSSKFLRRFFPPART 277

Query: 131 VKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDA 190
            +LR EI  F +   E L+EAWERFKE +RKCP HG PD L +++FY  L  + +++VDA
Sbjct: 278 ARLRNEITNFTKFNGESLYEAWERFKEAIRKCPHHGLPDNLLIEVFYLSLDDTLRSLVDA 337

Query: 191 AAGGTLLSKTVENARTLLEDMATNSYQWPSERSTHKKIAAGVFEVDKVSALQAQMTSLAS 250
           AAGG L+ K  + A  L+E+MA++++ W +ERS  K   A V ++D ++ L  Q+++L +
Sbjct: 338 AAGGALMGKNYDEASALIEEMASSAHNWQNERS--KSRVASVNDMDTIANLTTQISALTT 397

Query: 251 AFMKFSG---------------TGSAQSIESAAALAS-RPQEETIEQV------------ 310
              K +                +G   ++E  +   S  P  E +  V            
Sbjct: 398 QVSKLTSAHSFNTNQVAFCELCSGPHSTLECMSGNPSASPNGEQVNFVNNFQRSNQGPYS 457

Query: 311 -------QNHENFSYANTKNVLN-PPGF--------APQTQDNKKLEDLVGAFIAESSNR 370
                  +NH NFS+ N  N L  PPGF        AP  Q   ++E+L+ +++ ++   
Sbjct: 458 NTYNPGWRNHPNFSWRNENNALKPPPGFQKQGPAQNAPPQQSQSRMEELMLSYMQKT--- 517

Query: 371 TTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAE-QEKPQ------M 430
                      +T +    A I+N+E Q+ Q+   +S    G  P+  +E P+      +
Sbjct: 518 -----------DTMLQNQQATIRNLEGQISQISQQLSNRPSGSLPSNTEENPKGVNAIML 577

Query: 431 EYCKAITVHQEESEEEPESEDYETPTGEAEEDTSADEAEKPNLQPPIPSPTLLVPKEKKK 490
              K + +   +++ + ES + +    + EE        KP + PP+P P  L       
Sbjct: 578 RSGKELEIVNRKAQTQEESPEKDKGKQKVEEPRRKSLGVKPYV-PPVPFPGRL------- 637

Query: 491 KKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLA 550
             K++    QF KF++ F  L INIPFAEAL +MP Y RFMK+ L KKRK    + V L 
Sbjct: 638 --KQQQLDAQFAKFLDVFKKLQINIPFAEALQQMPSYARFMKDLLTKKRKFDGSEPVMLT 697

Query: 551 STCSTRVQQ---KVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIG 610
             CS  +Q+    +P K  D GSF+VPC+ G + F   L D GASIN++PLS+ +KL +G
Sbjct: 698 GECSMILQKDLPNLPRKQRDQGSFTVPCTIGNFHFENVLIDSGASINLMPLSIFRKLGLG 757

Query: 611 EIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRSFLA 615
           E K T + LQLAD+S+  P GIVENVL++V +F  P+D  V++M E+  +P+ILGR FLA
Sbjct: 758 ECKKTHITLQLADRSIKYPKGIVENVLVKVDKFIFPVDFIVLEMEEDREVPMILGRPFLA 817

BLAST of Lag0035150 vs. ExPASy TrEMBL
Match: A0A6P8DD93 (uncharacterized protein LOC116206453 OS=Punica granatum OX=22663 GN=LOC116206453 PE=4 SV=1)

HSP 1 Score: 365.2 bits (936), Expect = 5.8e-97
Identity = 246/691 (35.60%), Postives = 362/691 (52.39%), Query Frame = 0

Query: 11  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYKESPTEDPNSHLKSFLD 70
           R +RDY  P   G  S I    I ANNFELK  LIQM +   +   P E P+ H+  FL 
Sbjct: 52  RALRDYAVPTIMG--SAIRRPTIPANNFELKPALIQMVQSNQFGGYPNESPDEHIAGFLQ 111

Query: 71  ICGTVKINGVSEDAIRLRLF-------------------------------KEFFPPAKT 130
            C TVK+N V++D IRL+LF                               + FFPPA+T
Sbjct: 112 YCNTVKMNNVTDDVIRLQLFPFSLRDKARAWFNSLPQESITTWADLSSKFLRRFFPPART 171

Query: 131 VKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDA 190
            +LR EI  F +   E L+EAWERFKE +RKCP HG PD L +++FY  L  + +++VDA
Sbjct: 172 ARLRNEITNFTKFNGESLYEAWERFKEAIRKCPHHGLPDNLLIEVFYLSLDDTLRSLVDA 231

Query: 191 AAGGTLLSKTVENARTLLEDMATNSYQWPSERSTHKKIAAGVFEVDKVSALQAQMTSLAS 250
           AAGG L+ K  + A  L+E+MA++++ W +ERS  K   A V ++D ++ L  Q+++L +
Sbjct: 232 AAGGALMGKNYDEASALIEEMASSAHNWQNERS--KSRVASVNDMDTIANLTTQISALTT 291

Query: 251 AFMKFSG---------------TGSAQSIESAAALAS-RPQEETIEQV------------ 310
              K +                +G   ++E  +   S  P  E +  V            
Sbjct: 292 QVSKLTSAHSFNTNQVAFCELCSGPHSTLECMSGNPSASPNGEQVNFVNNFQRSNQGPYS 351

Query: 311 -------QNHENFSYANTKNVLN-PPGF--------APQTQDNKKLEDLVGAFIAESSNR 370
                  +NH NFS+ N  N L  PPGF        AP  Q   ++E+L+ +++ ++   
Sbjct: 352 NTYNPGWRNHPNFSWRNENNALKPPPGFQKQGPAQNAPPQQSQSRMEELMLSYMQKT--- 411

Query: 371 TTKLEEAVIAINTTVNGHSAAIKNIETQLGQLVNVVSTMNKGKAPAE-QEKPQ------M 430
                      +T +    A I+N+E Q+ Q+   +S    G  P+  +E P+      +
Sbjct: 412 -----------DTMLQNQQATIRNLEGQISQISQQLSNRPSGSLPSNTEENPKGVNAIML 471

Query: 431 EYCKAITVHQEESEEEPESEDYETPTGEAEEDTSADEAEKPNLQPPIPSPTLLVPKEKKK 490
              K + +   +++ + ES + +    + EE        KP + PP+P P         +
Sbjct: 472 RSGKELEIVNRKAQTQEESPEKDKGKQKVEEPRQKSLGVKPYV-PPVPFP---------R 531

Query: 491 KKKKKNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNRFMKEWLAKKRKEKKVDTVYLA 550
           + K++    QF KF++ F  L INIPFAEAL +MP Y RFMK+ L KKRK    + V L 
Sbjct: 532 RLKQQQLDAQFAKFLDVFKKLQINIPFAEALQQMPSYARFMKDLLTKKRKFDGSEPVMLT 591

Query: 551 STCSTRVQQ---KVPEKVADPGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLDIG 610
             CS  +Q+    +P K  D GSF+VPC+ G + F   L D GASIN++PLS+ +KL +G
Sbjct: 592 GECSMILQKDLPNLPRKQRDQGSFTVPCTIGNFHFENVLIDSGASINLMPLSIFRKLGLG 651

Query: 611 EIKSTPVKLQLADQSVVRPVGIVENVLIRVGRFFLPIDLYVMDMIENPSMPVILGRSFLA 615
           E K T V LQLAD+S+  P GIVENVL++V +F  P+D  V++M E+  +P+ILGR FLA
Sbjct: 652 ECKKTHVTLQLADRSIKYPKGIVENVLVKVDKFIFPVDFIVLEMEEDREVPMILGRPFLA 711

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG7990634.17.1e-14243.90hypothetical protein I3843_02G035100 [Carya illinoinensis][more]
KAG7947748.13.5e-14143.23hypothetical protein I3843_14G109500 [Carya illinoinensis][more]
KAG6734747.11.5e-13942.94hypothetical protein I3842_01G285500 [Carya illinoinensis][more]
XP_023874613.12.4e-13742.73uncharacterized protein LOC111987139 [Quercus suber][more]
XP_022843226.11.0e-13243.59uncharacterized protein LOC111366761 [Olea europaea var. sylvestris][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1DU196.5e-11743.07uncharacterized protein LOC111024361 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
A0A6J0ZX643.6e-11540.00LOW QUALITY PROTEIN: uncharacterized protein LOC110412945 OS=Herrania umbratica ... [more]
A0A6P6XAQ16.1e-10738.38Reverse transcriptase OS=Coffea arabica OX=13443 GN=LOC113740608 PE=4 SV=1[more]
A0A6P8DKJ25.8e-9735.60uncharacterized protein LOC116204231 OS=Punica granatum OX=22663 GN=LOC116204231... [more]
A0A6P8DD935.8e-9735.60uncharacterized protein LOC116206453 OS=Punica granatum OX=22663 GN=LOC116206453... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 86..150
e-value: 2.4E-7
score: 30.9
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 467..608
e-value: 8.5E-29
score: 102.0
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 485..590
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 361..383
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 357..414
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 383..607
coord: 88..197
NoneNo IPR availablePANTHERPTHR24559:SF334SUBFAMILY NOT NAMEDcoord: 383..607
coord: 88..197
NoneNo IPR availableCDDcd00303retropepsin_likecoord: 490..583
e-value: 2.16933E-18
score: 78.5323

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0035150.1Lag0035150.1mRNA