Lag0032111 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0032111
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionReverse transcriptase
Locationchr11: 25041045 .. 25043225 (+)
RNA-Seq ExpressionLag0032111
SyntenyLag0032111
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGTAGTAGTAAAGATTTAATTTTAGCACCATTGGATCCCGAGATAGAAAGAACCATCCATAGGCTTAGAAGGGAGAATAGAGAAAACTTTCAAATGGCTGACCAAAATCCACCTGAGGAGCCTAGGCCTATTAGAGATTACTTTCAGCCCGTGTTTCAGGGGCAACAATCGGGGATTGTCTATGCCCCGATTAATGCCAACAACTTTGAGCTGAAGACCGGTCTCATTCAGATGGCTCGAGACTGTGCATATAGAGGATCGCCCACTGAGGATCCAAATTCTCATCTTAAATCATTTTTGACATTTGTGGGACGGTAAAAATAAATGGAGTTTCTGATGATGATATTCGCTTACGCTTATTTCCTTTTTCTTTGCAGGATAAAGCACGAGATTGGTTGCAATCTATTACCCCTGGGAGCATCACCACCTGGGATGCTTTGGTCCAGGCCTTTTTAAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAACAACAATATGATGAGCAGTTGTTCGAGGCCTGGGAGCGATTTAAAGAGTTGTTGAGGAAGTGCCCTCCGCATGGATATCCCGACTGGCTTCAGGTTCAATTTTTTATAATGGTTTAACTCCTAGTACAAAAACGATTGTTGATGCAGCTGCAGGTGGGATTCTGTTGTCCAAGACCGTGGAAAATGCTCGCATACTTCTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGGTCTGCACCTAAAAAGATTTCTGCTGGAGTGTTTGAGGTTGACAAGGTAAGTGCACTCCAGGCCCAGATAACCTCCCTTGCTAATGCTTTTATGAAATTTTCAGGTACAGGGAGTGCTCAATCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACCATCGAGCAGGTTCAGTATGTATCAAATTTTAATTCTAGGGGATATAATAATAGTTCTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAAATTCTCTTATGCAAATACTAAGAATGTTCTTAACCCCCCTGGTTTTGCCCCGCAAACTCAAGAAAATAAAAAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTCTAGTAACAGGACAACCAAATTAGAGGAGGCAGTCATTGCCATCAACTCAACAGTGAATGGCCACATTGCAGCTATAAAGAACATTGAGACTCAGCTGGGACAGTTGGTGAGGGTTGTGAGCACTATGAATAAAGGTAAGGCCCTAGCTGAGCAAGAGAAAACCCAGATGGAGTACTGTAAGGCAATCATTGTGCACCAGGAGGAAGCTGACGAGGAGCCTGAATCTGAGGACTATGACACGCCTACAGGGGAAGCTGAGGAGGACACATCATCTGATGAGGCTGAAAAGCCTAACCCTGAGCCTCCTATTCCTTCTCCCACACTGATGGTTCCCAAGGAAAAGAAAAAGAAAAAGAAGAAAAAGAACAATCAAGTTCAGTTTGATAAATTTATGAATGCTTTTATGAATCTGAATATTAATATTCCTTTTGCAGAGGCATTAGAGATGCCCCAATACAACAGGTTTATGAAGGAGTGGTCAGCAAAGAAGCGAAAGGAAAAGAAGGTTGACACTGTATATCTTGCTTCCACATGCAGCACCAGAGTACAACAGAAGGTACCTGAAAAAGTAGCAGATCCAGGGAGTTTTTTGTTCCTTGTAGTTTTGGTACTTATTCTTTCAGAACATTATGTGATTTAGGTGCTAGCATTAATATTATTCCTATGTCCTTGTGTAAAAAGTTAGACATAGGTGAGATTAAATCTACTCCTGTAAAGCTCCAATTGGCTCATCAATCTGTGGTTAGACCAGTTGGCATTGTAGAAAATGTTTTAATCAGAGTAGGTAAATTTTTCCTCCCTATTGACTTGTATGTTATGGACATGATAGAAAATCCTTCAATGCCTATCATATTAGGAAGACCATTCCTCGCTACTGGGCGAGTGATTATTGATATTGAGCGCAGGGAGCTCACTATTAGAGTCAGGAACGAAAAAGAAATTTTTAAAGCAGTGGAAGACTCTAAAGATGAAGTGCTTTTCATGGGTTACAAGAAAGGTGCAAGAAAAAGCACCTCTGTTGGATTCACAGAACAAAAGCCTCCTTGA

mRNA sequence

ATGCGTAGTAGTAAAGATTTAATTTTAGCACCATTGGATCCCGAGATAGAAAGAACCATCCATAGGCTTAGAAGGGAGAATAGAGAAAACTTTCAAATGGCTGACCAAAATCCACCTGAGGAGCCTAGGCCTATTAGAGATTACTTTCAGCCCGTGTTTCAGGGGCAACAATCGGGGATTGTCTATGCCCCGATTAATGCCAACAACTTTGAGCTGAAGACCGGTCTCATTCAGATGGCTCGAGACTGTGCATATAGAGGATCGCCCACTGAGGATCCAAATTCTCATCTTAAATCATTTTTGACATTTGATAAAGCACGAGATTGGTTGCAATCTATTACCCCTGGGAGCATCACCACCTGGGATGCTTTGGTCCAGGCCTTTTTAAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAACAACAATATGATGAGCAGTTGTTCGAGGCCTGGGAGCGATTTAAAGAGTTGTTGAGGAAGTGCCCTCCGCATGGATATCCCGACTGGCTTCAGACCGTGGAAAATGCTCGCATACTTCTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGGTCTGCACCTAAAAAGATTTCTGCTGGAGTGTTTGAGGTTGACAAGGTAAGTGCACTCCAGGCCCAGATAACCTCCCTTGCTAATGCTTTTATGAAATTTTCAGGTACAGGGAGTGCTCAATCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACCATCGAGCAGGTTCAGTATGTATCAAATTTTAATTCTAGGGGATATAATAATAGTTCTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAAATTCTCTTATGCAAATACTAAGAATGTTCTTAACCCCCCTGGTTTTGCCCCGCAAACTCAAGAAAATAAAAAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTCTAGTAACAGGACAACCAAATTAGAGGAGGCAGTCATTGCCATCAACTCAACAGTGAATGGCCACATTGCAGCTATAAAGAACATTGAGACTCAGCTGGGACAGTTGGTGAGGGTTGTGAGCACTATGAATAAAGGTAAGGCCCTAGCTGAGCAAGAGAAAACCCAGATGGAGTACTGTAAGGCAATCATTGTGCACCAGGAGGAAGCTGACGAGGAGCCTGAATCTGAGGACTATGACACGCCTACAGGGGAAGCTGAGGAGGACACATCATCTGATGAGGCTGAAAAGCCTAACCCTGAGCCTCCTATTCCTTCTCCCACACTGATGGTTCCCAAGGAAAAGAAAAAGAAAAAGAAGAAAAAGAACAATCAAGTTCAGTTTGATAAATTTATGAATGCTTTTATGAATCTGAATATTAATATTCCTTTTGCAGAGGCATTAGAGATGCCCCAATACAACAGGTTTATGAAGGAGTGGTCAGCAAAGAAGCGAAAGGAAAAGAAGGTTGACACTGTATATCTTGCTTCCACATGCAGCACCAGAGTACAACAGAAGGTACCTGAAAAAGTAGCAGATCCAGGGAGTTTTTTGTTCCTTGTAGTTTTGTTAGACATAGGTGAGATTAAATCTACTCCTGTAAAGCTCCAATTGGCTCATCAATCTGTGGTTAGACCAGTTGGCATTGTAGAAAATGTTTTAATCAGAGTAGGTAAATTTTTCCTCCCTATTGACTTGTATGTTATGGACATGATAGAAAATCCTTCAATGCCTATCATATTAGGAAGACCATTCCTCGCTACTGGGCGAGTGATTATTGATATTGAGCGCAGGGAGCTCACTATTAGAGTCAGGAACGAAAAAGAAATTTTTAAAGCAGTGGAAGACTCTAAAGATGAAGTGCTTTTCATGGGTTACAAGAAAGGTGCAAGAAAAAGCACCTCTGTTGGATTCACAGAACAAAAGCCTCCTTGA

Coding sequence (CDS)

ATGCGTAGTAGTAAAGATTTAATTTTAGCACCATTGGATCCCGAGATAGAAAGAACCATCCATAGGCTTAGAAGGGAGAATAGAGAAAACTTTCAAATGGCTGACCAAAATCCACCTGAGGAGCCTAGGCCTATTAGAGATTACTTTCAGCCCGTGTTTCAGGGGCAACAATCGGGGATTGTCTATGCCCCGATTAATGCCAACAACTTTGAGCTGAAGACCGGTCTCATTCAGATGGCTCGAGACTGTGCATATAGAGGATCGCCCACTGAGGATCCAAATTCTCATCTTAAATCATTTTTGACATTTGATAAAGCACGAGATTGGTTGCAATCTATTACCCCTGGGAGCATCACCACCTGGGATGCTTTGGTCCAGGCCTTTTTAAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAACAACAATATGATGAGCAGTTGTTCGAGGCCTGGGAGCGATTTAAAGAGTTGTTGAGGAAGTGCCCTCCGCATGGATATCCCGACTGGCTTCAGACCGTGGAAAATGCTCGCATACTTCTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGGTCTGCACCTAAAAAGATTTCTGCTGGAGTGTTTGAGGTTGACAAGGTAAGTGCACTCCAGGCCCAGATAACCTCCCTTGCTAATGCTTTTATGAAATTTTCAGGTACAGGGAGTGCTCAATCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACCATCGAGCAGGTTCAGTATGTATCAAATTTTAATTCTAGGGGATATAATAATAGTTCTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAAATTCTCTTATGCAAATACTAAGAATGTTCTTAACCCCCCTGGTTTTGCCCCGCAAACTCAAGAAAATAAAAAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTCTAGTAACAGGACAACCAAATTAGAGGAGGCAGTCATTGCCATCAACTCAACAGTGAATGGCCACATTGCAGCTATAAAGAACATTGAGACTCAGCTGGGACAGTTGGTGAGGGTTGTGAGCACTATGAATAAAGGTAAGGCCCTAGCTGAGCAAGAGAAAACCCAGATGGAGTACTGTAAGGCAATCATTGTGCACCAGGAGGAAGCTGACGAGGAGCCTGAATCTGAGGACTATGACACGCCTACAGGGGAAGCTGAGGAGGACACATCATCTGATGAGGCTGAAAAGCCTAACCCTGAGCCTCCTATTCCTTCTCCCACACTGATGGTTCCCAAGGAAAAGAAAAAGAAAAAGAAGAAAAAGAACAATCAAGTTCAGTTTGATAAATTTATGAATGCTTTTATGAATCTGAATATTAATATTCCTTTTGCAGAGGCATTAGAGATGCCCCAATACAACAGGTTTATGAAGGAGTGGTCAGCAAAGAAGCGAAAGGAAAAGAAGGTTGACACTGTATATCTTGCTTCCACATGCAGCACCAGAGTACAACAGAAGGTACCTGAAAAAGTAGCAGATCCAGGGAGTTTTTTGTTCCTTGTAGTTTTGTTAGACATAGGTGAGATTAAATCTACTCCTGTAAAGCTCCAATTGGCTCATCAATCTGTGGTTAGACCAGTTGGCATTGTAGAAAATGTTTTAATCAGAGTAGGTAAATTTTTCCTCCCTATTGACTTGTATGTTATGGACATGATAGAAAATCCTTCAATGCCTATCATATTAGGAAGACCATTCCTCGCTACTGGGCGAGTGATTATTGATATTGAGCGCAGGGAGCTCACTATTAGAGTCAGGAACGAAAAAGAAATTTTTAAAGCAGTGGAAGACTCTAAAGATGAAGTGCTTTTCATGGGTTACAAGAAAGGTGCAAGAAAAAGCACCTCTGTTGGATTCACAGAACAAAAGCCTCCTTGA

Protein sequence

MRSSKDLILAPLDPEIERTIHRLRRENRENFQMADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLTFDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPPHGYPDWLQTVENARILLEDMATNSYQWPSERSAPKKISAGVFEVDKVSALQAQITSLANAFMKFSGTGSAQSIESAAALASRPQEETIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHEKFSYANTKNVLNPPGFAPQTQENKKLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHIAAIKNIETQLGQLVRVVSTMNKGKALAEQEKTQMEYCKAIIVHQEEADEEPESEDYDTPTGEAEEDTSSDEAEKPNPEPPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALEMPQYNRFMKEWSAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFLFLVVLLDIGEIKSTPVKLQLAHQSVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPIILGRPFLATGRVIIDIERRELTIRVRNEKEIFKAVEDSKDEVLFMGYKKGARKSTSVGFTEQKPP
Homology
BLAST of Lag0032111 vs. NCBI nr
Match: KAG7990634.1 (hypothetical protein I3843_02G035100 [Carya illinoinensis])

HSP 1 Score: 453.4 bits (1165), Expect = 3.4e-123
Identity = 287/721 (39.81%), Postives = 404/721 (56.03%), Query Frame = 0

Query: 1   MRSSKDLILAPLDPEIERTIHRLRRENRENFQMADQNPPEEPRPIRDYFQPVFQGQQSGI 60
           MR ++   + P+DPEIERT+  LRR   +   MA+++    PR ++DY +PV  G  S I
Sbjct: 1   MRRARSRDIIPVDPEIERTLRSLRR--NKILAMAEEDREVLPRTLKDYVRPVVNGNYSSI 60

Query: 61  VYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLTF----------------- 120
           +  PINANNFELK  LI M +   + GSP +DPN HL  FL                   
Sbjct: 61  MRQPINANNFELKPALISMVQQAQFSGSPLDDPNIHLAMFLEICDTVKINGVTEDTIRLR 120

Query: 121 -------DKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQL 180
                  DKAR WLQS+ PGSI +W  + + FL KFFPPAKT +LR+EIG F+Q   E L
Sbjct: 121 LFPFSLRDKARGWLQSLQPGSIVSWQDMAERFLAKFFPPAKTAQLRSEIGQFKQNDFESL 180

Query: 181 FEAWERFKELLRKCPPHGYPDWLQ---------------------------TVENARILL 240
           +EAWER+K+L+R+CP HG PDWLQ                           T E A  LL
Sbjct: 181 YEAWERYKDLIRRCPQHGLPDWLQVQMFYNGLNGQTRTIVDAASGGTLMSKTAEGATALL 240

Query: 241 EDMATNSYQWPSERSAPKKISAGVFEVDKVSALQAQITSLANAFMKFSGTGSAQSIE--S 300
           E+MA+N+YQWP+ER+  KK+ AG+ +++ ++AL AQ+ +L++     +     QS E  +
Sbjct: 241 EEMASNNYQWPTERTLAKKV-AGIHDLEPIAALSAQVATLSHQISALTTQRIPQSTEYLA 300

Query: 301 AAALASRPQEETIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHEKFSYANTKNVL---NP 360
           + ++     E + EQVQYV+N N   Y  +  P +YHP  RNHE  SY NTKNVL   +P
Sbjct: 301 STSMIVPSNEASQEQVQYVNNRN-YNYRGNPMPNYYHPGLRNHENLSYGNTKNVLQPQHP 360

Query: 361 PGFAPQTQENK-KLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHIAAIKNIETQLGQL 420
           PGF  Q  E K  LED + +F+ E++ R  K +  +  I +  +   AAIKNIE Q+GQL
Sbjct: 361 PGFDSQPSERKMSLEDAMVSFVQETNARFKKTDSRLDNIETHCSNMGAAIKNIEVQIGQL 420

Query: 421 VRVVSTMNKGKALAEQEKTQMEYCKAIIVHQ-EEADEEPESEDYDTPT----GEAEEDTS 480
              ++   +G   +  E    E CKAI +   +E +  P  E   TPT    G+++    
Sbjct: 421 ATTINAQQRGAFPSNTEVNPKEQCKAITLRSGKEIERSPLKESKSTPTAVNIGQSKNKVE 480

Query: 481 SDEAEKPNPE-------------PPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNL 540
            DE      E             PPI +P L  P+  +K+K  K    QF KF++ F  +
Sbjct: 481 EDEIVNDTLEETDFAPTISFPDNPPILAPPLPYPQRFQKQKLDK----QFSKFLDIFKKI 540

Query: 541 NINIPFAEALE-MPQYNRFMKEWSAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSF 600
           +INIPFA+ALE MP Y +F+K+  +KKR+ ++ +TV L+  CS  +Q+K+P+K+ DPGSF
Sbjct: 541 HINIPFADALEQMPNYVKFLKDIISKKRRLEEFETVKLSEECSAILQKKLPQKLKDPGSF 600

Query: 601 ---------LFLVVLLDIG-----------------EIKSTPVKLQLAHQSVVRPVGIVE 620
                     F  VL D+G                 E+K T + LQLA +S+  P GI+E
Sbjct: 601 TLPCTIGDSFFDKVLCDLGASINLMPLSVCRKLGLEEMKPTTISLQLADRSIKYPRGIIE 660

BLAST of Lag0032111 vs. NCBI nr
Match: KAG7947748.1 (hypothetical protein I3843_14G109500 [Carya illinoinensis])

HSP 1 Score: 428.3 bits (1100), Expect = 1.2e-115
Identity = 272/695 (39.14%), Postives = 386/695 (55.54%), Query Frame = 0

Query: 33  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTED 92
           MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +D
Sbjct: 1   MAEEDREVLPRTLKDYVRPVVNGNYSSIMRQPINANNFELKPALISMVQQAQFSGSPLDD 60

Query: 93  PNSHLKSFLTF------------------------DKARDWLQSITPGSITTWDALVQAF 152
           PN HL  FL                          DKAR WLQS+ PGSI +W  + + F
Sbjct: 61  PNVHLAMFLEICDTVKINGVTEDTIRLRLFPFSLRDKARGWLQSLQPGSIVSWQDMAERF 120

Query: 153 LKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPPHGYPDWLQ-------- 212
           L KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CP HG PDWLQ        
Sbjct: 121 LAKFFPPAKTAQLRSEIGQFKQNDFESLYEAWERYKDLIRRCPQHGLPDWLQVQMFYNGL 180

Query: 213 -------------------TVENARILLEDMATNSYQWPSERSAPKKISAGVFEVDKVSA 272
                              T E A  LLE+MA+N+YQWP+ER+  KK+ AG+ E++ ++A
Sbjct: 181 NGQTRTIVDAASGGTLMSKTAEGATALLEEMASNNYQWPTERTLAKKV-AGIHELEPIAA 240

Query: 273 LQAQITSLANAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQYVSNFNSRGYNNSST 332
           L AQ+ +L++     +     QS E  ++ ++     E + EQVQYV+N N   Y  +  
Sbjct: 241 LSAQVATLSHQISALTTQRIPQSTEYVASTSMIVPSNEASQEQVQYVNNRN-YNYRGNPM 300

Query: 333 PTHYHPNNRNHEKFSYANTKNVL---NPPGFAPQTQENK-KLEDLVGAFIAESSNRTTKL 392
           P +YHP  RNHE  SY NTKNVL   +PPGF  Q  E K  LED + +F+ E++ R  K 
Sbjct: 301 PNYYHPGLRNHENLSYGNTKNVLQPQHPPGFDSQPSEKKMSLEDAMVSFVQETNARFKKT 360

Query: 393 EEAVIAINSTVNGHIAAIKNIETQLGQLVRVVSTMNKGKALAEQEKTQMEYCKAIIVHQ- 452
           +  +  I +  +   A +KN+E Q+GQL   ++   +G   +  E    E CKAI +   
Sbjct: 361 DSRLDNIETHCSNMGATMKNLEVQIGQLATTINAQQRGAFPSNTEVNPKEQCKAITLRSG 420

Query: 453 EEADEEPESEDYDTPTGE---------AEEDTSSDEAEKPN--------PEPPIPSPTLM 512
           +E +  P  E   TPT            EE+  +D  E+ +          PPI +P L 
Sbjct: 421 KEIERAPLKESKSTPTAANNGQSKDQVEEEEIVNDTLEETDLPPTISFPDNPPILAPPLP 480

Query: 513 VPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWSAKKRKEKK 572
            P+  +K+K  K    QF KF++ F  ++INIPFA+ALE MP Y +F+K+  +KKR+ ++
Sbjct: 481 YPQRFQKQKLDK----QFSKFLDIFKKIHINIPFADALEQMPNYVKFLKDIISKKRRLEE 540

Query: 573 VDTVYLASTCSTRVQQKVPEKVADPGSF---------LFLVVLLD--------------- 626
            +TV L+  CS  +Q+K+P+K+ DPGSF          F  VL D               
Sbjct: 541 FETVKLSEECSAILQKKLPQKLKDPGSFTLPCTIGDSFFDRVLCDLGASINLMPFSVCRK 600

BLAST of Lag0032111 vs. NCBI nr
Match: KAG6734747.1 (hypothetical protein I3842_01G285500 [Carya illinoinensis])

HSP 1 Score: 426.4 bits (1095), Expect = 4.4e-115
Identity = 269/695 (38.71%), Postives = 384/695 (55.25%), Query Frame = 0

Query: 33  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTED 92
           MA+++    PR ++DY +PV  G  S I+  PINANNFELK  LI M +   + GSP +D
Sbjct: 1   MAEEDREVLPRTLKDYVRPVVNGNYSSIMRQPINANNFELKPALISMVQQAQFSGSPLDD 60

Query: 93  PNSHLKSFLTF------------------------DKARDWLQSITPGSITTWDALVQAF 152
           PN HL  FL                          DKAR WLQS+ PGSI +W  + + F
Sbjct: 61  PNVHLAMFLEICDTVKINGVTEDTIRLRLFPFSLRDKARGWLQSLQPGSIVSWQDMAERF 120

Query: 153 LKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPPHGYPDWLQ-------- 212
           L KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R+CP HG PDWLQ        
Sbjct: 121 LAKFFPPAKTAQLRSEIGQFKQNDFESLYEAWERYKDLIRRCPQHGLPDWLQVQMFYNGL 180

Query: 213 -------------------TVENARILLEDMATNSYQWPSERSAPKKISAGVFEVDKVSA 272
                              T E A  LLE+MA+N+YQWP+ER+  KK+ AG+ E++ ++A
Sbjct: 181 NGQTRTIVDAASGGTLMSKTAEGATALLEEMASNNYQWPTERTLAKKV-AGIHELEPIAA 240

Query: 273 LQAQITSLANAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQYVSNFNSRGYNNSST 332
           L AQ+ +L++     +     QS E  ++ ++     E + EQVQYV+N N   Y  +  
Sbjct: 241 LSAQVATLSHQISALTTQRIPQSTEYVASTSMIVPSNEASQEQVQYVNNRN-YNYRGNPM 300

Query: 333 PTHYHPNNRNHEKFSYANTKNVL---NPPGFAPQTQENK-KLEDLVGAFIAESSNRTTKL 392
           P +YHP  RNHE  SY NTKNVL   +PPGF  Q  E K  LED + +F+ E++ R  K 
Sbjct: 301 PNYYHPGLRNHENLSYGNTKNVLQPQHPPGFDSQPSEKKMSLEDAMVSFVQETNARFKKT 360

Query: 393 EEAVIAINSTVNGHIAAIKNIETQLGQLVRVVSTMNKGKALAEQEKTQMEYCKAIIVHQ- 452
           +  +  I +  +   A +KN+E Q+GQL   ++   +G   +  E    E CKAI +   
Sbjct: 361 DSRLDNIETHCSNMGATMKNLEVQIGQLATTINAQQRGAFPSNTEVNPKEQCKAITLRSG 420

Query: 453 EEADEEPESEDYDTPTGE---------AEEDTSSDEAEKPN--------PEPPIPSPTLM 512
           +E +  P  E   TPT            EE+  +D  E+ +          PPI +P L 
Sbjct: 421 KEIERAPLKESKSTPTAANNGQSKDQVEEEEIVNDTLEETDLPPTISFPDNPPILAPPLP 480

Query: 513 VPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWSAKKRKEKK 572
            P+  +K+K  K    QF KF++ F  ++INIPFA+ALE MP Y +F+K+  +KKR+ ++
Sbjct: 481 YPQRFQKQKLDK----QFSKFLDIFKKIHINIPFADALEQMPNYVKFLKDIISKKRRLEE 540

Query: 573 VDTVYLASTCSTRVQQKVPEKVADPGSF--------------------------LFLVVL 626
            +TV L+  CS  +Q+K+P+K+ DP SF                           F+   
Sbjct: 541 FETVKLSEECSAILQKKLPQKLKDPESFTLPCTIGDSFFDRVLCDLGASINLMPFFVCRK 600

BLAST of Lag0032111 vs. NCBI nr
Match: XP_023874613.1 (uncharacterized protein LOC111987139 [Quercus suber])

HSP 1 Score: 421.4 bits (1082), Expect = 1.4e-113
Identity = 268/689 (38.90%), Postives = 373/689 (54.14%), Query Frame = 0

Query: 33  MADQNPPEEPRPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTED 92
           MA+     +PR ++DY +P+     SGI    INANNFELK  LI M +   + GSP +D
Sbjct: 1   MAEGEQNAQPRTLKDYVRPIVNDNYSGIRRQTINANNFELKPALISMVQQAQFSGSPLDD 60

Query: 93  PNSHLKSFLTF------------------------DKARDWLQSITPGSITTWDALVQAF 152
           PN HL  FL                          DKAR WLQS+ PGSIT+W  + + F
Sbjct: 61  PNIHLAMFLEICDTIKMNGVTEDTIRLRLFPFSLRDKARGWLQSLQPGSITSWQDMAEKF 120

Query: 153 LKKFFPPAKTVKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPPHGYPDWLQ-------- 212
           L KFFPPAKT +LR+EIG F+Q   E L+EAWER+K+L+R CP HG PDWLQ        
Sbjct: 121 LAKFFPPAKTAQLRSEIGQFRQNDFESLYEAWERYKDLIRCCPQHGLPDWLQVQMFYNGL 180

Query: 213 -------------------TVENARILLEDMATNSYQWPSERSAPKKISAGVFEVDKVSA 272
                              T E A  LLE+MA+N+YQWP+ER+  KK+ AG+ E++  +A
Sbjct: 181 NGQTRTIVDAASGGTLMSKTAEGATSLLEEMASNNYQWPTERTMAKKV-AGIHELEPFAA 240

Query: 273 LQAQITSLANAFMKFSGTGSAQSIE--SAAALASRPQEETIEQVQYVSNFNSRGYNNSST 332
           L AQ+ SL++     +     Q  E  +A+++     E + EQVQY++N N   Y  +  
Sbjct: 241 LSAQVASLSHQVSALTTQRIPQGAEYVAASSMTVPMNEASQEQVQYINNRN-YNYRGNPM 300

Query: 333 PTHYHPNNRNHEKFSYANTKNVLN-PPGFAPQTQENK-KLEDLVGAFIAESSNRTTKLEE 392
           P +YHP  RNHE FSY NTKNVL  PPGF  Q  E K  LED + +F+ E+     K + 
Sbjct: 301 PNYYHPGLRNHENFSYGNTKNVLQPPPGFDSQPSEKKMSLEDAMVSFVEETKATFKKSDS 360

Query: 393 AVIAINSTVNGHIAAIKNIETQLGQLVRVVSTMNKGKALAEQEKTQMEYCKAIIVHQ-EE 452
            +  I +  +   A +KN+E Q+GQL   ++   +G   +  E    E CKAI +    E
Sbjct: 361 QLDNIETHCSNMGATMKNLEVQIGQLATTINAQQRGTFPSNTEVNPKEQCKAITLRSGRE 420

Query: 453 ADEEPESEDYDTPTG-------------EAEEDTSSDEAEKPNPEPPIPSPTLMVPKEKK 512
            +  P  E   TPT              E  EDT  +    P+   P   P L  P    
Sbjct: 421 IERSPSKETETTPTAPNNGQSKNKVEEEEIVEDTLRETDMPPSISFPDNPPILSTPLPYP 480

Query: 513 KKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWSAKKRKEKKVDTVYL 572
           ++ +K+    QF KF++ F  ++INIPFA+ALE MP Y +F+K+  +KKR+ ++ +TV L
Sbjct: 481 QRFQKQKLDKQFSKFLDIFKKIHINIPFADALEQMPNYAKFLKDIISKKRRLEEFETVKL 540

Query: 573 ASTCSTRVQQKVPEKVADPGSF---------LFLVVLLD-----------------IGEI 626
           +  CS  +Q+K+P+K+ DPGSF          F  VL D                 +GE+
Sbjct: 541 SEECSAIIQKKLPQKLKDPGSFTLPCTIGNSFFDKVLCDLGASINLMPLSVYRKLGLGEM 600

BLAST of Lag0032111 vs. NCBI nr
Match: XP_022843226.1 (uncharacterized protein LOC111366761 [Olea europaea var. sylvestris])

HSP 1 Score: 416.4 bits (1069), Expect = 4.6e-112
Identity = 281/708 (39.69%), Postives = 383/708 (54.10%), Query Frame = 0

Query: 1   MRSSKDLILAPLDPEIERTIHRLRR-ENRENFQMADQ-----NPPEEPRPIRDYFQPVFQ 60
           MR +++L L  +DPE ERT   LR  +  E   MA+Q     N   + R IRDY +PV  
Sbjct: 94  MRRARNLDLLHVDPEPERTFRILRGIQRNEREAMAEQDVRAANEDNQQRAIRDYIRPVVN 153

Query: 61  GQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLTF----------- 120
              SGI    I A NFELK GLI M +   + G+  EDPN+HL SFL             
Sbjct: 154 DNYSGIARPAIVAKNFELKPGLIDMVQQNQFGGAAVEDPNAHLGSFLEICDTVKMNGVTE 213

Query: 121 -------------DKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQ 180
                        DKA+ W QS+  GSITTWD L Q FL K+FPP+K+ +LR EI  F+Q
Sbjct: 214 DAIRLRLFSFSLRDKAKAWFQSLPYGSITTWDDLAQKFLTKYFPPSKSAQLRGEISQFKQ 273

Query: 181 QYDEQLFEAWERFKELLRKCPPHGYPDWLQ---------------------------TVE 240
              E  +EAWERFK+LLR+CP HG+  W+Q                           T E
Sbjct: 274 LDFEPFYEAWERFKDLLRRCPQHGFQKWVQIEIFYNGLNGQTRTMVDAAAGGILMAKTAE 333

Query: 241 NARILLEDMATNSYQWPSERSAPKKISAGVFEVDKVSALQAQITSLANAFMKFSGTGSAQ 300
            A  LL+D+ATNSYQWPSERS  KK+ AG+ EVD ++AL AQ+ SL N  +  +  G+ Q
Sbjct: 334 AAYALLDDIATNSYQWPSERSGVKKV-AGLHEVDPITALAAQVASLTNQIVMLTTQGNQQ 393

Query: 301 SIESAAALASRPQEETI--EQVQYVS--NFNSRGYNNSSTPTHYHPNNRNHEKFSYANTK 360
           +++S  + +S  QE  +  EQVQY+   N+N RG   ++   HYHP  RNHE  SY N +
Sbjct: 394 NVDSVISTSSSHQETEVANEQVQYIDSRNYNQRGGYQAN---HYHPGLRNHENLSYGNNR 453

Query: 361 NVLN-PPGFAPQTQENK-KLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHIAAIKNIE 420
           N L  PPGF  Q  + K  LED++G FI+E+ +R  K E  +  I + V+   A +KN+E
Sbjct: 454 NTLQPPPGFNTQNSDGKPPLEDILGTFISETRSRFNKNELRLDNIETHVSKIGATMKNLE 513

Query: 421 TQLGQLVRVVSTMNKGKALAEQEKTQMEYCKAIIVHQEEADEEPESEDYDTP------TG 480
            Q+GQL  ++ +  KGK  ++ E    E+C AI +   +  EE + +    P      T 
Sbjct: 514 VQIGQLATLMKSQQKGKFPSDTEVNPREHCNAITLRSGKMVEESKPKKIMVPTPDVIVTD 573

Query: 481 EAEEDTSSDEAE-----KP-----NPEPPIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNA 540
           E + +    EAE     KP        PPI  P L  P+   KKK       QF KF+  
Sbjct: 574 ERQSERQKTEAEGTKIYKPYSISFPDNPPILKPPLPFPQRFMKKKFDD----QFAKFLEV 633

Query: 541 FMNLNINIPFAEAL-EMPQYNRFMKEWSAKKRKEKKVDTVYLASTCSTRVQQKVPEKVAD 600
           F  ++INIPFAE L +MP Y +F+KE  + K+K ++ +T+ L   CS  + QK+P K+ D
Sbjct: 634 FKKIHINIPFAETLAQMPNYAKFLKEVMSNKKKLEEFETIKLTEGCSD-ILQKLPHKLKD 693

Query: 601 PGSF--------------------------LFLVVLLDIGEIKSTPVKLQLAHQSVVRPV 603
           PGSF                          L +   L +GE+K T + LQLA +S+  P 
Sbjct: 694 PGSFNIPCNIGGITFDRALCDFGASINLMPLSVFKKLGLGEVKPTTLTLQLADRSITYPK 753

BLAST of Lag0032111 vs. ExPASy TrEMBL
Match: A0A6J0ZX64 (LOW QUALITY PROTEIN: uncharacterized protein LOC110412945 OS=Herrania umbratica OX=108875 GN=LOC110412945 PE=4 SV=1)

HSP 1 Score: 361.3 bits (926), Expect = 8.5e-96
Identity = 268/747 (35.88%), Postives = 377/747 (50.47%), Query Frame = 0

Query: 1   MRSSKDLILAPLDPEIERTIHRLRRENRE----NFQMADQN----------PPEEPRPIR 60
           M+   +L L P DP+IERT  R RREN +    N  MA+ N           PE  R +R
Sbjct: 1   MQRRNNLNLVPFDPDIERTFRRHRRENLQVATLNQTMAEDNNNNGNNAINLVPEANRALR 60

Query: 61  DYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCA-YRGSPTEDPNSHLKSFL---- 120
           DY  P+ QG    I    INANNFE+K   IQM +    + G P++DPNSHL +FL    
Sbjct: 61  DYVVPLVQGLHQSIRRPSINANNFEIKPAYIQMIQSSVQFSGLPSDDPNSHLVNFLEICD 120

Query: 121 TF--------------------DKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKL 180
           TF                    DKA+ WL S+  GSITTW+ L Q FL KFFPPAKT K+
Sbjct: 121 TFKYNGVTDDAIRLRLFPFSLRDKAKSWLNSLPNGSITTWEDLAQKFLAKFFPPAKTAKM 180

Query: 181 RTEIGTFQQQYDEQLFEAWERFKELLRKCPPHGYPDWLQ--------------------- 240
           R +I +F Q   E L+EAWERFKELLR+CP HG PDWLQ                     
Sbjct: 181 RNDITSFIQFDGESLYEAWERFKELLRRCPHHGIPDWLQVQTFYNGLVGSIKTIIDAAAG 240

Query: 241 ------TVENARILLEDMATNSYQWPSERSAPKKISAGVFEVDKVSALQAQITSLANAFM 300
                    +A  LLE+MA+N+YQWPSERS  +K + G +E+D +  L  Q+ +L+    
Sbjct: 241 GALMSKNAVDAYNLLEEMASNNYQWPSERSGSRK-AVGAYEIDALGTLTTQVAALS---- 300

Query: 301 KFSGTGSAQSIESAAALASRPQEE--------TIEQVQYVSNFNSRGYNNSSTPTHYHPN 360
           K   T    +++++  +     +           E VQ+V NFN R  NN  + T Y+P 
Sbjct: 301 KKLDTLGVHAVQNSLVVCEMCGDSHSYDQCPYNSESVQFVGNFN-RQQNNPYSNT-YNPG 360

Query: 361 NRNHEKFSYANTKNVLN-----PPGF----APQTQENK-KLEDLVGAFIAESSNRTTKLE 420
            RNH  FS++N     N     PPGF     PQ  E K +LE+L+  +I+++        
Sbjct: 361 WRNHPNFSWSNNAGPSNPKPIMPPGFQQQARPQIPEKKSQLEELLLQYISKT-------- 420

Query: 421 EAVIAINSTVNGHIAAIKNIETQLGQLVRVVSTMNKGKALAEQE--KTQMEYCKAII--- 480
                 ++ +    A+++N+ETQ+GQL   ++   +G   ++ +      E C+AI    
Sbjct: 421 ------DAIIQSQGASLRNLETQVGQLANSINNRPQGSLPSDTQINPKGKEQCQAITLRS 480

Query: 481 ------VHQEEADEEPESEDYDTPTGEAEEDTSSDEAEKPNPEPPIPSPTLMVPKEKKKK 540
                 V+Q+  + E E  D +   G  E +    + +    E    S  +  P    ++
Sbjct: 481 GKEIEGVNQKAVESEIEHVDKE---GMCENEIEIQQKDDDKAENQGTSQVIHPPPPFPQR 540

Query: 541 KKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWSAKKRKEKKVDTVYLAS 600
            +K+  + QF KF+N F  L+INIPFAEALE MP Y +F+K+  +KKRK  + +TV+L  
Sbjct: 541 LQKQKLEKQFQKFLNVFKKLHINIPFAEALEQMPSYVKFLKDILSKKRKLGEFETVFLTE 600

Query: 601 TCSTRVQQKVPEKVADPGSFLFLVVL--------------------------LDIGEIKS 626
            CS  +Q K+P K+ DPGSF     +                          L +GE K 
Sbjct: 601 ECSAILQNKLPPKLKDPGSFTIPCTIGNLFFTKALSDLGASINLMPWSIFEKLGLGECKP 660

BLAST of Lag0032111 vs. ExPASy TrEMBL
Match: A0A6P8DD93 (uncharacterized protein LOC116206453 OS=Punica granatum OX=22663 GN=LOC116206453 PE=4 SV=1)

HSP 1 Score: 330.5 bits (846), Expect = 1.6e-86
Identity = 258/747 (34.54%), Postives = 361/747 (48.33%), Query Frame = 0

Query: 1   MRSSKDLILAPLDPEIERTIHRLRRENREN-----FQMADQNPPEE----PRPIRDYFQP 60
           MR S+   L PLDPEIERT+HRLRRENR        +MAD +   +     R +RDY  P
Sbjct: 1   MRRSRSAELLPLDPEIERTLHRLRRENRRREELQVVEMADDDINRQIQGAARALRDYAVP 60

Query: 61  VFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLTF-------- 120
              G  S I    I ANNFELK  LIQM +   + G P E P+ H+  FL +        
Sbjct: 61  TIMG--SAIRRPTIPANNFELKPALIQMVQSNQFGGYPNESPDEHIAGFLQYCNTVKMNN 120

Query: 121 ----------------DKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGT 180
                           DKAR W  S+   SITTW  L   FL++FFPPA+T +LR EI  
Sbjct: 121 VTDDVIRLQLFPFSLRDKARAWFNSLPQESITTWADLSSKFLRRFFPPARTARLRNEITN 180

Query: 181 FQQQYDEQLFEAWERFKELLRKCPPHGYPDWL---------------------------Q 240
           F +   E L+EAWERFKE +RKCP HG PD L                           +
Sbjct: 181 FTKFNGESLYEAWERFKEAIRKCPHHGLPDNLLIEVFYLSLDDTLRSLVDAAAGGALMGK 240

Query: 241 TVENARILLEDMATNSYQWPSERSAPKKISAGVFEVDKVSALQAQITSLANAFMKFSGTG 300
             + A  L+E+MA++++ W +ERS  K   A V ++D ++ L  QI++L     K +   
Sbjct: 241 NYDEASALIEEMASSAHNWQNERS--KSRVASVNDMDTIANLTTQISALTTQVSKLTSAH 300

Query: 301 SAQSIESA-AALASRPQ------------EETIEQVQYVSNF---NSRGYNNSSTPTHYH 360
           S  + + A   L S P                 EQV +V+NF   N   Y+N+     Y+
Sbjct: 301 SFNTNQVAFCELCSGPHSTLECMSGNPSASPNGEQVNFVNNFQRSNQGPYSNT-----YN 360

Query: 361 PNNRNHEKFSYANTKNVLN-PPGF--------APQTQENKKLEDLVGAFIAESSNRTTKL 420
           P  RNH  FS+ N  N L  PPGF        AP  Q   ++E+L+ +++ ++       
Sbjct: 361 PGWRNHPNFSWRNENNALKPPPGFQKQGPAQNAPPQQSQSRMEELMLSYMQKT------- 420

Query: 421 EEAVIAINSTVNGHIAAIKNIETQLGQLVRVVSTMNKGKALAEQEKT-------QMEYCK 480
                  ++ +    A I+N+E Q+ Q+ + +S    G   +  E+         +   K
Sbjct: 421 -------DTMLQNQQATIRNLEGQISQISQQLSNRPSGSLPSNTEENPKGVNAIMLRSGK 480

Query: 481 AIIVHQEEADEEPESEDYDTPTGEAEEDTSSDEAEKPNPEPPIPSPTLMVPKEKKKKKKK 540
            + +   +A  + ES + D    + EE        KP   PP+P P         ++ K+
Sbjct: 481 ELEIVNRKAQTQEESPEKDKGKQKVEEPRQKSLGVKPY-VPPVPFP---------RRLKQ 540

Query: 541 KNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNRFMKEWSAKKRKEKKVDTVYLASTCS 600
           +    QF KF++ F  L INIPFAEAL +MP Y RFMK+   KKRK    + V L   CS
Sbjct: 541 QQLDAQFAKFLDVFKKLQINIPFAEALQQMPSYARFMKDLLTKKRKFDGSEPVMLTGECS 600

Query: 601 TRVQQ---KVPEKVADPGSFL---------FLVVLLD-----------------IGEIKS 626
             +Q+    +P K  D GSF          F  VL+D                 +GE K 
Sbjct: 601 MILQKDLPNLPRKQRDQGSFTVPCTIGNFHFENVLIDSGASINLMPLSIFRKLGLGECKK 660

BLAST of Lag0032111 vs. ExPASy TrEMBL
Match: A0A6P8DKJ2 (uncharacterized protein LOC116204231 OS=Punica granatum OX=22663 GN=LOC116204231 PE=4 SV=1)

HSP 1 Score: 328.2 bits (840), Expect = 7.9e-86
Identity = 257/747 (34.40%), Postives = 360/747 (48.19%), Query Frame = 0

Query: 1   MRSSKDLILAPLDPEIERTIHRLRRENREN-----FQMADQNPPEE----PRPIRDYFQP 60
           MR S+   L PLDPEIERT+HRLRRENR        +MAD +   +     R +RDY  P
Sbjct: 107 MRRSRSAELLPLDPEIERTLHRLRRENRRREELQVVEMADDDINRQIQGAARALRDYAVP 166

Query: 61  VFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLTF-------- 120
              G  S I    I ANNFELK  LIQM +   + G P E P+ H+  FL +        
Sbjct: 167 TIMG--SAIRRPTIPANNFELKPALIQMVQSNQFGGYPNESPDEHIAGFLQYCNTVKMNN 226

Query: 121 ----------------DKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGT 180
                           DKAR W  S+   SITTW  L   FL++FFPPA+T +LR EI  
Sbjct: 227 VTDDVIRLQLFPFSLRDKARAWFNSLPQESITTWADLSSKFLRRFFPPARTARLRNEITN 286

Query: 181 FQQQYDEQLFEAWERFKELLRKCPPHGYPDWL---------------------------Q 240
           F +   E L+EAWERFKE +RKCP HG PD L                           +
Sbjct: 287 FTKFNGESLYEAWERFKEAIRKCPHHGLPDNLLIEVFYLSLDDTLRSLVDAAAGGALMGK 346

Query: 241 TVENARILLEDMATNSYQWPSERSAPKKISAGVFEVDKVSALQAQITSLANAFMKFSGTG 300
             + A  L+E+MA++++ W +ERS  K   A V ++D ++ L  QI++L     K +   
Sbjct: 347 NYDEASALIEEMASSAHNWQNERS--KSRVASVNDMDTIANLTTQISALTTQVSKLTSAH 406

Query: 301 SAQSIESA-AALASRPQ------------EETIEQVQYVSNF---NSRGYNNSSTPTHYH 360
           S  + + A   L S P                 EQV +V+NF   N   Y+N+     Y+
Sbjct: 407 SFNTNQVAFCELCSGPHSTLECMSGNPSASPNGEQVNFVNNFQRSNQGPYSNT-----YN 466

Query: 361 PNNRNHEKFSYANTKNVLN-PPGF--------APQTQENKKLEDLVGAFIAESSNRTTKL 420
           P  RNH  FS+ N  N L  PPGF        AP  Q   ++E+L+ +++ ++       
Sbjct: 467 PGWRNHPNFSWRNENNALKPPPGFQKQGPAQNAPPQQSQSRMEELMLSYMQKT------- 526

Query: 421 EEAVIAINSTVNGHIAAIKNIETQLGQLVRVVSTMNKGKALAEQEKT-------QMEYCK 480
                  ++ +    A I+N+E Q+ Q+ + +S    G   +  E+         +   K
Sbjct: 527 -------DTMLQNQQATIRNLEGQISQISQQLSNRPSGSLPSNTEENPKGVNAIMLRSGK 586

Query: 481 AIIVHQEEADEEPESEDYDTPTGEAEEDTSSDEAEKPNPEPPIPSPTLMVPKEKKKKKKK 540
            + +   +A  + ES + D    + EE        KP   PP+P P          + K+
Sbjct: 587 ELEIVNRKAQTQEESPEKDKGKQKVEEPRRKSLGVKPY-VPPVPFP---------GRLKQ 646

Query: 541 KNNQVQFDKFMNAFMNLNINIPFAEAL-EMPQYNRFMKEWSAKKRKEKKVDTVYLASTCS 600
           +    QF KF++ F  L INIPFAEAL +MP Y RFMK+   KKRK    + V L   CS
Sbjct: 647 QQLDAQFAKFLDVFKKLQINIPFAEALQQMPSYARFMKDLLTKKRKFDGSEPVMLTGECS 706

Query: 601 TRVQQ---KVPEKVADPGSFL---------FLVVLLD-----------------IGEIKS 626
             +Q+    +P K  D GSF          F  VL+D                 +GE K 
Sbjct: 707 MILQKDLPNLPRKQRDQGSFTVPCTIGNFHFENVLIDSGASINLMPLSIFRKLGLGECKK 766

BLAST of Lag0032111 vs. ExPASy TrEMBL
Match: A0A6P6XAQ1 (Reverse transcriptase OS=Coffea arabica OX=13443 GN=LOC113740608 PE=4 SV=1)

HSP 1 Score: 321.6 bits (823), Expect = 7.4e-84
Identity = 233/668 (34.88%), Postives = 334/668 (50.00%), Query Frame = 0

Query: 43  RPIRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLT 102
           R +RD+  P  QG Q+ IV   +NANNFE+K  LIQM +   Y G+ TEDPNSHL +FL 
Sbjct: 9   RILRDFALPGAQGSQTSIVRPTVNANNFEIKPSLIQMVQQSQYGGNATEDPNSHLSTFLE 68

Query: 103 F------------------------DKARDWLQSITPGSITTWDALVQAFLKKFFPPAKT 162
                                    DKA+ WLQS  P + TTWD L +AFL KFFPP KT
Sbjct: 69  ICDTIKFNGVSEDAIKLRLFPFSLRDKAKVWLQSHPPNTFTTWDELAKAFLNKFFPPGKT 128

Query: 163 VKLRTEIGTFQQQYDEQLFEAWERFKELLRKCPPHGYPDWL------------------- 222
            KLR +I +F QQ  E L+EAWER++EL R+CP HG PDWL                   
Sbjct: 129 AKLRMDITSFSQQEGETLYEAWERYRELQRRCPHHGLPDWLVVQTFYNGLTYPTKTHVDA 188

Query: 223 --------QTVENARILLEDMATNSYQWPSERSAPKKISAGVFEVDKVSALQAQITSLAN 282
                   +T E A+ L+E+MA N+YQW +ER   ++ +AG+ EVD ++ L A++ ++  
Sbjct: 189 AAGGALMGKTAEEAQQLIEEMAANNYQWANERGNSRR-TAGMLEVDTLNMLSAKMDNVVK 248

Query: 283 AFMKFSGTGSAQSIESAAALASRPQEE-----TIEQVQYVSNFNSRGYNNSSTPTHYHPN 342
              +  G+ S Q +  A+        +     + EQVQY++N+N    NN  + T Y+P 
Sbjct: 249 MLNRQVGSSSNQGVVVASCTICGGDHDDFMCSSSEQVQYLNNYNRPPQNNPYSNT-YNPG 308

Query: 343 NRNHEKFSY---ANTKNVLNPPGFAPQ--TQENKKLEDLVGAFIAESSN-RTTKLEEAVI 402
            RNH  F +    N +  +NPPGF  +    E+K   +L    +A +SN +  KL  A  
Sbjct: 309 WRNHPNFGWKDQGNQQRPVNPPGFQQKQTLHESKPAWELAIEKLANASNDKIEKLASATT 368

Query: 403 AINSTVNGHIAAI----KNIETQLGQLVRVVSTMNKGKALAEQEKTQMEYCKAIIVHQEE 462
                + G +  +    +N+E QLGQ+   V+  N+G   ++ E    E+ KAI +   +
Sbjct: 369 QRFERIEGRMDQLTNMYRNVEVQLGQIANAVNNRNQGDLPSKTEVNPREHVKAITLRSGK 428

Query: 463 ADEEPESEDYDTPTGEAEEDTSSDEAEKPNPEPPIPSPTLMVPKEKKKKKKKKNNQVQFD 522
              EP          + E    S+  E                KE+K K+K + N++Q  
Sbjct: 429 ELVEPPVVGSGREFEKRENKKLSELKEG--------------SKEEKGKEKIEENELQ-- 488

Query: 523 KFMNAFMNLNINIPFAEALEMPQYNRFMKEWSAKKRKEKKVDTVYLASTCSTRVQQKVPE 582
                 M     IP      +P Y +F+KE   KKRK    +T+ L   CS  +Q K+P 
Sbjct: 489 ------MEDATPIP----PPIPSYAKFLKEIMTKKRKLVDSETIALTEECSAIIQNKLPP 548

Query: 583 KVADPGSFL---------FLVVLLDIG-----------------EIKSTPVKLQLAHQSV 619
           K+ DPGSF          F   L D+G                 E+K T + LQLA +S+
Sbjct: 549 KLKDPGSFTVPCTIGNVEFSKALCDLGASVSLIPLTVARQLGLKELKRTNISLQLADRSI 608

BLAST of Lag0032111 vs. ExPASy TrEMBL
Match: A0A6J1DU19 (uncharacterized protein LOC111024361 OS=Momordica charantia OX=3673 GN=LOC111024361 PE=4 SV=1)

HSP 1 Score: 320.5 bits (820), Expect = 1.7e-83
Identity = 224/610 (36.72%), Postives = 313/610 (51.31%), Query Frame = 0

Query: 45  IRDYFQPVFQGQQSGIVYAPINANNFELKTGLIQMARDCAYRGSPTEDPNSHLKSFL--- 104
           IRDY QP F     GI+  PINANN ELK GLIQM R+  +RG+ TEDPN+HL  FL   
Sbjct: 27  IRDYCQPNFP-NHVGIINLPINANNSELKPGLIQMVRENTFRGNATEDPNNHLTIFLDVC 86

Query: 105 ---TFDKARDWLQSITPGSITTWD-ALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDEQLF 164
                +   D    +    ++  D  +VQAFL  FFPPAKT +LRTEI +F++   EQLF
Sbjct: 87  GTVKMNGVIDDAIRLRLFPLSLQDKEMVQAFLTNFFPPAKTTQLRTEIRSFRKYDYEQLF 146

Query: 165 EAWERFKELLRKCPPHGYPDWLQ---------------------------TVENARILLE 224
           E WER+KELLRKCP HG  +WLQ                           T ENA ILL+
Sbjct: 147 EVWERYKELLRKCPQHGNLEWLQIQMFYNGLNGQTRTILDAAAGGTLLSRTPENAYILLK 206

Query: 225 DMATNSYQWPSERSAPKKISAGVFEVDKVSALQAQITSLANAFMKFSGTGSAQSIESAAA 284
           DMA NS+QWPSERS  KK+ AG++E+D++S+L+AQ+ +L NA  K SG G++ S E  AA
Sbjct: 207 DMADNSFQWPSERSNAKKV-AGMYEIDELSSLKAQVQALTNAVSKLSGPGTSHSNELVAA 266

Query: 285 LASRP-QEETIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHEKFSYANTKNVLNPPGFAP 344
             +    E TIEQ Q+ S                HP                        
Sbjct: 267 TDTYSYYEPTIEQAQFTS----------------HP------------------------ 326

Query: 345 QTQENKKLEDLVGAFIAESSNRTTKLEEAVIAINSTVNGHIAAIKNIETQLGQLVRVVST 404
             ++   LEDL+GAFI E  +R +++E  V  +   + G+  +IKN+E Q+GQ+   ++T
Sbjct: 327 -AEKKSSLEDLLGAFINECRSRASRIENQVEGMEVKLEGNTTSIKNMEVQIGQIAPTLNT 386

Query: 405 MNKGKALAEQEKTQMEYCKAIIVHQEEADEEPESEDYDTPTGEAEEDTSSDEAEKPNPEP 464
           M KGK  ++ E    E+CKA+ +   +  +EPE +  + P    EE  + +E  K     
Sbjct: 387 MQKGKFPSDIEVKPREHCKAVTLRSGKELQEPEKKKMEEPVITTEERENKEEVVKE---- 446

Query: 465 PIPSPTLMVPKEKKKKKKKKNNQVQFDKFMNAFMNLNINIPFAEALE-MPQYNRFMKEWS 524
              +P L   K          N + + +                ALE MP Y RFMK+  
Sbjct: 447 --ATPALQADKPTSSIVSSPPNSLPYPQ---------------HALEQMPNYVRFMKDIM 506

Query: 525 AKKRKEKKVDTVYLASTCSTRVQQKVPEKVADPGSFLFLVVLLDIGEIKSTPVKLQLAHQ 584
             KRK +  +TV L   CS  +Q+K+P+K+ DPGSF    +   I           +   
Sbjct: 507 TGKRKLEAYETVNLTEECSAILQRKLPQKLKDPGSF---TIPCTISSSSFNKALCDICAS 566

Query: 585 SVVRPVGIVENVLIRVGKFFLPIDLYVMDMIENPSMPIILGRPFLATGRVIIDIERRELT 619
             + P+G++E+VL++V +   P D  V+   E+  +PIILGR FLATG  +ID++   LT
Sbjct: 567 INLMPLGVIEDVLVKVDRLIFPADFVVLXXEEDSEIPIILGRXFLATGXALIDVQLGXLT 569

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG7990634.13.4e-12339.81hypothetical protein I3843_02G035100 [Carya illinoinensis][more]
KAG7947748.11.2e-11539.14hypothetical protein I3843_14G109500 [Carya illinoinensis][more]
KAG6734747.14.4e-11538.71hypothetical protein I3842_01G285500 [Carya illinoinensis][more]
XP_023874613.11.4e-11338.90uncharacterized protein LOC111987139 [Quercus suber][more]
XP_022843226.14.6e-11239.69uncharacterized protein LOC111366761 [Olea europaea var. sylvestris][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J0ZX648.5e-9635.88LOW QUALITY PROTEIN: uncharacterized protein LOC110412945 OS=Herrania umbratica ... [more]
A0A6P8DD931.6e-8634.54uncharacterized protein LOC116206453 OS=Punica granatum OX=22663 GN=LOC116206453... [more]
A0A6P8DKJ27.9e-8634.40uncharacterized protein LOC116204231 OS=Punica granatum OX=22663 GN=LOC116204231... [more]
A0A6P6XAQ17.4e-8434.88Reverse transcriptase OS=Coffea arabica OX=13443 GN=LOC113740608 PE=4 SV=1[more]
A0A6J1DU191.7e-8336.72uncharacterized protein LOC111024361 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 99..171
e-value: 1.4E-13
score: 50.9
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 527..619
e-value: 1.9E-14
score: 55.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 397..421
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 393..450
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 97..174
coord: 434..618
NoneNo IPR availablePANTHERPTHR24559:SF334SUBFAMILY NOT NAMEDcoord: 97..174
coord: 434..618
NoneNo IPR availableCDDcd00303retropepsin_likecoord: 520..594
e-value: 3.36392E-9
score: 52.3388

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0032111.1Lag0032111.1mRNA