Lag0030986 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0030986
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
Descriptionprotein LOW PSII ACCUMULATION 1, chloroplastic
Locationchr11: 3575919 .. 3580572 (-)
RNA-Seq ExpressionLag0030986
SyntenyLag0030986
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTATGGTTTCTCTTCCTCTGTACCACCAGCTGCTCAGCCTCTCAAACCCCAAATCAAAAACCATTCTCAGGCCGCGGCTACCGAGCTCCACATCCACCCTCTTCAACTCCCAAAGGAATTTCCATCTCTCTATTGTGTATTGCTCTTCCACTTCTCAGTCCCCTGAAGCTAATGTCGAATCTGCAGAGTCCTGCGTCAATCTCGGCCTCCAGCTCTTTTCTAAAGGACGGGTGTGCGTTCTTTTCCTTTCTATTTGATTCGTTTTTCTCTTTCCAGTTTGTTTGTTGTGTGTTTTCTTGCTTTCTGCGAAATGGGTATTCTTTAATTCGTTTCAACTTGTTAGATAATTTGGGCATTTTAGTTGAATTTTCTTAGCATGTTCTTGAATGGTCACTTTTGGTACATGATATGTTATGACTGATACCGTAAATCAATGAAATAAGAATGAAAGGGCGAAGAAAACAACACACAAATTTACTTAGTTCACTAACGATGTGTTAGCTACATCCACGGACAGAGGGAGAGAACAATCTTATTAGTGAGGAGTAAAACATATACAAATTACCGAGATGGTGTATATATGGCACCACTCTTAACCCTAACATAATAGGTCCAAAATAAAAGTCCCAAACGGAAAATCATAGCTAATTGTTCATCTCACGTTGATCCAGAATGAGACTTGCAATGCCAATTAAGAACTCATCCGAGACCCCCGTACTTGGTTATTTGGAGCGCTTCATATCAAATTCAAGGCATATCAACATGATAAGTAAGAACCGAGTGAGACTTGCAATGCCAATTAAGAAATCGTCCAAAAGAAGAAGAAAAAGGTTCAATTATGAGTTCAGTCCAACTTTCAGGATTGAGTCTATTCAGTCCATATGCTTTTAAAAAGCTTCTAACAGGTCCAAACTTTCGTGGTTTGGTCTGTTTAGTCCATGAAAGTTTAAAGACCTACTGTAACCATTAAGTATAGCCACCAATTTGAGAGGAAATTCAAATTTATCTAATAGAAAATACGTCAGCTGCCCGAGCACTATTTGATCCATCTTTGCATTAAGTATGTGAGAGAGAAGAAATAAGGGAAAATGTTAGGACTGAAGAAAGACAGTGATGAAATTACAGAAGGAGAAAGGAGAAGAGTCTAAGGATTTTCATTTGTAGGCATTGTGGATTTTGTACATCAGCATGGAAACAAGATCAGTTTCCTTTATGATATGATAACAACTCACATCTTTACCCACCCACATTGTTGAAACAATCATTTCTAATTTGGTCCTTCAATTATTATTATTATTTTTTTATCTATACGCATCAATGTCATTCTGTTCTTTTCTAGTTTCTTTGTTTTTGCCGGACACATCCATGGGCCGTGGCCTTAAAGCATCTGTTAGGAATAGCTTTTTTGACAACTACATCTCATGGTCTAAATAGTTTTTAGTATCCCAATGACACCATAAACTTGGATACTAACTCAGTTATCATTATATTTTCGTCAACTCTTGCAATCTGATGAGATGCAATCTTGAATCACAGAGCCTTTCAAGCCTCTCTATTCTTGTGGATCATAGATACTAGTGAACATTATGAAAGTTTAGTTTCATTCATAGTTTTGTTTAATACTCAGAGATTGATTTGTTTGAAGGGATCAATGAATACAGTCATGTTGTGTTTCTGTGAAACAGCATCTTCTTTCTTCATTGATATTTCTTTTCCAGTTCTGTTCCTCAATTATTTTGAAAGAAATCTGTGTGTGTGTAGACACTTCATAGAACAAAAAAATCTTTCAGGTCAAAGAAGCTTTAGTCCAGTTTGAAGCAGCATTGAATATGAATCCCAACCCAATGGAGGCCCAAGCTGCTTTGTACAATAAAGCATGTTGTCATGCCTATCGGTATGCCTAAAGGTCTCTATCTTTTACTTATTAACGTAGCTTCTATTTCTTAAAATTTTTTCTCATGTTTGAAGAATTGTTCATCTCACGTTGATCCAGAATCATTGAAGAATTCAATATAATGGACTTAGTTTAATGCCACCTTTAAGTTGCTTGTAGACTCTTTTTCCTTTTTAGCTAGAGTTCTGATTTCGCAAAGATGATGACTTGTGACTTTGTGGATATTTTGAGGTAGTGGGGAAGGGAAGAAAGCTGCTGACTGTCTGCGTCTTGCATTAAGAGAATATAACCTGAAGTTTGGCACCATTCTGAATGATCCTGACTTGGCCTCATTCAGAGCCCTTCCTGAATTCAAGGAATTGCAAGAAGAGGTTCGTCACTAAATGGCCACTCTCTTCCTCCTGTCATCTTCAACTTATGAAAGTTCTCAATAACCGCCTTGAATATTAAAATATTTTAATGTGAATAGATGACTGTTTAATTAACAGATCTGCAATTACTCATTTTATTATATCTCTAAATAATTTGATGGACATTCAGGCTAGGATAGGTGGAGAGGATATAGGATATGGTTTTCGAAGAGATCTTAAACTCATTAGTGAAGTCCAAGCACCTTTTCGTGGGGTGCGGAAATTCTTTTATGTGGCACTATCTGCAGCAGCTGGAATATCGTTGTTGTTTACTCTACCCAGGTTATTTCGTGCTATTCAAGGTGGTGATGAAGCTCCTGATGTTTGGGAAACTGCTGGAAATTTAGCTGTTAATTTTGGAGGTACATCTGAACTTTTTATGAACTTCCATGGCTCGGTGAAGGGATATGAATAAAAAGTTACATGTTCACTACAATAAGCTAGCTGAAATACTGACTTTTCAATCAGCATTGAACAGGTATTGTTGTTCTTGTGGCATTGTTCTTATGGGACAACAAGAAAGAAGAGGAACAGCTTGCACAAATTTCAAGAAATGAAACATTATCGAGGTTGCCTCTTCGTCTTTCTACCAATCGGATTGTTGAACTTGTACAGCTTCGAGATACTGTAAGACCGGTGAGTATGATTATTCATCTTACCCCTTGACGACTCCATCAATAATTGGTAAGTCCAACTTTCATATTCTAGGGTCAATCACTCTTTCTCCAGTTCTGGTTATTTCAATTGATTTGCCCGTTTTGCCTTCATGCCCTAACTCATTGGTGCCTTGCCCTTGTCCATTTTTGCCTGAATCAACATTTCAGTTATCAATTGTGCTGATTCTTCTTGCTTCTATAACAATTTCCCTACCCCTTCTACTAAGGCTGCCAGTGCTGTTCGATCGATAAAATCCTCAGGACTTCCTTTGATAATATTCCTAATCATGTCTTTATTCCTTCGAGTTGATTATCCATCATGTAAATAATGCTTCAGAAGACAAATGGAAGACAATAAACTATTTTAGGAGACAAATTCAGAGAGGAATGGGGAGGGTTTATTGGTACAAGTATATTAATGGTGGGAGTTTCTGGAATAGGTTGTTTTTTACAGCCTCTTTTCTACCTTCATTGACCTAGAATGTTTTGGATAATCTATCCCTGATATTACCCAAAGAAAACATTTCTTCTCTAAGATTTTTTCTTTTCTTCTCATTCTGCCTCTAATCACCCAACCCTCTGTGTCCTCCTCCATTGAATTTCATGTTTCCAACAATCAACCAAAGTTTTGTGTTACCTTTCTGGAAAATGGAAATACATGATACTGCCTGTAGTTATGTACATTTTTACATTCCTTGAATCGATTTACCTTGCATTTAAGGTCATTTTAGCTGGGAAAAAGGAGACAGTTTCTTCAGCCATCCAAAAAGCAGAAAGGTTCAGAACTGAGCTCCTTAGACGAGGCGTTCTCGTAGTTCCTGTCGTATGGGGCGAAGGGAGGGAACCCCAAATAGAAAAGAAAGGGTTTGGTGCTCCAGCCACTGCAGCTGCTGCTCTGCCATCTATTGGGGTAAGATCAATAGATTTTCATCCTGTCCTACATTTGTCCAAGGATATATCTTGAATTATGCTGCTGTTTGCTCATGTCCCAATTAGAAAATTCGAAAGTTAATTGGTTCGTTCTCTTGTTCTCACGGAACCCTTGTAGGAAGATTTTGAGAAACGAGCTCAGTCTATAACTGCAAAATCGAAGTTGAAAGCCGAAATTCGATTCAGGGCTGAGGTTATATCACCAGCAGAATGGGAAAGGTAAGTGTATACCTGTCTCCTTGGAATGCTTGCTTCCTATCATCTAATTTTCTTGTTAGTTTATTGGCTGAGCTTGCTCTCCATCATCTAATCTACTGATATAGAACTAACAAGAAGAAAGATCATAAGCGATTCTATCTCCGAGAACATGTTATTTTTCTCTTTCGAAGTTGAATCGTCTCGCCAATATTTTGAAAGCAATCATTCGGTTTAATCCATATGAGGTGGATATGATATTCTGTTTTTTTCACTCTGGATTCTGGCTCATTTTTGTTTCCTGAATTAGTTGGATAAGGGACCAGCAGAAGTCTGAAGGGGTTACTCCTGGGGAGGATGTCTACATTATATTGCGATTGGACGGTCGAGTTCGAAGATCAGGGAGAGTAAGTTCATATCATGAAAGTTGTAGCGTAATACCTTCTTCGTCATGTAGAAGTGTTCCTCATTTTTGGCAACGAATTTGCAGGGGATGCCTGACTGGCCAAAAATTATTGAAGAGCTACCACCAATGGAAGCTCTTCTAAGCAAGCTAGAAAGATGA

mRNA sequence

ATGGCTATGGTTTCTCTTCCTCTGTACCACCAGCTGCTCAGCCTCTCAAACCCCAAATCAAAAACCATTCTCAGGCCGCGGCTACCGAGCTCCACATCCACCCTCTTCAACTCCCAAAGGAATTTCCATCTCTCTATTGTGTATTGCTCTTCCACTTCTCAGTCCCCTGAAGCTAATGTCGAATCTGCAGAGTCCTGCGTCAATCTCGGCCTCCAGCTCTTTTCTAAAGGACGGGTCAAAGAAGCTTTAGTCCAGTTTGAAGCAGCATTGAATATGAATCCCAACCCAATGGAGGCCCAAGCTGCTTTGTACAATAAAGCATGTTGTCATGCCTATCGTGGGGAAGGGAAGAAAGCTGCTGACTGTCTGCGTCTTGCATTAAGAGAATATAACCTGAAGTTTGGCACCATTCTGAATGATCCTGACTTGGCCTCATTCAGAGCCCTTCCTGAATTCAAGGAATTGCAAGAAGAGGCTAGGATAGGTGGAGAGGATATAGGATATGGTTTTCGAAGAGATCTTAAACTCATTAGTGAAGTCCAAGCACCTTTTCGTGGGGTGCGGAAATTCTTTTATGTGGCACTATCTGCAGCAGCTGGAATATCGTTGTTGTTTACTCTACCCAGGTTATTTCGTGCTATTCAAGGTGGTGATGAAGCTCCTGATGTTTGGGAAACTGCTGGAAATTTAGCTGTTAATTTTGGAGGTATTGTTGTTCTTGTGGCATTGTTCTTATGGGACAACAAGAAAGAAGAGGAACAGCTTGCACAAATTTCAAGAAATGAAACATTATCGAGGTTGCCTCTTCGTCTTTCTACCAATCGGATTGTTGAACTTGTACAGCTTCGAGATACTGTAAGACCGGTCATTTTAGCTGGGAAAAAGGAGACAGTTTCTTCAGCCATCCAAAAAGCAGAAAGGTTCAGAACTGAGCTCCTTAGACGAGGCGTTCTCGTAGTTCCTGTCGTATGGGGCGAAGGGAGGGAACCCCAAATAGAAAAGAAAGGGTTTGGTGCTCCAGCCACTGCAGCTGCTGCTCTGCCATCTATTGGGGAAGATTTTGAGAAACGAGCTCAGTCTATAACTGCAAAATCGAAGTTGAAAGCCGAAATTCGATTCAGGGCTGAGGTTATATCACCAGCAGAATGGGAAAGTTGGATAAGGGACCAGCAGAAGTCTGAAGGGGTTACTCCTGGGGAGGATGTCTACATTATATTGCGATTGGACGGTCGAGTTCGAAGATCAGGGAGAGGGATGCCTGACTGGCCAAAAATTATTGAAGAGCTACCACCAATGGAAGCTCTTCTAAGCAAGCTAGAAAGATGA

Coding sequence (CDS)

ATGGCTATGGTTTCTCTTCCTCTGTACCACCAGCTGCTCAGCCTCTCAAACCCCAAATCAAAAACCATTCTCAGGCCGCGGCTACCGAGCTCCACATCCACCCTCTTCAACTCCCAAAGGAATTTCCATCTCTCTATTGTGTATTGCTCTTCCACTTCTCAGTCCCCTGAAGCTAATGTCGAATCTGCAGAGTCCTGCGTCAATCTCGGCCTCCAGCTCTTTTCTAAAGGACGGGTCAAAGAAGCTTTAGTCCAGTTTGAAGCAGCATTGAATATGAATCCCAACCCAATGGAGGCCCAAGCTGCTTTGTACAATAAAGCATGTTGTCATGCCTATCGTGGGGAAGGGAAGAAAGCTGCTGACTGTCTGCGTCTTGCATTAAGAGAATATAACCTGAAGTTTGGCACCATTCTGAATGATCCTGACTTGGCCTCATTCAGAGCCCTTCCTGAATTCAAGGAATTGCAAGAAGAGGCTAGGATAGGTGGAGAGGATATAGGATATGGTTTTCGAAGAGATCTTAAACTCATTAGTGAAGTCCAAGCACCTTTTCGTGGGGTGCGGAAATTCTTTTATGTGGCACTATCTGCAGCAGCTGGAATATCGTTGTTGTTTACTCTACCCAGGTTATTTCGTGCTATTCAAGGTGGTGATGAAGCTCCTGATGTTTGGGAAACTGCTGGAAATTTAGCTGTTAATTTTGGAGGTATTGTTGTTCTTGTGGCATTGTTCTTATGGGACAACAAGAAAGAAGAGGAACAGCTTGCACAAATTTCAAGAAATGAAACATTATCGAGGTTGCCTCTTCGTCTTTCTACCAATCGGATTGTTGAACTTGTACAGCTTCGAGATACTGTAAGACCGGTCATTTTAGCTGGGAAAAAGGAGACAGTTTCTTCAGCCATCCAAAAAGCAGAAAGGTTCAGAACTGAGCTCCTTAGACGAGGCGTTCTCGTAGTTCCTGTCGTATGGGGCGAAGGGAGGGAACCCCAAATAGAAAAGAAAGGGTTTGGTGCTCCAGCCACTGCAGCTGCTGCTCTGCCATCTATTGGGGAAGATTTTGAGAAACGAGCTCAGTCTATAACTGCAAAATCGAAGTTGAAAGCCGAAATTCGATTCAGGGCTGAGGTTATATCACCAGCAGAATGGGAAAGTTGGATAAGGGACCAGCAGAAGTCTGAAGGGGTTACTCCTGGGGAGGATGTCTACATTATATTGCGATTGGACGGTCGAGTTCGAAGATCAGGGAGAGGGATGCCTGACTGGCCAAAAATTATTGAAGAGCTACCACCAATGGAAGCTCTTCTAAGCAAGCTAGAAAGATGA

Protein sequence

MAMVSLPLYHQLLSLSNPKSKTILRPRLPSSTSTLFNSQRNFHLSIVYCSSTSQSPEANVESAESCVNLGLQLFSKGRVKEALVQFEAALNMNPNPMEAQAALYNKACCHAYRGEGKKAADCLRLALREYNLKFGTILNDPDLASFRALPEFKELQEEARIGGEDIGYGFRRDLKLISEVQAPFRGVRKFFYVALSAAAGISLLFTLPRLFRAIQGGDEAPDVWETAGNLAVNFGGIVVLVALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSSAIQKAERFRTELLRRGVLVVPVVWGEGREPQIEKKGFGAPATAAAALPSIGEDFEKRAQSITAKSKLKAEIRFRAEVISPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRVRRSGRGMPDWPKIIEELPPMEALLSKLER
Homology
BLAST of Lag0030986 vs. NCBI nr
Match: XP_022143429.1 (protein LOW PSII ACCUMULATION 1, chloroplastic isoform X1 [Momordica charantia])

HSP 1 Score: 805.1 bits (2078), Expect = 3.1e-229
Identity = 406/441 (92.06%), Postives = 424/441 (96.15%), Query Frame = 0

Query: 1   MAMVSLPLYHQLLSLSNPKSKTILRPRLPSSTSTLFNSQRNFHLSIVYCSSTSQSPEANV 60
           MA+ +LPLYH LL  SNPKS+T LRPRLP+ST   FN  +NFHLSI +CSSTSQSPEANV
Sbjct: 1   MAVATLPLYHHLLRFSNPKSRTTLRPRLPTST---FNFHKNFHLSIAFCSSTSQSPEANV 60

Query: 61  ESAESCVNLGLQLFSKGRVKEALVQFEAALNMNPNPMEAQAALYNKACCHAYRGEGKKAA 120
           E+AESCVNLGLQLFSKGRVKEALVQF+AALN++PNP+EAQAA YNKACCHAYRGEGKKAA
Sbjct: 61  ETAESCVNLGLQLFSKGRVKEALVQFDAALNLDPNPLEAQAAFYNKACCHAYRGEGKKAA 120

Query: 121 DCLRLALREYNLKFGTILNDPDLASFRALPEFKELQEEARIGGEDIGYGFRRDLKLISEV 180
           DCLR+ALREYNLKFGTILNDPDLASFRALPEFKELQEEAR+GGEDIGYGFRRDLKLISEV
Sbjct: 121 DCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEV 180

Query: 181 QAPFRGVRKFFYVALSAAAGISLLFTLPRLFRAIQGGDEAPDVWETAGNLAVNFGGIVVL 240
           QAPFRGVRKFFYVALSAAAGISLLFT+PRLFRAIQGGDEAPDVWETAGNLAVN GGI+VL
Sbjct: 181 QAPFRGVRKFFYVALSAAAGISLLFTIPRLFRAIQGGDEAPDVWETAGNLAVNMGGIIVL 240

Query: 241 VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS 300
           VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS
Sbjct: 241 VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS 300

Query: 301 AIQKAERFRTELLRRGVLVVPVVWGEGREPQIEKKGFGAPATAAAALPSIGEDFEKRAQS 360
           AIQKAERFRTELLRRGVL+VPVVWGEGREPQIEK+GFGAP  A A LPSIGEDFEKRAQS
Sbjct: 301 AIQKAERFRTELLRRGVLLVPVVWGEGREPQIEKRGFGAPTNATAVLPSIGEDFEKRAQS 360

Query: 361 ITAKSKLKAEIRFRAEVISPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRVRRSGRGMP 420
           ITAKSKLKAEIRFRAEV+SPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRVRRSGRGMP
Sbjct: 361 ITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRVRRSGRGMP 420

Query: 421 DWPKIIEELPPMEALLSKLER 442
           DWPKIIEELPPMEALLSKLER
Sbjct: 421 DWPKIIEELPPMEALLSKLER 438

BLAST of Lag0030986 vs. NCBI nr
Match: XP_008454363.1 (PREDICTED: protein LOW PSII ACCUMULATION 1, chloroplastic isoform X1 [Cucumis melo] >KAA0044367.1 protein LOW PSII ACCUMULATION 1 [Cucumis melo var. makuwa] >TYK29495.1 protein LOW PSII ACCUMULATION 1 [Cucumis melo var. makuwa])

HSP 1 Score: 791.6 bits (2043), Expect = 3.5e-225
Identity = 403/441 (91.38%), Postives = 420/441 (95.24%), Query Frame = 0

Query: 1   MAMVSLPLYHQLLSLSNPKSKTILRPRLPSSTSTLFNSQRNFHLSIVYCSSTSQSPEANV 60
           MAM +LPL+H L +LSNPKS TILRPRLP+       SQR FHLSI+ CSSTSQSPEAN+
Sbjct: 1   MAMATLPLFHHLPTLSNPKSPTILRPRLPT-------SQRTFHLSILSCSSTSQSPEANL 60

Query: 61  ESAESCVNLGLQLFSKGRVKEALVQFEAALNMNPNPMEAQAALYNKACCHAYRGEGKKAA 120
           +SAESCVNLGLQLFSKGRVKEALVQFEAALNM+PNPMEAQAA YNKACCHAYRGEGKKAA
Sbjct: 61  QSAESCVNLGLQLFSKGRVKEALVQFEAALNMDPNPMEAQAAFYNKACCHAYRGEGKKAA 120

Query: 121 DCLRLALREYNLKFGTILNDPDLASFRALPEFKELQEEARIGGEDIGYGFRRDLKLISEV 180
           DCLR+ALREYNLKFGTILNDPDLASFRALPEFKELQEEAR+GGEDIGYGFRRDLKLISEV
Sbjct: 121 DCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARMGGEDIGYGFRRDLKLISEV 180

Query: 181 QAPFRGVRKFFYVALSAAAGISLLFTLPRLFRAIQGGDEAPDVWETAGNLAVNFGGIVVL 240
           QAPFRGVR+FFYVALSAAAGISLLF +PRLFRAIQGGD APDVWETAGNLAVN GGI+V 
Sbjct: 181 QAPFRGVRRFFYVALSAAAGISLLFNIPRLFRAIQGGDGAPDVWETAGNLAVNVGGIIVF 240

Query: 241 VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS 300
           VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS
Sbjct: 241 VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS 300

Query: 301 AIQKAERFRTELLRRGVLVVPVVWGEGREPQIEKKGFGAPATAAAALPSIGEDFEKRAQS 360
           AIQKAERFRTELLRRGVL+VPV+WGEGREPQIEKKGFGAPA AA ALPSIGEDFEKRAQS
Sbjct: 301 AIQKAERFRTELLRRGVLLVPVIWGEGREPQIEKKGFGAPAAAATALPSIGEDFEKRAQS 360

Query: 361 ITAKSKLKAEIRFRAEVISPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRVRRSGRGMP 420
           ITAKSKLKAEIRFRAEVISPAEWESWIRDQQKSEGVTPGEDVYIILRLDGR+RRSGRGMP
Sbjct: 361 ITAKSKLKAEIRFRAEVISPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRIRRSGRGMP 420

Query: 421 DWPKIIEELPPMEALLSKLER 442
           DW KIIEELPPMEALLSKLE+
Sbjct: 421 DWQKIIEELPPMEALLSKLEK 434

BLAST of Lag0030986 vs. NCBI nr
Match: XP_004152258.2 (protein LOW PSII ACCUMULATION 1, chloroplastic [Cucumis sativus] >KGN52798.1 hypothetical protein Csa_014443 [Cucumis sativus])

HSP 1 Score: 783.5 bits (2022), Expect = 9.7e-223
Identity = 403/442 (91.18%), Postives = 420/442 (95.02%), Query Frame = 0

Query: 1   MAMVSLPLYHQLLSLSNPKSKTILRPRLPSSTSTLFNSQRNFHLSIVYCSSTSQSPEANV 60
           MAM +LPL+H L +LSNPKS TILRPRLP+       SQR F LSI+ CSSTSQSPEAN+
Sbjct: 1   MAMATLPLFHHLPTLSNPKSLTILRPRLPT-------SQRTFRLSILSCSSTSQSPEANL 60

Query: 61  ESAESCVNLGLQLFSKGRVKEALVQFEAALNMNPNPMEAQAALYNKACCHAYRGEGKKAA 120
           +SAESCVN GLQLFSKGRVKEALVQFEAALNM+PNPMEAQAALYNKACCHAYRGEGKKAA
Sbjct: 61  QSAESCVNFGLQLFSKGRVKEALVQFEAALNMDPNPMEAQAALYNKACCHAYRGEGKKAA 120

Query: 121 DCLRLALREYNLKFGTILNDPDLASFRALPEFKELQEEARIGGEDIGYGFRRDLKLISEV 180
           DCLR+ALREYNLKFGTILNDPDLASFRALPEFKELQEEAR+GGEDIGYGFRRDLKLISEV
Sbjct: 121 DCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARMGGEDIGYGFRRDLKLISEV 180

Query: 181 QAPFRGVRKFFYVALSAAAGISLLFTLPRLFRAIQGGDEAPDVWETAGNLAVNFGGIVVL 240
           QAPFRGVRKFFYVALSAAAGISLLF +PRLFRAIQGGD APDVWETAGNLAVN GGI+V 
Sbjct: 181 QAPFRGVRKFFYVALSAAAGISLLFNIPRLFRAIQGGDGAPDVWETAGNLAVNVGGIIVF 240

Query: 241 VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS 300
           VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS
Sbjct: 241 VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS 300

Query: 301 AIQKAERFRTELLRRGVLVVPVVWGEGREPQIEKKGFGAPAT-AAAALPSIGEDFEKRAQ 360
           AIQKAERFRTELLRRGVL+VPV+WGEGREPQIEKKGFGAP T AAAALPSIGEDFEKRAQ
Sbjct: 301 AIQKAERFRTELLRRGVLLVPVIWGEGREPQIEKKGFGAPTTAAAAALPSIGEDFEKRAQ 360

Query: 361 SITAKSKLKAEIRFRAEVISPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRVRRSGRGM 420
           SITAKSKLKAEIRFRAEVISPAEWESWIR+QQ+SEGVTPGEDVYIILRLDGRVRRSGRGM
Sbjct: 361 SITAKSKLKAEIRFRAEVISPAEWESWIRNQQESEGVTPGEDVYIILRLDGRVRRSGRGM 420

Query: 421 PDWPKIIEELPPMEALLSKLER 442
           PDW KIIEELPPMEALLSKLE+
Sbjct: 421 PDWQKIIEELPPMEALLSKLEK 435

BLAST of Lag0030986 vs. NCBI nr
Match: XP_038905239.1 (protein LOW PSII ACCUMULATION 1, chloroplastic [Benincasa hispida])

HSP 1 Score: 780.0 bits (2013), Expect = 1.1e-221
Identity = 401/441 (90.93%), Postives = 417/441 (94.56%), Query Frame = 0

Query: 1   MAMVSLPLYHQLLSLSNPKSKTILRPRLPSSTSTLFNSQRNFHLSIVYCSSTSQSPEANV 60
           MAM +LPL+H LL+ S+PKS TILRPR       L  SQR FH+SI+  SSTSQSPEAN+
Sbjct: 1   MAMATLPLFHHLLTFSSPKSATILRPR-------LLTSQRAFHVSILCFSSTSQSPEANL 60

Query: 61  ESAESCVNLGLQLFSKGRVKEALVQFEAALNMNPNPMEAQAALYNKACCHAYRGEGKKAA 120
           ESAESCVNLGLQLFSKGRVKEALVQFEAALNM+PNPMEAQAALYNKACCHAYRGEGKKAA
Sbjct: 61  ESAESCVNLGLQLFSKGRVKEALVQFEAALNMDPNPMEAQAALYNKACCHAYRGEGKKAA 120

Query: 121 DCLRLALREYNLKFGTILNDPDLASFRALPEFKELQEEARIGGEDIGYGFRRDLKLISEV 180
           DCLR+ALREYNLKFGTILNDPDLASFRALPEFKELQEEAR+GGEDIGYGFRRDLKLISEV
Sbjct: 121 DCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARMGGEDIGYGFRRDLKLISEV 180

Query: 181 QAPFRGVRKFFYVALSAAAGISLLFTLPRLFRAIQGGDEAPDVWETAGNLAVNFGGIVVL 240
           QAPFRGVR+FF VALSAAAGISLLF +PRLFRAIQGGD APDVWETAGNLAVN GGIVV 
Sbjct: 181 QAPFRGVRRFFSVALSAAAGISLLFNIPRLFRAIQGGDGAPDVWETAGNLAVNVGGIVVF 240

Query: 241 VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS 300
           VALFLWDNKKEEEQL+QISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS
Sbjct: 241 VALFLWDNKKEEEQLSQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS 300

Query: 301 AIQKAERFRTELLRRGVLVVPVVWGEGREPQIEKKGFGAPATAAAALPSIGEDFEKRAQS 360
           AIQKAERFRTELLRRGVL+VPV+WGEGREPQIEKKGFGAPAT AAALPSIGEDFEKRAQS
Sbjct: 301 AIQKAERFRTELLRRGVLLVPVIWGEGREPQIEKKGFGAPATPAAALPSIGEDFEKRAQS 360

Query: 361 ITAKSKLKAEIRFRAEVISPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRVRRSGRGMP 420
           ITAKSKLKAEIRFRAEV+SPAEWESWIRDQQKSE VTPGEDVYIILRLDGRVRRSGRGMP
Sbjct: 361 ITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSEEVTPGEDVYIILRLDGRVRRSGRGMP 420

Query: 421 DWPKIIEELPPMEALLSKLER 442
           DW KIIEELPPMEALLSKLER
Sbjct: 421 DWQKIIEELPPMEALLSKLER 434

BLAST of Lag0030986 vs. NCBI nr
Match: XP_022983449.1 (protein LOW PSII ACCUMULATION 1, chloroplastic [Cucurbita maxima])

HSP 1 Score: 779.2 bits (2011), Expect = 1.8e-221
Identity = 400/442 (90.50%), Postives = 421/442 (95.25%), Query Frame = 0

Query: 1   MAMVSLPLYHQLLSLSNPKSKTILRPRLPSSTSTLFNSQRNFHLSIVYCSSTSQSPEANV 60
           + M +LP++HQLL+LSNPKS TILR RLP+S     NSQR FH+SI+ CSSTSQSPE NV
Sbjct: 3   LGMATLPVFHQLLTLSNPKSATILRQRLPTS-----NSQRAFHVSILCCSSTSQSPETNV 62

Query: 61  ESAESCVNLGLQLFSKGRVKEALVQFEAALNMNPNPMEAQAALYNKACCHAYRGEGKKAA 120
           ESAES VNLGLQLFSKGRVKEALVQFEAAL+MNPNPMEAQAALYNKACCHAYRGEGKKAA
Sbjct: 63  ESAESSVNLGLQLFSKGRVKEALVQFEAALDMNPNPMEAQAALYNKACCHAYRGEGKKAA 122

Query: 121 DCLRLALREYNLKFGTILNDPDLASFRALPEFKELQEEARIGGEDIGYGFRRDLKLISEV 180
           DCLR+ALREYNLKFGTILNDPDLASFRALPEFKELQEEAR+GGEDIGYGFRRDLKLISEV
Sbjct: 123 DCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARMGGEDIGYGFRRDLKLISEV 182

Query: 181 QAPFRGVRKFFYVALSAAAGISLLFTLPRLFRAIQGGDEAPDVWETAGNLAVNFGGIVVL 240
           QAPFRGVR+FFYVALSAAAGISLLF LPRLFRAIQGG+EAPDVWET GNLAVN GGIVV 
Sbjct: 183 QAPFRGVRRFFYVALSAAAGISLLFNLPRLFRAIQGGNEAPDVWETVGNLAVNVGGIVVF 242

Query: 241 VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS 300
           VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNR+VELVQLRDTVRPVILAGKKETVSS
Sbjct: 243 VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRVVELVQLRDTVRPVILAGKKETVSS 302

Query: 301 AIQKAERFRTELLRRGVLVVPVVWGEGREPQIEKKGFGAPATA-AAALPSIGEDFEKRAQ 360
           AIQKAERFRTELLRRGVL+VPV+W EGREP++EKKGFGAPA A +AALPSIGEDFEKRAQ
Sbjct: 303 AIQKAERFRTELLRRGVLLVPVIWREGREPRMEKKGFGAPAPAGSAALPSIGEDFEKRAQ 362

Query: 361 SITAKSKLKAEIRFRAEVISPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRVRRSGRGM 420
           SITAKSKLKAEIRFRA+VISPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRVRRSGRGM
Sbjct: 363 SITAKSKLKAEIRFRADVISPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRVRRSGRGM 422

Query: 421 PDWPKIIEELPPMEALLSKLER 442
           PDW KIIEELPPM+ALLSKLER
Sbjct: 423 PDWQKIIEELPPMDALLSKLER 439

BLAST of Lag0030986 vs. ExPASy Swiss-Prot
Match: Q9SRY4 (Protein LOW PSII ACCUMULATION 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=LPA1 PE=1 SV=1)

HSP 1 Score: 629.0 bits (1621), Expect = 4.0e-179
Identity = 314/426 (73.71%), Postives = 365/426 (85.68%), Query Frame = 0

Query: 25  RPRLPSSTSTLFNSQRNFHLSIVYCSSTSQSPEANVES---------AESCVNLGLQLFS 84
           RP LP   +TLFNS+RN+   +   +S+S SP ++  S         AE CVN GL LF 
Sbjct: 28  RPWLPPGDATLFNSRRNWDSHLFVYASSSSSPSSSPPSPNSPTDDLTAELCVNTGLDLFK 87

Query: 85  KGRVKEALVQFEAALNMNPNPMEAQAALYNKACCHAYRGEGKKAADCLRLALREYNLKFG 144
           +GRVK+ALVQFE AL++ PNP+E+QAA YNKACCHAYRGEGKKA DCLR+ALR+YNLKF 
Sbjct: 88  RGRVKDALVQFETALSLAPNPIESQAAYYNKACCHAYRGEGKKAVDCLRIALRDYNLKFA 147

Query: 145 TILNDPDLASFRALPEFKELQEEARIGGEDIGYGFRRDLKLISEVQAPFRGVRKFFYVAL 204
           TILNDPDLASFRALPEFKELQEEAR+GGEDIG  FRRDLKLISEV+APFRGVRKFFY A 
Sbjct: 148 TILNDPDLASFRALPEFKELQEEARLGGEDIGDNFRRDLKLISEVRAPFRGVRKFFYFAF 207

Query: 205 SAAAGISLLFTLPRLFRAIQGGDEAPDVWETAGNLAVNFGGIVVLVALFLWDNKKEEEQL 264
           +AAAGIS+ FT+PRL +AI+GGD AP++ ET GN A+N GGIVV+V+LFLW+NKKEEEQ+
Sbjct: 208 AAAAGISMFFTVPRLVQAIRGGDGAPNLLETTGNAAINIGGIVVMVSLFLWENKKEEEQM 267

Query: 265 AQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSSAIQKAERFRTELLRR 324
            QI+R+ETLSRLPLRLSTNR+VELVQLRDTVRPVILAGKKETV+ A+QKA+RFRTELLRR
Sbjct: 268 VQITRDETLSRLPLRLSTNRVVELVQLRDTVRPVILAGKKETVTLAMQKADRFRTELLRR 327

Query: 325 GVLVVPVVWGEGREPQIEKKGFGAPATAAAALPSIGEDFEKRAQSITAKSKLKAEIRFRA 384
           GVL+VPVVWGE + P+IEKKGFGA + AA +LPSIGEDF+ RAQS+ A+SKLK EIRF+A
Sbjct: 328 GVLLVPVVWGERKTPEIEKKGFGASSKAATSLPSIGEDFDTRAQSVVAQSKLKGEIRFKA 387

Query: 385 EVISPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRVRRSGRGMPDWPKIIEELPPMEAL 442
           E +SP EWE WIRDQQ SEGV PG+DVYIILRLDGRVRRSGRGMPDW +I +ELPPM+ +
Sbjct: 388 ETVSPGEWERWIRDQQISEGVNPGDDVYIILRLDGRVRRSGRGMPDWAEISKELPPMDDV 447

BLAST of Lag0030986 vs. ExPASy TrEMBL
Match: A0A6J1CPA1 (protein LOW PSII ACCUMULATION 1, chloroplastic isoform X1 OS=Momordica charantia OX=3673 GN=LOC111013307 PE=4 SV=1)

HSP 1 Score: 805.1 bits (2078), Expect = 1.5e-229
Identity = 406/441 (92.06%), Postives = 424/441 (96.15%), Query Frame = 0

Query: 1   MAMVSLPLYHQLLSLSNPKSKTILRPRLPSSTSTLFNSQRNFHLSIVYCSSTSQSPEANV 60
           MA+ +LPLYH LL  SNPKS+T LRPRLP+ST   FN  +NFHLSI +CSSTSQSPEANV
Sbjct: 1   MAVATLPLYHHLLRFSNPKSRTTLRPRLPTST---FNFHKNFHLSIAFCSSTSQSPEANV 60

Query: 61  ESAESCVNLGLQLFSKGRVKEALVQFEAALNMNPNPMEAQAALYNKACCHAYRGEGKKAA 120
           E+AESCVNLGLQLFSKGRVKEALVQF+AALN++PNP+EAQAA YNKACCHAYRGEGKKAA
Sbjct: 61  ETAESCVNLGLQLFSKGRVKEALVQFDAALNLDPNPLEAQAAFYNKACCHAYRGEGKKAA 120

Query: 121 DCLRLALREYNLKFGTILNDPDLASFRALPEFKELQEEARIGGEDIGYGFRRDLKLISEV 180
           DCLR+ALREYNLKFGTILNDPDLASFRALPEFKELQEEAR+GGEDIGYGFRRDLKLISEV
Sbjct: 121 DCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEV 180

Query: 181 QAPFRGVRKFFYVALSAAAGISLLFTLPRLFRAIQGGDEAPDVWETAGNLAVNFGGIVVL 240
           QAPFRGVRKFFYVALSAAAGISLLFT+PRLFRAIQGGDEAPDVWETAGNLAVN GGI+VL
Sbjct: 181 QAPFRGVRKFFYVALSAAAGISLLFTIPRLFRAIQGGDEAPDVWETAGNLAVNMGGIIVL 240

Query: 241 VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS 300
           VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS
Sbjct: 241 VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS 300

Query: 301 AIQKAERFRTELLRRGVLVVPVVWGEGREPQIEKKGFGAPATAAAALPSIGEDFEKRAQS 360
           AIQKAERFRTELLRRGVL+VPVVWGEGREPQIEK+GFGAP  A A LPSIGEDFEKRAQS
Sbjct: 301 AIQKAERFRTELLRRGVLLVPVVWGEGREPQIEKRGFGAPTNATAVLPSIGEDFEKRAQS 360

Query: 361 ITAKSKLKAEIRFRAEVISPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRVRRSGRGMP 420
           ITAKSKLKAEIRFRAEV+SPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRVRRSGRGMP
Sbjct: 361 ITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRVRRSGRGMP 420

Query: 421 DWPKIIEELPPMEALLSKLER 442
           DWPKIIEELPPMEALLSKLER
Sbjct: 421 DWPKIIEELPPMEALLSKLER 438

BLAST of Lag0030986 vs. ExPASy TrEMBL
Match: A0A5A7TR76 (Protein LOW PSII ACCUMULATION 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold655G00940 PE=4 SV=1)

HSP 1 Score: 791.6 bits (2043), Expect = 1.7e-225
Identity = 403/441 (91.38%), Postives = 420/441 (95.24%), Query Frame = 0

Query: 1   MAMVSLPLYHQLLSLSNPKSKTILRPRLPSSTSTLFNSQRNFHLSIVYCSSTSQSPEANV 60
           MAM +LPL+H L +LSNPKS TILRPRLP+       SQR FHLSI+ CSSTSQSPEAN+
Sbjct: 1   MAMATLPLFHHLPTLSNPKSPTILRPRLPT-------SQRTFHLSILSCSSTSQSPEANL 60

Query: 61  ESAESCVNLGLQLFSKGRVKEALVQFEAALNMNPNPMEAQAALYNKACCHAYRGEGKKAA 120
           +SAESCVNLGLQLFSKGRVKEALVQFEAALNM+PNPMEAQAA YNKACCHAYRGEGKKAA
Sbjct: 61  QSAESCVNLGLQLFSKGRVKEALVQFEAALNMDPNPMEAQAAFYNKACCHAYRGEGKKAA 120

Query: 121 DCLRLALREYNLKFGTILNDPDLASFRALPEFKELQEEARIGGEDIGYGFRRDLKLISEV 180
           DCLR+ALREYNLKFGTILNDPDLASFRALPEFKELQEEAR+GGEDIGYGFRRDLKLISEV
Sbjct: 121 DCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARMGGEDIGYGFRRDLKLISEV 180

Query: 181 QAPFRGVRKFFYVALSAAAGISLLFTLPRLFRAIQGGDEAPDVWETAGNLAVNFGGIVVL 240
           QAPFRGVR+FFYVALSAAAGISLLF +PRLFRAIQGGD APDVWETAGNLAVN GGI+V 
Sbjct: 181 QAPFRGVRRFFYVALSAAAGISLLFNIPRLFRAIQGGDGAPDVWETAGNLAVNVGGIIVF 240

Query: 241 VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS 300
           VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS
Sbjct: 241 VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS 300

Query: 301 AIQKAERFRTELLRRGVLVVPVVWGEGREPQIEKKGFGAPATAAAALPSIGEDFEKRAQS 360
           AIQKAERFRTELLRRGVL+VPV+WGEGREPQIEKKGFGAPA AA ALPSIGEDFEKRAQS
Sbjct: 301 AIQKAERFRTELLRRGVLLVPVIWGEGREPQIEKKGFGAPAAAATALPSIGEDFEKRAQS 360

Query: 361 ITAKSKLKAEIRFRAEVISPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRVRRSGRGMP 420
           ITAKSKLKAEIRFRAEVISPAEWESWIRDQQKSEGVTPGEDVYIILRLDGR+RRSGRGMP
Sbjct: 361 ITAKSKLKAEIRFRAEVISPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRIRRSGRGMP 420

Query: 421 DWPKIIEELPPMEALLSKLER 442
           DW KIIEELPPMEALLSKLE+
Sbjct: 421 DWQKIIEELPPMEALLSKLEK 434

BLAST of Lag0030986 vs. ExPASy TrEMBL
Match: A0A1S3BYE8 (protein LOW PSII ACCUMULATION 1, chloroplastic isoform X1 OS=Cucumis melo OX=3656 GN=LOC103494784 PE=4 SV=1)

HSP 1 Score: 791.6 bits (2043), Expect = 1.7e-225
Identity = 403/441 (91.38%), Postives = 420/441 (95.24%), Query Frame = 0

Query: 1   MAMVSLPLYHQLLSLSNPKSKTILRPRLPSSTSTLFNSQRNFHLSIVYCSSTSQSPEANV 60
           MAM +LPL+H L +LSNPKS TILRPRLP+       SQR FHLSI+ CSSTSQSPEAN+
Sbjct: 1   MAMATLPLFHHLPTLSNPKSPTILRPRLPT-------SQRTFHLSILSCSSTSQSPEANL 60

Query: 61  ESAESCVNLGLQLFSKGRVKEALVQFEAALNMNPNPMEAQAALYNKACCHAYRGEGKKAA 120
           +SAESCVNLGLQLFSKGRVKEALVQFEAALNM+PNPMEAQAA YNKACCHAYRGEGKKAA
Sbjct: 61  QSAESCVNLGLQLFSKGRVKEALVQFEAALNMDPNPMEAQAAFYNKACCHAYRGEGKKAA 120

Query: 121 DCLRLALREYNLKFGTILNDPDLASFRALPEFKELQEEARIGGEDIGYGFRRDLKLISEV 180
           DCLR+ALREYNLKFGTILNDPDLASFRALPEFKELQEEAR+GGEDIGYGFRRDLKLISEV
Sbjct: 121 DCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARMGGEDIGYGFRRDLKLISEV 180

Query: 181 QAPFRGVRKFFYVALSAAAGISLLFTLPRLFRAIQGGDEAPDVWETAGNLAVNFGGIVVL 240
           QAPFRGVR+FFYVALSAAAGISLLF +PRLFRAIQGGD APDVWETAGNLAVN GGI+V 
Sbjct: 181 QAPFRGVRRFFYVALSAAAGISLLFNIPRLFRAIQGGDGAPDVWETAGNLAVNVGGIIVF 240

Query: 241 VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS 300
           VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS
Sbjct: 241 VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS 300

Query: 301 AIQKAERFRTELLRRGVLVVPVVWGEGREPQIEKKGFGAPATAAAALPSIGEDFEKRAQS 360
           AIQKAERFRTELLRRGVL+VPV+WGEGREPQIEKKGFGAPA AA ALPSIGEDFEKRAQS
Sbjct: 301 AIQKAERFRTELLRRGVLLVPVIWGEGREPQIEKKGFGAPAAAATALPSIGEDFEKRAQS 360

Query: 361 ITAKSKLKAEIRFRAEVISPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRVRRSGRGMP 420
           ITAKSKLKAEIRFRAEVISPAEWESWIRDQQKSEGVTPGEDVYIILRLDGR+RRSGRGMP
Sbjct: 361 ITAKSKLKAEIRFRAEVISPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRIRRSGRGMP 420

Query: 421 DWPKIIEELPPMEALLSKLER 442
           DW KIIEELPPMEALLSKLE+
Sbjct: 421 DWQKIIEELPPMEALLSKLEK 434

BLAST of Lag0030986 vs. ExPASy TrEMBL
Match: A0A0A0KT96 (TPR_REGION domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G001650 PE=4 SV=1)

HSP 1 Score: 783.5 bits (2022), Expect = 4.7e-223
Identity = 403/442 (91.18%), Postives = 420/442 (95.02%), Query Frame = 0

Query: 1   MAMVSLPLYHQLLSLSNPKSKTILRPRLPSSTSTLFNSQRNFHLSIVYCSSTSQSPEANV 60
           MAM +LPL+H L +LSNPKS TILRPRLP+       SQR F LSI+ CSSTSQSPEAN+
Sbjct: 1   MAMATLPLFHHLPTLSNPKSLTILRPRLPT-------SQRTFRLSILSCSSTSQSPEANL 60

Query: 61  ESAESCVNLGLQLFSKGRVKEALVQFEAALNMNPNPMEAQAALYNKACCHAYRGEGKKAA 120
           +SAESCVN GLQLFSKGRVKEALVQFEAALNM+PNPMEAQAALYNKACCHAYRGEGKKAA
Sbjct: 61  QSAESCVNFGLQLFSKGRVKEALVQFEAALNMDPNPMEAQAALYNKACCHAYRGEGKKAA 120

Query: 121 DCLRLALREYNLKFGTILNDPDLASFRALPEFKELQEEARIGGEDIGYGFRRDLKLISEV 180
           DCLR+ALREYNLKFGTILNDPDLASFRALPEFKELQEEAR+GGEDIGYGFRRDLKLISEV
Sbjct: 121 DCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARMGGEDIGYGFRRDLKLISEV 180

Query: 181 QAPFRGVRKFFYVALSAAAGISLLFTLPRLFRAIQGGDEAPDVWETAGNLAVNFGGIVVL 240
           QAPFRGVRKFFYVALSAAAGISLLF +PRLFRAIQGGD APDVWETAGNLAVN GGI+V 
Sbjct: 181 QAPFRGVRKFFYVALSAAAGISLLFNIPRLFRAIQGGDGAPDVWETAGNLAVNVGGIIVF 240

Query: 241 VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS 300
           VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS
Sbjct: 241 VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS 300

Query: 301 AIQKAERFRTELLRRGVLVVPVVWGEGREPQIEKKGFGAPAT-AAAALPSIGEDFEKRAQ 360
           AIQKAERFRTELLRRGVL+VPV+WGEGREPQIEKKGFGAP T AAAALPSIGEDFEKRAQ
Sbjct: 301 AIQKAERFRTELLRRGVLLVPVIWGEGREPQIEKKGFGAPTTAAAAALPSIGEDFEKRAQ 360

Query: 361 SITAKSKLKAEIRFRAEVISPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRVRRSGRGM 420
           SITAKSKLKAEIRFRAEVISPAEWESWIR+QQ+SEGVTPGEDVYIILRLDGRVRRSGRGM
Sbjct: 361 SITAKSKLKAEIRFRAEVISPAEWESWIRNQQESEGVTPGEDVYIILRLDGRVRRSGRGM 420

Query: 421 PDWPKIIEELPPMEALLSKLER 442
           PDW KIIEELPPMEALLSKLE+
Sbjct: 421 PDWQKIIEELPPMEALLSKLEK 435

BLAST of Lag0030986 vs. ExPASy TrEMBL
Match: A0A6J1J7F4 (protein LOW PSII ACCUMULATION 1, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111482050 PE=4 SV=1)

HSP 1 Score: 779.2 bits (2011), Expect = 8.8e-222
Identity = 400/442 (90.50%), Postives = 421/442 (95.25%), Query Frame = 0

Query: 1   MAMVSLPLYHQLLSLSNPKSKTILRPRLPSSTSTLFNSQRNFHLSIVYCSSTSQSPEANV 60
           + M +LP++HQLL+LSNPKS TILR RLP+S     NSQR FH+SI+ CSSTSQSPE NV
Sbjct: 3   LGMATLPVFHQLLTLSNPKSATILRQRLPTS-----NSQRAFHVSILCCSSTSQSPETNV 62

Query: 61  ESAESCVNLGLQLFSKGRVKEALVQFEAALNMNPNPMEAQAALYNKACCHAYRGEGKKAA 120
           ESAES VNLGLQLFSKGRVKEALVQFEAAL+MNPNPMEAQAALYNKACCHAYRGEGKKAA
Sbjct: 63  ESAESSVNLGLQLFSKGRVKEALVQFEAALDMNPNPMEAQAALYNKACCHAYRGEGKKAA 122

Query: 121 DCLRLALREYNLKFGTILNDPDLASFRALPEFKELQEEARIGGEDIGYGFRRDLKLISEV 180
           DCLR+ALREYNLKFGTILNDPDLASFRALPEFKELQEEAR+GGEDIGYGFRRDLKLISEV
Sbjct: 123 DCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARMGGEDIGYGFRRDLKLISEV 182

Query: 181 QAPFRGVRKFFYVALSAAAGISLLFTLPRLFRAIQGGDEAPDVWETAGNLAVNFGGIVVL 240
           QAPFRGVR+FFYVALSAAAGISLLF LPRLFRAIQGG+EAPDVWET GNLAVN GGIVV 
Sbjct: 183 QAPFRGVRRFFYVALSAAAGISLLFNLPRLFRAIQGGNEAPDVWETVGNLAVNVGGIVVF 242

Query: 241 VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS 300
           VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNR+VELVQLRDTVRPVILAGKKETVSS
Sbjct: 243 VALFLWDNKKEEEQLAQISRNETLSRLPLRLSTNRVVELVQLRDTVRPVILAGKKETVSS 302

Query: 301 AIQKAERFRTELLRRGVLVVPVVWGEGREPQIEKKGFGAPATA-AAALPSIGEDFEKRAQ 360
           AIQKAERFRTELLRRGVL+VPV+W EGREP++EKKGFGAPA A +AALPSIGEDFEKRAQ
Sbjct: 303 AIQKAERFRTELLRRGVLLVPVIWREGREPRMEKKGFGAPAPAGSAALPSIGEDFEKRAQ 362

Query: 361 SITAKSKLKAEIRFRAEVISPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRVRRSGRGM 420
           SITAKSKLKAEIRFRA+VISPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRVRRSGRGM
Sbjct: 363 SITAKSKLKAEIRFRADVISPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRVRRSGRGM 422

Query: 421 PDWPKIIEELPPMEALLSKLER 442
           PDW KIIEELPPM+ALLSKLER
Sbjct: 423 PDWQKIIEELPPMDALLSKLER 439

BLAST of Lag0030986 vs. TAIR 10
Match: AT1G02910.1 (tetratricopeptide repeat (TPR)-containing protein )

HSP 1 Score: 629.0 bits (1621), Expect = 2.8e-180
Identity = 314/426 (73.71%), Postives = 365/426 (85.68%), Query Frame = 0

Query: 25  RPRLPSSTSTLFNSQRNFHLSIVYCSSTSQSPEANVES---------AESCVNLGLQLFS 84
           RP LP   +TLFNS+RN+   +   +S+S SP ++  S         AE CVN GL LF 
Sbjct: 28  RPWLPPGDATLFNSRRNWDSHLFVYASSSSSPSSSPPSPNSPTDDLTAELCVNTGLDLFK 87

Query: 85  KGRVKEALVQFEAALNMNPNPMEAQAALYNKACCHAYRGEGKKAADCLRLALREYNLKFG 144
           +GRVK+ALVQFE AL++ PNP+E+QAA YNKACCHAYRGEGKKA DCLR+ALR+YNLKF 
Sbjct: 88  RGRVKDALVQFETALSLAPNPIESQAAYYNKACCHAYRGEGKKAVDCLRIALRDYNLKFA 147

Query: 145 TILNDPDLASFRALPEFKELQEEARIGGEDIGYGFRRDLKLISEVQAPFRGVRKFFYVAL 204
           TILNDPDLASFRALPEFKELQEEAR+GGEDIG  FRRDLKLISEV+APFRGVRKFFY A 
Sbjct: 148 TILNDPDLASFRALPEFKELQEEARLGGEDIGDNFRRDLKLISEVRAPFRGVRKFFYFAF 207

Query: 205 SAAAGISLLFTLPRLFRAIQGGDEAPDVWETAGNLAVNFGGIVVLVALFLWDNKKEEEQL 264
           +AAAGIS+ FT+PRL +AI+GGD AP++ ET GN A+N GGIVV+V+LFLW+NKKEEEQ+
Sbjct: 208 AAAAGISMFFTVPRLVQAIRGGDGAPNLLETTGNAAINIGGIVVMVSLFLWENKKEEEQM 267

Query: 265 AQISRNETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSSAIQKAERFRTELLRR 324
            QI+R+ETLSRLPLRLSTNR+VELVQLRDTVRPVILAGKKETV+ A+QKA+RFRTELLRR
Sbjct: 268 VQITRDETLSRLPLRLSTNRVVELVQLRDTVRPVILAGKKETVTLAMQKADRFRTELLRR 327

Query: 325 GVLVVPVVWGEGREPQIEKKGFGAPATAAAALPSIGEDFEKRAQSITAKSKLKAEIRFRA 384
           GVL+VPVVWGE + P+IEKKGFGA + AA +LPSIGEDF+ RAQS+ A+SKLK EIRF+A
Sbjct: 328 GVLLVPVVWGERKTPEIEKKGFGASSKAATSLPSIGEDFDTRAQSVVAQSKLKGEIRFKA 387

Query: 385 EVISPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRVRRSGRGMPDWPKIIEELPPMEAL 442
           E +SP EWE WIRDQQ SEGV PG+DVYIILRLDGRVRRSGRGMPDW +I +ELPPM+ +
Sbjct: 388 ETVSPGEWERWIRDQQISEGVNPGDDVYIILRLDGRVRRSGRGMPDWAEISKELPPMDDV 447

BLAST of Lag0030986 vs. TAIR 10
Match: AT4G28740.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF3493 (InterPro:IPR021883); BEST Arabidopsis thaliana protein match is: tetratricopeptide repeat (TPR)-containing protein (TAIR:AT1G02910.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 137.1 bits (344), Expect = 3.4e-32
Identity = 84/268 (31.34%), Postives = 135/268 (50.37%), Query Frame = 0

Query: 173 DLKLISEVQAPFRGVRKFFYVALSAAAGISLLFTLPRLFRAIQGGDEAPDVWETAGNLAV 232
           D ++ SEV +PFR VR FFY+A  A+  +  L    RL  A+     + +V E    L V
Sbjct: 94  DARIRSEVLSPFRSVRMFFYLAFIASGSLGGLIATSRLIGALANPARSGEVLEIVKGLGV 153

Query: 233 NFGGIVVLVALFLWDNKKEEEQLAQISRNETLSRLPLRL-STNRIVELVQLRDTVRPVIL 292
           + G   +   L+  +NK +  Q+A++SR E L +L +R+   N+++ +  LR   R VI 
Sbjct: 154 DIGAASLFAFLYFNENKTKNAQMARLSREENLGKLKMRVEENNKVISVGDLRGVARLVIC 213

Query: 293 AGKKETVSSAIQKAERFRTELLRRGVLVVPVVWGEGREPQIEKKGFGAPATAAAALPSIG 352
           AG  E +  A ++++ +   L+ RGV+VV     +G  P +E   F     A        
Sbjct: 214 AGPAEFIEEAFKRSKEYTQGLVERGVVVVAYA-TDGNSPVLE---FDETDIA-------D 273

Query: 353 EDFEKRAQSITAKSKLKAEIRFRAEVISPAEWESWIRDQQKSEGVTPGEDVYIILRLDGR 412
           E+  +R + +           +R   +   EWE W+ +Q+K   V+    VY+ LRLDGR
Sbjct: 274 EEMSQRRKKL-----------WRVTPVFVPEWEKWLNEQKKLANVSSDSPVYLSLRLDGR 333

Query: 413 VRRSGRGMPDWPKIIEELPPMEALLSKL 440
           VR SG G P W   + +LPP++ + + L
Sbjct: 334 VRASGVGYPPWQAFVAQLPPVKGMWTGL 339

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022143429.13.1e-22992.06protein LOW PSII ACCUMULATION 1, chloroplastic isoform X1 [Momordica charantia][more]
XP_008454363.13.5e-22591.38PREDICTED: protein LOW PSII ACCUMULATION 1, chloroplastic isoform X1 [Cucumis me... [more]
XP_004152258.29.7e-22391.18protein LOW PSII ACCUMULATION 1, chloroplastic [Cucumis sativus] >KGN52798.1 hyp... [more]
XP_038905239.11.1e-22190.93protein LOW PSII ACCUMULATION 1, chloroplastic [Benincasa hispida][more]
XP_022983449.11.8e-22190.50protein LOW PSII ACCUMULATION 1, chloroplastic [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q9SRY44.0e-17973.71Protein LOW PSII ACCUMULATION 1, chloroplastic OS=Arabidopsis thaliana OX=3702 G... [more]
Match NameE-valueIdentityDescription
A0A6J1CPA11.5e-22992.06protein LOW PSII ACCUMULATION 1, chloroplastic isoform X1 OS=Momordica charantia... [more]
A0A5A7TR761.7e-22591.38Protein LOW PSII ACCUMULATION 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_... [more]
A0A1S3BYE81.7e-22591.38protein LOW PSII ACCUMULATION 1, chloroplastic isoform X1 OS=Cucumis melo OX=365... [more]
A0A0A0KT964.7e-22391.18TPR_REGION domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G001650 ... [more]
A0A6J1J7F48.8e-22290.50protein LOW PSII ACCUMULATION 1, chloroplastic OS=Cucurbita maxima OX=3661 GN=LO... [more]
Match NameE-valueIdentityDescription
AT1G02910.12.8e-18073.71tetratricopeptide repeat (TPR)-containing protein [more]
AT4G28740.13.4e-3231.34FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR019734Tetratricopeptide repeatSMARTSM00028tpr_5coord: 63..96
e-value: 0.0014
score: 27.8
coord: 100..133
e-value: 350.0
score: 1.4
IPR019734Tetratricopeptide repeatPROSITEPS50005TPRcoord: 63..96
score: 9.9714
IPR021883Protein LOW PSII ACCUMULATION 1-likePFAMPF11998DUF3493coord: 171..248
e-value: 1.2E-24
score: 86.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 35..159
e-value: 3.3E-10
score: 42.0
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 56..129
NoneNo IPR availablePANTHERPTHR35498PROTEIN LOW PSII ACCUMULATION 1, CHLOROPLASTICcoord: 36..441
NoneNo IPR availablePANTHERPTHR35498:SF4PROTEIN LOW PSII ACCUMULATION 1, CHLOROPLASTICcoord: 36..441

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0030986.1Lag0030986.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010270 photosystem II oxygen evolving complex assembly
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005515 protein binding