HG10012085 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10012085
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionprotein LOW PSII ACCUMULATION 1, chloroplastic
LocationChr01: 17335999 .. 17339870 (+)
RNA-Seq ExpressionHG10012085
SyntenyHG10012085
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGGCGGCAAATTCTGGGTATCGTGGCGGCATCGAGACCATCCCTCGAAATTTGCTTCCATTGACAAACTGTGAAACTACCTCACGTGTCAATGCTCAGTTTCTCGAGTCGTTAACCATGGTGGTTTCCCTTTCCATGCCCAACATTTCCTCCTGCAACAACATTGCTCGTCCTCTTCATAACACTCCTGATTTCAGATCCTGCCGCCGTTTCCGCTGCCATGTTCATCGGAAGGCGGCCTCCTTTAATTCTAGCCCTAGTCTACCAGATTCGCTTCGTTTTCTTCATTACGCCAGACGAAATTTAGCTAACTTCACTTGCTCGGCTGCTGATAAACCGGAAATCAGGTTCTTCTCTCTTTCTCTGCCGTGGTAGTGTCTACAATGGATCGATAATGGCTGATGATTTTAATCCTATTATGAATTTTACATCTAAGCACTTGACATTTGATTGATTTTAGGTAGTTTCATGTTATTTTTCTTTTGTTCATGAGATTATGTGTTTGGATGGCTGAAAACCAACATATATTTGTGGAAATTAGGAGGGTCATAACTGGGTTCTTATATGAATTTTGTATTTGTGTATTTTCCAATTGAGTATTTAAGTTGGTGACTTCGTATTTTATGCCATGAGCTACAGTGCTTGTCTGCGTTTCTCTTTCTCTTATTTGTTATTTGATCACAATTTGATTTGTGTATGAACAAATCTATAGATAGTTAAAAGGCTCATTTGGTACCCGTTTAGTTTTATGTTTTTGAAAATTAAGCTTATTTCCTCTCAATTTGTTACCATGGTTATCATCCTTCTTAAGTAAAATAGTTTAATTCTTTGCCGAATTCTAAAAACAAAAACAAGTTTTTATTTTTTTATAGAATCAACAACTTTCATTGATTAAAAATGAAAGAATGTGAGGGCATACAAAAAAAAAACGAAGCCCACAAAAAACACTCCTTAAAGAAAGGGAGACCAAGTAAAATGTTTCCTAGAGAGTAGTTACAAAAGGTGTTCGAAATCGAAGCTCAAAGGGAGATAAAACTTAAGCAAAGACCACACATCATGCTAGGAGTCCTCTCCAACCCTTTAAACACTTTGTTAATTCTCTCTCCCCATAAAACCCATAACAAAGCGCACACCCCAGCATCCCAAAGAAAGCGGTCCTTCTCCCTACAAAGCGAATTAAGGAGGAACTCCTCAATCAAGCCACTGACACTCCTCTGAGGAGCCAATTGAAACCTAACTCCTAAAAGAAGCAATTCCACAATGAACGCACAACTCCAGCGCAGATGATCCAGGTCTTCCTCCGCCTTCCAACAAAGAATACAACAGAAGGGACCAACTAATGAAGGTAACTTCCTCAAAAGTCGATCCATCAAGTTCGCACGACTAAGTAGTAAAACTTGCCAAGAAAAGAAATTAACTTTCTTAGGAATCTTTATCCTCCACAAGGCAACAAAATGGGACTTATGGTAGAGTGATCCAACAAATATCGAAAGAAAGACTTACAAGAGAAACCATATAATGGGTTCAGACTCCAAACACGAATGTCCTTTCTCCCAAGCCTAAAACCAAACTCTCCAATCAAAGACAAAAGTATAACAAACCTCCTTTATCGATCAATGAGCGACAAAAGCCAAACGAAAAAGAAACATAGCTCTCCAACCAAACAAGAAAATCAGATATAAAACAATGCTTAAGGGAAGACAAATGATAAAGACGTGGAAAACCGAGCAGAGGGGTCTATCCTCCACCCACTGATCCTCCCAAAAATACGTTTTTCCCCTGCCCCACCATGCAACGAACAAGATGGGAAAAAGAAGGGAGCTCAGAAGAAATGTCCTTCCATAAATTTCAAGAAATGCCTTTAACCTCACTCGCAAAAATCAAAGGGATGAGGGTCATGCTTGCTAACAACTCTATGCCATAAGGAATCAGACTCAGGGGGAAACGCCACATCCACTTGGCCAACAAGGCTTTGTTCCAAAACCTTAAAACCTTAAATTAGTTTTTGTTTTTGACATTTATGCTTGTTCTCCTAATTTCCTTACCATGTTTTTCACCTTTCTTAAAGAATCATTTGAATTCTTAGTCAAGTTTGAAAACAAAAACGAGCTTTTAAAATCTACTTTTTTCGGTCTTCAAAACTTGAATTGGTTTTTGAAAATATTGGTAGAAAGTGAATAGCAAAACAAAGATAAATATAAGTGGAAATAGTGTTTATAATCTTGATTTTTAAGCCTAAATGGTTATCAAACTGGGTCTGCAATTTTCTAATGTGGTGTTTATTTTTTTGGGGGGAGTTGAGTGAGTGAGCCTGGCTGTGTTCTAACCTAGTCTCTTTTTCCATTACGCCTGTTATTCAACCTGAAATATGGCGTTGTGTAACTGTATTTGGTTTTTTCTTAGTTCCACGGCCAAGATAAGAAGTGAAGTTCTTTCTCCATTTCGGTCTGTTCGGATGTTCTTTTATCTCACTTTCATTGCAAGTGGTACATTGGGAGGATTGATAGCAACCACTCAATTGCTTGCTGCATTGGCAAACTCATCAAGAGCTGAAGAAGTCCCTGATATTCTGAAAGGACTTGGAATAGACTTCGGAGCCGTAGCCCTTTTTGCATTTCTTTATTTCAGAGAGAACAATGCAAAAAATGCTCAGTTGGCTAGACTGTCAAGAGAAGAAAGCCTTTCCAATTTAAAGCTTCGAGTGGACCAAAACAAAGTTATTCCCATCAGCACTTTGCGTGGGATTGCTCGTCTGGTAATTTGTGCTGGCCCTGAATCCTTTATCATGGAAGCTTTTAAATCAAGTGAACCTTTCACTGAACGACTTCTAGAACGGGGTGTTTTAGTCGTACCCCTTGCCACAGATGTCAGTACACTGAAGTTTGAGTTTGATGAACGTGAAGAGGTGAAGGATATAACCACCAAAAGGAAAAGACTCTGGCGCTTGACTCCAGTTTACATGACTGAGTGGTCAGCGTAAGTCATTATAATTAACATCCATTTTCACAATTGTTCTCCTTCTCCTTCCCTCATGTAATTTTCCTTTTCTTGTGAACAATAGGTGGTTAGATGAACAGAAGAAGTTGGCTGGAGTCTCCTCCGACTCTCCAGTGTAAGTTGTCCTTGTTGTTAAACTTTGAATTCAAATTGATGAAGTGTAAAATGGTTTTCTTTTTAATGCTTGAACTCCCAATCAAGTTGTGTCGTAGGTATTGATTTTTTTTATTTAAGTACTACTTTGGCCCTATACTTTTAGTTTTGATTCATTTTAGTTGTCTGTTCAAAATGTTCATTTAGCCTCGTAGTTTTAACAAGTGGTCAATTTCTCCCTATACGCAGATGTTTAACACAACTTGTATAAATGAAGGGACCAAAATAGTTATTTTATGTAAGTATAGAGACAAAAATGGACATTTTCAAAGTACATGCACCAAAATGAACAATAGTTGAAATTGCATAAATTGAAAAGAACCAAAGTCGGAAGTATAGAAACTGGAGTAGTATTAGTTTTGTAATGAAATATATTGTTTCATTAATCATCATTCATGAGTTATCCAATTTGGAGTCGCATAGTTGGTAATGTAAACTGCACAGAAAAAAAAGGAAAGGAAAGGAAAGGAAATAAACTTTTTGCTGAATCTAATGCCAAAACTAGTTAATCTTACAGAAGTTAACCAAAGTAGAATTTAAAGCTAAACTTTTTCTATGTTAATGTTTGTTCATGCAGGTATCTATCTCTGCGAATGGATGGCCGTGTTCGTGGTAGCGGTGTTGGCTATCCTCCATGGAATGCTCTTGTAGCACAATTACCACCTGTAAAAGGACTATGGTCAGGTCTTCTAGATGGGATGGACGGGAGAGTTCTTTGA

mRNA sequence

ATGGTGGCGGCAAATTCTGGGTATCGTGGCGGCATCGAGACCATCCCTCGAAATTTGCTTCCATTGACAAACTGTGAAACTACCTCACGTGTCAATGCTCAGTTTCTCGAGTCGTTAACCATGGTGGTTTCCCTTTCCATGCCCAACATTTCCTCCTGCAACAACATTGCTCGTCCTCTTCATAACACTCCTGATTTCAGATCCTGCCGCCGTTTCCGCTGCCATGTTCATCGGAAGGCGGCCTCCTTTAATTCTAGCCCTAGTCTACCAGATTCGCTTCGTTTTCTTCATTACGCCAGACGAAATTTAGCTAACTTCACTTGCTCGGCTGCTGATAAACCGGAAATCAGTTCCACGGCCAAGATAAGAAGTGAAGTTCTTTCTCCATTTCGGTCTGTTCGGATGTTCTTTTATCTCACTTTCATTGCAAGTGGTACATTGGGAGGATTGATAGCAACCACTCAATTGCTTGCTGCATTGGCAAACTCATCAAGAGCTGAAGAAGTCCCTGATATTCTGAAAGGACTTGGAATAGACTTCGGAGCCGTAGCCCTTTTTGCATTTCTTTATTTCAGAGAGAACAATGCAAAAAATGCTCAGTTGGCTAGACTGTCAAGAGAAGAAAGCCTTTCCAATTTAAAGCTTCGAGTGGACCAAAACAAAGTTATTCCCATCAGCACTTTGCGTGGGATTGCTCGTCTGGTAATTTGTGCTGGCCCTGAATCCTTTATCATGGAAGCTTTTAAATCAAGTGAACCTTTCACTGAACGACTTCTAGAACGGGGTGTTTTAGTCGTACCCCTTGCCACAGATGTCAGTACACTGAAGTTTGAGTTTGATGAACGTGAAGAGGTGAAGGATATAACCACCAAAAGGAAAAGACTCTGGCGCTTGACTCCAGTTTACATGACTGAGTGGTCAGCGTGGTTAGATGAACAGAAGAAGTTGGCTGGAGTCTCCTCCGACTCTCCAGTGTATCTATCTCTGCGAATGGATGGCCGTGTTCGTGGTAGCGGTGTTGGCTATCCTCCATGGAATGCTCTTGTAGCACAATTACCACCTGTAAAAGGACTATGGTCAGGTCTTCTAGATGGGATGGACGGGAGAGTTCTTTGA

Coding sequence (CDS)

ATGGTGGCGGCAAATTCTGGGTATCGTGGCGGCATCGAGACCATCCCTCGAAATTTGCTTCCATTGACAAACTGTGAAACTACCTCACGTGTCAATGCTCAGTTTCTCGAGTCGTTAACCATGGTGGTTTCCCTTTCCATGCCCAACATTTCCTCCTGCAACAACATTGCTCGTCCTCTTCATAACACTCCTGATTTCAGATCCTGCCGCCGTTTCCGCTGCCATGTTCATCGGAAGGCGGCCTCCTTTAATTCTAGCCCTAGTCTACCAGATTCGCTTCGTTTTCTTCATTACGCCAGACGAAATTTAGCTAACTTCACTTGCTCGGCTGCTGATAAACCGGAAATCAGTTCCACGGCCAAGATAAGAAGTGAAGTTCTTTCTCCATTTCGGTCTGTTCGGATGTTCTTTTATCTCACTTTCATTGCAAGTGGTACATTGGGAGGATTGATAGCAACCACTCAATTGCTTGCTGCATTGGCAAACTCATCAAGAGCTGAAGAAGTCCCTGATATTCTGAAAGGACTTGGAATAGACTTCGGAGCCGTAGCCCTTTTTGCATTTCTTTATTTCAGAGAGAACAATGCAAAAAATGCTCAGTTGGCTAGACTGTCAAGAGAAGAAAGCCTTTCCAATTTAAAGCTTCGAGTGGACCAAAACAAAGTTATTCCCATCAGCACTTTGCGTGGGATTGCTCGTCTGGTAATTTGTGCTGGCCCTGAATCCTTTATCATGGAAGCTTTTAAATCAAGTGAACCTTTCACTGAACGACTTCTAGAACGGGGTGTTTTAGTCGTACCCCTTGCCACAGATGTCAGTACACTGAAGTTTGAGTTTGATGAACGTGAAGAGGTGAAGGATATAACCACCAAAAGGAAAAGACTCTGGCGCTTGACTCCAGTTTACATGACTGAGTGGTCAGCGTGGTTAGATGAACAGAAGAAGTTGGCTGGAGTCTCCTCCGACTCTCCAGTGTATCTATCTCTGCGAATGGATGGCCGTGTTCGTGGTAGCGGTGTTGGCTATCCTCCATGGAATGCTCTTGTAGCACAATTACCACCTGTAAAAGGACTATGGTCAGGTCTTCTAGATGGGATGGACGGGAGAGTTCTTTGA

Protein sequence

MVAANSGYRGGIETIPRNLLPLTNCETTSRVNAQFLESLTMVVSLSMPNISSCNNIARPLHNTPDFRSCRRFRCHVHRKAASFNSSPSLPDSLRFLHYARRNLANFTCSAADKPEISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIATTQLLAALANSSRAEEVPDILKGLGIDFGAVALFAFLYFRENNAKNAQLARLSREESLSNLKLRVDQNKVIPISTLRGIARLVICAGPESFIMEAFKSSEPFTERLLERGVLVVPLATDVSTLKFEFDEREEVKDITTKRKRLWRLTPVYMTEWSAWLDEQKKLAGVSSDSPVYLSLRMDGRVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL
Homology
BLAST of HG10012085 vs. NCBI nr
Match: XP_038888898.1 (protein LOW PSII ACCUMULATION 1, chloroplastic [Benincasa hispida] >XP_038888899.1 protein LOW PSII ACCUMULATION 1, chloroplastic [Benincasa hispida])

HSP 1 Score: 609.0 bits (1569), Expect = 2.7e-170
Identity = 313/333 (93.99%), Postives = 316/333 (94.89%), Query Frame = 0

Query: 41  MVVSLSMPNISSCNNIARPLH--NTPDFRSCRRFRCHVHRKAASFNSSPSLPDSLRFLHY 100
           M VSLS PNISSCNNI RP H   T DF+  RR  CHVHRK  SFNSSP+LPDSL  LHY
Sbjct: 1   MAVSLSTPNISSCNNITRPPHFSRTSDFKFRRRICCHVHRKTLSFNSSPTLPDSLPSLHY 60

Query: 101 ARRNLANFTCSAADKPEISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIATTQLLA 160
           ARRNLANFTCSAADKPEISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIATTQLLA
Sbjct: 61  ARRNLANFTCSAADKPEISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIATTQLLA 120

Query: 161 ALANSSRAEEVPDILKGLGIDFGAVALFAFLYFRENNAKNAQLARLSREESLSNLKLRVD 220
           ALANSSRAEEVPDILKGLGIDFGAVALFAFLYFRENNAKNAQLARLSREESLSNLKLRVD
Sbjct: 121 ALANSSRAEEVPDILKGLGIDFGAVALFAFLYFRENNAKNAQLARLSREESLSNLKLRVD 180

Query: 221 QNKVIPISTLRGIARLVICAGPESFIMEAFKSSEPFTERLLERGVLVVPLATDVSTLKFE 280
           QNKVIPISTLRGIARLVICAGPESFIMEAFKSSEPFTERLLERGVLVVPLATDVSTL FE
Sbjct: 181 QNKVIPISTLRGIARLVICAGPESFIMEAFKSSEPFTERLLERGVLVVPLATDVSTLNFE 240

Query: 281 FDEREEVKDITTKRKRLWRLTPVYMTEWSAWLDEQKKLAGVSSDSPVYLSLRMDGRVRGS 340
           FDEREEVKD+TTKRKRLWRLTPVYMTEWSAWLDEQKKLAGVSSDSPVYLSLRMDGRVRGS
Sbjct: 241 FDEREEVKDMTTKRKRLWRLTPVYMTEWSAWLDEQKKLAGVSSDSPVYLSLRMDGRVRGS 300

Query: 341 GVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 372
           GVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL
Sbjct: 301 GVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 333

BLAST of HG10012085 vs. NCBI nr
Match: XP_022963501.1 (protein LOW PSII ACCUMULATION 1, chloroplastic [Cucurbita moschata] >XP_022963502.1 protein LOW PSII ACCUMULATION 1, chloroplastic [Cucurbita moschata])

HSP 1 Score: 606.7 bits (1563), Expect = 1.4e-169
Identity = 313/376 (83.24%), Postives = 331/376 (88.03%), Query Frame = 0

Query: 3   AANSGYRGGIETIPRNLLPLTNCETTSRVNAQFLESLTMVVSLSMPNISSCNNIARP--- 62
           AANSGYRG IET  RNLLPL   ETT  +N Q+LE   MVVS+++  I SCNN  RP   
Sbjct: 7   AANSGYRGAIETFLRNLLPLIIWETTLYINCQYLEWSIMVVSMAI--IPSCNNFGRPPHC 66

Query: 63  ----LHNTPDFRSCRRFRCHVHRKAASFNSSPSLPDSLRFLHYARRNLANFTCSAADKPE 122
                HN  + + CRR  CHVHR   SFN  PS+ DSLR+L Y RRNLA+ TCSA+DKPE
Sbjct: 67  SRTRSHNARNLKFCRRICCHVHRNPVSFNFGPSVADSLRYLQYRRRNLASVTCSASDKPE 126

Query: 123 ISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIATTQLLAALANSSRAEEVPDILKG 182
           ISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIATTQLLAALANSSRAEEVPDIL G
Sbjct: 127 ISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIATTQLLAALANSSRAEEVPDILNG 186

Query: 183 LGIDFGAVALFAFLYFRENNAKNAQLARLSREESLSNLKLRVDQNKVIPISTLRGIARLV 242
           LGIDFGAVALFAFLYFRENNAKNAQLARLSREESLSNLKLRVDQNKVI ISTLRGIARLV
Sbjct: 187 LGIDFGAVALFAFLYFRENNAKNAQLARLSREESLSNLKLRVDQNKVITISTLRGIARLV 246

Query: 243 ICAGPESFIMEAFKSSEPFTERLLERGVLVVPLATDVSTLKFEFDEREEVKDITTKRKRL 302
           ICAGPESFIMEAFK+SEPFTERLLERGVLV+P ATD S+L FEFDEREE+KDITTKRKRL
Sbjct: 247 ICAGPESFIMEAFKTSEPFTERLLERGVLVIPFATDASSLNFEFDEREEMKDITTKRKRL 306

Query: 303 WRLTPVYMTEWSAWLDEQKKLAGVSSDSPVYLSLRMDGRVRGSGVGYPPWNALVAQLPPV 362
           WRLTPVYM++WSAWLD+QKKLAGVSSDSPVYLSLRMDGRVRGSGVGYPPWNALVAQLPPV
Sbjct: 307 WRLTPVYMSQWSAWLDDQKKLAGVSSDSPVYLSLRMDGRVRGSGVGYPPWNALVAQLPPV 366

Query: 363 KGLWSGLLDGMDGRVL 372
           KGLWSGLLDGMDGRVL
Sbjct: 367 KGLWSGLLDGMDGRVL 380

BLAST of HG10012085 vs. NCBI nr
Match: XP_023554665.1 (protein LOW PSII ACCUMULATION 1, chloroplastic [Cucurbita pepo subsp. pepo] >XP_023554666.1 protein LOW PSII ACCUMULATION 1, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 603.6 bits (1555), Expect = 1.2e-168
Identity = 311/376 (82.71%), Postives = 330/376 (87.77%), Query Frame = 0

Query: 3   AANSGYRGGIETIPRNLLPLTNCETTSRVNAQFLESLTMVVSLSMPNISSCNNIARP--- 62
           AANSGYRG IET  RNLLPL   ETT  +N ++LE   MVVS+++  I SCNN  RP   
Sbjct: 7   AANSGYRGAIETFLRNLLPLIIWETTLYINCKYLEWSIMVVSMAI--IPSCNNFGRPPHC 66

Query: 63  ----LHNTPDFRSCRRFRCHVHRKAASFNSSPSLPDSLRFLHYARRNLANFTCSAADKPE 122
                HN  + + CRR  CHVHR   SFN  PS+ DSLR+L Y RRNLA+ TCSA+DKPE
Sbjct: 67  SRTRSHNARNLKFCRRICCHVHRNRVSFNFGPSVADSLRYLQYRRRNLASVTCSASDKPE 126

Query: 123 ISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIATTQLLAALANSSRAEEVPDILKG 182
           ISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIATTQLLAALANSSRAEEVPDIL G
Sbjct: 127 ISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIATTQLLAALANSSRAEEVPDILNG 186

Query: 183 LGIDFGAVALFAFLYFRENNAKNAQLARLSREESLSNLKLRVDQNKVIPISTLRGIARLV 242
           LGIDFGAVA FAFLYFRENNAKNAQLARLSREESLSNLKLRVDQNKVI ISTLRGIARLV
Sbjct: 187 LGIDFGAVAFFAFLYFRENNAKNAQLARLSREESLSNLKLRVDQNKVITISTLRGIARLV 246

Query: 243 ICAGPESFIMEAFKSSEPFTERLLERGVLVVPLATDVSTLKFEFDEREEVKDITTKRKRL 302
           ICAGPESFIMEAFK+SEPFTERLLERGVLV+P ATD S+L FEFDEREE+KDITTKRKRL
Sbjct: 247 ICAGPESFIMEAFKTSEPFTERLLERGVLVIPFATDASSLNFEFDEREEMKDITTKRKRL 306

Query: 303 WRLTPVYMTEWSAWLDEQKKLAGVSSDSPVYLSLRMDGRVRGSGVGYPPWNALVAQLPPV 362
           WRLTPVYM++WSAWLD+QKKLAGVSSDSPVYLSLRMDGRVRGSGVGYPPWNALVAQLPPV
Sbjct: 307 WRLTPVYMSQWSAWLDDQKKLAGVSSDSPVYLSLRMDGRVRGSGVGYPPWNALVAQLPPV 366

Query: 363 KGLWSGLLDGMDGRVL 372
           KGLWSGLLDGMDGRVL
Sbjct: 367 KGLWSGLLDGMDGRVL 380

BLAST of HG10012085 vs. NCBI nr
Match: KAG7011306.1 (Protein LOW PSII ACCUMULATION 1, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 585.9 bits (1509), Expect = 2.5e-163
Identity = 311/396 (78.54%), Postives = 330/396 (83.33%), Query Frame = 0

Query: 3   AANSGYRGGIETIPRNLLPLTNCETTSRVNAQFLESLTMVVSLSMPNISSCNNIARP--- 62
           AANSGYRG IET  RNLLPL   ETT  +N Q+LE   MVVS+++  I SCNN  RP   
Sbjct: 19  AANSGYRGAIETFLRNLLPLIIWETTLYINCQYLEWSIMVVSMAI--IPSCNNFGRPPHC 78

Query: 63  ----LHNTPDFRSCRRFRCHVHRKAASFNSSPSLPDSLRFLHYARRNLANFTCSAADKPE 122
                HN  + + CRR  CHVHR   SFN  PS+ DSLR+L Y  R+LA+ TCSA+DKPE
Sbjct: 79  SRTRSHNARNLKFCRRICCHVHRNRVSFNFGPSVADSLRYLKY--RDLASVTCSASDKPE 138

Query: 123 I--------------------SSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIATTQ 182
           I                    SSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIATTQ
Sbjct: 139 ISTPVIQYEMRLCVTVSVFFLSSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIATTQ 198

Query: 183 LLAALANSSRAEEVPDILKGLGIDFGAVALFAFLYFRENNAKNAQLARLSREESLSNLKL 242
           LLAALANSSRAEEVPDIL GLGIDFGAVALFAFLYFRENNAKNAQLARLSREESLSNLKL
Sbjct: 199 LLAALANSSRAEEVPDILNGLGIDFGAVALFAFLYFRENNAKNAQLARLSREESLSNLKL 258

Query: 243 RVDQNKVIPISTLRGIARLVICAGPESFIMEAFKSSEPFTERLLERGVLVVPLATDVSTL 302
           RVDQNKVI ISTLRGIARLVICAGPESFIMEAFK+SEPFTERLLERGVLV+P ATD S+L
Sbjct: 259 RVDQNKVITISTLRGIARLVICAGPESFIMEAFKTSEPFTERLLERGVLVIPFATDASSL 318

Query: 303 KFEFDEREEVKDITTKRKRLWRLTPVYMTEWSAWLDEQKKLAGVSSDSPVYLSLRMDGRV 362
            FEFDEREE+KDITTKRKRLWRLTPVYM++WSAWLD+QKKLAGVSSDSPVYLSLRMDGRV
Sbjct: 319 NFEFDEREEMKDITTKRKRLWRLTPVYMSQWSAWLDDQKKLAGVSSDSPVYLSLRMDGRV 378

Query: 363 RGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 372
           RGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL
Sbjct: 379 RGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 410

BLAST of HG10012085 vs. NCBI nr
Match: XP_004152010.1 (protein LOW PSII ACCUMULATION 1, chloroplastic [Cucumis sativus] >KGN58323.1 hypothetical protein Csa_017553 [Cucumis sativus])

HSP 1 Score: 576.2 bits (1484), Expect = 2.0e-160
Identity = 298/338 (88.17%), Postives = 308/338 (91.12%), Query Frame = 0

Query: 41  MVVSLSMPNISSCNNIA-------RPLHNTPDFRSCRRFRCHVHRKAASFNSSPSLPDSL 100
           M VSL  P+ISS NNIA        P HN  DF+  RR  C VHRK  SF+SSP LP SL
Sbjct: 1   MAVSLFTPSISSFNNIAPPPHFSPTPSHNASDFKFRRRSYCRVHRKTVSFSSSPRLPVSL 60

Query: 101 RFLHYARRNLANFTCSAADKPEISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIAT 160
           RFL Y RRNLANF CSAADKPEISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIAT
Sbjct: 61  RFLVYGRRNLANFICSAADKPEISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIAT 120

Query: 161 TQLLAALANSSRAEEVPDILKGLGIDFGAVALFAFLYFRENNAKNAQLARLSREESLSNL 220
           TQLL ALANSSRA+EVPDIL+GLG+DFGAVALFAFLYFRENNAKNAQLARLSREESLSNL
Sbjct: 121 TQLLGALANSSRADEVPDILEGLGVDFGAVALFAFLYFRENNAKNAQLARLSREESLSNL 180

Query: 221 KLRVDQNKVIPISTLRGIARLVICAGPESFIMEAFKSSEPFTERLLERGVLVVPLATDVS 280
           KLRVDQNKVIPIS LRGIARLVICAGPESFI+EAFKSSEPFTERLLERGVLVVPLATDV+
Sbjct: 181 KLRVDQNKVIPISILRGIARLVICAGPESFIIEAFKSSEPFTERLLERGVLVVPLATDVT 240

Query: 281 TLKFEFDEREEVKDITTKRKRLWRLTPVYMTEWSAWLDEQKKLAGVSSDSPVYLSLRMDG 340
           TL FEFD+REEVKDITTKRKRLWRLTPVYMTEWSAWLDEQKKLAGV+SDSPVYLSLRMDG
Sbjct: 241 TLNFEFDDREEVKDITTKRKRLWRLTPVYMTEWSAWLDEQKKLAGVTSDSPVYLSLRMDG 300

Query: 341 RVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 372
           RVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL
Sbjct: 301 RVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 338

BLAST of HG10012085 vs. ExPASy Swiss-Prot
Match: Q9SRY4 (Protein LOW PSII ACCUMULATION 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=LPA1 PE=1 SV=1)

HSP 1 Score: 134.4 bits (337), Expect = 2.6e-30
Identity = 82/265 (30.94%), Postives = 131/265 (49.43%), Query Frame = 0

Query: 121 KIRSEVLSPFRSVRMFFYLTFIASGTLGGLIATTQLLAALANSSRAEEVPDILKGLGIDF 180
           K+ SEV +PFR VR FFY  F A+  +       +L+ A+     A  + +      I+ 
Sbjct: 187 KLISEVRAPFRGVRKFFYFAFAAAAGISMFFTVPRLVQAIRGGDGAPNLLETTGNAAINI 246

Query: 181 GAVALFAFLYFRENNAKNAQLARLSREESLSNLKLRVDQNKVIPISTLRGIARLVICAGP 240
           G + +   L+  EN  +  Q+ +++R+E+LS L LR+  N+V+ +  LR   R VI AG 
Sbjct: 247 GGIVVMVSLFLWENKKEEEQMVQITRDETLSRLPLRLSTNRVVELVQLRDTVRPVILAGK 306

Query: 241 ESFIMEAFKSSEPFTERLLERGVLVVPL---------------------ATDVSTLKFEF 300
           +  +  A + ++ F   LL RGVL+VP+                     AT + ++  +F
Sbjct: 307 KETVTLAMQKADRFRTELLRRGVLLVPVVWGERKTPEIEKKGFGASSKAATSLPSIGEDF 366

Query: 301 DEREE--VKDITTKRKRLWRLTPVYMTEWSAWLDEQKKLAGVSSDSPVYLSLRMDGRVRG 360
           D R +  V     K +  ++   V   EW  W+ +Q+   GV+    VY+ LR+DGRVR 
Sbjct: 367 DTRAQSVVAQSKLKGEIRFKAETVSPGEWERWIRDQQISEGVNPGDDVYIILRLDGRVRR 426

Query: 361 SGVGYPPWNALVAQLPPVKGLWSGL 363
           SG G P W  +  +LPP+  + S L
Sbjct: 427 SGRGMPDWAEISKELPPMDDVLSKL 451

BLAST of HG10012085 vs. ExPASy TrEMBL
Match: A0A6J1HGA1 (protein LOW PSII ACCUMULATION 1, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111463810 PE=4 SV=1)

HSP 1 Score: 606.7 bits (1563), Expect = 6.6e-170
Identity = 313/376 (83.24%), Postives = 331/376 (88.03%), Query Frame = 0

Query: 3   AANSGYRGGIETIPRNLLPLTNCETTSRVNAQFLESLTMVVSLSMPNISSCNNIARP--- 62
           AANSGYRG IET  RNLLPL   ETT  +N Q+LE   MVVS+++  I SCNN  RP   
Sbjct: 7   AANSGYRGAIETFLRNLLPLIIWETTLYINCQYLEWSIMVVSMAI--IPSCNNFGRPPHC 66

Query: 63  ----LHNTPDFRSCRRFRCHVHRKAASFNSSPSLPDSLRFLHYARRNLANFTCSAADKPE 122
                HN  + + CRR  CHVHR   SFN  PS+ DSLR+L Y RRNLA+ TCSA+DKPE
Sbjct: 67  SRTRSHNARNLKFCRRICCHVHRNPVSFNFGPSVADSLRYLQYRRRNLASVTCSASDKPE 126

Query: 123 ISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIATTQLLAALANSSRAEEVPDILKG 182
           ISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIATTQLLAALANSSRAEEVPDIL G
Sbjct: 127 ISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIATTQLLAALANSSRAEEVPDILNG 186

Query: 183 LGIDFGAVALFAFLYFRENNAKNAQLARLSREESLSNLKLRVDQNKVIPISTLRGIARLV 242
           LGIDFGAVALFAFLYFRENNAKNAQLARLSREESLSNLKLRVDQNKVI ISTLRGIARLV
Sbjct: 187 LGIDFGAVALFAFLYFRENNAKNAQLARLSREESLSNLKLRVDQNKVITISTLRGIARLV 246

Query: 243 ICAGPESFIMEAFKSSEPFTERLLERGVLVVPLATDVSTLKFEFDEREEVKDITTKRKRL 302
           ICAGPESFIMEAFK+SEPFTERLLERGVLV+P ATD S+L FEFDEREE+KDITTKRKRL
Sbjct: 247 ICAGPESFIMEAFKTSEPFTERLLERGVLVIPFATDASSLNFEFDEREEMKDITTKRKRL 306

Query: 303 WRLTPVYMTEWSAWLDEQKKLAGVSSDSPVYLSLRMDGRVRGSGVGYPPWNALVAQLPPV 362
           WRLTPVYM++WSAWLD+QKKLAGVSSDSPVYLSLRMDGRVRGSGVGYPPWNALVAQLPPV
Sbjct: 307 WRLTPVYMSQWSAWLDDQKKLAGVSSDSPVYLSLRMDGRVRGSGVGYPPWNALVAQLPPV 366

Query: 363 KGLWSGLLDGMDGRVL 372
           KGLWSGLLDGMDGRVL
Sbjct: 367 KGLWSGLLDGMDGRVL 380

BLAST of HG10012085 vs. ExPASy TrEMBL
Match: A0A0A0L902 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G621430 PE=4 SV=1)

HSP 1 Score: 576.2 bits (1484), Expect = 9.5e-161
Identity = 298/338 (88.17%), Postives = 308/338 (91.12%), Query Frame = 0

Query: 41  MVVSLSMPNISSCNNIA-------RPLHNTPDFRSCRRFRCHVHRKAASFNSSPSLPDSL 100
           M VSL  P+ISS NNIA        P HN  DF+  RR  C VHRK  SF+SSP LP SL
Sbjct: 1   MAVSLFTPSISSFNNIAPPPHFSPTPSHNASDFKFRRRSYCRVHRKTVSFSSSPRLPVSL 60

Query: 101 RFLHYARRNLANFTCSAADKPEISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIAT 160
           RFL Y RRNLANF CSAADKPEISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIAT
Sbjct: 61  RFLVYGRRNLANFICSAADKPEISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIAT 120

Query: 161 TQLLAALANSSRAEEVPDILKGLGIDFGAVALFAFLYFRENNAKNAQLARLSREESLSNL 220
           TQLL ALANSSRA+EVPDIL+GLG+DFGAVALFAFLYFRENNAKNAQLARLSREESLSNL
Sbjct: 121 TQLLGALANSSRADEVPDILEGLGVDFGAVALFAFLYFRENNAKNAQLARLSREESLSNL 180

Query: 221 KLRVDQNKVIPISTLRGIARLVICAGPESFIMEAFKSSEPFTERLLERGVLVVPLATDVS 280
           KLRVDQNKVIPIS LRGIARLVICAGPESFI+EAFKSSEPFTERLLERGVLVVPLATDV+
Sbjct: 181 KLRVDQNKVIPISILRGIARLVICAGPESFIIEAFKSSEPFTERLLERGVLVVPLATDVT 240

Query: 281 TLKFEFDEREEVKDITTKRKRLWRLTPVYMTEWSAWLDEQKKLAGVSSDSPVYLSLRMDG 340
           TL FEFD+REEVKDITTKRKRLWRLTPVYMTEWSAWLDEQKKLAGV+SDSPVYLSLRMDG
Sbjct: 241 TLNFEFDDREEVKDITTKRKRLWRLTPVYMTEWSAWLDEQKKLAGVTSDSPVYLSLRMDG 300

Query: 341 RVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 372
           RVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL
Sbjct: 301 RVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 338

BLAST of HG10012085 vs. ExPASy TrEMBL
Match: A0A5D3BG35 (Protein LOW PSII ACCUMULATION 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold487G00670 PE=4 SV=1)

HSP 1 Score: 572.8 bits (1475), Expect = 1.1e-159
Identity = 297/338 (87.87%), Postives = 308/338 (91.12%), Query Frame = 0

Query: 41  MVVSLSMPNISSCNNIA-------RPLHNTPDFRSCRRFRCHVHRKAASFNSSPSLPDSL 100
           M VSLS   +SS NNIA        P HN PDF+ CRR   HVHRK  SF+SSP LP SL
Sbjct: 1   MAVSLSTLIVSSFNNIAPPPHFSPTPSHNAPDFKFCRRSCFHVHRKTVSFSSSPRLPVSL 60

Query: 101 RFLHYARRNLANFTCSAADKPEISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIAT 160
           RF  Y RRNLAN+  SAADKPEISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIAT
Sbjct: 61  RFHVYGRRNLANYIYSAADKPEISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIAT 120

Query: 161 TQLLAALANSSRAEEVPDILKGLGIDFGAVALFAFLYFRENNAKNAQLARLSREESLSNL 220
           TQLL ALANSSRA+EVPDILKGLGIDFGAVALFAFLYFRENNAKNAQLARLSREESLSNL
Sbjct: 121 TQLLGALANSSRADEVPDILKGLGIDFGAVALFAFLYFRENNAKNAQLARLSREESLSNL 180

Query: 221 KLRVDQNKVIPISTLRGIARLVICAGPESFIMEAFKSSEPFTERLLERGVLVVPLATDVS 280
           KLRVDQNKVIPISTLRGIARLVICAGPESF++EAFKSSEPFTE+LLERGVLVVPLATDV+
Sbjct: 181 KLRVDQNKVIPISTLRGIARLVICAGPESFVIEAFKSSEPFTEQLLERGVLVVPLATDVT 240

Query: 281 TLKFEFDEREEVKDITTKRKRLWRLTPVYMTEWSAWLDEQKKLAGVSSDSPVYLSLRMDG 340
           TL FEFDEREEVKDIT+KRK+LWRLTPVYMTEWSAWLDEQKKLAGVSSDSPVYLSLRMDG
Sbjct: 241 TLNFEFDEREEVKDITSKRKKLWRLTPVYMTEWSAWLDEQKKLAGVSSDSPVYLSLRMDG 300

Query: 341 RVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 372
           RVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL
Sbjct: 301 RVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 338

BLAST of HG10012085 vs. ExPASy TrEMBL
Match: A0A1S3BHU5 (protein LOW PSII ACCUMULATION 1, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103489822 PE=4 SV=1)

HSP 1 Score: 572.8 bits (1475), Expect = 1.1e-159
Identity = 297/338 (87.87%), Postives = 308/338 (91.12%), Query Frame = 0

Query: 41  MVVSLSMPNISSCNNIA-------RPLHNTPDFRSCRRFRCHVHRKAASFNSSPSLPDSL 100
           M VSLS   +SS NNIA        P HN PDF+ CRR   HVHRK  SF+SSP LP SL
Sbjct: 1   MAVSLSTLIVSSFNNIAPPPHFSPTPSHNAPDFKFCRRSCFHVHRKTVSFSSSPRLPVSL 60

Query: 101 RFLHYARRNLANFTCSAADKPEISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIAT 160
           RF  Y RRNLAN+  SAADKPEISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIAT
Sbjct: 61  RFHVYGRRNLANYIYSAADKPEISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIAT 120

Query: 161 TQLLAALANSSRAEEVPDILKGLGIDFGAVALFAFLYFRENNAKNAQLARLSREESLSNL 220
           TQLL ALANSSRA+EVPDILKGLGIDFGAVALFAFLYFRENNAKNAQLARLSREESLSNL
Sbjct: 121 TQLLGALANSSRADEVPDILKGLGIDFGAVALFAFLYFRENNAKNAQLARLSREESLSNL 180

Query: 221 KLRVDQNKVIPISTLRGIARLVICAGPESFIMEAFKSSEPFTERLLERGVLVVPLATDVS 280
           KLRVDQNKVIPISTLRGIARLVICAGPESF++EAFKSSEPFTE+LLERGVLVVPLATDV+
Sbjct: 181 KLRVDQNKVIPISTLRGIARLVICAGPESFVIEAFKSSEPFTEQLLERGVLVVPLATDVT 240

Query: 281 TLKFEFDEREEVKDITTKRKRLWRLTPVYMTEWSAWLDEQKKLAGVSSDSPVYLSLRMDG 340
           TL FEFDEREEVKDIT+KRK+LWRLTPVYMTEWSAWLDEQKKLAGVSSDSPVYLSLRMDG
Sbjct: 241 TLNFEFDEREEVKDITSKRKKLWRLTPVYMTEWSAWLDEQKKLAGVSSDSPVYLSLRMDG 300

Query: 341 RVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 372
           RVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL
Sbjct: 301 RVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 338

BLAST of HG10012085 vs. ExPASy TrEMBL
Match: A0A6J1HWD0 (protein LOW PSII ACCUMULATION 1, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111466857 PE=4 SV=1)

HSP 1 Score: 559.7 bits (1441), Expect = 9.2e-156
Identity = 286/338 (84.62%), Postives = 303/338 (89.64%), Query Frame = 0

Query: 41  MVVSLSMPNISSCNNIARP-------LHNTPDFRSCRRFRCHVHRKAASFNSSPSLPDSL 100
           MVVS+++  I SCNN  RP        HN  + + CRR  CHVHR   SFN  PS+ DSL
Sbjct: 1   MVVSMAI--IPSCNNFGRPPHCSRTRSHNAHNLKFCRRICCHVHRNPVSFNFGPSVADSL 60

Query: 101 RFLHYARRNLANFTCSAADKPEISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIAT 160
           R+L Y RRNLA+ TCSA+DKPEISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIAT
Sbjct: 61  RYLQYRRRNLASVTCSASDKPEISSTAKIRSEVLSPFRSVRMFFYLTFIASGTLGGLIAT 120

Query: 161 TQLLAALANSSRAEEVPDILKGLGIDFGAVALFAFLYFRENNAKNAQLARLSREESLSNL 220
           TQLLAALAN+SRAEEVPDIL GLGIDFGAVA FAFLYFRENNAKNAQLARLSREESLSNL
Sbjct: 121 TQLLAALANTSRAEEVPDILNGLGIDFGAVAFFAFLYFRENNAKNAQLARLSREESLSNL 180

Query: 221 KLRVDQNKVIPISTLRGIARLVICAGPESFIMEAFKSSEPFTERLLERGVLVVPLATDVS 280
           KLRVDQNKVI ISTLRGIARLVICAGPES IMEAFK+SEPFTERLLERGVLV+P ATD S
Sbjct: 181 KLRVDQNKVITISTLRGIARLVICAGPESLIMEAFKTSEPFTERLLERGVLVIPFATDAS 240

Query: 281 TLKFEFDEREEVKDITTKRKRLWRLTPVYMTEWSAWLDEQKKLAGVSSDSPVYLSLRMDG 340
           +L FEFDEREE+KDITTKRKRLWRLTPVYM++WSAWLD+QKKLAGVSSDSPVYLSLRMDG
Sbjct: 241 SLNFEFDEREEMKDITTKRKRLWRLTPVYMSQWSAWLDDQKKLAGVSSDSPVYLSLRMDG 300

Query: 341 RVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 372
           RVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL
Sbjct: 301 RVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRVL 336

BLAST of HG10012085 vs. TAIR 10
Match: AT4G28740.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF3493 (InterPro:IPR021883); BEST Arabidopsis thaliana protein match is: tetratricopeptide repeat (TPR)-containing protein (TAIR:AT1G02910.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 367.5 bits (942), Expect = 1.3e-101
Identity = 199/346 (57.51%), Postives = 249/346 (71.97%), Query Frame = 0

Query: 28  TSRVNAQFLESLTMVVSLSMPNISSCNNIARPLHNTPDFRSCRRFRCHVHRKAASFNSSP 87
           T  V       +  +VS S   I  C+   + L+   +  S RR   H HR+   F+   
Sbjct: 6   TGEVAENLARPMATLVS-SQTYIYHCHISKQALYQAKESYSHRRISRHNHRERLDFSHR- 65

Query: 88  SLPDSLRFLHYARRNLANFTCSAADKP-EISSTAKIRSEVLSPFRSVRMFFYLTFIASGT 147
                L      +    N  C AAD+P EIS+ A+IRSEVLSPFRSVRMFFYL FIASG+
Sbjct: 66  --NHRLTITRKQQPLSFNTVCFAADEPSEISADARIRSEVLSPFRSVRMFFYLAFIASGS 125

Query: 148 LGGLIATTQLLAALANSSRAEEVPDILKGLGIDFGAVALFAFLYFRENNAKNAQLARLSR 207
           LGGLIAT++L+ ALAN +R+ EV +I+KGLG+D GA +LFAFLYF EN  KNAQ+ARLSR
Sbjct: 126 LGGLIATSRLIGALANPARSGEVLEIVKGLGVDIGAASLFAFLYFNENKTKNAQMARLSR 185

Query: 208 EESLSNLKLRVDQ-NKVIPISTLRGIARLVICAGPESFIMEAFKSSEPFTERLLERGVLV 267
           EE+L  LK+RV++ NKVI +  LRG+ARLVICAGP  FI EAFK S+ +T+ L+ERGV+V
Sbjct: 186 EENLGKLKMRVEENNKVISVGDLRGVARLVICAGPAEFIEEAFKRSKEYTQGLVERGVVV 245

Query: 268 VPLATDVSTLKFEFDEREEV-KDITTKRKRLWRLTPVYMTEWSAWLDEQKKLAGVSSDSP 327
           V  ATD ++   EFDE +   ++++ +RK+LWR+TPV++ EW  WL+EQKKLA VSSDSP
Sbjct: 246 VAYATDGNSPVLEFDETDIADEEMSQRRKKLWRVTPVFVPEWEKWLNEQKKLANVSSDSP 305

Query: 328 VYLSLRMDGRVRGSGVGYPPWNALVAQLPPVKGLWSGLLDGMDGRV 371
           VYLSLR+DGRVR SGVGYPPW A VAQLPPVKG+W+GLLDGMDGRV
Sbjct: 306 VYLSLRLDGRVRASGVGYPPWQAFVAQLPPVKGMWTGLLDGMDGRV 347

BLAST of HG10012085 vs. TAIR 10
Match: AT1G02910.1 (tetratricopeptide repeat (TPR)-containing protein )

HSP 1 Score: 134.4 bits (337), Expect = 1.8e-31
Identity = 82/265 (30.94%), Postives = 131/265 (49.43%), Query Frame = 0

Query: 121 KIRSEVLSPFRSVRMFFYLTFIASGTLGGLIATTQLLAALANSSRAEEVPDILKGLGIDF 180
           K+ SEV +PFR VR FFY  F A+  +       +L+ A+     A  + +      I+ 
Sbjct: 187 KLISEVRAPFRGVRKFFYFAFAAAAGISMFFTVPRLVQAIRGGDGAPNLLETTGNAAINI 246

Query: 181 GAVALFAFLYFRENNAKNAQLARLSREESLSNLKLRVDQNKVIPISTLRGIARLVICAGP 240
           G + +   L+  EN  +  Q+ +++R+E+LS L LR+  N+V+ +  LR   R VI AG 
Sbjct: 247 GGIVVMVSLFLWENKKEEEQMVQITRDETLSRLPLRLSTNRVVELVQLRDTVRPVILAGK 306

Query: 241 ESFIMEAFKSSEPFTERLLERGVLVVPL---------------------ATDVSTLKFEF 300
           +  +  A + ++ F   LL RGVL+VP+                     AT + ++  +F
Sbjct: 307 KETVTLAMQKADRFRTELLRRGVLLVPVVWGERKTPEIEKKGFGASSKAATSLPSIGEDF 366

Query: 301 DEREE--VKDITTKRKRLWRLTPVYMTEWSAWLDEQKKLAGVSSDSPVYLSLRMDGRVRG 360
           D R +  V     K +  ++   V   EW  W+ +Q+   GV+    VY+ LR+DGRVR 
Sbjct: 367 DTRAQSVVAQSKLKGEIRFKAETVSPGEWERWIRDQQISEGVNPGDDVYIILRLDGRVRR 426

Query: 361 SGVGYPPWNALVAQLPPVKGLWSGL 363
           SG G P W  +  +LPP+  + S L
Sbjct: 427 SGRGMPDWAEISKELPPMDDVLSKL 451

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038888898.12.7e-17093.99protein LOW PSII ACCUMULATION 1, chloroplastic [Benincasa hispida] >XP_038888899... [more]
XP_022963501.11.4e-16983.24protein LOW PSII ACCUMULATION 1, chloroplastic [Cucurbita moschata] >XP_02296350... [more]
XP_023554665.11.2e-16882.71protein LOW PSII ACCUMULATION 1, chloroplastic [Cucurbita pepo subsp. pepo] >XP_... [more]
KAG7011306.12.5e-16378.54Protein LOW PSII ACCUMULATION 1, chloroplastic, partial [Cucurbita argyrosperma ... [more]
XP_004152010.12.0e-16088.17protein LOW PSII ACCUMULATION 1, chloroplastic [Cucumis sativus] >KGN58323.1 hyp... [more]
Match NameE-valueIdentityDescription
Q9SRY42.6e-3030.94Protein LOW PSII ACCUMULATION 1, chloroplastic OS=Arabidopsis thaliana OX=3702 G... [more]
Match NameE-valueIdentityDescription
A0A6J1HGA16.6e-17083.24protein LOW PSII ACCUMULATION 1, chloroplastic OS=Cucurbita moschata OX=3662 GN=... [more]
A0A0A0L9029.5e-16188.17Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G621430 PE=4 SV=1[more]
A0A5D3BG351.1e-15987.87Protein LOW PSII ACCUMULATION 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_... [more]
A0A1S3BHU51.1e-15987.87protein LOW PSII ACCUMULATION 1, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103... [more]
A0A6J1HWD09.2e-15684.62protein LOW PSII ACCUMULATION 1, chloroplastic OS=Cucurbita maxima OX=3661 GN=LO... [more]
Match NameE-valueIdentityDescription
AT4G28740.11.3e-10157.51FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT1G02910.11.8e-3130.94tetratricopeptide repeat (TPR)-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021883Protein LOW PSII ACCUMULATION 1-likePFAMPF11998DUF3493coord: 117..194
e-value: 3.5E-27
score: 94.4
NoneNo IPR availablePANTHERPTHR35498:SF1LOW PSII ACCUMULATION-LIKE PROTEINcoord: 72..369
NoneNo IPR availablePANTHERPTHR35498PROTEIN LOW PSII ACCUMULATION 1, CHLOROPLASTICcoord: 72..369

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10012085.1HG10012085.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane