CmaCh10G011670 (gene) Cucurbita maxima (Rimu)

NameCmaCh10G011670
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionChloroplast oxygen-evolving complex/thylakoid lumenal 25.6kDa protein
LocationCma_Chr10 : 8377302 .. 8380178 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGTAATCCTCGATTCCTTCTTACCTCCAATTCACACACTCAGTTCTTCTTCTCGGAAATGGATTCCAACCGCTTCCCCGTCTCCGCGATGGCCGCCAATCGATTCATCCCCCAAATCTCGCTTCTCACTCACCTCTGAATCTAAACCGGTAATTCTTCCAAAATTACTACACAAACTGTAATCTGGAACTTGTTGCTTGCATTTATCGGCATCGCTTTCTAATATGTAACCAGATTAAAGCCCTTGCAGTTCCGAGGAGGAGTGCAATGGCGTTAATCTTGTAATCTGGAACTTGTTTCTTGCATTTATCGCTATCGCTTTCGCTTTATAATTTGTAATCAGATTAAAGCCGTTGCAGTTCAGAGGAGGAGTGCAATGGCGTTGATCTTGTAGTCTGGAACTTGTTTCTTGCATCTATCGCTATCGCTTTCTAATATGTAACCAGATTAAAGCCGTTGCAGTTCCGAGGAGGAGTGCAATGGCGTTGATCTTGTAATCTGGAACTTGTTACTTGCATATATCGCTATCGCTATCGCTTTCTGATATGTAATCAGATTAAAGCCGTTGCAGTTCCGAGGAGGAGTGCAATGGCGTTGATCTTGTAATCTGGAACTTGTTTCTTGCATTTATCGCTATCGCTTTCTAATGTGTAATCAGATTAAAGCCCTTGCAGTTCCGAGGAGGAGTGCGATGGCGTTGATCTTGTCCTCTTGTATTTTCTCGAATTCGGCAATGGCTGAGCAATCTGTTGGATTATTGGAATACATTGACACATTTGATGGCTATTCGTTCAATTACCCTAAGAATTGGATTCAAGTTCGGGGGGCGGGGGCTGACATATTCTTTAGGGATCCGTTTGTTCTTGATGAGAATATTTCCGTGGAGTTCTCGTCGCCGTCTTCATCTAGGTATAAGAGCGTTGAGGATTTGGGACCTCCTGAAGAAGCGGGGAAGAAAGTGCTGAAACAGTATTTGACAGAGTTCATGTCTACAAGGCTTGGGGTTAGAAGGGAATCCAACATTCTTTCTACTTCTTCAAGAATGGCGGATGATGGCAGGACTTACTACCAAGTTGAGGTAATGTGAATATTGAGAAGTTTTATATAATTATTGTCCCATCTTTTCCTGCTTGAATTTTATCCATTTTGGAAGGTGGTTTACACAAAATTTTGAAAGTTCACGAACATATTTAAGATCTAATACAAATTCAAATTTGTTTGATAAAACTTGTAGACCCCATCTCAATTGGAGAGGGAAACAAAAACATTTTTTATAAGGGTATAGACACGATCTCCCTAGTAGATGCGTTTTAAAAATTTTGTAGGGAAGCCTGAGGACAACCCAAAGAGGACAATATTTGCTAGCGGTGGACTTGGGCTGTTACAAATAGTATCAGGACACAGCGAGGACACTTAGCCCTGAAGGGGTGGACACTGGGAGATGTGCTAGCAAGGTCGTTGGATCTCGAAGGGGGTGGATTGTGAGATTCCACATTGATTGGAGAGAGGAACGAGTGTCAGCAAGGAGGCTAGGTCTCGAAGGGGGGTGGATTGTGAGATCCCATATCTGTTAGAATCGAAGCATTGTTTTATGAAAACCTCTCTAGCAGACGCGTTTTAAAAACGCATGAAAGAGGACAATATTTGCTAGCAGTGGACTTGGGCTGTTATAAATAGTATCAGGACACAGCGAGGACGCTTAGCCCCGAAGGGGGTGGACACCGGAGGTGCGTCAGCAAGGACGCTGGGTCCCGAAAGGGGGTAGATTGTGAGATCCCACATCGGTTAGAGAGAGGAACGAGGAGGCTAGGCCCCAAAGGGGGGTAGATTGTGAGATCTCACATCGGTTAGAAACAAAACATTCTTTATAAGGTTGTGAAAACCTCTCTCAGGCAGACACGTTTTAAAAACACGTGAAAGGAAAAACCCTAAAATGACAATATTTTCTACCTTTTGAGATTTCATAGTTCATTCTTTGACATTAAATAATGTAGCGTCTCTAAAGTAGTTAGTTTACTATTTTTTTTTTCACGATAGCTTAACTTTTAACAAATGTTAGACTCGAGTTTATGATGTCGTTTAGCTTCTTTGATCTACATTTCTTCAAATGCATCATAACTATCTCCAATAATTGTATTATGTGCATATCAGGTAAACATAAAGTCATATGCAAACAACAATGAACTGGCAGTAATGCCACAGGATCGGGTGGTTCGTCTGGAATGGGACCGAAGATATCTCTCCGTTCTCGGAGTCGAAAACAATCGGCTATATGAACTGAGATTACAAACCCCAGAAAATGTGTTTGTAGAAGAAGAAAATGAGCTGCGTCAAGTTATGGATTCTTTCAGAGTCAACAAAGTGAATGCATGAAATGAAACACAAGAGGAGGCTTGTTTTCAGATACAATAGCTGAAATATTGTACTTTCTGTCATCAAATCACTACTGGATTTTACAGTTTGATATTGGAATCCGCTCTCACTAATTTAACATTGAACCAAAGATTGACACACTATCAATTTCAATGAAACAGGCATCATTTAAGGATTGTGCTCACTTCACACTTGGGTAGTACCGTTATAGCTTAGGACCGAACTTCGAACTTCCCAGCTCGTGTTAGAGCTCGAAAGCCTTTGTAAGGATGATTCGGCAAGTTCTTCACGTCGAGTTTATCTACTTGCAGGCCAGAGAGAACTACACCCATGATCTTAAAACGCACTTGAAATGTTGGAAATATATGAAGCTGTTGTAATCCTGTCTCGAGTGTTAATGTTCCAGACATTGAAGGGGTTTTATCTTTTGGCATCTTTCCAATTGTCCAAGAACACATCTGTAGAAATAAAACGGTATTGAATTTTTAGTATATCATGAG

mRNA sequence

ATGGCTGTAATCCTCGATTCCTTCTTACCTCCAATTCACACACTCAGTTCTTCTTCTCGGAAATGGATTCCAACCGCTTCCCCGTCTCCGCGATGGCCGCCAATCGATTCATCCCCCAAATCTCGCTTCTCACTCACCTCTGAATCTAAACCGATTAAAGCCCTTGCAGTTCCGAGGAGGAGTGCGATGGCGTTGATCTTGTCCTCTTGTATTTTCTCGAATTCGGCAATGGCTGAGCAATCTGTTGGATTATTGGAATACATTGACACATTTGATGGCTATTCGTTCAATTACCCTAAGAATTGGATTCAAGTTCGGGGGGCGGGGGCTGACATATTCTTTAGGGATCCGTTTGTTCTTGATGAGAATATTTCCGTGGAGTTCTCGTCGCCGTCTTCATCTAGGTATAAGAGCGTTGAGGATTTGGGACCTCCTGAAGAAGCGGGGAAGAAAGTGCTGAAACAGTATTTGACAGAGTTCATGTCTACAAGGCTTGGGGTTAGAAGGGAATCCAACATTCTTTCTACTTCTTCAAGAATGGCGGATGATGGCAGGACTTACTACCAAGTTGAGGTAAACATAAAGTCATATGCAAACAACAATGAACTGGCAGTAATGCCACAGGATCGGGTGGTTCGTCTGGAATGGGACCGAAGATATCTCTCCGTTCTCGGAGTCGAAAACAATCGGCTATATGAACTGAGATTACAAACCCCAGAAAATGTGTTTGTAGAAGAAGAAAATGAGCTGCGTCAAGTTATGGATTCTTTCAGAGTCAACAAAGTGAATGCATGAAATGAAACACAAGAGGAGGCTTGTTTTCAGATACAATAGCTGAAATATTGTACTTTCTGTCATCAAATCACTACTGGATTTTACAGTTTGATATTGGAATCCGCTCTCACTAATTTAACATTGAACCAAAGATTGACACACTATCAATTTCAATGAAACAGGCATCATTTAAGGATTGTGCTCACTTCACACTTGGGTAGTACCGTTATAGCTTAGGACCGAACTTCGAACTTCCCAGCTCGTGTTAGAGCTCGAAAGCCTTTGTAAGGATGATTCGGCAAGTTCTTCACGTCGAGTTTATCTACTTGCAGGCCAGAGAGAACTACACCCATGATCTTAAAACGCACTTGAAATGTTGGAAATATATGAAGCTGTTGTAATCCTGTCTCGAGTGTTAATGTTCCAGACATTGAAGGGGTTTTATCTTTTGGCATCTTTCCAATTGTCCAAGAACACATCTGTAGAAATAAAACGGTATTGAATTTTTAGTATATCATGAG

Coding sequence (CDS)

ATGGCTGTAATCCTCGATTCCTTCTTACCTCCAATTCACACACTCAGTTCTTCTTCTCGGAAATGGATTCCAACCGCTTCCCCGTCTCCGCGATGGCCGCCAATCGATTCATCCCCCAAATCTCGCTTCTCACTCACCTCTGAATCTAAACCGATTAAAGCCCTTGCAGTTCCGAGGAGGAGTGCGATGGCGTTGATCTTGTCCTCTTGTATTTTCTCGAATTCGGCAATGGCTGAGCAATCTGTTGGATTATTGGAATACATTGACACATTTGATGGCTATTCGTTCAATTACCCTAAGAATTGGATTCAAGTTCGGGGGGCGGGGGCTGACATATTCTTTAGGGATCCGTTTGTTCTTGATGAGAATATTTCCGTGGAGTTCTCGTCGCCGTCTTCATCTAGGTATAAGAGCGTTGAGGATTTGGGACCTCCTGAAGAAGCGGGGAAGAAAGTGCTGAAACAGTATTTGACAGAGTTCATGTCTACAAGGCTTGGGGTTAGAAGGGAATCCAACATTCTTTCTACTTCTTCAAGAATGGCGGATGATGGCAGGACTTACTACCAAGTTGAGGTAAACATAAAGTCATATGCAAACAACAATGAACTGGCAGTAATGCCACAGGATCGGGTGGTTCGTCTGGAATGGGACCGAAGATATCTCTCCGTTCTCGGAGTCGAAAACAATCGGCTATATGAACTGAGATTACAAACCCCAGAAAATGTGTTTGTAGAAGAAGAAAATGAGCTGCGTCAAGTTATGGATTCTTTCAGAGTCAACAAAGTGAATGCATGA

Protein sequence

MAVILDSFLPPIHTLSSSSRKWIPTASPSPRWPPIDSSPKSRFSLTSESKPIKALAVPRRSAMALILSSCIFSNSAMAEQSVGLLEYIDTFDGYSFNYPKNWIQVRGAGADIFFRDPFVLDENISVEFSSPSSSRYKSVEDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTSSRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENNRLYELRLQTPENVFVEEENELRQVMDSFRVNKVNA
BLAST of CmaCh10G011670 vs. Swiss-Prot
Match: PPD1_ARATH (PsbP domain-containing protein 1, chloroplastic OS=Arabidopsis thaliana GN=PPD1 PE=1 SV=1)

HSP 1 Score: 305.4 bits (781), Expect = 5.8e-82
Identity = 164/243 (67.49%), Postives = 194/243 (79.84%), Query Frame = 1

Query: 27  SPSPRWPPIDSS-PKSRFSLTSESKPIKALAVPRRSAM--ALILSSCIFSNS----AMAE 86
           S  P++    S+ P+S  ++   +   +  AV RR +M   L++S  I S +    A A 
Sbjct: 46  SSGPKYQSAKSAKPESPVAINCLTDAKQVCAVGRRKSMMMGLLMSGLIVSQANLPTAFAS 105

Query: 87  QSVGLLEYIDTFDGYSFNYPKNWIQVRGAGADIFFRDPFVLDENISVEFSSPSSSRYKSV 146
             V   EYIDTFDGYSF YP+NWIQVRGAGADIFFRDP VLDEN+SVEFSSPSSS Y S+
Sbjct: 106 TPV-FREYIDTFDGYSFKYPQNWIQVRGAGADIFFRDPVVLDENLSVEFSSPSSSNYTSL 165

Query: 147 EDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTSSRMADDGRTYYQVEVNIKSYAN 206
           EDLG PEE GK+VL+QYLTEFMSTRLGV+R++NILSTSSR+ADDG+ YYQVEVNIKSYAN
Sbjct: 166 EDLGSPEEVGKRVLRQYLTEFMSTRLGVKRQANILSTSSRVADDGKLYYQVEVNIKSYAN 225

Query: 207 NNELAVMPQDRVVRLEWDRRYLSVLGVENNRLYELRLQTPENVFVEEENELRQVMDSFRV 263
           NNELAVMPQDRV RLEW+RRYL+VLGVEN+RLY +RLQTPE VF+EEE +LR+VMDSFRV
Sbjct: 226 NNELAVMPQDRVARLEWNRRYLAVLGVENDRLYSIRLQTPEKVFLEEEKDLRRVMDSFRV 285

BLAST of CmaCh10G011670 vs. TrEMBL
Match: B9RFL8_RICCO (Thylakoid lumenal 21.5 kDa protein, chloroplast, putative OS=Ricinus communis GN=RCOM_1435730 PE=4 SV=1)

HSP 1 Score: 382.9 bits (982), Expect = 3.2e-103
Identity = 201/267 (75.28%), Postives = 226/267 (84.64%), Query Frame = 1

Query: 1   MAVILDSFLPPIHTLSSSSRKWIPTASPSPRWPPIDSSPKSRFSLTSESKPIKALAVPRR 60
           MA ILDSFLPP+H  S +   ++ +        PI +      S++ +++P KA AVPRR
Sbjct: 1   MATILDSFLPPVHLTSPTRPAFLSSWFSCSL--PISADSTRCTSISCKNQPTKAFAVPRR 60

Query: 61  SAMALILSSCI-----FSNSAMAEQSVGLLEYIDTFDGYSFNYPKNWIQVRGAGADIFFR 120
           S MALI SSCI     F +SA+A+ SVG  EYIDTFDGYSF YPKNWIQVRGAGADIFFR
Sbjct: 61  STMALIFSSCILSEVGFHSSALAQSSVGFREYIDTFDGYSFKYPKNWIQVRGAGADIFFR 120

Query: 121 DPFVLDENISVEFSSPSSSRYKSVEDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILS 180
           DP+VLDEN+SVE SSPSSS+Y SVEDLGPP+EAGKKVLKQYLTEFMSTRLGVRRES+ILS
Sbjct: 121 DPYVLDENLSVEMSSPSSSKYTSVEDLGPPQEAGKKVLKQYLTEFMSTRLGVRRESDILS 180

Query: 181 TSSRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENNRLYELR 240
           TSSR+ADDG+ YYQVEVNIKSYANNNE+AVMPQDRVVRLEW+RRYLSVLGVENNRLYELR
Sbjct: 181 TSSRVADDGKLYYQVEVNIKSYANNNEMAVMPQDRVVRLEWNRRYLSVLGVENNRLYELR 240

Query: 241 LQTPENVFVEEENELRQVMDSFRVNKV 263
           LQTPENVFVEEEN+LRQVM+SFRVNKV
Sbjct: 241 LQTPENVFVEEENDLRQVMESFRVNKV 265

BLAST of CmaCh10G011670 vs. TrEMBL
Match: A0A0A0LTD2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G181310 PE=4 SV=1)

HSP 1 Score: 380.2 bits (975), Expect = 2.1e-102
Identity = 191/202 (94.55%), Postives = 198/202 (98.02%), Query Frame = 1

Query: 63  MALILSSCIFSNSAMAEQSVGLLEYIDTFDGYSFNYPKNWIQVRGAGADIFFRDPFVLDE 122
           MAL+LS+CIFSNSA+A  SVGLLEYIDTFDGYSF YPKNWIQVRGAGADIFFRDPFVLDE
Sbjct: 1   MALMLSTCIFSNSALAVSSVGLLEYIDTFDGYSFKYPKNWIQVRGAGADIFFRDPFVLDE 60

Query: 123 NISVEFSSPSSSRYKSVEDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTSSRMAD 182
           N+SVEFSSPSSSRY SV+DLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTSSRMAD
Sbjct: 61  NLSVEFSSPSSSRYNSVQDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTSSRMAD 120

Query: 183 DGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENNRLYELRLQTPENV 242
           DGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVEN+RLYELRLQTPENV
Sbjct: 121 DGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENSRLYELRLQTPENV 180

Query: 243 FVEEENELRQVMDSFRVNKVNA 265
           FVEEEN+LRQVMDSFRVNKVNA
Sbjct: 181 FVEEENDLRQVMDSFRVNKVNA 202

BLAST of CmaCh10G011670 vs. TrEMBL
Match: A0A061F332_THECC (Photosystem II reaction center PsbP family protein isoform 1 OS=Theobroma cacao GN=TCM_026794 PE=4 SV=1)

HSP 1 Score: 375.6 bits (963), Expect = 5.1e-101
Identity = 206/269 (76.58%), Postives = 225/269 (83.64%), Query Frame = 1

Query: 1   MAVILDSFLPPIHTLSSSSRKWIPTASPSPRWPPIDSSPKSRFSLTSESKPIKALAVPRR 60
           MA ILDS LPP       SR  +PT   +P   P  SS  S    T +++  KA A+PRR
Sbjct: 30  MATILDSLLPP-------SRPTLPTRLSTPF--PSSSSCIS----TRKTQKTKAFALPRR 89

Query: 61  SAMALILSSCIFS-----NSAMAEQSVGLLEYIDTFDGYSFNYPKNWIQVRGAGADIFFR 120
           +AMALILSSCIFS     + A A+ SVGL EYIDTFDGYSF YP+NWIQVRGAGADIFFR
Sbjct: 90  NAMALILSSCIFSEVGLHDFAFAQPSVGLREYIDTFDGYSFKYPQNWIQVRGAGADIFFR 149

Query: 121 DPFVLDENISVEFSSPSSSRYKSVEDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILS 180
           DP+VLDEN+SVE SSPSSSRYK+VEDLGPP+EAGKKVLKQYLTEFMSTRLGVRRESNILS
Sbjct: 150 DPYVLDENLSVEMSSPSSSRYKTVEDLGPPQEAGKKVLKQYLTEFMSTRLGVRRESNILS 209

Query: 181 TSSRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENNRLYELR 240
           TSSR+ADDG+ YYQVEVNIKSYAN NELAVMPQDRV RLEW+RRYLSVLGVENNRLYELR
Sbjct: 210 TSSRVADDGKLYYQVEVNIKSYANTNELAVMPQDRVPRLEWNRRYLSVLGVENNRLYELR 269

Query: 241 LQTPENVFVEEENELRQVMDSFRVNKVNA 265
           LQTPENVFVEEEN+LRQVMDSFRVNKV +
Sbjct: 270 LQTPENVFVEEENDLRQVMDSFRVNKVTS 285

BLAST of CmaCh10G011670 vs. TrEMBL
Match: U5GD37_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0005s01430g PE=4 SV=1)

HSP 1 Score: 375.2 bits (962), Expect = 6.7e-101
Identity = 205/271 (75.65%), Postives = 225/271 (83.03%), Query Frame = 1

Query: 1   MAVILDSFLPPIHTLSSSSRKWIPTASPSPRWP----PIDSSPKSRFSLTSESKPIKALA 60
           MA ILDSF PP+           PT S S  WP    PI  +P    S++  ++  KA A
Sbjct: 1   MARILDSFPPPLQLTH-------PTLSRST-WPRCSMPISPNPTFCSSISHYNQLTKAFA 60

Query: 61  VPRRSAMALILSSCIFS-----NSAMAEQSVGLLEYIDTFDGYSFNYPKNWIQVRGAGAD 120
           VPRR+AMALILSS +FS     N A A+QSVG  EYID FDGYSF YP+NWIQVRGAGAD
Sbjct: 61  VPRRNAMALILSSYMFSEFGFDNLAFAQQSVGFREYIDQFDGYSFKYPQNWIQVRGAGAD 120

Query: 121 IFFRDPFVLDENISVEFSSPSSSRYKSVEDLGPPEEAGKKVLKQYLTEFMSTRLGVRRES 180
           IFFRDPFVLDEN+SVE SSPSSSRYKSVEDLGPP+EAGKKVLKQYLTEFMSTRLGVRRES
Sbjct: 121 IFFRDPFVLDENLSVELSSPSSSRYKSVEDLGPPQEAGKKVLKQYLTEFMSTRLGVRRES 180

Query: 181 NILSTSSRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENNRL 240
           NILSTSSR+ADDG+ YYQVEVNIKSYANNNELAVMP++RVVRLEWDRRYLSVLGVENN+L
Sbjct: 181 NILSTSSRVADDGKLYYQVEVNIKSYANNNELAVMPRERVVRLEWDRRYLSVLGVENNQL 240

Query: 241 YELRLQTPENVFVEEENELRQVMDSFRVNKV 263
           YELRLQTPENVFVEEEN+LR+VMDSFRVNK+
Sbjct: 241 YELRLQTPENVFVEEENDLRKVMDSFRVNKI 263

BLAST of CmaCh10G011670 vs. TrEMBL
Match: A9PDI9_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0013s00790g PE=2 SV=1)

HSP 1 Score: 365.9 bits (938), Expect = 4.1e-98
Identity = 203/270 (75.19%), Postives = 221/270 (81.85%), Query Frame = 1

Query: 1   MAVILDSFLPPIHTL--SSSSRKWIPTASP-SPRWPPIDSSPKSRFSLTSESKPIKALAV 60
           MA ILDSF PP      + S   W+  + P SP      S P ++  LT      KA AV
Sbjct: 1   MARILDSFPPPTQLTHPTRSRPTWLSCSMPISPNSTCFSSIPHNK-QLT------KAFAV 60

Query: 61  PRRSAMALILSSCIFS-----NSAMAEQSVGLLEYIDTFDGYSFNYPKNWIQVRGAGADI 120
           PRR+AMALILSS IFS     N A A++SVG  EYID FDGYS  +P+NWIQVRGAGADI
Sbjct: 61  PRRNAMALILSSYIFSEVGFNNIAFAQRSVGFREYIDQFDGYSLKHPQNWIQVRGAGADI 120

Query: 121 FFRDPFVLDENISVEFSSPSSSRYKSVEDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESN 180
           FFRDPFVLDEN+SVE SSPSSS YKSVEDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESN
Sbjct: 121 FFRDPFVLDENLSVELSSPSSSNYKSVEDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESN 180

Query: 181 ILSTSSRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENNRLY 240
           I+STSSR+ADDG+ YYQVEVNIKSYANNNELAVMPQ+RVVRLEW+RRY+SVLGVENNRLY
Sbjct: 181 IISTSSRVADDGKLYYQVEVNIKSYANNNELAVMPQERVVRLEWNRRYMSVLGVENNRLY 240

Query: 241 ELRLQTPENVFVEEENELRQVMDSFRVNKV 263
           ELRLQTPENVFVEEEN+LRQVMDSFRVNKV
Sbjct: 241 ELRLQTPENVFVEEENDLRQVMDSFRVNKV 263

BLAST of CmaCh10G011670 vs. TAIR10
Match: AT4G15510.1 (AT4G15510.1 Photosystem II reaction center PsbP family protein)

HSP 1 Score: 305.4 bits (781), Expect = 3.3e-83
Identity = 164/243 (67.49%), Postives = 194/243 (79.84%), Query Frame = 1

Query: 27  SPSPRWPPIDSS-PKSRFSLTSESKPIKALAVPRRSAM--ALILSSCIFSNS----AMAE 86
           S  P++    S+ P+S  ++   +   +  AV RR +M   L++S  I S +    A A 
Sbjct: 46  SSGPKYQSAKSAKPESPVAINCLTDAKQVCAVGRRKSMMMGLLMSGLIVSQANLPTAFAS 105

Query: 87  QSVGLLEYIDTFDGYSFNYPKNWIQVRGAGADIFFRDPFVLDENISVEFSSPSSSRYKSV 146
             V   EYIDTFDGYSF YP+NWIQVRGAGADIFFRDP VLDEN+SVEFSSPSSS Y S+
Sbjct: 106 TPV-FREYIDTFDGYSFKYPQNWIQVRGAGADIFFRDPVVLDENLSVEFSSPSSSNYTSL 165

Query: 147 EDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTSSRMADDGRTYYQVEVNIKSYAN 206
           EDLG PEE GK+VL+QYLTEFMSTRLGV+R++NILSTSSR+ADDG+ YYQVEVNIKSYAN
Sbjct: 166 EDLGSPEEVGKRVLRQYLTEFMSTRLGVKRQANILSTSSRVADDGKLYYQVEVNIKSYAN 225

Query: 207 NNELAVMPQDRVVRLEWDRRYLSVLGVENNRLYELRLQTPENVFVEEENELRQVMDSFRV 263
           NNELAVMPQDRV RLEW+RRYL+VLGVEN+RLY +RLQTPE VF+EEE +LR+VMDSFRV
Sbjct: 226 NNELAVMPQDRVARLEWNRRYLAVLGVENDRLYSIRLQTPEKVFLEEEKDLRRVMDSFRV 285

BLAST of CmaCh10G011670 vs. NCBI nr
Match: gi|449443516|ref|XP_004139523.1| (PREDICTED: psbP domain-containing protein 1, chloroplastic isoform X1 [Cucumis sativus])

HSP 1 Score: 432.2 bits (1110), Expect = 6.6e-118
Identity = 230/264 (87.12%), Postives = 243/264 (92.05%), Query Frame = 1

Query: 1   MAVILDSFLPPIHTLSSSSRKWIPTASPSPRWPPIDSSPKSRFSLTSESKPIKALAVPRR 60
           MAVILDSFLP I TLSSS R+ IP+ S S RWP ++S PK      SES+ IK +AVPRR
Sbjct: 1   MAVILDSFLPSIQTLSSSFRQRIPSTS-STRWP-MNSFPKC----CSESQSIKGVAVPRR 60

Query: 61  SAMALILSSCIFSNSAMAEQSVGLLEYIDTFDGYSFNYPKNWIQVRGAGADIFFRDPFVL 120
           SAMAL+LS+CIFSNSA+A  SVGLLEYIDTFDGYSF YPKNWIQVRGAGADIFFRDPFVL
Sbjct: 61  SAMALMLSTCIFSNSALAVSSVGLLEYIDTFDGYSFKYPKNWIQVRGAGADIFFRDPFVL 120

Query: 121 DENISVEFSSPSSSRYKSVEDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTSSRM 180
           DEN+SVEFSSPSSSRY SV+DLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTSSRM
Sbjct: 121 DENLSVEFSSPSSSRYNSVQDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTSSRM 180

Query: 181 ADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENNRLYELRLQTPE 240
           ADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVEN+RLYELRLQTPE
Sbjct: 181 ADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENSRLYELRLQTPE 240

Query: 241 NVFVEEENELRQVMDSFRVNKVNA 265
           NVFVEEEN+LRQVMDSFRVNKVNA
Sbjct: 241 NVFVEEENDLRQVMDSFRVNKVNA 258

BLAST of CmaCh10G011670 vs. NCBI nr
Match: gi|659128677|ref|XP_008464319.1| (PREDICTED: psbP domain-containing protein 1, chloroplastic isoform X1 [Cucumis melo])

HSP 1 Score: 419.1 bits (1076), Expect = 5.8e-114
Identity = 225/264 (85.23%), Postives = 237/264 (89.77%), Query Frame = 1

Query: 1   MAVILDSFLPPIHTLSSSSRKWIPTASPSPRWPPIDSSPKSRFSLTSESKPIKALAVPRR 60
           MAVIL SFLP I TL+S  R+ IP+ S +    PI S PK      S+S+ IK +AVPRR
Sbjct: 1   MAVILHSFLPSIQTLTSPIRQRIPSTSSTRS--PIISFPKC----CSQSQSIKDVAVPRR 60

Query: 61  SAMALILSSCIFSNSAMAEQSVGLLEYIDTFDGYSFNYPKNWIQVRGAGADIFFRDPFVL 120
           +AMALILS+CIFSNSA A  SVGLLEYIDTFDGYSF YPKNWIQVRGAGADIFFRDPFVL
Sbjct: 61  NAMALILSTCIFSNSAFAVPSVGLLEYIDTFDGYSFKYPKNWIQVRGAGADIFFRDPFVL 120

Query: 121 DENISVEFSSPSSSRYKSVEDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTSSRM 180
           DEN+SVEFSSPSSSRY SV+DLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTSSRM
Sbjct: 121 DENLSVEFSSPSSSRYNSVQDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTSSRM 180

Query: 181 ADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENNRLYELRLQTPE 240
           ADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVEN+RLYELRLQTPE
Sbjct: 181 ADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENSRLYELRLQTPE 240

Query: 241 NVFVEEENELRQVMDSFRVNKVNA 265
           NVFVEEENELRQVMDSFRVNKVNA
Sbjct: 241 NVFVEEENELRQVMDSFRVNKVNA 258

BLAST of CmaCh10G011670 vs. NCBI nr
Match: gi|255542948|ref|XP_002512537.1| (PREDICTED: psbP domain-containing protein 1, chloroplastic isoform X1 [Ricinus communis])

HSP 1 Score: 382.9 bits (982), Expect = 4.6e-103
Identity = 201/267 (75.28%), Postives = 226/267 (84.64%), Query Frame = 1

Query: 1   MAVILDSFLPPIHTLSSSSRKWIPTASPSPRWPPIDSSPKSRFSLTSESKPIKALAVPRR 60
           MA ILDSFLPP+H  S +   ++ +        PI +      S++ +++P KA AVPRR
Sbjct: 1   MATILDSFLPPVHLTSPTRPAFLSSWFSCSL--PISADSTRCTSISCKNQPTKAFAVPRR 60

Query: 61  SAMALILSSCI-----FSNSAMAEQSVGLLEYIDTFDGYSFNYPKNWIQVRGAGADIFFR 120
           S MALI SSCI     F +SA+A+ SVG  EYIDTFDGYSF YPKNWIQVRGAGADIFFR
Sbjct: 61  STMALIFSSCILSEVGFHSSALAQSSVGFREYIDTFDGYSFKYPKNWIQVRGAGADIFFR 120

Query: 121 DPFVLDENISVEFSSPSSSRYKSVEDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILS 180
           DP+VLDEN+SVE SSPSSS+Y SVEDLGPP+EAGKKVLKQYLTEFMSTRLGVRRES+ILS
Sbjct: 121 DPYVLDENLSVEMSSPSSSKYTSVEDLGPPQEAGKKVLKQYLTEFMSTRLGVRRESDILS 180

Query: 181 TSSRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENNRLYELR 240
           TSSR+ADDG+ YYQVEVNIKSYANNNE+AVMPQDRVVRLEW+RRYLSVLGVENNRLYELR
Sbjct: 181 TSSRVADDGKLYYQVEVNIKSYANNNEMAVMPQDRVVRLEWNRRYLSVLGVENNRLYELR 240

Query: 241 LQTPENVFVEEENELRQVMDSFRVNKV 263
           LQTPENVFVEEEN+LRQVM+SFRVNKV
Sbjct: 241 LQTPENVFVEEENDLRQVMESFRVNKV 265

BLAST of CmaCh10G011670 vs. NCBI nr
Match: gi|778659702|ref|XP_011654885.1| (PREDICTED: psbP domain-containing protein 1, chloroplastic isoform X2 [Cucumis sativus])

HSP 1 Score: 380.2 bits (975), Expect = 3.0e-102
Identity = 191/202 (94.55%), Postives = 198/202 (98.02%), Query Frame = 1

Query: 63  MALILSSCIFSNSAMAEQSVGLLEYIDTFDGYSFNYPKNWIQVRGAGADIFFRDPFVLDE 122
           MAL+LS+CIFSNSA+A  SVGLLEYIDTFDGYSF YPKNWIQVRGAGADIFFRDPFVLDE
Sbjct: 1   MALMLSTCIFSNSALAVSSVGLLEYIDTFDGYSFKYPKNWIQVRGAGADIFFRDPFVLDE 60

Query: 123 NISVEFSSPSSSRYKSVEDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTSSRMAD 182
           N+SVEFSSPSSSRY SV+DLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTSSRMAD
Sbjct: 61  NLSVEFSSPSSSRYNSVQDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTSSRMAD 120

Query: 183 DGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENNRLYELRLQTPENV 242
           DGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVEN+RLYELRLQTPENV
Sbjct: 121 DGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENSRLYELRLQTPENV 180

Query: 243 FVEEENELRQVMDSFRVNKVNA 265
           FVEEEN+LRQVMDSFRVNKVNA
Sbjct: 181 FVEEENDLRQVMDSFRVNKVNA 202

BLAST of CmaCh10G011670 vs. NCBI nr
Match: gi|743890154|ref|XP_011038959.1| (PREDICTED: psbP domain-containing protein 1, chloroplastic-like [Populus euphratica])

HSP 1 Score: 375.9 bits (964), Expect = 5.6e-101
Identity = 207/271 (76.38%), Postives = 224/271 (82.66%), Query Frame = 1

Query: 1   MAVILDSFLPPIHTLSSSSRKWIPTASPSPRWP----PIDSSPKSRFSLTSESKPIKALA 60
           MA ILDSF PP+           PT S S  WP    PI  +P    S++  ++  KA A
Sbjct: 1   MARILDSFPPPLQLTH-------PTLSRST-WPSCSMPISPNPTFCSSISHYNQLTKAFA 60

Query: 61  VPRRSAMALILSSCIFS-----NSAMAEQSVGLLEYIDTFDGYSFNYPKNWIQVRGAGAD 120
           VPRR+AMALILSS IFS     N A A+QSVG  EYID FDGYSF YP+NWIQVRGAGAD
Sbjct: 61  VPRRNAMALILSSYIFSEVGFNNIAFAQQSVGFREYIDQFDGYSFKYPQNWIQVRGAGAD 120

Query: 121 IFFRDPFVLDENISVEFSSPSSSRYKSVEDLGPPEEAGKKVLKQYLTEFMSTRLGVRRES 180
           IFFRDPFVLDEN+SVE SSPSSSRYKSVEDLGPP+EA KKVLKQYLTEFMSTRLGVRRES
Sbjct: 121 IFFRDPFVLDENLSVELSSPSSSRYKSVEDLGPPQEAAKKVLKQYLTEFMSTRLGVRRES 180

Query: 181 NILSTSSRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENNRL 240
           NILSTSSR+ADDG+ YYQVEVNIKSYANNNELAVMPQ+RVVRLEWDRRYLSVLGVENN+L
Sbjct: 181 NILSTSSRVADDGKLYYQVEVNIKSYANNNELAVMPQERVVRLEWDRRYLSVLGVENNQL 240

Query: 241 YELRLQTPENVFVEEENELRQVMDSFRVNKV 263
           YELRLQTPENVFVEEEN+LR+VMDSFRVNKV
Sbjct: 241 YELRLQTPENVFVEEENDLRKVMDSFRVNKV 263

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPD1_ARATH5.8e-8267.49PsbP domain-containing protein 1, chloroplastic OS=Arabidopsis thaliana GN=PPD1 ... [more]
Match NameE-valueIdentityDescription
B9RFL8_RICCO3.2e-10375.28Thylakoid lumenal 21.5 kDa protein, chloroplast, putative OS=Ricinus communis GN... [more]
A0A0A0LTD2_CUCSA2.1e-10294.55Uncharacterized protein OS=Cucumis sativus GN=Csa_1G181310 PE=4 SV=1[more]
A0A061F332_THECC5.1e-10176.58Photosystem II reaction center PsbP family protein isoform 1 OS=Theobroma cacao ... [more]
U5GD37_POPTR6.7e-10175.65Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0005s01430g PE=4 SV=1[more]
A9PDI9_POPTR4.1e-9875.19Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0013s00790g PE=2 SV=1[more]
Match NameE-valueIdentityDescription
AT4G15510.13.3e-8367.49 Photosystem II reaction center PsbP family protein[more]
Match NameE-valueIdentityDescription
gi|449443516|ref|XP_004139523.1|6.6e-11887.12PREDICTED: psbP domain-containing protein 1, chloroplastic isoform X1 [Cucumis s... [more]
gi|659128677|ref|XP_008464319.1|5.8e-11485.23PREDICTED: psbP domain-containing protein 1, chloroplastic isoform X1 [Cucumis m... [more]
gi|255542948|ref|XP_002512537.1|4.6e-10375.28PREDICTED: psbP domain-containing protein 1, chloroplastic isoform X1 [Ricinus c... [more]
gi|778659702|ref|XP_011654885.1|3.0e-10294.55PREDICTED: psbP domain-containing protein 1, chloroplastic isoform X2 [Cucumis s... [more]
gi|743890154|ref|XP_011038959.1|5.6e-10176.38PREDICTED: psbP domain-containing protein 1, chloroplastic-like [Populus euphrat... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002683PsbP
IPR016123Mog1/PsbP_a/b/a-sand
Vocabulary: Molecular Function
TermDefinition
GO:0005509calcium ion binding
Vocabulary: Cellular Component
TermDefinition
GO:0009523photosystem II
GO:0009654photosystem II oxygen evolving complex
GO:0019898extrinsic component of membrane
Vocabulary: Biological Process
TermDefinition
GO:0015979photosynthesis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0048564 photosystem I assembly
biological_process GO:0015979 photosynthesis
cellular_component GO:0009570 chloroplast stroma
cellular_component GO:0009543 chloroplast thylakoid lumen
cellular_component GO:0009535 chloroplast thylakoid membrane
cellular_component GO:0019898 extrinsic component of membrane
cellular_component GO:0009654 photosystem II oxygen evolving complex
cellular_component GO:0009523 photosystem II
molecular_function GO:0005509 calcium ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh10G011670.1CmaCh10G011670.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002683PsbP familyPFAMPF01789PsbPcoord: 68..259
score: 6.5
IPR016123Mog1/PsbP, alpha/beta/alpha sandwichGENE3DG3DSA:3.40.1000.10coord: 83..259
score: 8.9
IPR016123Mog1/PsbP, alpha/beta/alpha sandwichunknownSSF55724Mog1p/PsbP-likecoord: 87..259
score: 8.37
NoneNo IPR availablePANTHERPTHR31407FAMILY NOT NAMEDcoord: 24..264
score: 3.2E
NoneNo IPR availablePANTHERPTHR31407:SF15SUBFAMILY NOT NAMEDcoord: 24..264
score: 3.2E

The following gene(s) are paralogous to this gene:

None