ClCG01G010870 (gene) Watermelon (Charleston Gray)

NameClCG01G010870
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionPsbP domain-containing protein 1, chloroplastic
LocationCG_Chr01 : 17035530 .. 17038502 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTTGGAGAGATTCCTGAAAATGGCTGTGATCCTCGACTCCTTCTTACCTCCAATTCACACACTGAGTTCTTCAATTCGCCAAAGGATTCCCATCACTTCTCCTTCTCCGCGATGGCCTATGGATTCATCCCCCAAATGTCGCTCTGAATCTCAATCGGTAATTCTTCCAAAATCACTTCATAAATTGTAATCAATCTGGAACTTGTTACTTGCATTTATCGCTATAGCTTTCTAATATATAATCAGATTAAAGCCGTTGCAGTTCCAAGGAGGAATGCAATGGCGTTGATCTTCTCCACTTGTATTTTCTCGAATTCTGCGTTGGCTGAGCCATCTCCATCTCCATCTCCATCTCCATCTGTTGGATTATTGGAATACATTGACACTTTTGATGGCTATTCGTTCAAATACCCTAAGAATTGGATTCAAGTTCGAGGGGCGGGGGCTGACATATTCTTCAGGGATCCGTTTGTTCTTGATGAGAATCTTTCGGTGGAGTTCTCGTCGCCGTCTTCTTCCAGCTATAAGAGCGTTCAGGATTTAGGACCTCCTGAAGAAGCGGGGAAGAAAGTGCTGAAGCAGTATTTGACTGAGTTCATGTCTACCAGGCTTGGGGTTAGAAGGGAATCCAACATTCTTTCTACTTCTTCCAGAATGGCAGATGATGGAAGAACTTACTACCAAGTTGAGGTAATTTGTGACATACATACACAACTTTGATTCAATTCTACTTTATCTTTTTCTGCATCCATTATGGTGAACAACTTGTTCTTCATCTGCTCATTTGGTTATGAATGTTTAGGTTCATTTTGATTTTGATTTTTGTTTTTGAAAATTAAGTCTAGACTCCATTTTCATTTCATTAGTTGGCCTACATCTCAAGACTGTTTTCAAAATTAAAACTAAAATTTGAAAACTAAAAGAAATTTTTAAAAACTTGTTTTTTTTTAGGAAAAAAAATTGACTAAGAATTCAAATGCTTATATAGAAAATACAAAAACTATGAGTAGACAAAATTTCAAAAACCAAAAATAAAAAACCGAATGGTTACTAACCAAAATCTTAACATCTTAACTTCAACTTTCACCTTTGTGTCAAATAAGTTTCTCGACTTTGACAGGTAACAATAGGTTCTCAAACGTCAATTTTGTATCTAGTAGGTTATTAAAAGATCAAAAACCTAGTTAACATTTTTAAAATATAGGAACTTAATAACTAATCATTGAAAGTTTAAGGTACTAATAAATATGTTTTAAAATGAAAAGATTTATTGAAACTTGTAGTTCACTCAAAAAAGTTGCCTTTTGGTCAGTGTATAATGGCTCTTCCCTGAATCTGTTTAAAACAACCCTTTTGGGTGTTTTTAACCACAGAAATCATTGGCCATGAACCTTAGTAATAAGCAATGGAAAACCATAATGGTGTAAGAGGAACATTCCAAAGATTGTTGTTTGTTGTGAGTTGTGACATTCTAGTTTTCTTTATAGGTACCTATGTGATGTATGATGTTTAAACCTGAAGCATTTCATATTTTTCTCTGCAACTCTGAATTGAACTAACCCATAAGTTATAAGTTCAAGTTAATCTTAATGTCACTTTAGCTTCTTTGATCTCCATTTCTTCAAATGCATCATAACAATCTGCAATCTGCAATCTTACGATTATGTGGGTATCAGGTAAACATAAAGTCATATGCAAACAACAATGAACTGGCAGTAATGCCACAGGATCGGGTGGTTCGTTTGGAATGGGACCGAAGATATCTCTCGGTTCTTGGAGTCGAAAACAGTCGGCTATATGAATTGAGATTACAAACTCCAGAAAACGTGTTCGTAGAAGAAGAAAATGAGTTGCGTCAAGTTATGGATTCTTTCAGAGTCAACAAAGTGAATGCATGAAATGAATTGTTGCAGCCGAGGGTTTTCAGATGTAGTAGCTGAAACATTGTACTCTTCATCATCGAAGCAAATTGATTCTGCATTTTACAGTTTGATATTGGAGTCCTGTCTCACTAATCTAATATTTCTCGATTCCTTGATTAAGTAAAAGTGCACCAAAGATTGACAAAGTATCAATTTCACTGAAACAGGAATCACTTCTGAGCTGCCAAAAGGCATCATTTAAACAAACTTTACAAATGGAAAGATTTTTGGTCTGTTTTCAACTATCATGAGATTGTGCATTGGTTCACAGGATTTCACAGAAGATCGTGCACTCAAGATCATCACAAATTCGCAAATGGGAGTTGCCGCTATTGCTTATGACCGAACTTCGAACTTCCCTGCTCGTGTTAGAGCTCGAAAGCCTTTGTAAGGATGATTTGGCAAGTTCTTCACATCAAGTTTATCTACTTGCAGGCCAGAGAGGACTACACCCATGATCTTAAAACGCACTTGAAATGTGGGAAATACATGAAGCTGTTGTAATCCTGTCTCAAGTGTCCATGTTCCAGACATTGAAGGGGTTTTATCTTTTGGCATCTTTCCAATTGTCCAAGAGCAGATCTGTAGAAACAAATTCTATCCTTTTTAGCTCATCAGAAATAAAACACTATGGGAACTTTAGTATCCTTAAACAAACAAACATACAATTCGGGAACCAACTCCTTTCACTAGACACTAACATAGAAATAAAAGGCAGGAATTTGGGAGAGGTACCAAACACTACAAAGAAACAGCTGCTCATGCTAATGAAGGTACCGAATGCTTTCAGTGTTGCACATCCCAGACCAAAAACAATGCTTTTCTTTTTTAGAACCAAAATCTGAGATGCTTTTTGGGGGCGTGGTCGATAACAAGGAAATCCCAACAAAGAAACTTACATATTCATCATATAAACACCAGATCCTTAACTAGTCAGAAATTGCCAAATACCAATTTTAAAAGCCACTGCAACGAAAATCAACTTAAAAACTCATTTAGAAGAACTCTTCTGGTTGATTAGCAAGTAGGGATGGAATATAACTCCTTGTC

mRNA sequence

GTTTGGAGAGATTCCTGAAAATGGCTGTGATCCTCGACTCCTTCTTACCTCCAATTCACACACTGAGTTCTTCAATTCGCCAAAGGATTCCCATCACTTCTCCTTCTCCGCGATGGCCTATGGATTCATCCCCCAAATGTCGCTCTGAATCTCAATCGATTAAAGCCGTTGCAGTTCCAAGGAGGAATGCAATGGCGTTGATCTTCTCCACTTGTATTTTCTCGAATTCTGCGTTGGCTGAGCCATCTCCATCTCCATCTCCATCTCCATCTGTTGGATTATTGGAATACATTGACACTTTTGATGGCTATTCGTTCAAATACCCTAAGAATTGGATTCAAGTTCGAGGGGCGGGGGCTGACATATTCTTCAGGGATCCGTTTGTTCTTGATGAGAATCTTTCGGTGGAGTTCTCGTCGCCGTCTTCTTCCAGCTATAAGAGCGTTCAGGATTTAGGACCTCCTGAAGAAGCGGGGAAGAAAGTGCTGAAGCAGTATTTGACTGAGTTCATGTCTACCAGGCTTGGGGTTAGAAGGGAATCCAACATTCTTTCTACTTCTTCCAGAATGGCAGATGATGGAAGAACTTACTACCAAGTTGAGGTAAACATAAAGTCATATGCAAACAACAATGAACTGGCAGTAATGCCACAGGATCGGGTGGTTCGTTTGGAATGGGACCGAAGATATCTCTCGGTTCTTGGAGTCGAAAACAGTCGGCTATATGAATTGAGATTACAAACTCCAGAAAACGTGTTCGTAGAAGAAGAAAATGAGTTGCGTCAAGTTATGGATTCTTTCAGAGTCAACAAAGTGAATGCATGAAATGAATTGTTGCAGCCGAGGGTTTTCAGATGTAGTAGCTGAAACATTGTACTCTTCATCATCGAAGCAAATTGATTCTGCATTTTACAGTTTGATATTGGAGTCCTGTCTCACTAATCTAATATTTCTCGATTCCTTGATTAAGTAAAAGTGCACCAAAGATTGACAAAGTATCAATTTCACTGAAACAGGAATCACTTCTGAGCTGCCAAAAGGCATCATTTAAACAAACTTTACAAATGGAAAGATTTTTGGTCTGTTTTCAACTATCATGAGATTGTGCATTGGTTCACAGGATTTCACAGAAGATCGTGCACTCAAGATCATCACAAATTCGCAAATGGGAGTTGCCGCTATTGCTTATGACCGAACTTCGAACTTCCCTGCTCGTGTTAGAGCTCGAAAGCCTTTGTAAGGATGATTTGGCAAGTTCTTCACATCAAGTTTATCTACTTGCAGGCCAGAGAGGACTACACCCATGATCTTAAAACGCACTTGAAATGTGGGAAATACATGAAGCTGTTGTAATCCTGTCTCAAGTGTCCATGTTCCAGACATTGAAGGGGTTTTATCTTTTGGCATCTTTCCAATTGTCCAAGAGCAGATCTGTAGAAACAAATTCTATCCTTTTTAGCTCATCAGAAATAAAACACTATGGGAACTTTAGTATCCTTAAACAAACAAACATACAATTCGGGAACCAACTCCTTTCACTAGACACTAACATAGAAATAAAAGGCAGGAATTTGGGAGAGGTACCAAACACTACAAAGAAACAGCTGCTCATGCTAATGAAGGTACCGAATGCTTTCAGTGTTGCACATCCCAGACCAAAAACAATGCTTTTCTTTTTTAGAACCAAAATCTGAGATGCTTTTTGGGGGCGTGGTCGATAACAAGGAAATCCCAACAAAGAAACTTACATATTCATCATATAAACACCAGATCCTTAACTAGTCAGAAATTGCCAAATACCAATTTTAAAAGCCACTGCAACGAAAATCAACTTAAAAACTCATTTAGAAGAACTCTTCTGGTTGATTAGCAAGTAGGGATGGAATATAACTCCTTGTC

Coding sequence (CDS)

ATGGCTGTGATCCTCGACTCCTTCTTACCTCCAATTCACACACTGAGTTCTTCAATTCGCCAAAGGATTCCCATCACTTCTCCTTCTCCGCGATGGCCTATGGATTCATCCCCCAAATGTCGCTCTGAATCTCAATCGATTAAAGCCGTTGCAGTTCCAAGGAGGAATGCAATGGCGTTGATCTTCTCCACTTGTATTTTCTCGAATTCTGCGTTGGCTGAGCCATCTCCATCTCCATCTCCATCTCCATCTGTTGGATTATTGGAATACATTGACACTTTTGATGGCTATTCGTTCAAATACCCTAAGAATTGGATTCAAGTTCGAGGGGCGGGGGCTGACATATTCTTCAGGGATCCGTTTGTTCTTGATGAGAATCTTTCGGTGGAGTTCTCGTCGCCGTCTTCTTCCAGCTATAAGAGCGTTCAGGATTTAGGACCTCCTGAAGAAGCGGGGAAGAAAGTGCTGAAGCAGTATTTGACTGAGTTCATGTCTACCAGGCTTGGGGTTAGAAGGGAATCCAACATTCTTTCTACTTCTTCCAGAATGGCAGATGATGGAAGAACTTACTACCAAGTTGAGGTAAACATAAAGTCATATGCAAACAACAATGAACTGGCAGTAATGCCACAGGATCGGGTGGTTCGTTTGGAATGGGACCGAAGATATCTCTCGGTTCTTGGAGTCGAAAACAGTCGGCTATATGAATTGAGATTACAAACTCCAGAAAACGTGTTCGTAGAAGAAGAAAATGAGTTGCGTCAAGTTATGGATTCTTTCAGAGTCAACAAAGTGAATGCATGA

Protein sequence

MAVILDSFLPPIHTLSSSIRQRIPITSPSPRWPMDSSPKCRSESQSIKAVAVPRRNAMALIFSTCIFSNSALAEPSPSPSPSPSVGLLEYIDTFDGYSFKYPKNWIQVRGAGADIFFRDPFVLDENLSVEFSSPSSSSYKSVQDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTSSRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENSRLYELRLQTPENVFVEEENELRQVMDSFRVNKVNA
BLAST of ClCG01G010870 vs. Swiss-Prot
Match: PPD1_ARATH (PsbP domain-containing protein 1, chloroplastic OS=Arabidopsis thaliana GN=PPD1 PE=1 SV=1)

HSP 1 Score: 315.1 bits (806), Expect = 7.5e-85
Identity = 160/226 (70.80%), Postives = 185/226 (81.86%), Query Frame = 1

Query: 40  CRSESQSIKAVAVPRRNAMALIFSTCIFSNSALAEPSPSPSPSPSVGLLEYIDTFDGYSF 99
           C ++++ + AV   +   M L+ S  I S + L    P+   S  V   EYIDTFDGYSF
Sbjct: 67  CLTDAKQVCAVGRRKSMMMGLLMSGLIVSQANL----PTAFASTPV-FREYIDTFDGYSF 126

Query: 100 KYPKNWIQVRGAGADIFFRDPFVLDENLSVEFSSPSSSSYKSVQDLGPPEEAGKKVLKQY 159
           KYP+NWIQVRGAGADIFFRDP VLDENLSVEFSSPSSS+Y S++DLG PEE GK+VL+QY
Sbjct: 127 KYPQNWIQVRGAGADIFFRDPVVLDENLSVEFSSPSSSNYTSLEDLGSPEEVGKRVLRQY 186

Query: 160 LTEFMSTRLGVRRESNILSTSSRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEW 219
           LTEFMSTRLGV+R++NILSTSSR+ADDG+ YYQVEVNIKSYANNNELAVMPQDRV RLEW
Sbjct: 187 LTEFMSTRLGVKRQANILSTSSRVADDGKLYYQVEVNIKSYANNNELAVMPQDRVARLEW 246

Query: 220 DRRYLSVLGVENSRLYELRLQTPENVFVEEENELRQVMDSFRVNKV 266
           +RRYL+VLGVEN RLY +RLQTPE VF+EEE +LR+VMDSFRV K+
Sbjct: 247 NRRYLAVLGVENDRLYSIRLQTPEKVFLEEEKDLRRVMDSFRVEKI 287

BLAST of ClCG01G010870 vs. TrEMBL
Match: B9RFL8_RICCO (Thylakoid lumenal 21.5 kDa protein, chloroplast, putative OS=Ricinus communis GN=RCOM_1435730 PE=4 SV=1)

HSP 1 Score: 381.7 bits (979), Expect = 7.2e-103
Identity = 206/278 (74.10%), Postives = 228/278 (82.01%), Query Frame = 1

Query: 1   MAVILDSFLPPIHT--------LSSSIRQRIPITSPSPRWPMDSSPKCRSESQSIKAVAV 60
           MA ILDSFLPP+H         LSS     +PI++ S R    +S  C++  Q  KA AV
Sbjct: 1   MATILDSFLPPVHLTSPTRPAFLSSWFSCSLPISADSTRC---TSISCKN--QPTKAFAV 60

Query: 61  PRRNAMALIFSTCI-----FSNSALAEPSPSPSPSPSVGLLEYIDTFDGYSFKYPKNWIQ 120
           PRR+ MALIFS+CI     F +SALA+ S        VG  EYIDTFDGYSFKYPKNWIQ
Sbjct: 61  PRRSTMALIFSSCILSEVGFHSSALAQSS--------VGFREYIDTFDGYSFKYPKNWIQ 120

Query: 121 VRGAGADIFFRDPFVLDENLSVEFSSPSSSSYKSVQDLGPPEEAGKKVLKQYLTEFMSTR 180
           VRGAGADIFFRDP+VLDENLSVE SSPSSS Y SV+DLGPP+EAGKKVLKQYLTEFMSTR
Sbjct: 121 VRGAGADIFFRDPYVLDENLSVEMSSPSSSKYTSVEDLGPPQEAGKKVLKQYLTEFMSTR 180

Query: 181 LGVRRESNILSTSSRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVL 240
           LGVRRES+ILSTSSR+ADDG+ YYQVEVNIKSYANNNE+AVMPQDRVVRLEW+RRYLSVL
Sbjct: 181 LGVRRESDILSTSSRVADDGKLYYQVEVNIKSYANNNEMAVMPQDRVVRLEWNRRYLSVL 240

Query: 241 GVENSRLYELRLQTPENVFVEEENELRQVMDSFRVNKV 266
           GVEN+RLYELRLQTPENVFVEEEN+LRQVM+SFRVNKV
Sbjct: 241 GVENNRLYELRLQTPENVFVEEENDLRQVMESFRVNKV 265

BLAST of ClCG01G010870 vs. TrEMBL
Match: A0A061F332_THECC (Photosystem II reaction center PsbP family protein isoform 1 OS=Theobroma cacao GN=TCM_026794 PE=4 SV=1)

HSP 1 Score: 380.6 bits (976), Expect = 1.6e-102
Identity = 201/267 (75.28%), Postives = 222/267 (83.15%), Query Frame = 1

Query: 1   MAVILDSFLPPIHTLSSSIRQRIPITSPSPRWPMDSSPKCRSESQSIKAVAVPRRNAMAL 60
           MA ILDS LPP        R  +P    +P +P  SS     ++Q  KA A+PRRNAMAL
Sbjct: 30  MATILDSLLPPS-------RPTLPTRLSTP-FPSSSSCISTRKTQKTKAFALPRRNAMAL 89

Query: 61  IFSTCIFSNSALAEPSPSPSPSPSVGLLEYIDTFDGYSFKYPKNWIQVRGAGADIFFRDP 120
           I S+CIFS   L + + +    PSVGL EYIDTFDGYSFKYP+NWIQVRGAGADIFFRDP
Sbjct: 90  ILSSCIFSEVGLHDFAFA---QPSVGLREYIDTFDGYSFKYPQNWIQVRGAGADIFFRDP 149

Query: 121 FVLDENLSVEFSSPSSSSYKSVQDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTS 180
           +VLDENLSVE SSPSSS YK+V+DLGPP+EAGKKVLKQYLTEFMSTRLGVRRESNILSTS
Sbjct: 150 YVLDENLSVEMSSPSSSRYKTVEDLGPPQEAGKKVLKQYLTEFMSTRLGVRRESNILSTS 209

Query: 181 SRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENSRLYELRLQ 240
           SR+ADDG+ YYQVEVNIKSYAN NELAVMPQDRV RLEW+RRYLSVLGVEN+RLYELRLQ
Sbjct: 210 SRVADDGKLYYQVEVNIKSYANTNELAVMPQDRVPRLEWNRRYLSVLGVENNRLYELRLQ 269

Query: 241 TPENVFVEEENELRQVMDSFRVNKVNA 268
           TPENVFVEEEN+LRQVMDSFRVNKV +
Sbjct: 270 TPENVFVEEENDLRQVMDSFRVNKVTS 285

BLAST of ClCG01G010870 vs. TrEMBL
Match: A0A0A0LTD2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G181310 PE=4 SV=1)

HSP 1 Score: 379.4 bits (973), Expect = 3.6e-102
Identity = 195/210 (92.86%), Postives = 197/210 (93.81%), Query Frame = 1

Query: 58  MALIFSTCIFSNSALAEPSPSPSPSPSVGLLEYIDTFDGYSFKYPKNWIQVRGAGADIFF 117
           MAL+ STCIFSNSALA          SVGLLEYIDTFDGYSFKYPKNWIQVRGAGADIFF
Sbjct: 1   MALMLSTCIFSNSALAVS--------SVGLLEYIDTFDGYSFKYPKNWIQVRGAGADIFF 60

Query: 118 RDPFVLDENLSVEFSSPSSSSYKSVQDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNIL 177
           RDPFVLDENLSVEFSSPSSS Y SVQDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNIL
Sbjct: 61  RDPFVLDENLSVEFSSPSSSRYNSVQDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNIL 120

Query: 178 STSSRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENSRLYEL 237
           STSSRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENSRLYEL
Sbjct: 121 STSSRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENSRLYEL 180

Query: 238 RLQTPENVFVEEENELRQVMDSFRVNKVNA 268
           RLQTPENVFVEEEN+LRQVMDSFRVNKVNA
Sbjct: 181 RLQTPENVFVEEENDLRQVMDSFRVNKVNA 202

BLAST of ClCG01G010870 vs. TrEMBL
Match: A0A0D2W809_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_013G208300 PE=4 SV=1)

HSP 1 Score: 372.9 bits (956), Expect = 3.4e-100
Identity = 199/265 (75.09%), Postives = 219/265 (82.64%), Query Frame = 1

Query: 1   MAVILDSFLPPIHTLSSSIRQRIPITSPSPRWPMDSSPKCRSESQSIKAVAVPRRNAMAL 60
           MA+ILDS LP    LS   R  +P    +P +P  +S  C   +QS +A ++PRRNAMAL
Sbjct: 1   MAIILDSLLP----LS---RPTLPARLSTP-FPPPASRLCTRRNQSFQAFSIPRRNAMAL 60

Query: 61  IFSTCIFSNSALAEPSPSPSPSPSVGLLEYIDTFDGYSFKYPKNWIQVRGAGADIFFRDP 120
           I ST IFS   L +      PS  VG  EYIDTFDGYS KYP+NWIQVRGAGADIFFRDP
Sbjct: 61  ILSTYIFSEVGLHDNIAFAEPS--VGFREYIDTFDGYSLKYPQNWIQVRGAGADIFFRDP 120

Query: 121 FVLDENLSVEFSSPSSSSYKSVQDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTS 180
           +VLDENLSVE SSPSSS YK+V+DLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTS
Sbjct: 121 YVLDENLSVELSSPSSSRYKTVEDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTS 180

Query: 181 SRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENSRLYELRLQ 240
           SR+ADDG+ YYQVEVNIKSYAN NELAVMPQDRV RLEW+RRYLSVLGVEN+RLYELRLQ
Sbjct: 181 SRVADDGKLYYQVEVNIKSYANTNELAVMPQDRVPRLEWNRRYLSVLGVENNRLYELRLQ 240

Query: 241 TPENVFVEEENELRQVMDSFRVNKV 266
           TPE+VFVEEEN+LRQVMDSFRVNKV
Sbjct: 241 TPESVFVEEENDLRQVMDSFRVNKV 255

BLAST of ClCG01G010870 vs. TrEMBL
Match: E5LBM4_GOSHI (PsbP domain protein 1 OS=Gossypium hirsutum GN=PPD1 PE=4 SV=1)

HSP 1 Score: 372.5 bits (955), Expect = 4.4e-100
Identity = 199/265 (75.09%), Postives = 219/265 (82.64%), Query Frame = 1

Query: 1   MAVILDSFLPPIHTLSSSIRQRIPITSPSPRWPMDSSPKCRSESQSIKAVAVPRRNAMAL 60
           MA+ILDS LP    LS   R  +P    +P +P  +S  C   +QS +A ++PRRNAMAL
Sbjct: 1   MAIILDSLLP----LS---RPTLPARLSTP-FPPSASCLCTRRNQSFQASSIPRRNAMAL 60

Query: 61  IFSTCIFSNSALAEPSPSPSPSPSVGLLEYIDTFDGYSFKYPKNWIQVRGAGADIFFRDP 120
           I ST IFS   L +      PS  VG  EYIDTFDGYS KYP+NWIQVRGAGADIFFRDP
Sbjct: 61  ILSTYIFSEVGLHDNIAFAEPS--VGFREYIDTFDGYSLKYPQNWIQVRGAGADIFFRDP 120

Query: 121 FVLDENLSVEFSSPSSSSYKSVQDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTS 180
           +VLDENLSVE SSPSSS YK+V+DLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTS
Sbjct: 121 YVLDENLSVELSSPSSSRYKTVEDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTS 180

Query: 181 SRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENSRLYELRLQ 240
           SR+ADDG+ YYQVEVNIKSYAN NELAVMPQDRV RLEW+RRYLSVLGVEN+RLYELRLQ
Sbjct: 181 SRVADDGKLYYQVEVNIKSYANTNELAVMPQDRVPRLEWNRRYLSVLGVENNRLYELRLQ 240

Query: 241 TPENVFVEEENELRQVMDSFRVNKV 266
           TPE+VFVEEEN+LRQVMDSFRVNKV
Sbjct: 241 TPESVFVEEENDLRQVMDSFRVNKV 255

BLAST of ClCG01G010870 vs. TAIR10
Match: AT4G15510.1 (AT4G15510.1 Photosystem II reaction center PsbP family protein)

HSP 1 Score: 315.1 bits (806), Expect = 4.2e-86
Identity = 160/226 (70.80%), Postives = 185/226 (81.86%), Query Frame = 1

Query: 40  CRSESQSIKAVAVPRRNAMALIFSTCIFSNSALAEPSPSPSPSPSVGLLEYIDTFDGYSF 99
           C ++++ + AV   +   M L+ S  I S + L    P+   S  V   EYIDTFDGYSF
Sbjct: 67  CLTDAKQVCAVGRRKSMMMGLLMSGLIVSQANL----PTAFASTPV-FREYIDTFDGYSF 126

Query: 100 KYPKNWIQVRGAGADIFFRDPFVLDENLSVEFSSPSSSSYKSVQDLGPPEEAGKKVLKQY 159
           KYP+NWIQVRGAGADIFFRDP VLDENLSVEFSSPSSS+Y S++DLG PEE GK+VL+QY
Sbjct: 127 KYPQNWIQVRGAGADIFFRDPVVLDENLSVEFSSPSSSNYTSLEDLGSPEEVGKRVLRQY 186

Query: 160 LTEFMSTRLGVRRESNILSTSSRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEW 219
           LTEFMSTRLGV+R++NILSTSSR+ADDG+ YYQVEVNIKSYANNNELAVMPQDRV RLEW
Sbjct: 187 LTEFMSTRLGVKRQANILSTSSRVADDGKLYYQVEVNIKSYANNNELAVMPQDRVARLEW 246

Query: 220 DRRYLSVLGVENSRLYELRLQTPENVFVEEENELRQVMDSFRVNKV 266
           +RRYL+VLGVEN RLY +RLQTPE VF+EEE +LR+VMDSFRV K+
Sbjct: 247 NRRYLAVLGVENDRLYSIRLQTPEKVFLEEEKDLRRVMDSFRVEKI 287

BLAST of ClCG01G010870 vs. NCBI nr
Match: gi|449443516|ref|XP_004139523.1| (PREDICTED: psbP domain-containing protein 1, chloroplastic isoform X1 [Cucumis sativus])

HSP 1 Score: 460.3 bits (1183), Expect = 2.3e-126
Identity = 241/267 (90.26%), Postives = 245/267 (91.76%), Query Frame = 1

Query: 1   MAVILDSFLPPIHTLSSSIRQRIPITSPSPRWPMDSSPKCRSESQSIKAVAVPRRNAMAL 60
           MAVILDSFLP I TLSSS RQRIP TS S RWPM+S PKC SESQSIK VAVPRR+AMAL
Sbjct: 1   MAVILDSFLPSIQTLSSSFRQRIPSTS-STRWPMNSFPKCCSESQSIKGVAVPRRSAMAL 60

Query: 61  IFSTCIFSNSALAEPSPSPSPSPSVGLLEYIDTFDGYSFKYPKNWIQVRGAGADIFFRDP 120
           + STCIFSNSALA          SVGLLEYIDTFDGYSFKYPKNWIQVRGAGADIFFRDP
Sbjct: 61  MLSTCIFSNSALAVS--------SVGLLEYIDTFDGYSFKYPKNWIQVRGAGADIFFRDP 120

Query: 121 FVLDENLSVEFSSPSSSSYKSVQDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTS 180
           FVLDENLSVEFSSPSSS Y SVQDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTS
Sbjct: 121 FVLDENLSVEFSSPSSSRYNSVQDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTS 180

Query: 181 SRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENSRLYELRLQ 240
           SRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENSRLYELRLQ
Sbjct: 181 SRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENSRLYELRLQ 240

Query: 241 TPENVFVEEENELRQVMDSFRVNKVNA 268
           TPENVFVEEEN+LRQVMDSFRVNKVNA
Sbjct: 241 TPENVFVEEENDLRQVMDSFRVNKVNA 258

BLAST of ClCG01G010870 vs. NCBI nr
Match: gi|659128677|ref|XP_008464319.1| (PREDICTED: psbP domain-containing protein 1, chloroplastic isoform X1 [Cucumis melo])

HSP 1 Score: 451.4 bits (1160), Expect = 1.1e-123
Identity = 239/267 (89.51%), Postives = 242/267 (90.64%), Query Frame = 1

Query: 1   MAVILDSFLPPIHTLSSSIRQRIPITSPSPRWPMDSSPKCRSESQSIKAVAVPRRNAMAL 60
           MAVIL SFLP I TL+S IRQRIP TS S R P+ S PKC S+SQSIK VAVPRRNAMAL
Sbjct: 1   MAVILHSFLPSIQTLTSPIRQRIPSTS-STRSPIISFPKCCSQSQSIKDVAVPRRNAMAL 60

Query: 61  IFSTCIFSNSALAEPSPSPSPSPSVGLLEYIDTFDGYSFKYPKNWIQVRGAGADIFFRDP 120
           I STCIFSNSA A P        SVGLLEYIDTFDGYSFKYPKNWIQVRGAGADIFFRDP
Sbjct: 61  ILSTCIFSNSAFAVP--------SVGLLEYIDTFDGYSFKYPKNWIQVRGAGADIFFRDP 120

Query: 121 FVLDENLSVEFSSPSSSSYKSVQDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTS 180
           FVLDENLSVEFSSPSSS Y SVQDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTS
Sbjct: 121 FVLDENLSVEFSSPSSSRYNSVQDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTS 180

Query: 181 SRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENSRLYELRLQ 240
           SRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENSRLYELRLQ
Sbjct: 181 SRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENSRLYELRLQ 240

Query: 241 TPENVFVEEENELRQVMDSFRVNKVNA 268
           TPENVFVEEENELRQVMDSFRVNKVNA
Sbjct: 241 TPENVFVEEENELRQVMDSFRVNKVNA 258

BLAST of ClCG01G010870 vs. NCBI nr
Match: gi|255542948|ref|XP_002512537.1| (PREDICTED: psbP domain-containing protein 1, chloroplastic isoform X1 [Ricinus communis])

HSP 1 Score: 381.7 bits (979), Expect = 1.0e-102
Identity = 206/278 (74.10%), Postives = 228/278 (82.01%), Query Frame = 1

Query: 1   MAVILDSFLPPIHT--------LSSSIRQRIPITSPSPRWPMDSSPKCRSESQSIKAVAV 60
           MA ILDSFLPP+H         LSS     +PI++ S R    +S  C++  Q  KA AV
Sbjct: 1   MATILDSFLPPVHLTSPTRPAFLSSWFSCSLPISADSTRC---TSISCKN--QPTKAFAV 60

Query: 61  PRRNAMALIFSTCI-----FSNSALAEPSPSPSPSPSVGLLEYIDTFDGYSFKYPKNWIQ 120
           PRR+ MALIFS+CI     F +SALA+ S        VG  EYIDTFDGYSFKYPKNWIQ
Sbjct: 61  PRRSTMALIFSSCILSEVGFHSSALAQSS--------VGFREYIDTFDGYSFKYPKNWIQ 120

Query: 121 VRGAGADIFFRDPFVLDENLSVEFSSPSSSSYKSVQDLGPPEEAGKKVLKQYLTEFMSTR 180
           VRGAGADIFFRDP+VLDENLSVE SSPSSS Y SV+DLGPP+EAGKKVLKQYLTEFMSTR
Sbjct: 121 VRGAGADIFFRDPYVLDENLSVEMSSPSSSKYTSVEDLGPPQEAGKKVLKQYLTEFMSTR 180

Query: 181 LGVRRESNILSTSSRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVL 240
           LGVRRES+ILSTSSR+ADDG+ YYQVEVNIKSYANNNE+AVMPQDRVVRLEW+RRYLSVL
Sbjct: 181 LGVRRESDILSTSSRVADDGKLYYQVEVNIKSYANNNEMAVMPQDRVVRLEWNRRYLSVL 240

Query: 241 GVENSRLYELRLQTPENVFVEEENELRQVMDSFRVNKV 266
           GVEN+RLYELRLQTPENVFVEEEN+LRQVM+SFRVNKV
Sbjct: 241 GVENNRLYELRLQTPENVFVEEENDLRQVMESFRVNKV 265

BLAST of ClCG01G010870 vs. NCBI nr
Match: gi|590644876|ref|XP_007031203.1| (Photosystem II reaction center PsbP family protein isoform 1 [Theobroma cacao])

HSP 1 Score: 380.6 bits (976), Expect = 2.3e-102
Identity = 201/267 (75.28%), Postives = 222/267 (83.15%), Query Frame = 1

Query: 1   MAVILDSFLPPIHTLSSSIRQRIPITSPSPRWPMDSSPKCRSESQSIKAVAVPRRNAMAL 60
           MA ILDS LPP        R  +P    +P +P  SS     ++Q  KA A+PRRNAMAL
Sbjct: 30  MATILDSLLPPS-------RPTLPTRLSTP-FPSSSSCISTRKTQKTKAFALPRRNAMAL 89

Query: 61  IFSTCIFSNSALAEPSPSPSPSPSVGLLEYIDTFDGYSFKYPKNWIQVRGAGADIFFRDP 120
           I S+CIFS   L + + +    PSVGL EYIDTFDGYSFKYP+NWIQVRGAGADIFFRDP
Sbjct: 90  ILSSCIFSEVGLHDFAFA---QPSVGLREYIDTFDGYSFKYPQNWIQVRGAGADIFFRDP 149

Query: 121 FVLDENLSVEFSSPSSSSYKSVQDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNILSTS 180
           +VLDENLSVE SSPSSS YK+V+DLGPP+EAGKKVLKQYLTEFMSTRLGVRRESNILSTS
Sbjct: 150 YVLDENLSVEMSSPSSSRYKTVEDLGPPQEAGKKVLKQYLTEFMSTRLGVRRESNILSTS 209

Query: 181 SRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENSRLYELRLQ 240
           SR+ADDG+ YYQVEVNIKSYAN NELAVMPQDRV RLEW+RRYLSVLGVEN+RLYELRLQ
Sbjct: 210 SRVADDGKLYYQVEVNIKSYANTNELAVMPQDRVPRLEWNRRYLSVLGVENNRLYELRLQ 269

Query: 241 TPENVFVEEENELRQVMDSFRVNKVNA 268
           TPENVFVEEEN+LRQVMDSFRVNKV +
Sbjct: 270 TPENVFVEEENDLRQVMDSFRVNKVTS 285

BLAST of ClCG01G010870 vs. NCBI nr
Match: gi|778659702|ref|XP_011654885.1| (PREDICTED: psbP domain-containing protein 1, chloroplastic isoform X2 [Cucumis sativus])

HSP 1 Score: 379.4 bits (973), Expect = 5.1e-102
Identity = 195/210 (92.86%), Postives = 197/210 (93.81%), Query Frame = 1

Query: 58  MALIFSTCIFSNSALAEPSPSPSPSPSVGLLEYIDTFDGYSFKYPKNWIQVRGAGADIFF 117
           MAL+ STCIFSNSALA          SVGLLEYIDTFDGYSFKYPKNWIQVRGAGADIFF
Sbjct: 1   MALMLSTCIFSNSALAVS--------SVGLLEYIDTFDGYSFKYPKNWIQVRGAGADIFF 60

Query: 118 RDPFVLDENLSVEFSSPSSSSYKSVQDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNIL 177
           RDPFVLDENLSVEFSSPSSS Y SVQDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNIL
Sbjct: 61  RDPFVLDENLSVEFSSPSSSRYNSVQDLGPPEEAGKKVLKQYLTEFMSTRLGVRRESNIL 120

Query: 178 STSSRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENSRLYEL 237
           STSSRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENSRLYEL
Sbjct: 121 STSSRMADDGRTYYQVEVNIKSYANNNELAVMPQDRVVRLEWDRRYLSVLGVENSRLYEL 180

Query: 238 RLQTPENVFVEEENELRQVMDSFRVNKVNA 268
           RLQTPENVFVEEEN+LRQVMDSFRVNKVNA
Sbjct: 181 RLQTPENVFVEEENDLRQVMDSFRVNKVNA 202

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPD1_ARATH7.5e-8570.80PsbP domain-containing protein 1, chloroplastic OS=Arabidopsis thaliana GN=PPD1 ... [more]
Match NameE-valueIdentityDescription
B9RFL8_RICCO7.2e-10374.10Thylakoid lumenal 21.5 kDa protein, chloroplast, putative OS=Ricinus communis GN... [more]
A0A061F332_THECC1.6e-10275.28Photosystem II reaction center PsbP family protein isoform 1 OS=Theobroma cacao ... [more]
A0A0A0LTD2_CUCSA3.6e-10292.86Uncharacterized protein OS=Cucumis sativus GN=Csa_1G181310 PE=4 SV=1[more]
A0A0D2W809_GOSRA3.4e-10075.09Uncharacterized protein OS=Gossypium raimondii GN=B456_013G208300 PE=4 SV=1[more]
E5LBM4_GOSHI4.4e-10075.09PsbP domain protein 1 OS=Gossypium hirsutum GN=PPD1 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G15510.14.2e-8670.80 Photosystem II reaction center PsbP family protein[more]
Match NameE-valueIdentityDescription
gi|449443516|ref|XP_004139523.1|2.3e-12690.26PREDICTED: psbP domain-containing protein 1, chloroplastic isoform X1 [Cucumis s... [more]
gi|659128677|ref|XP_008464319.1|1.1e-12389.51PREDICTED: psbP domain-containing protein 1, chloroplastic isoform X1 [Cucumis m... [more]
gi|255542948|ref|XP_002512537.1|1.0e-10274.10PREDICTED: psbP domain-containing protein 1, chloroplastic isoform X1 [Ricinus c... [more]
gi|590644876|ref|XP_007031203.1|2.3e-10275.28Photosystem II reaction center PsbP family protein isoform 1 [Theobroma cacao][more]
gi|778659702|ref|XP_011654885.1|5.1e-10292.86PREDICTED: psbP domain-containing protein 1, chloroplastic isoform X2 [Cucumis s... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002683PsbP
IPR016123Mog1/PsbP_a/b/a-sand
Vocabulary: Molecular Function
TermDefinition
GO:0005509calcium ion binding
Vocabulary: Cellular Component
TermDefinition
GO:0009523photosystem II
GO:0009654photosystem II oxygen evolving complex
GO:0019898extrinsic component of membrane
Vocabulary: Biological Process
TermDefinition
GO:0015979photosynthesis
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015979 photosynthesis
biological_process GO:0048564 photosystem I assembly
cellular_component GO:0009570 chloroplast stroma
cellular_component GO:0009543 chloroplast thylakoid lumen
cellular_component GO:0009535 chloroplast thylakoid membrane
cellular_component GO:0019898 extrinsic component of membrane
cellular_component GO:0009654 photosystem II oxygen evolving complex
cellular_component GO:0031977 thylakoid lumen
cellular_component GO:0009523 photosystem II
cellular_component GO:0044434 chloroplast part
molecular_function GO:0005509 calcium ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G010870.1ClCG01G010870.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002683PsbP familyPFAMPF01789PsbPcoord: 82..262
score: 8.0
IPR016123Mog1/PsbP, alpha/beta/alpha sandwichGENE3DG3DSA:3.40.1000.10coord: 86..262
score: 3.6
IPR016123Mog1/PsbP, alpha/beta/alpha sandwichunknownSSF55724Mog1p/PsbP-likecoord: 90..262
score: 6.28
NoneNo IPR availablePANTHERPTHR31407FAMILY NOT NAMEDcoord: 4..267
score: 1.1E
NoneNo IPR availablePANTHERPTHR31407:SF15SUBFAMILY NOT NAMEDcoord: 4..267
score: 1.1E