Clc09G19590 (gene) Watermelon (cordophanus) v2

Overview
NameClc09G19590
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
Descriptionprotein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic
LocationClcChr09: 33077813 .. 33082016 (-)
RNA-Seq ExpressionClc09G19590
SyntenyClc09G19590
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGTAAGAAGAAAGCCCTAATTGAGAGAAAACCTGCAACGAGGAGAGCTGGGTCTCCAAGGACTCGAGGGCATCGCAGATATTCAAAAGTCGTTCCACTTCTTCATCGTCCTCCGTATCGCCGACCAGAGTTTGGGAGCCAGAACCGCGGTCGTTGGCGGCCCCATTTCCCGTTTCGGCAGATGGGTCTTGCCCGAGTGAGGAGGTGATTGTTGCTTTGTGAATGCAGTTGCTGATCTTCAATCGGAGCTCCGTGGCTCTGCCCAGAACCATGTCAATGCCGTCTTCTTCCATGGACGCTGCTTGCTCCGGCTCCGGCTCCACCTAGGGTTTTGAGGGAAAGGAAGAAAATGGAAGAATAGGGAAGTGGGGTTTTGAAAAGCACTAGACTATGAATCAGAAGCCGCCAAGATTAGAAGATTACAAGTGATTCTTCTTAGTAGAAGTAGTTTCTGGTGGCCAAATTTAGATTAAAAGCGATTGTGCTCTCTCTTTGATTGGCTATTGCAAGTTGGACTCACCCTTTGAGTTTAAACTCATTTTAGCTATACAATAAATGGAATCAATGAATGGTAATAATAATATGGGGAAGCATCCACCATTTAAAAATGTTTCCAGAAAATTAGTTCCACTTACGGTCCATCGATTACTCAATATCGAAATCACATACATAACCTATGATTTTCCGTTGCTTTTTCGCCAACCGTCTAGACCAAAGTAGCTATTGAAATAAATTTCTAAGAATATTTCCAATTTATTTATTTACTTTGTCTAGCTTGAATTTTGACCCAACATTTTCAATTTTTTACATTTCCAACCCAAAACAACCAAATCGACCAATTTAATTTCAAACACGAGTCCACAACCGAATTCGGTCGGTTGGGAGATTTATCCTCATTATATGTCGTCTAAGCTAAATAAAATTATGAAACTCACACCGTAATAAATCCTTCTAAGTAAATTAGTTGGTGCATACTTATATTATATTTTGAATTTTGAGACCATGAAATTGGAGTAATCAAAATGTATGGAAACAGAAGTTTTAGAAGCTTTACGTAAAATATTAAAGTTGATAGAATGAATATAAACAAAATAATGTTGTAAAAATTGATAATACACATGCAACCCACCCACGTAGTAATTAATAAACTAGTTATATATAAATGAAATTATAGAGGAGATACTCATTTGTCCCAACATTTTTCCTTTAATATTTCGGACATGCCTCTTCGAGTTTAAGTGTCCTTTAAAATATGTTCGAGGTGTAATTTTTGTGCCCACAATATGACTTTTGGGTGCATCTATTACACATGGAATACGTGCATCTATTACATTCATTCATGTACAGATGTGAAAAGAGTTTATTTGTACTATGTTGATGTTTTTCAATGTTTGGGACATGACTCATCGAGTTTAGACATATGTTGGGAGATGTCACGAAATTATAGGTGCAATTTATGCTGACGGGAAACGACTATTTGGTTGTCTAGTTTACACATCAAATTGGGTGCATTTATTCATGTATGCATTTGACCTAGGTATAAAAATACTGCTTTTTGAGTTTAGGTGTTCATTGAGAGATGGTGGAACAATGGAGACGATTCTTGAAATATTCACTTTACAATATTGTTGATACCAAATTCTGAAAGGGGGAAAACACACCGAAACTTCAGATGATAACAACGATAAATTTATAATTCAGGGATGACATGAGCTGCAAAATAAGAAGAATCTGGGCACTAGTGTGGTACTTGCCACAACAACTCTAATGCCTAAGTCAGGAATAGGCTTTTGGTTGTAACAAGAGTTTTTAAGGATTGGAGTAAATTTTGCTTCTTCTAAAGTCATAATAATGTATTTATAGTGAATTTCTGACCAAGAGCATCCATATTAGAGTTTTTTAGTCATATTAGGTTAGTTCATCAAATCCTTGGCCGAGGTCAAGGCTGAGGAGGCCATTTGGGCTTTTAAACCAAACTATTTTTCTTCCTAAGGCCCAAATAAGTCACAATATGAGGGTTGACCTCTAGCTCATCCTTTGTCAAGGTCAACGTCGAGCAGGTCAGTTTGGGCTCTCTTCCCATGTTGAATTGGGTCCTAGGGTGTGGTTTTTTTGCTCGGGTTTTGCAATTTTCTTGATTATATAGAATGCAAACTTCATAAAGTTGATTGTGCCCTCTTGTCTACATATACCCATCACAAAATGAATATGTATATATATATATATATATATGAAAGTTAAACAATCATCAATGACCTAGTGGTAGTGGGAATATAAAAAAAAAAAAAAAACCAAAGGGCCCGATTCGTGGTGGCCATCTACTTAGGATTTAATATCCTATGGATTTCCTTGACCCCCAAATATTGTAGGAATCAAATGGGTTATCTCTTAAAATTAGTCGAAGTGCACGTAGTACACTACTATTTGGGTATTGGTAGTATATTACACATTGAATTGGTGCATCTCTTAACAATTTTATTTTATATATACAAACTTTAATGGATAATAATTTAGTCTCTATCTTTTACAATTTGTAACATTCTAATCTTTATAAACAATGAGGATAAGGCAATTAATAAAATAGTGGTGCAGATTGAGTTAAGTGTGGATAATGTTGAGAGACAGAGAATGGGATGAATTGATGAAAATAGGAACATCAAACCCTGAAAAGAATAATTGGAAAAGAGGGATGGGTAGCTCTGCCTCTAACAAAATCTTCCCATTTCAAGTCCTCAAATGGCCACCATATTATAATACAACTATAACTAAACCATCCATCATTTCTTTTGCTGCACCCACAGCCTCTTTGCTGCCCAAAAGTTGCCGCAACTGTGGAGGTAAAGGAGCCATTGATTGTCCTGGATGTAAGGTTAATTATCATCCTCCCTCCACTTCTTACTTCATTAATTCATATGTTGTTTATTCTAAATTAAAATTCATGATCATAAAAGGAAAACAAATGTGAATTGGTGATTCAATTTGGTGGGTTACTTATGAGATCAATCACATTGCTATTCCTTTCCTTTTTAAACGTATTCATCATTTTACCCCCTAAAACCATAACTTCATATTGTTTTTAAAGAATTTACTAAGGGAGCGTTTGGGGCATGCAACGAGTTGAGTTGAGTTGAGTTGAAGTTTAGAAAGTTGTGAGCTCTTATAGAGAGTGTTTGGCCCAAGGAATTGGAGTTGTGAAGTCCACTCTATGGAGTTCATGAGCCCCATGACTAATAAATATCAATTTTAGGCTTCATTAAAAGGGGCAATTGCAAATATAGTAGTAAGGCTTAAGATATTATCAGATATAGCACAATGCAAAAAAAGGAATTGAAAATATAGCAAAATTTAGATCCCACTCTTAGACTCTATTAGACTATCTCACTAATAGAAGTATATCATCGATAGAGTCTATCACTCCTACATTTTGCTATATTTGCAATTCTTTTAAAATATTGCTGTACAATTATTTATTAGCCTAAAAGTACTACCCATTACAATTACCTTTAGTAAAAATCTTACACTGTGGGCCTCATGATTTCATGACTCTTTCACAACTCCACTGTTTGCTCCACACACCCTCGTAAGGAGTTAATAAAATATAAAATTGATATCTTTAATTGCAAGACAAAGCCCATAAACTCTTTGGGTCAAACACTCCTTAATTATTTTAGGAAACTAAAAAGAGATTCTTTAACCATTTTCGAATACAAGATACCTATATATATATTCTATACTAATCATATTATGTGATTGTTTTTTAGGGCACGGGAAAGAACAAGAAAAACGGAAACATCTTCGAGCGTTGGAAGTAAGTACTCTCACTCGATATATTTTTTACTTGTTTTTTTTTTCTCTCTCTCTCTCCCTTTTAACATAAACTCAAAGGTAGCTTAATTGAATTCATCCATTCATTCTCTTAAAATTGTTTGTTTTAGATGTTTTGATTGTCAAGGATTTGGATTAAAGAGTTGTCCGGAATGTGGAAAAGGAGGACTCACCCCCGAACAAAGGGGAGAAAGATAATGCATATTTTCCCCAATGCAATATATATTTTGTATTTATATGTCCTGATTTTTCTAATTTAAGGTTTTATTATGTTTTTTTTTTCTTTCTTTTTAGAATTTTAACAAACATTATTAAGAATGGTGAAATTAAACGTCTTATTCAAATTTTTATGCGTGCTAAAATAAATTATTCGAATT

mRNA sequence

ATGTGTAAGAAGAAAGCCCTAATTGAGAGAAAACCTGCAACGAGGAGAGCTGGGTCTCCAAGGACTCGAGGGCATCGCAGATATTCAAAAGTCGTTCCACTTCTTCATCGTCCTCCGTATCGCCGACCAGAGTTTGGGAGCCAGAACCGCGGTCGTTGGCGGCCCCATTTCCCGTTTCGGCAGATGGGTCTTGCCCGAGTGAGGAGAGAATGGGATGAATTGATGAAAATAGGAACATCAAACCCTGAAAAGAATAATTGGAAAAGAGGGATGGGTAGCTCTGCCTCTAACAAAATCTTCCCATTTCAAGTCCTCAAATGGCCACCATATTATAATACAACTATAACTAAACCATCCATCATTTCTTTTGCTGCACCCACAGCCTCTTTGCTGCCCAAAAGTTGCCGCAACTGTGGAGGTAAAGGAGCCATTGATTGTCCTGGATGTAAGGGCACGGGAAAGAACAAGAAAAACGGAAACATCTTCGAGCGTTGGAAATGTTTTGATTGTCAAGGATTTGGATTAAAGAGTTGTCCGGAATGTGGAAAAGGAGGACTCACCCCCGAACAAAGGGGAGAAAGATAATGCATATTTTCCCCAATGCAATATATATTTTGTATTTATATGTCCTGATTTTTCTAATTTAAGGTTTTATTATGTTTTTTTTTTCTTTCTTTTTAGAATTTTAACAAACATTATTAAGAATGGTGAAATTAAACGTCTTATTCAAATTTTTATGCGTGCTAAAATAAATTATTCGAATT

Coding sequence (CDS)

ATGTGTAAGAAGAAAGCCCTAATTGAGAGAAAACCTGCAACGAGGAGAGCTGGGTCTCCAAGGACTCGAGGGCATCGCAGATATTCAAAAGTCGTTCCACTTCTTCATCGTCCTCCGTATCGCCGACCAGAGTTTGGGAGCCAGAACCGCGGTCGTTGGCGGCCCCATTTCCCGTTTCGGCAGATGGGTCTTGCCCGAGTGAGGAGAGAATGGGATGAATTGATGAAAATAGGAACATCAAACCCTGAAAAGAATAATTGGAAAAGAGGGATGGGTAGCTCTGCCTCTAACAAAATCTTCCCATTTCAAGTCCTCAAATGGCCACCATATTATAATACAACTATAACTAAACCATCCATCATTTCTTTTGCTGCACCCACAGCCTCTTTGCTGCCCAAAAGTTGCCGCAACTGTGGAGGTAAAGGAGCCATTGATTGTCCTGGATGTAAGGGCACGGGAAAGAACAAGAAAAACGGAAACATCTTCGAGCGTTGGAAATGTTTTGATTGTCAAGGATTTGGATTAAAGAGTTGTCCGGAATGTGGAAAAGGAGGACTCACCCCCGAACAAAGGGGAGAAAGATAA

Protein sequence

MCKKKALIERKPATRRAGSPRTRGHRRYSKVVPLLHRPPYRRPEFGSQNRGRWRPHFPFRQMGLARVRREWDELMKIGTSNPEKNNWKRGMGSSASNKIFPFQVLKWPPYYNTTITKPSIISFAAPTASLLPKSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRGER
Homology
BLAST of Clc09G19590 vs. NCBI nr
Match: XP_038901776.1 (protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic isoform X1 [Benincasa hispida])

HSP 1 Score: 166.4 bits (420), Expect = 2.5e-37
Identity = 83/110 (75.45%), Postives = 87/110 (79.09%), Query Frame = 0

Query: 91  MGSSASNKIFPFQ----VLKWPPYYNTTITKPSIISFAAPTA--SLLPKSCRNCGGKGAI 150
           M  ++SN IFPF     V   PPY  T+ TK S IS AA  A  SLLPK C+ CGGKGAI
Sbjct: 9   MAMASSNNIFPFSPYRLVRSSPPY--TSKTKSSNISLAAAAAARSLLPKRCQKCGGKGAI 68

Query: 151 DCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRGER 195
           DCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRGER
Sbjct: 69  DCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRGER 116

BLAST of Clc09G19590 vs. NCBI nr
Match: XP_023521952.1 (protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic-like [Cucurbita pepo subsp. pepo] >XP_023553311.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic-like isoform X1 [Cucurbita pepo subsp. pepo] >XP_023553319.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic-like isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 156.0 bits (393), Expect = 3.3e-34
Identity = 74/114 (64.91%), Postives = 84/114 (73.68%), Query Frame = 0

Query: 87  WKRGMGSSASNKIFPFQVLKWPPY------YNTTITKPSIISFAAPTASLLPKSCRNCGG 146
           W     S ++N  FPF    +PP+       N + +KPS I FAA     LPK C+ CGG
Sbjct: 3   WCGSSVSKSNNSTFPF---PFPPHGFAILPQNKSRSKPSNIPFAAAPKFSLPKRCQTCGG 62

Query: 147 KGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRGER 195
           KGAIDCPGCKGTG+NKKNGNIFERWKCF+CQGFGLKSCP+CGKGGLTPEQRGER
Sbjct: 63  KGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDCGKGGLTPEQRGER 113

BLAST of Clc09G19590 vs. NCBI nr
Match: XP_022152486.1 (uncharacterized protein LOC111020203 [Momordica charantia])

HSP 1 Score: 155.2 bits (391), Expect = 5.7e-34
Identity = 74/108 (68.52%), Postives = 78/108 (72.22%), Query Frame = 0

Query: 93  SSASNKIFPFQVLKWPPYYNTTITKP------SIISFAAPTASLLPKSCRNCGGKGAIDC 152
           S   N IFPF     PP +   I+ P      S ISF A     LPK C+ CGGKGAIDC
Sbjct: 13  SHTINTIFPFPFPPPPPPHRLLISPPNKFKRRSNISFGAAATFSLPKRCQKCGGKGAIDC 72

Query: 153 PGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRGER 195
           PGCKGTGKNKKNGNIFERWKCF+CQGFGLKSCPECG GGLTPEQRGER
Sbjct: 73  PGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPECGNGGLTPEQRGER 120

BLAST of Clc09G19590 vs. NCBI nr
Match: KAG7033428.1 (hypothetical protein SDJN02_07484 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 155.2 bits (391), Expect = 5.7e-34
Identity = 74/114 (64.91%), Postives = 83/114 (72.81%), Query Frame = 0

Query: 87  WKRGMGSSASNKIFPFQVLKWPPY------YNTTITKPSIISFAAPTASLLPKSCRNCGG 146
           W     S ++N  FPF    +PP+       N   +KPS I FAA     LPK C+ CGG
Sbjct: 3   WCGSSVSKSNNSTFPF---PFPPHGFAILPQNKCRSKPSNIPFAAAPKFSLPKRCQTCGG 62

Query: 147 KGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRGER 195
           KGAIDCPGCKGTG+NKKNGNIFERWKCF+CQGFGLKSCP+CGKGGLTPEQRGER
Sbjct: 63  KGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDCGKGGLTPEQRGER 113

BLAST of Clc09G19590 vs. NCBI nr
Match: XP_008459263.1 (PREDICTED: protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic [Cucumis melo])

HSP 1 Score: 154.8 bits (390), Expect = 7.4e-34
Identity = 76/107 (71.03%), Postives = 86/107 (80.37%), Query Frame = 0

Query: 93  SSASNKIFPFQVLKWPPYY---NTTITKPSI-ISFAA-PTASLLPKSCRNCGGKGAIDCP 152
           +S S  IFPF     P Y    + T + P+I + FAA PT++LLPK C+ CGGKGAIDCP
Sbjct: 6   TSTSINIFPF-----PSYQIVRSKTKSSPNITVPFAALPTSNLLPKRCQKCGGKGAIDCP 65

Query: 153 GCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRGER 195
           GCKGTGKNKKNGNIFERWKCF+CQGFGLKSCP+CGKGGLTPEQRGER
Sbjct: 66  GCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKGGLTPEQRGER 107

BLAST of Clc09G19590 vs. ExPASy Swiss-Prot
Match: O64750 (Protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PSA2 PE=1 SV=1)

HSP 1 Score: 49.3 bits (116), Expect = 5.7e-05
Identity = 27/64 (42.19%), Postives = 35/64 (54.69%), Query Frame = 0

Query: 134 SCRNCGGKGAIDCPGCKGTGK-----NKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTP 193
           SCRNC G GA+ C  C GTGK      K+  +++E  +C +C G G   CP C   GL P
Sbjct: 101 SCRNCQGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGL-P 160

BLAST of Clc09G19590 vs. ExPASy TrEMBL
Match: A0A6J1DHW0 (uncharacterized protein LOC111020203 OS=Momordica charantia OX=3673 GN=LOC111020203 PE=4 SV=1)

HSP 1 Score: 155.2 bits (391), Expect = 2.7e-34
Identity = 74/108 (68.52%), Postives = 78/108 (72.22%), Query Frame = 0

Query: 93  SSASNKIFPFQVLKWPPYYNTTITKP------SIISFAAPTASLLPKSCRNCGGKGAIDC 152
           S   N IFPF     PP +   I+ P      S ISF A     LPK C+ CGGKGAIDC
Sbjct: 13  SHTINTIFPFPFPPPPPPHRLLISPPNKFKRRSNISFGAAATFSLPKRCQKCGGKGAIDC 72

Query: 153 PGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRGER 195
           PGCKGTGKNKKNGNIFERWKCF+CQGFGLKSCPECG GGLTPEQRGER
Sbjct: 73  PGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPECGNGGLTPEQRGER 120

BLAST of Clc09G19590 vs. ExPASy TrEMBL
Match: A0A1S3C9W1 (protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103498437 PE=4 SV=1)

HSP 1 Score: 154.8 bits (390), Expect = 3.6e-34
Identity = 76/107 (71.03%), Postives = 86/107 (80.37%), Query Frame = 0

Query: 93  SSASNKIFPFQVLKWPPYY---NTTITKPSI-ISFAA-PTASLLPKSCRNCGGKGAIDCP 152
           +S S  IFPF     P Y    + T + P+I + FAA PT++LLPK C+ CGGKGAIDCP
Sbjct: 6   TSTSINIFPF-----PSYQIVRSKTKSSPNITVPFAALPTSNLLPKRCQKCGGKGAIDCP 65

Query: 153 GCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRGER 195
           GCKGTGKNKKNGNIFERWKCF+CQGFGLKSCP+CGKGGLTPEQRGER
Sbjct: 66  GCKGTGKNKKNGNIFERWKCFECQGFGLKSCPQCGKGGLTPEQRGER 107

BLAST of Clc09G19590 vs. ExPASy TrEMBL
Match: A0A0A0LIM5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G238860 PE=4 SV=1)

HSP 1 Score: 154.1 bits (388), Expect = 6.1e-34
Identity = 72/103 (69.90%), Postives = 81/103 (78.64%), Query Frame = 0

Query: 93  SSASNKIFPFQVLKWPPYYNTTITKPSIISFAA-PTASLLPKSCRNCGGKGAIDCPGCKG 152
           +S S  IFPF   +       +     I+ FAA PT++LLPK C+ CGGKGAIDCPGCKG
Sbjct: 4   TSTSINIFPFPSYQIFRSKRKSSPNDIIVPFAALPTSNLLPKRCQKCGGKGAIDCPGCKG 63

Query: 153 TGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRGER 195
           TGKNKKNGNIFERWKCF+CQGFGLKSCP+CGKGGLTPEQRGER
Sbjct: 64  TGKNKKNGNIFERWKCFECQGFGLKSCPQCGKGGLTPEQRGER 106

BLAST of Clc09G19590 vs. ExPASy TrEMBL
Match: A0A6J1JSI3 (protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111487154 PE=4 SV=1)

HSP 1 Score: 152.9 bits (385), Expect = 1.4e-33
Identity = 73/114 (64.04%), Postives = 83/114 (72.81%), Query Frame = 0

Query: 87  WKRGMGSSASNKIFPFQVLKWPPY------YNTTITKPSIISFAAPTASLLPKSCRNCGG 146
           W     S ++N  FPF    +PP+       N + +KPS I FAA     LPK C+ CGG
Sbjct: 3   WCGSSVSKSNNSTFPF---PFPPHGFAILPQNKSRSKPSNIPFAAAPKFSLPKRCQTCGG 62

Query: 147 KGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRGER 195
           KGAIDC GCKGTG+NKKNGNIFERWKCF+CQGFGLKSCP+CGKGGLTPEQRGER
Sbjct: 63  KGAIDCTGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDCGKGGLTPEQRGER 113

BLAST of Clc09G19590 vs. ExPASy TrEMBL
Match: A0A6J1HLG5 (protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111464051 PE=4 SV=1)

HSP 1 Score: 152.1 bits (383), Expect = 2.3e-33
Identity = 73/114 (64.04%), Postives = 82/114 (71.93%), Query Frame = 0

Query: 87  WKRGMGSSASNKIFPFQVLKWPPY------YNTTITKPSIISFAAPTASLLPKSCRNCGG 146
           W     S ++N   PF    +PP+       N   +KPS I FAA     LPK C+ CGG
Sbjct: 3   WCGSSVSKSNNSTSPF---PFPPHGFAILPQNKCRSKPSNIPFAAAPKFSLPKRCQTCGG 62

Query: 147 KGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRGER 195
           KGAIDCPGCKGTG+NKKNGNIFERWKCF+CQGFGLKSCP+CGKGGLTPEQRGER
Sbjct: 63  KGAIDCPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPDCGKGGLTPEQRGER 113

BLAST of Clc09G19590 vs. TAIR 10
Match: AT1G22630.1 (unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; Has 87 Blast hits to 86 proteins in 34 species: Archae - 0; Bacteria - 13; Metazoa - 27; Fungi - 0; Plants - 40; Viruses - 0; Other Eukaryotes - 7 (source: NCBI BLink). )

HSP 1 Score: 134.0 bits (336), Expect = 1.3e-31
Identity = 55/62 (88.71%), Postives = 59/62 (95.16%), Query Frame = 0

Query: 133 KSCRNCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTPEQRG 192
           KSC  CG KGAI+CPGCKGTGKNKKNGN+FERWKCFDCQGFG+KSCP+CGKGGLTPEQRG
Sbjct: 49  KSCETCGAKGAIECPGCKGTGKNKKNGNMFERWKCFDCQGFGMKSCPKCGKGGLTPEQRG 108

Query: 193 ER 195
           ER
Sbjct: 109 ER 110

BLAST of Clc09G19590 vs. TAIR 10
Match: AT2G34860.1 (DnaJ/Hsp40 cysteine-rich domain superfamily protein )

HSP 1 Score: 49.3 bits (116), Expect = 4.1e-06
Identity = 27/64 (42.19%), Postives = 35/64 (54.69%), Query Frame = 0

Query: 134 SCRNCGGKGAIDCPGCKGTGK-----NKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTP 193
           SCRNC G GA+ C  C GTGK      K+  +++E  +C +C G G   CP C   GL P
Sbjct: 101 SCRNCQGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGL-P 160

BLAST of Clc09G19590 vs. TAIR 10
Match: AT2G34860.2 (DnaJ/Hsp40 cysteine-rich domain superfamily protein )

HSP 1 Score: 49.3 bits (116), Expect = 4.1e-06
Identity = 27/64 (42.19%), Postives = 35/64 (54.69%), Query Frame = 0

Query: 134 SCRNCGGKGAIDCPGCKGTGK-----NKKNGNIFERWKCFDCQGFGLKSCPECGKGGLTP 193
           SCRNC G GA+ C  C GTGK      K+  +++E  +C +C G G   CP C   GL P
Sbjct: 101 SCRNCQGSGAVLCDMCGGTGKWKALNRKRAKDVYEFTECPNCYGRGKLVCPVCLGTGL-P 160

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038901776.12.5e-3775.45protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic isoform X1 [Benincasa hispida][more]
XP_023521952.13.3e-3464.91protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic-like [Cucurbita pepo subsp. pepo... [more]
XP_022152486.15.7e-3468.52uncharacterized protein LOC111020203 [Momordica charantia][more]
KAG7033428.15.7e-3464.91hypothetical protein SDJN02_07484 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_008459263.17.4e-3471.03PREDICTED: protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic [Cucumis melo][more]
Match NameE-valueIdentityDescription
O647505.7e-0542.19Protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A6J1DHW02.7e-3468.52uncharacterized protein LOC111020203 OS=Momordica charantia OX=3673 GN=LOC111020... [more]
A0A1S3C9W13.6e-3471.03protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic OS=Cucumis melo OX=3656 G... [more]
A0A0A0LIM56.1e-3469.90Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G238860 PE=4 SV=1[more]
A0A6J1JSI31.4e-3364.04protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1HLG52.3e-3364.04protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT1G22630.11.3e-3188.71unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 21 plant structures; EXP... [more]
AT2G34860.14.1e-0642.19DnaJ/Hsp40 cysteine-rich domain superfamily protein [more]
AT2G34860.24.1e-0642.19DnaJ/Hsp40 cysteine-rich domain superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..16
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..25
NoneNo IPR availablePANTHERPTHR15852PLASTID TRANSCRIPTIONALLY ACTIVE PROTEINcoord: 130..194
NoneNo IPR availablePANTHERPTHR15852:SF55PROTEIN EMBRYO SAC DEVELOPMENT ARREST 3, CHLOROPLASTIC ISOFORM X1coord: 130..194
IPR036410Heat shock protein DnaJ, cysteine-rich domain superfamilySUPERFAMILY57938DnaJ/Hsp40 cysteine-rich domaincoord: 133..187

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc09G19590.2Clc09G19590.2mRNA