Cp4.1LG04g07370 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG04g07370
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPhotosystem I reaction centre subunit N
LocationCp4.1LG04: 4075090 .. 4075934 (-)
RNA-Seq ExpressionCp4.1LG04g07370
SyntenyCp4.1LG04g07370
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGAGGGGATATGGACCAAGAAAGCCTCATCTGTTCATCAACCTCATCCACAACCCCAACACTCTCTCATTCCCATGAGCTCCATCGGCCAAAACATCCTGATGGCTCTCGCCCTCACTCTAAACCAATTCGCTTCCTCAAACGTTCAATCCGTCCAGAGAAACAAACCCAAAACGCCACCCACCACCACCGCCACCACCTCCGCCTCCACCTTCGCCAGTTCTGACATCCAACGAAGAGGCCTCCTCTTATCTGCCGCCGTTGCCGCCGCCGTGGACTCCAGAACCGAGCTCCTAAAAAGTGTGCTGCATTAATTCTTCCCTCTGTGTTCCATTTTTGAATGCTCGACTTTCGTTTCAATCCAAACATTTTATGTGTTGTTAGGGTACCTCAAGAAATCCGAAGAAAACAAAGAAAAGAACGACAAGGAGGTAAGTGCTTTTGTAATTTGTAATTTGGATTTGATTAATGTTTTGAAATTTGGGTTATTGGGTTTGTAATTAACAGAGATTGGAGAGTTTCTACAAGCGAAATTACAAAGATTATTTTGAGTTTGTTGAAGGATCTTTGAAGAACAAGGATGAACTTTCTGAAGCTGAGAAAGGTATCATTGAATGGCTTAAGAGAAACAAATAAATGAATTTCCATTCATTCTATCCTTTTCTTTTCATTTCTTCTAATCTCGTCTATCAATTATCATTCTAACGAGGATCGTTCAACGTTACAAATATTTAAAATTTTCTATCTATTAAAATTATTCGTGATATTATTCAATTTGAGCCTAGGGTTCATCAAGAACGCTTTTGTACCAATGAAGTTCCTGTCCTAATTTATACACGGTAGACT

mRNA sequence

GGAGGGGATATGGACCAAGAAAGCCTCATCTGTTCATCAACCTCATCCACAACCCCAACACTCTCTCATTCCCATGAGCTCCATCGGCCAAAACATCCTGATGGCTCTCGCCCTCACTCTAAACCAATTCGCTTCCTCAAACGTTCAATCCGTCCAGAGAAACAAACCCAAAACGCCACCCACCACCACCGCCACCACCTCCGCCTCCACCTTCGCCAGTTCTGACATCCAACGAAGAGGCCTCCTCTTATCTGCCGCCGTTGCCGCCGCCGTGGACTCCAGAACCGAGCTCCTAAAAAGGTACCTCAAGAAATCCGAAGAAAACAAAGAAAAGAACGACAAGGAGAGATTGGAGAGTTTCTACAAGCGAAATTACAAAGATTATTTTGAGTTTGTTGAAGGATCTTTGAAGAACAAGGATGAACTTTCTGAAGCTGAGAAAGGTATCATTGAATGGCTTAAGAGAAACAAATAAATGAATTTCCATTCATTCTATCCTTTTCTTTTCATTTCTTCTAATCTCGTCTATCAATTATCATTCTAACGAGGATCGTTCAACGTTACAAATATTTAAAATTTTCTATCTATTAAAATTATTCGTGATATTATTCAATTTGAGCCTAGGGTTCATCAAGAACGCTTTTGTACCAATGAAGTTCCTGTCCTAATTTATACACGGTAGACT

Coding sequence (CDS)

ATGAGCTCCATCGGCCAAAACATCCTGATGGCTCTCGCCCTCACTCTAAACCAATTCGCTTCCTCAAACGTTCAATCCGTCCAGAGAAACAAACCCAAAACGCCACCCACCACCACCGCCACCACCTCCGCCTCCACCTTCGCCAGTTCTGACATCCAACGAAGAGGCCTCCTCTTATCTGCCGCCGTTGCCGCCGCCGTGGACTCCAGAACCGAGCTCCTAAAAAGGTACCTCAAGAAATCCGAAGAAAACAAAGAAAAGAACGACAAGGAGAGATTGGAGAGTTTCTACAAGCGAAATTACAAAGATTATTTTGAGTTTGTTGAAGGATCTTTGAAGAACAAGGATGAACTTTCTGAAGCTGAGAAAGGTATCATTGAATGGCTTAAGAGAAACAAATAA

Protein sequence

MSSIGQNILMALALTLNQFASSNVQSVQRNKPKTPPTTTATTSASTFASSDIQRRGLLLSAAVAAAVDSRTELLKRYLKKSEENKEKNDKERLESFYKRNYKDYFEFVEGSLKNKDELSEAEKGIIEWLKRNK
Homology
BLAST of Cp4.1LG04g07370 vs. NCBI nr
Match: XP_023531124.1 (uncharacterized protein LOC111793462 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 241 bits (615), Expect = 7.81e-80
Identity = 133/133 (100.00%), Postives = 133/133 (100.00%), Query Frame = 0

Query: 1   MSSIGQNILMALALTLNQFASSNVQSVQRNKPKTPPTTTATTSASTFASSDIQRRGLLLS 60
           MSSIGQNILMALALTLNQFASSNVQSVQRNKPKTPPTTTATTSASTFASSDIQRRGLLLS
Sbjct: 1   MSSIGQNILMALALTLNQFASSNVQSVQRNKPKTPPTTTATTSASTFASSDIQRRGLLLS 60

Query: 61  AAVAAAVDSRTELLKRYLKKSEENKEKNDKERLESFYKRNYKDYFEFVEGSLKNKDELSE 120
           AAVAAAVDSRTELLKRYLKKSEENKEKNDKERLESFYKRNYKDYFEFVEGSLKNKDELSE
Sbjct: 61  AAVAAAVDSRTELLKRYLKKSEENKEKNDKERLESFYKRNYKDYFEFVEGSLKNKDELSE 120

Query: 121 AEKGIIEWLKRNK 133
           AEKGIIEWLKRNK
Sbjct: 121 AEKGIIEWLKRNK 133

BLAST of Cp4.1LG04g07370 vs. NCBI nr
Match: XP_022927848.1 (uncharacterized protein LOC111434615 [Cucurbita moschata])

HSP 1 Score: 238 bits (607), Expect = 1.30e-78
Identity = 132/133 (99.25%), Postives = 132/133 (99.25%), Query Frame = 0

Query: 1   MSSIGQNILMALALTLNQFASSNVQSVQRNKPKTPPTTTATTSASTFASSDIQRRGLLLS 60
           MSSIGQNILMALALTLNQFASSNVQSVQRNKPKTPPTTTATTSAST ASSDIQRRGLLLS
Sbjct: 1   MSSIGQNILMALALTLNQFASSNVQSVQRNKPKTPPTTTATTSASTSASSDIQRRGLLLS 60

Query: 61  AAVAAAVDSRTELLKRYLKKSEENKEKNDKERLESFYKRNYKDYFEFVEGSLKNKDELSE 120
           AAVAAAVDSRTELLKRYLKKSEENKEKNDKERLESFYKRNYKDYFEFVEGSLKNKDELSE
Sbjct: 61  AAVAAAVDSRTELLKRYLKKSEENKEKNDKERLESFYKRNYKDYFEFVEGSLKNKDELSE 120

Query: 121 AEKGIIEWLKRNK 133
           AEKGIIEWLKRNK
Sbjct: 121 AEKGIIEWLKRNK 133

BLAST of Cp4.1LG04g07370 vs. NCBI nr
Match: KAG7022506.1 (hypothetical protein SDJN02_16238, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 237 bits (605), Expect = 2.62e-78
Identity = 131/133 (98.50%), Postives = 132/133 (99.25%), Query Frame = 0

Query: 1   MSSIGQNILMALALTLNQFASSNVQSVQRNKPKTPPTTTATTSASTFASSDIQRRGLLLS 60
           MSSIGQNILMALALTLNQFASSNVQSVQRNKPKTPPTTTATTSAST ASSDIQRRGL+LS
Sbjct: 1   MSSIGQNILMALALTLNQFASSNVQSVQRNKPKTPPTTTATTSASTSASSDIQRRGLILS 60

Query: 61  AAVAAAVDSRTELLKRYLKKSEENKEKNDKERLESFYKRNYKDYFEFVEGSLKNKDELSE 120
           AAVAAAVDSRTELLKRYLKKSEENKEKNDKERLESFYKRNYKDYFEFVEGSLKNKDELSE
Sbjct: 61  AAVAAAVDSRTELLKRYLKKSEENKEKNDKERLESFYKRNYKDYFEFVEGSLKNKDELSE 120

Query: 121 AEKGIIEWLKRNK 133
           AEKGIIEWLKRNK
Sbjct: 121 AEKGIIEWLKRNK 133

BLAST of Cp4.1LG04g07370 vs. NCBI nr
Match: XP_022989008.1 (uncharacterized protein LOC111486201 [Cucurbita maxima])

HSP 1 Score: 223 bits (569), Expect = 6.84e-73
Identity = 126/133 (94.74%), Postives = 127/133 (95.49%), Query Frame = 0

Query: 1   MSSIGQNILMALALTLNQFASSNVQSVQRNKPKTPPTTTATTSASTFASSDIQRRGLLLS 60
           MSSIGQNILMALALTLNQFASSNVQSVQRNKPKTPPTT     A+T ASSDIQRRGLLLS
Sbjct: 1   MSSIGQNILMALALTLNQFASSNVQSVQRNKPKTPPTT-----ATTSASSDIQRRGLLLS 60

Query: 61  AAVAAAVDSRTELLKRYLKKSEENKEKNDKERLESFYKRNYKDYFEFVEGSLKNKDELSE 120
           AAVAAAVDSRTELLKRYLKKSEENKEKNDKERLESFYKRNYKDYFEFVEGSLKNKDELSE
Sbjct: 61  AAVAAAVDSRTELLKRYLKKSEENKEKNDKERLESFYKRNYKDYFEFVEGSLKNKDELSE 120

Query: 121 AEKGIIEWLKRNK 133
           AEKGIIEWLKRNK
Sbjct: 121 AEKGIIEWLKRNK 128

BLAST of Cp4.1LG04g07370 vs. NCBI nr
Match: KAG6588719.1 (hypothetical protein SDJN03_17284, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 205 bits (522), Expect = 7.79e-65
Identity = 117/123 (95.12%), Postives = 119/123 (96.75%), Query Frame = 0

Query: 1   MSSIGQNILMALALTLNQFASSNVQSVQRNKPKTPPTTTATTSASTFASSDIQRRGLLLS 60
           MSSIGQNILMALALTLNQFASSNVQSVQRNKPKTPPTTTATTSAS    SDIQRRGL+LS
Sbjct: 1   MSSIGQNILMALALTLNQFASSNVQSVQRNKPKTPPTTTATTSAS----SDIQRRGLILS 60

Query: 61  AAVAAAVDSRTELLKRYLKKSEENKEKNDKERLESFYKRNYKDYFEFVEGSLKNKDELSE 120
           AAVAAAVDSRT+LLKRYLKKSEENKEKNDKERLESFYKRNYKDYFEFVEGSLKNKDELSE
Sbjct: 61  AAVAAAVDSRTDLLKRYLKKSEENKEKNDKERLESFYKRNYKDYFEFVEGSLKNKDELSE 119

Query: 121 AEK 123
           AEK
Sbjct: 121 AEK 119

BLAST of Cp4.1LG04g07370 vs. ExPASy TrEMBL
Match: A0A6J1EM63 (uncharacterized protein LOC111434615 OS=Cucurbita moschata OX=3662 GN=LOC111434615 PE=3 SV=1)

HSP 1 Score: 238 bits (607), Expect = 6.27e-79
Identity = 132/133 (99.25%), Postives = 132/133 (99.25%), Query Frame = 0

Query: 1   MSSIGQNILMALALTLNQFASSNVQSVQRNKPKTPPTTTATTSASTFASSDIQRRGLLLS 60
           MSSIGQNILMALALTLNQFASSNVQSVQRNKPKTPPTTTATTSAST ASSDIQRRGLLLS
Sbjct: 1   MSSIGQNILMALALTLNQFASSNVQSVQRNKPKTPPTTTATTSASTSASSDIQRRGLLLS 60

Query: 61  AAVAAAVDSRTELLKRYLKKSEENKEKNDKERLESFYKRNYKDYFEFVEGSLKNKDELSE 120
           AAVAAAVDSRTELLKRYLKKSEENKEKNDKERLESFYKRNYKDYFEFVEGSLKNKDELSE
Sbjct: 61  AAVAAAVDSRTELLKRYLKKSEENKEKNDKERLESFYKRNYKDYFEFVEGSLKNKDELSE 120

Query: 121 AEKGIIEWLKRNK 133
           AEKGIIEWLKRNK
Sbjct: 121 AEKGIIEWLKRNK 133

BLAST of Cp4.1LG04g07370 vs. ExPASy TrEMBL
Match: A0A6J1JNZ7 (uncharacterized protein LOC111486201 OS=Cucurbita maxima OX=3661 GN=LOC111486201 PE=3 SV=1)

HSP 1 Score: 223 bits (569), Expect = 3.31e-73
Identity = 126/133 (94.74%), Postives = 127/133 (95.49%), Query Frame = 0

Query: 1   MSSIGQNILMALALTLNQFASSNVQSVQRNKPKTPPTTTATTSASTFASSDIQRRGLLLS 60
           MSSIGQNILMALALTLNQFASSNVQSVQRNKPKTPPTT     A+T ASSDIQRRGLLLS
Sbjct: 1   MSSIGQNILMALALTLNQFASSNVQSVQRNKPKTPPTT-----ATTSASSDIQRRGLLLS 60

Query: 61  AAVAAAVDSRTELLKRYLKKSEENKEKNDKERLESFYKRNYKDYFEFVEGSLKNKDELSE 120
           AAVAAAVDSRTELLKRYLKKSEENKEKNDKERLESFYKRNYKDYFEFVEGSLKNKDELSE
Sbjct: 61  AAVAAAVDSRTELLKRYLKKSEENKEKNDKERLESFYKRNYKDYFEFVEGSLKNKDELSE 120

Query: 121 AEKGIIEWLKRNK 133
           AEKGIIEWLKRNK
Sbjct: 121 AEKGIIEWLKRNK 128

BLAST of Cp4.1LG04g07370 vs. ExPASy TrEMBL
Match: A0A1S3C176 (uncharacterized protein LOC103495319 OS=Cucumis melo OX=3656 GN=LOC103495319 PE=3 SV=1)

HSP 1 Score: 177 bits (448), Expect = 9.09e-55
Identity = 106/140 (75.71%), Postives = 114/140 (81.43%), Query Frame = 0

Query: 1   MSSIGQNILMALALTLNQFASSNVQSVQRNKPKTPPTTTATTSASTFASSDIQRRGLLLS 60
           MSSIGQ+ILMALA+TLN+FASSNVQSVQRNK            A+   SS I RR LLLS
Sbjct: 1   MSSIGQSILMALAVTLNKFASSNVQSVQRNK------------ATATVSSPIGRRDLLLS 60

Query: 61  -------AAVAAAVDSRTELLKRYLKKSEENKEKNDKERLESFYKRNYKDYFEFVEGSLK 120
                  AA AAAVDSRTELLKRYLKKSEENKEKNDKERLES+YKRNYKDYFEFVEGS+K
Sbjct: 61  TVAPASTAAAAAAVDSRTELLKRYLKKSEENKEKNDKERLESYYKRNYKDYFEFVEGSVK 120

Query: 121 NKDELSEAEKGIIEWLKRNK 133
           NK+ELSEAEKGI+EWLKRNK
Sbjct: 121 NKNELSEAEKGIVEWLKRNK 128

BLAST of Cp4.1LG04g07370 vs. ExPASy TrEMBL
Match: A0A6J1D574 (uncharacterized protein LOC111017388 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111017388 PE=3 SV=1)

HSP 1 Score: 165 bits (418), Expect = 2.87e-50
Identity = 96/135 (71.11%), Postives = 110/135 (81.48%), Query Frame = 0

Query: 1   MSSIGQNILMALALTLNQFASSNVQSVQRNKPKTPPTTTATTSASTFASSDIQRRGLLLS 60
           MSSIGQ+ILMALA+T+N+FASSNVQSV RN+            ++  A+SDI RRGLL S
Sbjct: 1   MSSIGQSILMALAVTVNKFASSNVQSVHRNQ------------SAAAAASDIGRRGLLFS 60

Query: 61  AAVAAA--VDSRTELLKRYLKKSEENKEKNDKERLESFYKRNYKDYFEFVEGSLKNKDEL 120
           A  AA   VDSRTELLKRYLKKSE+NKEKNDKERL+S+YKRNYKDYFEFVEGS++NK EL
Sbjct: 61  AVAAAVAPVDSRTELLKRYLKKSEQNKEKNDKERLDSYYKRNYKDYFEFVEGSVRNKSEL 120

Query: 121 SEAEKGIIEWLKRNK 133
           SE EK IIEWL+RNK
Sbjct: 121 SETEKDIIEWLRRNK 123

BLAST of Cp4.1LG04g07370 vs. ExPASy TrEMBL
Match: A0A2I4HLI8 (uncharacterized protein LOC109019213 OS=Juglans regia OX=51240 GN=LOC109019213 PE=3 SV=1)

HSP 1 Score: 161 bits (407), Expect = 1.69e-48
Identity = 95/136 (69.85%), Postives = 113/136 (83.09%), Query Frame = 0

Query: 1   MSSIGQNILMALALTLNQFASSNVQSVQRNKPKTPPTTTATTSASTFASSDIQRRGLLLS 60
           MSSIGQ+ILMAL +T+N+FASSNVQ+V R + KTP +TT TT+      SDI RR LLLS
Sbjct: 1   MSSIGQSILMALTVTVNRFASSNVQAVHRRERKTPSSTTTTTT------SDIGRRCLLLS 60

Query: 61  AAVAA--AVDSRTELLKRYLKKSEENKEKNDKERLESFYKRNYKDYFEFVEGSLK-NKDE 120
             VAA    DSRT+LLK+YLKKSEENK KNDKERL+S+YKRNYKDYFEFVEG+ K N+++
Sbjct: 61  TLVAAPQVADSRTDLLKQYLKKSEENKSKNDKERLDSYYKRNYKDYFEFVEGASKGNQEQ 120

Query: 121 LSEAEKGIIEWLKRNK 133
           LSEAEKGII+WL+RNK
Sbjct: 121 LSEAEKGIIDWLQRNK 130

BLAST of Cp4.1LG04g07370 vs. TAIR 10
Match: AT1G49975.1 (INVOLVED IN: photosynthesis; LOCATED IN: photosystem I, chloroplast, thylakoid membrane; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Photosystem I reaction centre subunit N (InterPro:IPR008796); Has 34 Blast hits to 34 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 34; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 133.7 bits (335), Expect = 1.1e-31
Identity = 81/137 (59.12%), Postives = 103/137 (75.18%), Query Frame = 0

Query: 1   MSSIGQNILMALALTLNQFASSNVQSVQRNKPKTPPTTTATTSASTFASSDIQRRGLLLS 60
           MSSI Q+ILMAL +T+N++ASSNVQ+V+RN  K          + T   +D+ RR +L S
Sbjct: 1   MSSISQSILMALTVTVNKYASSNVQAVRRNDTK--------RHSLTAPPADLGRRNILFS 60

Query: 61  AA--VAAAVDSRTELLKRYLKKSEENKEKNDKERLESFYKRNYKDYFEFVEGSLKNKD-- 120
           +   +AAA+ S  +LL++YLKK+EENK KNDKERL+SFYKRNYKDYFEFVEGS+K K   
Sbjct: 61  STSFIAAALTSSDQLLQKYLKKTEENKAKNDKERLDSFYKRNYKDYFEFVEGSIKGKTEA 120

Query: 121 ELSEAEKGIIEWLKRNK 134
           ELSE+EK I+EWLK NK
Sbjct: 121 ELSESEKRILEWLKANK 129

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023531124.17.81e-80100.00uncharacterized protein LOC111793462 [Cucurbita pepo subsp. pepo][more]
XP_022927848.11.30e-7899.25uncharacterized protein LOC111434615 [Cucurbita moschata][more]
KAG7022506.12.62e-7898.50hypothetical protein SDJN02_16238, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022989008.16.84e-7394.74uncharacterized protein LOC111486201 [Cucurbita maxima][more]
KAG6588719.17.79e-6595.12hypothetical protein SDJN03_17284, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
A0A6J1EM636.27e-7999.25uncharacterized protein LOC111434615 OS=Cucurbita moschata OX=3662 GN=LOC1114346... [more]
A0A6J1JNZ73.31e-7394.74uncharacterized protein LOC111486201 OS=Cucurbita maxima OX=3661 GN=LOC111486201... [more]
A0A1S3C1769.09e-5575.71uncharacterized protein LOC103495319 OS=Cucumis melo OX=3656 GN=LOC103495319 PE=... [more]
A0A6J1D5742.87e-5071.11uncharacterized protein LOC111017388 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A2I4HLI81.69e-4869.85uncharacterized protein LOC109019213 OS=Juglans regia OX=51240 GN=LOC109019213 P... [more]
Match NameE-valueIdentityDescription
AT1G49975.11.1e-3159.12INVOLVED IN: photosynthesis; LOCATED IN: photosystem I, chloroplast, thylakoid m... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 74..94
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 25..46
NoneNo IPR availablePANTHERPTHR36327UNNAMED PRODUCTcoord: 15..133
IPR008796Photosystem I reaction centre subunit N, chloroplasticPFAMPF05479PsaNcoord: 49..95
e-value: 1.1E-5
score: 25.6

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g07370.1Cp4.1LG04g07370.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015979 photosynthesis
cellular_component GO:0009507 chloroplast
cellular_component GO:0009522 photosystem I