CmaCh05G013170 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh05G013170
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionPentatricopeptide repeat
LocationCma_Chr05: 9974684 .. 9974975 (-)
RNA-Seq ExpressionCmaCh05G013170
SyntenyCmaCh05G013170
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGTTGTGGGTGTTCGTCTTCAGCTCAGAAAGTGTTCGACAAAATGCAAGACAGAAATGTGGTTTCTTGGGCAACCATGGTTGGTGCTTTTGCGCAGAGAAGGCCACTGAACCTTTTGAGATGGATGGAGCTTGGAGAACGTGAAAGCCAATGAGGTTGCTTTGATTCACGTTTTGACTGCTTGTGCCATGGCAAGGGATTTGGAAATGGTGAAATGGGTGCACGAGTGCATTGATGGCGATGACCATGGGTATCATAGCGTGTTGATGACAACACTATTGGAAGATTAA

mRNA sequence

ATGTGTTGTGGGTGTTCGTCTTCAGCTCAGAAAGTGTTCGACAAAATGCAAGACAGAAATGTGGTTTCTTGGGCAACCATGGTTGGTGCTTTTGCGCAGAGAAGGCCACTGAACCTTTTGAGATGGATGGAGCTTGGAGAACCCAATGAGGTTGCTTTGATTCACGTTTTGACTGCTTGTGCCATGGCAAGGGATTTGGAAATGGTGAAATGGGTGCACGAGTGCATTGATGGCGATGACCATGGGTATCATAGCGTGTTGATGACAACACTATTGGAAGATTAA

Coding sequence (CDS)

ATGTGTTGTGGGTGTTCGTCTTCAGCTCAGAAAGTGTTCGACAAAATGCAAGACAGAAATGTGGTTTCTTGGGCAACCATGGTTGGTGCTTTTGCGCAGAGAAGGCCACTGAACCTTTTGAGATGGATGGAGCTTGGAGAACCCAATGAGGTTGCTTTGATTCACGTTTTGACTGCTTGTGCCATGGCAAGGGATTTGGAAATGGTGAAATGGGTGCACGAGTGCATTGATGGCGATGACCATGGGTATCATAGCGTGTTGATGACAACACTATTGGAAGATTAA

Protein sequence

MCCGCSSSAQKVFDKMQDRNVVSWATMVGAFAQRRPLNLLRWMELGEPNEVALIHVLTACAMARDLEMVKWVHECIDGDDHGYHSVLMTTLLED
Homology
BLAST of CmaCh05G013170 vs. ExPASy Swiss-Prot
Match: Q683I9 (Pentatricopeptide repeat-containing protein At3g62890 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H82 PE=2 SV=1)

HSP 1 Score: 60.5 bits (145), Expect = 1.2e-08
Identity = 35/101 (34.65%), Postives = 52/101 (51.49%), Query Frame = 0

Query: 4   GCSSSAQKVFDKMQDRNVVSWATMVGAFAQ----RRPLNLLRWMELGEPNEV-------A 63
           G    A+K+FD+M +RNV+SW+ ++  +      +  L+L R M+L +PNE         
Sbjct: 142 GLIDDARKLFDEMPERNVISWSCLINGYVMCGKYKEALDLFREMQLPKPNEAFVRPNEFT 201

Query: 64  LIHVLTACAMARDLEMVKWVHECIDGDDHGYHSVLMTTLLE 94
           +  VL+AC     LE  KWVH  ID        VL T L++
Sbjct: 202 MSTVLSACGRLGALEQGKWVHAYIDKYHVEIDIVLGTALID 242

BLAST of CmaCh05G013170 vs. ExPASy Swiss-Prot
Match: Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 58.5 bits (140), Expect = 4.6e-08
Identity = 33/98 (33.67%), Postives = 56/98 (57.14%), Query Frame = 0

Query: 3   CGCSSSAQKVFDKMQDRNVVSWATMVGAFAQRRP----LNLLRWM--ELGEPNEVALIHV 62
           CG  + AQ+VFD+M DRNVVSW +++  F Q  P    L++ + M     EP+EV L  V
Sbjct: 200 CGNVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASV 259

Query: 63  LTACAMARDLEMVKWVHECIDGDDHGYHSVLMTTLLED 95
           ++ACA    +++ + VH  +  +D   + ++++    D
Sbjct: 260 ISACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVD 297

BLAST of CmaCh05G013170 vs. ExPASy Swiss-Prot
Match: Q9LUJ2 (Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H56 PE=3 SV=1)

HSP 1 Score: 58.2 bits (139), Expect = 6.0e-08
Identity = 34/98 (34.69%), Postives = 56/98 (57.14%), Query Frame = 0

Query: 3   CGCSSSAQKVFDKMQDRNVVSWATMVGAFAQR----RPLNLLRWMELGE---PNEVALIH 62
           CG   SA+KVFD+M +RNVVSW +M+  +A+R      ++L   M   E   PN V ++ 
Sbjct: 182 CGELDSARKVFDEMSERNVVSWTSMICGYARRDFAKDAVDLFFRMVRDEEVTPNSVTMVC 241

Query: 63  VLTACAMARDLEMVKWVHECIDGDDHGYHSVLMTTLLE 94
           V++ACA   DLE  + V+  I       + ++++ L++
Sbjct: 242 VISACAKLEDLETGEKVYAFIRNSGIEVNDLMVSALVD 279

BLAST of CmaCh05G013170 vs. ExPASy Swiss-Prot
Match: Q9SJG6 (Pentatricopeptide repeat-containing protein At2g42920, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-E75 PE=2 SV=1)

HSP 1 Score: 57.8 bits (138), Expect = 7.8e-08
Identity = 30/97 (30.93%), Postives = 53/97 (54.64%), Query Frame = 0

Query: 3   CGCSSSAQKVFDKMQDRNVVSWATMVGAFAQ----RRPLNLLRWMELGE--PNEVALIHV 62
           CG    AQ +FD+M  RN VSW +M+  F +    +  L++ R M+  +  P+   ++ +
Sbjct: 205 CGLIDQAQNLFDEMPQRNGVSWNSMISGFVRNGRFKDALDMFREMQEKDVKPDGFTMVSL 264

Query: 63  LTACAMARDLEMVKWVHECIDGDDHGYHSVLMTTLLE 94
           L ACA     E  +W+HE I  +    +S+++T L++
Sbjct: 265 LNACAYLGASEQGRWIHEYIVRNRFELNSIVVTALID 301

BLAST of CmaCh05G013170 vs. ExPASy Swiss-Prot
Match: Q9SJZ3 (Pentatricopeptide repeat-containing protein At2g22410, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E28 PE=2 SV=1)

HSP 1 Score: 56.6 bits (135), Expect = 1.7e-07
Identity = 29/97 (29.90%), Postives = 54/97 (55.67%), Query Frame = 0

Query: 3   CGCSSSAQKVFDKMQDRNVVSWATMVGAFAQRR----PLNLLRWMELG--EPNEVALIHV 62
           CG    ++K+FD M++++VV W  M+G   Q +     L L + M+    +P+E+ +IH 
Sbjct: 336 CGLLDVSRKLFDDMEEKDVVLWNAMIGGSVQAKRGQDALALFQEMQTSNTKPDEITMIHC 395

Query: 63  LTACAMARDLEMVKWVHECIDGDDHGYHSVLMTTLLE 94
           L+AC+    L++  W+H  I+      +  L T+L++
Sbjct: 396 LSACSQLGALDVGIWIHRYIEKYSLSLNVALGTSLVD 432

BLAST of CmaCh05G013170 vs. TAIR 10
Match: AT3G62890.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 60.5 bits (145), Expect = 8.6e-10
Identity = 35/101 (34.65%), Postives = 52/101 (51.49%), Query Frame = 0

Query: 4   GCSSSAQKVFDKMQDRNVVSWATMVGAFAQ----RRPLNLLRWMELGEPNEV-------A 63
           G    A+K+FD+M +RNV+SW+ ++  +      +  L+L R M+L +PNE         
Sbjct: 142 GLIDDARKLFDEMPERNVISWSCLINGYVMCGKYKEALDLFREMQLPKPNEAFVRPNEFT 201

Query: 64  LIHVLTACAMARDLEMVKWVHECIDGDDHGYHSVLMTTLLE 94
           +  VL+AC     LE  KWVH  ID        VL T L++
Sbjct: 202 MSTVLSACGRLGALEQGKWVHAYIDKYHVEIDIVLGTALID 242

BLAST of CmaCh05G013170 vs. TAIR 10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 58.5 bits (140), Expect = 3.3e-09
Identity = 33/98 (33.67%), Postives = 56/98 (57.14%), Query Frame = 0

Query: 3   CGCSSSAQKVFDKMQDRNVVSWATMVGAFAQRRP----LNLLRWM--ELGEPNEVALIHV 62
           CG  + AQ+VFD+M DRNVVSW +++  F Q  P    L++ + M     EP+EV L  V
Sbjct: 200 CGNVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASV 259

Query: 63  LTACAMARDLEMVKWVHECIDGDDHGYHSVLMTTLLED 95
           ++ACA    +++ + VH  +  +D   + ++++    D
Sbjct: 260 ISACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVD 297

BLAST of CmaCh05G013170 vs. TAIR 10
Match: AT3G22690.1 (CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1); Has 49784 Blast hits to 14716 proteins in 280 species: Archae - 2; Bacteria - 10; Metazoa - 107; Fungi - 167; Plants - 48594; Viruses - 0; Other Eukaryotes - 904 (source: NCBI BLink). )

HSP 1 Score: 58.2 bits (139), Expect = 4.3e-09
Identity = 34/98 (34.69%), Postives = 56/98 (57.14%), Query Frame = 0

Query: 3   CGCSSSAQKVFDKMQDRNVVSWATMVGAFAQR----RPLNLLRWMELGE---PNEVALIH 62
           CG   SA+KVFD+M +RNVVSW +M+  +A+R      ++L   M   E   PN V ++ 
Sbjct: 182 CGELDSARKVFDEMSERNVVSWTSMICGYARRDFAKDAVDLFFRMVRDEEVTPNSVTMVC 241

Query: 63  VLTACAMARDLEMVKWVHECIDGDDHGYHSVLMTTLLE 94
           V++ACA   DLE  + V+  I       + ++++ L++
Sbjct: 242 VISACAKLEDLETGEKVYAFIRNSGIEVNDLMVSALVD 279

BLAST of CmaCh05G013170 vs. TAIR 10
Match: AT3G22690.2 (INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification; LOCATED IN: chloroplast; EXPRESSED IN: 13 plant structures; EXPRESSED DURING: LP.04 four leaves visible, 4 anthesis, petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage; CONTAINS InterPro DOMAIN/s: Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1). )

HSP 1 Score: 58.2 bits (139), Expect = 4.3e-09
Identity = 34/98 (34.69%), Postives = 56/98 (57.14%), Query Frame = 0

Query: 3   CGCSSSAQKVFDKMQDRNVVSWATMVGAFAQR----RPLNLLRWMELGE---PNEVALIH 62
           CG   SA+KVFD+M +RNVVSW +M+  +A+R      ++L   M   E   PN V ++ 
Sbjct: 182 CGELDSARKVFDEMSERNVVSWTSMICGYARRDFAKDAVDLFFRMVRDEEVTPNSVTMVC 241

Query: 63  VLTACAMARDLEMVKWVHECIDGDDHGYHSVLMTTLLE 94
           V++ACA   DLE  + V+  I       + ++++ L++
Sbjct: 242 VISACAKLEDLETGEKVYAFIRNSGIEVNDLMVSALVD 279

BLAST of CmaCh05G013170 vs. TAIR 10
Match: AT1G19720.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 55.8 bits (133), Expect = 2.1e-08
Identity = 26/85 (30.59%), Postives = 45/85 (52.94%), Query Frame = 0

Query: 3   CGCSSSAQKVFDKMQDRNVVSWATMVGAFAQRRPLNLLRWMELGE-----------PNEV 62
           CGC + A+KVFD M++RN+ +W+ M+GA+++       RW E+ +           P++ 
Sbjct: 128 CGCIADARKVFDSMRERNLFTWSAMIGAYSREN-----RWREVAKLFRLMMKDGVLPDDF 187

Query: 63  ALIHVLTACAMARDLEMVKWVHECI 77
               +L  CA   D+E  K +H  +
Sbjct: 188 LFPKILQGCANCGDVEAGKVIHSVV 207

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q683I91.2e-0834.65Pentatricopeptide repeat-containing protein At3g62890 OS=Arabidopsis thaliana OX... [more]
Q9SIT74.6e-0833.67Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
Q9LUJ26.0e-0834.69Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX... [more]
Q9SJG67.8e-0830.93Pentatricopeptide repeat-containing protein At2g42920, chloroplastic OS=Arabidop... [more]
Q9SJZ31.7e-0729.90Pentatricopeptide repeat-containing protein At2g22410, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
AT3G62890.18.6e-1034.65Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G13600.13.3e-0933.67Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G22690.14.3e-0934.69CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012... [more]
AT3G22690.24.3e-0934.69INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic pro... [more]
AT1G19720.12.1e-0830.59Pentatricopeptide repeat (PPR-like) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 1..93
e-value: 6.7E-8
score: 34.3
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 3..92
NoneNo IPR availablePANTHERPTHR47928:SF46SUBFAMILY NOT NAMEDcoord: 3..92

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh05G013170.1CmaCh05G013170.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding