Cp4.1LG02g11030 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG02g11030
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG02: 8948748 .. 8951273 (-)
RNA-Seq ExpressionCp4.1LG02g11030
SyntenyCp4.1LG02g11030
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCTGCGATGAGCTGCTTCAACGCTGCCCCCACCGCCGCCGCCGCTGCTGCTGCTGCTAGCCCTCTATTGATGAATGGCTTTTCTATCTCACATGGCGCCGCTGTAAAATGCCTTCTTTCTCTTCTCAACCCCTCAGAAAACCTTGTTTGGTTCGCCCCAAATCCAAGCAATGCCCAGATTCCACTCGCCCATGTGTCGTTCAGTCCCTTTTCAGCCTCTCTTCTCGAGGGAACCTCTCGGAGGCCCTTTCTTACCTCGACCCATTGGCCCAAAGAGGCATACGCTTGCCCTCTAGCGCTTTCGTCCACCTCTTGCGACTCTGTGCCAAAGCCAAGTCTCTCAAAGGTGGTAAATGTATTCATCTGCATTTGAAACATACGGGGTTTAAACGCCCCACGACTATTGTAGCCAACCATTTGATTGGTATGTATTTCAAATGTGGCCATGAGATTGATGCACGTAAGGTGTTTGATAAAATGTCTGTGAGGAATTTGTACTCTTGGAATCATATGCTTGCTGGGTATGCTAAGTTGGGGAATGTATATCAAGCTAGGAAGTTGTTTGATGGAATGACGGAGAAGGATGTTGTTTCTTGGAATACCATGGTTCTTGCTTATGCTAAGAAAGGGTGTTTCGATGAAGTTGTTGGGTTATACAGAGACTTCAGGAGACTGGAGATGGGTCTCAATGAGTTTAGTTTTGCTGGTGTTTTGATTCTTTGTGTGAAGCTGAGGGAATTGCAGCTCACGAAGCAGGTTCATGGGCAGGTATTGGTTGTTGGATTTTTGTCTAATGTAGTGCTTTCTAGTTCAATAGTTGATGCATATGCAAAATGTGGAGAGATGGGATGTGCGCGGAAATTGTTTGATGAAATGCTTGTGAAAGATATCCTCGCGTGGACTACTATGGTATCTGGATATGCTAAATGGGGTGATATGAATTTGGCTAGCGAATTGTTTCACCAAATGCCTGAAAAGAATCCTGTCTCCTGGACAGCTTTAATATCAGGCTATGCAAGGAACGGTTTGGGGCACGACGCACTCGATTACTTCACAAAAATGATGATGTTTCGAATTCATCCCGACCAATATACATTTAGTAGTTGTCTCTGCGCATGTGCCGGTATTGCTGCACTGAAGCATGGTAAACAAGTACATGCGTATTTGATAAGAACGAACTTCAGATGCAACACAATAGTCGTCAGCTCTCTCATTGACATGTATTCAAAGTGTGGCATGTTAGAAGCAGGCTACCGTGTTTTTCACCTTATGGGAAATAAGCAGGATGTTGTATTGTGGAATACAATGATATCTTCCCTAGCTCAACATGGTCATGGGGAGAAAGCAATGCAGATGTTCAATGACATGGTTAAATCAGGATTGAAGCCTGATAGGATCACGTTCATCGTGATCCTTAGCGTGTGTAGTCATTCAGGGCTCGTGCGAGAGGGACATCGGTTTTTCAAGGCCATGACGTATGATCACAGCGTTCTCCCTGATCAAGAACACTATGCATGGTTAATTGATCTCTTGGGTCGAGCGGGATGTTTTAACGAGTTGGTAAACGAGCTAGAGAAGATGTCGTGTAAGCCGGATGATCGAGTATGGAATGCGTTACTTGGTGTGTGTAGAATGCATGGTAGTATGGTGCTAGGAAGAAAAGTGGGTGAGCATGTAATGGAGGTGGATCATCAATCTTGTGCAGCTCGTGAATCTCTTGCAAGTTTGTATGCTTTTCTAGGGAAATGGGAGTCAGTAGAAAAGGTGAGGGAAGAACTGGAAGAGAGATTGGTGAAGAAGGAGCGTGCAATGAGTTGGAGTGACATTGAAAATAAGGTACATTCTTTCATTGCAGATCACATTCATTGAAGGAAGATAGGCACATCCCATACAGAAGAACAATTTTCGATCATTGAAAAGAGGGTGTTAAGGTATTTGTTATTTTTGTTGGGATATTTCACACAAGTGATCTTACATATAGGATGATAAAATGCTCCATATTGAACCTTTGGGTCTTAATCCATATAATTTGTTTATTATCTCGCTATTAATCAATAATTATGGCCCGAGTTAGTTTCTGAGTTCATTTTTCGTGTTTAGTTTGTGCTATAAAATTTGGCCTTTGACCTATATTTCCATGATTGAAGCAAGAGGGAAAAGTCGCTAAGGGTGTCAGTGCCTCAAAATGCTTTTGCTCGTACCTTCGTCAGCAACGTAAAGTAGAGATGCCAATTTAACCTAAACAAAGAAGTAGAGGACGGAAGAGATTTTTCTATATTAACTTAGCAGAATCCTTATATATATATATATATATGGGAGTTGAACTGCTACGGAATAGGTCAACTTTTTAAACTGCTACTGAATCCATGGGCAGACAAGAGACAATACTGAATCCATGGGCAAACAAGAGACAATGTAGTAAACTGCTACTGAATTCATGGGCAGACAAGAGACAATGTAGTAAACTGCTACTGAATCTATGGGCAGACAAGAGACAATGTAAATCTCTCTATTACCTCTACTTCTA

mRNA sequence

GCTGCGATGAGCTGCTTCAACGCTGCCCCCACCGCCGCCGCCGCTGCTGCTGCTGCTAGCCCTCTATTGATGAATGGCTTTTCTATCTCACATGGCGCCGCTGTAAAATGCCTTCTTTCTCTTCTCAACCCCTCAGAAAACCTTGTTTGGTTCGCCCCAAATCCAAGCAATGCCCAGATTCCACTCGCCCATGTGTCGTTCAGTCCCTTTTCAGCCTCTCTTCTCGAGGGAACCTCTCGGAGGCCCTTTCTTACCTCGACCCATTGGCCCAAAGAGGCATACGCTTGCCCTCTAGCGCTTTCGTCCACCTCTTGCGACTCTGTGCCAAAGCCAAGTCTCTCAAAGGTGATGGGTCTCAATGAGTTTAGTTTTGCTGGTGTTTTGATTCTTTGTGTGAAGCTGAGGGAATTGCAGCTCACGAAGCAGAATCCTGTCTCCTGGACAGCTTTAATATCAGGCTATGCAAGGAACGGTTTGGGGCACGACGCACTCGATTACTTCACAAAAATGATGATGTTTCGAATTCATCCCGACCAATATACATTTAGTAGTTGTCTCTGCGCATGTGCCGGTATTGCTGCACTGAAGCATGGGAAATGGGAGTCAGTAGAAAAGGTGAGGGAAGAACTGGAAGAGAGATTGGTGAAGAAGGAGCGTGCAATGAGTTGGAGTGACATTGAAAATAAGATAGGCACATCCCATACAGAAGAACAATTTTCGATCATTGAAAAGAGGGTGTTAAGCAAGAGGGAAAAGTCGCTAAGGGTGTCAGTGCCTCAAAATGCTTTTGCTCGTACCTTCGTCAGCAACGTAAAGTAGAGATGCCAATTTAACCTAAACAAAGAAGTAGAGGACGGAAGAGATTTTTCTATATTAACTTAGCAGAATCCTTATATATATATATATATATGGGAGTTGAACTGCTACGGAATAGGTCAACTTTTTAAACTGCTACTGAATCCATGGGCAGACAAGAGACAATACTGAATCCATGGGCAAACAAGAGACAATGTAGTAAACTGCTACTGAATTCATGGGCAGACAAGAGACAATGTAGTAAACTGCTACTGAATCTATGGGCAGACAAGAGACAATGTAAATCTCTCTATTACCTCTACTTCTA

Coding sequence (CDS)

GCTGCGATGAGCTGCTTCAACGCTGCCCCCACCGCCGCCGCCGCTGCTGCTGCTGCTAGCCCTCTATTGATGAATGGCTTTTCTATCTCACATGGCGCCGCTGTAAAATGCCTTCTTTCTCTTCTCAACCCCTCAGAAAACCTTGTTTGGTTCGCCCCAAATCCAAGCAATGCCCAGATTCCACTCGCCCATGTGTCGTTCAGTCCCTTTTCAGCCTCTCTTCTCGAGGGAACCTCTCGGAGGCCCTTTCTTACCTCGACCCATTGGCCCAAAGAGGCATACGCTTGCCCTCTAGCGCTTTCGTCCACCTCTTGCGACTCTGTGCCAAAGCCAAGTCTCTCAAAGGTGATGGGTCTCAATGAGTTTAGTTTTGCTGGTGTTTTGATTCTTTGTGTGAAGCTGAGGGAATTGCAGCTCACGAAGCAGAATCCTGTCTCCTGGACAGCTTTAATATCAGGCTATGCAAGGAACGGTTTGGGGCACGACGCACTCGATTACTTCACAAAAATGATGATGTTTCGAATTCATCCCGACCAATATACATTTAGTAGTTGTCTCTGCGCATGTGCCGGTATTGCTGCACTGAAGCATGGGAAATGGGAGTCAGTAGAAAAGGTGAGGGAAGAACTGGAAGAGAGATTGGTGAAGAAGGAGCGTGCAATGAGTTGGAGTGACATTGAAAATAAGATAGGCACATCCCATACAGAAGAACAATTTTCGATCATTGAAAAGAGGGTGTTAAGCAAGAGGGAAAAGTCGCTAAGGGTGTCAGTGCCTCAAAATGCTTTTGCTCGTACCTTCGTCAGCAACGTAAAGTAG

Protein sequence

AAMSCFNAAPTAAAAAAAASPLLMNGFSISHGAAVKCLLSLLNPSENLVWFAPNPSNAQIPLAHVSFSPFSASLLEGTSRRPFLTSTHWPKEAYACPLALSSTSCDSVPKPSLSKVMGLNEFSFAGVLILCVKLRELQLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFSSCLCACAGIAALKHGKWESVEKVREELEERLVKKERAMSWSDIENKIGTSHTEEQFSIIEKRVLSKREKSLRVSVPQNAFARTFVSNVK
Homology
BLAST of Cp4.1LG02g11030 vs. ExPASy Swiss-Prot
Match: Q9SKQ4 (Pentatricopeptide repeat-containing protein At2g21090 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E48 PE=2 SV=1)

HSP 1 Score: 87.4 bits (215), Expect = 2.7e-16
Identity = 37/62 (59.68%), Postives = 49/62 (79.03%), Query Frame = 0

Query: 138 QLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFSSCLCACAGIAALKH 197
           ++ ++NPVSWTALI+GY R G G+ ALD F KM+   + P+Q+TFSSCLCA A IA+L+H
Sbjct: 270 EMPEKNPVSWTALIAGYVRQGSGNRALDLFRKMIALGVKPEQFTFSSCLCASASIASLRH 329

Query: 198 GK 200
           GK
Sbjct: 330 GK 331


HSP 2 Score: 44.3 bits (103), Expect = 2.6e-03
Identity = 19/34 (55.88%), Postives = 27/34 (79.41%), Query Frame = 0

Query: 197 HGKWESVEKVREELEERLVKKERAMSWSDIENKI 231
           HGKWE VEK+R  +++R V KE+A+SW +IE K+
Sbjct: 528 HGKWELVEKLRGVMKKRRVNKEKAVSWIEIEKKV 561

BLAST of Cp4.1LG02g11030 vs. ExPASy Swiss-Prot
Match: Q9SJK9 (Pentatricopeptide repeat-containing protein At2g36980, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E73 PE=2 SV=1)

HSP 1 Score: 67.0 bits (162), Expect = 3.7e-10
Identity = 32/95 (33.68%), Postives = 53/95 (55.79%), Query Frame = 0

Query: 113 LSKVMGLNEFSFAGVLILCVKLRELQ--------LTKQNPVSWTALISGYARNGLGHDAL 172
           L  +  L + S+  ++  C+K+ E +          ++N V+WT +I+GY RNG G  AL
Sbjct: 263 LESIEVLTQVSWNSIIDACMKIGETEKALEVFHLAPEKNIVTWTTMITGYGRNGDGEQAL 322

Query: 173 DYFTKMMMFRIHPDQYTFSSCLCACAGIAALKHGK 200
            +F +MM   +  D + + + L AC+G+A L HGK
Sbjct: 323 RFFVEMMKSGVDSDHFAYGAVLHACSGLALLGHGK 357

BLAST of Cp4.1LG02g11030 vs. ExPASy Swiss-Prot
Match: Q9LJI9 (Pentatricopeptide repeat-containing protein At3g28660 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E80 PE=2 SV=1)

HSP 1 Score: 62.4 bits (150), Expect = 9.2e-09
Identity = 28/72 (38.89%), Postives = 46/72 (63.89%), Query Frame = 0

Query: 138 QLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFSSCLCACAGIAALKH 197
           ++ + + V W  L++GY R GLG + L+ F +M++  I PD+++ ++ L ACA + AL  
Sbjct: 177 EIPQPDVVKWDVLMNGYVRCGLGSEGLEVFKEMLVRGIEPDEFSVTTALTACAQVGALAQ 236

Query: 198 GKW--ESVEKVR 208
           GKW  E V+K R
Sbjct: 237 GKWIHEFVKKKR 248

BLAST of Cp4.1LG02g11030 vs. ExPASy Swiss-Prot
Match: Q9SJG6 (Pentatricopeptide repeat-containing protein At2g42920, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-E75 PE=2 SV=1)

HSP 1 Score: 62.0 bits (149), Expect = 1.2e-08
Identity = 30/74 (40.54%), Postives = 43/74 (58.11%), Query Frame = 0

Query: 138 QLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFSSCLCACAGIAALKH 197
           ++ ++N VSW ++ISG+ RNG   DALD F +M    + PD +T  S L ACA + A + 
Sbjct: 217 EMPQRNGVSWNSMISGFVRNGRFKDALDMFREMQEKDVKPDGFTMVSLLNACAYLGASEQ 276

Query: 198 GKWESVEKVREELE 212
           G+W     VR   E
Sbjct: 277 GRWIHEYIVRNRFE 290

BLAST of Cp4.1LG02g11030 vs. ExPASy Swiss-Prot
Match: Q9LFL5 (Pentatricopeptide repeat-containing protein At5g16860 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H92 PE=2 SV=1)

HSP 1 Score: 61.2 bits (147), Expect = 2.0e-08
Identity = 31/76 (40.79%), Postives = 47/76 (61.84%), Query Frame = 0

Query: 124 FAGVLILCVKLRELQLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFS 183
           F   + L  K++E ++ K + V+W+A ISGYA+ GLG++AL    +M+   I P++ T  
Sbjct: 311 FEDAVRLFEKMQEEKI-KMDVVTWSAAISGYAQRGLGYEALGVCRQMLSSGIKPNEVTLI 370

Query: 184 SCLCACAGIAALKHGK 200
           S L  CA + AL HGK
Sbjct: 371 SVLSGCASVGALMHGK 385

BLAST of Cp4.1LG02g11030 vs. NCBI nr
Match: XP_023523821.1 (pentatricopeptide repeat-containing protein At2g21090-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 143 bits (360), Expect = 6.21e-35
Identity = 83/157 (52.87%), Postives = 83/157 (52.87%), Query Frame = 0

Query: 117 MGLNEFSFAGVLILCVKLRELQLTKQ---------------------------------- 176
           MGLNEFSFAGVLILCVKLRELQLTKQ                                  
Sbjct: 191 MGLNEFSFAGVLILCVKLRELQLTKQVHGQVLVVGFLSNVVLSSSIVDAYAKCGEMGCAR 250

Query: 177 ----------------------------------------NPVSWTALISGYARNGLGHD 199
                                                   NPVSWTALISGYARNGLGHD
Sbjct: 251 KLFDEMLVKDILAWTTMVSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNGLGHD 310

BLAST of Cp4.1LG02g11030 vs. NCBI nr
Match: XP_022973379.1 (pentatricopeptide repeat-containing protein At2g21090-like [Cucurbita maxima] >XP_022973380.1 pentatricopeptide repeat-containing protein At2g21090-like [Cucurbita maxima])

HSP 1 Score: 140 bits (353), Expect = 5.89e-34
Identity = 81/157 (51.59%), Postives = 83/157 (52.87%), Query Frame = 0

Query: 117 MGLNEFSFAGVLILCVKLRELQLTKQ---------------------------------- 176
           MGLNEFSF+GVLILCVKLRELQLTKQ                                  
Sbjct: 191 MGLNEFSFSGVLILCVKLRELQLTKQVHGQVLAVGFLSNVVLSSSIIDAYAKCGEMGCAR 250

Query: 177 ----------------------------------------NPVSWTALISGYARNGLGHD 199
                                                   NPVSWTALISGYARNGLGH+
Sbjct: 251 KLFDEMLVKDILAWTTMVSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNGLGHE 310

BLAST of Cp4.1LG02g11030 vs. NCBI nr
Match: XP_022932441.1 (pentatricopeptide repeat-containing protein At2g21090-like [Cucurbita moschata] >XP_022932442.1 pentatricopeptide repeat-containing protein At2g21090-like [Cucurbita moschata])

HSP 1 Score: 140 bits (353), Expect = 6.44e-34
Identity = 82/157 (52.23%), Postives = 83/157 (52.87%), Query Frame = 0

Query: 117 MGLNEFSFAGVLILCVKLRELQLTKQ---------------------------------- 176
           MGLNEFSFAGVLILCVKLRELQLTKQ                                  
Sbjct: 202 MGLNEFSFAGVLILCVKLRELQLTKQVHGQVLVVGFLSNVVLSSSIVDAYAKCGEMGCAR 261

Query: 177 ----------------------------------------NPVSWTALISGYARNGLGHD 199
                                                   NPVSWTALISGYARNGLGHD
Sbjct: 262 RLFDEMLVKDILAWTTMVSGYAKWGDINFASELFHQMPEKNPVSWTALISGYARNGLGHD 321

BLAST of Cp4.1LG02g11030 vs. NCBI nr
Match: KAG6607537.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 138 bits (347), Expect = 1.10e-32
Identity = 81/157 (51.59%), Postives = 82/157 (52.23%), Query Frame = 0

Query: 117 MGLNEFSFAGVLILCVKLRELQLTKQ---------------------------------- 176
           MGLNEFSFAGVLILCVKLRELQLTKQ                                  
Sbjct: 191 MGLNEFSFAGVLILCVKLRELQLTKQVHGQVLVVGFLSNVVLSSSIVDAYAKCGEMGCAR 250

Query: 177 ----------------------------------------NPVSWTALISGYARNGLGHD 199
                                                   NPVSWTALISGYARNGLGHD
Sbjct: 251 RLFDEMLVKDILAWTTMVSGYAKWGDINFASELFHQMPEKNPVSWTALISGYARNGLGHD 310

BLAST of Cp4.1LG02g11030 vs. NCBI nr
Match: KAG7037178.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 136 bits (342), Expect = 2.00e-32
Identity = 80/157 (50.96%), Postives = 82/157 (52.23%), Query Frame = 0

Query: 117 MGLNEFSFAGVLILCVKLRELQLTKQ---------------------------------- 176
           MGLNEFSFAGVLILCVKLRELQLTKQ                                  
Sbjct: 191 MGLNEFSFAGVLILCVKLRELQLTKQVHGQVLVVGFLSNVVLSSSIVDAYAKCGEMGCAR 250

Query: 177 ----------------------------------------NPVSWTALISGYARNGLGHD 199
                                                   NPVSWTALISGYARNGLGHD
Sbjct: 251 RLFDEMLVKDIFAWTTMVSGYAKWGDINFASELFHQMPEKNPVSWTALISGYARNGLGHD 310

BLAST of Cp4.1LG02g11030 vs. ExPASy TrEMBL
Match: A0A6J1IEE4 (pentatricopeptide repeat-containing protein At2g21090-like OS=Cucurbita maxima OX=3661 GN=LOC111471937 PE=4 SV=1)

HSP 1 Score: 140 bits (353), Expect = 2.85e-34
Identity = 81/157 (51.59%), Postives = 83/157 (52.87%), Query Frame = 0

Query: 117 MGLNEFSFAGVLILCVKLRELQLTKQ---------------------------------- 176
           MGLNEFSF+GVLILCVKLRELQLTKQ                                  
Sbjct: 191 MGLNEFSFSGVLILCVKLRELQLTKQVHGQVLAVGFLSNVVLSSSIIDAYAKCGEMGCAR 250

Query: 177 ----------------------------------------NPVSWTALISGYARNGLGHD 199
                                                   NPVSWTALISGYARNGLGH+
Sbjct: 251 KLFDEMLVKDILAWTTMVSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNGLGHE 310

BLAST of Cp4.1LG02g11030 vs. ExPASy TrEMBL
Match: A0A6J1EWD5 (pentatricopeptide repeat-containing protein At2g21090-like OS=Cucurbita moschata OX=3662 GN=LOC111438861 PE=4 SV=1)

HSP 1 Score: 140 bits (353), Expect = 3.12e-34
Identity = 82/157 (52.23%), Postives = 83/157 (52.87%), Query Frame = 0

Query: 117 MGLNEFSFAGVLILCVKLRELQLTKQ---------------------------------- 176
           MGLNEFSFAGVLILCVKLRELQLTKQ                                  
Sbjct: 202 MGLNEFSFAGVLILCVKLRELQLTKQVHGQVLVVGFLSNVVLSSSIVDAYAKCGEMGCAR 261

Query: 177 ----------------------------------------NPVSWTALISGYARNGLGHD 199
                                                   NPVSWTALISGYARNGLGHD
Sbjct: 262 RLFDEMLVKDILAWTTMVSGYAKWGDINFASELFHQMPEKNPVSWTALISGYARNGLGHD 321

BLAST of Cp4.1LG02g11030 vs. ExPASy TrEMBL
Match: A0A6J1C3S4 (pentatricopeptide repeat-containing protein At2g21090 OS=Momordica charantia OX=3673 GN=LOC111007990 PE=4 SV=1)

HSP 1 Score: 125 bits (314), Expect = 7.76e-29
Identity = 73/157 (46.50%), Postives = 78/157 (49.68%), Query Frame = 0

Query: 117 MGLNEFSFAGVLILCVKLRELQLTKQ---------------------------------- 176
           MG NEFSFAG+LILCVK++ELQL KQ                                  
Sbjct: 187 MGFNEFSFAGLLILCVKIKELQLAKQVHGQVLVVGFLSNVVLSSSIVDAYAKCGEMGCAR 246

Query: 177 ----------------------------------------NPVSWTALISGYARNGLGHD 199
                                                   NPVSWTALISGYARN LGH+
Sbjct: 247 RLFDEMPVKDILSWTTMVSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHE 306

BLAST of Cp4.1LG02g11030 vs. ExPASy TrEMBL
Match: A0A6J1HUW8 (pentatricopeptide repeat-containing protein At2g21090-like OS=Cucurbita maxima OX=3661 GN=LOC111467037 PE=4 SV=1)

HSP 1 Score: 124 bits (310), Expect = 2.77e-28
Identity = 72/157 (45.86%), Postives = 78/157 (49.68%), Query Frame = 0

Query: 117 MGLNEFSFAGVLILCVKLRELQLTKQ---------------------------------- 176
           MG NEFSFAGVLILCVKL+ELQL KQ                                  
Sbjct: 191 MGFNEFSFAGVLILCVKLKELQLAKQVHTQVLVVGFLSNIVLSSSIVDAYAKCGEMECAK 250

Query: 177 ----------------------------------------NPVSWTALISGYARNGLGHD 199
                                                   NPVSWTALISGYARN LGH+
Sbjct: 251 RLFDEMPVKDILAWTTMVSGYAKWGDMNLASGLFHQMPEKNPVSWTALISGYARNSLGHE 310

BLAST of Cp4.1LG02g11030 vs. ExPASy TrEMBL
Match: A0A6J1HLP5 (pentatricopeptide repeat-containing protein At2g21090 OS=Cucurbita moschata OX=3662 GN=LOC111464099 PE=4 SV=1)

HSP 1 Score: 122 bits (307), Expect = 7.07e-28
Identity = 71/157 (45.22%), Postives = 78/157 (49.68%), Query Frame = 0

Query: 117 MGLNEFSFAGVLILCVKLRELQLTKQ---------------------------------- 176
           MG NEFSFAG+LILCVKL+ELQL KQ                                  
Sbjct: 191 MGFNEFSFAGLLILCVKLKELQLAKQVHTQVLVVGFLSNIVLSSSIVDAYAKCGEMECAK 250

Query: 177 ----------------------------------------NPVSWTALISGYARNGLGHD 199
                                                   NPVSWTALISGYARN LGH+
Sbjct: 251 RLFDEMPVKDILAWTTMVSGYAKWGDMNLASGLFHQMPEKNPVSWTALISGYARNSLGHE 310

BLAST of Cp4.1LG02g11030 vs. TAIR 10
Match: AT2G21090.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 87.4 bits (215), Expect = 1.9e-17
Identity = 37/62 (59.68%), Postives = 49/62 (79.03%), Query Frame = 0

Query: 138 QLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFSSCLCACAGIAALKH 197
           ++ ++NPVSWTALI+GY R G G+ ALD F KM+   + P+Q+TFSSCLCA A IA+L+H
Sbjct: 270 EMPEKNPVSWTALIAGYVRQGSGNRALDLFRKMIALGVKPEQFTFSSCLCASASIASLRH 329

Query: 198 GK 200
           GK
Sbjct: 330 GK 331


HSP 2 Score: 44.3 bits (103), Expect = 1.8e-04
Identity = 19/34 (55.88%), Postives = 27/34 (79.41%), Query Frame = 0

Query: 197 HGKWESVEKVREELEERLVKKERAMSWSDIENKI 231
           HGKWE VEK+R  +++R V KE+A+SW +IE K+
Sbjct: 528 HGKWELVEKLRGVMKKRRVNKEKAVSWIEIEKKV 561

BLAST of Cp4.1LG02g11030 vs. TAIR 10
Match: AT2G36980.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 67.0 bits (162), Expect = 2.6e-11
Identity = 32/95 (33.68%), Postives = 53/95 (55.79%), Query Frame = 0

Query: 113 LSKVMGLNEFSFAGVLILCVKLRELQ--------LTKQNPVSWTALISGYARNGLGHDAL 172
           L  +  L + S+  ++  C+K+ E +          ++N V+WT +I+GY RNG G  AL
Sbjct: 263 LESIEVLTQVSWNSIIDACMKIGETEKALEVFHLAPEKNIVTWTTMITGYGRNGDGEQAL 322

Query: 173 DYFTKMMMFRIHPDQYTFSSCLCACAGIAALKHGK 200
            +F +MM   +  D + + + L AC+G+A L HGK
Sbjct: 323 RFFVEMMKSGVDSDHFAYGAVLHACSGLALLGHGK 357

BLAST of Cp4.1LG02g11030 vs. TAIR 10
Match: AT3G28660.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 62.4 bits (150), Expect = 6.5e-10
Identity = 28/72 (38.89%), Postives = 46/72 (63.89%), Query Frame = 0

Query: 138 QLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFSSCLCACAGIAALKH 197
           ++ + + V W  L++GY R GLG + L+ F +M++  I PD+++ ++ L ACA + AL  
Sbjct: 177 EIPQPDVVKWDVLMNGYVRCGLGSEGLEVFKEMLVRGIEPDEFSVTTALTACAQVGALAQ 236

Query: 198 GKW--ESVEKVR 208
           GKW  E V+K R
Sbjct: 237 GKWIHEFVKKKR 248

BLAST of Cp4.1LG02g11030 vs. TAIR 10
Match: AT2G42920.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 62.0 bits (149), Expect = 8.5e-10
Identity = 30/74 (40.54%), Postives = 43/74 (58.11%), Query Frame = 0

Query: 138 QLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFSSCLCACAGIAALKH 197
           ++ ++N VSW ++ISG+ RNG   DALD F +M    + PD +T  S L ACA + A + 
Sbjct: 217 EMPQRNGVSWNSMISGFVRNGRFKDALDMFREMQEKDVKPDGFTMVSLLNACAYLGASEQ 276

Query: 198 GKWESVEKVREELE 212
           G+W     VR   E
Sbjct: 277 GRWIHEYIVRNRFE 290

BLAST of Cp4.1LG02g11030 vs. TAIR 10
Match: AT5G16860.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 61.2 bits (147), Expect = 1.5e-09
Identity = 31/76 (40.79%), Postives = 47/76 (61.84%), Query Frame = 0

Query: 124 FAGVLILCVKLRELQLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFS 183
           F   + L  K++E ++ K + V+W+A ISGYA+ GLG++AL    +M+   I P++ T  
Sbjct: 311 FEDAVRLFEKMQEEKI-KMDVVTWSAAISGYAQRGLGYEALGVCRQMLSSGIKPNEVTLI 370

Query: 184 SCLCACAGIAALKHGK 200
           S L  CA + AL HGK
Sbjct: 371 SVLSGCASVGALMHGK 385

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SKQ42.7e-1659.68Pentatricopeptide repeat-containing protein At2g21090 OS=Arabidopsis thaliana OX... [more]
Q9SJK93.7e-1033.68Pentatricopeptide repeat-containing protein At2g36980, mitochondrial OS=Arabidop... [more]
Q9LJI99.2e-0938.89Pentatricopeptide repeat-containing protein At3g28660 OS=Arabidopsis thaliana OX... [more]
Q9SJG61.2e-0840.54Pentatricopeptide repeat-containing protein At2g42920, chloroplastic OS=Arabidop... [more]
Q9LFL52.0e-0840.79Pentatricopeptide repeat-containing protein At5g16860 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_023523821.16.21e-3552.87pentatricopeptide repeat-containing protein At2g21090-like [Cucurbita pepo subsp... [more]
XP_022973379.15.89e-3451.59pentatricopeptide repeat-containing protein At2g21090-like [Cucurbita maxima] >X... [more]
XP_022932441.16.44e-3452.23pentatricopeptide repeat-containing protein At2g21090-like [Cucurbita moschata] ... [more]
KAG6607537.11.10e-3251.59Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
KAG7037178.12.00e-3250.96Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
Match NameE-valueIdentityDescription
A0A6J1IEE42.85e-3451.59pentatricopeptide repeat-containing protein At2g21090-like OS=Cucurbita maxima O... [more]
A0A6J1EWD53.12e-3452.23pentatricopeptide repeat-containing protein At2g21090-like OS=Cucurbita moschata... [more]
A0A6J1C3S47.76e-2946.50pentatricopeptide repeat-containing protein At2g21090 OS=Momordica charantia OX=... [more]
A0A6J1HUW82.77e-2845.86pentatricopeptide repeat-containing protein At2g21090-like OS=Cucurbita maxima O... [more]
A0A6J1HLP57.07e-2845.22pentatricopeptide repeat-containing protein At2g21090 OS=Cucurbita moschata OX=3... [more]
Match NameE-valueIdentityDescription
AT2G21090.11.9e-1759.68Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT2G36980.12.6e-1133.68Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G28660.16.5e-1038.89Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G42920.18.5e-1040.54Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT5G16860.11.5e-0940.79Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 200..220
NoneNo IPR availablePANTHERPTHR47926:SF136BNAA04G11950D PROTEINcoord: 135..199
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 135..199
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 143..190
e-value: 3.5E-9
score: 36.7
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 145..178
e-value: 1.1E-7
score: 29.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 143..177
score: 10.906551
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 117..261
e-value: 9.4E-14
score: 53.5

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g11030.1Cp4.1LG02g11030.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding