Cp4.1LG02g11030 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG02g11030
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG02 : 8948748 .. 8951273 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCTGCGATGAGCTGCTTCAACGCTGCCCCCACCGCCGCCGCCGCTGCTGCTGCTGCTAGCCCTCTATTGATGAATGGCTTTTCTATCTCACATGGCGCCGCTGTAAAATGCCTTCTTTCTCTTCTCAACCCCTCAGAAAACCTTGTTTGGTTCGCCCCAAATCCAAGCAATGCCCAGATTCCACTCGCCCATGTGTCGTTCAGTCCCTTTTCAGCCTCTCTTCTCGAGGGAACCTCTCGGAGGCCCTTTCTTACCTCGACCCATTGGCCCAAAGAGGCATACGCTTGCCCTCTAGCGCTTTCGTCCACCTCTTGCGACTCTGTGCCAAAGCCAAGTCTCTCAAAGGTGGTAAATGTATTCATCTGCATTTGAAACATACGGGGTTTAAACGCCCCACGACTATTGTAGCCAACCATTTGATTGGTATGTATTTCAAATGTGGCCATGAGATTGATGCACGTAAGGTGTTTGATAAAATGTCTGTGAGGAATTTGTACTCTTGGAATCATATGCTTGCTGGGTATGCTAAGTTGGGGAATGTATATCAAGCTAGGAAGTTGTTTGATGGAATGACGGAGAAGGATGTTGTTTCTTGGAATACCATGGTTCTTGCTTATGCTAAGAAAGGGTGTTTCGATGAAGTTGTTGGGTTATACAGAGACTTCAGGAGACTGGAGATGGGTCTCAATGAGTTTAGTTTTGCTGGTGTTTTGATTCTTTGTGTGAAGCTGAGGGAATTGCAGCTCACGAAGCAGGTTCATGGGCAGGTATTGGTTGTTGGATTTTTGTCTAATGTAGTGCTTTCTAGTTCAATAGTTGATGCATATGCAAAATGTGGAGAGATGGGATGTGCGCGGAAATTGTTTGATGAAATGCTTGTGAAAGATATCCTCGCGTGGACTACTATGGTATCTGGATATGCTAAATGGGGTGATATGAATTTGGCTAGCGAATTGTTTCACCAAATGCCTGAAAAGAATCCTGTCTCCTGGACAGCTTTAATATCAGGCTATGCAAGGAACGGTTTGGGGCACGACGCACTCGATTACTTCACAAAAATGATGATGTTTCGAATTCATCCCGACCAATATACATTTAGTAGTTGTCTCTGCGCATGTGCCGGTATTGCTGCACTGAAGCATGGTAAACAAGTACATGCGTATTTGATAAGAACGAACTTCAGATGCAACACAATAGTCGTCAGCTCTCTCATTGACATGTATTCAAAGTGTGGCATGTTAGAAGCAGGCTACCGTGTTTTTCACCTTATGGGAAATAAGCAGGATGTTGTATTGTGGAATACAATGATATCTTCCCTAGCTCAACATGGTCATGGGGAGAAAGCAATGCAGATGTTCAATGACATGGTTAAATCAGGATTGAAGCCTGATAGGATCACGTTCATCGTGATCCTTAGCGTGTGTAGTCATTCAGGGCTCGTGCGAGAGGGACATCGGTTTTTCAAGGCCATGACGTATGATCACAGCGTTCTCCCTGATCAAGAACACTATGCATGGTTAATTGATCTCTTGGGTCGAGCGGGATGTTTTAACGAGTTGGTAAACGAGCTAGAGAAGATGTCGTGTAAGCCGGATGATCGAGTATGGAATGCGTTACTTGGTGTGTGTAGAATGCATGGTAGTATGGTGCTAGGAAGAAAAGTGGGTGAGCATGTAATGGAGGTGGATCATCAATCTTGTGCAGCTCGTGAATCTCTTGCAAGTTTGTATGCTTTTCTAGGGAAATGGGAGTCAGTAGAAAAGGTGAGGGAAGAACTGGAAGAGAGATTGGTGAAGAAGGAGCGTGCAATGAGTTGGAGTGACATTGAAAATAAGGTACATTCTTTCATTGCAGATCACATTCATTGAAGGAAGATAGGCACATCCCATACAGAAGAACAATTTTCGATCATTGAAAAGAGGGTGTTAAGGTATTTGTTATTTTTGTTGGGATATTTCACACAAGTGATCTTACATATAGGATGATAAAATGCTCCATATTGAACCTTTGGGTCTTAATCCATATAATTTGTTTATTATCTCGCTATTAATCAATAATTATGGCCCGAGTTAGTTTCTGAGTTCATTTTTCGTGTTTAGTTTGTGCTATAAAATTTGGCCTTTGACCTATATTTCCATGATTGAAGCAAGAGGGAAAAGTCGCTAAGGGTGTCAGTGCCTCAAAATGCTTTTGCTCGTACCTTCGTCAGCAACGTAAAGTAGAGATGCCAATTTAACCTAAACAAAGAAGTAGAGGACGGAAGAGATTTTTCTATATTAACTTAGCAGAATCCTTATATATATATATATATATGGGAGTTGAACTGCTACGGAATAGGTCAACTTTTTAAACTGCTACTGAATCCATGGGCAGACAAGAGACAATACTGAATCCATGGGCAAACAAGAGACAATGTAGTAAACTGCTACTGAATTCATGGGCAGACAAGAGACAATGTAGTAAACTGCTACTGAATCTATGGGCAGACAAGAGACAATGTAAATCTCTCTATTACCTCTACTTCTA

mRNA sequence

GCTGCGATGAGCTGCTTCAACGCTGCCCCCACCGCCGCCGCCGCTGCTGCTGCTGCTAGCCCTCTATTGATGAATGGCTTTTCTATCTCACATGGCGCCGCTGTAAAATGCCTTCTTTCTCTTCTCAACCCCTCAGAAAACCTTGTTTGGTTCGCCCCAAATCCAAGCAATGCCCAGATTCCACTCGCCCATGTGTCGTTCAGTCCCTTTTCAGCCTCTCTTCTCGAGGGAACCTCTCGGAGGCCCTTTCTTACCTCGACCCATTGGCCCAAAGAGGCATACGCTTGCCCTCTAGCGCTTTCGTCCACCTCTTGCGACTCTGTGCCAAAGCCAAGTCTCTCAAAGGTGATGGGTCTCAATGAGTTTAGTTTTGCTGGTGTTTTGATTCTTTGTGTGAAGCTGAGGGAATTGCAGCTCACGAAGCAGAATCCTGTCTCCTGGACAGCTTTAATATCAGGCTATGCAAGGAACGGTTTGGGGCACGACGCACTCGATTACTTCACAAAAATGATGATGTTTCGAATTCATCCCGACCAATATACATTTAGTAGTTGTCTCTGCGCATGTGCCGGTATTGCTGCACTGAAGCATGGGAAATGGGAGTCAGTAGAAAAGGTGAGGGAAGAACTGGAAGAGAGATTGGTGAAGAAGGAGCGTGCAATGAGTTGGAGTGACATTGAAAATAAGATAGGCACATCCCATACAGAAGAACAATTTTCGATCATTGAAAAGAGGGTGTTAAGCAAGAGGGAAAAGTCGCTAAGGGTGTCAGTGCCTCAAAATGCTTTTGCTCGTACCTTCGTCAGCAACGTAAAGTAGAGATGCCAATTTAACCTAAACAAAGAAGTAGAGGACGGAAGAGATTTTTCTATATTAACTTAGCAGAATCCTTATATATATATATATATATGGGAGTTGAACTGCTACGGAATAGGTCAACTTTTTAAACTGCTACTGAATCCATGGGCAGACAAGAGACAATACTGAATCCATGGGCAAACAAGAGACAATGTAGTAAACTGCTACTGAATTCATGGGCAGACAAGAGACAATGTAGTAAACTGCTACTGAATCTATGGGCAGACAAGAGACAATGTAAATCTCTCTATTACCTCTACTTCTA

Coding sequence (CDS)

GCTGCGATGAGCTGCTTCAACGCTGCCCCCACCGCCGCCGCCGCTGCTGCTGCTGCTAGCCCTCTATTGATGAATGGCTTTTCTATCTCACATGGCGCCGCTGTAAAATGCCTTCTTTCTCTTCTCAACCCCTCAGAAAACCTTGTTTGGTTCGCCCCAAATCCAAGCAATGCCCAGATTCCACTCGCCCATGTGTCGTTCAGTCCCTTTTCAGCCTCTCTTCTCGAGGGAACCTCTCGGAGGCCCTTTCTTACCTCGACCCATTGGCCCAAAGAGGCATACGCTTGCCCTCTAGCGCTTTCGTCCACCTCTTGCGACTCTGTGCCAAAGCCAAGTCTCTCAAAGGTGATGGGTCTCAATGAGTTTAGTTTTGCTGGTGTTTTGATTCTTTGTGTGAAGCTGAGGGAATTGCAGCTCACGAAGCAGAATCCTGTCTCCTGGACAGCTTTAATATCAGGCTATGCAAGGAACGGTTTGGGGCACGACGCACTCGATTACTTCACAAAAATGATGATGTTTCGAATTCATCCCGACCAATATACATTTAGTAGTTGTCTCTGCGCATGTGCCGGTATTGCTGCACTGAAGCATGGGAAATGGGAGTCAGTAGAAAAGGTGAGGGAAGAACTGGAAGAGAGATTGGTGAAGAAGGAGCGTGCAATGAGTTGGAGTGACATTGAAAATAAGATAGGCACATCCCATACAGAAGAACAATTTTCGATCATTGAAAAGAGGGTGTTAAGCAAGAGGGAAAAGTCGCTAAGGGTGTCAGTGCCTCAAAATGCTTTTGCTCGTACCTTCGTCAGCAACGTAAAGTAG

Protein sequence

AAMSCFNAAPTAAAAAAAASPLLMNGFSISHGAAVKCLLSLLNPSENLVWFAPNPSNAQIPLAHVSFSPFSASLLEGTSRRPFLTSTHWPKEAYACPLALSSTSCDSVPKPSLSKVMGLNEFSFAGVLILCVKLRELQLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFSSCLCACAGIAALKHGKWESVEKVREELEERLVKKERAMSWSDIENKIGTSHTEEQFSIIEKRVLSKREKSLRVSVPQNAFARTFVSNVK
BLAST of Cp4.1LG02g11030 vs. Swiss-Prot
Match: PP167_ARATH (Pentatricopeptide repeat-containing protein At2g21090 OS=Arabidopsis thaliana GN=PCMP-E48 PE=2 SV=1)

HSP 1 Score: 87.4 bits (215), Expect = 2.6e-16
Identity = 37/62 (59.68%), Postives = 49/62 (79.03%), Query Frame = 1

Query: 138 QLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFSSCLCACAGIAALKH 197
           ++ ++NPVSWTALI+GY R G G+ ALD F KM+   + P+Q+TFSSCLCA A IA+L+H
Sbjct: 270 EMPEKNPVSWTALIAGYVRQGSGNRALDLFRKMIALGVKPEQFTFSSCLCASASIASLRH 329

Query: 198 GK 200
           GK
Sbjct: 330 GK 331

BLAST of Cp4.1LG02g11030 vs. Swiss-Prot
Match: PP189_ARATH (Pentatricopeptide repeat-containing protein At2g36980, mitochondrial OS=Arabidopsis thaliana GN=PCMP-E73 PE=2 SV=1)

HSP 1 Score: 67.0 bits (162), Expect = 3.6e-10
Identity = 32/95 (33.68%), Postives = 53/95 (55.79%), Query Frame = 1

Query: 113 LSKVMGLNEFSFAGVLILCVKLRELQLT--------KQNPVSWTALISGYARNGLGHDAL 172
           L  +  L + S+  ++  C+K+ E +          ++N V+WT +I+GY RNG G  AL
Sbjct: 263 LESIEVLTQVSWNSIIDACMKIGETEKALEVFHLAPEKNIVTWTTMITGYGRNGDGEQAL 322

Query: 173 DYFTKMMMFRIHPDQYTFSSCLCACAGIAALKHGK 200
            +F +MM   +  D + + + L AC+G+A L HGK
Sbjct: 323 RFFVEMMKSGVDSDHFAYGAVLHACSGLALLGHGK 357

BLAST of Cp4.1LG02g11030 vs. Swiss-Prot
Match: PP200_ARATH (Pentatricopeptide repeat-containing protein At2g42920, chloroplastic OS=Arabidopsis thaliana GN=PCMP-E75 PE=2 SV=1)

HSP 1 Score: 62.0 bits (149), Expect = 1.2e-08
Identity = 30/74 (40.54%), Postives = 43/74 (58.11%), Query Frame = 1

Query: 138 QLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFSSCLCACAGIAALKH 197
           ++ ++N VSW ++ISG+ RNG   DALD F +M    + PD +T  S L ACA + A + 
Sbjct: 217 EMPQRNGVSWNSMISGFVRNGRFKDALDMFREMQEKDVKPDGFTMVSLLNACAYLGASEQ 276

Query: 198 GKWESVEKVREELE 212
           G+W     VR   E
Sbjct: 277 GRWIHEYIVRNRFE 290

BLAST of Cp4.1LG02g11030 vs. Swiss-Prot
Match: PP315_ARATH (Pentatricopeptide repeat-containing protein At4g16470 OS=Arabidopsis thaliana GN=PCMP-E12 PE=2 SV=2)

HSP 1 Score: 61.6 bits (148), Expect = 1.5e-08
Identity = 31/91 (34.07%), Postives = 50/91 (54.95%), Query Frame = 1

Query: 125 AGVLILCVKLRELQLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFSS 184
           AG+L   +K+R+L       + W A+ISGY + GL  + L  +  M   RI PDQYTF+S
Sbjct: 162 AGILFRSLKIRDL-------IPWNAMISGYVQKGLEQEGLFIYYDMRQNRIVPDQYTFAS 221

Query: 185 CLCACAGIAALKHGKWESVEKVREELEERLV 216
              AC+ +  L+HGK      ++  ++  ++
Sbjct: 222 VFRACSALDRLEHGKRAHAVMIKRCIKSNII 245

BLAST of Cp4.1LG02g11030 vs. Swiss-Prot
Match: PP390_ARATH (Pentatricopeptide repeat-containing protein At5g16860 OS=Arabidopsis thaliana GN=PCMP-H92 PE=2 SV=1)

HSP 1 Score: 61.2 bits (147), Expect = 2.0e-08
Identity = 31/76 (40.79%), Postives = 47/76 (61.84%), Query Frame = 1

Query: 124 FAGVLILCVKLRELQLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFS 183
           F   + L  K++E ++ K + V+W+A ISGYA+ GLG++AL    +M+   I P++ T  
Sbjct: 311 FEDAVRLFEKMQEEKI-KMDVVTWSAAISGYAQRGLGYEALGVCRQMLSSGIKPNEVTLI 370

Query: 184 SCLCACAGIAALKHGK 200
           S L  CA + AL HGK
Sbjct: 371 SVLSGCASVGALMHGK 385

BLAST of Cp4.1LG02g11030 vs. TrEMBL
Match: A0A0A0K215_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G062860 PE=4 SV=1)

HSP 1 Score: 114.4 bits (285), Expect = 2.2e-22
Identity = 50/62 (80.65%), Postives = 57/62 (91.94%), Query Frame = 1

Query: 138 QLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFSSCLCACAGIAALKH 197
           Q+ ++NPVSW+ALISGYARN LGH+ALDYFTKMM F I+P+QYTFSSCLCACA IAALKH
Sbjct: 286 QMPEKNPVSWSALISGYARNSLGHEALDYFTKMMKFGINPEQYTFSSCLCACASIAALKH 345

Query: 198 GK 200
           GK
Sbjct: 346 GK 347

BLAST of Cp4.1LG02g11030 vs. TrEMBL
Match: F6I116_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_03s0038g02080 PE=4 SV=1)

HSP 1 Score: 110.5 bits (275), Expect = 3.2e-21
Identity = 46/63 (73.02%), Postives = 57/63 (90.48%), Query Frame = 1

Query: 137 LQLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFSSCLCACAGIAALK 196
           +++ ++NPVSWTALISGYARNG+GH AL+ FTKMM+F + PDQ+TFSSCLCACA IA+LK
Sbjct: 282 VEMPEKNPVSWTALISGYARNGMGHKALELFTKMMLFHVRPDQFTFSSCLCACASIASLK 341

Query: 197 HGK 200
           HGK
Sbjct: 342 HGK 344

BLAST of Cp4.1LG02g11030 vs. TrEMBL
Match: W9QSY8_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_024972 PE=4 SV=1)

HSP 1 Score: 105.9 bits (263), Expect = 7.8e-20
Identity = 45/61 (73.77%), Postives = 55/61 (90.16%), Query Frame = 1

Query: 138 QLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFSSCLCACAGIAALKH 197
           Q+ ++NPVSWTALI+GYARNG+G++AL  F KMMMF+I PDQ+TFSSCLCACA IA+LKH
Sbjct: 306 QMPEKNPVSWTALIAGYARNGMGYEALTLFRKMMMFQIRPDQFTFSSCLCACASIASLKH 365

Query: 198 G 199
           G
Sbjct: 366 G 366

BLAST of Cp4.1LG02g11030 vs. TrEMBL
Match: A0A067J9H6_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_06370 PE=4 SV=1)

HSP 1 Score: 104.4 bits (259), Expect = 2.3e-19
Identity = 45/61 (73.77%), Postives = 53/61 (86.89%), Query Frame = 1

Query: 139 LTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFSSCLCACAGIAALKHG 198
           + ++NPVSWTALISGYAR+GLGH AL+ FTKM+M  I PDQ+TFSSCLCACA IA+L HG
Sbjct: 122 MPEKNPVSWTALISGYARHGLGHKALELFTKMLMLHIRPDQFTFSSCLCACASIASLNHG 181

Query: 199 K 200
           K
Sbjct: 182 K 182

BLAST of Cp4.1LG02g11030 vs. TrEMBL
Match: M5W7K4_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa020837mg PE=4 SV=1)

HSP 1 Score: 103.2 bits (256), Expect = 5.1e-19
Identity = 45/78 (57.69%), Postives = 59/78 (75.64%), Query Frame = 1

Query: 138 QLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFSSCLCACAGIAALKH 197
           Q+ ++NPVSWTALISGYARNGLG++AL  FT+MM++++ PDQ+TFSSCLCA A IA+LKH
Sbjct: 281 QMPEKNPVSWTALISGYARNGLGYEALALFTEMMLYQVRPDQFTFSSCLCASASIASLKH 340

Query: 198 GKWESVEKVREELEERLV 216
           GK      +R       +
Sbjct: 341 GKQVHASLIRSNFRPNTI 358

BLAST of Cp4.1LG02g11030 vs. TAIR10
Match: AT2G21090.1 (AT2G21090.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 87.4 bits (215), Expect = 1.5e-17
Identity = 37/62 (59.68%), Postives = 49/62 (79.03%), Query Frame = 1

Query: 138 QLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFSSCLCACAGIAALKH 197
           ++ ++NPVSWTALI+GY R G G+ ALD F KM+   + P+Q+TFSSCLCA A IA+L+H
Sbjct: 270 EMPEKNPVSWTALIAGYVRQGSGNRALDLFRKMIALGVKPEQFTFSSCLCASASIASLRH 329

Query: 198 GK 200
           GK
Sbjct: 330 GK 331

BLAST of Cp4.1LG02g11030 vs. TAIR10
Match: AT2G36980.1 (AT2G36980.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 67.0 bits (162), Expect = 2.0e-11
Identity = 32/95 (33.68%), Postives = 53/95 (55.79%), Query Frame = 1

Query: 113 LSKVMGLNEFSFAGVLILCVKLRELQLT--------KQNPVSWTALISGYARNGLGHDAL 172
           L  +  L + S+  ++  C+K+ E +          ++N V+WT +I+GY RNG G  AL
Sbjct: 263 LESIEVLTQVSWNSIIDACMKIGETEKALEVFHLAPEKNIVTWTTMITGYGRNGDGEQAL 322

Query: 173 DYFTKMMMFRIHPDQYTFSSCLCACAGIAALKHGK 200
            +F +MM   +  D + + + L AC+G+A L HGK
Sbjct: 323 RFFVEMMKSGVDSDHFAYGAVLHACSGLALLGHGK 357

BLAST of Cp4.1LG02g11030 vs. TAIR10
Match: AT2G42920.1 (AT2G42920.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 62.0 bits (149), Expect = 6.5e-10
Identity = 30/74 (40.54%), Postives = 43/74 (58.11%), Query Frame = 1

Query: 138 QLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFSSCLCACAGIAALKH 197
           ++ ++N VSW ++ISG+ RNG   DALD F +M    + PD +T  S L ACA + A + 
Sbjct: 217 EMPQRNGVSWNSMISGFVRNGRFKDALDMFREMQEKDVKPDGFTMVSLLNACAYLGASEQ 276

Query: 198 GKWESVEKVREELE 212
           G+W     VR   E
Sbjct: 277 GRWIHEYIVRNRFE 290

BLAST of Cp4.1LG02g11030 vs. TAIR10
Match: AT4G16470.1 (AT4G16470.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 61.6 bits (148), Expect = 8.5e-10
Identity = 31/91 (34.07%), Postives = 50/91 (54.95%), Query Frame = 1

Query: 125 AGVLILCVKLRELQLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFSS 184
           AG+L   +K+R+L       + W A+ISGY + GL  + L  +  M   RI PDQYTF+S
Sbjct: 162 AGILFRSLKIRDL-------IPWNAMISGYVQKGLEQEGLFIYYDMRQNRIVPDQYTFAS 221

Query: 185 CLCACAGIAALKHGKWESVEKVREELEERLV 216
              AC+ +  L+HGK      ++  ++  ++
Sbjct: 222 VFRACSALDRLEHGKRAHAVMIKRCIKSNII 245

BLAST of Cp4.1LG02g11030 vs. TAIR10
Match: AT5G16860.1 (AT5G16860.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 61.2 bits (147), Expect = 1.1e-09
Identity = 31/76 (40.79%), Postives = 47/76 (61.84%), Query Frame = 1

Query: 124 FAGVLILCVKLRELQLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFS 183
           F   + L  K++E ++ K + V+W+A ISGYA+ GLG++AL    +M+   I P++ T  
Sbjct: 311 FEDAVRLFEKMQEEKI-KMDVVTWSAAISGYAQRGLGYEALGVCRQMLSSGIKPNEVTLI 370

Query: 184 SCLCACAGIAALKHGK 200
           S L  CA + AL HGK
Sbjct: 371 SVLSGCASVGALMHGK 385

BLAST of Cp4.1LG02g11030 vs. NCBI nr
Match: gi|449438554|ref|XP_004137053.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g21090 [Cucumis sativus])

HSP 1 Score: 114.4 bits (285), Expect = 3.1e-22
Identity = 50/62 (80.65%), Postives = 57/62 (91.94%), Query Frame = 1

Query: 138 QLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFSSCLCACAGIAALKH 197
           Q+ ++NPVSW+ALISGYARN LGH+ALDYFTKMM F I+P+QYTFSSCLCACA IAALKH
Sbjct: 286 QMPEKNPVSWSALISGYARNSLGHEALDYFTKMMKFGINPEQYTFSSCLCACASIAALKH 345

Query: 198 GK 200
           GK
Sbjct: 346 GK 347

BLAST of Cp4.1LG02g11030 vs. NCBI nr
Match: gi|659110392|ref|XP_008455202.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g21090 [Cucumis melo])

HSP 1 Score: 113.6 bits (283), Expect = 5.4e-22
Identity = 50/62 (80.65%), Postives = 56/62 (90.32%), Query Frame = 1

Query: 138 QLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFSSCLCACAGIAALKH 197
           Q+ ++NPVSWTALISGYARN LGH+ALDYFTKMM   I+P+QYTFSSCLCACA IAALKH
Sbjct: 286 QMPEKNPVSWTALISGYARNSLGHEALDYFTKMMKLGINPEQYTFSSCLCACASIAALKH 345

Query: 198 GK 200
           GK
Sbjct: 346 GK 347

BLAST of Cp4.1LG02g11030 vs. NCBI nr
Match: gi|297744641|emb|CBI37903.3| (unnamed protein product [Vitis vinifera])

HSP 1 Score: 110.5 bits (275), Expect = 4.5e-21
Identity = 46/63 (73.02%), Postives = 57/63 (90.48%), Query Frame = 1

Query: 137 LQLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFSSCLCACAGIAALK 196
           +++ ++NPVSWTALISGYARNG+GH AL+ FTKMM+F + PDQ+TFSSCLCACA IA+LK
Sbjct: 282 VEMPEKNPVSWTALISGYARNGMGHKALELFTKMMLFHVRPDQFTFSSCLCACASIASLK 341

Query: 197 HGK 200
           HGK
Sbjct: 342 HGK 344

BLAST of Cp4.1LG02g11030 vs. NCBI nr
Match: gi|225427963|ref|XP_002277549.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g21090 [Vitis vinifera])

HSP 1 Score: 110.5 bits (275), Expect = 4.5e-21
Identity = 46/63 (73.02%), Postives = 57/63 (90.48%), Query Frame = 1

Query: 137 LQLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFSSCLCACAGIAALK 196
           +++ ++NPVSWTALISGYARNG+GH AL+ FTKMM+F + PDQ+TFSSCLCACA IA+LK
Sbjct: 282 VEMPEKNPVSWTALISGYARNGMGHKALELFTKMMLFHVRPDQFTFSSCLCACASIASLK 341

Query: 197 HGK 200
           HGK
Sbjct: 342 HGK 344

BLAST of Cp4.1LG02g11030 vs. NCBI nr
Match: gi|1009151207|ref|XP_015893431.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g21090 [Ziziphus jujuba])

HSP 1 Score: 109.0 bits (271), Expect = 1.3e-20
Identity = 47/62 (75.81%), Postives = 57/62 (91.94%), Query Frame = 1

Query: 138 QLTKQNPVSWTALISGYARNGLGHDALDYFTKMMMFRIHPDQYTFSSCLCACAGIAALKH 197
           ++ ++NPVSWTALI+GYARNGLGH+AL  F+KM+MFRI PDQ+TFSSCLCACA IA+LKH
Sbjct: 288 KMPEKNPVSWTALIAGYARNGLGHEALTLFSKMVMFRIIPDQFTFSSCLCACASIASLKH 347

Query: 198 GK 200
           GK
Sbjct: 348 GK 349

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP167_ARATH2.6e-1659.68Pentatricopeptide repeat-containing protein At2g21090 OS=Arabidopsis thaliana GN... [more]
PP189_ARATH3.6e-1033.68Pentatricopeptide repeat-containing protein At2g36980, mitochondrial OS=Arabidop... [more]
PP200_ARATH1.2e-0840.54Pentatricopeptide repeat-containing protein At2g42920, chloroplastic OS=Arabidop... [more]
PP315_ARATH1.5e-0834.07Pentatricopeptide repeat-containing protein At4g16470 OS=Arabidopsis thaliana GN... [more]
PP390_ARATH2.0e-0840.79Pentatricopeptide repeat-containing protein At5g16860 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0K215_CUCSA2.2e-2280.65Uncharacterized protein OS=Cucumis sativus GN=Csa_7G062860 PE=4 SV=1[more]
F6I116_VITVI3.2e-2173.02Putative uncharacterized protein OS=Vitis vinifera GN=VIT_03s0038g02080 PE=4 SV=... [more]
W9QSY8_9ROSA7.8e-2073.77Uncharacterized protein OS=Morus notabilis GN=L484_024972 PE=4 SV=1[more]
A0A067J9H6_JATCU2.3e-1973.77Uncharacterized protein OS=Jatropha curcas GN=JCGZ_06370 PE=4 SV=1[more]
M5W7K4_PRUPE5.1e-1957.69Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa020837mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G21090.11.5e-1759.68 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT2G36980.12.0e-1133.68 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G42920.16.5e-1040.54 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT4G16470.18.5e-1034.07 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G16860.11.1e-0940.79 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449438554|ref|XP_004137053.1|3.1e-2280.65PREDICTED: pentatricopeptide repeat-containing protein At2g21090 [Cucumis sativu... [more]
gi|659110392|ref|XP_008455202.1|5.4e-2280.65PREDICTED: pentatricopeptide repeat-containing protein At2g21090 [Cucumis melo][more]
gi|297744641|emb|CBI37903.3|4.5e-2173.02unnamed protein product [Vitis vinifera][more]
gi|225427963|ref|XP_002277549.1|4.5e-2173.02PREDICTED: pentatricopeptide repeat-containing protein At2g21090 [Vitis vinifera... [more]
gi|1009151207|ref|XP_015893431.1|1.3e-2075.81PREDICTED: pentatricopeptide repeat-containing protein At2g21090 [Ziziphus jujub... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g11030.1Cp4.1LG02g11030.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 143..190
score: 5.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 145..178
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 143..177
score: 10.907coord: 178..217
score: 5
NoneNo IPR availableunknownCoilCoilcoord: 200..220
scor
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 117..199
score: 5.4
NoneNo IPR availablePANTHERPTHR24015:SF554SUBFAMILY NOT NAMEDcoord: 117..199
score: 5.4

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG02g11030Carg26280Silver-seed gourdcarcpeB0235
The following gene(s) are paralogous to this gene:

None