Cp4.1LG01g20930 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g20930
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCp4.1LG01 : 17698405 .. 17700430 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CTACAAATCATGATGGAAGAAGATGAAGCAGCAGCAGCTGAACCAGCACCAACCAAGACACACCTGATACTGCTATTGCTCGAATCCAAGTTCCGTTTTTCATAGCGGCAACCGTACTTATATGCACTTCCTGCTAAAGAATCTTCGTGGTGCGCACAGAGCTTCGTTTTCCCATTACTCCTATCTGATTGACCAGTGTTTAACGTCTAGATCTGTTCATATTGCGAAAACAATCCACGCTCAATTGCTAAAGCTTGGCCTTAACAACAATACTTTTCTGGGTAATCGCTGTCTTCAGCTATACGCGTTATTTGGTCCCGTTCATGAATTTTTTAGAGTATTTGGTGATATCAAAGAGAAGAATATTGTATCTTGGAACATATGCTTGAAGGGGTTGTTTAGATTTGGTTATGTCAATGGTGCACGCAATCTGTTCGATGAAATGCCTGAGAGGGACATTGTTTCTTGGAATTGCATGATGTCTGGCTGCGTTTCTTCTGGGTTTGCTAATAAGGCTATGGATGTGTTTCTGGAGATGCAGGATGCTGGTTTTAGACCAAGTGAATATACGTTTTCCATTATGCTTTCGGTCGTGTCGTGTCCTATTCATGGCAAGCAAATTCATGGCAGTATGATTCGAAGTGGCATTGATGTGTCAAATGTGGTTCTTGGGAATTCATTGATTGATATGTATGGAAAATTTGGCCTTGTTGATTATATGTTTGTCATGTTTTATAGTATGGAAAAGGTGGATATTATCTCTTGGAACTCTTTGATTTTGGGCTGCCACAGATCAGGCTTTAGAGTATTGGCACTAAATCAGTTCTATCTTATGAGAGCCACTGGGCACTCTCCTGATCAGTTCACTGTATCAATAATGATAAGTTTATGTTCTTTCCTCCAAGAATTGGAACTGGGTAAGCAATTTCTTGCTTTCTGTTTCAAGATGGGATTTACTTCTAACAGCATTGTACTTAGTGCTGCTATTGACTTGTTTTCCAAATGCAACAGCTTGAGGGATGCAATGCAGCTTTTTGAAGAAGCCAATCAATGGGATCAAGCTCTTTGTGATGCCATGATCTCAAGTTTGGCATGGCATGGTCATTGGAGGGATTCCATGTGGCTTTTTGTGTATGCCTTAAGGGAGAACCTCAGGCCAACAGGGATTACACTCAGCAGTGTCCTGAGCTCCATTTCAGTCTTCACACCTCTGGACTTAGGTAGTCAAATTCATAATTTGGTTCTTAAGTTGGGTTTTGAGTCTGATACCGTCGTCACTAGTTCGCTCGTCGACATGTATGCTAAAATTGGATTAATTGATAACGCCATGAAAGTCTTCACGGATATGCCTTCGAGAGATTTAATATCTTGGAACACTATGATTATGGGTCTGGTTAACAATGGTAAATACTTTGAGGCCTTGGGCACACTTGAAAATTTGGTTAGGGAAGGTGTAGTGGCAGATAGGATAACACTAGCTGGAGTTTTATTAGCTTGCAGCCATGCTGGTTTTGTTGATGAAGGGCTAAACATCTTCTGTACAATGGAAAATGAACATGGAGTCGTACCGACGAACGAGCATTATACTTGTGTGGTGGACTTACTGAGTCGGGCTGGTAAATTCAAAGAAGCAGTTAATATCATCGAAACAACATCGTGCCAACCTACTTCTACGTTTTGGATATCACTACTAGATGCCTGTGCTATTCATGGAGACATGAACAGCATTGAAAGAGTTGCGGAGAGGGTGATGAAGCTGGAACCTCAATCATCCTTACCGTATTCGGTGCTGGCTCGAGTATACGCAGCGAGAGGCCGATGGGAAAGCACTGTTCGTGTCAGGAAGGCCATGGAGAATATAGCTGCACAGAAGGTGAAGGCTTGCAGCTGGGTTGTGATCAAAGATCATGTGTATGCTTTCCAGGATGACCGGTTGCAGCATCTGCGAGGAGAAAGTTTGATTTCTGCGTTGGAGCTGATTGTTTGGGAGGTGGAATATGGGAATGAACACAAACAGTATGTTTAA

mRNA sequence

CTACAAATCATGATGGAAGAAGATGAAGCAGCAGCAGCTGAACCAGCACCAACCAAGACACACCTGATACTGCTATTGCTCGAATCCAAAGTATTTGGTGATATCAAAGAGAAGAATATTGTATCTTGGAACATATGCTTGAAGGGGTTGTTTAGATTTGGTTATGTCAATGGTGCACGCAATCTGTTCGATGAAATGCCTGAGAGGGACATTGTTTCTTGGAATTGCATGATGTCTGGCTGCGTTTCTTCTGGGTTTGCTAATAAGGCTATGGATGTGTTTCTGGAGATGCAGGATGCTGGTTTTAGACCAAGTGAATATACGTTTTCCATTATGCTTTCGGTCGTGTCGTGTCCTATTCATGGCAAGCAAATTCATGGCAGTATGATTCGAAGTGGCATTGATGTGTCAAATGTGGTTCTTGGGAATTCATTGATTGATATCTTGAGGGATGCAATGCAGCTTTTTGAAGAAGCCAATCAATGGGATCAAGCTCTTTGTGATGCCATGATCTCAAGTTTGGCATGGCATGGTCATTGGAGGGATTCCATGAAGGCCATGGAGAATATAGCTGCACAGAAGGTGAAGGCTTGCAGCTGGGTTGTGATCAAAGATCATGTGTATGCTTTCCAGGATGACCGGTTGCAGCATCTGCGAGGAGAAAGTTTGATTTCTGCGTTGGAGCTGATTGTTTGGGAGGTGGAATATGGGAATGAACACAAACAGTATGTTTAA

Coding sequence (CDS)

CTACAAATCATGATGGAAGAAGATGAAGCAGCAGCAGCTGAACCAGCACCAACCAAGACACACCTGATACTGCTATTGCTCGAATCCAAAGTATTTGGTGATATCAAAGAGAAGAATATTGTATCTTGGAACATATGCTTGAAGGGGTTGTTTAGATTTGGTTATGTCAATGGTGCACGCAATCTGTTCGATGAAATGCCTGAGAGGGACATTGTTTCTTGGAATTGCATGATGTCTGGCTGCGTTTCTTCTGGGTTTGCTAATAAGGCTATGGATGTGTTTCTGGAGATGCAGGATGCTGGTTTTAGACCAAGTGAATATACGTTTTCCATTATGCTTTCGGTCGTGTCGTGTCCTATTCATGGCAAGCAAATTCATGGCAGTATGATTCGAAGTGGCATTGATGTGTCAAATGTGGTTCTTGGGAATTCATTGATTGATATCTTGAGGGATGCAATGCAGCTTTTTGAAGAAGCCAATCAATGGGATCAAGCTCTTTGTGATGCCATGATCTCAAGTTTGGCATGGCATGGTCATTGGAGGGATTCCATGAAGGCCATGGAGAATATAGCTGCACAGAAGGTGAAGGCTTGCAGCTGGGTTGTGATCAAAGATCATGTGTATGCTTTCCAGGATGACCGGTTGCAGCATCTGCGAGGAGAAAGTTTGATTTCTGCGTTGGAGCTGATTGTTTGGGAGGTGGAATATGGGAATGAACACAAACAGTATGTTTAA

Protein sequence

LQIMMEEDEAAAAEPAPTKTHLILLLLESKVFGDIKEKNIVSWNICLKGLFRFGYVNGARNLFDEMPERDIVSWNCMMSGCVSSGFANKAMDVFLEMQDAGFRPSEYTFSIMLSVVSCPIHGKQIHGSMIRSGIDVSNVVLGNSLIDILRDAMQLFEEANQWDQALCDAMISSLAWHGHWRDSMKAMENIAAQKVKACSWVVIKDHVYAFQDDRLQHLRGESLISALELIVWEVEYGNEHKQYV
BLAST of Cp4.1LG01g20930 vs. Swiss-Prot
Match: PPR73_ARATH (Pentatricopeptide repeat-containing protein At1g43980, mitochondrial OS=Arabidopsis thaliana GN=PCMP-E58 PE=3 SV=1)

HSP 1 Score: 148.3 bits (373), Expect = 1.1e-34
Identity = 66/121 (54.55%), Postives = 92/121 (76.03%), Query Frame = 1

Query: 30  KVFGDIKEKNIVSWNICLKGLFRFGYVNGARNLFDEMPERDIVSWNCMMSGCVSSGFANK 89
           ++F DI +KN ++WN+CLKGLF+ GY+N A +LFDEMPERD+VSWN M+SG VS GF   
Sbjct: 72  QLFDDIPDKNTITWNVCLKGLFKNGYLNNALDLFDEMPERDVVSWNTMISGLVSCGFHEY 131

Query: 90  AMDVFLEMQDAGFRPSEYTFSIMLSVVSCPIHGKQIHGSMIRSGIDVSNVVLGNSLIDIL 149
            + VF +MQ    RP+E+TFSI+ S+V+C  HG+QIHG+ I SG+   N+V+ NS++D+ 
Sbjct: 132 GIRVFFDMQRWEIRPTEFTFSILASLVTCVRHGEQIHGNAICSGVSRYNLVVWNSVMDMY 191

Query: 150 R 151
           R
Sbjct: 192 R 192

BLAST of Cp4.1LG01g20930 vs. Swiss-Prot
Match: PPR57_ARATH (Pentatricopeptide repeat-containing protein At1g25360 OS=Arabidopsis thaliana GN=PCMP-H74 PE=2 SV=1)

HSP 1 Score: 100.9 bits (250), Expect = 2.0e-20
Identity = 52/175 (29.71%), Postives = 96/175 (54.86%), Query Frame = 1

Query: 31  VFGDIKEKNIVSWNICLKGLFRFGYVNGARNLFDEMPERDIVSWNCMMSGCVSSGFANKA 90
           +F  +  K++VSWN  L G    G++  A+ +F EM E++I+SW  M+SG   +GF  + 
Sbjct: 342 IFEKMPAKDLVSWNALLSGYVSSGHIGEAKLIFKEMKEKNILSWMIMISGLAENGFGEEG 401

Query: 91  MDVFLEMQDAGFRPSEYTFSIML---SVVSCPIHGKQIHGSMIRSGIDVSNVVLGNSLI- 150
           + +F  M+  GF P +Y FS  +   +V+    +G+Q H  +++ G D S++  GN+LI 
Sbjct: 402 LKLFSCMKREGFEPCDYAFSGAIKSCAVLGAYCNGQQYHAQLLKIGFD-SSLSAGNALIT 461

Query: 151 -----DILRDAMQLFEEANQWDQALCDAMISSLAWHGHWRDSMKAMENIAAQKVK 197
                 ++ +A Q+F      D    +A+I++L  HGH  +++   E +  + ++
Sbjct: 462 MYAKCGVVEEARQVFRTMPCLDSVSWNALIAALGQHGHGAEAVDVYEEMLKKGIR 515

BLAST of Cp4.1LG01g20930 vs. Swiss-Prot
Match: PP235_ARATH (Putative pentatricopeptide repeat-containing protein At3g15930 OS=Arabidopsis thaliana GN=PCMP-E51 PE=3 SV=2)

HSP 1 Score: 99.0 bits (245), Expect = 7.7e-20
Identity = 56/167 (33.53%), Postives = 97/167 (58.08%), Query Frame = 1

Query: 30  KVFGDIKEKNIVSWNICLKGLFRFGYVNGARNLFDEMPERDIVSWNCMMSGCVSSGFANK 89
           ++F  +K ++++SW   +KG    G +  AR  FD+MP RD +SW  M+ G + +G  N+
Sbjct: 292 RIFRSMKARDVISWTSIVKGYVERGNLKLARTYFDQMPVRDRISWTIMIDGYLRAGCFNE 351

Query: 90  AMDVFLEMQDAGFRPSEYTFSIMLSVVSCPIH-GKQIHGSMIRSGID----VSNVVLGNS 149
           ++++F EMQ AG  P E+T   M+SV++   H G    G  I++ ID     ++VV+GN+
Sbjct: 352 SLEIFREMQSAGMIPDEFT---MVSVLTACAHLGSLEIGEWIKTYIDKNKIKNDVVVGNA 411

Query: 150 LIDIL------RDAMQLFEEANQWDQALCDAMISSLAWHGHWRDSMK 186
           LID+         A ++F + +Q D+    AM+  LA +G  ++++K
Sbjct: 412 LIDMYFKCGCSEKAQKVFHDMDQRDKFTWTAMVVGLANNGQGQEAIK 455

BLAST of Cp4.1LG01g20930 vs. Swiss-Prot
Match: PP168_ARATH (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 98.6 bits (244), Expect = 1.0e-19
Identity = 54/171 (31.58%), Postives = 93/171 (54.39%), Query Frame = 1

Query: 36  KEKNIVSWNICLKGLFRFGYVNGARNLFDEMPERDIVSWNCMMSGCVSSGFANKAMDVFL 95
           K+  I  +   L G  + G +N A+N+F  + +RD+V+W  M+ G    G   +A+++F 
Sbjct: 343 KDLKIEGFTALLDGYIKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFR 402

Query: 96  EMQDAGFRPSEYTFSIMLSVVSCPI---HGKQIHGSMIRSGIDVSNVVLGNSLIDILRDA 155
            M   G RP+ YT + MLSV S      HGKQIHGS ++SG ++ +V + N+LI +   A
Sbjct: 403 SMVGGGQRPNSYTLAAMLSVASSLASLSHGKQIHGSAVKSG-EIYSVSVSNALITMYAKA 462

Query: 156 MQLFEEANQWDQALCD-------AMISSLAWHGHWRDSMKAMENIAAQKVK 197
             +   +  +D   C+       +MI +LA HGH  ++++  E +  + ++
Sbjct: 463 GNITSASRAFDLIRCERDTVSWTSMIIALAQHGHAEEALELFETMLMEGLR 512

BLAST of Cp4.1LG01g20930 vs. Swiss-Prot
Match: PP167_ARATH (Pentatricopeptide repeat-containing protein At2g21090 OS=Arabidopsis thaliana GN=PCMP-E48 PE=2 SV=1)

HSP 1 Score: 98.2 bits (243), Expect = 1.3e-19
Identity = 57/179 (31.84%), Postives = 101/179 (56.42%), Query Frame = 1

Query: 30  KVFGDIKEKNIVSWNICLKGLFRFGYVNGARNLFDEMPERDIVSWNCMMSGCVSSGFANK 89
           + F ++  K+I  W   + G  + G +  A  LF EMPE++ VSW  +++G V  G  N+
Sbjct: 235 RCFDEMTVKDIHIWTTLISGYAKLGDMEAAEKLFCEMPEKNPVSWTALIAGYVRQGSGNR 294

Query: 90  AMDVFLEMQDAGFRPSEYTFSIML---SVVSCPIHGKQIHGSMIRSGIDVSNVVLGNSLI 149
           A+D+F +M   G +P ++TFS  L   + ++   HGK+IHG MIR+ +  + +V+ +SLI
Sbjct: 295 ALDLFRKMIALGVKPEQFTFSSCLCASASIASLRHGKEIHGYMIRTNVRPNAIVI-SSLI 354

Query: 150 DILRDAMQLFEEANQWDQALCD---------AMISSLAWHGHWRDSMKAMENIAAQKVK 197
           D+   +  L  EA++    +CD          MIS+LA HG    +++ ++++   +V+
Sbjct: 355 DMYSKSGSL--EASERVFRICDDKHDCVFWNTMISALAQHGLGHKALRMLDDMIKFRVQ 410

BLAST of Cp4.1LG01g20930 vs. TrEMBL
Match: M5W3X4_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa014901mg PE=4 SV=1)

HSP 1 Score: 170.2 bits (430), Expect = 3.0e-39
Identity = 77/119 (64.71%), Postives = 97/119 (81.51%), Query Frame = 1

Query: 30  KVFGDIKEKNIVSWNICLKGLFRFGYVNGARNLFDEMPERDIVSWNCMMSGCVSSGFANK 89
           KVF +I +KNIVSWNICLKG  RFG +  A N+F EMP RD+VSWN M+SGCVS GF + 
Sbjct: 69  KVFDEITDKNIVSWNICLKGFCRFGELQRAHNMFAEMPVRDVVSWNSMISGCVSGGFFDN 128

Query: 90  AMDVFLEMQDAGFRPSEYTFSIMLSVVSCPIHGKQIHGSMIRSGIDVSNVVLGNSLIDI 149
           A  +F +MQ AG RPSEYTFSI++S+V+C  HGKQIHG MIR+G+++SN++LGNSLID+
Sbjct: 129 AFCLFSKMQIAGMRPSEYTFSIVMSLVTCSCHGKQIHGGMIRNGMNLSNLILGNSLIDM 187

BLAST of Cp4.1LG01g20930 vs. TrEMBL
Match: B9H7L3_POPTR (Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POPTR_0005s20740g PE=4 SV=1)

HSP 1 Score: 155.2 bits (391), Expect = 1.0e-34
Identity = 79/156 (50.64%), Postives = 107/156 (68.59%), Query Frame = 1

Query: 30  KVFGDIKEKNIVSWNICLKGLFRFGYVNGARNLFDEMPERDIVSWNCMMSGCVSSGFANK 89
           KVF DI  KNIVSWNICLKGL +F  ++ A ++FD+MPERD+VSWN M+SG  S G+ + 
Sbjct: 72  KVFDDISSKNIVSWNICLKGLLKFDNLSLACSVFDDMPERDVVSWNSMISGYASRGYFDC 131

Query: 90  AMDVFLEMQDAGFRPSEYTFSIMLSVVSCPIHGKQIHGSMIRSGIDVSNVVLGNSLIDI- 149
           A++ F EMQ  G RPSE+T+SI++SVV    HGK+IHGS++RSG+   NVVLGNSLID+ 
Sbjct: 132 ALETFWEMQKLGVRPSEFTYSILMSVVFGVRHGKEIHGSIVRSGLGALNVVLGNSLIDMY 191

Query: 150 -----LRDAMQLFEEANQWDQALCDAMISSLAWHGH 180
                L  A+ +F    + D    +++IS     G+
Sbjct: 192 GKFSSLDYALGVFLTMEELDVISWNSLISVCCQSGY 227

BLAST of Cp4.1LG01g20930 vs. TrEMBL
Match: D7KNQ4_ARALL (F9C16.15 (Fragment) OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_473794 PE=4 SV=1)

HSP 1 Score: 154.5 bits (389), Expect = 1.7e-34
Identity = 70/121 (57.85%), Postives = 94/121 (77.69%), Query Frame = 1

Query: 30  KVFGDIKEKNIVSWNICLKGLFRFGYVNGARNLFDEMPERDIVSWNCMMSGCVSSGFANK 89
           ++F DI +KN +SWN+CLKGLF+ G++N A +LFDEMPERD+VSWN M+SG VS GF   
Sbjct: 60  RLFDDIPDKNTISWNVCLKGLFKNGFLNNALDLFDEMPERDVVSWNTMISGFVSCGFPEY 119

Query: 90  AMDVFLEMQDAGFRPSEYTFSIMLSVVSCPIHGKQIHGSMIRSGIDVSNVVLGNSLIDIL 149
           A+ VF +MQ    RP+E+TFSI+ S+VSC  HG+QIHG+ I SG+  SN+V+ NSL+D+ 
Sbjct: 120 AIRVFFDMQRWVIRPTEFTFSILASLVSCVRHGEQIHGNAICSGVSKSNLVVWNSLMDMY 179

Query: 150 R 151
           R
Sbjct: 180 R 180

BLAST of Cp4.1LG01g20930 vs. TrEMBL
Match: A0A068UUJ0_COFCA (Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00035317001 PE=4 SV=1)

HSP 1 Score: 154.5 bits (389), Expect = 1.7e-34
Identity = 76/156 (48.72%), Postives = 106/156 (67.95%), Query Frame = 1

Query: 30  KVFGDIKEKNIVSWNICLKGLFRFGYVNGARNLFDEMPERDIVSWNCMMSGCVSSGFANK 89
           KVF DI  +N+ SWNICLK   + G    AR +FD+MPERD+VSWN M+SG VS GF+ +
Sbjct: 74  KVFDDIAYRNVYSWNICLKAYVQHGDFEKARLIFDKMPERDVVSWNSMISGYVSCGFSEQ 133

Query: 90  AMDVFLEMQDAGFRPSEYTFSIMLSVVSCPIHGKQIHGSMIRSGIDVSNVVLGNSLID-- 149
           A+++FL+MQ  G RPS +TFSI++S V C   GKQIH SM+R+G+D SNVV+GNSLID  
Sbjct: 134 ALELFLDMQKNGVRPSGFTFSILISSVECVFVGKQIHCSMLRNGVDFSNVVVGNSLIDMY 193

Query: 150 ----ILRDAMQLFEEANQWDQALCDAMISSLAWHGH 180
               ++  A+ +F    + D    +++IS+    G+
Sbjct: 194 GKVGVVEYALSVFWSMKEVDVMSWNSLISACCKSGY 229

BLAST of Cp4.1LG01g20930 vs. TrEMBL
Match: A0A072V224_MEDTR (PPR containing plant-like protein OS=Medicago truncatula GN=MTR_3g088835 PE=4 SV=1)

HSP 1 Score: 152.5 bits (384), Expect = 6.5e-34
Identity = 72/117 (61.54%), Postives = 89/117 (76.07%), Query Frame = 1

Query: 30  KVFGDIKEKNIVSWNICLKGLFRFGYVNGARNLFDEMPERDIVSWNCMMSGCVSSGFANK 89
           KVF DI  KN  SWNICLKGLF+ G V  A  +FDEMP RD+VSWN M+SG  S GF++ 
Sbjct: 71  KVFDDISYKNSTSWNICLKGLFKSGQVGKACYMFDEMPVRDVVSWNTMISGYASCGFSSH 130

Query: 90  AMDVFLEMQDAGFRPSEYTFSIMLSVVSCPIHGKQIHGSMIRSGIDVSNVVLGNSLI 147
           A+ VF+EMQ AG RPS +TFSI+ S+VS     K++HG MIRSG+++SNVV+GNSLI
Sbjct: 131 ALGVFVEMQGAGVRPSGFTFSILTSLVSSSCRAKEVHGMMIRSGMELSNVVIGNSLI 187

BLAST of Cp4.1LG01g20930 vs. TAIR10
Match: AT1G43980.1 (AT1G43980.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 148.3 bits (373), Expect = 6.2e-36
Identity = 66/121 (54.55%), Postives = 92/121 (76.03%), Query Frame = 1

Query: 30  KVFGDIKEKNIVSWNICLKGLFRFGYVNGARNLFDEMPERDIVSWNCMMSGCVSSGFANK 89
           ++F DI +KN ++WN+CLKGLF+ GY+N A +LFDEMPERD+VSWN M+SG VS GF   
Sbjct: 60  QLFDDIPDKNTITWNVCLKGLFKNGYLNNALDLFDEMPERDVVSWNTMISGLVSCGFHEY 119

Query: 90  AMDVFLEMQDAGFRPSEYTFSIMLSVVSCPIHGKQIHGSMIRSGIDVSNVVLGNSLIDIL 149
            + VF +MQ    RP+E+TFSI+ S+V+C  HG+QIHG+ I SG+   N+V+ NS++D+ 
Sbjct: 120 GIRVFFDMQRWEIRPTEFTFSILASLVTCVRHGEQIHGNAICSGVSRYNLVVWNSVMDMY 179

Query: 150 R 151
           R
Sbjct: 180 R 180

BLAST of Cp4.1LG01g20930 vs. TAIR10
Match: AT1G25360.1 (AT1G25360.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 100.9 bits (250), Expect = 1.1e-21
Identity = 52/175 (29.71%), Postives = 96/175 (54.86%), Query Frame = 1

Query: 31  VFGDIKEKNIVSWNICLKGLFRFGYVNGARNLFDEMPERDIVSWNCMMSGCVSSGFANKA 90
           +F  +  K++VSWN  L G    G++  A+ +F EM E++I+SW  M+SG   +GF  + 
Sbjct: 342 IFEKMPAKDLVSWNALLSGYVSSGHIGEAKLIFKEMKEKNILSWMIMISGLAENGFGEEG 401

Query: 91  MDVFLEMQDAGFRPSEYTFSIML---SVVSCPIHGKQIHGSMIRSGIDVSNVVLGNSLI- 150
           + +F  M+  GF P +Y FS  +   +V+    +G+Q H  +++ G D S++  GN+LI 
Sbjct: 402 LKLFSCMKREGFEPCDYAFSGAIKSCAVLGAYCNGQQYHAQLLKIGFD-SSLSAGNALIT 461

Query: 151 -----DILRDAMQLFEEANQWDQALCDAMISSLAWHGHWRDSMKAMENIAAQKVK 197
                 ++ +A Q+F      D    +A+I++L  HGH  +++   E +  + ++
Sbjct: 462 MYAKCGVVEEARQVFRTMPCLDSVSWNALIAALGQHGHGAEAVDVYEEMLKKGIR 515

BLAST of Cp4.1LG01g20930 vs. TAIR10
Match: AT3G15930.1 (AT3G15930.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 99.0 bits (245), Expect = 4.3e-21
Identity = 56/167 (33.53%), Postives = 97/167 (58.08%), Query Frame = 1

Query: 30  KVFGDIKEKNIVSWNICLKGLFRFGYVNGARNLFDEMPERDIVSWNCMMSGCVSSGFANK 89
           ++F  +K ++++SW   +KG    G +  AR  FD+MP RD +SW  M+ G + +G  N+
Sbjct: 292 RIFRSMKARDVISWTSIVKGYVERGNLKLARTYFDQMPVRDRISWTIMIDGYLRAGCFNE 351

Query: 90  AMDVFLEMQDAGFRPSEYTFSIMLSVVSCPIH-GKQIHGSMIRSGID----VSNVVLGNS 149
           ++++F EMQ AG  P E+T   M+SV++   H G    G  I++ ID     ++VV+GN+
Sbjct: 352 SLEIFREMQSAGMIPDEFT---MVSVLTACAHLGSLEIGEWIKTYIDKNKIKNDVVVGNA 411

Query: 150 LIDIL------RDAMQLFEEANQWDQALCDAMISSLAWHGHWRDSMK 186
           LID+         A ++F + +Q D+    AM+  LA +G  ++++K
Sbjct: 412 LIDMYFKCGCSEKAQKVFHDMDQRDKFTWTAMVVGLANNGQGQEAIK 455

BLAST of Cp4.1LG01g20930 vs. TAIR10
Match: AT2G22070.1 (AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein)

HSP 1 Score: 98.6 bits (244), Expect = 5.7e-21
Identity = 54/171 (31.58%), Postives = 93/171 (54.39%), Query Frame = 1

Query: 36  KEKNIVSWNICLKGLFRFGYVNGARNLFDEMPERDIVSWNCMMSGCVSSGFANKAMDVFL 95
           K+  I  +   L G  + G +N A+N+F  + +RD+V+W  M+ G    G   +A+++F 
Sbjct: 343 KDLKIEGFTALLDGYIKLGDMNQAKNIFVSLKDRDVVAWTAMIVGYEQHGSYGEAINLFR 402

Query: 96  EMQDAGFRPSEYTFSIMLSVVSCPI---HGKQIHGSMIRSGIDVSNVVLGNSLIDILRDA 155
            M   G RP+ YT + MLSV S      HGKQIHGS ++SG ++ +V + N+LI +   A
Sbjct: 403 SMVGGGQRPNSYTLAAMLSVASSLASLSHGKQIHGSAVKSG-EIYSVSVSNALITMYAKA 462

Query: 156 MQLFEEANQWDQALCD-------AMISSLAWHGHWRDSMKAMENIAAQKVK 197
             +   +  +D   C+       +MI +LA HGH  ++++  E +  + ++
Sbjct: 463 GNITSASRAFDLIRCERDTVSWTSMIIALAQHGHAEEALELFETMLMEGLR 512

BLAST of Cp4.1LG01g20930 vs. TAIR10
Match: AT2G21090.1 (AT2G21090.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 98.2 bits (243), Expect = 7.4e-21
Identity = 57/179 (31.84%), Postives = 101/179 (56.42%), Query Frame = 1

Query: 30  KVFGDIKEKNIVSWNICLKGLFRFGYVNGARNLFDEMPERDIVSWNCMMSGCVSSGFANK 89
           + F ++  K+I  W   + G  + G +  A  LF EMPE++ VSW  +++G V  G  N+
Sbjct: 235 RCFDEMTVKDIHIWTTLISGYAKLGDMEAAEKLFCEMPEKNPVSWTALIAGYVRQGSGNR 294

Query: 90  AMDVFLEMQDAGFRPSEYTFSIML---SVVSCPIHGKQIHGSMIRSGIDVSNVVLGNSLI 149
           A+D+F +M   G +P ++TFS  L   + ++   HGK+IHG MIR+ +  + +V+ +SLI
Sbjct: 295 ALDLFRKMIALGVKPEQFTFSSCLCASASIASLRHGKEIHGYMIRTNVRPNAIVI-SSLI 354

Query: 150 DILRDAMQLFEEANQWDQALCD---------AMISSLAWHGHWRDSMKAMENIAAQKVK 197
           D+   +  L  EA++    +CD          MIS+LA HG    +++ ++++   +V+
Sbjct: 355 DMYSKSGSL--EASERVFRICDDKHDCVFWNTMISALAQHGLGHKALRMLDDMIKFRVQ 410

BLAST of Cp4.1LG01g20930 vs. NCBI nr
Match: gi|449453105|ref|XP_004144299.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g43980, mitochondrial isoform X1 [Cucumis sativus])

HSP 1 Score: 195.3 bits (495), Expect = 1.3e-46
Identity = 92/119 (77.31%), Postives = 106/119 (89.08%), Query Frame = 1

Query: 30  KVFGDIKEKNIVSWNICLKGLFRFGYVNGARNLFDEMPERDIVSWNCMMSGCVSSGFANK 89
           KVF  IK+KNI+SWNICLKG+FRFG V+GAR+LFD MPERDIVSWN M+SG VSSGF N 
Sbjct: 72  KVFDGIKDKNIISWNICLKGMFRFGDVDGARHLFDVMPERDIVSWNTMISGYVSSGFGNS 131

Query: 90  AMDVFLEMQDAGFRPSEYTFSIMLSVVSCPIHGKQIHGSMIRSGIDVSNVVLGNSLIDI 149
           AM V LEMQ+AGFRPSEYTFSI+LS+VS   HGKQ+HGSMIRSG+DVS++VLGNSLID+
Sbjct: 132 AMGVSLEMQNAGFRPSEYTFSILLSLVSSAFHGKQVHGSMIRSGVDVSSMVLGNSLIDM 190

BLAST of Cp4.1LG01g20930 vs. NCBI nr
Match: gi|659081917|ref|XP_008441575.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g43980, mitochondrial isoform X1 [Cucumis melo])

HSP 1 Score: 188.3 bits (477), Expect = 1.5e-44
Identity = 88/119 (73.95%), Postives = 103/119 (86.55%), Query Frame = 1

Query: 30  KVFGDIKEKNIVSWNICLKGLFRFGYVNGARNLFDEMPERDIVSWNCMMSGCVSSGFANK 89
           K F  IK+KNI+SWNICLKG+FRFG V+  R+LFD MPERDIVSWN M+SG VSSGFA  
Sbjct: 72  KAFDGIKDKNIISWNICLKGMFRFGDVDAPRHLFDVMPERDIVSWNTMISGYVSSGFAKS 131

Query: 90  AMDVFLEMQDAGFRPSEYTFSIMLSVVSCPIHGKQIHGSMIRSGIDVSNVVLGNSLIDI 149
           AM + LEMQ+AGFRPSEYTFSI+LS+VS   HGKQ+HGSMIRSG+DVS++VLGNSLID+
Sbjct: 132 AMGISLEMQNAGFRPSEYTFSILLSLVSSAFHGKQVHGSMIRSGVDVSSMVLGNSLIDM 190

BLAST of Cp4.1LG01g20930 vs. NCBI nr
Match: gi|778714984|ref|XP_011657323.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g43980, mitochondrial isoform X2 [Cucumis sativus])

HSP 1 Score: 187.2 bits (474), Expect = 3.4e-44
Identity = 97/188 (51.60%), Postives = 120/188 (63.83%), Query Frame = 1

Query: 30  KVFGDIKEKNIVSWNICLKGLFRFGYVNGARNLFDEMPERDIVSWNCMMSGCVSSGFANK 89
           KVF  IK+KNI+SWNICLKG+FRFG V+GAR+LFD MPERDIVSWN M+SG VSSGF N 
Sbjct: 72  KVFDGIKDKNIISWNICLKGMFRFGDVDGARHLFDVMPERDIVSWNTMISGYVSSGFGNS 131

Query: 90  AMDVFLEMQDAGFRPS-------------------------EYTFSIMLSVVSCPIH--- 149
           AM V LEMQ+AGFRPS                         ++T S+++S+  C      
Sbjct: 132 AMGVSLEMQNAGFRPSCHRSGFRVLALNQFFLMRTTEHSPDQFTISMVISICCCLQELEL 191

Query: 150 GKQIHGSMIRSGIDVSNVVLGNSL-----IDILRDAMQLFEEANQWDQALCDAMISSLAW 185
           GKQI     + G   +++VL  ++      D L+ A+QLFE+ NQWD  LC+ MISS AW
Sbjct: 192 GKQIFAFCFKMGFTSNSIVLSATIDLFSKCDSLKVAVQLFEDTNQWDPVLCNVMISSFAW 251

BLAST of Cp4.1LG01g20930 vs. NCBI nr
Match: gi|659081929|ref|XP_008441582.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g43980, mitochondrial isoform X2 [Cucumis melo])

HSP 1 Score: 177.6 bits (449), Expect = 2.7e-41
Identity = 93/188 (49.47%), Postives = 118/188 (62.77%), Query Frame = 1

Query: 30  KVFGDIKEKNIVSWNICLKGLFRFGYVNGARNLFDEMPERDIVSWNCMMSGCVSSGFANK 89
           K F  IK+KNI+SWNICLKG+FRFG V+  R+LFD MPERDIVSWN M+SG VSSGFA  
Sbjct: 72  KAFDGIKDKNIISWNICLKGMFRFGDVDAPRHLFDVMPERDIVSWNTMISGYVSSGFAKS 131

Query: 90  AMDVFLEMQDAGFRPS-------------------------EYTFSIMLSVVSCPIH--- 149
           AM + LEMQ+AGFRPS                         ++T S+++S+  C      
Sbjct: 132 AMGISLEMQNAGFRPSCHRSGFRVLALNQFFLMRTTEHSPDQFTISMVISICCCLQELEL 191

Query: 150 GKQIHGSMIRSGIDVSNVVLGNSL-----IDILRDAMQLFEEANQWDQALCDAMISSLAW 185
           GKQI    I+ G   +++VL  ++      D L+ A+QLFE+ NQ D+ LC+ MISS AW
Sbjct: 192 GKQIFAFCIKMGFTSNSIVLSATIDLFSKCDSLKVAVQLFEDTNQLDRVLCNVMISSFAW 251

BLAST of Cp4.1LG01g20930 vs. NCBI nr
Match: gi|694430780|ref|XP_009342851.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g43980, mitochondrial-like [Pyrus x bretschneideri])

HSP 1 Score: 170.6 bits (431), Expect = 3.3e-39
Identity = 78/119 (65.55%), Postives = 100/119 (84.03%), Query Frame = 1

Query: 30  KVFGDIKEKNIVSWNICLKGLFRFGYVNGARNLFDEMPERDIVSWNCMMSGCVSSGFANK 89
           KV+ +I EKN+VSWNICLKGL RFG +  AR++F EMPERD+VSWN M+SG VS GF + 
Sbjct: 73  KVYDEITEKNLVSWNICLKGLSRFGEIQKARHMFAEMPERDVVSWNSMISGYVSCGFIDN 132

Query: 90  AMDVFLEMQDAGFRPSEYTFSIMLSVVSCPIHGKQIHGSMIRSGIDVSNVVLGNSLIDI 149
           A+++F +MQ AG RPSEYTFSIM+++V+   +GKQIHG MIRSG+++SNVVLGNSLID+
Sbjct: 133 ALEIFSQMQIAGVRPSEYTFSIMMTLVTSACYGKQIHGDMIRSGMNLSNVVLGNSLIDM 191

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR73_ARATH1.1e-3454.55Pentatricopeptide repeat-containing protein At1g43980, mitochondrial OS=Arabidop... [more]
PPR57_ARATH2.0e-2029.71Pentatricopeptide repeat-containing protein At1g25360 OS=Arabidopsis thaliana GN... [more]
PP235_ARATH7.7e-2033.53Putative pentatricopeptide repeat-containing protein At3g15930 OS=Arabidopsis th... [more]
PP168_ARATH1.0e-1931.58Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana GN... [more]
PP167_ARATH1.3e-1931.84Pentatricopeptide repeat-containing protein At2g21090 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
M5W3X4_PRUPE3.0e-3964.71Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa014901mg PE=4 SV=1[more]
B9H7L3_POPTR1.0e-3450.64Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POP... [more]
D7KNQ4_ARALL1.7e-3457.85F9C16.15 (Fragment) OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_473794 PE=... [more]
A0A068UUJ0_COFCA1.7e-3448.72Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00035317001 PE=4 SV=1[more]
A0A072V224_MEDTR6.5e-3461.54PPR containing plant-like protein OS=Medicago truncatula GN=MTR_3g088835 PE=4 SV... [more]
Match NameE-valueIdentityDescription
AT1G43980.16.2e-3654.55 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G25360.11.1e-2129.71 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G15930.14.3e-2133.53 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G22070.15.7e-2131.58 pentatricopeptide (PPR) repeat-containing protein[more]
AT2G21090.17.4e-2131.84 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449453105|ref|XP_004144299.1|1.3e-4677.31PREDICTED: pentatricopeptide repeat-containing protein At1g43980, mitochondrial ... [more]
gi|659081917|ref|XP_008441575.1|1.5e-4473.95PREDICTED: pentatricopeptide repeat-containing protein At1g43980, mitochondrial ... [more]
gi|778714984|ref|XP_011657323.1|3.4e-4451.60PREDICTED: pentatricopeptide repeat-containing protein At1g43980, mitochondrial ... [more]
gi|659081929|ref|XP_008441582.1|2.7e-4149.47PREDICTED: pentatricopeptide repeat-containing protein At1g43980, mitochondrial ... [more]
gi|694430780|ref|XP_009342851.1|3.3e-3965.55PREDICTED: pentatricopeptide repeat-containing protein At1g43980, mitochondrial-... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g20930.1Cp4.1LG01g20930.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 70..114
score: 7.9
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 72..106
score: 3.3E-9coord: 41..72
score: 2.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 163..197
score: 6.818coord: 39..69
score: 8.868coord: 70..104
score: 12
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 31..178
score: 1.3
NoneNo IPR availablePANTHERPTHR24015:SF613SUBFAMILY NOT NAMEDcoord: 31..178
score: 1.3

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG01g20930CmaCh04G024110Cucurbita maxima (Rimu)cmacpeB721
Cp4.1LG01g20930CmoCh15G006470Cucurbita moschata (Rifu)cmocpeB280
Cp4.1LG01g20930CmoCh04G025240Cucurbita moschata (Rifu)cmocpeB673
Cp4.1LG01g20930Carg15758Silver-seed gourdcarcpeB0096
Cp4.1LG01g20930Carg01568Silver-seed gourdcarcpeB1158
The following gene(s) are paralogous to this gene:

None