Cp4.1LG07g07710 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG07g07710
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCp4.1LG07 : 6821547 .. 6825462 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTTGTTAGTGTCATAATCACACTTTTCTAATGACGGGACAACGTTGGTGTGTGACAGGTGTTCTCACCTCACGAGCAAGTCAATTGTGCAAAACTTGGGCTTTATGGACAATCTCAATGTTGGATTTATAAAACGTGAGAGAGAAAATGAGTTGAATTTCGCCGGTTGCATTAAAACATATAGGCAAGAGAGTTTACGAACTCTTTGTTGGTGGAGGTTTGACAATTACTTAGTCTTCATGGTATGATCGAGGATGGCTTGAGTATGCGGCCTTAAGCGACATTGCTGCGGACAAGCATACCCGAGGCACTTAGTACATGTAGGAGGCTGGTTGTGCACTCGTCTCCAAAGTCCACACTCCTTTCTTCTGCACCCACATTATCTGTGAGATCCCGAGAGGGGAGCGAAACATTCTTTAAAAGGGCGTGGAAACCTCTCTCTATCAAAGTGGATAACATATGCTAGCGATAGACTTGAACTGTTACATCATGTCTTGATAATTGAATTTGCAAGGATTAAATTTTTTTTTAAAATTTTTTATTAACTCATTTCAAATATTTATAATTTAATTTTAGTTAAATTCATAAATATATCCTTAAATTTTGTATTTTATCTTAAAAATATTTTAATCTTTTTTGAAAATTTCAATAATATTCATAAAACTTCAAAAATCGTTATGAATATTCATGAGATGGATTTTTCGTCGGAGATGTGAGCGGTGGTTCGTCTGAATTCCGACGATCTTTACGGCCATCTGAACCGAGAATCGTTTCATGATGGAATTGAACTCTCTTGTTGCAGCGGTAGCAAATCCGCACTGCTTTCCGACAAATTTGAAACAGAAACGCCCAATTTTGTTCCAGCAGAGACGCCCCGCAGGTCGAGCGTCGGCTCGTAAGGTACGCCTCAATCACGGAAGTAATTTAGCAGTTTGAATTCAGATTCTGATGATTAATTTTCAGTTTCTTCCGCTTTTATCGTCATTTGTTGGTTTGTGGTTAGATAAAATTTGGGGGGAAATGGAAAATTGTGATAAATACGCGGATGTTATATTATCATGTTATCCAGAACTCGAATGATAATTTGTGTGTATGAACTGCTGTGATGATTGATTGTTAATTTAGATTAAGCCTAAGTTCTGAAGAATCGGTTCTGAATGTGTTGTCTTGAAACTTGGAGATGATTAATTGTATGAACTGTTAAGTAGTCTCATCAGGGACTTTGATGTTACTAGGGATTGTTCTTGTAAATAAAATTGAATGCAAAAGGTCCTCTTTGAAGTGGCTTGCTCATGATAAGATGAGGGTTTGAGGCTGTGGGTGGATGCCAGGTTAAATCTCCTTTAGGACCATGTTAGGCTAGTGGCTTGGACCTTCGTTCCAATGTTGAGTTAGAGTGAATCTAATCCTTTGATTTATTCTTGACGTTTCAAGATTGATGATAACTCTCTGCAAGAACTTAGCAGGGCATAGTTTAGGCATTATCAATACTTGAAATTAAAATCACGTTCTTTCATTTTCAAGTAAGATTTAGATCAGTTTGGGTTTAGTTCTTGATCATGTTTAGAGGTAAGAGCCCTAGCCCTTGGAAAGACCTCACTAATTGGCTTTCGTTGTTATCTGATAATTATTTGAAGTTGGGGATCGGTCAAGTCACAGATCTTTTTCTGGAAGAACAATGTGTGGGGACTGTGTTCCTTGTGGAGTTTGTCTAGAATGTACTTGTTGATATTACTCTTCAGTGACCTATGGCTATGGCTATGGCTATAGCTATGGCTTTTACGGGGGAAATTATTGCATGAAATACATTATTCAAGAGAGATGAGATTTATGCAACTGCTAGCCGATATTGTCCTCTTTGAGCTTTCCCTTGAAGTTTTTAAAACCCGTCTGCTAGGGAGAGGTTTCCACACCCTTATAAAGAATGTTTTGTTCTCCTCCCCAACCAATGTGGGATCTCACAATCCATCCCCTTCGGGGCCCAGCATCCTTGCTGGCACACCGCCTCGTGTCCACCCCCCTTCGGGACTCAGTCTCCTCGCTGGCATGTTGCCCAGTGTCTGGCTCTGATACCATTTGGAACGGCCCAAGCCTACTGCTAGCAGATATTGTCCTCTTTGGGCTTTCCTTTTTGGGTTTCTTCTCAAGGTTGTTAAAACTCGTCTGCTAGGGAGAGGTTTCCACACCCTTATAAAGAATGTTTTGTTCTCCTCCCCAACCAATGTGGGATCTCACAATCCATCCCCTTCGGGGCCCAGCATCCTTGCTGGCACACCGCCTCGTGTCCACCCCCCTTCGGGACTCAGTCTCCTCGCTGGCATGTTGCCCAGTGTCTGGCTCTGATACCATTTGGAACGGCCCAAGCCTACTGCTAGCAGATATTGTCCTCTTTGGGCTTTCCTTTTTGGGTTTCTTCTCAAGGTTGTTAAAACTCGTCTGCTAGGGAGAGGTTTCCACACCCTTATAAAGAATGTTCCGTTCTCCTCTCCAACTGACATGGGATCTCACAATTTATGAGTGGACAGATAACGCATTCGATTCAAGGAAAATACTCTGAGATGAAGGCATATTAACAAAAGGACTTACATATCACAATTATTTCTTGGGGAAATCCGTACAGCTAATGGAGATCTGTACAATTTGGAGCTTTAAAGTAACAAAACAGAAATAGAGTGATTTATCAACTGATGAACATGTTATAGATATTCACACTTCCCTAATTTTGTTTATTTTATACGCAATGTACTTAGGATAGATACTCCGGCAGAACAAAGAAAAAGGAAGCTTTGCTGGCGATGATTTTTCTATCAACCATTCCCAAGTGATTGAGAAATTGAGCCAAAGAAGAACGCCTGTTTTAGCTCAAGAGATTTTCTTGGAGCTGAAACCTGAAGAATTTCCATTAAACAACTCGACGTTGTCTTCTCTTATGGTACGCTACATAGATGGTGGTCTTCTCCTCCAAGCAGAAGCAATTTGGGAAGAAATGTTAAACAGTTGTTTCGTGCCTTCCGTTTTAGTAATTTCAAAGTTACTTAACACATACGGAAAGATGAGACGCTTCGATGATATAATTAAAGTTCTGGATCAGGTAAAGATAAGGTATTTACACTTACTCCCTGAGGCATACTCATTAGCCATATCATGTTTTGGGAAGCATGGTCAATTAGAATTGATGGAAAACACTTTGAGGGAAATGGTTTCCAGTGGTGTTCCAGTTGATTCCCGTACAGGAAATTCCTTTATTGTATACTACAGCATTTTTGGTTCTTTGATGGAGATGGAAACCGCCTATGGCCGCCTTAAAAGGTCTCGATTTCTAATCGAGAAGGAAGGCATCCTGGCAATGGCGTTCGCCTACATAAGGGAAAGAAAATTTTACAGATTAGGGGAATTTCTTAGGGATGTTGGTCTTAGAAGGAAAAACGTGGGGAATCTGTTATGGAATCTTCTACTTTTATCTTATGCTGCAAATTTCAAAATGAAAAGCTTGCAGCGTGAGTTTCTGGCAATGGTTGAAGCTGGATTTAATCCGGATATTACAACGTTTAATATTAGAGCTGTGGCATTTTCAAGAATGGATTTGTTATGGGATCTTCATCTTAGCCTTGAACATATGAAACACATGAAGATCGAACCCGATCTCGTGTCGTACGGTTGTGTTGTTGATGCATATGTAGATAGAAGACTTGGAAGGAATTTGGAATTCGTTTTGAGCAAAATGAATCCAGATCAACCACCAATATCATTAACTGATCCGTTTGTTTTTGAGGCATTGGGTAAAGGAGATTTCCACATGAGCTCTGAGGCGTTCATGCAGTTCCAGAAGCAGAAGAAATGGACTTACAGAGAGTTGATATCATTGTATCTGAAAAAGCAACACAGAAGAGATCAAGTCTTTTGGAATTACTAG

mRNA sequence

ATGGCGGTAGCAAATCCGCACTGCTTTCCGACAAATTTGAAACAGAAACGCCCAATTTTGTTCCAGCAGAGACGCCCCGCAGGTCGAGCGTCGGCTCGTAAGATGGAAACCGCCTATGGCCGCCTTAAAAGGTCTCGATTTCTAATCGAGAAGGAAGGCATCCTGGCAATGGCGTTCGCCTACATAAGGGAAAGAAAATTTTACAGATTAGGGGAATTTCTTAGGGATGTTGGTCTTAGAAGGAAAAACGTGGGGAATCTGTTATGGAATCTTCTACTTTTATCTTATGCTGCAAATTTCAAAATGAAAAGCTTGCAGCGTGAGTTTCTGGCAATGGTTGAAGCTGGATTTAATCCGGATATTACAACGTTTAATATTAGAGCTGTGGCATTTTCAAGAATGGATTTGTTATGGGATCTTCATCTTAGCCTTGAACATATGAAACACATGAAGATCGAACCCGATCTCGTGTCGTACGGTTGTGTTGTTGATGCATATGTAGATAGAAGACTTGGAAGGAATTTGGAATTCGTTTTGAGCAAAATGAATCCAGATCAACCACCAATATCATTAACTGATCCGTTTGTTTTTGAGGCATTGGGTAAAGGAGATTTCCACATGAGCTCTGAGGCGTTCATGCAGTTCCAGAAGCAGAAGAAATGGACTTACAGAGAGTTGATATCATTGTATCTGAAAAAGCAACACAGAAGAGATCAAGTCTTTTGGAATTACTAG

Coding sequence (CDS)

ATGGCGGTAGCAAATCCGCACTGCTTTCCGACAAATTTGAAACAGAAACGCCCAATTTTGTTCCAGCAGAGACGCCCCGCAGGTCGAGCGTCGGCTCGTAAGATGGAAACCGCCTATGGCCGCCTTAAAAGGTCTCGATTTCTAATCGAGAAGGAAGGCATCCTGGCAATGGCGTTCGCCTACATAAGGGAAAGAAAATTTTACAGATTAGGGGAATTTCTTAGGGATGTTGGTCTTAGAAGGAAAAACGTGGGGAATCTGTTATGGAATCTTCTACTTTTATCTTATGCTGCAAATTTCAAAATGAAAAGCTTGCAGCGTGAGTTTCTGGCAATGGTTGAAGCTGGATTTAATCCGGATATTACAACGTTTAATATTAGAGCTGTGGCATTTTCAAGAATGGATTTGTTATGGGATCTTCATCTTAGCCTTGAACATATGAAACACATGAAGATCGAACCCGATCTCGTGTCGTACGGTTGTGTTGTTGATGCATATGTAGATAGAAGACTTGGAAGGAATTTGGAATTCGTTTTGAGCAAAATGAATCCAGATCAACCACCAATATCATTAACTGATCCGTTTGTTTTTGAGGCATTGGGTAAAGGAGATTTCCACATGAGCTCTGAGGCGTTCATGCAGTTCCAGAAGCAGAAGAAATGGACTTACAGAGAGTTGATATCATTGTATCTGAAAAAGCAACACAGAAGAGATCAAGTCTTTTGGAATTACTAG

Protein sequence

MAVANPHCFPTNLKQKRPILFQQRRPAGRASARKMETAYGRLKRSRFLIEKEGILAMAFAYIRERKFYRLGEFLRDVGLRRKNVGNLLWNLLLLSYAANFKMKSLQREFLAMVEAGFNPDITTFNIRAVAFSRMDLLWDLHLSLEHMKHMKIEPDLVSYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPISLTDPFVFEALGKGDFHMSSEAFMQFQKQKKWTYRELISLYLKKQHRRDQVFWNY
BLAST of Cp4.1LG07g07710 vs. Swiss-Prot
Match: PP263_ARATH (Pentatricopeptide repeat-containing protein At3g42630 OS=Arabidopsis thaliana GN=At3g42630 PE=2 SV=2)

HSP 1 Score: 304.7 bits (779), Expect = 9.2e-82
Identity = 142/214 (66.36%), Postives = 179/214 (83.64%), Query Frame = 1

Query: 31  SARKMETAYGRLKRSRFLIEKEGILAMAFAYIRERKFYRLGEFLRDVGLRRKNVGNLLWN 90
           S  KME AYGR+K+   +IE+E I A+  AY+++RKFYRL EFL DVGL R+N+GN+LWN
Sbjct: 202 SLDKMEKAYGRVKKFGIVIEEEEIRAVVLAYLKQRKFYRLREFLSDVGLGRRNLGNMLWN 261

Query: 91  LLLLSYAANFKMKSLQREFLAMVEAGFNPDITTFNIRAVAFSRMDLLWDLHLSLEHMKHM 150
            +LLSYAA+FKMKSLQREF+ M++AGF+PD+TTFNIRA+AFSRM L WDLHL+LEHM+ +
Sbjct: 262 SVLLSYAADFKMKSLQREFIGMLDAGFSPDLTTFNIRALAFSRMALFWDLHLTLEHMRRL 321

Query: 151 KIEPDLVSYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPISLTDPFVFEALGKGDFHMSSE 210
            I PDLV++GCVVDAY+D+RL RNLEFV ++MN D  P+ LTDP  FE LGKGDFH+SSE
Sbjct: 322 NIVPDLVTFGCVVDAYMDKRLARNLEFVYNRMNLDDSPLVLTDPLAFEVLGKGDFHLSSE 381

Query: 211 AFMQFQKQKKWTYRELISLYLKKQHRRDQVFWNY 245
           A ++F  +K WTYR+LI +YLKK+ RRDQ+FWNY
Sbjct: 382 AVLEFSPRKNWTYRKLIGVYLKKKLRRDQIFWNY 415

BLAST of Cp4.1LG07g07710 vs. Swiss-Prot
Match: PP310_ARATH (Pentatricopeptide repeat-containing protein At4g14190, chloroplastic OS=Arabidopsis thaliana GN=At4g14190 PE=2 SV=2)

HSP 1 Score: 72.8 bits (177), Expect = 5.9e-12
Identity = 61/208 (29.33%), Postives = 88/208 (42.31%), Query Frame = 1

Query: 34  KMETAYGRLKRSRFLIEKEGILAMAFAYIRERKFYRLGEFLRDVGLRRKNVGNLLWNLLL 93
           KME    ++ R    +++  +  +A  YI    F RL +  R +   R     L W L L
Sbjct: 290 KMEETCNKIIRFGISLDEGLVRKLANVYIENLMFSRLDDLGRGISASRTRRTELAWCLRL 349

Query: 94  LSYAANFKMKSLQREFLAMVEAGFNPDITTFNIRAVAFSRMDLLWDLHLSLEHMKHMKIE 153
           L +A     K L      M EA    + T  NI  +A+S+M     + L L  ++   ++
Sbjct: 350 LCHARLVSRKGLDYVVKEMEEARVPWNTTFANIALLAYSKMGDFTSIELLLSELRIKHVK 409

Query: 154 PDLVSYGCVVDAYVDRRLGRNLEFVLSKMN-PDQPPISLTDPFVFEALGKGDFHMSSEAF 213
            DLV+ G V D    R  G  +     K+   D+P    TDP V  A GKG F  S E  
Sbjct: 410 LDLVTVGIVFDLSEARFDGTGVFMTWKKIGFLDKPVEMKTDPLVHAAFGKGQFLRSCEEV 469

Query: 214 MQFQ------KQKKWTYRELISLYLKKQ 235
                     + K WTY+ L+ L +K Q
Sbjct: 470 KNQSLGTRDGESKSWTYQYLMELVVKNQ 497

BLAST of Cp4.1LG07g07710 vs. TrEMBL
Match: A0A0A0LMJ8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G070240 PE=4 SV=1)

HSP 1 Score: 363.2 bits (931), Expect = 2.4e-97
Identity = 174/188 (92.55%), Postives = 183/188 (97.34%), Query Frame = 1

Query: 57  MAFAYIRERKFYRLGEFLRDVGLRRKNVGNLLWNLLLLSYAANFKMKSLQREFLAMVEAG 116
           MAFAYIR+RKFYRLGEFLRDVGL RKNVGNLLWNLLLLSYAANFKMKSLQREFL MV+AG
Sbjct: 1   MAFAYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNLLLLSYAANFKMKSLQREFLQMVDAG 60

Query: 117 FNPDITTFNIRAVAFSRMDLLWDLHLSLEHMKHMKIEPDLVSYGCVVDAYVDRRLGRNLE 176
           FNPD+TTFNIRA+AFSRMDLLWDLHLSLEHMKHM IEPDLV+YGCVVDAYVDRRLGRNLE
Sbjct: 61  FNPDLTTFNIRALAFSRMDLLWDLHLSLEHMKHMNIEPDLVTYGCVVDAYVDRRLGRNLE 120

Query: 177 FVLSKMNPDQPPISLTDPFVFEALGKGDFHMSSEAFMQFQKQKKWTYRELISLYLKKQHR 236
           F+LSKMNPDQPP+SLTD FVFEALGKGDFHMSSEAFMQF+KQKKWTYRELISLYLKK HR
Sbjct: 121 FILSKMNPDQPPVSLTDSFVFEALGKGDFHMSSEAFMQFRKQKKWTYRELISLYLKKHHR 180

Query: 237 RDQVFWNY 245
           R+QVFWNY
Sbjct: 181 RNQVFWNY 188

BLAST of Cp4.1LG07g07710 vs. TrEMBL
Match: B9RDM7_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_1614490 PE=4 SV=1)

HSP 1 Score: 341.3 bits (874), Expect = 9.9e-91
Identity = 155/214 (72.43%), Postives = 192/214 (89.72%), Query Frame = 1

Query: 31  SARKMETAYGRLKRSRFLIEKEGILAMAFAYIRERKFYRLGEFLRDVGLRRKNVGNLLWN 90
           S   ME+AY RLKRSR L+++EGI A++ AY++ERKFYRLGEFLRDVGL RK+VGNL+WN
Sbjct: 214 SLTDMESAYSRLKRSRHLVDREGIRAVSLAYVKERKFYRLGEFLRDVGLGRKDVGNLIWN 273

Query: 91  LLLLSYAANFKMKSLQREFLAMVEAGFNPDITTFNIRAVAFSRMDLLWDLHLSLEHMKHM 150
            LLLS+AANFKMKSLQREFL M+EAGF+PD+TTFNIRA+AFSRM LLWDLHL+LEHMKH 
Sbjct: 274 FLLLSFAANFKMKSLQREFLRMLEAGFHPDVTTFNIRALAFSRMSLLWDLHLTLEHMKHE 333

Query: 151 KIEPDLVSYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPISLTDPFVFEALGKGDFHMSSE 210
           K+ PD+V+YGC+VDAY+DRRLG+NL+F + KMN D  P+ LTDPFVFE LGKGDFH S+E
Sbjct: 334 KVSPDIVTYGCIVDAYLDRRLGKNLDFAIKKMNLDGSPVLLTDPFVFEVLGKGDFHSSAE 393

Query: 211 AFMQFQKQKKWTYRELISLYLKKQHRRDQVFWNY 245
           AF++F++Q+KWTYREL+S+YL+KQ+R +Q+FWNY
Sbjct: 394 AFLEFKRQRKWTYRELVSIYLRKQYRSNQIFWNY 427

BLAST of Cp4.1LG07g07710 vs. TrEMBL
Match: M5XQ40_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006191mg PE=4 SV=1)

HSP 1 Score: 341.3 bits (874), Expect = 9.9e-91
Identity = 162/214 (75.70%), Postives = 192/214 (89.72%), Query Frame = 1

Query: 31  SARKMETAYGRLKRSRFLIEKEGILAMAFAYIRERKFYRLGEFLRDVGLRRKNVGNLLWN 90
           S  +METAYGRLKRSRFLIE+EGI AM+FAY+++RKFYRL E L++VGL R+N+GNL WN
Sbjct: 210 SLTEMETAYGRLKRSRFLIEEEGIRAMSFAYLKKRKFYRLAELLKNVGLGRRNLGNLSWN 269

Query: 91  LLLLSYAANFKMKSLQREFLAMVEAGFNPDITTFNIRAVAFSRMDLLWDLHLSLEHMKHM 150
           LLLLSYAA+FKMKSLQREFL MVEAGF+PD+TTFNIRA+AFSRM LLWDLHLSLEHMKH 
Sbjct: 270 LLLLSYAADFKMKSLQREFLRMVEAGFHPDLTTFNIRALAFSRMSLLWDLHLSLEHMKHE 329

Query: 151 KIEPDLVSYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPISLTDPFVFEALGKGDFHMSSE 210
           K+ PDLV+ GCVVDAY++RRLG+N+ F L+KMN D  P+ LTDPFVFE LGKGDFH SSE
Sbjct: 330 KVFPDLVTCGCVVDAYLERRLGKNMYFALNKMNLDDSPLILTDPFVFEVLGKGDFHASSE 389

Query: 211 AFMQFQKQKKWTYRELISLYLKKQHRRDQVFWNY 245
           AF++FQ Q++WTYR LIS+YLKKQ+RR+Q+FWNY
Sbjct: 390 AFLEFQSQREWTYRRLISVYLKKQYRRNQIFWNY 423

BLAST of Cp4.1LG07g07710 vs. TrEMBL
Match: A0A067K0T9_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14439 PE=4 SV=1)

HSP 1 Score: 340.9 bits (873), Expect = 1.3e-90
Identity = 161/214 (75.23%), Postives = 191/214 (89.25%), Query Frame = 1

Query: 31  SARKMETAYGRLKRSRFLIEKEGILAMAFAYIRERKFYRLGEFLRDVGLRRKNVGNLLWN 90
           S  +ME+A  RLKRSR LI++EGI AMAFA+IRERKFYRLGEFLRDVGL R++VGNLLWN
Sbjct: 216 SLTEMESACNRLKRSRHLIDREGIRAMAFAFIRERKFYRLGEFLRDVGLGRRDVGNLLWN 275

Query: 91  LLLLSYAANFKMKSLQREFLAMVEAGFNPDITTFNIRAVAFSRMDLLWDLHLSLEHMKHM 150
            LLLSYAANFKMKSLQREFL M EAGF PD+TTFNIRA+AFSRM L WDLHLSLEHMK+ 
Sbjct: 276 FLLLSYAANFKMKSLQREFLRMSEAGFRPDLTTFNIRALAFSRMSLFWDLHLSLEHMKYE 335

Query: 151 KIEPDLVSYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPISLTDPFVFEALGKGDFHMSSE 210
           K+ PDLV+YGC VDAY+DRRLG+NL+FVL+KMN D+ P+  TDPFVFE LGKGDFH S+E
Sbjct: 336 KVSPDLVTYGCFVDAYLDRRLGKNLDFVLNKMNLDESPVVSTDPFVFEVLGKGDFHSSAE 395

Query: 211 AFMQFQKQKKWTYRELISLYLKKQHRRDQVFWNY 245
           AF++F+++KKWTY+EL+SLYL+KQ+R +Q+FWNY
Sbjct: 396 AFLEFKRKKKWTYKELVSLYLRKQYRSNQIFWNY 429

BLAST of Cp4.1LG07g07710 vs. TrEMBL
Match: B9N840_POPTR (Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POPTR_0018s14360g PE=4 SV=1)

HSP 1 Score: 340.1 bits (871), Expect = 2.2e-90
Identity = 161/214 (75.23%), Postives = 186/214 (86.92%), Query Frame = 1

Query: 31  SARKMETAYGRLKRSRFLIEKEGILAMAFAYIRERKFYRLGEFLRDVGLRRKNVGNLLWN 90
           S  +ME AY RLKRSR LIE+EGI AM+FAYI+ERKFY L EFLRDVGL RKN+GNL+WN
Sbjct: 215 SLAEMEAAYDRLKRSRLLIEREGIRAMSFAYIKERKFYGLSEFLRDVGLGRKNLGNLIWN 274

Query: 91  LLLLSYAANFKMKSLQREFLAMVEAGFNPDITTFNIRAVAFSRMDLLWDLHLSLEHMKHM 150
           LLLLSY+ANFKMK+LQREFL M+EAGF+PD+TTFNIRA+AFSRM LLWDLHL LEHMKH 
Sbjct: 275 LLLLSYSANFKMKTLQREFLNMLEAGFHPDLTTFNIRALAFSRMSLLWDLHLGLEHMKHD 334

Query: 151 KIEPDLVSYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPISLTDPFVFEALGKGDFHMSSE 210
           K+ PDLV+YGC+VDAY+DRRL RNLEF LSKM+ D  P+  TDPFVFE  GKGDFH SSE
Sbjct: 335 KVAPDLVTYGCIVDAYLDRRLVRNLEFALSKMHVDNSPVLSTDPFVFEVFGKGDFHSSSE 394

Query: 211 AFMQFQKQKKWTYRELISLYLKKQHRRDQVFWNY 245
           AFM+F++Q+KWTYRELI +YL+KQHR   +FWNY
Sbjct: 395 AFMEFKRQRKWTYRELIKIYLRKQHRSKHIFWNY 428

BLAST of Cp4.1LG07g07710 vs. TAIR10
Match: AT3G42630.1 (AT3G42630.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 304.7 bits (779), Expect = 5.2e-83
Identity = 142/214 (66.36%), Postives = 179/214 (83.64%), Query Frame = 1

Query: 31  SARKMETAYGRLKRSRFLIEKEGILAMAFAYIRERKFYRLGEFLRDVGLRRKNVGNLLWN 90
           S  KME AYGR+K+   +IE+E I A+  AY+++RKFYRL EFL DVGL R+N+GN+LWN
Sbjct: 202 SLDKMEKAYGRVKKFGIVIEEEEIRAVVLAYLKQRKFYRLREFLSDVGLGRRNLGNMLWN 261

Query: 91  LLLLSYAANFKMKSLQREFLAMVEAGFNPDITTFNIRAVAFSRMDLLWDLHLSLEHMKHM 150
            +LLSYAA+FKMKSLQREF+ M++AGF+PD+TTFNIRA+AFSRM L WDLHL+LEHM+ +
Sbjct: 262 SVLLSYAADFKMKSLQREFIGMLDAGFSPDLTTFNIRALAFSRMALFWDLHLTLEHMRRL 321

Query: 151 KIEPDLVSYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPISLTDPFVFEALGKGDFHMSSE 210
            I PDLV++GCVVDAY+D+RL RNLEFV ++MN D  P+ LTDP  FE LGKGDFH+SSE
Sbjct: 322 NIVPDLVTFGCVVDAYMDKRLARNLEFVYNRMNLDDSPLVLTDPLAFEVLGKGDFHLSSE 381

Query: 211 AFMQFQKQKKWTYRELISLYLKKQHRRDQVFWNY 245
           A ++F  +K WTYR+LI +YLKK+ RRDQ+FWNY
Sbjct: 382 AVLEFSPRKNWTYRKLIGVYLKKKLRRDQIFWNY 415

BLAST of Cp4.1LG07g07710 vs. TAIR10
Match: AT4G14190.1 (AT4G14190.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 72.8 bits (177), Expect = 3.3e-13
Identity = 61/208 (29.33%), Postives = 88/208 (42.31%), Query Frame = 1

Query: 34  KMETAYGRLKRSRFLIEKEGILAMAFAYIRERKFYRLGEFLRDVGLRRKNVGNLLWNLLL 93
           KME    ++ R    +++  +  +A  YI    F RL +  R +   R     L W L L
Sbjct: 290 KMEETCNKIIRFGISLDEGLVRKLANVYIENLMFSRLDDLGRGISASRTRRTELAWCLRL 349

Query: 94  LSYAANFKMKSLQREFLAMVEAGFNPDITTFNIRAVAFSRMDLLWDLHLSLEHMKHMKIE 153
           L +A     K L      M EA    + T  NI  +A+S+M     + L L  ++   ++
Sbjct: 350 LCHARLVSRKGLDYVVKEMEEARVPWNTTFANIALLAYSKMGDFTSIELLLSELRIKHVK 409

Query: 154 PDLVSYGCVVDAYVDRRLGRNLEFVLSKMN-PDQPPISLTDPFVFEALGKGDFHMSSEAF 213
            DLV+ G V D    R  G  +     K+   D+P    TDP V  A GKG F  S E  
Sbjct: 410 LDLVTVGIVFDLSEARFDGTGVFMTWKKIGFLDKPVEMKTDPLVHAAFGKGQFLRSCEEV 469

Query: 214 MQFQ------KQKKWTYRELISLYLKKQ 235
                     + K WTY+ L+ L +K Q
Sbjct: 470 KNQSLGTRDGESKSWTYQYLMELVVKNQ 497

BLAST of Cp4.1LG07g07710 vs. TAIR10
Match: AT2G18940.1 (AT2G18940.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 52.0 bits (123), Expect = 6.1e-07
Identity = 31/101 (30.69%), Postives = 48/101 (47.52%), Query Frame = 1

Query: 89  WNLLLLSYAANFKMKSL---QREFLAMVEAGFNPDITTFNIRAVAFSRMDLLWDLHLSLE 148
           W LL     ANFK ++L   +R F    + G+ PD+  FN     F+R ++       LE
Sbjct: 596 WMLLRTLLLANFKCRALAGSERAFTLFKKHGYKPDMVIFNSMLSIFTRNNMYDQAEGILE 655

Query: 149 HMKHMKIEPDLVSYGCVVDAYVDRRLGRNLEFVLSKMNPDQ 187
            ++   + PDLV+Y  ++D YV R      E +L  +   Q
Sbjct: 656 SIREDGLSPDLVTYNSLMDMYVRRGECWKAEEILKTLEKSQ 696

BLAST of Cp4.1LG07g07710 vs. TAIR10
Match: AT2G26790.1 (AT2G26790.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 49.3 bits (116), Expect = 3.9e-06
Identity = 39/169 (23.08%), Postives = 72/169 (42.60%), Query Frame = 1

Query: 8   CFPTNLKQKRPIL--FQQRRPAGRASARK-------METAYGRLKRSRFLIEKEGILAMA 67
           CF   +K+        +Q+ P  +AS  K        + AY    R  + + K   + + 
Sbjct: 504 CFARKVKEAEDFFSSLEQKCPENKASFVKGYCEAGLSKKAYKAFVRLEYPLRKSVYIKLF 563

Query: 68  FAYIRERKFYRLGEFLRDVGLRRKNVGNLLWNLLLLSYAANFKMKSLQREFLAMVEAGFN 127
           F+   E    +  + L+ +   R   G  +   ++ ++     ++  Q  F  MVE G  
Sbjct: 564 FSLCIEGYLEKAHDVLKKMSAYRVEPGRSMCGKMIGAFCKLNNVREAQVLFDTMVERGLI 623

Query: 128 PDITTFNIRAVAFSRMDLLWDLHLSLEHMKHMKIEPDLVSYGCVVDAYV 168
           PD+ T+ I    + R++ L       E MK   I+PD+V+Y  ++D Y+
Sbjct: 624 PDLFTYTIMIHTYCRLNELQKAESLFEDMKQRGIKPDVVTYTVLLDRYL 672

BLAST of Cp4.1LG07g07710 vs. NCBI nr
Match: gi|778667690|ref|XP_011648974.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like [Cucumis sativus])

HSP 1 Score: 401.7 bits (1031), Expect = 8.9e-109
Identity = 194/211 (91.94%), Postives = 206/211 (97.63%), Query Frame = 1

Query: 34  KMETAYGRLKRSRFLIEKEGILAMAFAYIRERKFYRLGEFLRDVGLRRKNVGNLLWNLLL 93
           +METAYGRLKRSRFLIEK+GI+AMAFAYIR+RKFYRLGEFLRDVGL RKNVGNLLWNLLL
Sbjct: 213 EMETAYGRLKRSRFLIEKKGIMAMAFAYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNLLL 272

Query: 94  LSYAANFKMKSLQREFLAMVEAGFNPDITTFNIRAVAFSRMDLLWDLHLSLEHMKHMKIE 153
           LSYAANFKMKSLQREFL MV+AGFNPD+TTFNIRA+AFSRMDLLWDLHLSLEHMKHM IE
Sbjct: 273 LSYAANFKMKSLQREFLQMVDAGFNPDLTTFNIRALAFSRMDLLWDLHLSLEHMKHMNIE 332

Query: 154 PDLVSYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPISLTDPFVFEALGKGDFHMSSEAFM 213
           PDLV+YGCVVDAYVDRRLGRNLEF+LSKMNPDQPP+SLTD FVFEALGKGDFHMSSEAFM
Sbjct: 333 PDLVTYGCVVDAYVDRRLGRNLEFILSKMNPDQPPVSLTDSFVFEALGKGDFHMSSEAFM 392

Query: 214 QFQKQKKWTYRELISLYLKKQHRRDQVFWNY 245
           QF+KQKKWTYRELISLYLKK HRR+QVFWNY
Sbjct: 393 QFRKQKKWTYRELISLYLKKHHRRNQVFWNY 423

BLAST of Cp4.1LG07g07710 vs. NCBI nr
Match: gi|659069563|ref|XP_008450656.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like [Cucumis melo])

HSP 1 Score: 401.4 bits (1030), Expect = 1.2e-108
Identity = 196/214 (91.59%), Postives = 207/214 (96.73%), Query Frame = 1

Query: 31  SARKMETAYGRLKRSRFLIEKEGILAMAFAYIRERKFYRLGEFLRDVGLRRKNVGNLLWN 90
           S  +METAYGRLKRSRFLIEK+GI+AMAFAYIR+RKFYRLGEFLRDVGL RKNVGNLLWN
Sbjct: 212 SLLEMETAYGRLKRSRFLIEKKGIMAMAFAYIRKRKFYRLGEFLRDVGLGRKNVGNLLWN 271

Query: 91  LLLLSYAANFKMKSLQREFLAMVEAGFNPDITTFNIRAVAFSRMDLLWDLHLSLEHMKHM 150
           LLLLSYAANFKMKSLQREFL MVEAGFNPD+TTFNIRA+AFSRMDLLWDLHLSLEHMKHM
Sbjct: 272 LLLLSYAANFKMKSLQREFLQMVEAGFNPDLTTFNIRALAFSRMDLLWDLHLSLEHMKHM 331

Query: 151 KIEPDLVSYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPISLTDPFVFEALGKGDFHMSSE 210
            IEPDLV+YGCVVDAYVDRRLGRNLEF+LSKMNP QPP+SLTD FVFEALGKGDFHMSSE
Sbjct: 332 NIEPDLVTYGCVVDAYVDRRLGRNLEFILSKMNPVQPPVSLTDSFVFEALGKGDFHMSSE 391

Query: 211 AFMQFQKQKKWTYRELISLYLKKQHRRDQVFWNY 245
           AFMQF+KQKKWTYRELISLYLKKQHRR+QVFWNY
Sbjct: 392 AFMQFRKQKKWTYRELISLYLKKQHRRNQVFWNY 425

BLAST of Cp4.1LG07g07710 vs. NCBI nr
Match: gi|700206087|gb|KGN61206.1| (hypothetical protein Csa_2G070240 [Cucumis sativus])

HSP 1 Score: 363.2 bits (931), Expect = 3.5e-97
Identity = 174/188 (92.55%), Postives = 183/188 (97.34%), Query Frame = 1

Query: 57  MAFAYIRERKFYRLGEFLRDVGLRRKNVGNLLWNLLLLSYAANFKMKSLQREFLAMVEAG 116
           MAFAYIR+RKFYRLGEFLRDVGL RKNVGNLLWNLLLLSYAANFKMKSLQREFL MV+AG
Sbjct: 1   MAFAYIRKRKFYRLGEFLRDVGLGRKNVGNLLWNLLLLSYAANFKMKSLQREFLQMVDAG 60

Query: 117 FNPDITTFNIRAVAFSRMDLLWDLHLSLEHMKHMKIEPDLVSYGCVVDAYVDRRLGRNLE 176
           FNPD+TTFNIRA+AFSRMDLLWDLHLSLEHMKHM IEPDLV+YGCVVDAYVDRRLGRNLE
Sbjct: 61  FNPDLTTFNIRALAFSRMDLLWDLHLSLEHMKHMNIEPDLVTYGCVVDAYVDRRLGRNLE 120

Query: 177 FVLSKMNPDQPPISLTDPFVFEALGKGDFHMSSEAFMQFQKQKKWTYRELISLYLKKQHR 236
           F+LSKMNPDQPP+SLTD FVFEALGKGDFHMSSEAFMQF+KQKKWTYRELISLYLKK HR
Sbjct: 121 FILSKMNPDQPPVSLTDSFVFEALGKGDFHMSSEAFMQFRKQKKWTYRELISLYLKKHHR 180

Query: 237 RDQVFWNY 245
           R+QVFWNY
Sbjct: 181 RNQVFWNY 188

BLAST of Cp4.1LG07g07710 vs. NCBI nr
Match: gi|470131016|ref|XP_004301396.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g42630 [Fragaria vesca subsp. vesca])

HSP 1 Score: 345.1 bits (884), Expect = 9.8e-92
Identity = 163/214 (76.17%), Postives = 193/214 (90.19%), Query Frame = 1

Query: 31  SARKMETAYGRLKRSRFLIEKEGILAMAFAYIRERKFYRLGEFLRDVGLRRKNVGNLLWN 90
           S  +METAY RLKRSRFLIE+EGI AM+ AY+++RKFY L EFL+ VGL R+N+GNLLWN
Sbjct: 211 SLTEMETAYDRLKRSRFLIEEEGIRAMSLAYLKKRKFYSLAEFLKSVGLGRRNLGNLLWN 270

Query: 91  LLLLSYAANFKMKSLQREFLAMVEAGFNPDITTFNIRAVAFSRMDLLWDLHLSLEHMKHM 150
           LLLLSYAANFKMK+LQREFL MVEAGF+PD+TTFNIRA+AFSRM LLWDLHL+LEHMKH+
Sbjct: 271 LLLLSYAANFKMKTLQREFLRMVEAGFHPDLTTFNIRALAFSRMSLLWDLHLTLEHMKHV 330

Query: 151 KIEPDLVSYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPISLTDPFVFEALGKGDFHMSSE 210
           K+ PDLV+ GC+VDAY+DRRLGRNL F L+KMN D  P+ LTDPFVFE LGKGDFH SSE
Sbjct: 331 KVVPDLVTCGCIVDAYLDRRLGRNLYFALNKMNLDDSPVVLTDPFVFEVLGKGDFHASSE 390

Query: 211 AFMQFQKQKKWTYRELISLYLKKQHRRDQVFWNY 245
           AF++F+KQK+WTY++LIS+YLKKQ+RRDQ+FWNY
Sbjct: 391 AFLEFRKQKEWTYQKLISVYLKKQYRRDQIFWNY 424

BLAST of Cp4.1LG07g07710 vs. NCBI nr
Match: gi|694366990|ref|XP_009362023.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g42630 isoform X1 [Pyrus x bretschneideri])

HSP 1 Score: 345.1 bits (884), Expect = 9.8e-92
Identity = 164/214 (76.64%), Postives = 191/214 (89.25%), Query Frame = 1

Query: 31  SARKMETAYGRLKRSRFLIEKEGILAMAFAYIRERKFYRLGEFLRDVGLRRKNVGNLLWN 90
           S  +METAYGRLKRSRFLIE+EGI AM+FAY++  KFY L EFL+DVGL R+N+GNLLWN
Sbjct: 199 SLTEMETAYGRLKRSRFLIEEEGIRAMSFAYLKRSKFYELAEFLKDVGLGRRNLGNLLWN 258

Query: 91  LLLLSYAANFKMKSLQREFLAMVEAGFNPDITTFNIRAVAFSRMDLLWDLHLSLEHMKHM 150
           LLLLSYAANFKMKSLQREFL MVEAGF+PD+TTFNIRA+ FS+M LLWDLHLSLEHMKH 
Sbjct: 259 LLLLSYAANFKMKSLQREFLRMVEAGFHPDLTTFNIRALTFSKMSLLWDLHLSLEHMKHE 318

Query: 151 KIEPDLVSYGCVVDAYVDRRLGRNLEFVLSKMNPDQPPISLTDPFVFEALGKGDFHMSSE 210
           K+ PDLV+ GCVVDAY+DRRLGRNL F L+KMN D  P+ LTDPFVFE LGKGDFH SSE
Sbjct: 319 KVVPDLVTCGCVVDAYLDRRLGRNLYFALNKMNLDDSPLVLTDPFVFEVLGKGDFHASSE 378

Query: 211 AFMQFQKQKKWTYRELISLYLKKQHRRDQVFWNY 245
           AF++FQ+Q++WTYR+LIS+YLKK +RR+Q+FWNY
Sbjct: 379 AFLEFQRQREWTYRKLISVYLKKTYRRNQIFWNY 412

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP263_ARATH9.2e-8266.36Pentatricopeptide repeat-containing protein At3g42630 OS=Arabidopsis thaliana GN... [more]
PP310_ARATH5.9e-1229.33Pentatricopeptide repeat-containing protein At4g14190, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0LMJ8_CUCSA2.4e-9792.55Uncharacterized protein OS=Cucumis sativus GN=Csa_2G070240 PE=4 SV=1[more]
B9RDM7_RICCO9.9e-9172.43Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
M5XQ40_PRUPE9.9e-9175.70Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006191mg PE=4 SV=1[more]
A0A067K0T9_JATCU1.3e-9075.23Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14439 PE=4 SV=1[more]
B9N840_POPTR2.2e-9075.23Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POP... [more]
Match NameE-valueIdentityDescription
AT3G42630.15.2e-8366.36 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G14190.13.3e-1329.33 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G18940.16.1e-0730.69 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G26790.13.9e-0623.08 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778667690|ref|XP_011648974.1|8.9e-10991.94PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like [Cucumis s... [more]
gi|659069563|ref|XP_008450656.1|1.2e-10891.59PREDICTED: pentatricopeptide repeat-containing protein At3g42630-like [Cucumis m... [more]
gi|700206087|gb|KGN61206.1|3.5e-9792.55hypothetical protein Csa_2G070240 [Cucumis sativus][more]
gi|470131016|ref|XP_004301396.1|9.8e-9276.17PREDICTED: pentatricopeptide repeat-containing protein At3g42630 [Fragaria vesca... [more]
gi|694366990|ref|XP_009362023.1|9.8e-9276.64PREDICTED: pentatricopeptide repeat-containing protein At3g42630 isoform X1 [Pyr... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0009507 chloroplast
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG07g07710.1Cp4.1LG07g07710.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 31..244
score: 3.2E
NoneNo IPR availablePANTHERPTHR24015:SF649SUBFAMILY NOT NAMEDcoord: 31..244
score: 3.2E