Cp4.1LG10g04050 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG10g04050
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG10 : 1694590 .. 1697155 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACCGCTCTCTCAACGCCATGGAACACTCAGCTCAGAGAATTAGCAAAACGATGCCAATTCCTTCAAGCTCTAAGTCTCTACGCCCAAATGCTTCGCCATGGCGATCACCCCAATGCCTTCACTTTCCCGTTTGCCCTCAAATCCTGCGCGGCCCTCTCGCTCCCCATACTCGGCGGCCAATTTCATGGTCAAATTATCAAAGTTGGCTGTGAATCTGAACCTTTTGTGCAAACTGGTTTGATTTCTATGTACTGCAGAGGTTCATTACTCGGAAATGCCCGTAAAGTGTTTGATGAAAAATCCCAGTCCAGAAAGCTTACCGTTTGCTACAATGCTTTGATTTCCGGCTATGTTTCGAATTCTAAAAGTTCTGACGCGGTTCTTTTGTTTCGCCAAATGAATGAAGAGGGTGTCCCAGTTAATTCAGTTACATTGCTGGGTTTGATCCCAGTGTGTGTATCTCCGATTAATTTGGAGCTTGGATTGTCTCTGCATTGCTCAACATTGAAGTATGGATTGGATTCAGATGTCTCCGTTGTTAACTGTTTCATTACTATGTACATGAAATGTGGCTCGGTTAATCATGCACAGAACCTGTTTGATAAAATGCCTGAGAAGGGTTTGATTTCTTGGAACGCTATGGTTTCTGGGTACGCACAAAATGGGCTGGCAACTAATGTTTTGGAGCTCTATCATAACATGGAGTTACATGGGATTCACCCGGATCCCTTCACTCTTGTTGGGGTTTTATCATCTTGCGCCAACCTTGGGGCTCAGAGTGTTGGTCGTGAGGTAGAGCTTAAGATCCAAGCAAGTGGGTTTACCAATAATCAGTTTCTGAATAATGCTTTGATTAATATGTACGCAAGGTGTGGAAATTTAACCAAGGCACAAGCATTGTTCGATGAAATGCCAGAAAGAACATTAGTTTCATGGACAGCAATAATAGGTGGCTATGGAATGCATGGACATGGAGAAATTGCAGTGCAACTATTCGAAGACATGATAAGGAGTGGCATTGTACCTGATGGAACTGCGTTTGTGAGTGTCTTGTCTGCCTGTAGCCATGCAGGGCTCACTTCTCAGGGCATGGAATATTTCAAGATGATGGGAAGAAACTATCAATTGGAACCAGGTCCAGAGCATTATTCGTGCATGGTGGATCTTCTGGGGCGAGCAGGGCGGCTAAATGAAGCTCGGAATCTCATTGAATCCATGCCAATAGAGCCTGATGGTGCTGTCTGGGGAGCTCTTCTGGGTGCTTGTAAGATCCACCAGAATGTCAAGTTGGCAGAGTTGGCTTTTGAACGTGTGGTCGAGCTTGAACCTGCAAACATAGGATACTATGTGTTATTATCAAACATTTATAATGATACCAAGAACTCAAAAGGGGTTTTGAGGATCCGGATTATGATGAAGGAGAGGAAGCTGAAGAAGGATCCTGGGTGTAGCTATGTTGAATTGAAGGGCAGAGTTCATCCATTTGTAGTTGGGGATAGAAGCCATCCCCAGGCTGAAGAGATATATAGAGTGTTGGAGGAGTTAGAAGCGTTGGTGCATGAATTTGGAGAGGCTAAAAGAGCTGATAGAGAAGAAAGCAACAAAGATCTGTTTACTGGGGCTGGAGTTCATAGTGAAAAATTGGCTGTTGCCTTTGGACTCTTGAATACCACAGCTGGGACTGAAGTTGTGGTCATCAAAAACCTTAGGATATGTGAAGATTGTCACTTGTTTTTCAAGATTGTTAGTAAAATTGTTCATCGTCAACTAACTGTTCGAGACGCTACTCGCTTCCATCATTTTAGAAATGGGAGCTGTTCTTGTAAGGATTATTGGTAAAATCATGGACTTATTTCCTTTCCGGTTAAATTATAAATTCAGTACTTCTTGTGTGCCTTTTTATTCAAAATCATGGACCTACTTCTTGTGTGCCTTTTTATTCAAAATCATGGACCTATTAGGATTATAAATTTTATTCAAGCGATCATTTTTTTCACTTTCAAAATCACTTAAGAATGTGCATGCAAAACTGATTTTGAATAAACCATTACATGGTGAATTATAAACTTATAGCCAAATCCATCACTAACCGGTTGGAATCAACTTTGAATTTTCGATAAGCAGACTTTATACCTTCATGTAATACTTTATACCTTCATGTAATACTTTATGCACATCGCTAACAAATATTGTCCATTTTGACCCATTACATATCGTCAGCCACACACAAATTTTGTCCATTTTAACCCATTTACGTATTGCCGTCAGCCACACAATTTACCCATTACGTATCATCAGCGCATCACCTAGGAAGAGATTTCACACCCTTATAAAAACTGTTTCATTCCACTCTCCAACCGATATGAAATTTCACAAAACCTTACAATGGAGCCAGGAGACACGCAAGAAGCAAAGTTGTTTAGCAAATGATCGGTCCTGACACTATTTCTTTTTGAATTCTTTATCTGCAGAATCTAAATTGATAAGGGATTCATTCATTGCAATGTTCATGGACATTGATGCTGAAGCAGAGAAGAACGAGAGGTTTTGTGAGAAGATCTAA

mRNA sequence

ATGACCGCTCTCTCAACGCCATGGAACACTCAGCTCAGAGAATTAGCAAAACGATGCCAATTCCTTCAAGCTCTAAGTCTCTACGCCCAAATGCTTCGCCATGGCGATCACCCCAATGCCTTCACTTTCCCGTTTGCCCTCAAATCCTGCGCGGCCCTCTCGCTCCCCATACTCGGCGGCCAATTTCATGGTCAAATTATCAAAGTTGGCTGTGAATCTGAACCTTTTGTGCAAACTGGTTTGATTTCTATGTACTGCAGAGGTTCATTACTCGGAAATGCCCGTAAAGTGTTTGATGAAAAATCCCAGTCCAGAAAGCTTACCGTTTGCTACAATGCTTTGATTTCCGGCTATGTTTCGAATTCTAAAAGTTCTGACGCGGTTCTTTTGTTTCGCCAAATGAATGAAGAGGGTGTCCCAGTTAATTCAGTTACATTGCTGGAATCTAAATTGATAAGGGATTCATTCATTGCAATGTTCATGGACATTGATGCTGAAGCAGAGAAGAACGAGAGGTTTTGTGAGAAGATCTAA

Coding sequence (CDS)

ATGACCGCTCTCTCAACGCCATGGAACACTCAGCTCAGAGAATTAGCAAAACGATGCCAATTCCTTCAAGCTCTAAGTCTCTACGCCCAAATGCTTCGCCATGGCGATCACCCCAATGCCTTCACTTTCCCGTTTGCCCTCAAATCCTGCGCGGCCCTCTCGCTCCCCATACTCGGCGGCCAATTTCATGGTCAAATTATCAAAGTTGGCTGTGAATCTGAACCTTTTGTGCAAACTGGTTTGATTTCTATGTACTGCAGAGGTTCATTACTCGGAAATGCCCGTAAAGTGTTTGATGAAAAATCCCAGTCCAGAAAGCTTACCGTTTGCTACAATGCTTTGATTTCCGGCTATGTTTCGAATTCTAAAAGTTCTGACGCGGTTCTTTTGTTTCGCCAAATGAATGAAGAGGGTGTCCCAGTTAATTCAGTTACATTGCTGGAATCTAAATTGATAAGGGATTCATTCATTGCAATGTTCATGGACATTGATGCTGAAGCAGAGAAGAACGAGAGGTTTTGTGAGAAGATCTAA

Protein sequence

MTALSTPWNTQLRELAKRCQFLQALSLYAQMLRHGDHPNAFTFPFALKSCAALSLPILGGQFHGQIIKVGCESEPFVQTGLISMYCRGSLLGNARKVFDEKSQSRKLTVCYNALISGYVSNSKSSDAVLLFRQMNEEGVPVNSVTLLESKLIRDSFIAMFMDIDAEAEKNERFCEKI
BLAST of Cp4.1LG10g04050 vs. Swiss-Prot
Match: PP223_ARATH (Putative pentatricopeptide repeat-containing protein At3g11460 OS=Arabidopsis thaliana GN=PCMP-H52 PE=3 SV=1)

HSP 1 Score: 183.7 bits (465), Expect = 1.7e-45
Identity = 87/143 (60.84%), Postives = 110/143 (76.92%), Query Frame = 1

Query: 5   STPWNTQLRELAKRCQFLQALSLYAQMLRHGDHPNAFTFPFALKSCAALSLPILGGQFHG 64
           STPWN +LRELA +  F +++SLY  MLR G  P+AF+FPF LKSCA+LSLP+ G Q H 
Sbjct: 18  STPWNVRLRELAYQSLFSESISLYRSMLRSGSSPDAFSFPFILKSCASLSLPVSGQQLHC 77

Query: 65  QIIKVGCESEPFVQTGLISMYCRGSLLGNARKVFDEKSQSRKLTVCYNALISGYVSNSKS 124
            + K GCE+EPFV T LISMYC+  L+ +ARKVF+E  QS +L+VCYNALISGY +NSK 
Sbjct: 78  HVTKGGCETEPFVLTALISMYCKCGLVADARKVFEENPQSSQLSVCYNALISGYTANSKV 137

Query: 125 SDAVLLFRQMNEEGVPVNSVTLL 148
           +DA  +FR+M E GV V+SVT+L
Sbjct: 138 TDAAYMFRRMKETGVSVDSVTML 160

BLAST of Cp4.1LG10g04050 vs. Swiss-Prot
Match: PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 98.2 bits (243), Expect = 9.5e-20
Identity = 59/137 (43.07%), Postives = 75/137 (54.74%), Query Frame = 1

Query: 8   WNTQLRELAKRCQFLQALSLYAQMLRHGDHPNAFTFPFALKSCAALSLPILGGQFHGQII 67
           WNT  R  A     + AL LY  M+  G  PN++TFPF LKSCA       G Q HG ++
Sbjct: 102 WNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVL 161

Query: 68  KVGCESEPFVQTGLISMYCRGSLLGNARKVFDEKSQSRKLTVCYNALISGYVSNSKSSDA 127
           K+GC+ + +V T LISMY +   L +A KVFD KS  R + V Y ALI GY S     +A
Sbjct: 162 KLGCDLDLYVHTSLISMYVQNGRLEDAHKVFD-KSPHRDV-VSYTALIKGYASRGYIENA 221

Query: 128 VLLFRQMNEEGVPVNSV 145
             LF     + +PV  V
Sbjct: 222 QKLF-----DEIPVKDV 231

BLAST of Cp4.1LG10g04050 vs. Swiss-Prot
Match: PP224_ARATH (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 96.3 bits (238), Expect = 3.6e-19
Identity = 48/141 (34.04%), Postives = 80/141 (56.74%), Query Frame = 1

Query: 7   PWNTQLRELAKRCQFLQALSLYAQMLRHGDHPNAFTFPFALKSCAALSLPILGGQFHGQI 66
           PWN  +R  ++   F  AL +Y+ M      P++FTFP  LK+C+ LS   +G   H Q+
Sbjct: 86  PWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQV 145

Query: 67  IKVGCESEPFVQTGLISMYCRGSLLGNARKVFDEKSQSRKLTVCYNALISGYVSNSKSSD 126
            ++G +++ FVQ GLI++Y +   LG+AR VF+      +  V + A++S Y  N +  +
Sbjct: 146 FRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPME 205

Query: 127 AVLLFRQMNEEGVPVNSVTLL 148
           A+ +F QM +  V  + V L+
Sbjct: 206 ALEIFSQMRKMDVKPDWVALV 226

BLAST of Cp4.1LG10g04050 vs. Swiss-Prot
Match: PP214_ARATH (Putative pentatricopeptide repeat-containing protein At3g05240 OS=Arabidopsis thaliana GN=PCMP-E82 PE=3 SV=2)

HSP 1 Score: 92.0 bits (227), Expect = 6.8e-18
Identity = 49/141 (34.75%), Postives = 77/141 (54.61%), Query Frame = 1

Query: 8   WNTQLRELAKRCQFLQALSLYAQMLRHGDHPNAFTFPFALKSCAALSLPILGGQFHGQII 67
           WN+ +R  +      +AL  Y +MLR G  P+ FTFP+ LK+C+ L     G   HG ++
Sbjct: 75  WNSMIRGYSNSPNPDKALIFYQEMLRKGYSPDYFTFPYVLKACSGLRDIQFGSCVHGFVV 134

Query: 68  KVGCESEPFVQTGLISMYCRGSLLGNARKVFDEKSQSRKLTVCYNALISGYVSNSKSSDA 127
           K G E   +V T L+ MY     +    +VF++  Q     V + +LISG+V+N++ SDA
Sbjct: 135 KTGFEVNMYVSTCLLHMYMCCGEVNYGLRVFEDIPQWN--VVAWGSLISGFVNNNRFSDA 194

Query: 128 VLLFRQMNEEGVPVNSVTLLE 149
           +  FR+M   GV  N   +++
Sbjct: 195 IEAFREMQSNGVKANETIMVD 213

BLAST of Cp4.1LG10g04050 vs. Swiss-Prot
Match: PP249_ARATH (Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana GN=PCMP-H56 PE=2 SV=1)

HSP 1 Score: 90.9 bits (224), Expect = 1.5e-17
Identity = 52/141 (36.88%), Postives = 78/141 (55.32%), Query Frame = 1

Query: 8   WNTQLRELAKRCQFLQALSLYAQMLRHGDHPNAFTFPFALKSCAALSLPILGGQFHGQII 67
           +N+ +R  A      +A+ L+ +M+  G  P+ +TFPF L +CA       G Q HG I+
Sbjct: 102 YNSLIRGYASSGLCNEAILLFLRMMNSGISPDKYTFPFGLSACAKSRAKGNGIQIHGLIV 161

Query: 68  KVGCESEPFVQTGLISMYCRGSLLGNARKVFDEKSQSRKLTVCYNALISGYVSNSKSSDA 127
           K+G   + FVQ  L+  Y     L +ARKVFDE S+  +  V + ++I GY     + DA
Sbjct: 162 KMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSE--RNVVSWTSMICGYARRDFAKDA 221

Query: 128 V-LLFRQMNEEGVPVNSVTLL 148
           V L FR + +E V  NSVT++
Sbjct: 222 VDLFFRMVRDEEVTPNSVTMV 240

BLAST of Cp4.1LG10g04050 vs. TrEMBL
Match: A0A0A0KFC9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G452690 PE=4 SV=1)

HSP 1 Score: 266.5 bits (680), Expect = 2.2e-68
Identity = 130/147 (88.44%), Postives = 135/147 (91.84%), Query Frame = 1

Query: 1   MTALSTPWNTQLRELAKRCQFLQALSLYAQMLRHGDHPNAFTFPFALKSCAALSLPILGG 60
           M ALSTPWNTQLRELAKRCQFLQALSLY QMLRHGD PNAFTFPFALKSCAALSLPILG 
Sbjct: 10  MNALSTPWNTQLRELAKRCQFLQALSLYPQMLRHGDRPNAFTFPFALKSCAALSLPILGS 69

Query: 61  QFHGQIIKVGCESEPFVQTGLISMYCRGSLLGNARKVFDEKSQSRKLTVCYNALISGYVS 120
           QFHGQI KVGC  EPFVQTGLISMYC+GSL+ NARKVF+E   SRKLTVCYNAL+SGYVS
Sbjct: 70  QFHGQITKVGCVFEPFVQTGLISMYCKGSLVDNARKVFEENFHSRKLTVCYNALVSGYVS 129

Query: 121 NSKSSDAVLLFRQMNEEGVPVNSVTLL 148
           NSK S+AVLLFRQMNEEGVPVNSVTLL
Sbjct: 130 NSKCSEAVLLFRQMNEEGVPVNSVTLL 156

BLAST of Cp4.1LG10g04050 vs. TrEMBL
Match: W9R4V5_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_019144 PE=4 SV=1)

HSP 1 Score: 203.0 bits (515), Expect = 3.1e-49
Identity = 97/147 (65.99%), Postives = 112/147 (76.19%), Query Frame = 1

Query: 1   MTALSTPWNTQLRELAKRCQFLQALSLYAQMLRHGDHPNAFTFPFALKSCAALSLPILGG 60
           +T  + PWNT LRELAK+C F +AL+LY +MLR G  PNAFTFPF LKSCA+LSL   G 
Sbjct: 23  LTTTAIPWNTHLRELAKQCLFSEALNLYRRMLRSGQSPNAFTFPFVLKSCASLSLSTAGK 82

Query: 61  QFHGQIIKVGCESEPFVQTGLISMYCRGSLLGNARKVFDEKSQSRKLTVCYNALISGYVS 120
             HG +IK+GCE EPFVQT LISMYC+  L+ NARKVFDE  QSR LTVCYNALISGY  
Sbjct: 83  LLHGHVIKIGCEPEPFVQTSLISMYCKCCLVDNARKVFDENPQSRNLTVCYNALISGYTL 142

Query: 121 NSKSSDAVLLFRQMNEEGVPVNSVTLL 148
           NSK  D ++LF +M E GV VNSVT+L
Sbjct: 143 NSKFLDGIVLFSKMRETGVAVNSVTML 169

BLAST of Cp4.1LG10g04050 vs. TrEMBL
Match: F6HKM1_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_08s0007g03150 PE=4 SV=1)

HSP 1 Score: 200.7 bits (509), Expect = 1.5e-48
Identity = 95/140 (67.86%), Postives = 111/140 (79.29%), Query Frame = 1

Query: 8   WNTQLRELAKRCQFLQALSLYAQMLRHGDHPNAFTFPFALKSCAALSLPILGGQFHGQII 67
           WN +LRELA++  F +AL+LY QML  GD PNAFTFPFA KSCA+LSLP+ G Q HG +I
Sbjct: 24  WNARLRELARQRHFQEALNLYCQMLASGDSPNAFTFPFAFKSCASLSLPLAGSQLHGHVI 83

Query: 68  KVGCESEPFVQTGLISMYCRGSLLGNARKVFDEKSQSRKLTVCYNALISGYVSNSKSSDA 127
           K GCE EPFVQT LISMYC+ S + +ARKVFDE   SR L VCYNALI+GY  NS+ SDA
Sbjct: 84  KTGCEPEPFVQTSLISMYCKCSTIASARKVFDENHHSRNLAVCYNALIAGYSLNSRFSDA 143

Query: 128 VLLFRQMNEEGVPVNSVTLL 148
           VLLFRQM +EGV VN+VT+L
Sbjct: 144 VLLFRQMRKEGVSVNAVTML 163

BLAST of Cp4.1LG10g04050 vs. TrEMBL
Match: A0A0D2RWW9_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_006G154800 PE=4 SV=1)

HSP 1 Score: 196.1 bits (497), Expect = 3.7e-47
Identity = 96/141 (68.09%), Postives = 110/141 (78.01%), Query Frame = 1

Query: 7   PWNTQLRELAKRCQFLQALSLYAQMLRHGDHPNAFTFPFALKSCAALSLPILGGQFHGQI 66
           PWNTQLRELAK+CQ+L+AL+LY QMLR G  PNAF+FPFALKS A+L LP+ G Q H Q+
Sbjct: 5   PWNTQLRELAKQCQYLEALTLYRQMLRCGSSPNAFSFPFALKSSASLPLPLSGQQLHCQV 64

Query: 67  IKVGCESEPFVQTGLISMYCRGSLLGNARKVFDEKSQSRKLTVCYNALISGYVSNSKSSD 126
           IK GC  EPFV T LISMYC+ S LGNARKVFDE   S +LTVCYNALISGY  NS+  D
Sbjct: 65  IKSGCSQEPFVLTSLISMYCKFSSLGNARKVFDENPISNQLTVCYNALISGYALNSRVFD 124

Query: 127 AVLLFRQMNEEGVPVNSVTLL 148
            + LF +M E GV VNSVT+L
Sbjct: 125 VIALFCRMREMGVSVNSVTML 145

BLAST of Cp4.1LG10g04050 vs. TrEMBL
Match: V4M924_EUTSA (Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10022435mg PE=4 SV=1)

HSP 1 Score: 193.0 bits (489), Expect = 3.2e-46
Identity = 92/143 (64.34%), Postives = 112/143 (78.32%), Query Frame = 1

Query: 5   STPWNTQLRELAKRCQFLQALSLYAQMLRHGDHPNAFTFPFALKSCAALSLPILGGQFHG 64
           STPWN +LRELA +  F +A+SLY  MLR G  P+AF++PF LKSCAALSLP+ G Q H 
Sbjct: 20  STPWNVRLRELAYQSLFAEAISLYRSMLRSGSSPDAFSYPFTLKSCAALSLPVSGQQLHC 79

Query: 65  QIIKVGCESEPFVQTGLISMYCRGSLLGNARKVFDEKSQSRKLTVCYNALISGYVSNSKS 124
            +I+ GCE+EPFV T LISMYC+  L+G ARKVFDE  QSR L VCYNALI+GY +NSK 
Sbjct: 80  HVIRGGCEAEPFVLTALISMYCKCGLVGEARKVFDENPQSRHLGVCYNALIAGYKANSKV 139

Query: 125 SDAVLLFRQMNEEGVPVNSVTLL 148
           +DAV +FR+M E GVPV+SVT+L
Sbjct: 140 TDAVCMFRKMKETGVPVDSVTML 162

BLAST of Cp4.1LG10g04050 vs. TAIR10
Match: AT3G11460.1 (AT3G11460.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 183.7 bits (465), Expect = 9.7e-47
Identity = 87/143 (60.84%), Postives = 110/143 (76.92%), Query Frame = 1

Query: 5   STPWNTQLRELAKRCQFLQALSLYAQMLRHGDHPNAFTFPFALKSCAALSLPILGGQFHG 64
           STPWN +LRELA +  F +++SLY  MLR G  P+AF+FPF LKSCA+LSLP+ G Q H 
Sbjct: 18  STPWNVRLRELAYQSLFSESISLYRSMLRSGSSPDAFSFPFILKSCASLSLPVSGQQLHC 77

Query: 65  QIIKVGCESEPFVQTGLISMYCRGSLLGNARKVFDEKSQSRKLTVCYNALISGYVSNSKS 124
            + K GCE+EPFV T LISMYC+  L+ +ARKVF+E  QS +L+VCYNALISGY +NSK 
Sbjct: 78  HVTKGGCETEPFVLTALISMYCKCGLVADARKVFEENPQSSQLSVCYNALISGYTANSKV 137

Query: 125 SDAVLLFRQMNEEGVPVNSVTLL 148
           +DA  +FR+M E GV V+SVT+L
Sbjct: 138 TDAAYMFRRMKETGVSVDSVTML 160

BLAST of Cp4.1LG10g04050 vs. TAIR10
Match: AT1G08070.1 (AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 98.2 bits (243), Expect = 5.4e-21
Identity = 59/137 (43.07%), Postives = 75/137 (54.74%), Query Frame = 1

Query: 8   WNTQLRELAKRCQFLQALSLYAQMLRHGDHPNAFTFPFALKSCAALSLPILGGQFHGQII 67
           WNT  R  A     + AL LY  M+  G  PN++TFPF LKSCA       G Q HG ++
Sbjct: 102 WNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQIHGHVL 161

Query: 68  KVGCESEPFVQTGLISMYCRGSLLGNARKVFDEKSQSRKLTVCYNALISGYVSNSKSSDA 127
           K+GC+ + +V T LISMY +   L +A KVFD KS  R + V Y ALI GY S     +A
Sbjct: 162 KLGCDLDLYVHTSLISMYVQNGRLEDAHKVFD-KSPHRDV-VSYTALIKGYASRGYIENA 221

Query: 128 VLLFRQMNEEGVPVNSV 145
             LF     + +PV  V
Sbjct: 222 QKLF-----DEIPVKDV 231

BLAST of Cp4.1LG10g04050 vs. TAIR10
Match: AT3G12770.1 (AT3G12770.1 mitochondrial editing factor 22)

HSP 1 Score: 96.3 bits (238), Expect = 2.0e-20
Identity = 48/141 (34.04%), Postives = 80/141 (56.74%), Query Frame = 1

Query: 7   PWNTQLRELAKRCQFLQALSLYAQMLRHGDHPNAFTFPFALKSCAALSLPILGGQFHGQI 66
           PWN  +R  ++   F  AL +Y+ M      P++FTFP  LK+C+ LS   +G   H Q+
Sbjct: 86  PWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQV 145

Query: 67  IKVGCESEPFVQTGLISMYCRGSLLGNARKVFDEKSQSRKLTVCYNALISGYVSNSKSSD 126
            ++G +++ FVQ GLI++Y +   LG+AR VF+      +  V + A++S Y  N +  +
Sbjct: 146 FRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPME 205

Query: 127 AVLLFRQMNEEGVPVNSVTLL 148
           A+ +F QM +  V  + V L+
Sbjct: 206 ALEIFSQMRKMDVKPDWVALV 226

BLAST of Cp4.1LG10g04050 vs. TAIR10
Match: AT3G05240.1 (AT3G05240.1 mitochondrial editing factor 19)

HSP 1 Score: 92.0 bits (227), Expect = 3.8e-19
Identity = 49/141 (34.75%), Postives = 77/141 (54.61%), Query Frame = 1

Query: 8   WNTQLRELAKRCQFLQALSLYAQMLRHGDHPNAFTFPFALKSCAALSLPILGGQFHGQII 67
           WN+ +R  +      +AL  Y +MLR G  P+ FTFP+ LK+C+ L     G   HG ++
Sbjct: 75  WNSMIRGYSNSPNPDKALIFYQEMLRKGYSPDYFTFPYVLKACSGLRDIQFGSCVHGFVV 134

Query: 68  KVGCESEPFVQTGLISMYCRGSLLGNARKVFDEKSQSRKLTVCYNALISGYVSNSKSSDA 127
           K G E   +V T L+ MY     +    +VF++  Q     V + +LISG+V+N++ SDA
Sbjct: 135 KTGFEVNMYVSTCLLHMYMCCGEVNYGLRVFEDIPQWN--VVAWGSLISGFVNNNRFSDA 194

Query: 128 VLLFRQMNEEGVPVNSVTLLE 149
           +  FR+M   GV  N   +++
Sbjct: 195 IEAFREMQSNGVKANETIMVD 213

BLAST of Cp4.1LG10g04050 vs. TAIR10
Match: AT3G22690.1 (AT3G22690.1 Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885))

HSP 1 Score: 90.9 bits (224), Expect = 8.6e-19
Identity = 52/141 (36.88%), Postives = 78/141 (55.32%), Query Frame = 1

Query: 8   WNTQLRELAKRCQFLQALSLYAQMLRHGDHPNAFTFPFALKSCAALSLPILGGQFHGQII 67
           +N+ +R  A      +A+ L+ +M+  G  P+ +TFPF L +CA       G Q HG I+
Sbjct: 102 YNSLIRGYASSGLCNEAILLFLRMMNSGISPDKYTFPFGLSACAKSRAKGNGIQIHGLIV 161

Query: 68  KVGCESEPFVQTGLISMYCRGSLLGNARKVFDEKSQSRKLTVCYNALISGYVSNSKSSDA 127
           K+G   + FVQ  L+  Y     L +ARKVFDE S+  +  V + ++I GY     + DA
Sbjct: 162 KMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSE--RNVVSWTSMICGYARRDFAKDA 221

Query: 128 V-LLFRQMNEEGVPVNSVTLL 148
           V L FR + +E V  NSVT++
Sbjct: 222 VDLFFRMVRDEEVTPNSVTMV 240

BLAST of Cp4.1LG10g04050 vs. NCBI nr
Match: gi|659125232|ref|XP_008462579.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g11460 [Cucumis melo])

HSP 1 Score: 268.9 bits (686), Expect = 6.5e-69
Identity = 132/147 (89.80%), Postives = 135/147 (91.84%), Query Frame = 1

Query: 1   MTALSTPWNTQLRELAKRCQFLQALSLYAQMLRHGDHPNAFTFPFALKSCAALSLPILGG 60
           M ALSTPWNTQLRELAKRCQFLQALSLY QMLRHGD PNAFTFPFALKSCAALS PILGG
Sbjct: 10  MNALSTPWNTQLRELAKRCQFLQALSLYPQMLRHGDRPNAFTFPFALKSCAALSHPILGG 69

Query: 61  QFHGQIIKVGCESEPFVQTGLISMYCRGSLLGNARKVFDEKSQSRKLTVCYNALISGYVS 120
           QFHGQIIKVGC  EPFVQTGLISMYC+GSL+ NARKVFDE   SRKLTVCYNALISGY S
Sbjct: 70  QFHGQIIKVGCIFEPFVQTGLISMYCKGSLVENARKVFDENFHSRKLTVCYNALISGYAS 129

Query: 121 NSKSSDAVLLFRQMNEEGVPVNSVTLL 148
           NSK SDAVLLFRQMNEEG+PVNSVTLL
Sbjct: 130 NSKCSDAVLLFRQMNEEGIPVNSVTLL 156

BLAST of Cp4.1LG10g04050 vs. NCBI nr
Match: gi|449451271|ref|XP_004143385.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g11460 [Cucumis sativus])

HSP 1 Score: 266.5 bits (680), Expect = 3.2e-68
Identity = 130/147 (88.44%), Postives = 135/147 (91.84%), Query Frame = 1

Query: 1   MTALSTPWNTQLRELAKRCQFLQALSLYAQMLRHGDHPNAFTFPFALKSCAALSLPILGG 60
           M ALSTPWNTQLRELAKRCQFLQALSLY QMLRHGD PNAFTFPFALKSCAALSLPILG 
Sbjct: 10  MNALSTPWNTQLRELAKRCQFLQALSLYPQMLRHGDRPNAFTFPFALKSCAALSLPILGS 69

Query: 61  QFHGQIIKVGCESEPFVQTGLISMYCRGSLLGNARKVFDEKSQSRKLTVCYNALISGYVS 120
           QFHGQI KVGC  EPFVQTGLISMYC+GSL+ NARKVF+E   SRKLTVCYNAL+SGYVS
Sbjct: 70  QFHGQITKVGCVFEPFVQTGLISMYCKGSLVDNARKVFEENFHSRKLTVCYNALVSGYVS 129

Query: 121 NSKSSDAVLLFRQMNEEGVPVNSVTLL 148
           NSK S+AVLLFRQMNEEGVPVNSVTLL
Sbjct: 130 NSKCSEAVLLFRQMNEEGVPVNSVTLL 156

BLAST of Cp4.1LG10g04050 vs. NCBI nr
Match: gi|645276950|ref|XP_008243533.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g11460 [Prunus mume])

HSP 1 Score: 210.7 bits (535), Expect = 2.1e-51
Identity = 101/147 (68.71%), Postives = 116/147 (78.91%), Query Frame = 1

Query: 1   MTALSTPWNTQLRELAKRCQFLQALSLYAQMLRHGDHPNAFTFPFALKSCAALSLPILGG 60
           M   STPWNT+LREL+K+C F +AL++Y QML HG  PNAFTFPFALKSCAALSLP+ G 
Sbjct: 1   MAQPSTPWNTRLRELSKQCLFFEALTVYRQMLHHGHSPNAFTFPFALKSCAALSLPLAGS 60

Query: 61  QFHGQIIKVGCESEPFVQTGLISMYCRGSLLGNARKVFDEKSQSRKLTVCYNALISGYVS 120
             H  ++K GCE EPFVQT LISMYC+  L+ +AR+VFDE   SRKLTVCYNALISG+ S
Sbjct: 61  LLHCHVVKTGCEPEPFVQTSLISMYCKCCLVDDARRVFDENPHSRKLTVCYNALISGHTS 120

Query: 121 NSKSSDAVLLFRQMNEEGVPVNSVTLL 148
           NSK SDAV LFRQM   GV VNSVT+L
Sbjct: 121 NSKFSDAVSLFRQMRAAGVEVNSVTML 147

BLAST of Cp4.1LG10g04050 vs. NCBI nr
Match: gi|1009160965|ref|XP_015898638.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g11460 [Ziziphus jujuba])

HSP 1 Score: 206.8 bits (525), Expect = 3.0e-50
Identity = 99/141 (70.21%), Postives = 114/141 (80.85%), Query Frame = 1

Query: 7   PWNTQLRELAKRCQFLQALSLYAQMLRHGDHPNAFTFPFALKSCAALSLPILGGQFHGQI 66
           PWN++LRELA +C F +AL+LY QMLR G  PNAFTFP ALKSCAALSL + G   HG +
Sbjct: 29  PWNSRLRELANQCHFAEALTLYRQMLRSGHPPNAFTFPSALKSCAALSLLVTGKLLHGHV 88

Query: 67  IKVGCESEPFVQTGLISMYCRGSLLGNARKVFDEKSQSRKLTVCYNALISGYVSNSKSSD 126
           +K GCE EPFVQT LISMYCR SL+ NAR+VF++   SRKLTVCYNALISGY SNSK SD
Sbjct: 89  VKTGCEPEPFVQTSLISMYCRCSLIKNARRVFEDNDSSRKLTVCYNALISGYTSNSKISD 148

Query: 127 AVLLFRQMNEEGVPVNSVTLL 148
           AVLLFR+M E G+ VNSVT+L
Sbjct: 149 AVLLFRRMREAGLSVNSVTML 169

BLAST of Cp4.1LG10g04050 vs. NCBI nr
Match: gi|703089210|ref|XP_010093742.1| (hypothetical protein L484_019144 [Morus notabilis])

HSP 1 Score: 203.0 bits (515), Expect = 4.4e-49
Identity = 97/147 (65.99%), Postives = 112/147 (76.19%), Query Frame = 1

Query: 1   MTALSTPWNTQLRELAKRCQFLQALSLYAQMLRHGDHPNAFTFPFALKSCAALSLPILGG 60
           +T  + PWNT LRELAK+C F +AL+LY +MLR G  PNAFTFPF LKSCA+LSL   G 
Sbjct: 23  LTTTAIPWNTHLRELAKQCLFSEALNLYRRMLRSGQSPNAFTFPFVLKSCASLSLSTAGK 82

Query: 61  QFHGQIIKVGCESEPFVQTGLISMYCRGSLLGNARKVFDEKSQSRKLTVCYNALISGYVS 120
             HG +IK+GCE EPFVQT LISMYC+  L+ NARKVFDE  QSR LTVCYNALISGY  
Sbjct: 83  LLHGHVIKIGCEPEPFVQTSLISMYCKCCLVDNARKVFDENPQSRNLTVCYNALISGYTL 142

Query: 121 NSKSSDAVLLFRQMNEEGVPVNSVTLL 148
           NSK  D ++LF +M E GV VNSVT+L
Sbjct: 143 NSKFLDGIVLFSKMRETGVAVNSVTML 169

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP223_ARATH1.7e-4560.84Putative pentatricopeptide repeat-containing protein At3g11460 OS=Arabidopsis th... [more]
PPR21_ARATH9.5e-2043.07Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
PP224_ARATH3.6e-1934.04Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana GN... [more]
PP214_ARATH6.8e-1834.75Putative pentatricopeptide repeat-containing protein At3g05240 OS=Arabidopsis th... [more]
PP249_ARATH1.5e-1736.88Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0KFC9_CUCSA2.2e-6888.44Uncharacterized protein OS=Cucumis sativus GN=Csa_6G452690 PE=4 SV=1[more]
W9R4V5_9ROSA3.1e-4965.99Uncharacterized protein OS=Morus notabilis GN=L484_019144 PE=4 SV=1[more]
F6HKM1_VITVI1.5e-4867.86Putative uncharacterized protein OS=Vitis vinifera GN=VIT_08s0007g03150 PE=4 SV=... [more]
A0A0D2RWW9_GOSRA3.7e-4768.09Uncharacterized protein OS=Gossypium raimondii GN=B456_006G154800 PE=4 SV=1[more]
V4M924_EUTSA3.2e-4664.34Uncharacterized protein OS=Eutrema salsugineum GN=EUTSA_v10022435mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G11460.19.7e-4760.84 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G08070.15.4e-2143.07 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G12770.12.0e-2034.04 mitochondrial editing factor 22[more]
AT3G05240.13.8e-1934.75 mitochondrial editing factor 19[more]
AT3G22690.18.6e-1936.88 Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatrico... [more]
Match NameE-valueIdentityDescription
gi|659125232|ref|XP_008462579.1|6.5e-6989.80PREDICTED: putative pentatricopeptide repeat-containing protein At3g11460 [Cucum... [more]
gi|449451271|ref|XP_004143385.1|3.2e-6888.44PREDICTED: putative pentatricopeptide repeat-containing protein At3g11460 [Cucum... [more]
gi|645276950|ref|XP_008243533.1|2.1e-5168.71PREDICTED: putative pentatricopeptide repeat-containing protein At3g11460 [Prunu... [more]
gi|1009160965|ref|XP_015898638.1|3.0e-5070.21PREDICTED: putative pentatricopeptide repeat-containing protein At3g11460 [Zizip... [more]
gi|703089210|ref|XP_010093742.1|4.4e-4965.99hypothetical protein L484_019144 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016554 cytidine to uridine editing
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0003674 molecular_function
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG10g04050.1Cp4.1LG10g04050.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 80..100
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 108..145
score: 1.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 109..142
score: 2.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 107..141
score: 9.898coord: 4..38
score: 7.651coord: 74..104
score: 5
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 6..147
score: 3.1
NoneNo IPR availablePANTHERPTHR24015:SF505SUBFAMILY NOT NAMEDcoord: 6..147
score: 3.1

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG10g04050MELO3C024551.2Melon (DHL92) v3.6.1cpemedB090
Cp4.1LG10g04050CsaV3_6G040730Cucumber (Chinese Long) v3cpecucB0093
Cp4.1LG10g04050Bhi03G001415Wax gourdcpewgoB0101
Cp4.1LG10g04050CsGy6G024990Cucumber (Gy14) v2cgybcpeB710
Cp4.1LG10g04050Carg10486Silver-seed gourdcarcpeB0337
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG10g04050Cucurbita pepo (Zucchini)cpecpeB076
Cp4.1LG10g04050Cucurbita pepo (Zucchini)cpecpeB092
Cp4.1LG10g04050Cucurbita pepo (Zucchini)cpecpeB106
Cp4.1LG10g04050Cucumber (Gy14) v1cgycpeB0145
Cp4.1LG10g04050Cucumber (Gy14) v1cgycpeB0638
Cp4.1LG10g04050Cucurbita maxima (Rimu)cmacpeB254
Cp4.1LG10g04050Cucurbita maxima (Rimu)cmacpeB656
Cp4.1LG10g04050Cucurbita maxima (Rimu)cmacpeB807
Cp4.1LG10g04050Cucurbita maxima (Rimu)cmacpeB866
Cp4.1LG10g04050Cucurbita moschata (Rifu)cmocpeB218
Cp4.1LG10g04050Cucurbita moschata (Rifu)cmocpeB606
Cp4.1LG10g04050Cucurbita moschata (Rifu)cmocpeB761
Cp4.1LG10g04050Cucurbita moschata (Rifu)cmocpeB806
Cp4.1LG10g04050Wild cucumber (PI 183967)cpecpiB059
Cp4.1LG10g04050Wild cucumber (PI 183967)cpecpiB081
Cp4.1LG10g04050Cucumber (Chinese Long) v2cpecuB063
Cp4.1LG10g04050Cucumber (Chinese Long) v2cpecuB085
Cp4.1LG10g04050Bottle gourd (USVL1VR-Ls)cpelsiB047
Cp4.1LG10g04050Bottle gourd (USVL1VR-Ls)cpelsiB054
Cp4.1LG10g04050Bottle gourd (USVL1VR-Ls)cpelsiB062
Cp4.1LG10g04050Watermelon (Charleston Gray)cpewcgB057
Cp4.1LG10g04050Watermelon (Charleston Gray)cpewcgB067
Cp4.1LG10g04050Watermelon (Charleston Gray)cpewcgB071
Cp4.1LG10g04050Watermelon (97103) v1cpewmB064
Cp4.1LG10g04050Watermelon (97103) v1cpewmB078
Cp4.1LG10g04050Watermelon (97103) v1cpewmB092
Cp4.1LG10g04050Melon (DHL92) v3.5.1cpemeB054
Cp4.1LG10g04050Melon (DHL92) v3.5.1cpemeB069
Cp4.1LG10g04050Cucumber (Gy14) v2cgybcpeB290
Cp4.1LG10g04050Melon (DHL92) v3.6.1cpemedB071
Cp4.1LG10g04050Silver-seed gourdcarcpeB0348
Cp4.1LG10g04050Silver-seed gourdcarcpeB1128
Cp4.1LG10g04050Cucumber (Chinese Long) v3cpecucB0069
Cp4.1LG10g04050Wax gourdcpewgoB0066