Cp4.1LG08g13030 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG08g13030
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG08 : 9382184 .. 9384843 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATGGGCTCCACTGGCGATCTTCGCGAGCTCTGGTGTCGAGAAATCGACCATTCTTCAGATTTCTAAGATCCATCTTCAACGATTGGAGTCGTAGGGAGTATCATCGCGATTCATATGATTACACGAAGTTATTACAACATTGCAGATCTATCAGAAGCGTTCAAGAACTTCATGCCCAGATCGTCGTTGAAGGTCATGACCAAAATGGATTCTTAGCCACGAAGCTAATCGGCAAATATGCCGAGTATGGCGAGGAAAAAATGGGAATTGCACGGAAGGTGTTCGATAGATTGCTTGAAAAAGATGTGTTCTTATGGAACGTGGTTATTCAAGGGTATGCGAATTGGGGTCCGTTTGCTGAAGCTCTAAACCTGTATGATGAAATGCGGGTCGGTGGCGAACCCACCAATCGCTACACATTTCCTTTTGTGTTGAAGGCATGTGGCGCCATGAAGAACGGTGACAAGGGGAAGATTGTTCATGGGCACGTTTTGAAATGTGGGTTGGACTTGGATTTGTTCGTGGGCAATGCTCTGATTGCGTTTTATTCCAAGTGCCAGGATGTTGAAACTGCTCGGAAGGTGTTTGATGAAATGTCTCTGAGAGACATTGTTAGTTGGAACTCCATGATTGCTGGGTATACTTTGAATGGGAAAGTGGATGAAGCTATTATGATTTTCCACGCCATGCTGCATAATCAAACTGCTTGCTCACCTGATAATGCAACTCTTGTTGGTATTCTCCCTGCTTGTGTTACAAAATCAGCTTCCCAAGTTGGCTTCTGGGTTCATTCCTACATTATAAAGACAGGAATGGAAGTTGGGGCCCCGTTGGGCAGTTGCCTTATCTCAATGTATGCTAACTGTGGTCATGTGAACATTGCGAGAGACGTTTTCGATCGAATCAAGGACAAAAACGTCATCGTATGGAGCGCGATCATAAGGTCTTACGGAATGCATGGTTTTGCAGATGAGGCATTAAACATGTTCACAAGTTTGGAAGAAGCTGGTCTAAAACCAGATGGCGTGATCTTCCTGAATTTGTTGTCGACGTGTAGTCACGCCGGGCTTGTAGAAAAAGGGCGCGAGATATACGAAAGGATGGAGGCTTATGGTGCAGAGAGGAAAGAGGAACATTATGCGTGCATGGTGGATCTCTTAGGGAGGGCTGGTTTCTTAGAACAAGCAGTAGAGTTCATTGAAGGCATGCCAGTGCAGGCAGGGAAAGATGTGTATGGTGCATTGCTTGGTGCTTGTAGGATACACAACAACATAGAGTTAGCTAAAGAAGTTGGGGATAAGTTGTTTGTTTTGGATCCCGAAAACGCAGGACGATACGTGATCTTAGCTAGTATGTATGAAGATGCAGGGCAGTGGGAAGATGCTGCTAAACTAAGGAAGTTGCTGAGAGATAGGAATATTAGGAAGCCGGTTGGTTGCAGTTCAATAGAGATAGATAGGATTAATCATGTGTTTGGGAAGGAGGATGAATCTCACCCCTTCACAGAACAAATTTTTGACACATTGGAGAAGCTTGAAAGGGTAATGGATGAAAATTTTGAACCTATTTAATGGAATCCACTTTGCAATTTTCCCTTCAATTGTTCATGTTTCTCATTTTTCTCCCTTGGAATTTGGATGTTCCACTTTAATTCACAACATTCTTGTTAAAAAAAACCTACCTACCACTAAACTCGACTGAGTTGTCTGTCCGAAATTCTAGGAAAACTCTAGGTTTAGCTTAAAAAACCTACTTACGACTCAACTCCGCTCTTTTCTTAAGGAAAGACTTAGTCTTACTGGGTCGATGCAGAATTCTAGGTTAGCTTTCAATTTCACTTGACAAAAGAATAGATTAAGTCTTCCTTCCTTTTTTTTATCGAAATTTTTTTCTTCATTCAAGGGTAAAATAAGTCAATTTCTTTCGTCGATTCAAACGCTGAGTTTGTTTTAAAGTCTTTCGACACTAAACGGACCTTTTGTTTAGCATTGATCATTTCTGCCAAATTGGACCCCATATGTCGTGCTAGTGACCACATTACGATTTAGAGCCAAGTAAAAAGACGAAGATCAAACTAAAACTAAGCCAAAACCATAAAACCAAATTGAAACCTTCAAACCATTGGAAGCATGGAAGTAAATTATAATCTTCAAAAACTTGGGTCCAAACCATATGGACAAAGTTTGTAATTTAACCTAACAATTTTAAGACTGATACTGAAAGTTCATTTGATACCAGTACAATTTACAAATAAAAACTAATACATAGTTGAGAAGCCTATTTCTTAGTAGAGTTTCTGAGTCAAACAAATTACAGTTTTCTTTACATAATCATTTGCTATTTGGTAAGGTACAAAGATACAGGGGAACGGACGGGCTAATTGTAAGATAATCAGTTTTTGCTTGTAATGTGTTCACAGATTCTTCATAGCAAAATAAACAAAGCTTCAGACAAGTTCTGTTCTCTGCCAAATCTAAATCAAATAATGTATTCTCATTCCAAGCAGCATGCCTTGATGAGTTGGGTCGTTTCTTTGCCTCTGCTCGTTCTCGGTTTTCGAGGGCCTGACATTTACAAATTACCTGCACAATCAGTACAATTATACAATGAAGGATTAACTACAACATGTTCTAAGAAATCACTTGCAAAAGGTTAA

mRNA sequence

ATGAATGGGCTCCACTGGCGATCTTCGCGAGCTCTGGTGTCGAGAAATCGACCATTCTTCAGATTTCTAAGATCCATCTTCAACGATTGGAGTCGTAGGGAGTATCATCGCGATTCATATGATTACACGAAGTTATTACAACATTGCAGATCTATCAGAAGCGTTCAAGAACTTCATGCCCAGATCGTCGTTGAAGGTCATGACCAAAATGGATTCTTAGCCACGAAGCTAATCGGCAAATATGCCGAGTATGGCGAGGAAAAAATGGGAATTGCACGGAAGGTGTTCGATAGATTGCTTGAAAAAGATGTGTTCTTATGGAACGTGGTTATTCAAGGGTATGCGAATTGGGGTCCGTTTGCTGAAGCTCTAAACCTGTATGATGAAATGCGGATTCTTCATAGCAAAATAAACAAAGCTTCAGACAAGTTCTGTTCTCTGCCAAATCTAAATCAAATAATGTATTCTCATTCCAAGCAGCATGCCTTGATGAGTTGGGTCGTTTCTTTGCCTCTGCTCGTTCTCGGTTTTCGAGGGCCTGACATTTACAAATTACCTGCACAATCAGTACAATTATACAATGAAGGATTAACTACAACATGTTCTAAGAAATCACTTGCAAAAGGTTAA

Coding sequence (CDS)

ATGAATGGGCTCCACTGGCGATCTTCGCGAGCTCTGGTGTCGAGAAATCGACCATTCTTCAGATTTCTAAGATCCATCTTCAACGATTGGAGTCGTAGGGAGTATCATCGCGATTCATATGATTACACGAAGTTATTACAACATTGCAGATCTATCAGAAGCGTTCAAGAACTTCATGCCCAGATCGTCGTTGAAGGTCATGACCAAAATGGATTCTTAGCCACGAAGCTAATCGGCAAATATGCCGAGTATGGCGAGGAAAAAATGGGAATTGCACGGAAGGTGTTCGATAGATTGCTTGAAAAAGATGTGTTCTTATGGAACGTGGTTATTCAAGGGTATGCGAATTGGGGTCCGTTTGCTGAAGCTCTAAACCTGTATGATGAAATGCGGATTCTTCATAGCAAAATAAACAAAGCTTCAGACAAGTTCTGTTCTCTGCCAAATCTAAATCAAATAATGTATTCTCATTCCAAGCAGCATGCCTTGATGAGTTGGGTCGTTTCTTTGCCTCTGCTCGTTCTCGGTTTTCGAGGGCCTGACATTTACAAATTACCTGCACAATCAGTACAATTATACAATGAAGGATTAACTACAACATGTTCTAAGAAATCACTTGCAAAAGGTTAA

Protein sequence

MNGLHWRSSRALVSRNRPFFRFLRSIFNDWSRREYHRDSYDYTKLLQHCRSIRSVQELHAQIVVEGHDQNGFLATKLIGKYAEYGEEKMGIARKVFDRLLEKDVFLWNVVIQGYANWGPFAEALNLYDEMRILHSKINKASDKFCSLPNLNQIMYSHSKQHALMSWVVSLPLLVLGFRGPDIYKLPAQSVQLYNEGLTTTCSKKSLAKG
BLAST of Cp4.1LG08g13030 vs. Swiss-Prot
Match: PP224_ARATH (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 66.6 bits (161), Expect = 3.6e-10
Identity = 34/97 (35.05%), Postives = 57/97 (58.76%), Query Frame = 1

Query: 36  HRDSYDYTKLLQHCRSIRSVQELHAQIVVEGHDQNGFLATKLIGKYAEYGEEKMGIARKV 95
           H DS+ Y  L+        ++++HA+++V G   +GFL TKLI   + +G+  +  AR+V
Sbjct: 19  HSDSF-YASLIDSATHKAQLKQIHARLLVLGLQFSGFLITKLIHASSSFGD--ITFARQV 78

Query: 96  FDRLLEKDVFLWNVVIQGYANWGPFAEALNLYDEMRI 133
           FD L    +F WN +I+GY+    F +AL +Y  M++
Sbjct: 79  FDDLPRPQIFPWNAIIRGYSRNNHFQDALLMYSNMQL 112

BLAST of Cp4.1LG08g13030 vs. Swiss-Prot
Match: PP116_ARATH (Pentatricopeptide repeat-containing protein At1g71490 OS=Arabidopsis thaliana GN=PCMP-E67 PE=2 SV=1)

HSP 1 Score: 65.5 bits (158), Expect = 8.1e-10
Identity = 33/96 (34.38%), Postives = 53/96 (55.21%), Query Frame = 1

Query: 38  DSYDYTKLLQHCRSIRSV---QELHAQIVVEGHDQNGFLATKLIGKYAEYGEEKMGIARK 97
           D++ Y  +L+ C     V   + +H  I V  +  + ++   LI  Y  +    MGIAR+
Sbjct: 143 DAFTYPSVLKACGETLDVAFGRVVHGSIEVSSYKSSLYVCNALISMYKRF--RNMGIARR 202

Query: 98  VFDRLLEKDVFLWNVVIQGYANWGPFAEALNLYDEM 131
           +FDR+ E+D   WN VI  YA+ G ++EA  L+D+M
Sbjct: 203 LFDRMFERDAVSWNAVINCYASEGMWSEAFELFDKM 236

BLAST of Cp4.1LG08g13030 vs. Swiss-Prot
Match: PP265_ARATH (Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidopsis thaliana GN=CRR2 PE=2 SV=1)

HSP 1 Score: 64.3 bits (155), Expect = 1.8e-09
Identity = 35/97 (36.08%), Postives = 54/97 (55.67%), Query Frame = 1

Query: 48  HCRSIRSVQELHAQIVVEGHDQNGFLATKLIGKYAEYGEEKMGIARKVFDRLLEKDVFLW 107
           H  S+     +H  I+  G DQ+ FLATKLIG Y++ G   +  ARKVFD+  ++ +++W
Sbjct: 89  HRSSLSDALRVHRHILDNGSDQDPFLATKLIGMYSDLG--SVDYARKVFDKTRKRTIYVW 148

Query: 108 NVVIQGYANWGPFAEALNLYDEMRILHSKINKASDKF 145
           N + +     G   E L LY +M    ++I   SD+F
Sbjct: 149 NALFRALTLAGHGEEVLGLYWKM----NRIGVESDRF 179

BLAST of Cp4.1LG08g13030 vs. Swiss-Prot
Match: PP348_ARATH (Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana GN=EMB2758 PE=3 SV=2)

HSP 1 Score: 63.9 bits (154), Expect = 2.3e-09
Identity = 41/161 (25.47%), Postives = 77/161 (47.83%), Query Frame = 1

Query: 3   GLHWRSSRALVSRNRPFFRFLRSIFNDWSRREYHRDSYDYTKLLQHCRSIRSVQELHAQI 62
           G   R  +++ S  R F     ++ + W      ++  D   L ++C +++S + LHA++
Sbjct: 18  GRFTRVLQSIGSVIREFSASANALQDCWKNGNESKEIDDVHTLFRYCTNLQSAKCLHARL 77

Query: 63  VVEGHDQNGFLATKLIGKYAEYGEEKMGIARKVFDRLLEKDVFLWNVVIQGYANWGPFAE 122
           VV    QN  ++ KL+  Y   G   + +AR  FD +  +DV+ WN++I GY   G  +E
Sbjct: 78  VVSKQIQNVCISAKLVNLYCYLG--NVALARHTFDHIQNRDVYAWNLMISGYGRAGNSSE 137

Query: 123 ALNLYDEMRILHSKINKASDKFCSLPNLNQIMYSHSKQHAL 164
            +  +  + +L S +      F S+    + +   +K H L
Sbjct: 138 VIRCF-SLFMLSSGLTPDYRTFPSVLKACRTVIDGNKIHCL 175

BLAST of Cp4.1LG08g13030 vs. Swiss-Prot
Match: PP303_ARATH (Pentatricopeptide repeat-containing protein At4g04370 OS=Arabidopsis thaliana GN=PCMP-E99 PE=3 SV=1)

HSP 1 Score: 62.4 bits (150), Expect = 6.8e-09
Identity = 33/110 (30.00%), Postives = 61/110 (55.45%), Query Frame = 1

Query: 25  SIFNDWSRREYHRDSYDYTKLLQHCRSIRSVQ---ELHAQIVVEGHDQNGFLATKLIGKY 84
           S F+     +   D++ +  LL+ C S++ +     +H Q++V G   + ++++ L+  Y
Sbjct: 32  STFSSMLANKLLPDTFTFPSLLKACASLQRLSFGLSIHQQVLVNGFSSDFYISSSLVNLY 91

Query: 85  AEYGEEKMGIARKVFDRLLEKDVFLWNVVIQGYANWGPFAEALNLYDEMR 132
           A++G   +  ARKVF+ + E+DV  W  +I  Y+  G   EA +L +EMR
Sbjct: 92  AKFG--LLAHARKVFEEMRERDVVHWTAMIGCYSRAGIVGEACSLVNEMR 139

BLAST of Cp4.1LG08g13030 vs. TrEMBL
Match: A0A0A0L101_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G002500 PE=4 SV=1)

HSP 1 Score: 179.5 bits (454), Expect = 4.3e-42
Identity = 92/146 (63.01%), Postives = 110/146 (75.34%), Query Frame = 1

Query: 1   MNGLHWRSSRALVSRNRPFFRFLRSIFNDWSRREYHRDSYDYTKLLQHCRSIRSVQELHA 60
           MNGL  R+ +AL  RN+PFF       ++WS  +YHRDSY + KLL HCRSIRSVQELHA
Sbjct: 1   MNGLCRRTWQALTFRNQPFF-------SEWSFGKYHRDSYGFIKLLGHCRSIRSVQELHA 60

Query: 61  QIVVEGHDQNGFLATKLIGKYAEY--GEEKMGIARKVFDRLLEKDVFLWNVVIQGYANWG 120
           QI+VEG DQNGF+A KLIGKY E+  GE KMG ARKVFD L+ +DVF+WNVVIQGYA+ G
Sbjct: 61  QILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDTLVRRDVFVWNVVIQGYASLG 120

Query: 121 PFAEALNLYDEMRILHSKINKASDKF 145
           PF EALNL+DEMR+     N+ +  F
Sbjct: 121 PFVEALNLFDEMRVSGEPTNRYTFPF 139

BLAST of Cp4.1LG08g13030 vs. TrEMBL
Match: M5WJX7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa021315mg PE=4 SV=1)

HSP 1 Score: 140.6 bits (353), Expect = 2.2e-30
Identity = 69/134 (51.49%), Postives = 91/134 (67.91%), Query Frame = 1

Query: 1   MNGLHWRSSRALVSRNRPF--FRFLRSIFNDWSRREYHRDSYDYTKLLQHCRSIRSVQEL 60
           M+GL  ++   LVS   PF    F       +    Y+RDSY YT LL+HCRS +S+++L
Sbjct: 9   MSGLARQAELLLVSWREPFRLLGFFIGGVRTYMTPHYYRDSYHYTYLLKHCRSTKSIKKL 68

Query: 61  HAQIVVEGHDQNGFLATKLIGKYAEYGEEKMGIARKVFDRLLEKDVFLWNVVIQGYANWG 120
           HAQI++ G +QN F+  K++GKY E  E  M  ARKVFDRLLE+DVF+WN+VIQGYAN  
Sbjct: 69  HAQIIIGGFEQNPFVVAKIVGKYVECSEPSMETARKVFDRLLERDVFVWNMVIQGYANVE 128

Query: 121 PFAEALNLYDEMRI 133
           PF EAL +Y+ MR+
Sbjct: 129 PFVEALKMYNRMRL 142

BLAST of Cp4.1LG08g13030 vs. TrEMBL
Match: A0A059AGP0_EUCGR (Uncharacterized protein (Fragment) OS=Eucalyptus grandis GN=EUGRSUZ_J02493 PE=4 SV=1)

HSP 1 Score: 134.4 bits (337), Expect = 1.6e-28
Identity = 63/107 (58.88%), Postives = 79/107 (73.83%), Query Frame = 1

Query: 38  DSYDYTKLLQHCRSIRSVQELHAQIVVEGHDQNGFLATKLIGKYAEYGEEKMGIARKVFD 97
           DSYDYT LL+ CRS   ++++HAQIV+ G +QN FLA KL+G Y +YG   MG ARKVFD
Sbjct: 1   DSYDYTHLLERCRSSSCLKKIHAQIVIGGFEQNPFLAAKLVGAYGDYGAPDMGDARKVFD 60

Query: 98  RLLEKDVFLWNVVIQGYANWGPFAEALNLYDEMRILHSKINKASDKF 145
           RL ++DVFLWNV+I+GYAN GPF EALN+Y  MR+     N+ S  F
Sbjct: 61  RLSDRDVFLWNVMIRGYANLGPFEEALNVYRHMRLSCVSANRYSFPF 107

BLAST of Cp4.1LG08g13030 vs. TrEMBL
Match: W9SCS3_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_026611 PE=4 SV=1)

HSP 1 Score: 129.4 bits (324), Expect = 5.1e-27
Identity = 67/133 (50.38%), Postives = 90/133 (67.67%), Query Frame = 1

Query: 12  LVSRNRPFFRFLRSIFNDWSRREYHRDSYDYTKLLQHCRSIRSVQELHAQIVVEGHDQNG 71
           LVSR   F      +    SR   +RD+YD+T LL+ C++ ++V++LHAQI++EGH+QN 
Sbjct: 15  LVSRREIFRSSRFYVSGITSRGPCYRDAYDFTYLLEQCKTTKAVKKLHAQIIMEGHEQNP 74

Query: 72  FLATKLIGKYAEYGEEKMGIARKVFDRLLEKDVFLWNVVIQGYANWGPFAEALNLYDEMR 131
            +A+KLIGKY       M IA+KVFD L ++DVFLWN+VIQGYAN GPF EA++LY  MR
Sbjct: 75  KVASKLIGKYIGCNNTSMEIAQKVFDSLSKRDVFLWNMVIQGYANLGPFIEAVDLYRRMR 134

Query: 132 ILHSKINKASDKF 145
           I     NK +  F
Sbjct: 135 ISGLAANKYTYPF 147

BLAST of Cp4.1LG08g13030 vs. TrEMBL
Match: D7T1S2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_00s0264g00110 PE=4 SV=1)

HSP 1 Score: 122.1 bits (305), Expect = 8.1e-25
Identity = 53/95 (55.79%), Postives = 73/95 (76.84%), Query Frame = 1

Query: 37  RDSYDYTKLLQHCRSIRSVQELHAQIVVEGHDQNGFLATKLIGKYAEYGEEKMGIARKVF 96
           R+SYDYT LLQ C+  ++++ +HAQI++ G ++N FL  KL+GKYA+  E  +  ARKVF
Sbjct: 5   RNSYDYTYLLQRCKGTKTIKSIHAQIIIGGFEENPFLGAKLVGKYAQCYESNIEDARKVF 64

Query: 97  DRLLEKDVFLWNVVIQGYANWGPFAEALNLYDEMR 132
           D L ++DVF+WN +IQGYAN GPF EALN+Y+ MR
Sbjct: 65  DCLPDRDVFVWNTIIQGYANLGPFMEALNIYEYMR 99

BLAST of Cp4.1LG08g13030 vs. TAIR10
Match: AT3G12770.1 (AT3G12770.1 mitochondrial editing factor 22)

HSP 1 Score: 66.6 bits (161), Expect = 2.0e-11
Identity = 34/97 (35.05%), Postives = 57/97 (58.76%), Query Frame = 1

Query: 36  HRDSYDYTKLLQHCRSIRSVQELHAQIVVEGHDQNGFLATKLIGKYAEYGEEKMGIARKV 95
           H DS+ Y  L+        ++++HA+++V G   +GFL TKLI   + +G+  +  AR+V
Sbjct: 19  HSDSF-YASLIDSATHKAQLKQIHARLLVLGLQFSGFLITKLIHASSSFGD--ITFARQV 78

Query: 96  FDRLLEKDVFLWNVVIQGYANWGPFAEALNLYDEMRI 133
           FD L    +F WN +I+GY+    F +AL +Y  M++
Sbjct: 79  FDDLPRPQIFPWNAIIRGYSRNNHFQDALLMYSNMQL 112

BLAST of Cp4.1LG08g13030 vs. TAIR10
Match: AT1G71490.1 (AT1G71490.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 65.5 bits (158), Expect = 4.5e-11
Identity = 33/96 (34.38%), Postives = 53/96 (55.21%), Query Frame = 1

Query: 38  DSYDYTKLLQHCRSIRSV---QELHAQIVVEGHDQNGFLATKLIGKYAEYGEEKMGIARK 97
           D++ Y  +L+ C     V   + +H  I V  +  + ++   LI  Y  +    MGIAR+
Sbjct: 143 DAFTYPSVLKACGETLDVAFGRVVHGSIEVSSYKSSLYVCNALISMYKRF--RNMGIARR 202

Query: 98  VFDRLLEKDVFLWNVVIQGYANWGPFAEALNLYDEM 131
           +FDR+ E+D   WN VI  YA+ G ++EA  L+D+M
Sbjct: 203 LFDRMFERDAVSWNAVINCYASEGMWSEAFELFDKM 236

BLAST of Cp4.1LG08g13030 vs. TAIR10
Match: AT3G46790.1 (AT3G46790.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 64.3 bits (155), Expect = 1.0e-10
Identity = 35/97 (36.08%), Postives = 54/97 (55.67%), Query Frame = 1

Query: 48  HCRSIRSVQELHAQIVVEGHDQNGFLATKLIGKYAEYGEEKMGIARKVFDRLLEKDVFLW 107
           H  S+     +H  I+  G DQ+ FLATKLIG Y++ G   +  ARKVFD+  ++ +++W
Sbjct: 89  HRSSLSDALRVHRHILDNGSDQDPFLATKLIGMYSDLG--SVDYARKVFDKTRKRTIYVW 148

Query: 108 NVVIQGYANWGPFAEALNLYDEMRILHSKINKASDKF 145
           N + +     G   E L LY +M    ++I   SD+F
Sbjct: 149 NALFRALTLAGHGEEVLGLYWKM----NRIGVESDRF 179

BLAST of Cp4.1LG08g13030 vs. TAIR10
Match: AT4G33990.1 (AT4G33990.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 63.9 bits (154), Expect = 1.3e-10
Identity = 41/161 (25.47%), Postives = 77/161 (47.83%), Query Frame = 1

Query: 3   GLHWRSSRALVSRNRPFFRFLRSIFNDWSRREYHRDSYDYTKLLQHCRSIRSVQELHAQI 62
           G   R  +++ S  R F     ++ + W      ++  D   L ++C +++S + LHA++
Sbjct: 18  GRFTRVLQSIGSVIREFSASANALQDCWKNGNESKEIDDVHTLFRYCTNLQSAKCLHARL 77

Query: 63  VVEGHDQNGFLATKLIGKYAEYGEEKMGIARKVFDRLLEKDVFLWNVVIQGYANWGPFAE 122
           VV    QN  ++ KL+  Y   G   + +AR  FD +  +DV+ WN++I GY   G  +E
Sbjct: 78  VVSKQIQNVCISAKLVNLYCYLG--NVALARHTFDHIQNRDVYAWNLMISGYGRAGNSSE 137

Query: 123 ALNLYDEMRILHSKINKASDKFCSLPNLNQIMYSHSKQHAL 164
            +  +  + +L S +      F S+    + +   +K H L
Sbjct: 138 VIRCF-SLFMLSSGLTPDYRTFPSVLKACRTVIDGNKIHCL 175

BLAST of Cp4.1LG08g13030 vs. TAIR10
Match: AT4G04370.1 (AT4G04370.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 62.4 bits (150), Expect = 3.8e-10
Identity = 33/110 (30.00%), Postives = 61/110 (55.45%), Query Frame = 1

Query: 25  SIFNDWSRREYHRDSYDYTKLLQHCRSIRSVQ---ELHAQIVVEGHDQNGFLATKLIGKY 84
           S F+     +   D++ +  LL+ C S++ +     +H Q++V G   + ++++ L+  Y
Sbjct: 32  STFSSMLANKLLPDTFTFPSLLKACASLQRLSFGLSIHQQVLVNGFSSDFYISSSLVNLY 91

Query: 85  AEYGEEKMGIARKVFDRLLEKDVFLWNVVIQGYANWGPFAEALNLYDEMR 132
           A++G   +  ARKVF+ + E+DV  W  +I  Y+  G   EA +L +EMR
Sbjct: 92  AKFG--LLAHARKVFEEMRERDVVHWTAMIGCYSRAGIVGEACSLVNEMR 139

BLAST of Cp4.1LG08g13030 vs. NCBI nr
Match: gi|659096558|ref|XP_008449159.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic-like [Cucumis melo])

HSP 1 Score: 185.3 bits (469), Expect = 1.1e-43
Identity = 96/146 (65.75%), Postives = 113/146 (77.40%), Query Frame = 1

Query: 1   MNGLHWRSSRALVSRNRPFFRFLRSIFNDWSRREYHRDSYDYTKLLQHCRSIRSVQELHA 60
           MNGL+ R+ +AL  R++PFF   RS        +YHRDSYD+ KLL HCR+IRSVQELHA
Sbjct: 1   MNGLYRRTWQALTFRSQPFFSERRS-------GKYHRDSYDFMKLLGHCRTIRSVQELHA 60

Query: 61  QIVVEGHDQNGFLATKLIGKYAEY--GEEKMGIARKVFDRLLEKDVFLWNVVIQGYANWG 120
           QI+VEG DQNGFLATKLIGKY E   GE KMG ARKVFDRLL++DVFLWNVVIQGYA++G
Sbjct: 61  QILVEGLDQNGFLATKLIGKYVELGEGESKMGTARKVFDRLLQRDVFLWNVVIQGYASFG 120

Query: 121 PFAEALNLYDEMRILHSKINKASDKF 145
           PF EALNL+DEMR+     N+ +  F
Sbjct: 121 PFVEALNLFDEMRVSGEPTNRYTFPF 139

BLAST of Cp4.1LG08g13030 vs. NCBI nr
Match: gi|778674677|ref|XP_011650274.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Cucumis sativus])

HSP 1 Score: 179.5 bits (454), Expect = 6.1e-42
Identity = 92/146 (63.01%), Postives = 110/146 (75.34%), Query Frame = 1

Query: 1   MNGLHWRSSRALVSRNRPFFRFLRSIFNDWSRREYHRDSYDYTKLLQHCRSIRSVQELHA 60
           MNGL  R+ +AL  RN+PFF       ++WS  +YHRDSY + KLL HCRSIRSVQELHA
Sbjct: 1   MNGLCRRTWQALTFRNQPFF-------SEWSFGKYHRDSYGFIKLLGHCRSIRSVQELHA 60

Query: 61  QIVVEGHDQNGFLATKLIGKYAEY--GEEKMGIARKVFDRLLEKDVFLWNVVIQGYANWG 120
           QI+VEG DQNGF+A KLIGKY E+  GE KMG ARKVFD L+ +DVF+WNVVIQGYA+ G
Sbjct: 61  QILVEGLDQNGFVAAKLIGKYVEHDEGEVKMGTARKVFDTLVRRDVFVWNVVIQGYASLG 120

Query: 121 PFAEALNLYDEMRILHSKINKASDKF 145
           PF EALNL+DEMR+     N+ +  F
Sbjct: 121 PFVEALNLFDEMRVSGEPTNRYTFPF 139

BLAST of Cp4.1LG08g13030 vs. NCBI nr
Match: gi|645239109|ref|XP_008225987.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Prunus mume])

HSP 1 Score: 143.3 bits (360), Expect = 4.9e-31
Identity = 74/135 (54.81%), Postives = 92/135 (68.15%), Query Frame = 1

Query: 1   MNGLHWRSSRALVSRNRPFFRFLRSIFNDWSRR---EYHRDSYDYTKLLQHCRSIRSVQE 60
           MNG   ++   LVS   PF   L   F    R     ++RDSY YT LLQHCRS +S+++
Sbjct: 1   MNGSARQAELLLVSWREPFR--LLGFFIGGVRTYMTPHYRDSYHYTYLLQHCRSTKSIKK 60

Query: 61  LHAQIVVEGHDQNGFLATKLIGKYAEYGEEKMGIARKVFDRLLEKDVFLWNVVIQGYANW 120
           LHAQI++ G +QN F+  KL+GKY E  E  M  ARKVFDRLLE+DVF+WN+VIQGYAN 
Sbjct: 61  LHAQIIIGGFEQNPFVVAKLVGKYVECSEPSMETARKVFDRLLERDVFVWNMVIQGYANA 120

Query: 121 GPFAEALNLYDEMRI 133
           GPF EAL +YD MR+
Sbjct: 121 GPFVEALKMYDRMRL 133

BLAST of Cp4.1LG08g13030 vs. NCBI nr
Match: gi|694371571|ref|XP_009363319.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g46790, chloroplastic-like [Pyrus x bretschneideri])

HSP 1 Score: 141.0 bits (354), Expect = 2.4e-30
Identity = 75/145 (51.72%), Postives = 92/145 (63.45%), Query Frame = 1

Query: 1   MNGLHWRSSRALVSRNRPFFRFLRSIFNDWSRREYH-RDSYDYTKLLQHCRSIRSVQELH 60
           +NGL  R +  L+S   PF R L       +    H RDSY YT LLQHC+S  S+++LH
Sbjct: 9   INGLA-RQAEVLISLKEPF-RLLGFFIGGRTYTTLHCRDSYHYTYLLQHCKSTNSIKKLH 68

Query: 61  AQIVVEGHDQNGFLATKLIGKYAEYGEEKMGIARKVFDRLLEKDVFLWNVVIQGYANWGP 120
           AQI   G +QN F+  KL+GKY E  E  M  ARKVFDRLLE+D F+WN+VIQGYAN GP
Sbjct: 69  AQITTGGFEQNPFVVAKLVGKYVECSESSMEAARKVFDRLLERDAFVWNMVIQGYANVGP 128

Query: 121 FAEALNLYDEMRILHSKINKASDKF 145
           F EA N+YD MR+     NK +  F
Sbjct: 129 FVEAFNVYDRMRLSGVPANKYTYPF 151

BLAST of Cp4.1LG08g13030 vs. NCBI nr
Match: gi|595883572|ref|XP_007212815.1| (hypothetical protein PRUPE_ppa021315mg [Prunus persica])

HSP 1 Score: 140.6 bits (353), Expect = 3.2e-30
Identity = 69/134 (51.49%), Postives = 91/134 (67.91%), Query Frame = 1

Query: 1   MNGLHWRSSRALVSRNRPF--FRFLRSIFNDWSRREYHRDSYDYTKLLQHCRSIRSVQEL 60
           M+GL  ++   LVS   PF    F       +    Y+RDSY YT LL+HCRS +S+++L
Sbjct: 9   MSGLARQAELLLVSWREPFRLLGFFIGGVRTYMTPHYYRDSYHYTYLLKHCRSTKSIKKL 68

Query: 61  HAQIVVEGHDQNGFLATKLIGKYAEYGEEKMGIARKVFDRLLEKDVFLWNVVIQGYANWG 120
           HAQI++ G +QN F+  K++GKY E  E  M  ARKVFDRLLE+DVF+WN+VIQGYAN  
Sbjct: 69  HAQIIIGGFEQNPFVVAKIVGKYVECSEPSMETARKVFDRLLERDVFVWNMVIQGYANVE 128

Query: 121 PFAEALNLYDEMRI 133
           PF EAL +Y+ MR+
Sbjct: 129 PFVEALKMYNRMRL 142

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP224_ARATH3.6e-1035.05Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana GN... [more]
PP116_ARATH8.1e-1034.38Pentatricopeptide repeat-containing protein At1g71490 OS=Arabidopsis thaliana GN... [more]
PP265_ARATH1.8e-0936.08Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidop... [more]
PP348_ARATH2.3e-0925.47Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana GN... [more]
PP303_ARATH6.8e-0930.00Pentatricopeptide repeat-containing protein At4g04370 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0L101_CUCSA4.3e-4263.01Uncharacterized protein OS=Cucumis sativus GN=Csa_3G002500 PE=4 SV=1[more]
M5WJX7_PRUPE2.2e-3051.49Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa021315mg PE=4 SV=1[more]
A0A059AGP0_EUCGR1.6e-2858.88Uncharacterized protein (Fragment) OS=Eucalyptus grandis GN=EUGRSUZ_J02493 PE=4 ... [more]
W9SCS3_9ROSA5.1e-2750.38Uncharacterized protein OS=Morus notabilis GN=L484_026611 PE=4 SV=1[more]
D7T1S2_VITVI8.1e-2555.79Putative uncharacterized protein OS=Vitis vinifera GN=VIT_00s0264g00110 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT3G12770.12.0e-1135.05 mitochondrial editing factor 22[more]
AT1G71490.14.5e-1134.38 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G46790.11.0e-1036.08 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G33990.11.3e-1025.47 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G04370.13.8e-1030.00 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659096558|ref|XP_008449159.1|1.1e-4365.75PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic-... [more]
gi|778674677|ref|XP_011650274.1|6.1e-4263.01PREDICTED: pentatricopeptide repeat-containing protein At3g46790, chloroplastic-... [more]
gi|645239109|ref|XP_008225987.1|4.9e-3154.81PREDICTED: pentatricopeptide repeat-containing protein At3g46790, chloroplastic-... [more]
gi|694371571|ref|XP_009363319.1|2.4e-3051.72PREDICTED: pentatricopeptide repeat-containing protein At3g46790, chloroplastic-... [more]
gi|595883572|ref|XP_007212815.1|3.2e-3051.49hypothetical protein PRUPE_ppa021315mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG08g13030.1Cp4.1LG08g13030.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 106..131
score: 2.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 106..131
score: 3.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 103..137
score: 9
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 7..132
score: 3.5
NoneNo IPR availablePANTHERPTHR24015:SF883SUBFAMILY NOT NAMEDcoord: 7..132
score: 3.5

The following gene(s) are paralogous to this gene:

None