Cp4.1LG03g08090 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG03g08090
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG03 : 2998829 .. 3000091 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCGCACCATATTTCATTATCAGGGGATATGCTTCGAGCGAATCTCCACAAGATGCCATTTCGGTATTTGGGGAAATGCGAAGACGAGGAATCAGACCCAATAACCTCACTTACCCCTTCCTTCTCAAGGCCTGTGCGACTCTCGCGGCGCTACAAGAAGGTAAGCAGTTTCATGCTGTTGCCATAAAGTGTGGTTTGGATTTAGATGTTTATGTTCGGAACACTCTGATTAATTTCTATGGGTCCTGCAAAAGAATGTCTGGTGCACGGAAGGTGTTCGACGAAATGTCTGAAAGAACCTTAGTTTCGTGGAATGCGATTATTACAGCGTGTGTTGAGAATTTATGCTTTGATGAAGCGATTGACTACTTTTTGAAAATGGGAAACCATGGTTTTGAGCCGGATGAAACCACAATGGTGGTTATATTATCTGCTTGTGCAGAGCTTGGTAACTTGAGCTTAGGGAGATGGGTTCATTCTCAAGTGGTGGGAAGAGGGATGGTTTGGAATGTTCAATTGGGCACTGCCTTCGTCGACATGTATGCTAAATCTGGTGATGTGGGATGTGCTAGGCTTGTATTCAATTCTTTGAAACAGAGAAGTGTATGGCCATGGAGTGCAATGATTTTGGGGCTAGCCCAACATGGATTTGCCAACGAAGCCATTGAACTTTTCACAAATATGATGAGCTCCTCTGTTGTCCCTAACTATGTCACTTTCATTGGTGTCCTATGTGCCTGCAGCCATGCCGGATTGGTGGATGAAGGCTACCACTACTTCAACATTATGGAGAGAGTTTATGGGATTAAGCCAATGATGATACATTACGGATCAATGGTGGACATCTTAGGACGTGCAGGTCGTGTCAAAGAGGCTTATGAGTTCATCATGAGCATGCATGTGCAGCCTGATCCGATCGTGTGGAGGACATTGCTAAGTGCATGCAGTGCTCGTGATGTTGATGGTGGGGCTCAGGTTGCAGAGGAGGCAAGAAAGAGACTCCTTGAGCTGGAGCCGAAGAGGGGCGGAAATGTGGTGATGGTTGCGAACATGTTTGCTGAAGTTGGGATGTGGAAGCAGGCAGCAGATTGCCGGAGGACGATGAAAGATAGAGGAATCAAAAAGATGGCTGGGGAGAGTTGCATCGAAGTGGGTGGCTCCTTGTGTAAATTCTTCTCGGGTTTTAATGCTCGAGCTGATTCTCATGGCATTTACGATTTGCTTGATGGATTGGACCTGCATATGCAAATCATAAACTTCTAA

mRNA sequence

TTCGCACCATATTTCATTATCAGGGGATATGCTTCGAGCGAATCTCCACAAGATGCCATTTCGGTATTTGGGGAAATGCGAAGACGAGGAATCAGACCCAATAACCTCACTTACCCCTTCCTTCTCAAGGCCTGTGCGACTCTCGCGGCGCTACAAGAAGAGCTTGGTAACTTGAGCTTAGGGAGATGGGTTCATTCTCAAGTGGTGGGAAGAGGGATGGTTTGGAATGTTCAATTGGGCACTGCCTTCGTCGACATGTATGCTAAATCTGGTGATGTGGGATGTGCTAGGCTTGTATTCAATTCTTTGAAACAGAGAAGTGTATGGCCATGGAGTGCAATGATTTTGGGGCTAGCCCAACATGGATTTGCCAACGAAGCCATTGAACTTTTCACAAATATGATGAGCTCCTCTGTTGTCCCTAACTATGTCACTTTCATTGGTGTCCTATGTGCCTGCAGCCATGCCGGATTGGTGGATGAAGGCTACCACTACTTCAACATTATGGAGAGAGTTTATGGGATTAAGCCAATGATGATACATTACGGATCAATGGTGGACATCTTAGGACGTGCAGGTCGTGTCAAAGAGGCTTATGAGTTCATCATGAGCATGCATGTGCAGCCTGATCCGATCGTGTGGAGGACATTGCTAAGTGCATGCAGTGCTCGTGATGTTGATGGTGGGGCTCAGGTTGCAGAGGAGGCAAGAAAGAGACTCCTTGAGCTGGAGCCGAAGAGGGGCGGAAATGTGGTGATGGTTGCGAACATGTTTGCTGAAGTTGGGATGTGGAAGCAGGCAGCAGATTGCCGGAGGACGATGAAAGATAGAGGAATCAAAAAGATGGCTGGGGAGAGTTGCATCGAAGTGGGTGGCTCCTTGTGTAAATTCTTCTCGGGTTTTAATGCTCGAGCTGATTCTCATGGCATTTACGATTTGCTTGATGGATTGGACCTGCATATGCAAATCATAAACTTCTAA

Coding sequence (CDS)

TTCGCACCATATTTCATTATCAGGGGATATGCTTCGAGCGAATCTCCACAAGATGCCATTTCGGTATTTGGGGAAATGCGAAGACGAGGAATCAGACCCAATAACCTCACTTACCCCTTCCTTCTCAAGGCCTGTGCGACTCTCGCGGCGCTACAAGAAGAGCTTGGTAACTTGAGCTTAGGGAGATGGGTTCATTCTCAAGTGGTGGGAAGAGGGATGGTTTGGAATGTTCAATTGGGCACTGCCTTCGTCGACATGTATGCTAAATCTGGTGATGTGGGATGTGCTAGGCTTGTATTCAATTCTTTGAAACAGAGAAGTGTATGGCCATGGAGTGCAATGATTTTGGGGCTAGCCCAACATGGATTTGCCAACGAAGCCATTGAACTTTTCACAAATATGATGAGCTCCTCTGTTGTCCCTAACTATGTCACTTTCATTGGTGTCCTATGTGCCTGCAGCCATGCCGGATTGGTGGATGAAGGCTACCACTACTTCAACATTATGGAGAGAGTTTATGGGATTAAGCCAATGATGATACATTACGGATCAATGGTGGACATCTTAGGACGTGCAGGTCGTGTCAAAGAGGCTTATGAGTTCATCATGAGCATGCATGTGCAGCCTGATCCGATCGTGTGGAGGACATTGCTAAGTGCATGCAGTGCTCGTGATGTTGATGGTGGGGCTCAGGTTGCAGAGGAGGCAAGAAAGAGACTCCTTGAGCTGGAGCCGAAGAGGGGCGGAAATGTGGTGATGGTTGCGAACATGTTTGCTGAAGTTGGGATGTGGAAGCAGGCAGCAGATTGCCGGAGGACGATGAAAGATAGAGGAATCAAAAAGATGGCTGGGGAGAGTTGCATCGAAGTGGGTGGCTCCTTGTGTAAATTCTTCTCGGGTTTTAATGCTCGAGCTGATTCTCATGGCATTTACGATTTGCTTGATGGATTGGACCTGCATATGCAAATCATAAACTTCTAA

Protein sequence

FAPYFIIRGYASSESPQDAISVFGEMRRRGIRPNNLTYPFLLKACATLAALQEELGNLSLGRWVHSQVVGRGMVWNVQLGTAFVDMYAKSGDVGCARLVFNSLKQRSVWPWSAMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDEGYHYFNIMERVYGIKPMMIHYGSMVDILGRAGRVKEAYEFIMSMHVQPDPIVWRTLLSACSARDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRTMKDRGIKKMAGESCIEVGGSLCKFFSGFNARADSHGIYDLLDGLDLHMQIINF
BLAST of Cp4.1LG03g08090 vs. Swiss-Prot
Match: PP188_ARATH (Pentatricopeptide repeat-containing protein At2g36730 OS=Arabidopsis thaliana GN=PCMP-E44 PE=3 SV=1)

HSP 1 Score: 344.0 bits (881), Expect = 1.8e-93
Identity = 171/294 (58.16%), Postives = 217/294 (73.81%), Query Frame = 1

Query: 23  FGEMRRRGIRPNNLTYPFLLKACATLAALQEELGNLSLGRWVHSQVVGRGMVWNVQLGTA 82
           F EM  +   P+  T   LL AC          GNLSLG+ VHSQV+ R +  N +LGTA
Sbjct: 202 FCEMIGKRFCPDETTMVVLLSACG---------GNLSLGKLVHSQVMVRELELNCRLGTA 261

Query: 83  FVDMYAKSGDVGCARLVFNSLKQRSVWPWSAMILGLAQHGFANEAIELFTNMMS-SSVVP 142
            VDMYAKSG +  ARLVF  +  ++VW WSAMI+GLAQ+GFA EA++LF+ MM  SSV P
Sbjct: 262 LVDMYAKSGGLEYARLVFERMVDKNVWTWSAMIVGLAQYGFAEEALQLFSKMMKESSVRP 321

Query: 143 NYVTFIGVLCACSHAGLVDEGYHYFNIMERVYGIKPMMIHYGSMVDILGRAGRVKEAYEF 202
           NYVTF+GVLCACSH GLVD+GY YF+ ME+++ IKPMMIHYG+MVDILGRAGR+ EAY+F
Sbjct: 322 NYVTFLGVLCACSHTGLVDDGYKYFHEMEKIHKIKPMMIHYGAMVDILGRAGRLNEAYDF 381

Query: 203 IMSMHVQPDPIVWRTLLSACSARDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEV 262
           I  M  +PD +VWRTLLSACS    +    + E+ +KRL+ELEPKR GN+V+VAN FAE 
Sbjct: 382 IKKMPFEPDAVVWRTLLSACSIHHDEDDEGIGEKVKKRLIELEPKRSGNLVIVANRFAEA 441

Query: 263 GMWKQAADCRRTMKDRGIKKMAGESCIEVGGSLCKFFSGFNARADSHGIYDLLD 316
            MW +AA+ RR MK+  +KK+AGESC+E+GGS  +FFSG++ R++   IY+LLD
Sbjct: 442 RMWAEAAEVRRVMKETKMKKIAGESCLELGGSFHRFFSGYDPRSEYVSIYELLD 486

BLAST of Cp4.1LG03g08090 vs. Swiss-Prot
Match: PP330_ARATH (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 247.3 bits (630), Expect = 2.3e-64
Identity = 122/307 (39.74%), Postives = 193/307 (62.87%), Query Frame = 1

Query: 6   IIRGYASSESPQDAISVFGEMRRRGIRPNNLTYPFLLKACATLAALQEELGNLSLGRWVH 65
           +I G+A +  P++A++++ EM  +GI+P+  T   LL ACA       ++G L+LG+ VH
Sbjct: 193 VINGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACA-------KIGALTLGKRVH 252

Query: 66  SQVVGRGMVWNVQLGTAFVDMYAKSGDVGCARLVFNSLKQRSVWPWSAMILGLAQHGFAN 125
             ++  G+  N+      +D+YA+ G V  A+ +F+ +  ++   W+++I+GLA +GF  
Sbjct: 253 VYMIKVGLTRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGK 312

Query: 126 EAIELFTNMMSSS-VVPNYVTFIGVLCACSHAGLVDEGYHYFNIMERVYGIKPMMIHYGS 185
           EAIELF  M S+  ++P  +TF+G+L ACSH G+V EG+ YF  M   Y I+P + H+G 
Sbjct: 313 EAIELFKYMESTEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGC 372

Query: 186 MVDILGRAGRVKEAYEFIMSMHVQPDPIVWRTLLSACSARDVDGGAQVAEEARKRLLELE 245
           MVD+L RAG+VK+AYE+I SM +QP+ ++WRTLL AC+   V G + +AE AR ++L+LE
Sbjct: 373 MVDLLARAGQVKKAYEYIKSMPMQPNVVIWRTLLGACT---VHGDSDLAEFARIQILQLE 432

Query: 246 PKRGGNVVMVANMFAEVGMWKQAADCRRTMKDRGIKKMAGESCIEVGGSLCKFFSGFNAR 305
           P   G+ V+++NM+A    W      R+ M   G+KK+ G S +EVG  + +F  G  + 
Sbjct: 433 PNHSGDYVLLSNMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSH 489

Query: 306 ADSHGIY 312
             S  IY
Sbjct: 493 PQSDAIY 489

BLAST of Cp4.1LG03g08090 vs. Swiss-Prot
Match: PP175_ARATH (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 246.1 bits (627), Expect = 5.2e-64
Identity = 128/310 (41.29%), Postives = 189/310 (60.97%), Query Frame = 1

Query: 6   IIRGYASSESPQDAISVFGEMR-RRGIRPNNLTYPFLLKACATLAALQEELGNLSLGRWV 65
           +I  Y  +  P +A+ VF E++ ++ ++ N +T    L ACA       ++G L LGRW+
Sbjct: 335 LISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACA-------QVGALELGRWI 394

Query: 66  HSQVVGRGMVWNVQLGTAFVDMYAKSGDVGCARLVFNSLKQRSVWPWSAMILGLAQHGFA 125
           HS +   G+  N  + +A + MY+K GD+  +R VFNS+++R V+ WSAMI GLA HG  
Sbjct: 395 HSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCG 454

Query: 126 NEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDEGYHYFNIMERVYGIKPMMIHYGS 185
           NEA+++F  M  ++V PN VTF  V CACSH GLVDE    F+ ME  YGI P   HY  
Sbjct: 455 NEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYAC 514

Query: 186 MVDILGRAGRVKEAYEFIMSMHVQPDPIVWRTLLSACSARDVDGGAQVAEEARKRLLELE 245
           +VD+LGR+G +++A +FI +M + P   VW  LL AC    +     +AE A  RLLELE
Sbjct: 515 IVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACK---IHANLNLAEMACTRLLELE 574

Query: 246 PKRGGNVVMVANMFAEVGMWKQAADCRRTMKDRGIKKMAGESCIEVGGSLCKFFSGFNAR 305
           P+  G  V+++N++A++G W+  ++ R+ M+  G+KK  G S IE+ G + +F SG NA 
Sbjct: 575 PRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAH 634

Query: 306 ADSHGIYDLL 315
             S  +Y  L
Sbjct: 635 PMSEKVYGKL 634

BLAST of Cp4.1LG03g08090 vs. Swiss-Prot
Match: PP433_ARATH (Pentatricopeptide repeat-containing protein At5g56310 OS=Arabidopsis thaliana GN=PCMP-E13 PE=2 SV=1)

HSP 1 Score: 242.3 bits (617), Expect = 7.5e-63
Identity = 131/317 (41.32%), Postives = 185/317 (58.36%), Query Frame = 1

Query: 6   IIRGYASSESPQDAISVFGEMRRRGIRPNNLTYPFLLKACATLAALQEELGNLSLGRWVH 65
           +I GYA S    +AI VF  M    + P+ +T   +L ACA       +LG+L LG  + 
Sbjct: 221 VISGYAKSGRASEAIEVFQRMLMENVEPDEVTLLAVLSACA-------DLGSLELGERIC 280

Query: 66  SQVVGRGMVWNVQLGTAFVDMYAKSGDVGCARLVFNSLKQRSVWPWSAMILGLAQHGFAN 125
           S V  RGM   V L  A +DMYAKSG++  A  VF  + +R+V  W+ +I GLA HG   
Sbjct: 281 SYVDHRGMNRAVSLNNAVIDMYAKSGNITKALDVFECVNERNVVTWTTIIAGLATHGHGA 340

Query: 126 EAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDEGYHYFNIMERVYGIKPMMIHYGSM 185
           EA+ +F  M+ + V PN VTFI +L ACSH G VD G   FN M   YGI P + HYG M
Sbjct: 341 EALAMFNRMVKAGVRPNDVTFIAILSACSHVGWVDLGKRLFNSMRSKYGIHPNIEHYGCM 400

Query: 186 VDILGRAGRVKEAYEFIMSMHVQPDPIVWRTLLSACSARDVDGGAQVAEEARKRLLELEP 245
           +D+LGRAG+++EA E I SM  + +  +W +LL   +A +V    ++ E A   L++LEP
Sbjct: 401 IDLLGRAGKLREADEVIKSMPFKANAAIWGSLL---AASNVHHDLELGERALSELIKLEP 460

Query: 246 KRGGNVVMVANMFAEVGMWKQAADCRRTMKDRGIKKMAGESCIEVGGSLCKFFSGFNARA 305
              GN +++AN+++ +G W ++   R  MK  G+KKMAGES IEV   + KF SG     
Sbjct: 461 NNSGNYMLLANLYSNLGRWDESRMMRNMMKGIGVKKMAGESSIEVENRVYKFISGDLTHP 520

Query: 306 DSHGIYDLLDGLDLHMQ 323
               I+++L  +DL +Q
Sbjct: 521 QVERIHEILQEMDLQIQ 527

BLAST of Cp4.1LG03g08090 vs. Swiss-Prot
Match: PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 241.1 bits (614), Expect = 1.7e-62
Identity = 126/317 (39.75%), Postives = 186/317 (58.68%), Query Frame = 1

Query: 6   IIRGYASSESPQDAISVFGEMRRRGIRPNNLTYPFLLKACATLAALQEELGNLSLGRWVH 65
           +I G+A +  P DA++ F +MR R ++P+  TY  ++ A A L+            +W+H
Sbjct: 409 MILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHH-------AKWIH 468

Query: 66  SQVVGRGMVWNVQLGTAFVDMYAKSGDVGCARLVFNSLKQRSVWPWSAMILGLAQHGFAN 125
             V+   +  NV + TA VDMYAK G +  ARL+F+ + +R V  W+AMI G   HGF  
Sbjct: 469 GVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGK 528

Query: 126 EAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDEGYHYFNIMERVYGIKPMMIHYGSM 185
            A+ELF  M   ++ PN VTF+ V+ ACSH+GLV+ G   F +M+  Y I+  M HYG+M
Sbjct: 529 AALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAM 588

Query: 186 VDILGRAGRVKEAYEFIMSMHVQPDPIVWRTLLSACSARDVDGGAQVAEEARKRLLELEP 245
           VD+LGRAGR+ EA++FIM M V+P   V+  +L AC    +      AE+A +RL EL P
Sbjct: 589 VDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGAC---QIHKNVNFAEKAAERLFELNP 648

Query: 246 KRGGNVVMVANMFAEVGMWKQAADCRRTMKDRGIKKMAGESCIEVGGSLCKFFSGFNARA 305
             GG  V++AN++    MW++    R +M  +G++K  G S +E+   +  FFSG  A  
Sbjct: 649 DDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHP 708

Query: 306 DSHGIYDLLDGLDLHMQ 323
           DS  IY  L+ L  H++
Sbjct: 709 DSKKIYAFLEKLICHIK 715

BLAST of Cp4.1LG03g08090 vs. TrEMBL
Match: A0A0A0K153_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G007910 PE=4 SV=1)

HSP 1 Score: 511.5 bits (1316), Expect = 7.4e-142
Identity = 252/309 (81.55%), Postives = 272/309 (88.03%), Query Frame = 1

Query: 18  DAISVFGEMRRRGIRPNNLTYPFLLKACATLAALQEELGNLSLGRWVHSQVVGRGMVWNV 77
           +AI  F +M   G  P+  T   +L ACA       ELGNLSLGRWVHSQVVGRGMV NV
Sbjct: 215 EAIDYFLKMGNHGFEPDETTMVVILSACA-------ELGNLSLGRWVHSQVVGRGMVLNV 274

Query: 78  QLGTAFVDMYAKSGDVGCARLVFNSLKQRSVWPWSAMILGLAQHGFANEAIELFTNMMSS 137
           QLGTAFVDMYAKSGDVGCAR VFN LKQ+SVW WSAMILGLAQHGFANEAIELFTNMMSS
Sbjct: 275 QLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWTWSAMILGLAQHGFANEAIELFTNMMSS 334

Query: 138 SVVPNYVTFIGVLCACSHAGLVDEGYHYFNIMERVYGIKPMMIHYGSMVDILGRAGRVKE 197
            +VPN+VTFIGVLCACSHAGLVD+ YHYFN+MERVYGIKPMMIHYGSMVD+LGRAG+VKE
Sbjct: 335 PIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKE 394

Query: 198 AYEFIMSMHVQPDPIVWRTLLSACSARDVDGGAQVAEEARKRLLELEPKRGGNVVMVANM 257
           AYE IMSM V+PDPIVWRTLLSACS RDV+GGA+VAEEARKRLLELEPKRGGNVVMVAN 
Sbjct: 395 AYELIMSMPVEPDPIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANK 454

Query: 258 FAEVGMWKQAADCRRTMKDRGIKKMAGESCIEVGGSLCKFFSGFNARADSHGIYDLLDGL 317
           FAE+GMWKQAAD RRTMKDRGIKKMAGESCIE+GGSL KFFSGF++RA   GIYDLLDGL
Sbjct: 455 FAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGIYDLLDGL 514

Query: 318 DLHMQIINF 327
           +LHMQ+ NF
Sbjct: 515 NLHMQLTNF 516

BLAST of Cp4.1LG03g08090 vs. TrEMBL
Match: M5WZ10_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004522mg PE=4 SV=1)

HSP 1 Score: 406.0 bits (1042), Expect = 4.4e-110
Identity = 195/308 (63.31%), Postives = 241/308 (78.25%), Query Frame = 1

Query: 18  DAISVFGEMRRRGIRPNNLTYPFLLKACATLAALQEELGNLSLGRWVHSQVVGRGMVWNV 77
           + I  F +MR  G  P+  T   +L A +       ELGNLSLG+WVHSQV+ +G++ N 
Sbjct: 204 EGIGYFVKMRDCGFEPDETTMVVMLNASS-------ELGNLSLGKWVHSQVIEKGLILNC 263

Query: 78  QLGTAFVDMYAKSGDVGCARLVFNSLKQRSVWPWSAMILGLAQHGFANEAIELFTNMMSS 137
           QLGTA VDMYAKSG +  ARLVF+ ++ R+VW WSAMILGLAQHGFA EA+ELF  M++ 
Sbjct: 264 QLGTALVDMYAKSGALVYARLVFDRMELRNVWTWSAMILGLAQHGFAKEALELFPKMLNF 323

Query: 138 SVVPNYVTFIGVLCACSHAGLVDEGYHYFNIMERVYGIKPMMIHYGSMVDILGRAGRVKE 197
           SV PNYVTF+GVLCACSHAG VD+GY YF+ ME V+GIKPMMIHYG+MVDILGRAGR+ E
Sbjct: 324 SVRPNYVTFLGVLCACSHAGQVDDGYQYFHDMEHVHGIKPMMIHYGAMVDILGRAGRLNE 383

Query: 198 AYEFIMSMHVQPDPIVWRTLLSACSARDVDGGAQVAEEARKRLLELEPKRGGNVVMVANM 257
           AY FIMSM   PDPIVWRTLLSAC+ RD +    V  +  ++LLELEP RGGN+VMVANM
Sbjct: 384 AYSFIMSMPFDPDPIVWRTLLSACNTRDANDDEGVGNKVSEKLLELEPSRGGNLVMVANM 443

Query: 258 FAEVGMWKQAADCRRTMKDRGIKKMAGESCIEVGGSLCKFFSGFNARADSHGIYDLLDGL 317
           +AEVGMW++AA+ R+ MK+R +KK AGESC+E+GGS+ KFFSG+++RAD  GIY LLD L
Sbjct: 444 YAEVGMWEKAANLRKVMKERRVKKTAGESCVELGGSIHKFFSGYDSRADYEGIYQLLDVL 503

Query: 318 DLHMQIIN 326
            LHM+++N
Sbjct: 504 SLHMELVN 504

BLAST of Cp4.1LG03g08090 vs. TrEMBL
Match: B9T0U0_RICCO (Cell division protein ftsH, putative OS=Ricinus communis GN=RCOM_0340700 PE=3 SV=1)

HSP 1 Score: 401.4 bits (1030), Expect = 1.1e-108
Identity = 193/308 (62.66%), Postives = 238/308 (77.27%), Query Frame = 1

Query: 18   DAISVFGEMRRRGIRPNNLTYPFLLKACATLAALQEELGNLSLGRWVHSQVVGRGMVWNV 77
            +AI  F +MR  G  P+  T   +L  CA       E+GNL LGRW+HSQV+ RG+V N 
Sbjct: 848  EAIRYFLKMRDFGFEPDGTTMVLMLVICA-------EMGNLGLGRWIHSQVIERGLVLNY 907

Query: 78   QLGTAFVDMYAKSGDVGCARLVFNSLKQRSVWPWSAMILGLAQHGFANEAIELFTNMMSS 137
            QLGTA VDMYAKSG VG A+LVF+ +K+++VW WSAMILGLAQHGFA E +ELF +MM S
Sbjct: 908  QLGTALVDMYAKSGAVGYAKLVFDRMKEKNVWTWSAMILGLAQHGFAKEGLELFLDMMRS 967

Query: 138  SVV-PNYVTFIGVLCACSHAGLVDEGYHYFNIMERVYGIKPMMIHYGSMVDILGRAGRVK 197
            S++ PNYVTF+GVLCACSHAGLV +G+ YF+ M   YGIKPMM+HYG+MVDILGRAG +K
Sbjct: 968  SLIHPNYVTFLGVLCACSHAGLVSDGFRYFHEMGHTYGIKPMMVHYGAMVDILGRAGLLK 1027

Query: 198  EAYEFIMSMHVQPDPIVWRTLLSACSARDVDGGAQVAEEARKRLLELEPKRGGNVVMVAN 257
            EAY FI  M  QPDPIVWRTLLSACS  DV     VA + RKRLLELEP+R GN VMVAN
Sbjct: 1028 EAYNFITKMPFQPDPIVWRTLLSACSIHDVKDSTGVAYKVRKRLLELEPRRSGNFVMVAN 1087

Query: 258  MFAEVGMWKQAADCRRTMKDRGIKKMAGESCIEVGGSLCKFFSGFNARADSHGIYDLLDG 317
            M+A+ GMW++AA  RR M+D G+KK AGESC+E+ GS+ +FFSG++++ D  G+Y LLDG
Sbjct: 1088 MYADAGMWEKAAKVRRVMRDGGLKKKAGESCVELSGSIHRFFSGYDSQDDKEGMYQLLDG 1147

Query: 318  LDLHMQII 325
            L+LHMQ++
Sbjct: 1148 LNLHMQMM 1148

BLAST of Cp4.1LG03g08090 vs. TrEMBL
Match: A5AIR6_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_032528 PE=4 SV=1)

HSP 1 Score: 397.1 bits (1019), Expect = 2.0e-107
Identity = 193/320 (60.31%), Postives = 241/320 (75.31%), Query Frame = 1

Query: 6   IIRGYASSESPQDAISVFGEMRRRGIRPNNLTYPFLLKACATLAALQEELGNLSLGRWVH 65
           ++     +E   D+  +F +MR  G  P+  T   LL AC+       ELGNLS GRWVH
Sbjct: 191 VLSACVDNEWLNDSFGLFVKMRGSGFDPDETTMVILLSACS-------ELGNLSFGRWVH 250

Query: 66  SQVVGRGMVWNVQLGTAFVDMYAKSGDVGCARLVFNSLKQRSVWPWSAMILGLAQHGFAN 125
           SQV+ +GMV N +LGTA VDMYAK G V  A LVF+ + +R+VW WSAMILGLAQHGFA 
Sbjct: 251 SQVIEKGMVVNCRLGTALVDMYAKCGAVCEASLVFHRMLERNVWTWSAMILGLAQHGFAK 310

Query: 126 EAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDEGYHYFNIMERVYGIKPMMIHYGSM 185
           EA+ELF  M  SS+ PNYVTF+GVLCACSHAGLVD+GY +F+ ME V+GI+PMMIHYG+M
Sbjct: 311 EALELFPKMKQSSISPNYVTFLGVLCACSHAGLVDDGYRFFHDMEYVHGIEPMMIHYGAM 370

Query: 186 VDILGRAGRVKEAYEFIMSMHVQPDPIVWRTLLSACSARDVDGGAQVAEEARKRLLELEP 245
           VDIL RAGR+KEAY FI++M V+ DP+VWRTLLSAC+   ++    V ++ RKRLLELEP
Sbjct: 371 VDILSRAGRLKEAYNFILNMPVEADPVVWRTLLSACTIHGINDNDGVGDKVRKRLLELEP 430

Query: 246 KRGGNVVMVANMFAEVGMWKQAADCRRTMKDRGIKKMAGESCIEVGGSLCKFFSGFNARA 305
           +R GN VMVANM+AEVG W +AA+ RRTMKD G+KKMAGESCIEVGGS+ +FFSG + + 
Sbjct: 431 RRSGNFVMVANMYAEVGKWDKAANVRRTMKDTGLKKMAGESCIEVGGSIHRFFSGDDPQL 490

Query: 306 DSHGIYDLLDGLDLHMQIIN 326
           D   +  LLDGL+LHM+ ++
Sbjct: 491 DCEDVLQLLDGLNLHMKTVS 503

BLAST of Cp4.1LG03g08090 vs. TrEMBL
Match: F6H5S7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_14s0108g00600 PE=4 SV=1)

HSP 1 Score: 397.1 bits (1019), Expect = 2.0e-107
Identity = 193/320 (60.31%), Postives = 241/320 (75.31%), Query Frame = 1

Query: 6   IIRGYASSESPQDAISVFGEMRRRGIRPNNLTYPFLLKACATLAALQEELGNLSLGRWVH 65
           ++     +E   D+  +F +MR  G  P+  T   LL AC+       ELGNLS GRWVH
Sbjct: 195 VLSACVDNEWLNDSFGLFVKMRGSGFDPDETTMVILLSACS-------ELGNLSFGRWVH 254

Query: 66  SQVVGRGMVWNVQLGTAFVDMYAKSGDVGCARLVFNSLKQRSVWPWSAMILGLAQHGFAN 125
           SQV+ +GMV N +LGTA VDMYAK G V  A LVF+ + +R+VW WSAMILGLAQHGFA 
Sbjct: 255 SQVIEKGMVVNCRLGTALVDMYAKCGAVCEASLVFHRMLERNVWTWSAMILGLAQHGFAK 314

Query: 126 EAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDEGYHYFNIMERVYGIKPMMIHYGSM 185
           EA+ELF  M  SS+ PNYVTF+GVLCACSHAGLVD+GY +F+ ME V+GI+PMMIHYG+M
Sbjct: 315 EALELFPKMKQSSISPNYVTFLGVLCACSHAGLVDDGYRFFHDMEYVHGIEPMMIHYGAM 374

Query: 186 VDILGRAGRVKEAYEFIMSMHVQPDPIVWRTLLSACSARDVDGGAQVAEEARKRLLELEP 245
           VDIL RAGR+KEAY FI++M V+ DP+VWRTLLSAC+   ++    V ++ RKRLLELEP
Sbjct: 375 VDILSRAGRLKEAYNFILNMPVEADPVVWRTLLSACTIHGINDNDGVGDKVRKRLLELEP 434

Query: 246 KRGGNVVMVANMFAEVGMWKQAADCRRTMKDRGIKKMAGESCIEVGGSLCKFFSGFNARA 305
           +R GN VMVANM+AEVG W +AA+ RRTMKD G+KKMAGESCIEVGGS+ +FFSG + + 
Sbjct: 435 RRSGNFVMVANMYAEVGKWDKAANVRRTMKDTGLKKMAGESCIEVGGSIHRFFSGDDPQL 494

Query: 306 DSHGIYDLLDGLDLHMQIIN 326
           D   +  LLDGL+LHM+ ++
Sbjct: 495 DCEDVLQLLDGLNLHMKTVS 507

BLAST of Cp4.1LG03g08090 vs. TAIR10
Match: AT2G36730.1 (AT2G36730.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 344.0 bits (881), Expect = 1.0e-94
Identity = 171/294 (58.16%), Postives = 217/294 (73.81%), Query Frame = 1

Query: 23  FGEMRRRGIRPNNLTYPFLLKACATLAALQEELGNLSLGRWVHSQVVGRGMVWNVQLGTA 82
           F EM  +   P+  T   LL AC          GNLSLG+ VHSQV+ R +  N +LGTA
Sbjct: 202 FCEMIGKRFCPDETTMVVLLSACG---------GNLSLGKLVHSQVMVRELELNCRLGTA 261

Query: 83  FVDMYAKSGDVGCARLVFNSLKQRSVWPWSAMILGLAQHGFANEAIELFTNMMS-SSVVP 142
            VDMYAKSG +  ARLVF  +  ++VW WSAMI+GLAQ+GFA EA++LF+ MM  SSV P
Sbjct: 262 LVDMYAKSGGLEYARLVFERMVDKNVWTWSAMIVGLAQYGFAEEALQLFSKMMKESSVRP 321

Query: 143 NYVTFIGVLCACSHAGLVDEGYHYFNIMERVYGIKPMMIHYGSMVDILGRAGRVKEAYEF 202
           NYVTF+GVLCACSH GLVD+GY YF+ ME+++ IKPMMIHYG+MVDILGRAGR+ EAY+F
Sbjct: 322 NYVTFLGVLCACSHTGLVDDGYKYFHEMEKIHKIKPMMIHYGAMVDILGRAGRLNEAYDF 381

Query: 203 IMSMHVQPDPIVWRTLLSACSARDVDGGAQVAEEARKRLLELEPKRGGNVVMVANMFAEV 262
           I  M  +PD +VWRTLLSACS    +    + E+ +KRL+ELEPKR GN+V+VAN FAE 
Sbjct: 382 IKKMPFEPDAVVWRTLLSACSIHHDEDDEGIGEKVKKRLIELEPKRSGNLVIVANRFAEA 441

Query: 263 GMWKQAADCRRTMKDRGIKKMAGESCIEVGGSLCKFFSGFNARADSHGIYDLLD 316
            MW +AA+ RR MK+  +KK+AGESC+E+GGS  +FFSG++ R++   IY+LLD
Sbjct: 442 RMWAEAAEVRRVMKETKMKKIAGESCLELGGSFHRFFSGYDPRSEYVSIYELLD 486

BLAST of Cp4.1LG03g08090 vs. TAIR10
Match: AT4G21065.1 (AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 247.3 bits (630), Expect = 1.3e-65
Identity = 122/307 (39.74%), Postives = 193/307 (62.87%), Query Frame = 1

Query: 6   IIRGYASSESPQDAISVFGEMRRRGIRPNNLTYPFLLKACATLAALQEELGNLSLGRWVH 65
           +I G+A +  P++A++++ EM  +GI+P+  T   LL ACA       ++G L+LG+ VH
Sbjct: 193 VINGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACA-------KIGALTLGKRVH 252

Query: 66  SQVVGRGMVWNVQLGTAFVDMYAKSGDVGCARLVFNSLKQRSVWPWSAMILGLAQHGFAN 125
             ++  G+  N+      +D+YA+ G V  A+ +F+ +  ++   W+++I+GLA +GF  
Sbjct: 253 VYMIKVGLTRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGK 312

Query: 126 EAIELFTNMMSSS-VVPNYVTFIGVLCACSHAGLVDEGYHYFNIMERVYGIKPMMIHYGS 185
           EAIELF  M S+  ++P  +TF+G+L ACSH G+V EG+ YF  M   Y I+P + H+G 
Sbjct: 313 EAIELFKYMESTEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGC 372

Query: 186 MVDILGRAGRVKEAYEFIMSMHVQPDPIVWRTLLSACSARDVDGGAQVAEEARKRLLELE 245
           MVD+L RAG+VK+AYE+I SM +QP+ ++WRTLL AC+   V G + +AE AR ++L+LE
Sbjct: 373 MVDLLARAGQVKKAYEYIKSMPMQPNVVIWRTLLGACT---VHGDSDLAEFARIQILQLE 432

Query: 246 PKRGGNVVMVANMFAEVGMWKQAADCRRTMKDRGIKKMAGESCIEVGGSLCKFFSGFNAR 305
           P   G+ V+++NM+A    W      R+ M   G+KK+ G S +EVG  + +F  G  + 
Sbjct: 433 PNHSGDYVLLSNMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSH 489

Query: 306 ADSHGIY 312
             S  IY
Sbjct: 493 PQSDAIY 489

BLAST of Cp4.1LG03g08090 vs. TAIR10
Match: AT2G29760.1 (AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 246.1 bits (627), Expect = 2.9e-65
Identity = 128/310 (41.29%), Postives = 189/310 (60.97%), Query Frame = 1

Query: 6   IIRGYASSESPQDAISVFGEMR-RRGIRPNNLTYPFLLKACATLAALQEELGNLSLGRWV 65
           +I  Y  +  P +A+ VF E++ ++ ++ N +T    L ACA       ++G L LGRW+
Sbjct: 335 LISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACA-------QVGALELGRWI 394

Query: 66  HSQVVGRGMVWNVQLGTAFVDMYAKSGDVGCARLVFNSLKQRSVWPWSAMILGLAQHGFA 125
           HS +   G+  N  + +A + MY+K GD+  +R VFNS+++R V+ WSAMI GLA HG  
Sbjct: 395 HSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCG 454

Query: 126 NEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDEGYHYFNIMERVYGIKPMMIHYGS 185
           NEA+++F  M  ++V PN VTF  V CACSH GLVDE    F+ ME  YGI P   HY  
Sbjct: 455 NEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYAC 514

Query: 186 MVDILGRAGRVKEAYEFIMSMHVQPDPIVWRTLLSACSARDVDGGAQVAEEARKRLLELE 245
           +VD+LGR+G +++A +FI +M + P   VW  LL AC    +     +AE A  RLLELE
Sbjct: 515 IVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACK---IHANLNLAEMACTRLLELE 574

Query: 246 PKRGGNVVMVANMFAEVGMWKQAADCRRTMKDRGIKKMAGESCIEVGGSLCKFFSGFNAR 305
           P+  G  V+++N++A++G W+  ++ R+ M+  G+KK  G S IE+ G + +F SG NA 
Sbjct: 575 PRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAH 634

Query: 306 ADSHGIYDLL 315
             S  +Y  L
Sbjct: 635 PMSEKVYGKL 634

BLAST of Cp4.1LG03g08090 vs. TAIR10
Match: AT5G56310.1 (AT5G56310.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 242.3 bits (617), Expect = 4.2e-64
Identity = 131/317 (41.32%), Postives = 185/317 (58.36%), Query Frame = 1

Query: 6   IIRGYASSESPQDAISVFGEMRRRGIRPNNLTYPFLLKACATLAALQEELGNLSLGRWVH 65
           +I GYA S    +AI VF  M    + P+ +T   +L ACA       +LG+L LG  + 
Sbjct: 221 VISGYAKSGRASEAIEVFQRMLMENVEPDEVTLLAVLSACA-------DLGSLELGERIC 280

Query: 66  SQVVGRGMVWNVQLGTAFVDMYAKSGDVGCARLVFNSLKQRSVWPWSAMILGLAQHGFAN 125
           S V  RGM   V L  A +DMYAKSG++  A  VF  + +R+V  W+ +I GLA HG   
Sbjct: 281 SYVDHRGMNRAVSLNNAVIDMYAKSGNITKALDVFECVNERNVVTWTTIIAGLATHGHGA 340

Query: 126 EAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDEGYHYFNIMERVYGIKPMMIHYGSM 185
           EA+ +F  M+ + V PN VTFI +L ACSH G VD G   FN M   YGI P + HYG M
Sbjct: 341 EALAMFNRMVKAGVRPNDVTFIAILSACSHVGWVDLGKRLFNSMRSKYGIHPNIEHYGCM 400

Query: 186 VDILGRAGRVKEAYEFIMSMHVQPDPIVWRTLLSACSARDVDGGAQVAEEARKRLLELEP 245
           +D+LGRAG+++EA E I SM  + +  +W +LL   +A +V    ++ E A   L++LEP
Sbjct: 401 IDLLGRAGKLREADEVIKSMPFKANAAIWGSLL---AASNVHHDLELGERALSELIKLEP 460

Query: 246 KRGGNVVMVANMFAEVGMWKQAADCRRTMKDRGIKKMAGESCIEVGGSLCKFFSGFNARA 305
              GN +++AN+++ +G W ++   R  MK  G+KKMAGES IEV   + KF SG     
Sbjct: 461 NNSGNYMLLANLYSNLGRWDESRMMRNMMKGIGVKKMAGESSIEVENRVYKFISGDLTHP 520

Query: 306 DSHGIYDLLDGLDLHMQ 323
               I+++L  +DL +Q
Sbjct: 521 QVERIHEILQEMDLQIQ 527

BLAST of Cp4.1LG03g08090 vs. TAIR10
Match: AT1G11290.1 (AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 241.1 bits (614), Expect = 9.4e-64
Identity = 126/317 (39.75%), Postives = 186/317 (58.68%), Query Frame = 1

Query: 6   IIRGYASSESPQDAISVFGEMRRRGIRPNNLTYPFLLKACATLAALQEELGNLSLGRWVH 65
           +I G+A +  P DA++ F +MR R ++P+  TY  ++ A A L+            +W+H
Sbjct: 409 MILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHH-------AKWIH 468

Query: 66  SQVVGRGMVWNVQLGTAFVDMYAKSGDVGCARLVFNSLKQRSVWPWSAMILGLAQHGFAN 125
             V+   +  NV + TA VDMYAK G +  ARL+F+ + +R V  W+AMI G   HGF  
Sbjct: 469 GVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGK 528

Query: 126 EAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDEGYHYFNIMERVYGIKPMMIHYGSM 185
            A+ELF  M   ++ PN VTF+ V+ ACSH+GLV+ G   F +M+  Y I+  M HYG+M
Sbjct: 529 AALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAM 588

Query: 186 VDILGRAGRVKEAYEFIMSMHVQPDPIVWRTLLSACSARDVDGGAQVAEEARKRLLELEP 245
           VD+LGRAGR+ EA++FIM M V+P   V+  +L AC    +      AE+A +RL EL P
Sbjct: 589 VDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGAC---QIHKNVNFAEKAAERLFELNP 648

Query: 246 KRGGNVVMVANMFAEVGMWKQAADCRRTMKDRGIKKMAGESCIEVGGSLCKFFSGFNARA 305
             GG  V++AN++    MW++    R +M  +G++K  G S +E+   +  FFSG  A  
Sbjct: 649 DDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHP 708

Query: 306 DSHGIYDLLDGLDLHMQ 323
           DS  IY  L+ L  H++
Sbjct: 709 DSKKIYAFLEKLICHIK 715

BLAST of Cp4.1LG03g08090 vs. NCBI nr
Match: gi|778722801|ref|XP_011658571.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g36730 isoform X2 [Cucumis sativus])

HSP 1 Score: 577.0 bits (1486), Expect = 2.1e-161
Identity = 283/335 (84.48%), Postives = 305/335 (91.04%), Query Frame = 1

Query: 5   FIIRGYASSESPQDAISVFGEMRRRGIRPNNLTYPFLLKACATLAALQE----------- 64
           FIIRGY+SS+SPQ+AIS+FGEMRRRG+RPNNLT+PFLLKACATLA LQE           
Sbjct: 101 FIIRGYSSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEDETTMVVILSA 160

Query: 65  --ELGNLSLGRWVHSQVVGRGMVWNVQLGTAFVDMYAKSGDVGCARLVFNSLKQRSVWPW 124
             ELGNLSLGRWVHSQVVGRGMV NVQLGTAFVDMYAKSGDVGCAR VFN LKQ+SVW W
Sbjct: 161 CAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWTW 220

Query: 125 SAMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDEGYHYFNIMER 184
           SAMILGLAQHGFANEAIELFTNMMSS +VPN+VTFIGVLCACSHAGLVD+ YHYFN+MER
Sbjct: 221 SAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMER 280

Query: 185 VYGIKPMMIHYGSMVDILGRAGRVKEAYEFIMSMHVQPDPIVWRTLLSACSARDVDGGAQ 244
           VYGIKPMMIHYGSMVD+LGRAG+VKEAYE IMSM V+PDPIVWRTLLSACS RDV+GGA+
Sbjct: 281 VYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPDPIVWRTLLSACSGRDVNGGAE 340

Query: 245 VAEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRTMKDRGIKKMAGESCIEVG 304
           VAEEARKRLLELEPKRGGNVVMVAN FAE+GMWKQAAD RRTMKDRGIKKMAGESCIE+G
Sbjct: 341 VAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELG 400

Query: 305 GSLCKFFSGFNARADSHGIYDLLDGLDLHMQIINF 327
           GSL KFFSGF++RA   GIYDLLDGL+LHMQ+ NF
Sbjct: 401 GSLRKFFSGFDSRAAPDGIYDLLDGLNLHMQLTNF 435

BLAST of Cp4.1LG03g08090 vs. NCBI nr
Match: gi|659094371|ref|XP_008448024.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g36730 isoform X2 [Cucumis melo])

HSP 1 Score: 572.0 bits (1473), Expect = 6.6e-160
Identity = 279/334 (83.53%), Postives = 302/334 (90.42%), Query Frame = 1

Query: 6   IIRGYASSESPQDAISVFGEMRRRGIRPNNLTYPFLLKACATLAALQE------------ 65
           IIRGY+SS+SP++AIS+FGEMRRRG+ PNNLT+PFLLKACATLA LQE            
Sbjct: 102 IIRGYSSSDSPREAISLFGEMRRRGVIPNNLTFPFLLKACATLATLQEDETTMVVILSAC 161

Query: 66  -ELGNLSLGRWVHSQVVGRGMVWNVQLGTAFVDMYAKSGDVGCARLVFNSLKQRSVWPWS 125
            ELGNLSLGRWVHSQVVGRGMV N+QLGTAFVDMYAKSGDVGCAR VFN LKQ+SVW WS
Sbjct: 162 AELGNLSLGRWVHSQVVGRGMVLNIQLGTAFVDMYAKSGDVGCARRVFNCLKQKSVWTWS 221

Query: 126 AMILGLAQHGFANEAIELFTNMMSSSVVPNYVTFIGVLCACSHAGLVDEGYHYFNIMERV 185
           AMILGLAQHGFANEAIELFTNM SS +VPNYVTF+GVLCACSHAGLVD+ YHYFN+MERV
Sbjct: 222 AMILGLAQHGFANEAIELFTNMKSSPIVPNYVTFVGVLCACSHAGLVDKSYHYFNVMERV 281

Query: 186 YGIKPMMIHYGSMVDILGRAGRVKEAYEFIMSMHVQPDPIVWRTLLSACSARDVDGGAQV 245
           YGIKPMMIHYG MVD+LGRAG+VKEAYE IMSM V+PDP+VWRTLLSACS RDV+GGA+V
Sbjct: 282 YGIKPMMIHYGLMVDVLGRAGQVKEAYELIMSMPVEPDPVVWRTLLSACSGRDVNGGAEV 341

Query: 246 AEEARKRLLELEPKRGGNVVMVANMFAEVGMWKQAADCRRTMKDRGIKKMAGESCIEVGG 305
           AEEARKRLLELEPKRGGNVVMVAN FAEVGMWKQAAD RRTMKDRGIKKMAGESCIE+GG
Sbjct: 342 AEEARKRLLELEPKRGGNVVMVANKFAEVGMWKQAADYRRTMKDRGIKKMAGESCIELGG 401

Query: 306 SLCKFFSGFNARADSHGIYDLLDGLDLHMQIINF 327
           SL KFFSGFN+RA S GIYDLLDGL+LHMQ+ NF
Sbjct: 402 SLRKFFSGFNSRAASDGIYDLLDGLNLHMQLTNF 435

BLAST of Cp4.1LG03g08090 vs. NCBI nr
Match: gi|659094369|ref|XP_008448023.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g36730 isoform X1 [Cucumis melo])

HSP 1 Score: 513.5 bits (1321), Expect = 2.8e-142
Identity = 251/309 (81.23%), Postives = 271/309 (87.70%), Query Frame = 1

Query: 18  DAISVFGEMRRRGIRPNNLTYPFLLKACATLAALQEELGNLSLGRWVHSQVVGRGMVWNV 77
           +AI  F +M   G  P+  T   +L ACA       ELGNLSLGRWVHSQVVGRGMV N+
Sbjct: 215 EAIDYFLKMGNHGFEPDETTMVVILSACA-------ELGNLSLGRWVHSQVVGRGMVLNI 274

Query: 78  QLGTAFVDMYAKSGDVGCARLVFNSLKQRSVWPWSAMILGLAQHGFANEAIELFTNMMSS 137
           QLGTAFVDMYAKSGDVGCAR VFN LKQ+SVW WSAMILGLAQHGFANEAIELFTNM SS
Sbjct: 275 QLGTAFVDMYAKSGDVGCARRVFNCLKQKSVWTWSAMILGLAQHGFANEAIELFTNMKSS 334

Query: 138 SVVPNYVTFIGVLCACSHAGLVDEGYHYFNIMERVYGIKPMMIHYGSMVDILGRAGRVKE 197
            +VPNYVTF+GVLCACSHAGLVD+ YHYFN+MERVYGIKPMMIHYG MVD+LGRAG+VKE
Sbjct: 335 PIVPNYVTFVGVLCACSHAGLVDKSYHYFNVMERVYGIKPMMIHYGLMVDVLGRAGQVKE 394

Query: 198 AYEFIMSMHVQPDPIVWRTLLSACSARDVDGGAQVAEEARKRLLELEPKRGGNVVMVANM 257
           AYE IMSM V+PDP+VWRTLLSACS RDV+GGA+VAEEARKRLLELEPKRGGNVVMVAN 
Sbjct: 395 AYELIMSMPVEPDPVVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANK 454

Query: 258 FAEVGMWKQAADCRRTMKDRGIKKMAGESCIEVGGSLCKFFSGFNARADSHGIYDLLDGL 317
           FAEVGMWKQAAD RRTMKDRGIKKMAGESCIE+GGSL KFFSGFN+RA S GIYDLLDGL
Sbjct: 455 FAEVGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFNSRAASDGIYDLLDGL 514

Query: 318 DLHMQIINF 327
           +LHMQ+ NF
Sbjct: 515 NLHMQLTNF 516

BLAST of Cp4.1LG03g08090 vs. NCBI nr
Match: gi|449461643|ref|XP_004148551.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g36730 isoform X1 [Cucumis sativus])

HSP 1 Score: 511.5 bits (1316), Expect = 1.1e-141
Identity = 252/309 (81.55%), Postives = 272/309 (88.03%), Query Frame = 1

Query: 18  DAISVFGEMRRRGIRPNNLTYPFLLKACATLAALQEELGNLSLGRWVHSQVVGRGMVWNV 77
           +AI  F +M   G  P+  T   +L ACA       ELGNLSLGRWVHSQVVGRGMV NV
Sbjct: 215 EAIDYFLKMGNHGFEPDETTMVVILSACA-------ELGNLSLGRWVHSQVVGRGMVLNV 274

Query: 78  QLGTAFVDMYAKSGDVGCARLVFNSLKQRSVWPWSAMILGLAQHGFANEAIELFTNMMSS 137
           QLGTAFVDMYAKSGDVGCAR VFN LKQ+SVW WSAMILGLAQHGFANEAIELFTNMMSS
Sbjct: 275 QLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWTWSAMILGLAQHGFANEAIELFTNMMSS 334

Query: 138 SVVPNYVTFIGVLCACSHAGLVDEGYHYFNIMERVYGIKPMMIHYGSMVDILGRAGRVKE 197
            +VPN+VTFIGVLCACSHAGLVD+ YHYFN+MERVYGIKPMMIHYGSMVD+LGRAG+VKE
Sbjct: 335 PIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKE 394

Query: 198 AYEFIMSMHVQPDPIVWRTLLSACSARDVDGGAQVAEEARKRLLELEPKRGGNVVMVANM 257
           AYE IMSM V+PDPIVWRTLLSACS RDV+GGA+VAEEARKRLLELEPKRGGNVVMVAN 
Sbjct: 395 AYELIMSMPVEPDPIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANK 454

Query: 258 FAEVGMWKQAADCRRTMKDRGIKKMAGESCIEVGGSLCKFFSGFNARADSHGIYDLLDGL 317
           FAE+GMWKQAAD RRTMKDRGIKKMAGESCIE+GGSL KFFSGF++RA   GIYDLLDGL
Sbjct: 455 FAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGIYDLLDGL 514

Query: 318 DLHMQIINF 327
           +LHMQ+ NF
Sbjct: 515 NLHMQLTNF 516

BLAST of Cp4.1LG03g08090 vs. NCBI nr
Match: gi|595973476|ref|XP_007217715.1| (hypothetical protein PRUPE_ppa004522mg [Prunus persica])

HSP 1 Score: 406.0 bits (1042), Expect = 6.3e-110
Identity = 195/308 (63.31%), Postives = 241/308 (78.25%), Query Frame = 1

Query: 18  DAISVFGEMRRRGIRPNNLTYPFLLKACATLAALQEELGNLSLGRWVHSQVVGRGMVWNV 77
           + I  F +MR  G  P+  T   +L A +       ELGNLSLG+WVHSQV+ +G++ N 
Sbjct: 204 EGIGYFVKMRDCGFEPDETTMVVMLNASS-------ELGNLSLGKWVHSQVIEKGLILNC 263

Query: 78  QLGTAFVDMYAKSGDVGCARLVFNSLKQRSVWPWSAMILGLAQHGFANEAIELFTNMMSS 137
           QLGTA VDMYAKSG +  ARLVF+ ++ R+VW WSAMILGLAQHGFA EA+ELF  M++ 
Sbjct: 264 QLGTALVDMYAKSGALVYARLVFDRMELRNVWTWSAMILGLAQHGFAKEALELFPKMLNF 323

Query: 138 SVVPNYVTFIGVLCACSHAGLVDEGYHYFNIMERVYGIKPMMIHYGSMVDILGRAGRVKE 197
           SV PNYVTF+GVLCACSHAG VD+GY YF+ ME V+GIKPMMIHYG+MVDILGRAGR+ E
Sbjct: 324 SVRPNYVTFLGVLCACSHAGQVDDGYQYFHDMEHVHGIKPMMIHYGAMVDILGRAGRLNE 383

Query: 198 AYEFIMSMHVQPDPIVWRTLLSACSARDVDGGAQVAEEARKRLLELEPKRGGNVVMVANM 257
           AY FIMSM   PDPIVWRTLLSAC+ RD +    V  +  ++LLELEP RGGN+VMVANM
Sbjct: 384 AYSFIMSMPFDPDPIVWRTLLSACNTRDANDDEGVGNKVSEKLLELEPSRGGNLVMVANM 443

Query: 258 FAEVGMWKQAADCRRTMKDRGIKKMAGESCIEVGGSLCKFFSGFNARADSHGIYDLLDGL 317
           +AEVGMW++AA+ R+ MK+R +KK AGESC+E+GGS+ KFFSG+++RAD  GIY LLD L
Sbjct: 444 YAEVGMWEKAANLRKVMKERRVKKTAGESCVELGGSIHKFFSGYDSRADYEGIYQLLDVL 503

Query: 318 DLHMQIIN 326
            LHM+++N
Sbjct: 504 SLHMELVN 504

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP188_ARATH1.8e-9358.16Pentatricopeptide repeat-containing protein At2g36730 OS=Arabidopsis thaliana GN... [more]
PP330_ARATH2.3e-6439.74Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana GN... [more]
PP175_ARATH5.2e-6441.29Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
PP433_ARATH7.5e-6341.32Pentatricopeptide repeat-containing protein At5g56310 OS=Arabidopsis thaliana GN... [more]
PPR32_ARATH1.7e-6239.75Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0K153_CUCSA7.4e-14281.55Uncharacterized protein OS=Cucumis sativus GN=Csa_7G007910 PE=4 SV=1[more]
M5WZ10_PRUPE4.4e-11063.31Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004522mg PE=4 SV=1[more]
B9T0U0_RICCO1.1e-10862.66Cell division protein ftsH, putative OS=Ricinus communis GN=RCOM_0340700 PE=3 SV... [more]
A5AIR6_VITVI2.0e-10760.31Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_032528 PE=4 SV=1[more]
F6H5S7_VITVI2.0e-10760.31Putative uncharacterized protein OS=Vitis vinifera GN=VIT_14s0108g00600 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT2G36730.11.0e-9458.16 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G21065.11.3e-6539.74 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G29760.12.9e-6541.29 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G56310.14.2e-6441.32 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G11290.19.4e-6439.75 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778722801|ref|XP_011658571.1|2.1e-16184.48PREDICTED: pentatricopeptide repeat-containing protein At2g36730 isoform X2 [Cuc... [more]
gi|659094371|ref|XP_008448024.1|6.6e-16083.53PREDICTED: pentatricopeptide repeat-containing protein At2g36730 isoform X2 [Cuc... [more]
gi|659094369|ref|XP_008448023.1|2.8e-14281.23PREDICTED: pentatricopeptide repeat-containing protein At2g36730 isoform X1 [Cuc... [more]
gi|449461643|ref|XP_004148551.1|1.1e-14181.55PREDICTED: pentatricopeptide repeat-containing protein At2g36730 isoform X1 [Cuc... [more]
gi|595973476|ref|XP_007217715.1|6.3e-11063.31hypothetical protein PRUPE_ppa004522mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0051301 cell division
biological_process GO:0006508 proteolysis
cellular_component GO:0005575 cellular_component
cellular_component GO:0016020 membrane
molecular_function GO:0003674 molecular_function
molecular_function GO:0005524 ATP binding
molecular_function GO:0004222 metalloendopeptidase activity
molecular_function GO:0008568 microtubule-severing ATPase activity
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g08090.1Cp4.1LG03g08090.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 144..171
score: 0.56coord: 111..137
score: 4.2E-4coord: 181..206
score: 0.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 6..46
score: 5.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 6..34
score: 4.1E-6coord: 111..142
score: 8.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 178..208
score: 6.774coord: 247..281
score: 5.601coord: 76..106
score: 5.93coord: 1..33
score: 9.142coord: 142..177
score: 6.719coord: 107..141
score: 10
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 5..288
score: 3.7E
NoneNo IPR availablePANTHERPTHR24015:SF36SUBFAMILY NOT NAMEDcoord: 5..288
score: 3.7E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG03g08090Cp4.1LG04g01560Cucurbita pepo (Zucchini)cpecpeB475