Cp4.1LG04g03350 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG04g03350
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCp4.1LG04 : 7344636 .. 7346706 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGACTCTTCATCAAACGCCATTGCCGTAGTTTCACCTCTCAAAATCTAAAGAATTCGACACACCCACCAACAAAAGAGTCTCAAATTCTCCAATTTTGTGGCTCAGACCTACTCCACGATGCTCTACACACCCTAAACTCCCTCGATTCCTTCGATTCCACCACCAACAAATCCATTCTCTACGCTTCTCTCTTGCAAACCTGCACCAAGGTCGCTTCCTTCACCCATGGCCGTCAAATTCATGCCCATGTTCTTAAATCTGGCCTTGAAGCTGACCGGTTTGTTGGGAATAGCTTACTTTCTCTTTACTTTAAATTGGGTTCAGATTTCCGGCTGACTCGAAGAGTATTCGATGGTCTTTTTGTGAAGGATGTGGTGTCCTGGACGTCCATGATTACGAGCTATGTTCGAGAAGGTAAACCTGGTAATGCGATCGAATTGTTTTGGGATATGTTGGATTTAGGGATTGAGCCTAATGGGTTTACTTTATCGGCTGTGATCAAGGCGTGTTCTGAGATTGGGAATTTGGTTCTTGGTCGATGCTTTCATGGGTTAGTTGTAAGGCATGGATTCAATTCAAATCATGTCATTGTGAGTTCTTTGATAGACATGTACGGGAGAAACTTTGCTTCGAGCGATGCGCGCCAACTGTTCGATGAAATGCCTGAACCAGATGCAATATGTTGGACCTCTGTTATTTCAGCTCTTACAAGGAATGATTTGTATGAGGATGCATTGGGTTTCTTTTATTTGATGCTAAGAACTTATAGTTTGTCTCCTGATGGTTTTACATTTGGTTCTGTATTGACTGCCTGTGCTAATTTGGGGAGGTTGAGGCAAGGTGAAGAAGTTCATGCTAAGGTGATTGCGCATGGACTTGGTGGGAATGTGGTGGTTGAGAGTAGTCTGGTGGATATGTATGGGAAATGTGGAGCGGTTGAGAAGTCCCAACGCGTGTTCGATAGGATGTCGAAGAGGAACTCGGTTTCGTGGTCTGCATTGCTTGGAGTATATTGCCAAAATGGAGACTTTGAAAAGGTTATAAATATCTTTAGGGGAATGGAGAAGATCGACCTTTACAGTTTCGGAACGGTTATTCGTGCGTGTGCTGGGTTAGCAGCTGTAACTCAAGGGAAGGAGGTTCACTGTCAGTATGTAAGAAAGGGTGGATGGAGAGATGTCATTGTGGAATCAGCTCTAGTCGACTTATACGCGAAATGTGGTTGTATTGATTTTGCATATCGAATCTTTGAGCAGATGCCAACAAGAAACTTGATCACGTGGAACTCGATGATTCGTGGGTTCGCTCAGAACGGAAGAAGTGGGATTTCTATTGAGATATTTGAAGAGATGATTAAGGAAGGGATTAAGCCTGATTATATCAGTTTTATTGGTGTTCTTTTTGCTTGTAGTCATACAGGTTTGGTCGATCAAGGGCGACATTACTTCGTTCGAATGACCGAGGAATATGGAATCAAACCAGGAATCGAGCATTATAATTGTATGGTTGATCTTCTCGGCCGAGCTGGGCTGCTAGAAGAAGCTGAAAATTTGATCGAAAACGCAGACTTCAGAAACGATTCGTCTCTTTGGCAGGTTCTTCTAGGGGCTTGTACTACTTCTACAAACTCTGGTACTGCAGAACGCATAGCCAAGAAAATGATGGAGCTTGAACCTCAACACCATTTAAGCTATGTTCTCCTGGCTAACGTGTACAGAGCAGTAGGCCGATGGGACGACGCGTTGACGATCAGGAAGTTGATGAAAAGCCGACAGGTGAAGAAGGTGCCTGGTCAGAGCTGGATGTAGGGGGCAAAGCATCAAGGTTATGAGCCTCATATCAAACCCGAATTCAAGGCTGATGATGGCTTGGCATTGATTATCAAAATATGGGGTCAGCTATTGTTTTCATGCTCGTCCCAAGATGAAGAACAGCCATGAGAAGCCAGCCACTAAACATATCGAGAACGTACCGACCTGGTCGAAAAGGTTATTAGATGCTATTAACCGGGATTATCGAAAGTTCGTGAATTTTCCATGGAGACTATCAGTTGCAATTTTCAGTTAA

mRNA sequence

ATGAGACTCTTCATCAAACGCCATTGCCGTAGTTTCACCTCTCAAAATCTAAAGAATTCGACACACCCACCAACAAAAGAGTCTCAAATTCTCCAATTTTGTGGCTCAGACCTACTCCACGATGCTCTACACACCCTAAACTCCCTCGATTCCTTCGATTCCACCACCAACAAATCCATTCTCTACGCTTCTCTCTTGCAAACCTGCACCAAGGTCGCTTCCTTCACCCATGGCCGTCAAATTCATGCCCATGTTCTTAAATCTGGCCTTGAAGCTGACCGGTTTGTTGGGAATAGCTTACTTTCTCTTTACTTTAAATTGGGTTCAGATTTCCGGCTGACTCGAAGAGTATTCGATGGTCTTTTTGTGAAGGATGTGGTGTCCTGGACGTCCATGATTACGAGCTATGTTCGAGAAGGTAAACCTGGTAATGCGATCGAATTGTTTTGGGATATGTTGGATTTAGGGATTGAGCCTAATGGGTTTACTTTATCGGCTGTGATCAAGGCGTGTTCTGAGATTGGGAATTTGGTTCTTGGTCGATGCTTTCATGGGTTAGTTGTAAGGCATGGATTCAATTCAAATCATGTCATTGTGAGTTCTTTGATAGACATGTACGGGAGAAACTTTGCTTCGAGCGATGCGCGCCAACTGTTCGATGAAATGCCTGAACCAGATGCAATATGTTGGACCTCTGTTATTTCAGCTCTTACAAGGAATGATTTGTATGAGGATGCATTGGGTTTCTTTTATTTGATGCTAAGAACTTATAGTTTGTCTCCTGATGGTTTTACATTTGGTTCTGTATTGACTGCCTGTGCTAATTTGGGGAGGTTGAGGCAAGGTGAAGAAGTTCATGCTAAGGTGATTGCGCATGGACTTGGTGGGAATGTGGTGGTTGAGAGTAGTCTGGTGGATATGTATGGGAAATGTGGAGCGGTTGAGAAGTCCCAACGCGTGTTCGATAGGATGTCGAAGAGGAACTCGGTTTCGTGGTCTGCATTGCTTGGAGTATATTGCCAAAATGGAGACTTTGAAAAGGTTATAAATATCTTTAGGGGAATGGAGAAGATCGACCTTTACAGTTTCGGAACGGTTATTCGTGCGTGTGCTGGCTATTGTTTTCATGCTCGTCCCAAGATGAAGAACAGCCATGAGAAGCCAGCCACTAAACATATCGAGAACGTACCGACCTGGTCGAAAAGGTTATTAGATGCTATTAACCGGGATTATCGAAAGTTCGTGAATTTTCCATGGAGACTATCAGTTGCAATTTTCAGTTAA

Coding sequence (CDS)

ATGAGACTCTTCATCAAACGCCATTGCCGTAGTTTCACCTCTCAAAATCTAAAGAATTCGACACACCCACCAACAAAAGAGTCTCAAATTCTCCAATTTTGTGGCTCAGACCTACTCCACGATGCTCTACACACCCTAAACTCCCTCGATTCCTTCGATTCCACCACCAACAAATCCATTCTCTACGCTTCTCTCTTGCAAACCTGCACCAAGGTCGCTTCCTTCACCCATGGCCGTCAAATTCATGCCCATGTTCTTAAATCTGGCCTTGAAGCTGACCGGTTTGTTGGGAATAGCTTACTTTCTCTTTACTTTAAATTGGGTTCAGATTTCCGGCTGACTCGAAGAGTATTCGATGGTCTTTTTGTGAAGGATGTGGTGTCCTGGACGTCCATGATTACGAGCTATGTTCGAGAAGGTAAACCTGGTAATGCGATCGAATTGTTTTGGGATATGTTGGATTTAGGGATTGAGCCTAATGGGTTTACTTTATCGGCTGTGATCAAGGCGTGTTCTGAGATTGGGAATTTGGTTCTTGGTCGATGCTTTCATGGGTTAGTTGTAAGGCATGGATTCAATTCAAATCATGTCATTGTGAGTTCTTTGATAGACATGTACGGGAGAAACTTTGCTTCGAGCGATGCGCGCCAACTGTTCGATGAAATGCCTGAACCAGATGCAATATGTTGGACCTCTGTTATTTCAGCTCTTACAAGGAATGATTTGTATGAGGATGCATTGGGTTTCTTTTATTTGATGCTAAGAACTTATAGTTTGTCTCCTGATGGTTTTACATTTGGTTCTGTATTGACTGCCTGTGCTAATTTGGGGAGGTTGAGGCAAGGTGAAGAAGTTCATGCTAAGGTGATTGCGCATGGACTTGGTGGGAATGTGGTGGTTGAGAGTAGTCTGGTGGATATGTATGGGAAATGTGGAGCGGTTGAGAAGTCCCAACGCGTGTTCGATAGGATGTCGAAGAGGAACTCGGTTTCGTGGTCTGCATTGCTTGGAGTATATTGCCAAAATGGAGACTTTGAAAAGGTTATAAATATCTTTAGGGGAATGGAGAAGATCGACCTTTACAGTTTCGGAACGGTTATTCGTGCGTGTGCTGGCTATTGTTTTCATGCTCGTCCCAAGATGAAGAACAGCCATGAGAAGCCAGCCACTAAACATATCGAGAACGTACCGACCTGGTCGAAAAGGTTATTAGATGCTATTAACCGGGATTATCGAAAGTTCGTGAATTTTCCATGGAGACTATCAGTTGCAATTTTCAGTTAA

Protein sequence

MRLFIKRHCRSFTSQNLKNSTHPPTKESQILQFCGSDLLHDALHTLNSLDSFDSTTNKSILYASLLQTCTKVASFTHGRQIHAHVLKSGLEADRFVGNSLLSLYFKLGSDFRLTRRVFDGLFVKDVVSWTSMITSYVREGKPGNAIELFWDMLDLGIEPNGFTLSAVIKACSEIGNLVLGRCFHGLVVRHGFNSNHVIVSSLIDMYGRNFASSDARQLFDEMPEPDAICWTSVISALTRNDLYEDALGFFYLMLRTYSLSPDGFTFGSVLTACANLGRLRQGEEVHAKVIAHGLGGNVVVESSLVDMYGKCGAVEKSQRVFDRMSKRNSVSWSALLGVYCQNGDFEKVINIFRGMEKIDLYSFGTVIRACAGYCFHARPKMKNSHEKPATKHIENVPTWSKRLLDAINRDYRKFVNFPWRLSVAIFS
BLAST of Cp4.1LG04g03350 vs. Swiss-Prot
Match: PPR8_ARATH (Pentatricopeptide repeat-containing protein At1g03540 OS=Arabidopsis thaliana GN=PCMP-E4 PE=2 SV=1)

HSP 1 Score: 447.2 bits (1149), Expect = 2.0e-124
Identity = 222/372 (59.68%), Postives = 276/372 (74.19%), Query Frame = 1

Query: 3   LFIKRHCRSFTSQNLKNS--THPPTKESQILQFCGSDLLHDALHTLNSLDSFDSTTNKSI 62
           + +KRH     S  L  S  +  PTK+S+IL+ C    L +A+  LNS  S +       
Sbjct: 4   IILKRHFSQHASLCLTPSISSSAPTKQSRILELCKLGQLTEAIRILNSTHSSEIPATPK- 63

Query: 63  LYASLLQTCTKVASFTHGRQIHAHVLKSGLEADRFVGNSLLSLYFKLGSDFRLTRRVFDG 122
           LYASLLQTC KV SF HG Q HAHV+KSGLE DR VGNSLLSLYFKLG   R TRRVFDG
Sbjct: 64  LYASLLQTCNKVFSFIHGIQFHAHVVKSGLETDRNVGNSLLSLYFKLGPGMRETRRVFDG 123

Query: 123 LFVKDVVSWTSMITSYVREGKPGNAIELFWDMLDLGIEPNGFTLSAVIKACSEIGNLVLG 182
            FVKD +SWTSM++ YV   +   A+E+F +M+  G++ N FTLS+ +KACSE+G + LG
Sbjct: 124 RFVKDAISWTSMMSGYVTGKEHVKALEVFVEMVSFGLDANEFTLSSAVKACSELGEVRLG 183

Query: 183 RCFHGLVVRHGFNSNHVIVSSLIDMYGRNFASSDARQLFDEMPEPDAICWTSVISALTRN 242
           RCFHG+V+ HGF  NH I S+L  +YG N    DAR++FDEMPEPD ICWT+V+SA ++N
Sbjct: 184 RCFHGVVITHGFEWNHFISSTLAYLYGVNREPVDARRVFDEMPEPDVICWTAVLSAFSKN 243

Query: 243 DLYEDALGFFYLMLRTYSLSPDGFTFGSVLTACANLGRLRQGEEVHAKVIAHGLGGNVVV 302
           DLYE+ALG FY M R   L PDG TFG+VLTAC NL RL+QG+E+H K+I +G+G NVVV
Sbjct: 244 DLYEEALGLFYAMHRGKGLVPDGSTFGTVLTACGNLRRLKQGKEIHGKLITNGIGSNVVV 303

Query: 303 ESSLVDMYGKCGAVEKSQRVFDRMSKRNSVSWSALLGVYCQNGDFEKVINIFRGMEKIDL 362
           ESSL+DMYGKCG+V ++++VF+ MSK+NSVSWSALLG YCQNG+ EK I IFR ME+ DL
Sbjct: 304 ESSLLDMYGKCGSVREARQVFNGMSKKNSVSWSALLGGYCQNGEHEKAIEIFREMEEKDL 363

Query: 363 YSFGTVIRACAG 373
           Y FGTV++ACAG
Sbjct: 364 YCFGTVLKACAG 374

BLAST of Cp4.1LG04g03350 vs. Swiss-Prot
Match: PP252_ARATH (Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidopsis thaliana GN=PCMP-H87 PE=3 SV=1)

HSP 1 Score: 205.7 bits (522), Expect = 1.0e-51
Identity = 135/409 (33.01%), Postives = 214/409 (52.32%), Query Frame = 1

Query: 27  ESQILQFCGSDLLHDALHTLNSLDSFDSTTNKSILYASLLQTCTKVASFTHGRQIHAHVL 86
           E + L+F  +DLL     + N L+      ++   Y +LL+ CT       GR +HAH+L
Sbjct: 31  EDESLKFPSNDLLLRT--SSNDLEGSYIPADRRF-YNTLLKKCTVFKLLIQGRIVHAHIL 90

Query: 87  KSGLEADRFVGNSLLSLYFKLGSDFRLTRRVFDGLFVKDVVSWTSMITSYVREGKPGNAI 146
           +S    D  +GN+LL++Y K GS     R+VF+ +  +D V+WT++I+ Y +  +P +A+
Sbjct: 91  QSIFRHDIVMGNTLLNMYAKCGS-LEEARKVFEKMPQRDFVTWTTLISGYSQHDRPCDAL 150

Query: 147 ELFWDMLDLGIEPNGFTLSAVIKACSEIGNLVLGRCFHGLVVRHGFNSNHVIVSSLIDMY 206
             F  ML  G  PN FTLS+VIKA +       G   HG  V+ GF+SN  + S+L+D+Y
Sbjct: 151 LFFNQMLRFGYSPNEFTLSSVIKAAAAERRGCCGHQLHGFCVKCGFDSNVHVGSALLDLY 210

Query: 207 GRNFASSDARQLFDEMPEPDAICWTSVISALTRNDLYEDALGFFYLMLRTYSLSPDGFTF 266
            R     DA+ +FD +   + + W ++I+   R    E AL  F  MLR     P  F++
Sbjct: 211 TRYGLMDDAQLVFDALESRNDVSWNALIAGHARRSGTEKALELFQGMLRD-GFRPSHFSY 270

Query: 267 GSVLTACANLGRLRQGEEVHAKVIAHGLGGNVVVESSLVDMYGKCGAVEKSQRVFDRMSK 326
            S+  AC++ G L QG+ VHA +I  G        ++L+DMY K G++  ++++FDR++K
Sbjct: 271 ASLFGACSSTGFLEQGKWVHAYMIKSGEKLVAFAGNTLLDMYAKSGSIHDARKIFDRLAK 330

Query: 327 RNSVSWSALLGVYCQNGDFEKVINIFRGMEKIDL----YSFGTVIRACA-----GYCFHA 386
           R+ VSW++LL  Y Q+G  ++ +  F  M ++ +     SF +V+ AC+        +H 
Sbjct: 331 RDVVSWNSLLTAYAQHGFGKEAVWWFEEMRRVGIRPNEISFLSVLTACSHSGLLDEGWHY 390

Query: 387 RPKMKNSHEKPATKHIENVPTWSKRLLDAINRDYRKFVNFPWRLSVAIF 427
              MK     P   H   V     R  D +NR  R     P   + AI+
Sbjct: 391 YELMKKDGIVPEAWHYVTVVDLLGRAGD-LNRALRFIEEMPIEPTAAIW 433

BLAST of Cp4.1LG04g03350 vs. Swiss-Prot
Match: PP146_ARATH (Pentatricopeptide repeat-containing protein At2g03380, mitochondrial OS=Arabidopsis thaliana GN=PCMP-E47 PE=3 SV=1)

HSP 1 Score: 204.9 bits (520), Expect = 1.7e-51
Identity = 113/349 (32.38%), Postives = 192/349 (55.01%), Query Frame = 1

Query: 28  SQILQFCGSDLLHDALHTLNSLDSFDSTTNKSILYASLLQTCTKVASFTHGRQIHAHVLK 87
           S I  +  +DL  + L   N +   ++       Y +L+  CTK+++   G+  H  ++K
Sbjct: 212 SMIAGYVKNDLCEEGLVLFNRMRE-NNVLGNEYTYGTLIMACTKLSALHQGKWFHGCLVK 271

Query: 88  SGLEADRFVGNSLLSLYFKLGSDFRLTRRVFDGLFVKDVVSWTSMITSYVREGKPGNAIE 147
           SG+E    +  SLL +Y K G D    RRVF+     D+V WT+MI  Y   G    A+ 
Sbjct: 272 SGIELSSCLVTSLLDMYVKCG-DISNARRVFNEHSHVDLVMWTAMIVGYTHNGSVNEALS 331

Query: 148 LFWDMLDLGIEPNGFTLSAVIKACSEIGNLVLGRCFHGLVVRHGFNSNHVIVSSLIDMYG 207
           LF  M  + I+PN  T+++V+  C  I NL LGR  HGL ++ G    +V  ++L+ MY 
Sbjct: 332 LFQKMKGVEIKPNCVTIASVLSGCGLIENLELGRSVHGLSIKVGIWDTNV-ANALVHMYA 391

Query: 208 RNFASSDARQLFDEMPEPDAICWTSVISALTRNDLYEDALGFFYLMLRTYSLSPDGFTFG 267
           + + + DA+ +F+   E D + W S+IS  ++N    +AL  F+ M  + S++P+G T  
Sbjct: 392 KCYQNRDAKYVFEMESEKDIVAWNSIISGFSQNGSIHEALFLFHRM-NSESVTPNGVTVA 451

Query: 268 SVLTACANLGRLRQGEEVHAKVIAHGL--GGNVVVESSLVDMYGKCGAVEKSQRVFDRMS 327
           S+ +ACA+LG L  G  +HA  +  G     +V V ++L+D Y KCG  + ++ +FD + 
Sbjct: 452 SLFSACASLGSLAVGSSLHAYSVKLGFLASSSVHVGTALLDFYAKCGDPQSARLIFDTIE 511

Query: 328 KRNSVSWSALLGVYCQNGDFEKVINIFRGM----EKIDLYSFGTVIRAC 371
           ++N+++WSA++G Y + GD    + +F  M    +K +  +F +++ AC
Sbjct: 512 EKNTITWSAMIGGYGKQGDTIGSLELFEEMLKKQQKPNESTFTSILSAC 556

BLAST of Cp4.1LG04g03350 vs. Swiss-Prot
Match: PP348_ARATH (Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana GN=EMB2758 PE=3 SV=2)

HSP 1 Score: 202.6 bits (514), Expect = 8.6e-51
Identity = 116/349 (33.24%), Postives = 194/349 (55.59%), Query Frame = 1

Query: 28  SQILQFCGSDLLHDALHTLNSLDSFDSTTNKSILYASLLQTCTKVASFTHGRQIHAHVLK 87
           + I  +C S    +AL   N L + DS T       SLL  CT+   F  G  IH++ +K
Sbjct: 221 AMISGYCQSGNAKEALTLSNGLRAMDSVT-----VVSLLSACTEAGDFNRGVTIHSYSIK 280

Query: 88  SGLEADRFVGNSLLSLYFKLGSDFRLTRRVFDGLFVKDVVSWTSMITSYVREGKPGNAIE 147
            GLE++ FV N L+ LY + G   R  ++VFD ++V+D++SW S+I +Y    +P  AI 
Sbjct: 281 HGLESELFVSNKLIDLYAEFGR-LRDCQKVFDRMYVRDLISWNSIIKAYELNEQPLRAIS 340

Query: 148 LFWDMLDLGIEPNGFTLSAVIKACSEIGNLVLGRCFHGLVVRHG-FNSNHVIVSSLIDMY 207
           LF +M    I+P+  TL ++    S++G++   R   G  +R G F  +  I ++++ MY
Sbjct: 341 LFQEMRLSRIQPDCLTLISLASILSQLGDIRACRSVQGFTLRKGWFLEDITIGNAVVVMY 400

Query: 208 GRNFASSDARQLFDEMPEPDAICWTSVISALTRNDLYEDALGFFYLMLRTYSLSPDGFTF 267
            +      AR +F+ +P  D I W ++IS   +N    +A+  + +M     ++ +  T+
Sbjct: 401 AKLGLVDSARAVFNWLPNTDVISWNTIISGYAQNGFASEAIEMYNIMEEEGEIAANQGTW 460

Query: 268 GSVLTACANLGRLRQGEEVHAKVIAHGLGGNVVVESSLVDMYGKCGAVEKSQRVFDRMSK 327
            SVL AC+  G LRQG ++H +++ +GL  +V V +SL DMYGKCG +E +  +F ++ +
Sbjct: 461 VSVLPACSQAGALRQGMKLHGRLLKNGLYLDVFVVTSLADMYGKCGRLEDALSLFYQIPR 520

Query: 328 RNSVSWSALLGVYCQNGDFEKVINIFRGM----EKIDLYSFGTVIRACA 372
            NSV W+ L+  +  +G  EK + +F+ M     K D  +F T++ AC+
Sbjct: 521 VNSVPWNTLIACHGFHGHGEKAVMLFKEMLDEGVKPDHITFVTLLSACS 563

BLAST of Cp4.1LG04g03350 vs. Swiss-Prot
Match: PP319_ARATH (Pentatricopeptide repeat-containing protein At4g18520 OS=Arabidopsis thaliana GN=PCMP-A2 PE=2 SV=1)

HSP 1 Score: 199.5 bits (506), Expect = 7.3e-50
Identity = 106/311 (34.08%), Postives = 176/311 (56.59%), Query Frame = 1

Query: 64  SLLQTCTKVASFTHGRQIHAHVLKSGLEADRFVGNSLLSLYFKLGSDFRLTRRVFDGLFV 123
           S+L+ C++  +   GRQ+H+ V+K  ++ D FVG SL+ +Y K G +    R+VFDG+  
Sbjct: 289 SILKACSEEKALRFGRQVHSLVVKRMIKTDVFVGTSLMDMYAKCG-EISDCRKVFDGMSN 348

Query: 124 KDVVSWTSMITSYVREGKPGNAIELFWDMLDLGIEPNGFTLSAVIKACSEIGNLVLGRCF 183
           ++ V+WTS+I ++ REG    AI LF  M    +  N  T+ ++++AC  +G L+LG+  
Sbjct: 349 RNTVTWTSIIAAHAREGFGEEAISLFRIMKRRHLIANNLTVVSILRACGSVGALLLGKEL 408

Query: 184 HGLVVRHGFNSNHVIVSSLIDMYGRNFASSDARQLFDEMPEPDAICWTSVISALTRNDLY 243
           H  ++++    N  I S+L+ +Y +   S DA  +  ++P  D + WT++IS  +     
Sbjct: 409 HAQIIKNSIEKNVYIGSTLVWLYCKCGESRDAFNVLQQLPSRDVVSWTAMISGCSSLGHE 468

Query: 244 EDALGFFYLMLRTYSLSPDGFTFGSVLTACANLGRLRQGEEVHAKVIAHGLGGNVVVESS 303
            +AL F   M++   + P+ FT+ S L ACAN   L  G  +H+    +    NV V S+
Sbjct: 469 SEALDFLKEMIQE-GVEPNPFTYSSALKACANSESLLIGRSIHSIAKKNHALSNVFVGSA 528

Query: 304 LVDMYGKCGAVEKSQRVFDRMSKRNSVSWSALLGVYCQNGDFEKVINIFRGME----KID 363
           L+ MY KCG V ++ RVFD M ++N VSW A++  Y +NG   + + +   ME    ++D
Sbjct: 529 LIHMYAKCGFVSEAFRVFDSMPEKNLVSWKAMIMGYARNGFCREALKLMYRMEAEGFEVD 588

Query: 364 LYSFGTVIRAC 371
            Y F T++  C
Sbjct: 589 DYIFATILSTC 597

BLAST of Cp4.1LG04g03350 vs. TrEMBL
Match: A0A0A0K5P0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G047230 PE=4 SV=1)

HSP 1 Score: 609.0 bits (1569), Expect = 4.5e-171
Identity = 299/373 (80.16%), Postives = 324/373 (86.86%), Query Frame = 1

Query: 1   MRLFIKRHCRS-FTSQNLKNSTHPPTKESQILQFCGSDLLHDALHTLNSLDSFDSTTNKS 60
           M LF+KRHC S FTSQN K STHP  K SQILQFC S LL+DALH LNS+D +DS  NK 
Sbjct: 1   MMLFLKRHCSSSFTSQNFKYSTHPSIKLSQILQFCKSGLLNDALHLLNSIDLYDSRINKP 60

Query: 61  ILYASLLQTCTKVASFTHGRQIHAHVLKSGLEADRFVGNSLLSLYFKLGSDFRLTRRVFD 120
           +LYASLLQTC KV SFT GRQ HAHV+KSGLE DRFVGNSLLSLYFKLGSD  LTRRVFD
Sbjct: 61  LLYASLLQTCIKVDSFTRGRQFHAHVVKSGLETDRFVGNSLLSLYFKLGSDSLLTRRVFD 120

Query: 121 GLFVKDVVSWTSMITSYVREGKPGNAIELFWDMLDLGIEPNGFTLSAVIKACSEIGNLVL 180
           GLFVKDVVSW SMIT YVREGK G AIELFWDMLD GIEPNGFTLSAVIKACSEIGNLVL
Sbjct: 121 GLFVKDVVSWASMITGYVREGKSGIAIELFWDMLDSGIEPNGFTLSAVIKACSEIGNLVL 180

Query: 181 GRCFHGLVVRHGFNSNHVIVSSLIDMYGRNFASSDARQLFDEMPEPDAICWTSVISALTR 240
           G+CFHG+VVR GF+SN VI+SSLIDMYGRN  SSDARQLFDE+ EPD +CWT+VISA TR
Sbjct: 181 GKCFHGVVVRRGFDSNPVILSSLIDMYGRNSVSSDARQLFDELLEPDPVCWTTVISAFTR 240

Query: 241 NDLYEDALGFFYLMLRTYSLSPDGFTFGSVLTACANLGRLRQGEEVHAKVIAHGLGGNVV 300
           NDLYE+ALGFFYL  R + L PD +TFGSVLTAC NLGRLRQGEE+HAKVIA+G  GNVV
Sbjct: 241 NDLYEEALGFFYLKHRAHRLCPDNYTFGSVLTACGNLGRLRQGEEIHAKVIAYGFSGNVV 300

Query: 301 VESSLVDMYGKCGAVEKSQRVFDRMSKRNSVSWSALLGVYCQNGDFEKVINIFRGMEKID 360
            ESSLVDMYGKCGAVEKSQR+FDRMS RNSVSWSALL VYC NGD+EK +N+FR M+++D
Sbjct: 301 TESSLVDMYGKCGAVEKSQRLFDRMSNRNSVSWSALLAVYCHNGDYEKAVNLFREMKEVD 360

Query: 361 LYSFGTVIRACAG 373
           LYSFGTVIRACAG
Sbjct: 361 LYSFGTVIRACAG 373

BLAST of Cp4.1LG04g03350 vs. TrEMBL
Match: M5XE21_PRUPE (Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa019108mg PE=4 SV=1)

HSP 1 Score: 556.6 bits (1433), Expect = 2.6e-155
Identity = 274/372 (73.66%), Postives = 313/372 (84.14%), Query Frame = 1

Query: 1   MRLFIKRHCRSFTSQNLKNSTHPPTKESQILQFCGSDLLHDALHTLNSLDSFDSTTNKSI 60
           M+L +KRHC S TS NL+++  P  K+SQIL+ C   LL DA+  LNS+DS + T  K I
Sbjct: 1   MKLVLKRHCSSLTSLNLQSTKIPSPKQSQILRLCKLGLLSDAIRVLNSIDSGEITL-KPI 60

Query: 61  LYASLLQTCTKVASFTHGRQIHAHVLKSGLEADRFVGNSLLSLYFKLGSDFRLTRRVFDG 120
           LYASLLQTCTK  SF HG QIHAHV+KSGLE DRFVGNSLLSLYFKL  +   TRRVFDG
Sbjct: 61  LYASLLQTCTKAVSFNHGLQIHAHVVKSGLETDRFVGNSLLSLYFKLVPNMSETRRVFDG 120

Query: 121 LFVKDVVSWTSMITSYVREGKPGNAIELFWDMLDLGIEPNGFTLSAVIKACSEIGNLVLG 180
           LFVKDV+SWTSMIT YVR GKPGN+IE+F+DML  GIEPN FTLSAV+KACSEIG+L LG
Sbjct: 121 LFVKDVISWTSMITGYVRAGKPGNSIEVFYDMLKFGIEPNAFTLSAVVKACSEIGDLRLG 180

Query: 181 RCFHGLVVRHGFNSNHVIVSSLIDMYGRNFASSDARQLFDEMPEPDAICWTSVISALTRN 240
            CFHG+VVR GF SNHVI+S+LI+MYGRN+ S DAR LFDE+ EP AICWTS+ISALTR+
Sbjct: 181 LCFHGVVVRRGFVSNHVIISALINMYGRNYRSEDARLLFDELTEPGAICWTSIISALTRS 240

Query: 241 DLYEDALGFFYLMLRTYSLSPDGFTFGSVLTACANLGRLRQGEEVHAKVIAHGLGGNVVV 300
           DL+E+ALGFFYLM R + LSPDGFTFG+VL AC NLGRLRQG E+HAKVI +GL GNVVV
Sbjct: 241 DLFEEALGFFYLMHRYHGLSPDGFTFGTVLAACGNLGRLRQGREMHAKVITYGLCGNVVV 300

Query: 301 ESSLVDMYGKCGAVEKSQRVFDRMSKRNSVSWSALLGVYCQNGDFEKVINIFRGMEKIDL 360
           ESSLVDMYGKCG+VE ++RVFDR+ K+NSVSWSALLGVYCQ GDFE VIN FR ME+ DL
Sbjct: 301 ESSLVDMYGKCGSVECARRVFDRIPKKNSVSWSALLGVYCQTGDFESVINHFREMEEADL 360

Query: 361 YSFGTVIRACAG 373
           YSFGTV+RACAG
Sbjct: 361 YSFGTVLRACAG 371

BLAST of Cp4.1LG04g03350 vs. TrEMBL
Match: A0A061E2T1_THECC (Pentatricopeptide repeat (PPR-like) superfamily protein OS=Theobroma cacao GN=TCM_007980 PE=4 SV=1)

HSP 1 Score: 526.6 bits (1355), Expect = 2.9e-146
Identity = 260/372 (69.89%), Postives = 301/372 (80.91%), Query Frame = 1

Query: 4   FIKRHCRSFTSQNLK-NSTHPPTKESQILQFCGSDLLHDALHTLNSLDSFDSTTN--KSI 63
           F KRH  SF S N    S  P  K SQIL FC S  L  A+H LN+L     TT+  K +
Sbjct: 7   FFKRHRCSFASFNPTFPSKTPSDKHSQILHFCKSAQLFPAIHLLNTLHFPSETTSSKKPL 66

Query: 64  LYASLLQTCTKVASFTHGRQIHAHVLKSGLEADRFVGNSLLSLYFKLGSDFRLTRRVFDG 123
           LYASLLQTCT V SF+HG Q HAHV+KSGL+ DRFVGNSLL+LYFKLG DF  TRRVFDG
Sbjct: 67  LYASLLQTCTNVQSFSHGLQFHAHVIKSGLQTDRFVGNSLLALYFKLGPDFTETRRVFDG 126

Query: 124 LFVKDVVSWTSMITSYVREGKPGNAIELFWDMLDLGIEPNGFTLSAVIKACSEIGNLVLG 183
           LFVKDV+SWTSM++ Y++ GKP ++++LFW+ML  G+EPNGFTLS VIKACSE+G L LG
Sbjct: 127 LFVKDVISWTSMVSGYIKAGKPESSLQLFWEMLGFGVEPNGFTLSTVIKACSELGKLRLG 186

Query: 184 RCFHGLVVRHGFNSNHVIVSSLIDMYGRNFASSDARQLFDEMPEPDAICWTSVISALTRN 243
            CFHG+V++ GF SN VI S+LID YGRN+   +A ++FDE+PEPDAICWTSVISALTRN
Sbjct: 187 WCFHGVVIKRGFVSNRVISSALIDFYGRNWQLKEACEIFDELPEPDAICWTSVISALTRN 246

Query: 244 DLYEDALGFFYLMLRTYSLSPDGFTFGSVLTACANLGRLRQGEEVHAKVIAHGLGGNVVV 303
           DLYE+AL FFYLM R + LSPDGFTFG+VLTAC NLGRLRQG++VHAKVI  GL GNVVV
Sbjct: 247 DLYEEALRFFYLMHRNHGLSPDGFTFGTVLTACGNLGRLRQGKQVHAKVITCGLCGNVVV 306

Query: 304 ESSLVDMYGKCGAVEKSQRVFDRMSKRNSVSWSALLGVYCQNGDFEKVINIFRGMEKIDL 363
           ESSL+DMYGKCG V++SQ VFDRMSK+NSVSWSALLGVYCQN D+E VI IFR M+K DL
Sbjct: 307 ESSLLDMYGKCGLVDESQCVFDRMSKKNSVSWSALLGVYCQNKDYESVIRIFREMDKTDL 366

Query: 364 YSFGTVIRACAG 373
           Y FGTV+RACAG
Sbjct: 367 YCFGTVLRACAG 378

BLAST of Cp4.1LG04g03350 vs. TrEMBL
Match: A0A067JXD0_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_25836 PE=4 SV=1)

HSP 1 Score: 524.6 bits (1350), Expect = 1.1e-145
Identity = 258/374 (68.98%), Postives = 302/374 (80.75%), Query Frame = 1

Query: 1   MRLF--IKRHCRSFTSQNLKNSTHPPTKESQILQFCGSDLLHDALHTLNSLDSFDSTTNK 60
           M+LF  IKRHC S  S N   S +P TKE +I+Q+C S  L DALH LNSLDS     NK
Sbjct: 1   MKLFLSIKRHCSSLASLNHNTSQNPQTKEFRIIQYCKSGALFDALHILNSLDSA-KLCNK 60

Query: 61  SILYASLLQTCTKVASFTHGRQIHAHVLKSGLEADRFVGNSLLSLYFKLGSDFRLTRRVF 120
             LYASLLQTCTKV SF HG QIHAH++KSGLE DRF+GNSLL+LYFKLGS+F  TRR+F
Sbjct: 61  PFLYASLLQTCTKVVSFNHGLQIHAHLIKSGLETDRFIGNSLLALYFKLGSNFFETRRLF 120

Query: 121 DGLFVKDVVSWTSMITSYVREGKPGNAIELFWDMLDLGIEPNGFTLSAVIKACSEIGNLV 180
           DGL+ +DV+SWTSM+T Y++ GKP NAIELF +MLD GI+PNGFTLSA IKA S++GNL 
Sbjct: 121 DGLYFRDVISWTSMVTGYIKVGKPKNAIELFLEMLDFGIDPNGFTLSAAIKASSDLGNLR 180

Query: 181 LGRCFHGLVVRHGFNSNHVIVSSLIDMYGRNFASSDARQLFDEMPEPDAICWTSVISALT 240
           LG+C HG+V+  GF+SN+VI S+LIDMYGRN+   DA +LFD++ EPDAI WTSVISA T
Sbjct: 181 LGKCIHGVVISQGFDSNYVISSALIDMYGRNYGLEDACRLFDDLLEPDAISWTSVISAFT 240

Query: 241 RNDLYEDALGFFYLMLRTYSLSPDGFTFGSVLTACANLGRLRQGEEVHAKVIAHGLGGNV 300
           RND+Y+ ALGFFYLM R   L+PD FTFG+VLTAC NL RL++G+EVHA+VI  G  GNV
Sbjct: 241 RNDMYDKALGFFYLMQRKLGLAPDEFTFGTVLTACGNLRRLKRGKEVHARVITSGFSGNV 300

Query: 301 VVESSLVDMYGKCGAVEKSQRVFDRMSKRNSVSWSALLGVYCQNGDFEKVINIFRGMEKI 360
           VVESSLVDMYGKCG V +SQ VFDRMS +NSVSWSALLG YCQNGDFE VI IFR +E  
Sbjct: 301 VVESSLVDMYGKCGLVMESQHVFDRMSIKNSVSWSALLGGYCQNGDFESVIRIFREVESH 360

Query: 361 DLYSFGTVIRACAG 373
           DLYSFGTV+RAC+G
Sbjct: 361 DLYSFGTVLRACSG 373

BLAST of Cp4.1LG04g03350 vs. TrEMBL
Match: B9RW53_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_1176200 PE=4 SV=1)

HSP 1 Score: 517.7 bits (1332), Expect = 1.3e-143
Identity = 258/373 (69.17%), Postives = 303/373 (81.23%), Query Frame = 1

Query: 1   MRLFIKRHCRSFTSQNLKNSTHPPTKESQILQFCGSDLLHDALHTLNSLDSFDSTTNKSI 60
           M LF KRHC S  S NLKN      KE +I+ FC S  L  AL  LNS+DS +  +NK  
Sbjct: 1   MNLF-KRHCSSLPSFNLKN------KEIKIIGFCKSGALLHALDILNSIDSRE-ISNKPF 60

Query: 61  LYASLLQTCTKVASFTHGRQIHAHVLKSGLEADRFVGNSLLSLYFKLGSDFRLTRRVFDG 120
           +YASLLQTCTKV SF HG QIHAHV+KSGLE DRFVGNSLL+LYFKL +DF  TRRVFDG
Sbjct: 61  IYASLLQTCTKVVSFNHGLQIHAHVVKSGLETDRFVGNSLLALYFKLSTDFFETRRVFDG 120

Query: 121 LFVKDVVSWTSMITSYVREGKPGNAIELFWDMLDLGIEPNGFTLSAVIKACSEIGNLVLG 180
           L+ +DV+SWTSMIT YV+  KP  A++LFW+MLD+G++PN FTLSAVIKAC+++G L+LG
Sbjct: 121 LYFRDVISWTSMITGYVKGEKPKKALDLFWEMLDVGVDPNAFTLSAVIKACTDLGTLMLG 180

Query: 181 RCFHGLVVRHGFNSNHVIVSSLIDMYGRNFASSDARQLFDEMPEPDAICWTSVISALTRN 240
           +CFH +++  GF+SNHVI S+LID+YGRN+   DAR+LFDE+ EPDAICWTSVISA TRN
Sbjct: 181 KCFHCVIMIRGFHSNHVIGSALIDLYGRNYELDDARRLFDELLEPDAICWTSVISAYTRN 240

Query: 241 DLYEDALGFFYLMLRTYSLSPDGFTFGSVLTACANLGRLRQGEEVHAKVIAHGLGGNVVV 300
           D+Y+ ALGFFYLM R   L+PDGFTFG+VLTAC NL RL+QG+EVHAK+I  G  GNVVV
Sbjct: 241 DMYDKALGFFYLMQRKLGLAPDGFTFGTVLTACGNLRRLKQGKEVHAKLITSGFSGNVVV 300

Query: 301 ESSLVDMYGKCGAVEKSQRVFDRMSKRNSVSWSALLGVYCQNGDFEKVINIFRGM-EKID 360
           ESSLVDMYGKCG V++SQRVFDRMS +NSVSWSALLG +CQNGDFE VI IFR M E  D
Sbjct: 301 ESSLVDMYGKCGLVDESQRVFDRMSVKNSVSWSALLGGFCQNGDFESVIRIFREMGEADD 360

Query: 361 LYSFGTVIRACAG 373
           LYSFGTV+RACAG
Sbjct: 361 LYSFGTVLRACAG 365

BLAST of Cp4.1LG04g03350 vs. TAIR10
Match: AT1G03540.1 (AT1G03540.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 447.2 bits (1149), Expect = 1.1e-125
Identity = 222/372 (59.68%), Postives = 276/372 (74.19%), Query Frame = 1

Query: 3   LFIKRHCRSFTSQNLKNS--THPPTKESQILQFCGSDLLHDALHTLNSLDSFDSTTNKSI 62
           + +KRH     S  L  S  +  PTK+S+IL+ C    L +A+  LNS  S +       
Sbjct: 4   IILKRHFSQHASLCLTPSISSSAPTKQSRILELCKLGQLTEAIRILNSTHSSEIPATPK- 63

Query: 63  LYASLLQTCTKVASFTHGRQIHAHVLKSGLEADRFVGNSLLSLYFKLGSDFRLTRRVFDG 122
           LYASLLQTC KV SF HG Q HAHV+KSGLE DR VGNSLLSLYFKLG   R TRRVFDG
Sbjct: 64  LYASLLQTCNKVFSFIHGIQFHAHVVKSGLETDRNVGNSLLSLYFKLGPGMRETRRVFDG 123

Query: 123 LFVKDVVSWTSMITSYVREGKPGNAIELFWDMLDLGIEPNGFTLSAVIKACSEIGNLVLG 182
            FVKD +SWTSM++ YV   +   A+E+F +M+  G++ N FTLS+ +KACSE+G + LG
Sbjct: 124 RFVKDAISWTSMMSGYVTGKEHVKALEVFVEMVSFGLDANEFTLSSAVKACSELGEVRLG 183

Query: 183 RCFHGLVVRHGFNSNHVIVSSLIDMYGRNFASSDARQLFDEMPEPDAICWTSVISALTRN 242
           RCFHG+V+ HGF  NH I S+L  +YG N    DAR++FDEMPEPD ICWT+V+SA ++N
Sbjct: 184 RCFHGVVITHGFEWNHFISSTLAYLYGVNREPVDARRVFDEMPEPDVICWTAVLSAFSKN 243

Query: 243 DLYEDALGFFYLMLRTYSLSPDGFTFGSVLTACANLGRLRQGEEVHAKVIAHGLGGNVVV 302
           DLYE+ALG FY M R   L PDG TFG+VLTAC NL RL+QG+E+H K+I +G+G NVVV
Sbjct: 244 DLYEEALGLFYAMHRGKGLVPDGSTFGTVLTACGNLRRLKQGKEIHGKLITNGIGSNVVV 303

Query: 303 ESSLVDMYGKCGAVEKSQRVFDRMSKRNSVSWSALLGVYCQNGDFEKVINIFRGMEKIDL 362
           ESSL+DMYGKCG+V ++++VF+ MSK+NSVSWSALLG YCQNG+ EK I IFR ME+ DL
Sbjct: 304 ESSLLDMYGKCGSVREARQVFNGMSKKNSVSWSALLGGYCQNGEHEKAIEIFREMEEKDL 363

Query: 363 YSFGTVIRACAG 373
           Y FGTV++ACAG
Sbjct: 364 YCFGTVLKACAG 374

BLAST of Cp4.1LG04g03350 vs. TAIR10
Match: AT3G24000.1 (AT3G24000.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 205.7 bits (522), Expect = 5.7e-53
Identity = 135/409 (33.01%), Postives = 214/409 (52.32%), Query Frame = 1

Query: 27  ESQILQFCGSDLLHDALHTLNSLDSFDSTTNKSILYASLLQTCTKVASFTHGRQIHAHVL 86
           E + L+F  +DLL     + N L+      ++   Y +LL+ CT       GR +HAH+L
Sbjct: 31  EDESLKFPSNDLLLRT--SSNDLEGSYIPADRRF-YNTLLKKCTVFKLLIQGRIVHAHIL 90

Query: 87  KSGLEADRFVGNSLLSLYFKLGSDFRLTRRVFDGLFVKDVVSWTSMITSYVREGKPGNAI 146
           +S    D  +GN+LL++Y K GS     R+VF+ +  +D V+WT++I+ Y +  +P +A+
Sbjct: 91  QSIFRHDIVMGNTLLNMYAKCGS-LEEARKVFEKMPQRDFVTWTTLISGYSQHDRPCDAL 150

Query: 147 ELFWDMLDLGIEPNGFTLSAVIKACSEIGNLVLGRCFHGLVVRHGFNSNHVIVSSLIDMY 206
             F  ML  G  PN FTLS+VIKA +       G   HG  V+ GF+SN  + S+L+D+Y
Sbjct: 151 LFFNQMLRFGYSPNEFTLSSVIKAAAAERRGCCGHQLHGFCVKCGFDSNVHVGSALLDLY 210

Query: 207 GRNFASSDARQLFDEMPEPDAICWTSVISALTRNDLYEDALGFFYLMLRTYSLSPDGFTF 266
            R     DA+ +FD +   + + W ++I+   R    E AL  F  MLR     P  F++
Sbjct: 211 TRYGLMDDAQLVFDALESRNDVSWNALIAGHARRSGTEKALELFQGMLRD-GFRPSHFSY 270

Query: 267 GSVLTACANLGRLRQGEEVHAKVIAHGLGGNVVVESSLVDMYGKCGAVEKSQRVFDRMSK 326
            S+  AC++ G L QG+ VHA +I  G        ++L+DMY K G++  ++++FDR++K
Sbjct: 271 ASLFGACSSTGFLEQGKWVHAYMIKSGEKLVAFAGNTLLDMYAKSGSIHDARKIFDRLAK 330

Query: 327 RNSVSWSALLGVYCQNGDFEKVINIFRGMEKIDL----YSFGTVIRACA-----GYCFHA 386
           R+ VSW++LL  Y Q+G  ++ +  F  M ++ +     SF +V+ AC+        +H 
Sbjct: 331 RDVVSWNSLLTAYAQHGFGKEAVWWFEEMRRVGIRPNEISFLSVLTACSHSGLLDEGWHY 390

Query: 387 RPKMKNSHEKPATKHIENVPTWSKRLLDAINRDYRKFVNFPWRLSVAIF 427
              MK     P   H   V     R  D +NR  R     P   + AI+
Sbjct: 391 YELMKKDGIVPEAWHYVTVVDLLGRAGD-LNRALRFIEEMPIEPTAAIW 433

BLAST of Cp4.1LG04g03350 vs. TAIR10
Match: AT2G03380.1 (AT2G03380.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 204.9 bits (520), Expect = 9.8e-53
Identity = 113/349 (32.38%), Postives = 192/349 (55.01%), Query Frame = 1

Query: 28  SQILQFCGSDLLHDALHTLNSLDSFDSTTNKSILYASLLQTCTKVASFTHGRQIHAHVLK 87
           S I  +  +DL  + L   N +   ++       Y +L+  CTK+++   G+  H  ++K
Sbjct: 212 SMIAGYVKNDLCEEGLVLFNRMRE-NNVLGNEYTYGTLIMACTKLSALHQGKWFHGCLVK 271

Query: 88  SGLEADRFVGNSLLSLYFKLGSDFRLTRRVFDGLFVKDVVSWTSMITSYVREGKPGNAIE 147
           SG+E    +  SLL +Y K G D    RRVF+     D+V WT+MI  Y   G    A+ 
Sbjct: 272 SGIELSSCLVTSLLDMYVKCG-DISNARRVFNEHSHVDLVMWTAMIVGYTHNGSVNEALS 331

Query: 148 LFWDMLDLGIEPNGFTLSAVIKACSEIGNLVLGRCFHGLVVRHGFNSNHVIVSSLIDMYG 207
           LF  M  + I+PN  T+++V+  C  I NL LGR  HGL ++ G    +V  ++L+ MY 
Sbjct: 332 LFQKMKGVEIKPNCVTIASVLSGCGLIENLELGRSVHGLSIKVGIWDTNV-ANALVHMYA 391

Query: 208 RNFASSDARQLFDEMPEPDAICWTSVISALTRNDLYEDALGFFYLMLRTYSLSPDGFTFG 267
           + + + DA+ +F+   E D + W S+IS  ++N    +AL  F+ M  + S++P+G T  
Sbjct: 392 KCYQNRDAKYVFEMESEKDIVAWNSIISGFSQNGSIHEALFLFHRM-NSESVTPNGVTVA 451

Query: 268 SVLTACANLGRLRQGEEVHAKVIAHGL--GGNVVVESSLVDMYGKCGAVEKSQRVFDRMS 327
           S+ +ACA+LG L  G  +HA  +  G     +V V ++L+D Y KCG  + ++ +FD + 
Sbjct: 452 SLFSACASLGSLAVGSSLHAYSVKLGFLASSSVHVGTALLDFYAKCGDPQSARLIFDTIE 511

Query: 328 KRNSVSWSALLGVYCQNGDFEKVINIFRGM----EKIDLYSFGTVIRAC 371
           ++N+++WSA++G Y + GD    + +F  M    +K +  +F +++ AC
Sbjct: 512 EKNTITWSAMIGGYGKQGDTIGSLELFEEMLKKQQKPNESTFTSILSAC 556

BLAST of Cp4.1LG04g03350 vs. TAIR10
Match: AT4G33990.1 (AT4G33990.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 202.6 bits (514), Expect = 4.9e-52
Identity = 116/349 (33.24%), Postives = 194/349 (55.59%), Query Frame = 1

Query: 28  SQILQFCGSDLLHDALHTLNSLDSFDSTTNKSILYASLLQTCTKVASFTHGRQIHAHVLK 87
           + I  +C S    +AL   N L + DS T       SLL  CT+   F  G  IH++ +K
Sbjct: 221 AMISGYCQSGNAKEALTLSNGLRAMDSVT-----VVSLLSACTEAGDFNRGVTIHSYSIK 280

Query: 88  SGLEADRFVGNSLLSLYFKLGSDFRLTRRVFDGLFVKDVVSWTSMITSYVREGKPGNAIE 147
            GLE++ FV N L+ LY + G   R  ++VFD ++V+D++SW S+I +Y    +P  AI 
Sbjct: 281 HGLESELFVSNKLIDLYAEFGR-LRDCQKVFDRMYVRDLISWNSIIKAYELNEQPLRAIS 340

Query: 148 LFWDMLDLGIEPNGFTLSAVIKACSEIGNLVLGRCFHGLVVRHG-FNSNHVIVSSLIDMY 207
           LF +M    I+P+  TL ++    S++G++   R   G  +R G F  +  I ++++ MY
Sbjct: 341 LFQEMRLSRIQPDCLTLISLASILSQLGDIRACRSVQGFTLRKGWFLEDITIGNAVVVMY 400

Query: 208 GRNFASSDARQLFDEMPEPDAICWTSVISALTRNDLYEDALGFFYLMLRTYSLSPDGFTF 267
            +      AR +F+ +P  D I W ++IS   +N    +A+  + +M     ++ +  T+
Sbjct: 401 AKLGLVDSARAVFNWLPNTDVISWNTIISGYAQNGFASEAIEMYNIMEEEGEIAANQGTW 460

Query: 268 GSVLTACANLGRLRQGEEVHAKVIAHGLGGNVVVESSLVDMYGKCGAVEKSQRVFDRMSK 327
            SVL AC+  G LRQG ++H +++ +GL  +V V +SL DMYGKCG +E +  +F ++ +
Sbjct: 461 VSVLPACSQAGALRQGMKLHGRLLKNGLYLDVFVVTSLADMYGKCGRLEDALSLFYQIPR 520

Query: 328 RNSVSWSALLGVYCQNGDFEKVINIFRGM----EKIDLYSFGTVIRACA 372
            NSV W+ L+  +  +G  EK + +F+ M     K D  +F T++ AC+
Sbjct: 521 VNSVPWNTLIACHGFHGHGEKAVMLFKEMLDEGVKPDHITFVTLLSACS 563

BLAST of Cp4.1LG04g03350 vs. TAIR10
Match: AT4G18520.1 (AT4G18520.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 199.5 bits (506), Expect = 4.1e-51
Identity = 106/311 (34.08%), Postives = 176/311 (56.59%), Query Frame = 1

Query: 64  SLLQTCTKVASFTHGRQIHAHVLKSGLEADRFVGNSLLSLYFKLGSDFRLTRRVFDGLFV 123
           S+L+ C++  +   GRQ+H+ V+K  ++ D FVG SL+ +Y K G +    R+VFDG+  
Sbjct: 289 SILKACSEEKALRFGRQVHSLVVKRMIKTDVFVGTSLMDMYAKCG-EISDCRKVFDGMSN 348

Query: 124 KDVVSWTSMITSYVREGKPGNAIELFWDMLDLGIEPNGFTLSAVIKACSEIGNLVLGRCF 183
           ++ V+WTS+I ++ REG    AI LF  M    +  N  T+ ++++AC  +G L+LG+  
Sbjct: 349 RNTVTWTSIIAAHAREGFGEEAISLFRIMKRRHLIANNLTVVSILRACGSVGALLLGKEL 408

Query: 184 HGLVVRHGFNSNHVIVSSLIDMYGRNFASSDARQLFDEMPEPDAICWTSVISALTRNDLY 243
           H  ++++    N  I S+L+ +Y +   S DA  +  ++P  D + WT++IS  +     
Sbjct: 409 HAQIIKNSIEKNVYIGSTLVWLYCKCGESRDAFNVLQQLPSRDVVSWTAMISGCSSLGHE 468

Query: 244 EDALGFFYLMLRTYSLSPDGFTFGSVLTACANLGRLRQGEEVHAKVIAHGLGGNVVVESS 303
            +AL F   M++   + P+ FT+ S L ACAN   L  G  +H+    +    NV V S+
Sbjct: 469 SEALDFLKEMIQE-GVEPNPFTYSSALKACANSESLLIGRSIHSIAKKNHALSNVFVGSA 528

Query: 304 LVDMYGKCGAVEKSQRVFDRMSKRNSVSWSALLGVYCQNGDFEKVINIFRGME----KID 363
           L+ MY KCG V ++ RVFD M ++N VSW A++  Y +NG   + + +   ME    ++D
Sbjct: 529 LIHMYAKCGFVSEAFRVFDSMPEKNLVSWKAMIMGYARNGFCREALKLMYRMEAEGFEVD 588

Query: 364 LYSFGTVIRAC 371
            Y F T++  C
Sbjct: 589 DYIFATILSTC 597

BLAST of Cp4.1LG04g03350 vs. NCBI nr
Match: gi|449438472|ref|XP_004137012.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g03540 [Cucumis sativus])

HSP 1 Score: 609.0 bits (1569), Expect = 6.4e-171
Identity = 299/373 (80.16%), Postives = 324/373 (86.86%), Query Frame = 1

Query: 1   MRLFIKRHCRS-FTSQNLKNSTHPPTKESQILQFCGSDLLHDALHTLNSLDSFDSTTNKS 60
           M LF+KRHC S FTSQN K STHP  K SQILQFC S LL+DALH LNS+D +DS  NK 
Sbjct: 1   MMLFLKRHCSSSFTSQNFKYSTHPSIKLSQILQFCKSGLLNDALHLLNSIDLYDSRINKP 60

Query: 61  ILYASLLQTCTKVASFTHGRQIHAHVLKSGLEADRFVGNSLLSLYFKLGSDFRLTRRVFD 120
           +LYASLLQTC KV SFT GRQ HAHV+KSGLE DRFVGNSLLSLYFKLGSD  LTRRVFD
Sbjct: 61  LLYASLLQTCIKVDSFTRGRQFHAHVVKSGLETDRFVGNSLLSLYFKLGSDSLLTRRVFD 120

Query: 121 GLFVKDVVSWTSMITSYVREGKPGNAIELFWDMLDLGIEPNGFTLSAVIKACSEIGNLVL 180
           GLFVKDVVSW SMIT YVREGK G AIELFWDMLD GIEPNGFTLSAVIKACSEIGNLVL
Sbjct: 121 GLFVKDVVSWASMITGYVREGKSGIAIELFWDMLDSGIEPNGFTLSAVIKACSEIGNLVL 180

Query: 181 GRCFHGLVVRHGFNSNHVIVSSLIDMYGRNFASSDARQLFDEMPEPDAICWTSVISALTR 240
           G+CFHG+VVR GF+SN VI+SSLIDMYGRN  SSDARQLFDE+ EPD +CWT+VISA TR
Sbjct: 181 GKCFHGVVVRRGFDSNPVILSSLIDMYGRNSVSSDARQLFDELLEPDPVCWTTVISAFTR 240

Query: 241 NDLYEDALGFFYLMLRTYSLSPDGFTFGSVLTACANLGRLRQGEEVHAKVIAHGLGGNVV 300
           NDLYE+ALGFFYL  R + L PD +TFGSVLTAC NLGRLRQGEE+HAKVIA+G  GNVV
Sbjct: 241 NDLYEEALGFFYLKHRAHRLCPDNYTFGSVLTACGNLGRLRQGEEIHAKVIAYGFSGNVV 300

Query: 301 VESSLVDMYGKCGAVEKSQRVFDRMSKRNSVSWSALLGVYCQNGDFEKVINIFRGMEKID 360
            ESSLVDMYGKCGAVEKSQR+FDRMS RNSVSWSALL VYC NGD+EK +N+FR M+++D
Sbjct: 301 TESSLVDMYGKCGAVEKSQRLFDRMSNRNSVSWSALLAVYCHNGDYEKAVNLFREMKEVD 360

Query: 361 LYSFGTVIRACAG 373
           LYSFGTVIRACAG
Sbjct: 361 LYSFGTVIRACAG 373

BLAST of Cp4.1LG04g03350 vs. NCBI nr
Match: gi|659110665|ref|XP_008455346.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g03540 [Cucumis melo])

HSP 1 Score: 605.5 bits (1560), Expect = 7.1e-170
Identity = 298/373 (79.89%), Postives = 325/373 (87.13%), Query Frame = 1

Query: 1   MRLFIKRHCRS-FTSQNLKNSTHPPTKESQILQFCGSDLLHDALHTLNSLDSFDSTTNKS 60
           M LF KRHC S FTSQN K STH   K  QILQFC S LL+DALH LNS+D +DS  NK 
Sbjct: 1   MMLFFKRHCTSSFTSQNFKYSTHLSNKLFQILQFCKSGLLNDALHILNSVDLYDSRINKP 60

Query: 61  ILYASLLQTCTKVASFTHGRQIHAHVLKSGLEADRFVGNSLLSLYFKLGSDFRLTRRVFD 120
           +LYASLLQTCTKV SF+ G Q HAHV+KSGLE DRFVGNSLLSLYFKLGS+  LTRRVFD
Sbjct: 61  LLYASLLQTCTKVDSFSSGCQFHAHVVKSGLETDRFVGNSLLSLYFKLGSNCLLTRRVFD 120

Query: 121 GLFVKDVVSWTSMITSYVREGKPGNAIELFWDMLDLGIEPNGFTLSAVIKACSEIGNLVL 180
           GLFVKDVVSW SMIT YVREGK G AIELFWDMLD GIEPN FTLS VIKACSEIGNLVL
Sbjct: 121 GLFVKDVVSWASMITGYVREGKSGMAIELFWDMLDSGIEPNDFTLSTVIKACSEIGNLVL 180

Query: 181 GRCFHGLVVRHGFNSNHVIVSSLIDMYGRNFASSDARQLFDEMPEPDAICWTSVISALTR 240
           G+CFHG+VVR GF+SN VI+SSLIDMYGRN+ SS+ARQLFDE+ EPD +CWT+VISA TR
Sbjct: 181 GKCFHGVVVRRGFDSNPVILSSLIDMYGRNYLSSEARQLFDELLEPDPVCWTTVISAFTR 240

Query: 241 NDLYEDALGFFYLMLRTYSLSPDGFTFGSVLTACANLGRLRQGEEVHAKVIAHGLGGNVV 300
           ND YE+ALGFFYLM R Y LSPD +TFGSVLTAC NLGRL+QGEE+HAKVIA+G GGNVV
Sbjct: 241 NDFYEEALGFFYLMHRAYRLSPDNYTFGSVLTACGNLGRLKQGEEIHAKVIAYGFGGNVV 300

Query: 301 VESSLVDMYGKCGAVEKSQRVFDRMSKRNSVSWSALLGVYCQNGDFEKVINIFRGMEKID 360
           VESSLVDMYGKCGAVEKSQRVFDRMS RNSVSWSALL VYCQNGDFEKV+++FR M+K+D
Sbjct: 301 VESSLVDMYGKCGAVEKSQRVFDRMSNRNSVSWSALLAVYCQNGDFEKVVSLFREMKKVD 360

Query: 361 LYSFGTVIRACAG 373
           LYSFGTV+RACAG
Sbjct: 361 LYSFGTVLRACAG 373

BLAST of Cp4.1LG04g03350 vs. NCBI nr
Match: gi|694332952|ref|XP_009357088.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g03540 [Pyrus x bretschneideri])

HSP 1 Score: 562.0 bits (1447), Expect = 9.0e-157
Identity = 275/372 (73.92%), Postives = 313/372 (84.14%), Query Frame = 1

Query: 1   MRLFIKRHCRSFTSQNLKNSTHPPTKESQILQFCGSDLLHDALHTLNSLDSFDSTTNKSI 60
           M+L  KRHC S TS NL+N+  P +K+S+ILQ C   LL DA+  LNS+DS + T  K I
Sbjct: 1   MKLVFKRHCSSLTSFNLQNTKTPSSKQSKILQLCKLGLLSDAIRVLNSIDSGEITL-KPI 60

Query: 61  LYASLLQTCTKVASFTHGRQIHAHVLKSGLEADRFVGNSLLSLYFKLGSDFRLTRRVFDG 120
           LYASLLQTCTKV SF HG Q+HAHV+KSGLE DRFVGNSLL+LYFKL  +   TRRVFDG
Sbjct: 61  LYASLLQTCTKVVSFNHGLQVHAHVVKSGLETDRFVGNSLLALYFKLVPNMSETRRVFDG 120

Query: 121 LFVKDVVSWTSMITSYVREGKPGNAIELFWDMLDLGIEPNGFTLSAVIKACSEIGNLVLG 180
           LFVKDV+SWTSMIT YVR GKPGN+IE+FW+ML  GIEPN FTLSAV+KACSE+G+L LG
Sbjct: 121 LFVKDVISWTSMITGYVRAGKPGNSIEMFWEMLKFGIEPNAFTLSAVVKACSEVGDLRLG 180

Query: 181 RCFHGLVVRHGFNSNHVIVSSLIDMYGRNFASSDARQLFDEMPEPDAICWTSVISALTRN 240
            CFHG+V R GF+SN VIVS+LI MYGRN+ S DAR+LFDEM EP AICWTSVISA TR+
Sbjct: 181 LCFHGVVFRRGFDSNDVIVSALIYMYGRNYRSEDARRLFDEMSEPGAICWTSVISAFTRS 240

Query: 241 DLYEDALGFFYLMLRTYSLSPDGFTFGSVLTACANLGRLRQGEEVHAKVIAHGLGGNVVV 300
           DL+E+ALGFFYLM R + LSPDGFTFG+VLTAC NLGRLRQG EVHAKVI +GL GNVVV
Sbjct: 241 DLFEEALGFFYLMQRNHGLSPDGFTFGTVLTACGNLGRLRQGREVHAKVITYGLYGNVVV 300

Query: 301 ESSLVDMYGKCGAVEKSQRVFDRMSKRNSVSWSALLGVYCQNGDFEKVINIFRGMEKIDL 360
           ESSLVDMYGKCG+VE ++RVFDRMSK+NSVSWSALLGVYCQ GDFE V+  FR ME+ DL
Sbjct: 301 ESSLVDMYGKCGSVENARRVFDRMSKKNSVSWSALLGVYCQAGDFESVVKHFREMEETDL 360

Query: 361 YSFGTVIRACAG 373
           YSFGTV+RACAG
Sbjct: 361 YSFGTVLRACAG 371

BLAST of Cp4.1LG04g03350 vs. NCBI nr
Match: gi|595946318|ref|XP_007216136.1| (hypothetical protein PRUPE_ppa019108mg, partial [Prunus persica])

HSP 1 Score: 556.6 bits (1433), Expect = 3.8e-155
Identity = 274/372 (73.66%), Postives = 313/372 (84.14%), Query Frame = 1

Query: 1   MRLFIKRHCRSFTSQNLKNSTHPPTKESQILQFCGSDLLHDALHTLNSLDSFDSTTNKSI 60
           M+L +KRHC S TS NL+++  P  K+SQIL+ C   LL DA+  LNS+DS + T  K I
Sbjct: 1   MKLVLKRHCSSLTSLNLQSTKIPSPKQSQILRLCKLGLLSDAIRVLNSIDSGEITL-KPI 60

Query: 61  LYASLLQTCTKVASFTHGRQIHAHVLKSGLEADRFVGNSLLSLYFKLGSDFRLTRRVFDG 120
           LYASLLQTCTK  SF HG QIHAHV+KSGLE DRFVGNSLLSLYFKL  +   TRRVFDG
Sbjct: 61  LYASLLQTCTKAVSFNHGLQIHAHVVKSGLETDRFVGNSLLSLYFKLVPNMSETRRVFDG 120

Query: 121 LFVKDVVSWTSMITSYVREGKPGNAIELFWDMLDLGIEPNGFTLSAVIKACSEIGNLVLG 180
           LFVKDV+SWTSMIT YVR GKPGN+IE+F+DML  GIEPN FTLSAV+KACSEIG+L LG
Sbjct: 121 LFVKDVISWTSMITGYVRAGKPGNSIEVFYDMLKFGIEPNAFTLSAVVKACSEIGDLRLG 180

Query: 181 RCFHGLVVRHGFNSNHVIVSSLIDMYGRNFASSDARQLFDEMPEPDAICWTSVISALTRN 240
            CFHG+VVR GF SNHVI+S+LI+MYGRN+ S DAR LFDE+ EP AICWTS+ISALTR+
Sbjct: 181 LCFHGVVVRRGFVSNHVIISALINMYGRNYRSEDARLLFDELTEPGAICWTSIISALTRS 240

Query: 241 DLYEDALGFFYLMLRTYSLSPDGFTFGSVLTACANLGRLRQGEEVHAKVIAHGLGGNVVV 300
           DL+E+ALGFFYLM R + LSPDGFTFG+VL AC NLGRLRQG E+HAKVI +GL GNVVV
Sbjct: 241 DLFEEALGFFYLMHRYHGLSPDGFTFGTVLAACGNLGRLRQGREMHAKVITYGLCGNVVV 300

Query: 301 ESSLVDMYGKCGAVEKSQRVFDRMSKRNSVSWSALLGVYCQNGDFEKVINIFRGMEKIDL 360
           ESSLVDMYGKCG+VE ++RVFDR+ K+NSVSWSALLGVYCQ GDFE VIN FR ME+ DL
Sbjct: 301 ESSLVDMYGKCGSVECARRVFDRIPKKNSVSWSALLGVYCQTGDFESVINHFREMEEADL 360

Query: 361 YSFGTVIRACAG 373
           YSFGTV+RACAG
Sbjct: 361 YSFGTVLRACAG 371

BLAST of Cp4.1LG04g03350 vs. NCBI nr
Match: gi|658041113|ref|XP_008356170.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g03540-like [Malus domestica])

HSP 1 Score: 549.3 bits (1414), Expect = 6.0e-153
Identity = 273/372 (73.39%), Postives = 306/372 (82.26%), Query Frame = 1

Query: 1   MRLFIKRHCRSFTSQNLKNSTHPPTKESQILQFCGSDLLHDALHTLNSLDSFDSTTNKSI 60
           M+L  KRHC S TS NL+N+  P  K+S+ILQ C   LL DA+  LNS+DS + T  K I
Sbjct: 25  MKLVFKRHCSSLTSXNLQNTKTPSPKQSKILQLCKLGLLPDAIRVLNSIDSGEITL-KPI 84

Query: 61  LYASLLQTCTKVASFTHGRQIHAHVLKSGLEADRFVGNSLLSLYFKLGSDFRLTRRVFDG 120
           LYASLLQTCTK  SF HG Q+HAHV+KSGLE DRFVGNSLL+LYFKL  +   TRRVFDG
Sbjct: 85  LYASLLQTCTKAVSFNHGLQVHAHVVKSGLETDRFVGNSLLALYFKLVPNMSETRRVFDG 144

Query: 121 LFVKDVVSWTSMITSYVREGKPGNAIELFWDMLDLGIEPNGFTLSAVIKACSEIGNLVLG 180
           LFVKDV+SWTSMIT YVR GKPGN+IE FW+ML  GIEPN FTLSAV+KACSEIG+L LG
Sbjct: 145 LFVKDVISWTSMITGYVRAGKPGNSIETFWEMLKFGIEPNAFTLSAVVKACSEIGDLRLG 204

Query: 181 RCFHGLVVRHGFNSNHVIVSSLIDMYGRNFASSDARQLFDEMPEPDAICWTSVISALTRN 240
            CFHG+V R GF+SN VIVS+LI MYGRN  S DAR+LFDEM EP  ICWTSVISA TR+
Sbjct: 205 LCFHGVVFRRGFDSNDVIVSALIYMYGRNCRSEDARRLFDEMSEPGPICWTSVISAFTRS 264

Query: 241 DLYEDALGFFYLMLRTYSLSPDGFTFGSVLTACANLGRLRQGEEVHAKVIAHGLGGNVVV 300
           DL+E+ALGFFYLM R + LSPDGFTFG+VLTAC NLGRLRQG EVHAKVI +GL GNVVV
Sbjct: 265 DLFEEALGFFYLMQRNHGLSPDGFTFGTVLTACGNLGRLRQGREVHAKVITYGLYGNVVV 324

Query: 301 ESSLVDMYGKCGAVEKSQRVFDRMSKRNSVSWSALLGVYCQNGDFEKVINIFRGMEKIDL 360
           ESSLVDMYGKCG+VE + RVFDRMSK+NSVS SALLGVYCQ GDFE V+  FR ME+ DL
Sbjct: 325 ESSLVDMYGKCGSVENAXRVFDRMSKKNSVSRSALLGVYCQAGDFESVVKHFREMEEADL 384

Query: 361 YSFGTVIRACAG 373
           YSFGTV+RACAG
Sbjct: 385 YSFGTVLRACAG 395

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR8_ARATH2.0e-12459.68Pentatricopeptide repeat-containing protein At1g03540 OS=Arabidopsis thaliana GN... [more]
PP252_ARATH1.0e-5133.01Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidop... [more]
PP146_ARATH1.7e-5132.38Pentatricopeptide repeat-containing protein At2g03380, mitochondrial OS=Arabidop... [more]
PP348_ARATH8.6e-5133.24Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana GN... [more]
PP319_ARATH7.3e-5034.08Pentatricopeptide repeat-containing protein At4g18520 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0K5P0_CUCSA4.5e-17180.16Uncharacterized protein OS=Cucumis sativus GN=Csa_7G047230 PE=4 SV=1[more]
M5XE21_PRUPE2.6e-15573.66Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa019108mg PE=4 S... [more]
A0A061E2T1_THECC2.9e-14669.89Pentatricopeptide repeat (PPR-like) superfamily protein OS=Theobroma cacao GN=TC... [more]
A0A067JXD0_JATCU1.1e-14568.98Uncharacterized protein OS=Jatropha curcas GN=JCGZ_25836 PE=4 SV=1[more]
B9RW53_RICCO1.3e-14369.17Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
Match NameE-valueIdentityDescription
AT1G03540.11.1e-12559.68 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT3G24000.15.7e-5333.01 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G03380.19.8e-5332.38 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G33990.14.9e-5233.24 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G18520.14.1e-5134.08 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449438472|ref|XP_004137012.1|6.4e-17180.16PREDICTED: pentatricopeptide repeat-containing protein At1g03540 [Cucumis sativu... [more]
gi|659110665|ref|XP_008455346.1|7.1e-17079.89PREDICTED: pentatricopeptide repeat-containing protein At1g03540 [Cucumis melo][more]
gi|694332952|ref|XP_009357088.1|9.0e-15773.92PREDICTED: pentatricopeptide repeat-containing protein At1g03540 [Pyrus x bretsc... [more]
gi|595946318|ref|XP_007216136.1|3.8e-15573.66hypothetical protein PRUPE_ppa019108mg, partial [Prunus persica][more]
gi|658041113|ref|XP_008356170.1|6.0e-15373.39PREDICTED: pentatricopeptide repeat-containing protein At1g03540-like [Malus dom... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g03350.1Cp4.1LG04g03350.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 229..255
score: 2.5E-4coord: 198..223
score: 0.026coord: 330..358
score: 4.2E-6coord: 302..328
score: 0.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 124..171
score: 3.0
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 330..357
score: 3.0E-5coord: 127..160
score: 5.8E-6coord: 228..262
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 125..159
score: 12.441coord: 297..327
score: 8.418coord: 160..194
score: 6.38coord: 93..124
score: 5.251coord: 58..92
score: 6.588coord: 226..261
score: 8.221coord: 262..296
score: 8.057coord: 328..362
score: 10.26coord: 195..225
score: 7
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 224..354
score: 3.0E-4coord: 125..156
score: 3.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 45..372
score: 2.3E
NoneNo IPR availablePANTHERPTHR24015:SF328SUBFAMILY NOT NAMEDcoord: 45..372
score: 2.3E

The following gene(s) are paralogous to this gene:

None