Cp4.1LG02g11170.1 (mRNA) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG02g11170.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG02: 10160778 .. 10163424 (-)
Sequence length2439
RNA-Seq ExpressionCp4.1LG02g11170.1
SyntenyCp4.1LG02g11170.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACCTTGTAAACCCTAAGCCCAAGGTTTCATCATCGACAGTTCTTCTGAACTCTCCTTCGAGTTCCTCCATGTCCATTCGAACCTCTGCCTTCGCCACCGTCACCCTTCTCCGCTCTCTCACTCTTTCCTTCCCTCTATGCCACCACCACTTCCGTTGCCGGAACTACGTCATCCGTTCTCTCTCTATCCCAACATATTCAGCGAAAGGACGACGACAACTTACGAGGATTCCTGCCTTTGCTTCCAGTTCTTCCGTTGAAGCGTTGGTGCATGACCGGGATTCCCCGGCCGAATCTGAAGAGCCTTTGTGTTCTCCATACAGTACTGGCGCTGAGGGGTTTGCGTCGGCGGATTTGAAACACTTGGGAGCGCCTGCGCTTGAAGTCAAGGAGCTGGATGAGTTGCCGGAGCAATGGCGTAGATCCAAATTGGCTTGGCTTTGTAAAGAATTGCCGGCACATAAGCCGGGAACATTGATACGACTGCTTAATGCTCAGAGGAAATGGATGAAGCAGGATGATGCGGCCTATGTCATCGTGCATTGTTTGCGTATTCGCGAAAATGAGACTGCTTTTAGGGTTAGTTTTCGTTTTGATTCTATTATGTTCTACTATAACACCCGCAAAATACACGTTCGTATAAGTATAGCTACTTGGAATGCAATTGATAGGAAAGTTATTTGGAAGTGTGCAATTTCTGGAATATTGTTGTGCTTTGTTCTTTAGAATCTATGTTTGTGATATTGTTTTCTTCCCATTTCTTAATTTTCTTCATTTTTGAGATTAGGTGTACAAGTGGATGATGCAACAACATTGGTACCGATTTGATTATGCTTTAGCTACTAAGCTTGCTGATTACATGGGCAAGGAACGGAAGTTCTCAAAGTGTCGGGAAGTATTTGATGATATAATTAATCAGGGATGTGTGCCAAGTGAATCCACATTTCATATATTGATTGTTGCCTACCTTAGTGCACCTATCCAAGGATGCATAGAGGAAGCAAGTGCCATTTACAATCGTATGATTCAGTTAGGAGGTTACGAACCACGTCTTAGCTTGCACAATTCTCTCTTTAAAGCTCTCTTGAGCAAACCAGGGGATTTGTCAAAGCATCATCTTAAACAGGCTGAGTTCATATATCACAATCTGGTAACAACTGGACTTGAGTTGCATAAAGATATATATGCTGGTCTAATTTGGCTACATAGTTATCAGGATACTGTAGACAAAGAAAGGATAATGTCACTAAGGAAAGAAATGCAACAAGCAGGAATTGAGGAAGAAAGAGAAGTCCTTGTATCCATCTTGAGAGCGAGCTCGAAATTGGGGGATGTGATGGAAGCAGAAAGATCGTGGCTTAAACTTAAGTCTTTTGATGGTAGCATGCCATCTCAGGCTTTTGTTTACAAAATGGAAGTATATGCAAAGGTGGGTAATCCGATGAAAGCTTTCGAGATATTTAGGGAGATGGAGCAGTTGAACTCTGTAAGTGCTGCAGCATATCAGACAATTATTGGGATTTTATGTAAAGTTGAAGAGGTAACACTTGCAGAATCCGTCATGGAAGGCTTCATAAAGAGTAATTTAAAGCCCCTCAAGCCAGCTTATGTTGATTTGATGAATATGTTTTTCAATTTAAGCTTACATGATAAGTTAGAGTTAACCTTCTCCCAGTGCCTTGAGAAGTGTAAACCAAATCGTACTATTTACAGCATATATTTGAACTCTTTGGTAAAAGTTGGTAATCTCGACAGGGCTGAAGAAATATTTAGCCAGATGCAAACAAATGGAGAAATTGGTGTAAGTGCTCGTTCATGCAACATTATTTTAAGTGGGTACCTGTTAAGTGGGGATTATTTGAAGGCTGAAAAAATATATGATTTGATGTGTCAGAAAAAGTACGACATTGATCCTCCATTAATGGAGAAACTTGATTATGTCCTAAGCTTGAGTAGGAAGGAGATTAAGAAGCCAGTAAGCTTGAAGTTGAGTAAAGAACAAAGGGAGATTTTAGTAGGGTTGTTATTAGGTGGCCTGGAGATCGAATCTGATGAAGGGAGGAAGAATCATAGGATCCAATTTGAATTCCACGAAGATCGTAGCACCCACTCTCGTTTGAGGAGACACATATATGAGCAATATCATGAGTGGTTACATCCTGCTTCAAAGTCAAGCGATAGTGATACAGATATACCATATAAATTCTGCACCGTTTCGCATTCATATTTTGGTTTCTATGCCGATCAGTTTTGGCCACGAGGCCATCCTGCAATCCCTAATCTAATTCACCGGTGGCTTTCACCTCGTGTTCTTGCTTACTGGTACATGTATGGAGGCTGCAGGATATCGTCAGGGGATTTCGTACTGAAGCTAAAGGGAAGTCGTGAGGGTGTTGCGAAGATTGTTAAATCTCTGGGAGAAAAGTCCATGTCTTGCAAGGTGAAAAGGAAGGGCAGGGTGTATTGGATAGGCTTACTTGGAAGCAACGCCACATGGTTCTGGAAACTAATTGAACCTTTCATTCTGGATGACTTGAAAGATAGGTTACAGGCAGACAGCCTTAACATGGAGAAGGCTGTAAATGAAACTTACAATATCAACTTTGATAGTCAATCTGATTCCGATGAGGAGGCGTCTAGTTAG

mRNA sequence

ATGAACCTTGTAAACCCTAAGCCCAAGGTTTCATCATCGACAGTTCTTCTGAACTCTCCTTCGAGTTCCTCCATGTCCATTCGAACCTCTGCCTTCGCCACCGTCACCCTTCTCCGCTCTCTCACTCTTTCCTTCCCTCTATGCCACCACCACTTCCGTTGCCGGAACTACGTCATCCGTTCTCTCTCTATCCCAACATATTCAGCGAAAGGACGACGACAACTTACGAGGATTCCTGCCTTTGCTTCCAGTTCTTCCGTTGAAGCGTTGGTGCATGACCGGGATTCCCCGGCCGAATCTGAAGAGCCTTTGTGTTCTCCATACAGTACTGGCGCTGAGGGGTTTGCGTCGGCGGATTTGAAACACTTGGGAGCGCCTGCGCTTGAAGTCAAGGAGCTGGATGAGTTGCCGGAGCAATGGCGTAGATCCAAATTGGCTTGGCTTTGTAAAGAATTGCCGGCACATAAGCCGGGAACATTGATACGACTGCTTAATGCTCAGAGGAAATGGATGAAGCAGGATGATGCGGCCTATGTCATCGTGCATTGTTTGCGTATTCGCGAAAATGAGACTGCTTTTAGGGTGTACAAGTGGATGATGCAACAACATTGGTACCGATTTGATTATGCTTTAGCTACTAAGCTTGCTGATTACATGGGCAAGGAACGGAAGTTCTCAAAGTGTCGGGAAGTATTTGATGATATAATTAATCAGGGATGTGTGCCAAGTGAATCCACATTTCATATATTGATTGTTGCCTACCTTAGTGCACCTATCCAAGGATGCATAGAGGAAGCAAGTGCCATTTACAATCGTATGATTCAGTTAGGAGGTTACGAACCACGTCTTAGCTTGCACAATTCTCTCTTTAAAGCTCTCTTGAGCAAACCAGGGGATTTGTCAAAGCATCATCTTAAACAGGCTGAGTTCATATATCACAATCTGGTAACAACTGGACTTGAGTTGCATAAAGATATATATGCTGGTCTAATTTGGCTACATAGTTATCAGGATACTGTAGACAAAGAAAGGATAATGTCACTAAGGAAAGAAATGCAACAAGCAGGAATTGAGGAAGAAAGAGAAGTCCTTGTATCCATCTTGAGAGCGAGCTCGAAATTGGGGGATGTGATGGAAGCAGAAAGATCGTGGCTTAAACTTAAGTCTTTTGATGGTAGCATGCCATCTCAGGCTTTTGTTTACAAAATGGAAGTATATGCAAAGGTGGGTAATCCGATGAAAGCTTTCGAGATATTTAGGGAGATGGAGCAGTTGAACTCTGTAAGTGCTGCAGCATATCAGACAATTATTGGGATTTTATGTAAAGTTGAAGAGGTAACACTTGCAGAATCCGTCATGGAAGGCTTCATAAAGAGTAATTTAAAGCCCCTCAAGCCAGCTTATGTTGATTTGATGAATATGTTTTTCAATTTAAGCTTACATGATAAGTTAGAGTTAACCTTCTCCCAGTGCCTTGAGAAGTGTAAACCAAATCGTACTATTTACAGCATATATTTGAACTCTTTGGTAAAAGTTGGTAATCTCGACAGGGCTGAAGAAATATTTAGCCAGATGCAAACAAATGGAGAAATTGGTGTAAGTGCTCGTTCATGCAACATTATTTTAAGTGGGTACCTGTTAAGTGGGGATTATTTGAAGGCTGAAAAAATATATGATTTGATGTGTCAGAAAAAGTACGACATTGATCCTCCATTAATGGAGAAACTTGATTATGTCCTAAGCTTGAGTAGGAAGGAGATTAAGAAGCCAGTAAGCTTGAAGTTGAGTAAAGAACAAAGGGAGATTTTAGTAGGGTTGTTATTAGGTGGCCTGGAGATCGAATCTGATGAAGGGAGGAAGAATCATAGGATCCAATTTGAATTCCACGAAGATCGTAGCACCCACTCTCGTTTGAGGAGACACATATATGAGCAATATCATGAGTGGTTACATCCTGCTTCAAAGTCAAGCGATAGTGATACAGATATACCATATAAATTCTGCACCGTTTCGCATTCATATTTTGGTTTCTATGCCGATCAGTTTTGGCCACGAGGCCATCCTGCAATCCCTAATCTAATTCACCGGTGGCTTTCACCTCGTGTTCTTGCTTACTGGTACATGTATGGAGGCTGCAGGATATCGTCAGGGGATTTCGTACTGAAGCTAAAGGGAAGTCGTGAGGGTGTTGCGAAGATTGTTAAATCTCTGGGAGAAAAGTCCATGTCTTGCAAGGTGAAAAGGAAGGGCAGGGTGTATTGGATAGGCTTACTTGGAAGCAACGCCACATGGTTCTGGAAACTAATTGAACCTTTCATTCTGGATGACTTGAAAGATAGGTTACAGGCAGACAGCCTTAACATGGAGAAGGCTGTAAATGAAACTTACAATATCAACTTTGATAGTCAATCTGATTCCGATGAGGAGGCGTCTAGTTAG

Coding sequence (CDS)

ATGAACCTTGTAAACCCTAAGCCCAAGGTTTCATCATCGACAGTTCTTCTGAACTCTCCTTCGAGTTCCTCCATGTCCATTCGAACCTCTGCCTTCGCCACCGTCACCCTTCTCCGCTCTCTCACTCTTTCCTTCCCTCTATGCCACCACCACTTCCGTTGCCGGAACTACGTCATCCGTTCTCTCTCTATCCCAACATATTCAGCGAAAGGACGACGACAACTTACGAGGATTCCTGCCTTTGCTTCCAGTTCTTCCGTTGAAGCGTTGGTGCATGACCGGGATTCCCCGGCCGAATCTGAAGAGCCTTTGTGTTCTCCATACAGTACTGGCGCTGAGGGGTTTGCGTCGGCGGATTTGAAACACTTGGGAGCGCCTGCGCTTGAAGTCAAGGAGCTGGATGAGTTGCCGGAGCAATGGCGTAGATCCAAATTGGCTTGGCTTTGTAAAGAATTGCCGGCACATAAGCCGGGAACATTGATACGACTGCTTAATGCTCAGAGGAAATGGATGAAGCAGGATGATGCGGCCTATGTCATCGTGCATTGTTTGCGTATTCGCGAAAATGAGACTGCTTTTAGGGTGTACAAGTGGATGATGCAACAACATTGGTACCGATTTGATTATGCTTTAGCTACTAAGCTTGCTGATTACATGGGCAAGGAACGGAAGTTCTCAAAGTGTCGGGAAGTATTTGATGATATAATTAATCAGGGATGTGTGCCAAGTGAATCCACATTTCATATATTGATTGTTGCCTACCTTAGTGCACCTATCCAAGGATGCATAGAGGAAGCAAGTGCCATTTACAATCGTATGATTCAGTTAGGAGGTTACGAACCACGTCTTAGCTTGCACAATTCTCTCTTTAAAGCTCTCTTGAGCAAACCAGGGGATTTGTCAAAGCATCATCTTAAACAGGCTGAGTTCATATATCACAATCTGGTAACAACTGGACTTGAGTTGCATAAAGATATATATGCTGGTCTAATTTGGCTACATAGTTATCAGGATACTGTAGACAAAGAAAGGATAATGTCACTAAGGAAAGAAATGCAACAAGCAGGAATTGAGGAAGAAAGAGAAGTCCTTGTATCCATCTTGAGAGCGAGCTCGAAATTGGGGGATGTGATGGAAGCAGAAAGATCGTGGCTTAAACTTAAGTCTTTTGATGGTAGCATGCCATCTCAGGCTTTTGTTTACAAAATGGAAGTATATGCAAAGGTGGGTAATCCGATGAAAGCTTTCGAGATATTTAGGGAGATGGAGCAGTTGAACTCTGTAAGTGCTGCAGCATATCAGACAATTATTGGGATTTTATGTAAAGTTGAAGAGGTAACACTTGCAGAATCCGTCATGGAAGGCTTCATAAAGAGTAATTTAAAGCCCCTCAAGCCAGCTTATGTTGATTTGATGAATATGTTTTTCAATTTAAGCTTACATGATAAGTTAGAGTTAACCTTCTCCCAGTGCCTTGAGAAGTGTAAACCAAATCGTACTATTTACAGCATATATTTGAACTCTTTGGTAAAAGTTGGTAATCTCGACAGGGCTGAAGAAATATTTAGCCAGATGCAAACAAATGGAGAAATTGGTGTAAGTGCTCGTTCATGCAACATTATTTTAAGTGGGTACCTGTTAAGTGGGGATTATTTGAAGGCTGAAAAAATATATGATTTGATGTGTCAGAAAAAGTACGACATTGATCCTCCATTAATGGAGAAACTTGATTATGTCCTAAGCTTGAGTAGGAAGGAGATTAAGAAGCCAGTAAGCTTGAAGTTGAGTAAAGAACAAAGGGAGATTTTAGTAGGGTTGTTATTAGGTGGCCTGGAGATCGAATCTGATGAAGGGAGGAAGAATCATAGGATCCAATTTGAATTCCACGAAGATCGTAGCACCCACTCTCGTTTGAGGAGACACATATATGAGCAATATCATGAGTGGTTACATCCTGCTTCAAAGTCAAGCGATAGTGATACAGATATACCATATAAATTCTGCACCGTTTCGCATTCATATTTTGGTTTCTATGCCGATCAGTTTTGGCCACGAGGCCATCCTGCAATCCCTAATCTAATTCACCGGTGGCTTTCACCTCGTGTTCTTGCTTACTGGTACATGTATGGAGGCTGCAGGATATCGTCAGGGGATTTCGTACTGAAGCTAAAGGGAAGTCGTGAGGGTGTTGCGAAGATTGTTAAATCTCTGGGAGAAAAGTCCATGTCTTGCAAGGTGAAAAGGAAGGGCAGGGTGTATTGGATAGGCTTACTTGGAAGCAACGCCACATGGTTCTGGAAACTAATTGAACCTTTCATTCTGGATGACTTGAAAGATAGGTTACAGGCAGACAGCCTTAACATGGAGAAGGCTGTAAATGAAACTTACAATATCAACTTTGATAGTCAATCTGATTCCGATGAGGAGGCGTCTAGTTAG

Protein sequence

MNLVNPKPKVSSSTVLLNSPSSSSMSIRTSAFATVTLLRSLTLSFPLCHHHFRCRNYVIRSLSIPTYSAKGRRQLTRIPAFASSSSVEALVHDRDSPAESEEPLCSPYSTGAEGFASADLKHLGAPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMKQDDAAYVIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEEASAIYNRMIQLGGYEPRLSLHNSLFKALLSKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYAGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLNSVSAAAYQTIIGILCKVEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFHEDRSTHSRLRRHIYEQYHEWLHPASKSSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLGEKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDRLQADSLNMEKAVNETYNINFDSQSDSDEEASS
Homology
BLAST of Cp4.1LG02g11170.1 vs. ExPASy Swiss-Prot
Match: Q9XIL5 (Pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=OTP51 PE=2 SV=3)

HSP 1 Score: 886.7 bits (2290), Expect = 2.0e-256
Identity = 454/816 (55.64%), Postives = 607/816 (74.39%), Query Frame = 0

Query: 11  SSSTVLLNSPSSSSMSIRTSAF-ATVTLLRSLTLSFPLCHHHFRCRNYVIRSLSIPTYSA 70
           SSSTV + + + SS+S   +   ++ TL RS  LSF L  H        +R LSI T   
Sbjct: 29  SSSTVSVTTFNISSLSSNPNIINSSSTLFRS--LSFSLIRHRSSYSRRSLRRLSIHTVHG 88

Query: 71  KGRR----QLTRI-PAFASSSSVE---ALVHDRDSPAESEEPLCSPYSTGAEGFASADLK 130
              +      TR  P F ++S+ +     V       ESEE +      G    A  D++
Sbjct: 89  NKTQFFSHSSTRTPPLFTANSTAQRSGTFVEHLTGITESEEGISEANGFGDVESARNDIR 148

Query: 131 HLGAPAL----EVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMKQDDAA 190
           ++    +    EV+EL+ELPE+WRRSKLAWLCKE+P HK  TL+RLLNAQ+KW++Q+DA 
Sbjct: 149 NVATRRIETEFEVRELEELPEEWRRSKLAWLCKEVPTHKAVTLVRLLNAQKKWVRQEDAT 208

Query: 191 YVIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIIN 250
           Y+ VHC+RIRENET FRVY+WM QQ+WYRFD+ L TKLA+Y+GKERKF+KCREVFDD++N
Sbjct: 209 YISVHCMRIRENETGFRVYRWMTQQNWYRFDFGLTTKLAEYLGKERKFTKCREVFDDVLN 268

Query: 251 QGCVPSESTFHILIVAYLSA-PIQGCIEEASAIYNRMIQLGGYEPRLSLHNSLFKALLSK 310
           QG VPSESTFHIL+VAYLS+  ++GC+EEA ++YNRMIQLGGY+PRLSLHNSLF+AL+SK
Sbjct: 269 QGRVPSESTFHILVVAYLSSLSVEGCLEEACSVYNRMIQLGGYKPRLSLHNSLFRALVSK 328

Query: 311 PGDLSKHHLKQAEFIYHNLVTTGLELHKDIYAGLIWLHSYQDTVDKERIMSLRKEMQQAG 370
            G +    LKQAEFI+HN+VTTGLE+ KDIY+GLIWLHS QD VD  RI SLR+EM++AG
Sbjct: 329 QGGILNDQLKQAEFIFHNVVTTGLEVQKDIYSGLIWLHSCQDEVDIGRINSLREEMKKAG 388

Query: 371 IEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAF 430
            +E +EV+VS+LRA +K G V E ER+WL+L   D  +PSQAFVYK+E Y+KVG+  KA 
Sbjct: 389 FQESKEVVVSLLRAYAKEGGVEEVERTWLELLDLDCGIPSQAFVYKIEAYSKVGDFAKAM 448

Query: 431 EIFREMEQ-LNSVSAAAYQTIIGILCKVEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMF 490
           EIFREME+ +   + + Y  II +LCKV++V L E++M+ F +S  KPL P+++++  M+
Sbjct: 449 EIFREMEKHIGGATMSGYHKIIEVLCKVQQVELVETLMKEFEESGKKPLLPSFIEIAKMY 508

Query: 491 FNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSA 550
           F+L LH+KLE+ F QCLEKC+P++ IY+IYL+SL K+GNL++A ++F++M+ NG I VSA
Sbjct: 509 FDLGLHEKLEMAFVQCLEKCQPSQPIYNIYLDSLTKIGNLEKAGDVFNEMKNNGTINVSA 568

Query: 551 RSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKK-PVSLK 610
           RSCN +L GYL  G  ++AE+IYDLM  KKY+I+PPLMEKLDY+LSL +KE+KK P S+K
Sbjct: 569 RSCNSLLKGYLDCGKQVQAERIYDLMRMKKYEIEPPLMEKLDYILSLKKKEVKKRPFSMK 628

Query: 611 LSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFHEDRSTHSRLRRHIYEQYHEWLHPAS 670
           LSK+QRE+LVGLLLGGL+IESD+ +K+H I+FEF E+   H  L+++I++Q+ EWLHP S
Sbjct: 629 LSKDQREVLVGLLLGGLQIESDKEKKSHMIKFEFRENSQAHLVLKQNIHDQFREWLHPLS 688

Query: 671 KSSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRI 730
              +    IP++F +V HSYFGFYA+ +WP+G P IP LIHRWLSP  LAYWYMY G + 
Sbjct: 689 NFQED--IIPFEFYSVPHSYFGFYAEHYWPKGQPEIPKLIHRWLSPHSLAYWYMYSGVKT 748

Query: 731 SSGDFVLKLKGSREGVAKIVKSLGEKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFIL 790
           SSGD +L+LKGS EGV K+VK+L  KSM C+VK+KG+V+WIGL G+N+  FWKLIEP +L
Sbjct: 749 SSGDIILRLKGSLEGVEKVVKALQAKSMECRVKKKGKVFWIGLQGTNSALFWKLIEPHVL 808

Query: 791 DDLKDRLQADSLNMEKAVN-ETYNINFDSQSDSDEE 810
           ++LK+ L+  S +++     E  +INF S SD  ++
Sbjct: 809 ENLKEHLKPASESLDNVKEAEEQSINFKSNSDHSDD 840

BLAST of Cp4.1LG02g11170.1 vs. ExPASy Swiss-Prot
Match: Q6ZHJ5 (Pentatricopeptide repeat-containing protein OTP51, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=OTP51 PE=3 SV=1)

HSP 1 Score: 802.4 bits (2071), Expect = 4.8e-231
Identity = 398/738 (53.93%), Postives = 542/738 (73.44%), Query Frame = 0

Query: 78  IPAFASSSSVEALVHDRDSPAESEEPLCSPYSTGAEGFASADLKH-LGAPALEVKELDEL 137
           IPA A  S++E+L+ D D   E E+          E +A+AD +  + +P L V EL+EL
Sbjct: 53  IPAVA--SALESLILDLDDDEEDEDEETEFGLFQGEAWAAADEREAVRSPELVVPELEEL 112

Query: 138 PEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMKQDDAAYVIVHCLRIRENETAFRVY 197
           PEQWRRS++AWLCKELPA+K  T  R+LNAQRKW+ QDDA YV VHCLRIR N+ AFRVY
Sbjct: 113 PEQWRRSRIAWLCKELPAYKHSTFTRILNAQRKWITQDDATYVAVHCLRIRNNDAAFRVY 172

Query: 198 KWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLS 257
            WM++QHW+RF++ALAT++AD +G++ K  KCREVF+ ++ QG VP+ESTFHILIVAYLS
Sbjct: 173 SWMVRQHWFRFNFALATRVADCLGRDGKVEKCREVFEAMVKQGRVPAESTFHILIVAYLS 232

Query: 258 APIQGCIEEASAIYNRMIQLGGYEPRLSLHNSLFKALLSKPGDLSKHHLKQAEFIYHNLV 317
            P   C+EEA  IYN+MIQ+GGY+PRLSLHNSLF+AL+SK G  +K++LKQAEF+YHN+V
Sbjct: 233 VPKGRCLEEACTIYNQMIQMGGYKPRLSLHNSLFRALVSKTGGTAKYNLKQAEFVYHNVV 292

Query: 318 TTGLELHKDIYAGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILRASSKLGD 377
           TT L++HKD+YAGLIWLHSYQD +D+ERI++LRKEM+QAG +E  +VLVS++RA SK G+
Sbjct: 293 TTNLDVHKDVYAGLIWLHSYQDVIDRERIIALRKEMKQAGFDEGIDVLVSVMRAFSKEGN 352

Query: 378 VMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLN-SVSAAAYQT 437
           V E E +W  +      +P QA+V +ME YA+ G PMK+ ++F+EM+  N   + A+Y  
Sbjct: 353 VAETEATWHNILQSGSDLPVQAYVCRMEAYARTGEPMKSLDMFKEMKDKNIPPNVASYHK 412

Query: 438 IIGILCKVEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKC 497
           II I+ K  EV + E +M  FI+S++K L PA++DLM M+ +L +H+KLELTF +C+ +C
Sbjct: 413 IIEIMTKALEVDIVEQLMNEFIESDMKHLMPAFLDLMYMYMDLDMHEKLELTFLKCIARC 472

Query: 498 KPNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAE 557
           +PNR +Y+IYL SLVKVGN+++AEE+F +M  NG IG + +SCNI+L GYL + DY KAE
Sbjct: 473 RPNRILYTIYLESLVKVGNIEKAEEVFGEMHNNGMIGTNTKSCNIMLRGYLSAEDYQKAE 532

Query: 558 KIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIK-KPVSLKLSKEQREILVGLLLGGLEIE 617
           K+YD+M +KKYD+    +EKL   L L++K IK K VS+KL +EQREIL+GLLLGG  +E
Sbjct: 533 KVYDMMSKKKYDVQADSLEKLQSGLLLNKKVIKPKTVSMKLDQEQREILIGLLLGGTRME 592

Query: 618 SDEGRKNHRIQFEFHEDRSTHSRLRRHIYEQYHEWLHPASKSSDSDTDIPYKFCTVSHSY 677
           S   R  H + F+F ED + HS LR HI+E++ EWL  AS+S D  + IPY+F T+ H +
Sbjct: 593 SYAQRGVHIVHFQFQEDSNAHSVLRVHIHERFFEWLSSASRSFDDGSKIPYQFSTIPHQH 652

Query: 678 FGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLK-GSREGVAKI 737
           F F+ DQF+ +G P +P LIHRWL+PRVLAYW+M+GG ++ SGD VLKL  G+ EGV +I
Sbjct: 653 FSFFVDQFFLKGQPVLPKLIHRWLTPRVLAYWFMFGGSKLPSGDIVLKLSGGNSEGVERI 712

Query: 738 VKSLGEKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDRLQADSLNMEKAVN 797
           V SL  +S++ KVKRKGR +WIG  GSNA  FW++IEP +L++    +  +      ++ 
Sbjct: 713 VNSLHTQSLTSKVKRKGRFFWIGFQGSNAESFWRIIEPHVLNNFASLVTQEG----SSIG 772

Query: 798 ETYNINFDSQSDSDEEAS 812
                + D+ SD D + S
Sbjct: 773 SDGTQDTDTDSDDDMQMS 784

BLAST of Cp4.1LG02g11170.1 vs. ExPASy Swiss-Prot
Match: O04491 (Putative pentatricopeptide repeat-containing protein At1g09680 OS=Arabidopsis thaliana OX=3702 GN=At1g09680 PE=3 SV=1)

HSP 1 Score: 78.2 bits (191), Expect = 4.8e-13
Identity = 71/284 (25.00%), Postives = 127/284 (44.72%), Query Frame = 0

Query: 190 ETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHI 249
           +  FR+ K  M++   R D    + L + + KE K      +FD++  +G +P++  F  
Sbjct: 292 DEGFRL-KHQMEKSRTRPDVFTYSALINALCKENKMDGAHGLFDEMCKRGLIPNDVIFTT 351

Query: 250 LIVAYLSAPIQGCIEEASAIYNRMIQLGGYEPRLSLHNSLFKALLSKPGDLSKHHLKQAE 309
           LI  +      G I+     Y +M+   G +P + L+N+L      K GD     L  A 
Sbjct: 352 LIHGHSR---NGEIDLMKESYQKMLS-KGLQPDIVLYNTLVNG-FCKNGD-----LVAAR 411

Query: 310 FIYHNLVTTGLELHKDIYAGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILR 369
            I   ++  GL   K  Y  LI    +    D E  + +RKEM Q GIE +R    +++ 
Sbjct: 412 NIVDGMIRRGLRPDKITYTTLI--DGFCRGGDVETALEIRKEMDQNGIELDRVGFSALVC 471

Query: 370 ASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLNSV- 429
              K G V++AER+  ++           +   M+ + K G+    F++ +EM+    V 
Sbjct: 472 GMCKEGRVIDAERALREMLRAGIKPDDVTYTMMMDAFCKKGDAQTGFKLLKEMQSDGHVP 531

Query: 430 SAAAYQTIIGILCKVEEVTLAESVMEGFIKSNLKPLKPAYVDLM 473
           S   Y  ++  LCK+ ++  A+ +++  +   + P    Y  L+
Sbjct: 532 SVVTYNVLLNGLCKLGQMKNADMLLDAMLNIGVVPDDITYNTLL 562

BLAST of Cp4.1LG02g11170.1 vs. ExPASy Swiss-Prot
Match: Q0WPZ6 (Pentatricopeptide repeat-containing protein At2g17140 OS=Arabidopsis thaliana OX=3702 GN=At2g17140 PE=2 SV=1)

HSP 1 Score: 74.7 bits (182), Expect = 5.3e-12
Identity = 78/334 (23.35%), Postives = 153/334 (45.81%), Query Frame = 0

Query: 229 REVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEEASAIYNRMIQLGGYEPRLSLHNS 288
           RE+FD++  +GC P+E TF IL+  Y  A   G  ++   + N M +  G  P   ++N+
Sbjct: 167 RELFDEMPEKGCKPNEFTFGILVRGYCKA---GLTDKGLELLNAM-ESFGVLPNKVIYNT 226

Query: 289 LFKALL--SKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYAGLIWLHSYQDTVDKERIM 348
           +  +     +  D  K   K  E     LV   +  +  I A    L      +D  RI 
Sbjct: 227 IVSSFCREGRNDDSEKMVEKMRE---EGLVPDIVTFNSRISA----LCKEGKVLDASRIF 286

Query: 349 SLRKEMQQAGIEEEREVLVSI-LRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEV 408
           S  +  +  G+     +  ++ L+   K+G + +A+  +  ++  D     Q++   ++ 
Sbjct: 287 SDMELDEYLGLPRPNSITYNLMLKGFCKVGLLEDAKTLFESIRENDDLASLQSYNIWLQG 346

Query: 409 YAKVGNPMKAFEIFREMEQLN-SVSAAAYQTIIGILCKVEEVTLAESVMEGFIKSNLKPL 468
             + G  ++A  + ++M       S  +Y  ++  LCK+  ++ A++++    ++ + P 
Sbjct: 347 LVRHGKFIEAETVLKQMTDKGIGPSIYSYNILMDGLCKLGMLSDAKTIVGLMKRNGVCPD 406

Query: 469 KPAYVDLMNMFFNLSLHDKLELTFSQCL-EKCKPNRTIYSIYLNSLVKVGNLDRAEEIFS 528
              Y  L++ + ++   D  +    + +   C PN    +I L+SL K+G +  AEE+  
Sbjct: 407 AVTYGCLLHGYCSVGKVDAAKSLLQEMMRNNCLPNAYTCNILLHSLWKMGRISEAEELLR 466

Query: 529 QMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKI 558
           +M   G  G+   +CNII+ G   SG+  KA +I
Sbjct: 467 KMNEKG-YGLDTVTCNIIVDGLCGSGELDKAIEI 488

BLAST of Cp4.1LG02g11170.1 vs. ExPASy Swiss-Prot
Match: O82178 (Pentatricopeptide repeat-containing protein At2g35130 OS=Arabidopsis thaliana OX=3702 GN=At2g35130 PE=3 SV=1)

HSP 1 Score: 70.1 bits (170), Expect = 1.3e-10
Identity = 91/451 (20.18%), Postives = 201/451 (44.57%), Query Frame = 0

Query: 145 LAWLCKELPAHKPGTLIRLL-NAQRKWMKQDDAAYVIVHCLRIRENETAFRVYKWMMQQH 204
           L+++ KE    K   ++  L +    W   DD   V V     ++ ++   V +W++++ 
Sbjct: 93  LSFIQKETDPDKVADVLGALPSTHASW---DDLINVSVQLRLNKKWDSIILVCEWILRKS 152

Query: 205 WYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCI 264
            ++ D      L D  G++ ++ +   ++  ++    VP+E T+ +LI AY  A   G I
Sbjct: 153 SFQPDVICFNLLIDAYGQKFQYKEAESLYVQLLESRYVPTEDTYALLIKAYCMA---GLI 212

Query: 265 EEASAIYNRMIQLGGYEPR---LSLHNSLFKALLSKPGDLSKHHLKQAEFIYHNLVTTGL 324
           E A  +   M Q     P+   ++++N+  + L+ + G     + ++A  ++  +     
Sbjct: 213 ERAEVVLVEM-QNHHVSPKTIGVTVYNAYIEGLMKRKG-----NTEEAIDVFQRMKRDRC 272

Query: 325 ELHKDIYAGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILRASSKLGDVMEA 384
           +   + Y   + ++ Y           L  EM+    +       +++ A ++ G   +A
Sbjct: 273 KPTTETYN--LMINLYGKASKSYMSWKLYCEMRSHQCKPNICTYTALVNAFAREGLCEKA 332

Query: 385 ERSWLKLKSFDGSMPSQAFVYK--MEVYAKVGNPMKAFEIFREMEQLN-SVSAAAYQTII 444
           E  + +L+  DG  P   +VY   ME Y++ G P  A EIF  M+ +      A+Y  ++
Sbjct: 333 EEIFEQLQE-DGLEP-DVYVYNALMESYSRAGYPYGAAEIFSLMQHMGCEPDRASYNIMV 392

Query: 445 GILCKVEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CK 504
               +    + AE+V E   +  + P   +++ L++ +       K E    +  E   +
Sbjct: 393 DAYGRAGLHSDAEAVFEEMKRLGIAPTMKSHMLLLSAYSKARDVTKCEAIVKEMSENGVE 452

Query: 505 PNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEK 564
           P+  + +  LN   ++G   + E+I ++M+ NG       + NI+++ Y  +G   + E+
Sbjct: 453 PDTFVLNSMLNLYGRLGQFTKMEKILAEME-NGPCTADISTYNILINIYGKAGFLERIEE 512

Query: 565 IYDLMCQKKYDIDPPLMEKLDYVLSLSRKEI 588
           ++  + +K +   P ++     + + SRK++
Sbjct: 513 LFVELKEKNF--RPDVVTWTSRIGAYSRKKL 524

BLAST of Cp4.1LG02g11170.1 vs. NCBI nr
Match: XP_023525582.1 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1573 bits (4073), Expect = 0.0
Identity = 788/788 (100.00%), Postives = 788/788 (100.00%), Query Frame = 0

Query: 25  MSIRTSAFATVTLLRSLTLSFPLCHHHFRCRNYVIRSLSIPTYSAKGRRQLTRIPAFASS 84
           MSIRTSAFATVTLLRSLTLSFPLCHHHFRCRNYVIRSLSIPTYSAKGRRQLTRIPAFASS
Sbjct: 1   MSIRTSAFATVTLLRSLTLSFPLCHHHFRCRNYVIRSLSIPTYSAKGRRQLTRIPAFASS 60

Query: 85  SSVEALVHDRDSPAESEEPLCSPYSTGAEGFASADLKHLGAPALEVKELDELPEQWRRSK 144
           SSVEALVHDRDSPAESEEPLCSPYSTGAEGFASADLKHLGAPALEVKELDELPEQWRRSK
Sbjct: 61  SSVEALVHDRDSPAESEEPLCSPYSTGAEGFASADLKHLGAPALEVKELDELPEQWRRSK 120

Query: 145 LAWLCKELPAHKPGTLIRLLNAQRKWMKQDDAAYVIVHCLRIRENETAFRVYKWMMQQHW 204
           LAWLCKELPAHKPGTLIRLLNAQRKWMKQDDAAYVIVHCLRIRENETAFRVYKWMMQQHW
Sbjct: 121 LAWLCKELPAHKPGTLIRLLNAQRKWMKQDDAAYVIVHCLRIRENETAFRVYKWMMQQHW 180

Query: 205 YRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIE 264
           YRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIE
Sbjct: 181 YRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIE 240

Query: 265 EASAIYNRMIQLGGYEPRLSLHNSLFKALLSKPGDLSKHHLKQAEFIYHNLVTTGLELHK 324
           EASAIYNRMIQLGGYEPRLSLHNSLFKALLSKPGDLSKHHLKQAEFIYHNLVTTGLELHK
Sbjct: 241 EASAIYNRMIQLGGYEPRLSLHNSLFKALLSKPGDLSKHHLKQAEFIYHNLVTTGLELHK 300

Query: 325 DIYAGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILRASSKLGDVMEAERSW 384
           DIYAGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILRASSKLGDVMEAERSW
Sbjct: 301 DIYAGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILRASSKLGDVMEAERSW 360

Query: 385 LKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLNSVSAAAYQTIIGILCKVE 444
           LKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLNSVSAAAYQTIIGILCKVE
Sbjct: 361 LKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLNSVSAAAYQTIIGILCKVE 420

Query: 445 EVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSI 504
           EVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSI
Sbjct: 421 EVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSI 480

Query: 505 YLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQK 564
           YLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQK
Sbjct: 481 YLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQK 540

Query: 565 KYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI 624
           KYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI
Sbjct: 541 KYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI 600

Query: 625 QFEFHEDRSTHSRLRRHIYEQYHEWLHPASKSSDSDTDIPYKFCTVSHSYFGFYADQFWP 684
           QFEFHEDRSTHSRLRRHIYEQYHEWLHPASKSSDSDTDIPYKFCTVSHSYFGFYADQFWP
Sbjct: 601 QFEFHEDRSTHSRLRRHIYEQYHEWLHPASKSSDSDTDIPYKFCTVSHSYFGFYADQFWP 660

Query: 685 RGHPAIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLGEKSMSC 744
           RGHPAIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLGEKSMSC
Sbjct: 661 RGHPAIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLGEKSMSC 720

Query: 745 KVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDRLQADSLNMEKAVNETYNINFDSQS 804
           KVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDRLQADSLNMEKAVNETYNINFDSQS
Sbjct: 721 KVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDRLQADSLNMEKAVNETYNINFDSQS 780

Query: 805 DSDEEASS 812
           DSDEEASS
Sbjct: 781 DSDEEASS 788

BLAST of Cp4.1LG02g11170.1 vs. NCBI nr
Match: XP_023521219.1 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1568 bits (4061), Expect = 0.0
Identity = 786/788 (99.75%), Postives = 786/788 (99.75%), Query Frame = 0

Query: 25  MSIRTSAFATVTLLRSLTLSFPLCHHHFRCRNYVIRSLSIPTYSAKGRRQLTRIPAFASS 84
           MSIRTSAFATVTLLRSLTLSFPLCHHHFRCRNYVIRSLSIPTYSAKGRRQL RIPAFASS
Sbjct: 1   MSIRTSAFATVTLLRSLTLSFPLCHHHFRCRNYVIRSLSIPTYSAKGRRQLPRIPAFASS 60

Query: 85  SSVEALVHDRDSPAESEEPLCSPYSTGAEGFASADLKHLGAPALEVKELDELPEQWRRSK 144
           SSVEALVHDRDSPAESEEPLCSPYSTGAEGFASADLKHLGAPALEVKELDELPEQWRRSK
Sbjct: 61  SSVEALVHDRDSPAESEEPLCSPYSTGAEGFASADLKHLGAPALEVKELDELPEQWRRSK 120

Query: 145 LAWLCKELPAHKPGTLIRLLNAQRKWMKQDDAAYVIVHCLRIRENETAFRVYKWMMQQHW 204
           LAWLCKELPAH PGTLIRLLNAQRKWMKQDDAAYVIVHCLRIRENETAFRVYKWMMQQHW
Sbjct: 121 LAWLCKELPAHTPGTLIRLLNAQRKWMKQDDAAYVIVHCLRIRENETAFRVYKWMMQQHW 180

Query: 205 YRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIE 264
           YRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIE
Sbjct: 181 YRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIE 240

Query: 265 EASAIYNRMIQLGGYEPRLSLHNSLFKALLSKPGDLSKHHLKQAEFIYHNLVTTGLELHK 324
           EASAIYNRMIQLGGYEPRLSLHNSLFKALLSKPGDLSKHHLKQAEFIYHNLVTTGLELHK
Sbjct: 241 EASAIYNRMIQLGGYEPRLSLHNSLFKALLSKPGDLSKHHLKQAEFIYHNLVTTGLELHK 300

Query: 325 DIYAGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILRASSKLGDVMEAERSW 384
           DIYAGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILRASSKLGDVMEAERSW
Sbjct: 301 DIYAGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILRASSKLGDVMEAERSW 360

Query: 385 LKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLNSVSAAAYQTIIGILCKVE 444
           LKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLNSVSAAAYQTIIGILCKVE
Sbjct: 361 LKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLNSVSAAAYQTIIGILCKVE 420

Query: 445 EVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSI 504
           EVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSI
Sbjct: 421 EVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSI 480

Query: 505 YLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQK 564
           YLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQK
Sbjct: 481 YLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQK 540

Query: 565 KYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI 624
           KYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI
Sbjct: 541 KYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI 600

Query: 625 QFEFHEDRSTHSRLRRHIYEQYHEWLHPASKSSDSDTDIPYKFCTVSHSYFGFYADQFWP 684
           QFEFHEDRSTHSRLRRHIYEQYHEWLHPASKSSDSDTDIPYKFCTVSHSYFGFYADQFWP
Sbjct: 601 QFEFHEDRSTHSRLRRHIYEQYHEWLHPASKSSDSDTDIPYKFCTVSHSYFGFYADQFWP 660

Query: 685 RGHPAIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLGEKSMSC 744
           RGHPAIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLGEKSMSC
Sbjct: 661 RGHPAIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLGEKSMSC 720

Query: 745 KVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDRLQADSLNMEKAVNETYNINFDSQS 804
           KVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDRLQADSLNMEKAVNETYNINFDSQS
Sbjct: 721 KVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDRLQADSLNMEKAVNETYNINFDSQS 780

Query: 805 DSDEEASS 812
           DSDEEASS
Sbjct: 781 DSDEEASS 788

BLAST of Cp4.1LG02g11170.1 vs. NCBI nr
Match: KAG6607381.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1564 bits (4050), Expect = 0.0
Identity = 788/812 (97.04%), Postives = 794/812 (97.78%), Query Frame = 0

Query: 1   MNLVNPKPKVSSSTVLLNSPSSSSMSIRTSAFATVTLLRSLTLSFPLCHHHFRCRNYVIR 60
           MNLVNPKPKVSSSTVLLNS SSSSMSIRTSAFATVTLLRSLTL F  CHHHFRCRNYVIR
Sbjct: 1   MNLVNPKPKVSSSTVLLNSTSSSSMSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIR 60

Query: 61  SLSIPTYSAKGRRQLTRIPAFASSSSVEALVHDRDSPAESEEPLCSPYSTGAEGFASADL 120
           SL IPTYSAKGRRQL RIPAFASSSSVE LV+DRDSPAESEEPLCSPYSTGAEGFASADL
Sbjct: 61  SLCIPTYSAKGRRQLPRIPAFASSSSVEVLVYDRDSPAESEEPLCSPYSTGAEGFASADL 120

Query: 121 KHLGAPALEVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMKQDDAAYVI 180
           KHLGAPALEVKELDELPEQWRRSKLAWLCKELPA KPGTLIRLLNAQRKWMKQDDAAYVI
Sbjct: 121 KHLGAPALEVKELDELPEQWRRSKLAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYVI 180

Query: 181 VHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGC 240
           VHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGC
Sbjct: 181 VHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGC 240

Query: 241 VPSESTFHILIVAYLSAPIQGCIEEASAIYNRMIQLGGYEPRLSLHNSLFKALLSKPGDL 300
           VPSESTFHILIVAYLSAPIQGCIEEASAIYNRMIQLGGY+PRLSLHNSLFKAL+SKPGDL
Sbjct: 241 VPSESTFHILIVAYLSAPIQGCIEEASAIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDL 300

Query: 301 SKHHLKQAEFIYHNLVTTGLELHKDIYAGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEE 360
           SKHHLKQAEFIYHN+ TTGLELHKDIY GLIWLHSYQDTVDKERIMSLRKEMQQAGIEEE
Sbjct: 301 SKHHLKQAEFIYHNMATTGLELHKDIYGGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEE 360

Query: 361 REVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFR 420
           REVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFR
Sbjct: 361 REVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFR 420

Query: 421 EMEQLNSVSAAAYQTIIGILCKVEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSL 480
           EMEQLN +SAAAYQTIIGILCK+EEVTLAESVME FIKSNLKPLKPAYVDLMNMFFNLSL
Sbjct: 421 EMEQLNYISAAAYQTIIGILCKLEEVTLAESVMESFIKSNLKPLKPAYVDLMNMFFNLSL 480

Query: 481 HDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNI 540
           HDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNI
Sbjct: 481 HDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNI 540

Query: 541 ILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQR 600
           ILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQR
Sbjct: 541 ILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQR 600

Query: 601 EILVGLLLGGLEIESDEGRKNHRIQFEFHEDRSTHSRLRRHIYEQYHEWLHPASKSSDSD 660
           EILVGLLLGGLEIESDEGRKNHRIQFEFHED STHSRLRRHIYEQYHEWLH ASK SDSD
Sbjct: 601 EILVGLLLGGLEIESDEGRKNHRIQFEFHEDCSTHSRLRRHIYEQYHEWLHHASKLSDSD 660

Query: 661 TDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRISSGDFV 720
           TDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRISSGDFV
Sbjct: 661 TDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRISSGDFV 720

Query: 721 LKLKGSREGVAKIVKSLGEKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDR 780
           LKLKGSREGVAKIVKSL EKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKD 
Sbjct: 721 LKLKGSREGVAKIVKSLREKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDS 780

Query: 781 LQADSLNMEKAVNETYNINFDSQSDSDEEASS 812
           LQADSLNMEKA NETYNINFDSQSDSDEEASS
Sbjct: 781 LQADSLNMEKAANETYNINFDSQSDSDEEASS 812

BLAST of Cp4.1LG02g11170.1 vs. NCBI nr
Match: XP_022949171.1 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita moschata])

HSP 1 Score: 1527 bits (3953), Expect = 0.0
Identity = 764/788 (96.95%), Postives = 771/788 (97.84%), Query Frame = 0

Query: 25  MSIRTSAFATVTLLRSLTLSFPLCHHHFRCRNYVIRSLSIPTYSAKGRRQLTRIPAFASS 84
           MSIRTSAFATVTLLRSLTL F  CHHHFRCRNYVIRSL IPTYSAKGRRQL RIPAFASS
Sbjct: 1   MSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASS 60

Query: 85  SSVEALVHDRDSPAESEEPLCSPYSTGAEGFASADLKHLGAPALEVKELDELPEQWRRSK 144
           SSVEALV+DRDSPAESEEPLCSPYSTGAEGFASADLKHLGAPALEVKELDELPEQWRRSK
Sbjct: 61  SSVEALVYDRDSPAESEEPLCSPYSTGAEGFASADLKHLGAPALEVKELDELPEQWRRSK 120

Query: 145 LAWLCKELPAHKPGTLIRLLNAQRKWMKQDDAAYVIVHCLRIRENETAFRVYKWMMQQHW 204
           LAWLCKELPA KPGTLIRLLNAQRKWMKQDDAAY+IVHCLRIRENETAFRVYKWMMQQHW
Sbjct: 121 LAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHW 180

Query: 205 YRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIE 264
           YRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIE
Sbjct: 181 YRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIE 240

Query: 265 EASAIYNRMIQLGGYEPRLSLHNSLFKALLSKPGDLSKHHLKQAEFIYHNLVTTGLELHK 324
           E+S IYNRMIQLGGY+PRLSLHNSLFKAL+SKPGDLSKHHLKQAEFIYHNL TTGLELHK
Sbjct: 241 ESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHK 300

Query: 325 DIYAGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILRASSKLGDVMEAERSW 384
           DIY GLIWLHSYQDTVDKERIMSLRKEM QAGIEEEREVLVSILRASSKLGDVMEAERSW
Sbjct: 301 DIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSW 360

Query: 385 LKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLNSVSAAAYQTIIGILCKVE 444
           LKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLNS+SAAAYQTIIGILCK E
Sbjct: 361 LKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLNSISAAAYQTIIGILCKFE 420

Query: 445 EVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSI 504
           EVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSI
Sbjct: 421 EVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSI 480

Query: 505 YLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQK 564
           YLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQK
Sbjct: 481 YLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQK 540

Query: 565 KYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI 624
           KYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI
Sbjct: 541 KYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI 600

Query: 625 QFEFHEDRSTHSRLRRHIYEQYHEWLHPASKSSDSDTDIPYKFCTVSHSYFGFYADQFWP 684
           QFEFHED STHSRLRRHI+EQYHEWLHPASK SDSDTDIPYKFCTVSHSYFGFYADQFWP
Sbjct: 601 QFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWP 660

Query: 685 RGHPAIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLGEKSMSC 744
           RGHP IPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSL EKSMSC
Sbjct: 661 RGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLREKSMSC 720

Query: 745 KVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDRLQADSLNMEKAVNETYNINFDSQS 804
           KVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKD LQADSLNMEKA NETYNINFDSQS
Sbjct: 721 KVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNINFDSQS 780

Query: 805 DSDEEASS 812
           DSDEEASS
Sbjct: 781 DSDEEASS 788

BLAST of Cp4.1LG02g11170.1 vs. NCBI nr
Match: XP_022998786.1 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita maxima])

HSP 1 Score: 1515 bits (3923), Expect = 0.0
Identity = 758/788 (96.19%), Postives = 769/788 (97.59%), Query Frame = 0

Query: 25  MSIRTSAFATVTLLRSLTLSFPLCHHHFRCRNYVIRSLSIPTYSAKGRRQLTRIPAFASS 84
           MSIRTSAFATVTLLRSLTL F  CH+HFRC NYVIRSLSIPTYSAKGRRQL RIPAFASS
Sbjct: 1   MSIRTSAFATVTLLRSLTLPFSQCHNHFRCWNYVIRSLSIPTYSAKGRRQLPRIPAFASS 60

Query: 85  SSVEALVHDRDSPAESEEPLCSPYSTGAEGFASADLKHLGAPALEVKELDELPEQWRRSK 144
           SSVEALV+DRDSPAESEEPLCSPYS GAE FASADLKHLGAPALEVKELDELPEQWRRSK
Sbjct: 61  SSVEALVYDRDSPAESEEPLCSPYSNGAEEFASADLKHLGAPALEVKELDELPEQWRRSK 120

Query: 145 LAWLCKELPAHKPGTLIRLLNAQRKWMKQDDAAYVIVHCLRIRENETAFRVYKWMMQQHW 204
           LAWLCKELPAHKPGTLIRLLNAQRKWMKQDDAAY+IVHCLRIRENETAFRVYKWMMQQHW
Sbjct: 121 LAWLCKELPAHKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHW 180

Query: 205 YRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIE 264
           YRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGCIE
Sbjct: 181 YRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIE 240

Query: 265 EASAIYNRMIQLGGYEPRLSLHNSLFKALLSKPGDLSKHHLKQAEFIYHNLVTTGLELHK 324
           EAS IYNRMIQLGGY PRLSLHNSLFKAL+SKPGDLSKHHLKQAEFIYHNLVTTGLELHK
Sbjct: 241 EASTIYNRMIQLGGYPPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLVTTGLELHK 300

Query: 325 DIYAGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILRASSKLGDVMEAERSW 384
           DIY GLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILRASSKLGDVMEAERSW
Sbjct: 301 DIYGGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILRASSKLGDVMEAERSW 360

Query: 385 LKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLNSVSAAAYQTIIGILCKVE 444
           LK+KSFDGSMPSQAFVYKMEVYAKVGNPMKA EIFREMEQLNS+S+AAYQTIIGILCK E
Sbjct: 361 LKIKSFDGSMPSQAFVYKMEVYAKVGNPMKALEIFREMEQLNSISSAAYQTIIGILCKFE 420

Query: 445 EVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSI 504
           EVTLAESVM GFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSI
Sbjct: 421 EVTLAESVMAGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSI 480

Query: 505 YLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQK 564
           YLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQK
Sbjct: 481 YLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQK 540

Query: 565 KYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI 624
           KYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI
Sbjct: 541 KYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI 600

Query: 625 QFEFHEDRSTHSRLRRHIYEQYHEWLHPASKSSDSDTDIPYKFCTVSHSYFGFYADQFWP 684
           QFEFHED STHS LRRH+YEQYHEWLHPASK SDSDTDIPYKFCTVSHSYFGFYADQFWP
Sbjct: 601 QFEFHEDCSTHSCLRRHVYEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWP 660

Query: 685 RGHPAIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLGEKSMSC 744
           RGHPAIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGV KIVKSL EKSMSC
Sbjct: 661 RGHPAIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVVKIVKSLREKSMSC 720

Query: 745 KVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDRLQADSLNMEKAVNETYNINFDSQS 804
           KVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKD LQAD+LN+EKAVNETYNINFDSQS
Sbjct: 721 KVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADNLNLEKAVNETYNINFDSQS 780

Query: 805 DSDEEASS 812
           DSDEEASS
Sbjct: 781 DSDEEASS 788

BLAST of Cp4.1LG02g11170.1 vs. ExPASy TrEMBL
Match: A0A6J1GB98 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111452602 PE=4 SV=1)

HSP 1 Score: 1527 bits (3953), Expect = 0.0
Identity = 764/788 (96.95%), Postives = 771/788 (97.84%), Query Frame = 0

Query: 25  MSIRTSAFATVTLLRSLTLSFPLCHHHFRCRNYVIRSLSIPTYSAKGRRQLTRIPAFASS 84
           MSIRTSAFATVTLLRSLTL F  CHHHFRCRNYVIRSL IPTYSAKGRRQL RIPAFASS
Sbjct: 1   MSIRTSAFATVTLLRSLTLPFSQCHHHFRCRNYVIRSLCIPTYSAKGRRQLPRIPAFASS 60

Query: 85  SSVEALVHDRDSPAESEEPLCSPYSTGAEGFASADLKHLGAPALEVKELDELPEQWRRSK 144
           SSVEALV+DRDSPAESEEPLCSPYSTGAEGFASADLKHLGAPALEVKELDELPEQWRRSK
Sbjct: 61  SSVEALVYDRDSPAESEEPLCSPYSTGAEGFASADLKHLGAPALEVKELDELPEQWRRSK 120

Query: 145 LAWLCKELPAHKPGTLIRLLNAQRKWMKQDDAAYVIVHCLRIRENETAFRVYKWMMQQHW 204
           LAWLCKELPA KPGTLIRLLNAQRKWMKQDDAAY+IVHCLRIRENETAFRVYKWMMQQHW
Sbjct: 121 LAWLCKELPAQKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHW 180

Query: 205 YRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIE 264
           YRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIE
Sbjct: 181 YRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIE 240

Query: 265 EASAIYNRMIQLGGYEPRLSLHNSLFKALLSKPGDLSKHHLKQAEFIYHNLVTTGLELHK 324
           E+S IYNRMIQLGGY+PRLSLHNSLFKAL+SKPGDLSKHHLKQAEFIYHNL TTGLELHK
Sbjct: 241 ESSTIYNRMIQLGGYQPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLATTGLELHK 300

Query: 325 DIYAGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILRASSKLGDVMEAERSW 384
           DIY GLIWLHSYQDTVDKERIMSLRKEM QAGIEEEREVLVSILRASSKLGDVMEAERSW
Sbjct: 301 DIYGGLIWLHSYQDTVDKERIMSLRKEMHQAGIEEEREVLVSILRASSKLGDVMEAERSW 360

Query: 385 LKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLNSVSAAAYQTIIGILCKVE 444
           LKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLNS+SAAAYQTIIGILCK E
Sbjct: 361 LKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLNSISAAAYQTIIGILCKFE 420

Query: 445 EVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSI 504
           EVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSI
Sbjct: 421 EVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSI 480

Query: 505 YLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQK 564
           YLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQK
Sbjct: 481 YLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQK 540

Query: 565 KYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI 624
           KYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI
Sbjct: 541 KYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI 600

Query: 625 QFEFHEDRSTHSRLRRHIYEQYHEWLHPASKSSDSDTDIPYKFCTVSHSYFGFYADQFWP 684
           QFEFHED STHSRLRRHI+EQYHEWLHPASK SDSDTDIPYKFCTVSHSYFGFYADQFWP
Sbjct: 601 QFEFHEDCSTHSRLRRHIHEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWP 660

Query: 685 RGHPAIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLGEKSMSC 744
           RGHP IPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSL EKSMSC
Sbjct: 661 RGHPVIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLREKSMSC 720

Query: 745 KVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDRLQADSLNMEKAVNETYNINFDSQS 804
           KVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKD LQADSLNMEKA NETYNINFDSQS
Sbjct: 721 KVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADSLNMEKAANETYNINFDSQS 780

Query: 805 DSDEEASS 812
           DSDEEASS
Sbjct: 781 DSDEEASS 788

BLAST of Cp4.1LG02g11170.1 vs. ExPASy TrEMBL
Match: A0A6J1KB64 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111493350 PE=4 SV=1)

HSP 1 Score: 1515 bits (3923), Expect = 0.0
Identity = 758/788 (96.19%), Postives = 769/788 (97.59%), Query Frame = 0

Query: 25  MSIRTSAFATVTLLRSLTLSFPLCHHHFRCRNYVIRSLSIPTYSAKGRRQLTRIPAFASS 84
           MSIRTSAFATVTLLRSLTL F  CH+HFRC NYVIRSLSIPTYSAKGRRQL RIPAFASS
Sbjct: 1   MSIRTSAFATVTLLRSLTLPFSQCHNHFRCWNYVIRSLSIPTYSAKGRRQLPRIPAFASS 60

Query: 85  SSVEALVHDRDSPAESEEPLCSPYSTGAEGFASADLKHLGAPALEVKELDELPEQWRRSK 144
           SSVEALV+DRDSPAESEEPLCSPYS GAE FASADLKHLGAPALEVKELDELPEQWRRSK
Sbjct: 61  SSVEALVYDRDSPAESEEPLCSPYSNGAEEFASADLKHLGAPALEVKELDELPEQWRRSK 120

Query: 145 LAWLCKELPAHKPGTLIRLLNAQRKWMKQDDAAYVIVHCLRIRENETAFRVYKWMMQQHW 204
           LAWLCKELPAHKPGTLIRLLNAQRKWMKQDDAAY+IVHCLRIRENETAFRVYKWMMQQHW
Sbjct: 121 LAWLCKELPAHKPGTLIRLLNAQRKWMKQDDAAYLIVHCLRIRENETAFRVYKWMMQQHW 180

Query: 205 YRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIE 264
           YRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAP+QGCIE
Sbjct: 181 YRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIE 240

Query: 265 EASAIYNRMIQLGGYEPRLSLHNSLFKALLSKPGDLSKHHLKQAEFIYHNLVTTGLELHK 324
           EAS IYNRMIQLGGY PRLSLHNSLFKAL+SKPGDLSKHHLKQAEFIYHNLVTTGLELHK
Sbjct: 241 EASTIYNRMIQLGGYPPRLSLHNSLFKALVSKPGDLSKHHLKQAEFIYHNLVTTGLELHK 300

Query: 325 DIYAGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILRASSKLGDVMEAERSW 384
           DIY GLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILRASSKLGDVMEAERSW
Sbjct: 301 DIYGGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILRASSKLGDVMEAERSW 360

Query: 385 LKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLNSVSAAAYQTIIGILCKVE 444
           LK+KSFDGSMPSQAFVYKMEVYAKVGNPMKA EIFREMEQLNS+S+AAYQTIIGILCK E
Sbjct: 361 LKIKSFDGSMPSQAFVYKMEVYAKVGNPMKALEIFREMEQLNSISSAAYQTIIGILCKFE 420

Query: 445 EVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSI 504
           EVTLAESVM GFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSI
Sbjct: 421 EVTLAESVMAGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYSI 480

Query: 505 YLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQK 564
           YLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQK
Sbjct: 481 YLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQK 540

Query: 565 KYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI 624
           KYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI
Sbjct: 541 KYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHRI 600

Query: 625 QFEFHEDRSTHSRLRRHIYEQYHEWLHPASKSSDSDTDIPYKFCTVSHSYFGFYADQFWP 684
           QFEFHED STHS LRRH+YEQYHEWLHPASK SDSDTDIPYKFCTVSHSYFGFYADQFWP
Sbjct: 601 QFEFHEDCSTHSCLRRHVYEQYHEWLHPASKLSDSDTDIPYKFCTVSHSYFGFYADQFWP 660

Query: 685 RGHPAIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLGEKSMSC 744
           RGHPAIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGV KIVKSL EKSMSC
Sbjct: 661 RGHPAIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVVKIVKSLREKSMSC 720

Query: 745 KVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDRLQADSLNMEKAVNETYNINFDSQS 804
           KVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKD LQAD+LN+EKAVNETYNINFDSQS
Sbjct: 721 KVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDSLQADNLNLEKAVNETYNINFDSQS 780

Query: 805 DSDEEASS 812
           DSDEEASS
Sbjct: 781 DSDEEASS 788

BLAST of Cp4.1LG02g11170.1 vs. ExPASy TrEMBL
Match: A0A1S3CPK0 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103502781 PE=4 SV=1)

HSP 1 Score: 1324 bits (3427), Expect = 0.0
Identity = 665/794 (83.75%), Postives = 716/794 (90.18%), Query Frame = 0

Query: 24  SMSIRTSAFATVTLLRSLTLSFPLCHHHFRCRNYVIRSLSIPTYSAKGRRQLTRIPAFAS 83
           SMSI TSAF+TVTLLRSLTLS    HH+F   N++I +L I +YS K R QL RI AFAS
Sbjct: 4   SMSIPTSAFSTVTLLRSLTLSLSPYHHYFHYPNHIIPTLFISSYSVKVR-QLPRIRAFAS 63

Query: 84  SSSVEALVHDRDSPAESEEPLCSPYSTGAEGF------ASADLKHLGAPALEVKELDELP 143
            S V+ LV+DRDSP+ESEE L SPYS G +GF      AS DLKHLG PALEVKELDELP
Sbjct: 64  GSFVKQLVYDRDSPSESEEHLSSPYSNGGDGFHFENGFASVDLKHLGTPALEVKELDELP 123

Query: 144 EQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMKQDDAAYVIVHCLRIRENETAFRVYK 203
           EQWRRSKLAWLCKELPA KPGT+IRLLNAQRKWM QDDA Y+ VHCLRIRENETAFRVYK
Sbjct: 124 EQWRRSKLAWLCKELPAQKPGTVIRLLNAQRKWMGQDDATYLTVHCLRIRENETAFRVYK 183

Query: 204 WMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA 263
           WMMQQHWYRFDYAL+TKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA
Sbjct: 184 WMMQQHWYRFDYALSTKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA 243

Query: 264 PIQGCIEEASAIYNRMIQLGGYEPRLSLHNSLFKALLSKPGDLSKHHLKQAEFIYHNLVT 323
           P+QGCIEEAS IYNRMIQLGGY+PRLSLH+SLF+AL+SKPGDLSKHHLKQAEFIYHNLVT
Sbjct: 244 PVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALMSKPGDLSKHHLKQAEFIYHNLVT 303

Query: 324 TGLELHKDIYAGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILRASSKLGDV 383
           +GLELHKDIY GLIWLHSYQDT+DKERI+SLRKEMQQAGI+EE+EVL+SILRASSK+GDV
Sbjct: 304 SGLELHKDIYGGLIWLHSYQDTIDKERIVSLRKEMQQAGIKEEKEVLLSILRASSKMGDV 363

Query: 384 MEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLNSVSAAAYQTII 443
           +EAER W KLK  DG+MP QAFVYKMEVYAK+G PMKA EIFREMEQLNS +AAAYQTII
Sbjct: 364 VEAERLWQKLKYLDGNMPYQAFVYKMEVYAKMGKPMKALEIFREMEQLNSTNAAAYQTII 423

Query: 444 GILCKVEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKP 503
           GILCK +E+ LAES+M GFI+SNLKPL PAYVD+MNMFFNLSLHDKLELTFSQCLEKCKP
Sbjct: 424 GILCKFQEIELAESIMAGFIESNLKPLTPAYVDMMNMFFNLSLHDKLELTFSQCLEKCKP 483

Query: 504 NRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKI 563
           NRTIYSIYL+SLVKVGNLDRAEEIFSQM+TNGEIGV+ARSCN+IL GYLL G+Y+KAEKI
Sbjct: 484 NRTIYSIYLDSLVKVGNLDRAEEIFSQMETNGEIGVNARSCNLILCGYLLFGNYMKAEKI 543

Query: 564 YDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDE 623
           YDLMCQKKYDIDPPLMEKLDYVLSLSRKE+KKP+SLKLSKEQREILVGLLLGGLEIESDE
Sbjct: 544 YDLMCQKKYDIDPPLMEKLDYVLSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIESDE 603

Query: 624 GRKNHRIQFEFHEDRSTHSRLRRHIYEQYHEWLHPASKSSDSDTDIPYKFCTVSHSYFGF 683
            RKNHRIQFEFH++  THS LRRHIYEQYH+WLH ASK +D D DIPYKFCTVSHSYFGF
Sbjct: 604 ERKNHRIQFEFHKNCKTHSVLRRHIYEQYHKWLHSASKLTDGDIDIPYKFCTVSHSYFGF 663

Query: 684 YADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSL 743
           YADQFWPRG   IPNLIHRWLSPR LAYWYMYGGCR SSGD +LKLKGS EGV KIVKSL
Sbjct: 664 YADQFWPRGRQTIPNLIHRWLSPRALAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVKSL 723

Query: 744 GEKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDRLQADSLNMEKAVNETYN 803
            EKSM CKVKRKG +YWIGLLGSNATWFWKLIEPFILDDLK+  QADSLN+   +NET N
Sbjct: 724 REKSMHCKVKRKGSMYWIGLLGSNATWFWKLIEPFILDDLKESTQADSLNL-GVLNETEN 783

Query: 804 INFDSQSDSDEEAS 811
           INFDSQSDS EE S
Sbjct: 784 INFDSQSDSVEETS 795

BLAST of Cp4.1LG02g11170.1 vs. ExPASy TrEMBL
Match: A0A0A0LBL0 (LAGLIDADG_2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G625100 PE=4 SV=1)

HSP 1 Score: 1312 bits (3395), Expect = 0.0
Identity = 651/794 (81.99%), Postives = 713/794 (89.80%), Query Frame = 0

Query: 24  SMSIRTSAFATVTLLRSLTLSFPLCHHHFRCRNYVIRSLSIPTYSAKGRRQLTRIPAFAS 83
           SMSI TSAF+TVT LRSLTLS    HH+F C N++I +L +P YS K RRQL RI AFAS
Sbjct: 4   SMSIPTSAFSTVTRLRSLTLSLSPYHHYFHCPNHIIPTLFLPAYSVKVRRQLPRIRAFAS 63

Query: 84  SSSVEALVHDRDSPAESEEPLCSPYSTGAEGF------ASADLKHLGAPALEVKELDELP 143
            S V+ LV+D DSP+ESEE L S +S G +GF      AS DLKHLG P LEVKELDELP
Sbjct: 64  GSFVKQLVYDHDSPSESEEHLSSSFSNGGDGFHFENGFASVDLKHLGTPVLEVKELDELP 123

Query: 144 EQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMKQDDAAYVIVHCLRIRENETAFRVYK 203
           EQWRRSK+AWLCKELPA KPGT+IRLLNAQ+KWM QDDA Y+IVHCLRIRENETAFRVYK
Sbjct: 124 EQWRRSKVAWLCKELPAQKPGTVIRLLNAQKKWMGQDDATYLIVHCLRIRENETAFRVYK 183

Query: 204 WMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA 263
           WMMQQHWYRFDYAL+TKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA
Sbjct: 184 WMMQQHWYRFDYALSTKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSA 243

Query: 264 PIQGCIEEASAIYNRMIQLGGYEPRLSLHNSLFKALLSKPGDLSKHHLKQAEFIYHNLVT 323
           P+QGCIEEAS IYNRMIQLGGY+PRLSLH+SLF+AL+SKPGDLSKHHLKQAEFIYHNLVT
Sbjct: 244 PVQGCIEEASTIYNRMIQLGGYQPRLSLHSSLFRALVSKPGDLSKHHLKQAEFIYHNLVT 303

Query: 324 TGLELHKDIYAGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILRASSKLGDV 383
           +GLELHKD+Y GLIWLHSYQDT+D+ERI+SLRKEMQQAGI+EEREVL+SILRASSK+GDV
Sbjct: 304 SGLELHKDMYGGLIWLHSYQDTIDRERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDV 363

Query: 384 MEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLNSVSAAAYQTII 443
           MEAE+ W +LK  DG+MPSQAFVYKMEVYAK+G PMKA EIFREMEQLNS +AAAYQTII
Sbjct: 364 MEAEKLWQELKYLDGNMPSQAFVYKMEVYAKMGKPMKALEIFREMEQLNSTNAAAYQTII 423

Query: 444 GILCKVEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKP 503
           GILCK + + LAES+M GFI+SNLKPL PAYVDLMNMFFNL+L DKLELTFSQCLEKCKP
Sbjct: 424 GILCKFQVIELAESIMAGFIESNLKPLTPAYVDLMNMFFNLNLDDKLELTFSQCLEKCKP 483

Query: 504 NRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKI 563
           NRTIYSIYL+SLVKVGNLDRAEEIFSQM+TNGEIG++ARSCNIIL GYLL G+Y+KAEKI
Sbjct: 484 NRTIYSIYLDSLVKVGNLDRAEEIFSQMETNGEIGINARSCNIILRGYLLCGNYMKAEKI 543

Query: 564 YDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDE 623
           YDLMCQK+YDIDPPLMEKL+Y+LSLSRKE+KKP+SLKLSKEQREILVGLLLGGLEIESD+
Sbjct: 544 YDLMCQKRYDIDPPLMEKLEYILSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIESDD 603

Query: 624 GRKNHRIQFEFHEDRSTHSRLRRHIYEQYHEWLHPASKSSDSDTDIPYKFCTVSHSYFGF 683
            RKNHRIQFEFH +  THS LRRHIYEQYH+WLH ASK +D D DIPYKFCTVSHSYFGF
Sbjct: 604 ERKNHRIQFEFHRNCKTHSVLRRHIYEQYHKWLHSASKLTDGDVDIPYKFCTVSHSYFGF 663

Query: 684 YADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSL 743
           YADQFWPRG  AIPNLIHRWLSPRVLAYWYMYGGCR SSGD +LKLKGS EGV KIVKSL
Sbjct: 664 YADQFWPRGRRAIPNLIHRWLSPRVLAYWYMYGGCRTSSGDILLKLKGSHEGVEKIVKSL 723

Query: 744 GEKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDRLQADSLNMEKAVNETYN 803
            EKS+ CKVKRKG +YWIGLLGSNATWFWKLIEPFILD LK+  QADSLN+   +N + N
Sbjct: 724 REKSIHCKVKRKGNMYWIGLLGSNATWFWKLIEPFILDYLKESTQADSLNLVGVLNGSEN 783

Query: 804 INFDSQSDSDEEAS 811
           INFDS+SDS EE S
Sbjct: 784 INFDSESDSVEETS 797

BLAST of Cp4.1LG02g11170.1 vs. ExPASy TrEMBL
Match: A0A6P5T3C0 (pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Prunus avium OX=42229 GN=LOC110762945 PE=4 SV=1)

HSP 1 Score: 1083 bits (2801), Expect = 0.0
Identity = 543/788 (68.91%), Postives = 639/788 (81.09%), Query Frame = 0

Query: 33  ATVTLLRSLTLSFPLCHHHFRCRNYVIRSLSIPTYSAKGRRQLTRIPAFASSSSVEALVH 92
           ++++LLRSLTLS  L HHH     +       P   A   R++  +P+  SS+ VE L  
Sbjct: 41  SSLSLLRSLTLS--LSHHHHPTHRFPRPISGFPLAVAAKSRRVLALPS--SSTFVEHLSG 100

Query: 93  DRDSPAESEEPLCSPYSTGAEG--------FASADLKHLGAPALEVKELDELPEQWRRSK 152
           +   P E+ +      S  A+G        F+S DLKHL  P LEV EL++LPEQWRRSK
Sbjct: 101 EASQPGENWD-----LSNVAQGEAFDLEKCFSSTDLKHLAVPELEVPELEDLPEQWRRSK 160

Query: 153 LAWLCKELPAHKPGTLIRLLNAQRKWMKQDDAAYVIVHCLRIRENETAFRVYKWMMQQHW 212
           LAWLCKELPAHK GTL R+LNAQ+KWM+Q+DA YV VHC+RIREN+  FRVYKWMMQQHW
Sbjct: 161 LAWLCKELPAHKAGTLSRILNAQKKWMRQEDATYVAVHCMRIRENDVGFRVYKWMMQQHW 220

Query: 213 YRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIE 272
           YRFD+ALATKLADYMGKERKFSKCR++FDDIINQG VPSESTFHIL+VAYLSA +QGC+E
Sbjct: 221 YRFDFALATKLADYMGKERKFSKCRDIFDDIINQGRVPSESTFHILVVAYLSASVQGCLE 280

Query: 273 EASAIYNRMIQLGGYEPRLSLHNSLFKALLSKPGDLSKHHLKQAEFIYHNLVTTGLELHK 332
           EA  IYNRMIQLGGY+PRLSLHNSLFKAL+SKPG  SKH+LKQAEFI+HNLVTTGLE+HK
Sbjct: 281 EACGIYNRMIQLGGYQPRLSLHNSLFKALVSKPGTSSKHYLKQAEFIFHNLVTTGLEIHK 340

Query: 333 DIYAGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILRASSKLGDVMEAERSW 392
           DIY+GLIWLHS QDT+DKER+ SLRKEMQQAGIE  R+VLVSILRA SK GDV EAE +W
Sbjct: 341 DIYSGLIWLHSCQDTIDKERMTSLRKEMQQAGIEVGRDVLVSILRACSKEGDVEEAESTW 400

Query: 393 LKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREM-EQLNSVSAAAYQTIIGILCKV 452
           LKL   D  +PSQA+VYKME Y+K G P ++ EIFREM EQL S +A AY  +I +LCK 
Sbjct: 401 LKLLHLDIGLPSQAYVYKMEAYSKAGEPRRSLEIFREMQEQLGSANAVAYHKVIQVLCKA 460

Query: 453 EEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEKCKPNRTIYS 512
           +EV LAES+M  FI + LK   P+Y+DLMNM+FNL  HDKLE  F QCLE+C+P+RTIYS
Sbjct: 461 QEVELAESLMTDFINTGLKTFMPSYIDLMNMYFNLGSHDKLESAFFQCLERCRPSRTIYS 520

Query: 513 IYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKIYDLMCQ 572
           IYL+SLVKVGNLD+AEEIF QMQ+NG IG++ARSCN ILSGYL SGDY+KAEKI+DLMCQ
Sbjct: 521 IYLDSLVKVGNLDKAEEIFDQMQSNGAIGINARSCNTILSGYLSSGDYVKAEKIFDLMCQ 580

Query: 573 KKYDIDPPLMEKLDYVLSLSRKEIKKPVSLKLSKEQREILVGLLLGGLEIESDEGRKNHR 632
           KKYD+D PLMEK+DYVLSLSRK +K+PVSLKLSKEQRE+LVG+LLGGL+IESDE RKNH 
Sbjct: 581 KKYDVDSPLMEKIDYVLSLSRKVVKRPVSLKLSKEQREVLVGMLLGGLQIESDEDRKNHM 640

Query: 633 IQFEFHEDRSTHSRLRRHIYEQYHEWLHPASKSSDSDTDIPYKFCTVSHSYFGFYADQFW 692
           I+FEF E+ STHS LRRH+Y+QYHEWLHP+ K+S+S  DI YKF T+SHSYFGFYADQFW
Sbjct: 641 IRFEFSENSSTHSLLRRHMYDQYHEWLHPSCKTSESTDDILYKFSTISHSYFGFYADQFW 700

Query: 693 PRGHPAIPNLIHRWLSPRVLAYWYMYGGCRISSGDFVLKLKGSREGVAKIVKSLGEKSMS 752
           P+G   IP LIHRWLSP  LAYWYMYGG R S+GD +LK+KG+ EGV KIV++L  KS+ 
Sbjct: 701 PKGRLVIPKLIHRWLSPCALAYWYMYGGHRTSTGDILLKIKGNEEGVEKIVRALKAKSLD 760

Query: 753 CKVKRKGRVYWIGLLGSNATWFWKLIEPFILDDLKDRLQADSLNMEKAVNETYNINFDSQ 811
           CKVKRKGRV+WIG LGSN+TWFWKL+EP+ILDDLK  L+   ++   AV ET N+NF S 
Sbjct: 761 CKVKRKGRVFWIGFLGSNSTWFWKLVEPYILDDLKHLLKGGQISDNSAV-ETENVNFGSG 818

BLAST of Cp4.1LG02g11170.1 vs. TAIR 10
Match: AT2G15820.1 (endonucleases )

HSP 1 Score: 886.7 bits (2290), Expect = 1.4e-257
Identity = 454/816 (55.64%), Postives = 607/816 (74.39%), Query Frame = 0

Query: 11  SSSTVLLNSPSSSSMSIRTSAF-ATVTLLRSLTLSFPLCHHHFRCRNYVIRSLSIPTYSA 70
           SSSTV + + + SS+S   +   ++ TL RS  LSF L  H        +R LSI T   
Sbjct: 29  SSSTVSVTTFNISSLSSNPNIINSSSTLFRS--LSFSLIRHRSSYSRRSLRRLSIHTVHG 88

Query: 71  KGRR----QLTRI-PAFASSSSVE---ALVHDRDSPAESEEPLCSPYSTGAEGFASADLK 130
              +      TR  P F ++S+ +     V       ESEE +      G    A  D++
Sbjct: 89  NKTQFFSHSSTRTPPLFTANSTAQRSGTFVEHLTGITESEEGISEANGFGDVESARNDIR 148

Query: 131 HLGAPAL----EVKELDELPEQWRRSKLAWLCKELPAHKPGTLIRLLNAQRKWMKQDDAA 190
           ++    +    EV+EL+ELPE+WRRSKLAWLCKE+P HK  TL+RLLNAQ+KW++Q+DA 
Sbjct: 149 NVATRRIETEFEVRELEELPEEWRRSKLAWLCKEVPTHKAVTLVRLLNAQKKWVRQEDAT 208

Query: 191 YVIVHCLRIRENETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIIN 250
           Y+ VHC+RIRENET FRVY+WM QQ+WYRFD+ L TKLA+Y+GKERKF+KCREVFDD++N
Sbjct: 209 YISVHCMRIRENETGFRVYRWMTQQNWYRFDFGLTTKLAEYLGKERKFTKCREVFDDVLN 268

Query: 251 QGCVPSESTFHILIVAYLSA-PIQGCIEEASAIYNRMIQLGGYEPRLSLHNSLFKALLSK 310
           QG VPSESTFHIL+VAYLS+  ++GC+EEA ++YNRMIQLGGY+PRLSLHNSLF+AL+SK
Sbjct: 269 QGRVPSESTFHILVVAYLSSLSVEGCLEEACSVYNRMIQLGGYKPRLSLHNSLFRALVSK 328

Query: 311 PGDLSKHHLKQAEFIYHNLVTTGLELHKDIYAGLIWLHSYQDTVDKERIMSLRKEMQQAG 370
            G +    LKQAEFI+HN+VTTGLE+ KDIY+GLIWLHS QD VD  RI SLR+EM++AG
Sbjct: 329 QGGILNDQLKQAEFIFHNVVTTGLEVQKDIYSGLIWLHSCQDEVDIGRINSLREEMKKAG 388

Query: 371 IEEEREVLVSILRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAF 430
            +E +EV+VS+LRA +K G V E ER+WL+L   D  +PSQAFVYK+E Y+KVG+  KA 
Sbjct: 389 FQESKEVVVSLLRAYAKEGGVEEVERTWLELLDLDCGIPSQAFVYKIEAYSKVGDFAKAM 448

Query: 431 EIFREMEQ-LNSVSAAAYQTIIGILCKVEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMF 490
           EIFREME+ +   + + Y  II +LCKV++V L E++M+ F +S  KPL P+++++  M+
Sbjct: 449 EIFREMEKHIGGATMSGYHKIIEVLCKVQQVELVETLMKEFEESGKKPLLPSFIEIAKMY 508

Query: 491 FNLSLHDKLELTFSQCLEKCKPNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSA 550
           F+L LH+KLE+ F QCLEKC+P++ IY+IYL+SL K+GNL++A ++F++M+ NG I VSA
Sbjct: 509 FDLGLHEKLEMAFVQCLEKCQPSQPIYNIYLDSLTKIGNLEKAGDVFNEMKNNGTINVSA 568

Query: 551 RSCNIILSGYLLSGDYLKAEKIYDLMCQKKYDIDPPLMEKLDYVLSLSRKEIKK-PVSLK 610
           RSCN +L GYL  G  ++AE+IYDLM  KKY+I+PPLMEKLDY+LSL +KE+KK P S+K
Sbjct: 569 RSCNSLLKGYLDCGKQVQAERIYDLMRMKKYEIEPPLMEKLDYILSLKKKEVKKRPFSMK 628

Query: 611 LSKEQREILVGLLLGGLEIESDEGRKNHRIQFEFHEDRSTHSRLRRHIYEQYHEWLHPAS 670
           LSK+QRE+LVGLLLGGL+IESD+ +K+H I+FEF E+   H  L+++I++Q+ EWLHP S
Sbjct: 629 LSKDQREVLVGLLLGGLQIESDKEKKSHMIKFEFRENSQAHLVLKQNIHDQFREWLHPLS 688

Query: 671 KSSDSDTDIPYKFCTVSHSYFGFYADQFWPRGHPAIPNLIHRWLSPRVLAYWYMYGGCRI 730
              +    IP++F +V HSYFGFYA+ +WP+G P IP LIHRWLSP  LAYWYMY G + 
Sbjct: 689 NFQED--IIPFEFYSVPHSYFGFYAEHYWPKGQPEIPKLIHRWLSPHSLAYWYMYSGVKT 748

Query: 731 SSGDFVLKLKGSREGVAKIVKSLGEKSMSCKVKRKGRVYWIGLLGSNATWFWKLIEPFIL 790
           SSGD +L+LKGS EGV K+VK+L  KSM C+VK+KG+V+WIGL G+N+  FWKLIEP +L
Sbjct: 749 SSGDIILRLKGSLEGVEKVVKALQAKSMECRVKKKGKVFWIGLQGTNSALFWKLIEPHVL 808

Query: 791 DDLKDRLQADSLNMEKAVN-ETYNINFDSQSDSDEE 810
           ++LK+ L+  S +++     E  +INF S SD  ++
Sbjct: 809 ENLKEHLKPASESLDNVKEAEEQSINFKSNSDHSDD 840

BLAST of Cp4.1LG02g11170.1 vs. TAIR 10
Match: AT1G09680.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 78.2 bits (191), Expect = 3.4e-14
Identity = 71/284 (25.00%), Postives = 127/284 (44.72%), Query Frame = 0

Query: 190 ETAFRVYKWMMQQHWYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHI 249
           +  FR+ K  M++   R D    + L + + KE K      +FD++  +G +P++  F  
Sbjct: 292 DEGFRL-KHQMEKSRTRPDVFTYSALINALCKENKMDGAHGLFDEMCKRGLIPNDVIFTT 351

Query: 250 LIVAYLSAPIQGCIEEASAIYNRMIQLGGYEPRLSLHNSLFKALLSKPGDLSKHHLKQAE 309
           LI  +      G I+     Y +M+   G +P + L+N+L      K GD     L  A 
Sbjct: 352 LIHGHSR---NGEIDLMKESYQKMLS-KGLQPDIVLYNTLVNG-FCKNGD-----LVAAR 411

Query: 310 FIYHNLVTTGLELHKDIYAGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILR 369
            I   ++  GL   K  Y  LI    +    D E  + +RKEM Q GIE +R    +++ 
Sbjct: 412 NIVDGMIRRGLRPDKITYTTLI--DGFCRGGDVETALEIRKEMDQNGIELDRVGFSALVC 471

Query: 370 ASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEVYAKVGNPMKAFEIFREMEQLNSV- 429
              K G V++AER+  ++           +   M+ + K G+    F++ +EM+    V 
Sbjct: 472 GMCKEGRVIDAERALREMLRAGIKPDDVTYTMMMDAFCKKGDAQTGFKLLKEMQSDGHVP 531

Query: 430 SAAAYQTIIGILCKVEEVTLAESVMEGFIKSNLKPLKPAYVDLM 473
           S   Y  ++  LCK+ ++  A+ +++  +   + P    Y  L+
Sbjct: 532 SVVTYNVLLNGLCKLGQMKNADMLLDAMLNIGVVPDDITYNTLL 562

BLAST of Cp4.1LG02g11170.1 vs. TAIR 10
Match: AT2G17140.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 74.7 bits (182), Expect = 3.8e-13
Identity = 78/334 (23.35%), Postives = 153/334 (45.81%), Query Frame = 0

Query: 229 REVFDDIINQGCVPSESTFHILIVAYLSAPIQGCIEEASAIYNRMIQLGGYEPRLSLHNS 288
           RE+FD++  +GC P+E TF IL+  Y  A   G  ++   + N M +  G  P   ++N+
Sbjct: 167 RELFDEMPEKGCKPNEFTFGILVRGYCKA---GLTDKGLELLNAM-ESFGVLPNKVIYNT 226

Query: 289 LFKALL--SKPGDLSKHHLKQAEFIYHNLVTTGLELHKDIYAGLIWLHSYQDTVDKERIM 348
           +  +     +  D  K   K  E     LV   +  +  I A    L      +D  RI 
Sbjct: 227 IVSSFCREGRNDDSEKMVEKMRE---EGLVPDIVTFNSRISA----LCKEGKVLDASRIF 286

Query: 349 SLRKEMQQAGIEEEREVLVSI-LRASSKLGDVMEAERSWLKLKSFDGSMPSQAFVYKMEV 408
           S  +  +  G+     +  ++ L+   K+G + +A+  +  ++  D     Q++   ++ 
Sbjct: 287 SDMELDEYLGLPRPNSITYNLMLKGFCKVGLLEDAKTLFESIRENDDLASLQSYNIWLQG 346

Query: 409 YAKVGNPMKAFEIFREMEQLN-SVSAAAYQTIIGILCKVEEVTLAESVMEGFIKSNLKPL 468
             + G  ++A  + ++M       S  +Y  ++  LCK+  ++ A++++    ++ + P 
Sbjct: 347 LVRHGKFIEAETVLKQMTDKGIGPSIYSYNILMDGLCKLGMLSDAKTIVGLMKRNGVCPD 406

Query: 469 KPAYVDLMNMFFNLSLHDKLELTFSQCL-EKCKPNRTIYSIYLNSLVKVGNLDRAEEIFS 528
              Y  L++ + ++   D  +    + +   C PN    +I L+SL K+G +  AEE+  
Sbjct: 407 AVTYGCLLHGYCSVGKVDAAKSLLQEMMRNNCLPNAYTCNILLHSLWKMGRISEAEELLR 466

Query: 529 QMQTNGEIGVSARSCNIILSGYLLSGDYLKAEKI 558
           +M   G  G+   +CNII+ G   SG+  KA +I
Sbjct: 467 KMNEKG-YGLDTVTCNIIVDGLCGSGELDKAIEI 488

BLAST of Cp4.1LG02g11170.1 vs. TAIR 10
Match: AT2G35130.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 70.1 bits (170), Expect = 9.3e-12
Identity = 91/451 (20.18%), Postives = 201/451 (44.57%), Query Frame = 0

Query: 145 LAWLCKELPAHKPGTLIRLL-NAQRKWMKQDDAAYVIVHCLRIRENETAFRVYKWMMQQH 204
           L+++ KE    K   ++  L +    W   DD   V V     ++ ++   V +W++++ 
Sbjct: 93  LSFIQKETDPDKVADVLGALPSTHASW---DDLINVSVQLRLNKKWDSIILVCEWILRKS 152

Query: 205 WYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCI 264
            ++ D      L D  G++ ++ +   ++  ++    VP+E T+ +LI AY  A   G I
Sbjct: 153 SFQPDVICFNLLIDAYGQKFQYKEAESLYVQLLESRYVPTEDTYALLIKAYCMA---GLI 212

Query: 265 EEASAIYNRMIQLGGYEPR---LSLHNSLFKALLSKPGDLSKHHLKQAEFIYHNLVTTGL 324
           E A  +   M Q     P+   ++++N+  + L+ + G     + ++A  ++  +     
Sbjct: 213 ERAEVVLVEM-QNHHVSPKTIGVTVYNAYIEGLMKRKG-----NTEEAIDVFQRMKRDRC 272

Query: 325 ELHKDIYAGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILRASSKLGDVMEA 384
           +   + Y   + ++ Y           L  EM+    +       +++ A ++ G   +A
Sbjct: 273 KPTTETYN--LMINLYGKASKSYMSWKLYCEMRSHQCKPNICTYTALVNAFAREGLCEKA 332

Query: 385 ERSWLKLKSFDGSMPSQAFVYK--MEVYAKVGNPMKAFEIFREMEQLN-SVSAAAYQTII 444
           E  + +L+  DG  P   +VY   ME Y++ G P  A EIF  M+ +      A+Y  ++
Sbjct: 333 EEIFEQLQE-DGLEP-DVYVYNALMESYSRAGYPYGAAEIFSLMQHMGCEPDRASYNIMV 392

Query: 445 GILCKVEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CK 504
               +    + AE+V E   +  + P   +++ L++ +       K E    +  E   +
Sbjct: 393 DAYGRAGLHSDAEAVFEEMKRLGIAPTMKSHMLLLSAYSKARDVTKCEAIVKEMSENGVE 452

Query: 505 PNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEK 564
           P+  + +  LN   ++G   + E+I ++M+ NG       + NI+++ Y  +G   + E+
Sbjct: 453 PDTFVLNSMLNLYGRLGQFTKMEKILAEME-NGPCTADISTYNILINIYGKAGFLERIEE 512

Query: 565 IYDLMCQKKYDIDPPLMEKLDYVLSLSRKEI 588
           ++  + +K +   P ++     + + SRK++
Sbjct: 513 LFVELKEKNF--RPDVVTWTSRIGAYSRKKL 524

BLAST of Cp4.1LG02g11170.1 vs. TAIR 10
Match: AT2G35130.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 70.1 bits (170), Expect = 9.3e-12
Identity = 91/451 (20.18%), Postives = 201/451 (44.57%), Query Frame = 0

Query: 145 LAWLCKELPAHKPGTLIRLL-NAQRKWMKQDDAAYVIVHCLRIRENETAFRVYKWMMQQH 204
           L+++ KE    K   ++  L +    W   DD   V V     ++ ++   V +W++++ 
Sbjct: 115 LSFIQKETDPDKVADVLGALPSTHASW---DDLINVSVQLRLNKKWDSIILVCEWILRKS 174

Query: 205 WYRFDYALATKLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPIQGCI 264
            ++ D      L D  G++ ++ +   ++  ++    VP+E T+ +LI AY  A   G I
Sbjct: 175 SFQPDVICFNLLIDAYGQKFQYKEAESLYVQLLESRYVPTEDTYALLIKAYCMA---GLI 234

Query: 265 EEASAIYNRMIQLGGYEPR---LSLHNSLFKALLSKPGDLSKHHLKQAEFIYHNLVTTGL 324
           E A  +   M Q     P+   ++++N+  + L+ + G     + ++A  ++  +     
Sbjct: 235 ERAEVVLVEM-QNHHVSPKTIGVTVYNAYIEGLMKRKG-----NTEEAIDVFQRMKRDRC 294

Query: 325 ELHKDIYAGLIWLHSYQDTVDKERIMSLRKEMQQAGIEEEREVLVSILRASSKLGDVMEA 384
           +   + Y   + ++ Y           L  EM+    +       +++ A ++ G   +A
Sbjct: 295 KPTTETYN--LMINLYGKASKSYMSWKLYCEMRSHQCKPNICTYTALVNAFAREGLCEKA 354

Query: 385 ERSWLKLKSFDGSMPSQAFVYK--MEVYAKVGNPMKAFEIFREMEQLN-SVSAAAYQTII 444
           E  + +L+  DG  P   +VY   ME Y++ G P  A EIF  M+ +      A+Y  ++
Sbjct: 355 EEIFEQLQE-DGLEP-DVYVYNALMESYSRAGYPYGAAEIFSLMQHMGCEPDRASYNIMV 414

Query: 445 GILCKVEEVTLAESVMEGFIKSNLKPLKPAYVDLMNMFFNLSLHDKLELTFSQCLEK-CK 504
               +    + AE+V E   +  + P   +++ L++ +       K E    +  E   +
Sbjct: 415 DAYGRAGLHSDAEAVFEEMKRLGIAPTMKSHMLLLSAYSKARDVTKCEAIVKEMSENGVE 474

Query: 505 PNRTIYSIYLNSLVKVGNLDRAEEIFSQMQTNGEIGVSARSCNIILSGYLLSGDYLKAEK 564
           P+  + +  LN   ++G   + E+I ++M+ NG       + NI+++ Y  +G   + E+
Sbjct: 475 PDTFVLNSMLNLYGRLGQFTKMEKILAEME-NGPCTADISTYNILINIYGKAGFLERIEE 534

Query: 565 IYDLMCQKKYDIDPPLMEKLDYVLSLSRKEI 588
           ++  + +K +   P ++     + + SRK++
Sbjct: 535 LFVELKEKNF--RPDVVTWTSRIGAYSRKKL 546

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9XIL52.0e-25655.64Pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Arabidop... [more]
Q6ZHJ54.8e-23153.93Pentatricopeptide repeat-containing protein OTP51, chloroplastic OS=Oryza sativa... [more]
O044914.8e-1325.00Putative pentatricopeptide repeat-containing protein At1g09680 OS=Arabidopsis th... [more]
Q0WPZ65.3e-1223.35Pentatricopeptide repeat-containing protein At2g17140 OS=Arabidopsis thaliana OX... [more]
O821781.3e-1020.18Pentatricopeptide repeat-containing protein At2g35130 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_023525582.10.0100.00pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like [Cucur... [more]
XP_023521219.10.099.75pentatricopeptide repeat-containing protein At2g15820, chloroplastic-like [Cucur... [more]
KAG6607381.10.097.04Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
XP_022949171.10.096.95pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita ... [more]
XP_022998786.10.096.19pentatricopeptide repeat-containing protein At2g15820, chloroplastic [Cucurbita ... [more]
Match NameE-valueIdentityDescription
A0A6J1GB980.096.95pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Cucurbit... [more]
A0A6J1KB640.096.19pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Cucurbit... [more]
A0A1S3CPK00.083.75pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Cucumis ... [more]
A0A0A0LBL00.081.99LAGLIDADG_2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G625100... [more]
A0A6P5T3C00.068.91pentatricopeptide repeat-containing protein At2g15820, chloroplastic OS=Prunus a... [more]
Match NameE-valueIdentityDescription
AT2G15820.11.4e-25755.64endonucleases [more]
AT1G09680.13.4e-1425.00Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G17140.13.8e-1323.35Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G35130.19.3e-1220.18Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G35130.29.3e-1220.18Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 774..794
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..23
NoneNo IPR availablePANTHERPTHR47539PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN OTP51, CHLOROPLASTICcoord: 35..810
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 186..436
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 373..567
IPR027434Homing endonucleaseGENE3D3.10.28.10Homing endonucleasescoord: 577..687
e-value: 1.8E-18
score: 68.6
IPR027434Homing endonucleaseGENE3D3.10.28.10Homing endonucleasescoord: 689..788
e-value: 4.3E-10
score: 41.9
IPR027434Homing endonucleaseSUPERFAMILY55608Homing endonucleasescoord: 591..781
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 537..568
e-value: 3.8E-5
score: 21.6
coord: 501..529
e-value: 1.1E-4
score: 20.2
coord: 216..244
e-value: 4.6E-4
score: 18.2
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 537..564
e-value: 0.0069
score: 16.6
coord: 501..529
e-value: 0.0017
score: 18.5
coord: 404..425
e-value: 0.011
score: 15.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 498..532
score: 9.174665
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 492..576
e-value: 1.1E-13
score: 53.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 149..302
e-value: 2.3E-18
score: 68.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 303..491
e-value: 9.3E-14
score: 53.5
IPR004860Homing endonuclease, LAGLIDADGPFAMPF03161LAGLIDADG_2coord: 600..765
e-value: 5.8E-42
score: 143.5

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG02g11170Cp4.1LG02g11170gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG02g11170.1:exon:002Cp4.1LG02g11170.1:exon:002exon
Cp4.1LG02g11170.1:exon:001Cp4.1LG02g11170.1:exon:001exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG02g11170.1:cds:002Cp4.1LG02g11170.1:cds:002CDS
Cp4.1LG02g11170.1:cds:001Cp4.1LG02g11170.1:cds:001CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG02g11170.1Cp4.1LG02g11170.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010239 chloroplast mRNA processing
biological_process GO:0000373 Group II intron splicing
biological_process GO:0045292 mRNA cis splicing, via spliceosome
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0048564 photosystem I assembly
biological_process GO:0006388 tRNA splicing, via endonucleolytic cleavage and ligation
cellular_component GO:0009507 chloroplast
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0005515 protein binding