Cp4.1LG18g05820 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG18g05820
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG18: 6250917 .. 6254930 (-)
RNA-Seq ExpressionCp4.1LG18g05820
SyntenyCp4.1LG18g05820
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCTCCCTGAAACTCTCCTTCTCTCTAGATTCTTTCCATTCTAAGAAATTCGATTTTCCGGTTAATTCATCTCTGCTCTCTGATTGCTGCTCTGTTTTCTCTATCACTGGCTATATTCATCTCAATAAGTCCTGCGTACTTTACTCTCTGGTTAGGGCTCACAAGCCTTCTAAGGTCGAGCCGGAGACATCCGGCGGTTACGAATCGAAATGTGCCGTTGATGAAATTGACACCAGGAAGAAGTATTTTGGCGGCAAGAAGCCATCAAAGAGAGCGCCAGGTTCGTATTTTAGTTTCAGTAAGAATTGTAGTGAGAAAGTTTTCGATAGTATTGTTTTTCATGGTGGCGAATTGGATGTCAATTACTCCACTATATCGTCCGATTTGAGTTTAGAGGATTGCAATGCCATTTTAAAAAGGTTAGAGAAGTGTAATGATCGAAAAGCACTAGGTTTCTTTGAGTGGATGAGAATCAACCGGAAATTAGAACACAATGTGAGTGCGTATAATTTGATTCTTCGAGTGTTGGGCAGGCAACAAGATTGGGATGCTGCCGATAAGCTAATTAGAGAAGTTAGAGCTGAGTTGAGTGATCAATTGGATTTTCAGGTCTTTAACACCCTTATTTATGCTTGTTATAAGTCGGGGCTTGTAGAGCAGGGTGCTAAATGGTTTCAAATGATGTTGGAATGGCAAGTGCTGCCCAATGTTGCAACGTTTGGAATGCTTATGGGCCTCTATCAGAAGAGTTGTAACCTCAAGGAGGCAGAGTTCGCTTTTAATCAGATGAGAAACTTTGGGATTGTCTGCGAAACGGCATATGCATCTATGATTACTATATACACGCGTTTGAGTTTGTACGATAAAGCAGAAGAGGTGATTCGATTAATGCAAGAAGATAAGGTAATACCGAATGTAGAGAACTGGTTAGTCATGCTTAATGCTTATTGTCAGCAAGGTAAAATGGAGGACGCTGAACTTGTGTTTGCCTCGATGGAAGAACATGGGTTTTCGTCTAATATCATTGCGTATAATACGTTGATTACTGGGTATGGAAAAGCATCGAATATGGATGCTGCTCAACGCCTGTTCTTGAGCATCAAGAACTCTGGTGTAGAACCTGATGAAACGACTTACCGCTCCATGATTGAAGGTTGGGGACGAGCTGGTAATTACAAAATGGCAGAATGGTACTTTAAGGAACTCAAGCGAAAAGGATATATGCCGAATGCCTCTAACTTGTTCACCCTCATGAATCTTCAAGCCAAACATGAGGATGACGCAGGTGCACTTAAAACTCTTAATGATATGCTGAAGATTGGATGCCGGCTTTCTTCCATTGTTGGAAATGTTTTACAAGCTTATGAAAAGGCTAGAAGAATAAAAAGTGTGCCTCTCCTCTTGACAGGATCGTTCTATCGGAAAGTTCTGGCCAGCCAGACATCTTGCTCGATTCTGGTAATGGCTTATGTGAAGCACGGTTTAGTGGATGATGCTTTGAAAGTGTTGAGGGAAAAAGAGTGGAATGATCTTCGTTTTGAGGAGAATTTATATCATTTGCTAATTTGTTCATGTAAAGAGTTAGACCATCTCGAGAACGCAATCAAGATATACACTCAACTACCCAAACGTAAAAACAAACCGAACTTGCATATCACGAGCACAATGATTGATATCTACAGCATCATGGGTAGGTTCTCTGACGGGGAGAAACTTTATCTAAGCCTGAAATCTTCAGGCATTCGTTTGGATTTGATTGCCTTCAGTGTTGTTGTGAGAATGTATGTCAAAGCTGGATCATTGGAAGATGCATGCTCAGTTCTTGACTTCATGGATAAACAGCAGGACATTGTTCCAGACATATATCTGTTCCGGGACATGCTGCGTATTTATCAACGTTGTGGCATGGTGGATAAGCTACAAGATGTGTACTATAGGATACTGAATAGTGACGTCTCTTGGGATCAGGAAATGTATAATTGTGTCATAAATTGCTGTTCCCGTGCTCTGCTTGTTGATGAGCTTTCCAGCCTTTTTGATGAAATGCTTCAACGTGGGTTTGCTCCAAATACCGTGACCTTGAATGTCATGCTTGACGTTTATGGGAAGTCCAAGCTTTTTTCCAAGGCCAGAAAACTGTTATTGCTGGCTCAGAAAAAAGGTTTGGTTGATGTAATCTCTTATAATACTATGATATCTGCGTTTGGAAAGAGCAAGGACTTCGCAAACATGTCGTCCACAGTTAGAACAATGGAATTTAATGGCTTTTCGCTTTCCCTTGAAGCATACAATTCTCTGTTGGATGCTTATGGCAAAGAAGGCCGAATGGATAATTTCAGACAAGTCTTACAGCAATTAAAGGACTCGAATTCTGAACGTGACCAATACACTTATAACATCATGATCAACATCTATGGAAAACAAGGATGGATTGACGATGTCGAGGAAGTGCTGACAGAACTGAAAGCATGTGGACTCGAACCCGATCTGTATAGCTACAACGCATTGATCAAGGCATATGGAATAGCAGGGATGGTTGAAGAAGCCGCTCAGTTGGTGAAAGAAATGAGAGAAAAGAGGATAGAACCGGATAAGGTTACTTTTCTTAGCATGATCACTGCACTACAAAGAAACGATCAATACTTGGAGGCAATCAAGTGGTCATTGTGGATGAAGCAGATGAACTATTGAAGTTCAGAAATGCCAAAACAATGAAACAGGTGTTTGCCCTCCCTCGACGTAGCCTCCGGATATATCCGATCTTCCTGTCCCAACTCTTGCTCAAAACATATACTGGTAAGCGTTTGTTATTCCCTGCCACTTATCAAATGTTGTTTCTTTACCTTTATATTTCATATCTGTTAAAATATCATCAGTAGCCCAACCTTGTATCTCAACAGGGTCAAGCCCAACTATTGTATCTCAATAGAGTCGAGCCCAACCATTGTATGAGTGACTATATTTATGTTAAGAAAATGATTTAGGATTTTAGGAATAAACTTAGAAAAATGATTTAGGAATTTTAAGGAAAAACGTTGAGAAATTTTAGGAATAGATTGGCTACTATATAGTATAAATACCCTCCACCCTATTTAGTTCATCGTCCCAAGCAATACAAAGTAGCAGTATTCTATAAAGTATCTTGTATTCTATTCTTGATTGGTTTTAATAAAGAGTGCGAGTGTTTCCCGCATAGTTGTTCAACAATATCTCAACAATATCTGCCTCTTTTTGTTTTGATTCGCATAACAAAATATAATTACTGAACTTCCCCAAGCTTTCCCACTAACAAAGGTGTGATAAGTAATAGTCTTCATGCACTGTTTAGCTCATAATATGTAATATTCTTATTATTGATATTTTCTGTCTTAAACAAGTGTCATATCATATGGTAGAATCATAAACTTCTTCAAGGTTAGCTCAATCCTTGTTAGTTTAACAATTGTAAGGAAGAAGCTAAAGAAAATGCTGAATATACCATTGGTTTAATGTTGCTTTCCTGATCTTGTTTCTCATTGCTTTGGCAGGTTGGGATTTGGCAGACAAAATTGCCTTCAAAGCCGCAAAAAACTGCTCTCTCCGCCGTTCACACTTGTTTCTAAGGTTATATATAATCAATAAAGGTACAGTACAATTTATATGAGCTCACATGCTTGCACAATTATTTAGAAATATCAAAACTTGTTGGAATTTAATTACGAAAGTACAGAGATTAAAGCAAAACACTCAATTATATGCACCAAAAGAGTCTAAAAATCACAAAATTGACCAATAGAGTTGACATTATTAGTTGGTGTATTAGGGAGGTTTGAAAAAGAAGGTACAAATTATTATTGTTTAATTATGTTGAAAAGAACAGGTGTGAATTATGATTGCTTTTCAAAATGGAAGCTGTATTTGATGAACCCCATCCTTCCAAGCTTACCTTTTTCCTTTTCAATTTCTTATGTAAATGTGAATGATCATTTGGGACGACAATAAGGACTGAGTTTTTCTTCATGCTTATTATTT

mRNA sequence

ATGGCCTCCCTGAAACTCTCCTTCTCTCTAGATTCTTTCCATTCTAAGAAATTCGATTTTCCGGTTAATTCATCTCTGCTCTCTGATTGCTGCTCTGTTTTCTCTATCACTGGCTATATTCATCTCAATAAGTCCTGCGTACTTTACTCTCTGGTTAGGGCTCACAAGCCTTCTAAGGTCGAGCCGGAGACATCCGGCGGTTACGAATCGAAATGTGCCGTTGATGAAATTGACACCAGGAAGAAGTATTTTGGCGGCAAGAAGCCATCAAAGAGAGCGCCAGGTTCGTATTTTAGTTTCAGTAAGAATTGTAGTGAGAAAGTTTTCGATAGTATTGTTTTTCATGGTGGCGAATTGGATGTCAATTACTCCACTATATCGTCCGATTTGAGTTTAGAGGATTGCAATGCCATTTTAAAAAGGTTAGAGAAGTGTAATGATCGAAAAGCACTAGGTTTCTTTGAGTGGATGAGAATCAACCGGAAATTAGAACACAATGTGAGTGCGTATAATTTGATTCTTCGAGTGTTGGGCAGGCAACAAGATTGGGATGCTGCCGATAAGCTAATTAGAGAAGTTAGAGCTGAGTTGAGTGATCAATTGGATTTTCAGGTCTTTAACACCCTTATTTATGCTTGTTATAAGTCGGGGCTTGTAGAGCAGGGTGCTAAATGGTTTCAAATGATGTTGGAATGGCAAGTGCTGCCCAATGTTGCAACGTTTGGAATGCTTATGGGCCTCTATCAGAAGAGTTGTAACCTCAAGGAGGCAGAGTTCGCTTTTAATCAGATGAGAAACTTTGGGATTGTCTGCGAAACGGCATATGCATCTATGATTACTATATACACGCGTTTGAGTTTGTACGATAAAGCAGAAGAGGTGATTCGATTAATGCAAGAAGATAAGGTAATACCGAATGTAGAGAACTGGTTAGTCATGCTTAATGCTTATTGTCAGCAAGGTAAAATGGAGGACGCTGAACTTGTGTTTGCCTCGATGGAAGAACATGGGTTTTCGTCTAATATCATTGCGTATAATACGTTGATTACTGGGTATGGAAAAGCATCGAATATGGATGCTGCTCAACGCCTGTTCTTGAGCATCAAGAACTCTGGTGTAGAACCTGATGAAACGACTTACCGCTCCATGATTGAAGGTTGGGGACGAGCTGGTAATTACAAAATGGCAGAATGGTACTTTAAGGAACTCAAGCGAAAAGGATATATGCCGAATGCCTCTAACTTGTTCACCCTCATGAATCTTCAAGCCAAACATGAGGATGACGCAGGTGCACTTAAAACTCTTAATGATATGCTGAAGATTGGATGCCGGCTTTCTTCCATTGTTGGAAATGTTTTACAAGCTTATGAAAAGGCTAGAAGAATAAAAAGTGTGCCTCTCCTCTTGACAGGATCGTTCTATCGGAAAGTTCTGGCCAGCCAGACATCTTGCTCGATTCTGGTAATGGCTTATGTGAAGCACGGTTTAGTGGATGATGCTTTGAAAGTGTTGAGGGAAAAAGAGTGGAATGATCTTCGTTTTGAGGAGAATTTATATCATTTGCTAATTTGTTCATGTAAAGAGTTAGACCATCTCGAGAACGCAATCAAGATATACACTCAACTACCCAAACGTAAAAACAAACCGAACTTGCATATCACGAGCACAATGATTGATATCTACAGCATCATGGGTAGGTTCTCTGACGGGGAGAAACTTTATCTAAGCCTGAAATCTTCAGGCATTCGTTTGGATTTGATTGCCTTCAGTGTTGTTGTGAGAATGTATGTCAAAGCTGGATCATTGGAAGATGCATGCTCAGTTCTTGACTTCATGGATAAACAGCAGGACATTGTTCCAGACATATATCTGTTCCGGGACATGCTGCGTATTTATCAACGTTGTGGCATGGTGGATAAGCTACAAGATGTGTACTATAGGATACTGAATAGTGACGTCTCTTGGGATCAGGAAATGTATAATTGTGTCATAAATTGCTGTTCCCGTGCTCTGCTTGTTGATGAGCTTTCCAGCCTTTTTGATGAAATGCTTCAACGTGGGTTTGCTCCAAATACCGTGACCTTGAATGTCATGCTTGACGTTTATGGGAAGTCCAAGCTTTTTTCCAAGGCCAGAAAACTGTTATTGCTGGCTCAGAAAAAAGGTTTGGTTGATGTAATCTCTTATAATACTATGATATCTGCGTTTGGAAAGAGCAAGGACTTCGCAAACATGTCGTCCACAGTTAGAACAATGGAATTTAATGGCTTTTCGCTTTCCCTTGAAGCATACAATTCTCTGTTGGATGCTTATGGCAAAGAAGGCCGAATGGATAATTTCAGACAAGTCTTACAGCAATTAAAGGACTCGAATTCTGAACGTGACCAATACACTTATAACATCATGATCAACATCTATGGAAAACAAGGATGGATTGACGATGTCGAGGAAGTGCTGACAGAACTGAAAGCATGTGGACTCGAACCCGATCTGTATAGCTACAACGCATTGATCAAGGCATATGGAATAGCAGGGATGGTTGAAGAAGCCGCTCAGTTGGTGAAAGAAATGAGAGAAAAGAGGATAGAACCGGATAAGGTGTTTGCCCTCCCTCGACGTAGCCTCCGGATATATCCGATCTTCCTGTCCCAACTCTTGCTCAAAACATATACTGGTTGGGATTTGGCAGACAAAATTGCCTTCAAAGCCGCAAAAAACTGCTCTCTCCGCCGTTCACACTTGTTTCTAAGGTTATATATAATCAATAAAGGTACAGTACAATTTATATGAGCTCACATGCTTGCACAATTATTTAGAAATATCAAAACTTGTTGGAATTTAATTACGAAAGTACAGAGATTAAAGCAAAACACTCAATTATATGCACCAAAAGAGTCTAAAAATCACAAAATTGACCAATAGAGTTGACATTATTAGTTGGTGTATTAGGGAGGTTTGAAAAAGAAGGTACAAATTATTATTGTTTAATTATGTTGAAAAGAACAGGTGTGAATTATGATTGCTTTTCAAAATGGAAGCTGTATTTGATGAACCCCATCCTTCCAAGCTTACCTTTTTCCTTTTCAATTTCTTATGTAAATGTGAATGATCATTTGGGACGACAATAAGGACTGAGTTTTTCTTCATGCTTATTATTT

Coding sequence (CDS)

ATGGCCTCCCTGAAACTCTCCTTCTCTCTAGATTCTTTCCATTCTAAGAAATTCGATTTTCCGGTTAATTCATCTCTGCTCTCTGATTGCTGCTCTGTTTTCTCTATCACTGGCTATATTCATCTCAATAAGTCCTGCGTACTTTACTCTCTGGTTAGGGCTCACAAGCCTTCTAAGGTCGAGCCGGAGACATCCGGCGGTTACGAATCGAAATGTGCCGTTGATGAAATTGACACCAGGAAGAAGTATTTTGGCGGCAAGAAGCCATCAAAGAGAGCGCCAGGTTCGTATTTTAGTTTCAGTAAGAATTGTAGTGAGAAAGTTTTCGATAGTATTGTTTTTCATGGTGGCGAATTGGATGTCAATTACTCCACTATATCGTCCGATTTGAGTTTAGAGGATTGCAATGCCATTTTAAAAAGGTTAGAGAAGTGTAATGATCGAAAAGCACTAGGTTTCTTTGAGTGGATGAGAATCAACCGGAAATTAGAACACAATGTGAGTGCGTATAATTTGATTCTTCGAGTGTTGGGCAGGCAACAAGATTGGGATGCTGCCGATAAGCTAATTAGAGAAGTTAGAGCTGAGTTGAGTGATCAATTGGATTTTCAGGTCTTTAACACCCTTATTTATGCTTGTTATAAGTCGGGGCTTGTAGAGCAGGGTGCTAAATGGTTTCAAATGATGTTGGAATGGCAAGTGCTGCCCAATGTTGCAACGTTTGGAATGCTTATGGGCCTCTATCAGAAGAGTTGTAACCTCAAGGAGGCAGAGTTCGCTTTTAATCAGATGAGAAACTTTGGGATTGTCTGCGAAACGGCATATGCATCTATGATTACTATATACACGCGTTTGAGTTTGTACGATAAAGCAGAAGAGGTGATTCGATTAATGCAAGAAGATAAGGTAATACCGAATGTAGAGAACTGGTTAGTCATGCTTAATGCTTATTGTCAGCAAGGTAAAATGGAGGACGCTGAACTTGTGTTTGCCTCGATGGAAGAACATGGGTTTTCGTCTAATATCATTGCGTATAATACGTTGATTACTGGGTATGGAAAAGCATCGAATATGGATGCTGCTCAACGCCTGTTCTTGAGCATCAAGAACTCTGGTGTAGAACCTGATGAAACGACTTACCGCTCCATGATTGAAGGTTGGGGACGAGCTGGTAATTACAAAATGGCAGAATGGTACTTTAAGGAACTCAAGCGAAAAGGATATATGCCGAATGCCTCTAACTTGTTCACCCTCATGAATCTTCAAGCCAAACATGAGGATGACGCAGGTGCACTTAAAACTCTTAATGATATGCTGAAGATTGGATGCCGGCTTTCTTCCATTGTTGGAAATGTTTTACAAGCTTATGAAAAGGCTAGAAGAATAAAAAGTGTGCCTCTCCTCTTGACAGGATCGTTCTATCGGAAAGTTCTGGCCAGCCAGACATCTTGCTCGATTCTGGTAATGGCTTATGTGAAGCACGGTTTAGTGGATGATGCTTTGAAAGTGTTGAGGGAAAAAGAGTGGAATGATCTTCGTTTTGAGGAGAATTTATATCATTTGCTAATTTGTTCATGTAAAGAGTTAGACCATCTCGAGAACGCAATCAAGATATACACTCAACTACCCAAACGTAAAAACAAACCGAACTTGCATATCACGAGCACAATGATTGATATCTACAGCATCATGGGTAGGTTCTCTGACGGGGAGAAACTTTATCTAAGCCTGAAATCTTCAGGCATTCGTTTGGATTTGATTGCCTTCAGTGTTGTTGTGAGAATGTATGTCAAAGCTGGATCATTGGAAGATGCATGCTCAGTTCTTGACTTCATGGATAAACAGCAGGACATTGTTCCAGACATATATCTGTTCCGGGACATGCTGCGTATTTATCAACGTTGTGGCATGGTGGATAAGCTACAAGATGTGTACTATAGGATACTGAATAGTGACGTCTCTTGGGATCAGGAAATGTATAATTGTGTCATAAATTGCTGTTCCCGTGCTCTGCTTGTTGATGAGCTTTCCAGCCTTTTTGATGAAATGCTTCAACGTGGGTTTGCTCCAAATACCGTGACCTTGAATGTCATGCTTGACGTTTATGGGAAGTCCAAGCTTTTTTCCAAGGCCAGAAAACTGTTATTGCTGGCTCAGAAAAAAGGTTTGGTTGATGTAATCTCTTATAATACTATGATATCTGCGTTTGGAAAGAGCAAGGACTTCGCAAACATGTCGTCCACAGTTAGAACAATGGAATTTAATGGCTTTTCGCTTTCCCTTGAAGCATACAATTCTCTGTTGGATGCTTATGGCAAAGAAGGCCGAATGGATAATTTCAGACAAGTCTTACAGCAATTAAAGGACTCGAATTCTGAACGTGACCAATACACTTATAACATCATGATCAACATCTATGGAAAACAAGGATGGATTGACGATGTCGAGGAAGTGCTGACAGAACTGAAAGCATGTGGACTCGAACCCGATCTGTATAGCTACAACGCATTGATCAAGGCATATGGAATAGCAGGGATGGTTGAAGAAGCCGCTCAGTTGGTGAAAGAAATGAGAGAAAAGAGGATAGAACCGGATAAGGTGTTTGCCCTCCCTCGACGTAGCCTCCGGATATATCCGATCTTCCTGTCCCAACTCTTGCTCAAAACATATACTGGTTGGGATTTGGCAGACAAAATTGCCTTCAAAGCCGCAAAAAACTGCTCTCTCCGCCGTTCACACTTGTTTCTAAGGTTATATATAATCAATAAAGGTACAGTACAATTTATATGA

Protein sequence

MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSITGYIHLNKSCVLYSLVRAHKPSKVEPETSGGYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFSFSKNCSEKVFDSIVFHGGELDVNYSTISSDLSLEDCNAILKRLEKCNDRKALGFFEWMRINRKLEHNVSAYNLILRVLGRQQDWDAADKLIREVRAELSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVATFGMLMGLYQKSCNLKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEEVIRLMQEDKVIPNVENWLVMLNAYCQQGKMEDAELVFASMEEHGFSSNIIAYNTLITGYGKASNMDAAQRLFLSIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFTLMNLQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYRKVLASQTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIYTQLPKRKNKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKAGSLEDACSVLDFMDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQEMYNCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKLFSKARKLLLLAQKKGLVDVISYNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMDNFRQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNALIKAYGIAGMVEEAAQLVKEMREKRIEPDKVFALPRRSLRIYPIFLSQLLLKTYTGWDLADKIAFKAAKNCSLRRSHLFLRLYIINKGTVQFI
Homology
BLAST of Cp4.1LG18g05820 vs. ExPASy Swiss-Prot
Match: O65567 (Pentatricopeptide repeat-containing protein At4g30825, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At4g30825 PE=2 SV=2)

HSP 1 Score: 1018.5 bits (2632), Expect = 4.9e-296
Identity = 523/874 (59.84%), Postives = 649/874 (74.26%), Query Frame = 0

Query: 1   MASLKLSFSLDSFHS--KKFDFPVNSSLLSDCCSVFSITGYIHLNKSCVLYSLVRAHKPS 60
           M SL+ S  LD F S  K+F F  N S   D   +  +T  IH  ++  + S  R     
Sbjct: 1   MGSLRFSIPLDPFDSKRKRFHFSANPSQFPDQFPIHFVTSSIHATRASSIGSSTRVLDKI 60

Query: 61  KV-----EPETSGGYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFSFSKNCSEKVFDSIV 120
           +V     E   +    +  A  E     K  G ++ +K+     FSF +  ++   +++ 
Sbjct: 61  RVSSLGTEANENAINSASAAPVERSRSSKLSGDQRGTKKYVARKFSFRRGSNDLELENLF 120

Query: 121 FHGGELDVNYSTISSDLSLEDCNAILKRLEKCNDRKALGFFEWMRINRKLEHNVSAYNLI 180
            + GE+DVNYS I    SLE CN ILKRLE C+D  A+ FF+WMR N KL  N  AY+LI
Sbjct: 121 VNNGEIDVNYSAIKPGQSLEHCNGILKRLESCSDTNAIKFFDWMRCNGKLVGNFVAYSLI 180

Query: 181 LRVLGRQQDWDAADKLIREVRAELSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQ 240
           LRVLGR+++WD A+ LI+E+      Q  +QVFNT+IYAC K G V+  +KWF MMLE+ 
Sbjct: 181 LRVLGRREEWDRAEDLIKELCGFHEFQKSYQVFNTVIYACTKKGNVKLASKWFHMMLEFG 240

Query: 241 VLPNVATFGMLMGLYQKSCNLKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEE 300
           V PNVAT GMLMGLYQK+ N++EAEFAF+ MR FGIVCE+AY+SMITIYTRL LYDKAEE
Sbjct: 241 VRPNVATIGMLMGLYQKNWNVEEAEFAFSHMRKFGIVCESAYSSMITIYTRLRLYDKAEE 300

Query: 301 VIRLMQEDKVIPNVENWLVMLNAYCQQGKMEDAELVFASMEEHGFSSNIIAYNTLITGYG 360
           VI LM++D+V   +ENWLVMLNAY QQGKME AE +  SME  GFS NIIAYNTLITGYG
Sbjct: 301 VIDLMKQDRVRLKLENWLVMLNAYSQQGKMELAESILVSMEAAGFSPNIIAYNTLITGYG 360

Query: 361 KASNMDAAQRLFLSIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNAS 420
           K   M+AAQ LF  + N G+EPDET+YRSMIEGWGRA NY+ A+ Y++ELKR GY PN+ 
Sbjct: 361 KIFKMEAAQGLFHRLCNIGLEPDETSYRSMIEGWGRADNYEEAKHYYQELKRCGYKPNSF 420

Query: 421 NLFTLMNLQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSF 480
           NLFTL+NLQAK+ D  GA+KT+ DM  IGC+ SSI+G +LQAYEK  +I  VP +L GSF
Sbjct: 421 NLFTLINLQAKYGDRDGAIKTIEDMTGIGCQYSSILGIILQAYEKVGKIDVVPCVLKGSF 480

Query: 481 YRKVLASQTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLE 540
           +  +  +QTS S LVMAYVKHG+VDD L +LREK+W D  FE +LYHLLICSCKE   L 
Sbjct: 481 HNHIRLNQTSFSSLVMAYVKHGMVDDCLGLLREKKWRDSAFESHLYHLLICSCKESGQLT 540

Query: 541 NAIKIYTQLPKRKNKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVV 600
           +A+KIY    +   + NLHITSTMIDIY++MG FS+ EKLYL+LKSSG+ LD I FS+VV
Sbjct: 541 DAVKIYNHKMESDEEINLHITSTMIDIYTVMGEFSEAEKLYLNLKSSGVVLDRIGFSIVV 600

Query: 601 RMYVKAGSLEDACSVLDFMDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDV 660
           RMYVKAGSLE+ACSVL+ MD+Q+DIVPD+YLFRDMLRIYQ+C + DKLQ +YYRI  S +
Sbjct: 601 RMYVKAGSLEEACSVLEIMDEQKDIVPDVYLFRDMLRIYQKCDLQDKLQHLYYRIRKSGI 660

Query: 661 SWDQEMYNCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKLFSKARK 720
            W+QEMYNCVINCC+RAL +DELS  F+EM++ GF PNTVT NV+LDVYGK+KLF K  +
Sbjct: 661 HWNQEMYNCVINCCARALPLDELSGTFEEMIRYGFTPNTVTFNVLLDVYGKAKLFKKVNE 720

Query: 721 LLLLAQKKGLVDVISYNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGK 780
           L LLA++ G+VDVISYNT+I+A+GK+KD+ NMSS ++ M+F+GFS+SLEAYN+LLDAYGK
Sbjct: 721 LFLLAKRHGVVDVISYNTIIAAYGKNKDYTNMSSAIKNMQFDGFSVSLEAYNTLLDAYGK 780

Query: 781 EGRMDNFRQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYS 840
           + +M+ FR +L+++K S S  D YTYNIMINIYG+QGWID+V +VL ELK  GL PDL S
Sbjct: 781 DKQMEKFRSILKRMKKSTSGPDHYTYNIMINIYGEQGWIDEVADVLKELKESGLGPDLCS 840

Query: 841 YNALIKAYGIAGMVEEAAQLVKEMREKRIEPDKV 868
           YN LIKAYGI GMVEEA  LVKEMR + I PDKV
Sbjct: 841 YNTLIKAYGIGGMVEEAVGLVKEMRGRNIIPDKV 874

BLAST of Cp4.1LG18g05820 vs. ExPASy Swiss-Prot
Match: Q9SZ52 (Pentatricopeptide repeat-containing protein At4g31850, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PGR3 PE=1 SV=1)

HSP 1 Score: 197.2 bits (500), Expect = 8.2e-49
Identity = 168/678 (24.78%), Postives = 314/678 (46.31%), Query Frame = 0

Query: 202  DFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVATFGMLMGLYQKSCNLKEAEFAF 261
            +   +NTLI    +   ++   + F  M    V P   T+ + +  Y KS +   A   F
Sbjct: 397  NLHTYNTLICGLLRVHRLDDALELFGNMESLGVKPTAYTYIVFIDYYGKSGDSVSALETF 456

Query: 262  NQMRNFGIVCETAYASMITIYT--RLSLYDKAEEVIRLMQEDKVIPNVENWLVMLNAYCQ 321
             +M+  GI      A   ++Y+  +     +A+++   +++  ++P+   + +M+  Y +
Sbjct: 457  EKMKTKGI-APNIVACNASLYSLAKAGRDREAKQIFYGLKDIGLVPDSVTYNMMMKCYSK 516

Query: 322  QGKMEDAELVFASMEEHGFSSNIIAYNTLITGYGKASNMDAAQRLFLSIKNSGVEPDETT 381
             G++++A  + + M E+G   ++I  N+LI    KA  +D A ++F+ +K   ++P   T
Sbjct: 517  VGEIDEAIKLLSEMMENGCEPDVIVVNSLINTLYKADRVDEAWKMFMRMKEMKLKPTVVT 576

Query: 382  YRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFTLMNLQAKHEDDAGALKTLNDML 441
            Y +++ G G+ G  + A   F+ + +KG  PN     TL +   K+++   ALK L  M+
Sbjct: 577  YNTLLAGLGKNGKIQEAIELFEGMVQKGCPPNTITFNTLFDCLCKNDEVTLALKMLFKMM 636

Query: 442  KIGCRLSSIVGN-VLQAYEKARRIKSVPLLLTGSFYRKVLASQTSCSILVMAYVKHGLVD 501
             +GC       N ++    K  ++K   +       + V     +   L+   VK  L++
Sbjct: 637  DMGCVPDVFTYNTIIFGLVKNGQVKEA-MCFFHQMKKLVYPDFVTLCTLLPGVVKASLIE 696

Query: 502  DALKVLREKEWNDLRFEENLY-HLLICSCKELDHLENAIKIYTQLPKRKNKPNLHITSTM 561
            DA K++    +N      NL+   LI S      ++NA+    +L       +       
Sbjct: 697  DAYKIITNFLYNCADQPANLFWEDLIGSILAEAGIDNAVSFSERLVANGICRDGDSILVP 756

Query: 562  IDIYSIMGRFSDGEKLYLS--LKSSGIRLDLIAFSVVVRMYVKAGSLEDACSVLDFMDKQ 621
            I  YS       G +       K  G++  L  +++++   ++A  +E A  V     K 
Sbjct: 757  IIRYSCKHNNVSGARTLFEKFTKDLGVQPKLPTYNLLIGGLLEADMIEIAQDVF-LQVKS 816

Query: 622  QDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQEMYNCVINCCSRALLVDE 681
               +PD+  +  +L  Y + G +D+L ++Y  +   +   +   +N VI+   +A  VD+
Sbjct: 817  TGCIPDVATYNFLLDAYGKSGKIDELFELYKEMSTHECEANTITHNIVISGLVKAGNVDD 876

Query: 682  -LSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKLFSKARKLLLLAQKKGLVD------VIS 741
             L   +D M  R F+P   T   ++D   KS    +A++L      +G++D         
Sbjct: 877  ALDLYYDLMSDRDFSPTACTYGPLIDGLSKSGRLYEAKQLF-----EGMLDYGCRPNCAI 936

Query: 742  YNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMDNFRQVLQQLK 801
            YN +I+ FGK+ +     +  + M   G    L+ Y+ L+D     GR+D      ++LK
Sbjct: 937  YNILINGFGKAGEADAACALFKRMVKEGVRPDLKTYSVLVDCLCMVGRVDEGLHYFKELK 996

Query: 802  DSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKAC-GLEPDLYSYNALIKAYGIAGMV 861
            +S    D   YN++IN  GK   +++   +  E+K   G+ PDLY+YN+LI   GIAGMV
Sbjct: 997  ESGLNPDVVCYNLIINGLGKSHRLEEALVLFNEMKTSRGITPDLYTYNSLILNLGIAGMV 1056

Query: 862  EEAAQLVKEMREKRIEPD 866
            EEA ++  E++   +EP+
Sbjct: 1057 EEAGKIYNEIQRAGLEPN 1066

BLAST of Cp4.1LG18g05820 vs. ExPASy Swiss-Prot
Match: Q9LYZ9 (Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana OX=3702 GN=At5g02860 PE=2 SV=1)

HSP 1 Score: 182.2 bits (461), Expect = 2.7e-44
Identity = 137/627 (21.85%), Postives = 274/627 (43.70%), Query Frame = 0

Query: 245 MGLYQK-SCNLKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEEVIRLMQEDKV 304
           +G ++K    L+  ++   Q     ++  +  A +I++  +      A  +   +QED  
Sbjct: 145 LGFHKKFDLALRAFDWFMKQKDYQSMLDNSVVAIIISMLGKEGRVSSAANMFNGLQEDGF 204

Query: 305 IPNVENWLVMLNAYCQQGKMEDAELVFASMEEHGFSSNIIAYNTLITGYGK-ASNMDAAQ 364
             +V ++  +++A+   G+  +A  VF  MEE G    +I YN ++  +GK  +  +   
Sbjct: 205 SLDVYSYTSLISAFANSGRYREAVNVFKKMEEDGCKPTLITYNVILNVFGKMGTPWNKIT 264

Query: 365 RLFLSIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFTLMNLQ 424
            L   +K+ G+ PD  TY ++I    R   ++ A   F+E+K  G+  +      L+++ 
Sbjct: 265 SLVEKMKSDGIAPDAYTYNTLITCCKRGSLHQEAAQVFEEMKAAGFSYDKVTYNALLDVY 324

Query: 425 AKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYRKVLASQT 484
            K      A+K LN+M+  G   S +  N                               
Sbjct: 325 GKSHRPKEAMKVLNEMVLNGFSPSIVTYN------------------------------- 384

Query: 485 SCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIYTQL 544
               L+ AY + G++D+A+++  +      + +   Y  L+   +    +E+A+ I+ ++
Sbjct: 385 ---SLISAYARDGMLDEAMELKNQMAEKGTKPDVFTYTTLLSGFERAGKVESAMSIFEEM 444

Query: 545 PKRKNKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKAGSL 604
                KPN+   +  I +Y   G+F++  K++  +   G+  D++ ++ ++ ++ + G  
Sbjct: 445 RNAGCKPNICTFNAFIKMYGNRGKFTEMMKIFDEINVCGLSPDIVTWNTLLAVFGQNGMD 504

Query: 605 EDACSVLDFMDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQEMYNC 664
            +   V   M K+   VP+   F  ++  Y RCG  ++   VY R+L++ V+ D   YN 
Sbjct: 505 SEVSGVFKEM-KRAGFVPERETFNTLISAYSRCGSFEQAMTVYRRMLDAGVTPDLSTYNT 564

Query: 665 VINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKLFSKARKLLLLAQKKG 724
           V+   +R  + ++   +  EM      PN +T   +L  Y   K       L       G
Sbjct: 565 VLAALARGGMWEQSEKVLAEMEDGRCKPNELTYCSLLHAYANGKEIGLMHSLAEEVY-SG 624

Query: 725 LVD--VISYNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMDNF 784
           +++   +   T++    K             ++  GFS  +   NS++  YG+   +   
Sbjct: 625 VIEPRAVLLKTLVLVCSKCDLLPEAERAFSELKERGFSPDITTLNSMVSIYGRRQMVAKA 684

Query: 785 RQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNALIKA 844
             VL  +K+        TYN ++ ++ +       EE+L E+ A G++PD+ SYN +I A
Sbjct: 685 NGVLDYMKERGFTPSMATYNSLMYMHSRSADFGKSEEILREILAKGIKPDIISYNTVIYA 735

Query: 845 YGIAGMVEEAAQLVKEMREKRIEPDKV 868
           Y     + +A+++  EMR   I PD +
Sbjct: 745 YCRNTRMRDASRIFSEMRNSGIVPDVI 735

BLAST of Cp4.1LG18g05820 vs. ExPASy Swiss-Prot
Match: B8Y6I0 (Pentatricopeptide repeat-containing protein 10, chloroplastic OS=Zea mays OX=4577 GN=PPR10 PE=1 SV=1)

HSP 1 Score: 178.3 bits (451), Expect = 3.9e-43
Identity = 142/716 (19.83%), Postives = 308/716 (43.02%), Query Frame = 0

Query: 129 DLSLEDCNAILKRLEKCNDRK-ALGFFEWMRINRKLEHNVSAYNLILRVLGRQQDWDAAD 188
           +L   D  ++LK LE     + AL    W    ++   + SA  +++R LGR+   DA  
Sbjct: 101 ELLRADITSLLKALELSGHWEWALALLRW--AGKEGAADASALEMVVRALGREGQHDAVC 160

Query: 189 KLIREVRAELSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVATFGMLMGL 248
            L+ E       +LD + + T+++A  ++G  E+  + F  +    V P + T+ +++ +
Sbjct: 161 ALLDETPLPPGSRLDVRAYTTVLHALSRAGRYERALELFAELRRQGVAPTLVTYNVVLDV 220

Query: 249 Y-QKSCNLKEAEFAFNQMRNFGIVCETAYAS-MITIYTRLSLYDKAEEVIRLMQEDKVIP 308
           Y +   +        ++MR  G+  +   AS +I    R  L D+A      ++     P
Sbjct: 221 YGRMGRSWPRIVALLDEMRAAGVEPDGFTASTVIAACCRDGLVDEAVAFFEDLKARGHAP 280

Query: 309 NVENWLVMLNAYCQQGKMEDAELVFASMEEHGFSSNIIAYNTLITGYGKASNMDAAQRLF 368
            V  +  +L  + + G   +A  V   ME++G   + + YN L   Y +A   + A R  
Sbjct: 281 CVVTYNALLQVFGKAGNYTEALRVLGEMEQNGCQPDAVTYNELAGTYARAGFFEEAARCL 340

Query: 369 LSIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFTLMNLQAKH 428
            ++ + G+ P+  TY +++  +G  G    A   F ++K+ G++PN +    ++ +  K 
Sbjct: 341 DTMASKGLLPNAFTYNTVMTAYGNVGKVDEALALFDQMKKTGFVPNVNTYNLVLGMLGKK 400

Query: 429 EDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYRKVLASQTSCS 488
                 L+ L +M + GC  + +  N + A                            C 
Sbjct: 401 SRFTVMLEMLGEMSRSGCTPNRVTWNTMLAV---------------------------CG 460

Query: 489 ILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIYTQLPKR 548
                  K G+ D   +VL       +    + Y+ LI +        NA K+Y ++   
Sbjct: 461 -------KRGMEDYVTRVLEGMRSCGVELSRDTYNTLIAAYGRCGSRTNAFKMYNEMTSA 520

Query: 549 KNKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKAGSLEDA 608
              P +   + ++++ S  G +S  + +   +++ G + +  ++S++++ Y K G++   
Sbjct: 521 GFTPCITTYNALLNVLSRQGDWSTAQSIVSKMRTKGFKPNEQSYSLLLQCYAKGGNVAGI 580

Query: 609 CSVLDFMDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQEMYNCVIN 668
            ++ + +     + P   + R ++    +C  +D ++  +  +     + D  ++N +++
Sbjct: 581 AAIENEVYGSGAVFPSWVILRTLVIANFKCRRLDGMETAFQEVKARGYNPDLVIFNSMLS 640

Query: 669 CCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKLFSKARKLL--LLAQKKGL 728
             ++  +  + + +FD + + G +P+ +T N ++D+Y K     +A K+L  L   +   
Sbjct: 641 IYAKNGMYSKATEVFDSIKRSGLSPDLITYNSLMDMYAKCSESWEAEKILNQLKCSQTMK 700

Query: 729 VDVISYNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMDNFRQV 788
            DV+SYNT+I+ F K          +  M  +G +     Y++L+  Y         R+V
Sbjct: 701 PDVVSYNTVINGFCKQGLVKEAQRVLSEMVADGMAPCAVTYHTLVGGYSSLEMFSEAREV 760

Query: 789 LQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNALIK 840
           +  +     +  + TY  ++  Y +    ++    L+E+    L+ D  +  A I+
Sbjct: 761 IGYMVQHGLKPMELTYRRVVESYCRAKRFEEARGFLSEVSETDLDFDKKALEAYIE 780

BLAST of Cp4.1LG18g05820 vs. ExPASy Swiss-Prot
Match: Q9LER0 (Pentatricopeptide repeat-containing protein At5g14770, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g14770 PE=3 SV=2)

HSP 1 Score: 169.5 bits (428), Expect = 1.8e-40
Identity = 160/708 (22.60%), Postives = 305/708 (43.08%), Query Frame = 0

Query: 161 RKLEHNVSAYNLILR--VLGRQQDWDAADKLIREVRAELSDQLDFQVFNTLIYACYKSGL 220
           + L   +S  NLI    +L    +  A ++  R++     D  D   F+++I    K G 
Sbjct: 218 KALVDEISELNLITHTILLSSYYNLHAIEEAYRDMVMSGFDP-DVVTFSSIINRLCKGGK 277

Query: 221 VEQGAKWFQMMLEWQVLPNVATFGMLMGLYQKSCNLKEAEFAFNQMRNFGIVCE-TAYAS 280
           V +G    + M E  V PN  T+  L+    K+   + A   ++QM   GI  +   Y  
Sbjct: 278 VLEGGLLLREMEEMSVYPNHVTYTTLVDSLFKANIYRHALALYSQMVVRGIPVDLVVYTV 337

Query: 281 MITIYTRLSLYDKAEEVIRLMQEDKVIPNVENWLVMLNAYCQQGKMEDAELVFASMEEHG 340
           ++    +     +AE+  +++ ED  +PNV  +  +++  C+ G +  AE +   M E  
Sbjct: 338 LMDGLFKAGDLREAEKTFKMLLEDNQVPNVVTYTALVDGLCKAGDLSSAEFIITQMLEKS 397

Query: 341 FSSNIIAYNTLITGYGKASNMDAAQRLFLSIKNSGVEPDETTYRSMIEGWGRAGNYKMAE 400
              N++ Y+++I GY K   ++ A  L   +++  V P+  TY ++I+G  +AG  +MA 
Sbjct: 398 VIPNVVTYSSMINGYVKKGMLEEAVSLLRKMEDQNVVPNGFTYGTVIDGLFKAGKEEMAI 457

Query: 401 WYFKELKRKGYMPNASNLFTLMNLQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYE 460
              KE++  G   N   L  L                +N + +IG               
Sbjct: 458 ELSKEMRLIGVEENNYILDAL----------------VNHLKRIG--------------- 517

Query: 461 KARRIKSVPLLLTGSFYRKVLASQTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEEN 520
              RIK V  L+     + V   Q + + L+  + K G  + AL    E +   + ++  
Sbjct: 518 ---RIKEVKGLVKDMVSKGVTLDQINYTSLIDVFFKGGDEEAALAWAEEMQERGMPWDVV 577

Query: 521 LYHLLICSCKELDHLENAIKIYTQLPKRKNKPNLHITSTMIDIYSIMGRFSDGEKLYLSL 580
            Y++LI    +   +  A   Y  + ++  +P++   + M++     G      KL+  +
Sbjct: 578 SYNVLISGMLKFGKV-GADWAYKGMREKGIEPDIATFNIMMNSQRKQGDSEGILKLWDKM 637

Query: 581 KSSGIRLDLIAFSVVVRMYVKAGSLEDACSVLDFMDKQQDIVPDIYLFRDMLRIYQRCGM 640
           KS GI+  L++ ++VV M  + G +E+A  +L+ M    +I P++  +R  L    +   
Sbjct: 638 KSCGIKPSLMSCNIVVGMLCENGKMEEAIHILNQM-MLMEIHPNLTTYRIFLDTSSKHKR 697

Query: 641 VDKLQDVYYRILNSDVSWDQEMYNCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNV 700
            D +   +  +L+  +   +++YN +I    +  +  + + +  +M  RGF P+TVT N 
Sbjct: 698 ADAIFKTHETLLSYGIKLSRQVYNTLIATLCKLGMTKKAAMVMGDMEARGFIPDTVTFNS 757

Query: 701 MLDVYGKSKLFSKARKLLLLAQKKGLVDVISYNTMISAFGKSKDFANMSSTVRTMEFNGF 760
           ++  Y       KA              + +Y+ M+ A                    G 
Sbjct: 758 LMHGYFVGSHVRKA--------------LSTYSVMMEA--------------------GI 817

Query: 761 SLSLEAYNSLLDAYGKEGRMDNFRQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEE 820
           S ++  YN+++      G +    + L ++K      D +TYN +I+   K G +     
Sbjct: 818 SPNVATYNTIIRGLSDAGLIKEVDKWLSEMKSRGMRPDDFTYNALISGQAKIGNMKGSMT 854

Query: 821 VLTELKACGLEPDLYSYNALIKAYGIAGMVEEAAQLVKEMREKRIEPD 866
           +  E+ A GL P   +YN LI  +   G + +A +L+KEM ++ + P+
Sbjct: 878 IYCEMIADGLVPKTSTYNVLISEFANVGKMLQARELLKEMGKRGVSPN 854

BLAST of Cp4.1LG18g05820 vs. NCBI nr
Match: KAG7023427.1 (Pentatricopeptide repeat-containing protein, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1813 bits (4697), Expect = 0.0
Identity = 915/929 (98.49%), Postives = 919/929 (98.92%), Query Frame = 0

Query: 1    MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSITGYIHLNKSCVLYSLVRAHKPSKV 60
            MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSIT YIHLNKSCVLYSLVRAHK SKV
Sbjct: 530  MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSITDYIHLNKSCVLYSLVRAHKSSKV 589

Query: 61   EPETSGGYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFSFSKNCSEKVFDSIVFHGGELD 120
            EP TSGGYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFSFSKNCSEKVFDSI+FHGGELD
Sbjct: 590  EPGTSGGYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFSFSKNCSEKVFDSIIFHGGELD 649

Query: 121  VNYSTISSDLSLEDCNAILKRLEKCNDRKALGFFEWMRINRKLEHNVSAYNLILRVLGRQ 180
            VNYSTISSDLSLEDCNAILKRLEKCNDRKAL FFEWMRINRKLEHNVSAYNLILRVL RQ
Sbjct: 650  VNYSTISSDLSLEDCNAILKRLEKCNDRKALDFFEWMRINRKLEHNVSAYNLILRVLSRQ 709

Query: 181  QDWDAADKLIREVRAELSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVAT 240
            QDWDAA+KLIREVRAELSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVAT
Sbjct: 710  QDWDAAEKLIREVRAELSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVAT 769

Query: 241  FGMLMGLYQKSCNLKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEEVIRLMQE 300
            FGMLMGLYQKSCNLKEAEFAFNQMRNFGIVCET YASMITIYTRLSLYDKAEEVIRLMQE
Sbjct: 770  FGMLMGLYQKSCNLKEAEFAFNQMRNFGIVCETGYASMITIYTRLSLYDKAEEVIRLMQE 829

Query: 301  DKVIPNVENWLVMLNAYCQQGKMEDAELVFASMEEHGFSSNIIAYNTLITGYGKASNMDA 360
            DKVIPNVENWLVMLNAYCQQGKMEDAELVFASMEEHGFSSNIIAYNTLITGYGKASNMDA
Sbjct: 830  DKVIPNVENWLVMLNAYCQQGKMEDAELVFASMEEHGFSSNIIAYNTLITGYGKASNMDA 889

Query: 361  AQRLFLSIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFTLMN 420
            AQRLFLSIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFTLMN
Sbjct: 890  AQRLFLSIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFTLMN 949

Query: 421  LQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYRKVLAS 480
            LQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFY KVLAS
Sbjct: 950  LQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYLKVLAS 1009

Query: 481  QTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIYT 540
            QTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIYT
Sbjct: 1010 QTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIYT 1069

Query: 541  QLPKRKNKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKAG 600
            QLPKR+NKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKAG
Sbjct: 1070 QLPKRENKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKAG 1129

Query: 601  SLEDACSVLDFMDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQEMY 660
            SLEDACSVLD MDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQEMY
Sbjct: 1130 SLEDACSVLDLMDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQEMY 1189

Query: 661  NCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKLFSKARKLLLLAQK 720
            NCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSK FSKARKLLLLAQK
Sbjct: 1190 NCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKHFSKARKLLLLAQK 1249

Query: 721  KGLVDVISYNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMDNF 780
            KGLVDVISYNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMD+F
Sbjct: 1250 KGLVDVISYNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMDDF 1309

Query: 781  RQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNALIKA 840
            RQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNALIKA
Sbjct: 1310 RQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNALIKA 1369

Query: 841  YGIAGMVEEAAQLVKEMREKRIEPDKVFALPRRSLRIYPIFLSQLLLKTYTGWDLADKIA 900
            YGIAGMVEEAAQLVKEMREKRIEPDKVFALPR SLRIYPIFLSQLLLKTYTGWDLADKIA
Sbjct: 1370 YGIAGMVEEAAQLVKEMREKRIEPDKVFALPRCSLRIYPIFLSQLLLKTYTGWDLADKIA 1429

Query: 901  FKAAKNCSLRRSHLFLRLYIINKGTVQFI 929
            FKAAKNCSLRRSHLFLRLYIINKGTVQFI
Sbjct: 1430 FKAAKNCSLRRSHLFLRLYIINKGTVQFI 1458

BLAST of Cp4.1LG18g05820 vs. NCBI nr
Match: KAG6589757.1 (putative leucine-rich repeat receptor-like serine/threonine-protein kinase, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1791 bits (4639), Expect = 0.0
Identity = 903/917 (98.47%), Postives = 907/917 (98.91%), Query Frame = 0

Query: 1   MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSITGYIHLNKSCVLYSLVRAHKPSKV 60
           MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSIT YIHLNKSCVLYSLVRAHK SKV
Sbjct: 1   MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSITDYIHLNKSCVLYSLVRAHKSSKV 60

Query: 61  EPETSGGYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFSFSKNCSEKVFDSIVFHGGELD 120
           EP TSGGYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFSFSKNCSEKVFDSI+FHGGELD
Sbjct: 61  EPGTSGGYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFSFSKNCSEKVFDSIIFHGGELD 120

Query: 121 VNYSTISSDLSLEDCNAILKRLEKCNDRKALGFFEWMRINRKLEHNVSAYNLILRVLGRQ 180
           VNYSTISSDLSLEDCNAILKRLEKCNDRKAL FFEWMRINRKLEHNVSAYNLILRVL RQ
Sbjct: 121 VNYSTISSDLSLEDCNAILKRLEKCNDRKALDFFEWMRINRKLEHNVSAYNLILRVLSRQ 180

Query: 181 QDWDAADKLIREVRAELSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVAT 240
           QDWDAA+KLIREVRAELSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVAT
Sbjct: 181 QDWDAAEKLIREVRAELSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVAT 240

Query: 241 FGMLMGLYQKSCNLKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEEVIRLMQE 300
           FGMLMGLYQKSCNLKEAEFAFNQMRNFGIVCET YASMITIYTRLSLYDKAEEVIRLMQE
Sbjct: 241 FGMLMGLYQKSCNLKEAEFAFNQMRNFGIVCETGYASMITIYTRLSLYDKAEEVIRLMQE 300

Query: 301 DKVIPNVENWLVMLNAYCQQGKMEDAELVFASMEEHGFSSNIIAYNTLITGYGKASNMDA 360
           DKVIPNVENWLVMLNAYCQQGKMEDAELVFASMEEHGFSSNIIAYNTLITGYGKASNMDA
Sbjct: 301 DKVIPNVENWLVMLNAYCQQGKMEDAELVFASMEEHGFSSNIIAYNTLITGYGKASNMDA 360

Query: 361 AQRLFLSIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFTLMN 420
           AQRLFLSIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFTLMN
Sbjct: 361 AQRLFLSIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFTLMN 420

Query: 421 LQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYRKVLAS 480
           LQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFY KVLAS
Sbjct: 421 LQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYLKVLAS 480

Query: 481 QTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIYT 540
           QTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIYT
Sbjct: 481 QTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIYT 540

Query: 541 QLPKRKNKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKAG 600
           QLPKR+NKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKAG
Sbjct: 541 QLPKRENKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKAG 600

Query: 601 SLEDACSVLDFMDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQEMY 660
           SLEDACSVLD MDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQEMY
Sbjct: 601 SLEDACSVLDLMDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQEMY 660

Query: 661 NCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKLFSKARKLLLLAQK 720
           NCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSK FSKARKLLLLAQK
Sbjct: 661 NCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKHFSKARKLLLLAQK 720

Query: 721 KGLVDVISYNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMDNF 780
           KGLVDVISYNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMD+F
Sbjct: 721 KGLVDVISYNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMDDF 780

Query: 781 RQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNALIKA 840
           RQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNALIKA
Sbjct: 781 RQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNALIKA 840

Query: 841 YGIAGMVEEAAQLVKEMREKRIEPDKVFALPRRSLRIYPIFLSQLLLKTYTGWDLADKIA 900
           YGIAGMVEEAAQLVKEMREKRIEPDKVFALPRRS RIYPIFLSQLLLKTYTGWDLADKIA
Sbjct: 841 YGIAGMVEEAAQLVKEMREKRIEPDKVFALPRRSPRIYPIFLSQLLLKTYTGWDLADKIA 900

Query: 901 FKAAKNCSLRRSHLFLR 917
           FKAAKNCSLRRSHLFLR
Sbjct: 901 FKAAKNCSLRRSHLFLR 917

BLAST of Cp4.1LG18g05820 vs. NCBI nr
Match: XP_023516176.1 (pentatricopeptide repeat-containing protein At4g30825, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1723 bits (4463), Expect = 0.0
Identity = 867/867 (100.00%), Postives = 867/867 (100.00%), Query Frame = 0

Query: 1   MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSITGYIHLNKSCVLYSLVRAHKPSKV 60
           MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSITGYIHLNKSCVLYSLVRAHKPSKV
Sbjct: 1   MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSITGYIHLNKSCVLYSLVRAHKPSKV 60

Query: 61  EPETSGGYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFSFSKNCSEKVFDSIVFHGGELD 120
           EPETSGGYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFSFSKNCSEKVFDSIVFHGGELD
Sbjct: 61  EPETSGGYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFSFSKNCSEKVFDSIVFHGGELD 120

Query: 121 VNYSTISSDLSLEDCNAILKRLEKCNDRKALGFFEWMRINRKLEHNVSAYNLILRVLGRQ 180
           VNYSTISSDLSLEDCNAILKRLEKCNDRKALGFFEWMRINRKLEHNVSAYNLILRVLGRQ
Sbjct: 121 VNYSTISSDLSLEDCNAILKRLEKCNDRKALGFFEWMRINRKLEHNVSAYNLILRVLGRQ 180

Query: 181 QDWDAADKLIREVRAELSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVAT 240
           QDWDAADKLIREVRAELSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVAT
Sbjct: 181 QDWDAADKLIREVRAELSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVAT 240

Query: 241 FGMLMGLYQKSCNLKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEEVIRLMQE 300
           FGMLMGLYQKSCNLKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEEVIRLMQE
Sbjct: 241 FGMLMGLYQKSCNLKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEEVIRLMQE 300

Query: 301 DKVIPNVENWLVMLNAYCQQGKMEDAELVFASMEEHGFSSNIIAYNTLITGYGKASNMDA 360
           DKVIPNVENWLVMLNAYCQQGKMEDAELVFASMEEHGFSSNIIAYNTLITGYGKASNMDA
Sbjct: 301 DKVIPNVENWLVMLNAYCQQGKMEDAELVFASMEEHGFSSNIIAYNTLITGYGKASNMDA 360

Query: 361 AQRLFLSIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFTLMN 420
           AQRLFLSIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFTLMN
Sbjct: 361 AQRLFLSIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFTLMN 420

Query: 421 LQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYRKVLAS 480
           LQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYRKVLAS
Sbjct: 421 LQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYRKVLAS 480

Query: 481 QTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIYT 540
           QTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIYT
Sbjct: 481 QTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIYT 540

Query: 541 QLPKRKNKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKAG 600
           QLPKRKNKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKAG
Sbjct: 541 QLPKRKNKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKAG 600

Query: 601 SLEDACSVLDFMDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQEMY 660
           SLEDACSVLDFMDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQEMY
Sbjct: 601 SLEDACSVLDFMDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQEMY 660

Query: 661 NCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKLFSKARKLLLLAQK 720
           NCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKLFSKARKLLLLAQK
Sbjct: 661 NCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKLFSKARKLLLLAQK 720

Query: 721 KGLVDVISYNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMDNF 780
           KGLVDVISYNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMDNF
Sbjct: 721 KGLVDVISYNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMDNF 780

Query: 781 RQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNALIKA 840
           RQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNALIKA
Sbjct: 781 RQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNALIKA 840

Query: 841 YGIAGMVEEAAQLVKEMREKRIEPDKV 867
           YGIAGMVEEAAQLVKEMREKRIEPDKV
Sbjct: 841 YGIAGMVEEAAQLVKEMREKRIEPDKV 867

BLAST of Cp4.1LG18g05820 vs. NCBI nr
Match: XP_022922044.1 (pentatricopeptide repeat-containing protein At4g30825, chloroplastic [Cucurbita moschata])

HSP 1 Score: 1699 bits (4401), Expect = 0.0
Identity = 856/867 (98.73%), Postives = 861/867 (99.31%), Query Frame = 0

Query: 1   MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSITGYIHLNKSCVLYSLVRAHKPSKV 60
           MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSITGYIHLNKS VLYSLVRAHKPSKV
Sbjct: 1   MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSITGYIHLNKSFVLYSLVRAHKPSKV 60

Query: 61  EPETSGGYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFSFSKNCSEKVFDSIVFHGGELD 120
           EPETSGGYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFSFSKNCSEKVFDSIVFHGGELD
Sbjct: 61  EPETSGGYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFSFSKNCSEKVFDSIVFHGGELD 120

Query: 121 VNYSTISSDLSLEDCNAILKRLEKCNDRKALGFFEWMRINRKLEHNVSAYNLILRVLGRQ 180
           VNYSTISSDLSLEDCNAILKRLEKCNDRKALGFFEWMRINRKLEHNVSAYNLILRVLGRQ
Sbjct: 121 VNYSTISSDLSLEDCNAILKRLEKCNDRKALGFFEWMRINRKLEHNVSAYNLILRVLGRQ 180

Query: 181 QDWDAADKLIREVRAELSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVAT 240
           QDWDAA+KLIREVRAE SDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVAT
Sbjct: 181 QDWDAAEKLIREVRAESSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVAT 240

Query: 241 FGMLMGLYQKSCNLKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEEVIRLMQE 300
           FGMLMGLYQKSC+LKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEEVIRLMQE
Sbjct: 241 FGMLMGLYQKSCSLKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEEVIRLMQE 300

Query: 301 DKVIPNVENWLVMLNAYCQQGKMEDAELVFASMEEHGFSSNIIAYNTLITGYGKASNMDA 360
           DKV PNVENWLVMLNAYCQQGKMEDAELVFASMEEHGFSSNIIAYNTLITGYGKASNMDA
Sbjct: 301 DKVTPNVENWLVMLNAYCQQGKMEDAELVFASMEEHGFSSNIIAYNTLITGYGKASNMDA 360

Query: 361 AQRLFLSIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFTLMN 420
           AQRLFLSIKNSGVEPDETTYRSMIEGWGRAGNY+MAEWYFKELKRKGYMPNASNLFTLMN
Sbjct: 361 AQRLFLSIKNSGVEPDETTYRSMIEGWGRAGNYQMAEWYFKELKRKGYMPNASNLFTLMN 420

Query: 421 LQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYRKVLAS 480
           LQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYRKVLAS
Sbjct: 421 LQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYRKVLAS 480

Query: 481 QTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIYT 540
           QTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIYT
Sbjct: 481 QTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIYT 540

Query: 541 QLPKRKNKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKAG 600
           QLPK +NKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKAG
Sbjct: 541 QLPKHENKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKAG 600

Query: 601 SLEDACSVLDFMDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQEMY 660
           SLEDACSVLD MDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQEMY
Sbjct: 601 SLEDACSVLDLMDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQEMY 660

Query: 661 NCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKLFSKARKLLLLAQK 720
           NCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSK FSKARKLLLLAQK
Sbjct: 661 NCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKHFSKARKLLLLAQK 720

Query: 721 KGLVDVISYNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMDNF 780
           KGLVDVISYNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMD+F
Sbjct: 721 KGLVDVISYNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMDDF 780

Query: 781 RQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNALIKA 840
           RQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNALIKA
Sbjct: 781 RQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNALIKA 840

Query: 841 YGIAGMVEEAAQLVKEMREKRIEPDKV 867
           YGIAGMVEEAAQLVKEMREKRIEPDKV
Sbjct: 841 YGIAGMVEEAAQLVKEMREKRIEPDKV 867

BLAST of Cp4.1LG18g05820 vs. NCBI nr
Match: XP_022988547.1 (pentatricopeptide repeat-containing protein At4g30825, chloroplastic [Cucurbita maxima])

HSP 1 Score: 1688 bits (4371), Expect = 0.0
Identity = 849/867 (97.92%), Postives = 858/867 (98.96%), Query Frame = 0

Query: 1   MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSITGYIHLNKSCVLYSLVRAHKPSKV 60
           MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSITGYIHLNKSCVLYSLVRAHKPSKV
Sbjct: 1   MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSITGYIHLNKSCVLYSLVRAHKPSKV 60

Query: 61  EPETSGGYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFSFSKNCSEKVFDSIVFHGGELD 120
           EPETSG YESKCAVDEIDTRKKYFGGKKPSKRAPGSYF+FSK+ SEKVFDSI+FHGGELD
Sbjct: 61  EPETSGVYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFNFSKDWSEKVFDSIIFHGGELD 120

Query: 121 VNYSTISSDLSLEDCNAILKRLEKCNDRKALGFFEWMRINRKLEHNVSAYNLILRVLGRQ 180
           VNYSTISSDLSLEDCNAILKRLEKCNDRKALGF+EWMRINRKLEHNVSAYNLILRV GRQ
Sbjct: 121 VNYSTISSDLSLEDCNAILKRLEKCNDRKALGFYEWMRINRKLEHNVSAYNLILRVFGRQ 180

Query: 181 QDWDAADKLIREVRAELSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVAT 240
           QDWDAA+KLIREVRAELSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVAT
Sbjct: 181 QDWDAAEKLIREVRAELSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVAT 240

Query: 241 FGMLMGLYQKSCNLKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEEVIRLMQE 300
           FGMLMGLYQKSCNLKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEEVIRLMQE
Sbjct: 241 FGMLMGLYQKSCNLKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEEVIRLMQE 300

Query: 301 DKVIPNVENWLVMLNAYCQQGKMEDAELVFASMEEHGFSSNIIAYNTLITGYGKASNMDA 360
           DKVIPNVENWLVMLNAYCQQGKMEDAELVFASMEE GFSSNIIAYNTLITGYGKASNMDA
Sbjct: 301 DKVIPNVENWLVMLNAYCQQGKMEDAELVFASMEERGFSSNIIAYNTLITGYGKASNMDA 360

Query: 361 AQRLFLSIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFTLMN 420
           AQRLF SIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFTLMN
Sbjct: 361 AQRLFSSIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFTLMN 420

Query: 421 LQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYRKVLAS 480
           LQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYRKVLAS
Sbjct: 421 LQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYRKVLAS 480

Query: 481 QTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIYT 540
           QTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIYT
Sbjct: 481 QTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIYT 540

Query: 541 QLPKRKNKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKAG 600
           QLPKR+NKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKAG
Sbjct: 541 QLPKRENKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKAG 600

Query: 601 SLEDACSVLDFMDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQEMY 660
           SLEDACSVLD MDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQD+YYRILNSDVSWDQEMY
Sbjct: 601 SLEDACSVLDLMDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDLYYRILNSDVSWDQEMY 660

Query: 661 NCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKLFSKARKLLLLAQK 720
           NCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSK FSKARKLL LAQK
Sbjct: 661 NCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKHFSKARKLLFLAQK 720

Query: 721 KGLVDVISYNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMDNF 780
           KGLVDVISYNTMISA+GKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMDNF
Sbjct: 721 KGLVDVISYNTMISAYGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMDNF 780

Query: 781 RQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNALIKA 840
           RQVLQ LKDSNSERD+YTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNALIKA
Sbjct: 781 RQVLQLLKDSNSERDRYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNALIKA 840

Query: 841 YGIAGMVEEAAQLVKEMREKRIEPDKV 867
           YGIAGMVEEAAQLVKEMREKRIEPDKV
Sbjct: 841 YGIAGMVEEAAQLVKEMREKRIEPDKV 867

BLAST of Cp4.1LG18g05820 vs. ExPASy TrEMBL
Match: A0A6J1E280 (pentatricopeptide repeat-containing protein At4g30825, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111430111 PE=4 SV=1)

HSP 1 Score: 1699 bits (4401), Expect = 0.0
Identity = 856/867 (98.73%), Postives = 861/867 (99.31%), Query Frame = 0

Query: 1   MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSITGYIHLNKSCVLYSLVRAHKPSKV 60
           MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSITGYIHLNKS VLYSLVRAHKPSKV
Sbjct: 1   MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSITGYIHLNKSFVLYSLVRAHKPSKV 60

Query: 61  EPETSGGYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFSFSKNCSEKVFDSIVFHGGELD 120
           EPETSGGYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFSFSKNCSEKVFDSIVFHGGELD
Sbjct: 61  EPETSGGYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFSFSKNCSEKVFDSIVFHGGELD 120

Query: 121 VNYSTISSDLSLEDCNAILKRLEKCNDRKALGFFEWMRINRKLEHNVSAYNLILRVLGRQ 180
           VNYSTISSDLSLEDCNAILKRLEKCNDRKALGFFEWMRINRKLEHNVSAYNLILRVLGRQ
Sbjct: 121 VNYSTISSDLSLEDCNAILKRLEKCNDRKALGFFEWMRINRKLEHNVSAYNLILRVLGRQ 180

Query: 181 QDWDAADKLIREVRAELSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVAT 240
           QDWDAA+KLIREVRAE SDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVAT
Sbjct: 181 QDWDAAEKLIREVRAESSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVAT 240

Query: 241 FGMLMGLYQKSCNLKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEEVIRLMQE 300
           FGMLMGLYQKSC+LKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEEVIRLMQE
Sbjct: 241 FGMLMGLYQKSCSLKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEEVIRLMQE 300

Query: 301 DKVIPNVENWLVMLNAYCQQGKMEDAELVFASMEEHGFSSNIIAYNTLITGYGKASNMDA 360
           DKV PNVENWLVMLNAYCQQGKMEDAELVFASMEEHGFSSNIIAYNTLITGYGKASNMDA
Sbjct: 301 DKVTPNVENWLVMLNAYCQQGKMEDAELVFASMEEHGFSSNIIAYNTLITGYGKASNMDA 360

Query: 361 AQRLFLSIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFTLMN 420
           AQRLFLSIKNSGVEPDETTYRSMIEGWGRAGNY+MAEWYFKELKRKGYMPNASNLFTLMN
Sbjct: 361 AQRLFLSIKNSGVEPDETTYRSMIEGWGRAGNYQMAEWYFKELKRKGYMPNASNLFTLMN 420

Query: 421 LQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYRKVLAS 480
           LQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYRKVLAS
Sbjct: 421 LQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYRKVLAS 480

Query: 481 QTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIYT 540
           QTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIYT
Sbjct: 481 QTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIYT 540

Query: 541 QLPKRKNKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKAG 600
           QLPK +NKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKAG
Sbjct: 541 QLPKHENKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKAG 600

Query: 601 SLEDACSVLDFMDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQEMY 660
           SLEDACSVLD MDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQEMY
Sbjct: 601 SLEDACSVLDLMDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQEMY 660

Query: 661 NCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKLFSKARKLLLLAQK 720
           NCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSK FSKARKLLLLAQK
Sbjct: 661 NCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKHFSKARKLLLLAQK 720

Query: 721 KGLVDVISYNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMDNF 780
           KGLVDVISYNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMD+F
Sbjct: 721 KGLVDVISYNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMDDF 780

Query: 781 RQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNALIKA 840
           RQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNALIKA
Sbjct: 781 RQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNALIKA 840

Query: 841 YGIAGMVEEAAQLVKEMREKRIEPDKV 867
           YGIAGMVEEAAQLVKEMREKRIEPDKV
Sbjct: 841 YGIAGMVEEAAQLVKEMREKRIEPDKV 867

BLAST of Cp4.1LG18g05820 vs. ExPASy TrEMBL
Match: A0A6J1JDB7 (pentatricopeptide repeat-containing protein At4g30825, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111485755 PE=4 SV=1)

HSP 1 Score: 1688 bits (4371), Expect = 0.0
Identity = 849/867 (97.92%), Postives = 858/867 (98.96%), Query Frame = 0

Query: 1   MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSITGYIHLNKSCVLYSLVRAHKPSKV 60
           MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSITGYIHLNKSCVLYSLVRAHKPSKV
Sbjct: 1   MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSITGYIHLNKSCVLYSLVRAHKPSKV 60

Query: 61  EPETSGGYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFSFSKNCSEKVFDSIVFHGGELD 120
           EPETSG YESKCAVDEIDTRKKYFGGKKPSKRAPGSYF+FSK+ SEKVFDSI+FHGGELD
Sbjct: 61  EPETSGVYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFNFSKDWSEKVFDSIIFHGGELD 120

Query: 121 VNYSTISSDLSLEDCNAILKRLEKCNDRKALGFFEWMRINRKLEHNVSAYNLILRVLGRQ 180
           VNYSTISSDLSLEDCNAILKRLEKCNDRKALGF+EWMRINRKLEHNVSAYNLILRV GRQ
Sbjct: 121 VNYSTISSDLSLEDCNAILKRLEKCNDRKALGFYEWMRINRKLEHNVSAYNLILRVFGRQ 180

Query: 181 QDWDAADKLIREVRAELSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVAT 240
           QDWDAA+KLIREVRAELSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVAT
Sbjct: 181 QDWDAAEKLIREVRAELSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVAT 240

Query: 241 FGMLMGLYQKSCNLKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEEVIRLMQE 300
           FGMLMGLYQKSCNLKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEEVIRLMQE
Sbjct: 241 FGMLMGLYQKSCNLKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEEVIRLMQE 300

Query: 301 DKVIPNVENWLVMLNAYCQQGKMEDAELVFASMEEHGFSSNIIAYNTLITGYGKASNMDA 360
           DKVIPNVENWLVMLNAYCQQGKMEDAELVFASMEE GFSSNIIAYNTLITGYGKASNMDA
Sbjct: 301 DKVIPNVENWLVMLNAYCQQGKMEDAELVFASMEERGFSSNIIAYNTLITGYGKASNMDA 360

Query: 361 AQRLFLSIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFTLMN 420
           AQRLF SIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFTLMN
Sbjct: 361 AQRLFSSIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFTLMN 420

Query: 421 LQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYRKVLAS 480
           LQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYRKVLAS
Sbjct: 421 LQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYRKVLAS 480

Query: 481 QTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIYT 540
           QTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIYT
Sbjct: 481 QTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIYT 540

Query: 541 QLPKRKNKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKAG 600
           QLPKR+NKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKAG
Sbjct: 541 QLPKRENKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKAG 600

Query: 601 SLEDACSVLDFMDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQEMY 660
           SLEDACSVLD MDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQD+YYRILNSDVSWDQEMY
Sbjct: 601 SLEDACSVLDLMDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDLYYRILNSDVSWDQEMY 660

Query: 661 NCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKLFSKARKLLLLAQK 720
           NCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSK FSKARKLL LAQK
Sbjct: 661 NCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKHFSKARKLLFLAQK 720

Query: 721 KGLVDVISYNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMDNF 780
           KGLVDVISYNTMISA+GKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMDNF
Sbjct: 721 KGLVDVISYNTMISAYGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMDNF 780

Query: 781 RQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNALIKA 840
           RQVLQ LKDSNSERD+YTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNALIKA
Sbjct: 781 RQVLQLLKDSNSERDRYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNALIKA 840

Query: 841 YGIAGMVEEAAQLVKEMREKRIEPDKV 867
           YGIAGMVEEAAQLVKEMREKRIEPDKV
Sbjct: 841 YGIAGMVEEAAQLVKEMREKRIEPDKV 867

BLAST of Cp4.1LG18g05820 vs. ExPASy TrEMBL
Match: A0A6J1BZD7 (pentatricopeptide repeat-containing protein At4g30825, chloroplastic isoform X1 OS=Momordica charantia OX=3673 GN=LOC111007115 PE=3 SV=1)

HSP 1 Score: 1464 bits (3790), Expect = 0.0
Identity = 729/870 (83.79%), Postives = 797/870 (91.61%), Query Frame = 0

Query: 1   MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSITGYIHLNKSCVLYSLVRAHKPSKV 60
           MASLK+SF LDSF SKKFDFPV S+LLSD CSVFSITGYIHLNKSC+LYSL R HKPSKV
Sbjct: 1   MASLKISFPLDSFDSKKFDFPVKSALLSDICSVFSITGYIHLNKSCILYSLARVHKPSKV 60

Query: 61  ---EPETSGGYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFSFSKNCSEKVFDSIVFHGG 120
              EPE S  Y+SK   DEI  RKKY G KKPSKRAPGSYFSFS+NCSEKVFD+I+F+GG
Sbjct: 61  SQVEPEASDIYQSKFVDDEIGARKKYVGNKKPSKRAPGSYFSFSRNCSEKVFDNIIFNGG 120

Query: 121 ELDVNYSTISSDLSLEDCNAILKRLEKCNDRKALGFFEWMRINRKLEHNVSAYNLILRVL 180
           E+DVNYSTISSDLSLEDCNAIL++LEKCND K L FFEWMR N KLEHNV+AYNL+LRVL
Sbjct: 121 EMDVNYSTISSDLSLEDCNAILRKLEKCNDGKTLVFFEWMRRNGKLEHNVTAYNLVLRVL 180

Query: 181 GRQQDWDAADKLIREVRAELSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPN 240
           GRQ+DWDAA+KLIR+VRA+L  QLDFQ+FNTLIYACYKSGLV++GAKWF+MMLE +V PN
Sbjct: 181 GRQEDWDAAEKLIRQVRADLGSQLDFQIFNTLIYACYKSGLVDRGAKWFRMMLECRVQPN 240

Query: 241 VATFGMLMGLYQKSCNLKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEEVIRL 300
           VATFGMLMGL QK CN++EAEFAF+QMR+FGIVCE  YASMITIY RLSLYDKAEEVI+L
Sbjct: 241 VATFGMLMGLCQKGCNVEEAEFAFSQMRSFGIVCEAMYASMITIYARLSLYDKAEEVIQL 300

Query: 301 MQEDKVIPNVENWLVMLNAYCQQGKMEDAELVFASMEEHGFSSNIIAYNTLITGYGKASN 360
           MQEDKV PN+ENWLVMLN YCQQGK+EDAELVFASMEE GFSSNIIAYNTLITGYGK SN
Sbjct: 301 MQEDKVTPNLENWLVMLNTYCQQGKLEDAELVFASMEEAGFSSNIIAYNTLITGYGKVSN 360

Query: 361 MDAAQRLFLSIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFT 420
           MDAA+RLFL IKNSG EPDETTYRSMIEGWGRAGNY+MAEWY+KELKRKGYMPN SNLFT
Sbjct: 361 MDAAERLFLGIKNSGAEPDETTYRSMIEGWGRAGNYEMAEWYYKELKRKGYMPNTSNLFT 420

Query: 421 LMNLQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYRKV 480
           L+NLQAKHED+AGAL+TL+DMLKIGCR SSIVGNVLQAYEKARRIKSVPLLLTGSFY KV
Sbjct: 421 LINLQAKHEDEAGALETLDDMLKIGCRPSSIVGNVLQAYEKARRIKSVPLLLTGSFYCKV 480

Query: 481 LASQTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIK 540
           L+SQTSCSILVMAY+KH LVDDALK+LREKEWND  FEENLYHLLICSCKEL  LENAIK
Sbjct: 481 LSSQTSCSILVMAYMKHCLVDDALKILREKEWNDHNFEENLYHLLICSCKELGQLENAIK 540

Query: 541 IYTQLPKRKNKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYV 600
           IYTQLPKR+NKPNLHIT TMIDIYSIMG+FS+GEKLYLSL+SS I LDLIAF+VVVRMYV
Sbjct: 541 IYTQLPKRENKPNLHITCTMIDIYSIMGKFSEGEKLYLSLRSSDIPLDLIAFNVVVRMYV 600

Query: 601 KAGSLEDACSVLDFMDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQ 660
           KAGSLEDAC VLD MD+QQDIVPD+YL RDMLRIYQRCGMVDKL D+YYRIL S VSWDQ
Sbjct: 601 KAGSLEDACLVLDLMDQQQDIVPDVYLLRDMLRIYQRCGMVDKLADLYYRILKSGVSWDQ 660

Query: 661 EMYNCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKLFSKARKLLLL 720
           EMYNCVINCCSRAL VDELS LFDEML RGFAPNTVTLNVMLDVYGKSKLF+KAR L  L
Sbjct: 661 EMYNCVINCCSRALPVDELSRLFDEMLHRGFAPNTVTLNVMLDVYGKSKLFTKARNLFGL 720

Query: 721 AQKKGLVDVISYNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRM 780
           AQK+GLVDVISYNTMISA+GK+KDF NMSSTV+ M+FNGFS+SLEAYN +LDAYGKE +M
Sbjct: 721 AQKRGLVDVISYNTMISAYGKNKDFKNMSSTVQKMKFNGFSVSLEAYNCMLDAYGKECQM 780

Query: 781 DNFRQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNAL 840
           ++FR VLQ++K+S +ERD+YTYNIMINIYG+QGWID+V EVLTEL+ CGLEPDLYSYN L
Sbjct: 781 ESFRSVLQRMKESCAERDRYTYNIMINIYGEQGWIDEVAEVLTELRECGLEPDLYSYNTL 840

Query: 841 IKAYGIAGMVEEAAQLVKEMREKRIEPDKV 867
           IKAYGIAGMVEEA  LVKEMREKRIEPD++
Sbjct: 841 IKAYGIAGMVEEAVLLVKEMREKRIEPDRI 870

BLAST of Cp4.1LG18g05820 vs. ExPASy TrEMBL
Match: A0A5A7SYW6 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold271G00420 PE=4 SV=1)

HSP 1 Score: 1454 bits (3764), Expect = 0.0
Identity = 730/870 (83.91%), Postives = 792/870 (91.03%), Query Frame = 0

Query: 1   MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSITGYIHLNKSCVLYSLVRAHKPSKV 60
           MASLKLSFSL SF S KFDFPVNS  LSD CS+FSI GYIHLNKSC+LYSL R HKPSKV
Sbjct: 1   MASLKLSFSLHSFDSNKFDFPVNSPPLSDYCSLFSINGYIHLNKSCILYSLARVHKPSKV 60

Query: 61  ---EPETSGGYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFSFSKNCSEKVFDSIVFHGG 120
              EPE S   +S+   D+ID+RKKYF  KKPSKRA GS+FSFS+NCSEK+F++I+F GG
Sbjct: 61  SQVEPEASDVSQSR--FDDIDSRKKYFTAKKPSKRAAGSHFSFSRNCSEKIFENILFSGG 120

Query: 121 ELDVNYSTISSDLSLEDCNAILKRLEKCNDRKALGFFEWMRINRKLEHNVSAYNLILRVL 180
           ELDVNYSTISSDLSLE CNAILKRLEKCND K L FFEWMR N KL+HNVSAYNL+LRVL
Sbjct: 121 ELDVNYSTISSDLSLEGCNAILKRLEKCNDSKTLDFFEWMRSNGKLKHNVSAYNLVLRVL 180

Query: 181 GRQQDWDAADKLIREVRAELSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPN 240
           GRQ+DWDAA+KLI+EVRAEL  QLDFQVFNTLIYACYKSG VE G KWF+MMLE QV PN
Sbjct: 181 GRQEDWDAAEKLIKEVRAELGSQLDFQVFNTLIYACYKSGFVEWGTKWFRMMLECQVQPN 240

Query: 241 VATFGMLMGLYQKSCNLKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEEVIRL 300
           VATFGMLMGLYQKSC+++E+EFAFNQMRNFGIVCETAYASMITIY R++LYDKAEEVI+L
Sbjct: 241 VATFGMLMGLYQKSCDIEESEFAFNQMRNFGIVCETAYASMITIYIRMNLYDKAEEVIQL 300

Query: 301 MQEDKVIPNVENWLVMLNAYCQQGKMEDAELVFASMEEHGFSSNIIAYNTLITGYGKASN 360
           MQ+DKVIPN+ENWLVMLNAYCQQGKME+AELVFASMEE GFSSNIIAYNTLITGYGKASN
Sbjct: 301 MQKDKVIPNLENWLVMLNAYCQQGKMEEAELVFASMEEAGFSSNIIAYNTLITGYGKASN 360

Query: 361 MDAAQRLFLSIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFT 420
           MD AQRLFL IKNSGVEPDETTYRSMIEGWGRAGNYKMAEWY+KELKRKGYMPN+SNLFT
Sbjct: 361 MDTAQRLFLGIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYYKELKRKGYMPNSSNLFT 420

Query: 421 LMNLQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYRKV 480
           L+NLQAKHED+AGALKTLNDMLKIGCR SSIVGNVLQAYEKARRIKSVP+LLTGSFYRKV
Sbjct: 421 LINLQAKHEDEAGALKTLNDMLKIGCRPSSIVGNVLQAYEKARRIKSVPVLLTGSFYRKV 480

Query: 481 LASQTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIK 540
           L+SQTSCSILVMAYVKH LVDDALKVLREKEW D  FEENLYHLLICSCKEL H E+AIK
Sbjct: 481 LSSQTSCSILVMAYVKHCLVDDALKVLREKEWKDHHFEENLYHLLICSCKELGHFESAIK 540

Query: 541 IYTQLPKRKNKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYV 600
           IY Q PKR+NKPNLHIT TMIDIYSIMGRFSDGEKLYLSL+SSGI LDLIA++VVVRMYV
Sbjct: 541 IYAQRPKRENKPNLHITCTMIDIYSIMGRFSDGEKLYLSLRSSGIPLDLIAYNVVVRMYV 600

Query: 601 KAGSLEDACSVLDFMDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQ 660
           KAGSLEDACSVLD M +QQDIVPD+YL RDMLRIYQRCGMV KL D+YYRIL S VSWDQ
Sbjct: 601 KAGSLEDACSVLDLMAEQQDIVPDVYLLRDMLRIYQRCGMVHKLSDLYYRILKSGVSWDQ 660

Query: 661 EMYNCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKLFSKARKLLLL 720
           EMYNCVINCCSRAL VDELS LFDEMLQ GFAPNTVTLNVMLDVYGKSKLF+KAR L   
Sbjct: 661 EMYNCVINCCSRALPVDELSRLFDEMLQCGFAPNTVTLNVMLDVYGKSKLFAKARNLFGF 720

Query: 721 AQKKGLVDVISYNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRM 780
           AQK+GLVD ISYNTMIS +GK+KDF NMSSTV+ M+FNGFS+SLEAYN +LDAYGKE +M
Sbjct: 721 AQKRGLVDAISYNTMISVYGKNKDFKNMSSTVQQMKFNGFSVSLEAYNCMLDAYGKECQM 780

Query: 781 DNFRQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNAL 840
           +NFR VLQ++++S SE D YTYNIMINIYG++GWID+V EVLTELKACGLEPDLYSYN L
Sbjct: 781 ENFRSVLQRMQESTSECDHYTYNIMINIYGERGWIDEVAEVLTELKACGLEPDLYSYNTL 840

Query: 841 IKAYGIAGMVEEAAQLVKEMREKRIEPDKV 867
           IKAYGIAGMVEEAA+LVKEMREK IEPD++
Sbjct: 841 IKAYGIAGMVEEAARLVKEMREKGIEPDRI 868

BLAST of Cp4.1LG18g05820 vs. ExPASy TrEMBL
Match: A0A1S4DV41 (pentatricopeptide repeat-containing protein At4g30825, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103487334 PE=4 SV=1)

HSP 1 Score: 1454 bits (3764), Expect = 0.0
Identity = 730/870 (83.91%), Postives = 792/870 (91.03%), Query Frame = 0

Query: 1   MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSITGYIHLNKSCVLYSLVRAHKPSKV 60
           MASLKLSFSL SF S KFDFPVNS  LSD CS+FSI GYIHLNKSC+LYSL R HKPSKV
Sbjct: 1   MASLKLSFSLHSFDSNKFDFPVNSPPLSDYCSLFSINGYIHLNKSCILYSLARVHKPSKV 60

Query: 61  ---EPETSGGYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFSFSKNCSEKVFDSIVFHGG 120
              EPE S   +S+   D+ID+RKKYF  KKPSKRA GS+FSFS+NCSEK+F++I+F GG
Sbjct: 61  SQVEPEASDVSQSR--FDDIDSRKKYFTAKKPSKRAAGSHFSFSRNCSEKIFENILFSGG 120

Query: 121 ELDVNYSTISSDLSLEDCNAILKRLEKCNDRKALGFFEWMRINRKLEHNVSAYNLILRVL 180
           ELDVNYSTISSDLSLE CNAILKRLEKCND K L FFEWMR N KL+HNVSAYNL+LRVL
Sbjct: 121 ELDVNYSTISSDLSLEGCNAILKRLEKCNDSKTLDFFEWMRSNGKLKHNVSAYNLVLRVL 180

Query: 181 GRQQDWDAADKLIREVRAELSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPN 240
           GRQ+DWDAA+KLI+EVRAEL  QLDFQVFNTLIYACYKSG VE G KWF+MMLE QV PN
Sbjct: 181 GRQEDWDAAEKLIKEVRAELGSQLDFQVFNTLIYACYKSGFVEWGTKWFRMMLECQVQPN 240

Query: 241 VATFGMLMGLYQKSCNLKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEEVIRL 300
           VATFGMLMGLYQKSC+++E+EFAFNQMRNFGIVCETAYASMITIY R++LYDKAEEVI+L
Sbjct: 241 VATFGMLMGLYQKSCDIEESEFAFNQMRNFGIVCETAYASMITIYIRMNLYDKAEEVIQL 300

Query: 301 MQEDKVIPNVENWLVMLNAYCQQGKMEDAELVFASMEEHGFSSNIIAYNTLITGYGKASN 360
           MQ+DKVIPN+ENWLVMLNAYCQQGKME+AELVFASMEE GFSSNIIAYNTLITGYGKASN
Sbjct: 301 MQKDKVIPNLENWLVMLNAYCQQGKMEEAELVFASMEEAGFSSNIIAYNTLITGYGKASN 360

Query: 361 MDAAQRLFLSIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFT 420
           MD AQRLFL IKNSGVEPDETTYRSMIEGWGRAGNYKMAEWY+KELKRKGYMPN+SNLFT
Sbjct: 361 MDTAQRLFLGIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYYKELKRKGYMPNSSNLFT 420

Query: 421 LMNLQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYRKV 480
           L+NLQAKHED+AGALKTLNDMLKIGCR SSIVGNVLQAYEKARRIKSVP+LLTGSFYRKV
Sbjct: 421 LINLQAKHEDEAGALKTLNDMLKIGCRPSSIVGNVLQAYEKARRIKSVPVLLTGSFYRKV 480

Query: 481 LASQTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIK 540
           L+SQTSCSILVMAYVKH LVDDALKVLREKEW D  FEENLYHLLICSCKEL H E+AIK
Sbjct: 481 LSSQTSCSILVMAYVKHCLVDDALKVLREKEWKDHHFEENLYHLLICSCKELGHFESAIK 540

Query: 541 IYTQLPKRKNKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYV 600
           IY Q PKR+NKPNLHIT TMIDIYSIMGRFSDGEKLYLSL+SSGI LDLIA++VVVRMYV
Sbjct: 541 IYAQRPKRENKPNLHITCTMIDIYSIMGRFSDGEKLYLSLRSSGIPLDLIAYNVVVRMYV 600

Query: 601 KAGSLEDACSVLDFMDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQ 660
           KAGSLEDACSVLD M +QQDIVPD+YL RDMLRIYQRCGMV KL D+YYRIL S VSWDQ
Sbjct: 601 KAGSLEDACSVLDLMAEQQDIVPDVYLLRDMLRIYQRCGMVHKLSDLYYRILKSGVSWDQ 660

Query: 661 EMYNCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKLFSKARKLLLL 720
           EMYNCVINCCSRAL VDELS LFDEMLQ GFAPNTVTLNVMLDVYGKSKLF+KAR L   
Sbjct: 661 EMYNCVINCCSRALPVDELSRLFDEMLQCGFAPNTVTLNVMLDVYGKSKLFAKARNLFGF 720

Query: 721 AQKKGLVDVISYNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRM 780
           AQK+GLVD ISYNTMIS +GK+KDF NMSSTV+ M+FNGFS+SLEAYN +LDAYGKE +M
Sbjct: 721 AQKRGLVDAISYNTMISVYGKNKDFKNMSSTVQQMKFNGFSVSLEAYNCMLDAYGKECQM 780

Query: 781 DNFRQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNAL 840
           +NFR VLQ++++S SE D YTYNIMINIYG++GWID+V EVLTELKACGLEPDLYSYN L
Sbjct: 781 ENFRSVLQRMQESTSECDHYTYNIMINIYGERGWIDEVAEVLTELKACGLEPDLYSYNTL 840

Query: 841 IKAYGIAGMVEEAAQLVKEMREKRIEPDKV 867
           IKAYGIAGMVEEAA+LVKEMREK IEPD++
Sbjct: 841 IKAYGIAGMVEEAARLVKEMREKGIEPDRI 868

BLAST of Cp4.1LG18g05820 vs. TAIR 10
Match: AT4G30825.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 1018.5 bits (2632), Expect = 3.5e-297
Identity = 523/874 (59.84%), Postives = 649/874 (74.26%), Query Frame = 0

Query: 1   MASLKLSFSLDSFHS--KKFDFPVNSSLLSDCCSVFSITGYIHLNKSCVLYSLVRAHKPS 60
           M SL+ S  LD F S  K+F F  N S   D   +  +T  IH  ++  + S  R     
Sbjct: 1   MGSLRFSIPLDPFDSKRKRFHFSANPSQFPDQFPIHFVTSSIHATRASSIGSSTRVLDKI 60

Query: 61  KV-----EPETSGGYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFSFSKNCSEKVFDSIV 120
           +V     E   +    +  A  E     K  G ++ +K+     FSF +  ++   +++ 
Sbjct: 61  RVSSLGTEANENAINSASAAPVERSRSSKLSGDQRGTKKYVARKFSFRRGSNDLELENLF 120

Query: 121 FHGGELDVNYSTISSDLSLEDCNAILKRLEKCNDRKALGFFEWMRINRKLEHNVSAYNLI 180
            + GE+DVNYS I    SLE CN ILKRLE C+D  A+ FF+WMR N KL  N  AY+LI
Sbjct: 121 VNNGEIDVNYSAIKPGQSLEHCNGILKRLESCSDTNAIKFFDWMRCNGKLVGNFVAYSLI 180

Query: 181 LRVLGRQQDWDAADKLIREVRAELSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQ 240
           LRVLGR+++WD A+ LI+E+      Q  +QVFNT+IYAC K G V+  +KWF MMLE+ 
Sbjct: 181 LRVLGRREEWDRAEDLIKELCGFHEFQKSYQVFNTVIYACTKKGNVKLASKWFHMMLEFG 240

Query: 241 VLPNVATFGMLMGLYQKSCNLKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEE 300
           V PNVAT GMLMGLYQK+ N++EAEFAF+ MR FGIVCE+AY+SMITIYTRL LYDKAEE
Sbjct: 241 VRPNVATIGMLMGLYQKNWNVEEAEFAFSHMRKFGIVCESAYSSMITIYTRLRLYDKAEE 300

Query: 301 VIRLMQEDKVIPNVENWLVMLNAYCQQGKMEDAELVFASMEEHGFSSNIIAYNTLITGYG 360
           VI LM++D+V   +ENWLVMLNAY QQGKME AE +  SME  GFS NIIAYNTLITGYG
Sbjct: 301 VIDLMKQDRVRLKLENWLVMLNAYSQQGKMELAESILVSMEAAGFSPNIIAYNTLITGYG 360

Query: 361 KASNMDAAQRLFLSIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNAS 420
           K   M+AAQ LF  + N G+EPDET+YRSMIEGWGRA NY+ A+ Y++ELKR GY PN+ 
Sbjct: 361 KIFKMEAAQGLFHRLCNIGLEPDETSYRSMIEGWGRADNYEEAKHYYQELKRCGYKPNSF 420

Query: 421 NLFTLMNLQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSF 480
           NLFTL+NLQAK+ D  GA+KT+ DM  IGC+ SSI+G +LQAYEK  +I  VP +L GSF
Sbjct: 421 NLFTLINLQAKYGDRDGAIKTIEDMTGIGCQYSSILGIILQAYEKVGKIDVVPCVLKGSF 480

Query: 481 YRKVLASQTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLE 540
           +  +  +QTS S LVMAYVKHG+VDD L +LREK+W D  FE +LYHLLICSCKE   L 
Sbjct: 481 HNHIRLNQTSFSSLVMAYVKHGMVDDCLGLLREKKWRDSAFESHLYHLLICSCKESGQLT 540

Query: 541 NAIKIYTQLPKRKNKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVV 600
           +A+KIY    +   + NLHITSTMIDIY++MG FS+ EKLYL+LKSSG+ LD I FS+VV
Sbjct: 541 DAVKIYNHKMESDEEINLHITSTMIDIYTVMGEFSEAEKLYLNLKSSGVVLDRIGFSIVV 600

Query: 601 RMYVKAGSLEDACSVLDFMDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDV 660
           RMYVKAGSLE+ACSVL+ MD+Q+DIVPD+YLFRDMLRIYQ+C + DKLQ +YYRI  S +
Sbjct: 601 RMYVKAGSLEEACSVLEIMDEQKDIVPDVYLFRDMLRIYQKCDLQDKLQHLYYRIRKSGI 660

Query: 661 SWDQEMYNCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKLFSKARK 720
            W+QEMYNCVINCC+RAL +DELS  F+EM++ GF PNTVT NV+LDVYGK+KLF K  +
Sbjct: 661 HWNQEMYNCVINCCARALPLDELSGTFEEMIRYGFTPNTVTFNVLLDVYGKAKLFKKVNE 720

Query: 721 LLLLAQKKGLVDVISYNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGK 780
           L LLA++ G+VDVISYNT+I+A+GK+KD+ NMSS ++ M+F+GFS+SLEAYN+LLDAYGK
Sbjct: 721 LFLLAKRHGVVDVISYNTIIAAYGKNKDYTNMSSAIKNMQFDGFSVSLEAYNTLLDAYGK 780

Query: 781 EGRMDNFRQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYS 840
           + +M+ FR +L+++K S S  D YTYNIMINIYG+QGWID+V +VL ELK  GL PDL S
Sbjct: 781 DKQMEKFRSILKRMKKSTSGPDHYTYNIMINIYGEQGWIDEVADVLKELKESGLGPDLCS 840

Query: 841 YNALIKAYGIAGMVEEAAQLVKEMREKRIEPDKV 868
           YN LIKAYGI GMVEEA  LVKEMR + I PDKV
Sbjct: 841 YNTLIKAYGIGGMVEEAVGLVKEMRGRNIIPDKV 874

BLAST of Cp4.1LG18g05820 vs. TAIR 10
Match: AT4G31850.1 (proton gradient regulation 3 )

HSP 1 Score: 197.2 bits (500), Expect = 5.8e-50
Identity = 168/678 (24.78%), Postives = 314/678 (46.31%), Query Frame = 0

Query: 202  DFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVATFGMLMGLYQKSCNLKEAEFAF 261
            +   +NTLI    +   ++   + F  M    V P   T+ + +  Y KS +   A   F
Sbjct: 397  NLHTYNTLICGLLRVHRLDDALELFGNMESLGVKPTAYTYIVFIDYYGKSGDSVSALETF 456

Query: 262  NQMRNFGIVCETAYASMITIYT--RLSLYDKAEEVIRLMQEDKVIPNVENWLVMLNAYCQ 321
             +M+  GI      A   ++Y+  +     +A+++   +++  ++P+   + +M+  Y +
Sbjct: 457  EKMKTKGI-APNIVACNASLYSLAKAGRDREAKQIFYGLKDIGLVPDSVTYNMMMKCYSK 516

Query: 322  QGKMEDAELVFASMEEHGFSSNIIAYNTLITGYGKASNMDAAQRLFLSIKNSGVEPDETT 381
             G++++A  + + M E+G   ++I  N+LI    KA  +D A ++F+ +K   ++P   T
Sbjct: 517  VGEIDEAIKLLSEMMENGCEPDVIVVNSLINTLYKADRVDEAWKMFMRMKEMKLKPTVVT 576

Query: 382  YRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFTLMNLQAKHEDDAGALKTLNDML 441
            Y +++ G G+ G  + A   F+ + +KG  PN     TL +   K+++   ALK L  M+
Sbjct: 577  YNTLLAGLGKNGKIQEAIELFEGMVQKGCPPNTITFNTLFDCLCKNDEVTLALKMLFKMM 636

Query: 442  KIGCRLSSIVGN-VLQAYEKARRIKSVPLLLTGSFYRKVLASQTSCSILVMAYVKHGLVD 501
             +GC       N ++    K  ++K   +       + V     +   L+   VK  L++
Sbjct: 637  DMGCVPDVFTYNTIIFGLVKNGQVKEA-MCFFHQMKKLVYPDFVTLCTLLPGVVKASLIE 696

Query: 502  DALKVLREKEWNDLRFEENLY-HLLICSCKELDHLENAIKIYTQLPKRKNKPNLHITSTM 561
            DA K++    +N      NL+   LI S      ++NA+    +L       +       
Sbjct: 697  DAYKIITNFLYNCADQPANLFWEDLIGSILAEAGIDNAVSFSERLVANGICRDGDSILVP 756

Query: 562  IDIYSIMGRFSDGEKLYLS--LKSSGIRLDLIAFSVVVRMYVKAGSLEDACSVLDFMDKQ 621
            I  YS       G +       K  G++  L  +++++   ++A  +E A  V     K 
Sbjct: 757  IIRYSCKHNNVSGARTLFEKFTKDLGVQPKLPTYNLLIGGLLEADMIEIAQDVF-LQVKS 816

Query: 622  QDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQEMYNCVINCCSRALLVDE 681
               +PD+  +  +L  Y + G +D+L ++Y  +   +   +   +N VI+   +A  VD+
Sbjct: 817  TGCIPDVATYNFLLDAYGKSGKIDELFELYKEMSTHECEANTITHNIVISGLVKAGNVDD 876

Query: 682  -LSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKLFSKARKLLLLAQKKGLVD------VIS 741
             L   +D M  R F+P   T   ++D   KS    +A++L      +G++D         
Sbjct: 877  ALDLYYDLMSDRDFSPTACTYGPLIDGLSKSGRLYEAKQLF-----EGMLDYGCRPNCAI 936

Query: 742  YNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMDNFRQVLQQLK 801
            YN +I+ FGK+ +     +  + M   G    L+ Y+ L+D     GR+D      ++LK
Sbjct: 937  YNILINGFGKAGEADAACALFKRMVKEGVRPDLKTYSVLVDCLCMVGRVDEGLHYFKELK 996

Query: 802  DSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKAC-GLEPDLYSYNALIKAYGIAGMV 861
            +S    D   YN++IN  GK   +++   +  E+K   G+ PDLY+YN+LI   GIAGMV
Sbjct: 997  ESGLNPDVVCYNLIINGLGKSHRLEEALVLFNEMKTSRGITPDLYTYNSLILNLGIAGMV 1056

Query: 862  EEAAQLVKEMREKRIEPD 866
            EEA ++  E++   +EP+
Sbjct: 1057 EEAGKIYNEIQRAGLEPN 1066

BLAST of Cp4.1LG18g05820 vs. TAIR 10
Match: AT5G02860.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 182.2 bits (461), Expect = 1.9e-45
Identity = 137/627 (21.85%), Postives = 274/627 (43.70%), Query Frame = 0

Query: 245 MGLYQK-SCNLKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEEVIRLMQEDKV 304
           +G ++K    L+  ++   Q     ++  +  A +I++  +      A  +   +QED  
Sbjct: 145 LGFHKKFDLALRAFDWFMKQKDYQSMLDNSVVAIIISMLGKEGRVSSAANMFNGLQEDGF 204

Query: 305 IPNVENWLVMLNAYCQQGKMEDAELVFASMEEHGFSSNIIAYNTLITGYGK-ASNMDAAQ 364
             +V ++  +++A+   G+  +A  VF  MEE G    +I YN ++  +GK  +  +   
Sbjct: 205 SLDVYSYTSLISAFANSGRYREAVNVFKKMEEDGCKPTLITYNVILNVFGKMGTPWNKIT 264

Query: 365 RLFLSIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFTLMNLQ 424
            L   +K+ G+ PD  TY ++I    R   ++ A   F+E+K  G+  +      L+++ 
Sbjct: 265 SLVEKMKSDGIAPDAYTYNTLITCCKRGSLHQEAAQVFEEMKAAGFSYDKVTYNALLDVY 324

Query: 425 AKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYRKVLASQT 484
            K      A+K LN+M+  G   S +  N                               
Sbjct: 325 GKSHRPKEAMKVLNEMVLNGFSPSIVTYN------------------------------- 384

Query: 485 SCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIYTQL 544
               L+ AY + G++D+A+++  +      + +   Y  L+   +    +E+A+ I+ ++
Sbjct: 385 ---SLISAYARDGMLDEAMELKNQMAEKGTKPDVFTYTTLLSGFERAGKVESAMSIFEEM 444

Query: 545 PKRKNKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKAGSL 604
                KPN+   +  I +Y   G+F++  K++  +   G+  D++ ++ ++ ++ + G  
Sbjct: 445 RNAGCKPNICTFNAFIKMYGNRGKFTEMMKIFDEINVCGLSPDIVTWNTLLAVFGQNGMD 504

Query: 605 EDACSVLDFMDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQEMYNC 664
            +   V   M K+   VP+   F  ++  Y RCG  ++   VY R+L++ V+ D   YN 
Sbjct: 505 SEVSGVFKEM-KRAGFVPERETFNTLISAYSRCGSFEQAMTVYRRMLDAGVTPDLSTYNT 564

Query: 665 VINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKLFSKARKLLLLAQKKG 724
           V+   +R  + ++   +  EM      PN +T   +L  Y   K       L       G
Sbjct: 565 VLAALARGGMWEQSEKVLAEMEDGRCKPNELTYCSLLHAYANGKEIGLMHSLAEEVY-SG 624

Query: 725 LVD--VISYNTMISAFGKSKDFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMDNF 784
           +++   +   T++    K             ++  GFS  +   NS++  YG+   +   
Sbjct: 625 VIEPRAVLLKTLVLVCSKCDLLPEAERAFSELKERGFSPDITTLNSMVSIYGRRQMVAKA 684

Query: 785 RQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNALIKA 844
             VL  +K+        TYN ++ ++ +       EE+L E+ A G++PD+ SYN +I A
Sbjct: 685 NGVLDYMKERGFTPSMATYNSLMYMHSRSADFGKSEEILREILAKGIKPDIISYNTVIYA 735

Query: 845 YGIAGMVEEAAQLVKEMREKRIEPDKV 868
           Y     + +A+++  EMR   I PD +
Sbjct: 745 YCRNTRMRDASRIFSEMRNSGIVPDVI 735

BLAST of Cp4.1LG18g05820 vs. TAIR 10
Match: AT5G14770.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 169.5 bits (428), Expect = 1.3e-41
Identity = 160/708 (22.60%), Postives = 305/708 (43.08%), Query Frame = 0

Query: 161 RKLEHNVSAYNLILR--VLGRQQDWDAADKLIREVRAELSDQLDFQVFNTLIYACYKSGL 220
           + L   +S  NLI    +L    +  A ++  R++     D  D   F+++I    K G 
Sbjct: 216 KALVDEISELNLITHTILLSSYYNLHAIEEAYRDMVMSGFDP-DVVTFSSIINRLCKGGK 275

Query: 221 VEQGAKWFQMMLEWQVLPNVATFGMLMGLYQKSCNLKEAEFAFNQMRNFGIVCE-TAYAS 280
           V +G    + M E  V PN  T+  L+    K+   + A   ++QM   GI  +   Y  
Sbjct: 276 VLEGGLLLREMEEMSVYPNHVTYTTLVDSLFKANIYRHALALYSQMVVRGIPVDLVVYTV 335

Query: 281 MITIYTRLSLYDKAEEVIRLMQEDKVIPNVENWLVMLNAYCQQGKMEDAELVFASMEEHG 340
           ++    +     +AE+  +++ ED  +PNV  +  +++  C+ G +  AE +   M E  
Sbjct: 336 LMDGLFKAGDLREAEKTFKMLLEDNQVPNVVTYTALVDGLCKAGDLSSAEFIITQMLEKS 395

Query: 341 FSSNIIAYNTLITGYGKASNMDAAQRLFLSIKNSGVEPDETTYRSMIEGWGRAGNYKMAE 400
              N++ Y+++I GY K   ++ A  L   +++  V P+  TY ++I+G  +AG  +MA 
Sbjct: 396 VIPNVVTYSSMINGYVKKGMLEEAVSLLRKMEDQNVVPNGFTYGTVIDGLFKAGKEEMAI 455

Query: 401 WYFKELKRKGYMPNASNLFTLMNLQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYE 460
              KE++  G   N   L  L                +N + +IG               
Sbjct: 456 ELSKEMRLIGVEENNYILDAL----------------VNHLKRIG--------------- 515

Query: 461 KARRIKSVPLLLTGSFYRKVLASQTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEEN 520
              RIK V  L+     + V   Q + + L+  + K G  + AL    E +   + ++  
Sbjct: 516 ---RIKEVKGLVKDMVSKGVTLDQINYTSLIDVFFKGGDEEAALAWAEEMQERGMPWDVV 575

Query: 521 LYHLLICSCKELDHLENAIKIYTQLPKRKNKPNLHITSTMIDIYSIMGRFSDGEKLYLSL 580
            Y++LI    +   +  A   Y  + ++  +P++   + M++     G      KL+  +
Sbjct: 576 SYNVLISGMLKFGKV-GADWAYKGMREKGIEPDIATFNIMMNSQRKQGDSEGILKLWDKM 635

Query: 581 KSSGIRLDLIAFSVVVRMYVKAGSLEDACSVLDFMDKQQDIVPDIYLFRDMLRIYQRCGM 640
           KS GI+  L++ ++VV M  + G +E+A  +L+ M    +I P++  +R  L    +   
Sbjct: 636 KSCGIKPSLMSCNIVVGMLCENGKMEEAIHILNQM-MLMEIHPNLTTYRIFLDTSSKHKR 695

Query: 641 VDKLQDVYYRILNSDVSWDQEMYNCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNV 700
            D +   +  +L+  +   +++YN +I    +  +  + + +  +M  RGF P+TVT N 
Sbjct: 696 ADAIFKTHETLLSYGIKLSRQVYNTLIATLCKLGMTKKAAMVMGDMEARGFIPDTVTFNS 755

Query: 701 MLDVYGKSKLFSKARKLLLLAQKKGLVDVISYNTMISAFGKSKDFANMSSTVRTMEFNGF 760
           ++  Y       KA              + +Y+ M+ A                    G 
Sbjct: 756 LMHGYFVGSHVRKA--------------LSTYSVMMEA--------------------GI 815

Query: 761 SLSLEAYNSLLDAYGKEGRMDNFRQVLQQLKDSNSERDQYTYNIMINIYGKQGWIDDVEE 820
           S ++  YN+++      G +    + L ++K      D +TYN +I+   K G +     
Sbjct: 816 SPNVATYNTIIRGLSDAGLIKEVDKWLSEMKSRGMRPDDFTYNALISGQAKIGNMKGSMT 852

Query: 821 VLTELKACGLEPDLYSYNALIKAYGIAGMVEEAAQLVKEMREKRIEPD 866
           +  E+ A GL P   +YN LI  +   G + +A +L+KEM ++ + P+
Sbjct: 876 IYCEMIADGLVPKTSTYNVLISEFANVGKMLQARELLKEMGKRGVSPN 852

BLAST of Cp4.1LG18g05820 vs. TAIR 10
Match: AT1G63400.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 168.7 bits (426), Expect = 2.2e-41
Identity = 131/545 (24.04%), Postives = 244/545 (44.77%), Query Frame = 0

Query: 322 KMEDAELVFASMEEHGFSSNIIAYNTLITGYGKASNMDAAQRLFLSIKNSGVEPDETTYR 381
           K++DA  +F  M +     +I  +N L++   K    D    L   ++  G+  +  TY 
Sbjct: 65  KLDDAIGLFGGMVKSRPLPSIFEFNKLLSAIAKMKKFDLVISLGEKMQRLGISHNLYTYN 124

Query: 382 SMIEGWGRAGNYKMAEWYFKELKRKGYMPNASNLFTLMNLQAKHEDDAGALKTLNDMLKI 441
            +I  + R     +A     ++ + GY P+   L +L+N     +  + A+  ++ M+++
Sbjct: 125 ILINCFCRRSQISLALALLGKMMKLGYEPSIVTLSSLLNGYCHGKRISDAVALVDQMVEM 184

Query: 442 GCRLSSIV-GNVLQAYEKARRIKSVPLLLTGSFYRKVLASQTSCSILVMAYVKHGLVDDA 501
           G R  +I    ++       +      L+     R    +  +  ++V    K G +D A
Sbjct: 185 GYRPDTITFTTLIHGLFLHNKASEAVALVDRMVQRGCQPNLVTYGVVVNGLCKRGDIDLA 244

Query: 502 LKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIYTQLPKRKNKPNLHITSTMIDI 561
             +L + E   +     +Y  +I S  +  H ++A+ ++T++  +  +PN+   S++I  
Sbjct: 245 FNLLNKMEAAKIEANVVIYSTVIDSLCKYRHEDDALNLFTEMENKGVRPNVITYSSLISC 304

Query: 562 YSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKAGSLEDACSVLDFMDKQQDIVP 621
                R+SD  +L   +    I  +++ F+ ++  +VK G L +A  + D M K + I P
Sbjct: 305 LCNYERWSDASRLLSDMIERKINPNVVTFNALIDAFVKEGKLVEAEKLYDEMIK-RSIDP 364

Query: 622 DIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQEMYNCVINCCSRALLVDELSSLF 681
           DI+ +  ++  +     +D+ + ++  +++ D   +   YN +IN   +A  +DE   LF
Sbjct: 365 DIFTYSSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYNTLINGFCKAKRIDEGVELF 424

Query: 682 DEMLQRGFAPNTVTLNVMLDVYGKSKLFSKARKLLLLAQKKGLVDVISYNTMISAFGKSK 741
            EM QRG   NTVT                                  Y T+I  F +++
Sbjct: 425 REMSQRGLVGNTVT----------------------------------YTTLIHGFFQAR 484

Query: 742 DFANMSSTVRTMEFNGFSLSLEAYNSLLDAYGKEGRMDNFRQVLQQLKDSNSERDQYTYN 801
           D  N     + M  +G   ++  YN+LLD   K G+++    V + L+ S  E   YTYN
Sbjct: 485 DCDNAQMVFKQMVSDGVHPNIMTYNTLLDGLCKNGKLEKAMVVFEYLQRSKMEPTIYTYN 544

Query: 802 IMINIYGKQGWIDDVEEVLTELKACGLEPDLYSYNALIKAYGIAGMVEEAAQLVKEMREK 861
           IMI    K G ++D  ++   L   G++PD+  YN +I  +   G+ EEA  L ++MRE 
Sbjct: 545 IMIEGMCKAGKVEDGWDLFCSLSLKGVKPDVIIYNTMISGFCRKGLKEEADALFRKMRED 574

Query: 862 RIEPD 866
              PD
Sbjct: 605 GPLPD 574

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
O655674.9e-29659.84Pentatricopeptide repeat-containing protein At4g30825, chloroplastic OS=Arabidop... [more]
Q9SZ528.2e-4924.78Pentatricopeptide repeat-containing protein At4g31850, chloroplastic OS=Arabidop... [more]
Q9LYZ92.7e-4421.85Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana OX... [more]
B8Y6I03.9e-4319.83Pentatricopeptide repeat-containing protein 10, chloroplastic OS=Zea mays OX=457... [more]
Q9LER01.8e-4022.60Pentatricopeptide repeat-containing protein At5g14770, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
KAG7023427.10.098.49Pentatricopeptide repeat-containing protein, chloroplastic [Cucurbita argyrosper... [more]
KAG6589757.10.098.47putative leucine-rich repeat receptor-like serine/threonine-protein kinase, part... [more]
XP_023516176.10.0100.00pentatricopeptide repeat-containing protein At4g30825, chloroplastic [Cucurbita ... [more]
XP_022922044.10.098.73pentatricopeptide repeat-containing protein At4g30825, chloroplastic [Cucurbita ... [more]
XP_022988547.10.097.92pentatricopeptide repeat-containing protein At4g30825, chloroplastic [Cucurbita ... [more]
Match NameE-valueIdentityDescription
A0A6J1E2800.098.73pentatricopeptide repeat-containing protein At4g30825, chloroplastic OS=Cucurbit... [more]
A0A6J1JDB70.097.92pentatricopeptide repeat-containing protein At4g30825, chloroplastic OS=Cucurbit... [more]
A0A6J1BZD70.083.79pentatricopeptide repeat-containing protein At4g30825, chloroplastic isoform X1 ... [more]
A0A5A7SYW60.083.91Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S4DV410.083.91pentatricopeptide repeat-containing protein At4g30825, chloroplastic OS=Cucumis ... [more]
Match NameE-valueIdentityDescription
AT4G30825.13.5e-29759.84Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G31850.15.8e-5024.78proton gradient regulation 3 [more]
AT5G02860.11.9e-4521.85Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G14770.11.3e-4122.60Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G63400.12.2e-4124.04Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 843..863
NoneNo IPR availablePANTHERPTHR12683:SF10OS09G0423300 PROTEINcoord: 69..867
NoneNo IPR availablePANTHERPTHR12683CDK-ACTIVATING KINASE ASSEMBLY FACTOR MAT1coord: 69..867
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 163..343
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 315..575
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 724..774
e-value: 5.1E-4
score: 20.1
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 659..702
e-value: 7.0E-9
score: 35.8
coord: 795..841
e-value: 1.8E-9
score: 37.6
coord: 341..386
e-value: 1.6E-11
score: 44.3
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 240..269
e-value: 0.97
score: 9.8
coord: 483..506
e-value: 0.0048
score: 17.0
coord: 274..302
e-value: 0.0075
score: 16.4
coord: 588..614
e-value: 0.12
score: 12.7
coord: 310..338
e-value: 7.6E-6
score: 25.8
coord: 555..582
e-value: 0.13
score: 12.6
coord: 205..232
e-value: 0.0028
score: 17.8
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 763..795
e-value: 1.9E-4
score: 19.4
coord: 205..238
e-value: 3.1E-4
score: 18.7
coord: 344..377
e-value: 4.0E-6
score: 24.7
coord: 588..622
e-value: 5.9E-5
score: 21.0
coord: 274..307
e-value: 4.6E-5
score: 21.3
coord: 379..412
e-value: 8.4E-4
score: 17.4
coord: 832..865
e-value: 1.5E-9
score: 35.5
coord: 797..830
e-value: 3.5E-8
score: 31.1
coord: 659..692
e-value: 2.5E-7
score: 28.5
coord: 727..760
e-value: 6.9E-4
score: 17.6
coord: 311..341
e-value: 1.0E-6
score: 26.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 271..305
score: 9.108898
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 341..375
score: 11.728648
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 760..794
score: 9.558311
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 376..410
score: 11.936913
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 725..759
score: 9.339086
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 306..340
score: 11.334042
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 202..236
score: 9.700809
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 795..829
score: 12.090371
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 656..690
score: 11.366925
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 830..864
score: 13.131695
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 118..270
e-value: 2.3E-23
score: 84.7
coord: 370..445
e-value: 4.4E-9
score: 38.1
coord: 821..868
e-value: 7.1E-10
score: 40.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 479..652
e-value: 1.6E-27
score: 98.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 271..369
e-value: 8.6E-23
score: 83.2
coord: 653..794
e-value: 1.4E-32
score: 115.4

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG18g05820.1Cp4.1LG18g05820.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006281 DNA repair
biological_process GO:0000079 regulation of cyclin-dependent protein serine/threonine kinase activity
biological_process GO:0006357 regulation of transcription by RNA polymerase II
cellular_component GO:0009570 chloroplast stroma
cellular_component GO:0005675 transcription factor TFIIH holo complex
molecular_function GO:0003677 DNA binding
molecular_function GO:0003729 mRNA binding
molecular_function GO:0005515 protein binding