Cp4.1LG20g00240 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG20g00240
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG20: 123242 .. 126262 (+)
RNA-Seq ExpressionCp4.1LG20g00240
SyntenyCp4.1LG20g00240
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
TTGCTCAGTGATTATGTAAAATCCAGAAAATGCTCTTTGAGGAACACCAAAGTTCTACATGCCAAGTTACTCCGAGCAACTCTTCTTCATTCCAATATCTATGTTTCAAATTCTTTGCTGGATTGCTACTCAAAGTCTAACTCTCTGGACCATGCACTCAAGCTGTTTGATACAATGCTCCACCCAAATGTCATTTCTTGGAATATCCTTATCTCCAGTTTCAACCACAACTTCATGTATTTGGATTCGTGGAGAACATTTTGCAGGATGCATTTCCTGGGTTTTGAACCCAGCGAGATAACGTATGGGAGTGTTTTATCTGCTTGTGCTGCCATTCAAGCCCCAATGTTTGGTAAGCAGGTCTATTCACTTGCTGTGAGAAATGGGTTTTTTGTTAATGGTTATGTTCGAGCAGGGATGATCGATTTGTTTGCAAAAGATTCTAGTTTTCTGGATGCTCTAAGGGTGTTTCATGATGTTGATTGTGAGAACGTGGTGTGCTGGAATGCTATTGTGTCCGCAGCTGTAAGGAATGGGGAGAATTTTATGGCTTTGGATCTTTACAACACAATGTGTCATGGGTTCTTGGAGCCTAATAGTTTCACGTTTTCTAGTGTTCTAACTGCGTGTGCTGCACTTGAACATCCTGAATTTGGGAAAAGAGTTCAAGGGAAAGTGATTAAATGTGGCGGAGAAGATGTTTTTGTAGAGACAGCCCTCATTGATTTGTATAGCAAGTGTGGAGAAATGGATGAAGCTGTTAAAATATTCTTGCGGATGCCCATTCGCAATGTGGTCTCATGGACTGCCATAATATCTGGTTTTGTGCAGAAGAATGATTACTTAATGGCCCTCAAGTTTTTCAAAGATATGAGAAAATTGGGGGAGGAAATCAATAGCTATACAGTCACTAGCGTGTTAACTGCATGTGCTAATCCAGCCATGACAAAAGAAGCAATCCAACTCCACTCCTGGATTTTAAGAGCTGGTTTTTCATCTCATGCGGTGGTGGGAGCTGCTTTAATTAACATGTATTCGAAAATAGGAGCTATTGATCTTTCTATGACTGTTTTCGGAGAGATGGACAATAAAAGGAATCTCAGTTCTTGGACAGCTATGATTACCTCATTTGCACAGAACAATGATAAAGAGAAAGCAAGTGAATTGTTCCAAAAAATGTTACGTGAAAGTATGGGACCAGATACATTTTGCACCTCTAGTGTCTTGAGTGTGACCGACTGTATTACCTTTGGGAGACAGATTCACTGCTTCACGCATAAAACTGGATTAATATTTGACATTTCTGTCGGCAGTGCTCTTTTCACAATGTATTCCAAATGTGGCTATCTAGAGGAAGCTTTTCATGTTTTTAAAAACATGCCAAAGAAGGACAATATTTCATGGGCATCGATGATGTCCTGTTTCTCAGAACATGGTTATGCAAAAGAGGGCATCCAATTATTTAGAGAAATGTTGTTTGAAGAATATGTTCCTGATTATATGATTTTAAGTACAGTCCTAAATGCATGTTCTGTTCTTCATTCTATTCAAATAGGCAGAGAGATTCATTGTTATTCTGTTCGTTTGGGTCTGGACAAAGATGTAGCAATTGGGGGTTCGCTTGTGACTATGTACTCAAAATGCGGCAACCTGGAGATGGCTCGGAGGGTGTTTGAAACATTGCCCGAGAAAGATAATATTGCATGCTCTTCGTTGGTTTCAGGATATGCTCAACACAAGTGCATCAAAGAGACAATTTTGCTATTCCAAGATCTACTGGAGGCTGGCTTAGCCATCGATCCCTTTTCAATCTCATCCATACTGGGAGCAATTGCGCTTTTAAATAGGCCTGGTATTGGGACTCAACTCCATGCAATCATTACGAAAGTAGGCTTGGAGAAAGATGTTTCTGTTGGGAGTTCGCTAGTAATGGTATACTCCAAATGTGGAAGTATAGAAGACTGCTGCAAAGCATTTGAGCAGATTGGAAAGCCTGATTTGATAGGTTGGACAGCCATGATTGTGAGTTATGCCCAGCATGGGAAAGGTGCTGAAGCTTTATGTGTCTATGAACTTATGAAGAAAGAAGGAATCAAGCCTGATCCAGTCACCTTTGTTGGGGTTTTGTCTGCTTGTAGCCATAATGGTTTGGTGGATGAAGCTTATTTCCACCTTAATTCGATGGTGAAAGACTATGGTATACAACCGGGACATCGACATTATGCTTGTATGGTTGATCTTCTTGGTCGGTGTGGGCAACTGAAAAGGGCAGAAGAACTGATTAACAATATGCCTATTGAACCTGATGCTCTCATTTGGGGAACTCTTCTCGCTGCCTGTAAAGTACATGGAGATATTGAACTTGGAAAACTAGCGGCAAGAAAGGTGATGGAGTTGAAGCCAAGTGACACTGGGGCGTATGTCTCTCTTTCCAACATCTGTGCTGATATGGGCCTGTGGGAAGAGGTCCTGAACGTTAGAAGCCTTATGAAGGGAGCTGGAGTGACGAAGGAACCTGGTTGGAGCTTGCTGTAAGAAGCTATTGTTCATAGTTGGTAAGTCCTTATAGCTAAGGATTTTCCCTTTTTCTCTTTTTTTCTTTTTTTTGTTTTGTTTTGTTTTTGTTTTTGTTTGCATGTCTTGATTTTATTTAGAACGTCCTTAGCATGAATACCAATGAACTGCAGTATGAATACTCGAAGAGTTAGCAAAACAGCAAAAGATGTTTAGCTGACCTTATTGATTCTGATTTCATCCCATGGTCTAATCCGTAAGTTTTGATAATTATGACAGATCTCAGAAAAACTTGTGGGACAAGCTGAGTCCAGAAGGAAGGCTCTTGGTCAGTTTTAGAATATTTTTTTATGAGTTTCTCGGTGATATGGTATAAGAGAGGAAGTCTTTTTGTTCTATATCCACCTCAGTACCTCATGTTTGGTCCTTAAATCAGAAGACAGGAATGCAAGCAACAGACATCGTGACTTCCTGATCAGGCTGGATACCGGAAGTGACTCAAATTGA

mRNA sequence

TTGCTCAGTGATTATGTAAAATCCAGAAAATGCTCTTTGAGGAACACCAAAGTTCTACATGCCAAGTTACTCCGAGCAACTCTTCTTCATTCCAATATCTATGTTTCAAATTCTTTGCTGGATTGCTACTCAAAGTCTAACTCTCTGGACCATGCACTCAAGCTGTTTGATACAATGCTCCACCCAAATGTCATTTCTTGGAATATCCTTATCTCCAGTTTCAACCACAACTTCATGTATTTGGATTCGTGGAGAACATTTTGCAGGATGCATTTCCTGGGTTTTGAACCCAGCGAGATAACGTATGGGAGTGTTTTATCTGCTTGTGCTGCCATTCAAGCCCCAATGTTTGGTAAGCAGGTCTATTCACTTGCTGTGAGAAATGGGTTTTTTGTTAATGGTTATGTTCGAGCAGGGATGATCGATTTGTTTGCAAAAGATTCTAGTTTTCTGGATGCTCTAAGGGTGTTTCATGATGTTGATTGTGAGAACGTGGTGTGCTGGAATGCTATTGTGTCCGCAGCTGTAAGGAATGGGGAGAATTTTATGGCTTTGGATCTTTACAACACAATGTGTCATGGGTTCTTGGAGCCTAATAGTTTCACGTTTTCTAGTGTTCTAACTGCGTGTGCTGCACTTGAACATCCTGAATTTGGGAAAAGAGTTCAAGGGAAAGTGATTAAATGTGGCGGAGAAGATGTTTTTGTAGAGACAGCCCTCATTGATTTGTATAGCAAGTGTGGAGAAATGGATGAAGCTGTTAAAATATTCTTGCGGATGCCCATTCGCAATGTGGTCTCATGGACTGCCATAATATCTGGTTTTGTGCAGAAGAATGATTACTTAATGGCCCTCAAGTTTTTCAAAGATATGAGAAAATTGGGGGAGGAAATCAATAGCTATACAGTCACTAGCGTGTTAACTGCATGTGCTAATCCAGCCATGACAAAAGAAGCAATCCAACTCCACTCCTGGATTTTAAGAGCTGGTTTTTCATCTCATGCGGTGGTGGGAGCTGCTTTAATTAACATGTATTCGAAAATAGGAGCTATTGATCTTTCTATGACTGTTTTCGGAGAGATGGACAATAAAAGGAATCTCAGTTCTTGGACAGCTATGATTACCTCATTTGCACAGAACAATGATAAAGAGAAAGCAAGTGAATTGTTCCAAAAAATGTTACGTGAAAGTATGGGACCAGATACATTTTGCACCTCTAGTGTCTTGAGTGTGACCGACTGTATTACCTTTGGGAGACAGATTCACTGCTTCACGCATAAAACTGGATTAATATTTGACATTTCTGTCGGCAGTGCTCTTTTCACAATGTATTCCAAATGTGGCTATCTAGAGGAAGCTTTTCATATCTCAGAAAAACTTGTGGGACAAGCTGAGTCCAGAAGGAAGGCTCTTGAAGACAGGAATGCAAGCAACAGACATCGTGACTTCCTGATCAGGCTGGATACCGGAAGTGACTCAAATTGA

Coding sequence (CDS)

TTGCTCAGTGATTATGTAAAATCCAGAAAATGCTCTTTGAGGAACACCAAAGTTCTACATGCCAAGTTACTCCGAGCAACTCTTCTTCATTCCAATATCTATGTTTCAAATTCTTTGCTGGATTGCTACTCAAAGTCTAACTCTCTGGACCATGCACTCAAGCTGTTTGATACAATGCTCCACCCAAATGTCATTTCTTGGAATATCCTTATCTCCAGTTTCAACCACAACTTCATGTATTTGGATTCGTGGAGAACATTTTGCAGGATGCATTTCCTGGGTTTTGAACCCAGCGAGATAACGTATGGGAGTGTTTTATCTGCTTGTGCTGCCATTCAAGCCCCAATGTTTGGTAAGCAGGTCTATTCACTTGCTGTGAGAAATGGGTTTTTTGTTAATGGTTATGTTCGAGCAGGGATGATCGATTTGTTTGCAAAAGATTCTAGTTTTCTGGATGCTCTAAGGGTGTTTCATGATGTTGATTGTGAGAACGTGGTGTGCTGGAATGCTATTGTGTCCGCAGCTGTAAGGAATGGGGAGAATTTTATGGCTTTGGATCTTTACAACACAATGTGTCATGGGTTCTTGGAGCCTAATAGTTTCACGTTTTCTAGTGTTCTAACTGCGTGTGCTGCACTTGAACATCCTGAATTTGGGAAAAGAGTTCAAGGGAAAGTGATTAAATGTGGCGGAGAAGATGTTTTTGTAGAGACAGCCCTCATTGATTTGTATAGCAAGTGTGGAGAAATGGATGAAGCTGTTAAAATATTCTTGCGGATGCCCATTCGCAATGTGGTCTCATGGACTGCCATAATATCTGGTTTTGTGCAGAAGAATGATTACTTAATGGCCCTCAAGTTTTTCAAAGATATGAGAAAATTGGGGGAGGAAATCAATAGCTATACAGTCACTAGCGTGTTAACTGCATGTGCTAATCCAGCCATGACAAAAGAAGCAATCCAACTCCACTCCTGGATTTTAAGAGCTGGTTTTTCATCTCATGCGGTGGTGGGAGCTGCTTTAATTAACATGTATTCGAAAATAGGAGCTATTGATCTTTCTATGACTGTTTTCGGAGAGATGGACAATAAAAGGAATCTCAGTTCTTGGACAGCTATGATTACCTCATTTGCACAGAACAATGATAAAGAGAAAGCAAGTGAATTGTTCCAAAAAATGTTACGTGAAAGTATGGGACCAGATACATTTTGCACCTCTAGTGTCTTGAGTGTGACCGACTGTATTACCTTTGGGAGACAGATTCACTGCTTCACGCATAAAACTGGATTAATATTTGACATTTCTGTCGGCAGTGCTCTTTTCACAATGTATTCCAAATGTGGCTATCTAGAGGAAGCTTTTCATATCTCAGAAAAACTTGTGGGACAAGCTGAGTCCAGAAGGAAGGCTCTTGAAGACAGGAATGCAAGCAACAGACATCGTGACTTCCTGATCAGGCTGGATACCGGAAGTGACTCAAATTGA

Protein sequence

LLSDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTMLHPNVISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENFMALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETALIDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGEMDNKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQIHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHISEKLVGQAESRRKALEDRNASNRHRDFLIRLDTGSDSN
Homology
BLAST of Cp4.1LG20g00240 vs. ExPASy Swiss-Prot
Match: Q9CA56 (Pentatricopeptide repeat-containing protein At1g74600, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-E69 PE=3 SV=1)

HSP 1 Score: 450.7 bits (1158), Expect = 2.2e-125
Identity = 224/454 (49.34%), Postives = 319/454 (70.26%), Query Frame = 0

Query: 3   SDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTMLHP 62
           +D   SR C+LR TK+L A LLR  LL  +++++ SLL  YS S S+  A KLFDT+  P
Sbjct: 54  NDQSNSRLCNLRTTKILQAHLLRRYLLPFDVFLTKSLLSWYSNSGSMADAAKLFDTIPQP 113

Query: 63  NVISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVY 122
           +V+S NI+IS +  + ++ +S R F +MHFLGFE +EI+YGSV+SAC+A+QAP+F + V 
Sbjct: 114 DVVSCNIMISGYKQHRLFEESLRFFSKMHFLGFEANEISYGSVISACSALQAPLFSELVC 173

Query: 123 SLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENF 182
              ++ G+F    V + +ID+F+K+  F DA +VF D    NV CWN I++ A+RN    
Sbjct: 174 CHTIKMGYFFYEVVESALIDVFSKNLRFEDAYKVFRDSLSANVYCWNTIIAGALRNQNYG 233

Query: 183 MALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETALID 242
              DL++ MC GF +P+S+T+SSVL ACA+LE   FGK VQ +VIKCG EDVFV TA++D
Sbjct: 234 AVFDLFHEMCVGFQKPDSYTYSSVLAACASLEKLRFGKVVQARVIKCGAEDVFVCTAIVD 293

Query: 243 LYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINSYT 302
           LY+KCG M EA+++F R+P  +VVSWT ++SG+ + ND   AL+ FK+MR  G EIN+ T
Sbjct: 294 LYAKCGHMAEAMEVFSRIPNPSVVSWTVMLSGYTKSNDAFSALEIFKEMRHSGVEINNCT 353

Query: 303 VTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGEMD 362
           VTSV++AC  P+M  EA Q+H+W+ ++GF   + V AALI+MYSK G IDLS  VF ++D
Sbjct: 354 VTSVISACGRPSMVCEASQVHAWVFKSGFYLDSSVAAALISMYSKSGDIDLSEQVFEDLD 413

Query: 363 NKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQIH 422
           + +  +    MITSF+Q+    KA  LF +ML+E +  D F   S+LSV DC+  G+Q+H
Sbjct: 414 DIQRQNIVNVMITSFSQSKKPGKAIRLFTRMLQEGLRTDEFSVCSLLSVLDCLNLGKQVH 473

Query: 423 CFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHI 457
            +T K+GL+ D++VGS+LFT+YSKCG LEE++ +
Sbjct: 474 GYTLKSGLVLDLTVGSSLFTLYSKCGSLEESYKL 507

BLAST of Cp4.1LG20g00240 vs. ExPASy Swiss-Prot
Match: O04659 (Pentatricopeptide repeat-containing protein At5g27110 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E14 PE=2 SV=2)

HSP 1 Score: 219.2 bits (557), Expect = 1.1e-55
Identity = 142/464 (30.60%), Postives = 248/464 (53.45%), Query Frame = 0

Query: 1   LLSDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSN---SLDHALKLFD 60
           LL +   S K SLR  K++H ++L    L  ++ +  SL++ Y       S  H  + FD
Sbjct: 9   LLRECTNSTK-SLRRIKLVHQRILTLG-LRRDVVLCKSLINVYFTCKDHCSARHVFENFD 68

Query: 61  TMLHPNVISWNILISSFNHNFMYLDSWRTFCRMHFLGF-EPSEITYGSVLSACAAIQAPM 120
             +  +V  WN L+S ++ N M+ D+   F R+       P   T+ +V+ A  A+    
Sbjct: 69  --IRSDVYIWNSLMSGYSKNSMFHDTLEVFKRLLNCSICVPDSFTFPNVIKAYGALGREF 128

Query: 121 FGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAV 180
            G+ +++L V++G+  +  V + ++ ++AK + F ++L+VF ++   +V  WN ++S   
Sbjct: 129 LGRMIHTLVVKSGYVCDVVVASSLVGMYAKFNLFENSLQVFDEMPERDVASWNTVISCFY 188

Query: 181 RNGENFMALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGE-DVF 240
           ++GE   AL+L+  M     EPNS + +  ++AC+ L   E GK +  K +K G E D +
Sbjct: 189 QSGEAEKALELFGRMESSGFEPNSVSLTVAISACSRLLWLERGKEIHRKCVKKGFELDEY 248

Query: 241 VETALIDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLG 300
           V +AL+D+Y KC  ++ A ++F +MP +++V+W ++I G+V K D    ++    M   G
Sbjct: 249 VNSALVDMYGKCDCLEVAREVFQKMPRKSLVAWNSMIKGYVAKGDSKSCVEILNRMIIEG 308

Query: 301 EEINSYTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSM 360
              +  T+TS+L AC+          +H +++R+  ++   V  +LI++Y K G  +L+ 
Sbjct: 309 TRPSQTTLTSILMACSRSRNLLHGKFIHGYVIRSVVNADIYVNCSLIDLYFKCGEANLAE 368

Query: 361 TVFGEMDNKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVL---SVT 420
           TVF +   K    SW  MI+S+    +  KA E++ +M+   + PD    +SVL   S  
Sbjct: 369 TVFSK-TQKDVAESWNVMISSYISVGNWFKAVEVYDQMVSVGVKPDVVTFTSVLPACSQL 428

Query: 421 DCITFGRQIHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHI 457
             +  G+QIH    ++ L  D  + SAL  MYSKCG  +EAF I
Sbjct: 429 AALEKGKQIHLSISESRLETDELLLSALLDMYSKCGNEKEAFRI 467

BLAST of Cp4.1LG20g00240 vs. ExPASy Swiss-Prot
Match: Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 218.8 bits (556), Expect = 1.4e-55
Identity = 160/544 (29.41%), Postives = 260/544 (47.79%), Query Frame = 0

Query: 1   LLSDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTML 60
           LL   +KS K S    + +HA ++++    + I++ N L+D YSK  SL+   ++FD M 
Sbjct: 25  LLDSCIKS-KLSAIYVRYVHASVIKSG-FSNEIFIQNRLIDAYSKCGSLEDGRQVFDKMP 84

Query: 61  HPNVISWNILISSFNHNFMYLD---------------SWRT-----------------FC 120
             N+ +WN +++       +LD               +W +                 F 
Sbjct: 85  QRNIYTWNSVVTGLT-KLGFLDEADSLFRSMPERDQCTWNSMVSGFAQHDRCEEALCYFA 144

Query: 121 RMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDS 180
            MH  GF  +E ++ SVLSAC+ +     G QV+SL  ++ F  + Y+ + ++D+++K  
Sbjct: 145 MMHKEGFVLNEYSFASVLSACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCG 204

Query: 181 SFLDALRVFHDVDCENVVCWNAIVSAAVRNGENFMALDLYNTMCHGFLEPNSFTFSSVLT 240
           +  DA RVF ++   NVV WN++++   +NG    ALD++  M    +EP+  T +SV++
Sbjct: 205 NVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASVIS 264

Query: 241 ACAALEHPEFGKRVQGKVIKCG--GEDVFVETALIDLYSKCGEMDEAVKIFLRMPI---- 300
           ACA+L   + G+ V G+V+K      D+ +  A +D+Y+KC  + EA  IF  MPI    
Sbjct: 265 ACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNVI 324

Query: 301 ---------------------------RNVVSWTAIISGFVQKNDYLMALKFFKDMRKLG 360
                                      RNVVSW A+I+G+ Q  +   AL  F  +++  
Sbjct: 325 AETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYTQNGENEEALSLFCLLKRES 384

Query: 361 EEINSYTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHA------VVGAALINMYSKIG 420
                Y+  ++L ACA+ A     +Q H  +L+ GF   +       VG +LI+MY K G
Sbjct: 385 VCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKCG 444

Query: 421 AIDLSMTVFGEMDNKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVL 467
            ++    VF +M  +R+  SW AMI  FAQN    +A ELF++ML     PD      VL
Sbjct: 445 CVEEGYLVFRKM-MERDCVSWNAMIIGFAQNGYGNEALELFREMLESGEKPDHITMIGVL 504

BLAST of Cp4.1LG20g00240 vs. ExPASy Swiss-Prot
Match: Q9LR69 (Pentatricopeptide repeat-containing protein At1g03540 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E4 PE=2 SV=1)

HSP 1 Score: 211.8 bits (538), Expect = 1.7e-53
Identity = 140/458 (30.57%), Postives = 241/458 (52.62%), Query Frame = 0

Query: 20  HAKLLRATLLHSNIYVSNSLLDCYSK-SNSLDHALKLFDTMLHPNVISWNILISSFNHNF 79
           HA ++++  L ++  V NSLL  Y K    +    ++FD     + ISW  ++S +    
Sbjct: 84  HAHVVKSG-LETDRNVGNSLLSLYFKLGPGMRETRRVFDGRFVKDAISWTSMMSGYVTGK 143

Query: 80  MYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRA 139
            ++ +   F  M   G + +E T  S + AC+ +     G+  + + + +GF  N ++ +
Sbjct: 144 EHVKALEVFVEMVSFGLDANEFTLSSAVKACSELGEVRLGRCFHGVVITHGFEWNHFISS 203

Query: 140 GMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENFMALDLYNTMCHG-FLE 199
            +  L+  +   +DA RVF ++   +V+CW A++SA  +N     AL L+  M  G  L 
Sbjct: 204 TLAYLYGVNREPVDARRVFDEMPEPDVICWTAVLSAFSKNDLYEEALGLFYAMHRGKGLV 263

Query: 200 PNSFTFSSVLTACAALEHPEFGKRVQGKVIKCG-GEDVFVETALIDLYSKCGEMDEAVKI 259
           P+  TF +VLTAC  L   + GK + GK+I  G G +V VE++L+D+Y KCG + EA ++
Sbjct: 264 PDGSTFGTVLTACGNLRRLKQGKEIHGKLITNGIGSNVVVESSLLDMYGKCGSVREARQV 323

Query: 260 FLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINSYTVTSVLTACANPAMT 319
           F  M  +N VSW+A++ G+ Q  ++  A++ F++M    EE + Y   +VL ACA  A  
Sbjct: 324 FNGMSKKNSVSWSALLGGYCQNGEHEKAIEIFREM----EEKDLYCFGTVLKACAGLAAV 383

Query: 320 KEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGEMDNKRNLSSWTAMITS 379
           +   ++H   +R G   + +V +ALI++Y K G ID +  V+ +M + RN+ +W AM+++
Sbjct: 384 RLGKEIHGQYVRRGCFGNVIVESALIDLYGKSGCIDSASRVYSKM-SIRNMITWNAMLSA 443

Query: 380 FAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSV---TDCITFGRQIHCFTHKT-GLIF 439
            AQN   E+A   F  M+++ + PD     ++L+    T  +  GR       K+ G+  
Sbjct: 444 LAQNGRGEEAVSFFNDMVKKGIKPDYISFIAILTACGHTGMVDEGRNYFVLMAKSYGIKP 503

Query: 440 DISVGSALFTMYSKCGYLEEAFHISEKLVGQAESRRKA 471
                S +  +  + G  EEA    E L+ +AE R  A
Sbjct: 504 GTEHYSCMIDLLGRAGLFEEA----ENLLERAECRNDA 531

BLAST of Cp4.1LG20g00240 vs. ExPASy Swiss-Prot
Match: Q9SVP7 (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 211.1 bits (536), Expect = 2.9e-53
Identity = 132/442 (29.86%), Postives = 235/442 (53.17%), Query Frame = 0

Query: 19  LHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTMLHPNVISWNILISSFNHNF 78
           +HA++L   L  S + V N L+D YS++  +D A ++FD +   +  SW  +IS  + N 
Sbjct: 209 IHARILYQGLRDSTV-VCNPLIDLYSRNGFVDLARRVFDGLRLKDHSSWVAMISGLSKNE 268

Query: 79  MYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRA 138
              ++ R FC M+ LG  P+   + SVLSAC  I++   G+Q++ L ++ GF  + YV  
Sbjct: 269 CEAEAIRLFCDMYVLGIMPTPYAFSSVLSACKKIESLEIGEQLHGLVLKLGFSSDTYVCN 328

Query: 139 GMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENFMALDLYNTMCHGFLEP 198
            ++ L+    + + A  +F ++   + V +N +++   + G    A++L+  M    LEP
Sbjct: 329 ALVSLYFHLGNLISAEHIFSNMSQRDAVTYNTLINGLSQCGYGEKAMELFKRMHLDGLEP 388

Query: 199 NSFTFSSVLTACAALEHPEFGKRVQGKVIKCG-GEDVFVETALIDLYSKCGEMDEAVKIF 258
           +S T +S++ AC+A      G+++     K G   +  +E AL++LY+KC +++ A+  F
Sbjct: 389 DSNTLASLVVACSADGTLFRGQQLHAYTTKLGFASNNKIEGALLNLYAKCADIETALDYF 448

Query: 259 LRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEI--NSYTVTSVLTACANPAM 318
           L   + NVV W  ++  +   +D   + + F+ M+   EEI  N YT  S+L  C     
Sbjct: 449 LETEVENVVLWNVMLVAYGLLDDLRNSFRIFRQMQI--EEIVPNQYTYPSILKTCIRLGD 508

Query: 319 TKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGEMDNKRNLSSWTAMIT 378
            +   Q+HS I++  F  +A V + LI+MY+K+G +D +  +      K ++ SWT MI 
Sbjct: 509 LELGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIRFAGK-DVVSWTTMIA 568

Query: 379 SFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSV---TDCITFGRQIHCFTHKTGLIF 438
            + Q N  +KA   F++ML   +  D    ++ +S       +  G+QIH     +G   
Sbjct: 569 GYTQYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALKEGQQIHAQACVSGFSS 628

Query: 439 DISVGSALFTMYSKCGYLEEAF 455
           D+   +AL T+YS+CG +EE++
Sbjct: 629 DLPFQNALVTLYSRCGKIEESY 646

BLAST of Cp4.1LG20g00240 vs. NCBI nr
Match: XP_023519257.1 (pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 920 bits (2379), Expect = 0.0
Identity = 455/460 (98.91%), Postives = 458/460 (99.57%), Query Frame = 0

Query: 1   LLSDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTML 60
           LLSDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTML
Sbjct: 49  LLSDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTML 108

Query: 61  HPNVISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQ 120
           HPNVISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQ
Sbjct: 109 HPNVISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQ 168

Query: 121 VYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGE 180
           VYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGE
Sbjct: 169 VYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGE 228

Query: 181 NFMALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETAL 240
           NFMALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETAL
Sbjct: 229 NFMALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETAL 288

Query: 241 IDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINS 300
           IDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINS
Sbjct: 289 IDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINS 348

Query: 301 YTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGE 360
           YTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGE
Sbjct: 349 YTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGE 408

Query: 361 MDNKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQ 420
           MDNKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQ
Sbjct: 409 MDNKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQ 468

Query: 421 IHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHISEKL 460
           IHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFH+ + +
Sbjct: 469 IHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHVFKNM 508

BLAST of Cp4.1LG20g00240 vs. NCBI nr
Match: XP_022923751.1 (pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Cucurbita moschata] >KAG6584457.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia] >KAG7020048.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 909 bits (2348), Expect = 0.0
Identity = 448/461 (97.18%), Postives = 455/461 (98.70%), Query Frame = 0

Query: 1   LLSDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTML 60
           LLSDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTML
Sbjct: 49  LLSDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTML 108

Query: 61  HPNVISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQ 120
           HPNVISWNILISSFNHNF+YLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQ
Sbjct: 109 HPNVISWNILISSFNHNFLYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQ 168

Query: 121 VYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGE 180
           VYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHD+ CENVVCWNAIVSAAVRNGE
Sbjct: 169 VYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDIHCENVVCWNAIVSAAVRNGE 228

Query: 181 NFMALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETAL 240
           NFMALDLYNTMC G LEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETAL
Sbjct: 229 NFMALDLYNTMCRGLLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETAL 288

Query: 241 IDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINS 300
           IDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINS
Sbjct: 289 IDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINS 348

Query: 301 YTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGE 360
           YTVTSVLTACANPAMTKEAIQLHSWILRAG+SSHAVVGAALINMYSKIGAIDLSMTVFGE
Sbjct: 349 YTVTSVLTACANPAMTKEAIQLHSWILRAGYSSHAVVGAALINMYSKIGAIDLSMTVFGE 408

Query: 361 MDNKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQ 420
           MDN+RNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQ
Sbjct: 409 MDNQRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQ 468

Query: 421 IHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHISEKLV 461
           IHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFH+ + + 
Sbjct: 469 IHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHVFKNMA 509

BLAST of Cp4.1LG20g00240 vs. NCBI nr
Match: XP_023001341.1 (pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Cucurbita maxima])

HSP 1 Score: 905 bits (2339), Expect = 0.0
Identity = 447/460 (97.17%), Postives = 455/460 (98.91%), Query Frame = 0

Query: 1   LLSDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTML 60
           LLSDYVKSRKCSLR+TKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTML
Sbjct: 49  LLSDYVKSRKCSLRHTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTML 108

Query: 61  HPNVISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQ 120
           HPNVISWNILISSFNHNF+YLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQ
Sbjct: 109 HPNVISWNILISSFNHNFLYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQ 168

Query: 121 VYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGE 180
           VYSLAVRNGFFVNGYVRAGMIDLFAK+SSFLDALRVF DVDCENVVCWNAIVSAAVRNGE
Sbjct: 169 VYSLAVRNGFFVNGYVRAGMIDLFAKESSFLDALRVFQDVDCENVVCWNAIVSAAVRNGE 228

Query: 181 NFMALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETAL 240
           NFMALDLYNTMC GFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETAL
Sbjct: 229 NFMALDLYNTMCRGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETAL 288

Query: 241 IDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINS 300
           IDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINS
Sbjct: 289 IDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINS 348

Query: 301 YTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGE 360
           YTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGE
Sbjct: 349 YTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGE 408

Query: 361 MDNKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQ 420
           MDN+RNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQ
Sbjct: 409 MDNQRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQ 468

Query: 421 IHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHISEKL 460
           IHCFTHKTGL+F ISVGSALFTMYSKCGYLEEAFH+ + +
Sbjct: 469 IHCFTHKTGLVFGISVGSALFTMYSKCGYLEEAFHVFKNM 508

BLAST of Cp4.1LG20g00240 vs. NCBI nr
Match: XP_038893557.1 (pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Benincasa hispida])

HSP 1 Score: 836 bits (2159), Expect = 1.08e-296
Identity = 410/460 (89.13%), Postives = 439/460 (95.43%), Query Frame = 0

Query: 2   LSDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTMLH 61
           L+D+VKSRKCSLRNTKVLHAKLLRA LLHSNIYVSNSLLDCYSKSN++DHALKLFDTMLH
Sbjct: 64  LNDFVKSRKCSLRNTKVLHAKLLRANLLHSNIYVSNSLLDCYSKSNAMDHALKLFDTMLH 123

Query: 62  PNVISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQV 121
           PNVISWNI+IS FN+ F++LD+ RTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQV
Sbjct: 124 PNVISWNIIISGFNYKFLHLDTCRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQV 183

Query: 122 YSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGEN 181
           YSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGEN
Sbjct: 184 YSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGEN 243

Query: 182 FMALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETALI 241
            MALDL+NTMC GFLEPNSFTFSSVLTACAALE  EFGKRVQG+VIKCGGEDVFVETALI
Sbjct: 244 LMALDLFNTMCSGFLEPNSFTFSSVLTACAALEDLEFGKRVQGRVIKCGGEDVFVETALI 303

Query: 242 DLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINSY 301
           D Y+KCG+ DEAVKIFLRMPIRNVVSWTAIISGFVQ NDYLMALKFF+DMRK GEEINSY
Sbjct: 304 DSYAKCGDPDEAVKIFLRMPIRNVVSWTAIISGFVQNNDYLMALKFFEDMRKAGEEINSY 363

Query: 302 TVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGEM 361
           TVTSVLTACANPAMTKEA QLHSWIL+AGFSSHAVV AALINMYSKIGAIDLS+ VF EM
Sbjct: 364 TVTSVLTACANPAMTKEATQLHSWILKAGFSSHAVVAAALINMYSKIGAIDLSLMVFREM 423

Query: 362 DNKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQI 421
           DN+RNLSSWTAMITSFAQNNDKE ASELF+KML+ES+GPDTFCTSSVLSVTDCITFGR+I
Sbjct: 424 DNQRNLSSWTAMITSFAQNNDKENASELFRKMLKESVGPDTFCTSSVLSVTDCITFGREI 483

Query: 422 HCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHISEKLV 461
           HC+T KTGLIFD+SVGS+LFTMYSKCG+L+EAF + E ++
Sbjct: 484 HCYTLKTGLIFDVSVGSSLFTMYSKCGHLKEAFQVFENML 523

BLAST of Cp4.1LG20g00240 vs. NCBI nr
Match: XP_022137435.1 (pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Momordica charantia] >XP_022137436.1 pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Momordica charantia] >XP_022137437.1 pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Momordica charantia] >XP_022137439.1 pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Momordica charantia])

HSP 1 Score: 799 bits (2064), Expect = 1.72e-282
Identity = 394/460 (85.65%), Postives = 429/460 (93.26%), Query Frame = 0

Query: 1   LLSDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTML 60
           LL+DYVKSRKCSL+NTKV+HAKLLRATLLHS+IYV+NSLLDCYSKS ++D+ALKLFD ML
Sbjct: 49  LLNDYVKSRKCSLKNTKVMHAKLLRATLLHSSIYVTNSLLDCYSKSGAMDNALKLFDKML 108

Query: 61  HPNVISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQ 120
           H NVISWNI+IS FN NF++L+SWRTFCRMHFLGFEPSEITYGSVLSACAA+QAPMFGKQ
Sbjct: 109 HLNVISWNIMISGFNQNFLFLESWRTFCRMHFLGFEPSEITYGSVLSACAAMQAPMFGKQ 168

Query: 121 VYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGE 180
           +YSL VRNG FVNGYVRAGMIDLFAKDSSF DALRVF+DVDCENVVCWNAIVSAAVRNGE
Sbjct: 169 IYSLVVRNGSFVNGYVRAGMIDLFAKDSSFPDALRVFNDVDCENVVCWNAIVSAAVRNGE 228

Query: 181 NFMALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETAL 240
           N +ALDL+NTMC GFLEPNSFTFSSVLTACAA+E  EFGKRVQG+VIKCGGEDVFVETAL
Sbjct: 229 NSVALDLFNTMCSGFLEPNSFTFSSVLTACAAVEDLEFGKRVQGRVIKCGGEDVFVETAL 288

Query: 241 IDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINS 300
           IDLY+KCG++DEAVK FL+MPIRNVVSWTAIISGFVQKND  MALK FKDMR LGEEINS
Sbjct: 289 IDLYAKCGDIDEAVKTFLQMPIRNVVSWTAIISGFVQKNDCFMALKVFKDMRNLGEEINS 348

Query: 301 YTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGE 360
           YTVTSVLTACANPAM KEAIQLHSWIL+AGF S+AVV +ALINMYSKIG IDLSM VF E
Sbjct: 349 YTVTSVLTACANPAMRKEAIQLHSWILKAGFLSYAVVVSALINMYSKIGTIDLSMMVFRE 408

Query: 361 MDNKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQ 420
           +D++RNLSSW AMITSFAQN DKEKA ELFQKML+ES+GPDTFCTSSVLSVTDCITFGRQ
Sbjct: 409 IDDQRNLSSWAAMITSFAQNMDKEKAIELFQKMLQESIGPDTFCTSSVLSVTDCITFGRQ 468

Query: 421 IHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHISEKL 460
           IHC+T KTGLIFD+SVGS+LFTMYSKCGYLEEAF   E +
Sbjct: 469 IHCYTLKTGLIFDVSVGSSLFTMYSKCGYLEEAFQFFENM 508

BLAST of Cp4.1LG20g00240 vs. ExPASy TrEMBL
Match: A0A6J1E7L2 (pentatricopeptide repeat-containing protein At1g74600, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111431363 PE=4 SV=1)

HSP 1 Score: 909 bits (2348), Expect = 0.0
Identity = 448/461 (97.18%), Postives = 455/461 (98.70%), Query Frame = 0

Query: 1   LLSDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTML 60
           LLSDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTML
Sbjct: 49  LLSDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTML 108

Query: 61  HPNVISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQ 120
           HPNVISWNILISSFNHNF+YLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQ
Sbjct: 109 HPNVISWNILISSFNHNFLYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQ 168

Query: 121 VYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGE 180
           VYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHD+ CENVVCWNAIVSAAVRNGE
Sbjct: 169 VYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDIHCENVVCWNAIVSAAVRNGE 228

Query: 181 NFMALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETAL 240
           NFMALDLYNTMC G LEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETAL
Sbjct: 229 NFMALDLYNTMCRGLLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETAL 288

Query: 241 IDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINS 300
           IDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINS
Sbjct: 289 IDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINS 348

Query: 301 YTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGE 360
           YTVTSVLTACANPAMTKEAIQLHSWILRAG+SSHAVVGAALINMYSKIGAIDLSMTVFGE
Sbjct: 349 YTVTSVLTACANPAMTKEAIQLHSWILRAGYSSHAVVGAALINMYSKIGAIDLSMTVFGE 408

Query: 361 MDNKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQ 420
           MDN+RNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQ
Sbjct: 409 MDNQRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQ 468

Query: 421 IHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHISEKLV 461
           IHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFH+ + + 
Sbjct: 469 IHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHVFKNMA 509

BLAST of Cp4.1LG20g00240 vs. ExPASy TrEMBL
Match: A0A6J1KIC5 (pentatricopeptide repeat-containing protein At1g74600, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111495501 PE=4 SV=1)

HSP 1 Score: 905 bits (2339), Expect = 0.0
Identity = 447/460 (97.17%), Postives = 455/460 (98.91%), Query Frame = 0

Query: 1   LLSDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTML 60
           LLSDYVKSRKCSLR+TKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTML
Sbjct: 49  LLSDYVKSRKCSLRHTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTML 108

Query: 61  HPNVISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQ 120
           HPNVISWNILISSFNHNF+YLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQ
Sbjct: 109 HPNVISWNILISSFNHNFLYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQ 168

Query: 121 VYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGE 180
           VYSLAVRNGFFVNGYVRAGMIDLFAK+SSFLDALRVF DVDCENVVCWNAIVSAAVRNGE
Sbjct: 169 VYSLAVRNGFFVNGYVRAGMIDLFAKESSFLDALRVFQDVDCENVVCWNAIVSAAVRNGE 228

Query: 181 NFMALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETAL 240
           NFMALDLYNTMC GFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETAL
Sbjct: 229 NFMALDLYNTMCRGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETAL 288

Query: 241 IDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINS 300
           IDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINS
Sbjct: 289 IDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINS 348

Query: 301 YTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGE 360
           YTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGE
Sbjct: 349 YTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGE 408

Query: 361 MDNKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQ 420
           MDN+RNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQ
Sbjct: 409 MDNQRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQ 468

Query: 421 IHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHISEKL 460
           IHCFTHKTGL+F ISVGSALFTMYSKCGYLEEAFH+ + +
Sbjct: 469 IHCFTHKTGLVFGISVGSALFTMYSKCGYLEEAFHVFKNM 508

BLAST of Cp4.1LG20g00240 vs. ExPASy TrEMBL
Match: A0A6J1C6M8 (pentatricopeptide repeat-containing protein At1g74600, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111008883 PE=4 SV=1)

HSP 1 Score: 799 bits (2064), Expect = 8.35e-283
Identity = 394/460 (85.65%), Postives = 429/460 (93.26%), Query Frame = 0

Query: 1   LLSDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTML 60
           LL+DYVKSRKCSL+NTKV+HAKLLRATLLHS+IYV+NSLLDCYSKS ++D+ALKLFD ML
Sbjct: 49  LLNDYVKSRKCSLKNTKVMHAKLLRATLLHSSIYVTNSLLDCYSKSGAMDNALKLFDKML 108

Query: 61  HPNVISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQ 120
           H NVISWNI+IS FN NF++L+SWRTFCRMHFLGFEPSEITYGSVLSACAA+QAPMFGKQ
Sbjct: 109 HLNVISWNIMISGFNQNFLFLESWRTFCRMHFLGFEPSEITYGSVLSACAAMQAPMFGKQ 168

Query: 121 VYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGE 180
           +YSL VRNG FVNGYVRAGMIDLFAKDSSF DALRVF+DVDCENVVCWNAIVSAAVRNGE
Sbjct: 169 IYSLVVRNGSFVNGYVRAGMIDLFAKDSSFPDALRVFNDVDCENVVCWNAIVSAAVRNGE 228

Query: 181 NFMALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETAL 240
           N +ALDL+NTMC GFLEPNSFTFSSVLTACAA+E  EFGKRVQG+VIKCGGEDVFVETAL
Sbjct: 229 NSVALDLFNTMCSGFLEPNSFTFSSVLTACAAVEDLEFGKRVQGRVIKCGGEDVFVETAL 288

Query: 241 IDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINS 300
           IDLY+KCG++DEAVK FL+MPIRNVVSWTAIISGFVQKND  MALK FKDMR LGEEINS
Sbjct: 289 IDLYAKCGDIDEAVKTFLQMPIRNVVSWTAIISGFVQKNDCFMALKVFKDMRNLGEEINS 348

Query: 301 YTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGE 360
           YTVTSVLTACANPAM KEAIQLHSWIL+AGF S+AVV +ALINMYSKIG IDLSM VF E
Sbjct: 349 YTVTSVLTACANPAMRKEAIQLHSWILKAGFLSYAVVVSALINMYSKIGTIDLSMMVFRE 408

Query: 361 MDNKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQ 420
           +D++RNLSSW AMITSFAQN DKEKA ELFQKML+ES+GPDTFCTSSVLSVTDCITFGRQ
Sbjct: 409 IDDQRNLSSWAAMITSFAQNMDKEKAIELFQKMLQESIGPDTFCTSSVLSVTDCITFGRQ 468

Query: 421 IHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHISEKL 460
           IHC+T KTGLIFD+SVGS+LFTMYSKCGYLEEAF   E +
Sbjct: 469 IHCYTLKTGLIFDVSVGSSLFTMYSKCGYLEEAFQFFENM 508

BLAST of Cp4.1LG20g00240 vs. ExPASy TrEMBL
Match: A0A1S3B4I2 (pentatricopeptide repeat-containing protein At1g74600, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103485891 PE=4 SV=1)

HSP 1 Score: 708 bits (1827), Expect = 6.41e-247
Identity = 351/460 (76.30%), Postives = 400/460 (86.96%), Query Frame = 0

Query: 1   LLSDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTML 60
           LL+D+VK    SLRNTKVLHAK LR T    +IYVSNSLL CYSKSN++DHALKLFDT+L
Sbjct: 49  LLNDFVKLGNFSLRNTKVLHAKFLRVTP-RIDIYVSNSLLHCYSKSNAMDHALKLFDTIL 108

Query: 61  HPNVISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQ 120
           +PNVISWN +I+ FN+NF++LDS R FC MH+LGF+P+E+T GSVLSACAAIQA MFGKQ
Sbjct: 109 NPNVISWNTIITGFNNNFLHLDSLRIFCWMHYLGFKPNEVTCGSVLSACAAIQATMFGKQ 168

Query: 121 VYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGE 180
           VYSLAVRNGFF NGYVR  MIDLFAKDS FLDALRVFHDVDC NVVCWNAIVSAAV NGE
Sbjct: 169 VYSLAVRNGFFDNGYVRTVMIDLFAKDSKFLDALRVFHDVDCANVVCWNAIVSAAVTNGE 228

Query: 181 NFMALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETAL 240
             MALDL+N MC  FLEPNSFTFSSVLTAC+AL+  EFGK VQG+VIKCGG DVFVETAL
Sbjct: 229 YLMALDLFNRMCSKFLEPNSFTFSSVLTACSALQDLEFGKSVQGRVIKCGGGDVFVETAL 288

Query: 241 IDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINS 300
           + LY+KCG+MDEAVKIF +MPIRNVVSWT I+SGFVQ NDYLM +K F+D+RK+GEEINS
Sbjct: 289 VSLYAKCGDMDEAVKIFFQMPIRNVVSWTVIMSGFVQNNDYLMVIKIFEDLRKIGEEINS 348

Query: 301 YTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGE 360
           YTVT++L ACANP M KEA QLHSWIL+AGFSS A V AALI MYSKIGAIDLS+ VF E
Sbjct: 349 YTVTTLLRACANPGMRKEATQLHSWILKAGFSSQAEVVAALIIMYSKIGAIDLSLMVFRE 408

Query: 361 MDNKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQ 420
           MDN RNLSSWTAMI S A+NNDKE+AS+LF+KMLRE M PD+ CTS++LS+TDCITFGRQ
Sbjct: 409 MDNHRNLSSWTAMILSLAKNNDKEEASDLFRKMLREKMEPDSVCTSTLLSLTDCITFGRQ 468

Query: 421 IHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHISEKL 460
           IHC+T KT LIF++SVGS+LFTMYSKCG+L+EAF + E +
Sbjct: 469 IHCYTLKTELIFNVSVGSSLFTMYSKCGHLKEAFQVFENM 507

BLAST of Cp4.1LG20g00240 vs. ExPASy TrEMBL
Match: A0A5D3BIJ5 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G00800 PE=4 SV=1)

HSP 1 Score: 708 bits (1827), Expect = 4.82e-242
Identity = 351/460 (76.30%), Postives = 400/460 (86.96%), Query Frame = 0

Query: 1   LLSDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTML 60
           LL+D+VK    SLRNTKVLHAK LR T    +IYVSNSLL CYSKSN++DHALKLFDT+L
Sbjct: 88  LLNDFVKLGNFSLRNTKVLHAKFLRVTP-RIDIYVSNSLLHCYSKSNAMDHALKLFDTIL 147

Query: 61  HPNVISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQ 120
           +PNVISWN +I+ FN+NF++LDS R FC MH+LGF+P+E+T GSVLSACAAIQA MFGKQ
Sbjct: 148 NPNVISWNTIITGFNNNFLHLDSLRIFCWMHYLGFKPNEVTCGSVLSACAAIQATMFGKQ 207

Query: 121 VYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGE 180
           VYSLAVRNGFF NGYVR  MIDLFAKDS FLDALRVFHDVDC NVVCWNAIVSAAV NGE
Sbjct: 208 VYSLAVRNGFFDNGYVRTVMIDLFAKDSKFLDALRVFHDVDCANVVCWNAIVSAAVTNGE 267

Query: 181 NFMALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETAL 240
             MALDL+N MC  FLEPNSFTFSSVLTAC+AL+  EFGK VQG+VIKCGG DVFVETAL
Sbjct: 268 YLMALDLFNRMCSKFLEPNSFTFSSVLTACSALQDLEFGKSVQGRVIKCGGGDVFVETAL 327

Query: 241 IDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINS 300
           + LY+KCG+MDEAVKIF +MPIRNVVSWT I+SGFVQ NDYLM +K F+D+RK+GEEINS
Sbjct: 328 VSLYAKCGDMDEAVKIFFQMPIRNVVSWTVIMSGFVQNNDYLMVIKIFEDLRKIGEEINS 387

Query: 301 YTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGE 360
           YTVT++L ACANP M KEA QLHSWIL+AGFSS A V AALI MYSKIGAIDLS+ VF E
Sbjct: 388 YTVTTLLRACANPGMRKEATQLHSWILKAGFSSQAEVVAALIIMYSKIGAIDLSLMVFRE 447

Query: 361 MDNKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQ 420
           MDN RNLSSWTAMI S A+NNDKE+AS+LF+KMLRE M PD+ CTS++LS+TDCITFGRQ
Sbjct: 448 MDNHRNLSSWTAMILSLAKNNDKEEASDLFRKMLREKMEPDSVCTSTLLSLTDCITFGRQ 507

Query: 421 IHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHISEKL 460
           IHC+T KT LIF++SVGS+LFTMYSKCG+L+EAF + E +
Sbjct: 508 IHCYTLKTELIFNVSVGSSLFTMYSKCGHLKEAFQVFENM 546

BLAST of Cp4.1LG20g00240 vs. TAIR 10
Match: AT1G74600.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 450.7 bits (1158), Expect = 1.5e-126
Identity = 224/454 (49.34%), Postives = 319/454 (70.26%), Query Frame = 0

Query: 3   SDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTMLHP 62
           +D   SR C+LR TK+L A LLR  LL  +++++ SLL  YS S S+  A KLFDT+  P
Sbjct: 54  NDQSNSRLCNLRTTKILQAHLLRRYLLPFDVFLTKSLLSWYSNSGSMADAAKLFDTIPQP 113

Query: 63  NVISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVY 122
           +V+S NI+IS +  + ++ +S R F +MHFLGFE +EI+YGSV+SAC+A+QAP+F + V 
Sbjct: 114 DVVSCNIMISGYKQHRLFEESLRFFSKMHFLGFEANEISYGSVISACSALQAPLFSELVC 173

Query: 123 SLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENF 182
              ++ G+F    V + +ID+F+K+  F DA +VF D    NV CWN I++ A+RN    
Sbjct: 174 CHTIKMGYFFYEVVESALIDVFSKNLRFEDAYKVFRDSLSANVYCWNTIIAGALRNQNYG 233

Query: 183 MALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETALID 242
              DL++ MC GF +P+S+T+SSVL ACA+LE   FGK VQ +VIKCG EDVFV TA++D
Sbjct: 234 AVFDLFHEMCVGFQKPDSYTYSSVLAACASLEKLRFGKVVQARVIKCGAEDVFVCTAIVD 293

Query: 243 LYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINSYT 302
           LY+KCG M EA+++F R+P  +VVSWT ++SG+ + ND   AL+ FK+MR  G EIN+ T
Sbjct: 294 LYAKCGHMAEAMEVFSRIPNPSVVSWTVMLSGYTKSNDAFSALEIFKEMRHSGVEINNCT 353

Query: 303 VTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGEMD 362
           VTSV++AC  P+M  EA Q+H+W+ ++GF   + V AALI+MYSK G IDLS  VF ++D
Sbjct: 354 VTSVISACGRPSMVCEASQVHAWVFKSGFYLDSSVAAALISMYSKSGDIDLSEQVFEDLD 413

Query: 363 NKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQIH 422
           + +  +    MITSF+Q+    KA  LF +ML+E +  D F   S+LSV DC+  G+Q+H
Sbjct: 414 DIQRQNIVNVMITSFSQSKKPGKAIRLFTRMLQEGLRTDEFSVCSLLSVLDCLNLGKQVH 473

Query: 423 CFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHI 457
            +T K+GL+ D++VGS+LFT+YSKCG LEE++ +
Sbjct: 474 GYTLKSGLVLDLTVGSSLFTLYSKCGSLEESYKL 507

BLAST of Cp4.1LG20g00240 vs. TAIR 10
Match: AT3G61170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 228.8 bits (582), Expect = 9.6e-60
Identity = 140/423 (33.10%), Postives = 242/423 (57.21%), Query Frame = 0

Query: 37  NSLLDCYSKSNSLDHALKLFDTMLHPNVISWNILISSFNHNFMYLDSWRTFCRMHFLGFE 96
           N+++  YS S  L  A KLF +    N ISWN LIS +  +   ++++  F  M   G +
Sbjct: 63  NTMIVAYSNSRRLSDAEKLFRSNPVKNTISWNALISGYCKSGSKVEAFNLFWEMQSDGIK 122

Query: 97  PSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRV 156
           P+E T GSVL  C ++   + G+Q++   ++ GF ++  V  G++ ++A+     +A  +
Sbjct: 123 PNEYTLGSVLRMCTSLVLLLRGEQIHGHTIKTGFDLDVNVVNGLLAMYAQCKRISEAEYL 182

Query: 157 FHDVDCE-NVVCWNAIVSAAVRNGENFMALDLYNTMCHGFLEPNSFTFSSVLTACAALEH 216
           F  ++ E N V W ++++   +NG  F A++ +  +     + N +TF SVLTACA++  
Sbjct: 183 FETMEGEKNNVTWTSMLTGYSQNGFAFKAIECFRDLRREGNQSNQYTFPSVLTACASVSA 242

Query: 217 PEFGKRVQGKVIKCGGE-DVFVETALIDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISG 276
              G +V   ++K G + +++V++ALID+Y+KC EM+ A  +   M + +VVSW ++I G
Sbjct: 243 CRVGVQVHCCIVKSGFKTNIYVQSALIDMYAKCREMESARALLEGMEVDDVVSWNSMIVG 302

Query: 277 FVQKNDYLMALKFFKDMRKLGEEINSYTVTSVLTACA-NPAMTKEAIQLHSWILRAGFSS 336
            V++     AL  F  M +   +I+ +T+ S+L   A +    K A   H  I++ G+++
Sbjct: 303 CVRQGLIGEALSMFGRMHERDMKIDDFTIPSILNCFALSRTEMKIASSAHCLIVKTGYAT 362

Query: 337 HAVVGAALINMYSKIGAIDLSMTVFGEMDNKRNLSSWTAMITSFAQNNDKEKASELFQKM 396
           + +V  AL++MY+K G +D ++ VF  M  K ++ SWTA++T    N   ++A +LF  M
Sbjct: 363 YKLVNNALVDMYAKRGIMDSALKVFEGMIEK-DVISWTALVTGNTHNGSYDEALKLFCNM 422

Query: 397 LRESMGPDTFCTSSVLSVTDCIT---FGRQIHCFTHKTGLIFDISVGSALFTMYSKCGYL 454
               + PD   T+SVLS +  +T   FG+Q+H    K+G    +SV ++L TMY+KCG L
Sbjct: 423 RVGGITPDKIVTASVLSASAELTLLEFGQQVHGNYIKSGFPSSLSVNNSLVTMYTKCGSL 482

BLAST of Cp4.1LG20g00240 vs. TAIR 10
Match: AT5G27110.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 219.2 bits (557), Expect = 7.6e-57
Identity = 142/464 (30.60%), Postives = 248/464 (53.45%), Query Frame = 0

Query: 1   LLSDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSN---SLDHALKLFD 60
           LL +   S K SLR  K++H ++L    L  ++ +  SL++ Y       S  H  + FD
Sbjct: 9   LLRECTNSTK-SLRRIKLVHQRILTLG-LRRDVVLCKSLINVYFTCKDHCSARHVFENFD 68

Query: 61  TMLHPNVISWNILISSFNHNFMYLDSWRTFCRMHFLGF-EPSEITYGSVLSACAAIQAPM 120
             +  +V  WN L+S ++ N M+ D+   F R+       P   T+ +V+ A  A+    
Sbjct: 69  --IRSDVYIWNSLMSGYSKNSMFHDTLEVFKRLLNCSICVPDSFTFPNVIKAYGALGREF 128

Query: 121 FGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAV 180
            G+ +++L V++G+  +  V + ++ ++AK + F ++L+VF ++   +V  WN ++S   
Sbjct: 129 LGRMIHTLVVKSGYVCDVVVASSLVGMYAKFNLFENSLQVFDEMPERDVASWNTVISCFY 188

Query: 181 RNGENFMALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGE-DVF 240
           ++GE   AL+L+  M     EPNS + +  ++AC+ L   E GK +  K +K G E D +
Sbjct: 189 QSGEAEKALELFGRMESSGFEPNSVSLTVAISACSRLLWLERGKEIHRKCVKKGFELDEY 248

Query: 241 VETALIDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLG 300
           V +AL+D+Y KC  ++ A ++F +MP +++V+W ++I G+V K D    ++    M   G
Sbjct: 249 VNSALVDMYGKCDCLEVAREVFQKMPRKSLVAWNSMIKGYVAKGDSKSCVEILNRMIIEG 308

Query: 301 EEINSYTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSM 360
              +  T+TS+L AC+          +H +++R+  ++   V  +LI++Y K G  +L+ 
Sbjct: 309 TRPSQTTLTSILMACSRSRNLLHGKFIHGYVIRSVVNADIYVNCSLIDLYFKCGEANLAE 368

Query: 361 TVFGEMDNKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVL---SVT 420
           TVF +   K    SW  MI+S+    +  KA E++ +M+   + PD    +SVL   S  
Sbjct: 369 TVFSK-TQKDVAESWNVMISSYISVGNWFKAVEVYDQMVSVGVKPDVVTFTSVLPACSQL 428

Query: 421 DCITFGRQIHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHI 457
             +  G+QIH    ++ L  D  + SAL  MYSKCG  +EAF I
Sbjct: 429 AALEKGKQIHLSISESRLETDELLLSALLDMYSKCGNEKEAFRI 467

BLAST of Cp4.1LG20g00240 vs. TAIR 10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 218.8 bits (556), Expect = 9.9e-57
Identity = 160/544 (29.41%), Postives = 260/544 (47.79%), Query Frame = 0

Query: 1   LLSDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTML 60
           LL   +KS K S    + +HA ++++    + I++ N L+D YSK  SL+   ++FD M 
Sbjct: 25  LLDSCIKS-KLSAIYVRYVHASVIKSG-FSNEIFIQNRLIDAYSKCGSLEDGRQVFDKMP 84

Query: 61  HPNVISWNILISSFNHNFMYLD---------------SWRT-----------------FC 120
             N+ +WN +++       +LD               +W +                 F 
Sbjct: 85  QRNIYTWNSVVTGLT-KLGFLDEADSLFRSMPERDQCTWNSMVSGFAQHDRCEEALCYFA 144

Query: 121 RMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDS 180
            MH  GF  +E ++ SVLSAC+ +     G QV+SL  ++ F  + Y+ + ++D+++K  
Sbjct: 145 MMHKEGFVLNEYSFASVLSACSGLNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCG 204

Query: 181 SFLDALRVFHDVDCENVVCWNAIVSAAVRNGENFMALDLYNTMCHGFLEPNSFTFSSVLT 240
           +  DA RVF ++   NVV WN++++   +NG    ALD++  M    +EP+  T +SV++
Sbjct: 205 NVNDAQRVFDEMGDRNVVSWNSLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASVIS 264

Query: 241 ACAALEHPEFGKRVQGKVIKCG--GEDVFVETALIDLYSKCGEMDEAVKIFLRMPI---- 300
           ACA+L   + G+ V G+V+K      D+ +  A +D+Y+KC  + EA  IF  MPI    
Sbjct: 265 ACASLSAIKVGQEVHGRVVKNDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNVI 324

Query: 301 ---------------------------RNVVSWTAIISGFVQKNDYLMALKFFKDMRKLG 360
                                      RNVVSW A+I+G+ Q  +   AL  F  +++  
Sbjct: 325 AETSMISGYAMAASTKAARLMFTKMAERNVVSWNALIAGYTQNGENEEALSLFCLLKRES 384

Query: 361 EEINSYTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHA------VVGAALINMYSKIG 420
                Y+  ++L ACA+ A     +Q H  +L+ GF   +       VG +LI+MY K G
Sbjct: 385 VCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKCG 444

Query: 421 AIDLSMTVFGEMDNKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVL 467
            ++    VF +M  +R+  SW AMI  FAQN    +A ELF++ML     PD      VL
Sbjct: 445 CVEEGYLVFRKM-MERDCVSWNAMIIGFAQNGYGNEALELFREMLESGEKPDHITMIGVL 504

BLAST of Cp4.1LG20g00240 vs. TAIR 10
Match: AT1G03540.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 211.8 bits (538), Expect = 1.2e-54
Identity = 140/458 (30.57%), Postives = 241/458 (52.62%), Query Frame = 0

Query: 20  HAKLLRATLLHSNIYVSNSLLDCYSK-SNSLDHALKLFDTMLHPNVISWNILISSFNHNF 79
           HA ++++  L ++  V NSLL  Y K    +    ++FD     + ISW  ++S +    
Sbjct: 84  HAHVVKSG-LETDRNVGNSLLSLYFKLGPGMRETRRVFDGRFVKDAISWTSMMSGYVTGK 143

Query: 80  MYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRA 139
            ++ +   F  M   G + +E T  S + AC+ +     G+  + + + +GF  N ++ +
Sbjct: 144 EHVKALEVFVEMVSFGLDANEFTLSSAVKACSELGEVRLGRCFHGVVITHGFEWNHFISS 203

Query: 140 GMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENFMALDLYNTMCHG-FLE 199
            +  L+  +   +DA RVF ++   +V+CW A++SA  +N     AL L+  M  G  L 
Sbjct: 204 TLAYLYGVNREPVDARRVFDEMPEPDVICWTAVLSAFSKNDLYEEALGLFYAMHRGKGLV 263

Query: 200 PNSFTFSSVLTACAALEHPEFGKRVQGKVIKCG-GEDVFVETALIDLYSKCGEMDEAVKI 259
           P+  TF +VLTAC  L   + GK + GK+I  G G +V VE++L+D+Y KCG + EA ++
Sbjct: 264 PDGSTFGTVLTACGNLRRLKQGKEIHGKLITNGIGSNVVVESSLLDMYGKCGSVREARQV 323

Query: 260 FLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINSYTVTSVLTACANPAMT 319
           F  M  +N VSW+A++ G+ Q  ++  A++ F++M    EE + Y   +VL ACA  A  
Sbjct: 324 FNGMSKKNSVSWSALLGGYCQNGEHEKAIEIFREM----EEKDLYCFGTVLKACAGLAAV 383

Query: 320 KEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGEMDNKRNLSSWTAMITS 379
           +   ++H   +R G   + +V +ALI++Y K G ID +  V+ +M + RN+ +W AM+++
Sbjct: 384 RLGKEIHGQYVRRGCFGNVIVESALIDLYGKSGCIDSASRVYSKM-SIRNMITWNAMLSA 443

Query: 380 FAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSV---TDCITFGRQIHCFTHKT-GLIF 439
            AQN   E+A   F  M+++ + PD     ++L+    T  +  GR       K+ G+  
Sbjct: 444 LAQNGRGEEAVSFFNDMVKKGIKPDYISFIAILTACGHTGMVDEGRNYFVLMAKSYGIKP 503

Query: 440 DISVGSALFTMYSKCGYLEEAFHISEKLVGQAESRRKA 471
                S +  +  + G  EEA    E L+ +AE R  A
Sbjct: 504 GTEHYSCMIDLLGRAGLFEEA----ENLLERAECRNDA 531

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9CA562.2e-12549.34Pentatricopeptide repeat-containing protein At1g74600, chloroplastic OS=Arabidop... [more]
O046591.1e-5530.60Pentatricopeptide repeat-containing protein At5g27110 OS=Arabidopsis thaliana OX... [more]
Q9SIT71.4e-5529.41Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
Q9LR691.7e-5330.57Pentatricopeptide repeat-containing protein At1g03540 OS=Arabidopsis thaliana OX... [more]
Q9SVP72.9e-5329.86Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_023519257.10.098.91pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Cucurbita ... [more]
XP_022923751.10.097.18pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Cucurbita ... [more]
XP_023001341.10.097.17pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Cucurbita ... [more]
XP_038893557.11.08e-29689.13pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Benincasa ... [more]
XP_022137435.11.72e-28285.65pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Momordica ... [more]
Match NameE-valueIdentityDescription
A0A6J1E7L20.097.18pentatricopeptide repeat-containing protein At1g74600, chloroplastic OS=Cucurbit... [more]
A0A6J1KIC50.097.17pentatricopeptide repeat-containing protein At1g74600, chloroplastic OS=Cucurbit... [more]
A0A6J1C6M88.35e-28385.65pentatricopeptide repeat-containing protein At1g74600, chloroplastic OS=Momordic... [more]
A0A1S3B4I26.41e-24776.30pentatricopeptide repeat-containing protein At1g74600, chloroplastic OS=Cucumis ... [more]
A0A5D3BIJ54.82e-24276.30Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT1G74600.11.5e-12649.34pentatricopeptide (PPR) repeat-containing protein [more]
AT3G61170.19.6e-6033.10Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G27110.17.6e-5730.60Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G13600.19.9e-5729.41Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G03540.11.2e-5430.57Pentatricopeptide repeat (PPR-like) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 266..299
e-value: 6.5E-6
score: 24.0
coord: 369..402
e-value: 5.8E-6
score: 24.2
coord: 238..264
e-value: 0.0014
score: 16.7
coord: 37..60
e-value: 6.4E-4
score: 17.7
coord: 65..99
e-value: 3.1E-4
score: 18.7
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 369..396
e-value: 6.6E-6
score: 26.0
coord: 340..364
e-value: 0.9
score: 9.9
coord: 37..60
e-value: 7.5E-4
score: 19.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 164..211
e-value: 8.3E-10
score: 38.7
coord: 263..312
e-value: 2.4E-13
score: 50.1
coord: 62..110
e-value: 2.3E-8
score: 34.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 264..298
score: 9.97484
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 32..66
score: 9.240434
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 366..400
score: 10.818861
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 233..263
score: 8.527949
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 230..315
e-value: 2.7E-20
score: 74.4
coord: 119..223
e-value: 7.2E-13
score: 50.2
coord: 4..118
e-value: 4.7E-18
score: 67.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 316..419
e-value: 5.3E-16
score: 60.9
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 415..458
NoneNo IPR availablePANTHERPTHR24015:SF878OS09G0413300 PROTEINcoord: 415..458
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 23..318
NoneNo IPR availablePANTHERPTHR24015:SF878OS09G0413300 PROTEINcoord: 23..318
NoneNo IPR availablePANTHERPTHR24015:SF878OS09G0413300 PROTEINcoord: 321..412
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 321..412

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g00240.1Cp4.1LG20g00240.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding