Cp4.1LG20g00240 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG20g00240
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCp4.1LG20 : 123242 .. 126262 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TTGCTCAGTGATTATGTAAAATCCAGAAAATGCTCTTTGAGGAACACCAAAGTTCTACATGCCAAGTTACTCCGAGCAACTCTTCTTCATTCCAATATCTATGTTTCAAATTCTTTGCTGGATTGCTACTCAAAGTCTAACTCTCTGGACCATGCACTCAAGCTGTTTGATACAATGCTCCACCCAAATGTCATTTCTTGGAATATCCTTATCTCCAGTTTCAACCACAACTTCATGTATTTGGATTCGTGGAGAACATTTTGCAGGATGCATTTCCTGGGTTTTGAACCCAGCGAGATAACGTATGGGAGTGTTTTATCTGCTTGTGCTGCCATTCAAGCCCCAATGTTTGGTAAGCAGGTCTATTCACTTGCTGTGAGAAATGGGTTTTTTGTTAATGGTTATGTTCGAGCAGGGATGATCGATTTGTTTGCAAAAGATTCTAGTTTTCTGGATGCTCTAAGGGTGTTTCATGATGTTGATTGTGAGAACGTGGTGTGCTGGAATGCTATTGTGTCCGCAGCTGTAAGGAATGGGGAGAATTTTATGGCTTTGGATCTTTACAACACAATGTGTCATGGGTTCTTGGAGCCTAATAGTTTCACGTTTTCTAGTGTTCTAACTGCGTGTGCTGCACTTGAACATCCTGAATTTGGGAAAAGAGTTCAAGGGAAAGTGATTAAATGTGGCGGAGAAGATGTTTTTGTAGAGACAGCCCTCATTGATTTGTATAGCAAGTGTGGAGAAATGGATGAAGCTGTTAAAATATTCTTGCGGATGCCCATTCGCAATGTGGTCTCATGGACTGCCATAATATCTGGTTTTGTGCAGAAGAATGATTACTTAATGGCCCTCAAGTTTTTCAAAGATATGAGAAAATTGGGGGAGGAAATCAATAGCTATACAGTCACTAGCGTGTTAACTGCATGTGCTAATCCAGCCATGACAAAAGAAGCAATCCAACTCCACTCCTGGATTTTAAGAGCTGGTTTTTCATCTCATGCGGTGGTGGGAGCTGCTTTAATTAACATGTATTCGAAAATAGGAGCTATTGATCTTTCTATGACTGTTTTCGGAGAGATGGACAATAAAAGGAATCTCAGTTCTTGGACAGCTATGATTACCTCATTTGCACAGAACAATGATAAAGAGAAAGCAAGTGAATTGTTCCAAAAAATGTTACGTGAAAGTATGGGACCAGATACATTTTGCACCTCTAGTGTCTTGAGTGTGACCGACTGTATTACCTTTGGGAGACAGATTCACTGCTTCACGCATAAAACTGGATTAATATTTGACATTTCTGTCGGCAGTGCTCTTTTCACAATGTATTCCAAATGTGGCTATCTAGAGGAAGCTTTTCATGTTTTTAAAAACATGCCAAAGAAGGACAATATTTCATGGGCATCGATGATGTCCTGTTTCTCAGAACATGGTTATGCAAAAGAGGGCATCCAATTATTTAGAGAAATGTTGTTTGAAGAATATGTTCCTGATTATATGATTTTAAGTACAGTCCTAAATGCATGTTCTGTTCTTCATTCTATTCAAATAGGCAGAGAGATTCATTGTTATTCTGTTCGTTTGGGTCTGGACAAAGATGTAGCAATTGGGGGTTCGCTTGTGACTATGTACTCAAAATGCGGCAACCTGGAGATGGCTCGGAGGGTGTTTGAAACATTGCCCGAGAAAGATAATATTGCATGCTCTTCGTTGGTTTCAGGATATGCTCAACACAAGTGCATCAAAGAGACAATTTTGCTATTCCAAGATCTACTGGAGGCTGGCTTAGCCATCGATCCCTTTTCAATCTCATCCATACTGGGAGCAATTGCGCTTTTAAATAGGCCTGGTATTGGGACTCAACTCCATGCAATCATTACGAAAGTAGGCTTGGAGAAAGATGTTTCTGTTGGGAGTTCGCTAGTAATGGTATACTCCAAATGTGGAAGTATAGAAGACTGCTGCAAAGCATTTGAGCAGATTGGAAAGCCTGATTTGATAGGTTGGACAGCCATGATTGTGAGTTATGCCCAGCATGGGAAAGGTGCTGAAGCTTTATGTGTCTATGAACTTATGAAGAAAGAAGGAATCAAGCCTGATCCAGTCACCTTTGTTGGGGTTTTGTCTGCTTGTAGCCATAATGGTTTGGTGGATGAAGCTTATTTCCACCTTAATTCGATGGTGAAAGACTATGGTATACAACCGGGACATCGACATTATGCTTGTATGGTTGATCTTCTTGGTCGGTGTGGGCAACTGAAAAGGGCAGAAGAACTGATTAACAATATGCCTATTGAACCTGATGCTCTCATTTGGGGAACTCTTCTCGCTGCCTGTAAAGTACATGGAGATATTGAACTTGGAAAACTAGCGGCAAGAAAGGTGATGGAGTTGAAGCCAAGTGACACTGGGGCGTATGTCTCTCTTTCCAACATCTGTGCTGATATGGGCCTGTGGGAAGAGGTCCTGAACGTTAGAAGCCTTATGAAGGGAGCTGGAGTGACGAAGGAACCTGGTTGGAGCTTGCTGTAAGAAGCTATTGTTCATAGTTGGTAAGTCCTTATAGCTAAGGATTTTCCCTTTTTCTCTTTTTTTCTTTTTTTTGTTTTGTTTTGTTTTTGTTTTTGTTTGCATGTCTTGATTTTATTTAGAACGTCCTTAGCATGAATACCAATGAACTGCAGTATGAATACTCGAAGAGTTAGCAAAACAGCAAAAGATGTTTAGCTGACCTTATTGATTCTGATTTCATCCCATGGTCTAATCCGTAAGTTTTGATAATTATGACAGATCTCAGAAAAACTTGTGGGACAAGCTGAGTCCAGAAGGAAGGCTCTTGGTCAGTTTTAGAATATTTTTTTATGAGTTTCTCGGTGATATGGTATAAGAGAGGAAGTCTTTTTGTTCTATATCCACCTCAGTACCTCATGTTTGGTCCTTAAATCAGAAGACAGGAATGCAAGCAACAGACATCGTGACTTCCTGATCAGGCTGGATACCGGAAGTGACTCAAATTGA

mRNA sequence

TTGCTCAGTGATTATGTAAAATCCAGAAAATGCTCTTTGAGGAACACCAAAGTTCTACATGCCAAGTTACTCCGAGCAACTCTTCTTCATTCCAATATCTATGTTTCAAATTCTTTGCTGGATTGCTACTCAAAGTCTAACTCTCTGGACCATGCACTCAAGCTGTTTGATACAATGCTCCACCCAAATGTCATTTCTTGGAATATCCTTATCTCCAGTTTCAACCACAACTTCATGTATTTGGATTCGTGGAGAACATTTTGCAGGATGCATTTCCTGGGTTTTGAACCCAGCGAGATAACGTATGGGAGTGTTTTATCTGCTTGTGCTGCCATTCAAGCCCCAATGTTTGGTAAGCAGGTCTATTCACTTGCTGTGAGAAATGGGTTTTTTGTTAATGGTTATGTTCGAGCAGGGATGATCGATTTGTTTGCAAAAGATTCTAGTTTTCTGGATGCTCTAAGGGTGTTTCATGATGTTGATTGTGAGAACGTGGTGTGCTGGAATGCTATTGTGTCCGCAGCTGTAAGGAATGGGGAGAATTTTATGGCTTTGGATCTTTACAACACAATGTGTCATGGGTTCTTGGAGCCTAATAGTTTCACGTTTTCTAGTGTTCTAACTGCGTGTGCTGCACTTGAACATCCTGAATTTGGGAAAAGAGTTCAAGGGAAAGTGATTAAATGTGGCGGAGAAGATGTTTTTGTAGAGACAGCCCTCATTGATTTGTATAGCAAGTGTGGAGAAATGGATGAAGCTGTTAAAATATTCTTGCGGATGCCCATTCGCAATGTGGTCTCATGGACTGCCATAATATCTGGTTTTGTGCAGAAGAATGATTACTTAATGGCCCTCAAGTTTTTCAAAGATATGAGAAAATTGGGGGAGGAAATCAATAGCTATACAGTCACTAGCGTGTTAACTGCATGTGCTAATCCAGCCATGACAAAAGAAGCAATCCAACTCCACTCCTGGATTTTAAGAGCTGGTTTTTCATCTCATGCGGTGGTGGGAGCTGCTTTAATTAACATGTATTCGAAAATAGGAGCTATTGATCTTTCTATGACTGTTTTCGGAGAGATGGACAATAAAAGGAATCTCAGTTCTTGGACAGCTATGATTACCTCATTTGCACAGAACAATGATAAAGAGAAAGCAAGTGAATTGTTCCAAAAAATGTTACGTGAAAGTATGGGACCAGATACATTTTGCACCTCTAGTGTCTTGAGTGTGACCGACTGTATTACCTTTGGGAGACAGATTCACTGCTTCACGCATAAAACTGGATTAATATTTGACATTTCTGTCGGCAGTGCTCTTTTCACAATGTATTCCAAATGTGGCTATCTAGAGGAAGCTTTTCATATCTCAGAAAAACTTGTGGGACAAGCTGAGTCCAGAAGGAAGGCTCTTGAAGACAGGAATGCAAGCAACAGACATCGTGACTTCCTGATCAGGCTGGATACCGGAAGTGACTCAAATTGA

Coding sequence (CDS)

TTGCTCAGTGATTATGTAAAATCCAGAAAATGCTCTTTGAGGAACACCAAAGTTCTACATGCCAAGTTACTCCGAGCAACTCTTCTTCATTCCAATATCTATGTTTCAAATTCTTTGCTGGATTGCTACTCAAAGTCTAACTCTCTGGACCATGCACTCAAGCTGTTTGATACAATGCTCCACCCAAATGTCATTTCTTGGAATATCCTTATCTCCAGTTTCAACCACAACTTCATGTATTTGGATTCGTGGAGAACATTTTGCAGGATGCATTTCCTGGGTTTTGAACCCAGCGAGATAACGTATGGGAGTGTTTTATCTGCTTGTGCTGCCATTCAAGCCCCAATGTTTGGTAAGCAGGTCTATTCACTTGCTGTGAGAAATGGGTTTTTTGTTAATGGTTATGTTCGAGCAGGGATGATCGATTTGTTTGCAAAAGATTCTAGTTTTCTGGATGCTCTAAGGGTGTTTCATGATGTTGATTGTGAGAACGTGGTGTGCTGGAATGCTATTGTGTCCGCAGCTGTAAGGAATGGGGAGAATTTTATGGCTTTGGATCTTTACAACACAATGTGTCATGGGTTCTTGGAGCCTAATAGTTTCACGTTTTCTAGTGTTCTAACTGCGTGTGCTGCACTTGAACATCCTGAATTTGGGAAAAGAGTTCAAGGGAAAGTGATTAAATGTGGCGGAGAAGATGTTTTTGTAGAGACAGCCCTCATTGATTTGTATAGCAAGTGTGGAGAAATGGATGAAGCTGTTAAAATATTCTTGCGGATGCCCATTCGCAATGTGGTCTCATGGACTGCCATAATATCTGGTTTTGTGCAGAAGAATGATTACTTAATGGCCCTCAAGTTTTTCAAAGATATGAGAAAATTGGGGGAGGAAATCAATAGCTATACAGTCACTAGCGTGTTAACTGCATGTGCTAATCCAGCCATGACAAAAGAAGCAATCCAACTCCACTCCTGGATTTTAAGAGCTGGTTTTTCATCTCATGCGGTGGTGGGAGCTGCTTTAATTAACATGTATTCGAAAATAGGAGCTATTGATCTTTCTATGACTGTTTTCGGAGAGATGGACAATAAAAGGAATCTCAGTTCTTGGACAGCTATGATTACCTCATTTGCACAGAACAATGATAAAGAGAAAGCAAGTGAATTGTTCCAAAAAATGTTACGTGAAAGTATGGGACCAGATACATTTTGCACCTCTAGTGTCTTGAGTGTGACCGACTGTATTACCTTTGGGAGACAGATTCACTGCTTCACGCATAAAACTGGATTAATATTTGACATTTCTGTCGGCAGTGCTCTTTTCACAATGTATTCCAAATGTGGCTATCTAGAGGAAGCTTTTCATATCTCAGAAAAACTTGTGGGACAAGCTGAGTCCAGAAGGAAGGCTCTTGAAGACAGGAATGCAAGCAACAGACATCGTGACTTCCTGATCAGGCTGGATACCGGAAGTGACTCAAATTGA

Protein sequence

LLSDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTMLHPNVISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENFMALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETALIDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGEMDNKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQIHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHISEKLVGQAESRRKALEDRNASNRHRDFLIRLDTGSDSN
BLAST of Cp4.1LG20g00240 vs. Swiss-Prot
Match: PP121_ARATH (Pentatricopeptide repeat-containing protein At1g74600, chloroplastic OS=Arabidopsis thaliana GN=PCMP-E69 PE=3 SV=1)

HSP 1 Score: 450.7 bits (1158), Expect = 2.1e-125
Identity = 224/454 (49.34%), Postives = 319/454 (70.26%), Query Frame = 1

Query: 3   SDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTMLHP 62
           +D   SR C+LR TK+L A LLR  LL  +++++ SLL  YS S S+  A KLFDT+  P
Sbjct: 54  NDQSNSRLCNLRTTKILQAHLLRRYLLPFDVFLTKSLLSWYSNSGSMADAAKLFDTIPQP 113

Query: 63  NVISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVY 122
           +V+S NI+IS +  + ++ +S R F +MHFLGFE +EI+YGSV+SAC+A+QAP+F + V 
Sbjct: 114 DVVSCNIMISGYKQHRLFEESLRFFSKMHFLGFEANEISYGSVISACSALQAPLFSELVC 173

Query: 123 SLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENF 182
              ++ G+F    V + +ID+F+K+  F DA +VF D    NV CWN I++ A+RN    
Sbjct: 174 CHTIKMGYFFYEVVESALIDVFSKNLRFEDAYKVFRDSLSANVYCWNTIIAGALRNQNYG 233

Query: 183 MALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETALID 242
              DL++ MC GF +P+S+T+SSVL ACA+LE   FGK VQ +VIKCG EDVFV TA++D
Sbjct: 234 AVFDLFHEMCVGFQKPDSYTYSSVLAACASLEKLRFGKVVQARVIKCGAEDVFVCTAIVD 293

Query: 243 LYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINSYT 302
           LY+KCG M EA+++F R+P  +VVSWT ++SG+ + ND   AL+ FK+MR  G EIN+ T
Sbjct: 294 LYAKCGHMAEAMEVFSRIPNPSVVSWTVMLSGYTKSNDAFSALEIFKEMRHSGVEINNCT 353

Query: 303 VTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGEMD 362
           VTSV++AC  P+M  EA Q+H+W+ ++GF   + V AALI+MYSK G IDLS  VF ++D
Sbjct: 354 VTSVISACGRPSMVCEASQVHAWVFKSGFYLDSSVAAALISMYSKSGDIDLSEQVFEDLD 413

Query: 363 NKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQIH 422
           + +  +    MITSF+Q+    KA  LF +ML+E +  D F   S+LSV DC+  G+Q+H
Sbjct: 414 DIQRQNIVNVMITSFSQSKKPGKAIRLFTRMLQEGLRTDEFSVCSLLSVLDCLNLGKQVH 473

Query: 423 CFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHI 457
            +T K+GL+ D++VGS+LFT+YSKCG LEE++ +
Sbjct: 474 GYTLKSGLVLDLTVGSSLFTLYSKCGSLEESYKL 507

BLAST of Cp4.1LG20g00240 vs. Swiss-Prot
Match: PP398_ARATH (Pentatricopeptide repeat-containing protein At5g27110 OS=Arabidopsis thaliana GN=PCMP-E14 PE=2 SV=2)

HSP 1 Score: 219.5 bits (558), Expect = 7.9e-56
Identity = 142/464 (30.60%), Postives = 248/464 (53.45%), Query Frame = 1

Query: 1   LLSDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSN---SLDHALKLFD 60
           LL +   S K SLR  K++H ++L   L   ++ +  SL++ Y       S  H  + FD
Sbjct: 9   LLRECTNSTK-SLRRIKLVHQRILTLGL-RRDVVLCKSLINVYFTCKDHCSARHVFENFD 68

Query: 61  TMLHPNVISWNILISSFNHNFMYLDSWRTFCRMHFLGF-EPSEITYGSVLSACAAIQAPM 120
             +  +V  WN L+S ++ N M+ D+   F R+       P   T+ +V+ A  A+    
Sbjct: 69  --IRSDVYIWNSLMSGYSKNSMFHDTLEVFKRLLNCSICVPDSFTFPNVIKAYGALGREF 128

Query: 121 FGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAV 180
            G+ +++L V++G+  +  V + ++ ++AK + F ++L+VF ++   +V  WN ++S   
Sbjct: 129 LGRMIHTLVVKSGYVCDVVVASSLVGMYAKFNLFENSLQVFDEMPERDVASWNTVISCFY 188

Query: 181 RNGENFMALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGE-DVF 240
           ++GE   AL+L+  M     EPNS + +  ++AC+ L   E GK +  K +K G E D +
Sbjct: 189 QSGEAEKALELFGRMESSGFEPNSVSLTVAISACSRLLWLERGKEIHRKCVKKGFELDEY 248

Query: 241 VETALIDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLG 300
           V +AL+D+Y KC  ++ A ++F +MP +++V+W ++I G+V K D    ++    M   G
Sbjct: 249 VNSALVDMYGKCDCLEVAREVFQKMPRKSLVAWNSMIKGYVAKGDSKSCVEILNRMIIEG 308

Query: 301 EEINSYTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSM 360
              +  T+TS+L AC+          +H +++R+  ++   V  +LI++Y K G  +L+ 
Sbjct: 309 TRPSQTTLTSILMACSRSRNLLHGKFIHGYVIRSVVNADIYVNCSLIDLYFKCGEANLAE 368

Query: 361 TVFGEMDNKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVL---SVT 420
           TVF +   K    SW  MI+S+    +  KA E++ +M+   + PD    +SVL   S  
Sbjct: 369 TVFSK-TQKDVAESWNVMISSYISVGNWFKAVEVYDQMVSVGVKPDVVTFTSVLPACSQL 428

Query: 421 DCITFGRQIHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHI 457
             +  G+QIH    ++ L  D  + SAL  MYSKCG  +EAF I
Sbjct: 429 AALEKGKQIHLSISESRLETDELLLSALLDMYSKCGNEKEAFRI 467

BLAST of Cp4.1LG20g00240 vs. Swiss-Prot
Match: PP172_ARATH (Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana GN=PCMP-H60 PE=2 SV=1)

HSP 1 Score: 219.2 bits (557), Expect = 1.0e-55
Identity = 139/460 (30.22%), Postives = 248/460 (53.91%), Query Frame = 1

Query: 6   VKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTMLHPNVI 65
           V +  C     + LH + ++   L  ++ V  SL+D Y K ++     K+FD M   NV+
Sbjct: 102 VSATLCDELFGRQLHCQCIKFGFL-DDVSVGTSLVDTYMKGSNFKDGRKVFDEMKERNVV 161

Query: 66  SWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLA 125
           +W  LIS +  N M  +    F RM   G +P+  T+ + L   A       G QV+++ 
Sbjct: 162 TWTTLISGYARNSMNDEVLTLFMRMQNEGTQPNSFTFAAALGVLAEEGVGGRGLQVHTVV 221

Query: 126 VRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENFMAL 185
           V+NG      V   +I+L+ K  +   A  +F   + ++VV WN+++S    NG +  AL
Sbjct: 222 VKNGLDKTIPVSNSLINLYLKCGNVRKARILFDKTEVKSVVTWNSMISGYAANGLDLEAL 281

Query: 186 DLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGE-DVFVETALIDLY 245
            ++ +M   ++  +  +F+SV+  CA L+   F +++   V+K G   D  + TAL+  Y
Sbjct: 282 GMFYSMRLNYVRLSESSFASVIKLCANLKELRFTEQLHCSVVKYGFLFDQNIRTALMVAY 341

Query: 246 SKCGEMDEAVKIFLRMP-IRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINSYTV 305
           SKC  M +A+++F  +  + NVVSWTA+ISGF+Q +    A+  F +M++ G   N +T 
Sbjct: 342 SKCTAMLDALRLFKEIGCVGNVVSWTAMISGFLQNDGKEEAVDLFSEMKRKGVRPNEFTY 401

Query: 306 TSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGEMDN 365
           + +LTA   P ++    ++H+ +++  +   + VG AL++ Y K+G ++ +  VF  +D+
Sbjct: 402 SVILTAL--PVISPS--EVHAQVVKTNYERSSTVGTALLDAYVKLGKVEEAAKVFSGIDD 461

Query: 366 KRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITF----GR 425
           K ++ +W+AM+  +AQ  + E A ++F ++ +  + P+ F  SS+L+V          G+
Sbjct: 462 K-DIVAWSAMLAGYAQTGETEAAIKMFGELTKGGIKPNEFTFSSILNVCAATNASMGQGK 521

Query: 426 QIHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHISEK 460
           Q H F  K+ L   + V SAL TMY+K G +E A  + ++
Sbjct: 522 QFHGFAIKSRLDSSLCVSSALLTMYAKKGNIESAEEVFKR 555

BLAST of Cp4.1LG20g00240 vs. Swiss-Prot
Match: PP296_ARATH (Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H83 PE=2 SV=2)

HSP 1 Score: 214.2 bits (544), Expect = 3.3e-54
Identity = 131/451 (29.05%), Postives = 236/451 (52.33%), Query Frame = 1

Query: 19  LHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTMLHPNVISWNILISSFNHNF 78
           LH+++ +        +++  L+  Y K  SLD A K+FD M      +WN +I ++  N 
Sbjct: 102 LHSRIFKTFPSFELDFLAGKLVFMYGKCGSLDDAEKVFDEMPDRTAFAWNTMIGAYVSNG 161

Query: 79  MYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRA 138
               +   +  M   G      ++ ++L ACA ++    G +++SL V+ G+   G++  
Sbjct: 162 EPASALALYWNMRVEGVPLGLSSFPALLKACAKLRDIRSGSELHSLLVKLGYHSTGFIVN 221

Query: 139 GMIDLFAKDSSFLDALRVFHDVDCE-NVVCWNAIVSAAVRNGENFMALDLYNTMCHGFLE 198
            ++ ++AK+     A R+F     + + V WN+I+S+   +G++   L+L+  M      
Sbjct: 222 ALVSMYAKNDDLSAARRLFDGFQEKGDAVLWNSILSSYSTSGKSLETLELFREMHMTGPA 281

Query: 199 PNSFTFSSVLTACAALEHPEFGKRVQGKVIKCG--GEDVFVETALIDLYSKCGEMDEAVK 258
           PNS+T  S LTAC    + + GK +   V+K      +++V  ALI +Y++CG+M +A +
Sbjct: 282 PNSYTIVSALTACDGFSYAKLGKEIHASVLKSSTHSSELYVCNALIAMYTRCGKMPQAER 341

Query: 259 IFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINSYTVTSVLTACANPAM 318
           I  +M   +VV+W ++I G+VQ   Y  AL+FF DM   G + +  ++TS++ A    + 
Sbjct: 342 ILRQMNNADVVTWNSLIKGYVQNLMYKEALEFFSDMIAAGHKSDEVSMTSIIAASGRLSN 401

Query: 319 TKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGEMDNKRNLSSWTAMIT 378
               ++LH+++++ G+ S+  VG  LI+MYSK          F  M +K +L SWT +I 
Sbjct: 402 LLAGMELHAYVIKHGWDSNLQVGNTLIDMYSKCNLTCYMGRAFLRMHDK-DLISWTTVIA 461

Query: 379 SFAQNNDKEKASELFQKMLRESMGPDTFCTSSVL---SVTDCITFGRQIHCFTHKTGLIF 438
            +AQN+   +A ELF+ + ++ M  D     S+L   SV   +   ++IHC   + GL+ 
Sbjct: 462 GYAQNDCHVEALELFRDVAKKRMEIDEMILGSILRASSVLKSMLIVKEIHCHILRKGLL- 521

Query: 439 DISVGSALFTMYSKCGYLEEAFHISEKLVGQ 464
           D  + + L  +Y KC  +  A  + E + G+
Sbjct: 522 DTVIQNELVDVYGKCRNMGYATRVFESIKGK 550

BLAST of Cp4.1LG20g00240 vs. Swiss-Prot
Match: PP181_ARATH (Pentatricopeptide repeat-containing protein At2g33680 OS=Arabidopsis thaliana GN=PCMP-E19 PE=3 SV=1)

HSP 1 Score: 214.2 bits (544), Expect = 3.3e-54
Identity = 135/454 (29.74%), Postives = 244/454 (53.74%), Query Frame = 1

Query: 8   SRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTMLHPNVISW 67
           S++ +L   + +H +++R T   + I  +N L++ Y+K   L  A  +F+ ++  +V+SW
Sbjct: 25  SQQRNLVAGRAVHGQIIR-TGASTCIQHANVLVNFYAKCGKLAKAHSIFNAIICKDVVSW 84

Query: 68  NILISSFNHNFMYLDSW---RTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSL 127
           N LI+ ++ N     S+   + F  M      P+  T   +  A +++Q+   G+Q ++L
Sbjct: 85  NSLITGYSQNGGISSSYTVMQLFREMRAQDILPNAYTLAGIFKAESSLQSSTVGRQAHAL 144

Query: 128 AVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENFMA 187
            V+   F + YV   ++ ++ K     D L+VF  +   N   W+ +VS     G    A
Sbjct: 145 VVKMSSFGDIYVDTSLVGMYCKAGLVEDGLKVFAYMPERNTYTWSTMVSGYATRGRVEEA 204

Query: 188 LDLYNTMCHGFLE--PNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGED-VFVETALI 247
           + ++N       E   + + F++VL++ AA  +   G+++    IK G    V +  AL+
Sbjct: 205 IKVFNLFLREKEEGSDSDYVFTAVLSSLAATIYVGLGRQIHCITIKNGLLGFVALSNALV 264

Query: 248 DLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINSY 307
            +YSKC  ++EA K+F     RN ++W+A+++G+ Q  + L A+K F  M   G + + Y
Sbjct: 265 TMYSKCESLNEACKMFDSSGDRNSITWSAMVTGYSQNGESLEAVKLFSRMFSAGIKPSEY 324

Query: 308 TVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGEM 367
           T+  VL AC++    +E  QLHS++L+ GF  H     AL++MY+K G +  +   F + 
Sbjct: 325 TIVGVLNACSDICYLEEGKQLHSFLLKLGFERHLFATTALVDMYAKAGCLADARKGF-DC 384

Query: 368 DNKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCIT---FG 427
             +R+++ WT++I+ + QN+D E+A  L+++M    + P+    +SVL     +     G
Sbjct: 385 LQERDVALWTSLISGYVQNSDNEEALILYRRMKTAGIIPNDPTMASVLKACSSLATLELG 444

Query: 428 RQIHCFTHKTGLIFDISVGSALFTMYSKCGYLEE 453
           +Q+H  T K G   ++ +GSAL TMYSKCG LE+
Sbjct: 445 KQVHGHTIKHGFGLEVPIGSALSTMYSKCGSLED 476

BLAST of Cp4.1LG20g00240 vs. TrEMBL
Match: B9R998_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_1515050 PE=4 SV=1)

HSP 1 Score: 547.0 bits (1408), Expect = 2.4e-152
Identity = 258/458 (56.33%), Postives = 353/458 (77.07%), Query Frame = 1

Query: 3   SDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTMLHP 62
           ++Y+KS   ++  TKV+H  L++  L +SN  V+NSLLD Y KS +L +ALK+FDT+ + 
Sbjct: 55  TNYIKSADHTVEETKVIHTHLIKTALFNSNTVVANSLLDWYCKSGALFYALKVFDTIPNK 114

Query: 63  NVISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVY 122
           NVISWN++IS +N N ++ DSWR F  MHF GF+P++ITYG VLSACAA++ P  G+QVY
Sbjct: 115 NVISWNVIISGYNRNSLFEDSWRFFSMMHFSGFDPNDITYGCVLSACAALETPNLGEQVY 174

Query: 123 SLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENF 182
           SLA +NGF+ NG+VRAGMIDL A++  F DALRVF+DV CENVVCWN+I+S AV++GE +
Sbjct: 175 SLATKNGFYSNGHVRAGMIDLLARNGRFGDALRVFYDVSCENVVCWNSIISGAVKSGEYW 234

Query: 183 MALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETALID 242
           +ALD++  M   F+ PNSFTFSS+LTACA+LE  E GK +QG VIKC  +D+FV TA+++
Sbjct: 235 IALDIFYQMSRRFVVPNSFTFSSILTACASLEEVELGKGIQGWVIKCCAKDIFVGTAIVN 294

Query: 243 LYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINSYT 302
           +Y+KCG++ +AVK F RMP+RNVVSWTAI+SGF++++D + ALKFFK+MRK+ EE N +T
Sbjct: 295 MYAKCGDIVDAVKEFSRMPVRNVVSWTAIVSGFIKRDDSISALKFFKEMRKMKEETNKFT 354

Query: 303 VTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGEMD 362
           VT+V++ACA P   KEAIQ+H WIL+ G+    VVGAALINMY+K+ AI  S  VF EM+
Sbjct: 355 VTTVISACAKPHFIKEAIQIHCWILKTGYYLDPVVGAALINMYAKLHAISSSEMVFREME 414

Query: 363 NKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQIH 422
             +N   WT MI+SFA+N D + A +L  K+L++ + PD FC SSVLSV D +  GR+IH
Sbjct: 415 GVKNPGIWTIMISSFAKNQDSQSAIDLLLKLLQQGLRPDKFCLSSVLSVIDSLYLGREIH 474

Query: 423 CFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHISEKL 461
           C+  KTG + D+SVGS+LFTMYSKCG + +++ + E++
Sbjct: 475 CYILKTGFVLDLSVGSSLFTMYSKCGSIGDSYKVFEQI 512

BLAST of Cp4.1LG20g00240 vs. TrEMBL
Match: A0A061FX38_THECC (Pentatricopeptide repeat-containing protein, putative isoform 1 OS=Theobroma cacao GN=TCM_013958 PE=4 SV=1)

HSP 1 Score: 543.5 bits (1399), Expect = 2.7e-151
Identity = 270/458 (58.95%), Postives = 350/458 (76.42%), Query Frame = 1

Query: 4   DYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTMLHPN 63
           DY + R+ ++++TK+LH  LL+ + L SNI+V+NSLLD Y +  S++ A+KLFD M  PN
Sbjct: 53  DYKRLRQYTIKSTKLLHTHLLKTSKLQSNIFVANSLLDGYCRCGSMEEAIKLFDQMSEPN 112

Query: 64  VISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYS 123
           +ISWN +IS +N+N++   SW  F +M F GFEP EITY SVLSAC A+++  FGKQ+YS
Sbjct: 113 IISWNTMISGYNYNYLLEGSWVWFLKMRFSGFEPDEITYRSVLSACVAMRSTSFGKQLYS 172

Query: 124 LAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDC-ENVVCWNAIVSAAVRNGENF 183
           + ++NGFF NGYVR GMIDLFAK   F DALRVF+DV C ENVVCWNAI+S AVR+ EN+
Sbjct: 173 VTMKNGFFSNGYVRTGMIDLFAKCCVFEDALRVFYDVSCCENVVCWNAIISGAVRSEENW 232

Query: 184 MALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETALID 243
           +ALDL+  M   FL PNSFTFSSVL+ACAAL+  E GK VQG +IKCG  DVFV TAL D
Sbjct: 233 VALDLFVQMRKQFLMPNSFTFSSVLSACAALKELEIGKEVQGWIIKCGVVDVFVGTALTD 292

Query: 244 LYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINSYT 303
           LY KCG+M+EAV +F  MP R+VVSWTAIISGFVQK+D L AL+FFK+MR +  EIN+YT
Sbjct: 293 LYVKCGDMEEAVNMFSWMPTRDVVSWTAIISGFVQKDDLLNALEFFKEMRYMKVEINNYT 352

Query: 304 VTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGEMD 363
            TSV++ACA P M +EA Q+HSWI+++GF   +V+ AAL+NMYSKIG I L+  VF EM+
Sbjct: 353 ATSVISACAKPDMIEEAKQIHSWIIKSGFYMDSVIQAALVNMYSKIGIIGLAEIVFKEME 412

Query: 364 NKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQIH 423
           + R+ ++W  +I+SFAQ    ++  EL + ML+E + PD FCTSSV SV +CI  GRQ+H
Sbjct: 413 SIRSPNTWAVLISSFAQKQSFQRVIELLRTMLKEGLRPDRFCTSSVFSVIECINLGRQMH 472

Query: 424 CFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHISEKL 461
           C+T KTGLIF +SV S+LFTMYSKCG LE++  + + +
Sbjct: 473 CYTLKTGLIFYLSVESSLFTMYSKCGSLEDSLKVFQNI 510

BLAST of Cp4.1LG20g00240 vs. TrEMBL
Match: A0A0D2U5X8_GOSRA (Uncharacterized protein (Fragment) OS=Gossypium raimondii GN=B456_008G192700 PE=4 SV=1)

HSP 1 Score: 528.5 bits (1360), Expect = 8.8e-147
Identity = 266/459 (57.95%), Postives = 344/459 (74.95%), Query Frame = 1

Query: 4   DYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTMLHPN 63
           DY +SR  ++++TK+LHA LL+ + L + I+V+N+LLD Y K  S++ A+KLFD M  P 
Sbjct: 69  DYRRSRNYAIKSTKILHAHLLKTSKLQTYIFVANNLLDRYCKWGSMEEAVKLFDKMPEPT 128

Query: 64  VISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYS 123
           V SWN LIS FN+N ++  SW  F +M   GFEP EI+Y +VLSAC A+Q+  FGKQVYS
Sbjct: 129 VTSWNTLISGFNYNKLFESSWLWFSKMWISGFEPDEISYRNVLSACVAMQSISFGKQVYS 188

Query: 124 LAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVD-CENVVCWNAIVSAAVRNGENF 183
           + ++NG + NGYVR GMIDLFAK  +F DALRVF+DV  CENVVCWN I+SAAVRN EN+
Sbjct: 189 VTMKNGLYSNGYVRTGMIDLFAKCCAFWDALRVFYDVSGCENVVCWNGIISAAVRNEENW 248

Query: 184 MALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETALID 243
           +ALDL+  M   FL PNSFTFSSVLTACAAL+  E GK VQG +IKCGG DVFV TALID
Sbjct: 249 IALDLFVQMGKQFLMPNSFTFSSVLTACAALKELEIGKEVQGWIIKCGGVDVFVGTALID 308

Query: 244 LYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINSYT 303
            Y K G+MDEAVK F  MP RNVVSWTAIISGFVQK+D + ALKFFK+MR +  E+N+YT
Sbjct: 309 FYVKSGDMDEAVKAFSWMPTRNVVSWTAIISGFVQKDDCINALKFFKEMRYMNLEVNNYT 368

Query: 304 VTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGEMD 363
            T+V++ACA   M +EA Q+HSW++++GF   +V+  AL+NMYSKIG I ++  VF EM+
Sbjct: 369 ATAVISACAKLNMIEEATQIHSWVIKSGFCMDSVIKVALVNMYSKIGVIGMAEIVFKEME 428

Query: 364 NKRNLSSWTA-MITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQI 423
           + R+ +   A +I+SFAQ    +   EL  +ML+E + PD FCTSSV SV +C+  GRQ+
Sbjct: 429 SIRSSADTLAVLISSFAQKQSSQYVIELLTRMLKEGVRPDRFCTSSVFSVIECLKLGRQM 488

Query: 424 HCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHISEKL 461
           HC+T KTGLIFD+SV ++LFTMYSKCG LE++  + + +
Sbjct: 489 HCYTLKTGLIFDLSVETSLFTMYSKCGTLEDSLKVFQSM 527

BLAST of Cp4.1LG20g00240 vs. TrEMBL
Match: A0A072TYN0_MEDTR (Pentatricopeptide (PPR) repeat protein OS=Medicago truncatula GN=MTR_7g053240 PE=4 SV=1)

HSP 1 Score: 501.1 bits (1289), Expect = 1.5e-138
Identity = 242/461 (52.49%), Postives = 330/461 (71.58%), Query Frame = 1

Query: 1   LLSDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTML 60
           +L DY  S + + RNTK+LHA LL+   L S I+  +SL+  Y KS+ +  A KLFDT+ 
Sbjct: 38  ILRDYKFSPQHNARNTKILHAHLLKTHYLQSGIFFMDSLIGLYCKSSDMVLAHKLFDTIT 97

Query: 61  HPNVISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQ 120
            P+++SWN++IS +  N M+L S   FCRMH  GFEP E +YGSVLSAC A+QA MFG Q
Sbjct: 98  QPSIVSWNVMISGYVRNSMFLKSLEMFCRMHLFGFEPDEFSYGSVLSACVALQASMFGLQ 157

Query: 121 VYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGE 180
           V+SL V+NGF  +GYV+  M+D+F K+ +F +ALR F+D  C+NV  WNAI+S AV+NGE
Sbjct: 158 VFSLVVKNGFLSSGYVQTQMVDMFCKNCNFSEALRFFNDASCDNVASWNAIISLAVKNGE 217

Query: 181 NFMALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETAL 240
           N +AL+L++ MC   L PNS+TF S+LTAC AL+  + GK V G  IKCG  DVFVETA+
Sbjct: 218 NQVALNLFSEMCRASLMPNSYTFPSILTACCALKEMQIGKGVHGLAIKCGATDVFVETAI 277

Query: 241 IDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINS 300
           +DLY+K G M EA + F +M ++NVVSWTAIISGFVQ++D   ALK FKDMR++G EIN+
Sbjct: 278 VDLYAKFGCMSEAYRQFSQMQVQNVVSWTAIISGFVQQDDTTFALKLFKDMRQIGHEINA 337

Query: 301 YTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGE 360
           YTVTSVL+ACA P + +EA Q+HS +L+ G   +  VGAAL+NMY+KIG + LS   F E
Sbjct: 338 YTVTSVLSACAKPELIEEAKQIHSLVLKLGLILNVKVGAALVNMYAKIGGVGLSELAFSE 397

Query: 361 MDNKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQ 420
           M N ++   W +M++SFAQN +  +A ELF  MLRE + PD +C  S+LS+   ++ G Q
Sbjct: 398 MKNMKDPGIWASMLSSFAQNRNSGRALELFTVMLREGVKPDEYCIGSLLSIMSSLSLGSQ 457

Query: 421 IHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHISEKLV 462
           +H +  K GL+ + +VG +LFTMYSKCG LEE++ + ++ +
Sbjct: 458 VHSYILKAGLVTNATVGCSLFTMYSKCGCLEESYEVFQQAI 498

BLAST of Cp4.1LG20g00240 vs. TrEMBL
Match: A0A067KG80_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_11621 PE=4 SV=1)

HSP 1 Score: 500.0 bits (1286), Expect = 3.4e-138
Identity = 237/371 (63.88%), Postives = 301/371 (81.13%), Query Frame = 1

Query: 90  MHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSS 149
           MHF G++P++ITYGSVLSACAA+QAP+FG+QVY+LA+RNGF+ NGYVRAGMIDLFAK++ 
Sbjct: 1   MHFFGYKPNDITYGSVLSACAALQAPIFGEQVYALAIRNGFYSNGYVRAGMIDLFAKNNR 60

Query: 150 FLDALRVFHDVDCENVVCWNAIVSAAVRNGENFMALDLYNTMCHGFLEPNSFTFSSVLTA 209
           F  AL+VF+DV CENVVCWN+I+S AV+NGEN +ALDL+  MC  FL P+S+TFSSVLTA
Sbjct: 61  FDYALKVFYDVLCENVVCWNSIISGAVKNGENLVALDLFGQMCRRFLLPDSYTFSSVLTA 120

Query: 210 CAALEHPEFGKRVQGKVIKCGGEDVFVETALIDLYSKCGEMDEAVKIFLRMPIRNVVSWT 269
           CA LE  E GK VQG VIKC  +DVFVETA++D+YSKCG+++EAVK F RMP+RNVVSWT
Sbjct: 121 CATLEQIEIGKGVQGWVIKCCAKDVFVETAIVDMYSKCGDINEAVKKFSRMPVRNVVSWT 180

Query: 270 AIISGFVQKNDYLMALKFFKDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILRA 329
           AIISG V++ D++ ALKFFK+MRK+ EE N++T+TSV+TACA P M KEA Q+H+WIL+ 
Sbjct: 181 AIISGLVKRGDFMSALKFFKEMRKMEEETNNFTITSVITACARPNMIKEAFQIHNWILKT 240

Query: 330 GFSSHAVVGAALINMYSKIGAIDLSMTVFGEMDNKRNLSSWTAMITSFAQNNDKEKASEL 389
           GFS   VVGAAL+NMY+K  AIDLS  VF EM+  ++   W  MI+SFAQN   ++A  L
Sbjct: 241 GFSLDPVVGAALVNMYAKAHAIDLSEMVFREMEGLKDPRIWAIMISSFAQNQSSQRAIGL 300

Query: 390 FQKMLRESMGPDTFCTSSVLSVTDCITFGRQIHCFTHKTGLIFDISVGSALFTMYSKCGY 449
            Q+ML+E + PD FC SSV S  DC+  GRQIHC+T K GL  D+SVGS+LFTMYSKCG 
Sbjct: 301 LQRMLQEGLRPDKFCFSSVFSAIDCLNLGRQIHCYTVKIGLDLDLSVGSSLFTMYSKCGN 360

Query: 450 LEEAFHISEKL 461
           +E+++ + E++
Sbjct: 361 IEDSYKVFERI 371

BLAST of Cp4.1LG20g00240 vs. TAIR10
Match: AT1G74600.1 (AT1G74600.1 pentatricopeptide (PPR) repeat-containing protein)

HSP 1 Score: 450.7 bits (1158), Expect = 1.2e-126
Identity = 224/454 (49.34%), Postives = 319/454 (70.26%), Query Frame = 1

Query: 3   SDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTMLHP 62
           +D   SR C+LR TK+L A LLR  LL  +++++ SLL  YS S S+  A KLFDT+  P
Sbjct: 54  NDQSNSRLCNLRTTKILQAHLLRRYLLPFDVFLTKSLLSWYSNSGSMADAAKLFDTIPQP 113

Query: 63  NVISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVY 122
           +V+S NI+IS +  + ++ +S R F +MHFLGFE +EI+YGSV+SAC+A+QAP+F + V 
Sbjct: 114 DVVSCNIMISGYKQHRLFEESLRFFSKMHFLGFEANEISYGSVISACSALQAPLFSELVC 173

Query: 123 SLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENF 182
              ++ G+F    V + +ID+F+K+  F DA +VF D    NV CWN I++ A+RN    
Sbjct: 174 CHTIKMGYFFYEVVESALIDVFSKNLRFEDAYKVFRDSLSANVYCWNTIIAGALRNQNYG 233

Query: 183 MALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETALID 242
              DL++ MC GF +P+S+T+SSVL ACA+LE   FGK VQ +VIKCG EDVFV TA++D
Sbjct: 234 AVFDLFHEMCVGFQKPDSYTYSSVLAACASLEKLRFGKVVQARVIKCGAEDVFVCTAIVD 293

Query: 243 LYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINSYT 302
           LY+KCG M EA+++F R+P  +VVSWT ++SG+ + ND   AL+ FK+MR  G EIN+ T
Sbjct: 294 LYAKCGHMAEAMEVFSRIPNPSVVSWTVMLSGYTKSNDAFSALEIFKEMRHSGVEINNCT 353

Query: 303 VTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGEMD 362
           VTSV++AC  P+M  EA Q+H+W+ ++GF   + V AALI+MYSK G IDLS  VF ++D
Sbjct: 354 VTSVISACGRPSMVCEASQVHAWVFKSGFYLDSSVAAALISMYSKSGDIDLSEQVFEDLD 413

Query: 363 NKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQIH 422
           + +  +    MITSF+Q+    KA  LF +ML+E +  D F   S+LSV DC+  G+Q+H
Sbjct: 414 DIQRQNIVNVMITSFSQSKKPGKAIRLFTRMLQEGLRTDEFSVCSLLSVLDCLNLGKQVH 473

Query: 423 CFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHI 457
            +T K+GL+ D++VGS+LFT+YSKCG LEE++ +
Sbjct: 474 GYTLKSGLVLDLTVGSSLFTLYSKCGSLEESYKL 507

BLAST of Cp4.1LG20g00240 vs. TAIR10
Match: AT3G61170.1 (AT3G61170.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 228.8 bits (582), Expect = 7.3e-60
Identity = 140/423 (33.10%), Postives = 242/423 (57.21%), Query Frame = 1

Query: 37  NSLLDCYSKSNSLDHALKLFDTMLHPNVISWNILISSFNHNFMYLDSWRTFCRMHFLGFE 96
           N+++  YS S  L  A KLF +    N ISWN LIS +  +   ++++  F  M   G +
Sbjct: 63  NTMIVAYSNSRRLSDAEKLFRSNPVKNTISWNALISGYCKSGSKVEAFNLFWEMQSDGIK 122

Query: 97  PSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRV 156
           P+E T GSVL  C ++   + G+Q++   ++ GF ++  V  G++ ++A+     +A  +
Sbjct: 123 PNEYTLGSVLRMCTSLVLLLRGEQIHGHTIKTGFDLDVNVVNGLLAMYAQCKRISEAEYL 182

Query: 157 FHDVDCE-NVVCWNAIVSAAVRNGENFMALDLYNTMCHGFLEPNSFTFSSVLTACAALEH 216
           F  ++ E N V W ++++   +NG  F A++ +  +     + N +TF SVLTACA++  
Sbjct: 183 FETMEGEKNNVTWTSMLTGYSQNGFAFKAIECFRDLRREGNQSNQYTFPSVLTACASVSA 242

Query: 217 PEFGKRVQGKVIKCGGE-DVFVETALIDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISG 276
              G +V   ++K G + +++V++ALID+Y+KC EM+ A  +   M + +VVSW ++I G
Sbjct: 243 CRVGVQVHCCIVKSGFKTNIYVQSALIDMYAKCREMESARALLEGMEVDDVVSWNSMIVG 302

Query: 277 FVQKNDYLMALKFFKDMRKLGEEINSYTVTSVLTACA-NPAMTKEAIQLHSWILRAGFSS 336
            V++     AL  F  M +   +I+ +T+ S+L   A +    K A   H  I++ G+++
Sbjct: 303 CVRQGLIGEALSMFGRMHERDMKIDDFTIPSILNCFALSRTEMKIASSAHCLIVKTGYAT 362

Query: 337 HAVVGAALINMYSKIGAIDLSMTVFGEMDNKRNLSSWTAMITSFAQNNDKEKASELFQKM 396
           + +V  AL++MY+K G +D ++ VF  M  K ++ SWTA++T    N   ++A +LF  M
Sbjct: 363 YKLVNNALVDMYAKRGIMDSALKVFEGMIEK-DVISWTALVTGNTHNGSYDEALKLFCNM 422

Query: 397 LRESMGPDTFCTSSVLSVTDCIT---FGRQIHCFTHKTGLIFDISVGSALFTMYSKCGYL 454
               + PD   T+SVLS +  +T   FG+Q+H    K+G    +SV ++L TMY+KCG L
Sbjct: 423 RVGGITPDKIVTASVLSASAELTLLEFGQQVHGNYIKSGFPSSLSVNNSLVTMYTKCGSL 482

BLAST of Cp4.1LG20g00240 vs. TAIR10
Match: AT5G27110.1 (AT5G27110.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 219.5 bits (558), Expect = 4.4e-57
Identity = 142/464 (30.60%), Postives = 248/464 (53.45%), Query Frame = 1

Query: 1   LLSDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSN---SLDHALKLFD 60
           LL +   S K SLR  K++H ++L   L   ++ +  SL++ Y       S  H  + FD
Sbjct: 9   LLRECTNSTK-SLRRIKLVHQRILTLGL-RRDVVLCKSLINVYFTCKDHCSARHVFENFD 68

Query: 61  TMLHPNVISWNILISSFNHNFMYLDSWRTFCRMHFLGF-EPSEITYGSVLSACAAIQAPM 120
             +  +V  WN L+S ++ N M+ D+   F R+       P   T+ +V+ A  A+    
Sbjct: 69  --IRSDVYIWNSLMSGYSKNSMFHDTLEVFKRLLNCSICVPDSFTFPNVIKAYGALGREF 128

Query: 121 FGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAV 180
            G+ +++L V++G+  +  V + ++ ++AK + F ++L+VF ++   +V  WN ++S   
Sbjct: 129 LGRMIHTLVVKSGYVCDVVVASSLVGMYAKFNLFENSLQVFDEMPERDVASWNTVISCFY 188

Query: 181 RNGENFMALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGE-DVF 240
           ++GE   AL+L+  M     EPNS + +  ++AC+ L   E GK +  K +K G E D +
Sbjct: 189 QSGEAEKALELFGRMESSGFEPNSVSLTVAISACSRLLWLERGKEIHRKCVKKGFELDEY 248

Query: 241 VETALIDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLG 300
           V +AL+D+Y KC  ++ A ++F +MP +++V+W ++I G+V K D    ++    M   G
Sbjct: 249 VNSALVDMYGKCDCLEVAREVFQKMPRKSLVAWNSMIKGYVAKGDSKSCVEILNRMIIEG 308

Query: 301 EEINSYTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSM 360
              +  T+TS+L AC+          +H +++R+  ++   V  +LI++Y K G  +L+ 
Sbjct: 309 TRPSQTTLTSILMACSRSRNLLHGKFIHGYVIRSVVNADIYVNCSLIDLYFKCGEANLAE 368

Query: 361 TVFGEMDNKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVL---SVT 420
           TVF +   K    SW  MI+S+    +  KA E++ +M+   + PD    +SVL   S  
Sbjct: 369 TVFSK-TQKDVAESWNVMISSYISVGNWFKAVEVYDQMVSVGVKPDVVTFTSVLPACSQL 428

Query: 421 DCITFGRQIHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHI 457
             +  G+QIH    ++ L  D  + SAL  MYSKCG  +EAF I
Sbjct: 429 AALEKGKQIHLSISESRLETDELLLSALLDMYSKCGNEKEAFRI 467

BLAST of Cp4.1LG20g00240 vs. TAIR10
Match: AT2G27610.1 (AT2G27610.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 219.2 bits (557), Expect = 5.8e-57
Identity = 139/460 (30.22%), Postives = 248/460 (53.91%), Query Frame = 1

Query: 6   VKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTMLHPNVI 65
           V +  C     + LH + ++   L  ++ V  SL+D Y K ++     K+FD M   NV+
Sbjct: 102 VSATLCDELFGRQLHCQCIKFGFL-DDVSVGTSLVDTYMKGSNFKDGRKVFDEMKERNVV 161

Query: 66  SWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLA 125
           +W  LIS +  N M  +    F RM   G +P+  T+ + L   A       G QV+++ 
Sbjct: 162 TWTTLISGYARNSMNDEVLTLFMRMQNEGTQPNSFTFAAALGVLAEEGVGGRGLQVHTVV 221

Query: 126 VRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENFMAL 185
           V+NG      V   +I+L+ K  +   A  +F   + ++VV WN+++S    NG +  AL
Sbjct: 222 VKNGLDKTIPVSNSLINLYLKCGNVRKARILFDKTEVKSVVTWNSMISGYAANGLDLEAL 281

Query: 186 DLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGE-DVFVETALIDLY 245
            ++ +M   ++  +  +F+SV+  CA L+   F +++   V+K G   D  + TAL+  Y
Sbjct: 282 GMFYSMRLNYVRLSESSFASVIKLCANLKELRFTEQLHCSVVKYGFLFDQNIRTALMVAY 341

Query: 246 SKCGEMDEAVKIFLRMP-IRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINSYTV 305
           SKC  M +A+++F  +  + NVVSWTA+ISGF+Q +    A+  F +M++ G   N +T 
Sbjct: 342 SKCTAMLDALRLFKEIGCVGNVVSWTAMISGFLQNDGKEEAVDLFSEMKRKGVRPNEFTY 401

Query: 306 TSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGEMDN 365
           + +LTA   P ++    ++H+ +++  +   + VG AL++ Y K+G ++ +  VF  +D+
Sbjct: 402 SVILTAL--PVISPS--EVHAQVVKTNYERSSTVGTALLDAYVKLGKVEEAAKVFSGIDD 461

Query: 366 KRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITF----GR 425
           K ++ +W+AM+  +AQ  + E A ++F ++ +  + P+ F  SS+L+V          G+
Sbjct: 462 K-DIVAWSAMLAGYAQTGETEAAIKMFGELTKGGIKPNEFTFSSILNVCAATNASMGQGK 521

Query: 426 QIHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHISEK 460
           Q H F  K+ L   + V SAL TMY+K G +E A  + ++
Sbjct: 522 QFHGFAIKSRLDSSLCVSSALLTMYAKKGNIESAEEVFKR 555

BLAST of Cp4.1LG20g00240 vs. TAIR10
Match: AT3G63370.1 (AT3G63370.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 214.2 bits (544), Expect = 1.9e-55
Identity = 131/451 (29.05%), Postives = 236/451 (52.33%), Query Frame = 1

Query: 19  LHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTMLHPNVISWNILISSFNHNF 78
           LH+++ +        +++  L+  Y K  SLD A K+FD M      +WN +I ++  N 
Sbjct: 102 LHSRIFKTFPSFELDFLAGKLVFMYGKCGSLDDAEKVFDEMPDRTAFAWNTMIGAYVSNG 161

Query: 79  MYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVYSLAVRNGFFVNGYVRA 138
               +   +  M   G      ++ ++L ACA ++    G +++SL V+ G+   G++  
Sbjct: 162 EPASALALYWNMRVEGVPLGLSSFPALLKACAKLRDIRSGSELHSLLVKLGYHSTGFIVN 221

Query: 139 GMIDLFAKDSSFLDALRVFHDVDCE-NVVCWNAIVSAAVRNGENFMALDLYNTMCHGFLE 198
            ++ ++AK+     A R+F     + + V WN+I+S+   +G++   L+L+  M      
Sbjct: 222 ALVSMYAKNDDLSAARRLFDGFQEKGDAVLWNSILSSYSTSGKSLETLELFREMHMTGPA 281

Query: 199 PNSFTFSSVLTACAALEHPEFGKRVQGKVIKCG--GEDVFVETALIDLYSKCGEMDEAVK 258
           PNS+T  S LTAC    + + GK +   V+K      +++V  ALI +Y++CG+M +A +
Sbjct: 282 PNSYTIVSALTACDGFSYAKLGKEIHASVLKSSTHSSELYVCNALIAMYTRCGKMPQAER 341

Query: 259 IFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINSYTVTSVLTACANPAM 318
           I  +M   +VV+W ++I G+VQ   Y  AL+FF DM   G + +  ++TS++ A    + 
Sbjct: 342 ILRQMNNADVVTWNSLIKGYVQNLMYKEALEFFSDMIAAGHKSDEVSMTSIIAASGRLSN 401

Query: 319 TKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGEMDNKRNLSSWTAMIT 378
               ++LH+++++ G+ S+  VG  LI+MYSK          F  M +K +L SWT +I 
Sbjct: 402 LLAGMELHAYVIKHGWDSNLQVGNTLIDMYSKCNLTCYMGRAFLRMHDK-DLISWTTVIA 461

Query: 379 SFAQNNDKEKASELFQKMLRESMGPDTFCTSSVL---SVTDCITFGRQIHCFTHKTGLIF 438
            +AQN+   +A ELF+ + ++ M  D     S+L   SV   +   ++IHC   + GL+ 
Sbjct: 462 GYAQNDCHVEALELFRDVAKKRMEIDEMILGSILRASSVLKSMLIVKEIHCHILRKGLL- 521

Query: 439 DISVGSALFTMYSKCGYLEEAFHISEKLVGQ 464
           D  + + L  +Y KC  +  A  + E + G+
Sbjct: 522 DTVIQNELVDVYGKCRNMGYATRVFESIKGK 550

BLAST of Cp4.1LG20g00240 vs. NCBI nr
Match: gi|659066381|ref|XP_008441907.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Cucumis melo])

HSP 1 Score: 706.1 bits (1821), Expect = 4.4e-200
Identity = 351/460 (76.30%), Postives = 400/460 (86.96%), Query Frame = 1

Query: 1   LLSDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTML 60
           LL+D+VK    SLRNTKVLHAK LR T    +IYVSNSLL CYSKSN++DHALKLFDT+L
Sbjct: 49  LLNDFVKLGNFSLRNTKVLHAKFLRVTP-RIDIYVSNSLLHCYSKSNAMDHALKLFDTIL 108

Query: 61  HPNVISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQ 120
           +PNVISWN +I+ FN+NF++LDS R FC MH+LGF+P+E+T GSVLSACAAIQA MFGKQ
Sbjct: 109 NPNVISWNTIITGFNNNFLHLDSLRIFCWMHYLGFKPNEVTCGSVLSACAAIQATMFGKQ 168

Query: 121 VYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGE 180
           VYSLAVRNGFF NGYVR  MIDLFAKDS FLDALRVFHDVDC NVVCWNAIVSAAV NGE
Sbjct: 169 VYSLAVRNGFFDNGYVRTVMIDLFAKDSKFLDALRVFHDVDCANVVCWNAIVSAAVTNGE 228

Query: 181 NFMALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETAL 240
             MALDL+N MC  FLEPNSFTFSSVLTAC+AL+  EFGK VQG+VIKCGG DVFVETAL
Sbjct: 229 YLMALDLFNRMCSKFLEPNSFTFSSVLTACSALQDLEFGKSVQGRVIKCGGGDVFVETAL 288

Query: 241 IDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINS 300
           + LY+KCG+MDEAVKIF +MPIRNVVSWT I+SGFVQ NDYLM +K F+D+RK+GEEINS
Sbjct: 289 VSLYAKCGDMDEAVKIFFQMPIRNVVSWTVIMSGFVQNNDYLMVIKIFEDLRKIGEEINS 348

Query: 301 YTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGE 360
           YTVT++L ACANP M KEA QLHSWIL+AGFSS A V AALI MYSKIGAIDLS+ VF E
Sbjct: 349 YTVTTLLRACANPGMRKEATQLHSWILKAGFSSQAEVVAALIIMYSKIGAIDLSLMVFRE 408

Query: 361 MDNKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQ 420
           MDN RNLSSWTAMI S A+NNDKE+AS+LF+KMLRE M PD+ CTS++LS+TDCITFGRQ
Sbjct: 409 MDNHRNLSSWTAMILSLAKNNDKEEASDLFRKMLREKMEPDSVCTSTLLSLTDCITFGRQ 468

Query: 421 IHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHISEKL 461
           IHC+T KT LIF++SVGS+LFTMYSKCG+L+EAF + E +
Sbjct: 469 IHCYTLKTELIFNVSVGSSLFTMYSKCGHLKEAFQVFENM 507

BLAST of Cp4.1LG20g00240 vs. NCBI nr
Match: gi|778656821|ref|XP_011649738.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g74600, chloroplastic isoform X1 [Cucumis sativus])

HSP 1 Score: 661.8 bits (1706), Expect = 9.6e-187
Identity = 318/412 (77.18%), Postives = 367/412 (89.08%), Query Frame = 1

Query: 49  LDHALKLFDTMLHPNVISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSA 108
           +DHA+KLFDT+L+PNVISWN +I+  N+NF++LDS RTFC MHFLGF+P+E+T GSVLSA
Sbjct: 1   MDHAIKLFDTILYPNVISWNTIITGLNNNFLHLDSLRTFCWMHFLGFKPNEVTCGSVLSA 60

Query: 109 CAAIQAPMFGKQVYSLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCW 168
           CAAIQA MFGKQVYSLAVRNGFF NGYVR  MIDLFAKDS FLDALRVFHDVDC NVVCW
Sbjct: 61  CAAIQASMFGKQVYSLAVRNGFFDNGYVRTVMIDLFAKDSKFLDALRVFHDVDCANVVCW 120

Query: 169 NAIVSAAVRNGENFMALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIK 228
           NAIVSAAV NGEN MALDL+N MC  FLEPNSFTFSSVLTAC+AL+  EFGK+VQG+VIK
Sbjct: 121 NAIVSAAVTNGENLMALDLFNRMCSKFLEPNSFTFSSVLTACSALQDLEFGKKVQGRVIK 180

Query: 229 CGGEDVFVETALIDLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFF 288
           CGG DVFVETAL+ LY+KCG+MDEAVK FL+MPIRNVVSWT I+SGFVQ NDYLM +KFF
Sbjct: 181 CGGGDVFVETALVSLYAKCGDMDEAVKTFLQMPIRNVVSWTVIMSGFVQSNDYLMVIKFF 240

Query: 289 KDMRKLGEEINSYTVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKI 348
           +D+RK+GEEINSYTVT++L ACANPAM KEA QLHSWIL+AGFSSH+ V AALI MYSKI
Sbjct: 241 EDLRKVGEEINSYTVTTLLRACANPAMRKEATQLHSWILKAGFSSHSEVAAALIIMYSKI 300

Query: 349 GAIDLSMTVFGEMDNKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSV 408
           GA+DLS+ +F EMDN RNLSSWTAMI SFA+NNDKE+AS+LF+KMLRE MGPD+ CTS++
Sbjct: 301 GAVDLSLMIFREMDNHRNLSSWTAMILSFAKNNDKEEASDLFRKMLRERMGPDSVCTSAL 360

Query: 409 LSVTDCITFGRQIHCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHISEKL 461
           LS+TDCITFGRQIHC+  KT LIF++ VGS+L TMYSKCG+L+EAF + E +
Sbjct: 361 LSLTDCITFGRQIHCYALKTELIFNVHVGSSLLTMYSKCGHLKEAFQVFENM 412

BLAST of Cp4.1LG20g00240 vs. NCBI nr
Match: gi|225456755|ref|XP_002268980.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g74600, chloroplastic isoform X1 [Vitis vinifera])

HSP 1 Score: 609.8 bits (1571), Expect = 4.3e-171
Identity = 298/459 (64.92%), Postives = 366/459 (79.74%), Query Frame = 1

Query: 3   SDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTMLHP 62
           SDY KS +C+LRNTK+LHA  L+  +L SN +++NSL+  Y KSNS+ HAL+LFD   HP
Sbjct: 51  SDYTKSGRCTLRNTKILHAHFLKTAILQSNTFMTNSLMGWYCKSNSMVHALRLFDKTPHP 110

Query: 63  NVISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVY 122
           NVISWNILIS  N NF + DSWR FC+M F GF+P++ TYGSVLSAC A+ +P++G+ VY
Sbjct: 111 NVISWNILISGCNQNFSFEDSWRNFCKMRFSGFDPNQFTYGSVLSACTALGSPLYGELVY 170

Query: 123 SLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENF 182
           SLA++NGFF NGYVRAGMIDLFAK  SF DALRVF DV CENVVCWNAI+S AV+N EN+
Sbjct: 171 SLALKNGFFSNGYVRAGMIDLFAKLCSFEDALRVFQDVLCENVVCWNAIISGAVKNRENW 230

Query: 183 MALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCG-GEDVFVETALI 242
           +ALDL+  MC  F  PNSFTFSS+LTACAALE  EFG+ VQG VIKCG GEDVFV TA+I
Sbjct: 231 VALDLFCQMCCRFFMPNSFTFSSILTACAALEELEFGRGVQGWVIKCGAGEDVFVGTAII 290

Query: 243 DLYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINSY 302
           DLY+KC +MD+AVK FLRMPIRNVVSWT IISGFVQK+D + A  FFK+MRK+GE+IN+Y
Sbjct: 291 DLYAKCRDMDQAVKEFLRMPIRNVVSWTTIISGFVQKDDSISAFHFFKEMRKVGEKINNY 350

Query: 303 TVTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGEM 362
           T+TSVLTAC  P M KEA+QLHSWI + GF   + V +ALINMYSKIG +DLS  VF EM
Sbjct: 351 TITSVLTACTEPVMIKEAVQLHSWIFKTGFYLDSNVSSALINMYSKIGVVDLSERVFREM 410

Query: 363 DNKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQI 422
           ++ +NL+ W  MI++FAQ+    +A ELFQ+ML+E + PD FC+SSVLS+ D ++ GR I
Sbjct: 411 ESTKNLAMWAVMISAFAQSGSTGRAVELFQRMLQEGLRPDKFCSSSVLSIIDSLSLGRLI 470

Query: 423 HCFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHISEKL 461
           HC+  K GL  DISVGS+LFTMYSKCG LEE++ + E++
Sbjct: 471 HCYILKIGLFTDISVGSSLFTMYSKCGSLEESYTVFEQM 509

BLAST of Cp4.1LG20g00240 vs. NCBI nr
Match: gi|694387084|ref|XP_009369307.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Pyrus x bretschneideri])

HSP 1 Score: 607.4 bits (1565), Expect = 2.2e-170
Identity = 297/458 (64.85%), Postives = 369/458 (80.57%), Query Frame = 1

Query: 3   SDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTMLHP 62
           +DY KSR+C+ R TK++H  L    LL S+I++SNSLLD Y KS ++  ALKLFD +   
Sbjct: 51  NDYAKSRQCTARITKIVHTHLTTTGLLQSDIFLSNSLLDSYCKSAAMVDALKLFDLIADR 110

Query: 63  NVISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVY 122
           NV SWNI+IS +NH  ++  +W  FCRMH  GF P E  YGSVLSAC A+QAP+FGKQVY
Sbjct: 111 NVFSWNIMISGYNHISLFEMAWEMFCRMHASGFGPDEFAYGSVLSACNALQAPIFGKQVY 170

Query: 123 SLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENF 182
           SLA++NGFF NGYV++GMIDLFAK+ SF DALRVFHDV C+NVV WNA++S AVRNGEN 
Sbjct: 171 SLAIKNGFFPNGYVQSGMIDLFAKNCSFEDALRVFHDVSCQNVVSWNAVISGAVRNGENR 230

Query: 183 MALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETALID 242
           +AL L+  M  GFL PNSFTFSSVLTACAALE  E GK VQG VIK G EDVFV T ++D
Sbjct: 231 VALHLFQNMFRGFLLPNSFTFSSVLTACAALEEIEVGKEVQGLVIKRGAEDVFVGTTIVD 290

Query: 243 LYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINSYT 302
           LY+KCGEM+EAVK F RMP RNVVSWTAIISGFV K+DY+ ALKFF++MRK+GE++N YT
Sbjct: 291 LYAKCGEMNEAVKEFKRMPTRNVVSWTAIISGFVHKDDYISALKFFREMRKVGEQMNKYT 350

Query: 303 VTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGEMD 362
           VTSVLTACA P+MT+EA Q+HS IL++GF S AVVG+ALIN YSKIGA+DLS  VF EM+
Sbjct: 351 VTSVLTACARPSMTEEATQIHSLILKSGFFSAAVVGSALINAYSKIGAVDLSEMVFREME 410

Query: 363 NKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQIH 422
           N ++L +W A+I+SFAQN +  +A ELF++ML+ES+ PD FCTSSVLS+ DC+  GRQIH
Sbjct: 411 NIKDLGTWAAIISSFAQNQNSGRAIELFRRMLQESVRPDKFCTSSVLSIVDCLNLGRQIH 470

Query: 423 CFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHISEKL 461
            +T K+GL+FD+SVGS+LFTMYSKC  L+E++ + +++
Sbjct: 471 SYTLKSGLVFDVSVGSSLFTMYSKCDSLDESYKVFQQI 508

BLAST of Cp4.1LG20g00240 vs. NCBI nr
Match: gi|645269344|ref|XP_008239957.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g74600, chloroplastic [Prunus mume])

HSP 1 Score: 602.4 bits (1552), Expect = 6.9e-169
Identity = 296/458 (64.63%), Postives = 362/458 (79.04%), Query Frame = 1

Query: 3   SDYVKSRKCSLRNTKVLHAKLLRATLLHSNIYVSNSLLDCYSKSNSLDHALKLFDTMLHP 62
           +DY KSR+CS RNTK+LH  LLR  LL SNI+++NSLLD Y KS+++  ALKLFD +   
Sbjct: 51  NDYTKSRQCSTRNTKILHTHLLRTDLLQSNIFIANSLLDSYCKSSAMVDALKLFDFIADR 110

Query: 63  NVISWNILISSFNHNFMYLDSWRTFCRMHFLGFEPSEITYGSVLSACAAIQAPMFGKQVY 122
            VISWN++IS +N N ++  SW  FCRMH  GFEPSE TYGS LSAC A+QAP FGKQVY
Sbjct: 111 TVISWNMMISGYNQNSLFEKSWEIFCRMHSSGFEPSEFTYGSTLSACTALQAPTFGKQVY 170

Query: 123 SLAVRNGFFVNGYVRAGMIDLFAKDSSFLDALRVFHDVDCENVVCWNAIVSAAVRNGENF 182
           SLA+++GFF NGYV+AGMIDLFAK+ SF DALRVFHDV C+NVV WN I+S AVRNGEN 
Sbjct: 171 SLAMKSGFFPNGYVQAGMIDLFAKNFSFDDALRVFHDVSCQNVVSWNTIISGAVRNGENM 230

Query: 183 MALDLYNTMCHGFLEPNSFTFSSVLTACAALEHPEFGKRVQGKVIKCGGEDVFVETALID 242
            AL L+  MC G   PNSFTFSSVLTACAALE    GK VQG VIK G EDVFV T ++D
Sbjct: 231 AALYLFRQMCRGVFLPNSFTFSSVLTACAALEEVGVGKEVQGWVIKRGAEDVFVGTTIVD 290

Query: 243 LYSKCGEMDEAVKIFLRMPIRNVVSWTAIISGFVQKNDYLMALKFFKDMRKLGEEINSYT 302
           LY+KCG+M+EAVK F RMP RNVVSWTAIISGFV K+D + ALK F++MRK+GE++N YT
Sbjct: 291 LYAKCGKMNEAVKKFSRMPTRNVVSWTAIISGFVHKDDSVSALKVFREMRKMGEQMNKYT 350

Query: 303 VTSVLTACANPAMTKEAIQLHSWILRAGFSSHAVVGAALINMYSKIGAIDLSMTVFGEMD 362
           +TS+L ACA  +M +EA Q+HS IL+AGF S AVVG+ALIN YSKIGA+DLS  VF EM+
Sbjct: 351 ITSILNACAKTSMAEEATQIHSLILKAGFYSAAVVGSALINAYSKIGAVDLSEMVFREME 410

Query: 363 NKRNLSSWTAMITSFAQNNDKEKASELFQKMLRESMGPDTFCTSSVLSVTDCITFGRQIH 422
           N ++L +W AMI+S AQN +  +A ELFQ+ML+ES+ PD FCTSSVLS+ DC+  GRQIH
Sbjct: 411 NIKDLGTWAAMISSLAQNQNSGRAIELFQRMLQESVRPDMFCTSSVLSIVDCLNLGRQIH 470

Query: 423 CFTHKTGLIFDISVGSALFTMYSKCGYLEEAFHISEKL 461
            +T K GL+ D+SVGS+LFTMYSKC  LEE++ + +++
Sbjct: 471 SYTLKIGLVSDVSVGSSLFTMYSKCDSLEESYEVFQQI 508

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP121_ARATH2.1e-12549.34Pentatricopeptide repeat-containing protein At1g74600, chloroplastic OS=Arabidop... [more]
PP398_ARATH7.9e-5630.60Pentatricopeptide repeat-containing protein At5g27110 OS=Arabidopsis thaliana GN... [more]
PP172_ARATH1.0e-5530.22Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana GN... [more]
PP296_ARATH3.3e-5429.05Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidop... [more]
PP181_ARATH3.3e-5429.74Pentatricopeptide repeat-containing protein At2g33680 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
B9R998_RICCO2.4e-15256.33Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
A0A061FX38_THECC2.7e-15158.95Pentatricopeptide repeat-containing protein, putative isoform 1 OS=Theobroma cac... [more]
A0A0D2U5X8_GOSRA8.8e-14757.95Uncharacterized protein (Fragment) OS=Gossypium raimondii GN=B456_008G192700 PE=... [more]
A0A072TYN0_MEDTR1.5e-13852.49Pentatricopeptide (PPR) repeat protein OS=Medicago truncatula GN=MTR_7g053240 PE... [more]
A0A067KG80_JATCU3.4e-13863.88Uncharacterized protein OS=Jatropha curcas GN=JCGZ_11621 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G74600.11.2e-12649.34 pentatricopeptide (PPR) repeat-containing protein[more]
AT3G61170.17.3e-6033.10 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G27110.14.4e-5730.60 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G27610.15.8e-5730.22 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G63370.11.9e-5529.05 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659066381|ref|XP_008441907.1|4.4e-20076.30PREDICTED: pentatricopeptide repeat-containing protein At1g74600, chloroplastic ... [more]
gi|778656821|ref|XP_011649738.1|9.6e-18777.18PREDICTED: pentatricopeptide repeat-containing protein At1g74600, chloroplastic ... [more]
gi|225456755|ref|XP_002268980.1|4.3e-17164.92PREDICTED: pentatricopeptide repeat-containing protein At1g74600, chloroplastic ... [more]
gi|694387084|ref|XP_009369307.1|2.2e-17064.85PREDICTED: pentatricopeptide repeat-containing protein At1g74600, chloroplastic ... [more]
gi|645269344|ref|XP_008239957.1|6.9e-16964.63PREDICTED: pentatricopeptide repeat-containing protein At1g74600, chloroplastic ... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0044444 cytoplasmic part
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g00240.1Cp4.1LG20g00240.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 37..60
score: 6.9E-4coord: 238..263
score: 4.3E-5coord: 340..364
score: 0.78coord: 369..396
score: 6.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 264..311
score: 6.7E-13coord: 164..211
score: 1.3E-9coord: 62..110
score: 2.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 369..402
score: 5.8E-6coord: 266..299
score: 6.5E-6coord: 238..264
score: 0.0014coord: 37..60
score: 6.4E-4coord: 65..99
score: 3.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 98..132
score: 6.073coord: 264..298
score: 9.975coord: 334..364
score: 6.149coord: 199..229
score: 5.24coord: 32..66
score: 9.24coord: 233..263
score: 8.528coord: 433..463
score: 6.106coord: 299..333
score: 7.739coord: 67..97
score: 5.689coord: 164..198
score: 8.144coord: 366..400
score: 10
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 363..395
score: 1.4E-5coord: 233..294
score: 1.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 437..456
score: 5.7E-153coord: 25..404
score: 5.7E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG20g00240CmaCh13G011280Cucurbita maxima (Rimu)cmacpeB234
Cp4.1LG20g00240CmoCh13G011820Cucurbita moschata (Rifu)cmocpeB201
Cp4.1LG20g00240CsGy1G005280Cucumber (Gy14) v2cgybcpeB081
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG20g00240Cucurbita pepo (Zucchini)cpecpeB431
Cp4.1LG20g00240Cucurbita pepo (Zucchini)cpecpeB433
Cp4.1LG20g00240Cucurbita pepo (Zucchini)cpecpeB441
Cp4.1LG20g00240Cucumber (Gy14) v1cgycpeB0259
Cp4.1LG20g00240Cucumber (Gy14) v1cgycpeB0758
Cp4.1LG20g00240Cucurbita maxima (Rimu)cmacpeB441
Cp4.1LG20g00240Cucurbita maxima (Rimu)cmacpeB484
Cp4.1LG20g00240Cucurbita maxima (Rimu)cmacpeB485
Cp4.1LG20g00240Cucurbita moschata (Rifu)cmocpeB119
Cp4.1LG20g00240Cucurbita moschata (Rifu)cmocpeB444
Cp4.1LG20g00240Wild cucumber (PI 183967)cpecpiB519
Cp4.1LG20g00240Wild cucumber (PI 183967)cpecpiB526
Cp4.1LG20g00240Cucumber (Chinese Long) v2cpecuB517
Cp4.1LG20g00240Cucumber (Chinese Long) v2cpecuB524
Cp4.1LG20g00240Bottle gourd (USVL1VR-Ls)cpelsiB424
Cp4.1LG20g00240Bottle gourd (USVL1VR-Ls)cpelsiB430
Cp4.1LG20g00240Watermelon (Charleston Gray)cpewcgB470
Cp4.1LG20g00240Watermelon (Charleston Gray)cpewcgB473
Cp4.1LG20g00240Watermelon (97103) v1cpewmB511
Cp4.1LG20g00240Watermelon (97103) v1cpewmB522
Cp4.1LG20g00240Melon (DHL92) v3.5.1cpemeB471
Cp4.1LG20g00240Cucumber (Gy14) v2cgybcpeB671
Cp4.1LG20g00240Melon (DHL92) v3.6.1cpemedB549
Cp4.1LG20g00240Melon (DHL92) v3.6.1cpemedB553
Cp4.1LG20g00240Silver-seed gourdcarcpeB0072
Cp4.1LG20g00240Silver-seed gourdcarcpeB0149
Cp4.1LG20g00240Silver-seed gourdcarcpeB0544
Cp4.1LG20g00240Cucumber (Chinese Long) v3cpecucB0636
Cp4.1LG20g00240Cucumber (Chinese Long) v3cpecucB0643
Cp4.1LG20g00240Cucumber (Chinese Long) v3cpecucB0649
Cp4.1LG20g00240Wax gourdcpewgoB0639
Cp4.1LG20g00240Wax gourdcpewgoB0662