Cp4.1LG01g22400 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g22400
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG01 : 20292166 .. 20294491 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGAGGATTGCTTATAAATCCCACTCTGATTTTATCAAATGAGTTGAATTACCAACTTCACTCGTGTTACCCGGTTAGTTGTGCACACAAGCATTTCCATGTTATTCCAGAATTGAAATCATGTATAAGGCGTAGGATAACTCATGGGGGTAATGTAGCTTCAATGTCGTCGATGAGTATTCCACGATTGAATTTCGTGGTTCGGTCCACAAAAGCCTTGGAGTTTAGGACATGTGAAGAGGATGACGCTATTAGATTGGTCGTTGATGATGGAGTTGAAGAATCGTCTCGGGAGTGGAAATCGCCTCCCTGGGGAGAAGTGAAAAATCAGGATGAGCCAATCTTTCAATCTGAAGATGTAAACCAGTCCGAAGTGTTAGAAGGGGAGGGTTTGGGAAGTGACAGAAAGGTGTATTTTCTTGAGGAAACTGATGAAGTTATGCTATCAAAGCGTATTTTAATTCTCAGTAGAAAAAATAAGGTCAGAAGTGCAATGGAATTATTCAGGTCCATGCATTTAGCAGGTCTTCTGCCAAGTTTTCATGCTTCAAATTCACTTTTAGCTTGTCTTTTGAGGAATGGGCCGTTTGATGATGGTTTACGGATCTTCGAGTTTATGAAGTCAAACAAGCTATCAACAGGGCACACTTATAGCCTTATACTCAAAGCAGTTGCTGATACTCATGGATTTCTTTCTGCTCTTGAGATGTTTAGGACATGGGAGCACGAATATGACTTAAAACAGTTCGATGCAATTGTTTATAACACGATGATATCGGTCTGTGGAAAAGAGAATAACTGGGTTGAAGCTGAGAGAATATGGAGACTGATGGAGGCAAATGGCTGTAGTGCAACACATCTAACTTATTCTCTATTGGTGAGCATGTACGTCCGCTGCAACCAGAACGAACTTGCGATCGACATTTATGTAAAGATGGTTCAAAATGATTTAAAACCAGCTAATGATACAATGCAAGCTATTATTGGCGCATCTTCAAGGGAAGGGAGGTGGGATTTTGCTTTAAGAGTCTTTCAAGATATGTTGAAATGTGGACTCGAACCTAATTCCGTTGCATTCAACACCTTGATCAATGCTCTAGGAAAAGCTAATGAGGTCACTTTAGCATTCAGCATATACAATAGGATGAAACCTATGGGTCATTCACCTGATGTTTATACATGGAAGGCTCTACTCGGTGCTCTTTACAAGGCAAATCGCTACAACGATGCTATTCGTCTCTTTGAGTTTGTGAAAAGAGAGGAGAAGGCTCAATTGAATATACATATTTACAATACCATTCTATTGTCTTGTTCAAAGCTTGGGTTATGGGATAGGGCTCTCCAAATTTTATGGGAAATGGAGGCCGCCTCTGGTCGCTTAGTTTCGGCATCATCATATAACATTGTTATTAGTGCATGTGAGATGGCTAGGAAGCCAGAAATTGCGTTGCGAGTTTACGAACGCATGATTCATCAGAAGCTCACTCCTGATACCTTCACTCTTTTGTCGCTTATCCGAAGCTGCATTTGGGGATCTTTATGGGATGAAGTGGAACTACTTCTATCTGTAAGTGTTCACCTCTATGACTCGTCCCGTGCCTCGACGACACTCTTATACTATTTTGAATAGGGACAGATCTTCCCTGTAGAAATTTAGTTCAAAATTTTCATCGCATTAGCATCATTGAAATCCATATGGTTGAACTGGAAAATGAGGCTTTCTATTAGACCATTATGTTTTTTGTTGTGGTTTTTTCTCTCTGATGATCTTACTAATCTACAATTCCTTGTCCTGATGTTCTTTCGAACGTTTTTCGACATTTTGCATTGAGAATTATTTGGACGTATACCTGATAGTTGTTGTTCTTCAGAAGTCTGCACACGACGCATCTGTATACAATGCTGCCATCCAAGGAATGTGCTTAAGAGGCAAGACTGATTTAGCAAAAAAGCTTTACACGAAGATGCGCGAAATCGGTATCCAACCAGATGGAAAAACACGAGCTTTGATGCTTCAGACGTTGCCGAAGGATCGTGCTGGACTGAGGAACAGGTTGGCTTCTCGTTTCAAGAAAAGGCACAGACATTATCACCACAGGTAACATGAATGAATGTAAAAGAAGGTGTGAAAAGATTAGACTACTGTAAATGAAGTATATATTGTAGATGCATAATGCTAGCTCAAACTCATTGTTGTATAGCTTAGCAGAATCGCTGGTTGATTGCATCAGTTGGCTTATATTCCATAAAAAAGCTCAGATATCTGATGAATCGTATCCAAATTCTACGAACGATAATCATTTGATTTATGATTTTTCAAATTT

mRNA sequence

ATGAGAGGATTGCTTATAAATCCCACTCTGATTTTATCAAATGAGTTGAATTACCAACTTCACTCGTGTTACCCGGTTAGTTGTGCACACAAGCATTTCCATGTTATTCCAGAATTGAAATCATGTATAAGGCGTAGGATAACTCATGGGGGTAATGTAGCTTCAATGTCGTCGATGAGTATTCCACGATTGAATTTCGTGGTTCGGTCCACAAAAGCCTTGGAGTTTAGGACATGTGAAGAGGATGACGCTATTAGATTGGTCGTTGATGATGGAGTTGAAGAATCGTCTCGGGAGTGGAAATCGCCTCCCTGGGGAGAAGTGAAAAATCAGGATGAGCCAATCTTTCAATCTGAAGATGTAAACCAGTCCGAAGTGTTAGAAGGGGAGGGTTTGGGAAGTGACAGAAAGGTGTATTTTCTTGAGGAAACTGATGAAGTTATGCTATCAAAGCGTATTTTAATTCTCAGTAGAAAAAATAAGGTCAGAAGTGCAATGGAATTATTCAGGTCCATGCATTTAGCAGGTCTTCTGCCAAGTTTTCATGCTTCAAATTCACTTTTAGCTTGTCTTTTGAGGAATGGGCCGTTTGATGATGGTTTACGGATCTTCGAGTTTATGAAGTCAAACAAGCTATCAACAGGGCACACTTATAGCCTTATACTCAAAGCAGTTGCTGATACTCATGGATTTCTTTCTGCTCTTGAGATGTTTAGGACATGGGAGCACGAATATGACTTAAAACAGTTCGATGCAATTGTTTATAACACGATGATATCGGTCTGTGGAAAAGAGAATAACTGGGTTGAAGCTGAGAGAATATGGAGACTGATGGAGGCAAATGGCTGTAGTGCAACACATCTAACTTATTCTCTATTGGTGAGCATGTACGTCCGCTGCAACCAGAACGAACTTGCGATCGACATTTATGTAAAGATGGTTCAAAATGATTTAAAACCAGCTAATGATACAATGCAAGCTATTATTGGCGCATCTTCAAGGGAAGGGAGGTGGGATTTTGCTTTAAGAGTCTTTCAAGATATGTTGAAATGTGGACTCGAACCTAATTCCGTTGCATTCAACACCTTGATCAATGCTCTAGGAAAAGCTAATGAGGTCACTTTAGCATTCAGCATATACAATAGGATGAAACCTATGGGTCATTCACCTGATGTTTATACATGGAAGGCTCTACTCGGTGCTCTTTACAAGGCAAATCGCTACAACGATGCTATTCGTCTCTTTGAGTTTGTGAAAAGAGAGGAGAAGGCTCAATTGAATATACATATTTACAATACCATTCTATTGTCTTGTTCAAAGCTTGGGTTATGGGATAGGGCTCTCCAAATTTTATGGGAAATGGAGGCCGCCTCTGGTCGCTTAGTTTCGGCATCATCATATAACATTGTTATTAGTGCATGTGAGATGGCTAGGAAGCCAGAAATTGCGTTGCGAGTTTACGAACGCATGATTCATCAGAAGCTCACTCCTGATACCTTCACTCTTTTGTCGCTTATCCGAAGCTGCATTTGGGGATCTTTATGGGATGAAGTGGAACTACTTCTATCTAAGTCTGCACACGACGCATCTGTATACAATGCTGCCATCCAAGGAATGTGCTTAAGAGGCAAGACTGATTTAGCAAAAAAGCTTTACACGAAGATGCGCGAAATCGGTATCCAACCAGATGGAAAAACACGAGCTTTGATGCTTCAGACGTTGCCGAAGGATCGTGCTGGACTGAGGAACAGGTTGGCTTCTCGTTTCAAGAAAAGGCACAGACATTATCACCACAGGTAACATGAATGAATGTAAAAGAAGGTGTGAAAAGATTAGACTACTGTAAATGAAGTATATATTGTAGATGCATAATGCTAGCTCAAACTCATTGTTGTATAGCTTAGCAGAATCGCTGGTTGATTGCATCAGTTGGCTTATATTCCATAAAAAAGCTCAGATATCTGATGAATCGTATCCAAATTCTACGAACGATAATCATTTGATTTATGATTTTTCAAATTT

Coding sequence (CDS)

ATGAGAGGATTGCTTATAAATCCCACTCTGATTTTATCAAATGAGTTGAATTACCAACTTCACTCGTGTTACCCGGTTAGTTGTGCACACAAGCATTTCCATGTTATTCCAGAATTGAAATCATGTATAAGGCGTAGGATAACTCATGGGGGTAATGTAGCTTCAATGTCGTCGATGAGTATTCCACGATTGAATTTCGTGGTTCGGTCCACAAAAGCCTTGGAGTTTAGGACATGTGAAGAGGATGACGCTATTAGATTGGTCGTTGATGATGGAGTTGAAGAATCGTCTCGGGAGTGGAAATCGCCTCCCTGGGGAGAAGTGAAAAATCAGGATGAGCCAATCTTTCAATCTGAAGATGTAAACCAGTCCGAAGTGTTAGAAGGGGAGGGTTTGGGAAGTGACAGAAAGGTGTATTTTCTTGAGGAAACTGATGAAGTTATGCTATCAAAGCGTATTTTAATTCTCAGTAGAAAAAATAAGGTCAGAAGTGCAATGGAATTATTCAGGTCCATGCATTTAGCAGGTCTTCTGCCAAGTTTTCATGCTTCAAATTCACTTTTAGCTTGTCTTTTGAGGAATGGGCCGTTTGATGATGGTTTACGGATCTTCGAGTTTATGAAGTCAAACAAGCTATCAACAGGGCACACTTATAGCCTTATACTCAAAGCAGTTGCTGATACTCATGGATTTCTTTCTGCTCTTGAGATGTTTAGGACATGGGAGCACGAATATGACTTAAAACAGTTCGATGCAATTGTTTATAACACGATGATATCGGTCTGTGGAAAAGAGAATAACTGGGTTGAAGCTGAGAGAATATGGAGACTGATGGAGGCAAATGGCTGTAGTGCAACACATCTAACTTATTCTCTATTGGTGAGCATGTACGTCCGCTGCAACCAGAACGAACTTGCGATCGACATTTATGTAAAGATGGTTCAAAATGATTTAAAACCAGCTAATGATACAATGCAAGCTATTATTGGCGCATCTTCAAGGGAAGGGAGGTGGGATTTTGCTTTAAGAGTCTTTCAAGATATGTTGAAATGTGGACTCGAACCTAATTCCGTTGCATTCAACACCTTGATCAATGCTCTAGGAAAAGCTAATGAGGTCACTTTAGCATTCAGCATATACAATAGGATGAAACCTATGGGTCATTCACCTGATGTTTATACATGGAAGGCTCTACTCGGTGCTCTTTACAAGGCAAATCGCTACAACGATGCTATTCGTCTCTTTGAGTTTGTGAAAAGAGAGGAGAAGGCTCAATTGAATATACATATTTACAATACCATTCTATTGTCTTGTTCAAAGCTTGGGTTATGGGATAGGGCTCTCCAAATTTTATGGGAAATGGAGGCCGCCTCTGGTCGCTTAGTTTCGGCATCATCATATAACATTGTTATTAGTGCATGTGAGATGGCTAGGAAGCCAGAAATTGCGTTGCGAGTTTACGAACGCATGATTCATCAGAAGCTCACTCCTGATACCTTCACTCTTTTGTCGCTTATCCGAAGCTGCATTTGGGGATCTTTATGGGATGAAGTGGAACTACTTCTATCTAAGTCTGCACACGACGCATCTGTATACAATGCTGCCATCCAAGGAATGTGCTTAAGAGGCAAGACTGATTTAGCAAAAAAGCTTTACACGAAGATGCGCGAAATCGGTATCCAACCAGATGGAAAAACACGAGCTTTGATGCTTCAGACGTTGCCGAAGGATCGTGCTGGACTGAGGAACAGGTTGGCTTCTCGTTTCAAGAAAAGGCACAGACATTATCACCACAGGTAA

Protein sequence

MRGLLINPTLILSNELNYQLHSCYPVSCAHKHFHVIPELKSCIRRRITHGGNVASMSSMSIPRLNFVVRSTKALEFRTCEEDDAIRLVVDDGVEESSREWKSPPWGEVKNQDEPIFQSEDVNQSEVLEGEGLGSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHEYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNTLINALGKANEVTLAFSIYNRMKPMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGRLVSASSYNIVISACEMARKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWGSLWDEVELLLSKSAHDASVYNAAIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDRAGLRNRLASRFKKRHRHYHHR
BLAST of Cp4.1LG01g22400 vs. Swiss-Prot
Match: PP262_ARATH (Pentatricopeptide repeat-containing protein At3g29290 OS=Arabidopsis thaliana GN=EMB2076 PE=2 SV=1)

HSP 1 Score: 505.8 bits (1301), Expect = 6.7e-142
Identity = 255/471 (54.14%), Postives = 336/471 (71.34%), Query Frame = 1

Query: 107 EVKNQDEPIFQSEDVNQSEVLEGEGLGSDRKVYFLEETDEVMLSKRILILSRKNKVRSAM 166
           +V ++ +  F  E+V     LE +  G   +++FLEE +E  LSKR+  LSR +KVRSA+
Sbjct: 68  KVSSELDSSFNGENVVCGLELEEKTAGDRNRIHFLEERNEETLSKRLRKLSRLDKVRSAL 127

Query: 167 ELFRSMHLAGLLPSFHASNSLLACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVA 226
           ELF SM   GL P+ HA NS L+CLLRNG       +FEFM+  +  TGHTYSL+LKAVA
Sbjct: 128 ELFDSMRFLGLQPNAHACNSFLSCLLRNGDIQKAFTVFEFMRKKENVTGHTYSLMLKAVA 187

Query: 227 DTHGFLSALEMFRTWEHEYDLKQ-FDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSA 286
           +  G  SAL MFR  E E   +  FD ++YNT IS+CG+ NN  E ERIWR+M+ +G   
Sbjct: 188 EVKGCESALRMFRELEREPKRRSCFDVVLYNTAISLCGRINNVYETERIWRVMKGDGHIG 247

Query: 287 THLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVF 346
           T +TYSLLVS++VRC ++ELA+D+Y +MV N +    D M A+I A ++E +WD AL++F
Sbjct: 248 TEITYSLLVSIFVRCGRSELALDVYDEMVNNKISLREDAMYAMISACTKEEKWDLALKIF 307

Query: 347 QDMLKCGLEPNSVAFNTLINALGKANEVTLAFSIYNRMKPMGHSPDVYTWKALLGALYKA 406
           Q MLK G++PN VA NTLIN+LGKA +V L F +Y+ +K +GH PD YTW ALL ALYKA
Sbjct: 308 QSMLKKGMKPNLVACNTLINSLGKAGKVGLVFKVYSVLKSLGHKPDEYTWNALLTALYKA 367

Query: 407 NRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGRLVSAS 466
           NRY D ++LF+ ++ E    LN ++YNT ++SC KLG W++A+++L+EME  SG  VS S
Sbjct: 368 NRYEDVLQLFDMIRSENLCCLNEYLYNTAMVSCQKLGYWEKAVKLLYEME-GSGLTVSTS 427

Query: 467 SYNIVISACEMARKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWGSLWDEVELLLSKS 526
           SYN+VISACE +RK ++AL VYE M  +   P+TFT LSL+RSCIWGSLWDEVE +L K 
Sbjct: 428 SYNLVISACEKSRKSKVALLVYEHMAQRDCKPNTFTYLSLVRSCIWGSLWDEVEDILKKV 487

Query: 527 AHDASVYNAAIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPK 577
             D S+YNAAI GMCLR +   AK+LY KMRE+G++PDGKTRA+MLQ L K
Sbjct: 488 EPDVSLYNAAIHGMCLRREFKFAKELYVKMREMGLEPDGKTRAMMLQNLKK 537

BLAST of Cp4.1LG01g22400 vs. Swiss-Prot
Match: PP124_ARATH (Pentatricopeptide repeat-containing protein At1g74850, chloroplastic OS=Arabidopsis thaliana GN=PTAC2 PE=2 SV=1)

HSP 1 Score: 140.2 bits (352), Expect = 7.4e-32
Identity = 116/429 (27.04%), Postives = 188/429 (43.82%), Query Frame = 1

Query: 153 ILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGPFDDGLRIFEFMKSNKL 212
           I +L R+  +   +E+F  M   G+  S  +  +L+    RNG ++  L + + MK+ K+
Sbjct: 148 ISLLGREGLLDKCLEVFDEMPSQGVSRSVFSYTALINAYGRNGRYETSLELLDRMKNEKI 207

Query: 213 STGH-TYSLILKAVA----DTHGFLSALEMFRTWEHEYDLKQFDAIVYNTMISVCGKENN 272
           S    TY+ ++ A A    D  G L    +F    HE    Q D + YNT++S C     
Sbjct: 208 SPSILTYNTVINACARGGLDWEGLLG---LFAEMRHEGI--QPDIVTYNTLLSACAIRGL 267

Query: 273 WVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQA 332
             EAE ++R M   G      TYS LV  + +  + E   D+  +M      P   +   
Sbjct: 268 GDEAEMVFRTMNDGGIVPDLTTYSHLVETFGKLRRLEKVCDLLGEMASGGSLPDITSYNV 327

Query: 333 IIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNTLINALGKANEVTLAFSIYNRMKPMG 392
           ++ A ++ G    A+ VF  M   G  PN+  ++ L+N  G++        ++  MK   
Sbjct: 328 LLEAYAKSGSIKEAMGVFHQMQAAGCTPNANTYSVLLNLFGQSGRYDDVRQLFLEMKSSN 387

Query: 393 HSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRA 452
             PD  T+  L+    +   + + + LF  +  EE  + ++  Y  I+ +C K GL + A
Sbjct: 388 TDPDAATYNILIEVFGEGGYFKEVVTLFHDMV-EENIEPDMETYEGIIFACGKGGLHEDA 447

Query: 453 LQILWEMEAASGRLVSASSYNIVISACEMARKPEIALRVYERMIHQKLTPDTFTLLSLIR 512
            +IL  M  A+  + S+ +Y  VI A   A   E AL  +  M      P   T  SL+ 
Sbjct: 448 RKILQYM-TANDIVPSSKAYTGVIEAFGQAALYEEALVAFNTMHEVGSNPSIETFHSLLY 507

Query: 513 SCIWGSLWDEVELLLSKSA-----HDASVYNAAIQGMCLRGKTDLAKKLYTKMREIGIQP 572
           S   G L  E E +LS+        +   +NA I+     GK + A K Y  M +    P
Sbjct: 508 SFARGGLVKESEAILSRLVDSGIPRNRDTFNAQIEAYKQGGKFEEAVKTYVDMEKSRCDP 567

BLAST of Cp4.1LG01g22400 vs. Swiss-Prot
Match: PP362_ARATH (Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana GN=At5g02860 PE=2 SV=1)

HSP 1 Score: 136.3 bits (342), Expect = 1.1e-30
Identity = 118/491 (24.03%), Postives = 238/491 (48.47%), Query Frame = 1

Query: 113 EPIFQSEDVNQSEVLEG-EGLGSDRKV--------YFLEETD-EVMLSKRIL-----ILS 172
           EP     +   SE+L   +GLG  +K         +F+++ D + ML   ++     +L 
Sbjct: 125 EPFKDKPESTSSELLAFLKGLGFHKKFDLALRAFDWFMKQKDYQSMLDNSVVAIIISMLG 184

Query: 173 RKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGPFDDGLRIFEFMKSNKLS-TGH 232
           ++ +V SA  +F  +   G     ++  SL++    +G + + + +F+ M+ +    T  
Sbjct: 185 KEGRVSSAANMFNGLQEDGFSLDVYSYTSLISAFANSGRYREAVNVFKKMEEDGCKPTLI 244

Query: 233 TYSLIL----KAVADTHGFLSALEMFRTWEHEYDLKQFDAIVYNTMISVCGKENNWVEAE 292
           TY++IL    K     +   S +E  ++     D    DA  YNT+I+ C + +   EA 
Sbjct: 245 TYNVILNVFGKMGTPWNKITSLVEKMKS-----DGIAPDAYTYNTLITCCKRGSLHQEAA 304

Query: 293 RIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAIIGAS 352
           +++  M+A G S   +TY+ L+ +Y + ++ + A+ +  +MV N   P+  T  ++I A 
Sbjct: 305 QVFEEMKAAGFSYDKVTYNALLDVYGKSHRPKEAMKVLNEMVLNGFSPSIVTYNSLISAY 364

Query: 353 SREGRWDFALRVFQDMLKCGLEPNSVAFNTLINALGKANEVTLAFSIYNRMKPMGHSPDV 412
           +R+G  D A+ +   M + G +P+   + TL++   +A +V  A SI+  M+  G  P++
Sbjct: 365 ARDGMLDEAMELKNQMAEKGTKPDVFTYTTLLSGFERAGKVESAMSIFEEMRNAGCKPNI 424

Query: 413 YTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILW 472
            T+ A +       ++ + +++F+ +     +  +I  +NT+L    + G+      +  
Sbjct: 425 CTFNAFIKMYGNRGKFTEMMKIFDEINVCGLSP-DIVTWNTLLAVFGQNGMDSEVSGVFK 484

Query: 473 EMEAASGRLVSASSYNIVISACEMARKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWG 532
           EM+ A G +    ++N +ISA       E A+ VY RM+   +TPD  T  +++ +   G
Sbjct: 485 EMKRA-GFVPERETFNTLISAYSRCGSFEQAMTVYRRMLDAGVTPDLSTYNTVLAALARG 544

Query: 533 SLWDEVELLLSKSAHD---------ASVYNAAIQGMCLRGKTDLAKKLYTKMREIGIQPD 575
            +W++ E +L++              S+ +A   G  +     LA+++Y+ +    I+P 
Sbjct: 545 GMWEQSEKVLAEMEDGRCKPNELTYCSLLHAYANGKEIGLMHSLAEEVYSGV----IEP- 600

BLAST of Cp4.1LG01g22400 vs. Swiss-Prot
Match: PP217_ARATH (Pentatricopeptide repeat-containing protein At3g06920 OS=Arabidopsis thaliana GN=At3g06920 PE=2 SV=1)

HSP 1 Score: 123.6 bits (309), Expect = 7.2e-27
Identity = 77/321 (23.99%), Postives = 149/321 (46.42%), Query Frame = 1

Query: 258 MISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQND 317
           M+  C K N   E   + ++M           Y+ L+  +   N +++ + ++ +M +  
Sbjct: 139 MVLGCVKANKLREGYDVVQMMRKFKFRPAFSAYTTLIGAFSAVNHSDMMLTLFQQMQELG 198

Query: 318 LKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNTLINALGKANEVTLAF 377
            +P       +I   ++EGR D AL +  +M    L+ + V +N  I++ GK  +V +A+
Sbjct: 199 YEPTVHLFTTLIRGFAKEGRVDSALSLLDEMKSSSLDADIVLYNVCIDSFGKVGKVDMAW 258

Query: 378 SIYNRMKPMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLS 437
             ++ ++  G  PD  T+ +++G L KANR ++A+ +FE +++  +     + YNT+++ 
Sbjct: 259 KFFHEIEANGLKPDEVTYTSMIGVLCKANRLDEAVEMFEHLEKNRRVPCT-YAYNTMIMG 318

Query: 438 CSKLGLWDRALQILWEMEAASGRLVSASSYNIVISACEMARKPEIALRVYERMIHQKLTP 497
               G +D A  +L E + A G + S  +YN +++      K + AL+V+E M       
Sbjct: 319 YGSAGKFDEAYSLL-ERQRAKGSIPSVIAYNCILTCLRKMGKVDEALKVFEEM------- 378

Query: 498 DTFTLLSLIRSCIWGSLWDEVELLLSKSAHDASVYNAAIQGMCLRGKTDLAKKLYTKMRE 557
                                      +A + S YN  I  +C  GK D A +L   M++
Sbjct: 379 ------------------------KKDAAPNLSTYNILIDMLCRAGKLDTAFELRDSMQK 426

Query: 558 IGIQPDGKTRALMLQTLPKDR 579
            G+ P+ +T  +M+  L K +
Sbjct: 439 AGLFPNVRTVNIMVDRLCKSQ 426

BLAST of Cp4.1LG01g22400 vs. Swiss-Prot
Match: PPR37_ARATH (Pentatricopeptide repeat-containing protein At1g12620 OS=Arabidopsis thaliana GN=At1g12620 PE=2 SV=1)

HSP 1 Score: 119.0 bits (297), Expect = 1.8e-25
Identity = 108/435 (24.83%), Postives = 188/435 (43.22%), Query Frame = 1

Query: 143 ETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGPFDDGLR 202
           E D V  S  I  L  + +V  A+EL   M   G  P+    N+L+  L  NG   D + 
Sbjct: 139 EPDTVTFSTLINGLCLEGRVSEALELVDRMVEMGHKPTLITLNALVNGLCLNGKVSDAVL 198

Query: 203 IFEFMKSNKLSTGH-TYSLILKAVADTHGFLSALEMFRTWEHEYDLKQFDAIVYNTMISV 262
           + + M          TY  +LK +  +     A+E+ R  E E  +K  DA+ Y+ +I  
Sbjct: 199 LIDRMVETGFQPNEVTYGPVLKVMCKSGQTALAMELLRKME-ERKIK-LDAVKYSIIIDG 258

Query: 263 CGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPA 322
             K+ +   A  ++  ME  G  A  + Y+ L+  +    + +    +   M++  + P 
Sbjct: 259 LCKDGSLDNAFNLFNEMEIKGFKADIIIYTTLIRGFCYAGRWDDGAKLLRDMIKRKITPD 318

Query: 323 NDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNTLINALGKANEVTLAFSIYN 382
                A+I    +EG+   A  + ++M++ G+ P++V + +LI+   K N++  A  + +
Sbjct: 319 VVAFSALIDCFVKEGKLREAEELHKEMIQRGISPDTVTYTSLIDGFCKENQLDKANHMLD 378

Query: 383 RMKPMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKL 442
            M   G  P++ T+  L+    KAN  +D + LF  +         +  YNT++    +L
Sbjct: 379 LMVSKGCGPNIRTFNILINGYCKANLIDDGLELFRKMSLRGVVADTV-TYNTLIQGFCEL 438

Query: 443 GLWDRALQILWEMEAASGRLVSASSYNIVISACEMARKPEIALRVYERMIHQKLTPDTFT 502
           G  + A ++  EM +   R     SY I++       +PE AL ++E++   K+      
Sbjct: 439 GKLEVAKELFQEMVSRRVR-PDIVSYKILLDGLCDNGEPEKALEIFEKIEKSKM------ 498

Query: 503 LLSLIRSCIWGSLWDEVELLLSKSAHDASVYNAAIQGMCLRGKTDLAKKLYTKMREIGIQ 562
                            EL       D  +YN  I GMC   K D A  L+  +   G++
Sbjct: 499 -----------------EL-------DIGIYNIIIHGMCNASKVDDAWDLFCSLPLKGVK 539

Query: 563 PDGKTRALMLQTLPK 577
           PD KT  +M+  L K
Sbjct: 559 PDVKTYNIMIGGLCK 539

BLAST of Cp4.1LG01g22400 vs. TrEMBL
Match: A0A0A0KVV0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G642190 PE=4 SV=1)

HSP 1 Score: 891.3 bits (2302), Expect = 6.3e-256
Identity = 458/605 (75.70%), Postives = 508/605 (83.97%), Query Frame = 1

Query: 1   MRGLLIN--PTLILSNELNYQLHSCYPVSCAHKHF----HVIPELKSCIRRRITHGGNVA 60
           MRG+L N  PTL+L NE NYQ  S YP     K      +V P LKS +R  I + GN  
Sbjct: 1   MRGVLGNSSPTLVLLNEFNYQHDSHYPFRREDKRLRQCINVNPMLKSWMRCTIMYDGNAV 60

Query: 61  SMSSMSIPRLNFVVRSTKALEFRTCEEDDAIRLVVDDGVEESSREWKSPPWGEVKNQDEP 120
           S+   S PRLN VV+ST+ +    C ED+AI LV+D+GVEESSREWK PPWG++ +QDE 
Sbjct: 61  SVLPRSTPRLNLVVQSTRGVN---CGEDEAIELVIDEGVEESSREWKLPPWGDIAHQDEA 120

Query: 121 IFQSEDVNQSEVLEGEGLGSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHL 180
            FQSEDVNQ ++LEG+ L ++ K++FLEETD+VMLSKRILILSRKNKVRSA+EL RSM L
Sbjct: 121 TFQSEDVNQPKILEGKVLENESKLHFLEETDKVMLSKRILILSRKNKVRSALELLRSMQL 180

Query: 181 AGLLPSFHASNSLLACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSA 240
           AGLLPS HA NSLLACLLRN  F DGLRIFEFMK N+LSTGHTYSL+LKAVA+ HGFLSA
Sbjct: 181 AGLLPSLHALNSLLACLLRNELFADGLRIFEFMKLNELSTGHTYSLVLKAVANAHGFLSA 240

Query: 241 LEMFRTWEHEYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLV 300
           LEMF+ WEH+  L QFDAIVYNTMIS+CGK+NNWVEAER WRLME NGCSAT +TYSLLV
Sbjct: 241 LEMFKAWEHQCVLAQFDAIVYNTMISICGKDNNWVEAERTWRLMEKNGCSATRITYSLLV 300

Query: 301 SMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLE 360
           S +VRCNQNELAID YVKMVQN  KP NDTMQAIIGASS+EG+WDFALRVFQDMLKCGL+
Sbjct: 301 STFVRCNQNELAIDTYVKMVQNSFKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQ 360

Query: 361 PNSVAFNTLINALGKANEVTLAFSIYNRMKPMGHSPDVYTWKALLGALYKANRYNDAIRL 420
           PNSV+FN LINALGKA EVTLAFS+YN MK MGHSPDVYTW ALLGALYKANRY+DAI L
Sbjct: 361 PNSVSFNALINALGKAKEVTLAFSVYNVMKSMGHSPDVYTWNALLGALYKANRYSDAIHL 420

Query: 421 FEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGRLVSASSYNIVISAC 480
           FEFVKR EK QLNIHIYNTIL+SCSKLGLW+RA+QILWEME  SG  +S SSYNIV++AC
Sbjct: 421 FEFVKR-EKVQLNIHIYNTILMSCSKLGLWERAVQILWEME-VSGLSISTSSYNIVMTAC 480

Query: 481 EMARKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWGSLWDEVELLLSKSAHDASVYNA 540
           EMARKPEIAL+VYERM+HQK TPDTFT LSLIR CIWGSLWDEVELLL+KSA D SVYN 
Sbjct: 481 EMARKPEIALQVYERMVHQKHTPDTFTHLSLIRCCIWGSLWDEVELLLNKSAPDVSVYNV 540

Query: 541 AIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDRAGLRNRLASRFKKRHR 600
            IQGMCLRGK+DLAKKLYTKMRE GIQPDGKTRALMLQ LPKD A  +NR AS FKKR R
Sbjct: 541 VIQGMCLRGKSDLAKKLYTKMRENGIQPDGKTRALMLQNLPKDPARRKNRWASGFKKRQR 600

BLAST of Cp4.1LG01g22400 vs. TrEMBL
Match: M5WR43_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003127mg PE=4 SV=1)

HSP 1 Score: 617.5 bits (1591), Expect = 1.8e-173
Identity = 308/501 (61.48%), Postives = 391/501 (78.04%), Query Frame = 1

Query: 79  CEEDDAIRLVVDDGVEESSREWKS-PPWGEVKNQDEPIFQSEDVNQSE-VLEGEGLGSDR 138
           CEE++  ++V  +G  E+S   ++ PPWGE+   ++  F+ E   Q E  L+ +   +  
Sbjct: 82  CEEEEEDKVVQREGGYEASFVKQTLPPWGELAIDEDLDFEPEVPIQPESCLKRKASLNVN 141

Query: 139 KVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGP 198
           +V FLEE DE  LSKRIL+LSR NK RSA+ELF SM L+GLLP+ HA NSLL+CLLRN  
Sbjct: 142 RVSFLEEMDEGTLSKRILVLSRTNKTRSALELFTSMELSGLLPNLHACNSLLSCLLRNEL 201

Query: 199 FDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHEYDLKQ-FDAIVY 258
            DDGLR+FEFMK  KL+TGHTYSLILKAV+   G  SA+EMF   E E +++  FD IVY
Sbjct: 202 LDDGLRVFEFMKRKKLATGHTYSLILKAVSVAEGCSSAIEMFVAMEEESEVRDSFDTIVY 261

Query: 259 NTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQ 318
           NTMIS+CGK NNW E ER+WR ++ NG + T +TY LLVS++VRC+Q+ELA+D Y +M+Q
Sbjct: 262 NTMISICGKVNNWRETERLWRHIKENGLTGTRVTYCLLVSIFVRCSQHELALDAYNEMIQ 321

Query: 319 NDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNTLINALGKANEVTL 378
           N  +P NDTM AIIGA S++G+WD AL +FQ ML  GL+PN+VA N LIN+LGKA EV L
Sbjct: 322 NKFEPGNDTMHAIIGACSKDGKWDLALNIFQSMLDSGLKPNAVALNALINSLGKAGEVEL 381

Query: 379 AFSIYNRMKPMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTIL 438
           AF +YN MK +GHSPD YTW ALLGALY+ANR++DA+RL+E +K  + +QLN H+YN  L
Sbjct: 382 AFRVYNIMKSLGHSPDAYTWNALLGALYRANRHDDALRLYESIKTSQGSQLNSHLYNMAL 441

Query: 439 LSCSKLGLWDRALQILWEMEAASGRLVSASSYNIVISACEMARKPEIALRVYERMIHQKL 498
           +SCSKLGLWD+AL++LW++E ASG+ VS +SYN+V+SACE ARKP++AL+VYE M+HQK 
Sbjct: 442 MSCSKLGLWDKALKLLWQLE-ASGQSVSTASYNLVVSACEKARKPKVALQVYEHMVHQKC 501

Query: 499 TPDTFTLLSLIRSCIWGSLWDEVELLLSKSAHDASVYNAAIQGMCLRGKTDLAKKLYTKM 558
           TPD FT LSLIR CIWGSLWDEVE +L+ +A D S+YNAAIQGMCLRGK +LAKK+YTKM
Sbjct: 502 TPDIFTYLSLIRGCIWGSLWDEVEEILNWAAPDMSLYNAAIQGMCLRGKIELAKKIYTKM 561

Query: 559 REIGIQPDGKTRALMLQTLPK 577
           RE G+QPDGKTRA+MLQ L +
Sbjct: 562 RENGLQPDGKTRAMMLQNLQR 581

BLAST of Cp4.1LG01g22400 vs. TrEMBL
Match: W9R9N1_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_018143 PE=4 SV=1)

HSP 1 Score: 597.0 bits (1538), Expect = 2.5e-167
Identity = 295/496 (59.48%), Postives = 380/496 (76.61%), Query Frame = 1

Query: 103 PPWGEVKNQDEPIFQSEDVNQSEVLEGEGLGSD---RKVYFLEETDEVMLSKRILILSRK 162
           PPW  ++   +  F+ + +N  +V+  E    +     V+FLEE DE  LS RIL+LSR 
Sbjct: 50  PPWRNLETSKDLDFEPDGLNPPKVVPREKAVLNLNVNSVHFLEEVDEAKLSNRILVLSRT 109

Query: 163 NKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGPFDDGLRIFEFMKSNKLSTGHTYS 222
           NKVRSA+EL RSM L+GL P  HA NSLL+CLLRN   DDGLR+FEFMK+ K++TGHTYS
Sbjct: 110 NKVRSALELLRSMELSGLRPDLHACNSLLSCLLRNELVDDGLRVFEFMKTEKITTGHTYS 169

Query: 223 LILKAVADTHGFLSALEMFRTWEHEYDLKQ-FDAIVYNTMISVCGKENNWVEAERIWRLM 282
           L+LKAV D  G  +AL MF   E E  +K  FDA+VYNTMISVCG+ NNW+E  R+WR M
Sbjct: 170 LVLKAVTDAKGCDAALRMFSEMERECGVKNGFDAVVYNTMISVCGRVNNWLETLRLWRSM 229

Query: 283 EANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRW 342
           + N    T +TY LLVS++VRC QN+LA+D Y +MVQ+  +P  DTMQAIIGA ++EG+W
Sbjct: 230 KENCRIGTRITYCLLVSIFVRCGQNDLALDAYGEMVQSKFEPGKDTMQAIIGACAKEGKW 289

Query: 343 DFALRVFQDMLKCGLEPNSVAFNTLINALGKANEVTLAFSIYNRMKPMGHSPDVYTWKAL 402
           DFAL +FQ MLK GL+PN++A N +IN+LGKA E+ LAF +++ MK +GH PD YTW AL
Sbjct: 290 DFALSIFQSMLKKGLKPNAIACNAVINSLGKAGEIKLAFRVFDVMKSLGHLPDTYTWNAL 349

Query: 403 LGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAAS 462
           LGALY+AN ++DA+RLFE +K+++ ++LN+H+YN  L+SCSKLGLW+RA+Q+LW+ME A+
Sbjct: 350 LGALYRANLHDDALRLFERIKQDQDSELNLHLYNIALISCSKLGLWERAVQLLWQME-AN 409

Query: 463 GRLVSASSYNIVISACEMARKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWGSLWDEV 522
           G  +SA+SYN+VI+ACE ARKP++A++VYE M+H K  PDTFT LSLIR CIWGSLW+EV
Sbjct: 410 GMSISAASYNLVINACETARKPDVAVQVYEHMVHLKCIPDTFTHLSLIRVCIWGSLWNEV 469

Query: 523 ELLLSKSAHDASVYNAAIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDR 582
           E +L+++A DAS+YNAAIQGMCLRGK D AKKLY KMR  G+QPDGKTRA+MLQ L KD 
Sbjct: 470 EEILNQAAPDASLYNAAIQGMCLRGKIDTAKKLYAKMRNCGLQPDGKTRAMMLQNLRKDS 529

Query: 583 AGLRNRLASRFKKRHR 595
              + R  SR K+R R
Sbjct: 530 VKHKYRPPSRHKRRTR 544

BLAST of Cp4.1LG01g22400 vs. TrEMBL
Match: A0A061EF27_THECC (Pentatricopeptide repeat superfamily protein, putative isoform 1 OS=Theobroma cacao GN=TCM_018701 PE=4 SV=1)

HSP 1 Score: 590.1 bits (1520), Expect = 3.0e-165
Identity = 304/524 (58.02%), Postives = 386/524 (73.66%), Query Frame = 1

Query: 81  EDDAIRLVVDDGVEE---SSREWKSPPWGEVKNQDEPIFQSEDVNQSEVLE-GEGLGSDR 140
           EDD   +++  G EE    S     PPWG +   +   F+   V Q  +   G+    D 
Sbjct: 108 EDDEENILIQKGKEEFGLDSLGQNLPPWGNLVVDESLDFEHTSVGQPAISSNGKDSVHDS 167

Query: 141 KVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGP 200
           KV+FLEET+E  LS+R+L+LSR NKVRSA+EL RSM L+GL PS HA NSLL+CLLRNG 
Sbjct: 168 KVHFLEETNEEELSRRVLMLSRSNKVRSALELCRSMKLSGLQPSAHACNSLLSCLLRNGL 227

Query: 201 FDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHEYDLKQ-FDAIVY 260
            DD LR FEFMK+N+L TGHTYSLILKA+ADT G  +AL+MF   E +Y+ K+ FD IVY
Sbjct: 228 VDDALRTFEFMKANELITGHTYSLILKAIADTQGCDAALDMFAELERDYEQKKGFDVIVY 287

Query: 261 NTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQ 320
           NT +S+CG+ NNWVE ER+WR +  NG S T +TYSLLVS++VRCNQNELA+D Y +M++
Sbjct: 288 NTALSICGRWNNWVETERVWRRILENGYSGTQVTYSLLVSIFVRCNQNELALDAYDEMIR 347

Query: 321 NDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNTLINALGKANEVTL 380
           N L+P +DTM A+I A ++E +WD AL +FQ +L  GL+PN VA N LIN+LGKA EV L
Sbjct: 348 NGLEPRDDTMHAVISACTKEEKWDLALSIFQKILNDGLKPNPVACNALINSLGKAGEVRL 407

Query: 381 AFSIYNRMKPMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTIL 440
           AF IY+ MK +GH+PD +TW +LLGALY+AN+Y DA+ LFE + R++ +  N+H+YNT L
Sbjct: 408 AFKIYDIMKSLGHTPDAFTWNSLLGALYRANQYADALHLFERI-RKQSSLANVHLYNTAL 467

Query: 441 LSCSKLGLWDRALQILWEMEAASGRLVSASSYNIVISACEMARKPEIALRVYERMIHQKL 500
           +SC KLGLWDRALQ+LW+ME ASG LVS +SYN+VISACE ARKP++AL+VY+ MIHQK 
Sbjct: 468 MSCQKLGLWDRALQLLWQME-ASGLLVSTASYNLVISACETARKPKVALQVYDHMIHQKC 527

Query: 501 TPDTFTLLSLIRSCIWGSLWDEVELLLSKSAHDASVYNAAIQGMCLRGKTDLAKKLYTKM 560
            PDTFT LSLIRSCIWGSLW EVE +L++   + S+YNA I GMCL+GK + AKKLY +M
Sbjct: 528 VPDTFTHLSLIRSCIWGSLWAEVEEILNRVPENVSLYNAVIHGMCLKGKVESAKKLYMRM 587

Query: 561 REIGIQPDGKTRALMLQTLPKDRAGLRNRLASRFKKRHRHYHHR 600
           R+ G++PDGKTRALMLQ L KD          + K +   YH R
Sbjct: 588 RKNGLKPDGKTRALMLQNLRKD----------QIKVKRSSYHSR 619

BLAST of Cp4.1LG01g22400 vs. TrEMBL
Match: D7SXI6_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_14s0108g01490 PE=4 SV=1)

HSP 1 Score: 580.9 bits (1496), Expect = 1.8e-162
Identity = 299/514 (58.17%), Postives = 383/514 (74.51%), Query Frame = 1

Query: 79  CEEDDAIRLVV--DDGVEESSREWKSPPWGEVKNQDEPIFQSEDVNQSEVLEGEGLGSD- 138
           CEED+   L     +  + S  E K PP G  +    P F+   V +   +   G+ S+ 
Sbjct: 111 CEEDEENMLNQRRKEEFDPSYFEQKFPPLGNSEIHKNPDFEHIGVAEPLTIS-TGISSEF 170

Query: 139 -RKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRN 198
             K++FLEE +E +LSKRIL+LSR NKVRS +EL+R+M  +GL PS HA NSLL+CLLRN
Sbjct: 171 EDKLHFLEERNEQILSKRILMLSRSNKVRSVLELYRTMEFSGLQPSSHACNSLLSCLLRN 230

Query: 199 GPFDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHEYDLKQ-FDAI 258
              DD LR+FE MK+N+ +TGH+YSL+LKA+A+  G+ SAL+MF   E E  +K+ FD I
Sbjct: 231 EMLDDALRVFESMKANESTTGHSYSLVLKAIANIQGYDSALKMFSKLEVECKVKKDFDVI 290

Query: 259 VYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKM 318
            Y TMIS+CGK NNW + ERIWR M+ NG   T +TY LLVS++VRC QNELAID Y +M
Sbjct: 291 AYTTMISICGKVNNWAQTERIWRSMKENGLVGTIVTYRLLVSVFVRCGQNELAIDAYSEM 350

Query: 319 VQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNTLINALGKANEV 378
           +QN LKP  D M+AIIGA ++EG+WD AL VFQ ML  GL+PN +A N LIN++GK+  V
Sbjct: 351 IQNGLKPGEDAMKAIIGACAKEGKWDLALSVFQSMLNVGLKPNLIACNALINSIGKSGNV 410

Query: 379 TLAFSIYNRMKPMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNT 438
            LAF +Y+ MK +GH+PDVYTW ALLGALY+AN++ DA+ LFE + RE+ +Q+N+H+YNT
Sbjct: 411 KLAFRVYDVMKSLGHTPDVYTWNALLGALYRANQHADALHLFESI-REQSSQVNLHLYNT 470

Query: 439 ILLSCSKLGLWDRALQILWEMEAASGRLVSASSYNIVISACEMARKPEIALRVYERMIHQ 498
            L+SC KLGLW+RALQ+LW+ME ASG  VS++SYN+VI ACE+ARKPEIAL+VYE M+ Q
Sbjct: 471 ALMSCQKLGLWNRALQLLWQME-ASGLSVSSASYNLVIGACEVARKPEIALQVYEHMVQQ 530

Query: 499 KLTPDTFTLLSLIRSCIWGSLWDEVELLLSKSAHDASVYNAAIQGMCLRGKTDLAKKLYT 558
           + TPDTFT LSLIRSCIWGSLW EV+ +L+++  D S+YNAAIQGMCLRGK + AKKLY 
Sbjct: 531 QCTPDTFTHLSLIRSCIWGSLWAEVKEILNRAGTDVSLYNAAIQGMCLRGKIESAKKLYM 590

Query: 559 KMREIGIQPDGKTRALMLQTLPKDRAGLRNRLAS 588
           +MR+ G++PDGKTRALMLQ L KD     NR  S
Sbjct: 591 RMRKSGLKPDGKTRALMLQNLQKDAIRPSNRRIS 621

BLAST of Cp4.1LG01g22400 vs. TAIR10
Match: AT3G29290.1 (AT3G29290.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 505.8 bits (1301), Expect = 3.8e-143
Identity = 255/471 (54.14%), Postives = 336/471 (71.34%), Query Frame = 1

Query: 107 EVKNQDEPIFQSEDVNQSEVLEGEGLGSDRKVYFLEETDEVMLSKRILILSRKNKVRSAM 166
           +V ++ +  F  E+V     LE +  G   +++FLEE +E  LSKR+  LSR +KVRSA+
Sbjct: 68  KVSSELDSSFNGENVVCGLELEEKTAGDRNRIHFLEERNEETLSKRLRKLSRLDKVRSAL 127

Query: 167 ELFRSMHLAGLLPSFHASNSLLACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVA 226
           ELF SM   GL P+ HA NS L+CLLRNG       +FEFM+  +  TGHTYSL+LKAVA
Sbjct: 128 ELFDSMRFLGLQPNAHACNSFLSCLLRNGDIQKAFTVFEFMRKKENVTGHTYSLMLKAVA 187

Query: 227 DTHGFLSALEMFRTWEHEYDLKQ-FDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSA 286
           +  G  SAL MFR  E E   +  FD ++YNT IS+CG+ NN  E ERIWR+M+ +G   
Sbjct: 188 EVKGCESALRMFRELEREPKRRSCFDVVLYNTAISLCGRINNVYETERIWRVMKGDGHIG 247

Query: 287 THLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVF 346
           T +TYSLLVS++VRC ++ELA+D+Y +MV N +    D M A+I A ++E +WD AL++F
Sbjct: 248 TEITYSLLVSIFVRCGRSELALDVYDEMVNNKISLREDAMYAMISACTKEEKWDLALKIF 307

Query: 347 QDMLKCGLEPNSVAFNTLINALGKANEVTLAFSIYNRMKPMGHSPDVYTWKALLGALYKA 406
           Q MLK G++PN VA NTLIN+LGKA +V L F +Y+ +K +GH PD YTW ALL ALYKA
Sbjct: 308 QSMLKKGMKPNLVACNTLINSLGKAGKVGLVFKVYSVLKSLGHKPDEYTWNALLTALYKA 367

Query: 407 NRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGRLVSAS 466
           NRY D ++LF+ ++ E    LN ++YNT ++SC KLG W++A+++L+EME  SG  VS S
Sbjct: 368 NRYEDVLQLFDMIRSENLCCLNEYLYNTAMVSCQKLGYWEKAVKLLYEME-GSGLTVSTS 427

Query: 467 SYNIVISACEMARKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWGSLWDEVELLLSKS 526
           SYN+VISACE +RK ++AL VYE M  +   P+TFT LSL+RSCIWGSLWDEVE +L K 
Sbjct: 428 SYNLVISACEKSRKSKVALLVYEHMAQRDCKPNTFTYLSLVRSCIWGSLWDEVEDILKKV 487

Query: 527 AHDASVYNAAIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPK 577
             D S+YNAAI GMCLR +   AK+LY KMRE+G++PDGKTRA+MLQ L K
Sbjct: 488 EPDVSLYNAAIHGMCLRREFKFAKELYVKMREMGLEPDGKTRAMMLQNLKK 537

BLAST of Cp4.1LG01g22400 vs. TAIR10
Match: AT1G74850.1 (AT1G74850.1 plastid transcriptionally active 2)

HSP 1 Score: 140.2 bits (352), Expect = 4.2e-33
Identity = 116/429 (27.04%), Postives = 188/429 (43.82%), Query Frame = 1

Query: 153 ILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGPFDDGLRIFEFMKSNKL 212
           I +L R+  +   +E+F  M   G+  S  +  +L+    RNG ++  L + + MK+ K+
Sbjct: 148 ISLLGREGLLDKCLEVFDEMPSQGVSRSVFSYTALINAYGRNGRYETSLELLDRMKNEKI 207

Query: 213 STGH-TYSLILKAVA----DTHGFLSALEMFRTWEHEYDLKQFDAIVYNTMISVCGKENN 272
           S    TY+ ++ A A    D  G L    +F    HE    Q D + YNT++S C     
Sbjct: 208 SPSILTYNTVINACARGGLDWEGLLG---LFAEMRHEGI--QPDIVTYNTLLSACAIRGL 267

Query: 273 WVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQA 332
             EAE ++R M   G      TYS LV  + +  + E   D+  +M      P   +   
Sbjct: 268 GDEAEMVFRTMNDGGIVPDLTTYSHLVETFGKLRRLEKVCDLLGEMASGGSLPDITSYNV 327

Query: 333 IIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNTLINALGKANEVTLAFSIYNRMKPMG 392
           ++ A ++ G    A+ VF  M   G  PN+  ++ L+N  G++        ++  MK   
Sbjct: 328 LLEAYAKSGSIKEAMGVFHQMQAAGCTPNANTYSVLLNLFGQSGRYDDVRQLFLEMKSSN 387

Query: 393 HSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRA 452
             PD  T+  L+    +   + + + LF  +  EE  + ++  Y  I+ +C K GL + A
Sbjct: 388 TDPDAATYNILIEVFGEGGYFKEVVTLFHDMV-EENIEPDMETYEGIIFACGKGGLHEDA 447

Query: 453 LQILWEMEAASGRLVSASSYNIVISACEMARKPEIALRVYERMIHQKLTPDTFTLLSLIR 512
            +IL  M  A+  + S+ +Y  VI A   A   E AL  +  M      P   T  SL+ 
Sbjct: 448 RKILQYM-TANDIVPSSKAYTGVIEAFGQAALYEEALVAFNTMHEVGSNPSIETFHSLLY 507

Query: 513 SCIWGSLWDEVELLLSKSA-----HDASVYNAAIQGMCLRGKTDLAKKLYTKMREIGIQP 572
           S   G L  E E +LS+        +   +NA I+     GK + A K Y  M +    P
Sbjct: 508 SFARGGLVKESEAILSRLVDSGIPRNRDTFNAQIEAYKQGGKFEEAVKTYVDMEKSRCDP 567

BLAST of Cp4.1LG01g22400 vs. TAIR10
Match: AT5G02860.1 (AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 136.3 bits (342), Expect = 6.0e-32
Identity = 118/491 (24.03%), Postives = 238/491 (48.47%), Query Frame = 1

Query: 113 EPIFQSEDVNQSEVLEG-EGLGSDRKV--------YFLEETD-EVMLSKRIL-----ILS 172
           EP     +   SE+L   +GLG  +K         +F+++ D + ML   ++     +L 
Sbjct: 125 EPFKDKPESTSSELLAFLKGLGFHKKFDLALRAFDWFMKQKDYQSMLDNSVVAIIISMLG 184

Query: 173 RKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGPFDDGLRIFEFMKSNKLS-TGH 232
           ++ +V SA  +F  +   G     ++  SL++    +G + + + +F+ M+ +    T  
Sbjct: 185 KEGRVSSAANMFNGLQEDGFSLDVYSYTSLISAFANSGRYREAVNVFKKMEEDGCKPTLI 244

Query: 233 TYSLIL----KAVADTHGFLSALEMFRTWEHEYDLKQFDAIVYNTMISVCGKENNWVEAE 292
           TY++IL    K     +   S +E  ++     D    DA  YNT+I+ C + +   EA 
Sbjct: 245 TYNVILNVFGKMGTPWNKITSLVEKMKS-----DGIAPDAYTYNTLITCCKRGSLHQEAA 304

Query: 293 RIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAIIGAS 352
           +++  M+A G S   +TY+ L+ +Y + ++ + A+ +  +MV N   P+  T  ++I A 
Sbjct: 305 QVFEEMKAAGFSYDKVTYNALLDVYGKSHRPKEAMKVLNEMVLNGFSPSIVTYNSLISAY 364

Query: 353 SREGRWDFALRVFQDMLKCGLEPNSVAFNTLINALGKANEVTLAFSIYNRMKPMGHSPDV 412
           +R+G  D A+ +   M + G +P+   + TL++   +A +V  A SI+  M+  G  P++
Sbjct: 365 ARDGMLDEAMELKNQMAEKGTKPDVFTYTTLLSGFERAGKVESAMSIFEEMRNAGCKPNI 424

Query: 413 YTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILW 472
            T+ A +       ++ + +++F+ +     +  +I  +NT+L    + G+      +  
Sbjct: 425 CTFNAFIKMYGNRGKFTEMMKIFDEINVCGLSP-DIVTWNTLLAVFGQNGMDSEVSGVFK 484

Query: 473 EMEAASGRLVSASSYNIVISACEMARKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWG 532
           EM+ A G +    ++N +ISA       E A+ VY RM+   +TPD  T  +++ +   G
Sbjct: 485 EMKRA-GFVPERETFNTLISAYSRCGSFEQAMTVYRRMLDAGVTPDLSTYNTVLAALARG 544

Query: 533 SLWDEVELLLSKSAHD---------ASVYNAAIQGMCLRGKTDLAKKLYTKMREIGIQPD 575
            +W++ E +L++              S+ +A   G  +     LA+++Y+ +    I+P 
Sbjct: 545 GMWEQSEKVLAEMEDGRCKPNELTYCSLLHAYANGKEIGLMHSLAEEVYSGV----IEP- 600

BLAST of Cp4.1LG01g22400 vs. TAIR10
Match: AT3G06920.1 (AT3G06920.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 123.6 bits (309), Expect = 4.0e-28
Identity = 77/321 (23.99%), Postives = 149/321 (46.42%), Query Frame = 1

Query: 258 MISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQND 317
           M+  C K N   E   + ++M           Y+ L+  +   N +++ + ++ +M +  
Sbjct: 139 MVLGCVKANKLREGYDVVQMMRKFKFRPAFSAYTTLIGAFSAVNHSDMMLTLFQQMQELG 198

Query: 318 LKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNTLINALGKANEVTLAF 377
            +P       +I   ++EGR D AL +  +M    L+ + V +N  I++ GK  +V +A+
Sbjct: 199 YEPTVHLFTTLIRGFAKEGRVDSALSLLDEMKSSSLDADIVLYNVCIDSFGKVGKVDMAW 258

Query: 378 SIYNRMKPMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLS 437
             ++ ++  G  PD  T+ +++G L KANR ++A+ +FE +++  +     + YNT+++ 
Sbjct: 259 KFFHEIEANGLKPDEVTYTSMIGVLCKANRLDEAVEMFEHLEKNRRVPCT-YAYNTMIMG 318

Query: 438 CSKLGLWDRALQILWEMEAASGRLVSASSYNIVISACEMARKPEIALRVYERMIHQKLTP 497
               G +D A  +L E + A G + S  +YN +++      K + AL+V+E M       
Sbjct: 319 YGSAGKFDEAYSLL-ERQRAKGSIPSVIAYNCILTCLRKMGKVDEALKVFEEM------- 378

Query: 498 DTFTLLSLIRSCIWGSLWDEVELLLSKSAHDASVYNAAIQGMCLRGKTDLAKKLYTKMRE 557
                                      +A + S YN  I  +C  GK D A +L   M++
Sbjct: 379 ------------------------KKDAAPNLSTYNILIDMLCRAGKLDTAFELRDSMQK 426

Query: 558 IGIQPDGKTRALMLQTLPKDR 579
            G+ P+ +T  +M+  L K +
Sbjct: 439 AGLFPNVRTVNIMVDRLCKSQ 426

BLAST of Cp4.1LG01g22400 vs. TAIR10
Match: AT1G12620.1 (AT1G12620.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 119.0 bits (297), Expect = 9.9e-27
Identity = 108/435 (24.83%), Postives = 188/435 (43.22%), Query Frame = 1

Query: 143 ETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGPFDDGLR 202
           E D V  S  I  L  + +V  A+EL   M   G  P+    N+L+  L  NG   D + 
Sbjct: 139 EPDTVTFSTLINGLCLEGRVSEALELVDRMVEMGHKPTLITLNALVNGLCLNGKVSDAVL 198

Query: 203 IFEFMKSNKLSTGH-TYSLILKAVADTHGFLSALEMFRTWEHEYDLKQFDAIVYNTMISV 262
           + + M          TY  +LK +  +     A+E+ R  E E  +K  DA+ Y+ +I  
Sbjct: 199 LIDRMVETGFQPNEVTYGPVLKVMCKSGQTALAMELLRKME-ERKIK-LDAVKYSIIIDG 258

Query: 263 CGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPA 322
             K+ +   A  ++  ME  G  A  + Y+ L+  +    + +    +   M++  + P 
Sbjct: 259 LCKDGSLDNAFNLFNEMEIKGFKADIIIYTTLIRGFCYAGRWDDGAKLLRDMIKRKITPD 318

Query: 323 NDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNTLINALGKANEVTLAFSIYN 382
                A+I    +EG+   A  + ++M++ G+ P++V + +LI+   K N++  A  + +
Sbjct: 319 VVAFSALIDCFVKEGKLREAEELHKEMIQRGISPDTVTYTSLIDGFCKENQLDKANHMLD 378

Query: 383 RMKPMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKL 442
            M   G  P++ T+  L+    KAN  +D + LF  +         +  YNT++    +L
Sbjct: 379 LMVSKGCGPNIRTFNILINGYCKANLIDDGLELFRKMSLRGVVADTV-TYNTLIQGFCEL 438

Query: 443 GLWDRALQILWEMEAASGRLVSASSYNIVISACEMARKPEIALRVYERMIHQKLTPDTFT 502
           G  + A ++  EM +   R     SY I++       +PE AL ++E++   K+      
Sbjct: 439 GKLEVAKELFQEMVSRRVR-PDIVSYKILLDGLCDNGEPEKALEIFEKIEKSKM------ 498

Query: 503 LLSLIRSCIWGSLWDEVELLLSKSAHDASVYNAAIQGMCLRGKTDLAKKLYTKMREIGIQ 562
                            EL       D  +YN  I GMC   K D A  L+  +   G++
Sbjct: 499 -----------------EL-------DIGIYNIIIHGMCNASKVDDAWDLFCSLPLKGVK 539

Query: 563 PDGKTRALMLQTLPK 577
           PD KT  +M+  L K
Sbjct: 559 PDVKTYNIMIGGLCK 539

BLAST of Cp4.1LG01g22400 vs. NCBI nr
Match: gi|659118986|ref|XP_008459413.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g29290 isoform X1 [Cucumis melo])

HSP 1 Score: 901.0 bits (2327), Expect = 1.1e-258
Identity = 461/612 (75.33%), Postives = 510/612 (83.33%), Query Frame = 1

Query: 1   MRGLLIN--PTLILSNELNYQLHSCYPVSCAHKHF----HVIPELKSCIRRRITHGGNVA 60
           MRG+L N  PTLIL NE NYQ  S YP     KH     +V P LKSC+R  I + GN  
Sbjct: 1   MRGVLGNSSPTLILLNEFNYQHDSYYPFRREDKHLRQCINVNPMLKSCMRCTIMYDGNAV 60

Query: 61  SMSSMSIPRLNFVVRSTKALEFRT-------CEEDDAIRLVVDDGVEESSREWKSPPWGE 120
           SM  MS PRLN VV+S + ++FRT       C ED+AI LV+D+   ESSREWK PPWG+
Sbjct: 61  SMLPMSTPRLNLVVQSIRGMQFRTGVGTLLNCGEDEAIELVIDEEGVESSREWKLPPWGD 120

Query: 121 VKNQDEPIFQSEDVNQSEVLEGEGLGSDRKVYFLEETDEVMLSKRILILSRKNKVRSAME 180
           + +QDE  FQSEDVN  ++LEGE L ++ KV+FLEETD+V+LSKRILILSRKNKVRSA+E
Sbjct: 121 MTHQDEAAFQSEDVNLPKILEGEALENESKVHFLEETDKVLLSKRILILSRKNKVRSALE 180

Query: 181 LFRSMHLAGLLPSFHASNSLLACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVAD 240
           LFRSM LAG+LP+ HA NSLLACLLRNG F DGLRIFEFMK N+LSTGHTYSL+LKAVA+
Sbjct: 181 LFRSMQLAGVLPNLHALNSLLACLLRNGLFADGLRIFEFMKLNELSTGHTYSLVLKAVAN 240

Query: 241 THGFLSALEMFRTWEHEYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATH 300
            HGFLSALEMF+ WEH+Y L QFDAIVYNTMIS+CGK+NNWVEAER WRLME NGC+ATH
Sbjct: 241 AHGFLSALEMFKAWEHKYVLTQFDAIVYNTMISICGKDNNWVEAERTWRLMEKNGCTATH 300

Query: 301 LTYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQD 360
           +TYSLLVS +VRCNQNELAID YVKMVQ+  KP NDTMQAIIGASS+EG+WDFAL VFQD
Sbjct: 301 ITYSLLVSTFVRCNQNELAIDAYVKMVQSSFKPGNDTMQAIIGASSKEGKWDFALGVFQD 360

Query: 361 MLKCGLEPNSVAFNTLINALGKANEVTLAFSIYNRMKPMGHSPDVYTWKALLGALYKANR 420
           MLKCGL+PNSV+FN LINALGKA EVTLAFSIYN MK MGHSPDVYTW ALLGALYKANR
Sbjct: 361 MLKCGLQPNSVSFNALINALGKAKEVTLAFSIYNVMKSMGHSPDVYTWNALLGALYKANR 420

Query: 421 YNDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGRLVSASSY 480
           YNDAI LF FVKREEKAQLNIHIYNTIL+ CSKLGLW+RALQILWEME  SG L+S +SY
Sbjct: 421 YNDAIHLFGFVKREEKAQLNIHIYNTILMCCSKLGLWERALQILWEME-VSGLLISTTSY 480

Query: 481 NIVISACEMARKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWGSLWDEVELLLSKSAH 540
           NIV++ACE ARKPEIAL+VYERM+HQK TPDTFT LSLIR CIWGSLWDEVELLL+KS  
Sbjct: 481 NIVLTACETARKPEIALQVYERMVHQKHTPDTFTHLSLIRCCIWGSLWDEVELLLNKSGP 540

Query: 541 DASVYNAAIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDRAGLRNRLAS 600
           D SVYN  IQGMCLRGKTDLAKKLYTKMRE  IQ DGKTRALMLQ LPKD A L+NR AS
Sbjct: 541 DVSVYNVVIQGMCLRGKTDLAKKLYTKMRENSIQSDGKTRALMLQNLPKDPARLKNRWAS 600

BLAST of Cp4.1LG01g22400 vs. NCBI nr
Match: gi|449447683|ref|XP_004141597.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g29290 [Cucumis sativus])

HSP 1 Score: 891.3 bits (2302), Expect = 9.0e-256
Identity = 458/605 (75.70%), Postives = 508/605 (83.97%), Query Frame = 1

Query: 1   MRGLLIN--PTLILSNELNYQLHSCYPVSCAHKHF----HVIPELKSCIRRRITHGGNVA 60
           MRG+L N  PTL+L NE NYQ  S YP     K      +V P LKS +R  I + GN  
Sbjct: 1   MRGVLGNSSPTLVLLNEFNYQHDSHYPFRREDKRLRQCINVNPMLKSWMRCTIMYDGNAV 60

Query: 61  SMSSMSIPRLNFVVRSTKALEFRTCEEDDAIRLVVDDGVEESSREWKSPPWGEVKNQDEP 120
           S+   S PRLN VV+ST+ +    C ED+AI LV+D+GVEESSREWK PPWG++ +QDE 
Sbjct: 61  SVLPRSTPRLNLVVQSTRGVN---CGEDEAIELVIDEGVEESSREWKLPPWGDIAHQDEA 120

Query: 121 IFQSEDVNQSEVLEGEGLGSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHL 180
            FQSEDVNQ ++LEG+ L ++ K++FLEETD+VMLSKRILILSRKNKVRSA+EL RSM L
Sbjct: 121 TFQSEDVNQPKILEGKVLENESKLHFLEETDKVMLSKRILILSRKNKVRSALELLRSMQL 180

Query: 181 AGLLPSFHASNSLLACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSA 240
           AGLLPS HA NSLLACLLRN  F DGLRIFEFMK N+LSTGHTYSL+LKAVA+ HGFLSA
Sbjct: 181 AGLLPSLHALNSLLACLLRNELFADGLRIFEFMKLNELSTGHTYSLVLKAVANAHGFLSA 240

Query: 241 LEMFRTWEHEYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLV 300
           LEMF+ WEH+  L QFDAIVYNTMIS+CGK+NNWVEAER WRLME NGCSAT +TYSLLV
Sbjct: 241 LEMFKAWEHQCVLAQFDAIVYNTMISICGKDNNWVEAERTWRLMEKNGCSATRITYSLLV 300

Query: 301 SMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLE 360
           S +VRCNQNELAID YVKMVQN  KP NDTMQAIIGASS+EG+WDFALRVFQDMLKCGL+
Sbjct: 301 STFVRCNQNELAIDTYVKMVQNSFKPGNDTMQAIIGASSKEGKWDFALRVFQDMLKCGLQ 360

Query: 361 PNSVAFNTLINALGKANEVTLAFSIYNRMKPMGHSPDVYTWKALLGALYKANRYNDAIRL 420
           PNSV+FN LINALGKA EVTLAFS+YN MK MGHSPDVYTW ALLGALYKANRY+DAI L
Sbjct: 361 PNSVSFNALINALGKAKEVTLAFSVYNVMKSMGHSPDVYTWNALLGALYKANRYSDAIHL 420

Query: 421 FEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGRLVSASSYNIVISAC 480
           FEFVKR EK QLNIHIYNTIL+SCSKLGLW+RA+QILWEME  SG  +S SSYNIV++AC
Sbjct: 421 FEFVKR-EKVQLNIHIYNTILMSCSKLGLWERAVQILWEME-VSGLSISTSSYNIVMTAC 480

Query: 481 EMARKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWGSLWDEVELLLSKSAHDASVYNA 540
           EMARKPEIAL+VYERM+HQK TPDTFT LSLIR CIWGSLWDEVELLL+KSA D SVYN 
Sbjct: 481 EMARKPEIALQVYERMVHQKHTPDTFTHLSLIRCCIWGSLWDEVELLLNKSAPDVSVYNV 540

Query: 541 AIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDRAGLRNRLASRFKKRHR 600
            IQGMCLRGK+DLAKKLYTKMRE GIQPDGKTRALMLQ LPKD A  +NR AS FKKR R
Sbjct: 541 VIQGMCLRGKSDLAKKLYTKMRENGIQPDGKTRALMLQNLPKDPARRKNRWASGFKKRQR 600

BLAST of Cp4.1LG01g22400 vs. NCBI nr
Match: gi|659118988|ref|XP_008459414.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g29290 isoform X2 [Cucumis melo])

HSP 1 Score: 857.4 bits (2214), Expect = 1.4e-245
Identity = 430/551 (78.04%), Postives = 475/551 (86.21%), Query Frame = 1

Query: 56  MSSMSIPRLNFVVRSTKALEFRT-------CEEDDAIRLVVDDGVEESSREWKSPPWGEV 115
           M  MS PRLN VV+S + ++FRT       C ED+AI LV+D+   ESSREWK PPWG++
Sbjct: 1   MLPMSTPRLNLVVQSIRGMQFRTGVGTLLNCGEDEAIELVIDEEGVESSREWKLPPWGDM 60

Query: 116 KNQDEPIFQSEDVNQSEVLEGEGLGSDRKVYFLEETDEVMLSKRILILSRKNKVRSAMEL 175
            +QDE  FQSEDVN  ++LEGE L ++ KV+FLEETD+V+LSKRILILSRKNKVRSA+EL
Sbjct: 61  THQDEAAFQSEDVNLPKILEGEALENESKVHFLEETDKVLLSKRILILSRKNKVRSALEL 120

Query: 176 FRSMHLAGLLPSFHASNSLLACLLRNGPFDDGLRIFEFMKSNKLSTGHTYSLILKAVADT 235
           FRSM LAG+LP+ HA NSLLACLLRNG F DGLRIFEFMK N+LSTGHTYSL+LKAVA+ 
Sbjct: 121 FRSMQLAGVLPNLHALNSLLACLLRNGLFADGLRIFEFMKLNELSTGHTYSLVLKAVANA 180

Query: 236 HGFLSALEMFRTWEHEYDLKQFDAIVYNTMISVCGKENNWVEAERIWRLMEANGCSATHL 295
           HGFLSALEMF+ WEH+Y L QFDAIVYNTMIS+CGK+NNWVEAER WRLME NGC+ATH+
Sbjct: 181 HGFLSALEMFKAWEHKYVLTQFDAIVYNTMISICGKDNNWVEAERTWRLMEKNGCTATHI 240

Query: 296 TYSLLVSMYVRCNQNELAIDIYVKMVQNDLKPANDTMQAIIGASSREGRWDFALRVFQDM 355
           TYSLLVS +VRCNQNELAID YVKMVQ+  KP NDTMQAIIGASS+EG+WDFAL VFQDM
Sbjct: 241 TYSLLVSTFVRCNQNELAIDAYVKMVQSSFKPGNDTMQAIIGASSKEGKWDFALGVFQDM 300

Query: 356 LKCGLEPNSVAFNTLINALGKANEVTLAFSIYNRMKPMGHSPDVYTWKALLGALYKANRY 415
           LKCGL+PNSV+FN LINALGKA EVTLAFSIYN MK MGHSPDVYTW ALLGALYKANRY
Sbjct: 301 LKCGLQPNSVSFNALINALGKAKEVTLAFSIYNVMKSMGHSPDVYTWNALLGALYKANRY 360

Query: 416 NDAIRLFEFVKREEKAQLNIHIYNTILLSCSKLGLWDRALQILWEMEAASGRLVSASSYN 475
           NDAI LF FVKREEKAQLNIHIYNTIL+ CSKLGLW+RALQILWEME  SG L+S +SYN
Sbjct: 361 NDAIHLFGFVKREEKAQLNIHIYNTILMCCSKLGLWERALQILWEME-VSGLLISTTSYN 420

Query: 476 IVISACEMARKPEIALRVYERMIHQKLTPDTFTLLSLIRSCIWGSLWDEVELLLSKSAHD 535
           IV++ACE ARKPEIAL+VYERM+HQK TPDTFT LSLIR CIWGSLWDEVELLL+KS  D
Sbjct: 421 IVLTACETARKPEIALQVYERMVHQKHTPDTFTHLSLIRCCIWGSLWDEVELLLNKSGPD 480

Query: 536 ASVYNAAIQGMCLRGKTDLAKKLYTKMREIGIQPDGKTRALMLQTLPKDRAGLRNRLASR 595
            SVYN  IQGMCLRGKTDLAKKLYTKMRE  IQ DGKTRALMLQ LPKD A L+NR AS 
Sbjct: 481 VSVYNVVIQGMCLRGKTDLAKKLYTKMRENSIQSDGKTRALMLQNLPKDPARLKNRWASG 540

Query: 596 FKKRHRHYHHR 600
           FKKR R YHHR
Sbjct: 541 FKKRRRRYHHR 550

BLAST of Cp4.1LG01g22400 vs. NCBI nr
Match: gi|645248486|ref|XP_008230319.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g29290 [Prunus mume])

HSP 1 Score: 619.4 bits (1596), Expect = 6.6e-174
Identity = 313/522 (59.96%), Postives = 400/522 (76.63%), Query Frame = 1

Query: 79  CEEDDAIRLVVDDGVEESSREWKS-PPWGEVK-NQDEPIFQSEDVNQSEVLEGEGLGSDR 138
           CEE++  ++V  +G  E+S   ++ PPWGE+  ++D  I     +     L+ +   ++ 
Sbjct: 82  CEEEEEDKMVQREGGYETSFVKQALPPWGELAIDEDLDIEPEVPIQPESCLKRKASLNEN 141

Query: 139 KVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGP 198
           +V FLEE DE  LSKRIL+LSR NK RSA+ELF SM L+GLLP+ HA NSLL+CLLRN  
Sbjct: 142 RVSFLEEMDEETLSKRILVLSRTNKTRSALELFTSMELSGLLPNLHACNSLLSCLLRNEL 201

Query: 199 FDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHEYDLKQ-FDAIVY 258
            DDGLR+FEFMK  KL+TGHTYSLILKAV+   G  SA+EMF   E E +++  FD IVY
Sbjct: 202 LDDGLRVFEFMKRKKLATGHTYSLILKAVSVAEGCSSAIEMFVAMEEESEVRDSFDTIVY 261

Query: 259 NTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQ 318
           NTMIS+CGK NNW E ER+WR ++ NG + T +TY LLVS++VRC+Q+ELA+D Y +M+Q
Sbjct: 262 NTMISICGKVNNWRETERLWRHIKENGLTGTRVTYCLLVSIFVRCSQHELALDAYNEMIQ 321

Query: 319 NDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNTLINALGKANEVTL 378
           N  +P NDTM AIIGA S++G+WD AL +FQ ML  GL+PN+VAFN LIN+LGKA EV L
Sbjct: 322 NKFEPGNDTMHAIIGACSKDGKWDLALNIFQSMLDSGLKPNAVAFNALINSLGKAGEVEL 381

Query: 379 AFSIYNRMKPMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTIL 438
           AF +YN M+ +GHSPD YTW ALLGALY+ANR++DA+RL+E +K  + +QLN H+YN  L
Sbjct: 382 AFRVYNIMRSLGHSPDAYTWNALLGALYRANRHDDALRLYESIKTSQGSQLNSHLYNMAL 441

Query: 439 LSCSKLGLWDRALQILWEMEAASGRLVSASSYNIVISACEMARKPEIALRVYERMIHQKL 498
           +SCSKLGLWD+AL++LW++E ASG+ VS +SYN+VISACE ARKPE+AL+VYE M+HQK 
Sbjct: 442 MSCSKLGLWDKALKLLWQLE-ASGQSVSTASYNLVISACEKARKPEVALQVYEHMVHQKC 501

Query: 499 TPDTFTLLSLIRSCIWGSLWDEVELLLSKSAHDASVYNAAIQGMCLRGKTDLAKKLYTKM 558
           TPD FT LSLIR CIWGSLWDEVE +L+ +A D S+YNAAIQGMCLRGK +LAKK+YTKM
Sbjct: 502 TPDIFTYLSLIRGCIWGSLWDEVEEILNWAAPDMSLYNAAIQGMCLRGKIELAKKIYTKM 561

Query: 559 REIGIQPDGKTRALMLQTLP--KDRAGLRNRLASRFKKRHRH 596
           RE G+QPDGKTRA+MLQ L   K +   R + +S+F    R+
Sbjct: 562 RENGLQPDGKTRAMMLQNLQRRKKKQPPRYKTSSKFSYYRRN 602

BLAST of Cp4.1LG01g22400 vs. NCBI nr
Match: gi|595924587|ref|XP_007214950.1| (hypothetical protein PRUPE_ppa003127mg [Prunus persica])

HSP 1 Score: 617.5 bits (1591), Expect = 2.5e-173
Identity = 308/501 (61.48%), Postives = 391/501 (78.04%), Query Frame = 1

Query: 79  CEEDDAIRLVVDDGVEESSREWKS-PPWGEVKNQDEPIFQSEDVNQSE-VLEGEGLGSDR 138
           CEE++  ++V  +G  E+S   ++ PPWGE+   ++  F+ E   Q E  L+ +   +  
Sbjct: 82  CEEEEEDKVVQREGGYEASFVKQTLPPWGELAIDEDLDFEPEVPIQPESCLKRKASLNVN 141

Query: 139 KVYFLEETDEVMLSKRILILSRKNKVRSAMELFRSMHLAGLLPSFHASNSLLACLLRNGP 198
           +V FLEE DE  LSKRIL+LSR NK RSA+ELF SM L+GLLP+ HA NSLL+CLLRN  
Sbjct: 142 RVSFLEEMDEGTLSKRILVLSRTNKTRSALELFTSMELSGLLPNLHACNSLLSCLLRNEL 201

Query: 199 FDDGLRIFEFMKSNKLSTGHTYSLILKAVADTHGFLSALEMFRTWEHEYDLKQ-FDAIVY 258
            DDGLR+FEFMK  KL+TGHTYSLILKAV+   G  SA+EMF   E E +++  FD IVY
Sbjct: 202 LDDGLRVFEFMKRKKLATGHTYSLILKAVSVAEGCSSAIEMFVAMEEESEVRDSFDTIVY 261

Query: 259 NTMISVCGKENNWVEAERIWRLMEANGCSATHLTYSLLVSMYVRCNQNELAIDIYVKMVQ 318
           NTMIS+CGK NNW E ER+WR ++ NG + T +TY LLVS++VRC+Q+ELA+D Y +M+Q
Sbjct: 262 NTMISICGKVNNWRETERLWRHIKENGLTGTRVTYCLLVSIFVRCSQHELALDAYNEMIQ 321

Query: 319 NDLKPANDTMQAIIGASSREGRWDFALRVFQDMLKCGLEPNSVAFNTLINALGKANEVTL 378
           N  +P NDTM AIIGA S++G+WD AL +FQ ML  GL+PN+VA N LIN+LGKA EV L
Sbjct: 322 NKFEPGNDTMHAIIGACSKDGKWDLALNIFQSMLDSGLKPNAVALNALINSLGKAGEVEL 381

Query: 379 AFSIYNRMKPMGHSPDVYTWKALLGALYKANRYNDAIRLFEFVKREEKAQLNIHIYNTIL 438
           AF +YN MK +GHSPD YTW ALLGALY+ANR++DA+RL+E +K  + +QLN H+YN  L
Sbjct: 382 AFRVYNIMKSLGHSPDAYTWNALLGALYRANRHDDALRLYESIKTSQGSQLNSHLYNMAL 441

Query: 439 LSCSKLGLWDRALQILWEMEAASGRLVSASSYNIVISACEMARKPEIALRVYERMIHQKL 498
           +SCSKLGLWD+AL++LW++E ASG+ VS +SYN+V+SACE ARKP++AL+VYE M+HQK 
Sbjct: 442 MSCSKLGLWDKALKLLWQLE-ASGQSVSTASYNLVVSACEKARKPKVALQVYEHMVHQKC 501

Query: 499 TPDTFTLLSLIRSCIWGSLWDEVELLLSKSAHDASVYNAAIQGMCLRGKTDLAKKLYTKM 558
           TPD FT LSLIR CIWGSLWDEVE +L+ +A D S+YNAAIQGMCLRGK +LAKK+YTKM
Sbjct: 502 TPDIFTYLSLIRGCIWGSLWDEVEEILNWAAPDMSLYNAAIQGMCLRGKIELAKKIYTKM 561

Query: 559 REIGIQPDGKTRALMLQTLPK 577
           RE G+QPDGKTRA+MLQ L +
Sbjct: 562 RENGLQPDGKTRAMMLQNLQR 581

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP262_ARATH6.7e-14254.14Pentatricopeptide repeat-containing protein At3g29290 OS=Arabidopsis thaliana GN... [more]
PP124_ARATH7.4e-3227.04Pentatricopeptide repeat-containing protein At1g74850, chloroplastic OS=Arabidop... [more]
PP362_ARATH1.1e-3024.03Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana GN... [more]
PP217_ARATH7.2e-2723.99Pentatricopeptide repeat-containing protein At3g06920 OS=Arabidopsis thaliana GN... [more]
PPR37_ARATH1.8e-2524.83Pentatricopeptide repeat-containing protein At1g12620 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0KVV0_CUCSA6.3e-25675.70Uncharacterized protein OS=Cucumis sativus GN=Csa_5G642190 PE=4 SV=1[more]
M5WR43_PRUPE1.8e-17361.48Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003127mg PE=4 SV=1[more]
W9R9N1_9ROSA2.5e-16759.48Uncharacterized protein OS=Morus notabilis GN=L484_018143 PE=4 SV=1[more]
A0A061EF27_THECC3.0e-16558.02Pentatricopeptide repeat superfamily protein, putative isoform 1 OS=Theobroma ca... [more]
D7SXI6_VITVI1.8e-16258.17Putative uncharacterized protein OS=Vitis vinifera GN=VIT_14s0108g01490 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT3G29290.13.8e-14354.14 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G74850.14.2e-3327.04 plastid transcriptionally active 2[more]
AT5G02860.16.0e-3224.03 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G06920.14.0e-2823.99 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G12620.19.9e-2724.83 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659118986|ref|XP_008459413.1|1.1e-25875.33PREDICTED: pentatricopeptide repeat-containing protein At3g29290 isoform X1 [Cuc... [more]
gi|449447683|ref|XP_004141597.1|9.0e-25675.70PREDICTED: pentatricopeptide repeat-containing protein At3g29290 [Cucumis sativu... [more]
gi|659118988|ref|XP_008459414.1|1.4e-24578.04PREDICTED: pentatricopeptide repeat-containing protein At3g29290 isoform X2 [Cuc... [more]
gi|645248486|ref|XP_008230319.1|6.6e-17459.96PREDICTED: pentatricopeptide repeat-containing protein At3g29290 [Prunus mume][more]
gi|595924587|ref|XP_007214950.1|2.5e-17361.48hypothetical protein PRUPE_ppa003127mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006487 protein N-linked glycosylation
cellular_component GO:0005575 cellular_component
cellular_component GO:0005886 plasma membrane
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g22400.1Cp4.1LG01g22400.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 327..353
score: 0.0054coord: 430..455
score: 0.0026coord: 185..211
score: 0.026coord: 466..492
score: 0.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 251..297
score: 1.4E-8coord: 355..403
score: 6.0E-12coord: 528..574
score: 2.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 358..392
score: 5.0E-7coord: 466..499
score: 7.8E-6coord: 253..285
score: 1.9E-9coord: 327..356
score: 1.5E-5coord: 531..563
score: 7.3E-5coord: 185..212
score: 0.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 391..421
score: 8.638coord: 528..562
score: 11.608coord: 145..179
score: 8.177coord: 463..497
score: 10.293coord: 427..461
score: 9.219coord: 321..355
score: 10.041coord: 251..285
score: 11.466coord: 356..390
score: 11.005coord: 180..214
score: 8.364coord: 286..320
score: 9
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 289..558
score: 1.8
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 37..63
score: 9.9E-141coord: 103..130
score: 9.9E-141coord: 156..577
score: 9.9E
NoneNo IPR availablePANTHERPTHR24015:SF700SUBFAMILY NOT NAMEDcoord: 156..577
score: 9.9E-141coord: 103..130
score: 9.9E-141coord: 37..63
score: 9.9E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG01g22400Cucurbita pepo (Zucchini)cpecpeB199
Cp4.1LG01g22400Cucumber (Gy14) v1cgycpeB0255
Cp4.1LG01g22400Cucurbita maxima (Rimu)cmacpeB317
Cp4.1LG01g22400Cucurbita moschata (Rifu)cmocpeB279
Cp4.1LG01g22400Wild cucumber (PI 183967)cpecpiB380
Cp4.1LG01g22400Bottle gourd (USVL1VR-Ls)cpelsiB330
Cp4.1LG01g22400Melon (DHL92) v3.6.1cpemedB408
Cp4.1LG01g22400Silver-seed gourdcarcpeB0374
Cp4.1LG01g22400Cucumber (Chinese Long) v3cpecucB0472
Cp4.1LG01g22400Wax gourdcpewgoB0502