Cp4.1LG12g03500 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG12g03500
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCp4.1LG12 : 2344586 .. 2347731 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AAATCCTACTCGAACTCATCCACTTCACTGCTTGAAGCTCTGCGGAGCGAACTCCATCATCCATGGCTGCCTTCTCCTTCTCCTTCTCCTCTTCGCTCCTTCCTTCATCAGTTTCTTTCGATCATGGCTCCTCTTCCTTCTTCCCCATCCTTATTTCATCTTCTTTCGACTCCAATTCTCGCGTTGTTCGCTGTACATTTGCAGCCCCCGCTAGAAAATCGCCTCCTCCTCCTTCTTCTTCTTCTTCTTCCTCACCGGATAAGAAGAGGCACTGGAAGCAAGGGGAATTTCCAGGTATTACGGAGACATCCGCCCCCAGGATGACCCCCGCTAGAAAATCACCTCCTCCTTCTTCTTCGTCGCCGGTTAAGAAGAAGCATTGGAAACAAGGGGAATTTCCAGGCGTTACAGAGAGATTCGACGCCAGGAAGACCCCTCTCAAGAATGTCAAGAAGAAATTAGATCGCAAAATGAACGCCAAAGCTTGGGCTAACACCGTTACTGAAACCTTGTCGGAACATATCACGAACAAACGGTGGATTCAAGCTCTTGAGGTCGCTTCCTCTTCCTTTGATTTCACTTTTTGTTTTGGGGTTTTTTGTTAAATTTGTTCTGCGATCCCTCTTTTTGTTACATTTGTTCATTGTTTTGATTGATTTGAGTTTAAATCGATGAGGAATTGCATAATTTTGTAATGTGCTTTGGAATTAGCTTTTTGTCCATTTACGAACAAAGAGAAAAATGCAGAGATTCATTACTTGCTGTTCAAAGTTGAATTGGAGGCATGCAAATTCGTCTTCTCTTGCCTTAATTTGAAGGAATGAAATGATAATTTGAAGAGCCTGCATTAACTTGTTCGTGTATTGTTGTCGAGGATTATTGGGAGTGAGTCCCACATTGGCTAATTTAGGGAATGATCATGGGTTTATAAGTGAGGAATACTATCTCCATTGGTATGAGGTCTTTTGGGAAAACCCAAAGCAAAGCCATGAGAGCTTATGCTCAAAGTGGACAATATCATACCATTTTGGAGAGTCATGATTCCTAATATTGTTTTCATCTTTAACAACTATGGCTGCTTCAATTCAACCGTTCAGGTGTTTGAAATGCTCCGCAAACAGCCATTTTATGAACCAAAAGAAGGAACTTACATGAAGCTGCTTGTTCTTCTTGGAAGATCTGGCCAACCACAGCGTGCCCGTCTGCTTTTTGACACAATGGTGGAAGAAAAATGTGAACCCACCACTGAACTTTACACTGCTTTGCTTGCTGCTTATTGTCGTAACAATCTCATTGATGATGCATTTTCCATTCTTAATCTCATGAAAACCCTCCCTTGCTGCCAGCCTGATATCTACACTTATAGTATATTAATCAAGGCTTGTGTCGATTCTTCTCGCTTTGAATTAGTTGAATCTTTGTACGAGGAAATGGCCGAACGACTGATAACTCCGAACACGGTTACTCAGAATATAGTCTTAAGTGGATATGGCAAGATAGGGAAGTATGACCAGATGGAGAAAGTTCTTTTAGGAATGCTTGAGAGCACAACTTGTAGACCAGATGTGTGGACAATGAACATAATTCTTAGTGTGTTTGGTAACAAGGGTCAGATTGAAATGATGGAAAGATGGTACGAGAAGTTTCGCAACTTCGGTATCGAGCCAGAAACACGTACATTCAATATTTTGATTGGTGCTTATGGGAAGAAAAGGATGTATGATAAAATGTCTTCTGTGATGGAGTATATGCGCAAACTGCAGTTTCCTTGGACGACCTCGACATACAACAACGTTATTGAAGCTTTTGCAGATGTGGGAGATGCAAAGAATATGGAGTACACCTTCACGCAGATGCGTGCTGAGGGCATGAAAGCAGACACCAAGACATTCTGTTGCCTTATAAATGGATATTCCAATGCAGGTCTCTTCCATAAGGTGATTGGCTCAGTTAAGTTGGCAGGAAAGTGTGAAATTCCAGAGAATACCTCATTTTATAATGCTGTTATATCTGCTTGTGCAAAGGCTGAGGATCTGATGGAAATGGATAGAGTGTTTAAGCGAATGAAAGATAAACATTGTCAGCCAGATAGCAAAACATATTCTATCATGATAGAAGCATATGAAAAGGAGGCCATGAATGACAGAGTTCATTACTTGGAACTGGAAAGGAAGCAGGTGATCGATCACGCTCCTACTAACGAGTGACTCGGAAATGGAATAGAGTGAAAAATTGGTGCACGTTTGAGAAGTTGGACGCTACACTATGCCTGTTCCACCAAGGGCTCTAGCTACATTAGATACTTCTCAGAAAATCAAATCCGCGCTTCCAGCCCTTCCAGCTCTGTGCCAAGTGTTTCACCATAGCGTTTTGATCATTTGAAATGATATGAAACCAGGCATCTCAATCATGCAATATTAATGCCACGTTAAGAAGAATTAAGTTTTGTTTATGCAACAAAGGTTAAGGGTTGGAATGTCTCATGGAAGGTTAAGGGTTTTCATTTTGAGGCTATATAAAGTCATGTATCTTATCTTTTGTAGGAGAGACTTGGAAAATGTAGTAAAATGAGCAATTTGTGCTTTCCTTTAGCCAATGGCTAAGTTTCTTTGCAATTCTGTGTAGTTTTGGTTGCATGTAGAATCATTCAAGCTTGACATGATCAATCTTGCTCGTGGAGTGATTCGAATCTCAAACAAGTGTTCTTGTCTTGAGATATTTGATCAACAAGGTAATCTGAATTTTACTTTCTCTTGCAGTGATTCTTTATTATTACAGCCAAGTGTTCTTACCTTTCCGTTGCATTTATGTATAGATTAGATTGCATGTTTAGAATCATTCAAGCTAGATATAATCGATCTTGCTTGTTGAATGATTTGAATCTCAAACAAATGTTTTTGTCTTGAAATATTCGATCAACAAAGTAATCTCAATCTTACTTCCCTTAGGTTGATTTCTTATCATTTGATATCAGAATTGTTTCTTAGGTTGATTCCTTATCATCCTCCTTCGTGTGGGAAGATTGCAGCTAACTGGTTTTTATGTCTAGAGCTCTTGTCACAGTGAAGTTGATCAGTATTTTGTAGTTATAATAGTTTTAGGCGAACTTGCTCGTTGTTCGGGACAAAACGGCGAACAGGGAAAAA

mRNA sequence

AAATCCTACTCGAACTCATCCACTTCACTGCTTGAAGCTCTGCGGAGCGAACTCCATCATCCATGGCTGCCTTCTCCTTCTCCTTCTCCTCTTCGCTCCTTCCTTCATCAGTTTCTTTCGATCATGGCTCCTCTTCCTTCTTCCCCATCCTTATTTCATCTTCTTTCGACTCCAATTCTCGCGTTGTTCGCTGTACATTTGCAGCCCCCGCTAGAAAATCGCCTCCTCCTCCTTCTTCTTCTTCTTCTTCCTCACCGGATAAGAAGAGGCACTGGAAGCAAGGGGAATTTCCAGGTATTACGGAGACATCCGCCCCCAGGATGACCCCCGCTAGAAAATCACCTCCTCCTTCTTCTTCGTCGCCGGTTAAGAAGAAGCATTGGAAACAAGGGGAATTTCCAGGCGTTACAGAGAGATTCGACGCCAGGAAGACCCCTCTCAAGAATGTCAAGAAGAAATTAGATCGCAAAATGAACGCCAAAGCTTGGGCTAACACCGTTACTGAAACCTTGTCGGAACATATCACGAACAAACGGTGGATTCAAGCTCTTGAGCTGCTTGTTCTTCTTGGAAGATCTGGCCAACCACAGCGTGCCCGTCTGCTTTTTGACACAATGGTGGAAGAAAAATGTGAACCCACCACTGAACTTTACACTGCTTTGCTTGCTGCTTATTGTCGTAACAATCTCATTGATGATGCATTTTCCATTCTTAATCTCATGAAAACCCTCCCTTGCTGCCAGCCTGATATCTACACTTATAGTATATTAATCAAGGCTTGTGTCGATTCTTCTCGCTTTGAATTAGTTGAATCTTTGTACGAGGAAATGGCCGAACGACTGATAACTCCGAACACGGTTACTCAGAATATAGTCTTAAGTGGATATGGCAAGATAGGGAAGTATGACCAGATGGAGAAAGTTCTTTTAGGAATGCTTGAGAGCACAACTTGTAGACCAGATGTGTGGACAATGAACATAATTCTTAGTGTGTTTGGTAACAAGGGTCAGATTGAAATGATGGAAAGATGGTACGAGAAGTTTCGCAACTTCGGTATCGAGCCAGAAACACGTACATTCAATATTTTGATTGGTGCTTATGGGAAGAAAAGGATGTATGATAAAATGTCTTCTGTGATGGAGTATATGCGCAAACTGCAGTTTCCTTGGACGACCTCGACATACAACAACGTTATTGAAGCTTTTGCAGATGTGGGAGATGCAAAGAATATGGAGTACACCTTCACGCAGATGCGTGCTGAGGGCATGAAAGCAGACACCAAGACATTCTGTTGCCTTATAAATGGATATTCCAATGCAGGTCTCTTCCATAAGGTGATTGGCTCAGTTAAGTTGGCAGGAAAGTGTGAAATTCCAGAGAATACCTCATTTTATAATGCTGTTATATCTGCTTGTGCAAAGGCTGAGGATCTGATGGAAATGGATAGAGTGTTTAAGCGAATGAAAGATAAACATTGTCAGCCAGATAGCAAAACATATTCTATCATGATAGAAGCATATGAAAAGGAGGCCATGAATGACAGAGTTCATTACTTGGAACTGGAAAGGAAGCAGGCGAACTTGCTCGTTGTTCGGGACAAAACGGCGAACAGGGAAAAA

Coding sequence (CDS)

ATGGCTGCCTTCTCCTTCTCCTTCTCCTCTTCGCTCCTTCCTTCATCAGTTTCTTTCGATCATGGCTCCTCTTCCTTCTTCCCCATCCTTATTTCATCTTCTTTCGACTCCAATTCTCGCGTTGTTCGCTGTACATTTGCAGCCCCCGCTAGAAAATCGCCTCCTCCTCCTTCTTCTTCTTCTTCTTCCTCACCGGATAAGAAGAGGCACTGGAAGCAAGGGGAATTTCCAGGTATTACGGAGACATCCGCCCCCAGGATGACCCCCGCTAGAAAATCACCTCCTCCTTCTTCTTCGTCGCCGGTTAAGAAGAAGCATTGGAAACAAGGGGAATTTCCAGGCGTTACAGAGAGATTCGACGCCAGGAAGACCCCTCTCAAGAATGTCAAGAAGAAATTAGATCGCAAAATGAACGCCAAAGCTTGGGCTAACACCGTTACTGAAACCTTGTCGGAACATATCACGAACAAACGGTGGATTCAAGCTCTTGAGCTGCTTGTTCTTCTTGGAAGATCTGGCCAACCACAGCGTGCCCGTCTGCTTTTTGACACAATGGTGGAAGAAAAATGTGAACCCACCACTGAACTTTACACTGCTTTGCTTGCTGCTTATTGTCGTAACAATCTCATTGATGATGCATTTTCCATTCTTAATCTCATGAAAACCCTCCCTTGCTGCCAGCCTGATATCTACACTTATAGTATATTAATCAAGGCTTGTGTCGATTCTTCTCGCTTTGAATTAGTTGAATCTTTGTACGAGGAAATGGCCGAACGACTGATAACTCCGAACACGGTTACTCAGAATATAGTCTTAAGTGGATATGGCAAGATAGGGAAGTATGACCAGATGGAGAAAGTTCTTTTAGGAATGCTTGAGAGCACAACTTGTAGACCAGATGTGTGGACAATGAACATAATTCTTAGTGTGTTTGGTAACAAGGGTCAGATTGAAATGATGGAAAGATGGTACGAGAAGTTTCGCAACTTCGGTATCGAGCCAGAAACACGTACATTCAATATTTTGATTGGTGCTTATGGGAAGAAAAGGATGTATGATAAAATGTCTTCTGTGATGGAGTATATGCGCAAACTGCAGTTTCCTTGGACGACCTCGACATACAACAACGTTATTGAAGCTTTTGCAGATGTGGGAGATGCAAAGAATATGGAGTACACCTTCACGCAGATGCGTGCTGAGGGCATGAAAGCAGACACCAAGACATTCTGTTGCCTTATAAATGGATATTCCAATGCAGGTCTCTTCCATAAGGTGATTGGCTCAGTTAAGTTGGCAGGAAAGTGTGAAATTCCAGAGAATACCTCATTTTATAATGCTGTTATATCTGCTTGTGCAAAGGCTGAGGATCTGATGGAAATGGATAGAGTGTTTAAGCGAATGAAAGATAAACATTGTCAGCCAGATAGCAAAACATATTCTATCATGATAGAAGCATATGAAAAGGAGGCCATGAATGACAGAGTTCATTACTTGGAACTGGAAAGGAAGCAGGCGAACTTGCTCGTTGTTCGGGACAAAACGGCGAACAGGGAAAAA

Protein sequence

MAAFSFSFSSSLLPSSVSFDHGSSSFFPILISSSFDSNSRVVRCTFAAPARKSPPPPSSSSSSSPDKKRHWKQGEFPGITETSAPRMTPARKSPPPSSSSPVKKKHWKQGEFPGVTERFDARKTPLKNVKKKLDRKMNAKAWANTVTETLSEHITNKRWIQALELLVLLGRSGQPQRARLLFDTMVEEKCEPTTELYTALLAAYCRNNLIDDAFSILNLMKTLPCCQPDIYTYSILIKACVDSSRFELVESLYEEMAERLITPNTVTQNIVLSGYGKIGKYDQMEKVLLGMLESTTCRPDVWTMNIILSVFGNKGQIEMMERWYEKFRNFGIEPETRTFNILIGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNVIEAFADVGDAKNMEYTFTQMRAEGMKADTKTFCCLINGYSNAGLFHKVIGSVKLAGKCEIPENTSFYNAVISACAKAEDLMEMDRVFKRMKDKHCQPDSKTYSIMIEAYEKEAMNDRVHYLELERKQANLLVVRDKTANREK
BLAST of Cp4.1LG12g03500 vs. Swiss-Prot
Match: PP216_ARATH (Pentatricopeptide repeat-containing protein At3g06430, chloroplastic OS=Arabidopsis thaliana GN=EMB2750 PE=2 SV=1)

HSP 1 Score: 634.4 bits (1635), Expect = 1.1e-180
Identity = 321/467 (68.74%), Postives = 377/467 (80.73%), Query Frame = 1

Query: 58  SSSSSSSPDKKRHWKQGEFPGITETSAPRMTPARKSPPPSSSSPVKKKHWKQGEFPGVTE 117
           S  SS  P+ KR ++  +  GI       +  A KS P    S  KK+ WK GEFPG+TE
Sbjct: 11  SLCSSRIPEGKRRFRHRDV-GIVRC----VLAASKSSP---GSVTKKRLWKDGEFPGITE 70

Query: 118 RFDARKTPLKNVKKKLDRKMNAKAWANTVTETLSEHITNKRWIQALE------------- 177
             + R+TP+KNVKKKLDR+  A  W NTVTETLS+ I  K+W+QALE             
Sbjct: 71  PVNQRRTPIKNVKKKLDRRSKANGWVNTVTETLSDLIAKKQWLQALEVFDMLREQTFYQP 130

Query: 178 -------LLVLLGRSGQPQRARLLFDTMVEEKCEPTTELYTALLAAYCRNNLIDDAFSIL 237
                  LLVLLG+SGQP RA+ LFD M+EE  EPT ELYTALLAAY R+NLIDDAFSIL
Sbjct: 131 KEGTYMKLLVLLGKSGQPNRAQKLFDEMLEEGLEPTVELYTALLAAYTRSNLIDDAFSIL 190

Query: 238 NLMKTLPCCQPDIYTYSILIKACVDSSRFELVESLYEEMAERLITPNTVTQNIVLSGYGK 297
           + MK+ P CQPD++TYS L+KACVD+S+F+LV+SLY+EM ERLITPNTVTQNIVLSGYG+
Sbjct: 191 DKMKSFPQCQPDVFTYSTLLKACVDASQFDLVDSLYKEMDERLITPNTVTQNIVLSGYGR 250

Query: 298 IGKYDQMEKVLLGMLESTTCRPDVWTMNIILSVFGNKGQIEMMERWYEKFRNFGIEPETR 357
           +G++DQMEKVL  ML ST C+PDVWTMNIILSVFGN G+I+MME WYEKFRNFGIEPETR
Sbjct: 251 VGRFDQMEKVLSDMLVSTACKPDVWTMNIILSVFGNMGKIDMMESWYEKFRNFGIEPETR 310

Query: 358 TFNILIGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNVIEAFADVGDAKNMEYTFTQM 417
           TFNILIG+YGKKRMYDKMSSVMEYMRKL+FPWTTSTYNN+IEAFADVGDAKNME TF QM
Sbjct: 311 TFNILIGSYGKKRMYDKMSSVMEYMRKLEFPWTTSTYNNIIEAFADVGDAKNMELTFDQM 370

Query: 418 RAEGMKADTKTFCCLINGYSNAGLFHKVIGSVKLAGKCEIPENTSFYNAVISACAKAEDL 477
           R+EGMKADTKTFCCLINGY+NAGLFHKVI SV+LA K EIPENT+FYNAVISACAKA+DL
Sbjct: 371 RSEGMKADTKTFCCLINGYANAGLFHKVISSVQLAAKFEIPENTAFYNAVISACAKADDL 430

Query: 478 MEMDRVFKRMKDKHCQPDSKTYSIMIEAYEKEAMNDRVHYLELERKQ 505
           +EM+RV+ RMK++ C  DS+T+ IM+EAYEKE MND+++YLE ER++
Sbjct: 431 IEMERVYIRMKERQCVCDSRTFEIMVEAYEKEGMNDKIYYLEQERQK 469

BLAST of Cp4.1LG12g03500 vs. Swiss-Prot
Match: PP424_ARATH (Pentatricopeptide repeat-containing protein At5g48730, chloroplastic OS=Arabidopsis thaliana GN=At5g48730 PE=2 SV=2)

HSP 1 Score: 290.8 bits (743), Expect = 2.9e-77
Identity = 159/397 (40.05%), Postives = 222/397 (55.92%), Query Frame = 1

Query: 122 RKTPLKNVKKKLDRKMNAKAWANTVTETLSEHITNKRWIQALE----------------- 181
           R+   K++ +K  +K + K    TV E+L E IT  RW  A++                 
Sbjct: 95  RREATKSIIEK--KKGSKKLLPRTVLESLHERITALRWESAIQVFELLREQLWYKPNVGI 154

Query: 182 ---LLVLLGRSGQPQRARLLFDTMVEEKCEPTTELYTALLAAYCRNNLIDDAFSILNLMK 241
              L+V+LG+  QP++A  LF  M+ E C    E+YTAL++AY R+   D AF++L  MK
Sbjct: 155 YVKLIVMLGKCKQPEKAHELFQEMINEGCVVNHEVYTALVSAYSRSGRFDAAFTLLERMK 214

Query: 242 TLPCCQPDIYTYSILIKACVDSSRFELVESLYEEMAERLITPNTVTQNIVLSGYGKIGKY 301
           +   CQPD++TYSILIK+ +    F+ V+ L  +M  + I PNT+T N ++  YGK   +
Sbjct: 215 SSHNCQPDVHTYSILIKSFLQVFAFDKVQDLLSDMRRQGIRPNTITYNTLIDAYGKAKMF 274

Query: 302 DQMEKVLLGMLESTTCRPDVWTMNIILSVFGNKGQIEMMERWYEKFRNFGIEPETRTFNI 361
            +ME  L+ ML    C+PD WTMN  L  FG  GQIEMME  YEKF++ GIEP  RTFNI
Sbjct: 275 VEMESTLIQMLGEDDCKPDSWTMNSTLRAFGGNGQIEMMENCYEKFQSSGIEPNIRTFNI 334

Query: 362 LIGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNVIEAFADVGDAKNMEYTFTQMRAEG 421
           L+ +YGK   Y KMS+VMEYM+K  + WT  TYN VI+AF   GD K MEY F  M++E 
Sbjct: 335 LLDSYGKSGNYKKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGDLKQMEYLFRLMQSER 394

Query: 422 MKADTKTFCCLINGYSNAGLFHKVIGSVKLAGKCEIPENTSFYNAVISACAKAEDLMEMD 481
           +     T C L+  Y  A    K+ G ++     +I  +  F+N ++ A  + E   EM 
Sbjct: 395 IFPSCVTLCSLVRAYGRASKADKIGGVLRFIENSDIRLDLVFFNCLVDAYGRMEKFAEMK 454

Query: 482 RVFKRMKDKHCQPDSKTYSIMIEAYEKEAMNDRVHYL 499
            V + M+ K  +PD  TY  M++AY    M   V  L
Sbjct: 455 GVLELMEKKGFKPDKITYRTMVKAYRISGMTTHVKEL 489

BLAST of Cp4.1LG12g03500 vs. Swiss-Prot
Match: PP279_ARATH (Pentatricopeptide repeat-containing protein At3g53170 OS=Arabidopsis thaliana GN=At3g53170 PE=3 SV=1)

HSP 1 Score: 284.6 bits (727), Expect = 2.1e-75
Identity = 145/394 (36.80%), Postives = 229/394 (58.12%), Query Frame = 1

Query: 126 LKNVKKKLDRKMNAKAWANTVTETLSEHITNKRWIQALELLVLL---------------- 185
           +K +++K + +     W   V E L E I   RW  AL++  LL                
Sbjct: 41  VKGIERKANSEKYLTLWPKAVLEALDEAIKENRWQSALKIFNLLRKQHWYEPRCKTYTKL 100

Query: 186 ----GRSGQPQRARLLFDTMVEEKCEPTTELYTALLAAYCRNNLIDDAFSILNLMKTLPC 245
               G   QP +A LLF+ M+ E  +PT ++YT+L++ Y ++ L+D AFS L  MK++  
Sbjct: 101 FKVLGNCKQPDQASLLFEVMLSEGLKPTIDVYTSLISVYGKSELLDKAFSTLEYMKSVSD 160

Query: 246 CQPDIYTYSILIKACVDSSRFELVESLYEEMAERLITPNTVTQNIVLSGYGKIGKYDQME 305
           C+PD++T+++LI  C    RF+LV+S+  EM+   +  +TVT N ++ GYGK G +++ME
Sbjct: 161 CKPDVFTFTVLISCCCKLGRFDLVKSIVLEMSYLGVGCSTVTYNTIIDGYGKAGMFEEME 220

Query: 306 KVLLGMLESTTCRPDVWTMNIILSVFGNKGQIEMMERWYEKFRNFGIEPETRTFNILIGA 365
            VL  M+E     PDV T+N I+  +GN   +  ME WY +F+  G++P+  TFNILI +
Sbjct: 221 SVLADMIEDGDSLPDVCTLNSIIGSYGNGRNMRKMESWYSRFQLMGVQPDITTFNILILS 280

Query: 366 YGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNVIEAFADVGDAKNMEYTFTQMRAEGMKAD 425
           +GK  MY KM SVM++M K  F  TT TYN VIE F   G  + M+  F +M+ +G+K +
Sbjct: 281 FGKAGMYKKMCSVMDFMEKRFFSLTTVTYNIVIETFGKAGRIEKMDDVFRKMKYQGVKPN 340

Query: 426 TKTFCCLINGYSNAGLFHKVIGSVKLAGKCEIPENTSFYNAVISACAKAEDLMEMDRVFK 485
           + T+C L+N YS AGL  K+   ++     ++  +T F+N +I+A  +A DL  M  ++ 
Sbjct: 341 SITYCSLVNAYSKAGLVVKIDSVLRQIVNSDVVLDTPFFNCIINAYGQAGDLATMKELYI 400

Query: 486 RMKDKHCQPDSKTYSIMIEAYEKEAMNDRVHYLE 500
           +M+++ C+PD  T++ MI+ Y    + D V  LE
Sbjct: 401 QMEERKCKPDKITFATMIKTYTAHGIFDAVQELE 434

BLAST of Cp4.1LG12g03500 vs. Swiss-Prot
Match: PP362_ARATH (Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana GN=At5g02860 PE=2 SV=1)

HSP 1 Score: 150.6 bits (379), Expect = 4.7e-35
Identity = 95/332 (28.61%), Postives = 156/332 (46.99%), Query Frame = 1

Query: 165 LLVLLGRSGQPQRARLLFDTMVEEKCEPTTELYTALLAAYCRNNLIDDAFSILNLMKTLP 224
           LL + G+S +P+ A  + + MV     P+   Y +L++AY R+ ++D+A  + N M    
Sbjct: 320 LLDVYGKSHRPKEAMKVLNEMVLNGFSPSIVTYNSLISAYARDGMLDEAMELKNQMAEKG 379

Query: 225 CCQPDIYTYSILIKACVDSSRFELVESLYEEMAERLITPNTVTQNIVLSGYGKIGKYDQM 284
             +PD++TY+ L+     + + E   S++EEM      PN  T N  +  YG  GK+ +M
Sbjct: 380 T-KPDVFTYTTLLSGFERAGKVESAMSIFEEMRNAGCKPNICTFNAFIKMYGNRGKFTEM 439

Query: 285 EKVLLGMLESTTC--RPDVWTMNIILSVFGNKGQIEMMERWYEKFRNFGIEPETRTFNIL 344
            K+     E   C   PD+ T N +L+VFG  G    +   +++ +  G  PE  TFN L
Sbjct: 440 MKIFD---EINVCGLSPDIVTWNTLLAVFGQNGMDSEVSGVFKEMKRAGFVPERETFNTL 499

Query: 345 IGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNVIEAFADVGDAKNMEYTFTQMRAEGM 404
           I AY +   +++  +V   M         STYN V+ A A  G  +  E    +M     
Sbjct: 500 ISAYSRCGSFEQAMTVYRRMLDAGVTPDLSTYNTVLAALARGGMWEQSEKVLAEMEDGRC 559

Query: 405 KADTKTFCCLINGYSNA---GLFHKVIGSVKLAGKCEIPENTSFYNAVISACAKAEDLME 464
           K +  T+C L++ Y+N    GL H +   V       I         ++  C+K + L E
Sbjct: 560 KPNELTYCSLLHAYANGKEIGLMHSLAEEVYSG---VIEPRAVLLKTLVLVCSKCDLLPE 619

Query: 465 MDRVFKRMKDKHCQPDSKTYSIMIEAYEKEAM 492
            +R F  +K++   PD  T + M+  Y +  M
Sbjct: 620 AERAFSELKERGFSPDITTLNSMVSIYGRRQM 644

BLAST of Cp4.1LG12g03500 vs. Swiss-Prot
Match: PP358_ARATH (Pentatricopeptide repeat-containing protein At4g39620, chloroplastic OS=Arabidopsis thaliana GN=EMB2453 PE=2 SV=1)

HSP 1 Score: 144.1 bits (362), Expect = 4.4e-33
Identity = 85/327 (25.99%), Postives = 153/327 (46.79%), Query Frame = 1

Query: 164 ELLVLLGRSGQPQRARLLFDTMVEEKCEPTTELYTALLAAYCRNN----LIDDAFSILNL 223
           +L+ ++G+ GQ + A  LF  M    C P   +Y AL+ A+         ++     L+ 
Sbjct: 138 KLISVMGKKGQTRMAMWLFSEMKNSGCRPDASVYNALITAHLHTRDKAKALEKVRGYLDK 197

Query: 224 MKTLPCCQPDIYTYSILIKACVDSSRFELVESLYEEMAERLITPNTVTQNIVLSGYGKIG 283
           MK +  CQP++ TY+IL++A   S + + V +L++++    ++P+  T N V+  YGK G
Sbjct: 198 MKGIERCQPNVVTYNILLRAFAQSGKVDQVNALFKDLDMSPVSPDVYTFNGVMDAYGKNG 257

Query: 284 KYDQMEKVLLGMLESTTCRPDVWTMNIILSVFGNKGQIEMMERWYEKFRNFGIEPETRTF 343
              +ME VL  M  S  C+PD+ T N+++  +G K + E ME+ ++       +P   TF
Sbjct: 258 MIKEMEAVLTRM-RSNECKPDIITFNVLIDSYGKKQEFEKMEQTFKSLMRSKEKPTLPTF 317

Query: 344 NILIGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNVIEAFADVGDAKNMEYTFTQMRA 403
           N +I  YGK RM DK   V + M  + +  +  TY  +I  +   G        F ++  
Sbjct: 318 NSMIINYGKARMIDKAEWVFKKMNDMNYIPSFITYECMIMMYGYCGSVSRAREIFEEVGE 377

Query: 404 EGMKADTKTFCCLINGYSNAGLFHKVIGSVKLAGKCEIPENTSFYNAVISACAKAEDLME 463
                   T   ++  Y   GL+ +       A    +  + S Y  +  A  KA+   +
Sbjct: 378 SDRVLKASTLNAMLEVYCRNGLYIEADKLFHNASAFRVHPDASTYKFLYKAYTKADMKEQ 437

Query: 464 MDRVFKRMKDKHCQPDSKTYSIMIEAY 487
           +  + K+M+     P+ + +   +E +
Sbjct: 438 VQILMKKMEKDGIVPNKRFFLEALEVF 463

BLAST of Cp4.1LG12g03500 vs. TrEMBL
Match: A0A0A0K7C8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G000680 PE=4 SV=1)

HSP 1 Score: 787.7 bits (2033), Expect = 8.5e-225
Identity = 418/538 (77.70%), Postives = 437/538 (81.23%), Query Frame = 1

Query: 5   SFSFSSSLLPSSVSFDHGSSSFFPILISSSFDS----NSRVVRCTFAAPARKSPPPPSSS 64
           SFSFSSSLLPSSVSFDH S+  FPI +SSS  S    NSR+VRC FAAP RKSP P   S
Sbjct: 3   SFSFSSSLLPSSVSFDHVSTFLFPIFVSSSCSSSSHPNSRIVRCAFAAPTRKSPVP---S 62

Query: 65  SSSSPDKKRHWKQGEFPGITETSAPRMTPARKSPPPSSSSPVKKKHWKQGEFPGVTERFD 124
           +SSSP KKRHWKQGEFPG TETS  R  P ++         VKKK               
Sbjct: 63  TSSSPAKKRHWKQGEFPGTTETSTRRRAPLKR---------VKKKL-------------- 122

Query: 125 ARKTPLKNVKKKLDRKMNAKAWANTVTETLSEHITNKRWIQALE---------------- 184
                        DRK NAKAWANTVTE LS+HITNKRW+QALE                
Sbjct: 123 -------------DRKNNAKAWANTVTEALSDHITNKRWLQALEVFEMLREQPFYEPKEG 182

Query: 185 ----LLVLLGRSGQPQRARLLFDTMVEEKCEPTTELYTALLAAYCRNNLIDDAFSILNLM 244
               LLVLLGRSGQP RARLLFDTMV+E+CEPT ELYTALLAAYCRNNLIDDAFS LNLM
Sbjct: 183 TYMKLLVLLGRSGQPHRARLLFDTMVQERCEPTPELYTALLAAYCRNNLIDDAFSTLNLM 242

Query: 245 KTLPCCQPDIYTYSILIKACVDSSRFELVESLYEEMAERLITPNTVTQNIVLSGYGKIGK 304
           KTLP CQPD+YTYSILIKACVD SRFE+VESLYEEMAERLITPNTVTQNIVLSGYGKIGK
Sbjct: 243 KTLPRCQPDVYTYSILIKACVDDSRFEIVESLYEEMAERLITPNTVTQNIVLSGYGKIGK 302

Query: 305 YDQMEKVLLGMLESTTCRPDVWTMNIILSVFGNKGQIEMMERWYEKFRNFGIEPETRTFN 364
           YDQMEKVL+GMLESTTCRPDVWTMNIILSVFGNKG IEMMERWYEKFRNFGIEPETRTFN
Sbjct: 303 YDQMEKVLIGMLESTTCRPDVWTMNIILSVFGNKGHIEMMERWYEKFRNFGIEPETRTFN 362

Query: 365 ILIGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNVIEAFADVGDAKNMEYTFTQMRAE 424
           ILIGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNVIEAFADVGDAKNMEYTF QMRAE
Sbjct: 363 ILIGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNVIEAFADVGDAKNMEYTFEQMRAE 422

Query: 425 GMKADTKTFCCLINGYSNAGLFHKVIGSVKLAGKCEIPENTSFYNAVISACAKAEDLMEM 484
           GM+ADTKTFCCLINGY+NAGLFHKVIGSVKLAGK EIPENTSFYNAVISACAKAEDLMEM
Sbjct: 423 GMRADTKTFCCLINGYANAGLFHKVIGSVKLAGKLEIPENTSFYNAVISACAKAEDLMEM 482

Query: 485 DRVFKRMKDKHCQPDSKTYSIMIEAYEKEAMNDRVHYLELERKQANLLVVRDKTANRE 519
           DRVFKRMKDKHCQPD+KTYSIM+EAY KE MNDRVHYLELE+KQ     V D  +N E
Sbjct: 483 DRVFKRMKDKHCQPDNKTYSIMMEAYGKEGMNDRVHYLELEKKQ-----VIDNASNNE 496

BLAST of Cp4.1LG12g03500 vs. TrEMBL
Match: A0A067K848_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_19079 PE=4 SV=1)

HSP 1 Score: 684.9 bits (1766), Expect = 7.8e-194
Identity = 344/478 (71.97%), Postives = 388/478 (81.17%), Query Frame = 1

Query: 45   TFAAPARKSPPPPSSSSSSSPDKKRHWKQGEFPGITETSAPRMTPARKSPPPSSSSPVKK 104
            TFA   RK P   +S SSS   +K+HWK+GEFPGITET APR  P + + P SS    +K
Sbjct: 774  TFAP--RKQPIKNTSPSSS---EKKHWKKGEFPGITETFAPRKQPIKNTSPSSS----EK 833

Query: 105  KHWKQGEFPGVTERFDARKTPLKNVKKKLDRKMNAKAWANTVTETLSEHITNKRWIQALE 164
            KHWK+GEFPG+TE F  RK P+KN+KKKLD+K  AKAW NTVTE LS+ I  K+W+QALE
Sbjct: 834  KHWKKGEFPGITETFAPRKQPIKNIKKKLDKKSKAKAWVNTVTEALSDRIAKKQWLQALE 893

Query: 165  --------------------LLVLLGRSGQPQRARLLFDTMVEEKCEPTTELYTALLAAY 224
                                LLVLLGR GQ  RA  LFD M++E+ EPT ELYTALLAAY
Sbjct: 894  VFEMLKVQPFYQPKEGTYMKLLVLLGRCGQAHRAHQLFDEMIQERLEPTPELYTALLAAY 953

Query: 225  CRNNLIDDAFSILNLMKTLPCCQPDIYTYSILIKACVDSSRFELVESLYEEMAERLITPN 284
            CRNNLID+ FS+L  M++LP C PD+YTYS L+KACVD+SRFELVE+LY EM ERLITPN
Sbjct: 954  CRNNLIDEGFSVLRQMQSLPRCLPDVYTYSTLLKACVDASRFELVETLYHEMDERLITPN 1013

Query: 285  TVTQNIVLSGYGKIGKYDQMEKVLLGMLESTTCRPDVWTMNIILSVFGNKGQIEMMERWY 344
            TVTQNIVLSGYGK G YDQMEKVL  ML+S  C+PDVWTMNI+LSVFGNKGQI+ MERWY
Sbjct: 1014 TVTQNIVLSGYGKAGMYDQMEKVLSAMLDSKECKPDVWTMNIVLSVFGNKGQIDSMERWY 1073

Query: 345  EKFRNFGIEPETRTFNILIGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNVIEAFADV 404
            EKFRNFGIEPETRTFNILIGAYGKKRMYDKMSSVMEYMRKLQFPWTTST+NNVIE FADV
Sbjct: 1074 EKFRNFGIEPETRTFNILIGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTFNNVIEVFADV 1133

Query: 405  GDAKNMEYTFTQMRAEGMKADTKTFCCLINGYSNAGLFHKVIGSVKLAGKCEIPENTSFY 464
            GDAK+MEYT  QMRAEGMKADTKTFCCLINGY+NAGLFHKVI +V+LA K EIPEN +FY
Sbjct: 1134 GDAKHMEYTLDQMRAEGMKADTKTFCCLINGYANAGLFHKVISTVQLAAKFEIPENITFY 1193

Query: 465  NAVISACAKAEDLMEMDRVFKRMKDKHCQPDSKTYSIMIEAYEKEAMNDRVHYLELER 503
            NAVISACAKA+DL EM+RVF RMKD  CQPDS+TYSIM+EAY KE MND+++YLE E+
Sbjct: 1194 NAVISACAKADDLKEMERVFARMKDNQCQPDSRTYSIMVEAYRKEGMNDKIYYLEQEK 1242

BLAST of Cp4.1LG12g03500 vs. TrEMBL
Match: M5XJ12_PRUPE (Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa020554mg PE=4 SV=1)

HSP 1 Score: 677.9 bits (1748), Expect = 9.5e-192
Identity = 333/428 (77.80%), Postives = 367/428 (85.75%), Query Frame = 1

Query: 97  SSSSPVKKKHWKQGEFPGVTERF---DARKTPLKNVKKKLDRKMNAKAWANTVTETLSEH 156
           SSSS  +K+HWKQGEFPGV+E       R+ PLKNVKKKLDRK NAKAWANTVTE LS+ 
Sbjct: 54  SSSSETRKRHWKQGEFPGVSETSIPGTYRRAPLKNVKKKLDRKNNAKAWANTVTEALSDA 113

Query: 157 ITNKRWIQALE--------------------LLVLLGRSGQPQRARLLFDTMVEEKCEPT 216
           I  K+W+QALE                    LL LLGR GQP RAR LFD MVEE CEPT
Sbjct: 114 IDKKQWLQALEVFDMLREQPFYQPKEGTYMKLLGLLGRCGQPHRARQLFDAMVEEGCEPT 173

Query: 217 TELYTALLAAYCRNNLIDDAFSILNLMKTLPCCQPDIYTYSILIKACVDSSRFELVESLY 276
            ELYTALL AYCRNNLID+AFS+LN MKTLP CQPD++TYS LIK C+D+ +FELVESLY
Sbjct: 174 LELYTALLTAYCRNNLIDEAFSVLNQMKTLPHCQPDVFTYSTLIKVCIDALKFELVESLY 233

Query: 277 EEMAERLITPNTVTQNIVLSGYGKIGKYDQMEKVLLGMLESTTCRPDVWTMNIILSVFGN 336
           EEMAERLITPNTVTQNIVLSGYGK GKYDQMEKVL GMLE  TC+PDVWTMN+ILSVFGN
Sbjct: 234 EEMAERLITPNTVTQNIVLSGYGKAGKYDQMEKVLSGMLEGATCKPDVWTMNVILSVFGN 293

Query: 337 KGQIEMMERWYEKFRNFGIEPETRTFNILIGAYGKKRMYDKMSSVMEYMRKLQFPWTTST 396
           KGQI+MMERWYEKFR+FGIEPETRTFNILIGAYGKK++YDKMS+VMEYMRKLQFPWTT+T
Sbjct: 294 KGQIDMMERWYEKFRDFGIEPETRTFNILIGAYGKKKLYDKMSTVMEYMRKLQFPWTTAT 353

Query: 397 YNNVIEAFADVGDAKNMEYTFTQMRAEGMKADTKTFCCLINGYSNAGLFHKVIGSVKLAG 456
           YNNVIEAFADVGDAKNMEYTF QMRAEGMKADTKTFCCLINGY+NAGLFHKV+ SV+LAG
Sbjct: 354 YNNVIEAFADVGDAKNMEYTFDQMRAEGMKADTKTFCCLINGYANAGLFHKVVSSVQLAG 413

Query: 457 KCEIPENTSFYNAVISACAKAEDLMEMDRVFKRMKDKHCQPDSKTYSIMIEAYEKEAMND 502
           K EIPENT+FYNAVI+ACAKAEDLMEM+RVFKRMK+K C PDS TYS+M+EAY KE MND
Sbjct: 414 KFEIPENTTFYNAVIAACAKAEDLMEMERVFKRMKEKQCPPDSTTYSLMVEAYSKEGMND 473

BLAST of Cp4.1LG12g03500 vs. TrEMBL
Match: A5BHU7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_025315 PE=4 SV=1)

HSP 1 Score: 669.1 bits (1725), Expect = 4.4e-189
Identity = 344/487 (70.64%), Postives = 393/487 (80.70%), Query Frame = 1

Query: 42  VRCTFAAPARKSPPPPSSSSSSSPDKKRHWKQGEFPGITETS----APRMTPARKSPPPS 101
           +  +F++    SP     + ++S  K R    G   G+   +    A R T +  S   S
Sbjct: 4   ISLSFSSSLLPSPLLQDKNKATSATKPRD-HHGSLCGVVRCAFASPARRKTSSTPSSSSS 63

Query: 102 SSSPVKKKHWKQGEFPGVTERFDARKTPLKNVKKKLDRKMNAKAWANTVTETLSEHITNK 161
           SS+  KK+ WKQGEFPG +    +RKTP+KN+KKKLDRK +AKAWANTV E LS+ +  K
Sbjct: 64  SSAVGKKRLWKQGEFPGTSAEGRSRKTPIKNIKKKLDRKNDAKAWANTVAEALSDLVLKK 123

Query: 162 RWIQALE--------------------LLVLLGRSGQPQRARLLFDTMVEEKCEPTTELY 221
           +W+QALE                    LLVLLG+SGQP RA  LFDTMVEE CEPTTELY
Sbjct: 124 QWLQALEVFEMLREQPFYQPKEGTYMKLLVLLGKSGQPLRAHELFDTMVEEGCEPTTELY 183

Query: 222 TALLAAYCRNNLIDDAFSILNLMKTLPCCQPDIYTYSILIKACVDSSRFELVESLYEEMA 281
           TALLA+YCR+NLID+AFSILN MKTLP CQPD++TYS L+KACVD+SRFELVESLYEEM 
Sbjct: 184 TALLASYCRSNLIDEAFSILNQMKTLPRCQPDVFTYSTLLKACVDASRFELVESLYEEMD 243

Query: 282 ERLITPNTVTQNIVLSGYGKIGKYDQMEKVLLGMLESTTCRPDVWTMNIILSVFGNKGQI 341
            R ITPNTVTQNIVLSGYGK GK+D+MEKVL GMLEST+ +PDVWTMN ILS+FGNKGQI
Sbjct: 244 VRSITPNTVTQNIVLSGYGKAGKFDEMEKVLSGMLESTSSKPDVWTMNTILSLFGNKGQI 303

Query: 342 EMMERWYEKFRNFGIEPETRTFNILIGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNV 401
           E+ME+WYEKFRNFGIEPETRTFNILIGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNV
Sbjct: 304 EIMEKWYEKFRNFGIEPETRTFNILIGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNV 363

Query: 402 IEAFADVGDAKNMEYTFTQMRAEGMKADTKTFCCLINGYSNAGLFHKVIGSVKLAGKCEI 461
           IEAF+DVGDAKNMEYTF QMRAEGMKADTKTFCCLI GY+NAGLFHKV+ SV+LAGK EI
Sbjct: 364 IEAFSDVGDAKNMEYTFDQMRAEGMKADTKTFCCLIRGYANAGLFHKVVSSVQLAGKFEI 423

Query: 462 PENTSFYNAVISACAKAEDLMEMDRVFKRMKDKHCQPDSKTYSIMIEAYEKEAMNDRVHY 505
           PENTSFYNAVISACAKAEDL+EM+RVF RMKDKHCQPDS TYSIM+EAY+KE MND+++ 
Sbjct: 424 PENTSFYNAVISACAKAEDLIEMERVFNRMKDKHCQPDSTTYSIMVEAYKKEGMNDKIYD 483

BLAST of Cp4.1LG12g03500 vs. TrEMBL
Match: F6HE37_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0020g00690 PE=4 SV=1)

HSP 1 Score: 669.1 bits (1725), Expect = 4.4e-189
Identity = 344/487 (70.64%), Postives = 393/487 (80.70%), Query Frame = 1

Query: 42  VRCTFAAPARKSPPPPSSSSSSSPDKKRHWKQGEFPGITETS----APRMTPARKSPPPS 101
           +  +F++    SP     + ++S  K R    G   G+   +    A R T +  S   S
Sbjct: 4   ISLSFSSSLLPSPLLQDKNKATSATKPRD-HHGSLCGVVRCAFASPARRKTSSTPSSSSS 63

Query: 102 SSSPVKKKHWKQGEFPGVTERFDARKTPLKNVKKKLDRKMNAKAWANTVTETLSEHITNK 161
           SS+  KK+ WKQGEFPG +    +RKTP+KN+KKKLDRK +AKAWANTV E LS+ +  K
Sbjct: 64  SSAVGKKRLWKQGEFPGTSAEGRSRKTPIKNIKKKLDRKNDAKAWANTVAEALSDLVLKK 123

Query: 162 RWIQALE--------------------LLVLLGRSGQPQRARLLFDTMVEEKCEPTTELY 221
           +W+QALE                    LLVLLG+SGQP RA  LFDTMVEE CEPTTELY
Sbjct: 124 QWLQALEVFEMLREQPFYQPKEGTYMKLLVLLGKSGQPLRAHELFDTMVEEGCEPTTELY 183

Query: 222 TALLAAYCRNNLIDDAFSILNLMKTLPCCQPDIYTYSILIKACVDSSRFELVESLYEEMA 281
           TALLA+YCR+NLID+AFSILN MKTLP CQPD++TYS L+KACVD+SRFELVESLYEEM 
Sbjct: 184 TALLASYCRSNLIDEAFSILNQMKTLPRCQPDVFTYSTLLKACVDASRFELVESLYEEMD 243

Query: 282 ERLITPNTVTQNIVLSGYGKIGKYDQMEKVLLGMLESTTCRPDVWTMNIILSVFGNKGQI 341
            R ITPNTVTQNIVLSGYGK GK+D+MEKVL GMLEST+ +PDVWTMN ILS+FGNKGQI
Sbjct: 244 VRSITPNTVTQNIVLSGYGKAGKFDEMEKVLSGMLESTSSKPDVWTMNTILSLFGNKGQI 303

Query: 342 EMMERWYEKFRNFGIEPETRTFNILIGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNV 401
           E+ME+WYEKFRNFGIEPETRTFNILIGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNV
Sbjct: 304 EIMEKWYEKFRNFGIEPETRTFNILIGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNV 363

Query: 402 IEAFADVGDAKNMEYTFTQMRAEGMKADTKTFCCLINGYSNAGLFHKVIGSVKLAGKCEI 461
           IEAF+DVGDAKNMEYTF QMRAEGMKADTKTFCCLI GY+NAGLFHKV+ SV+LAGK EI
Sbjct: 364 IEAFSDVGDAKNMEYTFDQMRAEGMKADTKTFCCLIRGYANAGLFHKVVSSVQLAGKFEI 423

Query: 462 PENTSFYNAVISACAKAEDLMEMDRVFKRMKDKHCQPDSKTYSIMIEAYEKEAMNDRVHY 505
           PENTSFYNAVISACAKAEDL+EM+RVF RMKDKHCQPDS TYSIM+EAY+KE MND+++ 
Sbjct: 424 PENTSFYNAVISACAKAEDLIEMERVFNRMKDKHCQPDSTTYSIMVEAYKKEGMNDKIYD 483

BLAST of Cp4.1LG12g03500 vs. TAIR10
Match: AT3G06430.1 (AT3G06430.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 634.4 bits (1635), Expect = 6.1e-182
Identity = 321/467 (68.74%), Postives = 377/467 (80.73%), Query Frame = 1

Query: 58  SSSSSSSPDKKRHWKQGEFPGITETSAPRMTPARKSPPPSSSSPVKKKHWKQGEFPGVTE 117
           S  SS  P+ KR ++  +  GI       +  A KS P    S  KK+ WK GEFPG+TE
Sbjct: 11  SLCSSRIPEGKRRFRHRDV-GIVRC----VLAASKSSP---GSVTKKRLWKDGEFPGITE 70

Query: 118 RFDARKTPLKNVKKKLDRKMNAKAWANTVTETLSEHITNKRWIQALE------------- 177
             + R+TP+KNVKKKLDR+  A  W NTVTETLS+ I  K+W+QALE             
Sbjct: 71  PVNQRRTPIKNVKKKLDRRSKANGWVNTVTETLSDLIAKKQWLQALEVFDMLREQTFYQP 130

Query: 178 -------LLVLLGRSGQPQRARLLFDTMVEEKCEPTTELYTALLAAYCRNNLIDDAFSIL 237
                  LLVLLG+SGQP RA+ LFD M+EE  EPT ELYTALLAAY R+NLIDDAFSIL
Sbjct: 131 KEGTYMKLLVLLGKSGQPNRAQKLFDEMLEEGLEPTVELYTALLAAYTRSNLIDDAFSIL 190

Query: 238 NLMKTLPCCQPDIYTYSILIKACVDSSRFELVESLYEEMAERLITPNTVTQNIVLSGYGK 297
           + MK+ P CQPD++TYS L+KACVD+S+F+LV+SLY+EM ERLITPNTVTQNIVLSGYG+
Sbjct: 191 DKMKSFPQCQPDVFTYSTLLKACVDASQFDLVDSLYKEMDERLITPNTVTQNIVLSGYGR 250

Query: 298 IGKYDQMEKVLLGMLESTTCRPDVWTMNIILSVFGNKGQIEMMERWYEKFRNFGIEPETR 357
           +G++DQMEKVL  ML ST C+PDVWTMNIILSVFGN G+I+MME WYEKFRNFGIEPETR
Sbjct: 251 VGRFDQMEKVLSDMLVSTACKPDVWTMNIILSVFGNMGKIDMMESWYEKFRNFGIEPETR 310

Query: 358 TFNILIGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNVIEAFADVGDAKNMEYTFTQM 417
           TFNILIG+YGKKRMYDKMSSVMEYMRKL+FPWTTSTYNN+IEAFADVGDAKNME TF QM
Sbjct: 311 TFNILIGSYGKKRMYDKMSSVMEYMRKLEFPWTTSTYNNIIEAFADVGDAKNMELTFDQM 370

Query: 418 RAEGMKADTKTFCCLINGYSNAGLFHKVIGSVKLAGKCEIPENTSFYNAVISACAKAEDL 477
           R+EGMKADTKTFCCLINGY+NAGLFHKVI SV+LA K EIPENT+FYNAVISACAKA+DL
Sbjct: 371 RSEGMKADTKTFCCLINGYANAGLFHKVISSVQLAAKFEIPENTAFYNAVISACAKADDL 430

Query: 478 MEMDRVFKRMKDKHCQPDSKTYSIMIEAYEKEAMNDRVHYLELERKQ 505
           +EM+RV+ RMK++ C  DS+T+ IM+EAYEKE MND+++YLE ER++
Sbjct: 431 IEMERVYIRMKERQCVCDSRTFEIMVEAYEKEGMNDKIYYLEQERQK 469

BLAST of Cp4.1LG12g03500 vs. TAIR10
Match: AT5G48730.1 (AT5G48730.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 290.8 bits (743), Expect = 1.7e-78
Identity = 159/397 (40.05%), Postives = 222/397 (55.92%), Query Frame = 1

Query: 122 RKTPLKNVKKKLDRKMNAKAWANTVTETLSEHITNKRWIQALE----------------- 181
           R+   K++ +K  +K + K    TV E+L E IT  RW  A++                 
Sbjct: 95  RREATKSIIEK--KKGSKKLLPRTVLESLHERITALRWESAIQVFELLREQLWYKPNVGI 154

Query: 182 ---LLVLLGRSGQPQRARLLFDTMVEEKCEPTTELYTALLAAYCRNNLIDDAFSILNLMK 241
              L+V+LG+  QP++A  LF  M+ E C    E+YTAL++AY R+   D AF++L  MK
Sbjct: 155 YVKLIVMLGKCKQPEKAHELFQEMINEGCVVNHEVYTALVSAYSRSGRFDAAFTLLERMK 214

Query: 242 TLPCCQPDIYTYSILIKACVDSSRFELVESLYEEMAERLITPNTVTQNIVLSGYGKIGKY 301
           +   CQPD++TYSILIK+ +    F+ V+ L  +M  + I PNT+T N ++  YGK   +
Sbjct: 215 SSHNCQPDVHTYSILIKSFLQVFAFDKVQDLLSDMRRQGIRPNTITYNTLIDAYGKAKMF 274

Query: 302 DQMEKVLLGMLESTTCRPDVWTMNIILSVFGNKGQIEMMERWYEKFRNFGIEPETRTFNI 361
            +ME  L+ ML    C+PD WTMN  L  FG  GQIEMME  YEKF++ GIEP  RTFNI
Sbjct: 275 VEMESTLIQMLGEDDCKPDSWTMNSTLRAFGGNGQIEMMENCYEKFQSSGIEPNIRTFNI 334

Query: 362 LIGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNVIEAFADVGDAKNMEYTFTQMRAEG 421
           L+ +YGK   Y KMS+VMEYM+K  + WT  TYN VI+AF   GD K MEY F  M++E 
Sbjct: 335 LLDSYGKSGNYKKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGDLKQMEYLFRLMQSER 394

Query: 422 MKADTKTFCCLINGYSNAGLFHKVIGSVKLAGKCEIPENTSFYNAVISACAKAEDLMEMD 481
           +     T C L+  Y  A    K+ G ++     +I  +  F+N ++ A  + E   EM 
Sbjct: 395 IFPSCVTLCSLVRAYGRASKADKIGGVLRFIENSDIRLDLVFFNCLVDAYGRMEKFAEMK 454

Query: 482 RVFKRMKDKHCQPDSKTYSIMIEAYEKEAMNDRVHYL 499
            V + M+ K  +PD  TY  M++AY    M   V  L
Sbjct: 455 GVLELMEKKGFKPDKITYRTMVKAYRISGMTTHVKEL 489

BLAST of Cp4.1LG12g03500 vs. TAIR10
Match: AT3G53170.1 (AT3G53170.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 284.6 bits (727), Expect = 1.2e-76
Identity = 145/394 (36.80%), Postives = 229/394 (58.12%), Query Frame = 1

Query: 126 LKNVKKKLDRKMNAKAWANTVTETLSEHITNKRWIQALELLVLL---------------- 185
           +K +++K + +     W   V E L E I   RW  AL++  LL                
Sbjct: 91  VKGIERKANSEKYLTLWPKAVLEALDEAIKENRWQSALKIFNLLRKQHWYEPRCKTYTKL 150

Query: 186 ----GRSGQPQRARLLFDTMVEEKCEPTTELYTALLAAYCRNNLIDDAFSILNLMKTLPC 245
               G   QP +A LLF+ M+ E  +PT ++YT+L++ Y ++ L+D AFS L  MK++  
Sbjct: 151 FKVLGNCKQPDQASLLFEVMLSEGLKPTIDVYTSLISVYGKSELLDKAFSTLEYMKSVSD 210

Query: 246 CQPDIYTYSILIKACVDSSRFELVESLYEEMAERLITPNTVTQNIVLSGYGKIGKYDQME 305
           C+PD++T+++LI  C    RF+LV+S+  EM+   +  +TVT N ++ GYGK G +++ME
Sbjct: 211 CKPDVFTFTVLISCCCKLGRFDLVKSIVLEMSYLGVGCSTVTYNTIIDGYGKAGMFEEME 270

Query: 306 KVLLGMLESTTCRPDVWTMNIILSVFGNKGQIEMMERWYEKFRNFGIEPETRTFNILIGA 365
            VL  M+E     PDV T+N I+  +GN   +  ME WY +F+  G++P+  TFNILI +
Sbjct: 271 SVLADMIEDGDSLPDVCTLNSIIGSYGNGRNMRKMESWYSRFQLMGVQPDITTFNILILS 330

Query: 366 YGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNVIEAFADVGDAKNMEYTFTQMRAEGMKAD 425
           +GK  MY KM SVM++M K  F  TT TYN VIE F   G  + M+  F +M+ +G+K +
Sbjct: 331 FGKAGMYKKMCSVMDFMEKRFFSLTTVTYNIVIETFGKAGRIEKMDDVFRKMKYQGVKPN 390

Query: 426 TKTFCCLINGYSNAGLFHKVIGSVKLAGKCEIPENTSFYNAVISACAKAEDLMEMDRVFK 485
           + T+C L+N YS AGL  K+   ++     ++  +T F+N +I+A  +A DL  M  ++ 
Sbjct: 391 SITYCSLVNAYSKAGLVVKIDSVLRQIVNSDVVLDTPFFNCIINAYGQAGDLATMKELYI 450

Query: 486 RMKDKHCQPDSKTYSIMIEAYEKEAMNDRVHYLE 500
           +M+++ C+PD  T++ MI+ Y    + D V  LE
Sbjct: 451 QMEERKCKPDKITFATMIKTYTAHGIFDAVQELE 484

BLAST of Cp4.1LG12g03500 vs. TAIR10
Match: AT5G02860.1 (AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 150.6 bits (379), Expect = 2.7e-36
Identity = 95/332 (28.61%), Postives = 156/332 (46.99%), Query Frame = 1

Query: 165 LLVLLGRSGQPQRARLLFDTMVEEKCEPTTELYTALLAAYCRNNLIDDAFSILNLMKTLP 224
           LL + G+S +P+ A  + + MV     P+   Y +L++AY R+ ++D+A  + N M    
Sbjct: 320 LLDVYGKSHRPKEAMKVLNEMVLNGFSPSIVTYNSLISAYARDGMLDEAMELKNQMAEKG 379

Query: 225 CCQPDIYTYSILIKACVDSSRFELVESLYEEMAERLITPNTVTQNIVLSGYGKIGKYDQM 284
             +PD++TY+ L+     + + E   S++EEM      PN  T N  +  YG  GK+ +M
Sbjct: 380 T-KPDVFTYTTLLSGFERAGKVESAMSIFEEMRNAGCKPNICTFNAFIKMYGNRGKFTEM 439

Query: 285 EKVLLGMLESTTC--RPDVWTMNIILSVFGNKGQIEMMERWYEKFRNFGIEPETRTFNIL 344
            K+     E   C   PD+ T N +L+VFG  G    +   +++ +  G  PE  TFN L
Sbjct: 440 MKIFD---EINVCGLSPDIVTWNTLLAVFGQNGMDSEVSGVFKEMKRAGFVPERETFNTL 499

Query: 345 IGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNVIEAFADVGDAKNMEYTFTQMRAEGM 404
           I AY +   +++  +V   M         STYN V+ A A  G  +  E    +M     
Sbjct: 500 ISAYSRCGSFEQAMTVYRRMLDAGVTPDLSTYNTVLAALARGGMWEQSEKVLAEMEDGRC 559

Query: 405 KADTKTFCCLINGYSNA---GLFHKVIGSVKLAGKCEIPENTSFYNAVISACAKAEDLME 464
           K +  T+C L++ Y+N    GL H +   V       I         ++  C+K + L E
Sbjct: 560 KPNELTYCSLLHAYANGKEIGLMHSLAEEVYSG---VIEPRAVLLKTLVLVCSKCDLLPE 619

Query: 465 MDRVFKRMKDKHCQPDSKTYSIMIEAYEKEAM 492
            +R F  +K++   PD  T + M+  Y +  M
Sbjct: 620 AERAFSELKERGFSPDITTLNSMVSIYGRRQM 644

BLAST of Cp4.1LG12g03500 vs. TAIR10
Match: AT4G39620.1 (AT4G39620.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 144.1 bits (362), Expect = 2.5e-34
Identity = 85/327 (25.99%), Postives = 153/327 (46.79%), Query Frame = 1

Query: 164 ELLVLLGRSGQPQRARLLFDTMVEEKCEPTTELYTALLAAYCRNN----LIDDAFSILNL 223
           +L+ ++G+ GQ + A  LF  M    C P   +Y AL+ A+         ++     L+ 
Sbjct: 138 KLISVMGKKGQTRMAMWLFSEMKNSGCRPDASVYNALITAHLHTRDKAKALEKVRGYLDK 197

Query: 224 MKTLPCCQPDIYTYSILIKACVDSSRFELVESLYEEMAERLITPNTVTQNIVLSGYGKIG 283
           MK +  CQP++ TY+IL++A   S + + V +L++++    ++P+  T N V+  YGK G
Sbjct: 198 MKGIERCQPNVVTYNILLRAFAQSGKVDQVNALFKDLDMSPVSPDVYTFNGVMDAYGKNG 257

Query: 284 KYDQMEKVLLGMLESTTCRPDVWTMNIILSVFGNKGQIEMMERWYEKFRNFGIEPETRTF 343
              +ME VL  M  S  C+PD+ T N+++  +G K + E ME+ ++       +P   TF
Sbjct: 258 MIKEMEAVLTRM-RSNECKPDIITFNVLIDSYGKKQEFEKMEQTFKSLMRSKEKPTLPTF 317

Query: 344 NILIGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNVIEAFADVGDAKNMEYTFTQMRA 403
           N +I  YGK RM DK   V + M  + +  +  TY  +I  +   G        F ++  
Sbjct: 318 NSMIINYGKARMIDKAEWVFKKMNDMNYIPSFITYECMIMMYGYCGSVSRAREIFEEVGE 377

Query: 404 EGMKADTKTFCCLINGYSNAGLFHKVIGSVKLAGKCEIPENTSFYNAVISACAKAEDLME 463
                   T   ++  Y   GL+ +       A    +  + S Y  +  A  KA+   +
Sbjct: 378 SDRVLKASTLNAMLEVYCRNGLYIEADKLFHNASAFRVHPDASTYKFLYKAYTKADMKEQ 437

Query: 464 MDRVFKRMKDKHCQPDSKTYSIMIEAY 487
           +  + K+M+     P+ + +   +E +
Sbjct: 438 VQILMKKMEKDGIVPNKRFFLEALEVF 463

BLAST of Cp4.1LG12g03500 vs. NCBI nr
Match: gi|700190416|gb|KGN45620.1| (hypothetical protein Csa_6G000680 [Cucumis sativus])

HSP 1 Score: 787.7 bits (2033), Expect = 1.2e-224
Identity = 418/538 (77.70%), Postives = 437/538 (81.23%), Query Frame = 1

Query: 5   SFSFSSSLLPSSVSFDHGSSSFFPILISSSFDS----NSRVVRCTFAAPARKSPPPPSSS 64
           SFSFSSSLLPSSVSFDH S+  FPI +SSS  S    NSR+VRC FAAP RKSP P   S
Sbjct: 3   SFSFSSSLLPSSVSFDHVSTFLFPIFVSSSCSSSSHPNSRIVRCAFAAPTRKSPVP---S 62

Query: 65  SSSSPDKKRHWKQGEFPGITETSAPRMTPARKSPPPSSSSPVKKKHWKQGEFPGVTERFD 124
           +SSSP KKRHWKQGEFPG TETS  R  P ++         VKKK               
Sbjct: 63  TSSSPAKKRHWKQGEFPGTTETSTRRRAPLKR---------VKKKL-------------- 122

Query: 125 ARKTPLKNVKKKLDRKMNAKAWANTVTETLSEHITNKRWIQALE---------------- 184
                        DRK NAKAWANTVTE LS+HITNKRW+QALE                
Sbjct: 123 -------------DRKNNAKAWANTVTEALSDHITNKRWLQALEVFEMLREQPFYEPKEG 182

Query: 185 ----LLVLLGRSGQPQRARLLFDTMVEEKCEPTTELYTALLAAYCRNNLIDDAFSILNLM 244
               LLVLLGRSGQP RARLLFDTMV+E+CEPT ELYTALLAAYCRNNLIDDAFS LNLM
Sbjct: 183 TYMKLLVLLGRSGQPHRARLLFDTMVQERCEPTPELYTALLAAYCRNNLIDDAFSTLNLM 242

Query: 245 KTLPCCQPDIYTYSILIKACVDSSRFELVESLYEEMAERLITPNTVTQNIVLSGYGKIGK 304
           KTLP CQPD+YTYSILIKACVD SRFE+VESLYEEMAERLITPNTVTQNIVLSGYGKIGK
Sbjct: 243 KTLPRCQPDVYTYSILIKACVDDSRFEIVESLYEEMAERLITPNTVTQNIVLSGYGKIGK 302

Query: 305 YDQMEKVLLGMLESTTCRPDVWTMNIILSVFGNKGQIEMMERWYEKFRNFGIEPETRTFN 364
           YDQMEKVL+GMLESTTCRPDVWTMNIILSVFGNKG IEMMERWYEKFRNFGIEPETRTFN
Sbjct: 303 YDQMEKVLIGMLESTTCRPDVWTMNIILSVFGNKGHIEMMERWYEKFRNFGIEPETRTFN 362

Query: 365 ILIGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNVIEAFADVGDAKNMEYTFTQMRAE 424
           ILIGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNVIEAFADVGDAKNMEYTF QMRAE
Sbjct: 363 ILIGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNVIEAFADVGDAKNMEYTFEQMRAE 422

Query: 425 GMKADTKTFCCLINGYSNAGLFHKVIGSVKLAGKCEIPENTSFYNAVISACAKAEDLMEM 484
           GM+ADTKTFCCLINGY+NAGLFHKVIGSVKLAGK EIPENTSFYNAVISACAKAEDLMEM
Sbjct: 423 GMRADTKTFCCLINGYANAGLFHKVIGSVKLAGKLEIPENTSFYNAVISACAKAEDLMEM 482

Query: 485 DRVFKRMKDKHCQPDSKTYSIMIEAYEKEAMNDRVHYLELERKQANLLVVRDKTANRE 519
           DRVFKRMKDKHCQPD+KTYSIM+EAY KE MNDRVHYLELE+KQ     V D  +N E
Sbjct: 483 DRVFKRMKDKHCQPDNKTYSIMMEAYGKEGMNDRVHYLELEKKQ-----VIDNASNNE 496

BLAST of Cp4.1LG12g03500 vs. NCBI nr
Match: gi|778708982|ref|XP_004138596.2| (PREDICTED: pentatricopeptide repeat-containing protein At3g06430, chloroplastic [Cucumis sativus])

HSP 1 Score: 787.7 bits (2033), Expect = 1.2e-224
Identity = 418/538 (77.70%), Postives = 437/538 (81.23%), Query Frame = 1

Query: 5   SFSFSSSLLPSSVSFDHGSSSFFPILISSSFDS----NSRVVRCTFAAPARKSPPPPSSS 64
           SFSFSSSLLPSSVSFDH S+  FPI +SSS  S    NSR+VRC FAAP RKSP P   S
Sbjct: 13  SFSFSSSLLPSSVSFDHVSTFLFPIFVSSSCSSSSHPNSRIVRCAFAAPTRKSPVP---S 72

Query: 65  SSSSPDKKRHWKQGEFPGITETSAPRMTPARKSPPPSSSSPVKKKHWKQGEFPGVTERFD 124
           +SSSP KKRHWKQGEFPG TETS  R  P ++         VKKK               
Sbjct: 73  TSSSPAKKRHWKQGEFPGTTETSTRRRAPLKR---------VKKKL-------------- 132

Query: 125 ARKTPLKNVKKKLDRKMNAKAWANTVTETLSEHITNKRWIQALE---------------- 184
                        DRK NAKAWANTVTE LS+HITNKRW+QALE                
Sbjct: 133 -------------DRKNNAKAWANTVTEALSDHITNKRWLQALEVFEMLREQPFYEPKEG 192

Query: 185 ----LLVLLGRSGQPQRARLLFDTMVEEKCEPTTELYTALLAAYCRNNLIDDAFSILNLM 244
               LLVLLGRSGQP RARLLFDTMV+E+CEPT ELYTALLAAYCRNNLIDDAFS LNLM
Sbjct: 193 TYMKLLVLLGRSGQPHRARLLFDTMVQERCEPTPELYTALLAAYCRNNLIDDAFSTLNLM 252

Query: 245 KTLPCCQPDIYTYSILIKACVDSSRFELVESLYEEMAERLITPNTVTQNIVLSGYGKIGK 304
           KTLP CQPD+YTYSILIKACVD SRFE+VESLYEEMAERLITPNTVTQNIVLSGYGKIGK
Sbjct: 253 KTLPRCQPDVYTYSILIKACVDDSRFEIVESLYEEMAERLITPNTVTQNIVLSGYGKIGK 312

Query: 305 YDQMEKVLLGMLESTTCRPDVWTMNIILSVFGNKGQIEMMERWYEKFRNFGIEPETRTFN 364
           YDQMEKVL+GMLESTTCRPDVWTMNIILSVFGNKG IEMMERWYEKFRNFGIEPETRTFN
Sbjct: 313 YDQMEKVLIGMLESTTCRPDVWTMNIILSVFGNKGHIEMMERWYEKFRNFGIEPETRTFN 372

Query: 365 ILIGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNVIEAFADVGDAKNMEYTFTQMRAE 424
           ILIGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNVIEAFADVGDAKNMEYTF QMRAE
Sbjct: 373 ILIGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNVIEAFADVGDAKNMEYTFEQMRAE 432

Query: 425 GMKADTKTFCCLINGYSNAGLFHKVIGSVKLAGKCEIPENTSFYNAVISACAKAEDLMEM 484
           GM+ADTKTFCCLINGY+NAGLFHKVIGSVKLAGK EIPENTSFYNAVISACAKAEDLMEM
Sbjct: 433 GMRADTKTFCCLINGYANAGLFHKVIGSVKLAGKLEIPENTSFYNAVISACAKAEDLMEM 492

Query: 485 DRVFKRMKDKHCQPDSKTYSIMIEAYEKEAMNDRVHYLELERKQANLLVVRDKTANRE 519
           DRVFKRMKDKHCQPD+KTYSIM+EAY KE MNDRVHYLELE+KQ     V D  +N E
Sbjct: 493 DRVFKRMKDKHCQPDNKTYSIMMEAYGKEGMNDRVHYLELEKKQ-----VIDNASNNE 506

BLAST of Cp4.1LG12g03500 vs. NCBI nr
Match: gi|659116850|ref|XP_008458291.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g06430, chloroplastic [Cucumis melo])

HSP 1 Score: 782.7 bits (2020), Expect = 3.9e-223
Identity = 413/535 (77.20%), Postives = 439/535 (82.06%), Query Frame = 1

Query: 5   SFSFSSSLLPSSVSFDHGSSSFFPILISSSFDSNSRVVRCTFAAPARKSPPPPSSSSSSS 64
           SFSFSSSLLPSSVSFDH S+  FPI +SSS                       SSSSSS 
Sbjct: 3   SFSFSSSLLPSSVSFDHVSTFLFPIFVSSS-----------------------SSSSSSH 62

Query: 65  PDKKRHWKQGEFPGITETSAPRMTPARKSPPPS-SSSPVKKKHWKQGEFPGVTERFDARK 124
           P+ +                    P RKSP PS SSSP KK+HWKQGEFPG+TE    RK
Sbjct: 63  PNSRI------------VRCAFAAPTRKSPVPSTSSSPAKKRHWKQGEFPGITETPTRRK 122

Query: 125 TPLKNVKKKLDRKMNAKAWANTVTETLSEHITNKRWIQALE------------------- 184
           TPLKNVKKKLDRK NAKAWANTVTE LS+HI+NKRW++ALE                   
Sbjct: 123 TPLKNVKKKLDRKNNAKAWANTVTEALSDHISNKRWLEALEVFEMLREQPFYEPKEGTYM 182

Query: 185 -LLVLLGRSGQPQRARLLFDTMVEEKCEPTTELYTALLAAYCRNNLIDDAFSILNLMKTL 244
            LLVLLGRSGQP RARLLFDTM++E+CEPT+ELYTALLAAYCRNNLIDDAFS LNLMKTL
Sbjct: 183 KLLVLLGRSGQPHRARLLFDTMLQERCEPTSELYTALLAAYCRNNLIDDAFSTLNLMKTL 242

Query: 245 PCCQPDIYTYSILIKACVDSSRFELVESLYEEMAERLITPNTVTQNIVLSGYGKIGKYDQ 304
           P CQPD+YTYSILIKACVD+SRFE+VESLYEEMA+RLITPNTVTQNIVLSGYGKIGKYDQ
Sbjct: 243 PGCQPDVYTYSILIKACVDASRFEIVESLYEEMAQRLITPNTVTQNIVLSGYGKIGKYDQ 302

Query: 305 MEKVLLGMLESTTCRPDVWTMNIILSVFGNKGQIEMMERWYEKFRNFGIEPETRTFNILI 364
           MEKVL+GMLESTTCRPDVWTMNIILSVFGNKGQIEMMERWYEKFRNFGIEPETRTFNILI
Sbjct: 303 MEKVLIGMLESTTCRPDVWTMNIILSVFGNKGQIEMMERWYEKFRNFGIEPETRTFNILI 362

Query: 365 GAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNVIEAFADVGDAKNMEYTFTQMRAEGMK 424
           GAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNVIEAFADVGDAKNMEYTF QMRAEGM+
Sbjct: 363 GAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNVIEAFADVGDAKNMEYTFEQMRAEGMR 422

Query: 425 ADTKTFCCLINGYSNAGLFHKVIGSVKLAGKCEIPENTSFYNAVISACAKAEDLMEMDRV 484
           ADTKTFCCLINGY+NAGLFHKVI SVKLAGK EIPENTSFYNAVISACAKAEDLMEMDRV
Sbjct: 423 ADTKTFCCLINGYANAGLFHKVISSVKLAGKLEIPENTSFYNAVISACAKAEDLMEMDRV 482

Query: 485 FKRMKDKHCQPDSKTYSIMIEAYEKEAMNDRVHYLELERKQANLLVVRDKTANRE 519
           FKRMKDKHCQPDSKTY+IM+EAY KE MNDRVHYLELE+K+     V D  +N E
Sbjct: 483 FKRMKDKHCQPDSKTYNIMMEAYGKEGMNDRVHYLELEKKE-----VIDYASNNE 497

BLAST of Cp4.1LG12g03500 vs. NCBI nr
Match: gi|694324867|ref|XP_009353413.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g06430, chloroplastic-like [Pyrus x bretschneideri])

HSP 1 Score: 706.1 bits (1821), Expect = 4.7e-200
Identity = 358/482 (74.27%), Postives = 400/482 (82.99%), Query Frame = 1

Query: 47  AAPAR-KSPPPPSSSSSSSPDKKRHWKQGEFPGITETSAP---RMTPARKSPPPSSSSPV 106
           +APA  + PP  S+SS SS  +K+HWK+GEFPG++ETS P   R   AR    PSS +  
Sbjct: 84  SAPANYRKPPGRSTSSPSSDTRKKHWKRGEFPGVSETSTPATYRKPSARSPSYPSSDN-- 143

Query: 107 KKKHWKQGEFPGVTERFDA---RKTPLKNVKKKLDRKMNAKAWANTVTETLSEHITNKRW 166
           KKKHWKQGEFPGV+E       RK PLKNVKKKLDRK NAKAW NTVTE LS+ I  K+W
Sbjct: 144 KKKHWKQGEFPGVSETSIPAIYRKPPLKNVKKKLDRKNNAKAWVNTVTEALSDAIDKKQW 203

Query: 167 IQALE--------------------LLVLLGRSGQPQRARLLFDTMVEEKCEPTTELYTA 226
           +QALE                    L+ +LGR GQP RAR LFDTMVEE CEPT ELYTA
Sbjct: 204 LQALEVFDMLREQPFYQPKEGTYMKLIGMLGRCGQPNRARQLFDTMVEEGCEPTLELYTA 263

Query: 227 LLAAYCRNNLIDDAFSILNLMKTLPCCQPDIYTYSILIKACVDSSRFELVESLYEEMAER 286
           LLAAYCRNNLID+AFS+LNLMKTLP CQPD++TYS LIK CVD  +F+LVESLYEEMAER
Sbjct: 264 LLAAYCRNNLIDEAFSVLNLMKTLPQCQPDVFTYSTLIKVCVDHLKFDLVESLYEEMAER 323

Query: 287 LITPNTVTQNIVLSGYGKIGKYDQMEKVLLGMLESTTCRPDVWTMNIILSVFGNKGQIEM 346
           LITPNTVTQNIVLSGYGK GKYDQMEKVL GMLE T+C+PDVWTMN+ILSVFGNKGQI+M
Sbjct: 324 LITPNTVTQNIVLSGYGKAGKYDQMEKVLSGMLEGTSCKPDVWTMNVILSVFGNKGQIDM 383

Query: 347 MERWYEKFRNFGIEPETRTFNILIGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNVIE 406
           MERWYEKFR+FGIEPETRT NILIGAYGKKR+YDKMS+VMEYMRKLQFPWTT+TYNNVIE
Sbjct: 384 MERWYEKFRDFGIEPETRTLNILIGAYGKKRLYDKMSTVMEYMRKLQFPWTTATYNNVIE 443

Query: 407 AFADVGDAKNMEYTFTQMRAEGMKADTKTFCCLINGYSNAGLFHKVIGSVKLAGKCEIPE 466
           AFADVGDAKNMEYTF QMRAEGMKADTKTFCCLINGY+NAGLFHKV+  V+LAGK EIPE
Sbjct: 444 AFADVGDAKNMEYTFEQMRAEGMKADTKTFCCLINGYANAGLFHKVVSCVQLAGKFEIPE 503

Query: 467 NTSFYNAVISACAKAEDLMEMDRVFKRMKDKHCQPDSKTYSIMIEAYEKEAMNDRVHYLE 502
           NT+FYNAVI+ACAKAEDLMEM+RVF RMK+K CQ DS TYS+M+EAY KE MND+++YL+
Sbjct: 504 NTTFYNAVIAACAKAEDLMEMERVFNRMKEKQCQADSTTYSVMVEAYSKEGMNDKIYYLK 563

BLAST of Cp4.1LG12g03500 vs. NCBI nr
Match: gi|657996020|ref|XP_008390371.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g06430, chloroplastic-like [Malus domestica])

HSP 1 Score: 703.7 bits (1815), Expect = 2.3e-199
Identity = 351/476 (73.74%), Postives = 397/476 (83.40%), Query Frame = 1

Query: 52  KSPPPPSSSSSSSPDKKRHWKQGEFPGITETSAP---RMTPARKSPPPSSSSPVKKKHWK 111
           +  P  S+SS SS  +K+HWK+GEFPG++ETS P   R T AR +  PSS +  +KKHWK
Sbjct: 123 RKTPGRSTSSESSDTRKKHWKRGEFPGVSETSTPATYRKTSARSASYPSSDN--RKKHWK 182

Query: 112 QGEFPGVTERFDA---RKTPLKNVKKKLDRKMNAKAWANTVTETLSEHITNKRWIQALE- 171
           QGEFPGV+E       RK PLKNVKKKLDRK NAKAW NTVTE LS+ I  K+W+QALE 
Sbjct: 183 QGEFPGVSETSIPAIYRKPPLKNVKKKLDRKNNAKAWVNTVTEALSDAIDKKQWLQALEV 242

Query: 172 -------------------LLVLLGRSGQPQRARLLFDTMVEEKCEPTTELYTALLAAYC 231
                              L+ +LGR GQP RAR LFDTMVEE CEPT +LYTALLAAYC
Sbjct: 243 FDMLREQPFYQPKEGTYMKLIGMLGRCGQPNRARQLFDTMVEEGCEPTLDLYTALLAAYC 302

Query: 232 RNNLIDDAFSILNLMKTLPCCQPDIYTYSILIKACVDSSRFELVESLYEEMAERLITPNT 291
           RNNLID+AFS+LNLMKTLP CQPD++TYS LIK C+D  +F+LVESLYEEMAERLI PNT
Sbjct: 303 RNNLIDEAFSVLNLMKTLPQCQPDVFTYSTLIKVCIDHLKFDLVESLYEEMAERLIAPNT 362

Query: 292 VTQNIVLSGYGKIGKYDQMEKVLLGMLESTTCRPDVWTMNIILSVFGNKGQIEMMERWYE 351
           VTQNIVLSGYGK GKYDQMEKVL GMLE T+C+PDVWTMN++LSVFGNKGQI+MMERWYE
Sbjct: 363 VTQNIVLSGYGKAGKYDQMEKVLSGMLEGTSCKPDVWTMNVVLSVFGNKGQIDMMERWYE 422

Query: 352 KFRNFGIEPETRTFNILIGAYGKKRMYDKMSSVMEYMRKLQFPWTTSTYNNVIEAFADVG 411
           KFR+FGIEPETRT NILIGAYGKKR+YDKMS+VMEYMRKLQFPWTT+TYNNVIEAFADVG
Sbjct: 423 KFRDFGIEPETRTLNILIGAYGKKRLYDKMSTVMEYMRKLQFPWTTATYNNVIEAFADVG 482

Query: 412 DAKNMEYTFTQMRAEGMKADTKTFCCLINGYSNAGLFHKVIGSVKLAGKCEIPENTSFYN 471
           DAKNMEYTF QMRAEGMKADTKTFCCLINGY+NAGLFHKV+  V+LAGK EIPENT+FYN
Sbjct: 483 DAKNMEYTFEQMRAEGMKADTKTFCCLINGYANAGLFHKVVSCVQLAGKFEIPENTTFYN 542

Query: 472 AVISACAKAEDLMEMDRVFKRMKDKHCQPDSKTYSIMIEAYEKEAMNDRVHYLELE 502
           AVI+ACAKAEDLMEM+RVF RMK+K CQPDS TYS+M+EAY KE MND+++YL+ E
Sbjct: 543 AVIAACAKAEDLMEMERVFNRMKEKQCQPDSTTYSVMVEAYSKEGMNDKIYYLKQE 596

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP216_ARATH1.1e-18068.74Pentatricopeptide repeat-containing protein At3g06430, chloroplastic OS=Arabidop... [more]
PP424_ARATH2.9e-7740.05Pentatricopeptide repeat-containing protein At5g48730, chloroplastic OS=Arabidop... [more]
PP279_ARATH2.1e-7536.80Pentatricopeptide repeat-containing protein At3g53170 OS=Arabidopsis thaliana GN... [more]
PP362_ARATH4.7e-3528.61Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana GN... [more]
PP358_ARATH4.4e-3325.99Pentatricopeptide repeat-containing protein At4g39620, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0K7C8_CUCSA8.5e-22577.70Uncharacterized protein OS=Cucumis sativus GN=Csa_6G000680 PE=4 SV=1[more]
A0A067K848_JATCU7.8e-19471.97Uncharacterized protein OS=Jatropha curcas GN=JCGZ_19079 PE=4 SV=1[more]
M5XJ12_PRUPE9.5e-19277.80Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa020554mg PE=4 S... [more]
A5BHU7_VITVI4.4e-18970.64Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_025315 PE=4 SV=1[more]
F6HE37_VITVI4.4e-18970.64Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0020g00690 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT3G06430.16.1e-18268.74 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G48730.11.7e-7840.05 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G53170.11.2e-7636.80 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G02860.12.7e-3628.61 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G39620.12.5e-3425.99 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|700190416|gb|KGN45620.1|1.2e-22477.70hypothetical protein Csa_6G000680 [Cucumis sativus][more]
gi|778708982|ref|XP_004138596.2|1.2e-22477.70PREDICTED: pentatricopeptide repeat-containing protein At3g06430, chloroplastic ... [more]
gi|659116850|ref|XP_008458291.1|3.9e-22377.20PREDICTED: pentatricopeptide repeat-containing protein At3g06430, chloroplastic ... [more]
gi|694324867|ref|XP_009353413.1|4.7e-20074.27PREDICTED: pentatricopeptide repeat-containing protein At3g06430, chloroplastic-... [more]
gi|657996020|ref|XP_008390371.1|2.3e-19973.74PREDICTED: pentatricopeptide repeat-containing protein At3g06430, chloroplastic-... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0003006 developmental process involved in reproduction
biological_process GO:0009790 embryo development
biological_process GO:0048229 gametophyte development
biological_process GO:0009793 embryo development ending in seed dormancy
biological_process GO:0009553 embryo sac development
biological_process GO:0009555 pollen development
biological_process GO:0048868 pollen tube development
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0009507 chloroplast
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding
molecular_function GO:0019843 rRNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG12g03500.1Cp4.1LG12g03500.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 269..293
score: 0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 440..486
score: 2.8E-13coord: 193..240
score: 4.8E-12coord: 299..347
score: 7.
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 369..416
score: 3.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 168..193
score: 0.002coord: 196..229
score: 5.0E-6coord: 444..476
score: 4.9E-8coord: 231..264
score: 1.8E-5coord: 303..334
score: 0.0022coord: 266..300
score: 1.3E-4coord: 373..405
score: 0.0013coord: 338..366
score: 4.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 193..227
score: 8.879coord: 370..404
score: 9.131coord: 264..299
score: 8.835coord: 335..369
score: 8.55coord: 300..334
score: 9.657coord: 158..192
score: 6.621coord: 229..263
score: 10.918coord: 440..474
score: 11.235coord: 475..509
score: 6.226coord: 405..439
score: 6
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 75..501
score: 5.9E
NoneNo IPR availablePANTHERPTHR24015:SF301SUBFAMILY NOT NAMEDcoord: 75..501
score: 5.9E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG12g03500Cp4.1LG17g00030Cucurbita pepo (Zucchini)cpecpeB158
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG12g03500Cucumber (Gy14) v2cgybcpeB179
Cp4.1LG12g03500Cucumber (Gy14) v2cgybcpeB303
Cp4.1LG12g03500Melon (DHL92) v3.6.1cpemedB170
Cp4.1LG12g03500Silver-seed gourdcarcpeB0393
Cp4.1LG12g03500Cucumber (Chinese Long) v3cpecucB0154
Cp4.1LG12g03500Cucurbita pepo (Zucchini)cpecpeB184
Cp4.1LG12g03500Cucurbita pepo (Zucchini)cpecpeB194
Cp4.1LG12g03500Watermelon (97103) v1cpewmB152