Cp4.1LG14g06120 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG14g06120
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCp4.1LG14 : 598176 .. 600494 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTATCAATCATGAACTCCTTATCGCTTGTGAGTTCGCGGATAGTCTGCAGATTGTTTTCGACTTCTTTTATTGTGCGTCGAACAATCTGTAATCGGAGCTTTGGTGGTGATGATCGATTTGAGTTTGTTGTGGAACCCAATAAGAAGAGGTTGGAGGGATCGGAGTTTGGTTACTTTACTGATGAAAATTCGGATTCCTGTGAGATTGGGAGTCGTAGAGGGGATGATCTGTCAATGAGAAGGGGTTTTCTAGAGGGTGCGAAAATTGGTGCTGAAAAAGTTATTGAGATTCTCAAACAGGACGGTCCTGGATTTGACACATTATTGGCTTTGGATGAACTGCAACTACAGGTCTCAGGGGTTCTTGTTAGGGAAGTTCTGAAGGGGATTTTGAGAAGTATAAATGTTCTAAACAAAACTCAATGTGCAAAATTGGGGTACAAGTTCTTCGTGTGGTCCAGTAAGGTCGAGAATTATCGACACACTGCGAGCTCGTATCATATGGTCATGAAAATATTTGCTGAATGTGAGGAGTTCAAGGCTATGTGGAGGTTACTTGATGAGATGACTGAGAAAGGGTACCCTGTGACTGCAAGGACTTTTATGATATTGATATGTACCTGTGGTGATGCAGGCTTGGCTAGGAAAGTTGTCGAGAGGTTTATCAAATCAAAAACATTCAATTTTAGGCCGTTTAAACACTCTTATAATGCGATTCTTCATGGACTACTTGTGATAAAACAGTACAAGTTGATTGAGTGGGTGTATCAGCAAATGCTGCTTGATGGTCACAGCTCAGACATTCTTACATATAATGTGGTGTTGTATGCAAGGTGCAAATTGGGGAAATTGGATCAGTTTCACAGATTGCTTGATGAAATGGCTAGAAATGGGTTTTCTCCTGATTATCATACTTATAACATTCTTCTTTTTGTTCTTGGCAAGGGAGACAAACCTCTTGCAGCTCTAAATCTTTTGAACCACATGAGGGAAGTGGGTTTTGATCCGAGCATTCTTCATTTCACGACGCTTATCGATGGACTTAGTCGGGCTGGAAATTTGGNGTAAATGTTCTAAACAAAACTCAATGTGCAAAATCGGGGTACAAGTTCTTCGTGTGGTCCGGTAAGGTCGAGAATTATCGACACACTGCGAGCTCGTATCATATNATAAATGTTCTAAACAAAACTCAATGTGCAAAATTGGGGTACAAGTTCTTCGTGTGGTCCAGTAAGGTCGAGAATTATCGACACACTGCGAGCTCGTATCATATGGTCATGAAAATATTTGCTGAATGTGAGGAGTTCAAGGCTATGTGGAGGTTACTTGATGAGATGACTGAGAAAGGGTACCCTGTGACTGCAAGGACTTTTATGATATTGATATGTACCTGTGGTGATGCAGGCTTGGCTAGGAAAGTTGTCGAGAGGTTTATCAAATCAAAAACATTCAATTTTAGGCCGTTTANTTGTGGTGATGCAGGCTTGGCTAGGAAAGTTGTTGAGAGGTTTATCAAATCAAAAACATTCAATTTTAGGCCGTTTAAACACTCTTATAATGCGATTCTTCATGGACTACTTGTGATAAAACAGTACAAGTTGATTGAGTGGGTGTATCAGCAAATGCTGCTTGATGGTCACAGCTCAGACATTCTTACATATAATGTGGTGTTGTATGCAAGGTGCAAATTGGGGAAATTGGATCAGTTTCACAGATTGCTTGATGAAATGGCTAGAAATGGGTTTTCTCCTGATTATCATACTTATAACATTCTTCTTTTTGTTCTTGGCAAGGGAGACAAACCTCTTGCAGCTCTAAATCTTTTGAACCACATGAGGGAAGTGGGTTTTGATCCGAGCATTCTTCATTTCACGACGCTTATCGATGGACTTAGTCGGGCTGGAAATTTGGATGCTTGCAAATATTTCTTTGATGAACTGGGAAACAAGGGGTGCATTCCTGATGTTGCTTGCTACACTGTGATGATCACAACCTACACAGTGGCTGGGGAGCATGAGAAAGCTAAGGAGCTTTTTGATGAGATGGTCATGAAGGGTCAGCTCCCTAATGTATTGACGTACAACTCCATGATTCGGGGTTTTTGTATGGCGGGTAAGTTCGATGAGGCTTACTCGATGCTGGCTGAAATGGAAACCCGTGGTTGCAGACCAAATTTTGTTGTATATAGTACCCTTGTAAGCTATTTGCGAAATGCTGGAAAGCTTTCTGAGGCTCATAAAGTTATAACGCGCATGGTGGAGAAGGGGCAATACGCCCATTTAGTGACGAAATTCAGGGGATATAGGAGATGCTAG

mRNA sequence

ATGGTCTGCAGATTGTTTTCGACTTCTTTTATTGTGCGTCGAACAATCTGTAATCGGAGCTTTGGTGGTGATGATCGATTTGAGTTTGTTGTGGAACCCAATAAGAAGAGGTTGGAGGGATCGGAGTTTGGTTACTTTACTGATGAAAATTCGGATTCCTGTGAGATTGGGAGTCGTAGAGGGGATGATCTGTCAATGAGAAGGGGTTTTCTAGAGGGTGCGAAAATTGGTGCTGAAAAAGTTATTGAGATTCTCAAACAGGACGGTCCTGGATTTGACACATTATTGGCTTTGGATGAACTGCAACTACAGGTCTCAGGGGTTCTTGTTAGGGAAGTTCTGAAGGGGATTTTGAGAAGTATAAATGTTCTAAACAAAACTCAATGTGCAAAATTGGGGTACAAGTTCTTCGTGTGGTCCAGTAAGGTCGAGAATTATCGACACACTGCGAGCTCGTATCATATGGTCATGAAAATATTTGCTGAATGTGAGGAGTTCAAGGCTATGTGGAGGTTACTTGATGAGATGACTGAGAAAGGGTACCCTGTGACTGCAAGGACTTTTATGATATTGATATGTACCTGTGGTGATGCAGGCTTGGCTAGGAAAGTTGTCGAGAGGTTTATCAAATCAAAAACATTCAATTTTAGGCCGTTTAAACACTCTTATAATGCGATTCTTCATGGACTACTTGTGATAAAACAGTACAAGTTGATTGAGTGGGTGTATCAGCAAATGCTGCTTGATGGTCACAGCTCAGACATTCTTACATATAATGTGGTGTTGTATGCAAGGTGCAAATTGGGGAAATTGGATCAGTTTCACAGATTGCTTGATGAAATGGCTAGAAATGGGTTTTCTCCTGATTATCATACTTATAACATTCTTCTTTTTGTTCTTGGCAAGGGAGACAAACCTCTTGCAGCTCTAAATCTTTTGAACCACATGAGGGAAGTGGGTTTTGATCCGAGCATTCTTCATTTCACGACGCTTATCGATGGACTTAGTCGGGCTGGAAATTTGGNTAAGGTCGAGAATTATCGACACACTGCGAGCTCGTATCATATGGTCATGAAAATATTTGCTGAATGTGAGGAGTTCAAGGCTATGTGGAGGTTACTTGATGAGATGACTGAGAAAGGGTACCCTGTGACTGCAAGGACTTTTATGATATTGATATGTACCTGTGGTGATGCAGGCTTGGCTAGGAAAGTTGTCGAGAGGTTTATCAAATCAAAAACATTCAATTTTAGGCCGTTTANTTGTGGTGATGCAGGCTTGGCTAGGAAAGTTGTTGAGAGGTTTATCAAATCAAAAACATTCAATTTTAGGCCGTTTAAACACTCTTATAATGCGATTCTTCATGGACTACTTGTGATAAAACAGTACAAGTTGATTGAGTGGGTGTATCAGCAAATGCTGCTTGATGGTCACAGCTCAGACATTCTTACATATAATGTGGTGTTGTATGCAAGGTGCAAATTGGGGAAATTGGATCAGTTTCACAGATTGCTTGATGAAATGGCTAGAAATGGGTTTTCTCCTGATTATCATACTTATAACATTCTTCTTTTTGTTCTTGGCAAGGGAGACAAACCTCTTGCAGCTCTAAATCTTTTGAACCACATGAGGGAAGTGGGTTTTGATCCGAGCATTCTTCATTTCACGACGCTTATCGATGGACTTAGTCGGGCTGGAAATTTGGATGCTTGCAAATATTTCTTTGATGAACTGGGAAACAAGGGGTGCATTCCTGATGTTGCTTGCTACACTGTGATGATCACAACCTACACAGTGGCTGGGGAGCATGAGAAAGCTAAGGAGCTTTTTGATGAGATGGTCATGAAGGGTCAGCTCCCTAATGTATTGACGTACAACTCCATGATTCGGGGTTTTTGTATGGCGGGTAAGTTCGATGAGGCTTACTCGATGCTGGCTGAAATGGAAACCCGTGGTTGCAGACCAAATTTTGTTGTATATAGTACCCTTGTAAGCTATTTGCGAAATGCTGGAAAGCTTTCTGAGGCTCATAAAGTTATAACGCGCATGGTGGAGAAGGGGCAATACGCCCATTTAGTGACGAAATTCAGGGGATATAGGAGATGCTAG

Coding sequence (CDS)

ATGGTCTGCAGATTGTTTTCGACTTCTTTTATTGTGCGTCGAACAATCTGTAATCGGAGCTTTGGTGGTGATGATCGATTTGAGTTTGTTGTGGAACCCAATAAGAAGAGGTTGGAGGGATCGGAGTTTGGTTACTTTACTGATGAAAATTCGGATTCCTGTGAGATTGGGAGTCGTAGAGGGGATGATCTGTCAATGAGAAGGGGTTTTCTAGAGGGTGCGAAAATTGGTGCTGAAAAAGTTATTGAGATTCTCAAACAGGACGGTCCTGGATTTGACACATTATTGGCTTTGGATGAACTGCAACTACAGGTCTCAGGGGTTCTTGTTAGGGAAGTTCTGAAGGGGATTTTGAGAAGTATAAATGTTCTAAACAAAACTCAATGTGCAAAATTGGGGTACAAGTTCTTCGTGTGGTCCAGTAAGGTCGAGAATTATCGACACACTGCGAGCTCGTATCATATGGTCATGAAAATATTTGCTGAATGTGAGGAGTTCAAGGCTATGTGGAGGTTACTTGATGAGATGACTGAGAAAGGGTACCCTGTGACTGCAAGGACTTTTATGATATTGATATGTACCTGTGGTGATGCAGGCTTGGCTAGGAAAGTTGTCGAGAGGTTTATCAAATCAAAAACATTCAATTTTAGGCCGTTTAAACACTCTTATAATGCGATTCTTCATGGACTACTTGTGATAAAACAGTACAAGTTGATTGAGTGGGTGTATCAGCAAATGCTGCTTGATGGTCACAGCTCAGACATTCTTACATATAATGTGGTGTTGTATGCAAGGTGCAAATTGGGGAAATTGGATCAGTTTCACAGATTGCTTGATGAAATGGCTAGAAATGGGTTTTCTCCTGATTATCATACTTATAACATTCTTCTTTTTGTTCTTGGCAAGGGAGACAAACCTCTTGCAGCTCTAAATCTTTTGAACCACATGAGGGAAGTGGGTTTTGATCCGAGCATTCTTCATTTCACGACGCTTATCGATGGACTTAGTCGGGCTGGAAATTTGGNTAAGGTCGAGAATTATCGACACACTGCGAGCTCGTATCATATGGTCATGAAAATATTTGCTGAATGTGAGGAGTTCAAGGCTATGTGGAGGTTACTTGATGAGATGACTGAGAAAGGGTACCCTGTGACTGCAAGGACTTTTATGATATTGATATGTACCTGTGGTGATGCAGGCTTGGCTAGGAAAGTTGTCGAGAGGTTTATCAAATCAAAAACATTCAATTTTAGGCCGTTTANTTGTGGTGATGCAGGCTTGGCTAGGAAAGTTGTTGAGAGGTTTATCAAATCAAAAACATTCAATTTTAGGCCGTTTAAACACTCTTATAATGCGATTCTTCATGGACTACTTGTGATAAAACAGTACAAGTTGATTGAGTGGGTGTATCAGCAAATGCTGCTTGATGGTCACAGCTCAGACATTCTTACATATAATGTGGTGTTGTATGCAAGGTGCAAATTGGGGAAATTGGATCAGTTTCACAGATTGCTTGATGAAATGGCTAGAAATGGGTTTTCTCCTGATTATCATACTTATAACATTCTTCTTTTTGTTCTTGGCAAGGGAGACAAACCTCTTGCAGCTCTAAATCTTTTGAACCACATGAGGGAAGTGGGTTTTGATCCGAGCATTCTTCATTTCACGACGCTTATCGATGGACTTAGTCGGGCTGGAAATTTGGATGCTTGCAAATATTTCTTTGATGAACTGGGAAACAAGGGGTGCATTCCTGATGTTGCTTGCTACACTGTGATGATCACAACCTACACAGTGGCTGGGGAGCATGAGAAAGCTAAGGAGCTTTTTGATGAGATGGTCATGAAGGGTCAGCTCCCTAATGTATTGACGTACAACTCCATGATTCGGGGTTTTTGTATGGCGGGTAAGTTCGATGAGGCTTACTCGATGCTGGCTGAAATGGAAACCCGTGGTTGCAGACCAAATTTTGTTGTATATAGTACCCTTGTAAGCTATTTGCGAAATGCTGGAAAGCTTTCTGAGGCTCATAAAGTTATAACGCGCATGGTGGAGAAGGGGCAATACGCCCATTTAGTGACGAAATTCAGGGGATATAGGAGATGCTAG

Protein sequence

MVCRLFSTSFIVRRTICNRSFGGDDRFEFVVEPNKKRLEGSEFGYFTDENSDSCEIGSRRGDDLSMRRGFLEGAKIGAEKVIEILKQDGPGFDTLLALDELQLQVSGVLVREVLKGILRSINVLNKTQCAKLGYKFFVWSSKVENYRHTASSYHMVMKIFAECEEFKAMWRLLDEMTEKGYPVTARTFMILICTCGDAGLARKVVERFIKSKTFNFRPFKHSYNAILHGLLVIKQYKLIEWVYQQMLLDGHSSDILTYNVVLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILLFVLGKGDKPLAALNLLNHMREVGFDPSILHFTTLIDGLSRAGNLXKVENYRHTASSYHMVMKIFAECEEFKAMWRLLDEMTEKGYPVTARTFMILICTCGDAGLARKVVERFIKSKTFNFRPFXCGDAGLARKVVERFIKSKTFNFRPFKHSYNAILHGLLVIKQYKLIEWVYQQMLLDGHSSDILTYNVVLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILLFVLGKGDKPLAALNLLNHMREVGFDPSILHFTTLIDGLSRAGNLDACKYFFDELGNKGCIPDVACYTVMITTYTVAGEHEKAKELFDEMVMKGQLPNVLTYNSMIRGFCMAGKFDEAYSMLAEMETRGCRPNFVVYSTLVSYLRNAGKLSEAHKVITRMVEKGQYAHLVTKFRGYRRC
BLAST of Cp4.1LG14g06120 vs. Swiss-Prot
Match: PP288_ARATH (Pentatricopeptide repeat-containing protein At3g60050 OS=Arabidopsis thaliana GN=At3g60050 PE=2 SV=1)

HSP 1 Score: 491.9 bits (1265), Expect = 1.2e-137
Identity = 236/357 (66.11%), Postives = 279/357 (78.15%), Query Frame = 1

Query: 345 ENYRHTASSYHMVMKIFAECEEFKAMWRLLDEMTEKGYPVTARTFMILICTCGDAGLARK 404
           E +RHT +SYH++MKIFAEC E+KAMWRL+DEM + G+P TARTF +LIC+CG+AGLA++
Sbjct: 143 ECFRHTVNSYHLLMKIFAECGEYKAMWRLVDEMVQDGFPTTARTFNLLICSCGEAGLAKQ 202

Query: 405 VVERFIKSKTFNFRPFXCGDAGLARKVVERFIKSKTFNFRPFKHSYNAILHGLLVIKQYK 464
            V +F+KSKTFN+                          RPFKHSYNAIL+ LL +KQYK
Sbjct: 203 AVVQFMKSKTFNY--------------------------RPFKHSYNAILNSLLGVKQYK 262

Query: 465 LIEWVYQQMLLDGHSSDILTYNVVLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILL 524
           LIEWVY+QML DG S D+LTYN++L+   +LGK+D+F RL DEMAR+GFSPD +TYNILL
Sbjct: 263 LIEWVYKQMLEDGFSPDVLTYNILLWTNYRLGKMDRFDRLFDEMARDGFSPDSYTYNILL 322

Query: 525 FVLGKGDKPLAALNLLNHMREVGFDPSILHFTTLIDGLSRAGNLDACKYFFDELGNKGCI 584
            +LGKG+KPLAAL  LNHM+EVG DPS+LH+TTLIDGLSRAGNL+ACKYF DE+   GC 
Sbjct: 323 HILGKGNKPLAALTTLNHMKEVGIDPSVLHYTTLIDGLSRAGNLEACKYFLDEMVKAGCR 382

Query: 585 PDVACYTVMITTYTVAGEHEKAKELFDEMVMKGQLPNVLTYNSMIRGFCMAGKFDEAYSM 644
           PDV CYTVMIT Y V+GE +KAKE+F EM +KGQLPNV TYNSMIRG CMAG+F EA  +
Sbjct: 383 PDVVCYTVMITGYVVSGELDKAKEMFREMTVKGQLPNVFTYNSMIRGLCMAGEFREACWL 442

Query: 645 LAEMETRGCRPNFVVYSTLVSYLRNAGKLSEAHKVITRMVEKGQYAHLVTKFRGYRR 702
           L EME+RGC PNFVVYSTLVSYLR AGKLSEA KVI  MV+KG Y HLV K   YRR
Sbjct: 443 LKEMESRGCNPNFVVYSTLVSYLRKAGKLSEARKVIREMVKKGHYVHLVPKMMKYRR 473

BLAST of Cp4.1LG14g06120 vs. Swiss-Prot
Match: PPR81_ARATH (Pentatricopeptide repeat-containing protein At1g55630 OS=Arabidopsis thaliana GN=At1g55630 PE=2 SV=1)

HSP 1 Score: 484.2 bits (1245), Expect = 2.4e-135
Identity = 232/357 (64.99%), Postives = 276/357 (77.31%), Query Frame = 1

Query: 345 ENYRHTASSYHMVMKIFAECEEFKAMWRLLDEMTEKGYPVTARTFMILICTCGDAGLARK 404
           EN+RHTA+ YH++MKIFAEC E+KAM RL+DEM + GYP TA TF +LICTCG+AGL   
Sbjct: 146 ENFRHTANCYHLLMKIFAECGEYKAMCRLIDEMIKDGYPTTACTFNLLICTCGEAGL--- 205

Query: 405 VVERFIKSKTFNFRPFXCGDAGLARKVVERFIKSKTFNFRPFKHSYNAILHGLLVIKQYK 464
                                  AR VVE+FIKSKTFN+RP+KHSYNAILH LL +KQYK
Sbjct: 206 -----------------------ARDVVEQFIKSKTFNYRPYKHSYNAILHSLLGVKQYK 265

Query: 465 LIEWVYQQMLLDGHSSDILTYNVVLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILL 524
           LI+WVY+QML DG + D+LTYN+V++A  +LGK D+ +RLLDEM ++GFSPD +TYNILL
Sbjct: 266 LIDWVYEQMLEDGFTPDVLTYNIVMFANFRLGKTDRLYRLLDEMVKDGFSPDLYTYNILL 325

Query: 525 FVLGKGDKPLAALNLLNHMREVGFDPSILHFTTLIDGLSRAGNLDACKYFFDELGNKGCI 584
             L  G+KPLAALNLLNHMREVG +P ++HFTTLIDGLSRAG L+ACKYF DE    GC 
Sbjct: 326 HHLATGNKPLAALNLLNHMREVGVEPGVIHFTTLIDGLSRAGKLEACKYFMDETVKVGCT 385

Query: 585 PDVACYTVMITTYTVAGEHEKAKELFDEMVMKGQLPNVLTYNSMIRGFCMAGKFDEAYSM 644
           PDV CYTVMIT Y   GE EKA+E+F EM  KGQLPNV TYNSMIRGFCMAGKF EA ++
Sbjct: 386 PDVVCYTVMITGYISGGELEKAEEMFKEMTEKGQLPNVFTYNSMIRGFCMAGKFKEACAL 445

Query: 645 LAEMETRGCRPNFVVYSTLVSYLRNAGKLSEAHKVITRMVEKGQYAHLVTKFRGYRR 702
           L EME+RGC PNFVVYSTLV+ L+NAGK+ EAH+V+  MVEKG Y HL++K + YRR
Sbjct: 446 LKEMESRGCNPNFVVYSTLVNNLKNAGKVLEAHEVVKDMVEKGHYVHLISKLKKYRR 476

BLAST of Cp4.1LG14g06120 vs. Swiss-Prot
Match: PPR18_ARATH (Pentatricopeptide repeat-containing protein At1g06710, mitochondrial OS=Arabidopsis thaliana GN=At1g06710 PE=3 SV=1)

HSP 1 Score: 198.4 bits (503), Expect = 2.7e-49
Identity = 161/626 (25.72%), Postives = 272/626 (43.45%), Query Frame = 1

Query: 98  LDELQLQVSGVLVREVLKGILRSINVLNKTQCAKLGYKFFVWSSKVENYRHTASSYHMVM 157
           L + + ++S  LV EVL+ I R   V++          FFVW+ +   Y+HTA  Y+ ++
Sbjct: 123 LRQFREKLSESLVIEVLRLIARPSAVIS----------FFVWAGRQIGYKHTAPVYNALV 182

Query: 158 KIFAECEEFKAMWRLLDEMTEKGYPVTARTFMILI---CTCGDAGLARKVVERFIKSKTF 217
            +    ++ K     L ++ +    V      +L+   C  G   +A   +E   + K F
Sbjct: 183 DLIVRDDDEKVPEEFLQQIRDDDKEVFGEFLNVLVRKHCRNGSFSIA---LEELGRLKDF 242

Query: 218 NFRPFKHSYNAILHGLLVIKQYKLIEWVYQQMLLDGHSSDILTYNVVLYARCKLGKLDQF 277
            FRP + +YN ++   L   +      ++++M L     D  T     Y+ CK+GK   +
Sbjct: 243 RFRPSRSTYNCLIQAFLKADRLDSASLIHREMSLANLRMDGFTLRCFAYSLCKVGK---W 302

Query: 278 HRLLDEMARNGFSPDYHTYNILLFVLGKGDKPLAALNLLNHMREVGFDPSILHFTTLIDG 337
              L  +    F PD   Y  L+  L +      A++ LN MR     P+++ ++TL+ G
Sbjct: 303 REALTLVETENFVPDTVFYTKLISGLCEASLFEEAMDFLNRMRATSCLPNVVTYSTLLCG 362

Query: 338 ---------LSRAGNLXKVENYRHTASSYHMVMKIFAECEEFKAMWRLLDEMTEKGYPVT 397
                      R  N+  +E    +   ++ ++  +    +    ++LL +M + G+   
Sbjct: 363 CLNKKQLGRCKRVLNMMMMEGCYPSPKIFNSLVHAYCTSGDHSYAYKLLKKMVKCGHMPG 422

Query: 398 ARTFMILI-CTCGD--------AGLARKVVERFI-------KSKTFNFRPFXCGDAGLAR 457
              + ILI   CGD          LA K     +       K    +F    C  AG   
Sbjct: 423 YVVYNILIGSICGDKDSLNCDLLDLAEKAYSEMLAAGVVLNKINVSSFTRCLC-SAGKYE 482

Query: 458 KVVERFIKSKTFNFRPFKHSYNAILHGLLVIKQYKLIEWVYQQMLLDGHSSDILTYNVVL 517
           K      +     F P   +Y+ +L+ L    + +L   ++++M   G  +D+ TY +++
Sbjct: 483 KAFSVIREMIGQGFIPDTSTYSKVLNYLCNASKMELAFLLFEEMKRGGLVADVYTYTIMV 542

Query: 518 YARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILLFVLGKGDKPLAALNLLNHMREVGFD 577
            + CK G ++Q  +  +EM   G +P+  TY  L+    K  K   A  L   M   G  
Sbjct: 543 DSFCKAGLIEQARKWFNEMREVGCTPNVVTYTALIHAYLKAKKVSYANELFETMLSEGCL 602

Query: 578 PSILHFTTLIDGLSRAGNLD-ACKYFFDELGNKGCIPDVACYTVMITTYTVAGEHEKAKE 637
           P+I+ ++ LIDG  +AG ++ AC+ F    G+K  +PDV  Y                  
Sbjct: 603 PNIVTYSALIDGHCKAGQVEKACQIFERMCGSKD-VPDVDMY------------------ 662

Query: 638 LFDEMVMKGQLPNVLTYNSMIRGFCMAGKFDEAYSMLAEMETRGCRPNFVVYSTLVSYLR 695
            F +     + PNV+TY +++ GFC + + +EA  +L  M   GC PN +VY  L+  L 
Sbjct: 663 -FKQYDDNSERPNVVTYGALLDGFCKSHRVEEARKLLDAMSMEGCEPNQIVYDALIDGLC 711

BLAST of Cp4.1LG14g06120 vs. Swiss-Prot
Match: PP445_ARATH (Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana GN=At5g65560 PE=2 SV=1)

HSP 1 Score: 189.1 bits (479), Expect = 1.6e-46
Identity = 148/561 (26.38%), Postives = 256/561 (45.63%), Query Frame = 1

Query: 131 KLGYKFFVWSSKVENYRHTASSYHMVMKIFAECEEFKAMWRLLDEMTEKGYPVTARTFMI 190
           K    F  W S+   Y+H+  SY  ++ +         ++++             R  MI
Sbjct: 104 KTALNFSHWISQNPRYKHSVYSYASLLTLLINNGYVGVVFKI-------------RLLMI 163

Query: 191 LIC-TCGDAGLARKVVERFIKSKTFN--FRPFKHSYNAILHGLLVIKQYKLIEWVYQQML 250
             C + GDA     +  +  K + F   ++     YN +L+ L        ++ VY +ML
Sbjct: 164 KSCDSVGDALYVLDLCRKMNKDERFELKYKLIIGCYNTLLNSLARFGLVDEMKQVYMEML 223

Query: 251 LDGHSSDILTYNVVLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILLFVLGKGDKPL 310
            D    +I TYN ++   CKLG +++ ++ + ++   G  PD+ TY  L+    +     
Sbjct: 224 EDKVCPNIYTYNKMVNGYCKLGNVEEANQYVSKIVEAGLDPDFFTYTSLIMGYCQRKDLD 283

Query: 311 AALNLLNHMREVGFDPSILHFTTLIDGLSRAGNLXKVENYRHTASSYHMVMKIFAECEEF 370
           +A  + N M   G   + + +T LI GL  A  + +              M +F + ++ 
Sbjct: 284 SAFKVFNEMPLKGCRRNEVAYTHLIHGLCVARRIDEA-------------MDLFVKMKD- 343

Query: 371 KAMWRLLDEMTEKGYPVTARTFMILICT-CGDAGLARKVVERFIKSKTFNFRPFXCGDAG 430
                      ++ +P T RT+ +LI + CG         ER  KS+  N          
Sbjct: 344 -----------DECFP-TVRTYTVLIKSLCGS--------ER--KSEALN---------- 403

Query: 431 LARKVVERFIKSKTFNFRPFKHSYNAILHGLLVIKQYKLIEWVYQQMLLDGHSSDILTYN 490
           L +++ E  IK       P  H+Y  ++  L    +++    +  QML  G   +++TYN
Sbjct: 404 LVKEMEETGIK-------PNIHTYTVLIDSLCSQCKFEKARELLGQMLEKGLMPNVITYN 463

Query: 491 VVLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILLFVLGKGDKPLAALNLLNHMREV 550
            ++   CK G ++    +++ M     SP+  TYN L+    K +    A+ +LN M E 
Sbjct: 464 ALINGYCKRGMIEDAVDVVELMESRKLSPNTRTYNELIKGYCKSNVH-KAMGVLNKMLER 523

Query: 551 GFDPSILHFTTLIDGLSRAGNLDACKYFFDELGNKGCIPDVACYTVMITTYTVAGEHEKA 610
              P ++ + +LIDG  R+GN D+       + ++G +PD   YT MI +   +   E+A
Sbjct: 524 KVLPDVVTYNSLIDGQCRSGNFDSAYRLLSLMNDRGLVPDQWTYTSMIDSLCKSKRVEEA 583

Query: 611 KELFDEMVMKGQLPNVLTYNSMIRGFCMAGKFDEAYSMLAEMETRGCRPNFVVYSTLVSY 670
            +LFD +  KG  PNV+ Y ++I G+C AGK DEA+ ML +M ++ C PN + ++ L+  
Sbjct: 584 CDLFDSLEQKGVNPNVVMYTALIDGYCKAGKVDEAHLMLEKMLSKNCLPNSLTFNALIHG 597

Query: 671 LRNAGKLSEAHKVITRMVEKG 688
           L   GKL EA  +  +MV+ G
Sbjct: 644 LCADGKLKEATLLEEKMVKIG 597

BLAST of Cp4.1LG14g06120 vs. Swiss-Prot
Match: PP444_ARATH (Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidopsis thaliana GN=At5g64320 PE=2 SV=1)

HSP 1 Score: 188.7 bits (478), Expect = 2.1e-46
Identity = 137/564 (24.29%), Postives = 254/564 (45.04%), Query Frame = 1

Query: 135 KFFVWSSKVENYRHTASSYHMVMKIFAECEEFKAMWRLLDEMTEKGYPVTARTFMILICT 194
           + F W+     YRH+   Y +++       EFK + RLL +M ++G       F+ ++  
Sbjct: 96  ELFSWTGSQNGYRHSFDVYQVLIGKLGANGEFKTIDRLLIQMKDEGIVFKESLFISIMRD 155

Query: 195 CGDAGLARKVVERFIKSKT-FNFRPFKHSYNAILHGLLVIKQYKLIEWVYQQMLLDGHSS 254
              AG   +     ++ +  ++  P   SYN +L  L+    +K+   V+  ML      
Sbjct: 156 YDKAGFPGQTTRLMLEMRNVYSCEPTFKSYNVVLEILVSGNCHKVAANVFYDMLSRKIPP 215

Query: 255 DILTYNVVLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILLFVLGKGDKPLAALNLL 314
            + T+ VV+ A C + ++D    LL +M ++G  P+   Y  L+  L K ++   AL LL
Sbjct: 216 TLFTFGVVMKAFCAVNEIDSALSLLRDMTKHGCVPNSVIYQTLIHSLSKCNRVNEALQLL 275

Query: 315 NHMREVGFDPSILHFTTLIDGLSRAGNLXKVENYRHTASSYHMVMKIFAECEEFKAMWRL 374
             M  +G  P    F  +I GL +   + +     +      M+++ FA           
Sbjct: 276 EEMFLMGCVPDAETFNDVILGLCKFDRINEAAKMVN-----RMLIRGFAP---------- 335

Query: 375 LDEMTEKGYPVTARTFMILICTCGDAGLARKVVERFIKSKTFNFRPFXCGDAGLARKVVE 434
            D++T  GY +        +C  G    A+ +  R  K +   F     G     R    
Sbjct: 336 -DDITY-GYLMNG------LCKIGRVDAAKDLFYRIPKPEIVIFNTLIHGFVTHGRLDDA 395

Query: 435 RFIKSK---TFNFRPFKHSYNAILHGLLVIKQYKLIEWVYQQMLLDGHSSDILTYNVVLY 494
           + + S    ++   P   +YN++++G        L   V   M   G   ++ +Y +++ 
Sbjct: 396 KAVLSDMVTSYGIVPDVCTYNSLIYGYWKEGLVGLALEVLHDMRNKGCKPNVYSYTILVD 455

Query: 495 ARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILLFVLGKGDKPLAALNLLNHMREVGFDP 554
             CKLGK+D+ + +L+EM+ +G  P+   +N L+    K  +   A+ +   M   G  P
Sbjct: 456 GFCKLGKIDEAYNVLNEMSADGLKPNTVGFNCLISAFCKEHRIPEAVEIFREMPRKGCKP 515

Query: 555 SILHFTTLIDGLSRAGNLDACKYFFDELGNKGCIPDVACYTVMITTYTVAGEHEKAKELF 614
            +  F +LI GL     +    +   ++ ++G + +   Y  +I  +   GE ++A++L 
Sbjct: 516 DVYTFNSLISGLCEVDEIKHALWLLRDMISEGVVANTVTYNTLINAFLRRGEIKEARKLV 575

Query: 615 DEMVMKGQLPNVLTYNSMIRGFCMAGKFDEAYSMLAEMETRGCRPNFVVYSTLVSYLRNA 674
           +EMV +G   + +TYNS+I+G C AG+ D+A S+  +M   G  P+ +  + L++ L  +
Sbjct: 576 NEMVFQGSPLDEITYNSLIKGLCRAGEVDKARSLFEKMLRDGHAPSNISCNILINGLCRS 635

Query: 675 GKLSEAHKVITRMVEKGQYAHLVT 695
           G + EA +    MV +G    +VT
Sbjct: 636 GMVEEAVEFQKEMVLRGSTPDIVT 636

BLAST of Cp4.1LG14g06120 vs. TrEMBL
Match: A0A0A0L8P4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G182100 PE=4 SV=1)

HSP 1 Score: 605.1 bits (1559), Expect = 1.1e-169
Identity = 283/360 (78.61%), Postives = 315/360 (87.50%), Query Frame = 1

Query: 343 KVENYRHTASSYHMVMKIFAECEEFKAMWRLLDEMTEKGYPVTARTFMILICTCGDAGLA 402
           ++ENYRHT +SYH++MKIFAECEEFKAMWR+LDEMTEKGYPVTARTFMILICT       
Sbjct: 143 RIENYRHTVNSYHIIMKIFAECEEFKAMWRVLDEMTEKGYPVTARTFMILICT------- 202

Query: 403 RKVVERFIKSKTFNFRPFXCGDAGLARKVVERFIKSKTFNFRPFKHSYNAILHGLLVIKQ 462
                              CG+AGLA++VVERFIKSKTFNFRP+KHSYNAILHGL+++KQ
Sbjct: 203 -------------------CGEAGLAKRVVERFIKSKTFNFRPYKHSYNAILHGLVIVKQ 262

Query: 463 YKLIEWVYQQMLLDGHSSDILTYNVVLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNI 522
           YKLI WVY QMLLD HS DILTYNV+L++ CKLGKLDQFHRLLDEMAR GFSPD+HTYNI
Sbjct: 263 YKLIGWVYDQMLLDDHSPDILTYNVLLFSSCKLGKLDQFHRLLDEMARKGFSPDFHTYNI 322

Query: 523 LLFVLGKGDKPLAALNLLNHMREVGFDPSILHFTTLIDGLSRAGNLDACKYFFDELGNKG 582
           LL+VLGKGDKPLAALNLLNHMREVGF P++LHFTTLI+GLSRAGNLDACKYFFDELGN G
Sbjct: 323 LLYVLGKGDKPLAALNLLNHMREVGFGPNVLHFTTLINGLSRAGNLDACKYFFDELGNNG 382

Query: 583 CIPDVACYTVMITTYTVAGEHEKAKELFDEMVMKGQLPNVLTYNSMIRGFCMAGKFDEAY 642
           CIPDV CYTVMIT++T AG+HEKA+  FDEM+MKGQLPNV TYNSMIRGFCM GKF EAY
Sbjct: 383 CIPDVVCYTVMITSFTEAGQHEKARAFFDEMIMKGQLPNVFTYNSMIRGFCMVGKFKEAY 442

Query: 643 SMLAEMETRGCRPNFVVYSTLVSYLRNAGKLSEAHKVITRMVEKGQYAHLVTKFRGYRRC 702
           SML+EME+RGCRPNF+VYSTLVSYLRNAGKL EAHKVI +MVE GQYAHL+TKF+GYRRC
Sbjct: 443 SMLSEMESRGCRPNFLVYSTLVSYLRNAGKLGEAHKVIKQMVENGQYAHLMTKFKGYRRC 476

BLAST of Cp4.1LG14g06120 vs. TrEMBL
Match: M5XQE6_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025794mg PE=4 SV=1)

HSP 1 Score: 558.5 bits (1438), Expect = 1.1e-155
Identity = 263/360 (73.06%), Postives = 303/360 (84.17%), Query Frame = 1

Query: 343 KVENYRHTASSYHMVMKIFAECEEFKAMWRLLDEMTEKGYPVTARTFMILICTCGDAGLA 402
           ++ENYRHTA++YH++MKIFA+CEEFKAMWRL+DEM EKGYP TA+TF ILICT       
Sbjct: 113 QLENYRHTANTYHLMMKIFADCEEFKAMWRLVDEMIEKGYPTTAQTFNILICT------- 172

Query: 403 RKVVERFIKSKTFNFRPFXCGDAGLARKVVERFIKSKTFNFRPFKHSYNAILHGLLVIKQ 462
                              CG+AGLA+KVVERFIKSKTFN+RPFKHSYNAILH L+V+KQ
Sbjct: 173 -------------------CGEAGLAKKVVERFIKSKTFNYRPFKHSYNAILHSLVVVKQ 232

Query: 463 YKLIEWVYQQMLLDGHSSDILTYNVVLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNI 522
           YKLIEWVYQQML DGH +DILTYNV++YA+ +LGKLDQFHRLL+EM R+GF+PD HTYNI
Sbjct: 233 YKLIEWVYQQMLADGHCTDILTYNVMMYAKYRLGKLDQFHRLLEEMGRSGFAPDLHTYNI 292

Query: 523 LLFVLGKGDKPLAALNLLNHMREVGFDPSILHFTTLIDGLSRAGNLDACKYFFDELGNKG 582
           LL VLGKGDKPLAALNLLNHM+EVG DPS+LHFTTLIDGLSR+GNLDACKYFFDE+    
Sbjct: 293 LLHVLGKGDKPLAALNLLNHMKEVGLDPSVLHFTTLIDGLSRSGNLDACKYFFDEMIKHE 352

Query: 583 CIPDVACYTVMITTYTVAGEHEKAKELFDEMVMKGQLPNVLTYNSMIRGFCMAGKFDEAY 642
           C PDV CYTVMI+ Y VAGE EKA+ +FDEM+  GQLPNV TYN+MIRG CMAGKF+EA 
Sbjct: 353 CFPDVVCYTVMISGYIVAGELEKAQGVFDEMIPNGQLPNVFTYNAMIRGLCMAGKFEEAC 412

Query: 643 SMLAEMETRGCRPNFVVYSTLVSYLRNAGKLSEAHKVITRMVEKGQYAHLVTKFRGYRRC 702
           SML +ME+RGC PNF VYSTLVSYLRNAGKL++AH+VIT MVEKGQY HL++KF+GYRRC
Sbjct: 413 SMLKDMESRGCNPNFTVYSTLVSYLRNAGKLAKAHEVITHMVEKGQYTHLLSKFKGYRRC 446

BLAST of Cp4.1LG14g06120 vs. TrEMBL
Match: V4S4A4_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10004849mg PE=4 SV=1)

HSP 1 Score: 553.5 bits (1425), Expect = 3.6e-154
Identity = 262/358 (73.18%), Postives = 301/358 (84.08%), Query Frame = 1

Query: 345 ENYRHTASSYHMVMKIFAECEEFKAMWRLLDEMTEKGYPVTARTFMILICTCGDAGLARK 404
           EN+RHTA+SYH++MKIFA+CEEFKAMWRL+DEM E G+P TARTF ILICT         
Sbjct: 156 ENFRHTANSYHLIMKIFADCEEFKAMWRLVDEMIENGFPTTARTFNILICT--------- 215

Query: 405 VVERFIKSKTFNFRPFXCGDAGLARKVVERFIKSKTFNFRPFKHSYNAILHGLLVIKQYK 464
                            CG+ GLARKVVERFIKSK FNFRPFK+SYNAILH LL I+QYK
Sbjct: 216 -----------------CGEVGLARKVVERFIKSKLFNFRPFKNSYNAILHALLGIRQYK 275

Query: 465 LIEWVYQQMLLDGHSSDILTYNVVLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILL 524
           LIEWVYQQM  +G++ DILTYN+V+ A+ +LGKLDQFHRLLDEM R+GFSPD+HTYNILL
Sbjct: 276 LIEWVYQQMSDEGYAPDILTYNIVMCAKYRLGKLDQFHRLLDEMGRSGFSPDFHTYNILL 335

Query: 525 FVLGKGDKPLAALNLLNHMREVGFDPSILHFTTLIDGLSRAGNLDACKYFFDELGNKGCI 584
            VLGKGDKPLAALNLLNHM+EVGFDPS+LHFTTL+DGLSRAGNLDACKYFFDE+ NKGC+
Sbjct: 336 HVLGKGDKPLAALNLLNHMKEVGFDPSVLHFTTLMDGLSRAGNLDACKYFFDEMANKGCM 395

Query: 585 PDVACYTVMITTYTVAGEHEKAKELFDEMVMKGQLPNVLTYNSMIRGFCMAGKFDEAYSM 644
           PDV CYTVMIT+Y  AGE EKA++LFD M+ KGQLPNV TYNSMIRGFCMAGKFDEA +M
Sbjct: 396 PDVVCYTVMITSYIAAGELEKAQDLFDGMITKGQLPNVFTYNSMIRGFCMAGKFDEACTM 455

Query: 645 LAEMETRGCRPNFVVYSTLVSYLRNAGKLSEAHKVITRMVEKGQYAHLVTKFRGYRRC 703
           + EME+RGC PNF+VY+TLVS LRNAGKL+EAH+VI  MVEKG+Y HLV+KF+ Y+RC
Sbjct: 456 MKEMESRGCNPNFLVYNTLVSNLRNAGKLAEAHEVIRHMVEKGKYIHLVSKFKRYKRC 487

BLAST of Cp4.1LG14g06120 vs. TrEMBL
Match: W9S7I4_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_015785 PE=4 SV=1)

HSP 1 Score: 549.7 bits (1415), Expect = 5.3e-153
Identity = 262/357 (73.39%), Postives = 293/357 (82.07%), Query Frame = 1

Query: 346 NYRHTASSYHMVMKIFAECEEFKAMWRLLDEMTEKGYPVTARTFMILICTCGDAGLARKV 405
           NYRHTA+SYH+++KIFAECEEFKAMWRL+DEM E+G+P TART  ILICT          
Sbjct: 168 NYRHTANSYHLLIKIFAECEEFKAMWRLVDEMIERGFPTTARTLNILICT---------- 227

Query: 406 VERFIKSKTFNFRPFXCGDAGLARKVVERFIKSKTFNFRPFKHSYNAILHGLLVIKQYKL 465
                           CG+AGLARKVVERFIKSKTFNFRPFKHSYNAILH LLV  QYKL
Sbjct: 228 ----------------CGEAGLARKVVERFIKSKTFNFRPFKHSYNAILHCLLVTNQYKL 287

Query: 466 IEWVYQQMLLDGHSSDILTYNVVLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILLF 525
           IEWVYQQML DG S DILTYNV++  + +LGKLDQFHRLLDEM R GFSPD HTYNILL 
Sbjct: 288 IEWVYQQMLADGFSPDILTYNVLMLTKYRLGKLDQFHRLLDEMGRRGFSPDLHTYNILLH 347

Query: 526 VLGKGDKPLAALNLLNHMREVGFDPSILHFTTLIDGLSRAGNLDACKYFFDELGNKGCIP 585
           VLGKGDKPLAALNLLNHM+E G DP +LHFTTLIDGLSRAGNLDAC++FFDE+   GCIP
Sbjct: 348 VLGKGDKPLAALNLLNHMKETGIDPGVLHFTTLIDGLSRAGNLDACRFFFDEMPKNGCIP 407

Query: 586 DVACYTVMITTYTVAGEHEKAKELFDEMVMKGQLPNVLTYNSMIRGFCMAGKFDEAYSML 645
           DV CYTVMIT Y VAGE EKA+ +FDEM+ KGQ+PNV TYNSMIRG CMAGKF+EA SML
Sbjct: 408 DVVCYTVMITGYVVAGELEKAQSIFDEMITKGQIPNVFTYNSMIRGLCMAGKFEEACSML 467

Query: 646 AEMETRGCRPNFVVYSTLVSYLRNAGKLSEAHKVITRMVEKGQYAHLVTKFRGYRRC 703
            +ME+RGC PNF+VY+TLVS LRNAGKLSEAH+V+  MVEKGQY HL++KF+GYRRC
Sbjct: 468 KDMESRGCNPNFLVYTTLVSNLRNAGKLSEAHQVVKHMVEKGQYVHLLSKFKGYRRC 498

BLAST of Cp4.1LG14g06120 vs. TrEMBL
Match: M1C1Y3_SOLTU (Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG400022513 PE=4 SV=1)

HSP 1 Score: 542.7 bits (1397), Expect = 6.4e-151
Identity = 256/358 (71.51%), Postives = 298/358 (83.24%), Query Frame = 1

Query: 345 ENYRHTASSYHMVMKIFAECEEFKAMWRLLDEMTEKGYPVTARTFMILICTCGDAGLARK 404
           ENYRHTA+SYH++MKIFAE +EFKAMWRL+DEM EKGYP TARTF +LICT         
Sbjct: 160 ENYRHTANSYHLIMKIFAESDEFKAMWRLVDEMIEKGYPTTARTFNLLICT--------- 219

Query: 405 VVERFIKSKTFNFRPFXCGDAGLARKVVERFIKSKTFNFRPFKHSYNAILHGLLVIKQYK 464
                            CG+AGLARKVVERFIKSKTFN+RPF+HS+NAILH LL + QY+
Sbjct: 220 -----------------CGEAGLARKVVERFIKSKTFNYRPFRHSFNAILHSLLGVNQYR 279

Query: 465 LIEWVYQQMLLDGHSSDILTYNVVLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILL 524
           LIEWVYQQML++GH  DILTYN++L ++ +LGKLDQFHRLLDEM RNGFSPD++T+NILL
Sbjct: 280 LIEWVYQQMLVEGHIPDILTYNILLCSKYRLGKLDQFHRLLDEMGRNGFSPDFYTFNILL 339

Query: 525 FVLGKGDKPLAALNLLNHMREVGFDPSILHFTTLIDGLSRAGNLDACKYFFDELGNKGCI 584
            VLGKGDKPLAA+NLLNHM+EVG++PSILHFTTLIDGLSRAGNLDACKYFF+E+  +GC+
Sbjct: 340 HVLGKGDKPLAAVNLLNHMKEVGYEPSILHFTTLIDGLSRAGNLDACKYFFEEMIRQGCV 399

Query: 585 PDVACYTVMITTYTVAGEHEKAKELFDEMVMKGQLPNVLTYNSMIRGFCMAGKFDEAYSM 644
           PDV CYTVMIT Y VAGE +KA+ELF +M++ GQLPNV TYNSMIRG CMA KFDEA  M
Sbjct: 400 PDVVCYTVMITGYVVAGELDKAQELFTDMIVNGQLPNVFTYNSMIRGLCMAEKFDEACMM 459

Query: 645 LAEMETRGCRPNFVVYSTLVSYLRNAGKLSEAHKVITRMVEKGQYAHLVTKFRGYRRC 703
           + EME RGC PNF+VYSTLVSYL+NAGKLS+AH+VI  MV+KGQY HLV KF+ YRRC
Sbjct: 460 VKEMELRGCNPNFMVYSTLVSYLKNAGKLSKAHEVIRHMVDKGQYIHLVPKFKRYRRC 491

BLAST of Cp4.1LG14g06120 vs. TAIR10
Match: AT3G60050.1 (AT3G60050.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 491.9 bits (1265), Expect = 6.6e-139
Identity = 236/357 (66.11%), Postives = 279/357 (78.15%), Query Frame = 1

Query: 345 ENYRHTASSYHMVMKIFAECEEFKAMWRLLDEMTEKGYPVTARTFMILICTCGDAGLARK 404
           E +RHT +SYH++MKIFAEC E+KAMWRL+DEM + G+P TARTF +LIC+CG+AGLA++
Sbjct: 143 ECFRHTVNSYHLLMKIFAECGEYKAMWRLVDEMVQDGFPTTARTFNLLICSCGEAGLAKQ 202

Query: 405 VVERFIKSKTFNFRPFXCGDAGLARKVVERFIKSKTFNFRPFKHSYNAILHGLLVIKQYK 464
            V +F+KSKTFN+                          RPFKHSYNAIL+ LL +KQYK
Sbjct: 203 AVVQFMKSKTFNY--------------------------RPFKHSYNAILNSLLGVKQYK 262

Query: 465 LIEWVYQQMLLDGHSSDILTYNVVLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILL 524
           LIEWVY+QML DG S D+LTYN++L+   +LGK+D+F RL DEMAR+GFSPD +TYNILL
Sbjct: 263 LIEWVYKQMLEDGFSPDVLTYNILLWTNYRLGKMDRFDRLFDEMARDGFSPDSYTYNILL 322

Query: 525 FVLGKGDKPLAALNLLNHMREVGFDPSILHFTTLIDGLSRAGNLDACKYFFDELGNKGCI 584
            +LGKG+KPLAAL  LNHM+EVG DPS+LH+TTLIDGLSRAGNL+ACKYF DE+   GC 
Sbjct: 323 HILGKGNKPLAALTTLNHMKEVGIDPSVLHYTTLIDGLSRAGNLEACKYFLDEMVKAGCR 382

Query: 585 PDVACYTVMITTYTVAGEHEKAKELFDEMVMKGQLPNVLTYNSMIRGFCMAGKFDEAYSM 644
           PDV CYTVMIT Y V+GE +KAKE+F EM +KGQLPNV TYNSMIRG CMAG+F EA  +
Sbjct: 383 PDVVCYTVMITGYVVSGELDKAKEMFREMTVKGQLPNVFTYNSMIRGLCMAGEFREACWL 442

Query: 645 LAEMETRGCRPNFVVYSTLVSYLRNAGKLSEAHKVITRMVEKGQYAHLVTKFRGYRR 702
           L EME+RGC PNFVVYSTLVSYLR AGKLSEA KVI  MV+KG Y HLV K   YRR
Sbjct: 443 LKEMESRGCNPNFVVYSTLVSYLRKAGKLSEARKVIREMVKKGHYVHLVPKMMKYRR 473

BLAST of Cp4.1LG14g06120 vs. TAIR10
Match: AT1G55630.1 (AT1G55630.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 484.2 bits (1245), Expect = 1.4e-136
Identity = 232/357 (64.99%), Postives = 276/357 (77.31%), Query Frame = 1

Query: 345 ENYRHTASSYHMVMKIFAECEEFKAMWRLLDEMTEKGYPVTARTFMILICTCGDAGLARK 404
           EN+RHTA+ YH++MKIFAEC E+KAM RL+DEM + GYP TA TF +LICTCG+AGL   
Sbjct: 146 ENFRHTANCYHLLMKIFAECGEYKAMCRLIDEMIKDGYPTTACTFNLLICTCGEAGL--- 205

Query: 405 VVERFIKSKTFNFRPFXCGDAGLARKVVERFIKSKTFNFRPFKHSYNAILHGLLVIKQYK 464
                                  AR VVE+FIKSKTFN+RP+KHSYNAILH LL +KQYK
Sbjct: 206 -----------------------ARDVVEQFIKSKTFNYRPYKHSYNAILHSLLGVKQYK 265

Query: 465 LIEWVYQQMLLDGHSSDILTYNVVLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILL 524
           LI+WVY+QML DG + D+LTYN+V++A  +LGK D+ +RLLDEM ++GFSPD +TYNILL
Sbjct: 266 LIDWVYEQMLEDGFTPDVLTYNIVMFANFRLGKTDRLYRLLDEMVKDGFSPDLYTYNILL 325

Query: 525 FVLGKGDKPLAALNLLNHMREVGFDPSILHFTTLIDGLSRAGNLDACKYFFDELGNKGCI 584
             L  G+KPLAALNLLNHMREVG +P ++HFTTLIDGLSRAG L+ACKYF DE    GC 
Sbjct: 326 HHLATGNKPLAALNLLNHMREVGVEPGVIHFTTLIDGLSRAGKLEACKYFMDETVKVGCT 385

Query: 585 PDVACYTVMITTYTVAGEHEKAKELFDEMVMKGQLPNVLTYNSMIRGFCMAGKFDEAYSM 644
           PDV CYTVMIT Y   GE EKA+E+F EM  KGQLPNV TYNSMIRGFCMAGKF EA ++
Sbjct: 386 PDVVCYTVMITGYISGGELEKAEEMFKEMTEKGQLPNVFTYNSMIRGFCMAGKFKEACAL 445

Query: 645 LAEMETRGCRPNFVVYSTLVSYLRNAGKLSEAHKVITRMVEKGQYAHLVTKFRGYRR 702
           L EME+RGC PNFVVYSTLV+ L+NAGK+ EAH+V+  MVEKG Y HL++K + YRR
Sbjct: 446 LKEMESRGCNPNFVVYSTLVNNLKNAGKVLEAHEVVKDMVEKGHYVHLISKLKKYRR 476

BLAST of Cp4.1LG14g06120 vs. TAIR10
Match: AT1G06710.1 (AT1G06710.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 198.4 bits (503), Expect = 1.5e-50
Identity = 161/626 (25.72%), Postives = 272/626 (43.45%), Query Frame = 1

Query: 98  LDELQLQVSGVLVREVLKGILRSINVLNKTQCAKLGYKFFVWSSKVENYRHTASSYHMVM 157
           L + + ++S  LV EVL+ I R   V++          FFVW+ +   Y+HTA  Y+ ++
Sbjct: 123 LRQFREKLSESLVIEVLRLIARPSAVIS----------FFVWAGRQIGYKHTAPVYNALV 182

Query: 158 KIFAECEEFKAMWRLLDEMTEKGYPVTARTFMILI---CTCGDAGLARKVVERFIKSKTF 217
            +    ++ K     L ++ +    V      +L+   C  G   +A   +E   + K F
Sbjct: 183 DLIVRDDDEKVPEEFLQQIRDDDKEVFGEFLNVLVRKHCRNGSFSIA---LEELGRLKDF 242

Query: 218 NFRPFKHSYNAILHGLLVIKQYKLIEWVYQQMLLDGHSSDILTYNVVLYARCKLGKLDQF 277
            FRP + +YN ++   L   +      ++++M L     D  T     Y+ CK+GK   +
Sbjct: 243 RFRPSRSTYNCLIQAFLKADRLDSASLIHREMSLANLRMDGFTLRCFAYSLCKVGK---W 302

Query: 278 HRLLDEMARNGFSPDYHTYNILLFVLGKGDKPLAALNLLNHMREVGFDPSILHFTTLIDG 337
              L  +    F PD   Y  L+  L +      A++ LN MR     P+++ ++TL+ G
Sbjct: 303 REALTLVETENFVPDTVFYTKLISGLCEASLFEEAMDFLNRMRATSCLPNVVTYSTLLCG 362

Query: 338 ---------LSRAGNLXKVENYRHTASSYHMVMKIFAECEEFKAMWRLLDEMTEKGYPVT 397
                      R  N+  +E    +   ++ ++  +    +    ++LL +M + G+   
Sbjct: 363 CLNKKQLGRCKRVLNMMMMEGCYPSPKIFNSLVHAYCTSGDHSYAYKLLKKMVKCGHMPG 422

Query: 398 ARTFMILI-CTCGD--------AGLARKVVERFI-------KSKTFNFRPFXCGDAGLAR 457
              + ILI   CGD          LA K     +       K    +F    C  AG   
Sbjct: 423 YVVYNILIGSICGDKDSLNCDLLDLAEKAYSEMLAAGVVLNKINVSSFTRCLC-SAGKYE 482

Query: 458 KVVERFIKSKTFNFRPFKHSYNAILHGLLVIKQYKLIEWVYQQMLLDGHSSDILTYNVVL 517
           K      +     F P   +Y+ +L+ L    + +L   ++++M   G  +D+ TY +++
Sbjct: 483 KAFSVIREMIGQGFIPDTSTYSKVLNYLCNASKMELAFLLFEEMKRGGLVADVYTYTIMV 542

Query: 518 YARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILLFVLGKGDKPLAALNLLNHMREVGFD 577
            + CK G ++Q  +  +EM   G +P+  TY  L+    K  K   A  L   M   G  
Sbjct: 543 DSFCKAGLIEQARKWFNEMREVGCTPNVVTYTALIHAYLKAKKVSYANELFETMLSEGCL 602

Query: 578 PSILHFTTLIDGLSRAGNLD-ACKYFFDELGNKGCIPDVACYTVMITTYTVAGEHEKAKE 637
           P+I+ ++ LIDG  +AG ++ AC+ F    G+K  +PDV  Y                  
Sbjct: 603 PNIVTYSALIDGHCKAGQVEKACQIFERMCGSKD-VPDVDMY------------------ 662

Query: 638 LFDEMVMKGQLPNVLTYNSMIRGFCMAGKFDEAYSMLAEMETRGCRPNFVVYSTLVSYLR 695
            F +     + PNV+TY +++ GFC + + +EA  +L  M   GC PN +VY  L+  L 
Sbjct: 663 -FKQYDDNSERPNVVTYGALLDGFCKSHRVEEARKLLDAMSMEGCEPNQIVYDALIDGLC 711

BLAST of Cp4.1LG14g06120 vs. TAIR10
Match: AT5G65560.1 (AT5G65560.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 189.1 bits (479), Expect = 9.2e-48
Identity = 148/561 (26.38%), Postives = 256/561 (45.63%), Query Frame = 1

Query: 131 KLGYKFFVWSSKVENYRHTASSYHMVMKIFAECEEFKAMWRLLDEMTEKGYPVTARTFMI 190
           K    F  W S+   Y+H+  SY  ++ +         ++++             R  MI
Sbjct: 104 KTALNFSHWISQNPRYKHSVYSYASLLTLLINNGYVGVVFKI-------------RLLMI 163

Query: 191 LIC-TCGDAGLARKVVERFIKSKTFN--FRPFKHSYNAILHGLLVIKQYKLIEWVYQQML 250
             C + GDA     +  +  K + F   ++     YN +L+ L        ++ VY +ML
Sbjct: 164 KSCDSVGDALYVLDLCRKMNKDERFELKYKLIIGCYNTLLNSLARFGLVDEMKQVYMEML 223

Query: 251 LDGHSSDILTYNVVLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILLFVLGKGDKPL 310
            D    +I TYN ++   CKLG +++ ++ + ++   G  PD+ TY  L+    +     
Sbjct: 224 EDKVCPNIYTYNKMVNGYCKLGNVEEANQYVSKIVEAGLDPDFFTYTSLIMGYCQRKDLD 283

Query: 311 AALNLLNHMREVGFDPSILHFTTLIDGLSRAGNLXKVENYRHTASSYHMVMKIFAECEEF 370
           +A  + N M   G   + + +T LI GL  A  + +              M +F + ++ 
Sbjct: 284 SAFKVFNEMPLKGCRRNEVAYTHLIHGLCVARRIDEA-------------MDLFVKMKD- 343

Query: 371 KAMWRLLDEMTEKGYPVTARTFMILICT-CGDAGLARKVVERFIKSKTFNFRPFXCGDAG 430
                      ++ +P T RT+ +LI + CG         ER  KS+  N          
Sbjct: 344 -----------DECFP-TVRTYTVLIKSLCGS--------ER--KSEALN---------- 403

Query: 431 LARKVVERFIKSKTFNFRPFKHSYNAILHGLLVIKQYKLIEWVYQQMLLDGHSSDILTYN 490
           L +++ E  IK       P  H+Y  ++  L    +++    +  QML  G   +++TYN
Sbjct: 404 LVKEMEETGIK-------PNIHTYTVLIDSLCSQCKFEKARELLGQMLEKGLMPNVITYN 463

Query: 491 VVLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILLFVLGKGDKPLAALNLLNHMREV 550
            ++   CK G ++    +++ M     SP+  TYN L+    K +    A+ +LN M E 
Sbjct: 464 ALINGYCKRGMIEDAVDVVELMESRKLSPNTRTYNELIKGYCKSNVH-KAMGVLNKMLER 523

Query: 551 GFDPSILHFTTLIDGLSRAGNLDACKYFFDELGNKGCIPDVACYTVMITTYTVAGEHEKA 610
              P ++ + +LIDG  R+GN D+       + ++G +PD   YT MI +   +   E+A
Sbjct: 524 KVLPDVVTYNSLIDGQCRSGNFDSAYRLLSLMNDRGLVPDQWTYTSMIDSLCKSKRVEEA 583

Query: 611 KELFDEMVMKGQLPNVLTYNSMIRGFCMAGKFDEAYSMLAEMETRGCRPNFVVYSTLVSY 670
            +LFD +  KG  PNV+ Y ++I G+C AGK DEA+ ML +M ++ C PN + ++ L+  
Sbjct: 584 CDLFDSLEQKGVNPNVVMYTALIDGYCKAGKVDEAHLMLEKMLSKNCLPNSLTFNALIHG 597

Query: 671 LRNAGKLSEAHKVITRMVEKG 688
           L   GKL EA  +  +MV+ G
Sbjct: 644 LCADGKLKEATLLEEKMVKIG 597

BLAST of Cp4.1LG14g06120 vs. TAIR10
Match: AT5G64320.1 (AT5G64320.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 188.7 bits (478), Expect = 1.2e-47
Identity = 137/564 (24.29%), Postives = 254/564 (45.04%), Query Frame = 1

Query: 135 KFFVWSSKVENYRHTASSYHMVMKIFAECEEFKAMWRLLDEMTEKGYPVTARTFMILICT 194
           + F W+     YRH+   Y +++       EFK + RLL +M ++G       F+ ++  
Sbjct: 96  ELFSWTGSQNGYRHSFDVYQVLIGKLGANGEFKTIDRLLIQMKDEGIVFKESLFISIMRD 155

Query: 195 CGDAGLARKVVERFIKSKT-FNFRPFKHSYNAILHGLLVIKQYKLIEWVYQQMLLDGHSS 254
              AG   +     ++ +  ++  P   SYN +L  L+    +K+   V+  ML      
Sbjct: 156 YDKAGFPGQTTRLMLEMRNVYSCEPTFKSYNVVLEILVSGNCHKVAANVFYDMLSRKIPP 215

Query: 255 DILTYNVVLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILLFVLGKGDKPLAALNLL 314
            + T+ VV+ A C + ++D    LL +M ++G  P+   Y  L+  L K ++   AL LL
Sbjct: 216 TLFTFGVVMKAFCAVNEIDSALSLLRDMTKHGCVPNSVIYQTLIHSLSKCNRVNEALQLL 275

Query: 315 NHMREVGFDPSILHFTTLIDGLSRAGNLXKVENYRHTASSYHMVMKIFAECEEFKAMWRL 374
             M  +G  P    F  +I GL +   + +     +      M+++ FA           
Sbjct: 276 EEMFLMGCVPDAETFNDVILGLCKFDRINEAAKMVN-----RMLIRGFAP---------- 335

Query: 375 LDEMTEKGYPVTARTFMILICTCGDAGLARKVVERFIKSKTFNFRPFXCGDAGLARKVVE 434
            D++T  GY +        +C  G    A+ +  R  K +   F     G     R    
Sbjct: 336 -DDITY-GYLMNG------LCKIGRVDAAKDLFYRIPKPEIVIFNTLIHGFVTHGRLDDA 395

Query: 435 RFIKSK---TFNFRPFKHSYNAILHGLLVIKQYKLIEWVYQQMLLDGHSSDILTYNVVLY 494
           + + S    ++   P   +YN++++G        L   V   M   G   ++ +Y +++ 
Sbjct: 396 KAVLSDMVTSYGIVPDVCTYNSLIYGYWKEGLVGLALEVLHDMRNKGCKPNVYSYTILVD 455

Query: 495 ARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILLFVLGKGDKPLAALNLLNHMREVGFDP 554
             CKLGK+D+ + +L+EM+ +G  P+   +N L+    K  +   A+ +   M   G  P
Sbjct: 456 GFCKLGKIDEAYNVLNEMSADGLKPNTVGFNCLISAFCKEHRIPEAVEIFREMPRKGCKP 515

Query: 555 SILHFTTLIDGLSRAGNLDACKYFFDELGNKGCIPDVACYTVMITTYTVAGEHEKAKELF 614
            +  F +LI GL     +    +   ++ ++G + +   Y  +I  +   GE ++A++L 
Sbjct: 516 DVYTFNSLISGLCEVDEIKHALWLLRDMISEGVVANTVTYNTLINAFLRRGEIKEARKLV 575

Query: 615 DEMVMKGQLPNVLTYNSMIRGFCMAGKFDEAYSMLAEMETRGCRPNFVVYSTLVSYLRNA 674
           +EMV +G   + +TYNS+I+G C AG+ D+A S+  +M   G  P+ +  + L++ L  +
Sbjct: 576 NEMVFQGSPLDEITYNSLIKGLCRAGEVDKARSLFEKMLRDGHAPSNISCNILINGLCRS 635

Query: 675 GKLSEAHKVITRMVEKGQYAHLVT 695
           G + EA +    MV +G    +VT
Sbjct: 636 GMVEEAVEFQKEMVLRGSTPDIVT 636

BLAST of Cp4.1LG14g06120 vs. NCBI nr
Match: gi|659077478|ref|XP_008439226.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g55630-like [Cucumis melo])

HSP 1 Score: 620.5 bits (1599), Expect = 3.5e-174
Identity = 293/360 (81.39%), Postives = 318/360 (88.33%), Query Frame = 1

Query: 343 KVENYRHTASSYHMVMKIFAECEEFKAMWRLLDEMTEKGYPVTARTFMILICTCGDAGLA 402
           ++ENYRHT  SYHM+MKIFAECEEFKAMWRLLDEMTEKGYPVTARTFMILICT       
Sbjct: 143 RIENYRHTVKSYHMIMKIFAECEEFKAMWRLLDEMTEKGYPVTARTFMILICT------- 202

Query: 403 RKVVERFIKSKTFNFRPFXCGDAGLARKVVERFIKSKTFNFRPFKHSYNAILHGLLVIKQ 462
                              CGDAGLA+KVVERFIKSKTFNFRP+KHSYNAILHGL+V+KQ
Sbjct: 203 -------------------CGDAGLAKKVVERFIKSKTFNFRPYKHSYNAILHGLVVVKQ 262

Query: 463 YKLIEWVYQQMLLDGHSSDILTYNVVLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNI 522
           YKLIEWVY+QMLLDGH  DILTYNV+L++RCKLGKLDQFHRLLDEMAR GFSPD+HTYNI
Sbjct: 263 YKLIEWVYEQMLLDGHGPDILTYNVLLFSRCKLGKLDQFHRLLDEMARKGFSPDFHTYNI 322

Query: 523 LLFVLGKGDKPLAALNLLNHMREVGFDPSILHFTTLIDGLSRAGNLDACKYFFDELGNKG 582
           LL+VLGKGDKPLAALNLLNHMREVGF P+ILHFTTLI+GLSRAGNLDACKYFFDELGN  
Sbjct: 323 LLYVLGKGDKPLAALNLLNHMREVGFGPNILHFTTLINGLSRAGNLDACKYFFDELGNHD 382

Query: 583 CIPDVACYTVMITTYTVAGEHEKAKELFDEMVMKGQLPNVLTYNSMIRGFCMAGKFDEAY 642
           CIPDV CYTVMIT+YT AG+HEKA+  FDEM+MKGQLPNV TYNSMIRGFCM GKF EAY
Sbjct: 383 CIPDVVCYTVMITSYTEAGQHEKARAFFDEMIMKGQLPNVFTYNSMIRGFCMVGKFKEAY 442

Query: 643 SMLAEMETRGCRPNFVVYSTLVSYLRNAGKLSEAHKVITRMVEKGQYAHLVTKFRGYRRC 702
           SML+EME+RGCRPNF+VYSTLVSYLRNAGKL+EAHKVITRMVE GQYAHL+TKF+GYRRC
Sbjct: 443 SMLSEMESRGCRPNFLVYSTLVSYLRNAGKLAEAHKVITRMVENGQYAHLMTKFKGYRRC 476

BLAST of Cp4.1LG14g06120 vs. NCBI nr
Match: gi|449446161|ref|XP_004140840.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g55630-like [Cucumis sativus])

HSP 1 Score: 605.1 bits (1559), Expect = 1.5e-169
Identity = 283/360 (78.61%), Postives = 315/360 (87.50%), Query Frame = 1

Query: 343 KVENYRHTASSYHMVMKIFAECEEFKAMWRLLDEMTEKGYPVTARTFMILICTCGDAGLA 402
           ++ENYRHT +SYH++MKIFAECEEFKAMWR+LDEMTEKGYPVTARTFMILICT       
Sbjct: 143 RIENYRHTVNSYHIIMKIFAECEEFKAMWRVLDEMTEKGYPVTARTFMILICT------- 202

Query: 403 RKVVERFIKSKTFNFRPFXCGDAGLARKVVERFIKSKTFNFRPFKHSYNAILHGLLVIKQ 462
                              CG+AGLA++VVERFIKSKTFNFRP+KHSYNAILHGL+++KQ
Sbjct: 203 -------------------CGEAGLAKRVVERFIKSKTFNFRPYKHSYNAILHGLVIVKQ 262

Query: 463 YKLIEWVYQQMLLDGHSSDILTYNVVLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNI 522
           YKLI WVY QMLLD HS DILTYNV+L++ CKLGKLDQFHRLLDEMAR GFSPD+HTYNI
Sbjct: 263 YKLIGWVYDQMLLDDHSPDILTYNVLLFSSCKLGKLDQFHRLLDEMARKGFSPDFHTYNI 322

Query: 523 LLFVLGKGDKPLAALNLLNHMREVGFDPSILHFTTLIDGLSRAGNLDACKYFFDELGNKG 582
           LL+VLGKGDKPLAALNLLNHMREVGF P++LHFTTLI+GLSRAGNLDACKYFFDELGN G
Sbjct: 323 LLYVLGKGDKPLAALNLLNHMREVGFGPNVLHFTTLINGLSRAGNLDACKYFFDELGNNG 382

Query: 583 CIPDVACYTVMITTYTVAGEHEKAKELFDEMVMKGQLPNVLTYNSMIRGFCMAGKFDEAY 642
           CIPDV CYTVMIT++T AG+HEKA+  FDEM+MKGQLPNV TYNSMIRGFCM GKF EAY
Sbjct: 383 CIPDVVCYTVMITSFTEAGQHEKARAFFDEMIMKGQLPNVFTYNSMIRGFCMVGKFKEAY 442

Query: 643 SMLAEMETRGCRPNFVVYSTLVSYLRNAGKLSEAHKVITRMVEKGQYAHLVTKFRGYRRC 702
           SML+EME+RGCRPNF+VYSTLVSYLRNAGKL EAHKVI +MVE GQYAHL+TKF+GYRRC
Sbjct: 443 SMLSEMESRGCRPNFLVYSTLVSYLRNAGKLGEAHKVIKQMVENGQYAHLMTKFKGYRRC 476

BLAST of Cp4.1LG14g06120 vs. NCBI nr
Match: gi|596041207|ref|XP_007220061.1| (hypothetical protein PRUPE_ppa025794mg [Prunus persica])

HSP 1 Score: 558.5 bits (1438), Expect = 1.6e-155
Identity = 263/360 (73.06%), Postives = 303/360 (84.17%), Query Frame = 1

Query: 343 KVENYRHTASSYHMVMKIFAECEEFKAMWRLLDEMTEKGYPVTARTFMILICTCGDAGLA 402
           ++ENYRHTA++YH++MKIFA+CEEFKAMWRL+DEM EKGYP TA+TF ILICT       
Sbjct: 113 QLENYRHTANTYHLMMKIFADCEEFKAMWRLVDEMIEKGYPTTAQTFNILICT------- 172

Query: 403 RKVVERFIKSKTFNFRPFXCGDAGLARKVVERFIKSKTFNFRPFKHSYNAILHGLLVIKQ 462
                              CG+AGLA+KVVERFIKSKTFN+RPFKHSYNAILH L+V+KQ
Sbjct: 173 -------------------CGEAGLAKKVVERFIKSKTFNYRPFKHSYNAILHSLVVVKQ 232

Query: 463 YKLIEWVYQQMLLDGHSSDILTYNVVLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNI 522
           YKLIEWVYQQML DGH +DILTYNV++YA+ +LGKLDQFHRLL+EM R+GF+PD HTYNI
Sbjct: 233 YKLIEWVYQQMLADGHCTDILTYNVMMYAKYRLGKLDQFHRLLEEMGRSGFAPDLHTYNI 292

Query: 523 LLFVLGKGDKPLAALNLLNHMREVGFDPSILHFTTLIDGLSRAGNLDACKYFFDELGNKG 582
           LL VLGKGDKPLAALNLLNHM+EVG DPS+LHFTTLIDGLSR+GNLDACKYFFDE+    
Sbjct: 293 LLHVLGKGDKPLAALNLLNHMKEVGLDPSVLHFTTLIDGLSRSGNLDACKYFFDEMIKHE 352

Query: 583 CIPDVACYTVMITTYTVAGEHEKAKELFDEMVMKGQLPNVLTYNSMIRGFCMAGKFDEAY 642
           C PDV CYTVMI+ Y VAGE EKA+ +FDEM+  GQLPNV TYN+MIRG CMAGKF+EA 
Sbjct: 353 CFPDVVCYTVMISGYIVAGELEKAQGVFDEMIPNGQLPNVFTYNAMIRGLCMAGKFEEAC 412

Query: 643 SMLAEMETRGCRPNFVVYSTLVSYLRNAGKLSEAHKVITRMVEKGQYAHLVTKFRGYRRC 702
           SML +ME+RGC PNF VYSTLVSYLRNAGKL++AH+VIT MVEKGQY HL++KF+GYRRC
Sbjct: 413 SMLKDMESRGCNPNFTVYSTLVSYLRNAGKLAKAHEVITHMVEKGQYTHLLSKFKGYRRC 446

BLAST of Cp4.1LG14g06120 vs. NCBI nr
Match: gi|645257085|ref|XP_008234250.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g60050-like [Prunus mume])

HSP 1 Score: 557.0 bits (1434), Expect = 4.7e-155
Identity = 263/360 (73.06%), Postives = 303/360 (84.17%), Query Frame = 1

Query: 343 KVENYRHTASSYHMVMKIFAECEEFKAMWRLLDEMTEKGYPVTARTFMILICTCGDAGLA 402
           ++ENYRHTA++YH++MKIFA+CEEFKAMWRL+DEM EKGYP TA+TF ILI T       
Sbjct: 154 QLENYRHTANTYHLMMKIFADCEEFKAMWRLVDEMIEKGYPTTAQTFSILIRT------- 213

Query: 403 RKVVERFIKSKTFNFRPFXCGDAGLARKVVERFIKSKTFNFRPFKHSYNAILHGLLVIKQ 462
                              CG+AGLA+KVVERFIKSKTFN+RPFKHSYNAILH L+V+KQ
Sbjct: 214 -------------------CGEAGLAKKVVERFIKSKTFNYRPFKHSYNAILHSLVVVKQ 273

Query: 463 YKLIEWVYQQMLLDGHSSDILTYNVVLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNI 522
           YKLIEWVYQQML DGH +DILTYNV++YA+ +LGKLDQFHRLL+EM R+GF+PD+HTYNI
Sbjct: 274 YKLIEWVYQQMLADGHCTDILTYNVMMYAKYRLGKLDQFHRLLEEMGRSGFAPDFHTYNI 333

Query: 523 LLFVLGKGDKPLAALNLLNHMREVGFDPSILHFTTLIDGLSRAGNLDACKYFFDELGNKG 582
           LL VLGKGDKPLAALNLLNHM+EVGFDPS+LHFTTLIDGLSRAGNLDACKYFFDE+    
Sbjct: 334 LLHVLGKGDKPLAALNLLNHMKEVGFDPSVLHFTTLIDGLSRAGNLDACKYFFDEMIKHE 393

Query: 583 CIPDVACYTVMITTYTVAGEHEKAKELFDEMVMKGQLPNVLTYNSMIRGFCMAGKFDEAY 642
           C PDV CYTVMI+ Y VAGE EKA+ +FDEM+  GQLPNV TYN+MIRG CMAGKF+EA 
Sbjct: 394 CFPDVVCYTVMISGYIVAGELEKAQGVFDEMIPNGQLPNVFTYNAMIRGLCMAGKFEEAC 453

Query: 643 SMLAEMETRGCRPNFVVYSTLVSYLRNAGKLSEAHKVITRMVEKGQYAHLVTKFRGYRRC 702
           SML +ME+RGC PNF VYSTLVSYLRNAGK ++AH+VIT MVEKGQY HL++KF+GYRRC
Sbjct: 454 SMLKDMESRGCNPNFTVYSTLVSYLRNAGKRAKAHEVITHMVEKGQYTHLLSKFKGYRRC 487

BLAST of Cp4.1LG14g06120 vs. NCBI nr
Match: gi|567858468|ref|XP_006421917.1| (hypothetical protein CICLE_v10004849mg [Citrus clementina])

HSP 1 Score: 553.5 bits (1425), Expect = 5.2e-154
Identity = 262/358 (73.18%), Postives = 301/358 (84.08%), Query Frame = 1

Query: 345 ENYRHTASSYHMVMKIFAECEEFKAMWRLLDEMTEKGYPVTARTFMILICTCGDAGLARK 404
           EN+RHTA+SYH++MKIFA+CEEFKAMWRL+DEM E G+P TARTF ILICT         
Sbjct: 156 ENFRHTANSYHLIMKIFADCEEFKAMWRLVDEMIENGFPTTARTFNILICT--------- 215

Query: 405 VVERFIKSKTFNFRPFXCGDAGLARKVVERFIKSKTFNFRPFKHSYNAILHGLLVIKQYK 464
                            CG+ GLARKVVERFIKSK FNFRPFK+SYNAILH LL I+QYK
Sbjct: 216 -----------------CGEVGLARKVVERFIKSKLFNFRPFKNSYNAILHALLGIRQYK 275

Query: 465 LIEWVYQQMLLDGHSSDILTYNVVLYARCKLGKLDQFHRLLDEMARNGFSPDYHTYNILL 524
           LIEWVYQQM  +G++ DILTYN+V+ A+ +LGKLDQFHRLLDEM R+GFSPD+HTYNILL
Sbjct: 276 LIEWVYQQMSDEGYAPDILTYNIVMCAKYRLGKLDQFHRLLDEMGRSGFSPDFHTYNILL 335

Query: 525 FVLGKGDKPLAALNLLNHMREVGFDPSILHFTTLIDGLSRAGNLDACKYFFDELGNKGCI 584
            VLGKGDKPLAALNLLNHM+EVGFDPS+LHFTTL+DGLSRAGNLDACKYFFDE+ NKGC+
Sbjct: 336 HVLGKGDKPLAALNLLNHMKEVGFDPSVLHFTTLMDGLSRAGNLDACKYFFDEMANKGCM 395

Query: 585 PDVACYTVMITTYTVAGEHEKAKELFDEMVMKGQLPNVLTYNSMIRGFCMAGKFDEAYSM 644
           PDV CYTVMIT+Y  AGE EKA++LFD M+ KGQLPNV TYNSMIRGFCMAGKFDEA +M
Sbjct: 396 PDVVCYTVMITSYIAAGELEKAQDLFDGMITKGQLPNVFTYNSMIRGFCMAGKFDEACTM 455

Query: 645 LAEMETRGCRPNFVVYSTLVSYLRNAGKLSEAHKVITRMVEKGQYAHLVTKFRGYRRC 703
           + EME+RGC PNF+VY+TLVS LRNAGKL+EAH+VI  MVEKG+Y HLV+KF+ Y+RC
Sbjct: 456 MKEMESRGCNPNFLVYNTLVSNLRNAGKLAEAHEVIRHMVEKGKYIHLVSKFKRYKRC 487

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP288_ARATH1.2e-13766.11Pentatricopeptide repeat-containing protein At3g60050 OS=Arabidopsis thaliana GN... [more]
PPR81_ARATH2.4e-13564.99Pentatricopeptide repeat-containing protein At1g55630 OS=Arabidopsis thaliana GN... [more]
PPR18_ARATH2.7e-4925.72Pentatricopeptide repeat-containing protein At1g06710, mitochondrial OS=Arabidop... [more]
PP445_ARATH1.6e-4626.38Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana GN... [more]
PP444_ARATH2.1e-4624.29Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0L8P4_CUCSA1.1e-16978.61Uncharacterized protein OS=Cucumis sativus GN=Csa_3G182100 PE=4 SV=1[more]
M5XQE6_PRUPE1.1e-15573.06Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025794mg PE=4 SV=1[more]
V4S4A4_9ROSI3.6e-15473.18Uncharacterized protein OS=Citrus clementina GN=CICLE_v10004849mg PE=4 SV=1[more]
W9S7I4_9ROSA5.3e-15373.39Uncharacterized protein OS=Morus notabilis GN=L484_015785 PE=4 SV=1[more]
M1C1Y3_SOLTU6.4e-15171.51Uncharacterized protein OS=Solanum tuberosum GN=PGSC0003DMG400022513 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G60050.16.6e-13966.11 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G55630.11.4e-13664.99 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G06710.11.5e-5025.72 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G65560.19.2e-4826.38 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G64320.11.2e-4724.29 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659077478|ref|XP_008439226.1|3.5e-17481.39PREDICTED: pentatricopeptide repeat-containing protein At1g55630-like [Cucumis m... [more]
gi|449446161|ref|XP_004140840.1|1.5e-16978.61PREDICTED: pentatricopeptide repeat-containing protein At1g55630-like [Cucumis s... [more]
gi|596041207|ref|XP_007220061.1|1.6e-15573.06hypothetical protein PRUPE_ppa025794mg [Prunus persica][more]
gi|645257085|ref|XP_008234250.1|4.7e-15573.06PREDICTED: pentatricopeptide repeat-containing protein At3g60050-like [Prunus mu... [more]
gi|567858468|ref|XP_006421917.1|5.2e-15473.18hypothetical protein CICLE_v10004849mg [Citrus clementina][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG14g06120.1Cp4.1LG14g06120.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 328..345
score: 0.48coord: 555..583
score: 7.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 481..524
score: 2.8E-10coord: 585..633
score: 2.8E-14coord: 254..297
score: 2.8
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 644..687
score: 4.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 291..324
score: 6.7E-4coord: 555..587
score: 1.4E-6coord: 257..289
score: 3.2E-6coord: 658..687
score: 0.002coord: 484..516
score: 3.2E-6coord: 624..656
score: 1.2E-11coord: 589..622
score: 2.3E-6coord: 518..551
score: 6.7E-4coord: 353..385
score: 0.0022coord: 152..184
score: 0.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 184..218
score: 6.423coord: 586..620
score: 11.17coord: 254..288
score: 11.772coord: 516..550
score: 9.712coord: 621..655
score: 14.513coord: 149..183
score: 8.977coord: 350..384
score: 8.977coord: 219..253
score: 6.686coord: 446..480
score: 6.686coord: 481..515
score: 11.772coord: 289..323
score: 9.712coord: 385..419
score: 6.423coord: 551..585
score: 10.019coord: 656..690
score: 9
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 78..100
score: 3.0E-233coord: 124..330
score: 3.0E-233coord: 488..690
score: 3.0E
NoneNo IPR availablePANTHERPTHR24015:SF361SUBFAMILY NOT NAMEDcoord: 488..690
score: 3.0E-233coord: 78..100
score: 3.0E-233coord: 124..330
score: 3.0E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG14g06120Cp4.1LG09g00920Cucurbita pepo (Zucchini)cpecpeB023