Cp4.1LG08g01180 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG08g01180
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCp4.1LG08 : 3634974 .. 3636788 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTGTGCTTTTCAAGATGTGCACGCTACCTGTTTGTCAAAAGTCCCCAAAGAAAAAATTACACGATAGGTCTTGTGATCGATGCTACTTCAGCCAAAAAACAAGCATATTATGAAGACCCAGTTGGATTCTTTGCTCAACAAGAGGATGTAATTTCTTGGACGTCTAAAATCACCAATTTGGTAAGAACAGGCCAGCCGGACTCGGCTTTTGGCTTCTTCAAGATGATGTTCGCAAATGGGCACAGGCCAAATTATGTGACAATGCTGAGCGTATTAAGAGCGATTGACGCATTAAGCTGGGAATCAACGATTGAGGTGATGCATGGAGGAGTGATCAAAATGGGGTTTGAATCAGAAGTGGCAGTTTCAACGGCTCTTCTTGGGTTTTATTCAATGCGTGATATTGGGATCGTTTGGAAGTTGTTTTATCAGATACCTTATAAGGATGTTGTTTTGTGGAGTGCTGTGATCTCGGCATGCGTGAAACATGGCCAGTTTATTGAAGCATTTCATCTTTTCAGAGAGATGCAATATCAAGGTGTTCATCCAAACCATGTAAGCATTGTAAGCATTCTACCTGCTTGTGCTGATTTTGGTGCTCTATCTTTGGGTAAAGAGATTCATGCGTTTTCAATGAGAAGGGATTTTTATTCTATAGTTAACATTCAGAACTCACTCATGGATATGTATTCAAAATGTAGAAATCTTGAGGCATCAATTAGAGTTTTGAAGACTATGAGGAAAAAGGACATGGTGTCATGGAGAACTGTAACTCATGCGTGTATCCAAAACAATTGTCCTAGTAAAGCCTTTAAAATTTTCACAAGGATGCGGTCTTTTGGATTCGAATTGGGCGAAACGATGATGCTGGACTTTATAGCTGCTGTGTTATTAGTTGATGAACTTTTACTTGGCTTGGCTGTTCATTGTCATGCGCTGAAAGGTGGTTTTCTCTGTTTTATTTCAGTTGGAACTGAACTCCTCCAAATGTATGCTAAATTTGGTGAATTGGGGTTGGCCAAACTTACATTTGATGAGCTTGTTGACAAAGATATCATTGCATGGAGTGCAATGATCTCAGCATACTCTCATGGCGAAGAACCACTTAGTGCCATCCAGACATTTAAAATGATGCAATCAACTAATGAAAGGCCTAATGCTATAACTTTTGTAAGTCTAGTGAATGCCTGTTCTTCATTGGATGCTCAGGAACTGGGAGAAAGTATACATGCTCATATAACGAAATCTGGTTACTCGTCTAATACATGTTTGATGTCTGCTTTGGTTGATTTTTACTGCATACTTAGAAGGGTAAAGCTAGGAGAACATGTTTTTGATGAGATTGTGACAAAGGATTTAGTTTGTTGGAGTACGATGATTAAAGGGTACGGTACGAATGGCTGCGGGAATGAGGCACTCAATACATTTTCAGACATGTTAAGTTATGGTTTGAAGCCGAATGGGACGCTCTTCGTTTCTCTTTTATCGGCTTGTGCTCAATGTGGATTGGAAAAGGAAGGTTGGATGTGGTTTAATGCAATGATTGACGAGTATAACATTACTCCAACAGTGGCACATTATGCTTGTATGGTGGAGTTGCTTGCTAGGCAAGGAAAAATTAGAGAAGCCGTTGAATTTGTGAAGAAAATGGCAGTAGAACCTGATACAAGGATCTGGGGTGCTCTTTTTGCTGGTTGTAAATTAACTCATGGGTTCTCTGACATCGCTGATTCTATTGTTCAACAGCTCAATGCTTTAGAACCAAACAATTCTGACTTTCATGCAATGTTGCACAACTTTTGTATTGAGTAA

mRNA sequence

ATGCTGTGCTTTTCAAGATGTGCACGCTACCTGTTTGTCAAAAGTCCCCAAAGAAAAAATTACACGATAGGTCTTGTGATCGATGCTACTTCAGCCAAAAAACAAGCATATTATGAAGACCCAGTTGGATTCTTTGCTCAACAAGAGGATGTAATTTCTTGGACGTCTAAAATCACCAATTTGGTAAGAACAGGCCAGCCGGACTCGGCTTTTGGCTTCTTCAAGATGATGTTCGCAAATGGGCACAGGCCAAATTATGTGACAATGCTGAGCGTATTAAGAGCGATTGACGCATTAAGCTGGGAATCAACGATTGAGGTGATGCATGGAGGAGTGATCAAAATGGGGTTTGAATCAGAAGTGGCAGTTTCAACGGCTCTTCTTGGGTTTTATTCAATGCGTGATATTGGGATCGTTTGGAAGTTGTTTTATCAGATACCTTATAAGGATGTTGTTTTGTGGAGTGCTGTGATCTCGGCATGCGTGAAACATGGCCAGTTTATTGAAGCATTTCATCTTTTCAGAGAGATGCAATATCAAGGTGTTCATCCAAACCATGTAAGCATTGTAAGCATTCTACCTGCTTGTGCTGATTTTGGTGCTCTATCTTTGGGTAAAGAGATTCATGCGTTTTCAATGAGAAGGGATTTTTATTCTATAGTTAACATTCAGAACTCACTCATGGATATGTATTCAAAATGTAGAAATCTTGAGGCATCAATTAGAGTTTTGAAGACTATGAGGAAAAAGGACATGGTGTCATGGAGAACTGTAACTCATGCGTGTATCCAAAACAATTGTCCTAGTAAAGCCTTTAAAATTTTCACAAGGATGCGGTCTTTTGGATTCGAATTGGGCGAAACGATGATGCTGGACTTTATAGCTGCTGTGTTATTAGTTGATGAACTTTTACTTGGCTTGGCTGTTCATTGTCATGCGCTGAAAGGTGGTTTTCTCTGTTTTATTTCAGTTGGAACTGAACTCCTCCAAATGTATGCTAAATTTGGTGAATTGGGGTTGGCCAAACTTACATTTGATGAGCTTGTTGACAAAGATATCATTGCATGGAGTGCAATGATCTCAGCATACTCTCATGGCGAAGAACCACTTAGTGCCATCCAGACATTTAAAATGATGCAATCAACTAATGAAAGGCCTAATGCTATAACTTTTGTAAGTCTAGTGAATGCCTGTTCTTCATTGGATGCTCAGGAACTGGGAGAAAGTATACATGCTCATATAACGAAATCTGGTTACTCGTCTAATACATGTTTGATGTCTGCTTTGGTTGATTTTTACTGCATACTTAGAAGGGTAAAGCTAGGAGAACATGTTTTTGATGAGATTGTGACAAAGGATTTAGTTTGTTGGAGTACGATGATTAAAGGGTACGGTACGAATGGCTGCGGGAATGAGGCACTCAATACATTTTCAGACATGTTAAGTTATGGTTTGAAGCCGAATGGGACGCTCTTCGTTTCTCTTTTATCGGCTTGTGCTCAATGTGGATTGGAAAAGGAAGGTTGGATGTGGTTTAATGCAATGATTGACGAGTATAACATTACTCCAACAGTGGCACATTATGCTTGTATGGTGGAGTTGCTTGCTAGGCAAGGAAAAATTAGAGAAGCCGTTGAATTTGTGAAGAAAATGGCAGTAGAACCTGATACAAGGATCTGGGGTGCTCTTTTTGCTGGTTGTAAATTAACTCATGGGTTCTCTGACATCGCTGATTCTATTGTTCAACAGCTCAATGCTTTAGAACCAAACAATTCTGACTTTCATGCAATGTTGCACAACTTTTGTATTGAGTAA

Coding sequence (CDS)

ATGCTGTGCTTTTCAAGATGTGCACGCTACCTGTTTGTCAAAAGTCCCCAAAGAAAAAATTACACGATAGGTCTTGTGATCGATGCTACTTCAGCCAAAAAACAAGCATATTATGAAGACCCAGTTGGATTCTTTGCTCAACAAGAGGATGTAATTTCTTGGACGTCTAAAATCACCAATTTGGTAAGAACAGGCCAGCCGGACTCGGCTTTTGGCTTCTTCAAGATGATGTTCGCAAATGGGCACAGGCCAAATTATGTGACAATGCTGAGCGTATTAAGAGCGATTGACGCATTAAGCTGGGAATCAACGATTGAGGTGATGCATGGAGGAGTGATCAAAATGGGGTTTGAATCAGAAGTGGCAGTTTCAACGGCTCTTCTTGGGTTTTATTCAATGCGTGATATTGGGATCGTTTGGAAGTTGTTTTATCAGATACCTTATAAGGATGTTGTTTTGTGGAGTGCTGTGATCTCGGCATGCGTGAAACATGGCCAGTTTATTGAAGCATTTCATCTTTTCAGAGAGATGCAATATCAAGGTGTTCATCCAAACCATGTAAGCATTGTAAGCATTCTACCTGCTTGTGCTGATTTTGGTGCTCTATCTTTGGGTAAAGAGATTCATGCGTTTTCAATGAGAAGGGATTTTTATTCTATAGTTAACATTCAGAACTCACTCATGGATATGTATTCAAAATGTAGAAATCTTGAGGCATCAATTAGAGTTTTGAAGACTATGAGGAAAAAGGACATGGTGTCATGGAGAACTGTAACTCATGCGTGTATCCAAAACAATTGTCCTAGTAAAGCCTTTAAAATTTTCACAAGGATGCGGTCTTTTGGATTCGAATTGGGCGAAACGATGATGCTGGACTTTATAGCTGCTGTGTTATTAGTTGATGAACTTTTACTTGGCTTGGCTGTTCATTGTCATGCGCTGAAAGGTGGTTTTCTCTGTTTTATTTCAGTTGGAACTGAACTCCTCCAAATGTATGCTAAATTTGGTGAATTGGGGTTGGCCAAACTTACATTTGATGAGCTTGTTGACAAAGATATCATTGCATGGAGTGCAATGATCTCAGCATACTCTCATGGCGAAGAACCACTTAGTGCCATCCAGACATTTAAAATGATGCAATCAACTAATGAAAGGCCTAATGCTATAACTTTTGTAAGTCTAGTGAATGCCTGTTCTTCATTGGATGCTCAGGAACTGGGAGAAAGTATACATGCTCATATAACGAAATCTGGTTACTCGTCTAATACATGTTTGATGTCTGCTTTGGTTGATTTTTACTGCATACTTAGAAGGGTAAAGCTAGGAGAACATGTTTTTGATGAGATTGTGACAAAGGATTTAGTTTGTTGGAGTACGATGATTAAAGGGTACGGTACGAATGGCTGCGGGAATGAGGCACTCAATACATTTTCAGACATGTTAAGTTATGGTTTGAAGCCGAATGGGACGCTCTTCGTTTCTCTTTTATCGGCTTGTGCTCAATGTGGATTGGAAAAGGAAGGTTGGATGTGGTTTAATGCAATGATTGACGAGTATAACATTACTCCAACAGTGGCACATTATGCTTGTATGGTGGAGTTGCTTGCTAGGCAAGGAAAAATTAGAGAAGCCGTTGAATTTGTGAAGAAAATGGCAGTAGAACCTGATACAAGGATCTGGGGTGCTCTTTTTGCTGGTTGTAAATTAACTCATGGGTTCTCTGACATCGCTGATTCTATTGTTCAACAGCTCAATGCTTTAGAACCAAACAATTCTGACTTTCATGCAATGTTGCACAACTTTTGTATTGAGTAA

Protein sequence

MLCFSRCARYLFVKSPQRKNYTIGLVIDATSAKKQAYYEDPVGFFAQQEDVISWTSKITNLVRTGQPDSAFGFFKMMFANGHRPNYVTMLSVLRAIDALSWESTIEVMHGGVIKMGFESEVAVSTALLGFYSMRDIGIVWKLFYQIPYKDVVLWSAVISACVKHGQFIEAFHLFREMQYQGVHPNHVSIVSILPACADFGALSLGKEIHAFSMRRDFYSIVNIQNSLMDMYSKCRNLEASIRVLKTMRKKDMVSWRTVTHACIQNNCPSKAFKIFTRMRSFGFELGETMMLDFIAAVLLVDELLLGLAVHCHALKGGFLCFISVGTELLQMYAKFGELGLAKLTFDELVDKDIIAWSAMISAYSHGEEPLSAIQTFKMMQSTNERPNAITFVSLVNACSSLDAQELGESIHAHITKSGYSSNTCLMSALVDFYCILRRVKLGEHVFDEIVTKDLVCWSTMIKGYGTNGCGNEALNTFSDMLSYGLKPNGTLFVSLLSACAQCGLEKEGWMWFNAMIDEYNITPTVAHYACMVELLARQGKIREAVEFVKKMAVEPDTRIWGALFAGCKLTHGFSDIADSIVQQLNALEPNNSDFHAMLHNFCIE
BLAST of Cp4.1LG08g01180 vs. Swiss-Prot
Match: PP320_ARATH (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana GN=DOT4 PE=2 SV=1)

HSP 1 Score: 322.4 bits (825), Expect = 1.1e-86
Identity = 181/561 (32.26%), Postives = 298/561 (53.12%), Query Frame = 1

Query: 49  EDVISWTSKITNLVRTGQPDSAFGFFKMMFANGHRPNYVTMLSVLRAIDALSWESTIEVM 108
           E  + W   +  L ++G    + G FK M ++G   +  T   V ++  +L      E +
Sbjct: 158 EKALFWNILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGEQL 217

Query: 109 HGGVIKMGFESEVAVSTALLGFYSMRD-IGIVWKLFYQIPYKDVVLWSAVISACVKHGQF 168
           HG ++K GF    +V  +L+ FY     +    K+F ++  +DV+ W+++I+  V +G  
Sbjct: 218 HGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNGLA 277

Query: 169 IEAFHLFREMQYQGVHPNHVSIVSILPACADFGALSLGKEIHAFSMRRDFYSIVNIQNSL 228
            +   +F +M   G+  +  +IVS+   CAD   +SLG+ +H+  ++  F       N+L
Sbjct: 278 EKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFCNTL 337

Query: 229 MDMYSKCRNLEASIRVLKTMRKKDMVSWRTVTHACIQNNCPSKAFKIFTRMRSFGFE--- 288
           +DMYSKC +L+++  V + M  + +VS+ ++     +     +A K+F  M   G     
Sbjct: 338 LDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDV 397

Query: 289 LGETMMLDFIAAVLLVDELLLGLAVHCHALKGGFLCFISVGTELLQMYAKFGELGLAKLT 348
              T +L+  A   L+DE   G  VH    +      I V   L+ MYAK G +  A+L 
Sbjct: 398 YTVTAVLNCCARYRLLDE---GKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELV 457

Query: 349 FDELVDKDIIAWSAMISAYSHGEEPLSAIQTFKMM-QSTNERPNAITFVSLVNACSSLDA 408
           F E+  KDII+W+ +I  YS       A+  F ++ +     P+  T   ++ AC+SL A
Sbjct: 458 FSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLSA 517

Query: 409 QELGESIHAHITKSGYSSNTCLMSALVDFYCILRRVKLGEHVFDEIVTKDLVCWSTMIKG 468
            + G  IH +I ++GY S+  + ++LVD Y     + L   +FD+I +KDLV W+ MI G
Sbjct: 518 FDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIAG 577

Query: 469 YGTNGCGNEALNTFSDMLSYGLKPNGTLFVSLLSACAQCGLEKEGWMWFNAMIDEYNITP 528
           YG +G G EA+  F+ M   G++ +   FVSLL AC+  GL  EGW +FN M  E  I P
Sbjct: 578 YGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEP 637

Query: 529 TVAHYACMVELLARQGKIREAVEFVKKMAVEPDTRIWGALFAGCKLTHGFSDIADSIVQQ 588
           TV HYAC+V++LAR G + +A  F++ M + PD  IWGAL  GC++ H    +A+ + ++
Sbjct: 638 TVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVK-LAEKVAEK 697

Query: 589 LNALEPNNSDFHAMLHNFCIE 605
           +  LEP N+ ++ ++ N   E
Sbjct: 698 VFELEPENTGYYVLMANIYAE 714

BLAST of Cp4.1LG08g01180 vs. Swiss-Prot
Match: PP341_ARATH (Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana GN=DYW9 PE=2 SV=1)

HSP 1 Score: 303.1 bits (775), Expect = 6.6e-81
Identity = 167/567 (29.45%), Postives = 295/567 (52.03%), Query Frame = 1

Query: 37  YYEDPVGFFAQQEDVISWTSKITNLVRTGQPDSAFGFFK-MMFANGHRPNYVTMLSVLRA 96
           YY   +    Q+ DV  +   +        P S+   F  +  +   +PN  T    + A
Sbjct: 69  YYARDIFLSVQRPDVFLFNVLMRGFSVNESPHSSLSVFAHLRKSTDLKPNSSTYAFAISA 128

Query: 97  IDALSWESTIEVMHGGVIKMGFESEVAVSTALLGFY-SMRDIGIVWKLFYQIPYKDVVLW 156
                 +    V+HG  +  G +SE+ + + ++  Y     +    K+F ++P KD +LW
Sbjct: 129 ASGFRDDRAGRVIHGQAVVDGCDSELLLGSNIVKMYFKFWRVEDARKVFDRMPEKDTILW 188

Query: 157 SAVISACVKHGQFIEAFHLFREMQYQG-VHPNHVSIVSILPACADFGALSLGKEIHAFSM 216
           + +IS   K+  ++E+  +FR++  +     +  +++ ILPA A+   L LG +IH+ + 
Sbjct: 189 NTMISGYRKNEMYVESIQVFRDLINESCTRLDTTTLLDILPAVAELQELRLGMQIHSLAT 248

Query: 217 RRDFYSIVNIQNSLMDMYSKCRNLEASIRVLKTMRKKDMVSWRTVTHACIQNNCPSKAFK 276
           +   YS   +    + +YSKC  ++    + +  RK D+V++  + H    N     +  
Sbjct: 249 KTGCYSHDYVLTGFISLYSKCGKIKMGSALFREFRKPDIVAYNAMIHGYTSNGETELSLS 308

Query: 277 IFTRMRSFGFELGETMMLDFIAAVLLVDELLLGLAVHCHALKGGFLCFISVGTELLQMYA 336
           +F  +   G  L  + +   ++ V +   L+L  A+H + LK  FL   SV T L  +Y+
Sbjct: 309 LFKELMLSGARLRSSTL---VSLVPVSGHLMLIYAIHGYCLKSNFLSHASVSTALTTVYS 368

Query: 337 KFGELGLAKLTFDELVDKDIIAWSAMISAYSHGEEPLSAIQTFKMMQSTNERPNAITFVS 396
           K  E+  A+  FDE  +K + +W+AMIS Y+       AI  F+ MQ +   PN +T   
Sbjct: 369 KLNEIESARKLFDESPEKSLPSWNAMISGYTQNGLTEDAISLFREMQKSEFSPNPVTITC 428

Query: 397 LVNACSSLDAQELGESIHAHITKSGYSSNTCLMSALVDFYCILRRVKLGEHVFDEIVTKD 456
           +++AC+ L A  LG+ +H  +  + + S+  + +AL+  Y     +     +FD +  K+
Sbjct: 429 ILSACAQLGALSLGKWVHDLVRSTDFESSIYVSTALIGMYAKCGSIAEARRLFDLMTKKN 488

Query: 457 LVCWSTMIKGYGTNGCGNEALNTFSDMLSYGLKPNGTLFVSLLSACAQCGLEKEGWMWFN 516
            V W+TMI GYG +G G EALN F +ML+ G+ P    F+ +L AC+  GL KEG   FN
Sbjct: 489 EVTWNTMISGYGLHGQGQEALNIFYEMLNSGITPTPVTFLCVLYACSHAGLVKEGDEIFN 548

Query: 517 AMIDEYNITPTVAHYACMVELLARQGKIREAVEFVKKMAVEPDTRIWGALFAGCKLTHGF 576
           +MI  Y   P+V HYACMV++L R G ++ A++F++ M++EP + +W  L   C++ H  
Sbjct: 549 SMIHRYGFEPSVKHYACMVDILGRAGHLQRALQFIEAMSIEPGSSVWETLLGACRI-HKD 608

Query: 577 SDIADSIVQQLNALEPNNSDFHAMLHN 601
           +++A ++ ++L  L+P+N  +H +L N
Sbjct: 609 TNLARTVSEKLFELDPDNVGYHVLLSN 631

BLAST of Cp4.1LG08g01180 vs. Swiss-Prot
Match: PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 302.0 bits (772), Expect = 1.5e-80
Identity = 161/550 (29.27%), Postives = 294/550 (53.45%), Query Frame = 1

Query: 52  ISWTSKITNLVRTGQPDSAFGFFKMMFANGHRPNYVTMLSVLRAIDALSWESTIEVMHGG 111
           + + + +    +    D A  FF  M  +   P       +L+     +     + +HG 
Sbjct: 101 VLYHTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGL 160

Query: 112 VIKMGFESEVAVSTALLGFYSM-RDIGIVWKLFYQIPYKDVVLWSAVISACVKHGQFIEA 171
           ++K GF  ++   T L   Y+  R +    K+F ++P +D+V W+ +++   ++G    A
Sbjct: 161 LVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMA 220

Query: 172 FHLFREMQYQGVHPNHVSIVSILPACADFGALSLGKEIHAFSMRRDFYSIVNIQNSLMDM 231
             + + M  + + P+ ++IVS+LPA +    +S+GKEIH ++MR  F S+VNI  +L+DM
Sbjct: 221 LEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDM 280

Query: 232 YSKCRNLEASIRVLKTMRKKDMVSWRTVTHACIQNNCPSKAFKIFTRMRSFGFELGETMM 291
           Y+KC +LE + ++   M ++++VSW ++  A +QN  P +A  IF +M   G +  +  +
Sbjct: 281 YAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSV 340

Query: 292 LDFIAAVLLVDELLLGLAVHCHALKGGFLCFISVGTELLQMYAKFGELGLAKLTFDELVD 351
           +  + A   + +L  G  +H  +++ G    +SV   L+ MY K  E+  A   F +L  
Sbjct: 341 MGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQS 400

Query: 352 KDIIAWSAMISAYSHGEEPLSAIQTFKMMQSTNERPNAITFVSLVNACSSLDAQELGESI 411
           + +++W+AMI  ++    P+ A+  F  M+S   +P+  T+VS++ A + L      + I
Sbjct: 401 RTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWI 460

Query: 412 HAHITKSGYSSNTCLMSALVDFYCILRRVKLGEHVFDEIVTKDLVCWSTMIKGYGTNGCG 471
           H  + +S    N  + +ALVD Y     + +   +FD +  + +  W+ MI GYGT+G G
Sbjct: 461 HGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFG 520

Query: 472 NEALNTFSDMLSYGLKPNGTLFVSLLSACAQCGLEKEGWMWFNAMIDEYNITPTVAHYAC 531
             AL  F +M    +KPNG  F+S++SAC+  GL + G   F  M + Y+I  ++ HY  
Sbjct: 521 KAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGA 580

Query: 532 MVELLARQGKIREAVEFVKKMAVEPDTRIWGALFAGCKLTHGFSDIADSIVQQLNALEPN 591
           MV+LL R G++ EA +F+ +M V+P   ++GA+   C++ H   + A+   ++L  L P+
Sbjct: 581 MVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQI-HKNVNFAEKAAERLFELNPD 640

Query: 592 NSDFHAMLHN 601
           +  +H +L N
Sbjct: 641 DGGYHVLLAN 649

BLAST of Cp4.1LG08g01180 vs. Swiss-Prot
Match: PP359_ARATH (Pentatricopeptide repeat-containing protein At4g39952, mitochondrial OS=Arabidopsis thaliana GN=PCMP-E98 PE=2 SV=2)

HSP 1 Score: 300.8 bits (769), Expect = 3.3e-80
Identity = 172/556 (30.94%), Postives = 290/556 (52.16%), Query Frame = 1

Query: 50  DVISWTSKITNLVRTGQPDSAFGFFKMMFANGH---RPNYVTMLSVLRAIDALSWESTIE 109
           DV++WT+ I+  V+ G+ +   G+   M + G    +PN  T+    +A   L       
Sbjct: 191 DVVAWTAIISGHVQNGESEGGLGYLCKMHSAGSDVDKPNPRTLECGFQACSNLGALKEGR 250

Query: 110 VMHGGVIKMGFESEVAVSTALLGFYSMR-DIGIVWKLFYQIPYKDVVLWSAVISACVKHG 169
            +HG  +K G  S   V +++  FYS   +    +  F ++  +D+  W+++I++  + G
Sbjct: 251 CLHGFAVKNGLASSKFVQSSMFSFYSKSGNPSEAYLSFRELGDEDMFSWTSIIASLARSG 310

Query: 170 QFIEAFHLFREMQYQGVHPNHVSIVSILPACADFGALSLGKEIHAFSMRRDFYSIVNIQN 229
              E+F +F EMQ +G+HP+ V I  ++        +  GK  H F +R  F     + N
Sbjct: 311 DMEESFDMFWEMQNKGMHPDGVVISCLINELGKMMLVPQGKAFHGFVIRHCFSLDSTVCN 370

Query: 230 SLMDMYSKCRNLEASIRVL-KTMRKKDMVSWRTVTHACIQNNCPSKAFKIFTRMRSFGFE 289
           SL+ MY K   L  + ++  +   + +  +W T+     +  C  K  ++F ++++ G E
Sbjct: 371 SLLSMYCKFELLSVAEKLFCRISEEGNKEAWNTMLKGYGKMKCHVKCIELFRKIQNLGIE 430

Query: 290 LGETMMLDFIAAVLLVDELLLGLAVHCHALKGGFLCFISVGTELLQMYAKFGELGLAKLT 349
           +        I++   +  +LLG ++HC+ +K      ISV   L+ +Y K G+L +A   
Sbjct: 431 IDSASATSVISSCSHIGAVLLGKSLHCYVVKTSLDLTISVVNSLIDLYGKMGDLTVAWRM 490

Query: 350 FDELVDKDIIAWSAMISAYSHGEEPLSAIQTFKMMQSTNERPNAITFVSLVNACSSLDAQ 409
           F E  D ++I W+AMI++Y H E+   AI  F  M S N +P++IT V+L+ AC +  + 
Sbjct: 491 FCE-ADTNVITWNAMIASYVHCEQSEKAIALFDRMVSENFKPSSITLVTLLMACVNTGSL 550

Query: 410 ELGESIHAHITKSGYSSNTCLMSALVDFYCILRRVKLGEHVFDEIVTKDLVCWSTMIKGY 469
           E G+ IH +IT++ +  N  L +AL+D Y     ++    +FD    KD VCW+ MI GY
Sbjct: 551 ERGQMIHRYITETEHEMNLSLSAALIDMYAKCGHLEKSRELFDAGNQKDAVCWNVMISGY 610

Query: 470 GTNGCGNEALNTFSDMLSYGLKPNGTLFVSLLSACAQCGLEKEGWMWFNAMIDEYNITPT 529
           G +G    A+  F  M    +KP G  F++LLSAC   GL ++G   F  M  +Y++ P 
Sbjct: 611 GMHGDVESAIALFDQMEESDVKPTGPTFLALLSACTHAGLVEQGKKLFLKM-HQYDVKPN 670

Query: 530 VAHYACMVELLARQGKIREAVEFVKKMAVEPDTRIWGALFAGCKLTHGFSDIADSIVQQL 589
           + HY+C+V+LL+R G + EA   V  M   PD  IWG L + C +THG  ++   + ++ 
Sbjct: 671 LKHYSCLVDLLSRSGNLEEAESTVMSMPFSPDGVIWGTLLSSC-MTHGEFEMGIRMAERA 730

Query: 590 NALEPNNSDFHAMLHN 601
            A +P N  ++ ML N
Sbjct: 731 VASDPQNDGYYIMLAN 743

BLAST of Cp4.1LG08g01180 vs. Swiss-Prot
Match: PP296_ARATH (Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H83 PE=2 SV=2)

HSP 1 Score: 294.7 bits (753), Expect = 2.4e-78
Identity = 168/562 (29.89%), Postives = 285/562 (50.71%), Query Frame = 1

Query: 45  FAQQEDVISWTSKITNLVRTGQPDSAFGFFKMMFANGHRPNYVTMLSVLRAIDALSWEST 104
           F ++ D + W S +++   +G+       F+ M   G  PN  T++S L A D  S+   
Sbjct: 243 FQEKGDAVLWNSILSSYSTSGKSLETLELFREMHMTGPAPNSYTIVSALTACDGFSYAKL 302

Query: 105 IEVMHGGVIKMG-FESEVAVSTALLGFYSM-RDIGIVWKLFYQIPYKDVVLWSAVISACV 164
            + +H  V+K     SE+ V  AL+  Y+    +    ++  Q+   DVV W+++I   V
Sbjct: 303 GKEIHASVLKSSTHSSELYVCNALIAMYTRCGKMPQAERILRQMNNADVVTWNSLIKGYV 362

Query: 165 KHGQFIEAFHLFREMQYQGVHPNHVSIVSILPACADFGALSLGKEIHAFSMRRDFYSIVN 224
           ++  + EA   F +M   G   + VS+ SI+ A      L  G E+HA+ ++  + S + 
Sbjct: 363 QNLMYKEALEFFSDMIAAGHKSDEVSMTSIIAASGRLSNLLAGMELHAYVIKHGWDSNLQ 422

Query: 225 IQNSLMDMYSKCRNLEASIRVLKTMRKKDMVSWRTVTHACIQNNCPSKAFKIFTRMRSFG 284
           + N+L+DMYSKC       R    M  KD++SW TV     QN+C  +A ++F  +    
Sbjct: 423 VGNTLIDMYSKCNLTCYMGRAFLRMHDKDLISWTTVIAGYAQNDCHVEALELFRDVAKKR 482

Query: 285 FELGETMMLDFIAAVLLVDELLLGLAVHCHALKGGFLCFISVGTELLQMYAKFGELGLAK 344
            E+ E ++   + A  ++  +L+   +HCH L+ G L  + +  EL+ +Y K   +G A 
Sbjct: 483 MEIDEMILGSILRASSVLKSMLIVKEIHCHILRKGLLDTV-IQNELVDVYGKCRNMGYAT 542

Query: 345 LTFDELVDKDIIAWSAMISAYSHGEEPLSAIQTFKMMQSTNERPNAITFVSLVNACSSLD 404
             F+ +  KD+++W++MIS+ +       A++ F+ M  T    +++  + +++A +SL 
Sbjct: 543 RVFESIKGKDVVSWTSMISSSALNGNESEAVELFRRMVETGLSADSVALLCILSAAASLS 602

Query: 405 AQELGESIHAHITKSGYSSNTCLMSALVDFYCILRRVKLGEHVFDEIVTKDLVCWSTMIK 464
           A   G  IH ++ + G+     +  A+VD Y     ++  + VFD I  K L+ +++MI 
Sbjct: 603 ALNKGREIHCYLLRKGFCLEGSIAVAVVDMYACCGDLQSAKAVFDRIERKGLLQYTSMIN 662

Query: 465 GYGTNGCGNEALNTFSDMLSYGLKPNGTLFVSLLSACAQCGLEKEGWMWFNAMIDEYNIT 524
            YG +GCG  A+  F  M    + P+   F++LL AC+  GL  EG  +   M  EY + 
Sbjct: 663 AYGMHGCGKAAVELFDKMRHENVSPDHISFLALLYACSHAGLLDEGRGFLKIMEHEYELE 722

Query: 525 PTVAHYACMVELLARQGKIREAVEFVKKMAVEPDTRIWGALFAGCKLTHGFSDIADSIVQ 584
           P   HY C+V++L R   + EA EFVK M  EP   +W AL A C+ +H   +I +   Q
Sbjct: 723 PWPEHYVCLVDMLGRANCVVEAFEFVKMMKTEPTAEVWCALLAACR-SHSEKEIGEIAAQ 782

Query: 585 QLNALEPNNSDFHAMLHNFCIE 605
           +L  LEP N     ++ N   E
Sbjct: 783 RLLELEPKNPGNLVLVSNVFAE 802

BLAST of Cp4.1LG08g01180 vs. TrEMBL
Match: A0A0A0L489_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G122420 PE=4 SV=1)

HSP 1 Score: 909.1 bits (2348), Expect = 2.9e-261
Identity = 446/550 (81.09%), Postives = 493/550 (89.64%), Query Frame = 1

Query: 1   MLCFSRCARYLFVKSPQRKNYTIGLVIDATSAKKQAYYEDPVGFFAQQEDVISWTSKITN 60
           MLCFSRCAR LFV S +RK+YTI  ++DATS KK+ Y+EDPV F+AQ+EDVISWTSKITN
Sbjct: 1   MLCFSRCARNLFVVSSKRKDYTIRSMVDATSTKKRVYFEDPVEFYAQREDVISWTSKITN 60

Query: 61  LVRTGQPDSAFGFFKMMFANGHRPNYVTMLSVLRAIDALSWESTIEVMHGGVIKMGFESE 120
           LVRTGQP+SAFGFFKMMF+NGHRPNYVTMLSV+RAIDALSW+S IEVMHG VIKMGFESE
Sbjct: 61  LVRTGQPESAFGFFKMMFSNGHRPNYVTMLSVIRAIDALSWDSMIEVMHGVVIKMGFESE 120

Query: 121 VAVSTALLGFYSMRDIGIVWKLFYQIPYKDVVLWSAVISACVKHGQFIEAFHLFREMQYQ 180
           VAVSTALLGFYS+RDI  VWKLF QIP KDVVLWSA+IS CVK+GQ+ EAF L REMQ Q
Sbjct: 121 VAVSTALLGFYSIRDIETVWKLFNQIPSKDVVLWSAIISVCVKNGQYNEAFDLLREMQDQ 180

Query: 181 GVHPNHVSIVSILPACADFGALSLGKEIHAFSMRRDFYSIVNIQNSLMDMYSKCRNLEAS 240
           GV PN V+IVSILPACADFG LSLGKE+HAFSMRRDFYS+V++QNSLMDMYSKCR  EAS
Sbjct: 181 GVQPNQVTIVSILPACADFGVLSLGKELHAFSMRRDFYSMVDLQNSLMDMYSKCRKFEAS 240

Query: 241 IRVLKTMRKKDMVSWRTVTHACIQNNCPSKAFKIFTRMRSFGFELGETMMLDFIAAVLLV 300
           IRVLK MRKKD VSW+ +THACIQNNCPSK FKIF+RMRSFGFEL ETMMLD I+AVLL+
Sbjct: 241 IRVLKLMRKKDAVSWKIITHACIQNNCPSKVFKIFSRMRSFGFELSETMMLDMISAVLLL 300

Query: 301 DELLLGLAVHCHALKGGFLCFISVGTELLQMYAKFGELGLAKLTFDELVDKDIIAWSAMI 360
           DELLLGLAVHC+ALKGGFLCFI VGTELLQMYAKFG+L LAKL FD LVDKDIIAWSAMI
Sbjct: 301 DELLLGLAVHCYALKGGFLCFILVGTELLQMYAKFGDLRLAKLVFDGLVDKDIIAWSAMI 360

Query: 361 SAYSHGEEPLSAIQTFKMMQSTNERPNAITFVSLVNACSSLDAQELGESIHAHITKSGYS 420
           SAYSHGE+PL+AIQTFKMMQSTNE+PN  TFVSL++ACSSL A+ELGE+I AH  K GY+
Sbjct: 361 SAYSHGEDPLNAIQTFKMMQSTNEKPNERTFVSLMDACSSLGAKELGETIQAHTIKCGYT 420

Query: 421 SNTCLMSALVDFYCILRRVKLGEHVFDEIVTKDLVCWSTMIKGYGTNGCGNEALNTFSDM 480
           SNT LMSALV FYC L R+KLGEHVFDEI  KD++CW+ +IKGYG NGCGN+ALNTFSDM
Sbjct: 421 SNTHLMSALVGFYCKLGRIKLGEHVFDEISRKDVICWNALIKGYGLNGCGNKALNTFSDM 480

Query: 481 LSYGLKPNGTLFVSLLSACAQCGLEKEGWMWFNAMIDEYNITPTVAHYACMVELLARQGK 540
           LSYGLKPNG +F SLLSACAQCGLEKE  MWF +M DEY ITPT+AHYAC+V+LL RQGK
Sbjct: 481 LSYGLKPNGVVFASLLSACAQCGLEKEVRMWFRSMNDEYGITPTMAHYACIVDLLVRQGK 540

Query: 541 IREAVEFVKK 551
           IREAVEFVKK
Sbjct: 541 IREAVEFVKK 550

BLAST of Cp4.1LG08g01180 vs. TrEMBL
Match: A5C4V9_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_040403 PE=4 SV=1)

HSP 1 Score: 683.7 bits (1763), Expect = 2.0e-193
Identity = 324/569 (56.94%), Postives = 424/569 (74.52%), Query Frame = 1

Query: 36  AYYEDPVGFFAQQEDVISWTSKITNLVRTGQPDSAFGFFKMMFANGHRPNYVTMLSVLRA 95
           AYYE+PV F  ++++VISWTSKI++LV+  Q + A G FKMM     RPN+VT+LSV+RA
Sbjct: 39  AYYEEPVEFHGEKDNVISWTSKISSLVKQNQSELAVGLFKMMLMTEQRPNHVTVLSVIRA 98

Query: 96  IDALSWESTIEVMHGGVIKMGFESEVAVSTALLGFYSMRDIGIVWKLFYQIPYKDVVLWS 155
           I  L  E  + V+ G VIK+GFESEV+V+TAL+GFYS  D+GIVWK+F Q P KD+VLWS
Sbjct: 99  ISGLGLEDMMRVICGSVIKLGFESEVSVATALIGFYSDYDMGIVWKIFNQTPIKDLVLWS 158

Query: 156 AVISACVKHGQFIEAFHLFREMQYQGVHPNHVSIVSILPACADFGALSLGKEIHAFSMRR 215
           A++SACVK GQ+ EAF +FR MQY GV PNHVSIVSILPACA+ GAL  GKEIH FS+++
Sbjct: 159 AMVSACVKSGQYGEAFEIFRAMQYDGVEPNHVSIVSILPACANVGALLFGKEIHGFSIKK 218

Query: 216 DFYSIVNIQNSLMDMYSKCRNLEASIRVLKTMRKKDMVSWRTVTHACIQNNCPSKAFKIF 275
            F+ + N+ NSL+DMY+KCRN +AS+ V   + +KD++SW T+   CI+N+CP +AFK F
Sbjct: 219 MFHPLTNVHNSLVDMYAKCRNFKASMLVFDQILEKDLISWTTIIRGCIENDCPREAFKAF 278

Query: 276 TRMRSFGFELGETMMLDFIAAVLLVDELLLGLAVHCHALKGGFLCFISVGTELLQMYAKF 335
           +RM+   F   ET++ D I A++  DE   G+A H   LK G L F+S+GT LLQMYAKF
Sbjct: 279 SRMQFSCFGADETIVQDLIVAIIQADEHKFGIAFHGFLLKNGLLAFVSIGTALLQMYAKF 338

Query: 336 GELGLAKLTFDELVDKDIIAWSAMISAYSHGEEPLSAIQTFKMMQSTNERPNAITFVSLV 395
           GEL  A + FD+L  KD I+WSAMIS ++H   P +A++TFK MQST+ERPN ITFVSL+
Sbjct: 339 GELESAIIVFDQLNKKDYISWSAMISVHAHSRHPYNALETFKQMQSTDERPNEITFVSLL 398

Query: 396 NACSSLDAQELGESIHAHITKSGYSSNTCLMSALVDFYCILRRVKLGEHVFDEIVTKDLV 455
            ACS + AQELGESI AH TK+GY SN  L SAL+D YC   R+  G  +F+EI TKDLV
Sbjct: 399 QACSLIGAQELGESIQAHATKAGYLSNAFLSSALIDLYCKFGRINQGRAIFNEIPTKDLV 458

Query: 456 CWSTMIKGYGTNGCGNEALNTFSDMLSYGLKPNGTLFVSLLSACAQCGLEKEGWMWFNAM 515
           CWS+MI GYG NGCG+EAL TFS+ML+ G+KPN  +F+S+LSAC+ CGLE EGW  F++M
Sbjct: 459 CWSSMINGYGLNGCGDEALETFSNMLACGVKPNEVVFISVLSACSHCGLEHEGWSCFSSM 518

Query: 516 IDEYNITPTVAHYACMVELLARQGKIREAVEFVKKMAVEPDTRIWGALFAGCKLTHGFSD 575
             +Y I P + HYACMV+L++R+G I  A++FV KM +EPD RIWGAL AGC+ THG  +
Sbjct: 519 EQKYGIIPKLPHYACMVDLISRRGNIEGALQFVNKMPMEPDKRIWGALLAGCRSTHGSIE 578

Query: 576 IADSIVQQLNALEPNNSDFHAMLHNFCIE 605
           IA+ + ++L  L+P N+ ++ +L N   E
Sbjct: 579 IAELVAERLIGLDPQNTSYYVILSNLYAE 607

BLAST of Cp4.1LG08g01180 vs. TrEMBL
Match: A0A061DV61_THECC (Basic helix-loop-helix DNA-binding superfamily protein OS=Theobroma cacao GN=TCM_005333 PE=4 SV=1)

HSP 1 Score: 613.6 bits (1581), Expect = 2.6e-172
Identity = 305/589 (51.78%), Postives = 411/589 (69.78%), Query Frame = 1

Query: 15   SPQRKNYTIGLVIDATSAKKQAYYEDPVGFFAQQEDVISWTSKITNLVRTGQPDSAFGFF 74
            SP R   ++ L   +T   +   ++DPV  +A ++ VISWTS ++ LVR GQP+ A G F
Sbjct: 668  SPTR--ISVALSRYSTLCYRNPNHDDPVDPYADKDHVISWTSVLSKLVRQGQPEEAIGLF 727

Query: 75   KMMFANGHRPNYVTMLSVLRAIDALSWESTIEVMHGGVIKMGFESEVAVSTALLGFYSMR 134
            K M  +  RPNYVT+LS+++A D L WE+   ++HG VIKMGFESE +V TAL+G YS+ 
Sbjct: 728  KTMLMSNQRPNYVTILSLVKAFDTLDWEALRMMVHGLVIKMGFESEPSVLTALIGSYSVY 787

Query: 135  DIGIVWKLFYQIPYKDVVLWSAVISACVKHGQFIEAFHLFREMQYQGVHPNHVSIVSILP 194
             +G+ W LF QIP KDVVL SA++SACVK+G ++EA  LFR MQ  G+  NHVSIVSILP
Sbjct: 788  GMGVCWSLFNQIPNKDVVLRSAMVSACVKNGDYVEALELFRRMQVLGLKANHVSIVSILP 847

Query: 195  ACADFGALSLGKEIHAFSMRRDFYSIVNIQNSLMDMYSKCRNLEASIRVLKTMRKKDMVS 254
            ACA+ GAL LG+EIH F +RR    +  +QNSL+DMY+KCR+L+ +I V   M KKD+VS
Sbjct: 848  ACANLGALQLGREIHGFIIRRMICYVNTVQNSLVDMYAKCRSLQTAICVFNGMLKKDLVS 907

Query: 255  WRTVTHACIQNNCPSKAFKIFTRMRSFG-FELGETMMLDFIAAVLLVDELLLGLAVHCHA 314
            WRT+    ++N C  KA   F++M+    F L E ++ D I AVL   E  +G A HC+ 
Sbjct: 908  WRTLIRGYVENECGIKALDAFSKMQRLSFFALDEFVVRDMIMAVLQSGESKIGSAFHCYI 967

Query: 315  LKGGFLCFISVGTELLQMYAKFGELGLAKLTFDELVDKDIIAWSAMISAYSHGEEPLSAI 374
            LK GFL F+S+ T LLQMYAKF  +  A+  FD + +KD+IAW+AMISAY+    P +AI
Sbjct: 968  LKTGFLAFVSIATALLQMYAKFSMVASARNVFDHISNKDVIAWNAMISAYAQTGLPFNAI 1027

Query: 375  QTFKMMQSTNERPNAITFVSLVNACSSLDAQE----LGESIHAHITKSGYSSNTCLMSAL 434
             TF+ M   NE+P+  + VSL+  CS + +QE    +GE+IHA + K GYS N  L SAL
Sbjct: 1028 NTFRQMLLMNEKPSEFSLVSLLQICSLMASQEVSDKVGETIHAFVAKVGYSRNVYLSSAL 1087

Query: 435  VDFYCILRRVKLGEHVFDEIVTKDLVCWSTMIKGYGTNGCGNEALNTFSDMLSYGLKPNG 494
            +DFYC   RVK G+ +FDE+ TKDL+CWS+MI GY  NG G EAL TF++ML  G+KPN 
Sbjct: 1088 IDFYCRFGRVKQGKALFDEVPTKDLICWSSMINGYVLNGYGIEALETFANMLDCGIKPND 1147

Query: 495  TLFVSLLSACAQCGLEKEGWMWFNAMIDEYNITPTVAHYACMVELLARQGKIREAVEFVK 554
             +F+S+LSAC+ CGL+ EGW WF +M ++Y ITP +AHYACMV+LL+RQG I +A+ FVK
Sbjct: 1148 IIFLSVLSACSHCGLKNEGWNWFYSMKEKYGITPKLAHYACMVDLLSRQGHIEQALHFVK 1207

Query: 555  KMAVEPDTRIWGALFAGCKLTHGFSDIADSIVQQLNALEPNNSDFHAML 599
            KM +EPD RIWGAL AGC+++ G   I + +V++L+ L+P NS  + M+
Sbjct: 1208 KMPMEPDKRIWGALLAGCRVSPGPIKIVEFVVERLSTLDPQNSTHYYMI 1254

BLAST of Cp4.1LG08g01180 vs. TrEMBL
Match: A0A0D2RE34_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G121600 PE=4 SV=1)

HSP 1 Score: 610.9 bits (1574), Expect = 1.7e-171
Identity = 312/600 (52.00%), Postives = 418/600 (69.67%), Query Frame = 1

Query: 1   MLCFSRCARYLFVKSPQRKNYTIGLVIDATSAKKQAYYEDPVGFFAQQEDV-ISWTSKIT 60
           M   SR AR LF +  QR   +    I AT   + + ++D V   A+ + V +SWTSK++
Sbjct: 1   MFSQSRNARKLFAEITQRIKSS-PTRISATLFHRNSSHDDTVEPNAENDHVLVSWTSKLS 60

Query: 61  NLVRTGQPDSAFGFFKMMFA--NGHRPNYVTMLSVLRAIDALSWESTIEVMHGGVIKMGF 120
            LV+ GQP+ A   FK M    +  RPNYVT+LS+++A+DAL W+  + ++HG V+KMGF
Sbjct: 61  KLVKQGQPEEAICLFKRMLLLKSNQRPNYVTILSLIKALDALHWDVLVMMVHGLVVKMGF 120

Query: 121 ESEVAVSTALLGFYSMRDIGIVWKLFYQIPYKDVVLWSAVISACVKHGQFIEAFHLFREM 180
            SE +V TAL+G YS+  +G  W LF+QI  KDVVLWSAV+ ACVK+  ++EA  LFR M
Sbjct: 121 ISEPSVLTALIGSYSVYGMGTCWSLFHQIRDKDVVLWSAVVYACVKNKDYLEALELFRRM 180

Query: 181 QYQGVHPNHVSIVSILPACADFGALSLGKEIHAFSMRRDFYSIVNIQNSLMDMYSKCRNL 240
           Q+ G+  NHVSIVSILPACA+ GAL LG+EIH F ++R F  ++++QNSL+DMY+KCRNL
Sbjct: 181 QFIGLKVNHVSIVSILPACANLGALRLGREIHGFIIKRMFSHVISVQNSLVDMYAKCRNL 240

Query: 241 EASIRVLKTMRKKDMVSWRTVTHACIQNNCPSKAFKIFTRMRSFGFEL-GETMMLDFIAA 300
           E+ IRV   M +KD+VSWRTV    I+N    +A  IF++M+   F    E ++ D I A
Sbjct: 241 ESGIRVFDGMLEKDLVSWRTVIRGYIENEFGIEAINIFSKMQLLSFFAPDEFVVRDMIMA 300

Query: 301 VLLVDELLLGLAVHCHALKGGFLCFISVGTELLQMYAKFGELGLAKLTFDELVDKDIIAW 360
           VL   E  LG A HC+ +K GFL F+SV T LLQMYAKFG +  A+  FD + +KD+IAW
Sbjct: 301 VLQSGENKLGSAFHCYIMKNGFLAFVSVATALLQMYAKFGMVCSARSVFDHIGNKDVIAW 360

Query: 361 SAMISAYSHGEEPLSAIQTFKMMQSTNERPNAITFVSLVNACSSLDAQE----LGESIHA 420
           +AMISAY+  + P +A+ TF  M   N +PN  + +SL+  CS + +QE    LG+SIHA
Sbjct: 361 NAMISAYTQSKLPFNAVDTFTQMLHMNAKPNEFSLISLLQMCSLMASQEVSHELGDSIHA 420

Query: 421 HITKSGYSSNTCLMSALVDFYCILRRVKLGEHVFDEIVTKDLVCWSTMIKGYGTNGCGNE 480
            I K GYS N  L SAL+DFYC   RVK G+ +FDE+  KDL+CWS++I GYG NG G E
Sbjct: 421 FIEKVGYSRNVYLSSALIDFYCRSGRVKQGKALFDEVPVKDLICWSSLINGYGLNGYGIE 480

Query: 481 ALNTFSDMLSYGLKPNGTLFVSLLSACAQCGLEKEGWMWFNAMIDEYNITPTVAHYACMV 540
           AL TFS+ML  G+KPN  +F+S+LSAC+ CGLE EGW WF +M ++YN+TP +AHYACMV
Sbjct: 481 ALETFSNMLDCGIKPNEIIFLSVLSACSHCGLEYEGWNWFYSMKEKYNVTPKLAHYACMV 540

Query: 541 ELLARQGKIREAVEFVKKMAVEPDTRIWGALFAGCKLTHGFSDIADSIVQQLNALEPNNS 593
           +LL+RQG I +A++FVKKM +EPD RIWGA+ AGC+LT G  +I + +V+QL  L+P NS
Sbjct: 541 DLLSRQGNIEQALDFVKKMPMEPDKRIWGAILAGCRLTPGPIEIVEFVVEQLATLDPQNS 599

BLAST of Cp4.1LG08g01180 vs. TrEMBL
Match: A0A0J8D8C3_BETVU (Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_1g004700 PE=4 SV=1)

HSP 1 Score: 594.7 bits (1532), Expect = 1.2e-166
Identity = 283/569 (49.74%), Postives = 392/569 (68.89%), Query Frame = 1

Query: 36  AYYEDPVGFFAQQEDVISWTSKITNLVRTGQPDSAFGFFKMMFANGHRPNYVTMLSVLRA 95
           ++ +DP+  + + ++V+ WTS I+ LV+  +P  A   FK+M  +  RPNYVT+++VLR+
Sbjct: 45  SFLKDPIENYGEYKEVVYWTSIISGLVKQNRPKDAIAKFKLMLLSEQRPNYVTLVTVLRS 104

Query: 96  IDALSWESTIEVMHGGVIKMGFESEVAVSTALLGFYSMRDIGIVWKLFYQIPYKDVVLWS 155
           I AL +++   V+HG +IKMGF SE+ V TAL+G Y    +   WKLF  +P KD+V+WS
Sbjct: 105 ISALRFKTPAFVVHGLMIKMGFGSEIRVLTALIGVYGNMHLPFAWKLFSDMPMKDLVMWS 164

Query: 156 AVISACVKHGQFIEAFHLFREMQYQGVHPNHVSIVSILPACADFGALSLGKEIHAFSMRR 215
           A++S CVK G+F+ A  +FR M  +G+ PN VSIV++LPACA   +L+LGK++H F+++ 
Sbjct: 165 AMVSICVKSGEFMGAIEVFRNMICEGIVPNFVSIVNVLPACAKLDSLALGKQVHGFALKT 224

Query: 216 DFYSIVNIQNSLMDMYSKCRNLEASIRVLKTMRKKDMVSWRTVTHACIQNNCPSKAFKIF 275
            F+ ++ IQNSL+DMY+KC     + RV   + +KD+V+W+T+   CI+   P KA  IF
Sbjct: 225 SFHYVIYIQNSLIDMYAKCGGFMYAARVFARVARKDVVTWKTMIRGCIETGNPGKALVIF 284

Query: 276 TRMRSFGFELGETMMLDFIAAVLLVDELLLGLAVHCHALKGGFLCFISVGTELLQMYAKF 335
             M     E+   ++ D I      +E   GL +HC++LK G    +SVGT LLQMYAKF
Sbjct: 285 AGMHQSCSEIDCGIICDVIGVTSDSEEFTFGLGLHCYSLKSGLAESVSVGTALLQMYAKF 344

Query: 336 GELGLAKLTFDELVDKDIIAWSAMISAYSHGEEPLSAIQTFKMMQSTNERPNAITFVSLV 395
           GE+  ++L FD+L  KD+IAW+AMISAY+       A+ TFK MQ T+E+PN ITFVSL+
Sbjct: 345 GEVHPSQLLFDQLHPKDLIAWTAMISAYAQSGYTSEALDTFKHMQLTSEKPNEITFVSLL 404

Query: 396 NACSSLDAQELGESIHAHITKSGYSSNTCLMSALVDFYCILRRVKLGEHVFDEIVTKDLV 455
            ACSS   QE GESIH ++TK+GY  N+ L+SAL+D YC   R+K G  +FDE   KDL+
Sbjct: 405 QACSSSGTQEFGESIHGYVTKAGYLPNSFLISALIDLYCKFGRIKQGRALFDEHHIKDLI 464

Query: 456 CWSTMIKGYGTNGCGNEALNTFSDMLSYGLKPNGTLFVSLLSACAQCGLEKEGWMWFNAM 515
            WS+MI GYG NG  NEAL  FS+ML+ G+KPN  +FVS+LS+C+ CGLE EGW WFN M
Sbjct: 465 IWSSMINGYGLNGFANEALEIFSNMLASGVKPNDVIFVSVLSSCSHCGLEDEGWYWFNCM 524

Query: 516 IDEYNITPTVAHYACMVELLARQGKIREAVEFVKKMAVEPDTRIWGALFAGCKLTHGFSD 575
            D+Y I P +AHYACMV+LL+RQG + EAV+FV  M VEPD RIWG+L AGCK THG  D
Sbjct: 525 QDKYGIVPKIAHYACMVDLLSRQGNVEEAVDFVYNMQVEPDKRIWGSLLAGCKSTHGSID 584

Query: 576 IADSIVQQLNALEPNNSDFHAMLHNFCIE 605
           +A+ +VQQL  L+P N+ ++ +L N   E
Sbjct: 585 VAELVVQQLIRLDPKNTSYYVVLSNMYAE 613

BLAST of Cp4.1LG08g01180 vs. TAIR10
Match: AT4G18750.1 (AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 322.4 bits (825), Expect = 6.0e-88
Identity = 181/561 (32.26%), Postives = 298/561 (53.12%), Query Frame = 1

Query: 49  EDVISWTSKITNLVRTGQPDSAFGFFKMMFANGHRPNYVTMLSVLRAIDALSWESTIEVM 108
           E  + W   +  L ++G    + G FK M ++G   +  T   V ++  +L      E +
Sbjct: 158 EKALFWNILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGEQL 217

Query: 109 HGGVIKMGFESEVAVSTALLGFYSMRD-IGIVWKLFYQIPYKDVVLWSAVISACVKHGQF 168
           HG ++K GF    +V  +L+ FY     +    K+F ++  +DV+ W+++I+  V +G  
Sbjct: 218 HGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNGLA 277

Query: 169 IEAFHLFREMQYQGVHPNHVSIVSILPACADFGALSLGKEIHAFSMRRDFYSIVNIQNSL 228
            +   +F +M   G+  +  +IVS+   CAD   +SLG+ +H+  ++  F       N+L
Sbjct: 278 EKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFCNTL 337

Query: 229 MDMYSKCRNLEASIRVLKTMRKKDMVSWRTVTHACIQNNCPSKAFKIFTRMRSFGFE--- 288
           +DMYSKC +L+++  V + M  + +VS+ ++     +     +A K+F  M   G     
Sbjct: 338 LDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDV 397

Query: 289 LGETMMLDFIAAVLLVDELLLGLAVHCHALKGGFLCFISVGTELLQMYAKFGELGLAKLT 348
              T +L+  A   L+DE   G  VH    +      I V   L+ MYAK G +  A+L 
Sbjct: 398 YTVTAVLNCCARYRLLDE---GKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELV 457

Query: 349 FDELVDKDIIAWSAMISAYSHGEEPLSAIQTFKMM-QSTNERPNAITFVSLVNACSSLDA 408
           F E+  KDII+W+ +I  YS       A+  F ++ +     P+  T   ++ AC+SL A
Sbjct: 458 FSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLSA 517

Query: 409 QELGESIHAHITKSGYSSNTCLMSALVDFYCILRRVKLGEHVFDEIVTKDLVCWSTMIKG 468
            + G  IH +I ++GY S+  + ++LVD Y     + L   +FD+I +KDLV W+ MI G
Sbjct: 518 FDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIAG 577

Query: 469 YGTNGCGNEALNTFSDMLSYGLKPNGTLFVSLLSACAQCGLEKEGWMWFNAMIDEYNITP 528
           YG +G G EA+  F+ M   G++ +   FVSLL AC+  GL  EGW +FN M  E  I P
Sbjct: 578 YGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEP 637

Query: 529 TVAHYACMVELLARQGKIREAVEFVKKMAVEPDTRIWGALFAGCKLTHGFSDIADSIVQQ 588
           TV HYAC+V++LAR G + +A  F++ M + PD  IWGAL  GC++ H    +A+ + ++
Sbjct: 638 TVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVK-LAEKVAEK 697

Query: 589 LNALEPNNSDFHAMLHNFCIE 605
           +  LEP N+ ++ ++ N   E
Sbjct: 698 VFELEPENTGYYVLMANIYAE 714

BLAST of Cp4.1LG08g01180 vs. TAIR10
Match: AT4G30700.1 (AT4G30700.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 303.1 bits (775), Expect = 3.7e-82
Identity = 167/567 (29.45%), Postives = 295/567 (52.03%), Query Frame = 1

Query: 37  YYEDPVGFFAQQEDVISWTSKITNLVRTGQPDSAFGFFK-MMFANGHRPNYVTMLSVLRA 96
           YY   +    Q+ DV  +   +        P S+   F  +  +   +PN  T    + A
Sbjct: 69  YYARDIFLSVQRPDVFLFNVLMRGFSVNESPHSSLSVFAHLRKSTDLKPNSSTYAFAISA 128

Query: 97  IDALSWESTIEVMHGGVIKMGFESEVAVSTALLGFY-SMRDIGIVWKLFYQIPYKDVVLW 156
                 +    V+HG  +  G +SE+ + + ++  Y     +    K+F ++P KD +LW
Sbjct: 129 ASGFRDDRAGRVIHGQAVVDGCDSELLLGSNIVKMYFKFWRVEDARKVFDRMPEKDTILW 188

Query: 157 SAVISACVKHGQFIEAFHLFREMQYQG-VHPNHVSIVSILPACADFGALSLGKEIHAFSM 216
           + +IS   K+  ++E+  +FR++  +     +  +++ ILPA A+   L LG +IH+ + 
Sbjct: 189 NTMISGYRKNEMYVESIQVFRDLINESCTRLDTTTLLDILPAVAELQELRLGMQIHSLAT 248

Query: 217 RRDFYSIVNIQNSLMDMYSKCRNLEASIRVLKTMRKKDMVSWRTVTHACIQNNCPSKAFK 276
           +   YS   +    + +YSKC  ++    + +  RK D+V++  + H    N     +  
Sbjct: 249 KTGCYSHDYVLTGFISLYSKCGKIKMGSALFREFRKPDIVAYNAMIHGYTSNGETELSLS 308

Query: 277 IFTRMRSFGFELGETMMLDFIAAVLLVDELLLGLAVHCHALKGGFLCFISVGTELLQMYA 336
           +F  +   G  L  + +   ++ V +   L+L  A+H + LK  FL   SV T L  +Y+
Sbjct: 309 LFKELMLSGARLRSSTL---VSLVPVSGHLMLIYAIHGYCLKSNFLSHASVSTALTTVYS 368

Query: 337 KFGELGLAKLTFDELVDKDIIAWSAMISAYSHGEEPLSAIQTFKMMQSTNERPNAITFVS 396
           K  E+  A+  FDE  +K + +W+AMIS Y+       AI  F+ MQ +   PN +T   
Sbjct: 369 KLNEIESARKLFDESPEKSLPSWNAMISGYTQNGLTEDAISLFREMQKSEFSPNPVTITC 428

Query: 397 LVNACSSLDAQELGESIHAHITKSGYSSNTCLMSALVDFYCILRRVKLGEHVFDEIVTKD 456
           +++AC+ L A  LG+ +H  +  + + S+  + +AL+  Y     +     +FD +  K+
Sbjct: 429 ILSACAQLGALSLGKWVHDLVRSTDFESSIYVSTALIGMYAKCGSIAEARRLFDLMTKKN 488

Query: 457 LVCWSTMIKGYGTNGCGNEALNTFSDMLSYGLKPNGTLFVSLLSACAQCGLEKEGWMWFN 516
            V W+TMI GYG +G G EALN F +ML+ G+ P    F+ +L AC+  GL KEG   FN
Sbjct: 489 EVTWNTMISGYGLHGQGQEALNIFYEMLNSGITPTPVTFLCVLYACSHAGLVKEGDEIFN 548

Query: 517 AMIDEYNITPTVAHYACMVELLARQGKIREAVEFVKKMAVEPDTRIWGALFAGCKLTHGF 576
           +MI  Y   P+V HYACMV++L R G ++ A++F++ M++EP + +W  L   C++ H  
Sbjct: 549 SMIHRYGFEPSVKHYACMVDILGRAGHLQRALQFIEAMSIEPGSSVWETLLGACRI-HKD 608

Query: 577 SDIADSIVQQLNALEPNNSDFHAMLHN 601
           +++A ++ ++L  L+P+N  +H +L N
Sbjct: 609 TNLARTVSEKLFELDPDNVGYHVLLSN 631

BLAST of Cp4.1LG08g01180 vs. TAIR10
Match: AT1G11290.1 (AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 302.0 bits (772), Expect = 8.3e-82
Identity = 161/550 (29.27%), Postives = 294/550 (53.45%), Query Frame = 1

Query: 52  ISWTSKITNLVRTGQPDSAFGFFKMMFANGHRPNYVTMLSVLRAIDALSWESTIEVMHGG 111
           + + + +    +    D A  FF  M  +   P       +L+     +     + +HG 
Sbjct: 101 VLYHTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGL 160

Query: 112 VIKMGFESEVAVSTALLGFYSM-RDIGIVWKLFYQIPYKDVVLWSAVISACVKHGQFIEA 171
           ++K GF  ++   T L   Y+  R +    K+F ++P +D+V W+ +++   ++G    A
Sbjct: 161 LVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMA 220

Query: 172 FHLFREMQYQGVHPNHVSIVSILPACADFGALSLGKEIHAFSMRRDFYSIVNIQNSLMDM 231
             + + M  + + P+ ++IVS+LPA +    +S+GKEIH ++MR  F S+VNI  +L+DM
Sbjct: 221 LEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDM 280

Query: 232 YSKCRNLEASIRVLKTMRKKDMVSWRTVTHACIQNNCPSKAFKIFTRMRSFGFELGETMM 291
           Y+KC +LE + ++   M ++++VSW ++  A +QN  P +A  IF +M   G +  +  +
Sbjct: 281 YAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSV 340

Query: 292 LDFIAAVLLVDELLLGLAVHCHALKGGFLCFISVGTELLQMYAKFGELGLAKLTFDELVD 351
           +  + A   + +L  G  +H  +++ G    +SV   L+ MY K  E+  A   F +L  
Sbjct: 341 MGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQS 400

Query: 352 KDIIAWSAMISAYSHGEEPLSAIQTFKMMQSTNERPNAITFVSLVNACSSLDAQELGESI 411
           + +++W+AMI  ++    P+ A+  F  M+S   +P+  T+VS++ A + L      + I
Sbjct: 401 RTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWI 460

Query: 412 HAHITKSGYSSNTCLMSALVDFYCILRRVKLGEHVFDEIVTKDLVCWSTMIKGYGTNGCG 471
           H  + +S    N  + +ALVD Y     + +   +FD +  + +  W+ MI GYGT+G G
Sbjct: 461 HGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFG 520

Query: 472 NEALNTFSDMLSYGLKPNGTLFVSLLSACAQCGLEKEGWMWFNAMIDEYNITPTVAHYAC 531
             AL  F +M    +KPNG  F+S++SAC+  GL + G   F  M + Y+I  ++ HY  
Sbjct: 521 KAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGA 580

Query: 532 MVELLARQGKIREAVEFVKKMAVEPDTRIWGALFAGCKLTHGFSDIADSIVQQLNALEPN 591
           MV+LL R G++ EA +F+ +M V+P   ++GA+   C++ H   + A+   ++L  L P+
Sbjct: 581 MVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQI-HKNVNFAEKAAERLFELNPD 640

Query: 592 NSDFHAMLHN 601
           +  +H +L N
Sbjct: 641 DGGYHVLLAN 649

BLAST of Cp4.1LG08g01180 vs. TAIR10
Match: AT4G39952.1 (AT4G39952.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 300.8 bits (769), Expect = 1.9e-81
Identity = 172/556 (30.94%), Postives = 290/556 (52.16%), Query Frame = 1

Query: 50  DVISWTSKITNLVRTGQPDSAFGFFKMMFANGH---RPNYVTMLSVLRAIDALSWESTIE 109
           DV++WT+ I+  V+ G+ +   G+   M + G    +PN  T+    +A   L       
Sbjct: 191 DVVAWTAIISGHVQNGESEGGLGYLCKMHSAGSDVDKPNPRTLECGFQACSNLGALKEGR 250

Query: 110 VMHGGVIKMGFESEVAVSTALLGFYSMR-DIGIVWKLFYQIPYKDVVLWSAVISACVKHG 169
            +HG  +K G  S   V +++  FYS   +    +  F ++  +D+  W+++I++  + G
Sbjct: 251 CLHGFAVKNGLASSKFVQSSMFSFYSKSGNPSEAYLSFRELGDEDMFSWTSIIASLARSG 310

Query: 170 QFIEAFHLFREMQYQGVHPNHVSIVSILPACADFGALSLGKEIHAFSMRRDFYSIVNIQN 229
              E+F +F EMQ +G+HP+ V I  ++        +  GK  H F +R  F     + N
Sbjct: 311 DMEESFDMFWEMQNKGMHPDGVVISCLINELGKMMLVPQGKAFHGFVIRHCFSLDSTVCN 370

Query: 230 SLMDMYSKCRNLEASIRVL-KTMRKKDMVSWRTVTHACIQNNCPSKAFKIFTRMRSFGFE 289
           SL+ MY K   L  + ++  +   + +  +W T+     +  C  K  ++F ++++ G E
Sbjct: 371 SLLSMYCKFELLSVAEKLFCRISEEGNKEAWNTMLKGYGKMKCHVKCIELFRKIQNLGIE 430

Query: 290 LGETMMLDFIAAVLLVDELLLGLAVHCHALKGGFLCFISVGTELLQMYAKFGELGLAKLT 349
           +        I++   +  +LLG ++HC+ +K      ISV   L+ +Y K G+L +A   
Sbjct: 431 IDSASATSVISSCSHIGAVLLGKSLHCYVVKTSLDLTISVVNSLIDLYGKMGDLTVAWRM 490

Query: 350 FDELVDKDIIAWSAMISAYSHGEEPLSAIQTFKMMQSTNERPNAITFVSLVNACSSLDAQ 409
           F E  D ++I W+AMI++Y H E+   AI  F  M S N +P++IT V+L+ AC +  + 
Sbjct: 491 FCE-ADTNVITWNAMIASYVHCEQSEKAIALFDRMVSENFKPSSITLVTLLMACVNTGSL 550

Query: 410 ELGESIHAHITKSGYSSNTCLMSALVDFYCILRRVKLGEHVFDEIVTKDLVCWSTMIKGY 469
           E G+ IH +IT++ +  N  L +AL+D Y     ++    +FD    KD VCW+ MI GY
Sbjct: 551 ERGQMIHRYITETEHEMNLSLSAALIDMYAKCGHLEKSRELFDAGNQKDAVCWNVMISGY 610

Query: 470 GTNGCGNEALNTFSDMLSYGLKPNGTLFVSLLSACAQCGLEKEGWMWFNAMIDEYNITPT 529
           G +G    A+  F  M    +KP G  F++LLSAC   GL ++G   F  M  +Y++ P 
Sbjct: 611 GMHGDVESAIALFDQMEESDVKPTGPTFLALLSACTHAGLVEQGKKLFLKM-HQYDVKPN 670

Query: 530 VAHYACMVELLARQGKIREAVEFVKKMAVEPDTRIWGALFAGCKLTHGFSDIADSIVQQL 589
           + HY+C+V+LL+R G + EA   V  M   PD  IWG L + C +THG  ++   + ++ 
Sbjct: 671 LKHYSCLVDLLSRSGNLEEAESTVMSMPFSPDGVIWGTLLSSC-MTHGEFEMGIRMAERA 730

Query: 590 NALEPNNSDFHAMLHN 601
            A +P N  ++ ML N
Sbjct: 731 VASDPQNDGYYIMLAN 743

BLAST of Cp4.1LG08g01180 vs. TAIR10
Match: AT3G63370.1 (AT3G63370.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 294.7 bits (753), Expect = 1.3e-79
Identity = 168/562 (29.89%), Postives = 285/562 (50.71%), Query Frame = 1

Query: 45  FAQQEDVISWTSKITNLVRTGQPDSAFGFFKMMFANGHRPNYVTMLSVLRAIDALSWEST 104
           F ++ D + W S +++   +G+       F+ M   G  PN  T++S L A D  S+   
Sbjct: 243 FQEKGDAVLWNSILSSYSTSGKSLETLELFREMHMTGPAPNSYTIVSALTACDGFSYAKL 302

Query: 105 IEVMHGGVIKMG-FESEVAVSTALLGFYSM-RDIGIVWKLFYQIPYKDVVLWSAVISACV 164
            + +H  V+K     SE+ V  AL+  Y+    +    ++  Q+   DVV W+++I   V
Sbjct: 303 GKEIHASVLKSSTHSSELYVCNALIAMYTRCGKMPQAERILRQMNNADVVTWNSLIKGYV 362

Query: 165 KHGQFIEAFHLFREMQYQGVHPNHVSIVSILPACADFGALSLGKEIHAFSMRRDFYSIVN 224
           ++  + EA   F +M   G   + VS+ SI+ A      L  G E+HA+ ++  + S + 
Sbjct: 363 QNLMYKEALEFFSDMIAAGHKSDEVSMTSIIAASGRLSNLLAGMELHAYVIKHGWDSNLQ 422

Query: 225 IQNSLMDMYSKCRNLEASIRVLKTMRKKDMVSWRTVTHACIQNNCPSKAFKIFTRMRSFG 284
           + N+L+DMYSKC       R    M  KD++SW TV     QN+C  +A ++F  +    
Sbjct: 423 VGNTLIDMYSKCNLTCYMGRAFLRMHDKDLISWTTVIAGYAQNDCHVEALELFRDVAKKR 482

Query: 285 FELGETMMLDFIAAVLLVDELLLGLAVHCHALKGGFLCFISVGTELLQMYAKFGELGLAK 344
            E+ E ++   + A  ++  +L+   +HCH L+ G L  + +  EL+ +Y K   +G A 
Sbjct: 483 MEIDEMILGSILRASSVLKSMLIVKEIHCHILRKGLLDTV-IQNELVDVYGKCRNMGYAT 542

Query: 345 LTFDELVDKDIIAWSAMISAYSHGEEPLSAIQTFKMMQSTNERPNAITFVSLVNACSSLD 404
             F+ +  KD+++W++MIS+ +       A++ F+ M  T    +++  + +++A +SL 
Sbjct: 543 RVFESIKGKDVVSWTSMISSSALNGNESEAVELFRRMVETGLSADSVALLCILSAAASLS 602

Query: 405 AQELGESIHAHITKSGYSSNTCLMSALVDFYCILRRVKLGEHVFDEIVTKDLVCWSTMIK 464
           A   G  IH ++ + G+     +  A+VD Y     ++  + VFD I  K L+ +++MI 
Sbjct: 603 ALNKGREIHCYLLRKGFCLEGSIAVAVVDMYACCGDLQSAKAVFDRIERKGLLQYTSMIN 662

Query: 465 GYGTNGCGNEALNTFSDMLSYGLKPNGTLFVSLLSACAQCGLEKEGWMWFNAMIDEYNIT 524
            YG +GCG  A+  F  M    + P+   F++LL AC+  GL  EG  +   M  EY + 
Sbjct: 663 AYGMHGCGKAAVELFDKMRHENVSPDHISFLALLYACSHAGLLDEGRGFLKIMEHEYELE 722

Query: 525 PTVAHYACMVELLARQGKIREAVEFVKKMAVEPDTRIWGALFAGCKLTHGFSDIADSIVQ 584
           P   HY C+V++L R   + EA EFVK M  EP   +W AL A C+ +H   +I +   Q
Sbjct: 723 PWPEHYVCLVDMLGRANCVVEAFEFVKMMKTEPTAEVWCALLAACR-SHSEKEIGEIAAQ 782

Query: 585 QLNALEPNNSDFHAMLHNFCIE 605
           +L  LEP N     ++ N   E
Sbjct: 783 RLLELEPKNPGNLVLVSNVFAE 802

BLAST of Cp4.1LG08g01180 vs. NCBI nr
Match: gi|659077727|ref|XP_008439351.1| (PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g01580 [Cucumis melo])

HSP 1 Score: 941.0 bits (2431), Expect = 1.0e-270
Identity = 462/571 (80.91%), Postives = 506/571 (88.62%), Query Frame = 1

Query: 1   MLCFSRCARYLFVKSPQRKNYTIGLVIDATSAKKQAYYEDPVGFFAQQEDVISWTSKITN 60
           MLCFSRCAR LFV SP RKNYTI  ++DATS KK+ Y+EDPV F+AQ+EDVISWTSKITN
Sbjct: 1   MLCFSRCARNLFVISPNRKNYTIRSMMDATSTKKRGYFEDPVEFYAQREDVISWTSKITN 60

Query: 61  LVRTGQPDSAFGFFKMMFANGHRPNYVTMLSVLRAIDALSWESTIEVMHGGVIKMGFESE 120
           LVR GQP+SAFGFFKMMF+NGHRPNYVTMLSV+RAIDALSW+S IEVMHG  IKMGFESE
Sbjct: 61  LVRAGQPESAFGFFKMMFSNGHRPNYVTMLSVIRAIDALSWDSMIEVMHGVTIKMGFESE 120

Query: 121 VAVSTALLGFYSMRDIGIVWKLFYQIPYKDVVLWSAVISACVKHGQFIEAFHLFREMQYQ 180
           VAVSTALLGFYS+RDI  VWKLF QIP KDVV WSA+ISACVK+GQ+ EAF L REMQ Q
Sbjct: 121 VAVSTALLGFYSIRDIETVWKLFNQIPCKDVVFWSAIISACVKNGQYSEAFDLLREMQDQ 180

Query: 181 GVHPNHVSIVSILPACADFGALSLGKEIHAFSMRRDFYSIVNIQNSLMDMYSKCRNLEAS 240
           GV PN VSIVSILPACADFG LSLGKE+HAFSMR+DFYS+V+IQNSLMDMYSKCR  EAS
Sbjct: 181 GVQPNQVSIVSILPACADFGVLSLGKELHAFSMRKDFYSMVDIQNSLMDMYSKCRMFEAS 240

Query: 241 IRVLKTMRKKDMVSWRTVTHACIQNNCPSKAFKIFTRMRSFGFELGETMMLDFIAAVLLV 300
           I+VLK MRKKD VSW+ +THACIQNN PS+ FKIF+RMRS GFEL ETM+LD I+AVLLV
Sbjct: 241 IKVLKLMRKKDAVSWKIITHACIQNNYPSEVFKIFSRMRSLGFELSETMVLDMISAVLLV 300

Query: 301 DELLLGLAVHCHALKGGFLCFISVGTELLQMYAKFGELGLAKLTFDELVDKDIIAWSAMI 360
           DELLLGLAVHC+ALKGGFLCFI VGTELLQMYAKFG+L LAKL FDELVDKDIIAWSAMI
Sbjct: 301 DELLLGLAVHCYALKGGFLCFILVGTELLQMYAKFGDLRLAKLVFDELVDKDIIAWSAMI 360

Query: 361 SAYSHGEEPLSAIQTFKMMQSTNERPNAITFVSLVNACSSLDAQELGESIHAHITKSGYS 420
           S YSHGE+PL+AIQTFKMMQSTNE+PN  TFVSL++ACSSL A+ELGESI AH  K GY+
Sbjct: 361 SVYSHGEDPLNAIQTFKMMQSTNEKPNERTFVSLMDACSSLGAKELGESIQAHTIKCGYT 420

Query: 421 SNTCLMSALVDFYCILRRVKLGEHVFDEIVTKDLVCWSTMIKGYGTNGCGNEALNTFSDM 480
           SNT LMSALV FYC L R+KLGEHVFDEI TKDL+CW+ MIKGYG NGCGN+ALNTFSDM
Sbjct: 421 SNTHLMSALVGFYCTLGRIKLGEHVFDEISTKDLICWNAMIKGYGLNGCGNKALNTFSDM 480

Query: 481 LSYGLKPNGTLFVSLLSACAQCGLEKEGWMWFNAMIDEYNITPTVAHYACMVELLARQGK 540
           LSYGLKPNG +F SLLSACAQCGLEKE  MWF +MID+Y ITPT AHYAC+V+LL R+GK
Sbjct: 481 LSYGLKPNGVVFASLLSACAQCGLEKEVRMWFRSMIDKYGITPTEAHYACIVDLLVRKGK 540

Query: 541 IREAVEFVKKMAVEPDTRIWGALFAGCKLTH 572
           I EAVEFVK M VEPDTRIWGAL  GCKLTH
Sbjct: 541 IGEAVEFVKXMPVEPDTRIWGALLLGCKLTH 571

BLAST of Cp4.1LG08g01180 vs. NCBI nr
Match: gi|700201385|gb|KGN56518.1| (hypothetical protein Csa_3G122420 [Cucumis sativus])

HSP 1 Score: 909.1 bits (2348), Expect = 4.2e-261
Identity = 446/550 (81.09%), Postives = 493/550 (89.64%), Query Frame = 1

Query: 1   MLCFSRCARYLFVKSPQRKNYTIGLVIDATSAKKQAYYEDPVGFFAQQEDVISWTSKITN 60
           MLCFSRCAR LFV S +RK+YTI  ++DATS KK+ Y+EDPV F+AQ+EDVISWTSKITN
Sbjct: 1   MLCFSRCARNLFVVSSKRKDYTIRSMVDATSTKKRVYFEDPVEFYAQREDVISWTSKITN 60

Query: 61  LVRTGQPDSAFGFFKMMFANGHRPNYVTMLSVLRAIDALSWESTIEVMHGGVIKMGFESE 120
           LVRTGQP+SAFGFFKMMF+NGHRPNYVTMLSV+RAIDALSW+S IEVMHG VIKMGFESE
Sbjct: 61  LVRTGQPESAFGFFKMMFSNGHRPNYVTMLSVIRAIDALSWDSMIEVMHGVVIKMGFESE 120

Query: 121 VAVSTALLGFYSMRDIGIVWKLFYQIPYKDVVLWSAVISACVKHGQFIEAFHLFREMQYQ 180
           VAVSTALLGFYS+RDI  VWKLF QIP KDVVLWSA+IS CVK+GQ+ EAF L REMQ Q
Sbjct: 121 VAVSTALLGFYSIRDIETVWKLFNQIPSKDVVLWSAIISVCVKNGQYNEAFDLLREMQDQ 180

Query: 181 GVHPNHVSIVSILPACADFGALSLGKEIHAFSMRRDFYSIVNIQNSLMDMYSKCRNLEAS 240
           GV PN V+IVSILPACADFG LSLGKE+HAFSMRRDFYS+V++QNSLMDMYSKCR  EAS
Sbjct: 181 GVQPNQVTIVSILPACADFGVLSLGKELHAFSMRRDFYSMVDLQNSLMDMYSKCRKFEAS 240

Query: 241 IRVLKTMRKKDMVSWRTVTHACIQNNCPSKAFKIFTRMRSFGFELGETMMLDFIAAVLLV 300
           IRVLK MRKKD VSW+ +THACIQNNCPSK FKIF+RMRSFGFEL ETMMLD I+AVLL+
Sbjct: 241 IRVLKLMRKKDAVSWKIITHACIQNNCPSKVFKIFSRMRSFGFELSETMMLDMISAVLLL 300

Query: 301 DELLLGLAVHCHALKGGFLCFISVGTELLQMYAKFGELGLAKLTFDELVDKDIIAWSAMI 360
           DELLLGLAVHC+ALKGGFLCFI VGTELLQMYAKFG+L LAKL FD LVDKDIIAWSAMI
Sbjct: 301 DELLLGLAVHCYALKGGFLCFILVGTELLQMYAKFGDLRLAKLVFDGLVDKDIIAWSAMI 360

Query: 361 SAYSHGEEPLSAIQTFKMMQSTNERPNAITFVSLVNACSSLDAQELGESIHAHITKSGYS 420
           SAYSHGE+PL+AIQTFKMMQSTNE+PN  TFVSL++ACSSL A+ELGE+I AH  K GY+
Sbjct: 361 SAYSHGEDPLNAIQTFKMMQSTNEKPNERTFVSLMDACSSLGAKELGETIQAHTIKCGYT 420

Query: 421 SNTCLMSALVDFYCILRRVKLGEHVFDEIVTKDLVCWSTMIKGYGTNGCGNEALNTFSDM 480
           SNT LMSALV FYC L R+KLGEHVFDEI  KD++CW+ +IKGYG NGCGN+ALNTFSDM
Sbjct: 421 SNTHLMSALVGFYCKLGRIKLGEHVFDEISRKDVICWNALIKGYGLNGCGNKALNTFSDM 480

Query: 481 LSYGLKPNGTLFVSLLSACAQCGLEKEGWMWFNAMIDEYNITPTVAHYACMVELLARQGK 540
           LSYGLKPNG +F SLLSACAQCGLEKE  MWF +M DEY ITPT+AHYAC+V+LL RQGK
Sbjct: 481 LSYGLKPNGVVFASLLSACAQCGLEKEVRMWFRSMNDEYGITPTMAHYACIVDLLVRQGK 540

Query: 541 IREAVEFVKK 551
           IREAVEFVKK
Sbjct: 541 IREAVEFVKK 550

BLAST of Cp4.1LG08g01180 vs. NCBI nr
Match: gi|147834193|emb|CAN75306.1| (hypothetical protein VITISV_040403 [Vitis vinifera])

HSP 1 Score: 683.7 bits (1763), Expect = 2.9e-193
Identity = 324/569 (56.94%), Postives = 424/569 (74.52%), Query Frame = 1

Query: 36  AYYEDPVGFFAQQEDVISWTSKITNLVRTGQPDSAFGFFKMMFANGHRPNYVTMLSVLRA 95
           AYYE+PV F  ++++VISWTSKI++LV+  Q + A G FKMM     RPN+VT+LSV+RA
Sbjct: 39  AYYEEPVEFHGEKDNVISWTSKISSLVKQNQSELAVGLFKMMLMTEQRPNHVTVLSVIRA 98

Query: 96  IDALSWESTIEVMHGGVIKMGFESEVAVSTALLGFYSMRDIGIVWKLFYQIPYKDVVLWS 155
           I  L  E  + V+ G VIK+GFESEV+V+TAL+GFYS  D+GIVWK+F Q P KD+VLWS
Sbjct: 99  ISGLGLEDMMRVICGSVIKLGFESEVSVATALIGFYSDYDMGIVWKIFNQTPIKDLVLWS 158

Query: 156 AVISACVKHGQFIEAFHLFREMQYQGVHPNHVSIVSILPACADFGALSLGKEIHAFSMRR 215
           A++SACVK GQ+ EAF +FR MQY GV PNHVSIVSILPACA+ GAL  GKEIH FS+++
Sbjct: 159 AMVSACVKSGQYGEAFEIFRAMQYDGVEPNHVSIVSILPACANVGALLFGKEIHGFSIKK 218

Query: 216 DFYSIVNIQNSLMDMYSKCRNLEASIRVLKTMRKKDMVSWRTVTHACIQNNCPSKAFKIF 275
            F+ + N+ NSL+DMY+KCRN +AS+ V   + +KD++SW T+   CI+N+CP +AFK F
Sbjct: 219 MFHPLTNVHNSLVDMYAKCRNFKASMLVFDQILEKDLISWTTIIRGCIENDCPREAFKAF 278

Query: 276 TRMRSFGFELGETMMLDFIAAVLLVDELLLGLAVHCHALKGGFLCFISVGTELLQMYAKF 335
           +RM+   F   ET++ D I A++  DE   G+A H   LK G L F+S+GT LLQMYAKF
Sbjct: 279 SRMQFSCFGADETIVQDLIVAIIQADEHKFGIAFHGFLLKNGLLAFVSIGTALLQMYAKF 338

Query: 336 GELGLAKLTFDELVDKDIIAWSAMISAYSHGEEPLSAIQTFKMMQSTNERPNAITFVSLV 395
           GEL  A + FD+L  KD I+WSAMIS ++H   P +A++TFK MQST+ERPN ITFVSL+
Sbjct: 339 GELESAIIVFDQLNKKDYISWSAMISVHAHSRHPYNALETFKQMQSTDERPNEITFVSLL 398

Query: 396 NACSSLDAQELGESIHAHITKSGYSSNTCLMSALVDFYCILRRVKLGEHVFDEIVTKDLV 455
            ACS + AQELGESI AH TK+GY SN  L SAL+D YC   R+  G  +F+EI TKDLV
Sbjct: 399 QACSLIGAQELGESIQAHATKAGYLSNAFLSSALIDLYCKFGRINQGRAIFNEIPTKDLV 458

Query: 456 CWSTMIKGYGTNGCGNEALNTFSDMLSYGLKPNGTLFVSLLSACAQCGLEKEGWMWFNAM 515
           CWS+MI GYG NGCG+EAL TFS+ML+ G+KPN  +F+S+LSAC+ CGLE EGW  F++M
Sbjct: 459 CWSSMINGYGLNGCGDEALETFSNMLACGVKPNEVVFISVLSACSHCGLEHEGWSCFSSM 518

Query: 516 IDEYNITPTVAHYACMVELLARQGKIREAVEFVKKMAVEPDTRIWGALFAGCKLTHGFSD 575
             +Y I P + HYACMV+L++R+G I  A++FV KM +EPD RIWGAL AGC+ THG  +
Sbjct: 519 EQKYGIIPKLPHYACMVDLISRRGNIEGALQFVNKMPMEPDKRIWGALLAGCRSTHGSIE 578

Query: 576 IADSIVQQLNALEPNNSDFHAMLHNFCIE 605
           IA+ + ++L  L+P N+ ++ +L N   E
Sbjct: 579 IAELVAERLIGLDPQNTSYYVILSNLYAE 607

BLAST of Cp4.1LG08g01180 vs. NCBI nr
Match: gi|731421553|ref|XP_010661790.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g01580 [Vitis vinifera])

HSP 1 Score: 683.7 bits (1763), Expect = 2.9e-193
Identity = 324/569 (56.94%), Postives = 424/569 (74.52%), Query Frame = 1

Query: 36  AYYEDPVGFFAQQEDVISWTSKITNLVRTGQPDSAFGFFKMMFANGHRPNYVTMLSVLRA 95
           AYYE+PV F  ++++VISWTSKI++LV+  Q + A G FKMM     RPN+VT+LSV+RA
Sbjct: 39  AYYEEPVEFHGEKDNVISWTSKISSLVKQNQSELAVGLFKMMLMTEQRPNHVTVLSVIRA 98

Query: 96  IDALSWESTIEVMHGGVIKMGFESEVAVSTALLGFYSMRDIGIVWKLFYQIPYKDVVLWS 155
           I  L  E  + V+ G VIK+GFESEV+V+TAL+GFYS  D+GIVWK+F Q P KD+VLWS
Sbjct: 99  ISGLGLEDMMRVICGSVIKLGFESEVSVATALIGFYSDYDMGIVWKIFNQTPIKDLVLWS 158

Query: 156 AVISACVKHGQFIEAFHLFREMQYQGVHPNHVSIVSILPACADFGALSLGKEIHAFSMRR 215
           A++SACVK GQ+ EAF +FR MQY GV PNHVSIVSILPACA+ GAL  GKEIH FS+++
Sbjct: 159 AMVSACVKSGQYGEAFEIFRAMQYDGVEPNHVSIVSILPACANVGALLFGKEIHGFSIKK 218

Query: 216 DFYSIVNIQNSLMDMYSKCRNLEASIRVLKTMRKKDMVSWRTVTHACIQNNCPSKAFKIF 275
            F+ + N+ NSL+DMY+KCRN +AS+ V   + +KD++SW T+   CI+N+CP +AFK F
Sbjct: 219 MFHPLTNVHNSLVDMYAKCRNFKASMLVFDQILEKDLISWTTIIRGCIENDCPREAFKAF 278

Query: 276 TRMRSFGFELGETMMLDFIAAVLLVDELLLGLAVHCHALKGGFLCFISVGTELLQMYAKF 335
           +RM+   F   ET++ D I A++  DE   G+A H   LK G L F+S+GT LLQMYAKF
Sbjct: 279 SRMQFSCFGADETIVQDLIVAIIQADEHKFGIAFHGFLLKNGLLAFVSIGTALLQMYAKF 338

Query: 336 GELGLAKLTFDELVDKDIIAWSAMISAYSHGEEPLSAIQTFKMMQSTNERPNAITFVSLV 395
           GEL  A + FD+L  KD I+WSAMIS ++H   P +A++TFK MQST+ERPN ITFVSL+
Sbjct: 339 GELESAIIVFDQLNKKDYISWSAMISVHAHSRHPYNALETFKQMQSTDERPNEITFVSLL 398

Query: 396 NACSSLDAQELGESIHAHITKSGYSSNTCLMSALVDFYCILRRVKLGEHVFDEIVTKDLV 455
            ACS + AQELGESI AH TK+GY SN  L SAL+D YC   R+  G  +F+EI TKDLV
Sbjct: 399 QACSLIGAQELGESIQAHATKAGYLSNAFLSSALIDLYCKFGRINQGRAIFNEIPTKDLV 458

Query: 456 CWSTMIKGYGTNGCGNEALNTFSDMLSYGLKPNGTLFVSLLSACAQCGLEKEGWMWFNAM 515
           CWS+MI GYG NGCG+EAL TFS+ML+ G+KPN  +F+S+LSAC+ CGLE EGW  F++M
Sbjct: 459 CWSSMINGYGLNGCGDEALETFSNMLACGVKPNEVVFISVLSACSHCGLEHEGWSCFSSM 518

Query: 516 IDEYNITPTVAHYACMVELLARQGKIREAVEFVKKMAVEPDTRIWGALFAGCKLTHGFSD 575
             +Y I P + HYACMV+L++R+G I  A++FV KM +EPD RIWGAL AGC+ THG  +
Sbjct: 519 EQKYGIIPKLPHYACMVDLISRRGNIEGALQFVNKMPMEPDKRIWGALLAGCRSTHGSIE 578

Query: 576 IADSIVQQLNALEPNNSDFHAMLHNFCIE 605
           IA+ + ++L  L+P N+ ++ +L N   E
Sbjct: 579 IAELVAERLIGLDPQNTSYYVILSNLYAE 607

BLAST of Cp4.1LG08g01180 vs. NCBI nr
Match: gi|590722123|ref|XP_007051809.1| (Basic helix-loop-helix DNA-binding superfamily protein [Theobroma cacao])

HSP 1 Score: 613.6 bits (1581), Expect = 3.7e-172
Identity = 305/589 (51.78%), Postives = 411/589 (69.78%), Query Frame = 1

Query: 15   SPQRKNYTIGLVIDATSAKKQAYYEDPVGFFAQQEDVISWTSKITNLVRTGQPDSAFGFF 74
            SP R   ++ L   +T   +   ++DPV  +A ++ VISWTS ++ LVR GQP+ A G F
Sbjct: 668  SPTR--ISVALSRYSTLCYRNPNHDDPVDPYADKDHVISWTSVLSKLVRQGQPEEAIGLF 727

Query: 75   KMMFANGHRPNYVTMLSVLRAIDALSWESTIEVMHGGVIKMGFESEVAVSTALLGFYSMR 134
            K M  +  RPNYVT+LS+++A D L WE+   ++HG VIKMGFESE +V TAL+G YS+ 
Sbjct: 728  KTMLMSNQRPNYVTILSLVKAFDTLDWEALRMMVHGLVIKMGFESEPSVLTALIGSYSVY 787

Query: 135  DIGIVWKLFYQIPYKDVVLWSAVISACVKHGQFIEAFHLFREMQYQGVHPNHVSIVSILP 194
             +G+ W LF QIP KDVVL SA++SACVK+G ++EA  LFR MQ  G+  NHVSIVSILP
Sbjct: 788  GMGVCWSLFNQIPNKDVVLRSAMVSACVKNGDYVEALELFRRMQVLGLKANHVSIVSILP 847

Query: 195  ACADFGALSLGKEIHAFSMRRDFYSIVNIQNSLMDMYSKCRNLEASIRVLKTMRKKDMVS 254
            ACA+ GAL LG+EIH F +RR    +  +QNSL+DMY+KCR+L+ +I V   M KKD+VS
Sbjct: 848  ACANLGALQLGREIHGFIIRRMICYVNTVQNSLVDMYAKCRSLQTAICVFNGMLKKDLVS 907

Query: 255  WRTVTHACIQNNCPSKAFKIFTRMRSFG-FELGETMMLDFIAAVLLVDELLLGLAVHCHA 314
            WRT+    ++N C  KA   F++M+    F L E ++ D I AVL   E  +G A HC+ 
Sbjct: 908  WRTLIRGYVENECGIKALDAFSKMQRLSFFALDEFVVRDMIMAVLQSGESKIGSAFHCYI 967

Query: 315  LKGGFLCFISVGTELLQMYAKFGELGLAKLTFDELVDKDIIAWSAMISAYSHGEEPLSAI 374
            LK GFL F+S+ T LLQMYAKF  +  A+  FD + +KD+IAW+AMISAY+    P +AI
Sbjct: 968  LKTGFLAFVSIATALLQMYAKFSMVASARNVFDHISNKDVIAWNAMISAYAQTGLPFNAI 1027

Query: 375  QTFKMMQSTNERPNAITFVSLVNACSSLDAQE----LGESIHAHITKSGYSSNTCLMSAL 434
             TF+ M   NE+P+  + VSL+  CS + +QE    +GE+IHA + K GYS N  L SAL
Sbjct: 1028 NTFRQMLLMNEKPSEFSLVSLLQICSLMASQEVSDKVGETIHAFVAKVGYSRNVYLSSAL 1087

Query: 435  VDFYCILRRVKLGEHVFDEIVTKDLVCWSTMIKGYGTNGCGNEALNTFSDMLSYGLKPNG 494
            +DFYC   RVK G+ +FDE+ TKDL+CWS+MI GY  NG G EAL TF++ML  G+KPN 
Sbjct: 1088 IDFYCRFGRVKQGKALFDEVPTKDLICWSSMINGYVLNGYGIEALETFANMLDCGIKPND 1147

Query: 495  TLFVSLLSACAQCGLEKEGWMWFNAMIDEYNITPTVAHYACMVELLARQGKIREAVEFVK 554
             +F+S+LSAC+ CGL+ EGW WF +M ++Y ITP +AHYACMV+LL+RQG I +A+ FVK
Sbjct: 1148 IIFLSVLSACSHCGLKNEGWNWFYSMKEKYGITPKLAHYACMVDLLSRQGHIEQALHFVK 1207

Query: 555  KMAVEPDTRIWGALFAGCKLTHGFSDIADSIVQQLNALEPNNSDFHAML 599
            KM +EPD RIWGAL AGC+++ G   I + +V++L+ L+P NS  + M+
Sbjct: 1208 KMPMEPDKRIWGALLAGCRVSPGPIKIVEFVVERLSTLDPQNSTHYYMI 1254

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP320_ARATH1.1e-8632.26Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
PP341_ARATH6.6e-8129.45Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana GN... [more]
PPR32_ARATH1.5e-8029.27Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
PP359_ARATH3.3e-8030.94Pentatricopeptide repeat-containing protein At4g39952, mitochondrial OS=Arabidop... [more]
PP296_ARATH2.4e-7829.89Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0L489_CUCSA2.9e-26181.09Uncharacterized protein OS=Cucumis sativus GN=Csa_3G122420 PE=4 SV=1[more]
A5C4V9_VITVI2.0e-19356.94Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_040403 PE=4 SV=1[more]
A0A061DV61_THECC2.6e-17251.78Basic helix-loop-helix DNA-binding superfamily protein OS=Theobroma cacao GN=TCM... [more]
A0A0D2RE34_GOSRA1.7e-17152.00Uncharacterized protein OS=Gossypium raimondii GN=B456_008G121600 PE=4 SV=1[more]
A0A0J8D8C3_BETVU1.2e-16649.74Uncharacterized protein OS=Beta vulgaris subsp. vulgaris GN=BVRB_1g004700 PE=4 S... [more]
Match NameE-valueIdentityDescription
AT4G18750.16.0e-8832.26 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G30700.13.7e-8229.45 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G11290.18.3e-8229.27 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G39952.11.9e-8130.94 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G63370.11.3e-7929.89 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659077727|ref|XP_008439351.1|1.0e-27080.91PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing pro... [more]
gi|700201385|gb|KGN56518.1|4.2e-26181.09hypothetical protein Csa_3G122420 [Cucumis sativus][more]
gi|147834193|emb|CAN75306.1|2.9e-19356.94hypothetical protein VITISV_040403 [Vitis vinifera][more]
gi|731421553|ref|XP_010661790.1|2.9e-19356.94PREDICTED: putative pentatricopeptide repeat-containing protein At3g01580 [Vitis... [more]
gi|590722123|ref|XP_007051809.1|3.7e-17251.78Basic helix-loop-helix DNA-binding superfamily protein [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding
molecular_function GO:0005488 binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG08g01180.1Cp4.1LG08g01180.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 253..283
score: 0.0016coord: 528..551
score: 0.18coord: 225..251
score: 0.037coord: 53..81
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 149..197
score: 5.0E-10coord: 452..500
score: 5.1E-8coord: 351..399
score: 2.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 253..285
score: 4.4E-4coord: 152..185
score: 4.9E-7coord: 455..488
score: 5.2E-7coord: 355..388
score: 9.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 352..386
score: 9.339coord: 387..421
score: 6.752coord: 422..452
score: 6.018coord: 220..250
score: 6.873coord: 150..184
score: 12.474coord: 251..285
score: 9.591coord: 524..554
score: 7.563coord: 453..487
score: 11.224coord: 488..523
score: 7.004coord: 50..84
score: 10
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 322..600
score: 2.2E-205coord: 50..286
score: 2.2E

The following gene(s) are paralogous to this gene:

None