Cp4.1LG01g19790 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g19790
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing family protein
LocationCp4.1LG01 : 17001332 .. 17002945 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCATTTCTTCGCCAAAAACTGTCAACATTTATTCTCTCGTCGTCCAAAATTTCCTTCTGTTCATCCATACGAAAACTTAACTCAATTACAGCTGCAGATGAACTCATCAACAACGATGCAATCGTAAATTCAATATGCGATTCATTCACAAGACGCGAAAGCTGGGACACTCTGACTCGAAAATTCGAGACTCTGGAACTAAACGATTCGTTGGTCCAAAAAGTCTTGCTCAAATTCCAGCAGCTTGTTGATGCCAAACGCGCTCTAGGGTTCTTCCACTGGTCGGCCAAGAGGAAGAATTTCAATCATGGGTTTCAATCTTATGGTATTATGATTCATATTCTAGTGAAAGCTCGGTTGGTTATCGATGCTCGAGCTTTGCTCGAATCAATTTTGAAGAAAAATGAAGGTAGCTCTTTCAATTTCTCTATTGTAGATTCTCTATTGGATACTTATGAGGTGACTGATTCATCCCCATTTGTGTTCGATTTATTGATTCAAACTTGTGCTAAATTGAGATTGATTGATTTTGCTTTGAATATGTGTGCCCATTTGGAGGAACGTGGGTTCTCGTTGAGTTTGATAAGTTTCAATACTTTGCTTCATGTTGTGGAAAAATCTGATGAGAATCGTAAGGTTTGGAAGATTTATGAGCAAATGATTAGGAAAAGAGTTTACCCCAATGTGATTACTGTTAGGATCATGATCAATTCGTTGTGTAAGGAAGGTAAATTGCAAGAAATTTCTGATATGTTGAGTAGAATCCATGGCAGTCGTTGCTCTGCTTCTTTGATTGTCAATGGCTGTTTGATTTACAGGATTTTGGAGGAGGGGAGAGTTGAAGATGGTGTAATGTTGTTGAAGAGAATGTTGCAGAAAAACATGATTCTTGACGATATTGCTTATTCATTGATTGTTTATGCTAAACTGAAAATTGGGAATATAAAATCTGCACAGGAAGTGTTTGATGAAATGTCTAAAAGAGGATTTCAGGCAAATTCTTTCATTTATACATTGTTCATTGGCGCCCATTGTAGAGATGGAAGGATTGAAGAAGCTCATTGCTTGATGGAAGAGATGGAAAACATGGGTTTGAAGCCATATCCTGAAACCTTCAATCTTCTCATTGAAGGGTGTAGAGATTCAGAAGAAAGCTTGAGAATGTGTGAGAAAATGTTAGAAAGAGGGTTTGTTCCCAGTTGTTCATCTTTCAATGTGGCAATAGCTAAGATTTGTGAGGAAGGAGATGTGAAAAAGGCTAATGAAATGTTAACCATTTTATTAGATAAAGGGTTCTTGCCTGATGAAACCACTTACACCAATCTAATCATAGGATATGGGAAAATTGGTGAAACTCAGGAGATTCTTAAGCTATATTATGAAATGGATGCTAGATTACTTTCTCCTGGCGTGTCGGTTTTCTTTGCACTGATTGGGAGCTTTTGTCAATCTGGGAGACTGGAAGAAGCAGAGAAATATTTGAAGATTATGAAAGATAGGTCTATACAACCAAGTGTATCTATATATCAAACATTATCCTTGTTCTATTTGAAGAAGGGTAATAGAGCAAAGGCTCTAGAACATTATAATGAGATGATGTTCAATGGATAG

mRNA sequence

ATGGCATTTCTTCGCCAAAAACTGTCAACATTTATTCTCTCGTCGTCCAAAATTTCCTTCTGTTCATCCATACGAAAACTTAACTCAATTACAGCTGCAGATGAACTCATCAACAACGATGCAATCGTAAATTCAATATGCGATTCATTCACAAGACGCGAAAGCTGGGACACTCTGACTCGAAAATTCGAGACTCTGGAACTAAACGATTCGTTGGTCCAAAAAGTCTTGCTCAAATTCCAGCAGCTTGTTGATGCCAAACGCGCTCTAGGGTTCTTCCACTGGTCGGCCAAGAGGAAGAATTTCAATCATGGGTTTCAATCTTATGGTATTATGATTCATATTCTAGTGAAAGCTCGGTTGGTTATCGATGCTCGAGCTTTGCTCGAATCAATTTTGAAGAAAAATGAAGGTAGCTCTTTCAATTTCTCTATTGTAGATTCTCTATTGGATACTTATGAGGTGACTGATTCATCCCCATTTGTGTTCGATTTATTGATTCAAACTTGTGCTAAATTGAGATTGATTGATTTTGCTTTGAATATGTGTGCCCATTTGGAGGAACGTGGGTTCTCGTTGAGTTTGATAAGTTTCAATACTTTGCTTCATGTTGTGGAAAAATCTGATGAGAATCGTAAGGTTTGGAAGATTTATGAGCAAATGATTAGGAAAAGAGTTTACCCCAATGTGATTACTGTTAGGATCATGATCAATTCGTTGTGTAAGGAAGGTAAATTGCAAGAAATTTCTGATATGTTGAGTAGAATCCATGGCAGTCGTTGCTCTGCTTCTTTGATTGTCAATGGCTGTTTGATTTACAGGATTTTGGAGGAGGGGAGAGTTGAAGATGGTGTAATGTTGTTGAAGAGAATGTTGCAGAAAAACATGATTCTTGACGATATTGCTTATTCATTGATTGTTTATGCTAAACTGAAAATTGGGAATATAAAATCTGCACAGGAAGTGTTTGATGAAATGTCTAAAAGAGGATTTCAGGCAAATTCTTTCATTTATACATTGTTCATTGGCGCCCATTGTAGAGATGGAAGGATTGAAGAAGCTCATTGCTTGATGGAAGAGATGGAAAACATGGGTTTGAAGCCATATCCTGAAACCTTCAATCTTCTCATTGAAGGGTGTAGAGATTCAGAAGAAAGCTTGAGAATGTGTGAGAAAATGTTAGAAAGAGGGTTTGTTCCCAGTTGTTCATCTTTCAATGTGGCAATAGCTAAGATTTGTGAGGAAGGAGATGTGAAAAAGGCTAATGAAATGTTAACCATTTTATTAGATAAAGGGTTCTTGCCTGATGAAACCACTTACACCAATCTAATCATAGGATATGGGAAAATTGGTGAAACTCAGGAGATTCTTAAGCTATATTATGAAATGGATGCTAGATTACTTTCTCCTGGCGTGTCGGTTTTCTTTGCACTGATTGGGAGCTTTTGTCAATCTGGGAGACTGGAAGAAGCAGAGAAATATTTGAAGATTATGAAAGATAGGTCTATACAACCAAGTGTATCTATATATCAAACATTATCCTTGTTCTATTTGAAGAAGGGTAATAGAGCAAAGGCTCTAGAACATTATAATGAGATGATGTTCAATGGATAG

Coding sequence (CDS)

ATGGCATTTCTTCGCCAAAAACTGTCAACATTTATTCTCTCGTCGTCCAAAATTTCCTTCTGTTCATCCATACGAAAACTTAACTCAATTACAGCTGCAGATGAACTCATCAACAACGATGCAATCGTAAATTCAATATGCGATTCATTCACAAGACGCGAAAGCTGGGACACTCTGACTCGAAAATTCGAGACTCTGGAACTAAACGATTCGTTGGTCCAAAAAGTCTTGCTCAAATTCCAGCAGCTTGTTGATGCCAAACGCGCTCTAGGGTTCTTCCACTGGTCGGCCAAGAGGAAGAATTTCAATCATGGGTTTCAATCTTATGGTATTATGATTCATATTCTAGTGAAAGCTCGGTTGGTTATCGATGCTCGAGCTTTGCTCGAATCAATTTTGAAGAAAAATGAAGGTAGCTCTTTCAATTTCTCTATTGTAGATTCTCTATTGGATACTTATGAGGTGACTGATTCATCCCCATTTGTGTTCGATTTATTGATTCAAACTTGTGCTAAATTGAGATTGATTGATTTTGCTTTGAATATGTGTGCCCATTTGGAGGAACGTGGGTTCTCGTTGAGTTTGATAAGTTTCAATACTTTGCTTCATGTTGTGGAAAAATCTGATGAGAATCGTAAGGTTTGGAAGATTTATGAGCAAATGATTAGGAAAAGAGTTTACCCCAATGTGATTACTGTTAGGATCATGATCAATTCGTTGTGTAAGGAAGGTAAATTGCAAGAAATTTCTGATATGTTGAGTAGAATCCATGGCAGTCGTTGCTCTGCTTCTTTGATTGTCAATGGCTGTTTGATTTACAGGATTTTGGAGGAGGGGAGAGTTGAAGATGGTGTAATGTTGTTGAAGAGAATGTTGCAGAAAAACATGATTCTTGACGATATTGCTTATTCATTGATTGTTTATGCTAAACTGAAAATTGGGAATATAAAATCTGCACAGGAAGTGTTTGATGAAATGTCTAAAAGAGGATTTCAGGCAAATTCTTTCATTTATACATTGTTCATTGGCGCCCATTGTAGAGATGGAAGGATTGAAGAAGCTCATTGCTTGATGGAAGAGATGGAAAACATGGGTTTGAAGCCATATCCTGAAACCTTCAATCTTCTCATTGAAGGGTGTAGAGATTCAGAAGAAAGCTTGAGAATGTGTGAGAAAATGTTAGAAAGAGGGTTTGTTCCCAGTTGTTCATCTTTCAATGTGGCAATAGCTAAGATTTGTGAGGAAGGAGATGTGAAAAAGGCTAATGAAATGTTAACCATTTTATTAGATAAAGGGTTCTTGCCTGATGAAACCACTTACACCAATCTAATCATAGGATATGGGAAAATTGGTGAAACTCAGGAGATTCTTAAGCTATATTATGAAATGGATGCTAGATTACTTTCTCCTGGCGTGTCGGTTTTCTTTGCACTGATTGGGAGCTTTTGTCAATCTGGGAGACTGGAAGAAGCAGAGAAATATTTGAAGATTATGAAAGATAGGTCTATACAACCAAGTGTATCTATATATCAAACATTATCCTTGTTCTATTTGAAGAAGGGTAATAGAGCAAAGGCTCTAGAACATTATAATGAGATGATGTTCAATGGATAG

Protein sequence

MAFLRQKLSTFILSSSKISFCSSIRKLNSITAADELINNDAIVNSICDSFTRRESWDTLTRKFETLELNDSLVQKVLLKFQQLVDAKRALGFFHWSAKRKNFNHGFQSYGIMIHILVKARLVIDARALLESILKKNEGSSFNFSIVDSLLDTYEVTDSSPFVFDLLIQTCAKLRLIDFALNMCAHLEERGFSLSLISFNTLLHVVEKSDENRKVWKIYEQMIRKRVYPNVITVRIMINSLCKEGKLQEISDMLSRIHGSRCSASLIVNGCLIYRILEEGRVEDGVMLLKRMLQKNMILDDIAYSLIVYAKLKIGNIKSAQEVFDEMSKRGFQANSFIYTLFIGAHCRDGRIEEAHCLMEEMENMGLKPYPETFNLLIEGCRDSEESLRMCEKMLERGFVPSCSSFNVAIAKICEEGDVKKANEMLTILLDKGFLPDETTYTNLIIGYGKIGETQEILKLYYEMDARLLSPGVSVFFALIGSFCQSGRLEEAEKYLKIMKDRSIQPSVSIYQTLSLFYLKKGNRAKALEHYNEMMFNG
BLAST of Cp4.1LG01g19790 vs. Swiss-Prot
Match: PP107_ARATH (Pentatricopeptide repeat-containing protein At1g66345, mitochondrial OS=Arabidopsis thaliana GN=At1g66345 PE=3 SV=1)

HSP 1 Score: 481.5 bits (1238), Expect = 1.2e-134
Identity = 243/496 (48.99%), Postives = 344/496 (69.35%), Query Frame = 1

Query: 42  IVNSICDSFTRRESWDTLTRKFETLELNDSLVQKVLLKFQQLVDAKRALGFFHWSAKRKN 101
           +++ I  S    ++W+TL+ KF +++L+DSL++ +LL+F+    AK+AL FFHWS+  +N
Sbjct: 49  LIDYISKSLQSNDTWETLSTKFSSIDLSDSLIETILLRFKNPETAKQALSFFHWSSHTRN 108

Query: 102 FNHGFQSYGIMIHILVKARLVIDARALLESILKKNEGSSFNFSIVDSLLDTYEVTDSSPF 161
             HG +SY + IHILVKARL+IDARAL+ES L  +   S    +VDSLLDTYE++ S+P 
Sbjct: 109 LRHGIKSYALTIHILVKARLLIDARALIESSLLNSPPDS---DLVDSLLDTYEISSSTPL 168

Query: 162 VFDLLIQTCAKLRLIDFALNMCAHLEERGFSLSLISFNTLLHVVEKSDENRKVWKIYEQM 221
           VFDLL+Q  AK+R ++   ++   L + GF+LS+I+ NTL+H   KS  +  VW+IYE  
Sbjct: 169 VFDLLVQCYAKIRYLELGFDVFKRLCDCGFTLSVITLNTLIHYSSKSKIDDLVWRIYECA 228

Query: 222 IRKRVYPNVITVRIMINSLCKEGKLQEISDMLSRIHGSRCSASLIVNGCLIYRILEEGRV 281
           I KR+YPN IT+RIMI  LCKEG+L+E+ D+L RI G RC  S+IVN  L++R+LEE R+
Sbjct: 229 IDKRIYPNEITIRIMIQVLCKEGRLKEVVDLLDRICGKRCLPSVIVNTSLVFRVLEEMRI 288

Query: 282 EDGVMLLKRMLQKNMILDDIAYSLIVYAKLKIGNIKSAQEVFDEMSKRGFQANSFIYTLF 341
           E+ + LLKR+L KNM++D I YS++VYAK K G++ SA++VFDEM +RGF ANSF+YT+F
Sbjct: 289 EESMSLLKRLLMKNMVVDTIGYSIVVYAKAKEGDLVSARKVFDEMLQRGFSANSFVYTVF 348

Query: 342 IGAHCRDGRIEEAHCLMEEMENMGLKPYPETFNLLIEGCRD---SEESLRMCEKMLERGF 401
           +   C  G ++EA  L+ EME  G+ PY ETFN LI G       E+ L  CE M+ RG 
Sbjct: 349 VRVCCEKGDVKEAERLLSEMEESGVSPYDETFNCLIGGFARFGWEEKGLEYCEVMVTRGL 408

Query: 402 VPSCSSFNVAIAKICEEGDVKKANEMLTILLDKGFLPDETTYTNLIIGYGKIGETQEILK 461
           +PSCS+FN  +  + +  +V +ANE+LT  +DKGF+PDE TY++LI G+ +  +  + LK
Sbjct: 409 MPSCSAFNEMVKSVSKIENVNRANEILTKSIDKGFVPDEHTYSHLIRGFIEGNDIDQALK 468

Query: 462 LYYEMDARLLSPGVSVFFALIGSFCQSGRLEEAEKYLKIMKDRSIQPSVSIYQTLSLFYL 521
           L+YEM+ R +SPG  VF +LI   C  G++E  EKYLKIMK R I+P+  IY  L   + 
Sbjct: 469 LFYEMEYRKMSPGFEVFRSLIVGLCTCGKVEAGEKYLKIMKKRLIEPNADIYDALIKAFQ 528

Query: 522 KKGNRAKALEHYNEMM 535
           K G++  A   YNEM+
Sbjct: 529 KIGDKTNADRVYNEMI 541

BLAST of Cp4.1LG01g19790 vs. Swiss-Prot
Match: PP432_ARATH (Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana GN=At5g55840 PE=3 SV=2)

HSP 1 Score: 194.9 bits (494), Expect = 2.3e-48
Identity = 130/461 (28.20%), Postives = 226/461 (49.02%), Query Frame = 1

Query: 82  QLVDAKRALGFFHWSAKRKNF--NHGFQSYGIMIHILVKARLVIDARALLESILKKNEGS 141
           +LV  K AL F  W  K+     +H  Q   I  HILV+AR+   AR +L+ +   +  S
Sbjct: 46  RLVHGKLALKFLKWVVKQPGLETDHIVQLVCITTHILVRARMYDPARHILKELSLMSGKS 105

Query: 142 SFNFSIVDSLLDTYEVTDSSPFVFDLLIQTCAKLRLIDFALNMCAHLEERGFSLSLISFN 201
           SF F    +L+ TY + +S+P V+D+LI+   +  +I  +L +   +   GF+ S+ + N
Sbjct: 106 SFVFG---ALMTTYRLCNSNPSVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCN 165

Query: 202 TLLHVVEKSDENRKVWKIYEQMIRKRVYPNVITVRIMINSLCKEGKLQEISDMLSRIHGS 261
            +L  V KS E+  VW   ++M+++++ P+V T  I+IN LC EG  ++ S ++ ++  S
Sbjct: 166 AILGSVVKSGEDVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKS 225

Query: 262 RCSASLIVNGCLIYRILEEGRVEDGVMLLKRMLQKNMILDDIAYSLIVYAKLKIGNIKSA 321
             + +++    +++   ++GR +  + LL  M  K +  D   Y+++++   +   I   
Sbjct: 226 GYAPTIVTYNTVLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKG 285

Query: 322 QEVFDEMSKRGFQANSFIYTLFIGAHCRDGRIEEAHCLMEEMENMGLKPYPETFNLLIEG 381
             +  +M KR    N   Y   I     +G++  A  L+ EM + GL P   TFN LI+G
Sbjct: 286 YLLLRDMRKRMIHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDG 345

Query: 382 C---RDSEESLRMCEKMLERGFVPSCSSFNVAIAKICEEGDVKKANEMLTILLDKGFLPD 441
                + +E+L+M   M  +G  PS  S+ V +  +C+  +   A      +   G    
Sbjct: 346 HISEGNFKEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVG 405

Query: 442 ETTYTNLIIGYGKIGETQEILKLYYEMDARLLSPGVSVFFALIGSFCQSGRLEEAEKYLK 501
             TYT +I G  K G   E + L  EM    + P +  + ALI  FC+ GR + A++ + 
Sbjct: 406 RITYTGMIDGLCKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVC 465

Query: 502 IMKDRSIQPSVSIYQTLSLFYLKKGNRAKALEHYNEMMFNG 538
            +    + P+  IY TL     + G   +A+  Y  M+  G
Sbjct: 466 RIYRVGLSPNGIIYSTLIYNCCRMGCLKEAIRIYEAMILEG 503

BLAST of Cp4.1LG01g19790 vs. Swiss-Prot
Match: PP407_ARATH (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 186.8 bits (473), Expect = 6.2e-46
Identity = 119/463 (25.70%), Postives = 229/463 (49.46%), Query Frame = 1

Query: 76  VLLKFQQLVDAKRALGFFHWSAKRKNFNHGFQSYGIMIHILVKARLVIDARALLESILKK 135
           +LLK Q   D    L F +W+   + F    +   I +HIL K +L   A+ L E +  K
Sbjct: 54  LLLKSQN--DQALILKFLNWANPHQFFT--LRCKCITLHILTKFKLYKTAQILAEDVAAK 113

Query: 136 NEGSSFNFSIVDSLLDTYEVTDSSPFVFDLLIQTCAKLRLIDFALNMCAHLEERGFSLSL 195
                +   +  SL +TY++  S+  VFDL++++ ++L LID AL++    +  GF   +
Sbjct: 114 TLDDEYASLVFKSLQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGV 173

Query: 196 ISFNTLLHVVEKSDENRKVWK-IYEQMIRKRVYPNVITVRIMINSLCKEGKLQEISDMLS 255
           +S+N +L    +S  N    + ++++M+  +V PNV T  I+I   C  G +     +  
Sbjct: 174 LSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFD 233

Query: 256 RIHGSRCSASLIVNGCLIYRILEEGRVEDGVMLLKRMLQKNMILDDIAYSLIVYAKLKIG 315
           ++    C  +++    LI    +  +++DG  LL+ M  K +  + I+Y++++    + G
Sbjct: 234 KMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREG 293

Query: 316 NIKSAQEVFDEMSKRGFQANSFIYTLFIGAHCRDGRIEEAHCLMEEMENMGLKPYPETFN 375
            +K    V  EM++RG+  +   Y   I  +C++G   +A  +  EM   GL P   T+ 
Sbjct: 294 RMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYT 353

Query: 376 LLIEG-CR--DSEESLRMCEKMLERGFVPSCSSFNVAIAKICEEGDVKKANEMLTILLDK 435
            LI   C+  +   ++   ++M  RG  P+  ++   +    ++G + +A  +L  + D 
Sbjct: 354 SLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDN 413

Query: 436 GFLPDETTYTNLIIGYGKIGETQEILKLYYEMDARLLSPGVSVFFALIGSFCQSGRLEEA 495
           GF P   TY  LI G+   G+ ++ + +  +M  + LSP V  +  ++  FC+S  ++EA
Sbjct: 414 GFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEA 473

Query: 496 EKYLKIMKDRSIQPSVSIYQTLSLFYLKKGNRAKALEHYNEMM 535
            +  + M ++ I+P    Y +L   + ++    +A + Y EM+
Sbjct: 474 LRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEML 512

BLAST of Cp4.1LG01g19790 vs. Swiss-Prot
Match: PP143_ARATH (Putative pentatricopeptide repeat-containing protein At2g02150 OS=Arabidopsis thaliana GN=At2g02150 PE=3 SV=1)

HSP 1 Score: 172.2 bits (435), Expect = 1.6e-41
Identity = 130/483 (26.92%), Postives = 219/483 (45.34%), Query Frame = 1

Query: 56  WDT--LTRKFETLELNDSLVQKVLLKFQQLVDAKRALGFFHWSAKRKNFNHGFQSYGIMI 115
           WD   L + F+ L L    V +VL++ ++  D K A  FF WS  R  F H  +SY I+ 
Sbjct: 93  WDDPGLEKLFD-LTLAPIWVPRVLVELKE--DPKLAFKFFKWSMTRNGFKHSVESYCIVA 152

Query: 116 HILVKARLVIDARALLESILKKNEGSSFNFSIVDSLLDTYEVTDSSPFVFDLLIQTCAKL 175
           HIL  AR+  DA     S+LK+   S  +  + D L  T  V      VFD L      L
Sbjct: 153 HILFCARMYYDAN----SVLKEMVLSKADCDVFDVLWSTRNVCVPGFGVFDALFSVLIDL 212

Query: 176 RLIDFALNMCAHLEERGFSLSLISFNTLLHVVEKSDENRKVWKIYEQMIRKRVYPNVITV 235
            +++ A+   + ++         S N LLH   K  +   V + ++ MI     P V T 
Sbjct: 213 GMLEEAIQCFSKMKRFRVFPKTRSCNGLLHRFAKLGKTDDVKRFFKDMIGAGARPTVFTY 272

Query: 236 RIMINSLCKEGKLQEISDMLSRIHGSRCSASLIVNGCLIYRILEEGRVEDGVMLLKRMLQ 295
            IMI+ +CKEG ++    +   +         +    +I    + GR++D V   + M  
Sbjct: 273 NIMIDCMCKEGDVEAARGLFEEMKFRGLVPDTVTYNSMIDGFGKVGRLDDTVCFFEEMKD 332

Query: 296 KNMILDDIAYSLIVYAKLKIGNIKSAQEVFDEMSKRGFQANSFIYTLFIGAHCRDGRIEE 355
                D I Y+ ++    K G +    E + EM   G + N   Y+  + A C++G +++
Sbjct: 333 MCCEPDVITYNALINCFCKFGKLPIGLEFYREMKGNGLKPNVVSYSTLVDAFCKEGMMQQ 392

Query: 356 AHCLMEEMENMGLKPYPETFNLLIE-GCR--DSEESLRMCEKMLERGFVPSCSSFNVAIA 415
           A     +M  +GL P   T+  LI+  C+  +  ++ R+  +ML+ G   +  ++   I 
Sbjct: 393 AIKFYVDMRRVGLVPNEYTYTSLIDANCKIGNLSDAFRLGNEMLQVGVEWNVVTYTALID 452

Query: 416 KICEEGDVKKANEMLTILLDKGFLPDETTYTNLIIGYGKIGETQEILKLYYEMDARLLSP 475
            +C+   +K+A E+   +   G +P+  +Y  LI G+ K       L+L  E+  R + P
Sbjct: 453 GLCDAERMKEAEELFGKMDTAGVIPNLASYNALIHGFVKAKNMDRALELLNELKGRGIKP 512

Query: 476 GVSVFFALIGSFCQSGRLEEAEKYLKIMKDRSIQPSVSIYQTLSLFYLKKGNRAKALEHY 534
            + ++   I   C   ++E A+  +  MK+  I+ +  IY TL   Y K GN  + L   
Sbjct: 513 DLLLYGTFIWGLCSLEKIEAAKVVMNEMKECGIKANSLIYTTLMDAYFKSGNPTEGLHLL 568

BLAST of Cp4.1LG01g19790 vs. Swiss-Prot
Match: PPR39_ARATH (Pentatricopeptide repeat-containing protein At1g12775, mitochondrial OS=Arabidopsis thaliana GN=At1g12775 PE=2 SV=1)

HSP 1 Score: 171.0 bits (432), Expect = 3.5e-41
Identity = 108/417 (25.90%), Postives = 210/417 (50.36%), Query Frame = 1

Query: 122 VIDARALLESILKKNEGSSFNFSIVDSLLDTYEVTD--SSPFVFDLLIQTCAKLRLIDFA 181
           VID   L  +I K  +     + +V +L    E      S +   ++I    + R + +A
Sbjct: 88  VIDFNRLFSAIAKTKQ-----YELVLALCKQMESKGIAHSIYTLSIMINCFCRCRKLSYA 147

Query: 182 LNMCAHLEERGFSLSLISFNTLLHVVEKSDENRKVWKIYEQMIRKRVYPNVITVRIMINS 241
            +    + + G+    + FNTLL+ +       +  ++ ++M+     P +IT+  ++N 
Sbjct: 148 FSTMGKIMKLGYEPDTVIFNTLLNGLCLECRVSEALELVDRMVEMGHKPTLITLNTLVNG 207

Query: 242 LCKEGKLQEISDMLSRIHGSRCSASLIVNGCLIYRILEEGRVEDGVMLLKRMLQKNMILD 301
           LC  GK+ +   ++ R+  +    + +  G ++  + + G+    + LL++M ++N+ LD
Sbjct: 208 LCLNGKVSDAVVLIDRMVETGFQPNEVTYGPVLNVMCKSGQTALAMELLRKMEERNIKLD 267

Query: 302 DIAYSLIVYAKLKIGNIKSAQEVFDEMSKRGFQANSFIYTLFIGAHCRDGRIEEAHCLME 361
            + YS+I+    K G++ +A  +F+EM  +GF+A+   Y   IG  C  GR ++   L+ 
Sbjct: 268 AVKYSIIIDGLCKDGSLDNAFNLFNEMEIKGFKADIITYNTLIGGFCNAGRWDDGAKLLR 327

Query: 362 EMENMGLKPYPETFNLLIEGCRDS---EESLRMCEKMLERGFVPSCSSFNVAIAKICEEG 421
           +M    + P   TF++LI+         E+ ++ ++M++RG  P+  ++N  I   C+E 
Sbjct: 328 DMIKRKISPNVVTFSVLIDSFVKEGKLREADQLLKEMMQRGIAPNTITYNSLIDGFCKEN 387

Query: 422 DVKKANEMLTILLDKGFLPDETTYTNLIIGYGKIGETQEILKLYYEMDARLLSPGVSVFF 481
            +++A +M+ +++ KG  PD  T+  LI GY K     + L+L+ EM  R +      + 
Sbjct: 388 RLEEAIQMVDLMISKGCDPDIMTFNILINGYCKANRIDDGLELFREMSLRGVIANTVTYN 447

Query: 482 ALIGSFCQSGRLEEAEKYLKIMKDRSIQPSVSIYQTLSLFYLKKGNRAKALEHYNEM 534
            L+  FCQSG+LE A+K  + M  R ++P +  Y+ L       G   KALE + ++
Sbjct: 448 TLVQGFCQSGKLEVAKKLFQEMVSRRVRPDIVSYKILLDGLCDNGELEKALEIFGKI 499

BLAST of Cp4.1LG01g19790 vs. TrEMBL
Match: A0A0A0KRP7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G423870 PE=4 SV=1)

HSP 1 Score: 884.0 bits (2283), Expect = 9.0e-254
Identity = 453/540 (83.89%), Postives = 489/540 (90.56%), Query Frame = 1

Query: 1   MAFLRQKLSTFILSSSKISFCSSIRKLNSITAADELINNDAIVNSICDSFTRRESWDTLT 60
           MA LRQKLS  +LSS KIS C S+R L SI  AD+LIN+DA VNSICDS TRR+SWDTL+
Sbjct: 1   MALLRQKLSPIVLSS-KISLCLSMRNLISIPVADKLINDDATVNSICDSLTRRQSWDTLS 60

Query: 61  RKFETLELNDSLVQKVLLKFQQLVDAKRALGFFHWSAKRKNFNHGFQSYGIMIHILVKAR 120
           RKF+ LELND LVQKVLLKFQQ VDAKRALGFFHWSAKRKNFNHG QS+GIMIHILVKAR
Sbjct: 61  RKFQFLELNDFLVQKVLLKFQQPVDAKRALGFFHWSAKRKNFNHGPQSFGIMIHILVKAR 120

Query: 121 LVIDARALLESILKKNEGSSFNFSIVDSLLDTYEVTDSSPFVFDLLIQTCAKLRLIDFAL 180
           LV+DARALLESILKKNEG+SF++S+VDSL+D+YEVT SSPFVFDLL+QTCAKLRLIDFAL
Sbjct: 121 LVLDARALLESILKKNEGNSFDYSVVDSLMDSYEVTGSSPFVFDLLVQTCAKLRLIDFAL 180

Query: 181 NMCAHLEERGFSLSLISFNTLLHVVEKSDENRKVWKIYEQMIRKRVYPNVITVRIMINSL 240
            +C+HLEERGFSLSLISFNTL+HVVEKSDEN KVWKIYEQMIRKRVYPN ITVRIMINSL
Sbjct: 181 CVCSHLEERGFSLSLISFNTLIHVVEKSDENLKVWKIYEQMIRKRVYPNAITVRIMINSL 240

Query: 241 CKEGKLQEISDMLSRIHGSRCSASLIVNGCLIYRILEEGRVEDGVMLLKRMLQKNMILDD 300
           CKEGKLQE SDML+RIHGSRCSASLIVN CLIYRILEEGRVEDG+ LLKRMLQKNM+LDD
Sbjct: 241 CKEGKLQETSDMLNRIHGSRCSASLIVNACLIYRILEEGRVEDGITLLKRMLQKNMVLDD 300

Query: 301 IAYSLIVYAKLKIGNIKSAQEVFDEMSKRGFQANSFIYTLFIGAHCRDGRIEEAHCLMEE 360
           IAYSLIVYAK+K G+I S  EVF+EMS+RGFQANSFIYTLFIG HCR G++EEAHCLM+E
Sbjct: 301 IAYSLIVYAKVKTGSITSTWEVFEEMSERGFQANSFIYTLFIGVHCRGGKVEEAHCLMQE 360

Query: 361 MENMGLKPYPETFNLLIEGCR---DSEESLRMCEKMLERGFVPSCSSFNVAIAKICEEGD 420
           MENMGLKPYPETFNLLIEGC     SEE L MCEKMLERGF+PSCS FNVAI KICE+GD
Sbjct: 361 MENMGLKPYPETFNLLIEGCAISGHSEEILSMCEKMLERGFLPSCSVFNVAIDKICEKGD 420

Query: 421 VKKANEMLTILLDKGFLPDETTYTNLIIGYGKIGETQEILKLYYEMDARLLSPGVSVFFA 480
           VKKAN +LTILLDKGFLPDETTYTNLIIGY K GE QEILKLYYEM ARLLSPGVSVFFA
Sbjct: 421 VKKANALLTILLDKGFLPDETTYTNLIIGYRKSGEIQEILKLYYEMGARLLSPGVSVFFA 480

Query: 481 LIGSFCQSGRLEEAEKYLKIMKDRSIQPSVSIYQTLSLFYLKKGNRAKALEHYNEMMFNG 538
           LIGS CQSGRLEEAEKYLKI+KD S+ P +SIYQ L L YLKKGNRAKALE YNEMMF+G
Sbjct: 481 LIGSLCQSGRLEEAEKYLKIVKDSSLTPCLSIYQALILLYLKKGNRAKALELYNEMMFDG 539

BLAST of Cp4.1LG01g19790 vs. TrEMBL
Match: B9H373_POPTR (Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POPTR_0004s08990g PE=4 SV=1)

HSP 1 Score: 599.7 bits (1545), Expect = 3.4e-168
Identity = 307/544 (56.43%), Postives = 400/544 (73.53%), Query Frame = 1

Query: 1   MAFLRQKLSTFILSSSKISF----CSSIRKLNSITAADELINNDAIVNSICDSFTRRESW 60
           MA LR+   + I +  K S     CSS  ++           +D +V+SICDS  R  +W
Sbjct: 1   MALLRRTFPSLISTPLKYSIHPRVCSSWFEVARFLHDGTKTESDTVVSSICDSLRRGYNW 60

Query: 61  DTLTRKFETLELNDSLVQKVLLKFQQLVDAKRALGFFHWSAKRKNFNHGFQSYGIMIHIL 120
           DTL RKFE+L+LN+ LV+ VLL+ ++  DAKRALGFFHWSA+R NF HG QSY +MIHIL
Sbjct: 61  DTLNRKFESLQLNNLLVKNVLLELKEPTDAKRALGFFHWSARR-NFVHGVQSYCLMIHIL 120

Query: 121 VKARLVIDARALLESILKKNEGSSFNFSIVDSLLDTYEVTDSSPFVFDLLIQTCAKLRLI 180
           ++ARL++DA+ALLES+LKK+ G    F ++DSLL +Y++  SSP VFDLL+Q  AK R+ 
Sbjct: 121 IQARLIMDAQALLESLLKKSVGDPTKFLVLDSLLSSYKIIISSPLVFDLLVQAYAKQRMF 180

Query: 181 DFALNMCAHLEERGFSLSLISFNTLLHVVEKSDENRKVWKIYEQMIRKRVYPNVITVRIM 240
           +   ++C  LEE  F+LSLISFNTL+HVV+KSD++   WKIYE M+ +R YPN  T+  M
Sbjct: 181 EIGFDVCCRLEEHRFTLSLISFNTLIHVVQKSDKSPLAWKIYEHMLHRRTYPNEATIESM 240

Query: 241 INSLCKEGKLQEISDMLSRIHGSRCSASLIVNGCLIYRILEEGRVEDGVMLLKRMLQKNM 300
           I++LCKEGKLQ I +ML +IHG RCS  +IVN CL++RILEEGRVE G+ LLK ML+KNM
Sbjct: 241 ISALCKEGKLQTIVNMLDKIHGKRCSPVVIVNTCLVFRILEEGRVEPGLALLKMMLRKNM 300

Query: 301 ILDDIAYSLIVYAKLKIGNIKSAQEVFDEMSKRGFQANSFIYTLFIGAHCRDGRIEEAHC 360
           ILD +AYSLIVYAK+K+GN+ SA +V++EM KRGF ANSF+YT FIGA+C++ RIEEA+ 
Sbjct: 301 ILDTVAYSLIVYAKVKLGNLNSAMQVYEEMLKRGFNANSFVYTSFIGAYCKEERIEEANQ 360

Query: 361 LMEEMENMGLKPYPETFNLLIEGCRDS---EESLRMCEKMLERGFVPSCSSFNVAIAKIC 420
           L++EMENMGLKPY +TFN L+EGC  +   EE+L  C+KM+E G VPS S+FN  + K+C
Sbjct: 361 LLQEMENMGLKPYGDTFNFLLEGCAKAGRVEETLSYCKKMMEMGHVPSLSAFNEMVGKLC 420

Query: 421 EEGDVKKANEMLTILLDKGFLPDETTYTNLIIGYGKIGETQEILKLYYEMDARLLSPGVS 480
              DV +ANEMLT LLD+GFL DE TY+NLI GY K  + QE+LKLYYEM+ R LSPG+ 
Sbjct: 421 RIEDVTRANEMLTNLLDEGFLADEITYSNLISGYAKNNQIQEMLKLYYEMEYRSLSPGLM 480

Query: 481 VFFALIGSFCQSGRLEEAEKYLKIMKDRSIQPSVSIYQTLSLFYLKKGNRAKALEHYNEM 538
            F +LI   C  G+LEEAEKYL+IM  RS+ P   +Y+ L   Y +KG++ +AL  YNEM
Sbjct: 481 GFTSLIKGLCNCGKLEEAEKYLRIMIGRSLNPREDVYEALIKVYFEKGDKRRALNLYNEM 540

BLAST of Cp4.1LG01g19790 vs. TrEMBL
Match: A5AQQ7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_012747 PE=4 SV=1)

HSP 1 Score: 598.6 bits (1542), Expect = 7.6e-168
Identity = 305/537 (56.80%), Postives = 403/537 (75.05%), Query Frame = 1

Query: 1   MAFLRQKLSTFILSSSKISFCSSIRKLNSITAADELINNDAIVNSICDSFTRRESWDTLT 60
           MA LR+ + + I  S+     +    +  +T  +   N+++ VN +CDS  R  +WD L 
Sbjct: 1   MALLRRVIPSLISISTNNPTRTIHLHVPQLTLGET--NSNSKVNMLCDSLRRGLNWDALN 60

Query: 61  RKFETLELNDSLVQKVLLKFQQLVDAKRALGFFHWSAKRKNFNHGFQSYGIMIHILVKAR 120
           ++F +LEL +S V +VLL+ ++ +DAK+ALGFFHWSA+ KN  HG  SY I IHILV A 
Sbjct: 61  QRFGSLELTESFVGRVLLELKKPIDAKQALGFFHWSAQCKNLEHGVASYCITIHILVGAH 120

Query: 121 LVIDARALLESILKKNEGSSFNFSIVDSLLDTYEVTDSSPFVFDLLIQTCAKLRLIDFAL 180
           L++DA++LLES LKKN GS   F +VDSLL +Y +T S+P VFDLL+Q+ +KLR+ +   
Sbjct: 121 LLMDAQSLLESTLKKNAGS--RFLVVDSLLSSYNITGSNPRVFDLLVQSYSKLRMFEICF 180

Query: 181 NMCAHLEERGFSLSLISFNTLLHVVEKSDENRKVWKIYEQMIRKRVYPNVITVRIMINSL 240
           ++C +LEE GFSLSLISFN LLHVV+KSD    VWKIYE MIR R YPN ++V +MI++L
Sbjct: 181 DVCCYLEEHGFSLSLISFNXLLHVVQKSDNYPLVWKIYEHMIRVRKYPNEVSVSVMISAL 240

Query: 241 CKEGKLQEISDMLSRIHGSRCSASLIVNGCLIYRILEEGRVEDGVMLLKRMLQKNMILDD 300
           CKEG LQ+  DML RIHG RCS  +IVN C+I+R+LEEGRVE G+++LKR+LQKNMILD 
Sbjct: 241 CKEGALQKFVDMLDRIHGKRCSPIVIVNTCMIFRMLEEGRVEQGMLILKRLLQKNMILDT 300

Query: 301 IAYSLIVYAKLKIGNIKSAQEVFDEMSKRGFQANSFIYTLFIGAHCRDGRIEEAHCLMEE 360
           I+YSLI YAK+K G + SA EV++EM  RGF  N+F+YTLFIG+HC +GRIEEA+ LM++
Sbjct: 301 ISYSLIAYAKVKYGTLDSAWEVYEEMLNRGFHPNAFVYTLFIGSHCVEGRIEEANELMQD 360

Query: 361 MENMGLKPYPETFNLLIEGCRDS---EESLRMCEKMLERGFVPSCSSFNVAIAKICEEGD 420
           MEN GL PY ETFNLLI GC  +   EE LR+CE+M++RG VPSC +FN+   K+CE G 
Sbjct: 361 MENAGLMPYDETFNLLIAGCSKAGRLEEGLRLCERMMQRGLVPSCWAFNLMAGKLCESGV 420

Query: 421 VKKANEMLTILLDKGFLPDETTYTNLIIGYGKIGETQEILKLYYEMDARLLSPGVSVFFA 480
           VK+A+EMLT+LLDKGF+PDE TY+NLI  YGK+GE Q++LKLYYEM+ R LSPG+ VF +
Sbjct: 421 VKRADEMLTLLLDKGFVPDEITYSNLIASYGKLGEIQQVLKLYYEMEYRSLSPGLLVFES 480

Query: 481 LIGSFCQSGRLEEAEKYLKIMKDRSIQPSVSIYQTLSLFYLKKGNRAKALEHYNEMM 535
           +I S CQ  +LE+AEKYL+IMKDRSI  S  +Y+TL   Y +KG+  +A + +NEM+
Sbjct: 481 IIRSLCQCRKLEKAEKYLRIMKDRSIAISTCVYETLISGYFEKGDELRASQLHNEML 533

BLAST of Cp4.1LG01g19790 vs. TrEMBL
Match: B9SYW9_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_0012930 PE=4 SV=1)

HSP 1 Score: 586.3 bits (1510), Expect = 3.9e-164
Identity = 302/533 (56.66%), Postives = 391/533 (73.36%), Query Frame = 1

Query: 8   LSTFILSSSKISFCSSIRKLNSITAADELINNDAIVNSICDSFTRRESWDTLTRKFETLE 67
           LS  + + S    CS    +      D   + DAIV +ICDS  R  +W +L+ KF+ +E
Sbjct: 12  LSRQVKNPSDSCLCSHQILVTRFVHNDAKPDGDAIVYAICDSLRRGHNWVSLSGKFQYVE 71

Query: 68  LNDSLVQKVLLKFQQLVDAKRALGFFHWSAKRKNFNHGFQSYGIMIHILVKARLVIDARA 127
           LN  LV+KVLL+ ++ +DAKRALGFFHWSA+RKNF HG  SY +M++ILV+A+L+ DA+A
Sbjct: 72  LNHLLVEKVLLELKEPIDAKRALGFFHWSAQRKNFVHGVWSYCLMVNILVRAQLLNDAQA 131

Query: 128 LLESILKKNEGSSFNFSIVDSLLDTYEVTDSSPFVFDLLIQTCAKLRLIDFALNMCAHLE 187
           LLESILKKN   S  F IVDSLLD+Y++  SSP VF+LL+Q  AKLRL +    +C +LE
Sbjct: 132 LLESILKKNVEDSSEFLIVDSLLDSYKIIVSSPLVFNLLVQAYAKLRLFEIGFKICFYLE 191

Query: 188 ERGFSLSLISFNTLLHVVEKSDENRKVWKIYEQMIRKRVYPNVITVRIMINSLCKEGKLQ 247
           E GF LSL+SFNTL+HVV+KSD+   VWKIYE MI KR+YPN  T+R MIN+LCKEGKLQ
Sbjct: 192 EHGFFLSLLSFNTLIHVVQKSDQYPLVWKIYEHMIHKRIYPNEATIRTMINALCKEGKLQ 251

Query: 248 EISDMLSRIHGSRCSASLIVNGCLIYRILEEGRVEDGVMLLKRMLQKNMILDDIAYSLIV 307
              D+L RIHG RC   +I+N C+++RIL+EGRV+ G+ +LK MLQKNMILD +AYSLIV
Sbjct: 252 MFVDILDRIHGKRCRPLVIINACMVFRILQEGRVDVGIGILKGMLQKNMILDTVAYSLIV 311

Query: 308 YAKLKIGNIKSAQEVFDEMSKRGFQANSFIYTLFIGAHCRDGRIEEAHCLMEEMENMGLK 367
           +AK+++GN+ SA EV++ M KRGF ANSF++T+ IGA+C  G+IE+A+ L  EM  MGL+
Sbjct: 312 FAKVRLGNLDSALEVYEAMLKRGFNANSFVHTVLIGAYCNGGKIEKANQLFGEMGTMGLE 371

Query: 368 PYPETFNLLIEGCRDS---EESLRMCEKMLERGFVPSCSSFNVAIAKICEEGDVKKANEM 427
           PY ETFN LIEGC  +   EE L   EKM+ERG VPS  +FN  IAK+CE G+V +AN  
Sbjct: 372 PYDETFNFLIEGCAKAGRVEECLSYFEKMIERGLVPSLLAFNKMIAKLCETGEVNQANTF 431

Query: 428 LTILLDKGFLPDETTYTNLIIGYGKIGETQEILKLYYEMDARLLSPGVSVFFALIGSFCQ 487
           LT LLDKGF PDETTY+ L+ GY +  + QE+LKLYYEM+ R LSPG+ VF  LI S C 
Sbjct: 432 LTRLLDKGFSPDETTYSYLMTGYERDNQIQEVLKLYYEMEYRPLSPGLLVFTPLIRSLCH 491

Query: 488 SGRLEEAEKYLKIMKDRSIQPSVSIYQTLSLFYLKKGNRAKALEHYNEMMFNG 538
            G+LE+AEKYL+IMK RS+ PS  +Y+ L   +L+K + A+AL+ YNEM+  G
Sbjct: 492 CGKLEQAEKYLRIMKGRSLNPSQQVYEALIAGHLEKSDTARALQLYNEMISKG 544

BLAST of Cp4.1LG01g19790 vs. TrEMBL
Match: M5X9J7_PRUPE (Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa014874mg PE=4 SV=1)

HSP 1 Score: 583.2 bits (1502), Expect = 3.3e-163
Identity = 291/483 (60.25%), Postives = 373/483 (77.23%), Query Frame = 1

Query: 30  ITAADELINNDA----IVNSICDSFTRRESWDTLTRKFETLELNDSLVQKVLLKFQQLVD 89
           I     LIN D+    +  +I DSF    +WDTLT KFE+++L+  LV  VLL+ ++ +D
Sbjct: 17  INQETRLINTDSAPKDVAKAIRDSFRSSWNWDTLTTKFESVKLDGGLVDSVLLELKEPID 76

Query: 90  AKRALGFFHWSAKRKNFNHGFQSYGIMIHILVKARLVIDARALLESILKKNEGSSFNFSI 149
           AKRALGFFHW+A RK+F HG  SY I IHIL +ARL++DARALLES+LKK   +   FS+
Sbjct: 77  AKRALGFFHWAAHRKSFEHGVWSYSITIHILARARLLMDARALLESVLKKTAENGSKFSV 136

Query: 150 VDSLLDTYEVTDSSPFVFDLLIQTCAKLRLIDFALNMCAHLEERGFSLSLISFNTLLHVV 209
           VDSLL +YEVT S+PFVFDLL+Q  AKLR+ +   ++C +L E G  LSLI++NTLLHVV
Sbjct: 137 VDSLLSSYEVTASNPFVFDLLLQAYAKLRMFETGFDVCCYLGEHGLPLSLITYNTLLHVV 196

Query: 210 EKSDENRKVWKIYEQMIRKRVYPNVITVRIMINSLCKEGKLQEISDMLSRIHGSRCSASL 269
           +KSD+   VWKIYE M+ KR YPN  T++I+I++LCKEGKL++  DML RIHG RCS S+
Sbjct: 197 QKSDQTALVWKIYEHMVGKRNYPNEETIKILIDALCKEGKLKKCVDMLDRIHGKRCSPSV 256

Query: 270 IVNGCLIYRILEEGRVEDGVMLLKRMLQKNMILDDIAYSLIVYAKLKIGNIKSAQEVFDE 329
           IVN  L++ ILE GRVE+G+MLL+RMLQKNM+LD IAYSLIVYAK+K+G++ SA EV++E
Sbjct: 257 IVNTSLVFSILEGGRVEEGLMLLRRMLQKNMVLDTIAYSLIVYAKVKLGDVCSAWEVYEE 316

Query: 330 MSKRGFQANSFIYTLFIGAHCRDGRIEEAHCLMEEMENMGLKPYPETFNLLIEGCRDS-- 389
           M KRGF+ANSF+YTLF+GAHC +GR+EEA  +M EMENM LKP+ E++NLLIEGC  +  
Sbjct: 317 MLKRGFRANSFVYTLFMGAHCEEGRMEEAQGMMNEMENMDLKPFDESYNLLIEGCAKAGR 376

Query: 390 -EESLRMCEKMLERGFVPSCSSFNVAIAKICEEGDVKKANEMLTILLDKGFLPDETTYTN 449
            E SL   +KM+E GF+P  S+FN  + K+CE GD ++AN M TILLDKGFLPD TTY +
Sbjct: 377 VEASLSYLKKMVESGFIPCRSAFNEMVGKLCETGDAEQANTMFTILLDKGFLPDSTTYGH 436

Query: 450 LIIGYGKIGETQEILKLYYEMDARLLSPGVSVFFALIGSFCQSGRLEEAEKYLKIMKDRS 506
           LI GYG+ GE QE++KLYYEM++R LSPG  VF ++I SFCQ G++EEAE+Y  IMKDRS
Sbjct: 437 LIDGYGRKGEIQEVVKLYYEMESRSLSPGALVFTSVIKSFCQCGKVEEAERYFGIMKDRS 496

BLAST of Cp4.1LG01g19790 vs. TAIR10
Match: AT1G66345.1 (AT1G66345.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 481.5 bits (1238), Expect = 6.8e-136
Identity = 243/496 (48.99%), Postives = 344/496 (69.35%), Query Frame = 1

Query: 42  IVNSICDSFTRRESWDTLTRKFETLELNDSLVQKVLLKFQQLVDAKRALGFFHWSAKRKN 101
           +++ I  S    ++W+TL+ KF +++L+DSL++ +LL+F+    AK+AL FFHWS+  +N
Sbjct: 49  LIDYISKSLQSNDTWETLSTKFSSIDLSDSLIETILLRFKNPETAKQALSFFHWSSHTRN 108

Query: 102 FNHGFQSYGIMIHILVKARLVIDARALLESILKKNEGSSFNFSIVDSLLDTYEVTDSSPF 161
             HG +SY + IHILVKARL+IDARAL+ES L  +   S    +VDSLLDTYE++ S+P 
Sbjct: 109 LRHGIKSYALTIHILVKARLLIDARALIESSLLNSPPDS---DLVDSLLDTYEISSSTPL 168

Query: 162 VFDLLIQTCAKLRLIDFALNMCAHLEERGFSLSLISFNTLLHVVEKSDENRKVWKIYEQM 221
           VFDLL+Q  AK+R ++   ++   L + GF+LS+I+ NTL+H   KS  +  VW+IYE  
Sbjct: 169 VFDLLVQCYAKIRYLELGFDVFKRLCDCGFTLSVITLNTLIHYSSKSKIDDLVWRIYECA 228

Query: 222 IRKRVYPNVITVRIMINSLCKEGKLQEISDMLSRIHGSRCSASLIVNGCLIYRILEEGRV 281
           I KR+YPN IT+RIMI  LCKEG+L+E+ D+L RI G RC  S+IVN  L++R+LEE R+
Sbjct: 229 IDKRIYPNEITIRIMIQVLCKEGRLKEVVDLLDRICGKRCLPSVIVNTSLVFRVLEEMRI 288

Query: 282 EDGVMLLKRMLQKNMILDDIAYSLIVYAKLKIGNIKSAQEVFDEMSKRGFQANSFIYTLF 341
           E+ + LLKR+L KNM++D I YS++VYAK K G++ SA++VFDEM +RGF ANSF+YT+F
Sbjct: 289 EESMSLLKRLLMKNMVVDTIGYSIVVYAKAKEGDLVSARKVFDEMLQRGFSANSFVYTVF 348

Query: 342 IGAHCRDGRIEEAHCLMEEMENMGLKPYPETFNLLIEGCRD---SEESLRMCEKMLERGF 401
           +   C  G ++EA  L+ EME  G+ PY ETFN LI G       E+ L  CE M+ RG 
Sbjct: 349 VRVCCEKGDVKEAERLLSEMEESGVSPYDETFNCLIGGFARFGWEEKGLEYCEVMVTRGL 408

Query: 402 VPSCSSFNVAIAKICEEGDVKKANEMLTILLDKGFLPDETTYTNLIIGYGKIGETQEILK 461
           +PSCS+FN  +  + +  +V +ANE+LT  +DKGF+PDE TY++LI G+ +  +  + LK
Sbjct: 409 MPSCSAFNEMVKSVSKIENVNRANEILTKSIDKGFVPDEHTYSHLIRGFIEGNDIDQALK 468

Query: 462 LYYEMDARLLSPGVSVFFALIGSFCQSGRLEEAEKYLKIMKDRSIQPSVSIYQTLSLFYL 521
           L+YEM+ R +SPG  VF +LI   C  G++E  EKYLKIMK R I+P+  IY  L   + 
Sbjct: 469 LFYEMEYRKMSPGFEVFRSLIVGLCTCGKVEAGEKYLKIMKKRLIEPNADIYDALIKAFQ 528

Query: 522 KKGNRAKALEHYNEMM 535
           K G++  A   YNEM+
Sbjct: 529 KIGDKTNADRVYNEMI 541

BLAST of Cp4.1LG01g19790 vs. TAIR10
Match: AT5G55840.1 (AT5G55840.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 194.9 bits (494), Expect = 1.3e-49
Identity = 130/461 (28.20%), Postives = 226/461 (49.02%), Query Frame = 1

Query: 82  QLVDAKRALGFFHWSAKRKNF--NHGFQSYGIMIHILVKARLVIDARALLESILKKNEGS 141
           +LV  K AL F  W  K+     +H  Q   I  HILV+AR+   AR +L+ +   +  S
Sbjct: 86  RLVHGKLALKFLKWVVKQPGLETDHIVQLVCITTHILVRARMYDPARHILKELSLMSGKS 145

Query: 142 SFNFSIVDSLLDTYEVTDSSPFVFDLLIQTCAKLRLIDFALNMCAHLEERGFSLSLISFN 201
           SF F    +L+ TY + +S+P V+D+LI+   +  +I  +L +   +   GF+ S+ + N
Sbjct: 146 SFVFG---ALMTTYRLCNSNPSVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCN 205

Query: 202 TLLHVVEKSDENRKVWKIYEQMIRKRVYPNVITVRIMINSLCKEGKLQEISDMLSRIHGS 261
            +L  V KS E+  VW   ++M+++++ P+V T  I+IN LC EG  ++ S ++ ++  S
Sbjct: 206 AILGSVVKSGEDVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKS 265

Query: 262 RCSASLIVNGCLIYRILEEGRVEDGVMLLKRMLQKNMILDDIAYSLIVYAKLKIGNIKSA 321
             + +++    +++   ++GR +  + LL  M  K +  D   Y+++++   +   I   
Sbjct: 266 GYAPTIVTYNTVLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKG 325

Query: 322 QEVFDEMSKRGFQANSFIYTLFIGAHCRDGRIEEAHCLMEEMENMGLKPYPETFNLLIEG 381
             +  +M KR    N   Y   I     +G++  A  L+ EM + GL P   TFN LI+G
Sbjct: 326 YLLLRDMRKRMIHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDG 385

Query: 382 C---RDSEESLRMCEKMLERGFVPSCSSFNVAIAKICEEGDVKKANEMLTILLDKGFLPD 441
                + +E+L+M   M  +G  PS  S+ V +  +C+  +   A      +   G    
Sbjct: 386 HISEGNFKEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVG 445

Query: 442 ETTYTNLIIGYGKIGETQEILKLYYEMDARLLSPGVSVFFALIGSFCQSGRLEEAEKYLK 501
             TYT +I G  K G   E + L  EM    + P +  + ALI  FC+ GR + A++ + 
Sbjct: 446 RITYTGMIDGLCKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVC 505

Query: 502 IMKDRSIQPSVSIYQTLSLFYLKKGNRAKALEHYNEMMFNG 538
            +    + P+  IY TL     + G   +A+  Y  M+  G
Sbjct: 506 RIYRVGLSPNGIIYSTLIYNCCRMGCLKEAIRIYEAMILEG 543

BLAST of Cp4.1LG01g19790 vs. TAIR10
Match: AT5G39710.1 (AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 186.8 bits (473), Expect = 3.5e-47
Identity = 119/463 (25.70%), Postives = 229/463 (49.46%), Query Frame = 1

Query: 76  VLLKFQQLVDAKRALGFFHWSAKRKNFNHGFQSYGIMIHILVKARLVIDARALLESILKK 135
           +LLK Q   D    L F +W+   + F    +   I +HIL K +L   A+ L E +  K
Sbjct: 54  LLLKSQN--DQALILKFLNWANPHQFFT--LRCKCITLHILTKFKLYKTAQILAEDVAAK 113

Query: 136 NEGSSFNFSIVDSLLDTYEVTDSSPFVFDLLIQTCAKLRLIDFALNMCAHLEERGFSLSL 195
                +   +  SL +TY++  S+  VFDL++++ ++L LID AL++    +  GF   +
Sbjct: 114 TLDDEYASLVFKSLQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGV 173

Query: 196 ISFNTLLHVVEKSDENRKVWK-IYEQMIRKRVYPNVITVRIMINSLCKEGKLQEISDMLS 255
           +S+N +L    +S  N    + ++++M+  +V PNV T  I+I   C  G +     +  
Sbjct: 174 LSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFD 233

Query: 256 RIHGSRCSASLIVNGCLIYRILEEGRVEDGVMLLKRMLQKNMILDDIAYSLIVYAKLKIG 315
           ++    C  +++    LI    +  +++DG  LL+ M  K +  + I+Y++++    + G
Sbjct: 234 KMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREG 293

Query: 316 NIKSAQEVFDEMSKRGFQANSFIYTLFIGAHCRDGRIEEAHCLMEEMENMGLKPYPETFN 375
            +K    V  EM++RG+  +   Y   I  +C++G   +A  +  EM   GL P   T+ 
Sbjct: 294 RMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYT 353

Query: 376 LLIEG-CR--DSEESLRMCEKMLERGFVPSCSSFNVAIAKICEEGDVKKANEMLTILLDK 435
            LI   C+  +   ++   ++M  RG  P+  ++   +    ++G + +A  +L  + D 
Sbjct: 354 SLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDN 413

Query: 436 GFLPDETTYTNLIIGYGKIGETQEILKLYYEMDARLLSPGVSVFFALIGSFCQSGRLEEA 495
           GF P   TY  LI G+   G+ ++ + +  +M  + LSP V  +  ++  FC+S  ++EA
Sbjct: 414 GFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEA 473

Query: 496 EKYLKIMKDRSIQPSVSIYQTLSLFYLKKGNRAKALEHYNEMM 535
            +  + M ++ I+P    Y +L   + ++    +A + Y EM+
Sbjct: 474 LRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEML 512

BLAST of Cp4.1LG01g19790 vs. TAIR10
Match: AT2G02150.1 (AT2G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 172.2 bits (435), Expect = 8.9e-43
Identity = 130/483 (26.92%), Postives = 219/483 (45.34%), Query Frame = 1

Query: 56  WDT--LTRKFETLELNDSLVQKVLLKFQQLVDAKRALGFFHWSAKRKNFNHGFQSYGIMI 115
           WD   L + F+ L L    V +VL++ ++  D K A  FF WS  R  F H  +SY I+ 
Sbjct: 93  WDDPGLEKLFD-LTLAPIWVPRVLVELKE--DPKLAFKFFKWSMTRNGFKHSVESYCIVA 152

Query: 116 HILVKARLVIDARALLESILKKNEGSSFNFSIVDSLLDTYEVTDSSPFVFDLLIQTCAKL 175
           HIL  AR+  DA     S+LK+   S  +  + D L  T  V      VFD L      L
Sbjct: 153 HILFCARMYYDAN----SVLKEMVLSKADCDVFDVLWSTRNVCVPGFGVFDALFSVLIDL 212

Query: 176 RLIDFALNMCAHLEERGFSLSLISFNTLLHVVEKSDENRKVWKIYEQMIRKRVYPNVITV 235
            +++ A+   + ++         S N LLH   K  +   V + ++ MI     P V T 
Sbjct: 213 GMLEEAIQCFSKMKRFRVFPKTRSCNGLLHRFAKLGKTDDVKRFFKDMIGAGARPTVFTY 272

Query: 236 RIMINSLCKEGKLQEISDMLSRIHGSRCSASLIVNGCLIYRILEEGRVEDGVMLLKRMLQ 295
            IMI+ +CKEG ++    +   +         +    +I    + GR++D V   + M  
Sbjct: 273 NIMIDCMCKEGDVEAARGLFEEMKFRGLVPDTVTYNSMIDGFGKVGRLDDTVCFFEEMKD 332

Query: 296 KNMILDDIAYSLIVYAKLKIGNIKSAQEVFDEMSKRGFQANSFIYTLFIGAHCRDGRIEE 355
                D I Y+ ++    K G +    E + EM   G + N   Y+  + A C++G +++
Sbjct: 333 MCCEPDVITYNALINCFCKFGKLPIGLEFYREMKGNGLKPNVVSYSTLVDAFCKEGMMQQ 392

Query: 356 AHCLMEEMENMGLKPYPETFNLLIE-GCR--DSEESLRMCEKMLERGFVPSCSSFNVAIA 415
           A     +M  +GL P   T+  LI+  C+  +  ++ R+  +ML+ G   +  ++   I 
Sbjct: 393 AIKFYVDMRRVGLVPNEYTYTSLIDANCKIGNLSDAFRLGNEMLQVGVEWNVVTYTALID 452

Query: 416 KICEEGDVKKANEMLTILLDKGFLPDETTYTNLIIGYGKIGETQEILKLYYEMDARLLSP 475
            +C+   +K+A E+   +   G +P+  +Y  LI G+ K       L+L  E+  R + P
Sbjct: 453 GLCDAERMKEAEELFGKMDTAGVIPNLASYNALIHGFVKAKNMDRALELLNELKGRGIKP 512

Query: 476 GVSVFFALIGSFCQSGRLEEAEKYLKIMKDRSIQPSVSIYQTLSLFYLKKGNRAKALEHY 534
            + ++   I   C   ++E A+  +  MK+  I+ +  IY TL   Y K GN  + L   
Sbjct: 513 DLLLYGTFIWGLCSLEKIEAAKVVMNEMKECGIKANSLIYTTLMDAYFKSGNPTEGLHLL 568

BLAST of Cp4.1LG01g19790 vs. TAIR10
Match: AT1G12775.1 (AT1G12775.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 171.0 bits (432), Expect = 2.0e-42
Identity = 108/417 (25.90%), Postives = 210/417 (50.36%), Query Frame = 1

Query: 122 VIDARALLESILKKNEGSSFNFSIVDSLLDTYEVTD--SSPFVFDLLIQTCAKLRLIDFA 181
           VID   L  +I K  +     + +V +L    E      S +   ++I    + R + +A
Sbjct: 88  VIDFNRLFSAIAKTKQ-----YELVLALCKQMESKGIAHSIYTLSIMINCFCRCRKLSYA 147

Query: 182 LNMCAHLEERGFSLSLISFNTLLHVVEKSDENRKVWKIYEQMIRKRVYPNVITVRIMINS 241
            +    + + G+    + FNTLL+ +       +  ++ ++M+     P +IT+  ++N 
Sbjct: 148 FSTMGKIMKLGYEPDTVIFNTLLNGLCLECRVSEALELVDRMVEMGHKPTLITLNTLVNG 207

Query: 242 LCKEGKLQEISDMLSRIHGSRCSASLIVNGCLIYRILEEGRVEDGVMLLKRMLQKNMILD 301
           LC  GK+ +   ++ R+  +    + +  G ++  + + G+    + LL++M ++N+ LD
Sbjct: 208 LCLNGKVSDAVVLIDRMVETGFQPNEVTYGPVLNVMCKSGQTALAMELLRKMEERNIKLD 267

Query: 302 DIAYSLIVYAKLKIGNIKSAQEVFDEMSKRGFQANSFIYTLFIGAHCRDGRIEEAHCLME 361
            + YS+I+    K G++ +A  +F+EM  +GF+A+   Y   IG  C  GR ++   L+ 
Sbjct: 268 AVKYSIIIDGLCKDGSLDNAFNLFNEMEIKGFKADIITYNTLIGGFCNAGRWDDGAKLLR 327

Query: 362 EMENMGLKPYPETFNLLIEGCRDS---EESLRMCEKMLERGFVPSCSSFNVAIAKICEEG 421
           +M    + P   TF++LI+         E+ ++ ++M++RG  P+  ++N  I   C+E 
Sbjct: 328 DMIKRKISPNVVTFSVLIDSFVKEGKLREADQLLKEMMQRGIAPNTITYNSLIDGFCKEN 387

Query: 422 DVKKANEMLTILLDKGFLPDETTYTNLIIGYGKIGETQEILKLYYEMDARLLSPGVSVFF 481
            +++A +M+ +++ KG  PD  T+  LI GY K     + L+L+ EM  R +      + 
Sbjct: 388 RLEEAIQMVDLMISKGCDPDIMTFNILINGYCKANRIDDGLELFREMSLRGVIANTVTYN 447

Query: 482 ALIGSFCQSGRLEEAEKYLKIMKDRSIQPSVSIYQTLSLFYLKKGNRAKALEHYNEM 534
            L+  FCQSG+LE A+K  + M  R ++P +  Y+ L       G   KALE + ++
Sbjct: 448 TLVQGFCQSGKLEVAKKLFQEMVSRRVRPDIVSYKILLDGLCDNGELEKALEIFGKI 499

BLAST of Cp4.1LG01g19790 vs. NCBI nr
Match: gi|659121166|ref|XP_008460527.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g66345, mitochondrial [Cucumis melo])

HSP 1 Score: 895.2 bits (2312), Expect = 5.6e-257
Identity = 457/540 (84.63%), Postives = 491/540 (90.93%), Query Frame = 1

Query: 1   MAFLRQKLSTFILSSSKISFCSSIRKLNSITAADELINNDAIVNSICDSFTRRESWDTLT 60
           MA LRQKLS  +LSSSKIS C SIRKL S   AD+LIN+DA VNSIC+SFTRR+SWD L+
Sbjct: 1   MALLRQKLSPIVLSSSKISLCLSIRKLISTPVADDLINDDATVNSICESFTRRQSWDALS 60

Query: 61  RKFETLELNDSLVQKVLLKFQQLVDAKRALGFFHWSAKRKNFNHGFQSYGIMIHILVKAR 120
           RKF+ LELND LVQKVLLKFQQ VDAK ALGFFHWSAKRKNFNHG QSYGIMIHILVKAR
Sbjct: 61  RKFQFLELNDLLVQKVLLKFQQPVDAKLALGFFHWSAKRKNFNHGSQSYGIMIHILVKAR 120

Query: 121 LVIDARALLESILKKNEGSSFNFSIVDSLLDTYEVTDSSPFVFDLLIQTCAKLRLIDFAL 180
           LV+DARALL+SILKKNEG+SF++S+VDSLLD+Y+VT SSPFVFDLL+QTCAKLRLIDFAL
Sbjct: 121 LVLDARALLQSILKKNEGNSFDYSVVDSLLDSYKVTGSSPFVFDLLVQTCAKLRLIDFAL 180

Query: 181 NMCAHLEERGFSLSLISFNTLLHVVEKSDENRKVWKIYEQMIRKRVYPNVITVRIMINSL 240
             C+HLEERGFSLSLISFNTL+HV+EKSDENRKVWKIYEQMI KRVYPN ITVRIMINSL
Sbjct: 181 CFCSHLEERGFSLSLISFNTLIHVLEKSDENRKVWKIYEQMIGKRVYPNAITVRIMINSL 240

Query: 241 CKEGKLQEISDMLSRIHGSRCSASLIVNGCLIYRILEEGRVEDGVMLLKRMLQKNMILDD 300
           CKEGKLQE SDML+RIHGSRCSASLIVN CLIYRILEEGRVEDGVMLLKRMLQKNM+LDD
Sbjct: 241 CKEGKLQETSDMLNRIHGSRCSASLIVNACLIYRILEEGRVEDGVMLLKRMLQKNMVLDD 300

Query: 301 IAYSLIVYAKLKIGNIKSAQEVFDEMSKRGFQANSFIYTLFIGAHCRDGRIEEAHCLMEE 360
           IAYSLIVYAK+K G+I S  EVF+EMSKRGFQANSFIYTL IG HCR G +EEAHCLM+E
Sbjct: 301 IAYSLIVYAKVKTGSITSTWEVFEEMSKRGFQANSFIYTLLIGVHCRGGEVEEAHCLMQE 360

Query: 361 MENMGLKPYPETFNLLIEGCR---DSEESLRMCEKMLERGFVPSCSSFNVAIAKICEEGD 420
           MENMGLKPY ETFNLLIEGC     SEE LRMCEKMLERGF+PSCS FNVAIAKICEEGD
Sbjct: 361 MENMGLKPYSETFNLLIEGCAISGHSEEILRMCEKMLERGFLPSCSVFNVAIAKICEEGD 420

Query: 421 VKKANEMLTILLDKGFLPDETTYTNLIIGYGKIGETQEILKLYYEMDARLLSPGVSVFFA 480
           VKKANE+LTILLDKGFLPDETTYTNLIIGY K GE  EILKLYYEM+ARLLSPG+SVFFA
Sbjct: 421 VKKANELLTILLDKGFLPDETTYTNLIIGYRKSGEILEILKLYYEMEARLLSPGISVFFA 480

Query: 481 LIGSFCQSGRLEEAEKYLKIMKDRSIQPSVSIYQTLSLFYLKKGNRAKALEHYNEMMFNG 538
           LIGS CQSGRLEEAEKYLKI+KD S+ PS+SIYQ L LFYLKKGNRAKALE YNEMMF+G
Sbjct: 481 LIGSLCQSGRLEEAEKYLKIVKDSSLTPSLSIYQALILFYLKKGNRAKALELYNEMMFDG 540

BLAST of Cp4.1LG01g19790 vs. NCBI nr
Match: gi|778702614|ref|XP_004140361.2| (PREDICTED: pentatricopeptide repeat-containing protein At1g66345, mitochondrial [Cucumis sativus])

HSP 1 Score: 884.0 bits (2283), Expect = 1.3e-253
Identity = 453/540 (83.89%), Postives = 489/540 (90.56%), Query Frame = 1

Query: 1   MAFLRQKLSTFILSSSKISFCSSIRKLNSITAADELINNDAIVNSICDSFTRRESWDTLT 60
           MA LRQKLS  +LSS KIS C S+R L SI  AD+LIN+DA VNSICDS TRR+SWDTL+
Sbjct: 1   MALLRQKLSPIVLSS-KISLCLSMRNLISIPVADKLINDDATVNSICDSLTRRQSWDTLS 60

Query: 61  RKFETLELNDSLVQKVLLKFQQLVDAKRALGFFHWSAKRKNFNHGFQSYGIMIHILVKAR 120
           RKF+ LELND LVQKVLLKFQQ VDAKRALGFFHWSAKRKNFNHG QS+GIMIHILVKAR
Sbjct: 61  RKFQFLELNDFLVQKVLLKFQQPVDAKRALGFFHWSAKRKNFNHGPQSFGIMIHILVKAR 120

Query: 121 LVIDARALLESILKKNEGSSFNFSIVDSLLDTYEVTDSSPFVFDLLIQTCAKLRLIDFAL 180
           LV+DARALLESILKKNEG+SF++S+VDSL+D+YEVT SSPFVFDLL+QTCAKLRLIDFAL
Sbjct: 121 LVLDARALLESILKKNEGNSFDYSVVDSLMDSYEVTGSSPFVFDLLVQTCAKLRLIDFAL 180

Query: 181 NMCAHLEERGFSLSLISFNTLLHVVEKSDENRKVWKIYEQMIRKRVYPNVITVRIMINSL 240
            +C+HLEERGFSLSLISFNTL+HVVEKSDEN KVWKIYEQMIRKRVYPN ITVRIMINSL
Sbjct: 181 CVCSHLEERGFSLSLISFNTLIHVVEKSDENLKVWKIYEQMIRKRVYPNAITVRIMINSL 240

Query: 241 CKEGKLQEISDMLSRIHGSRCSASLIVNGCLIYRILEEGRVEDGVMLLKRMLQKNMILDD 300
           CKEGKLQE SDML+RIHGSRCSASLIVN CLIYRILEEGRVEDG+ LLKRMLQKNM+LDD
Sbjct: 241 CKEGKLQETSDMLNRIHGSRCSASLIVNACLIYRILEEGRVEDGITLLKRMLQKNMVLDD 300

Query: 301 IAYSLIVYAKLKIGNIKSAQEVFDEMSKRGFQANSFIYTLFIGAHCRDGRIEEAHCLMEE 360
           IAYSLIVYAK+K G+I S  EVF+EMS+RGFQANSFIYTLFIG HCR G++EEAHCLM+E
Sbjct: 301 IAYSLIVYAKVKTGSITSTWEVFEEMSERGFQANSFIYTLFIGVHCRGGKVEEAHCLMQE 360

Query: 361 MENMGLKPYPETFNLLIEGCR---DSEESLRMCEKMLERGFVPSCSSFNVAIAKICEEGD 420
           MENMGLKPYPETFNLLIEGC     SEE L MCEKMLERGF+PSCS FNVAI KICE+GD
Sbjct: 361 MENMGLKPYPETFNLLIEGCAISGHSEEILSMCEKMLERGFLPSCSVFNVAIDKICEKGD 420

Query: 421 VKKANEMLTILLDKGFLPDETTYTNLIIGYGKIGETQEILKLYYEMDARLLSPGVSVFFA 480
           VKKAN +LTILLDKGFLPDETTYTNLIIGY K GE QEILKLYYEM ARLLSPGVSVFFA
Sbjct: 421 VKKANALLTILLDKGFLPDETTYTNLIIGYRKSGEIQEILKLYYEMGARLLSPGVSVFFA 480

Query: 481 LIGSFCQSGRLEEAEKYLKIMKDRSIQPSVSIYQTLSLFYLKKGNRAKALEHYNEMMFNG 538
           LIGS CQSGRLEEAEKYLKI+KD S+ P +SIYQ L L YLKKGNRAKALE YNEMMF+G
Sbjct: 481 LIGSLCQSGRLEEAEKYLKIVKDSSLTPCLSIYQALILLYLKKGNRAKALELYNEMMFDG 539

BLAST of Cp4.1LG01g19790 vs. NCBI nr
Match: gi|1009126673|ref|XP_015880282.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g66345, mitochondrial-like [Ziziphus jujuba])

HSP 1 Score: 612.8 bits (1579), Expect = 5.6e-172
Identity = 309/537 (57.54%), Postives = 412/537 (76.72%), Query Frame = 1

Query: 1   MAFLRQKLSTFILSSSKISFCSSIRKLNSITAADELINNDAIVNSICDSFTRRESWDTLT 60
           M+F+R  L   +++ S I  C  ++ +++ TAA++      +V +IC S     +WD L+
Sbjct: 1   MSFVRGLLIPSLIAKS-IKSCIHLQLIHTETAAND------VVKAICCSLRAGRNWDILS 60

Query: 61  RKFETLELNDSLVQKVLLKFQQLVDAKRALGFFHWSAKRKNFNHGFQSYGIMIHILVKAR 120
           RKF +++L++ +V+KVLL+ ++ VDAKRALGFFHWSA      HG QSY I+IHILV+A 
Sbjct: 61  RKFGSVDLDEVVVKKVLLELKEPVDAKRALGFFHWSAHSTFQQHGLQSYCILIHILVRAG 120

Query: 121 LVIDARALLESILKKNEGSSFNFSIVDSLLDTYEVTDSSPFVFDLLIQTCAKLRLIDFAL 180
           L +DARALLES+LKKN GSSF FS+VDSL+ +Y+VT S+PFVFD+L+Q  AKLR+ +   
Sbjct: 121 LNLDARALLESVLKKNSGSSFRFSVVDSLISSYKVTASNPFVFDMLVQVYAKLRMFEIGF 180

Query: 181 NMCAHLEERGFSLSLISFNTLLHVVEKSDENRKVWKIYEQMIRKRVYPNVITVRIMINSL 240
           ++C +L+ERGFSL+L SFN L+HVV+KSD+   VWKIYE MI +R+YPN  TVRI+IN+L
Sbjct: 181 DVCCYLDERGFSLNLSSFNILIHVVQKSDQFVLVWKIYEHMITRRMYPNEETVRILINAL 240

Query: 241 CKEGKLQEISDMLSRIHGSRCSASLIVNGCLIYRILEEGRVEDGVMLLKRMLQKNMILDD 300
           CKEGKLQE  ++L RI G RCS S+IVN  L+ R+LEEGR+E+ ++LLKRMLQKNM+LD 
Sbjct: 241 CKEGKLQECVNILDRILGKRCSPSVIVNASLVLRVLEEGRIEESMVLLKRMLQKNMVLDT 300

Query: 301 IAYSLIVYAKLKIGNIKSAQEVFDEMSKRGFQANSFIYTLFIGAHCRDGRIEEAHCLMEE 360
           IAYSL+VYAK+KIGN++ A EVF+EM KRGFQ N F+YTLFIGAHC++GRIEE +C+M+E
Sbjct: 301 IAYSLVVYAKVKIGNLELAYEVFEEMLKRGFQPNPFVYTLFIGAHCKEGRIEEGNCMMQE 360

Query: 361 MENMGLKPYPETFNLLIEGCRDS---EESLRMCEKMLERGFVPSCSSFNVAIAKICEEGD 420
           MENMG KPY +T+N LIEG   +   EE LR  EKM+ERG +PSCS+FN  + K+CE GD
Sbjct: 361 MENMGFKPYGDTYNFLIEGSAKAGSLEEMLRNYEKMIERGMIPSCSTFNEMVGKLCENGD 420

Query: 421 VKKANEMLTILLDKGFLPDETTYTNLIIGYGKIGETQEILKLYYEMDARLLSPGVSVFFA 480
           V++AN  LT LL+KGFL DE TY+ LI GYGK G+ QE+LKLYYEM+ + +SPG+ +F +
Sbjct: 421 VEEANRRLTALLEKGFLADEITYSYLIDGYGKKGDIQEVLKLYYEMEYKSMSPGLPIFSS 480

Query: 481 LIGSFCQSGRLEEAEKYLKIMKDRSIQPSVSIYQTLSLFYLKKGNRAKALEHYNEMM 535
           LI   C  G+LEEAEKYL IMKDRS+ PS  +Y+TL   Y+ +GN+ +A   Y++M+
Sbjct: 481 LIKGLCLCGKLEEAEKYLGIMKDRSLVPSECLYETLIAGYIAQGNKERAHFLYDDMV 530

BLAST of Cp4.1LG01g19790 vs. NCBI nr
Match: gi|1009126661|ref|XP_015880277.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g66345, mitochondrial-like [Ziziphus jujuba])

HSP 1 Score: 612.5 bits (1578), Expect = 7.3e-172
Identity = 309/537 (57.54%), Postives = 411/537 (76.54%), Query Frame = 1

Query: 1   MAFLRQKLSTFILSSSKISFCSSIRKLNSITAADELINNDAIVNSICDSFTRRESWDTLT 60
           M+F+R  L   ++S S I  C  ++ +++ TA ++      +V +IC S     +WD L+
Sbjct: 1   MSFVRGLLIPSLISKS-IKSCIHLQLIHTETATND------VVKAICCSLRAGRNWDILS 60

Query: 61  RKFETLELNDSLVQKVLLKFQQLVDAKRALGFFHWSAKRKNFNHGFQSYGIMIHILVKAR 120
           RKF +++L++ +V+KVLL+ ++ VDAKRALGFFHWSA      HG QSY I+IHILV+A 
Sbjct: 61  RKFGSVDLDEVVVKKVLLELKEPVDAKRALGFFHWSAHSTFQQHGLQSYCILIHILVRAG 120

Query: 121 LVIDARALLESILKKNEGSSFNFSIVDSLLDTYEVTDSSPFVFDLLIQTCAKLRLIDFAL 180
           L +DARALLES+LKKN GSSF FS+VDSL+ +Y+VT S+PFVFD+L+Q  AKLR+ +   
Sbjct: 121 LNLDARALLESVLKKNSGSSFRFSVVDSLISSYKVTASNPFVFDMLVQVYAKLRMFEIGF 180

Query: 181 NMCAHLEERGFSLSLISFNTLLHVVEKSDENRKVWKIYEQMIRKRVYPNVITVRIMINSL 240
           ++C +L+ERGFSL+L SFN L+HVV+KSD+   VWKIYE MI +R+YPN  TVRI+IN+L
Sbjct: 181 DVCCYLDERGFSLNLSSFNILIHVVQKSDQFVLVWKIYEHMITRRMYPNEETVRILINAL 240

Query: 241 CKEGKLQEISDMLSRIHGSRCSASLIVNGCLIYRILEEGRVEDGVMLLKRMLQKNMILDD 300
           CKEGKLQE  ++L RI G RCS S+IVN  L+ R+LEEGR+E+ ++LLKRMLQKNM+LD 
Sbjct: 241 CKEGKLQECVNILDRILGKRCSPSVIVNASLVLRVLEEGRIEESMVLLKRMLQKNMVLDT 300

Query: 301 IAYSLIVYAKLKIGNIKSAQEVFDEMSKRGFQANSFIYTLFIGAHCRDGRIEEAHCLMEE 360
           IAYSL+VYAK+KIGN++ A EVF+EM KRGFQ N F+YTLFIGAHC++GRIEE +C+M+E
Sbjct: 301 IAYSLVVYAKVKIGNLELAYEVFEEMLKRGFQPNPFVYTLFIGAHCKEGRIEEGNCMMQE 360

Query: 361 MENMGLKPYPETFNLLIEGCRDS---EESLRMCEKMLERGFVPSCSSFNVAIAKICEEGD 420
           MENMG KPY +T+N LIEG   +   EE LR  EKM+ERG +PSCS+FN  + K+CE GD
Sbjct: 361 MENMGFKPYGDTYNFLIEGSAKAGSLEEMLRNYEKMIERGMIPSCSTFNEMVGKLCENGD 420

Query: 421 VKKANEMLTILLDKGFLPDETTYTNLIIGYGKIGETQEILKLYYEMDARLLSPGVSVFFA 480
           V++AN  LT LL+KGFL DE TY+ LI GYGK G+ QE+LKLYYEM+ + +SPG+ +F +
Sbjct: 421 VEEANRRLTALLEKGFLADEITYSYLIDGYGKKGDIQEVLKLYYEMEYKSMSPGLPIFSS 480

Query: 481 LIGSFCQSGRLEEAEKYLKIMKDRSIQPSVSIYQTLSLFYLKKGNRAKALEHYNEMM 535
           LI   C  G+LEEAEKYL IMKDRS+ PS  +Y+TL   Y+ +GN+ +A   Y++M+
Sbjct: 481 LIKGLCLCGKLEEAEKYLGIMKDRSLVPSECLYETLIAGYIAQGNKERAHFLYDDMV 530

BLAST of Cp4.1LG01g19790 vs. NCBI nr
Match: gi|645241556|ref|XP_008227131.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g66345, mitochondrial [Prunus mume])

HSP 1 Score: 605.1 bits (1559), Expect = 1.2e-169
Identity = 308/536 (57.46%), Postives = 403/536 (75.19%), Query Frame = 1

Query: 1   MAFLRQKLSTFILSSSKISFCSSIRKLNSITAADELINNDAIVNSICDSFTRRESWDTLT 60
           MAFL +++S  ++    I      R +N+ +A  +      +  +I DSF    +WDTLT
Sbjct: 1   MAFLLRRISKSLIP---IPINQETRLINTDSAPKD------VAKAIRDSFRISWNWDTLT 60

Query: 61  RKFETLELNDSLVQKVLLKFQQLVDAKRALGFFHWSAKRKNFNHGFQSYGIMIHILVKAR 120
            KFE+++L+  LV+ VLL+ ++ +DAKRALGFFHW+A RK+F HG  SY I IHIL +AR
Sbjct: 61  TKFESVKLDGGLVESVLLELKEPIDAKRALGFFHWAAHRKSFEHGVWSYSITIHILARAR 120

Query: 121 LVIDARALLESILKKNEGSSFNFSIVDSLLDTYEVTDSSPFVFDLLIQTCAKLRLIDFAL 180
           L++DARALLES+LKK   +   FS+VDSLL +YEVT S+PFVFDLL+Q  AKLR+ +   
Sbjct: 121 LLMDARALLESVLKKTAENGSKFSVVDSLLSSYEVTASNPFVFDLLLQAYAKLRMFETGF 180

Query: 181 NMCAHLEERGFSLSLISFNTLLHVVEKSDENRKVWKIYEQMIRKRVYPNVITVRIMINSL 240
           ++C +L E G  LSLI++NTLLHVV+KSD+   VWKIYE M+ KR YPN +T++I+I++L
Sbjct: 181 DVCCYLGEHGLPLSLITYNTLLHVVQKSDQTALVWKIYEHMVGKRNYPNEVTIKILIDAL 240

Query: 241 CKEGKLQEISDMLSRIHGSRCSASLIVNGCLIYRILEEGRVEDGVMLLKRMLQKNMILDD 300
           CKEGKL++  DML RIHG RCS S+IVN  L++ ILE+GRVE+G+MLL+RMLQKNM+LD 
Sbjct: 241 CKEGKLKKYVDMLDRIHGKRCSPSVIVNTSLVFSILEDGRVEEGLMLLRRMLQKNMVLDT 300

Query: 301 IAYSLIVYAKLKIGNIKSAQEVFDEMSKRGFQANSFIYTLFIGAHCRDGRIEEAHCLMEE 360
           IAYSLIVYAK+K G++ SA EV++EM KRGF+ANSF+YTLF+GAHC  GRIEEA  +M E
Sbjct: 301 IAYSLIVYAKVKQGDVCSAWEVYEEMLKRGFRANSFVYTLFMGAHCEGGRIEEAQSMMNE 360

Query: 361 MENMGLKPYPETFNLLIEGCRDS---EESLRMCEKMLERGFVPSCSSFNVAIAKICEEGD 420
           MENM LKP+ E++NLLIEGC  +   E SL   +KM+E GF+P  S+FN  + K+CE GD
Sbjct: 361 MENMDLKPFDESYNLLIEGCAKAGRVEASLSYLKKMVENGFIPCRSAFNEMVGKLCETGD 420

Query: 421 VKKANEMLTILLDKGFLPDETTYTNLIIGYGKIGETQEILKLYYEMDARLLSPGVSVFFA 480
            ++AN M TILLDKGFLPD  TY +LI GYG+ GE QE++KLYYEM++R LSPG  VF +
Sbjct: 421 AEQANTMFTILLDKGFLPDSITYAHLIDGYGRKGEIQEVVKLYYEMESRSLSPGALVFTS 480

Query: 481 LIGSFCQSGRLEEAEKYLKIMKDRSIQPSVSIYQTLSLFYLKKGNRAKALEHYNEM 534
           +I SFCQ G++EEAE+YL IMKDRSI PS+ +Y+TL   +  KGN  +AL   NEM
Sbjct: 481 VIKSFCQCGKVEEAERYLGIMKDRSIAPSLCVYETLIANHFDKGNAERALHLKNEM 527

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP107_ARATH1.2e-13448.99Pentatricopeptide repeat-containing protein At1g66345, mitochondrial OS=Arabidop... [more]
PP432_ARATH2.3e-4828.20Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana GN... [more]
PP407_ARATH6.2e-4625.70Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN... [more]
PP143_ARATH1.6e-4126.92Putative pentatricopeptide repeat-containing protein At2g02150 OS=Arabidopsis th... [more]
PPR39_ARATH3.5e-4125.90Pentatricopeptide repeat-containing protein At1g12775, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0KRP7_CUCSA9.0e-25483.89Uncharacterized protein OS=Cucumis sativus GN=Csa_5G423870 PE=4 SV=1[more]
B9H373_POPTR3.4e-16856.43Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POP... [more]
A5AQQ7_VITVI7.6e-16856.80Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_012747 PE=4 SV=1[more]
B9SYW9_RICCO3.9e-16456.66Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
M5X9J7_PRUPE3.3e-16360.25Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa014874mg PE=4 S... [more]
Match NameE-valueIdentityDescription
AT1G66345.16.8e-13648.99 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G55840.11.3e-4928.20 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G39710.13.5e-4725.70 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G02150.18.9e-4326.92 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G12775.12.0e-4225.90 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659121166|ref|XP_008460527.1|5.6e-25784.63PREDICTED: pentatricopeptide repeat-containing protein At1g66345, mitochondrial ... [more]
gi|778702614|ref|XP_004140361.2|1.3e-25383.89PREDICTED: pentatricopeptide repeat-containing protein At1g66345, mitochondrial ... [more]
gi|1009126673|ref|XP_015880282.1|5.6e-17257.54PREDICTED: pentatricopeptide repeat-containing protein At1g66345, mitochondrial-... [more]
gi|1009126661|ref|XP_015880277.1|7.3e-17257.54PREDICTED: pentatricopeptide repeat-containing protein At1g66345, mitochondrial-... [more]
gi|645241556|ref|XP_008227131.1|1.2e-16957.46PREDICTED: pentatricopeptide repeat-containing protein At1g66345, mitochondrial ... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g19790.1Cp4.1LG01g19790.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 108..136
score: 0.21coord: 302..331
score: 0.045coord: 439..464
score: 3.1E-4coord: 477..502
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 334..380
score: 2.9E-10coord: 196..242
score: 1.1
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 439..465
score: 0.0018coord: 477..506
score: 2.4E-5coord: 337..368
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 436..470
score: 9.931coord: 299..333
score: 9.69coord: 264..298
score: 6.478coord: 105..139
score: 6.237coord: 194..228
score: 8.835coord: 401..435
score: 8.659coord: 334..368
score: 12.156coord: 471..505
score: 10.6coord: 159..193
score: 7.618coord: 506..537
score: 7.18coord: 229..263
score: 8
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 277..386
score: 4.4E-7coord: 439..532
score: 4.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 80..135
score: 3.7E-232coord: 154..533
score: 3.7E-232coord: 7..62
score: 3.7E
NoneNo IPR availablePANTHERPTHR24015:SF686SUBFAMILY NOT NAMEDcoord: 154..533
score: 3.7E-232coord: 80..135
score: 3.7E-232coord: 7..62
score: 3.7E
NoneNo IPR availableunknownSSF81901HCP-likecoord: 381..532
score: 3.0