Cp4.1LG03g05680 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG03g05680
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein 11
LocationCp4.1LG03 : 4620193 .. 4621647 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTAATTTCCTCTGCGCCCATCAAATGTTCGAAGAATCACAATCGATTGTCCGATTTCTCGTCTCCCGCAAAGGTAAGGACTCGGCGGCTTCGATCTTCGCCGCGATTCTTGAAATTACAGATACGCGTTGTTCGAATTTTGTATTTGATGCTTTGATGATTGCGTATTCGGATTCTGGGTTCATATCCGATGCGATTCAGTGCTTTAGGTTGGTCAGGAAGAGAAATTTTCAAATCCCGTTTCGTGGATGTGAGTACTTACTTGATAAAATGATGAATTCAAACTCCCCTGTTACGATTTGGACGTTTTATCTGGAAATTTTGGATTCTGGATTCCCACCTAAAGTAAAGTATTTCAACATTTTGATTAATAAGTTCTGTAAACAGGGAAGCATTAGAGATGCCAGGTTGATCTTCGATGAAATTGGGAAGAGGGGTTTTCGTCCCACAGCTGTTAGTTTCAATACCTTGATTAATGGTTTCTGTAAATCCCGAAATTTAGATGAGTGTTTTAGGTTGAAGAAAGTCATGGAAGAGAGTAGAATATATCCTGATGTTTACACTTACAGTGTTCTGATTCATGGGTTATGCAAGGAAGGTAAGGTAGATGATGCAGAACAACTGTTCGATGAAATGCGTCAGAGAGGATTGAGGGCAAACGACGTTACATTCACTGCTTTGATTGATGGGCAATGCAGGAGCGGACGAATTGACTCAGCCATGAACACTTATCAGCAAATGTTAGCCATGGGAGTGAAACCAGATTTAGTTATGTATAACACACTCTTGAATGGCCTTTGCAAAGTGGGGGATGTTAGTAAAGCTAGGAAGCTAGTTGATGAAATGAAAATGGTGGGGATGAAACCAGATAAAATCACTTACACAACACTCATAGATGGTTACTGCAAAGAGGGAGATTTAGAATCAGCCATGGAGATTAGGAAAGGTATGAATGAAGAAGGGGTTGTTCTTGATAATGTAGCATTCACAGCCATTATTTCAGGTTTGTGTAGAGATGGAAGGGTGATGGATGCAGAGAGGACGTTGAGGGAGATGAAGGAAGCTGGGATGAAACCCGACGACGCGACGTATACTATGGTGATCGACGGGTATTGCAAGAACGGCGATGTTAAGACAGGGTTTAAGCTGCTGAAAGAGATGCAGAGAAATGGCCATAATCCTGGTGTGATAACTTACAATGTGCTTATGAATGGACTTTGCAAGCAAGGACAGATGAAGAATGCCAATATGCTGTTGGAAGCAATGCTTAACTTAGGAGTAACTCCTGATGACATTACATACAATATTCTGTTGGAAGGGCACTGTAAGAGTGGAAGAGCTGAAGATTTCCTTCACCTCAGAAATGAGAAGGGGGTCGTAGTAGACTACGCGTATTATACTTCTTTAGTCGGTGAATACGATAAATCGTTAAAGGATCGTCGAAAGAGGTGA

mRNA sequence

ATGGCTAATTTCCTCTGCGCCCATCAAATGTTCGAAGAATCACAATCGATTGTCCGATTTCTCGTCTCCCGCAAAGGTAAGGACTCGGCGGCTTCGATCTTCGCCGCGATTCTTGAAATTACAGATACGCGTTGTTCGAATTTTGTATTTGATGCTTTGATGATTGCGTATTCGGATTCTGGGTTCATATCCGATGCGATTCAGTGCTTTAGGTTGGTCAGGAAGAGAAATTTTCAAATCCCGTTTCGTGGATGTGAGTACTTACTTGATAAAATGATGAATTCAAACTCCCCTGTTACGATTTGGACGTTTTATCTGGAAATTTTGGATTCTGGATTCCCACCTAAAGTAAAGTATTTCAACATTTTGATTAATAAGTTCTGTAAACAGGGAAGCATTAGAGATGCCAGGTTGATCTTCGATGAAATTGGGAAGAGGGGTTTTCGTCCCACAGCTGTTAGTTTCAATACCTTGATTAATGGTTTCTGTAAATCCCGAAATTTAGATGAGTGTTTTAGGTTGAAGAAAGTCATGGAAGAGAGTAGAATATATCCTGATGTTTACACTTACAGTGTTCTGATTCATGGGTTATGCAAGGAAGGTAAGGTAGATGATGCAGAACAACTGTTCGATGAAATGCGTCAGAGAGGATTGAGGGCAAACGACGTTACATTCACTGCTTTGATTGATGGGCAATGCAGGAGCGGACGAATTGACTCAGCCATGAACACTTATCAGCAAATGTTAGCCATGGGAGTGAAACCAGATTTAGTTATGTATAACACACTCTTGAATGGCCTTTGCAAAGTGGGGGATGTTAGTAAAGCTAGGAAGCTAGTTGATGAAATGAAAATGGTGGGGATGAAACCAGATAAAATCACTTACACAACACTCATAGATGGTTACTGCAAAGAGGGAGATTTAGAATCAGCCATGGAGATTAGGAAAGGTATGAATGAAGAAGGGGTTGTTCTTGATAATGTAGCATTCACAGCCATTATTTCAGGTTTGTGTAGAGATGGAAGGGTGATGGATGCAGAGAGGACGTTGAGGGAGATGAAGGAAGCTGGGATGAAACCCGACGACGCGACGTATACTATGGTGATCGACGGGTATTGCAAGAACGGCGATGTTAAGACAGGGTTTAAGCTGCTGAAAGAGATGCAGAGAAATGGCCATAATCCTGGTGTGATAACTTACAATGTGCTTATGAATGGACTTTGCAAGCAAGGACAGATGAAGAATGCCAATATGCTGTTGGAAGCAATGCTTAACTTAGGAGTAACTCCTGATGACATTACATACAATATTCTGTTGGAAGGGCACTGTAAGAGTGGAAGAGCTGAAGATTTCCTTCACCTCAGAAATGAGAAGGGGGTCGTAGTAGACTACGCGTATTATACTTCTTTAGTCGGTGAATACGATAAATCGTTAAAGGATCGTCGAAAGAGGTGA

Coding sequence (CDS)

ATGGCTAATTTCCTCTGCGCCCATCAAATGTTCGAAGAATCACAATCGATTGTCCGATTTCTCGTCTCCCGCAAAGGTAAGGACTCGGCGGCTTCGATCTTCGCCGCGATTCTTGAAATTACAGATACGCGTTGTTCGAATTTTGTATTTGATGCTTTGATGATTGCGTATTCGGATTCTGGGTTCATATCCGATGCGATTCAGTGCTTTAGGTTGGTCAGGAAGAGAAATTTTCAAATCCCGTTTCGTGGATGTGAGTACTTACTTGATAAAATGATGAATTCAAACTCCCCTGTTACGATTTGGACGTTTTATCTGGAAATTTTGGATTCTGGATTCCCACCTAAAGTAAAGTATTTCAACATTTTGATTAATAAGTTCTGTAAACAGGGAAGCATTAGAGATGCCAGGTTGATCTTCGATGAAATTGGGAAGAGGGGTTTTCGTCCCACAGCTGTTAGTTTCAATACCTTGATTAATGGTTTCTGTAAATCCCGAAATTTAGATGAGTGTTTTAGGTTGAAGAAAGTCATGGAAGAGAGTAGAATATATCCTGATGTTTACACTTACAGTGTTCTGATTCATGGGTTATGCAAGGAAGGTAAGGTAGATGATGCAGAACAACTGTTCGATGAAATGCGTCAGAGAGGATTGAGGGCAAACGACGTTACATTCACTGCTTTGATTGATGGGCAATGCAGGAGCGGACGAATTGACTCAGCCATGAACACTTATCAGCAAATGTTAGCCATGGGAGTGAAACCAGATTTAGTTATGTATAACACACTCTTGAATGGCCTTTGCAAAGTGGGGGATGTTAGTAAAGCTAGGAAGCTAGTTGATGAAATGAAAATGGTGGGGATGAAACCAGATAAAATCACTTACACAACACTCATAGATGGTTACTGCAAAGAGGGAGATTTAGAATCAGCCATGGAGATTAGGAAAGGTATGAATGAAGAAGGGGTTGTTCTTGATAATGTAGCATTCACAGCCATTATTTCAGGTTTGTGTAGAGATGGAAGGGTGATGGATGCAGAGAGGACGTTGAGGGAGATGAAGGAAGCTGGGATGAAACCCGACGACGCGACGTATACTATGGTGATCGACGGGTATTGCAAGAACGGCGATGTTAAGACAGGGTTTAAGCTGCTGAAAGAGATGCAGAGAAATGGCCATAATCCTGGTGTGATAACTTACAATGTGCTTATGAATGGACTTTGCAAGCAAGGACAGATGAAGAATGCCAATATGCTGTTGGAAGCAATGCTTAACTTAGGAGTAACTCCTGATGACATTACATACAATATTCTGTTGGAAGGGCACTGTAAGAGTGGAAGAGCTGAAGATTTCCTTCACCTCAGAAATGAGAAGGGGGTCGTAGTAGACTACGCGTATTATACTTCTTTAGTCGGTGAATACGATAAATCGTTAAAGGATCGTCGAAAGAGGTGA

Protein sequence

MANFLCAHQMFEESQSIVRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDSGFISDAIQCFRLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYFNILINKFCKQGSIRDARLIFDEIGKRGFRPTAVSFNTLINGFCKSRNLDECFRLKKVMEESRIYPDVYTYSVLIHGLCKEGKVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDSAMNTYQQMLAMGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRKGMNEEGVVLDNVAFTAIISGLCRDGRVMDAERTLREMKEAGMKPDDATYTMVIDGYCKNGDVKTGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLLEAMLNLGVTPDDITYNILLEGHCKSGRAEDFLHLRNEKGVVVDYAYYTSLVGEYDKSLKDRRKR
BLAST of Cp4.1LG03g05680 vs. Swiss-Prot
Match: PPR26_ARATH (Putative pentatricopeptide repeat-containing protein At1g09680 OS=Arabidopsis thaliana GN=At1g09680 PE=3 SV=1)

HSP 1 Score: 607.4 bits (1565), Expect = 1.3e-172
Identity = 290/484 (59.92%), Postives = 367/484 (75.83%), Query Frame = 1

Query: 1   MANFLCAHQMFEESQSIVRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDS 60
           +A FL  H+MF E+QS++  +VSRKGK+SA+S+F +++E+  T    F+ DALMI Y+D 
Sbjct: 124 LARFLAVHEMFTEAQSLIELVVSRKGKNSASSVFISLVEMRVTPMCGFLVDALMITYTDL 183

Query: 61  GFISDAIQCFRLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYF 120
           GFI DAIQCFRL RK  F +P RGC  LLD+MM  N   TIW FY+EILD+GFP  V  F
Sbjct: 184 GFIPDAIQCFRLSRKHRFDVPIRGCGNLLDRMMKLNPTGTIWGFYMEILDAGFPLNVYVF 243

Query: 121 NILINKFCKQGSIRDARLIFDEIGKRGFRPTAVSFNTLINGFCKSRNLDECFRLKKVMEE 180
           NIL+NKFCK+G+I DA+ +FDEI KR  +PT VSFNTLING+CK  NLDE FRLK  ME+
Sbjct: 244 NILMNKFCKEGNISDAQKVFDEITKRSLQPTVVSFNTLINGYCKVGNLDEGFRLKHQMEK 303

Query: 181 SRIYPDVYTYSVLIHGLCKEGKVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDS 240
           SR  PDV+TYS LI+ LCKE K+D A  LFDEM +RGL  NDV FT LI G  R+G ID 
Sbjct: 304 SRTRPDVFTYSALINALCKENKMDGAHGLFDEMCKRGLIPNDVIFTTLIHGHSRNGEIDL 363

Query: 241 AMNTYQQMLAMGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLID 300
              +YQ+ML+ G++PD+V+YNTL+NG CK GD+  AR +VD M   G++PDKITYTTLID
Sbjct: 364 MKESYQKMLSKGLQPDIVLYNTLVNGFCKNGDLVAARNIVDGMIRRGLRPDKITYTTLID 423

Query: 301 GYCKEGDLESAMEIRKGMNEEGVVLDNVAFTAIISGLCRDGRVMDAERTLREMKEAGMKP 360
           G+C+ GD+E+A+EIRK M++ G+ LD V F+A++ G+C++GRV+DAER LREM  AG+KP
Sbjct: 424 GFCRGGDVETALEIRKEMDQNGIELDRVGFSALVCGMCKEGRVIDAERALREMLRAGIKP 483

Query: 361 DDATYTMVIDGYCKNGDVKTGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLL 420
           DD TYTM++D +CK GD +TGFKLLKEMQ +GH P V+TYNVL+NGLCK GQMKNA+MLL
Sbjct: 484 DDVTYTMMMDAFCKKGDAQTGFKLLKEMQSDGHVPSVVTYNVLLNGLCKLGQMKNADMLL 543

Query: 421 EAMLNLGVTPDDITYNILLEGHCKSGRAEDFLHLRNEKGVVVDYAYYTSLVGEYDKSLKD 480
           +AMLN+GV PDDITYN LLEGH +   +      + E G+V D A Y S+V E D++ KD
Sbjct: 544 DAMLNIGVVPDDITYNTLLEGHHRHANSSKRYIQKPEIGIVADLASYKSIVNELDRASKD 603

Query: 481 RRKR 485
            R R
Sbjct: 604 HRNR 607

BLAST of Cp4.1LG03g05680 vs. Swiss-Prot
Match: PP407_ARATH (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 305.8 bits (782), Expect = 8.2e-82
Identity = 157/488 (32.17%), Postives = 275/488 (56.35%), Query Frame = 1

Query: 3   NFLCAHQMFEESQSIVRFLVSRKGKDSAASIFAAILEITDTRC--SNFVFDALMIAYSDS 62
           + L   ++++ +Q +   + ++   D  AS+    L+ T   C  ++ VFD ++ +YS  
Sbjct: 88  HILTKFKLYKTAQILAEDVAAKTLDDEYASLVFKSLQETYDLCYSTSSVFDLVVKSYSRL 147

Query: 63  GFISDAIQCFRLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIW-TFYLEILDSGFPPKVKY 122
             I  A+    L +   F         +LD  + S   ++     + E+L+S   P V  
Sbjct: 148 SLIDKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFT 207

Query: 123 FNILINKFCKQGSIRDARLIFDEIGKRGFRPTAVSFNTLINGFCKSRNLDECFRLKKVME 182
           +NILI  FC  G+I  A  +FD++  +G  P  V++NTLI+G+CK R +D+ F+L + M 
Sbjct: 208 YNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMA 267

Query: 183 ESRIYPDVYTYSVLIHGLCKEGKVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRID 242
              + P++ +Y+V+I+GLC+EG++ +   +  EM +RG   ++VT+  LI G C+ G   
Sbjct: 268 LKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFH 327

Query: 243 SAMNTYQQMLAMGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLI 302
            A+  + +ML  G+ P ++ Y +L++ +CK G++++A + +D+M++ G+ P++ TYTTL+
Sbjct: 328 QALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLV 387

Query: 303 DGYCKEGDLESAMEIRKGMNEEGVVLDNVAFTAIISGLCRDGRVMDAERTLREMKEAGMK 362
           DG+ ++G +  A  + + MN+ G     V + A+I+G C  G++ DA   L +MKE G+ 
Sbjct: 388 DGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLS 447

Query: 363 PDDATYTMVIDGYCKNGDVKTGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANML 422
           PD  +Y+ V+ G+C++ DV    ++ +EM   G  P  ITY+ L+ G C+Q + K A  L
Sbjct: 448 PDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDL 507

Query: 423 LEAMLNLGVTPDDITYNILLEGHCKSGRAEDFLHLRN---EKGVVVDYAYYTSLVGEYDK 482
            E ML +G+ PD+ TY  L+  +C  G  E  L L N   EKGV+ D   Y+ L+   +K
Sbjct: 508 YEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNK 567

Query: 483 SLKDRRKR 485
             + R  +
Sbjct: 568 QSRTREAK 575

BLAST of Cp4.1LG03g05680 vs. Swiss-Prot
Match: PP432_ARATH (Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana GN=At5g55840 PE=3 SV=2)

HSP 1 Score: 276.9 bits (707), Expect = 4.1e-73
Identity = 148/479 (30.90%), Postives = 254/479 (53.03%), Query Frame = 1

Query: 3   NFLCAHQMFEESQSIVRFLVSRKGKDSAASIFAAILEITDTRCSN-FVFDALMIAYSDSG 62
           + L   +M++ ++ I++ L    GK S   +F A++       SN  V+D L+  Y   G
Sbjct: 80  HILVRARMYDPARHILKELSLMSGKSSF--VFGALMTTYRLCNSNPSVYDILIRVYLREG 139

Query: 63  FISDAIQCFRLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYFN 122
            I D+++ FRL+    F      C  +L  ++ S   V++W+F  E+L     P V  FN
Sbjct: 140 MIQDSLEIFRLMGLYGFNPSVYTCNAILGSVVKSGEDVSVWSFLKEMLKRKICPDVATFN 199

Query: 123 ILINKFCKQGSIRDARLIFDEIGKRGFRPTAVSFNTLINGFCKSRNLDECFRLKKVMEES 182
           ILIN  C +GS   +  +  ++ K G+ PT V++NT+++ +CK         L   M+  
Sbjct: 200 ILINVLCAEGSFEKSSYLMQKMEKSGYAPTIVTYNTVLHWYCKKGRFKAAIELLDHMKSK 259

Query: 183 RIYPDVYTYSVLIHGLCKEGKVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDSA 242
            +  DV TY++LIH LC+  ++     L  +MR+R +  N+VT+  LI+G    G++  A
Sbjct: 260 GVDADVCTYNMLIHDLCRSNRIAKGYLLLRDMRKRMIHPNEVTYNTLINGFSNEGKVLIA 319

Query: 243 MNTYQQMLAMGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLIDG 302
                +ML+ G+ P+ V +N L++G    G+  +A K+   M+  G+ P +++Y  L+DG
Sbjct: 320 SQLLNEMLSFGLSPNHVTFNALIDGHISEGNFKEALKMFYMMEAKGLTPSEVSYGVLLDG 379

Query: 303 YCKEGDLESAMEIRKGMNEEGVVLDNVAFTAIISGLCRDGRVMDAERTLREMKEAGMKPD 362
            CK  + + A      M   GV +  + +T +I GLC++G + +A   L EM + G+ PD
Sbjct: 380 LCKNAEFDLARGFYMRMKRNGVCVGRITYTGMIDGLCKNGFLDEAVVLLNEMSKDGIDPD 439

Query: 363 DATYTMVIDGYCKNGDVKTGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLLE 422
             TY+ +I+G+CK G  KT  +++  + R G +P  I Y+ L+   C+ G +K A  + E
Sbjct: 440 IVTYSALINGFCKVGRFKTAKEIVCRIYRVGLSPNGIIYSTLIYNCCRMGCLKEAIRIYE 499

Query: 423 AMLNLGVTPDDITYNILLEGHCKSGR---AEDFLHLRNEKGVVVDYAYYTSLVGEYDKS 478
           AM+  G T D  T+N+L+   CK+G+   AE+F+      G++ +   +  L+  Y  S
Sbjct: 500 AMILEGHTRDHFTFNVLVTSLCKAGKVAEAEEFMRCMTSDGILPNTVSFDCLINGYGNS 556

BLAST of Cp4.1LG03g05680 vs. Swiss-Prot
Match: PPR12_ARATH (Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidopsis thaliana GN=At1g05670 PE=2 SV=1)

HSP 1 Score: 274.2 bits (700), Expect = 2.6e-72
Identity = 137/430 (31.86%), Postives = 237/430 (55.12%), Query Frame = 1

Query: 49  VFDALMIAYSDSGFISDAIQCFRLVRKRNFQIPFRGCEYLLDKMMNS-NSPVTIWTFYLE 108
           VFD       D G + +A + F  +      +    C   L ++        T    + E
Sbjct: 177 VFDVFFQVLVDFGLLREARRVFEKMLNYGLVLSVDSCNVYLTRLSKDCYKTATAIIVFRE 236

Query: 109 ILDSGFPPKVKYFNILINKFCKQGSIRDARLIFDEIGKRGFRPTAVSFNTLINGFCKSRN 168
             + G    V  +NI+I+  C+ G I++A  +   +  +G+ P  +S++T++NG+C+   
Sbjct: 237 FPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLLLMELKGYTPDVISYSTVVNGYCRFGE 296

Query: 169 LDECFRLKKVMEESRIYPDVYTYSVLIHGLCKEGKVDDAEQLFDEMRQRGLRANDVTFTA 228
           LD+ ++L +VM+   + P+ Y Y  +I  LC+  K+ +AE+ F EM ++G+  + V +T 
Sbjct: 297 LDKVWKLIEVMKRKGLKPNSYIYGSIIGLLCRICKLAEAEEAFSEMIRQGILPDTVVYTT 356

Query: 229 LIDGQCRSGRIDSAMNTYQQMLAMGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVG 288
           LIDG C+ G I +A   + +M +  + PD++ Y  +++G C++GD+ +A KL  EM   G
Sbjct: 357 LIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAIISGFCQIGDMVEAGKLFHEMFCKG 416

Query: 289 MKPDKITYTTLIDGYCKEGDLESAMEIRKGMNEEGVVLDNVAFTAIISGLCRDGRVMDAE 348
           ++PD +T+T LI+GYCK G ++ A  +   M + G   + V +T +I GLC++G +  A 
Sbjct: 417 LEPDSVTFTELINGYCKAGHMKDAFRVHNHMIQAGCSPNVVTYTTLIDGLCKEGDLDSAN 476

Query: 349 RTLREMKEAGMKPDDATYTMVIDGYCKNGDVKTGFKLLKEMQRNGHNPGVITYNVLMNGL 408
             L EM + G++P+  TY  +++G CK+G+++   KL+ E +  G N   +TY  LM+  
Sbjct: 477 ELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKLVGEFEAAGLNADTVTYTTLMDAY 536

Query: 409 CKQGQMKNANMLLEAMLNLGVTPDDITYNILLEGHCKSGRAEDFLHLRN---EKGVVVDY 468
           CK G+M  A  +L+ ML  G+ P  +T+N+L+ G C  G  ED   L N    KG+  + 
Sbjct: 537 CKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFCLHGMLEDGEKLLNWMLAKGIAPNA 596

Query: 469 AYYTSLVGEY 475
             + SLV +Y
Sbjct: 597 TTFNSLVKQY 606

BLAST of Cp4.1LG03g05680 vs. Swiss-Prot
Match: PPR27_ARATH (Pentatricopeptide repeat-containing protein At1g09820 OS=Arabidopsis thaliana GN=At1g09820 PE=2 SV=1)

HSP 1 Score: 260.0 bits (663), Expect = 5.2e-68
Identity = 138/452 (30.53%), Postives = 248/452 (54.87%), Query Frame = 1

Query: 24  RKGKDSAA-SIFAAILEITDTRCSNFVFDALMIAYSDSGFISDAIQCFRLVRKRNFQIPF 83
           R G D    SIF AI    +   ++ + D L++AY+++       + F+      +++  
Sbjct: 129 RNGSDHQVHSIFHAISMCDNVCVNSIIADMLVLAYANNSRFELGFEAFKRSGYYGYKLSA 188

Query: 84  RGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYFNILINKFCKQGSIRDARLIFDE 143
             C+ L+  ++  N    +   Y E++     P V  FN++IN  CK G +  AR + ++
Sbjct: 189 LSCKPLMIALLKENRSADVEYVYKEMIRRKIQPNVFTFNVVINALCKTGKMNKARDVMED 248

Query: 144 IGKRGFRPTAVSFNTLINGFCKSRNLDECFR---LKKVMEESRIYPDVYTYSVLIHGLCK 203
           +   G  P  VS+NTLI+G+CK     + ++   + K M E+ + P++ T+++LI G  K
Sbjct: 249 MKVYGCSPNVVSYNTLIDGYCKLGGNGKMYKADAVLKEMVENDVSPNLTTFNILIDGFWK 308

Query: 204 EGKVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDSAMNTYQQMLAMGVKPDLVM 263
           +  +  + ++F EM  + ++ N +++ +LI+G C  G+I  A++   +M++ GV+P+L+ 
Sbjct: 309 DDNLPGSMKVFKEMLDQDVKPNVISYNSLINGLCNGGKISEAISMRDKMVSAGVQPNLIT 368

Query: 264 YNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRKGMN 323
           YN L+NG CK   + +A  +   +K  G  P    Y  LID YCK G ++    +++ M 
Sbjct: 369 YNALINGFCKNDMLKEALDMFGSVKGQGAVPTTRMYNMLIDAYCKLGKIDDGFALKEEME 428

Query: 324 EEGVVLDNVAFTAIISGLCRDGRVMDAERTLREMKEAGMKPDDATYTMVIDGYCKNGDVK 383
            EG+V D   +  +I+GLCR+G +  A++   ++   G+ PD  T+ ++++GYC+ G+ +
Sbjct: 429 REGIVPDVGTYNCLIAGLCRNGNIEAAKKLFDQLTSKGL-PDLVTFHILMEGYCRKGESR 488

Query: 384 TGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNA-NMLLEAMLNLGVTPDDITYNIL 443
               LLKEM + G  P  +TYN++M G CK+G +K A NM  +      +  +  +YN+L
Sbjct: 489 KAAMLLKEMSKMGLKPRHLTYNIVMKGYCKEGNLKAATNMRTQMEKERRLRMNVASYNVL 548

Query: 444 LEGHCKSGRAEDFLHLRN---EKGVVVDYAYY 468
           L+G+ + G+ ED   L N   EKG+V +   Y
Sbjct: 549 LQGYSQKGKLEDANMLLNEMLEKGLVPNRITY 579

BLAST of Cp4.1LG03g05680 vs. TrEMBL
Match: M5XHR3_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa024153mg PE=4 SV=1)

HSP 1 Score: 728.8 bits (1880), Expect = 4.4e-207
Identity = 344/484 (71.07%), Postives = 414/484 (85.54%), Query Frame = 1

Query: 1   MANFLCAHQMFEESQSIVRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDS 60
           MA+FLCAHQM+ ++QS++R +VSRKGK++A+S+FA+ILE   T  SN+VFDALM AY D 
Sbjct: 122 MAHFLCAHQMYPQAQSLLRIVVSRKGKETASSVFASILETRGTHQSNYVFDALMNAYVDC 181

Query: 61  GFISDAIQCFRLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYF 120
           GF+SDA QCFRL+RK NF+IPF  C  LLDKM+  NSPV  W FYLEILDSGFPPKV  F
Sbjct: 182 GFVSDACQCFRLLRKHNFRIPFHACGCLLDKMLKLNSPVVAWGFYLEILDSGFPPKVYNF 241

Query: 121 NILINKFCKQGSIRDARLIFDEIGKRGFRPTAVSFNTLINGFCKSRNLDECFRLKKVMEE 180
           N+L++K CK+G IR+A+L+FDEIGKRG  PT VSFNTLING+CKSRNL+ECFRLK+ MEE
Sbjct: 242 NVLMHKLCKEGEIREAQLVFDEIGKRGLLPTVVSFNTLINGYCKSRNLEECFRLKRDMEE 301

Query: 181 SRIYPDVYTYSVLIHGLCKEGKVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDS 240
           SR  PDV+TYSVLI+GLCKE ++DDA  LFDEM +RGL  N+VT+T LIDGQC++GRID 
Sbjct: 302 SRTRPDVFTYSVLINGLCKELRLDDANLLFDEMCERGLVPNNVTYTTLIDGQCKNGRIDL 361

Query: 241 AMNTYQQMLAMGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLID 300
           AM  YQ+ML +G+KPD++ YNTL+NGLCKVGD+ +ARKLV+EM + G+KPD ITYTTLID
Sbjct: 362 AMEVYQKMLGIGIKPDVITYNTLINGLCKVGDLKEARKLVEEMNIAGLKPDTITYTTLID 421

Query: 301 GYCKEGDLESAMEIRKGMNEEGVVLDNVAFTAIISGLCRDGRVMDAERTLREMKEAGMKP 360
           G CKEG+L+SA+EIRKGM ++G+ LDNVAFTA+ISGLCR+G+ +DAERTLREM  +GMKP
Sbjct: 422 GCCKEGNLQSALEIRKGMIKQGIELDNVAFTALISGLCREGKTLDAERTLREMLNSGMKP 481

Query: 361 DDATYTMVIDGYCKNGDVKTGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLL 420
           DDATYTM+IDG+CK GDVK GFKLLKEMQ +G+ P V+TYN LMNGLCK GQMKNANMLL
Sbjct: 482 DDATYTMIIDGFCKKGDVKMGFKLLKEMQGDGYVPSVVTYNALMNGLCKLGQMKNANMLL 541

Query: 421 EAMLNLGVTPDDITYNILLEGHCKSGRAEDFLHLRNEKGVVVDYAYYTSLVGEYDKSLKD 480
           +AM+NLGV PDDITYNILLEGHCK G  EDF  LR+ KG+V+DYA YTSLV E++KS KD
Sbjct: 542 DAMINLGVAPDDITYNILLEGHCKHGNPEDFDKLRSGKGLVLDYASYTSLVSEFNKSSKD 601

Query: 481 RRKR 485
           RRKR
Sbjct: 602 RRKR 605

BLAST of Cp4.1LG03g05680 vs. TrEMBL
Match: D7TSI7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_14s0006g00170 PE=4 SV=1)

HSP 1 Score: 682.6 bits (1760), Expect = 3.6e-193
Identity = 326/484 (67.36%), Postives = 393/484 (81.20%), Query Frame = 1

Query: 1   MANFLCAHQMFEESQSIVRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDS 60
           M +FLC H+M  E+QS+++F+VSRKGK+SA+S+F ++LE   T  SN VF  LM AY+DS
Sbjct: 109 MTHFLCTHKMLSEAQSLLQFVVSRKGKNSASSVFTSVLEARGTHQSNLVFSVLMNAYTDS 168

Query: 61  GFISDAIQCFRLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYF 120
           G+ SDAIQCFRLVRK N QIPF  C YL D++M  N     W FY EILD G+PP V  F
Sbjct: 169 GYFSDAIQCFRLVRKHNLQIPFHSCGYLFDRLMKLNLTSPAWAFYEEILDCGYPPDVCKF 228

Query: 121 NILINKFCKQGSIRDARLIFDEIGKRGFRPTAVSFNTLINGFCKSRNLDECFRLKKVMEE 180
           N+L+++ CK+  I +A+L+F EIGKRG RPT VSFNTLING+CKS NLD+ FRLK+ M E
Sbjct: 229 NVLMHRLCKEHKINEAQLLFGEIGKRGLRPTVVSFNTLINGYCKSGNLDQGFRLKRFMME 288

Query: 181 SRIYPDVYTYSVLIHGLCKEGKVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDS 240
           +R++PDV+TYSVLI+GLCKEG++DDA +LF EM  RGL  NDVTFT LI+G C +GR D 
Sbjct: 289 NRVFPDVFTYSVLINGLCKEGQLDDANKLFLEMCDRGLVPNDVTFTTLINGHCVTGRADL 348

Query: 241 AMNTYQQMLAMGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLID 300
            M  YQQML  GVKPD++ YNTL+NGLCKVGD+ +A+KLV EM   G+KPDK TYT LID
Sbjct: 349 GMEIYQQMLRKGVKPDVITYNTLINGLCKVGDLREAKKLVIEMTQRGLKPDKFTYTMLID 408

Query: 301 GYCKEGDLESAMEIRKGMNEEGVVLDNVAFTAIISGLCRDGRVMDAERTLREMKEAGMKP 360
           G CKEGDLESA+EIRK M +EG+ LDNVAFTA+ISG CR+G+V++AERTLREM EAG+KP
Sbjct: 409 GCCKEGDLESALEIRKEMVKEGIELDNVAFTALISGFCREGQVIEAERTLREMLEAGIKP 468

Query: 361 DDATYTMVIDGYCKNGDVKTGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLL 420
           DDATYTMVI G+CK GDVKTGFKLLKEMQ +GH PGV+TYNVL+NGLCKQGQMKNANMLL
Sbjct: 469 DDATYTMVIHGFCKKGDVKTGFKLLKEMQCDGHVPGVVTYNVLLNGLCKQGQMKNANMLL 528

Query: 421 EAMLNLGVTPDDITYNILLEGHCKSGRAEDFLHLRNEKGVVVDYAYYTSLVGEYDKSLKD 480
           +AMLNLGV PDDITYNILLEGHCK G  EDF  L++EKG+V DY  YTSL+G+  K+ K+
Sbjct: 529 DAMLNLGVVPDDITYNILLEGHCKHGNREDFDKLQSEKGLVQDYGSYTSLIGDLRKTCKE 588

Query: 481 RRKR 485
           R+KR
Sbjct: 589 RQKR 592

BLAST of Cp4.1LG03g05680 vs. TrEMBL
Match: A0A061EF62_THECC (Pentatricopeptide repeat superfamily protein OS=Theobroma cacao GN=TCM_017452 PE=4 SV=1)

HSP 1 Score: 667.9 bits (1722), Expect = 9.2e-189
Identity = 317/473 (67.02%), Postives = 388/473 (82.03%), Query Frame = 1

Query: 1   MANFLCAHQMFEESQSIVRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDS 60
           MA+FL AH+MF ++QS++ FLVSRKGK SA+ +F +I+E   T    FVFD+LMIAY D 
Sbjct: 117 MAHFLIAHKMFHQAQSLLHFLVSRKGKGSASLVFTSIIETKGTHQCGFVFDSLMIAYKDL 176

Query: 61  GFISDAIQCFRLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYF 120
           GF+ DAIQCFRLVRK  F++PF+GC+YLLD+MM  +SP+    FYLEILD GF P V  F
Sbjct: 177 GFVPDAIQCFRLVRKHKFKLPFQGCKYLLDRMMKISSPMVSLGFYLEILDYGFSPSVYNF 236

Query: 121 NILINKFCKQGSIRDARLIFDEIGKRGFRPTAVSFNTLINGFCKSRNLDECFRLKKVMEE 180
           NIL++K C+   I+DA+++F+EIGKRG R T VSFNTLING+CKS NL E FRLK+ ME+
Sbjct: 237 NILMHKLCRVSQIKDAQMVFNEIGKRGLRATVVSFNTLINGYCKSGNLGEGFRLKRAMED 296

Query: 181 SRIYPDVYTYSVLIHGLCKEGKVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDS 240
           S I PDV+TYSVLI+GLCKE ++DDA  LF+EM  RGL  NDVTFT LIDGQC++GRID 
Sbjct: 297 SGIRPDVFTYSVLINGLCKESRLDDANGLFEEMCNRGLVPNDVTFTTLIDGQCKNGRIDL 356

Query: 241 AMNTYQQMLAMGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLID 300
           AM TYQ++L+ G+KPDLVM+NTL+NGLCK GD+ +A+ L+ EM + G+KPDK TYT LID
Sbjct: 357 AMTTYQRILSKGLKPDLVMFNTLINGLCKAGDLKEAKNLIAEMSLRGLKPDKFTYTILID 416

Query: 301 GYCKEGDLESAMEIRKGMNEEGVVLDNVAFTAIISGLCRDGRVMDAERTLREMKEAGMKP 360
           G+CKEG++E A+EIR  M + G+ LDNVAFTA+ISGLCR+GR++DAERTLREM  AGMKP
Sbjct: 417 GFCKEGNMELAIEIRDEMVKHGIELDNVAFTALISGLCREGRLIDAERTLREMLSAGMKP 476

Query: 361 DDATYTMVIDGYCKNGDVKTGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLL 420
           DDATYTMVIDG+CKNG+VK GFKLLKEMQ +GH PGVITYNVLMNGLCK GQMKNANMLL
Sbjct: 477 DDATYTMVIDGFCKNGNVKMGFKLLKEMQSDGHVPGVITYNVLMNGLCKLGQMKNANMLL 536

Query: 421 EAMLNLGVTPDDITYNILLEGHCKSGRAEDFLHLRNEKGVVVDYAYYTSLVGE 474
           +AM+ LGV PDDITYNILL+GHCK G  +DF  L++E G+V DYA Y SL+ +
Sbjct: 537 DAMIGLGVVPDDITYNILLDGHCKKGNPKDFNRLKSEMGLVADYASYKSLISQ 589

BLAST of Cp4.1LG03g05680 vs. TrEMBL
Match: W9RWL8_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_004954 PE=4 SV=1)

HSP 1 Score: 666.4 bits (1718), Expect = 2.7e-188
Identity = 330/490 (67.35%), Postives = 400/490 (81.63%), Query Frame = 1

Query: 1   MANFLCAHQMFEESQSIVRFLVSRKGKDSAASIFAAILEITDTRCSNF--VFDALMIAYS 60
           MA+FL AH+M  ++QS++ FLVSRKGK SAAS+FAA+ E T  R +N   +FD L+IAY+
Sbjct: 132 MAHFLRAHRMGPDAQSLLNFLVSRKGKHSAASLFAAV-EATGGRYNNAHPLFDYLIIAYT 191

Query: 61  DSGFISDAIQCFRL-VRKRNFQ--IPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPP 120
           D GF+SDAIQCFRL VRKRN     PFRGC YL DKM+ SNSP  +W F+ EILD G+PP
Sbjct: 192 DCGFLSDAIQCFRLLVRKRNLNSHFPFRGCGYLFDKMLKSNSPWAVWLFFSEILDYGYPP 251

Query: 121 KVKYFNILINKFCKQGSIRDARLIFDEIGKRGFRPTAVSFNTLINGFCKSRNLDECFRLK 180
           KV  FN+L+N+FCK+G I  A+++FDEI KRG RP+ VSFNTLING+CKS NL+E FRLK
Sbjct: 252 KVYNFNVLMNRFCKEGRIEGAQMVFDEITKRGLRPSVVSFNTLINGYCKSGNLEEGFRLK 311

Query: 181 KVMEESRIY-PDVYTYSVLIHGLCKEGKVDDAEQLFDEMRQRGLRANDVTFTALIDGQCR 240
           +VMEESR++ PDV+TYS LI GLCKE K D+A +LFDEM +RGL  N VT TAL+DGQC+
Sbjct: 312 RVMEESRMWAPDVFTYSALISGLCKECKSDNARKLFDEMCERGLVPNGVTVTALLDGQCK 371

Query: 241 SGRIDSAMNTYQQMLAMGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKIT 300
           SGR+D AM  Y  ML  G++PDLVMYN L+NGLC++GD+S+ARKLVDEM++ G+KPDKIT
Sbjct: 372 SGRVDKAMEMYWMMLKKGIEPDLVMYNALVNGLCRIGDLSEARKLVDEMRIRGLKPDKIT 431

Query: 301 YTTLIDGYCKEGDLESAMEIRKGMNEEGVVLDNVAFTAIISGLCRDGRVMDAERTLREMK 360
           YTTLIDG  KEGD+E AMEI+K M +EGV LD+VAFTA++SGLCR+GR++DAERT+REM 
Sbjct: 432 YTTLIDGCFKEGDMELAMEIKKAMVKEGVELDSVAFTALVSGLCREGRIVDAERTMREML 491

Query: 361 EAGMKPDDATYTMVIDGYCKNGDVKTGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMK 420
            AGMKPDDATYTMVID +CK+GDVK GFKLL EMQR+   P + TYN LMNGLCK+GQM+
Sbjct: 492 SAGMKPDDATYTMVIDAFCKSGDVKMGFKLLNEMQRDSRVPNIATYNALMNGLCKKGQMR 551

Query: 421 NANMLLEAMLNLGVTPDDITYNILLEGHCKSGRAEDFLHLRNEKGVVVDYAYYTSLVGEY 480
           NA+MLL+AMLN+GV PDDITYNILLEGHCK G   DF  LR E+G+V DYA Y SLVG+ 
Sbjct: 552 NADMLLDAMLNVGVIPDDITYNILLEGHCKHGNRADFDKLRRERGLVSDYASYASLVGKS 611

Query: 481 DKSLKDRRKR 485
            KS KDR+KR
Sbjct: 612 SKSSKDRQKR 620

BLAST of Cp4.1LG03g05680 vs. TrEMBL
Match: U5FHA0_POPTR (Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POPTR_0017s02740g PE=4 SV=1)

HSP 1 Score: 663.3 bits (1710), Expect = 2.3e-187
Identity = 317/484 (65.50%), Postives = 392/484 (80.99%), Query Frame = 1

Query: 1   MANFLCAHQMFEESQSIVRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDS 60
           M +FL AH+M ++++S++ F+VSRKGK SA+S+FA+ILE   T  S+FVFDALM  Y++ 
Sbjct: 116 MVHFLIAHRMNQQAESLLHFVVSRKGKGSASSVFASILETKGTLSSSFVFDALMSVYTEF 175

Query: 61  GFISDAIQCFRLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYF 120
           G++SDAIQCFRL +K N +IPF GC+ LL++M+  +SP+    FYLEILDSG+PP V  F
Sbjct: 176 GYVSDAIQCFRLTKKHNLKIPFNGCKCLLERMIKMSSPMVALEFYLEILDSGYPPNVYTF 235

Query: 121 NILINKFCKQGSIRDARLIFDEIGKRGFRPTAVSFNTLINGFCKSRNLDECFRLKKVMEE 180
           N+L+N+ CK+G ++DA+LIFDEI K G +PTAVSFNTLING+CKS NL+E FRLK VMEE
Sbjct: 236 NVLMNRLCKEGKVKDAQLIFDEIRKTGLQPTAVSFNTLINGYCKSGNLEEGFRLKMVMEE 295

Query: 181 SRIYPDVYTYSVLIHGLCKEGKVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDS 240
            R++PDV+TYS LI GLCKE ++DDA  LF EM  RGL  NDVTFT LI+GQC++GR+D 
Sbjct: 296 FRVFPDVFTYSALIDGLCKECQLDDANHLFKEMCDRGLVPNDVTFTTLINGQCKNGRVDL 355

Query: 241 AMNTYQQMLAMGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLID 300
           A+  YQQM   G+K DLV+YNTL++GLCK G   +ARK V EM   G+ PDK TYTTL+D
Sbjct: 356 ALEIYQQMFTKGLKADLVLYNTLIDGLCKGGYFREARKFVGEMTKRGLIPDKFTYTTLLD 415

Query: 301 GYCKEGDLESAMEIRKGMNEEGVVLDNVAFTAIISGLCRDGRVMDAERTLREMKEAGMKP 360
           G CKEGDLE A+E+RK M +EG+ LDNVAFTAIISGLCRDG+++DAERTLREM  AG+KP
Sbjct: 416 GSCKEGDLELALEMRKEMVKEGIQLDNVAFTAIISGLCRDGKIVDAERTLREMLRAGLKP 475

Query: 361 DDATYTMVIDGYCKNGDVKTGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLL 420
           DD TYTMV+DG+CK GDVK GFKLLKEMQ +GH PGVITYNVLMNGLCKQGQ+KNA+MLL
Sbjct: 476 DDGTYTMVMDGFCKKGDVKMGFKLLKEMQSDGHIPGVITYNVLMNGLCKQGQVKNADMLL 535

Query: 421 EAMLNLGVTPDDITYNILLEGHCKSGRAEDFLHLRNEKGVVVDYAYYTSLVGEYDKSLKD 480
            AMLNLGV PDDITYNILL+GHCK G+  DF +++ E G+V DYA Y SL+ E  K+ KD
Sbjct: 536 NAMLNLGVVPDDITYNILLQGHCKHGKLGDFQNVKTEMGLVSDYASYRSLLHELTKASKD 595

Query: 481 RRKR 485
           R+KR
Sbjct: 596 RQKR 599

BLAST of Cp4.1LG03g05680 vs. TAIR10
Match: AT1G09680.1 (AT1G09680.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 607.4 bits (1565), Expect = 7.4e-174
Identity = 290/484 (59.92%), Postives = 367/484 (75.83%), Query Frame = 1

Query: 1   MANFLCAHQMFEESQSIVRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDS 60
           +A FL  H+MF E+QS++  +VSRKGK+SA+S+F +++E+  T    F+ DALMI Y+D 
Sbjct: 124 LARFLAVHEMFTEAQSLIELVVSRKGKNSASSVFISLVEMRVTPMCGFLVDALMITYTDL 183

Query: 61  GFISDAIQCFRLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYF 120
           GFI DAIQCFRL RK  F +P RGC  LLD+MM  N   TIW FY+EILD+GFP  V  F
Sbjct: 184 GFIPDAIQCFRLSRKHRFDVPIRGCGNLLDRMMKLNPTGTIWGFYMEILDAGFPLNVYVF 243

Query: 121 NILINKFCKQGSIRDARLIFDEIGKRGFRPTAVSFNTLINGFCKSRNLDECFRLKKVMEE 180
           NIL+NKFCK+G+I DA+ +FDEI KR  +PT VSFNTLING+CK  NLDE FRLK  ME+
Sbjct: 244 NILMNKFCKEGNISDAQKVFDEITKRSLQPTVVSFNTLINGYCKVGNLDEGFRLKHQMEK 303

Query: 181 SRIYPDVYTYSVLIHGLCKEGKVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDS 240
           SR  PDV+TYS LI+ LCKE K+D A  LFDEM +RGL  NDV FT LI G  R+G ID 
Sbjct: 304 SRTRPDVFTYSALINALCKENKMDGAHGLFDEMCKRGLIPNDVIFTTLIHGHSRNGEIDL 363

Query: 241 AMNTYQQMLAMGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLID 300
              +YQ+ML+ G++PD+V+YNTL+NG CK GD+  AR +VD M   G++PDKITYTTLID
Sbjct: 364 MKESYQKMLSKGLQPDIVLYNTLVNGFCKNGDLVAARNIVDGMIRRGLRPDKITYTTLID 423

Query: 301 GYCKEGDLESAMEIRKGMNEEGVVLDNVAFTAIISGLCRDGRVMDAERTLREMKEAGMKP 360
           G+C+ GD+E+A+EIRK M++ G+ LD V F+A++ G+C++GRV+DAER LREM  AG+KP
Sbjct: 424 GFCRGGDVETALEIRKEMDQNGIELDRVGFSALVCGMCKEGRVIDAERALREMLRAGIKP 483

Query: 361 DDATYTMVIDGYCKNGDVKTGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLL 420
           DD TYTM++D +CK GD +TGFKLLKEMQ +GH P V+TYNVL+NGLCK GQMKNA+MLL
Sbjct: 484 DDVTYTMMMDAFCKKGDAQTGFKLLKEMQSDGHVPSVVTYNVLLNGLCKLGQMKNADMLL 543

Query: 421 EAMLNLGVTPDDITYNILLEGHCKSGRAEDFLHLRNEKGVVVDYAYYTSLVGEYDKSLKD 480
           +AMLN+GV PDDITYN LLEGH +   +      + E G+V D A Y S+V E D++ KD
Sbjct: 544 DAMLNIGVVPDDITYNTLLEGHHRHANSSKRYIQKPEIGIVADLASYKSIVNELDRASKD 603

Query: 481 RRKR 485
            R R
Sbjct: 604 HRNR 607

BLAST of Cp4.1LG03g05680 vs. TAIR10
Match: AT5G39710.1 (AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 305.8 bits (782), Expect = 4.6e-83
Identity = 157/488 (32.17%), Postives = 275/488 (56.35%), Query Frame = 1

Query: 3   NFLCAHQMFEESQSIVRFLVSRKGKDSAASIFAAILEITDTRC--SNFVFDALMIAYSDS 62
           + L   ++++ +Q +   + ++   D  AS+    L+ T   C  ++ VFD ++ +YS  
Sbjct: 88  HILTKFKLYKTAQILAEDVAAKTLDDEYASLVFKSLQETYDLCYSTSSVFDLVVKSYSRL 147

Query: 63  GFISDAIQCFRLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIW-TFYLEILDSGFPPKVKY 122
             I  A+    L +   F         +LD  + S   ++     + E+L+S   P V  
Sbjct: 148 SLIDKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFT 207

Query: 123 FNILINKFCKQGSIRDARLIFDEIGKRGFRPTAVSFNTLINGFCKSRNLDECFRLKKVME 182
           +NILI  FC  G+I  A  +FD++  +G  P  V++NTLI+G+CK R +D+ F+L + M 
Sbjct: 208 YNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMA 267

Query: 183 ESRIYPDVYTYSVLIHGLCKEGKVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRID 242
              + P++ +Y+V+I+GLC+EG++ +   +  EM +RG   ++VT+  LI G C+ G   
Sbjct: 268 LKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFH 327

Query: 243 SAMNTYQQMLAMGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLI 302
            A+  + +ML  G+ P ++ Y +L++ +CK G++++A + +D+M++ G+ P++ TYTTL+
Sbjct: 328 QALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLV 387

Query: 303 DGYCKEGDLESAMEIRKGMNEEGVVLDNVAFTAIISGLCRDGRVMDAERTLREMKEAGMK 362
           DG+ ++G +  A  + + MN+ G     V + A+I+G C  G++ DA   L +MKE G+ 
Sbjct: 388 DGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLS 447

Query: 363 PDDATYTMVIDGYCKNGDVKTGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANML 422
           PD  +Y+ V+ G+C++ DV    ++ +EM   G  P  ITY+ L+ G C+Q + K A  L
Sbjct: 448 PDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDL 507

Query: 423 LEAMLNLGVTPDDITYNILLEGHCKSGRAEDFLHLRN---EKGVVVDYAYYTSLVGEYDK 482
            E ML +G+ PD+ TY  L+  +C  G  E  L L N   EKGV+ D   Y+ L+   +K
Sbjct: 508 YEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNK 567

Query: 483 SLKDRRKR 485
             + R  +
Sbjct: 568 QSRTREAK 575

BLAST of Cp4.1LG03g05680 vs. TAIR10
Match: AT5G55840.1 (AT5G55840.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 276.9 bits (707), Expect = 2.3e-74
Identity = 148/479 (30.90%), Postives = 254/479 (53.03%), Query Frame = 1

Query: 3   NFLCAHQMFEESQSIVRFLVSRKGKDSAASIFAAILEITDTRCSN-FVFDALMIAYSDSG 62
           + L   +M++ ++ I++ L    GK S   +F A++       SN  V+D L+  Y   G
Sbjct: 120 HILVRARMYDPARHILKELSLMSGKSSF--VFGALMTTYRLCNSNPSVYDILIRVYLREG 179

Query: 63  FISDAIQCFRLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYFN 122
            I D+++ FRL+    F      C  +L  ++ S   V++W+F  E+L     P V  FN
Sbjct: 180 MIQDSLEIFRLMGLYGFNPSVYTCNAILGSVVKSGEDVSVWSFLKEMLKRKICPDVATFN 239

Query: 123 ILINKFCKQGSIRDARLIFDEIGKRGFRPTAVSFNTLINGFCKSRNLDECFRLKKVMEES 182
           ILIN  C +GS   +  +  ++ K G+ PT V++NT+++ +CK         L   M+  
Sbjct: 240 ILINVLCAEGSFEKSSYLMQKMEKSGYAPTIVTYNTVLHWYCKKGRFKAAIELLDHMKSK 299

Query: 183 RIYPDVYTYSVLIHGLCKEGKVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDSA 242
            +  DV TY++LIH LC+  ++     L  +MR+R +  N+VT+  LI+G    G++  A
Sbjct: 300 GVDADVCTYNMLIHDLCRSNRIAKGYLLLRDMRKRMIHPNEVTYNTLINGFSNEGKVLIA 359

Query: 243 MNTYQQMLAMGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLIDG 302
                +ML+ G+ P+ V +N L++G    G+  +A K+   M+  G+ P +++Y  L+DG
Sbjct: 360 SQLLNEMLSFGLSPNHVTFNALIDGHISEGNFKEALKMFYMMEAKGLTPSEVSYGVLLDG 419

Query: 303 YCKEGDLESAMEIRKGMNEEGVVLDNVAFTAIISGLCRDGRVMDAERTLREMKEAGMKPD 362
            CK  + + A      M   GV +  + +T +I GLC++G + +A   L EM + G+ PD
Sbjct: 420 LCKNAEFDLARGFYMRMKRNGVCVGRITYTGMIDGLCKNGFLDEAVVLLNEMSKDGIDPD 479

Query: 363 DATYTMVIDGYCKNGDVKTGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLLE 422
             TY+ +I+G+CK G  KT  +++  + R G +P  I Y+ L+   C+ G +K A  + E
Sbjct: 480 IVTYSALINGFCKVGRFKTAKEIVCRIYRVGLSPNGIIYSTLIYNCCRMGCLKEAIRIYE 539

Query: 423 AMLNLGVTPDDITYNILLEGHCKSGR---AEDFLHLRNEKGVVVDYAYYTSLVGEYDKS 478
           AM+  G T D  T+N+L+   CK+G+   AE+F+      G++ +   +  L+  Y  S
Sbjct: 540 AMILEGHTRDHFTFNVLVTSLCKAGKVAEAEEFMRCMTSDGILPNTVSFDCLINGYGNS 596

BLAST of Cp4.1LG03g05680 vs. TAIR10
Match: AT1G05670.1 (AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 274.2 bits (700), Expect = 1.5e-73
Identity = 137/430 (31.86%), Postives = 237/430 (55.12%), Query Frame = 1

Query: 49  VFDALMIAYSDSGFISDAIQCFRLVRKRNFQIPFRGCEYLLDKMMNS-NSPVTIWTFYLE 108
           VFD       D G + +A + F  +      +    C   L ++        T    + E
Sbjct: 177 VFDVFFQVLVDFGLLREARRVFEKMLNYGLVLSVDSCNVYLTRLSKDCYKTATAIIVFRE 236

Query: 109 ILDSGFPPKVKYFNILINKFCKQGSIRDARLIFDEIGKRGFRPTAVSFNTLINGFCKSRN 168
             + G    V  +NI+I+  C+ G I++A  +   +  +G+ P  +S++T++NG+C+   
Sbjct: 237 FPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLLLMELKGYTPDVISYSTVVNGYCRFGE 296

Query: 169 LDECFRLKKVMEESRIYPDVYTYSVLIHGLCKEGKVDDAEQLFDEMRQRGLRANDVTFTA 228
           LD+ ++L +VM+   + P+ Y Y  +I  LC+  K+ +AE+ F EM ++G+  + V +T 
Sbjct: 297 LDKVWKLIEVMKRKGLKPNSYIYGSIIGLLCRICKLAEAEEAFSEMIRQGILPDTVVYTT 356

Query: 229 LIDGQCRSGRIDSAMNTYQQMLAMGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVG 288
           LIDG C+ G I +A   + +M +  + PD++ Y  +++G C++GD+ +A KL  EM   G
Sbjct: 357 LIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAIISGFCQIGDMVEAGKLFHEMFCKG 416

Query: 289 MKPDKITYTTLIDGYCKEGDLESAMEIRKGMNEEGVVLDNVAFTAIISGLCRDGRVMDAE 348
           ++PD +T+T LI+GYCK G ++ A  +   M + G   + V +T +I GLC++G +  A 
Sbjct: 417 LEPDSVTFTELINGYCKAGHMKDAFRVHNHMIQAGCSPNVVTYTTLIDGLCKEGDLDSAN 476

Query: 349 RTLREMKEAGMKPDDATYTMVIDGYCKNGDVKTGFKLLKEMQRNGHNPGVITYNVLMNGL 408
             L EM + G++P+  TY  +++G CK+G+++   KL+ E +  G N   +TY  LM+  
Sbjct: 477 ELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKLVGEFEAAGLNADTVTYTTLMDAY 536

Query: 409 CKQGQMKNANMLLEAMLNLGVTPDDITYNILLEGHCKSGRAEDFLHLRN---EKGVVVDY 468
           CK G+M  A  +L+ ML  G+ P  +T+N+L+ G C  G  ED   L N    KG+  + 
Sbjct: 537 CKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFCLHGMLEDGEKLLNWMLAKGIAPNA 596

Query: 469 AYYTSLVGEY 475
             + SLV +Y
Sbjct: 597 TTFNSLVKQY 606

BLAST of Cp4.1LG03g05680 vs. TAIR10
Match: AT1G09820.1 (AT1G09820.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 260.0 bits (663), Expect = 2.9e-69
Identity = 138/452 (30.53%), Postives = 248/452 (54.87%), Query Frame = 1

Query: 24  RKGKDSAA-SIFAAILEITDTRCSNFVFDALMIAYSDSGFISDAIQCFRLVRKRNFQIPF 83
           R G D    SIF AI    +   ++ + D L++AY+++       + F+      +++  
Sbjct: 129 RNGSDHQVHSIFHAISMCDNVCVNSIIADMLVLAYANNSRFELGFEAFKRSGYYGYKLSA 188

Query: 84  RGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYFNILINKFCKQGSIRDARLIFDE 143
             C+ L+  ++  N    +   Y E++     P V  FN++IN  CK G +  AR + ++
Sbjct: 189 LSCKPLMIALLKENRSADVEYVYKEMIRRKIQPNVFTFNVVINALCKTGKMNKARDVMED 248

Query: 144 IGKRGFRPTAVSFNTLINGFCKSRNLDECFR---LKKVMEESRIYPDVYTYSVLIHGLCK 203
           +   G  P  VS+NTLI+G+CK     + ++   + K M E+ + P++ T+++LI G  K
Sbjct: 249 MKVYGCSPNVVSYNTLIDGYCKLGGNGKMYKADAVLKEMVENDVSPNLTTFNILIDGFWK 308

Query: 204 EGKVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDSAMNTYQQMLAMGVKPDLVM 263
           +  +  + ++F EM  + ++ N +++ +LI+G C  G+I  A++   +M++ GV+P+L+ 
Sbjct: 309 DDNLPGSMKVFKEMLDQDVKPNVISYNSLINGLCNGGKISEAISMRDKMVSAGVQPNLIT 368

Query: 264 YNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLIDGYCKEGDLESAMEIRKGMN 323
           YN L+NG CK   + +A  +   +K  G  P    Y  LID YCK G ++    +++ M 
Sbjct: 369 YNALINGFCKNDMLKEALDMFGSVKGQGAVPTTRMYNMLIDAYCKLGKIDDGFALKEEME 428

Query: 324 EEGVVLDNVAFTAIISGLCRDGRVMDAERTLREMKEAGMKPDDATYTMVIDGYCKNGDVK 383
            EG+V D   +  +I+GLCR+G +  A++   ++   G+ PD  T+ ++++GYC+ G+ +
Sbjct: 429 REGIVPDVGTYNCLIAGLCRNGNIEAAKKLFDQLTSKGL-PDLVTFHILMEGYCRKGESR 488

Query: 384 TGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNA-NMLLEAMLNLGVTPDDITYNIL 443
               LLKEM + G  P  +TYN++M G CK+G +K A NM  +      +  +  +YN+L
Sbjct: 489 KAAMLLKEMSKMGLKPRHLTYNIVMKGYCKEGNLKAATNMRTQMEKERRLRMNVASYNVL 548

Query: 444 LEGHCKSGRAEDFLHLRN---EKGVVVDYAYY 468
           L+G+ + G+ ED   L N   EKG+V +   Y
Sbjct: 549 LQGYSQKGKLEDANMLLNEMLEKGLVPNRITY 579

BLAST of Cp4.1LG03g05680 vs. NCBI nr
Match: gi|659114573|ref|XP_008457121.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g09680 [Cucumis melo])

HSP 1 Score: 869.4 bits (2245), Expect = 3.0e-249
Identity = 421/484 (86.98%), Postives = 451/484 (93.18%), Query Frame = 1

Query: 1   MANFLCAHQMFEESQSIVRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDS 60
           MANFL AHQMFEE QSI+RFLVSRKGKDSAAS+FAAIL+I  TRCSNFVFDALMIAY DS
Sbjct: 107 MANFLSAHQMFEECQSIIRFLVSRKGKDSAASVFAAILDIAGTRCSNFVFDALMIAYWDS 166

Query: 61  GFISDAIQCFRLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYF 120
           GF+SDAIQCFRLVRK NFQIPF GC YLLDKMMNSNSPVTIWTFY EILDSGFPP VKYF
Sbjct: 167 GFVSDAIQCFRLVRKSNFQIPFHGCGYLLDKMMNSNSPVTIWTFYSEILDSGFPPNVKYF 226

Query: 121 NILINKFCKQGSIRDARLIFDEIGKRGFRPTAVSFNTLINGFCKSRNLDECFRLKKVMEE 180
           NILINKFCK+GS RDA+LIFDEI K G RPT VSFNTLING CKSRNLDE FRLKK+MEE
Sbjct: 227 NILINKFCKEGSTRDAKLIFDEIRKWGLRPTTVSFNTLINGLCKSRNLDEGFRLKKIMEE 286

Query: 181 SRIYPDVYTYSVLIHGLCKEGKVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDS 240
           +RIYPDV+TYSVLIH LCKEG++D+AEQLFDEM++RGLR N VTFTALIDGQC+  +IDS
Sbjct: 287 NRIYPDVFTYSVLIHRLCKEGRLDNAEQLFDEMQKRGLRPNGVTFTALIDGQCKRRQIDS 346

Query: 241 AMNTYQQMLAMGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLID 300
           AMNTY QML MGVKPDLVMYNTLL GLCKVGDV+KARKL+DEM+MVGMKPDKI+YTTLID
Sbjct: 347 AMNTYHQMLTMGVKPDLVMYNTLLYGLCKVGDVNKARKLIDEMRMVGMKPDKISYTTLID 406

Query: 301 GYCKEGDLESAMEIRKGMNEEGVVLDNVAFTAIISGLCRDGRVMDAERTLREMKEAGMKP 360
           GYCKEGDLESA+EIRKGMNEEGVVLDNVAFTA+ISG CRDGRV DAERTLREM EAGMKP
Sbjct: 407 GYCKEGDLESALEIRKGMNEEGVVLDNVAFTALISGFCRDGRVRDAERTLREMMEAGMKP 466

Query: 361 DDATYTMVIDGYCKNGDVKTGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLL 420
           DDATYTMVIDGYCK GDVKTGFK+LKEMQ NGH PGVITYNVLMNGLCKQGQMKNA+MLL
Sbjct: 467 DDATYTMVIDGYCKKGDVKTGFKMLKEMQINGHKPGVITYNVLMNGLCKQGQMKNAHMLL 526

Query: 421 EAMLNLGVTPDDITYNILLEGHCKSGRAEDFLHLRNEKGVVVDYAYYTSLVGEYDKSLKD 480
           EAMLNLGVTPDDITYNILLEGHCK+G+AED L LRNEKG+++DYAYYTSLVGEYDKSLKD
Sbjct: 527 EAMLNLGVTPDDITYNILLEGHCKNGKAEDLLKLRNEKGLIIDYAYYTSLVGEYDKSLKD 586

Query: 481 RRKR 485
           R+KR
Sbjct: 587 RQKR 590

BLAST of Cp4.1LG03g05680 vs. NCBI nr
Match: gi|778679906|ref|XP_004140820.2| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g09680 [Cucumis sativus])

HSP 1 Score: 867.8 bits (2241), Expect = 8.6e-249
Identity = 422/484 (87.19%), Postives = 450/484 (92.98%), Query Frame = 1

Query: 1   MANFLCAHQMFEESQSIVRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDS 60
           MANFL AHQMF+E QSI+RFLVSRKGKDSAAS+FAAIL+   TRCSNFVFDALMIAY DS
Sbjct: 121 MANFLSAHQMFQECQSIIRFLVSRKGKDSAASVFAAILDTAGTRCSNFVFDALMIAYWDS 180

Query: 61  GFISDAIQCFRLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYF 120
           GF+SDAIQCFRLVR  NFQIPF GC YLLDKM+NSNSPVTIWTFY EIL+ GFPPKV+Y+
Sbjct: 181 GFVSDAIQCFRLVRNSNFQIPFHGCGYLLDKMINSNSPVTIWTFYSEILEYGFPPKVQYY 240

Query: 121 NILINKFCKQGSIRDARLIFDEIGKRGFRPTAVSFNTLINGFCKSRNLDECFRLKKVMEE 180
           NILINKFCK+GSIRDA+LIF+EI KRG RPT VSFNTLING CKSRNLDE FRLKK MEE
Sbjct: 241 NILINKFCKEGSIRDAKLIFNEIRKRGLRPTTVSFNTLINGLCKSRNLDEGFRLKKTMEE 300

Query: 181 SRIYPDVYTYSVLIHGLCKEGKVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDS 240
           +RIYPDV+TYSVLIHGLCKEG++D AEQLFDEM+QRGLR N +TFTALIDGQCRS RIDS
Sbjct: 301 NRIYPDVFTYSVLIHGLCKEGRLDVAEQLFDEMQQRGLRPNGITFTALIDGQCRSRRIDS 360

Query: 241 AMNTYQQMLAMGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLID 300
           AMNTY QML MGVKPDLVMYNTLLNGLCKVGDV+KARKLVDEM+MVGMKPDKITYTTLID
Sbjct: 361 AMNTYHQMLTMGVKPDLVMYNTLLNGLCKVGDVNKARKLVDEMRMVGMKPDKITYTTLID 420

Query: 301 GYCKEGDLESAMEIRKGMNEEGVVLDNVAFTAIISGLCRDGRVMDAERTLREMKEAGMKP 360
           GYCKEGDLESAMEIRKGMNEEGVVLDNVAFTA+ISG CRDGRV DAERTLREM EAGMKP
Sbjct: 421 GYCKEGDLESAMEIRKGMNEEGVVLDNVAFTALISGFCRDGRVRDAERTLREMVEAGMKP 480

Query: 361 DDATYTMVIDGYCKNGDVKTGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLL 420
           DDATYTMVIDGYCK G+VK GFKLLKEMQ NGH PGVITYNVLMNGLCKQGQMKNANMLL
Sbjct: 481 DDATYTMVIDGYCKKGNVKMGFKLLKEMQINGHKPGVITYNVLMNGLCKQGQMKNANMLL 540

Query: 421 EAMLNLGVTPDDITYNILLEGHCKSGRAEDFLHLRNEKGVVVDYAYYTSLVGEYDKSLKD 480
           EAMLNLGVTPDDITYNILLEGHCK+G+AED L LRNEKG++VDYAYYTSLV EY+KSLKD
Sbjct: 541 EAMLNLGVTPDDITYNILLEGHCKNGKAEDLLKLRNEKGLIVDYAYYTSLVSEYNKSLKD 600

Query: 481 RRKR 485
           R+KR
Sbjct: 601 RQKR 604

BLAST of Cp4.1LG03g05680 vs. NCBI nr
Match: gi|645246622|ref|XP_008229438.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g09680 [Prunus mume])

HSP 1 Score: 729.2 bits (1881), Expect = 4.8e-207
Identity = 344/484 (71.07%), Postives = 413/484 (85.33%), Query Frame = 1

Query: 1   MANFLCAHQMFEESQSIVRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDS 60
           MA+FLCAHQM+ ++QS++R +VSRKGK++A+S+FA++LE   T  SN+VFDALM AY D 
Sbjct: 117 MAHFLCAHQMYPQAQSLLRIVVSRKGKETASSVFASVLETRGTHQSNYVFDALMNAYVDC 176

Query: 61  GFISDAIQCFRLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYF 120
           GF+SDA QCFRL+RK NF+IPF  C  LLDKM+  NSPV  W FYLEILDSGFPPKV  F
Sbjct: 177 GFVSDACQCFRLLRKHNFRIPFHACGCLLDKMLKLNSPVVAWGFYLEILDSGFPPKVYNF 236

Query: 121 NILINKFCKQGSIRDARLIFDEIGKRGFRPTAVSFNTLINGFCKSRNLDECFRLKKVMEE 180
           N+L++K CK+G IR+A+L+FDEIGKRG  PT VSFNTLING+CKSRNL+ECFRLK+ MEE
Sbjct: 237 NVLMHKLCKEGEIREAQLVFDEIGKRGLLPTVVSFNTLINGYCKSRNLEECFRLKRDMEE 296

Query: 181 SRIYPDVYTYSVLIHGLCKEGKVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDS 240
           SR  PDV+TYSVLI+GLCKE ++DDA  LFDEM +RGL  N+VT+T LIDGQC++GRID 
Sbjct: 297 SRTRPDVFTYSVLINGLCKELRLDDANLLFDEMCERGLVPNNVTYTTLIDGQCKNGRIDL 356

Query: 241 AMNTYQQMLAMGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLID 300
           AM  YQ+ML +G+KPDL+ YNTL+NGLCKVGD+ + RKLV+EM + G+KPD ITYTTLID
Sbjct: 357 AMEVYQKMLGIGIKPDLITYNTLINGLCKVGDLKETRKLVEEMNIEGLKPDTITYTTLID 416

Query: 301 GYCKEGDLESAMEIRKGMNEEGVVLDNVAFTAIISGLCRDGRVMDAERTLREMKEAGMKP 360
           G CKEGDL+SA+EIRKGM ++G+ LDNVAFTA+ISGLCR+G+ +DAERTLREM  +GMKP
Sbjct: 417 GCCKEGDLQSALEIRKGMIKQGIELDNVAFTALISGLCREGKTLDAERTLREMLNSGMKP 476

Query: 361 DDATYTMVIDGYCKNGDVKTGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLL 420
           DDATYTM+IDG+CK GDVK GFKLLKEMQ +G+ P V+TYN LMNGLCK GQMKNANMLL
Sbjct: 477 DDATYTMIIDGFCKKGDVKMGFKLLKEMQGDGYVPSVVTYNALMNGLCKLGQMKNANMLL 536

Query: 421 EAMLNLGVTPDDITYNILLEGHCKSGRAEDFLHLRNEKGVVVDYAYYTSLVGEYDKSLKD 480
           +AM+NLGV PDDITYNILLEGHCK G  EDF  LR+ KG+V+DYA YTSLV E++KS KD
Sbjct: 537 DAMINLGVAPDDITYNILLEGHCKHGNPEDFDKLRSGKGLVLDYASYTSLVNEFNKSSKD 596

Query: 481 RRKR 485
           RRKR
Sbjct: 597 RRKR 600

BLAST of Cp4.1LG03g05680 vs. NCBI nr
Match: gi|595968678|ref|XP_007217406.1| (hypothetical protein PRUPE_ppa024153mg [Prunus persica])

HSP 1 Score: 728.8 bits (1880), Expect = 6.3e-207
Identity = 344/484 (71.07%), Postives = 414/484 (85.54%), Query Frame = 1

Query: 1   MANFLCAHQMFEESQSIVRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDS 60
           MA+FLCAHQM+ ++QS++R +VSRKGK++A+S+FA+ILE   T  SN+VFDALM AY D 
Sbjct: 122 MAHFLCAHQMYPQAQSLLRIVVSRKGKETASSVFASILETRGTHQSNYVFDALMNAYVDC 181

Query: 61  GFISDAIQCFRLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYF 120
           GF+SDA QCFRL+RK NF+IPF  C  LLDKM+  NSPV  W FYLEILDSGFPPKV  F
Sbjct: 182 GFVSDACQCFRLLRKHNFRIPFHACGCLLDKMLKLNSPVVAWGFYLEILDSGFPPKVYNF 241

Query: 121 NILINKFCKQGSIRDARLIFDEIGKRGFRPTAVSFNTLINGFCKSRNLDECFRLKKVMEE 180
           N+L++K CK+G IR+A+L+FDEIGKRG  PT VSFNTLING+CKSRNL+ECFRLK+ MEE
Sbjct: 242 NVLMHKLCKEGEIREAQLVFDEIGKRGLLPTVVSFNTLINGYCKSRNLEECFRLKRDMEE 301

Query: 181 SRIYPDVYTYSVLIHGLCKEGKVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDS 240
           SR  PDV+TYSVLI+GLCKE ++DDA  LFDEM +RGL  N+VT+T LIDGQC++GRID 
Sbjct: 302 SRTRPDVFTYSVLINGLCKELRLDDANLLFDEMCERGLVPNNVTYTTLIDGQCKNGRIDL 361

Query: 241 AMNTYQQMLAMGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLID 300
           AM  YQ+ML +G+KPD++ YNTL+NGLCKVGD+ +ARKLV+EM + G+KPD ITYTTLID
Sbjct: 362 AMEVYQKMLGIGIKPDVITYNTLINGLCKVGDLKEARKLVEEMNIAGLKPDTITYTTLID 421

Query: 301 GYCKEGDLESAMEIRKGMNEEGVVLDNVAFTAIISGLCRDGRVMDAERTLREMKEAGMKP 360
           G CKEG+L+SA+EIRKGM ++G+ LDNVAFTA+ISGLCR+G+ +DAERTLREM  +GMKP
Sbjct: 422 GCCKEGNLQSALEIRKGMIKQGIELDNVAFTALISGLCREGKTLDAERTLREMLNSGMKP 481

Query: 361 DDATYTMVIDGYCKNGDVKTGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLL 420
           DDATYTM+IDG+CK GDVK GFKLLKEMQ +G+ P V+TYN LMNGLCK GQMKNANMLL
Sbjct: 482 DDATYTMIIDGFCKKGDVKMGFKLLKEMQGDGYVPSVVTYNALMNGLCKLGQMKNANMLL 541

Query: 421 EAMLNLGVTPDDITYNILLEGHCKSGRAEDFLHLRNEKGVVVDYAYYTSLVGEYDKSLKD 480
           +AM+NLGV PDDITYNILLEGHCK G  EDF  LR+ KG+V+DYA YTSLV E++KS KD
Sbjct: 542 DAMINLGVAPDDITYNILLEGHCKHGNPEDFDKLRSGKGLVLDYASYTSLVSEFNKSSKD 601

Query: 481 RRKR 485
           RRKR
Sbjct: 602 RRKR 605

BLAST of Cp4.1LG03g05680 vs. NCBI nr
Match: gi|1000940553|ref|XP_015582938.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g09680 [Ricinus communis])

HSP 1 Score: 705.7 bits (1820), Expect = 5.7e-200
Identity = 338/483 (69.98%), Postives = 405/483 (83.85%), Query Frame = 1

Query: 1   MANFLCAHQMFEESQSIVRFLVSRKGKDSAASIFAAILEITDTRCSNFVFDALMIAYSDS 60
           M +FL AHQM  ++QS++ F++SRKGKDS+ S+FA+ILE      S+FVFD LM AY+D 
Sbjct: 115 MVHFLSAHQMHSQAQSLLHFIISRKGKDSSFSVFASILETKGNHFSDFVFDGLMNAYTDL 174

Query: 61  GFISDAIQCFRLVRKRNFQIPFRGCEYLLDKMMNSNSPVTIWTFYLEILDSGFPPKVKYF 120
            F+SDAIQCFRLVRK NF+IPF GC+YLLD+M+ ++SP+    FYLEILDSG+PP VK F
Sbjct: 175 EFLSDAIQCFRLVRKHNFKIPFHGCKYLLDRMIRNSSPILARRFYLEILDSGYPPNVKSF 234

Query: 121 NILINKFCKQGSIRDARLIFDEIGKRGFRPTAVSFNTLINGFCKSRNLDECFRLKKVMEE 180
           N+LI++FCK+G I DA +IFDEIGK G +PT VSFNTLING+CK  NLDE FRLKKVMEE
Sbjct: 235 NMLISRFCKEGKINDAHMIFDEIGKWGLQPTVVSFNTLINGYCKLGNLDEGFRLKKVMEE 294

Query: 181 SRIYPDVYTYSVLIHGLCKEGKVDDAEQLFDEMRQRGLRANDVTFTALIDGQCRSGRIDS 240
           SRI+PDV+TYSVLI+GLCK+ +++DA QLFDEM ++GL  NDVTFT LI+GQC++GRID 
Sbjct: 295 SRIFPDVFTYSVLINGLCKDRRLNDANQLFDEMCEKGLVPNDVTFTTLINGQCKNGRIDL 354

Query: 241 AMNTYQQMLAMGVKPDLVMYNTLLNGLCKVGDVSKARKLVDEMKMVGMKPDKITYTTLID 300
           A+  YQQML  G+KPDL++YNTL+NGLCKVGD+ +ARKL DEM     KPDKITYTTLID
Sbjct: 355 AVEMYQQMLRKGLKPDLILYNTLINGLCKVGDMREARKLADEMNERCQKPDKITYTTLID 414

Query: 301 GYCKEGDLESAMEIRKGMNEEGVVLDNVAFTAIISGLCRDGRVMDAERTLREMKEAGMKP 360
           GYCKEGDLESA+EIR  M +EGV LD VAFTAIISGLC++G+V+DAER LREM +AG KP
Sbjct: 415 GYCKEGDLESALEIRNIMIKEGVELDIVAFTAIISGLCKEGKVIDAERALREMLKAGFKP 474

Query: 361 DDATYTMVIDGYCKNGDVKTGFKLLKEMQRNGHNPGVITYNVLMNGLCKQGQMKNANMLL 420
           DDATYTMV+DG+CK GD+KTGFKLLKEMQ +GH PGV+TYNVLMNG CKQ QMKNANMLL
Sbjct: 475 DDATYTMVMDGFCKKGDMKTGFKLLKEMQSDGHVPGVVTYNVLMNGYCKQSQMKNANMLL 534

Query: 421 EAMLNLGVTPDDITYNILLEGHCKSGRAEDFLHLRNEKGVVVDYAYYTSLVGEYDKSLKD 480
           +AM+NLGV PDDITYNILLEGHCK G  +DF  L++EKGVV DYA Y SL+ E  +S KD
Sbjct: 535 DAMMNLGVVPDDITYNILLEGHCKHGNLQDFHKLQSEKGVVADYASYKSLLNELSRSSKD 594

Query: 481 RRK 484
           R+K
Sbjct: 595 RQK 597

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR26_ARATH1.3e-17259.92Putative pentatricopeptide repeat-containing protein At1g09680 OS=Arabidopsis th... [more]
PP407_ARATH8.2e-8232.17Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN... [more]
PP432_ARATH4.1e-7330.90Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana GN... [more]
PPR12_ARATH2.6e-7231.86Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidop... [more]
PPR27_ARATH5.2e-6830.53Pentatricopeptide repeat-containing protein At1g09820 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
M5XHR3_PRUPE4.4e-20771.07Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa024153mg PE=4 SV=1[more]
D7TSI7_VITVI3.6e-19367.36Putative uncharacterized protein OS=Vitis vinifera GN=VIT_14s0006g00170 PE=4 SV=... [more]
A0A061EF62_THECC9.2e-18967.02Pentatricopeptide repeat superfamily protein OS=Theobroma cacao GN=TCM_017452 PE... [more]
W9RWL8_9ROSA2.7e-18867.35Uncharacterized protein OS=Morus notabilis GN=L484_004954 PE=4 SV=1[more]
U5FHA0_POPTR2.3e-18765.50Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POP... [more]
Match NameE-valueIdentityDescription
AT1G09680.17.4e-17459.92 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G39710.14.6e-8332.17 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G55840.12.3e-7430.90 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G05670.11.5e-7331.86 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT1G09820.12.9e-6930.53 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659114573|ref|XP_008457121.1|3.0e-24986.98PREDICTED: putative pentatricopeptide repeat-containing protein At1g09680 [Cucum... [more]
gi|778679906|ref|XP_004140820.2|8.6e-24987.19PREDICTED: putative pentatricopeptide repeat-containing protein At1g09680 [Cucum... [more]
gi|645246622|ref|XP_008229438.1|4.8e-20771.07PREDICTED: putative pentatricopeptide repeat-containing protein At1g09680 [Prunu... [more]
gi|595968678|ref|XP_007217406.1|6.3e-20771.07hypothetical protein PRUPE_ppa024153mg [Prunus persica][more]
gi|1000940553|ref|XP_015582938.1|5.7e-20069.98PREDICTED: putative pentatricopeptide repeat-containing protein At1g09680 [Ricin... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005739 mitochondrion
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g05680.1Cp4.1LG03g05680.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 120..148
score: 7.2E-4coord: 49..78
score:
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 216..248
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 395..444
score: 5.5E-17coord: 150..199
score: 2.2E-17coord: 255..304
score: 4.4E-19coord: 326..374
score: 4.1
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 433..455
score: 0.0024coord: 293..326
score: 9.0E-9coord: 223..256
score: 3.7E-8coord: 258..291
score: 2.2E-10coord: 364..396
score: 2.5E-10coord: 328..361
score: 5.7E-9coord: 188..221
score: 1.7E-11coord: 153..187
score: 1.7E-8coord: 120..151
score: 2.4E-6coord: 398..431
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 221..255
score: 12.485coord: 46..80
score: 7.892coord: 431..461
score: 7.015coord: 186..220
score: 14.82coord: 396..430
score: 11.904coord: 326..360
score: 12.759coord: 256..290
score: 13.329coord: 151..185
score: 11.203coord: 116..150
score: 10.731coord: 361..395
score: 13.088coord: 291..325
score: 12
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 49..77
score: 3.0E-5coord: 388..426
score: 3.0E-5coord: 114..283
score: 3.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 1..24
score: 1.5E-194coord: 42..469
score: 1.5E
NoneNo IPR availablePANTHERPTHR24015:SF824SUBFAMILY NOT NAMEDcoord: 1..24
score: 1.5E-194coord: 42..469
score: 1.5E