Cp4.1LG01g08640 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g08640
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCp4.1LG01 : 4912194 .. 4914186 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GCGCTCAAAACTGTCGAAACTCGATTTTATTTCCTTTTTTTCCTGAAATTGGAGGTTTCCGGAGCTCCGGCCTTGGCGGCGGCATTGGCAATGTTCAAAATCTTGAGGAGCTTTTCTTCAGGTTTCACGAGAACGGCAAGAACGGAGACAGATGCATTCTGTTTTGTAGCGTTGAGATTATACAGCGCGAGACGAACCTGCAACCGAAGAAACCTCTTCGCCAGGATCAGTCCTCTCGGTTCTCCTGAGCTTAGTGTAGTTCCGATTCTTGATCAGTGGATTCAGGAAGGCAGGATGATCAAGGACTTTGAGATGCGGAGAATCGTTCGCGACCTTCGTAATTGCCGGCGGTATGGCCAAGCCCTTGAGGTGAGCGCAATTGAATAACACACTCTAATTTCTGCTTGATCTTTTGCCATTTCCCGTTAAAGGAATGTTTGTATTGTACATGATGTCTGTTGGATGGAGTAAAATTCGTGTGTTTTTATTGAATGAATGATGTTGAGAGAGGATTCTCAAATCGATTGATTGATGAAAATATTGTGGACGAAGGCAACAAACTGGAATTTCGCGTTTTGAATTCTGAAGTAATCACATGTAGTCAAGTTAATTGTGGTTTTTGATTCTAGCAGTTGTAATGCATTAGTTGCAGTTTCACTAGGTTGATGTCCTAAATTGCTCTCTTCCCATTTGGTTTTTCATCAATCCTAGATCTTATTTTTGAAATATCTATTAGATTTCTGGATATTTGATCATTTGATCTTCTTTTGGATTATTCAAATTGAAGGTGTCTGAATGGATGCGTAGCAAGGGACTTTTTTCCTTTACAACTAGAGACTTTGCTGTACAGCTTGATCTGATCGGCCGAGTTCAGGGGCTCGATTCTGCAGAGAAGTATTTCAGCAGTGTTTCTAACCAAGAGGAAATTGGTAAACTCTATGGTGCTCTTCTAAATTGTTATGTCAGGGAAGGCCTTGTAGATAAGGCCCTTTCCCATATGCAGAAGATGAAAGAGATGGGTTTTGCTTCCTCTCCCCTCTGCTACAATGATATAATGTGTCTATATTTGAACACTGGCCAGGTCGATAAAGTTCCGAATGTACTTTCTGAAATGAAGGAGAATGGTGTTCTTCCTGACAATTATAGCTATAGAATTTGCATCAGTTCTTATGGAGCTAGGTCTGATCTAATCGGTATGCTGAAGGTTTTGAGAGAAATGGAGAGTCAAACTCACATATCTATGGACTGGACTACTTATTCAATGGTTGCCAATTTTTTCATAAAGGCTGGTATGCACGAGCAAGCAATGAGTTACCTTCGGAAATGCGAGGACAAGGTCAACCAAGACGCTCTCGGCTTCAATCACCTCATTTCACTTTACACCAGTCTGGGACGTAAAGACGAAGTAATGAGACTGTGGGCTCTCCAAAAGAAGTGCAAGAAGCAAGTCAATAGGGATTATATAACCATGTTGGGTTGTTTGGTTAAGCTTGAGTTTCTTGAGGAAGCTGAGAAATTGGTCAAGGAATGGGAGTCATCTTGCGAGTGTTATGATTTTCGAGTTCCGAATGTTCTTCTCATCGGATACTCGCAAAGGGGGCTAATTGAAAGAGCAGAAAAGATGCTTCAAAACATCATCAGTGATGGGAGGATCCCACCACCCAATAGTTGGGGCATTATTGCAGCAGGGTACTTGGAGAAGCAGAACCCGGAGAGAGCTTTCAAGTGCATGAAGGAAGCTGTAGCTGTACAAGAGCAAAACAAAGGGTGGAGGCCCAAACCTAGCGTCTTATCAAGCATACTGCGATGGCTATCTGAAAATGGAAGATATGAGGAGCTGAAAGAGTTTCTGAGCTCATTGAAGACTGTTCCTTCCATGGACGGAAAACTAAGTAATGCCTTCGATGAGCTTCTGGAAACCTTAAAAAACAATGATGAAACAACGGCCGATGCTCTTAAGAAATCACAACCTTGTTTAGCTCAGGTAGATTAA

mRNA sequence

GCGCTCAAAACTGTCGAAACTCGATTTTATTTCCTTTTTTTCCTGAAATTGGAGGTTTCCGGAGCTCCGGCCTTGGCGGCGGCATTGGCAATGTTCAAAATCTTGAGGAGCTTTTCTTCAGGTTTCACGAGAACGGCAAGAACGGAGACAGATGCATTCTGTTTTGTAGCGTTGAGATTATACAGCGCGAGACGAACCTGCAACCGAAGAAACCTCTTCGCCAGGATCAGTCCTCTCGGTTCTCCTGAGCTTAGTGTAGTTCCGATTCTTGATCAGTGGATTCAGGAAGGCAGGATGATCAAGGACTTTGAGATGCGGAGAATCGTTCGCGACCTTCGTAATTGCCGGCGGTATGGCCAAGCCCTTGAGGTGTCTGAATGGATGCGTAGCAAGGGACTTTTTTCCTTTACAACTAGAGACTTTGCTGTACAGCTTGATCTGATCGGCCGAGTTCAGGGGCTCGATTCTGCAGAGAAGTATTTCAGCAGTGTTTCTAACCAAGAGGAAATTGGTAAACTCTATGGTGCTCTTCTAAATTGTTATGTCAGGGAAGGCCTTGTAGATAAGGCCCTTTCCCATATGCAGAAGATGAAAGAGATGGGTTTTGCTTCCTCTCCCCTCTGCTACAATGATATAATGTGTCTATATTTGAACACTGGCCAGGTCGATAAAGTTCCGAATGTACTTTCTGAAATGAAGGAGAATGGTGTTCTTCCTGACAATTATAGCTATAGAATTTGCATCAGTTCTTATGGAGCTAGGTCTGATCTAATCGGTATGCTGAAGGTTTTGAGAGAAATGGAGAGTCAAACTCACATATCTATGGACTGGACTACTTATTCAATGGTTGCCAATTTTTTCATAAAGGCTGGTATGCACGAGCAAGCAATGAGTTACCTTCGGAAATGCGAGGACAAGGTCAACCAAGACGCTCTCGGCTTCAATCACCTCATTTCACTTTACACCAGTCTGGGACGTAAAGACGAAGTAATGAGACTGTGGGCTCTCCAAAAGAAGTGCAAGAAGCAAGTCAATAGGGATTATATAACCATGTTGGGTTGTTTGGTTAAGCTTGAGTTTCTTGAGGAAGCTGAGAAATTGGTCAAGGAATGGGAGTCATCTTGCGAGTGTTATGATTTTCGAGTTCCGAATGTTCTTCTCATCGGATACTCGCAAAGGGGGCTAATTGAAAGAGCAGAAAAGATGCTTCAAAACATCATCAGTGATGGGAGGATCCCACCACCCAATAGTTGGGGCATTATTGCAGCAGGGTACTTGGAGAAGCAGAACCCGGAGAGAGCTTTCAAGTGCATGAAGGAAGCTGTAGCTGTACAAGAGCAAAACAAAGGGTGGAGGCCCAAACCTAGCGTCTTATCAAGCATACTGCGATGGCTATCTGAAAATGGAAGATATGAGGAGCTGAAAGAGTTTCTGAGCTCATTGAAGACTGTTCCTTCCATGGACGGAAAACTAAGTAATGCCTTCGATGAGCTTCTGGAAACCTTAAAAAACAATGATGAAACAACGGCCGATGCTCTTAAGAAATCACAACCTTGTTTAGCTCAGGTAGATTAA

Coding sequence (CDS)

GCGCTCAAAACTGTCGAAACTCGATTTTATTTCCTTTTTTTCCTGAAATTGGAGGTTTCCGGAGCTCCGGCCTTGGCGGCGGCATTGGCAATGTTCAAAATCTTGAGGAGCTTTTCTTCAGGTTTCACGAGAACGGCAAGAACGGAGACAGATGCATTCTGTTTTGTAGCGTTGAGATTATACAGCGCGAGACGAACCTGCAACCGAAGAAACCTCTTCGCCAGGATCAGTCCTCTCGGTTCTCCTGAGCTTAGTGTAGTTCCGATTCTTGATCAGTGGATTCAGGAAGGCAGGATGATCAAGGACTTTGAGATGCGGAGAATCGTTCGCGACCTTCGTAATTGCCGGCGGTATGGCCAAGCCCTTGAGGTGTCTGAATGGATGCGTAGCAAGGGACTTTTTTCCTTTACAACTAGAGACTTTGCTGTACAGCTTGATCTGATCGGCCGAGTTCAGGGGCTCGATTCTGCAGAGAAGTATTTCAGCAGTGTTTCTAACCAAGAGGAAATTGGTAAACTCTATGGTGCTCTTCTAAATTGTTATGTCAGGGAAGGCCTTGTAGATAAGGCCCTTTCCCATATGCAGAAGATGAAAGAGATGGGTTTTGCTTCCTCTCCCCTCTGCTACAATGATATAATGTGTCTATATTTGAACACTGGCCAGGTCGATAAAGTTCCGAATGTACTTTCTGAAATGAAGGAGAATGGTGTTCTTCCTGACAATTATAGCTATAGAATTTGCATCAGTTCTTATGGAGCTAGGTCTGATCTAATCGGTATGCTGAAGGTTTTGAGAGAAATGGAGAGTCAAACTCACATATCTATGGACTGGACTACTTATTCAATGGTTGCCAATTTTTTCATAAAGGCTGGTATGCACGAGCAAGCAATGAGTTACCTTCGGAAATGCGAGGACAAGGTCAACCAAGACGCTCTCGGCTTCAATCACCTCATTTCACTTTACACCAGTCTGGGACGTAAAGACGAAGTAATGAGACTGTGGGCTCTCCAAAAGAAGTGCAAGAAGCAAGTCAATAGGGATTATATAACCATGTTGGGTTGTTTGGTTAAGCTTGAGTTTCTTGAGGAAGCTGAGAAATTGGTCAAGGAATGGGAGTCATCTTGCGAGTGTTATGATTTTCGAGTTCCGAATGTTCTTCTCATCGGATACTCGCAAAGGGGGCTAATTGAAAGAGCAGAAAAGATGCTTCAAAACATCATCAGTGATGGGAGGATCCCACCACCCAATAGTTGGGGCATTATTGCAGCAGGGTACTTGGAGAAGCAGAACCCGGAGAGAGCTTTCAAGTGCATGAAGGAAGCTGTAGCTGTACAAGAGCAAAACAAAGGGTGGAGGCCCAAACCTAGCGTCTTATCAAGCATACTGCGATGGCTATCTGAAAATGGAAGATATGAGGAGCTGAAAGAGTTTCTGAGCTCATTGAAGACTGTTCCTTCCATGGACGGAAAACTAAGTAATGCCTTCGATGAGCTTCTGGAAACCTTAAAAAACAATGATGAAACAACGGCCGATGCTCTTAAGAAATCACAACCTTGTTTAGCTCAGGTAGATTAA

Protein sequence

ALKTVETRFYFLFFLKLEVSGAPALAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLREMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGFNHLISLYTSLGRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETLKNNDETTADALKKSQPCLAQVD
BLAST of Cp4.1LG01g08640 vs. Swiss-Prot
Match: PP334_ARATH (Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidopsis thaliana GN=At4g21705 PE=2 SV=1)

HSP 1 Score: 427.2 bits (1097), Expect = 2.6e-118
Identity = 215/482 (44.61%), Postives = 315/482 (65.35%), Query Frame = 1

Query: 56  VALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNC 115
           +A R Y   R   +  L+++ISPLG P+ SV P L  W+Q G+ +   E+ RIV DLR  
Sbjct: 12  IASRYYYTNRV-KKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVSVAELIRIVHDLRRR 71

Query: 116 RRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEIGKLYG 175
           +R+  ALEVS+WM   G+  F+  + AV LDLIGRV G  +AE+YF ++  Q +  K YG
Sbjct: 72  KRFLHALEVSKWMNETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYFENLKEQYKNDKTYG 131

Query: 176 ALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKEN 235
           ALLNCYVR+  V+K+L H +KMKEMGF +S L YN+IMCLY N GQ +KVP VL EMKE 
Sbjct: 132 ALLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQHEKVPKVLEEMKEE 191

Query: 236 GVLPDNYSYRICISSYGARSDLIGMLKVLREMESQTHISMDWTTYSMVANFFIKAGMHEQ 295
            V PDNYSYRICI+++GA  DL  +   LR+ME +  I+MDW TY++ A F+I  G  ++
Sbjct: 192 NVAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYAVAAKFYIDGGDCDR 251

Query: 296 AMSYLRKCEDKV-NQDALGFNHLISLYTSLGRKDEVMRLWALQKK-CKKQVNRDYITMLG 355
           A+  L+  E+++  +D  G+NHLI+LY  LG+K EV+RLW L+K  CK+++N+DY+T+L 
Sbjct: 252 AVELLKMSENRLEKKDGEGYNHLITLYARLGKKIEVLRLWDLEKDVCKRRINQDYLTVLQ 311

Query: 356 CLVKLEFLEEAEKLVKEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIP 415
            LVK++ L EAE+++ EW+SS  CYDFRVPN ++ GY  + + E+AE ML+++   G+  
Sbjct: 312 SLVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEEKAEAMLEDLARRGKAT 371

Query: 416 PPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEE 475
            P SW ++A  Y EK   E AFKCMK A+ V+  ++ WRP  ++++S+L W+ + G  +E
Sbjct: 372 TPESWELVATAYAEKGTLENAFKCMKTALGVEVGSRKWRPGLTLVTSVLSWVGDEGSLKE 431

Query: 476 LKEFLSSLKTVPSMDGKLSNA------------FDELLETLKNN----DETTADALKKSQ 520
           ++ F++SL+    ++ ++ +A             D LL+ +K++    DE T   L    
Sbjct: 432 VESFVASLRNCIGVNKQMYHALVKADIREGGRNIDTLLQRMKDDKIEIDEETTVILSTRS 491

BLAST of Cp4.1LG01g08640 vs. Swiss-Prot
Match: PP166_ARATH (Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidopsis thaliana GN=At2g20710 PE=2 SV=1)

HSP 1 Score: 292.7 bits (748), Expect = 7.8e-78
Identity = 151/427 (35.36%), Postives = 249/427 (58.31%), Query Frame = 1

Query: 75  RISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALEVSEWMRSKGLF 134
           R++  G P  S++ +LD W+ +G ++K  E+  I++ LR   R+  AL++S+WM    + 
Sbjct: 43  RVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQISDWMSEHRVH 102

Query: 135 SFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHM 194
             +  D A++LDLI +V GL  AEK+F ++  +     LYGALLNCY  + ++ KA    
Sbjct: 103 EISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYASKKVLHKAEQVF 162

Query: 195 QKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGAR 254
           Q+MKE+GF    L YN ++ LY+ TG+   V  +L EM++  V PD ++    + +Y   
Sbjct: 163 QEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTRLHAYSVV 222

Query: 255 SDLIGMLKVLREMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDAL-- 314
           SD+ GM K L   E+   + +DW TY+  AN +IKAG+ E+A+  LRK E  VN      
Sbjct: 223 SDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMVNAQKRKH 282

Query: 315 GFNHLISLYTSLGRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWE 374
            +  L+S Y + G+K+EV RLW+L K+     N  YI+++  L+K++ +EE EK+++EWE
Sbjct: 283 AYEVLMSFYGAAGKKEEVYRLWSLYKELDGFYNTGYISVISALLKMDDIEEVEKIMEEWE 342

Query: 375 SSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPE 434
           +    +D R+P++L+ GY ++G++E+AE+++  ++   R+   ++W  +A GY      E
Sbjct: 343 AGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWERLALGYKMAGKME 402

Query: 435 RAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLS 494
           +A +  K A+ V +   GWRP   VL S + +L      E L++ L  L    S  G +S
Sbjct: 403 KAVEKWKRAIEVSK--PGWRPHQVVLMSCVDYLEGQRDMEGLRKILRLL----SERGHIS 461

Query: 495 NAFDELL 500
             +D+LL
Sbjct: 463 --YDQLL 461

BLAST of Cp4.1LG01g08640 vs. Swiss-Prot
Match: PPR3_ARATH (Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana GN=At1g02150 PE=2 SV=2)

HSP 1 Score: 270.0 bits (689), Expect = 5.4e-71
Identity = 152/454 (33.48%), Postives = 252/454 (55.51%), Query Frame = 1

Query: 61  YSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQ 120
           Y  R       ++ +IS +  PEL    +L+QW + GR +  +E+ R+V++LR  +R  Q
Sbjct: 58  YERRPIVQWNAIYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQ 117

Query: 121 ALEVSEWMRSKG-LFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEIGKLYGALLN 180
           ALEV +WM ++G  F  +  D A+QLDLIG+V+G+  AE++F  +    +  ++YG+LLN
Sbjct: 118 ALEVYDWMNNRGERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLN 177

Query: 181 CYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLP 240
            YVR    +KA + +  M++ G+A  PL +N +M LY+N  + DKV  ++ EMK+  +  
Sbjct: 178 AYVRAKSREKAEALLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRL 237

Query: 241 DNYSYRICISSYGARSDLIGMLKVLREMESQTHISMDWTTYSMVANFFIKAGMHEQAMSY 300
           D YSY I +SS G+   +  M  V ++M+S   I  +WTT+S +A  +IK G  E+A   
Sbjct: 238 DIYSYNIWLSSCGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDA 297

Query: 301 LRKCEDKV-NQDALGFNHLISLYTSLGRKDEVMRLWALQKKCKKQV-NRDYITMLGCLVK 360
           LRK E ++  ++ + +++L+SLY SLG K E+ R+W + K     + N  Y  ++  LV+
Sbjct: 298 LRKVEARITGRNRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVR 357

Query: 361 LEFLEEAEKLVKEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNS 420
           +  +E AEK+ +EW      YD R+PN+L+  Y +   +E AE +  +++  G  P  ++
Sbjct: 358 MGDIEGAEKVYEEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSST 417

Query: 421 WGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEF 480
           W I+A G+  K+    A  C++ A +  E +  WRPK  +LS   +   E       +  
Sbjct: 418 WEILAVGHTRKRCISEALTCLRNAFSA-EGSSNWRPKVLMLSGFFKLCEEESDVTSKEAV 477

Query: 481 LSSLKTVPSMDGKLSNAFDELLET-LKNNDETTA 511
           L  L+    ++ K   A  ++ E    NN E  A
Sbjct: 478 LELLRQSGDLEDKSYLALIDVDENRTVNNSEIDA 510

BLAST of Cp4.1LG01g08640 vs. Swiss-Prot
Match: PPR86_ARATH (Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana GN=At1g60770 PE=2 SV=1)

HSP 1 Score: 260.0 bits (663), Expect = 5.6e-68
Identity = 146/457 (31.95%), Postives = 249/457 (54.49%), Query Frame = 1

Query: 56  VALRLYSARRTCNRRN--------LFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRR 115
           +A+R  S  R   +R+        L+ R+   G  E+ V   L+Q+++  + +  +E+  
Sbjct: 1   MAMRHLSRSRDVTKRSTKKYIEEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGD 60

Query: 116 IVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQ 175
            ++ LRN   Y  AL++SE M  +G+ + T  D A+ LDL+ + + + + E YF  +   
Sbjct: 61  TIKKLRNRGLYYPALKLSEVMEERGM-NKTVSDQAIHLDLVAKAREITAGENYFVDLPET 120

Query: 176 EEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPN 235
            +    YG+LLNCY +E L +KA   + KMKE+    S + YN +M LY  TG+ +KVP 
Sbjct: 121 SKTELTYGSLLNCYCKELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPA 180

Query: 236 VLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLREMESQTHISMDWTTYSMVANFF 295
           ++ E+K   V+PD+Y+Y + + +  A +D+ G+ +V+ EM     ++ DWTTYS +A+ +
Sbjct: 181 MIQELKAENVMPDSYTYNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIY 240

Query: 296 IKAGMHEQAMSYLRKCEDKVNQ-DALGFNHLISLYTSLGRKDEVMRLW-ALQKKCKKQVN 355
           + AG+ ++A   L++ E K  Q D   +  LI+LY  LG+  EV R+W +L+    K  N
Sbjct: 241 VDAGLSQKAEKALQELEMKNTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSN 300

Query: 356 RDYITMLGCLVKLEFLEEAEKLVKEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQN 415
             Y+ M+  LVKL  L  AE L KEW+++C  YD R+ NVL+  Y+Q GLI++A ++ + 
Sbjct: 301 VAYLNMIQVLVKLNDLPGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEK 360

Query: 416 IISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKG-WRPKPSVLSSILRW 475
               G      +W I    Y++  +  RA +CM +AV++ + + G W P P  + +++ +
Sbjct: 361 APRRGGKLNAKTWEIFMDYYVKSGDMARALECMSKAVSIGKGDGGKWLPSPETVRALMSY 420

Query: 476 LSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLET 502
             +       +  L  LK     D   +  F+ L+ T
Sbjct: 421 FEQKKDVNGAENLLEILKN--GTDNIGAEIFEPLIRT 454

BLAST of Cp4.1LG01g08640 vs. Swiss-Prot
Match: PPR4_ARATH (Pentatricopeptide repeat-containing protein At1g02370, mitochondrial OS=Arabidopsis thaliana GN=At1g02370 PE=2 SV=1)

HSP 1 Score: 238.8 bits (608), Expect = 1.3e-61
Identity = 129/389 (33.16%), Postives = 220/389 (56.56%), Query Frame = 1

Query: 69  RRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALEVSEWM 128
           +R L+ ++S L     +V   L+Q+I EG  ++  ++ R  + LR  RR   A E+ +WM
Sbjct: 70  QRELYKKLSMLSVTGGTVAETLNQFIMEGITVRKDDLFRCAKTLRKFRRPQHAFEIFDWM 129

Query: 129 RSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVS-NQEEIGKLYGALLNCYVREGLV 188
             + + +F+  D A+ LDLIG+ +GL++AE YF+++  + +     YGAL+NCY  E   
Sbjct: 130 EKRKM-TFSVSDHAICLDLIGKTKGLEAAENYFNNLDPSAKNHQSTYGALMNCYCVELEE 189

Query: 189 DKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRIC 248
           +KA +H + M E+ F ++ L +N++M +Y+   Q +KVP ++  MK+ G+ P   +Y I 
Sbjct: 190 EKAKAHFEIMDELNFVNNSLPFNNMMSMYMRLSQPEKVPVLVDAMKQRGISPCGVTYSIW 249

Query: 249 ISSYGARSDLIGMLKVLREMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKV 308
           + S G+ +DL G+ K++ EM   +     W T+S +A  + KAG++E+A S L+  E+K+
Sbjct: 250 MQSCGSLNDLDGLEKIIDEMGKDSEAKTTWNTFSNLAAIYTKAGLYEKADSALKSMEEKM 309

Query: 309 NQDALGFNH-LISLYTSLGRKDEVMRLWALQKKCKKQVNR-DYITMLGCLVKLEFLEEAE 368
           N +    +H L+SLY  + +  EV R+W   KK + +VN   Y+ ML  + KL  L+  +
Sbjct: 310 NPNNRDSHHFLMSLYAGISKGPEVYRVWESLKKARPEVNNLSYLVMLQAMSKLGDLDGIK 369

Query: 369 KLVKEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGY 428
           K+  EWES C  YD R+ N+ +  Y +  + E AEK+L   +   + P   +  ++    
Sbjct: 370 KIFTEWESKCWAYDMRLANIAINTYLKGNMYEEAEKILDGAMKKSKGPFSKARQLLMIHL 429

Query: 429 LEKQNPERAFKCMKEAVAVQEQNK---GW 452
           LE    + A K ++ AV+   +NK   GW
Sbjct: 430 LENDKADLAMKHLEAAVSDSAENKDEWGW 457

BLAST of Cp4.1LG01g08640 vs. TrEMBL
Match: A0A0A0L7Y2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G104890 PE=4 SV=1)

HSP 1 Score: 816.6 bits (2108), Expect = 1.7e-233
Identity = 398/490 (81.22%), Postives = 444/490 (90.61%), Query Frame = 1

Query: 25  LAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPEL 84
           +AAA AMFKIL   SSG TRT R ETDAFCFVALRLYS RR+C+RRNL+ARISPLG PE 
Sbjct: 1   MAAASAMFKILSRSSSGCTRTLRPETDAFCFVALRLYSTRRSCDRRNLYARISPLGDPEC 60

Query: 85  SVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQ 144
           +VVP+L+QWI+EGR IKDFE+RRIVRDLR CRRY QALEVSEWM SKGLFS TTRDFA+Q
Sbjct: 61  TVVPVLNQWIEEGRNIKDFELRRIVRDLRTCRRYRQALEVSEWMCSKGLFSLTTRDFAIQ 120

Query: 145 LDLIGRVQGLDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFAS 204
           LDLIG+V+GLDSAEKYF SVSNQ+EIGKLYGALLNCYVREGL+DK+L+HMQKMKEMG AS
Sbjct: 121 LDLIGQVRGLDSAEKYFGSVSNQKEIGKLYGALLNCYVREGLIDKSLAHMQKMKEMGLAS 180

Query: 205 SPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVL 264
           SPLCYNDIMCLYLNTGQ DKVPNVLSEMKENGVLPDN+SYRICISSYGARSD+I M  VL
Sbjct: 181 SPLCYNDIMCLYLNTGQADKVPNVLSEMKENGVLPDNFSYRICISSYGARSDVISMENVL 240

Query: 265 REMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGFNHLISLYTSL 324
           +EME QTHISMDWTTYSMVA FFIKAGMH++AM+YLRKCEDKV++DALGFNHLIS YT+L
Sbjct: 241 KEMEGQTHISMDWTTYSMVAGFFIKAGMHDKAMNYLRKCEDKVDEDALGFNHLISHYTNL 300

Query: 325 GRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWESSCECYDFRVPN 384
           G K+EVMRLWAL KK KKQ+NRDYITMLG LVKLE LEEAE LV EWESSC+CYDFRVPN
Sbjct: 301 GHKNEVMRLWALLKKGKKQLNRDYITMLGSLVKLELLEEAENLVMEWESSCQCYDFRVPN 360

Query: 385 VLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAV 444
           V+LIGYSQ+GLIE+AEKML+NII +G IP PNSWGIIA+GYLEKQN E+AF+CMKEA+AV
Sbjct: 361 VVLIGYSQKGLIEKAEKMLRNIIVNGMIPSPNSWGIIASGYLEKQNLEKAFECMKEALAV 420

Query: 445 QEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETLKN 504
           + QNK WRPKP+VLSSILRWLSEN RYEE+KEF+SSLKTVPSMD KL+NA DELLE + N
Sbjct: 421 KGQNKVWRPKPNVLSSILRWLSENRRYEEMKEFMSSLKTVPSMDEKLNNALDELLEIMAN 480

Query: 505 NDETTADALK 515
           +D  + D L+
Sbjct: 481 DDGISKDELE 490

BLAST of Cp4.1LG01g08640 vs. TrEMBL
Match: B9SNN7_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_1010390 PE=4 SV=1)

HSP 1 Score: 594.7 bits (1532), Expect = 1.1e-166
Identity = 289/452 (63.94%), Postives = 361/452 (79.87%), Query Frame = 1

Query: 42  FTRTARTET-DAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMI 101
           FT   RT++  A   +  R Y+  RT +   LFARISPLG P++S+VP+LD W+QEG+ I
Sbjct: 7   FTILKRTQSLTANAILTRRYYNKARTASN-TLFARISPLGEPDISLVPVLDNWVQEGKKI 66

Query: 102 KDFEMRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKY 161
           + FE+++I+RDLR  RRY QAL+VSEWM  KG   F+  D AVQLDLIGRV+GL+SAE Y
Sbjct: 67  RGFELQKIIRDLRCHRRYTQALQVSEWMNGKGQSGFSPADHAVQLDLIGRVRGLESAESY 126

Query: 162 FSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTG 221
           F ++ NQ+   K YGALLNCYVREGLVDK+L HMQKMKE+GFASSPL YND+MCLY  TG
Sbjct: 127 FQNLVNQDRNDKTYGALLNCYVREGLVDKSLYHMQKMKELGFASSPLNYNDLMCLYTRTG 186

Query: 222 QVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLREMESQTHISMDWTTY 281
           Q++KV +VLSEMKENG+ PD +SYRIC+SS  ARSDL G+ ++L EME+Q+HIS+DW TY
Sbjct: 187 QLEKVTDVLSEMKENGITPDLFSYRICMSSCAARSDLKGVEEILEEMENQSHISIDWVTY 246

Query: 282 SMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGFNHLISLYTSLGRKDEVMRLWALQK-K 341
           S VA+ ++KA + E+A+ YL+KCE KVN+DALG+NHLISL  SLG KDEVMRLW L K K
Sbjct: 247 STVASIYVKASLKEKALIYLKKCEQKVNRDALGYNHLISLNASLGIKDEVMRLWGLVKTK 306

Query: 342 CKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWESSCECYDFRVPNVLLIGYSQRGLIERA 401
           CKKQVNRDYITMLG LVKLE LEEA+KL++EWESSC+CYDFRVPNVLLIGY Q+GLIE+A
Sbjct: 307 CKKQVNRDYITMLGALVKLEELEEADKLLQEWESSCQCYDFRVPNVLLIGYCQQGLIEKA 366

Query: 402 EKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLS 461
           E ML++I+   + P PNSW IIAAGY+ KQN E+AF CMKEA+ VQ +NKGWRPK +++S
Sbjct: 367 EAMLKDIVKKQKNPTPNSWAIIAAGYVNKQNMEKAFNCMKEALTVQAENKGWRPKANLIS 426

Query: 462 SILRWLSENGRYEELKEFLSSLKTVPSMDGKL 492
           SIL WL ENG  E+++ F++ L+T    D ++
Sbjct: 427 SILSWLGENGDVEDVEAFVNLLETKVPKDREI 457

BLAST of Cp4.1LG01g08640 vs. TrEMBL
Match: F6H257_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_19s0014g02920 PE=4 SV=1)

HSP 1 Score: 584.3 bits (1505), Expect = 1.4e-163
Identity = 271/425 (63.76%), Postives = 355/425 (83.53%), Query Frame = 1

Query: 71  NLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALEVSEWMRS 130
           NL++RISPLG+P LS+VP+LDQW++EG+ ++D E+ RI+RDLR+ +RY QALEVSEWM S
Sbjct: 38  NLYSRISPLGTPNLSLVPVLDQWVEEGKKVRDVELHRIIRDLRSRKRYAQALEVSEWMSS 97

Query: 131 KGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKA 190
           K L  F+    AVQLDLIG+V+GL+SAE YF+++S +E+I K+YGALLNCYVRE ++DK+
Sbjct: 98  KELCPFSPSARAVQLDLIGQVRGLESAENYFNNMSAEEKIDKMYGALLNCYVRERVIDKS 157

Query: 191 LSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISS 250
           LSH+QKMKE+GFAS+PL YN +MCLY+NT Q++K+P+VLSEM+ENG+ PDN+SYR+CI+S
Sbjct: 158 LSHLQKMKELGFASTPLPYNGLMCLYINTDQLEKIPDVLSEMQENGISPDNFSYRLCINS 217

Query: 251 YGARSDLIGMLKVLREMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQD 310
           YGARSDL  M K+L EMES++HI +DW TYSMVANF+IKAG++E+A+ +L+K E K+++D
Sbjct: 218 YGARSDLNSMEKILEEMESKSHIHIDWMTYSMVANFYIKAGLNEKALFFLKKAETKLHKD 277

Query: 311 ALGFNHLISLYTSLGRKDEVMRLWALQKKC-KKQVNRDYITMLGCLVKLEFLEEAEKLVK 370
            LG+NHLISLY SLG K E+MRLW  +K   KK +NRDYITMLG LVKL  LE+ E L+K
Sbjct: 278 PLGYNHLISLYASLGSKAEMMRLWERRKTASKKLINRDYITMLGSLVKLGELEDTEALLK 337

Query: 371 EWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQ 430
           EWESS  CYDFRVPN LLIG+ Q+GLIE+AE ML++I+ +G+ P PNSW I+AAGY+EKQ
Sbjct: 338 EWESSGNCYDFRVPNTLLIGFCQKGLIEKAESMLRDIVEEGKTPTPNSWSIVAAGYIEKQ 397

Query: 431 NPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDG 490
           N E+AF+CMKEA+AV  +NKGWRPKP V+SSIL WL +N   EE++ F+S+LK V  MD 
Sbjct: 398 NMEKAFECMKEAIAVLAENKGWRPKPKVISSILSWLGDNRDVEEVETFVSALKAVIPMDR 457

Query: 491 KLSNA 495
           ++ +A
Sbjct: 458 EMYHA 462

BLAST of Cp4.1LG01g08640 vs. TrEMBL
Match: M5WGV8_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004720mg PE=4 SV=1)

HSP 1 Score: 577.4 bits (1487), Expect = 1.8e-161
Identity = 283/468 (60.47%), Postives = 366/468 (78.21%), Query Frame = 1

Query: 28  ALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVV 87
           A A+FK+L+   +     +  + D           AR T N RNLF+RISPLG P LSVV
Sbjct: 2   AFAVFKLLKRHQNLAADVSPIKFDC---------RARHTANTRNLFSRISPLGDPSLSVV 61

Query: 88  PILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDL 147
           P+LDQW+QEG  +  FE++RIVRDLR  +RY  AL+VSEWM SKGL  F   D AVQLDL
Sbjct: 62  PVLDQWVQEGGKVNYFELQRIVRDLRARKRYRHALDVSEWMSSKGLCQFLPGDHAVQLDL 121

Query: 148 IGRVQGLDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPL 207
           IGRV+GLD+AE  FSS+S+ E+  K YGALLNCYVREGL+DK+LS+MQKMKE+GFA+S L
Sbjct: 122 IGRVRGLDAAESCFSSLSD-EDTSKSYGALLNCYVREGLIDKSLSYMQKMKELGFATS-L 181

Query: 208 CYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLREM 267
            YNDIM LY++TGQ +K+P+VLSEMKE GV PDN+SYRIC+SSYG RSD+  M KVL EM
Sbjct: 182 NYNDIMRLYIHTGQPEKIPDVLSEMKEEGVSPDNFSYRICMSSYGMRSDISSMEKVLEEM 241

Query: 268 ESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGFNHLISLYTSLGRK 327
           E + HISMDW TY++VAN +IKAG+H++A+ YL K E+KVN+DALG+NHLISLY SLG K
Sbjct: 242 EREPHISMDWLTYALVANLYIKAGLHDKALIYLEKSEEKVNKDALGYNHLISLYASLGCK 301

Query: 328 DEVMRLWALQK-KCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWESSCECYDFRVPNVL 387
           D++MRLW+L+K KCKKQ+NRDYITMLG LVKL  LEE +KL+ EWE SC  YDFRVPN+L
Sbjct: 302 DDMMRLWSLEKTKCKKQINRDYITMLGSLVKLGELEETKKLLDEWELSCLSYDFRVPNIL 361

Query: 388 LIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQE 447
           LIGY Q+GL+E+AE  L++I+  G+ P PNSW I+AAGY++KQ  ++AF+CM EA+ ++ 
Sbjct: 362 LIGYCQKGLVEQAEDTLRDIVKKGKTPTPNSWAILAAGYVDKQKMQKAFECMTEALNLRA 421

Query: 448 QNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNA 495
           +N GWRPKP V+SS+L W+ +NG  E+++ F+S +KTV +++ ++ +A
Sbjct: 422 RNTGWRPKPGVVSSVLSWIGDNGDIEQVEAFVSLMKTVITVNREMYHA 458

BLAST of Cp4.1LG01g08640 vs. TrEMBL
Match: V4TX32_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10007647mg PE=4 SV=1)

HSP 1 Score: 569.3 bits (1466), Expect = 4.8e-159
Identity = 270/427 (63.23%), Postives = 351/427 (82.20%), Query Frame = 1

Query: 59  RLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRY 118
           R Y A +   R NL++RISPLG P++S+ P+LDQW+ EG+ I + E++RI+R LR+ +R+
Sbjct: 26  RAYRAAKPVARNNLYSRISPLGDPDVSLTPVLDQWVLEGQKISELELQRIIRQLRSRKRF 85

Query: 119 GQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEIGKLYGALL 178
             AL+VSEWM  +GL +F+ RD AVQLDLIG+V+GL+SAE YF+S+++++++ KLYGALL
Sbjct: 86  KHALQVSEWMSGQGL-AFSVRDHAVQLDLIGKVRGLESAETYFNSLNDEDKVDKLYGALL 145

Query: 179 NCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVL 238
           NCYVREGLVD++LS MQKMKEMG   S L YN IMCLY NTGQ +K+P+VL +MKENGV 
Sbjct: 146 NCYVREGLVDESLSLMQKMKEMGSFGSALNYNGIMCLYTNTGQHEKIPDVLLDMKENGVS 205

Query: 239 PDNYSYRICISSYGARSDLIGMLKVLREMESQTHISMDWTTYSMVANFFIKAGMHEQAMS 298
           PDN+SYRICI+SYGARS+L  M  VL+EMESQ+HISMDW TYS VAN++I AG+ E+A+ 
Sbjct: 206 PDNFSYRICINSYGARSELSSMENVLQEMESQSHISMDWGTYSTVANYYIIAGLKEKAII 265

Query: 299 YLRKCEDKV--NQDALGFNHLISLYTSLGRKDEVMRLWALQK-KCKKQVNRDYITMLGCL 358
           YL+KCED V  ++DALG+NHLIS Y SL  KDE+M+ W LQK KCKKQ+NRDYIT+LG L
Sbjct: 266 YLKKCEDIVSKSKDALGYNHLISHYASLRNKDEMMKFWGLQKIKCKKQLNRDYITILGSL 325

Query: 359 VKLEFLEEAEKLVKEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPP 418
           VK+  LEEAEK+++EWESSC CYDFRVPN++L+GYSQ+G+IE+A+ +L+ I+  G+ P P
Sbjct: 326 VKIGELEEAEKMLEEWESSCYCYDFRVPNIILLGYSQKGMIEKADAVLKEIVKKGKTPTP 385

Query: 419 NSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELK 478
           NSW IIAAGY +K N E+AF+CMKEA+AV E+NK WRPKPS++SSIL WL +N   EE++
Sbjct: 386 NSWSIIAAGYADKNNMEKAFECMKEALAVHEENKFWRPKPSLVSSILDWLGDNRDVEEVE 445

Query: 479 EFLSSLK 483
            F+SSLK
Sbjct: 446 AFVSSLK 451

BLAST of Cp4.1LG01g08640 vs. TAIR10
Match: AT4G21705.1 (AT4G21705.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 427.2 bits (1097), Expect = 1.5e-119
Identity = 215/482 (44.61%), Postives = 315/482 (65.35%), Query Frame = 1

Query: 56  VALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNC 115
           +A R Y   R   +  L+++ISPLG P+ SV P L  W+Q G+ +   E+ RIV DLR  
Sbjct: 12  IASRYYYTNRV-KKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVSVAELIRIVHDLRRR 71

Query: 116 RRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEIGKLYG 175
           +R+  ALEVS+WM   G+  F+  + AV LDLIGRV G  +AE+YF ++  Q +  K YG
Sbjct: 72  KRFLHALEVSKWMNETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYFENLKEQYKNDKTYG 131

Query: 176 ALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKEN 235
           ALLNCYVR+  V+K+L H +KMKEMGF +S L YN+IMCLY N GQ +KVP VL EMKE 
Sbjct: 132 ALLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQHEKVPKVLEEMKEE 191

Query: 236 GVLPDNYSYRICISSYGARSDLIGMLKVLREMESQTHISMDWTTYSMVANFFIKAGMHEQ 295
            V PDNYSYRICI+++GA  DL  +   LR+ME +  I+MDW TY++ A F+I  G  ++
Sbjct: 192 NVAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYAVAAKFYIDGGDCDR 251

Query: 296 AMSYLRKCEDKV-NQDALGFNHLISLYTSLGRKDEVMRLWALQKK-CKKQVNRDYITMLG 355
           A+  L+  E+++  +D  G+NHLI+LY  LG+K EV+RLW L+K  CK+++N+DY+T+L 
Sbjct: 252 AVELLKMSENRLEKKDGEGYNHLITLYARLGKKIEVLRLWDLEKDVCKRRINQDYLTVLQ 311

Query: 356 CLVKLEFLEEAEKLVKEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIP 415
            LVK++ L EAE+++ EW+SS  CYDFRVPN ++ GY  + + E+AE ML+++   G+  
Sbjct: 312 SLVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEEKAEAMLEDLARRGKAT 371

Query: 416 PPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEE 475
            P SW ++A  Y EK   E AFKCMK A+ V+  ++ WRP  ++++S+L W+ + G  +E
Sbjct: 372 TPESWELVATAYAEKGTLENAFKCMKTALGVEVGSRKWRPGLTLVTSVLSWVGDEGSLKE 431

Query: 476 LKEFLSSLKTVPSMDGKLSNA------------FDELLETLKNN----DETTADALKKSQ 520
           ++ F++SL+    ++ ++ +A             D LL+ +K++    DE T   L    
Sbjct: 432 VESFVASLRNCIGVNKQMYHALVKADIREGGRNIDTLLQRMKDDKIEIDEETTVILSTRS 491

BLAST of Cp4.1LG01g08640 vs. TAIR10
Match: AT2G20710.1 (AT2G20710.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 292.7 bits (748), Expect = 4.4e-79
Identity = 151/427 (35.36%), Postives = 249/427 (58.31%), Query Frame = 1

Query: 75  RISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALEVSEWMRSKGLF 134
           R++  G P  S++ +LD W+ +G ++K  E+  I++ LR   R+  AL++S+WM    + 
Sbjct: 43  RVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQISDWMSEHRVH 102

Query: 135 SFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHM 194
             +  D A++LDLI +V GL  AEK+F ++  +     LYGALLNCY  + ++ KA    
Sbjct: 103 EISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYASKKVLHKAEQVF 162

Query: 195 QKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGAR 254
           Q+MKE+GF    L YN ++ LY+ TG+   V  +L EM++  V PD ++    + +Y   
Sbjct: 163 QEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTRLHAYSVV 222

Query: 255 SDLIGMLKVLREMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDAL-- 314
           SD+ GM K L   E+   + +DW TY+  AN +IKAG+ E+A+  LRK E  VN      
Sbjct: 223 SDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMVNAQKRKH 282

Query: 315 GFNHLISLYTSLGRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWE 374
            +  L+S Y + G+K+EV RLW+L K+     N  YI+++  L+K++ +EE EK+++EWE
Sbjct: 283 AYEVLMSFYGAAGKKEEVYRLWSLYKELDGFYNTGYISVISALLKMDDIEEVEKIMEEWE 342

Query: 375 SSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPE 434
           +    +D R+P++L+ GY ++G++E+AE+++  ++   R+   ++W  +A GY      E
Sbjct: 343 AGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWERLALGYKMAGKME 402

Query: 435 RAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLS 494
           +A +  K A+ V +   GWRP   VL S + +L      E L++ L  L    S  G +S
Sbjct: 403 KAVEKWKRAIEVSK--PGWRPHQVVLMSCVDYLEGQRDMEGLRKILRLL----SERGHIS 461

Query: 495 NAFDELL 500
             +D+LL
Sbjct: 463 --YDQLL 461

BLAST of Cp4.1LG01g08640 vs. TAIR10
Match: AT1G02150.1 (AT1G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 270.0 bits (689), Expect = 3.0e-72
Identity = 152/454 (33.48%), Postives = 252/454 (55.51%), Query Frame = 1

Query: 61  YSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQ 120
           Y  R       ++ +IS +  PEL    +L+QW + GR +  +E+ R+V++LR  +R  Q
Sbjct: 58  YERRPIVQWNAIYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQ 117

Query: 121 ALEVSEWMRSKG-LFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEIGKLYGALLN 180
           ALEV +WM ++G  F  +  D A+QLDLIG+V+G+  AE++F  +    +  ++YG+LLN
Sbjct: 118 ALEVYDWMNNRGERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLN 177

Query: 181 CYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLP 240
            YVR    +KA + +  M++ G+A  PL +N +M LY+N  + DKV  ++ EMK+  +  
Sbjct: 178 AYVRAKSREKAEALLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRL 237

Query: 241 DNYSYRICISSYGARSDLIGMLKVLREMESQTHISMDWTTYSMVANFFIKAGMHEQAMSY 300
           D YSY I +SS G+   +  M  V ++M+S   I  +WTT+S +A  +IK G  E+A   
Sbjct: 238 DIYSYNIWLSSCGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDA 297

Query: 301 LRKCEDKV-NQDALGFNHLISLYTSLGRKDEVMRLWALQKKCKKQV-NRDYITMLGCLVK 360
           LRK E ++  ++ + +++L+SLY SLG K E+ R+W + K     + N  Y  ++  LV+
Sbjct: 298 LRKVEARITGRNRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVR 357

Query: 361 LEFLEEAEKLVKEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNS 420
           +  +E AEK+ +EW      YD R+PN+L+  Y +   +E AE +  +++  G  P  ++
Sbjct: 358 MGDIEGAEKVYEEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSST 417

Query: 421 WGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEF 480
           W I+A G+  K+    A  C++ A +  E +  WRPK  +LS   +   E       +  
Sbjct: 418 WEILAVGHTRKRCISEALTCLRNAFSA-EGSSNWRPKVLMLSGFFKLCEEESDVTSKEAV 477

Query: 481 LSSLKTVPSMDGKLSNAFDELLET-LKNNDETTA 511
           L  L+    ++ K   A  ++ E    NN E  A
Sbjct: 478 LELLRQSGDLEDKSYLALIDVDENRTVNNSEIDA 510

BLAST of Cp4.1LG01g08640 vs. TAIR10
Match: AT1G60770.1 (AT1G60770.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 260.0 bits (663), Expect = 3.2e-69
Identity = 146/457 (31.95%), Postives = 249/457 (54.49%), Query Frame = 1

Query: 56  VALRLYSARRTCNRRN--------LFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRR 115
           +A+R  S  R   +R+        L+ R+   G  E+ V   L+Q+++  + +  +E+  
Sbjct: 1   MAMRHLSRSRDVTKRSTKKYIEEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGD 60

Query: 116 IVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQ 175
            ++ LRN   Y  AL++SE M  +G+ + T  D A+ LDL+ + + + + E YF  +   
Sbjct: 61  TIKKLRNRGLYYPALKLSEVMEERGM-NKTVSDQAIHLDLVAKAREITAGENYFVDLPET 120

Query: 176 EEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPN 235
            +    YG+LLNCY +E L +KA   + KMKE+    S + YN +M LY  TG+ +KVP 
Sbjct: 121 SKTELTYGSLLNCYCKELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPA 180

Query: 236 VLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLREMESQTHISMDWTTYSMVANFF 295
           ++ E+K   V+PD+Y+Y + + +  A +D+ G+ +V+ EM     ++ DWTTYS +A+ +
Sbjct: 181 MIQELKAENVMPDSYTYNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIY 240

Query: 296 IKAGMHEQAMSYLRKCEDKVNQ-DALGFNHLISLYTSLGRKDEVMRLW-ALQKKCKKQVN 355
           + AG+ ++A   L++ E K  Q D   +  LI+LY  LG+  EV R+W +L+    K  N
Sbjct: 241 VDAGLSQKAEKALQELEMKNTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSN 300

Query: 356 RDYITMLGCLVKLEFLEEAEKLVKEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQN 415
             Y+ M+  LVKL  L  AE L KEW+++C  YD R+ NVL+  Y+Q GLI++A ++ + 
Sbjct: 301 VAYLNMIQVLVKLNDLPGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEK 360

Query: 416 IISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKG-WRPKPSVLSSILRW 475
               G      +W I    Y++  +  RA +CM +AV++ + + G W P P  + +++ +
Sbjct: 361 APRRGGKLNAKTWEIFMDYYVKSGDMARALECMSKAVSIGKGDGGKWLPSPETVRALMSY 420

Query: 476 LSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLET 502
             +       +  L  LK     D   +  F+ L+ T
Sbjct: 421 FEQKKDVNGAENLLEILKN--GTDNIGAEIFEPLIRT 454

BLAST of Cp4.1LG01g08640 vs. TAIR10
Match: AT1G02370.1 (AT1G02370.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 238.8 bits (608), Expect = 7.5e-63
Identity = 129/389 (33.16%), Postives = 220/389 (56.56%), Query Frame = 1

Query: 69  RRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALEVSEWM 128
           +R L+ ++S L     +V   L+Q+I EG  ++  ++ R  + LR  RR   A E+ +WM
Sbjct: 70  QRELYKKLSMLSVTGGTVAETLNQFIMEGITVRKDDLFRCAKTLRKFRRPQHAFEIFDWM 129

Query: 129 RSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVS-NQEEIGKLYGALLNCYVREGLV 188
             + + +F+  D A+ LDLIG+ +GL++AE YF+++  + +     YGAL+NCY  E   
Sbjct: 130 EKRKM-TFSVSDHAICLDLIGKTKGLEAAENYFNNLDPSAKNHQSTYGALMNCYCVELEE 189

Query: 189 DKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRIC 248
           +KA +H + M E+ F ++ L +N++M +Y+   Q +KVP ++  MK+ G+ P   +Y I 
Sbjct: 190 EKAKAHFEIMDELNFVNNSLPFNNMMSMYMRLSQPEKVPVLVDAMKQRGISPCGVTYSIW 249

Query: 249 ISSYGARSDLIGMLKVLREMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKV 308
           + S G+ +DL G+ K++ EM   +     W T+S +A  + KAG++E+A S L+  E+K+
Sbjct: 250 MQSCGSLNDLDGLEKIIDEMGKDSEAKTTWNTFSNLAAIYTKAGLYEKADSALKSMEEKM 309

Query: 309 NQDALGFNH-LISLYTSLGRKDEVMRLWALQKKCKKQVNR-DYITMLGCLVKLEFLEEAE 368
           N +    +H L+SLY  + +  EV R+W   KK + +VN   Y+ ML  + KL  L+  +
Sbjct: 310 NPNNRDSHHFLMSLYAGISKGPEVYRVWESLKKARPEVNNLSYLVMLQAMSKLGDLDGIK 369

Query: 369 KLVKEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGY 428
           K+  EWES C  YD R+ N+ +  Y +  + E AEK+L   +   + P   +  ++    
Sbjct: 370 KIFTEWESKCWAYDMRLANIAINTYLKGNMYEEAEKILDGAMKKSKGPFSKARQLLMIHL 429

Query: 429 LEKQNPERAFKCMKEAVAVQEQNK---GW 452
           LE    + A K ++ AV+   +NK   GW
Sbjct: 430 LENDKADLAMKHLEAAVSDSAENKDEWGW 457

BLAST of Cp4.1LG01g08640 vs. NCBI nr
Match: gi|449431834|ref|XP_004133705.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like [Cucumis sativus])

HSP 1 Score: 816.6 bits (2108), Expect = 2.5e-233
Identity = 398/490 (81.22%), Postives = 444/490 (90.61%), Query Frame = 1

Query: 25  LAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPEL 84
           +AAA AMFKIL   SSG TRT R ETDAFCFVALRLYS RR+C+RRNL+ARISPLG PE 
Sbjct: 1   MAAASAMFKILSRSSSGCTRTLRPETDAFCFVALRLYSTRRSCDRRNLYARISPLGDPEC 60

Query: 85  SVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQ 144
           +VVP+L+QWI+EGR IKDFE+RRIVRDLR CRRY QALEVSEWM SKGLFS TTRDFA+Q
Sbjct: 61  TVVPVLNQWIEEGRNIKDFELRRIVRDLRTCRRYRQALEVSEWMCSKGLFSLTTRDFAIQ 120

Query: 145 LDLIGRVQGLDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFAS 204
           LDLIG+V+GLDSAEKYF SVSNQ+EIGKLYGALLNCYVREGL+DK+L+HMQKMKEMG AS
Sbjct: 121 LDLIGQVRGLDSAEKYFGSVSNQKEIGKLYGALLNCYVREGLIDKSLAHMQKMKEMGLAS 180

Query: 205 SPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVL 264
           SPLCYNDIMCLYLNTGQ DKVPNVLSEMKENGVLPDN+SYRICISSYGARSD+I M  VL
Sbjct: 181 SPLCYNDIMCLYLNTGQADKVPNVLSEMKENGVLPDNFSYRICISSYGARSDVISMENVL 240

Query: 265 REMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGFNHLISLYTSL 324
           +EME QTHISMDWTTYSMVA FFIKAGMH++AM+YLRKCEDKV++DALGFNHLIS YT+L
Sbjct: 241 KEMEGQTHISMDWTTYSMVAGFFIKAGMHDKAMNYLRKCEDKVDEDALGFNHLISHYTNL 300

Query: 325 GRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWESSCECYDFRVPN 384
           G K+EVMRLWAL KK KKQ+NRDYITMLG LVKLE LEEAE LV EWESSC+CYDFRVPN
Sbjct: 301 GHKNEVMRLWALLKKGKKQLNRDYITMLGSLVKLELLEEAENLVMEWESSCQCYDFRVPN 360

Query: 385 VLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAV 444
           V+LIGYSQ+GLIE+AEKML+NII +G IP PNSWGIIA+GYLEKQN E+AF+CMKEA+AV
Sbjct: 361 VVLIGYSQKGLIEKAEKMLRNIIVNGMIPSPNSWGIIASGYLEKQNLEKAFECMKEALAV 420

Query: 445 QEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETLKN 504
           + QNK WRPKP+VLSSILRWLSEN RYEE+KEF+SSLKTVPSMD KL+NA DELLE + N
Sbjct: 421 KGQNKVWRPKPNVLSSILRWLSENRRYEEMKEFMSSLKTVPSMDEKLNNALDELLEIMAN 480

Query: 505 NDETTADALK 515
           +D  + D L+
Sbjct: 481 DDGISKDELE 490

BLAST of Cp4.1LG01g08640 vs. NCBI nr
Match: gi|659102689|ref|XP_008452263.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like [Cucumis melo])

HSP 1 Score: 797.0 bits (2057), Expect = 2.0e-227
Identity = 388/482 (80.50%), Postives = 434/482 (90.04%), Query Frame = 1

Query: 25  LAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPEL 84
           +AAA AMFKIL   SSG TRT R ETDAFCFVALRLYS RR+CNRR L+A ISPLG P+ 
Sbjct: 1   MAAASAMFKILSRSSSGCTRTPRPETDAFCFVALRLYSTRRSCNRRKLYAMISPLGDPDS 60

Query: 85  SVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQ 144
           SVVP+L+QWI+EGR IKDFE+RRIVRDLR CRRY QALEVSEWM SKG FS TTRDFA+Q
Sbjct: 61  SVVPVLNQWIKEGRKIKDFELRRIVRDLRTCRRYRQALEVSEWMCSKGRFSLTTRDFAIQ 120

Query: 145 LDLIGRVQGLDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFAS 204
           LDLIG+V+GLDSAEKYF SVS Q+EIGKLYG+LLNCYVREGL+DK+L+HMQKMKEMGFAS
Sbjct: 121 LDLIGQVRGLDSAEKYFGSVSKQKEIGKLYGSLLNCYVREGLIDKSLAHMQKMKEMGFAS 180

Query: 205 SPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVL 264
           SPLCYNDIMCLYLNTGQ DKVPNVLSEMKENGVLPDN+SYRICISSYGARSD+I M  VL
Sbjct: 181 SPLCYNDIMCLYLNTGQADKVPNVLSEMKENGVLPDNFSYRICISSYGARSDVISMENVL 240

Query: 265 REMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGFNHLISLYTSL 324
           +EMESQTHISMDW TYSMVA FFIK  MH++A +YLRKCED+V+QDALGFNHLIS YT+L
Sbjct: 241 KEMESQTHISMDWITYSMVAGFFIKVVMHDKARNYLRKCEDRVDQDALGFNHLISHYTNL 300

Query: 325 GRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWESSCECYDFRVPN 384
           G K+EVMRLWALQKK KKQ+NRDYITMLG LVKL+ LEEAE LV EWESSC+C DFRVPN
Sbjct: 301 GHKNEVMRLWALQKKAKKQLNRDYITMLGSLVKLDLLEEAENLVMEWESSCQCNDFRVPN 360

Query: 385 VLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAV 444
           V+LIGYSQ GLIE+AEKML+NII +G IP PNSWGIIA+GYLEKQN E+AF+CMKEA+AV
Sbjct: 361 VVLIGYSQNGLIEKAEKMLRNIIVNGMIPSPNSWGIIASGYLEKQNLEKAFECMKEALAV 420

Query: 445 QEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDGKLSNAFDELLETLKN 504
           + QNK WRPKP+VLSSILRWLSEN RYEE+KEF+SSLKTVPSMD KL++A DELLE ++N
Sbjct: 421 KGQNKVWRPKPNVLSSILRWLSENRRYEEMKEFMSSLKTVPSMDEKLNSALDELLEIMEN 480

Query: 505 ND 507
           +D
Sbjct: 481 DD 482

BLAST of Cp4.1LG01g08640 vs. NCBI nr
Match: gi|255573349|ref|XP_002527601.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 [Ricinus communis])

HSP 1 Score: 594.7 bits (1532), Expect = 1.5e-166
Identity = 289/452 (63.94%), Postives = 361/452 (79.87%), Query Frame = 1

Query: 42  FTRTARTET-DAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMI 101
           FT   RT++  A   +  R Y+  RT +   LFARISPLG P++S+VP+LD W+QEG+ I
Sbjct: 7   FTILKRTQSLTANAILTRRYYNKARTASN-TLFARISPLGEPDISLVPVLDNWVQEGKKI 66

Query: 102 KDFEMRRIVRDLRNCRRYGQALEVSEWMRSKGLFSFTTRDFAVQLDLIGRVQGLDSAEKY 161
           + FE+++I+RDLR  RRY QAL+VSEWM  KG   F+  D AVQLDLIGRV+GL+SAE Y
Sbjct: 67  RGFELQKIIRDLRCHRRYTQALQVSEWMNGKGQSGFSPADHAVQLDLIGRVRGLESAESY 126

Query: 162 FSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTG 221
           F ++ NQ+   K YGALLNCYVREGLVDK+L HMQKMKE+GFASSPL YND+MCLY  TG
Sbjct: 127 FQNLVNQDRNDKTYGALLNCYVREGLVDKSLYHMQKMKELGFASSPLNYNDLMCLYTRTG 186

Query: 222 QVDKVPNVLSEMKENGVLPDNYSYRICISSYGARSDLIGMLKVLREMESQTHISMDWTTY 281
           Q++KV +VLSEMKENG+ PD +SYRIC+SS  ARSDL G+ ++L EME+Q+HIS+DW TY
Sbjct: 187 QLEKVTDVLSEMKENGITPDLFSYRICMSSCAARSDLKGVEEILEEMENQSHISIDWVTY 246

Query: 282 SMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGFNHLISLYTSLGRKDEVMRLWALQK-K 341
           S VA+ ++KA + E+A+ YL+KCE KVN+DALG+NHLISL  SLG KDEVMRLW L K K
Sbjct: 247 STVASIYVKASLKEKALIYLKKCEQKVNRDALGYNHLISLNASLGIKDEVMRLWGLVKTK 306

Query: 342 CKKQVNRDYITMLGCLVKLEFLEEAEKLVKEWESSCECYDFRVPNVLLIGYSQRGLIERA 401
           CKKQVNRDYITMLG LVKLE LEEA+KL++EWESSC+CYDFRVPNVLLIGY Q+GLIE+A
Sbjct: 307 CKKQVNRDYITMLGALVKLEELEEADKLLQEWESSCQCYDFRVPNVLLIGYCQQGLIEKA 366

Query: 402 EKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLS 461
           E ML++I+   + P PNSW IIAAGY+ KQN E+AF CMKEA+ VQ +NKGWRPK +++S
Sbjct: 367 EAMLKDIVKKQKNPTPNSWAIIAAGYVNKQNMEKAFNCMKEALTVQAENKGWRPKANLIS 426

Query: 462 SILRWLSENGRYEELKEFLSSLKTVPSMDGKL 492
           SIL WL ENG  E+++ F++ L+T    D ++
Sbjct: 427 SILSWLGENGDVEDVEAFVNLLETKVPKDREI 457

BLAST of Cp4.1LG01g08640 vs. NCBI nr
Match: gi|225461407|ref|XP_002282230.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g21705, mitochondrial [Vitis vinifera])

HSP 1 Score: 584.3 bits (1505), Expect = 2.1e-163
Identity = 271/425 (63.76%), Postives = 355/425 (83.53%), Query Frame = 1

Query: 71  NLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALEVSEWMRS 130
           NL++RISPLG+P LS+VP+LDQW++EG+ ++D E+ RI+RDLR+ +RY QALEVSEWM S
Sbjct: 38  NLYSRISPLGTPNLSLVPVLDQWVEEGKKVRDVELHRIIRDLRSRKRYAQALEVSEWMSS 97

Query: 131 KGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKA 190
           K L  F+    AVQLDLIG+V+GL+SAE YF+++S +E+I K+YGALLNCYVRE ++DK+
Sbjct: 98  KELCPFSPSARAVQLDLIGQVRGLESAENYFNNMSAEEKIDKMYGALLNCYVRERVIDKS 157

Query: 191 LSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISS 250
           LSH+QKMKE+GFAS+PL YN +MCLY+NT Q++K+P+VLSEM+ENG+ PDN+SYR+CI+S
Sbjct: 158 LSHLQKMKELGFASTPLPYNGLMCLYINTDQLEKIPDVLSEMQENGISPDNFSYRLCINS 217

Query: 251 YGARSDLIGMLKVLREMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQD 310
           YGARSDL  M K+L EMES++HI +DW TYSMVANF+IKAG++E+A+ +L+K E K+++D
Sbjct: 218 YGARSDLNSMEKILEEMESKSHIHIDWMTYSMVANFYIKAGLNEKALFFLKKAETKLHKD 277

Query: 311 ALGFNHLISLYTSLGRKDEVMRLWALQKKC-KKQVNRDYITMLGCLVKLEFLEEAEKLVK 370
            LG+NHLISLY SLG K E+MRLW  +K   KK +NRDYITMLG LVKL  LE+ E L+K
Sbjct: 278 PLGYNHLISLYASLGSKAEMMRLWERRKTASKKLINRDYITMLGSLVKLGELEDTEALLK 337

Query: 371 EWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQ 430
           EWESS  CYDFRVPN LLIG+ Q+GLIE+AE ML++I+ +G+ P PNSW I+AAGY+EKQ
Sbjct: 338 EWESSGNCYDFRVPNTLLIGFCQKGLIEKAESMLRDIVEEGKTPTPNSWSIVAAGYIEKQ 397

Query: 431 NPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDG 490
           N E+AF+CMKEA+AV  +NKGWRPKP V+SSIL WL +N   EE++ F+S+LK V  MD 
Sbjct: 398 NMEKAFECMKEAIAVLAENKGWRPKPKVISSILSWLGDNRDVEEVETFVSALKAVIPMDR 457

Query: 491 KLSNA 495
           ++ +A
Sbjct: 458 EMYHA 462

BLAST of Cp4.1LG01g08640 vs. NCBI nr
Match: gi|302143027|emb|CBI20322.3| (unnamed protein product [Vitis vinifera])

HSP 1 Score: 584.3 bits (1505), Expect = 2.1e-163
Identity = 271/425 (63.76%), Postives = 355/425 (83.53%), Query Frame = 1

Query: 71  NLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALEVSEWMRS 130
           NL++RISPLG+P LS+VP+LDQW++EG+ ++D E+ RI+RDLR+ +RY QALEVSEWM S
Sbjct: 38  NLYSRISPLGTPNLSLVPVLDQWVEEGKKVRDVELHRIIRDLRSRKRYAQALEVSEWMSS 97

Query: 131 KGLFSFTTRDFAVQLDLIGRVQGLDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKA 190
           K L  F+    AVQLDLIG+V+GL+SAE YF+++S +E+I K+YGALLNCYVRE ++DK+
Sbjct: 98  KELCPFSPSARAVQLDLIGQVRGLESAENYFNNMSAEEKIDKMYGALLNCYVRERVIDKS 157

Query: 191 LSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKENGVLPDNYSYRICISS 250
           LSH+QKMKE+GFAS+PL YN +MCLY+NT Q++K+P+VLSEM+ENG+ PDN+SYR+CI+S
Sbjct: 158 LSHLQKMKELGFASTPLPYNGLMCLYINTDQLEKIPDVLSEMQENGISPDNFSYRLCINS 217

Query: 251 YGARSDLIGMLKVLREMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQD 310
           YGARSDL  M K+L EMES++HI +DW TYSMVANF+IKAG++E+A+ +L+K E K+++D
Sbjct: 218 YGARSDLNSMEKILEEMESKSHIHIDWMTYSMVANFYIKAGLNEKALFFLKKAETKLHKD 277

Query: 311 ALGFNHLISLYTSLGRKDEVMRLWALQKKC-KKQVNRDYITMLGCLVKLEFLEEAEKLVK 370
            LG+NHLISLY SLG K E+MRLW  +K   KK +NRDYITMLG LVKL  LE+ E L+K
Sbjct: 278 PLGYNHLISLYASLGSKAEMMRLWERRKTASKKLINRDYITMLGSLVKLGELEDTEALLK 337

Query: 371 EWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQ 430
           EWESS  CYDFRVPN LLIG+ Q+GLIE+AE ML++I+ +G+ P PNSW I+AAGY+EKQ
Sbjct: 338 EWESSGNCYDFRVPNTLLIGFCQKGLIEKAESMLRDIVEEGKTPTPNSWSIVAAGYIEKQ 397

Query: 431 NPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKTVPSMDG 490
           N E+AF+CMKEA+AV  +NKGWRPKP V+SSIL WL +N   EE++ F+S+LK V  MD 
Sbjct: 398 NMEKAFECMKEAIAVLAENKGWRPKPKVISSILSWLGDNRDVEEVETFVSALKAVIPMDR 457

Query: 491 KLSNA 495
           ++ +A
Sbjct: 458 EMYHA 462

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP334_ARATH2.6e-11844.61Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidop... [more]
PP166_ARATH7.8e-7835.36Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidop... [more]
PPR3_ARATH5.4e-7133.48Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana GN... [more]
PPR86_ARATH5.6e-6831.95Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana GN... [more]
PPR4_ARATH1.3e-6133.16Pentatricopeptide repeat-containing protein At1g02370, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0L7Y2_CUCSA1.7e-23381.22Uncharacterized protein OS=Cucumis sativus GN=Csa_3G104890 PE=4 SV=1[more]
B9SNN7_RICCO1.1e-16663.94Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
F6H257_VITVI1.4e-16363.76Putative uncharacterized protein OS=Vitis vinifera GN=VIT_19s0014g02920 PE=4 SV=... [more]
M5WGV8_PRUPE1.8e-16160.47Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004720mg PE=4 SV=1[more]
V4TX32_9ROSI4.8e-15963.23Uncharacterized protein OS=Citrus clementina GN=CICLE_v10007647mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G21705.11.5e-11944.61 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G20710.14.4e-7935.36 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G02150.13.0e-7233.48 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G60770.13.2e-6931.95 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G02370.17.5e-6333.16 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449431834|ref|XP_004133705.1|2.5e-23381.22PREDICTED: pentatricopeptide repeat-containing protein At4g21705, mitochondrial-... [more]
gi|659102689|ref|XP_008452263.1|2.0e-22780.50PREDICTED: pentatricopeptide repeat-containing protein At4g21705, mitochondrial-... [more]
gi|255573349|ref|XP_002527601.1|1.5e-16663.94PREDICTED: pentatricopeptide repeat-containing protein At4g21705, mitochondrial ... [more]
gi|225461407|ref|XP_002282230.1|2.1e-16363.76PREDICTED: pentatricopeptide repeat-containing protein At4g21705, mitochondrial ... [more]
gi|302143027|emb|CBI20322.3|2.1e-16363.76unnamed protein product [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g08640.1Cp4.1LG01g08640.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 384..410
score: 0.061coord: 174..202
score: 1.5E-5coord: 348..373
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 208..251
score: 7.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 208..240
score: 2.3E-5coord: 173..202
score: 5.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 205..239
score: 9.876coord: 310..344
score: 6.665coord: 101..135
score: 5.415coord: 454..484
score: 5.448coord: 379..413
score: 8.331coord: 240..274
score: 6.741coord: 170..204
score: 9.098coord: 276..306
score: 6
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 158..222
score: 3.1E-14coord: 279..472
score: 3.1
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 291..327
score: 3.29E-6coord: 180..229
score: 3.29E-6coord: 421..449
score: 3.2
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 71..488
score: 4.1E
NoneNo IPR availablePANTHERPTHR24015:SF671SUBFAMILY NOT NAMEDcoord: 71..488
score: 4.1E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG01g08640Wax gourdcpewgoB0485
Cp4.1LG01g08640Wax gourdcpewgoB0489
Cp4.1LG01g08640Cucurbita pepo (Zucchini)cpecpeB233
Cp4.1LG01g08640Cucurbita maxima (Rimu)cmacpeB350
Cp4.1LG01g08640Cucurbita maxima (Rimu)cmacpeB720
Cp4.1LG01g08640Cucurbita moschata (Rifu)cmocpeB312
Cp4.1LG01g08640Cucurbita moschata (Rifu)cmocpeB673
Cp4.1LG01g08640Silver-seed gourdcarcpeB0667