CmoCh04G004020 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G004020
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionPentatricopeptide repeat-containing protein
LocationCmo_Chr04 : 1983861 .. 1986184 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAAAAAAAAAAATGAAATGAAAAATAACCCACGATGTGGATGGGCAGGATTGTGGTTAAGCCCTAACAAAACCTCCGACGTATCACGCACCTCAGGACAAGTGATTTTCGAGAGAAATGAATCGTCGGAGTTTACTCTCGAGAGCGTCGGCAGGTTTGCGGCATCTCTGTACTTCAACCGCCGAGTCGAAGCGCGGTCCTGTGAATGATCAGCAGCGGCTATACCCGAGGCTGTCGAAGTTGGGTGCCACCGGCGGTAGCGTGGCGCAGACATTGAACCAGTACATTATGGAGGGAAAGATCGTCAAAAAATATGAGCTCGAGAGATGCATAAAGGAGCTCCGGAAGTACCGTAGATACCACCACGCCCTTCAGGTTTTTTCCTTAAACACATAGGTTCGTTCACTGTAGGAGAAATATAAATGCACAAAATTACTGCTGCTTTTCATTTGTTTTTCAAATTCTATTGCCAATTGTTGATTGTATTCTTCCTTTCGCTGCTAGATCATAAATTTTTTGTAAACGAAACATTCATAAAATATAAAGGAAATGGCAAAAACCTTGTGAAGAATTTACGATAAAAAATCATTGGATCTGGTAGCTATGAAAAAGCACTCTGGAGGGAGAACCAGCCAAAAAGTTGCTGTAATTGCAGGAGACCATAAGATCTTAGCTCATCCATTGCATTTGCTTCGGTTGTTGCAGATAATGGAATGGATGGAGATGAGGAAAATCAACTACTCATTCACTGACTACGCGCTGCGTTTAGATCTTATATCGAAAGTTAAAGGAATTGCTGCTGCGGAGAATTATTTCTGTGATCTGTCGTCGTCTGCGAAGAATCGATTTACTTATGGAGCTCTTTTGAATTGCTATTGCAAGGAATTGATGGAGGAAAAGGCATTGGCTCTTTCTAAGAAGATAGATGAGTTGAAGTTTGCTTCCAATTTGTCCTTTAACAATCTTATGACCATGTATATGAGAATGGATCAACCCGAGAAAGTACCTCCTCTTATAGATGAAATGAAGCGGAGAGGGATTTTTCTTAGCACGTACACATACAATGTTTGGATGAACAGTTGTGCTTCCCTGAATGGCGTTGGAAAAGTTGAAGAAATTCTCGAGGAGATGAAAAACGAAGACAGAAACAAATTTGATTGGACGACGTTTTCGAACTTGGCTGCTATCTATGTTAAGGCAGGACAGCTCGAGAAAGCTGAATTAGCTCTTAAGAAGGTAGAGAACGAGATCAAATCGAATAAGCAGCAGGATCGTCTAGCGTACCATTTCTTGATAAGCCTTTATGCATCGACGTCGAATCGGAGTGAGGTGTATAGGATATGGAATGCACTGAAATCAGTTTATCCAATGACAAATAACATGAGTTATCTCGTCATGCTTCAGGCTCTAAGCAAACTAAAGGATTTCGAGGGTCTTAAAAGCACTTATAAGGAATGGGAATCTAGTTGCTCGAGCTTTGATTTGCGGTTAGCGGATGTTACGATCGGGGCTTACCTACGACAGGACATGTACGAAGATGCTGCATTGGTCTTTGAGGATGCTATTAAGAGAAGTAAAGGACCTTTCTTTAGGGCTCGAGAAATGTTCATGATTTACTTCTTGAAGTTCAAGCAAGTCGATTTGGCACTCAGTCATTTGGAATCAGCTATATCGGAAAGCATGGACGATGAATGGCATCCATCACCGGCGATGGCGAATGCTTTTCTGATGTACTTTGAGGAAGAGAAAGACATTGAAGGCGCCGAAGATTTTGCCAGGATTTTGAAGAGATTTAAGTGTCTCGATGCTAGTGCATACCATCTATTGCTCAAGACCTATGCAGCTGCAGGAAAACGAGCCCCCGACATGCGACAAAGATTGATAGAAGACAACATTGAGGTAAGTAGTGAGCTTGAGGAGTTGTTAAGTAGACAACATTGAGACAAGTAGGAGGAAAATCCATGGGATGTTTTTTTTTTTATCTGATCTTTATAAGCTGATCCACTTCATTCCTTTTCTCTTAAACGAAATTTGTATTTTCGAGCATTAGTCTATTTTCATTTTCATTTTCTTGATAAGCTGATGGTGCACAACTTGTAGAAGTTCCCTTGGTTTGTGGTAGACTTGGACCAGGTTTTTGTCACCATGTTATCCGCATCGAAGAAGCCAGGGCTGGCAAGATAAGTTCACGTGGCCATGCCCTAAACCAATTGATAAGCTTTTTTTATTTGATTGATGAGAAAAATTACTACTTATTAGCTTAGCTAGCCTTCTGGGAGAATCGAGTCATTTAGTCCGTTCATATGAATATATCAATTTATT

mRNA sequence

AAAAAAAAAAAAAAATGAAATGAAAAATAACCCACGATGTGGATGGGCAGGATTGTGGTTAAGCCCTAACAAAACCTCCGACGTATCACGCACCTCAGGACAAGTGATTTTCGAGAGAAATGAATCGTCGGAGTTTACTCTCGAGAGCGTCGGCAGGTTTGCGGCATCTCTGTACTTCAACCGCCGAGTCGAAGCGCGGTCCTGTGAATGATCAGCAGCGGCTATACCCGAGGCTGTCGAAGTTGGGTGCCACCGGCGGTAGCGTGGCGCAGACATTGAACCAGTACATTATGGAGGGAAAGATCGTCAAAAAATATGAGCTCGAGAGATGCATAAAGGAGCTCCGGAAGTACCGTAGATACCACCACGCCCTTCAGATAATGGAATGGATGGAGATGAGGAAAATCAACTACTCATTCACTGACTACGCGCTGCGTTTAGATCTTATATCGAAAGTTAAAGGAATTGCTGCTGCGGAGAATTATTTCTGTGATCTGTCGTCGTCTGCGAAGAATCGATTTACTTATGGAGCTCTTTTGAATTGCTATTGCAAGGAATTGATGGAGGAAAAGGCATTGGCTCTTTCTAAGAAGATAGATGAGTTGAAGTTTGCTTCCAATTTGTCCTTTAACAATCTTATGACCATGTATATGAGAATGGATCAACCCGAGAAAGTACCTCCTCTTATAGATGAAATGAAGCGGAGAGGGATTTTTCTTAGCACGTACACATACAATGTTTGGATGAACAGTTGTGCTTCCCTGAATGGCGTTGGAAAAGTTGAAGAAATTCTCGAGGAGATGAAAAACGAAGACAGAAACAAATTTGATTGGACGACGTTTTCGAACTTGGCTGCTATCTATGTTAAGGCAGGACAGCTCGAGAAAGCTGAATTAGCTCTTAAGAAGGTAGAGAACGAGATCAAATCGAATAAGCAGCAGGATCGTCTAGCGTACCATTTCTTGATAAGCCTTTATGCATCGACGTCGAATCGGAGTGAGGTGTATAGGATATGGAATGCACTGAAATCAGTTTATCCAATGACAAATAACATGAGTTATCTCGTCATGCTTCAGGCTCTAAGCAAACTAAAGGATTTCGAGGGTCTTAAAAGCACTTATAAGGAATGGGAATCTAGTTGCTCGAGCTTTGATTTGCGGTTAGCGGATGTTACGATCGGGGCTTACCTACGACAGGACATGTACGAAGATGCTGCATTGGTCTTTGAGGATGCTATTAAGAGAAGTAAAGGACCTTTCTTTAGGGCTCGAGAAATGTTCATGATTTACTTCTTGAAGTTCAAGCAAGTCGATTTGGCACTCAGTCATTTGGAATCAGCTATATCGGAAAGCATGGACGATGAATGGCATCCATCACCGGCGATGGCGAATGCTTTTCTGATGTACTTTGAGGAAGAGAAAGACATTGAAGGCGCCGAAGATTTTGCCAGGATTTTGAAGAGATTTAAGTGTCTCGATGCTAGTGCATACCATCTATTGCTCAAGACCTATGCAGCTGCAGGAAAACGAGCCCCCGACATGCGACAAAGATTGATAGAAGACAACATTGAGGTAAGTAGTGAGCTTGAGGAGTTGTTAAGTAGACAACATTGAGACAAGTAGGAGGAAAATCCATGGGATGTTTTTTTTTTTATCTGATCTTTATAAGCTGATCCACTTCATTCCTTTTCTCTTAAACGAAATTTGTATTTTCGAGCATTAGTCTATTTTCATTTTCATTTTCTTGATAAGCTGATGGTGCACAACTTGTAGAAGTTCCCTTGGTTTGTGGTAGACTTGGACCAGGTTTTTGTCACCATGTTATCCGCATCGAAGAAGCCAGGGCTGGCAAGATAAGTTCACGTGGCCATGCCCTAAACCAATTGATAAGCTTTTTTTATTTGATTGATGAGAAAAATTACTACTTATTAGCTTAGCTAGCCTTCTGGGAGAATCGAGTCATTTAGTCCGTTCATATGAATATATCAATTTATT

Coding sequence (CDS)

ATGAATCGTCGGAGTTTACTCTCGAGAGCGTCGGCAGGTTTGCGGCATCTCTGTACTTCAACCGCCGAGTCGAAGCGCGGTCCTGTGAATGATCAGCAGCGGCTATACCCGAGGCTGTCGAAGTTGGGTGCCACCGGCGGTAGCGTGGCGCAGACATTGAACCAGTACATTATGGAGGGAAAGATCGTCAAAAAATATGAGCTCGAGAGATGCATAAAGGAGCTCCGGAAGTACCGTAGATACCACCACGCCCTTCAGATAATGGAATGGATGGAGATGAGGAAAATCAACTACTCATTCACTGACTACGCGCTGCGTTTAGATCTTATATCGAAAGTTAAAGGAATTGCTGCTGCGGAGAATTATTTCTGTGATCTGTCGTCGTCTGCGAAGAATCGATTTACTTATGGAGCTCTTTTGAATTGCTATTGCAAGGAATTGATGGAGGAAAAGGCATTGGCTCTTTCTAAGAAGATAGATGAGTTGAAGTTTGCTTCCAATTTGTCCTTTAACAATCTTATGACCATGTATATGAGAATGGATCAACCCGAGAAAGTACCTCCTCTTATAGATGAAATGAAGCGGAGAGGGATTTTTCTTAGCACGTACACATACAATGTTTGGATGAACAGTTGTGCTTCCCTGAATGGCGTTGGAAAAGTTGAAGAAATTCTCGAGGAGATGAAAAACGAAGACAGAAACAAATTTGATTGGACGACGTTTTCGAACTTGGCTGCTATCTATGTTAAGGCAGGACAGCTCGAGAAAGCTGAATTAGCTCTTAAGAAGGTAGAGAACGAGATCAAATCGAATAAGCAGCAGGATCGTCTAGCGTACCATTTCTTGATAAGCCTTTATGCATCGACGTCGAATCGGAGTGAGGTGTATAGGATATGGAATGCACTGAAATCAGTTTATCCAATGACAAATAACATGAGTTATCTCGTCATGCTTCAGGCTCTAAGCAAACTAAAGGATTTCGAGGGTCTTAAAAGCACTTATAAGGAATGGGAATCTAGTTGCTCGAGCTTTGATTTGCGGTTAGCGGATGTTACGATCGGGGCTTACCTACGACAGGACATGTACGAAGATGCTGCATTGGTCTTTGAGGATGCTATTAAGAGAAGTAAAGGACCTTTCTTTAGGGCTCGAGAAATGTTCATGATTTACTTCTTGAAGTTCAAGCAAGTCGATTTGGCACTCAGTCATTTGGAATCAGCTATATCGGAAAGCATGGACGATGAATGGCATCCATCACCGGCGATGGCGAATGCTTTTCTGATGTACTTTGAGGAAGAGAAAGACATTGAAGGCGCCGAAGATTTTGCCAGGATTTTGAAGAGATTTAAGTGTCTCGATGCTAGTGCATACCATCTATTGCTCAAGACCTATGCAGCTGCAGGAAAACGAGCCCCCGACATGCGACAAAGATTGATAGAAGACAACATTGAGGTAAGTAGTGAGCTTGAGGAGTTGTTAAGTAGACAACATTGA
BLAST of CmoCh04G004020 vs. Swiss-Prot
Match: PPR4_ARATH (Pentatricopeptide repeat-containing protein At1g02370, mitochondrial OS=Arabidopsis thaliana GN=At1g02370 PE=2 SV=1)

HSP 1 Score: 485.3 bits (1248), Expect = 7.8e-136
Identity = 247/466 (53.00%), Postives = 337/466 (72.32%), Query Frame = 1

Query: 32  QQRLYPRLSKLGATGGSVAQTLNQYIMEGKIVKKYELERCIKELRKYRRYHHALQIMEWM 91
           Q+ LY +LS L  TGG+VA+TLNQ+IMEG  V+K +L RC K LRK+RR  HA +I +WM
Sbjct: 70  QRELYKKLSMLSVTGGTVAETLNQFIMEGITVRKDDLFRCAKTLRKFRRPQHAFEIFDWM 129

Query: 92  EMRKINYSFTDYALRLDLISKVKGIAAAENYFCDLSSSAKN-RFTYGALLNCYCKELMEE 151
           E RK+ +S +D+A+ LDLI K KG+ AAENYF +L  SAKN + TYGAL+NCYC EL EE
Sbjct: 130 EKRKMTFSVSDHAICLDLIGKTKGLEAAENYFNNLDPSAKNHQSTYGALMNCYCVELEEE 189

Query: 152 KALALSKKIDELKFASN-LSFNNLMTMYMRMDQPEKVPPLIDEMKRRGIFLSTYTYNVWM 211
           KA A  + +DEL F +N L FNN+M+MYMR+ QPEKVP L+D MK+RGI     TY++WM
Sbjct: 190 KAKAHFEIMDELNFVNNSLPFNNMMSMYMRLSQPEKVPVLVDAMKQRGISPCGVTYSIWM 249

Query: 212 NSCASLNGVGKVEEILEEMKNEDRNKFDWTTFSNLAAIYVKAGQLEKAELALKKVENEIK 271
            SC SLN +  +E+I++EM  +   K  W TFSNLAAIY KAG  EKA+ ALK +E ++ 
Sbjct: 250 QSCGSLNDLDGLEKIIDEMGKDSEAKTTWNTFSNLAAIYTKAGLYEKADSALKSMEEKMN 309

Query: 272 SNKQQDRLAYHFLISLYASTSNRSEVYRIWNALKSVYPMTNNMSYLVMLQALSKLKDFEG 331
            N   +R ++HFL+SLYA  S   EVYR+W +LK   P  NN+SYLVMLQA+SKL D +G
Sbjct: 310 PN---NRDSHHFLMSLYAGISKGPEVYRVWESLKKARPEVNNLSYLVMLQAMSKLGDLDG 369

Query: 332 LKSTYKEWESSCSSFDLRLADVTIGAYLRQDMYEDAALVFEDAIKRSKGPFFRAREMFMI 391
           +K  + EWES C ++D+RLA++ I  YL+ +MYE+A  + + A+K+SKGPF +AR++ MI
Sbjct: 370 IKKIFTEWESKCWAYDMRLANIAINTYLKGNMYEEAEKILDGAMKKSKGPFSKARQLLMI 429

Query: 392 YFLKFKQVDLALSHLESAISESMD--DEWHPSPAMANAFLMYFEEEKDIEGAEDFARILK 451
           + L+  + DLA+ HLE+A+S+S +  DEW  S  + + F ++FE+ KD++GAEDF +IL 
Sbjct: 430 HLLENDKADLAMKHLEAAVSDSAENKDEWGWSSELVSLFFLHFEKAKDVDGAEDFCKILS 489

Query: 452 RFKCLDASAYHLLLKTYAAAGKRAPDMRQRLIEDNIEVSSELEELL 494
            +K LD+     L+KTYAAA K +PDMR+RL +  IEVS E+++LL
Sbjct: 490 NWKPLDSETMTFLIKTYAAAEKTSPDMRERLSQQQIEVSEEIQDLL 532

BLAST of CmoCh04G004020 vs. Swiss-Prot
Match: PP300_ARATH (Pentatricopeptide repeat-containing protein At4g01990, mitochondrial OS=Arabidopsis thaliana GN=At4g01990 PE=2 SV=1)

HSP 1 Score: 437.6 bits (1124), Expect = 1.9e-121
Identity = 234/488 (47.95%), Postives = 323/488 (66.19%), Query Frame = 1

Query: 17  LCTSTAE-------SKRGPVNDQQRLYPRLSKLGATGGS-VAQTLNQYIMEGKIVKKYEL 76
           L T+TAE       S        + +Y +LS LG  GG  + +TLNQ++MEG  VKK++L
Sbjct: 16  LATATAEISGEAAASVPTKAKKHRSIYKKLSSLGTRGGGKMEETLNQFVMEGVPVKKHDL 75

Query: 77  ERCIKELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVKGIAAAENYFCDLSS 136
            R  K+LRK+R+   AL+I EWME ++I ++ +D+A+RL+LI+K KG+ AAE YF  L  
Sbjct: 76  IRYAKDLRKFRQPQRALEIFEWMERKEIAFTGSDHAIRLNLIAKSKGLEAAETYFNSLDD 135

Query: 137 SAKNRFTYGALLNCYCKELMEEKALALSKKIDELKFASN-LSFNNLMTMYMRMDQPEKVP 196
           S KN+ TYG+LLNCYC E  E KA A  + + +L   SN L FNNLM MYM + QPEKVP
Sbjct: 136 SIKNQSTYGSLLNCYCVEKEEVKAKAHFENMVDLNHVSNSLPFNNLMAMYMGLGQPEKVP 195

Query: 197 PLIDEMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKNEDRNKFDWTTFSNLAAI 256
            L+  MK + I     TY++W+ SC SL  +  VE++L+EMK E    F W TF+NLAAI
Sbjct: 196 ALVVAMKEKSITPCDITYSMWIQSCGSLKDLDGVEKVLDEMKAEGEGIFSWNTFANLAAI 255

Query: 257 YVKAGQLEKAELALKKVENEIKSNKQQDRLAYHFLISLYASTSNRSEVYRIWNALKSVYP 316
           Y+K G   KAE ALK +EN +  + +     YHFLI+LY   +N SEVYR+W+ LK  YP
Sbjct: 256 YIKVGLYGKAEEALKSLENNMNPDVRD---CYHFLINLYTGIANASEVYRVWDLLKKRYP 315

Query: 317 MTNNMSYLVMLQALSKLKDFEGLKSTYKEWESSCSSFDLRLADVTIGAYLRQDMYEDAAL 376
             NN SYL ML+ALSKL D +G+K  + EWES+C ++D+R+A+V I +YL+Q+MYE+A  
Sbjct: 316 NVNNSSYLTMLRALSKLDDIDGVKKVFAEWESTCWTYDMRMANVAISSYLKQNMYEEAEA 375

Query: 377 VFEDAIKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDDEWHPSPAMANAFL 436
           VF  A+K+ KG F +AR++ M++ LK  Q DLAL H E+A+ +  D  W  S  + ++F 
Sbjct: 376 VFNGAMKKCKGQFSKARQLLMMHLLKNDQADLALKHFEAAVLD-QDKNWTWSSELISSFF 435

Query: 437 MYFEEEKDIEGAEDFARILKRFKCLDASAYHLLLKTYAAAGKRAPDMRQRLIEDNIEVSS 496
           ++FEE KD++GAE+F + L ++  L +  Y LL+KTY AAGK  PDM++RL E  I V  
Sbjct: 436 LHFEEAKDVDGAEEFCKTLTKWSPLSSETYTLLMKTYLAAGKACPDMKKRLEEQGILVDE 495

BLAST of CmoCh04G004020 vs. Swiss-Prot
Match: PPR86_ARATH (Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana GN=At1g60770 PE=2 SV=1)

HSP 1 Score: 366.7 bits (940), Expect = 4.0e-100
Identity = 205/485 (42.27%), Postives = 303/485 (62.47%), Query Frame = 1

Query: 14  LRHLCTSTAESKRGPVND-QQRLYPRLSKLGATGGSVAQTLNQYIMEGKIVKKYELERCI 73
           +RHL  S   +KR      ++ LY RL K G T   V Q LNQ++   K V K+E+   I
Sbjct: 3   MRHLSRSRDVTKRSTKKYIEEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGDTI 62

Query: 74  KELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVKGIAAAENYFCDLSSSAKN 133
           K+LR    Y+ AL++ E ME R +N + +D A+ LDL++K + I A ENYF DL  ++K 
Sbjct: 63  KKLRNRGLYYPALKLSEVMEERGMNKTVSDQAIHLDLVAKAREITAGENYFVDLPETSKT 122

Query: 134 RFTYGALLNCYCKELMEEKALALSKKIDELKFA-SNLSFNNLMTMYMRMDQPEKVPPLID 193
             TYG+LLNCYCKEL+ EKA  L  K+ EL    S++S+N+LMT+Y +  + EKVP +I 
Sbjct: 123 ELTYGSLLNCYCKELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPAMIQ 182

Query: 194 EMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKNEDRNKFDWTTFSNLAAIYVKA 253
           E+K   +   +YTYNVWM + A+ N +  VE ++EEM  + R   DWTT+SN+A+IYV A
Sbjct: 183 ELKAENVMPDSYTYNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIYVDA 242

Query: 254 GQLEKAELALKKVENEIKSNKQQDRLAYHFLISLYASTSNRSEVYRIWNALKSVYPMTNN 313
           G  +KAE AL+++E +   N Q+D  AY FLI+LY      +EVYRIW +L+   P T+N
Sbjct: 243 GLSQKAEKALQELEMK---NTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSN 302

Query: 314 MSYLVMLQALSKLKDFEGLKSTYKEWESSCSSFDLRLADVTIGAYLRQDMYEDAALVFED 373
           ++YL M+Q L KL D  G ++ +KEW+++CS++D+R+ +V IGAY ++ + + A  + E 
Sbjct: 303 VAYLNMIQVLVKLNDLPGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEK 362

Query: 374 AIKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDD--EWHPSPAMANAFLMY 433
           A +R      +  E+FM Y++K   +  AL  +  A+S    D  +W PSP    A + Y
Sbjct: 363 APRRGGKLNAKTWEIFMDYYVKSGDMARALECMSKAVSIGKGDGGKWLPSPETVRALMSY 422

Query: 434 FEEEKDIEGAEDFARILKR-FKCLDASAYHLLLKTYAAAGKRAPDMRQRLIEDNIEVSSE 493
           FE++KD+ GAE+   ILK     + A  +  L++TYAAAGK  P MR+RL  +N+EV+  
Sbjct: 423 FEQKKDVNGAENLLEILKNGTDNIGAEIFEPLIRTYAAAGKSHPAMRRRLKMENVEVNEA 482

BLAST of CmoCh04G004020 vs. Swiss-Prot
Match: PPR3_ARATH (Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana GN=At1g02150 PE=2 SV=2)

HSP 1 Score: 285.4 bits (729), Expect = 1.2e-75
Identity = 157/444 (35.36%), Postives = 256/444 (57.66%), Query Frame = 1

Query: 21  TAESKRGPVNDQQRLYPRLSKLGATGGSVAQTLNQYIMEGKIVKKYELERCIKELRKYRR 80
           T + +R P+     +Y ++S +       A  LNQ+   G+ + K+EL R +KELRKY+R
Sbjct: 55  TVDYERRPIVQWNAIYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKR 114

Query: 81  YHHALQIMEWMEMR--KINYSFTDYALRLDLISKVKGIAAAENYFCDLSSSAKNRFTYGA 140
            + AL++ +WM  R  +   S +D A++LDLI KV+GI  AE +F  L  + K+R  YG+
Sbjct: 115 ANQALEVYDWMNNRGERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGS 174

Query: 141 LLNCYCKELMEEKALALSKKIDELKFASN-LSFNNLMTMYMRMDQPEKVPPLIDEMKRRG 200
           LLN Y +    EKA AL   + +  +A + L FN +MT+YM + + +KV  ++ EMK++ 
Sbjct: 175 LLNAYVRAKSREKAEALLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKD 234

Query: 201 IFLSTYTYNVWMNSCASLNGVGKVEEILEEMKNEDRNKFDWTTFSNLAAIYVKAGQLEKA 260
           I L  Y+YN+W++SC SL  V K+E + ++MK++     +WTTFS +A +Y+K G+ EKA
Sbjct: 235 IRLDIYSYNIWLSSCGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKA 294

Query: 261 ELALKKVENEIKSNKQQDRLAYHFLISLYASTSNRSEVYRIWNALKSVYPMTNNMSYLVM 320
           E AL+KVE  I     ++R+ YH+L+SLY S  N+ E+YR+W+  KSV P   N+ Y  +
Sbjct: 295 EDALRKVEARITG---RNRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHAL 354

Query: 321 LQALSKLKDFEGLKSTYKEWESSCSSFDLRLADVTIGAYLRQDMYEDAALVFEDAIKRSK 380
           + +L ++ D EG +  Y+EW    SS+D R+ ++ + AY++ D  E A  +F+  ++   
Sbjct: 355 VSSLVRMGDIEGAEKVYEEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGG 414

Query: 381 GPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDDEWHPSPAMANAFLMYFEEEKDIE 440
            P     E+  +   + + +  AL+ L +A S      W P   M + F    EEE D+ 
Sbjct: 415 KPSSSTWEILAVGHTRKRCISEALTCLRNAFSAEGSSNWRPKVLMLSGFFKLCEEESDVT 474

Query: 441 GAEDFARILKRFKCLDASAYHLLL 462
             E    +L++   L+  +Y  L+
Sbjct: 475 SKEAVLELLRQSGDLEDKSYLALI 495

BLAST of CmoCh04G004020 vs. Swiss-Prot
Match: PP302_ARATH (Pentatricopeptide repeat-containing protein At4g02820, mitochondrial OS=Arabidopsis thaliana GN=At4g02820 PE=2 SV=1)

HSP 1 Score: 255.4 bits (651), Expect = 1.3e-66
Identity = 160/475 (33.68%), Postives = 261/475 (54.95%), Query Frame = 1

Query: 21  TAESKRGPVNDQQRLYPRLSKLGATGGSVAQTLNQYIMEGKIVKKYELERCIKELRKYRR 80
           +A  K   V  +  L  RL  L  T  S   T+ ++  EG  V+KYEL R ++ELRK +R
Sbjct: 49  SANKKETVVGGRDTLGGRLLSLVYTKRSAVVTIRKWKEEGHSVRKYELNRIVRELRKIKR 108

Query: 81  YHHALQIMEWMEMRK-INYSFTDYALRLDLISKVKGIAAAENYFCDLSSSAKNRFTYGAL 140
           Y HAL+I EWM +++ I     DYA+ LDLISK++G+ +AE +F D+    +      +L
Sbjct: 109 YKHALEICEWMVVQEDIKLQAGDYAVHLDLISKIRGLNSAEKFFEDMPDQMRGHAACTSL 168

Query: 141 LNCYCKELMEEKALALSKKIDELKFASN-LSFNNLMTMYMRMDQPEKVPPLIDEMKRRGI 200
           L+ Y +  + +KA AL +K+ E  F  + L +N++++MY+   Q EKVP LI E+K R  
Sbjct: 169 LHSYVQNKLSDKAEALFEKMGECGFLKSCLPYNHMLSMYISRGQFEKVPVLIKELKIR-T 228

Query: 201 FLSTYTYNVWMNSCASLNGVGKVEEILEEMKNEDRNKFDWTTFSNLAAIYVKAGQLEKAE 260
                TYN+W+ + AS N V   E++  + K E  N  DW T+S L  +Y K   +EKA 
Sbjct: 229 SPDIVTYNLWLTAFASGNDVEGAEKVYLKAKEEKLNP-DWVTYSVLTNLYAKTDNVEKAR 288

Query: 261 LALKKVENEIKSNKQQDRLAYHFLISLYASTSNRSEVYRIWNALKSVYPMTNNMSYLVML 320
           LALK++E  +    +++R+AY  LISL+A+  ++  V   W  +KS +   N+  YL M+
Sbjct: 289 LALKEMEKLVS---KKNRVAYASLISLHANLGDKDGVNLTWKKVKSSFKKMNDAEYLSMI 348

Query: 321 QALSKLKDFEGLKSTYKEWESSCSSFDLRLADVTIGAYLRQDMYEDAALVFEDAIKRSKG 380
            A+ KL +FE  K  Y EWES   + D R+ ++ +  Y+ +D        +E  +++   
Sbjct: 349 SAVVKLGEFEQAKGLYDEWESVSGTGDARIPNLILAEYMNRDEVLLGEKFYERIVEKGIN 408

Query: 381 PFFRAREMFMIYFLKFKQVDLALSHLESAISESMDDEWHPSPAMANAFLMYFEEEKDIEG 440
           P +   E+    +LK K ++  L     AI      +W  +  +        EE+ +++G
Sbjct: 409 PSYSTWEILTWAYLKRKDMEKVLDCFGKAIDSV--KKWTVNVRLVKGACKELEEQGNVKG 468

Query: 441 AEDFARILKRFKCLDASAYHLLLKTYAAAGKRAPDMRQRLIEDNIEVSSELEELL 494
           AE    +L++   ++   Y+ LL+TYA AG+ A  + +R+ +DN+E+  E +EL+
Sbjct: 469 AEKLMTLLQKAGYVNTQLYNSLLRTYAKAGEMALIVEERMAKDNVELDEETKELI 516

BLAST of CmoCh04G004020 vs. TrEMBL
Match: A0A0A0KUH1_CUCSA (Pentatricopeptide repeat-containing protein OS=Cucumis sativus GN=Csa_4G026260 PE=4 SV=1)

HSP 1 Score: 770.4 bits (1988), Expect = 1.3e-219
Identity = 389/493 (78.90%), Postives = 427/493 (86.61%), Query Frame = 1

Query: 1   MNRRSLLSRASAGLRHLCTSTAESKRGPVNDQQRLYPRLSKLGATGGSVAQTLNQYIMEG 60
           MNRRSL+SRA AG R LCTS  E  R P N+Q+ LYPRLS LGATGGSVA+T+NQ+IMEG
Sbjct: 1   MNRRSLISRAPAGFRQLCTSLNELMRSPANNQRGLYPRLSALGATGGSVAKTINQFIMEG 60

Query: 61  KIVKKYELERCIKELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVKGIAAAE 120
            IVKKYELE+CIKELRKYRRYHH LQIMEWME RKINYSFTDYALRLDLISKV G+ AAE
Sbjct: 61  NIVKKYELEKCIKELRKYRRYHHCLQIMEWMETRKINYSFTDYALRLDLISKVNGVTAAE 120

Query: 121 NYFCDLSSSAKNRFTYGALLNCYCKELMEEKALALSKKIDELKFASNLSFNNLMTMYMRM 180
            YF DL  SAKNR TYGALLNCYCKE+MEEKAL L KK+DELK +++LSFNNLMTMYMRM
Sbjct: 121 KYFYDLPPSAKNRCTYGALLNCYCKEMMEEKALTLFKKMDELKISTSLSFNNLMTMYMRM 180

Query: 181 DQPEKVPPLIDEMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKNEDRNKFDWTT 240
           D PEKVPPLI EMK+RG +L+T+TYNVWMNSCASLN +GKVEEILEEMK EDRNKFDWTT
Sbjct: 181 DHPEKVPPLIGEMKQRGFYLTTFTYNVWMNSCASLNDIGKVEEILEEMKMEDRNKFDWTT 240

Query: 241 FSNLAAIYVKAGQLEKAELALKKVENEIKSNKQQDRLAYHFLISLYASTSNRSEVYRIWN 300
           +SNLA+ YVKAGQ EKAELALKK+E E+KS+K  DRL YH LISLYASTSN SEV RIWN
Sbjct: 241 YSNLASFYVKAGQFEKAELALKKLEEEMKSDK-NDRLVYHCLISLYASTSNLSEVNRIWN 300

Query: 301 ALKSVYPMTNNMSYLVMLQALSKLKDFEGLKSTYKEWESSCSSFDLRLADVTIGAYLRQD 360
           ALKSVY    N+SYLVMLQAL KLKD EGLK TYKEWES+C +FDLR+ +  IGAYL+QD
Sbjct: 301 ALKSVYSTMTNISYLVMLQALRKLKDIEGLKRTYKEWESNCRNFDLRIVNDIIGAYLQQD 360

Query: 361 MYEDAALVFEDAIKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDDEWHPSP 420
           MYEDAA++FEDA KRSKGPF RAREMFM+YFLK KQVD A SHLESA+SES + EWHPS 
Sbjct: 361 MYEDAAMIFEDATKRSKGPFSRAREMFMVYFLKLKQVDSAFSHLESALSESKEKEWHPSL 420

Query: 421 AMANAFLMYFEEEKDIEGAEDFARILKRFKCLDASAYHLLLKTYAAAGKRAPDMRQRLIE 480
           A   AFL YFEEEKD+EGAEDFARILKR KCLDAS YHLLLKTY AAGK APDMR+RL E
Sbjct: 421 ATTTAFLNYFEEEKDVEGAEDFARILKRLKCLDASGYHLLLKTYVAAGKLAPDMRKRLKE 480

Query: 481 DNIEVSSELEELL 494
           D+IE+SSELEELL
Sbjct: 481 DDIEISSELEELL 492

BLAST of CmoCh04G004020 vs. TrEMBL
Match: M5X9A6_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005037mg PE=4 SV=1)

HSP 1 Score: 572.8 bits (1475), Expect = 4.1e-160
Identity = 295/461 (63.99%), Postives = 356/461 (77.22%), Query Frame = 1

Query: 1   MNRRSLLSRASAGLRHLCTST---AESKRGPVNDQQRLYPRLSKLGATGGSVAQTLNQYI 60
           MN    +S  +  +R LCT+     ES R    +  RLY RLS LGATGGSVA+TLNQYI
Sbjct: 1   MNSSRSISAGTWLVRKLCTAVEAATESARSQPGNPNRLYRRLSALGATGGSVAKTLNQYI 60

Query: 61  MEGKIVKKYELERCIKELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVKGIA 120
           MEGK++KKYELERCIKELRKYR++ HAL+IMEWME RK+NYS  D+A+RLDL SKVKGI 
Sbjct: 61  MEGKMLKKYELERCIKELRKYRKFQHALEIMEWMEFRKMNYSKADFAIRLDLTSKVKGIE 120

Query: 121 AAENYFCDLSSSAKNRFTYGALLNCYCKELMEEKALALSKKIDELKFASN-LSFNNLMTM 180
           AAE+YF  LS S K+RFTYGALLNCYCKELMEEKALAL + +DEL+FAS+ L FNNLM+M
Sbjct: 121 AAEDYFSGLSPSLKDRFTYGALLNCYCKELMEEKALALYETMDELEFASSSLVFNNLMSM 180

Query: 181 YMRMDQPEKVPPLIDEMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKNEDRNKF 240
           +MR  QPEKV PL+ EMK+R I L T+TYN+WM S ASLN     E +L+EM+ +D N+ 
Sbjct: 181 HMRKQQPEKVAPLVQEMKQRNIPLDTFTYNIWMQSFASLNDFEGAERVLDEMQKQDGNQC 240

Query: 241 DWTTFSNLAAIYVKAGQLEKAELALKKVENEIKSNKQQDRLAYHFLISLYASTSNRSEVY 300
            W+T+SNLAAIYVKA   +KAELALKK E  +K  KQ++   YHFLISLYA TSN  EV 
Sbjct: 241 SWSTYSNLAAIYVKAKIFDKAELALKKSEEMMKPLKQRN--TYHFLISLYACTSNLGEVK 300

Query: 301 RIWNALKSVYPMTNNMSYLVMLQALSKLKDFEGLKSTYKEWESSCSSFDLRLADVTIGAY 360
           R+W +LK  +P TNNMSYL+MLQAL KL D EGLK  ++EWE  CSS+D+RLA+  I  Y
Sbjct: 301 RVWESLKKAFPATNNMSYLIMLQALCKLNDIEGLKECFEEWECKCSSYDMRLANTAIRGY 360

Query: 361 LRQDMYEDAALVFEDAIKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDDEW 420
           L QDMYE+AALVF DA KR+KGPFF+AREMFM+YFLK  QVDLA+S+L +A+SE+ D EW
Sbjct: 361 LSQDMYEEAALVFADACKRTKGPFFKAREMFMLYFLKNCQVDLAVSYLGAAVSETADGEW 420

Query: 421 HPSPAMANAFLMYFEEEKDIEGAEDFARILKRFKCLDASAY 458
           HPSP   +AF  YFEEEKD+E AE+F +ILKR  CL ++ Y
Sbjct: 421 HPSPDTTSAFFKYFEEEKDVESAENFCKILKRLNCLCSNEY 459

BLAST of CmoCh04G004020 vs. TrEMBL
Match: B9RNC6_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_1346580 PE=4 SV=1)

HSP 1 Score: 550.1 bits (1416), Expect = 2.9e-153
Identity = 281/481 (58.42%), Postives = 362/481 (75.26%), Query Frame = 1

Query: 20  STAESKRGPV----NDQQRLYPRLSKLGATGGSVAQTLNQYIMEGKIVKKYELERCIKEL 79
           STAE+   P        ++LY +LS LGATGGSV++TLN++IMEGK + K EL RCI+EL
Sbjct: 21  STAEAAVPPAVVSPRQSEKLYHKLSALGATGGSVSRTLNEHIMEGKTITKIELSRCIREL 80

Query: 80  RKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVKGIAAAENYFCDLSSSAKNRFT 139
           RKYRR+ HA +IMEWME RK+N+S+ D A+RLDLI K +GIAAAE+YF  LS SAKN  T
Sbjct: 81  RKYRRFDHAFEIMEWMEKRKMNFSYADRAIRLDLIGKARGIAAAEDYFNGLSPSAKNHHT 140

Query: 140 -YGALLNCYCKELMEEKALALSKKIDELKFA-SNLSFNNLMTMYMRMDQPEKVPPLIDEM 199
            YGALLNCYCKELM +KALAL +++DE KF  S+L FNNLM+MYMR+ QPEKVPPL+DEM
Sbjct: 141 SYGALLNCYCKELMSDKALALFQEMDEKKFLYSSLPFNNLMSMYMRLGQPEKVPPLVDEM 200

Query: 200 KRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKNED-RNKFDWTTFSNLAAIYVKAG 259
           K+R +   ++TYN+WM S   LN    V+ +L E+ N+  ++   WTT+SNLA IY+KAG
Sbjct: 201 KKRKVSPCSFTYNIWMQSYGCLNDFQGVDRVLREIVNDGGKDNLQWTTYSNLATIYLKAG 260

Query: 260 QLEKAELALKKVENEIKSNKQQDRLAYHFLISLYASTSNRSEVYRIWNALKSVYPMTNNM 319
             EKAE ALKK+E  +     ++R AYHFLIS+YA T N +EV R+W  LKS + M NN+
Sbjct: 261 IFEKAESALKKLEAIMGF---RNREAYHFLISIYAGTGNSNEVNRVWGLLKSSFNMINNL 320

Query: 320 SYLVMLQALSKLKDFEGLKSTYKEWESSCSSFDLRLADVTIGAYLRQDMYEDAALVFEDA 379
           SYLVMLQAL+KLKD EG+   ++EWES C+++D+R+A+V I  +L+ DMYE+A L+F+DA
Sbjct: 321 SYLVMLQALAKLKDVEGVAKCFREWESGCTNYDMRIANVAIRVFLQHDMYEEAELIFDDA 380

Query: 380 IKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDDEWHPSPAMANAFLMYFEE 439
           +KR++GPFF+ARE FM++FLK  Q+DLAL H+ +A SES   EW P     NA+  YF  
Sbjct: 381 LKRTRGPFFKARERFMLFFLKIHQLDLALKHMRAAFSESEKHEWKPLQETVNAYFDYFRT 440

Query: 440 EKDIEGAEDFARILKRFKCLDASAYHLLLKTYAAAGKRAPDMRQRLIEDNIEVSSELEEL 494
           EKD++GAE  ++ILK   CL++S Y LLLKTY AAGK AP+MRQRL EDNIE+S ELE L
Sbjct: 441 EKDVDGAEKLSKILKHINCLNSSVYSLLLKTYIAAGKLAPEMRQRLEEDNIEISDELEYL 498

BLAST of CmoCh04G004020 vs. TrEMBL
Match: A0A067K8Z8_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14045 PE=4 SV=1)

HSP 1 Score: 545.0 bits (1403), Expect = 9.2e-152
Identity = 277/497 (55.73%), Postives = 365/497 (73.44%), Query Frame = 1

Query: 6   LLSRASAGLRHLCTST------AESKRGPVNDQQRLYPRLSKLGATGGSVAQTLNQYIME 65
           LLS+AS   R LCT+         S  G ++   RLYPRLS LGA GGSV+ TLN+Y+ME
Sbjct: 8   LLSKASWLARKLCTAAEAVAEAVPSAVGSLDKPVRLYPRLSALGAKGGSVSMTLNEYVME 67

Query: 66  GKIVKKYELERCIKELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVKGIAAA 125
           G  ++K EL RCIKELRKY+R+ HAL+IMEWME RK+N+S  +YA++LDLI+K KG++AA
Sbjct: 68  GNTIRKAELTRCIKELRKYQRFDHALEIMEWMEKRKMNFSRAEYAIKLDLIAKTKGVSAA 127

Query: 126 ENYFCDLSSSAKNRFTYGALLNCYCKELMEEKALALSKKIDELKFAS-NLSFNNLMTMYM 185
           E+YF  LS +AK R TYGALLNCY K LM +KAL L +K+D +   S +L FNNLM++YM
Sbjct: 128 ESYFSSLSPNAKTRSTYGALLNCYTKGLMPDKALDLFEKLDAMNLLSTSLPFNNLMSLYM 187

Query: 186 RMDQPEKVPPLIDEMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKNEDRNKFDW 245
           R+ QPEKVP L+ +MKRR I   +++YN+WM S   LN    VE +L E++ +  +   W
Sbjct: 188 RLGQPEKVPALVHDMKRRNIHPCSFSYNIWMQSYGCLNDFEGVERVLAEIEKDGEDNCKW 247

Query: 246 TTFSNLAAIYVKAGQLEKAELALKKVENEIKSNKQQDRLAYHFLISLYASTSNRSEVYRI 305
            T+SN+A IY+KAG  EKAE ALKK+E ++     ++R AYHFLIS+Y+ T N +EV R+
Sbjct: 248 NTYSNVATIYLKAGLFEKAESALKKLELKMGI---RNREAYHFLISIYSGTQNLNEVNRV 307

Query: 306 WNALKSVYPMTNNMSYLVMLQALSKLKDFEGLKSTYKEWESSCSSFDLRLADVTIGAYLR 365
           WN+LK  +    N SYLVMLQAL+KLKD +G+   +KEWESSCSS+D+RLA+  I AYL 
Sbjct: 308 WNSLKKSFTTVTNTSYLVMLQALAKLKDVDGIAKLFKEWESSCSSYDMRLANTAIKAYLE 367

Query: 366 QDMYEDAALVFEDAIKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDDEWHP 425
           QDMYE+A L+F+ A+KR+KGPFF+ REMFM++FLK  ++DLAL H++ A SE+ + +W P
Sbjct: 368 QDMYEEAELIFDGALKRAKGPFFKVREMFMVFFLKINELDLALEHMKVAFSETEEYQWKP 427

Query: 426 SPAMANAFLMYFEEEKDIEGAEDFARILKRFKCLDASAYHLLLKTYAAAGKRAPDMRQRL 485
                +AF  YF EEKDI+GAE F +ILK   CLD++AY LLL+TY AA + APDMR+RL
Sbjct: 428 KAETVSAFFSYFCEEKDIDGAEKFCKILKHINCLDSNAYSLLLQTYIAADRLAPDMRKRL 487

Query: 486 IEDNIEVSSELEELLSR 496
            EDNI++S ELE+LL R
Sbjct: 488 EEDNIQISHELEDLLER 501

BLAST of CmoCh04G004020 vs. TrEMBL
Match: A0A061DR66_THECC (Tetratricopeptide repeat (TPR)-like superfamily protein, putative OS=Theobroma cacao GN=TCM_004828 PE=4 SV=1)

HSP 1 Score: 540.8 bits (1392), Expect = 1.7e-150
Identity = 283/502 (56.37%), Postives = 364/502 (72.51%), Query Frame = 1

Query: 1   MNRRSLLSRASAGLRHLCTSTAESKR--------GPVNDQQRLYPRLSKLGATGGSVAQT 60
           MN R L+S  S  +R LCT+T+E  +         P+ +  RLYPRLS L ATGG+V++ 
Sbjct: 1   MNSRRLISSGSWLVRKLCTATSEKAKIKAAVAAASPMRN--RLYPRLSALAATGGTVSEA 60

Query: 61  LNQYIMEGKIVKKYELERCIKELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISK 120
           LN +IMEGK ++K EL RC+KELRKYRRY HAL IM+WME R ++ S  D+A+RLDLI+K
Sbjct: 61  LNDFIMEGKKIRKDELGRCVKELRKYRRYQHALDIMDWMERRNLHLSHVDHAIRLDLIAK 120

Query: 121 VKGIAAAENYFCDLSSSAKNRFTYGALLNCYCKELMEEKALALSKKIDELKFASN-LSFN 180
            KGI AAENY   L  SAKN+ TYGALLNCYC  LM++KA +L +K+DEL+F +N L FN
Sbjct: 121 TKGIDAAENYLSALPPSAKNQLTYGALLNCYCNNLMKDKASSLFQKMDELRFTNNTLPFN 180

Query: 181 NLMTMYMRMDQPEKVPPLIDEMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKNE 240
           NLM +YMR+ QPEKVP L+DE+K R I    +TY VWM S A+LN +  VE +LEE+  +
Sbjct: 181 NLMCLYMRLGQPEKVPELVDELKLRNIPRCRFTYVVWMQSYANLNDIEGVERVLEELAQD 240

Query: 241 DRNKFDWTTFSNLAAIYVKAGQLEKAELALKKVENEIKSNKQQDRLAYHFLISLYASTSN 300
             +K  WTT++NLAAIYVKAG  EKAE  LKK+E ++   +++   AYHFLISLYA TSN
Sbjct: 241 SEDKCTWTTYNNLAAIYVKAGLFEKAEACLKKLEKDMMPRQRE---AYHFLISLYAGTSN 300

Query: 301 RSEVYRIWNALKSVYPMTNNMSYLVMLQALSKLKDFEGLKSTYKEWESSCSSFDLRLADV 360
            +EV+R+W ALK  +    N SYLVM+QAL+KLKD EGLK  + EWESSCS++D+RLA  
Sbjct: 301 LAEVHRVWEALKRAFSTVTNTSYLVMVQALAKLKDLEGLKKCFAEWESSCSAYDIRLATS 360

Query: 361 TIGAYLRQDMYEDAALVFEDAIKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISES 420
           TI  YL  D+ E+A LV  +A+KRSKGPF + RE+FM+YFL+  Q DLAL H+E+ +SE 
Sbjct: 361 TIRGYLSGDLLEEAELVLGNAMKRSKGPFHKVRELFMVYFLEKCQFDLALQHVEAVVSEM 420

Query: 421 MDDEWHPSPAMANAFLMYFEEEKDIEGAEDFARILKRFKCLDASAYHLLLKTYAAAGKRA 480
            D  W P+P    AF  YF +E+D++ AE+F RILK    LD++AYHLLLKTY AAGK A
Sbjct: 421 GD--WRPAPETITAFFDYFMKERDVDAAEEFCRILKSKNGLDSNAYHLLLKTYVAAGKVA 480

Query: 481 PDMRQRLIEDNIEVSSELEELL 494
           PDMR+RL  D I++S EL++LL
Sbjct: 481 PDMRRRLEVDGIQLSQELQDLL 495

BLAST of CmoCh04G004020 vs. TAIR10
Match: AT1G02370.1 (AT1G02370.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 485.3 bits (1248), Expect = 4.4e-137
Identity = 247/466 (53.00%), Postives = 337/466 (72.32%), Query Frame = 1

Query: 32  QQRLYPRLSKLGATGGSVAQTLNQYIMEGKIVKKYELERCIKELRKYRRYHHALQIMEWM 91
           Q+ LY +LS L  TGG+VA+TLNQ+IMEG  V+K +L RC K LRK+RR  HA +I +WM
Sbjct: 70  QRELYKKLSMLSVTGGTVAETLNQFIMEGITVRKDDLFRCAKTLRKFRRPQHAFEIFDWM 129

Query: 92  EMRKINYSFTDYALRLDLISKVKGIAAAENYFCDLSSSAKN-RFTYGALLNCYCKELMEE 151
           E RK+ +S +D+A+ LDLI K KG+ AAENYF +L  SAKN + TYGAL+NCYC EL EE
Sbjct: 130 EKRKMTFSVSDHAICLDLIGKTKGLEAAENYFNNLDPSAKNHQSTYGALMNCYCVELEEE 189

Query: 152 KALALSKKIDELKFASN-LSFNNLMTMYMRMDQPEKVPPLIDEMKRRGIFLSTYTYNVWM 211
           KA A  + +DEL F +N L FNN+M+MYMR+ QPEKVP L+D MK+RGI     TY++WM
Sbjct: 190 KAKAHFEIMDELNFVNNSLPFNNMMSMYMRLSQPEKVPVLVDAMKQRGISPCGVTYSIWM 249

Query: 212 NSCASLNGVGKVEEILEEMKNEDRNKFDWTTFSNLAAIYVKAGQLEKAELALKKVENEIK 271
            SC SLN +  +E+I++EM  +   K  W TFSNLAAIY KAG  EKA+ ALK +E ++ 
Sbjct: 250 QSCGSLNDLDGLEKIIDEMGKDSEAKTTWNTFSNLAAIYTKAGLYEKADSALKSMEEKMN 309

Query: 272 SNKQQDRLAYHFLISLYASTSNRSEVYRIWNALKSVYPMTNNMSYLVMLQALSKLKDFEG 331
            N   +R ++HFL+SLYA  S   EVYR+W +LK   P  NN+SYLVMLQA+SKL D +G
Sbjct: 310 PN---NRDSHHFLMSLYAGISKGPEVYRVWESLKKARPEVNNLSYLVMLQAMSKLGDLDG 369

Query: 332 LKSTYKEWESSCSSFDLRLADVTIGAYLRQDMYEDAALVFEDAIKRSKGPFFRAREMFMI 391
           +K  + EWES C ++D+RLA++ I  YL+ +MYE+A  + + A+K+SKGPF +AR++ MI
Sbjct: 370 IKKIFTEWESKCWAYDMRLANIAINTYLKGNMYEEAEKILDGAMKKSKGPFSKARQLLMI 429

Query: 392 YFLKFKQVDLALSHLESAISESMD--DEWHPSPAMANAFLMYFEEEKDIEGAEDFARILK 451
           + L+  + DLA+ HLE+A+S+S +  DEW  S  + + F ++FE+ KD++GAEDF +IL 
Sbjct: 430 HLLENDKADLAMKHLEAAVSDSAENKDEWGWSSELVSLFFLHFEKAKDVDGAEDFCKILS 489

Query: 452 RFKCLDASAYHLLLKTYAAAGKRAPDMRQRLIEDNIEVSSELEELL 494
            +K LD+     L+KTYAAA K +PDMR+RL +  IEVS E+++LL
Sbjct: 490 NWKPLDSETMTFLIKTYAAAEKTSPDMRERLSQQQIEVSEEIQDLL 532

BLAST of CmoCh04G004020 vs. TAIR10
Match: AT4G01990.1 (AT4G01990.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 437.6 bits (1124), Expect = 1.0e-122
Identity = 234/488 (47.95%), Postives = 323/488 (66.19%), Query Frame = 1

Query: 17  LCTSTAE-------SKRGPVNDQQRLYPRLSKLGATGGS-VAQTLNQYIMEGKIVKKYEL 76
           L T+TAE       S        + +Y +LS LG  GG  + +TLNQ++MEG  VKK++L
Sbjct: 16  LATATAEISGEAAASVPTKAKKHRSIYKKLSSLGTRGGGKMEETLNQFVMEGVPVKKHDL 75

Query: 77  ERCIKELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVKGIAAAENYFCDLSS 136
            R  K+LRK+R+   AL+I EWME ++I ++ +D+A+RL+LI+K KG+ AAE YF  L  
Sbjct: 76  IRYAKDLRKFRQPQRALEIFEWMERKEIAFTGSDHAIRLNLIAKSKGLEAAETYFNSLDD 135

Query: 137 SAKNRFTYGALLNCYCKELMEEKALALSKKIDELKFASN-LSFNNLMTMYMRMDQPEKVP 196
           S KN+ TYG+LLNCYC E  E KA A  + + +L   SN L FNNLM MYM + QPEKVP
Sbjct: 136 SIKNQSTYGSLLNCYCVEKEEVKAKAHFENMVDLNHVSNSLPFNNLMAMYMGLGQPEKVP 195

Query: 197 PLIDEMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKNEDRNKFDWTTFSNLAAI 256
            L+  MK + I     TY++W+ SC SL  +  VE++L+EMK E    F W TF+NLAAI
Sbjct: 196 ALVVAMKEKSITPCDITYSMWIQSCGSLKDLDGVEKVLDEMKAEGEGIFSWNTFANLAAI 255

Query: 257 YVKAGQLEKAELALKKVENEIKSNKQQDRLAYHFLISLYASTSNRSEVYRIWNALKSVYP 316
           Y+K G   KAE ALK +EN +  + +     YHFLI+LY   +N SEVYR+W+ LK  YP
Sbjct: 256 YIKVGLYGKAEEALKSLENNMNPDVRD---CYHFLINLYTGIANASEVYRVWDLLKKRYP 315

Query: 317 MTNNMSYLVMLQALSKLKDFEGLKSTYKEWESSCSSFDLRLADVTIGAYLRQDMYEDAAL 376
             NN SYL ML+ALSKL D +G+K  + EWES+C ++D+R+A+V I +YL+Q+MYE+A  
Sbjct: 316 NVNNSSYLTMLRALSKLDDIDGVKKVFAEWESTCWTYDMRMANVAISSYLKQNMYEEAEA 375

Query: 377 VFEDAIKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDDEWHPSPAMANAFL 436
           VF  A+K+ KG F +AR++ M++ LK  Q DLAL H E+A+ +  D  W  S  + ++F 
Sbjct: 376 VFNGAMKKCKGQFSKARQLLMMHLLKNDQADLALKHFEAAVLD-QDKNWTWSSELISSFF 435

Query: 437 MYFEEEKDIEGAEDFARILKRFKCLDASAYHLLLKTYAAAGKRAPDMRQRLIEDNIEVSS 496
           ++FEE KD++GAE+F + L ++  L +  Y LL+KTY AAGK  PDM++RL E  I V  
Sbjct: 436 LHFEEAKDVDGAEEFCKTLTKWSPLSSETYTLLMKTYLAAGKACPDMKKRLEEQGILVDE 495

BLAST of CmoCh04G004020 vs. TAIR10
Match: AT1G60770.1 (AT1G60770.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 366.7 bits (940), Expect = 2.3e-101
Identity = 205/485 (42.27%), Postives = 303/485 (62.47%), Query Frame = 1

Query: 14  LRHLCTSTAESKRGPVND-QQRLYPRLSKLGATGGSVAQTLNQYIMEGKIVKKYELERCI 73
           +RHL  S   +KR      ++ LY RL K G T   V Q LNQ++   K V K+E+   I
Sbjct: 3   MRHLSRSRDVTKRSTKKYIEEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGDTI 62

Query: 74  KELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVKGIAAAENYFCDLSSSAKN 133
           K+LR    Y+ AL++ E ME R +N + +D A+ LDL++K + I A ENYF DL  ++K 
Sbjct: 63  KKLRNRGLYYPALKLSEVMEERGMNKTVSDQAIHLDLVAKAREITAGENYFVDLPETSKT 122

Query: 134 RFTYGALLNCYCKELMEEKALALSKKIDELKFA-SNLSFNNLMTMYMRMDQPEKVPPLID 193
             TYG+LLNCYCKEL+ EKA  L  K+ EL    S++S+N+LMT+Y +  + EKVP +I 
Sbjct: 123 ELTYGSLLNCYCKELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPAMIQ 182

Query: 194 EMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKNEDRNKFDWTTFSNLAAIYVKA 253
           E+K   +   +YTYNVWM + A+ N +  VE ++EEM  + R   DWTT+SN+A+IYV A
Sbjct: 183 ELKAENVMPDSYTYNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIYVDA 242

Query: 254 GQLEKAELALKKVENEIKSNKQQDRLAYHFLISLYASTSNRSEVYRIWNALKSVYPMTNN 313
           G  +KAE AL+++E +   N Q+D  AY FLI+LY      +EVYRIW +L+   P T+N
Sbjct: 243 GLSQKAEKALQELEMK---NTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSN 302

Query: 314 MSYLVMLQALSKLKDFEGLKSTYKEWESSCSSFDLRLADVTIGAYLRQDMYEDAALVFED 373
           ++YL M+Q L KL D  G ++ +KEW+++CS++D+R+ +V IGAY ++ + + A  + E 
Sbjct: 303 VAYLNMIQVLVKLNDLPGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEK 362

Query: 374 AIKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDD--EWHPSPAMANAFLMY 433
           A +R      +  E+FM Y++K   +  AL  +  A+S    D  +W PSP    A + Y
Sbjct: 363 APRRGGKLNAKTWEIFMDYYVKSGDMARALECMSKAVSIGKGDGGKWLPSPETVRALMSY 422

Query: 434 FEEEKDIEGAEDFARILKR-FKCLDASAYHLLLKTYAAAGKRAPDMRQRLIEDNIEVSSE 493
           FE++KD+ GAE+   ILK     + A  +  L++TYAAAGK  P MR+RL  +N+EV+  
Sbjct: 423 FEQKKDVNGAENLLEILKNGTDNIGAEIFEPLIRTYAAAGKSHPAMRRRLKMENVEVNEA 482

BLAST of CmoCh04G004020 vs. TAIR10
Match: AT1G02150.1 (AT1G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 285.4 bits (729), Expect = 6.6e-77
Identity = 157/444 (35.36%), Postives = 256/444 (57.66%), Query Frame = 1

Query: 21  TAESKRGPVNDQQRLYPRLSKLGATGGSVAQTLNQYIMEGKIVKKYELERCIKELRKYRR 80
           T + +R P+     +Y ++S +       A  LNQ+   G+ + K+EL R +KELRKY+R
Sbjct: 55  TVDYERRPIVQWNAIYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKR 114

Query: 81  YHHALQIMEWMEMR--KINYSFTDYALRLDLISKVKGIAAAENYFCDLSSSAKNRFTYGA 140
            + AL++ +WM  R  +   S +D A++LDLI KV+GI  AE +F  L  + K+R  YG+
Sbjct: 115 ANQALEVYDWMNNRGERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGS 174

Query: 141 LLNCYCKELMEEKALALSKKIDELKFASN-LSFNNLMTMYMRMDQPEKVPPLIDEMKRRG 200
           LLN Y +    EKA AL   + +  +A + L FN +MT+YM + + +KV  ++ EMK++ 
Sbjct: 175 LLNAYVRAKSREKAEALLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKD 234

Query: 201 IFLSTYTYNVWMNSCASLNGVGKVEEILEEMKNEDRNKFDWTTFSNLAAIYVKAGQLEKA 260
           I L  Y+YN+W++SC SL  V K+E + ++MK++     +WTTFS +A +Y+K G+ EKA
Sbjct: 235 IRLDIYSYNIWLSSCGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKA 294

Query: 261 ELALKKVENEIKSNKQQDRLAYHFLISLYASTSNRSEVYRIWNALKSVYPMTNNMSYLVM 320
           E AL+KVE  I     ++R+ YH+L+SLY S  N+ E+YR+W+  KSV P   N+ Y  +
Sbjct: 295 EDALRKVEARITG---RNRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHAL 354

Query: 321 LQALSKLKDFEGLKSTYKEWESSCSSFDLRLADVTIGAYLRQDMYEDAALVFEDAIKRSK 380
           + +L ++ D EG +  Y+EW    SS+D R+ ++ + AY++ D  E A  +F+  ++   
Sbjct: 355 VSSLVRMGDIEGAEKVYEEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGG 414

Query: 381 GPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDDEWHPSPAMANAFLMYFEEEKDIE 440
            P     E+  +   + + +  AL+ L +A S      W P   M + F    EEE D+ 
Sbjct: 415 KPSSSTWEILAVGHTRKRCISEALTCLRNAFSAEGSSNWRPKVLMLSGFFKLCEEESDVT 474

Query: 441 GAEDFARILKRFKCLDASAYHLLL 462
             E    +L++   L+  +Y  L+
Sbjct: 475 SKEAVLELLRQSGDLEDKSYLALI 495

BLAST of CmoCh04G004020 vs. TAIR10
Match: AT4G02820.1 (AT4G02820.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 255.4 bits (651), Expect = 7.4e-68
Identity = 160/475 (33.68%), Postives = 261/475 (54.95%), Query Frame = 1

Query: 21  TAESKRGPVNDQQRLYPRLSKLGATGGSVAQTLNQYIMEGKIVKKYELERCIKELRKYRR 80
           +A  K   V  +  L  RL  L  T  S   T+ ++  EG  V+KYEL R ++ELRK +R
Sbjct: 49  SANKKETVVGGRDTLGGRLLSLVYTKRSAVVTIRKWKEEGHSVRKYELNRIVRELRKIKR 108

Query: 81  YHHALQIMEWMEMRK-INYSFTDYALRLDLISKVKGIAAAENYFCDLSSSAKNRFTYGAL 140
           Y HAL+I EWM +++ I     DYA+ LDLISK++G+ +AE +F D+    +      +L
Sbjct: 109 YKHALEICEWMVVQEDIKLQAGDYAVHLDLISKIRGLNSAEKFFEDMPDQMRGHAACTSL 168

Query: 141 LNCYCKELMEEKALALSKKIDELKFASN-LSFNNLMTMYMRMDQPEKVPPLIDEMKRRGI 200
           L+ Y +  + +KA AL +K+ E  F  + L +N++++MY+   Q EKVP LI E+K R  
Sbjct: 169 LHSYVQNKLSDKAEALFEKMGECGFLKSCLPYNHMLSMYISRGQFEKVPVLIKELKIR-T 228

Query: 201 FLSTYTYNVWMNSCASLNGVGKVEEILEEMKNEDRNKFDWTTFSNLAAIYVKAGQLEKAE 260
                TYN+W+ + AS N V   E++  + K E  N  DW T+S L  +Y K   +EKA 
Sbjct: 229 SPDIVTYNLWLTAFASGNDVEGAEKVYLKAKEEKLNP-DWVTYSVLTNLYAKTDNVEKAR 288

Query: 261 LALKKVENEIKSNKQQDRLAYHFLISLYASTSNRSEVYRIWNALKSVYPMTNNMSYLVML 320
           LALK++E  +    +++R+AY  LISL+A+  ++  V   W  +KS +   N+  YL M+
Sbjct: 289 LALKEMEKLVS---KKNRVAYASLISLHANLGDKDGVNLTWKKVKSSFKKMNDAEYLSMI 348

Query: 321 QALSKLKDFEGLKSTYKEWESSCSSFDLRLADVTIGAYLRQDMYEDAALVFEDAIKRSKG 380
            A+ KL +FE  K  Y EWES   + D R+ ++ +  Y+ +D        +E  +++   
Sbjct: 349 SAVVKLGEFEQAKGLYDEWESVSGTGDARIPNLILAEYMNRDEVLLGEKFYERIVEKGIN 408

Query: 381 PFFRAREMFMIYFLKFKQVDLALSHLESAISESMDDEWHPSPAMANAFLMYFEEEKDIEG 440
           P +   E+    +LK K ++  L     AI      +W  +  +        EE+ +++G
Sbjct: 409 PSYSTWEILTWAYLKRKDMEKVLDCFGKAIDSV--KKWTVNVRLVKGACKELEEQGNVKG 468

Query: 441 AEDFARILKRFKCLDASAYHLLLKTYAAAGKRAPDMRQRLIEDNIEVSSELEELL 494
           AE    +L++   ++   Y+ LL+TYA AG+ A  + +R+ +DN+E+  E +EL+
Sbjct: 469 AEKLMTLLQKAGYVNTQLYNSLLRTYAKAGEMALIVEERMAKDNVELDEETKELI 516

BLAST of CmoCh04G004020 vs. NCBI nr
Match: gi|659107719|ref|XP_008453822.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial-like isoform X1 [Cucumis melo])

HSP 1 Score: 776.9 bits (2005), Expect = 2.1e-221
Identity = 397/493 (80.53%), Postives = 428/493 (86.82%), Query Frame = 1

Query: 1   MNRRSLLSRASAGLRHLCTSTAESKRGPVNDQQRLYPRLSKLGATGGSVAQTLNQYIMEG 60
           MNRRSL+SRA AGLR LCTS AE  R P N+ + LYPRLS LGATGGSVAQT+N++IMEG
Sbjct: 1   MNRRSLISRAPAGLRQLCTSVAELTRSPANNHRGLYPRLSVLGATGGSVAQTINRFIMEG 60

Query: 61  KIVKKYELERCIKELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVKGIAAAE 120
            IVKKYELE+CIKELRKYRRY H+LQIMEWME+RKINYSFTDYALRLDLISKV GI AAE
Sbjct: 61  NIVKKYELEKCIKELRKYRRYDHSLQIMEWMEIRKINYSFTDYALRLDLISKVNGITAAE 120

Query: 121 NYFCDLSSSAKNRFTYGALLNCYCKELMEEKALALSKKIDELKFASNLSFNNLMTMYMRM 180
            YF DL  SAKNR TYGALLNCYCKE+MEEKA  L KK+DELKF ++L+FNNLMTMYMRM
Sbjct: 121 KYFYDLPPSAKNRCTYGALLNCYCKEMMEEKASTLFKKMDELKFVTSLAFNNLMTMYMRM 180

Query: 181 DQPEKVPPLIDEMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKNEDRNKFDWTT 240
           DQPEKVPPLI EMK+RG +L+T+TYNVWMNSCASLN +GKVEEILEEMK ED NK DWTT
Sbjct: 181 DQPEKVPPLIGEMKQRGFYLTTFTYNVWMNSCASLNDIGKVEEILEEMKMEDSNKLDWTT 240

Query: 241 FSNLAAIYVKAGQLEKAELALKKVENEIKSNKQQDRLAYHFLISLYASTSNRSEVYRIWN 300
           FSNLA+ YVKAGQLEKAELALKKVE EIKS+K +DRLAYH LISLYASTSN SEV RIWN
Sbjct: 241 FSNLASFYVKAGQLEKAELALKKVEEEIKSDK-KDRLAYHCLISLYASTSNLSEVNRIWN 300

Query: 301 ALKSVYPMTNNMSYLVMLQALSKLKDFEGLKSTYKEWESSCSSFDLRLADVTIGAYLRQD 360
            LKSVYP   N SYLVMLQALSKLKD EGLK TYKEWES C  FDLRL +V IGAYL+QD
Sbjct: 301 LLKSVYPTMTNTSYLVMLQALSKLKDIEGLKKTYKEWESICHIFDLRLVNVIIGAYLQQD 360

Query: 361 MYEDAALVFEDAIKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDDEWHPSP 420
           MYEDAA++FEDAIKRSKGPF RARE FM+YFLK KQVD A SHLESAISES + EWHPS 
Sbjct: 361 MYEDAAMIFEDAIKRSKGPFSRAREKFMVYFLKLKQVDSAFSHLESAISESKEKEWHPSL 420

Query: 421 AMANAFLMYFEEEKDIEGAEDFARILKRFKCLDASAYHLLLKTYAAAGKRAPDMRQRLIE 480
           A  NAFL YFEEEKD+EGAEDFARILKR KCLD S YHLLLKTY AAGK APDMRQRL E
Sbjct: 421 ATTNAFLNYFEEEKDVEGAEDFARILKRLKCLDESGYHLLLKTYVAAGKSAPDMRQRLKE 480

Query: 481 DNIEVSSELEELL 494
           D+I +SSELEELL
Sbjct: 481 DDIGISSELEELL 492

BLAST of CmoCh04G004020 vs. NCBI nr
Match: gi|778690383|ref|XP_004146883.2| (PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial [Cucumis sativus])

HSP 1 Score: 770.4 bits (1988), Expect = 1.9e-219
Identity = 389/493 (78.90%), Postives = 427/493 (86.61%), Query Frame = 1

Query: 1   MNRRSLLSRASAGLRHLCTSTAESKRGPVNDQQRLYPRLSKLGATGGSVAQTLNQYIMEG 60
           MNRRSL+SRA AG R LCTS  E  R P N+Q+ LYPRLS LGATGGSVA+T+NQ+IMEG
Sbjct: 1   MNRRSLISRAPAGFRQLCTSLNELMRSPANNQRGLYPRLSALGATGGSVAKTINQFIMEG 60

Query: 61  KIVKKYELERCIKELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVKGIAAAE 120
            IVKKYELE+CIKELRKYRRYHH LQIMEWME RKINYSFTDYALRLDLISKV G+ AAE
Sbjct: 61  NIVKKYELEKCIKELRKYRRYHHCLQIMEWMETRKINYSFTDYALRLDLISKVNGVTAAE 120

Query: 121 NYFCDLSSSAKNRFTYGALLNCYCKELMEEKALALSKKIDELKFASNLSFNNLMTMYMRM 180
            YF DL  SAKNR TYGALLNCYCKE+MEEKAL L KK+DELK +++LSFNNLMTMYMRM
Sbjct: 121 KYFYDLPPSAKNRCTYGALLNCYCKEMMEEKALTLFKKMDELKISTSLSFNNLMTMYMRM 180

Query: 181 DQPEKVPPLIDEMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKNEDRNKFDWTT 240
           D PEKVPPLI EMK+RG +L+T+TYNVWMNSCASLN +GKVEEILEEMK EDRNKFDWTT
Sbjct: 181 DHPEKVPPLIGEMKQRGFYLTTFTYNVWMNSCASLNDIGKVEEILEEMKMEDRNKFDWTT 240

Query: 241 FSNLAAIYVKAGQLEKAELALKKVENEIKSNKQQDRLAYHFLISLYASTSNRSEVYRIWN 300
           +SNLA+ YVKAGQ EKAELALKK+E E+KS+K  DRL YH LISLYASTSN SEV RIWN
Sbjct: 241 YSNLASFYVKAGQFEKAELALKKLEEEMKSDK-NDRLVYHCLISLYASTSNLSEVNRIWN 300

Query: 301 ALKSVYPMTNNMSYLVMLQALSKLKDFEGLKSTYKEWESSCSSFDLRLADVTIGAYLRQD 360
           ALKSVY    N+SYLVMLQAL KLKD EGLK TYKEWES+C +FDLR+ +  IGAYL+QD
Sbjct: 301 ALKSVYSTMTNISYLVMLQALRKLKDIEGLKRTYKEWESNCRNFDLRIVNDIIGAYLQQD 360

Query: 361 MYEDAALVFEDAIKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDDEWHPSP 420
           MYEDAA++FEDA KRSKGPF RAREMFM+YFLK KQVD A SHLESA+SES + EWHPS 
Sbjct: 361 MYEDAAMIFEDATKRSKGPFSRAREMFMVYFLKLKQVDSAFSHLESALSESKEKEWHPSL 420

Query: 421 AMANAFLMYFEEEKDIEGAEDFARILKRFKCLDASAYHLLLKTYAAAGKRAPDMRQRLIE 480
           A   AFL YFEEEKD+EGAEDFARILKR KCLDAS YHLLLKTY AAGK APDMR+RL E
Sbjct: 421 ATTTAFLNYFEEEKDVEGAEDFARILKRLKCLDASGYHLLLKTYVAAGKLAPDMRKRLKE 480

Query: 481 DNIEVSSELEELL 494
           D+IE+SSELEELL
Sbjct: 481 DDIEISSELEELL 492

BLAST of CmoCh04G004020 vs. NCBI nr
Match: gi|659107721|ref|XP_008453823.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial-like isoform X2 [Cucumis melo])

HSP 1 Score: 647.9 bits (1670), Expect = 1.4e-182
Identity = 330/406 (81.28%), Postives = 353/406 (86.95%), Query Frame = 1

Query: 88  MEWMEMRKINYSFTDYALRLDLISKVKGIAAAENYFCDLSSSAKNRFTYGALLNCYCKEL 147
           MEWME+RKINYSFTDYALRLDLISKV GI AAE YF DL  SAKNR TYGALLNCYCKE+
Sbjct: 1   MEWMEIRKINYSFTDYALRLDLISKVNGITAAEKYFYDLPPSAKNRCTYGALLNCYCKEM 60

Query: 148 MEEKALALSKKIDELKFASNLSFNNLMTMYMRMDQPEKVPPLIDEMKRRGIFLSTYTYNV 207
           MEEKA  L KK+DELKF ++L+FNNLMTMYMRMDQPEKVPPLI EMK+RG +L+T+TYNV
Sbjct: 61  MEEKASTLFKKMDELKFVTSLAFNNLMTMYMRMDQPEKVPPLIGEMKQRGFYLTTFTYNV 120

Query: 208 WMNSCASLNGVGKVEEILEEMKNEDRNKFDWTTFSNLAAIYVKAGQLEKAELALKKVENE 267
           WMNSCASLN +GKVEEILEEMK ED NK DWTTFSNLA+ YVKAGQLEKAELALKKVE E
Sbjct: 121 WMNSCASLNDIGKVEEILEEMKMEDSNKLDWTTFSNLASFYVKAGQLEKAELALKKVEEE 180

Query: 268 IKSNKQQDRLAYHFLISLYASTSNRSEVYRIWNALKSVYPMTNNMSYLVMLQALSKLKDF 327
           IKS+K+ DRLAYH LISLYASTSN SEV RIWN LKSVYP   N SYLVMLQALSKLKD 
Sbjct: 181 IKSDKK-DRLAYHCLISLYASTSNLSEVNRIWNLLKSVYPTMTNTSYLVMLQALSKLKDI 240

Query: 328 EGLKSTYKEWESSCSSFDLRLADVTIGAYLRQDMYEDAALVFEDAIKRSKGPFFRAREMF 387
           EGLK TYKEWES C  FDLRL +V IGAYL+QDMYEDAA++FEDAIKRSKGPF RARE F
Sbjct: 241 EGLKKTYKEWESICHIFDLRLVNVIIGAYLQQDMYEDAAMIFEDAIKRSKGPFSRAREKF 300

Query: 388 MIYFLKFKQVDLALSHLESAISESMDDEWHPSPAMANAFLMYFEEEKDIEGAEDFARILK 447
           M+YFLK KQVD A SHLESAISES + EWHPS A  NAFL YFEEEKD+EGAEDFARILK
Sbjct: 301 MVYFLKLKQVDSAFSHLESAISESKEKEWHPSLATTNAFLNYFEEEKDVEGAEDFARILK 360

Query: 448 RFKCLDASAYHLLLKTYAAAGKRAPDMRQRLIEDNIEVSSELEELL 494
           R KCLD S YHLLLKTY AAGK APDMRQRL ED+I +SSELEELL
Sbjct: 361 RLKCLDESGYHLLLKTYVAAGKSAPDMRQRLKEDDIGISSELEELL 405

BLAST of CmoCh04G004020 vs. NCBI nr
Match: gi|1009148013|ref|XP_015891718.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial [Ziziphus jujuba])

HSP 1 Score: 641.0 bits (1652), Expect = 1.8e-180
Identity = 325/495 (65.66%), Postives = 399/495 (80.61%), Query Frame = 1

Query: 1   MNRRSLLSRASAGL-RHLCTSTAESKRGPVNDQQRLYPRLSKLGATGGSVAQTLNQYIME 60
           MN R L+S  +A L R L T+      G   +  RLY RLS LGATGGSV++TLN+YIME
Sbjct: 1   MNSRRLISAGAAWLVRQLSTAAETVAAGSTANGTRLYRRLSALGATGGSVSKTLNEYIME 60

Query: 61  GKIVKKYELERCIKELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVKGIAAA 120
           G+IVKK+ELERCIKELRKYRR+ HAL+IMEWMEMRKINYSFTD+ALRLDLI K KG+ AA
Sbjct: 61  GRIVKKFELERCIKELRKYRRFQHALEIMEWMEMRKINYSFTDHALRLDLICKTKGVDAA 120

Query: 121 ENYFCDLSSSAKNRFTYGALLNCYCKELMEEKALALSKKIDELKFASN-LSFNNLMTMYM 180
           ENYF +L S+AKNR T+GALLNCYCKE ME+KALAL +K+D+L F SN L+FNNLM++YM
Sbjct: 121 ENYFDNLPSNAKNRLTFGALLNCYCKENMEDKALALFQKMDDLNFVSNSLAFNNLMSLYM 180

Query: 181 RMDQPEKVPPLIDEMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKNEDRNKFDW 240
           RM +PEKVPPL+ EMK+R IF   +TY++WM S +SL  +  VE +LEEM   D +K +W
Sbjct: 181 RMGKPEKVPPLVQEMKQRNIFPCNFTYSIWMQSYSSLGDIEGVERVLEEMNKGDHDKCNW 240

Query: 241 TTFSNLAAIYVKAGQLEKAELALKKVENEIKSNKQQDRLAYHFLISLYASTSNRSEVYRI 300
            T++NLAAIYVKAG  EKA+LALKK+E E +   +Q   AYHF+ISLYA T N +EV R 
Sbjct: 241 KTYTNLAAIYVKAGHFEKADLALKKLEEETRPRGRQ---AYHFVISLYAGTGNLNEVNRA 300

Query: 301 WNALKSVYPMTNNMSYLVMLQALSKLKDFEGLKSTYKEWESSCSSFDLRLADVTIGAYLR 360
           W  LKS+YP TNN+SYLV+LQALSKL D EGLK  +KEWESS S +D+RLA+V +G YLR
Sbjct: 301 WETLKSIYPETNNLSYLVLLQALSKLNDVEGLKKYFKEWESSFSFYDIRLANVAVGTYLR 360

Query: 361 QDMYEDAALVFEDAIKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDDEWHP 420
            DMY++A+ VFEDA KR+KGPFF+AREMFM YFLKF+QVD ALS +E+AISE+ DD+W P
Sbjct: 361 NDMYKEASAVFEDATKRTKGPFFKAREMFMNYFLKFRQVDSALSFMEAAISEARDDDWRP 420

Query: 421 SPAMANAFLMYFEEEKDIEGAEDFARILKRFKCLDASAYHLLLKTYAAAGKRAPDMRQRL 480
           SPA+A+AFL YFEEEKD++ AE F +IL+RF CL+++AYHLLLKTY AAGK AP+MR+RL
Sbjct: 421 SPAVASAFLKYFEEEKDVDSAEQFCKILRRFNCLNSNAYHLLLKTYLAAGKLAPEMRRRL 480

Query: 481 IEDNIEVSSELEELL 494
            E++IE+S ELE LL
Sbjct: 481 EEEDIEISVELESLL 492

BLAST of CmoCh04G004020 vs. NCBI nr
Match: gi|645243222|ref|XP_008227878.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial [Prunus mume])

HSP 1 Score: 618.2 bits (1593), Expect = 1.2e-173
Identity = 319/499 (63.93%), Postives = 390/499 (78.16%), Query Frame = 1

Query: 1   MNRRSLLSRASAGLRHLCTST---AESKRGPVNDQQRLYPRLSKLGATGGSVAQTLNQYI 60
           MN    +S  +  +R LCT+     ES R    +  RLY RLS LGATGGSVA+TLNQYI
Sbjct: 1   MNSSRSISAGTWLVRKLCTAVEAATESARSQPGNPTRLYRRLSALGATGGSVAKTLNQYI 60

Query: 61  MEGKIVKKYELERCIKELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVKGIA 120
           MEGK++KKYELERCIKELRKYR++ HAL+IMEWME RK+NYS  D+A+RLDL SKVKGI 
Sbjct: 61  MEGKMLKKYELERCIKELRKYRKFQHALEIMEWMEFRKMNYSKADFAIRLDLTSKVKGIE 120

Query: 121 AAENYFCDLSSSAKNRFTYGALLNCYCKELMEEKALALSKKIDELKFASN-LSFNNLMTM 180
           AAE+YF  LS S K+RFTYGALLNCYCKELMEEKAL+L + +DEL+FAS+ L FNNLM+M
Sbjct: 121 AAEDYFSGLSPSLKDRFTYGALLNCYCKELMEEKALSLYETMDELEFASSSLVFNNLMSM 180

Query: 181 YMRMDQPEKVPPLIDEMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKNEDRNKF 240
           +MR  QPEKV PL+ EMK+R I L T+TYN+WM S ASLN    VE +L+EM+ +D ++ 
Sbjct: 181 HMRKQQPEKVAPLVQEMKQRKIPLDTFTYNIWMQSFASLNNFEGVERVLDEMQKQDGDQC 240

Query: 241 DWTTFSNLAAIYVKAGQLEKAELALKKVENEIKSNKQQDRLAYHFLISLYASTSNRSEVY 300
            W+T+SNLAAIYVKA   +KAELALKK E  +K  KQ++   YHFLISLYA TSN  EV 
Sbjct: 241 SWSTYSNLAAIYVKAKIFDKAELALKKSEEMMKPLKQRN--TYHFLISLYACTSNLGEVK 300

Query: 301 RIWNALKSVYPMTNNMSYLVMLQALSKLKDFEGLKSTYKEWESSCSSFDLRLADVTIGAY 360
           R+W +LK  +P TNN+SYL+MLQAL KL D EGLK  ++EWE  CSS+D+RLA+  I  Y
Sbjct: 301 RVWESLKKAFPATNNISYLIMLQALCKLNDIEGLKECFEEWECKCSSYDMRLANTAIRGY 360

Query: 361 LRQDMYEDAALVFEDAIKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDDEW 420
           L QDMYE+AALVF DA KR+KGPFF+AREMFM+YFLK  QVDLA+S+L +A+SE++D+EW
Sbjct: 361 LSQDMYEEAALVFSDACKRTKGPFFKAREMFMLYFLKNCQVDLAVSYLGAAVSETVDEEW 420

Query: 421 HPSPAMANAFLMYFEEEKDIEGAEDFARILKRFKCLDASAYHLLLKTYAAAGKRAPDMRQ 480
           HPSP   +AF  YFEEEKD+E AE+F +ILKR  CL ++ Y+LLLKTY AAGK  P+MRQ
Sbjct: 421 HPSPDTTSAFFKYFEEEKDVESAENFCKILKRLNCLCSNEYYLLLKTYIAAGKLDPEMRQ 480

Query: 481 RLIEDNIEVSSELEELLSR 496
           RL E++IE+S ELE LL R
Sbjct: 481 RLKEEDIEISPELESLLER 497

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR4_ARATH7.8e-13653.00Pentatricopeptide repeat-containing protein At1g02370, mitochondrial OS=Arabidop... [more]
PP300_ARATH1.9e-12147.95Pentatricopeptide repeat-containing protein At4g01990, mitochondrial OS=Arabidop... [more]
PPR86_ARATH4.0e-10042.27Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana GN... [more]
PPR3_ARATH1.2e-7535.36Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana GN... [more]
PP302_ARATH1.3e-6633.68Pentatricopeptide repeat-containing protein At4g02820, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0KUH1_CUCSA1.3e-21978.90Pentatricopeptide repeat-containing protein OS=Cucumis sativus GN=Csa_4G026260 P... [more]
M5X9A6_PRUPE4.1e-16063.99Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005037mg PE=4 SV=1[more]
B9RNC6_RICCO2.9e-15358.42Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
A0A067K8Z8_JATCU9.2e-15255.73Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14045 PE=4 SV=1[more]
A0A061DR66_THECC1.7e-15056.37Tetratricopeptide repeat (TPR)-like superfamily protein, putative OS=Theobroma c... [more]
Match NameE-valueIdentityDescription
AT1G02370.14.4e-13753.00 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G01990.11.0e-12247.95 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G60770.12.3e-10142.27 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G02150.16.6e-7735.36 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G02820.17.4e-6833.68 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659107719|ref|XP_008453822.1|2.1e-22180.53PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial-... [more]
gi|778690383|ref|XP_004146883.2|1.9e-21978.90PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial ... [more]
gi|659107721|ref|XP_008453823.1|1.4e-18281.28PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial-... [more]
gi|1009148013|ref|XP_015891718.1|1.8e-18065.66PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial ... [more]
gi|645243222|ref|XP_008227878.1|1.2e-17363.93PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial ... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G004020.1CmoCh04G004020.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 134..155
score: 0.031coord: 240..263
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 169..213
score: 2.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 132..162
score: 7.366coord: 345..379
score: 6.511coord: 453..489
score: 5.24coord: 237..267
score: 6.544coord: 275..305
score: 5.59coord: 166..200
score: 8.506coord: 310..340
score: 6.084coord: 201..235
score: 8
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 128..335
score: 4.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 213..315
score: 6.8
NoneNo IPR availableunknownCoilCoilcoord: 254..274
score: -coord: 475..495
scor
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 45..492
score: 7.9E
NoneNo IPR availablePANTHERPTHR24015:SF504SUBFAMILY NOT NAMEDcoord: 45..492
score: 7.9E