CmaCh04G003810 (gene) Cucurbita maxima (Rimu)

NameCmaCh04G003810
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionPentatricopeptide repeat-containing protein
LocationCma_Chr04 : 1908662 .. 1911797 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATCGTCGGAGTTTACTCTCGAGGGCGTCGGCAGGTTTGCGGCATCTCTGTACTTCAGCCGCCGAGTTGACGCGCGGTCCTGTGAATGATCAGCAGCAGCGGCTCTACCCGAGGCTGTCGAAGTTGGGTGCCACCGGCGGTAGCGTGGCGCAGACATTGAACCAGTACATTATGGAGGGAAAGATCGTCAAAAAATATGAGCTCGAGAGATGCATAAAGGAGCTCCGGAAGTACCGTAGATACCACCACGCCCTTCAGGTTTTTTCTTTAAACACATAGGTTCGTTCACCGTAGGAGAAATATAAATGCACAAAATTATTGCTGCTTTTCATTGGTTTTTCAAATTCTATTGCCAATTGTTGATTGTATTCTTCCTTTCGCTGCTAGATCATAAATTTTTTGTAAACGAAACATTCATGAAATATAAAGGGAATGACAAAAACCTTATGAAGAATTTACCACCAAAAATCATTGGATTTGGAAGCTATAAAAAAACAGTCTGGAGGGAGAACCAGCCAAAAAGTTGCTGTAATTGCAGGAGGCCATAAGATCTTAGCTCATCCATTGCATTTGCTTCGGTTGTTGCAGATAATGGAATGGATGGAGATGAGGAAAATCAACTACTCATTCACTGACTACGCGCTGCGTTTAGATCTTATATCGAAAGTTAAAGGAATTGCTGCTGCGGAGAATTATTTCTGTGATCTGTCGTCATCTGCGAAGAATCGATTTACTTATGGAGCTCTTTTGAATTGCTATTGCAAGGAATTGATGGAGGAAAAGGCTTTGGCTCTTTCTAAGAAGATAGATGAGTTGAAGTTTGCTTCGAATTTATCCTTTAACAATCTTATGACCATGTATATGAGAATGGATCAACCCGAGAAAGTACCTCCTCTTATAGATGAAATGAAGCGGAGAGGGATTTTTCTTAGCACGTACACATACAACGTGTGGATGAACAGTTGTGCTTCACTGAATGGTGTTGGAAAAGTGGAAGAAATTCTCGAGGAGATGAAAGACGAAGACAGAAACAAATTTGATTGGACGACGTTTTCGAACTTGGCTGCTATCTATGTTAAGGTAGGACAGCTCGAGAAAGCTGAATTAGCTCTTAAAAAGGTAGAGAACGAGATCAAATCGAGTAGGCAGCAGGATCGTCTAGCGTACCATTTCTTGATAAGCCTTTATGCATCGACGTCGAATCGGAGTGAGGTGTATAGGATATGGAATGCACTGAAATCAGTTTATCCAATGACAAATAACATGAGTTATCTCGTCATGCTTCAGGCTCTAAGCAAACTAAAGGATTTCGAGGGTCTTAAAAGAACTTATAAGGAATGGGAATCTAGTTGCTCGAGCTTTGATTTGCGGTTAGCGGATGTTACCATCGGGGCTTACCTACGACAGGACATGTACAAAGATGCTGTGTTGGTCTTCGAGGATGCCAATAAGAGAAGTAAAGGACCTTTCTTTAGGGCTCGAGAAATGTTCATGATTTACTTCTTGAAGTTCAAGCAAGTCGATTTGGCACTCAGTCATTTGGAATCAGCTATATCTGAAAGCATGGACGATGAATGGCATCCATCACCGGCGATGGCAAATGCTTTTCTGATGTACTTTGAAGAAGAGAAAGACGTTGAAGGCGCCGAAGATTTTGCCAGGATTTTGAAGAGATTTAAGTGTCTTGACGCTAGTGCATACCATCTATTGCTCAAGACCTATGCAGCTGCAGGAAAACGAGCCCCCGACATGCGACAAAGATTGAAAGAAGACAACATTGAGGTAAGTAGTGAGCTTGAGGAGTTGTTAAGTAGACAACATTGAGACAAGTAGGAGGAAAATCCATGGGATGTTTTTTTTTTAATCTGATCTTTATAAGCTGATCCACTTCATTCCCTTTCTCCTAAACGAAATTTGTATTTTCGAGCATTGGTCTATTTTCATTTTCTTGATAAGCTGATGGTGCACAACTTGTAGAAGTTCCCTTGGTTTGTGGTAGACTTGGACCGGGTTTTTGTCACCACGTTATCTGCATCGAAGAAGCCCGGGCTGGCAAGATAAGTTCACGTGGCCATGCCCTAAACCAATTGATAAGCTTTTTTATTTGATTGAAGGAAAAATTACTACTTATTAGCTTAGCTAGCCTTCTGGGAGAATCGAGTCATTTCGTCAGTTCATATGGATATATCAATTTATTTACACTTGGGATATACATTTATTTCGAGCATTCAAAAGCATGCACGTAACACGTCAAGAATTTCTAGGAGAATTGTTCTTTAACAATACCACATTGTTAGTAGCTTTTGAGGGAGGCCTATTGATCTTCTCTAATAATCTACTGAATACTAACTTTACTGGTATGTATGCTACAGTTGGGGGCCTACATGATTCGAGTTTGATCAATCCATTCATTCGACAAGTTACACTGTCATGCATTGGCACGTTACTATAATTCGGCTCGACAGCGATCGAAGACAAAGCTCTAGTCTTCTGGTTAGTTGATAAGCTTACTTTCTTTGACATCTTTTTTGGCAGTCTTCTATCACGAGAAGGTCGGCGACTGTGAGGGCCGCTTGCCTCTTCACCGCCCAGTACTGTTGTCGGTAAACGAGTGCGAGGGATGCGAACATTTGCTGTCCTTGCAGTCCTCCAAGCTGGTTGATTGCCTAGACAATCTGAAAGAAGAGCTTCAGAAAACCCGTCTTGTAGAGCTTCAATTTGAATAGGATGCCCAATTATAGCTTGCCCATTTAACTTGCTCACAAGTGATACAATAGGAACAGGCTCTTTCTGGTAATTTGCTTGGACTTTCAGATCAACCTCTGTTAATATAGTTCTTAACCTCCCACCAAAATGATGACGTGTACCGTATAAGCGATGTTTCACATCCCGATATCCTTTCAAATCAGGTTGCTGATCCTCCCAGTCCGACATGTAATTTCCTACAAGATTTTGATCCTTTGATACCCTTTTTGACGTTATGTAATACTCCTCTTCCATTGCCTCAGCTTGAGCTTGCTCTGTCGTGTCATCAAAATACTCATTTTTAAGTCTAACAGACTTTTTCGAAAGATTTCGAACGTTCCTTTTCCCCTTCAATTGCCACTTAGACATTAGGTCGTGATGATAAGCC

mRNA sequence

ATGAATCGTCGGAGTTTACTCTCGAGGGCGTCGGCAGGTTTGCGGCATCTCTGTACTTCAGCCGCCGAGTTGACGCGCGGTCCTGTGAATGATCAGCAGCAGCGGCTCTACCCGAGGCTGTCGAAGTTGGGTGCCACCGGCGGTAGCGTGGCGCAGACATTGAACCAGTACATTATGGAGGGAAAGATCGTCAAAAAATATGAGCTCGAGAGATGCATAAAGGAGCTCCGGAAGTACCGTAGATACCACCACGCCCTTCAGATAATGGAATGGATGGAGATGAGGAAAATCAACTACTCATTCACTGACTACGCGCTGCGTTTAGATCTTATATCGAAAGTTAAAGGAATTGCTGCTGCGGAGAATTATTTCTGTGATCTGTCGTCATCTGCGAAGAATCGATTTACTTATGGAGCTCTTTTGAATTGCTATTGCAAGGAATTGATGGAGGAAAAGGCTTTGGCTCTTTCTAAGAAGATAGATGAGTTGAAGTTTGCTTCGAATTTATCCTTTAACAATCTTATGACCATGTATATGAGAATGGATCAACCCGAGAAAGTACCTCCTCTTATAGATGAAATGAAGCGGAGAGGGATTTTTCTTAGCACGTACACATACAACGTGTGGATGAACAGTTGTGCTTCACTGAATGGTGTTGGAAAAGTGGAAGAAATTCTCGAGGAGATGAAAGACGAAGACAGAAACAAATTTGATTGGACGACGTTTTCGAACTTGGCTGCTATCTATGTTAAGGTAGGACAGCTCGAGAAAGCTGAATTAGCTCTTAAAAAGGTAGAGAACGAGATCAAATCGAGTAGGCAGCAGGATCGTCTAGCGTACCATTTCTTGATAAGCCTTTATGCATCGACGTCGAATCGGAGTGAGGTGTATAGGATATGGAATGCACTGAAATCAGTTTATCCAATGACAAATAACATGAGTTATCTCGTCATGCTTCAGGCTCTAAGCAAACTAAAGGATTTCGAGGGTCTTAAAAGAACTTATAAGGAATGGGAATCTAGTTGCTCGAGCTTTGATTTGCGGTTAGCGGATGTTACCATCGGGGCTTACCTACGACAGGACATGTACAAAGATGCTGTGTTGGTCTTCGAGGATGCCAATAAGAGAAGTAAAGGACCTTTCTTTAGGGCTCGAGAAATGTTCATGATTTACTTCTTGAAGTTCAAGCAAGTCGATTTGGCACTCAGTCATTTGGAATCAGCTATATCTGAAAGCATGGACGATGAATGGCATCCATCACCGGCGATGGCAAATGCTTTTCTGATGTACTTTGAAGAAGAGAAAGACGTTGAAGGCGCCGAAGATTTTGCCAGGATTTTGAAGAGATTTAAGTGTCTTGACGCTAGTGCATACCATCTATTGCTCAAGACCTATGCAGCTGCAGGAAAACGAGCCCCCGACATGCGACAAAGATTGAAAGAAGACAACATTGAGAAGTTCCCTTGGTTTGTGGTAGACTTGGACCGGGTTTTTGTCACCACGTTATCTGCATCGAAGAAGCCCGGGCTGGCAAGATAAGTTCACGTGGCCATGCCCTAAACCAATTGATAAGCTTTTTTATTTGATTGAAGGAAAAATTACTACTTATTAGCTTAGCTAGCCTTCTGGGAGAATCGAGTCATTTCGTCAGTTCATATGGATATATCAATTTATTTACACTTGGGATATACATTTATTTCGAGCATTCAAAAGCATGCACGTAACACGTCAAGAATTTCTAGGAGAATTGTTCTTTAACAATACCACATTGTTAGTAGCTTTTGAGGGAGGCCTATTGATCTTCTCTAATAATCTACTGAATACTAACTTTACTGGTATGTATGCTACAGTTGGGGGCCTACATGATTCGAGTTTGATCAATCCATTCATTCGACAAGTTACACTGTCATGCATTGGCACGTTACTATAATTCGGCTCGACAGCGATCGAAGACAAAGCTCTAGTCTTCTGGTTAGTTGATAAGCTTACTTTCTTTGACATCTTTTTTGGCAGTCTTCTATCACGAGAAGGTCGGCGACTGTGAGGGCCGCTTGCCTCTTCACCGCCCAGTACTGTTGTCGGTAAACGAGTGCGAGGGATGCGAACATTTGCTGTCCTTGCAGTCCTCCAAGCTGGTTGATTGCCTAGACAATCTGAAAGAAGAGCTTCAGAAAACCCGTCTTGTAGAGCTTCAATTTGAATAGGATGCCCAATTATAGCTTGCCCATTTAACTTGCTCACAAGTGATACAATAGGAACAGGCTCTTTCTGGTTGCTGATCCTCCCAGTCCGACATGTAATTTCCTACAAGATTTTGATCCTTTGATACCCTTTTTGACGTTATGTAATACTCCTCTTCCATTGCCTCAGCTTGAGCTTGCTCTGTCGTGTCATCAAAATACTCATTTTTAAGTCTAACAGACTTTTTCGAAAGATTTCGAACGTTCCTTTTCCCCTTCAATTGCCACTTAGACATTAGGTCGTGATGATAAGCC

Coding sequence (CDS)

ATGAATCGTCGGAGTTTACTCTCGAGGGCGTCGGCAGGTTTGCGGCATCTCTGTACTTCAGCCGCCGAGTTGACGCGCGGTCCTGTGAATGATCAGCAGCAGCGGCTCTACCCGAGGCTGTCGAAGTTGGGTGCCACCGGCGGTAGCGTGGCGCAGACATTGAACCAGTACATTATGGAGGGAAAGATCGTCAAAAAATATGAGCTCGAGAGATGCATAAAGGAGCTCCGGAAGTACCGTAGATACCACCACGCCCTTCAGATAATGGAATGGATGGAGATGAGGAAAATCAACTACTCATTCACTGACTACGCGCTGCGTTTAGATCTTATATCGAAAGTTAAAGGAATTGCTGCTGCGGAGAATTATTTCTGTGATCTGTCGTCATCTGCGAAGAATCGATTTACTTATGGAGCTCTTTTGAATTGCTATTGCAAGGAATTGATGGAGGAAAAGGCTTTGGCTCTTTCTAAGAAGATAGATGAGTTGAAGTTTGCTTCGAATTTATCCTTTAACAATCTTATGACCATGTATATGAGAATGGATCAACCCGAGAAAGTACCTCCTCTTATAGATGAAATGAAGCGGAGAGGGATTTTTCTTAGCACGTACACATACAACGTGTGGATGAACAGTTGTGCTTCACTGAATGGTGTTGGAAAAGTGGAAGAAATTCTCGAGGAGATGAAAGACGAAGACAGAAACAAATTTGATTGGACGACGTTTTCGAACTTGGCTGCTATCTATGTTAAGGTAGGACAGCTCGAGAAAGCTGAATTAGCTCTTAAAAAGGTAGAGAACGAGATCAAATCGAGTAGGCAGCAGGATCGTCTAGCGTACCATTTCTTGATAAGCCTTTATGCATCGACGTCGAATCGGAGTGAGGTGTATAGGATATGGAATGCACTGAAATCAGTTTATCCAATGACAAATAACATGAGTTATCTCGTCATGCTTCAGGCTCTAAGCAAACTAAAGGATTTCGAGGGTCTTAAAAGAACTTATAAGGAATGGGAATCTAGTTGCTCGAGCTTTGATTTGCGGTTAGCGGATGTTACCATCGGGGCTTACCTACGACAGGACATGTACAAAGATGCTGTGTTGGTCTTCGAGGATGCCAATAAGAGAAGTAAAGGACCTTTCTTTAGGGCTCGAGAAATGTTCATGATTTACTTCTTGAAGTTCAAGCAAGTCGATTTGGCACTCAGTCATTTGGAATCAGCTATATCTGAAAGCATGGACGATGAATGGCATCCATCACCGGCGATGGCAAATGCTTTTCTGATGTACTTTGAAGAAGAGAAAGACGTTGAAGGCGCCGAAGATTTTGCCAGGATTTTGAAGAGATTTAAGTGTCTTGACGCTAGTGCATACCATCTATTGCTCAAGACCTATGCAGCTGCAGGAAAACGAGCCCCCGACATGCGACAAAGATTGAAAGAAGACAACATTGAGAAGTTCCCTTGGTTTGTGGTAGACTTGGACCGGGTTTTTGTCACCACGTTATCTGCATCGAAGAAGCCCGGGCTGGCAAGATAA

Protein sequence

MNRRSLLSRASAGLRHLCTSAAELTRGPVNDQQQRLYPRLSKLGATGGSVAQTLNQYIMEGKIVKKYELERCIKELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVKGIAAAENYFCDLSSSAKNRFTYGALLNCYCKELMEEKALALSKKIDELKFASNLSFNNLMTMYMRMDQPEKVPPLIDEMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKDEDRNKFDWTTFSNLAAIYVKVGQLEKAELALKKVENEIKSSRQQDRLAYHFLISLYASTSNRSEVYRIWNALKSVYPMTNNMSYLVMLQALSKLKDFEGLKRTYKEWESSCSSFDLRLADVTIGAYLRQDMYKDAVLVFEDANKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDDEWHPSPAMANAFLMYFEEEKDVEGAEDFARILKRFKCLDASAYHLLLKTYAAAGKRAPDMRQRLKEDNIEKFPWFVVDLDRVFVTTLSASKKPGLAR
BLAST of CmaCh04G003810 vs. Swiss-Prot
Match: PPR4_ARATH (Pentatricopeptide repeat-containing protein At1g02370, mitochondrial OS=Arabidopsis thaliana GN=At1g02370 PE=2 SV=1)

HSP 1 Score: 474.6 bits (1220), Expect = 1.4e-132
Identity = 241/461 (52.28%), Postives = 330/461 (71.58%), Query Frame = 1

Query: 29  VNDQQQRLYPRLSKLGATGGSVAQTLNQYIMEGKIVKKYELERCIKELRKYRRYHHALQI 88
           V  +Q+ LY +LS L  TGG+VA+TLNQ+IMEG  V+K +L RC K LRK+RR  HA +I
Sbjct: 66  VASRQRELYKKLSMLSVTGGTVAETLNQFIMEGITVRKDDLFRCAKTLRKFRRPQHAFEI 125

Query: 89  MEWMEMRKINYSFTDYALRLDLISKVKGIAAAENYFCDLSSSAKN-RFTYGALLNCYCKE 148
            +WME RK+ +S +D+A+ LDLI K KG+ AAENYF +L  SAKN + TYGAL+NCYC E
Sbjct: 126 FDWMEKRKMTFSVSDHAICLDLIGKTKGLEAAENYFNNLDPSAKNHQSTYGALMNCYCVE 185

Query: 149 LMEEKALALSKKIDELKFASN-LSFNNLMTMYMRMDQPEKVPPLIDEMKRRGIFLSTYTY 208
           L EEKA A  + +DEL F +N L FNN+M+MYMR+ QPEKVP L+D MK+RGI     TY
Sbjct: 186 LEEEKAKAHFEIMDELNFVNNSLPFNNMMSMYMRLSQPEKVPVLVDAMKQRGISPCGVTY 245

Query: 209 NVWMNSCASLNGVGKVEEILEEMKDEDRNKFDWTTFSNLAAIYVKVGQLEKAELALKKVE 268
           ++WM SC SLN +  +E+I++EM  +   K  W TFSNLAAIY K G  EKA+ ALK +E
Sbjct: 246 SIWMQSCGSLNDLDGLEKIIDEMGKDSEAKTTWNTFSNLAAIYTKAGLYEKADSALKSME 305

Query: 269 NEIKSSRQQDRLAYHFLISLYASTSNRSEVYRIWNALKSVYPMTNNMSYLVMLQALSKLK 328
            ++  +   +R ++HFL+SLYA  S   EVYR+W +LK   P  NN+SYLVMLQA+SKL 
Sbjct: 306 EKMNPN---NRDSHHFLMSLYAGISKGPEVYRVWESLKKARPEVNNLSYLVMLQAMSKLG 365

Query: 329 DFEGLKRTYKEWESSCSSFDLRLADVTIGAYLRQDMYKDAVLVFEDANKRSKGPFFRARE 388
           D +G+K+ + EWES C ++D+RLA++ I  YL+ +MY++A  + + A K+SKGPF +AR+
Sbjct: 366 DLDGIKKIFTEWESKCWAYDMRLANIAINTYLKGNMYEEAEKILDGAMKKSKGPFSKARQ 425

Query: 389 MFMIYFLKFKQVDLALSHLESAISESMD--DEWHPSPAMANAFLMYFEEEKDVEGAEDFA 448
           + MI+ L+  + DLA+ HLE+A+S+S +  DEW  S  + + F ++FE+ KDV+GAEDF 
Sbjct: 426 LLMIHLLENDKADLAMKHLEAAVSDSAENKDEWGWSSELVSLFFLHFEKAKDVDGAEDFC 485

Query: 449 RILKRFKCLDASAYHLLLKTYAAAGKRAPDMRQRLKEDNIE 486
           +IL  +K LD+     L+KTYAAA K +PDMR+RL +  IE
Sbjct: 486 KILSNWKPLDSETMTFLIKTYAAAEKTSPDMRERLSQQQIE 523

BLAST of CmaCh04G003810 vs. Swiss-Prot
Match: PP300_ARATH (Pentatricopeptide repeat-containing protein At4g01990, mitochondrial OS=Arabidopsis thaliana GN=At4g01990 PE=2 SV=1)

HSP 1 Score: 434.1 bits (1115), Expect = 2.1e-120
Identity = 229/476 (48.11%), Postives = 320/476 (67.23%), Query Frame = 1

Query: 17  LCTSAAELTRG-----PVNDQQQR-LYPRLSKLGATGGS-VAQTLNQYIMEGKIVKKYEL 76
           L T+ AE++       P   ++ R +Y +LS LG  GG  + +TLNQ++MEG  VKK++L
Sbjct: 16  LATATAEISGEAAASVPTKAKKHRSIYKKLSSLGTRGGGKMEETLNQFVMEGVPVKKHDL 75

Query: 77  ERCIKELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVKGIAAAENYFCDLSS 136
            R  K+LRK+R+   AL+I EWME ++I ++ +D+A+RL+LI+K KG+ AAE YF  L  
Sbjct: 76  IRYAKDLRKFRQPQRALEIFEWMERKEIAFTGSDHAIRLNLIAKSKGLEAAETYFNSLDD 135

Query: 137 SAKNRFTYGALLNCYCKELMEEKALALSKKIDELKFASN-LSFNNLMTMYMRMDQPEKVP 196
           S KN+ TYG+LLNCYC E  E KA A  + + +L   SN L FNNLM MYM + QPEKVP
Sbjct: 136 SIKNQSTYGSLLNCYCVEKEEVKAKAHFENMVDLNHVSNSLPFNNLMAMYMGLGQPEKVP 195

Query: 197 PLIDEMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKDEDRNKFDWTTFSNLAAI 256
            L+  MK + I     TY++W+ SC SL  +  VE++L+EMK E    F W TF+NLAAI
Sbjct: 196 ALVVAMKEKSITPCDITYSMWIQSCGSLKDLDGVEKVLDEMKAEGEGIFSWNTFANLAAI 255

Query: 257 YVKVGQLEKAELALKKVENEIKSSRQQDRLAYHFLISLYASTSNRSEVYRIWNALKSVYP 316
           Y+KVG   KAE ALK +EN +    +     YHFLI+LY   +N SEVYR+W+ LK  YP
Sbjct: 256 YIKVGLYGKAEEALKSLENNMNPDVRD---CYHFLINLYTGIANASEVYRVWDLLKKRYP 315

Query: 317 MTNNMSYLVMLQALSKLKDFEGLKRTYKEWESSCSSFDLRLADVTIGAYLRQDMYKDAVL 376
             NN SYL ML+ALSKL D +G+K+ + EWES+C ++D+R+A+V I +YL+Q+MY++A  
Sbjct: 316 NVNNSSYLTMLRALSKLDDIDGVKKVFAEWESTCWTYDMRMANVAISSYLKQNMYEEAEA 375

Query: 377 VFEDANKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDDEWHPSPAMANAFL 436
           VF  A K+ KG F +AR++ M++ LK  Q DLAL H E+A+ +  D  W  S  + ++F 
Sbjct: 376 VFNGAMKKCKGQFSKARQLLMMHLLKNDQADLALKHFEAAVLD-QDKNWTWSSELISSFF 435

Query: 437 MYFEEEKDVEGAEDFARILKRFKCLDASAYHLLLKTYAAAGKRAPDMRQRLKEDNI 485
           ++FEE KDV+GAE+F + L ++  L +  Y LL+KTY AAGK  PDM++RL+E  I
Sbjct: 436 LHFEEAKDVDGAEEFCKTLTKWSPLSSETYTLLMKTYLAAGKACPDMKKRLEEQGI 487

BLAST of CmaCh04G003810 vs. Swiss-Prot
Match: PPR86_ARATH (Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana GN=At1g60770 PE=2 SV=1)

HSP 1 Score: 363.6 bits (932), Expect = 3.5e-99
Identity = 201/476 (42.23%), Postives = 294/476 (61.76%), Query Frame = 1

Query: 14  LRHLCTSAAELTRGPVNDQQQRLYPRLSKLGATGGSVAQTLNQYIMEGKIVKKYELERCI 73
           +RHL  S     R      ++ LY RL K G T   V Q LNQ++   K V K+E+   I
Sbjct: 3   MRHLSRSRDVTKRSTKKYIEEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGDTI 62

Query: 74  KELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVKGIAAAENYFCDLSSSAKN 133
           K+LR    Y+ AL++ E ME R +N + +D A+ LDL++K + I A ENYF DL  ++K 
Sbjct: 63  KKLRNRGLYYPALKLSEVMEERGMNKTVSDQAIHLDLVAKAREITAGENYFVDLPETSKT 122

Query: 134 RFTYGALLNCYCKELMEEKALALSKKIDELKFA-SNLSFNNLMTMYMRMDQPEKVPPLID 193
             TYG+LLNCYCKEL+ EKA  L  K+ EL    S++S+N+LMT+Y +  + EKVP +I 
Sbjct: 123 ELTYGSLLNCYCKELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPAMIQ 182

Query: 194 EMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKDEDRNKFDWTTFSNLAAIYVKV 253
           E+K   +   +YTYNVWM + A+ N +  VE ++EEM  + R   DWTT+SN+A+IYV  
Sbjct: 183 ELKAENVMPDSYTYNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIYVDA 242

Query: 254 GQLEKAELALKKVENEIKSSRQQDRLAYHFLISLYASTSNRSEVYRIWNALKSVYPMTNN 313
           G  +KAE AL+++E +   + Q+D  AY FLI+LY      +EVYRIW +L+   P T+N
Sbjct: 243 GLSQKAEKALQELEMK---NTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSN 302

Query: 314 MSYLVMLQALSKLKDFEGLKRTYKEWESSCSSFDLRLADVTIGAYLRQDMYKDAVLVFED 373
           ++YL M+Q L KL D  G +  +KEW+++CS++D+R+ +V IGAY ++ + + A  + E 
Sbjct: 303 VAYLNMIQVLVKLNDLPGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEK 362

Query: 374 ANKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDD--EWHPSPAMANAFLMY 433
           A +R      +  E+FM Y++K   +  AL  +  A+S    D  +W PSP    A + Y
Sbjct: 363 APRRGGKLNAKTWEIFMDYYVKSGDMARALECMSKAVSIGKGDGGKWLPSPETVRALMSY 422

Query: 434 FEEEKDVEGAEDFARILKR-FKCLDASAYHLLLKTYAAAGKRAPDMRQRLKEDNIE 486
           FE++KDV GAE+   ILK     + A  +  L++TYAAAGK  P MR+RLK +N+E
Sbjct: 423 FEQKKDVNGAENLLEILKNGTDNIGAEIFEPLIRTYAAAGKSHPAMRRRLKMENVE 475

BLAST of CmaCh04G003810 vs. Swiss-Prot
Match: PPR3_ARATH (Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana GN=At1g02150 PE=2 SV=2)

HSP 1 Score: 279.6 bits (714), Expect = 6.7e-74
Identity = 155/434 (35.71%), Postives = 251/434 (57.83%), Query Frame = 1

Query: 32  QQQRLYPRLSKLGATGGSVAQTLNQYIMEGKIVKKYELERCIKELRKYRRYHHALQIMEW 91
           Q   +Y ++S +       A  LNQ+   G+ + K+EL R +KELRKY+R + AL++ +W
Sbjct: 65  QWNAIYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDW 124

Query: 92  MEMR--KINYSFTDYALRLDLISKVKGIAAAENYFCDLSSSAKNRFTYGALLNCYCKELM 151
           M  R  +   S +D A++LDLI KV+GI  AE +F  L  + K+R  YG+LLN Y +   
Sbjct: 125 MNNRGERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKS 184

Query: 152 EEKALALSKKIDELKFASN-LSFNNLMTMYMRMDQPEKVPPLIDEMKRRGIFLSTYTYNV 211
            EKA AL   + +  +A + L FN +MT+YM + + +KV  ++ EMK++ I L  Y+YN+
Sbjct: 185 REKAEALLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNI 244

Query: 212 WMNSCASLNGVGKVEEILEEMKDEDRNKFDWTTFSNLAAIYVKVGQLEKAELALKKVENE 271
           W++SC SL  V K+E + ++MK +     +WTTFS +A +Y+K+G+ EKAE AL+KVE  
Sbjct: 245 WLSSCGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEAR 304

Query: 272 IKSSRQQDRLAYHFLISLYASTSNRSEVYRIWNALKSVYPMTNNMSYLVMLQALSKLKDF 331
           I     ++R+ YH+L+SLY S  N+ E+YR+W+  KSV P   N+ Y  ++ +L ++ D 
Sbjct: 305 ITG---RNRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDI 364

Query: 332 EGLKRTYKEWESSCSSFDLRLADVTIGAYLRQDMYKDAVLVFEDANKRSKGPFFRAREMF 391
           EG ++ Y+EW    SS+D R+ ++ + AY++ D  + A  +F+   +    P     E+ 
Sbjct: 365 EGAEKVYEEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEIL 424

Query: 392 MIYFLKFKQVDLALSHLESAISESMDDEWHPSPAMANAFLMYFEEEKDVEGAEDFARILK 451
            +   + + +  AL+ L +A S      W P   M + F    EEE DV   E    +L+
Sbjct: 425 AVGHTRKRCISEALTCLRNAFSAEGSSNWRPKVLMLSGFFKLCEEESDVTSKEAVLELLR 484

Query: 452 RFKCLDASAYHLLL 463
           +   L+  +Y  L+
Sbjct: 485 QSGDLEDKSYLALI 495

BLAST of CmaCh04G003810 vs. Swiss-Prot
Match: PP302_ARATH (Pentatricopeptide repeat-containing protein At4g02820, mitochondrial OS=Arabidopsis thaliana GN=At4g02820 PE=2 SV=1)

HSP 1 Score: 246.9 bits (629), Expect = 4.8e-64
Identity = 154/449 (34.30%), Postives = 249/449 (55.46%), Query Frame = 1

Query: 39  RLSKLGATGGSVAQTLNQYIMEGKIVKKYELERCIKELRKYRRYHHALQIMEWMEMRK-I 98
           RL  L  T  S   T+ ++  EG  V+KYEL R ++ELRK +RY HAL+I EWM +++ I
Sbjct: 66  RLLSLVYTKRSAVVTIRKWKEEGHSVRKYELNRIVRELRKIKRYKHALEICEWMVVQEDI 125

Query: 99  NYSFTDYALRLDLISKVKGIAAAENYFCDLSSSAKNRFTYGALLNCYCKELMEEKALALS 158
                DYA+ LDLISK++G+ +AE +F D+    +      +LL+ Y +  + +KA AL 
Sbjct: 126 KLQAGDYAVHLDLISKIRGLNSAEKFFEDMPDQMRGHAACTSLLHSYVQNKLSDKAEALF 185

Query: 159 KKIDELKFASN-LSFNNLMTMYMRMDQPEKVPPLIDEMKRRGIFLSTYTYNVWMNSCASL 218
           +K+ E  F  + L +N++++MY+   Q EKVP LI E+K R       TYN+W+ + AS 
Sbjct: 186 EKMGECGFLKSCLPYNHMLSMYISRGQFEKVPVLIKELKIR-TSPDIVTYNLWLTAFASG 245

Query: 219 NGVGKVEEILEEMKDEDRNKFDWTTFSNLAAIYVKVGQLEKAELALKKVENEIKSSRQQD 278
           N V   E++  + K+E  N  DW T+S L  +Y K   +EKA LALK++E  +    +++
Sbjct: 246 NDVEGAEKVYLKAKEEKLNP-DWVTYSVLTNLYAKTDNVEKARLALKEMEKLVS---KKN 305

Query: 279 RLAYHFLISLYASTSNRSEVYRIWNALKSVYPMTNNMSYLVMLQALSKLKDFEGLKRTYK 338
           R+AY  LISL+A+  ++  V   W  +KS +   N+  YL M+ A+ KL +FE  K  Y 
Sbjct: 306 RVAYASLISLHANLGDKDGVNLTWKKVKSSFKKMNDAEYLSMISAVVKLGEFEQAKGLYD 365

Query: 339 EWESSCSSFDLRLADVTIGAYLRQDMYKDAVLVFEDANKRSKGPFFRAREMFMIYFLKFK 398
           EWES   + D R+ ++ +  Y+ +D        +E   ++   P +   E+    +LK K
Sbjct: 366 EWESVSGTGDARIPNLILAEYMNRDEVLLGEKFYERIVEKGINPSYSTWEILTWAYLKRK 425

Query: 399 QVDLALSHLESAISESMDDEWHPSPAMANAFLMYFEEEKDVEGAEDFARILKRFKCLDAS 458
            ++  L     AI      +W  +  +        EE+ +V+GAE    +L++   ++  
Sbjct: 426 DMEKVLDCFGKAIDSV--KKWTVNVRLVKGACKELEEQGNVKGAEKLMTLLQKAGYVNTQ 485

Query: 459 AYHLLLKTYAAAGKRAPDMRQRLKEDNIE 486
            Y+ LL+TYA AG+ A  + +R+ +DN+E
Sbjct: 486 LYNSLLRTYAKAGEMALIVEERMAKDNVE 507

BLAST of CmaCh04G003810 vs. TrEMBL
Match: A0A0A0KUH1_CUCSA (Pentatricopeptide repeat-containing protein OS=Cucumis sativus GN=Csa_4G026260 PE=4 SV=1)

HSP 1 Score: 756.5 bits (1952), Expect = 2.1e-215
Identity = 381/485 (78.56%), Postives = 418/485 (86.19%), Query Frame = 1

Query: 1   MNRRSLLSRASAGLRHLCTSAAELTRGPVNDQQQRLYPRLSKLGATGGSVAQTLNQYIME 60
           MNRRSL+SRA AG R LCTS  EL R P N+Q+  LYPRLS LGATGGSVA+T+NQ+IME
Sbjct: 1   MNRRSLISRAPAGFRQLCTSLNELMRSPANNQRG-LYPRLSALGATGGSVAKTINQFIME 60

Query: 61  GKIVKKYELERCIKELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVKGIAAA 120
           G IVKKYELE+CIKELRKYRRYHH LQIMEWME RKINYSFTDYALRLDLISKV G+ AA
Sbjct: 61  GNIVKKYELEKCIKELRKYRRYHHCLQIMEWMETRKINYSFTDYALRLDLISKVNGVTAA 120

Query: 121 ENYFCDLSSSAKNRFTYGALLNCYCKELMEEKALALSKKIDELKFASNLSFNNLMTMYMR 180
           E YF DL  SAKNR TYGALLNCYCKE+MEEKAL L KK+DELK +++LSFNNLMTMYMR
Sbjct: 121 EKYFYDLPPSAKNRCTYGALLNCYCKEMMEEKALTLFKKMDELKISTSLSFNNLMTMYMR 180

Query: 181 MDQPEKVPPLIDEMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKDEDRNKFDWT 240
           MD PEKVPPLI EMK+RG +L+T+TYNVWMNSCASLN +GKVEEILEEMK EDRNKFDWT
Sbjct: 181 MDHPEKVPPLIGEMKQRGFYLTTFTYNVWMNSCASLNDIGKVEEILEEMKMEDRNKFDWT 240

Query: 241 TFSNLAAIYVKVGQLEKAELALKKVENEIKSSRQQDRLAYHFLISLYASTSNRSEVYRIW 300
           T+SNLA+ YVK GQ EKAELALKK+E E+KS +  DRL YH LISLYASTSN SEV RIW
Sbjct: 241 TYSNLASFYVKAGQFEKAELALKKLEEEMKSDK-NDRLVYHCLISLYASTSNLSEVNRIW 300

Query: 301 NALKSVYPMTNNMSYLVMLQALSKLKDFEGLKRTYKEWESSCSSFDLRLADVTIGAYLRQ 360
           NALKSVY    N+SYLVMLQAL KLKD EGLKRTYKEWES+C +FDLR+ +  IGAYL+Q
Sbjct: 301 NALKSVYSTMTNISYLVMLQALRKLKDIEGLKRTYKEWESNCRNFDLRIVNDIIGAYLQQ 360

Query: 361 DMYKDAVLVFEDANKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDDEWHPS 420
           DMY+DA ++FEDA KRSKGPF RAREMFM+YFLK KQVD A SHLESA+SES + EWHPS
Sbjct: 361 DMYEDAAMIFEDATKRSKGPFSRAREMFMVYFLKLKQVDSAFSHLESALSESKEKEWHPS 420

Query: 421 PAMANAFLMYFEEEKDVEGAEDFARILKRFKCLDASAYHLLLKTYAAAGKRAPDMRQRLK 480
            A   AFL YFEEEKDVEGAEDFARILKR KCLDAS YHLLLKTY AAGK APDMR+RLK
Sbjct: 421 LATTTAFLNYFEEEKDVEGAEDFARILKRLKCLDASGYHLLLKTYVAAGKLAPDMRKRLK 480

Query: 481 EDNIE 486
           ED+IE
Sbjct: 481 EDDIE 483

BLAST of CmaCh04G003810 vs. TrEMBL
Match: M5X9A6_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005037mg PE=4 SV=1)

HSP 1 Score: 564.3 bits (1453), Expect = 1.5e-157
Identity = 290/461 (62.91%), Postives = 352/461 (76.36%), Query Frame = 1

Query: 1   MNRRSLLSRASAGLRHLCTSAAELTRGPVND--QQQRLYPRLSKLGATGGSVAQTLNQYI 60
           MN    +S  +  +R LCT+    T    +      RLY RLS LGATGGSVA+TLNQYI
Sbjct: 1   MNSSRSISAGTWLVRKLCTAVEAATESARSQPGNPNRLYRRLSALGATGGSVAKTLNQYI 60

Query: 61  MEGKIVKKYELERCIKELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVKGIA 120
           MEGK++KKYELERCIKELRKYR++ HAL+IMEWME RK+NYS  D+A+RLDL SKVKGI 
Sbjct: 61  MEGKMLKKYELERCIKELRKYRKFQHALEIMEWMEFRKMNYSKADFAIRLDLTSKVKGIE 120

Query: 121 AAENYFCDLSSSAKNRFTYGALLNCYCKELMEEKALALSKKIDELKFASN-LSFNNLMTM 180
           AAE+YF  LS S K+RFTYGALLNCYCKELMEEKALAL + +DEL+FAS+ L FNNLM+M
Sbjct: 121 AAEDYFSGLSPSLKDRFTYGALLNCYCKELMEEKALALYETMDELEFASSSLVFNNLMSM 180

Query: 181 YMRMDQPEKVPPLIDEMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKDEDRNKF 240
           +MR  QPEKV PL+ EMK+R I L T+TYN+WM S ASLN     E +L+EM+ +D N+ 
Sbjct: 181 HMRKQQPEKVAPLVQEMKQRNIPLDTFTYNIWMQSFASLNDFEGAERVLDEMQKQDGNQC 240

Query: 241 DWTTFSNLAAIYVKVGQLEKAELALKKVENEIKSSRQQDRLAYHFLISLYASTSNRSEVY 300
            W+T+SNLAAIYVK    +KAELALKK E  +K  +Q++   YHFLISLYA TSN  EV 
Sbjct: 241 SWSTYSNLAAIYVKAKIFDKAELALKKSEEMMKPLKQRN--TYHFLISLYACTSNLGEVK 300

Query: 301 RIWNALKSVYPMTNNMSYLVMLQALSKLKDFEGLKRTYKEWESSCSSFDLRLADVTIGAY 360
           R+W +LK  +P TNNMSYL+MLQAL KL D EGLK  ++EWE  CSS+D+RLA+  I  Y
Sbjct: 301 RVWESLKKAFPATNNMSYLIMLQALCKLNDIEGLKECFEEWECKCSSYDMRLANTAIRGY 360

Query: 361 LRQDMYKDAVLVFEDANKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDDEW 420
           L QDMY++A LVF DA KR+KGPFF+AREMFM+YFLK  QVDLA+S+L +A+SE+ D EW
Sbjct: 361 LSQDMYEEAALVFADACKRTKGPFFKAREMFMLYFLKNCQVDLAVSYLGAAVSETADGEW 420

Query: 421 HPSPAMANAFLMYFEEEKDVEGAEDFARILKRFKCLDASAY 459
           HPSP   +AF  YFEEEKDVE AE+F +ILKR  CL ++ Y
Sbjct: 421 HPSPDTTSAFFKYFEEEKDVESAENFCKILKRLNCLCSNEY 459

BLAST of CmaCh04G003810 vs. TrEMBL
Match: B9RNC6_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_1346580 PE=4 SV=1)

HSP 1 Score: 540.0 bits (1390), Expect = 3.0e-150
Identity = 277/486 (57.00%), Postives = 362/486 (74.49%), Query Frame = 1

Query: 4   RSLLSRASAGLRHLCTSAAELTRGPVNDQQ-QRLYPRLSKLGATGGSVAQTLNQYIMEGK 63
           R +L+ +    +   T+ A +    V+ +Q ++LY +LS LGATGGSV++TLN++IMEGK
Sbjct: 7   RLILTASCPSRQRFSTAEAAVPPAVVSPRQSEKLYHKLSALGATGGSVSRTLNEHIMEGK 66

Query: 64  IVKKYELERCIKELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVKGIAAAEN 123
            + K EL RCI+ELRKYRR+ HA +IMEWME RK+N+S+ D A+RLDLI K +GIAAAE+
Sbjct: 67  TITKIELSRCIRELRKYRRFDHAFEIMEWMEKRKMNFSYADRAIRLDLIGKARGIAAAED 126

Query: 124 YFCDLSSSAKNRFT-YGALLNCYCKELMEEKALALSKKIDELKFA-SNLSFNNLMTMYMR 183
           YF  LS SAKN  T YGALLNCYCKELM +KALAL +++DE KF  S+L FNNLM+MYMR
Sbjct: 127 YFNGLSPSAKNHHTSYGALLNCYCKELMSDKALALFQEMDEKKFLYSSLPFNNLMSMYMR 186

Query: 184 MDQPEKVPPLIDEMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEE-MKDEDRNKFDW 243
           + QPEKVPPL+DEMK+R +   ++TYN+WM S   LN    V+ +L E + D  ++   W
Sbjct: 187 LGQPEKVPPLVDEMKKRKVSPCSFTYNIWMQSYGCLNDFQGVDRVLREIVNDGGKDNLQW 246

Query: 244 TTFSNLAAIYVKVGQLEKAELALKKVENEIKSSRQQDRLAYHFLISLYASTSNRSEVYRI 303
           TT+SNLA IY+K G  EKAE ALKK+E  I   R  +R AYHFLIS+YA T N +EV R+
Sbjct: 247 TTYSNLATIYLKAGIFEKAESALKKLE-AIMGFR--NREAYHFLISIYAGTGNSNEVNRV 306

Query: 304 WNALKSVYPMTNNMSYLVMLQALSKLKDFEGLKRTYKEWESSCSSFDLRLADVTIGAYLR 363
           W  LKS + M NN+SYLVMLQAL+KLKD EG+ + ++EWES C+++D+R+A+V I  +L+
Sbjct: 307 WGLLKSSFNMINNLSYLVMLQALAKLKDVEGVAKCFREWESGCTNYDMRIANVAIRVFLQ 366

Query: 364 QDMYKDAVLVFEDANKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDDEWHP 423
            DMY++A L+F+DA KR++GPFF+ARE FM++FLK  Q+DLAL H+ +A SES   EW P
Sbjct: 367 HDMYEEAELIFDDALKRTRGPFFKARERFMLFFLKIHQLDLALKHMRAAFSESEKHEWKP 426

Query: 424 SPAMANAFLMYFEEEKDVEGAEDFARILKRFKCLDASAYHLLLKTYAAAGKRAPDMRQRL 483
                NA+  YF  EKDV+GAE  ++ILK   CL++S Y LLLKTY AAGK AP+MRQRL
Sbjct: 427 LQETVNAYFDYFRTEKDVDGAEKLSKILKHINCLNSSVYSLLLKTYIAAGKLAPEMRQRL 486

Query: 484 KEDNIE 486
           +EDNIE
Sbjct: 487 EEDNIE 489

BLAST of CmaCh04G003810 vs. TrEMBL
Match: A0A067K8Z8_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14045 PE=4 SV=1)

HSP 1 Score: 533.1 bits (1372), Expect = 3.7e-148
Identity = 270/486 (55.56%), Postives = 358/486 (73.66%), Query Frame = 1

Query: 6   LLSRASAGLRHLCTSA---AELTRGPVN--DQQQRLYPRLSKLGATGGSVAQTLNQYIME 65
           LLS+AS   R LCT+A   AE     V   D+  RLYPRLS LGA GGSV+ TLN+Y+ME
Sbjct: 8   LLSKASWLARKLCTAAEAVAEAVPSAVGSLDKPVRLYPRLSALGAKGGSVSMTLNEYVME 67

Query: 66  GKIVKKYELERCIKELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVKGIAAA 125
           G  ++K EL RCIKELRKY+R+ HAL+IMEWME RK+N+S  +YA++LDLI+K KG++AA
Sbjct: 68  GNTIRKAELTRCIKELRKYQRFDHALEIMEWMEKRKMNFSRAEYAIKLDLIAKTKGVSAA 127

Query: 126 ENYFCDLSSSAKNRFTYGALLNCYCKELMEEKALALSKKIDELKFAS-NLSFNNLMTMYM 185
           E+YF  LS +AK R TYGALLNCY K LM +KAL L +K+D +   S +L FNNLM++YM
Sbjct: 128 ESYFSSLSPNAKTRSTYGALLNCYTKGLMPDKALDLFEKLDAMNLLSTSLPFNNLMSLYM 187

Query: 186 RMDQPEKVPPLIDEMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKDEDRNKFDW 245
           R+ QPEKVP L+ +MKRR I   +++YN+WM S   LN    VE +L E++ +  +   W
Sbjct: 188 RLGQPEKVPALVHDMKRRNIHPCSFSYNIWMQSYGCLNDFEGVERVLAEIEKDGEDNCKW 247

Query: 246 TTFSNLAAIYVKVGQLEKAELALKKVENEIKSSRQQDRLAYHFLISLYASTSNRSEVYRI 305
            T+SN+A IY+K G  EKAE ALKK+E ++     ++R AYHFLIS+Y+ T N +EV R+
Sbjct: 248 NTYSNVATIYLKAGLFEKAESALKKLELKMGI---RNREAYHFLISIYSGTQNLNEVNRV 307

Query: 306 WNALKSVYPMTNNMSYLVMLQALSKLKDFEGLKRTYKEWESSCSSFDLRLADVTIGAYLR 365
           WN+LK  +    N SYLVMLQAL+KLKD +G+ + +KEWESSCSS+D+RLA+  I AYL 
Sbjct: 308 WNSLKKSFTTVTNTSYLVMLQALAKLKDVDGIAKLFKEWESSCSSYDMRLANTAIKAYLE 367

Query: 366 QDMYKDAVLVFEDANKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDDEWHP 425
           QDMY++A L+F+ A KR+KGPFF+ REMFM++FLK  ++DLAL H++ A SE+ + +W P
Sbjct: 368 QDMYEEAELIFDGALKRAKGPFFKVREMFMVFFLKINELDLALEHMKVAFSETEEYQWKP 427

Query: 426 SPAMANAFLMYFEEEKDVEGAEDFARILKRFKCLDASAYHLLLKTYAAAGKRAPDMRQRL 485
                +AF  YF EEKD++GAE F +ILK   CLD++AY LLL+TY AA + APDMR+RL
Sbjct: 428 KAETVSAFFSYFCEEKDIDGAEKFCKILKHINCLDSNAYSLLLQTYIAADRLAPDMRKRL 487

BLAST of CmaCh04G003810 vs. TrEMBL
Match: A0A061DR66_THECC (Tetratricopeptide repeat (TPR)-like superfamily protein, putative OS=Theobroma cacao GN=TCM_004828 PE=4 SV=1)

HSP 1 Score: 532.3 bits (1370), Expect = 6.4e-148
Identity = 277/491 (56.42%), Postives = 352/491 (71.69%), Query Frame = 1

Query: 1   MNRRSLLSRASAGLRHLCTSAAELTR-----GPVNDQQQRLYPRLSKLGATGGSVAQTLN 60
           MN R L+S  S  +R LCT+ +E  +        +  + RLYPRLS L ATGG+V++ LN
Sbjct: 1   MNSRRLISSGSWLVRKLCTATSEKAKIKAAVAAASPMRNRLYPRLSALAATGGTVSEALN 60

Query: 61  QYIMEGKIVKKYELERCIKELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVK 120
            +IMEGK ++K EL RC+KELRKYRRY HAL IM+WME R ++ S  D+A+RLDLI+K K
Sbjct: 61  DFIMEGKKIRKDELGRCVKELRKYRRYQHALDIMDWMERRNLHLSHVDHAIRLDLIAKTK 120

Query: 121 GIAAAENYFCDLSSSAKNRFTYGALLNCYCKELMEEKALALSKKIDELKFASN-LSFNNL 180
           GI AAENY   L  SAKN+ TYGALLNCYC  LM++KA +L +K+DEL+F +N L FNNL
Sbjct: 121 GIDAAENYLSALPPSAKNQLTYGALLNCYCNNLMKDKASSLFQKMDELRFTNNTLPFNNL 180

Query: 181 MTMYMRMDQPEKVPPLIDEMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKDEDR 240
           M +YMR+ QPEKVP L+DE+K R I    +TY VWM S A+LN +  VE +LEE+  +  
Sbjct: 181 MCLYMRLGQPEKVPELVDELKLRNIPRCRFTYVVWMQSYANLNDIEGVERVLEELAQDSE 240

Query: 241 NKFDWTTFSNLAAIYVKVGQLEKAELALKKVENEIKSSRQQDRLAYHFLISLYASTSNRS 300
           +K  WTT++NLAAIYVK G  EKAE  LKK+E   K    + R AYHFLISLYA TSN +
Sbjct: 241 DKCTWTTYNNLAAIYVKAGLFEKAEACLKKLE---KDMMPRQREAYHFLISLYAGTSNLA 300

Query: 301 EVYRIWNALKSVYPMTNNMSYLVMLQALSKLKDFEGLKRTYKEWESSCSSFDLRLADVTI 360
           EV+R+W ALK  +    N SYLVM+QAL+KLKD EGLK+ + EWESSCS++D+RLA  TI
Sbjct: 301 EVHRVWEALKRAFSTVTNTSYLVMVQALAKLKDLEGLKKCFAEWESSCSAYDIRLATSTI 360

Query: 361 GAYLRQDMYKDAVLVFEDANKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMD 420
             YL  D+ ++A LV  +A KRSKGPF + RE+FM+YFL+  Q DLAL H+E+ +SE  D
Sbjct: 361 RGYLSGDLLEEAELVLGNAMKRSKGPFHKVRELFMVYFLEKCQFDLALQHVEAVVSEMGD 420

Query: 421 DEWHPSPAMANAFLMYFEEEKDVEGAEDFARILKRFKCLDASAYHLLLKTYAAAGKRAPD 480
             W P+P    AF  YF +E+DV+ AE+F RILK    LD++AYHLLLKTY AAGK APD
Sbjct: 421 --WRPAPETITAFFDYFMKERDVDAAEEFCRILKSKNGLDSNAYHLLLKTYVAAGKVAPD 480

Query: 481 MRQRLKEDNIE 486
           MR+RL+ D I+
Sbjct: 481 MRRRLEVDGIQ 486

BLAST of CmaCh04G003810 vs. TAIR10
Match: AT1G02370.1 (AT1G02370.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 474.6 bits (1220), Expect = 8.0e-134
Identity = 241/461 (52.28%), Postives = 330/461 (71.58%), Query Frame = 1

Query: 29  VNDQQQRLYPRLSKLGATGGSVAQTLNQYIMEGKIVKKYELERCIKELRKYRRYHHALQI 88
           V  +Q+ LY +LS L  TGG+VA+TLNQ+IMEG  V+K +L RC K LRK+RR  HA +I
Sbjct: 66  VASRQRELYKKLSMLSVTGGTVAETLNQFIMEGITVRKDDLFRCAKTLRKFRRPQHAFEI 125

Query: 89  MEWMEMRKINYSFTDYALRLDLISKVKGIAAAENYFCDLSSSAKN-RFTYGALLNCYCKE 148
            +WME RK+ +S +D+A+ LDLI K KG+ AAENYF +L  SAKN + TYGAL+NCYC E
Sbjct: 126 FDWMEKRKMTFSVSDHAICLDLIGKTKGLEAAENYFNNLDPSAKNHQSTYGALMNCYCVE 185

Query: 149 LMEEKALALSKKIDELKFASN-LSFNNLMTMYMRMDQPEKVPPLIDEMKRRGIFLSTYTY 208
           L EEKA A  + +DEL F +N L FNN+M+MYMR+ QPEKVP L+D MK+RGI     TY
Sbjct: 186 LEEEKAKAHFEIMDELNFVNNSLPFNNMMSMYMRLSQPEKVPVLVDAMKQRGISPCGVTY 245

Query: 209 NVWMNSCASLNGVGKVEEILEEMKDEDRNKFDWTTFSNLAAIYVKVGQLEKAELALKKVE 268
           ++WM SC SLN +  +E+I++EM  +   K  W TFSNLAAIY K G  EKA+ ALK +E
Sbjct: 246 SIWMQSCGSLNDLDGLEKIIDEMGKDSEAKTTWNTFSNLAAIYTKAGLYEKADSALKSME 305

Query: 269 NEIKSSRQQDRLAYHFLISLYASTSNRSEVYRIWNALKSVYPMTNNMSYLVMLQALSKLK 328
            ++  +   +R ++HFL+SLYA  S   EVYR+W +LK   P  NN+SYLVMLQA+SKL 
Sbjct: 306 EKMNPN---NRDSHHFLMSLYAGISKGPEVYRVWESLKKARPEVNNLSYLVMLQAMSKLG 365

Query: 329 DFEGLKRTYKEWESSCSSFDLRLADVTIGAYLRQDMYKDAVLVFEDANKRSKGPFFRARE 388
           D +G+K+ + EWES C ++D+RLA++ I  YL+ +MY++A  + + A K+SKGPF +AR+
Sbjct: 366 DLDGIKKIFTEWESKCWAYDMRLANIAINTYLKGNMYEEAEKILDGAMKKSKGPFSKARQ 425

Query: 389 MFMIYFLKFKQVDLALSHLESAISESMD--DEWHPSPAMANAFLMYFEEEKDVEGAEDFA 448
           + MI+ L+  + DLA+ HLE+A+S+S +  DEW  S  + + F ++FE+ KDV+GAEDF 
Sbjct: 426 LLMIHLLENDKADLAMKHLEAAVSDSAENKDEWGWSSELVSLFFLHFEKAKDVDGAEDFC 485

Query: 449 RILKRFKCLDASAYHLLLKTYAAAGKRAPDMRQRLKEDNIE 486
           +IL  +K LD+     L+KTYAAA K +PDMR+RL +  IE
Sbjct: 486 KILSNWKPLDSETMTFLIKTYAAAEKTSPDMRERLSQQQIE 523

BLAST of CmaCh04G003810 vs. TAIR10
Match: AT4G01990.1 (AT4G01990.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 434.1 bits (1115), Expect = 1.2e-121
Identity = 229/476 (48.11%), Postives = 320/476 (67.23%), Query Frame = 1

Query: 17  LCTSAAELTRG-----PVNDQQQR-LYPRLSKLGATGGS-VAQTLNQYIMEGKIVKKYEL 76
           L T+ AE++       P   ++ R +Y +LS LG  GG  + +TLNQ++MEG  VKK++L
Sbjct: 16  LATATAEISGEAAASVPTKAKKHRSIYKKLSSLGTRGGGKMEETLNQFVMEGVPVKKHDL 75

Query: 77  ERCIKELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVKGIAAAENYFCDLSS 136
            R  K+LRK+R+   AL+I EWME ++I ++ +D+A+RL+LI+K KG+ AAE YF  L  
Sbjct: 76  IRYAKDLRKFRQPQRALEIFEWMERKEIAFTGSDHAIRLNLIAKSKGLEAAETYFNSLDD 135

Query: 137 SAKNRFTYGALLNCYCKELMEEKALALSKKIDELKFASN-LSFNNLMTMYMRMDQPEKVP 196
           S KN+ TYG+LLNCYC E  E KA A  + + +L   SN L FNNLM MYM + QPEKVP
Sbjct: 136 SIKNQSTYGSLLNCYCVEKEEVKAKAHFENMVDLNHVSNSLPFNNLMAMYMGLGQPEKVP 195

Query: 197 PLIDEMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKDEDRNKFDWTTFSNLAAI 256
            L+  MK + I     TY++W+ SC SL  +  VE++L+EMK E    F W TF+NLAAI
Sbjct: 196 ALVVAMKEKSITPCDITYSMWIQSCGSLKDLDGVEKVLDEMKAEGEGIFSWNTFANLAAI 255

Query: 257 YVKVGQLEKAELALKKVENEIKSSRQQDRLAYHFLISLYASTSNRSEVYRIWNALKSVYP 316
           Y+KVG   KAE ALK +EN +    +     YHFLI+LY   +N SEVYR+W+ LK  YP
Sbjct: 256 YIKVGLYGKAEEALKSLENNMNPDVRD---CYHFLINLYTGIANASEVYRVWDLLKKRYP 315

Query: 317 MTNNMSYLVMLQALSKLKDFEGLKRTYKEWESSCSSFDLRLADVTIGAYLRQDMYKDAVL 376
             NN SYL ML+ALSKL D +G+K+ + EWES+C ++D+R+A+V I +YL+Q+MY++A  
Sbjct: 316 NVNNSSYLTMLRALSKLDDIDGVKKVFAEWESTCWTYDMRMANVAISSYLKQNMYEEAEA 375

Query: 377 VFEDANKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDDEWHPSPAMANAFL 436
           VF  A K+ KG F +AR++ M++ LK  Q DLAL H E+A+ +  D  W  S  + ++F 
Sbjct: 376 VFNGAMKKCKGQFSKARQLLMMHLLKNDQADLALKHFEAAVLD-QDKNWTWSSELISSFF 435

Query: 437 MYFEEEKDVEGAEDFARILKRFKCLDASAYHLLLKTYAAAGKRAPDMRQRLKEDNI 485
           ++FEE KDV+GAE+F + L ++  L +  Y LL+KTY AAGK  PDM++RL+E  I
Sbjct: 436 LHFEEAKDVDGAEEFCKTLTKWSPLSSETYTLLMKTYLAAGKACPDMKKRLEEQGI 487

BLAST of CmaCh04G003810 vs. TAIR10
Match: AT1G60770.1 (AT1G60770.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 363.6 bits (932), Expect = 2.0e-100
Identity = 201/476 (42.23%), Postives = 294/476 (61.76%), Query Frame = 1

Query: 14  LRHLCTSAAELTRGPVNDQQQRLYPRLSKLGATGGSVAQTLNQYIMEGKIVKKYELERCI 73
           +RHL  S     R      ++ LY RL K G T   V Q LNQ++   K V K+E+   I
Sbjct: 3   MRHLSRSRDVTKRSTKKYIEEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGDTI 62

Query: 74  KELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVKGIAAAENYFCDLSSSAKN 133
           K+LR    Y+ AL++ E ME R +N + +D A+ LDL++K + I A ENYF DL  ++K 
Sbjct: 63  KKLRNRGLYYPALKLSEVMEERGMNKTVSDQAIHLDLVAKAREITAGENYFVDLPETSKT 122

Query: 134 RFTYGALLNCYCKELMEEKALALSKKIDELKFA-SNLSFNNLMTMYMRMDQPEKVPPLID 193
             TYG+LLNCYCKEL+ EKA  L  K+ EL    S++S+N+LMT+Y +  + EKVP +I 
Sbjct: 123 ELTYGSLLNCYCKELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPAMIQ 182

Query: 194 EMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKDEDRNKFDWTTFSNLAAIYVKV 253
           E+K   +   +YTYNVWM + A+ N +  VE ++EEM  + R   DWTT+SN+A+IYV  
Sbjct: 183 ELKAENVMPDSYTYNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIYVDA 242

Query: 254 GQLEKAELALKKVENEIKSSRQQDRLAYHFLISLYASTSNRSEVYRIWNALKSVYPMTNN 313
           G  +KAE AL+++E +   + Q+D  AY FLI+LY      +EVYRIW +L+   P T+N
Sbjct: 243 GLSQKAEKALQELEMK---NTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSN 302

Query: 314 MSYLVMLQALSKLKDFEGLKRTYKEWESSCSSFDLRLADVTIGAYLRQDMYKDAVLVFED 373
           ++YL M+Q L KL D  G +  +KEW+++CS++D+R+ +V IGAY ++ + + A  + E 
Sbjct: 303 VAYLNMIQVLVKLNDLPGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEK 362

Query: 374 ANKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDD--EWHPSPAMANAFLMY 433
           A +R      +  E+FM Y++K   +  AL  +  A+S    D  +W PSP    A + Y
Sbjct: 363 APRRGGKLNAKTWEIFMDYYVKSGDMARALECMSKAVSIGKGDGGKWLPSPETVRALMSY 422

Query: 434 FEEEKDVEGAEDFARILKR-FKCLDASAYHLLLKTYAAAGKRAPDMRQRLKEDNIE 486
           FE++KDV GAE+   ILK     + A  +  L++TYAAAGK  P MR+RLK +N+E
Sbjct: 423 FEQKKDVNGAENLLEILKNGTDNIGAEIFEPLIRTYAAAGKSHPAMRRRLKMENVE 475

BLAST of CmaCh04G003810 vs. TAIR10
Match: AT1G02150.1 (AT1G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 279.6 bits (714), Expect = 3.8e-75
Identity = 155/434 (35.71%), Postives = 251/434 (57.83%), Query Frame = 1

Query: 32  QQQRLYPRLSKLGATGGSVAQTLNQYIMEGKIVKKYELERCIKELRKYRRYHHALQIMEW 91
           Q   +Y ++S +       A  LNQ+   G+ + K+EL R +KELRKY+R + AL++ +W
Sbjct: 65  QWNAIYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDW 124

Query: 92  MEMR--KINYSFTDYALRLDLISKVKGIAAAENYFCDLSSSAKNRFTYGALLNCYCKELM 151
           M  R  +   S +D A++LDLI KV+GI  AE +F  L  + K+R  YG+LLN Y +   
Sbjct: 125 MNNRGERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKS 184

Query: 152 EEKALALSKKIDELKFASN-LSFNNLMTMYMRMDQPEKVPPLIDEMKRRGIFLSTYTYNV 211
            EKA AL   + +  +A + L FN +MT+YM + + +KV  ++ EMK++ I L  Y+YN+
Sbjct: 185 REKAEALLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNI 244

Query: 212 WMNSCASLNGVGKVEEILEEMKDEDRNKFDWTTFSNLAAIYVKVGQLEKAELALKKVENE 271
           W++SC SL  V K+E + ++MK +     +WTTFS +A +Y+K+G+ EKAE AL+KVE  
Sbjct: 245 WLSSCGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEAR 304

Query: 272 IKSSRQQDRLAYHFLISLYASTSNRSEVYRIWNALKSVYPMTNNMSYLVMLQALSKLKDF 331
           I     ++R+ YH+L+SLY S  N+ E+YR+W+  KSV P   N+ Y  ++ +L ++ D 
Sbjct: 305 ITG---RNRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDI 364

Query: 332 EGLKRTYKEWESSCSSFDLRLADVTIGAYLRQDMYKDAVLVFEDANKRSKGPFFRAREMF 391
           EG ++ Y+EW    SS+D R+ ++ + AY++ D  + A  +F+   +    P     E+ 
Sbjct: 365 EGAEKVYEEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEIL 424

Query: 392 MIYFLKFKQVDLALSHLESAISESMDDEWHPSPAMANAFLMYFEEEKDVEGAEDFARILK 451
            +   + + +  AL+ L +A S      W P   M + F    EEE DV   E    +L+
Sbjct: 425 AVGHTRKRCISEALTCLRNAFSAEGSSNWRPKVLMLSGFFKLCEEESDVTSKEAVLELLR 484

Query: 452 RFKCLDASAYHLLL 463
           +   L+  +Y  L+
Sbjct: 485 QSGDLEDKSYLALI 495

BLAST of CmaCh04G003810 vs. TAIR10
Match: AT4G02820.1 (AT4G02820.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 246.9 bits (629), Expect = 2.7e-65
Identity = 154/449 (34.30%), Postives = 249/449 (55.46%), Query Frame = 1

Query: 39  RLSKLGATGGSVAQTLNQYIMEGKIVKKYELERCIKELRKYRRYHHALQIMEWMEMRK-I 98
           RL  L  T  S   T+ ++  EG  V+KYEL R ++ELRK +RY HAL+I EWM +++ I
Sbjct: 66  RLLSLVYTKRSAVVTIRKWKEEGHSVRKYELNRIVRELRKIKRYKHALEICEWMVVQEDI 125

Query: 99  NYSFTDYALRLDLISKVKGIAAAENYFCDLSSSAKNRFTYGALLNCYCKELMEEKALALS 158
                DYA+ LDLISK++G+ +AE +F D+    +      +LL+ Y +  + +KA AL 
Sbjct: 126 KLQAGDYAVHLDLISKIRGLNSAEKFFEDMPDQMRGHAACTSLLHSYVQNKLSDKAEALF 185

Query: 159 KKIDELKFASN-LSFNNLMTMYMRMDQPEKVPPLIDEMKRRGIFLSTYTYNVWMNSCASL 218
           +K+ E  F  + L +N++++MY+   Q EKVP LI E+K R       TYN+W+ + AS 
Sbjct: 186 EKMGECGFLKSCLPYNHMLSMYISRGQFEKVPVLIKELKIR-TSPDIVTYNLWLTAFASG 245

Query: 219 NGVGKVEEILEEMKDEDRNKFDWTTFSNLAAIYVKVGQLEKAELALKKVENEIKSSRQQD 278
           N V   E++  + K+E  N  DW T+S L  +Y K   +EKA LALK++E  +    +++
Sbjct: 246 NDVEGAEKVYLKAKEEKLNP-DWVTYSVLTNLYAKTDNVEKARLALKEMEKLVS---KKN 305

Query: 279 RLAYHFLISLYASTSNRSEVYRIWNALKSVYPMTNNMSYLVMLQALSKLKDFEGLKRTYK 338
           R+AY  LISL+A+  ++  V   W  +KS +   N+  YL M+ A+ KL +FE  K  Y 
Sbjct: 306 RVAYASLISLHANLGDKDGVNLTWKKVKSSFKKMNDAEYLSMISAVVKLGEFEQAKGLYD 365

Query: 339 EWESSCSSFDLRLADVTIGAYLRQDMYKDAVLVFEDANKRSKGPFFRAREMFMIYFLKFK 398
           EWES   + D R+ ++ +  Y+ +D        +E   ++   P +   E+    +LK K
Sbjct: 366 EWESVSGTGDARIPNLILAEYMNRDEVLLGEKFYERIVEKGINPSYSTWEILTWAYLKRK 425

Query: 399 QVDLALSHLESAISESMDDEWHPSPAMANAFLMYFEEEKDVEGAEDFARILKRFKCLDAS 458
            ++  L     AI      +W  +  +        EE+ +V+GAE    +L++   ++  
Sbjct: 426 DMEKVLDCFGKAIDSV--KKWTVNVRLVKGACKELEEQGNVKGAEKLMTLLQKAGYVNTQ 485

Query: 459 AYHLLLKTYAAAGKRAPDMRQRLKEDNIE 486
            Y+ LL+TYA AG+ A  + +R+ +DN+E
Sbjct: 486 LYNSLLRTYAKAGEMALIVEERMAKDNVE 507

BLAST of CmaCh04G003810 vs. NCBI nr
Match: gi|659107719|ref|XP_008453822.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial-like isoform X1 [Cucumis melo])

HSP 1 Score: 761.5 bits (1965), Expect = 9.2e-217
Identity = 388/484 (80.17%), Postives = 419/484 (86.57%), Query Frame = 1

Query: 1   MNRRSLLSRASAGLRHLCTSAAELTRGPVNDQQQRLYPRLSKLGATGGSVAQTLNQYIME 60
           MNRRSL+SRA AGLR LCTS AELTR P N+ +  LYPRLS LGATGGSVAQT+N++IME
Sbjct: 1   MNRRSLISRAPAGLRQLCTSVAELTRSPANNHRG-LYPRLSVLGATGGSVAQTINRFIME 60

Query: 61  GKIVKKYELERCIKELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVKGIAAA 120
           G IVKKYELE+CIKELRKYRRY H+LQIMEWME+RKINYSFTDYALRLDLISKV GI AA
Sbjct: 61  GNIVKKYELEKCIKELRKYRRYDHSLQIMEWMEIRKINYSFTDYALRLDLISKVNGITAA 120

Query: 121 ENYFCDLSSSAKNRFTYGALLNCYCKELMEEKALALSKKIDELKFASNLSFNNLMTMYMR 180
           E YF DL  SAKNR TYGALLNCYCKE+MEEKA  L KK+DELKF ++L+FNNLMTMYMR
Sbjct: 121 EKYFYDLPPSAKNRCTYGALLNCYCKEMMEEKASTLFKKMDELKFVTSLAFNNLMTMYMR 180

Query: 181 MDQPEKVPPLIDEMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKDEDRNKFDWT 240
           MDQPEKVPPLI EMK+RG +L+T+TYNVWMNSCASLN +GKVEEILEEMK ED NK DWT
Sbjct: 181 MDQPEKVPPLIGEMKQRGFYLTTFTYNVWMNSCASLNDIGKVEEILEEMKMEDSNKLDWT 240

Query: 241 TFSNLAAIYVKVGQLEKAELALKKVENEIKSSRQQDRLAYHFLISLYASTSNRSEVYRIW 300
           TFSNLA+ YVK GQLEKAELALKKVE EIKS + +DRLAYH LISLYASTSN SEV RIW
Sbjct: 241 TFSNLASFYVKAGQLEKAELALKKVEEEIKSDK-KDRLAYHCLISLYASTSNLSEVNRIW 300

Query: 301 NALKSVYPMTNNMSYLVMLQALSKLKDFEGLKRTYKEWESSCSSFDLRLADVTIGAYLRQ 360
           N LKSVYP   N SYLVMLQALSKLKD EGLK+TYKEWES C  FDLRL +V IGAYL+Q
Sbjct: 301 NLLKSVYPTMTNTSYLVMLQALSKLKDIEGLKKTYKEWESICHIFDLRLVNVIIGAYLQQ 360

Query: 361 DMYKDAVLVFEDANKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDDEWHPS 420
           DMY+DA ++FEDA KRSKGPF RARE FM+YFLK KQVD A SHLESAISES + EWHPS
Sbjct: 361 DMYEDAAMIFEDAIKRSKGPFSRAREKFMVYFLKLKQVDSAFSHLESAISESKEKEWHPS 420

Query: 421 PAMANAFLMYFEEEKDVEGAEDFARILKRFKCLDASAYHLLLKTYAAAGKRAPDMRQRLK 480
            A  NAFL YFEEEKDVEGAEDFARILKR KCLD S YHLLLKTY AAGK APDMRQRLK
Sbjct: 421 LATTNAFLNYFEEEKDVEGAEDFARILKRLKCLDESGYHLLLKTYVAAGKSAPDMRQRLK 480

Query: 481 EDNI 485
           ED+I
Sbjct: 481 EDDI 482

BLAST of CmaCh04G003810 vs. NCBI nr
Match: gi|778690383|ref|XP_004146883.2| (PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial [Cucumis sativus])

HSP 1 Score: 756.5 bits (1952), Expect = 3.0e-215
Identity = 381/485 (78.56%), Postives = 418/485 (86.19%), Query Frame = 1

Query: 1   MNRRSLLSRASAGLRHLCTSAAELTRGPVNDQQQRLYPRLSKLGATGGSVAQTLNQYIME 60
           MNRRSL+SRA AG R LCTS  EL R P N+Q+  LYPRLS LGATGGSVA+T+NQ+IME
Sbjct: 1   MNRRSLISRAPAGFRQLCTSLNELMRSPANNQRG-LYPRLSALGATGGSVAKTINQFIME 60

Query: 61  GKIVKKYELERCIKELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVKGIAAA 120
           G IVKKYELE+CIKELRKYRRYHH LQIMEWME RKINYSFTDYALRLDLISKV G+ AA
Sbjct: 61  GNIVKKYELEKCIKELRKYRRYHHCLQIMEWMETRKINYSFTDYALRLDLISKVNGVTAA 120

Query: 121 ENYFCDLSSSAKNRFTYGALLNCYCKELMEEKALALSKKIDELKFASNLSFNNLMTMYMR 180
           E YF DL  SAKNR TYGALLNCYCKE+MEEKAL L KK+DELK +++LSFNNLMTMYMR
Sbjct: 121 EKYFYDLPPSAKNRCTYGALLNCYCKEMMEEKALTLFKKMDELKISTSLSFNNLMTMYMR 180

Query: 181 MDQPEKVPPLIDEMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKDEDRNKFDWT 240
           MD PEKVPPLI EMK+RG +L+T+TYNVWMNSCASLN +GKVEEILEEMK EDRNKFDWT
Sbjct: 181 MDHPEKVPPLIGEMKQRGFYLTTFTYNVWMNSCASLNDIGKVEEILEEMKMEDRNKFDWT 240

Query: 241 TFSNLAAIYVKVGQLEKAELALKKVENEIKSSRQQDRLAYHFLISLYASTSNRSEVYRIW 300
           T+SNLA+ YVK GQ EKAELALKK+E E+KS +  DRL YH LISLYASTSN SEV RIW
Sbjct: 241 TYSNLASFYVKAGQFEKAELALKKLEEEMKSDK-NDRLVYHCLISLYASTSNLSEVNRIW 300

Query: 301 NALKSVYPMTNNMSYLVMLQALSKLKDFEGLKRTYKEWESSCSSFDLRLADVTIGAYLRQ 360
           NALKSVY    N+SYLVMLQAL KLKD EGLKRTYKEWES+C +FDLR+ +  IGAYL+Q
Sbjct: 301 NALKSVYSTMTNISYLVMLQALRKLKDIEGLKRTYKEWESNCRNFDLRIVNDIIGAYLQQ 360

Query: 361 DMYKDAVLVFEDANKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDDEWHPS 420
           DMY+DA ++FEDA KRSKGPF RAREMFM+YFLK KQVD A SHLESA+SES + EWHPS
Sbjct: 361 DMYEDAAMIFEDATKRSKGPFSRAREMFMVYFLKLKQVDSAFSHLESALSESKEKEWHPS 420

Query: 421 PAMANAFLMYFEEEKDVEGAEDFARILKRFKCLDASAYHLLLKTYAAAGKRAPDMRQRLK 480
            A   AFL YFEEEKDVEGAEDFARILKR KCLDAS YHLLLKTY AAGK APDMR+RLK
Sbjct: 421 LATTTAFLNYFEEEKDVEGAEDFARILKRLKCLDASGYHLLLKTYVAAGKLAPDMRKRLK 480

Query: 481 EDNIE 486
           ED+IE
Sbjct: 481 EDDIE 483

BLAST of CmaCh04G003810 vs. NCBI nr
Match: gi|1009148013|ref|XP_015891718.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial [Ziziphus jujuba])

HSP 1 Score: 637.5 bits (1643), Expect = 2.0e-179
Identity = 320/486 (65.84%), Postives = 392/486 (80.66%), Query Frame = 1

Query: 1   MNRRSLLSRASAGLRHLCTSAAELTRGPVNDQQQRLYPRLSKLGATGGSVAQTLNQYIME 60
           MN R L+S  +A L    ++AAE           RLY RLS LGATGGSV++TLN+YIME
Sbjct: 1   MNSRRLISAGAAWLVRQLSTAAETVAAGSTANGTRLYRRLSALGATGGSVSKTLNEYIME 60

Query: 61  GKIVKKYELERCIKELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVKGIAAA 120
           G+IVKK+ELERCIKELRKYRR+ HAL+IMEWMEMRKINYSFTD+ALRLDLI K KG+ AA
Sbjct: 61  GRIVKKFELERCIKELRKYRRFQHALEIMEWMEMRKINYSFTDHALRLDLICKTKGVDAA 120

Query: 121 ENYFCDLSSSAKNRFTYGALLNCYCKELMEEKALALSKKIDELKFASN-LSFNNLMTMYM 180
           ENYF +L S+AKNR T+GALLNCYCKE ME+KALAL +K+D+L F SN L+FNNLM++YM
Sbjct: 121 ENYFDNLPSNAKNRLTFGALLNCYCKENMEDKALALFQKMDDLNFVSNSLAFNNLMSLYM 180

Query: 181 RMDQPEKVPPLIDEMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKDEDRNKFDW 240
           RM +PEKVPPL+ EMK+R IF   +TY++WM S +SL  +  VE +LEEM   D +K +W
Sbjct: 181 RMGKPEKVPPLVQEMKQRNIFPCNFTYSIWMQSYSSLGDIEGVERVLEEMNKGDHDKCNW 240

Query: 241 TTFSNLAAIYVKVGQLEKAELALKKVENEIKSSRQQDRLAYHFLISLYASTSNRSEVYRI 300
            T++NLAAIYVK G  EKA+LALKK+E E   +R + R AYHF+ISLYA T N +EV R 
Sbjct: 241 KTYTNLAAIYVKAGHFEKADLALKKLEEE---TRPRGRQAYHFVISLYAGTGNLNEVNRA 300

Query: 301 WNALKSVYPMTNNMSYLVMLQALSKLKDFEGLKRTYKEWESSCSSFDLRLADVTIGAYLR 360
           W  LKS+YP TNN+SYLV+LQALSKL D EGLK+ +KEWESS S +D+RLA+V +G YLR
Sbjct: 301 WETLKSIYPETNNLSYLVLLQALSKLNDVEGLKKYFKEWESSFSFYDIRLANVAVGTYLR 360

Query: 361 QDMYKDAVLVFEDANKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDDEWHP 420
            DMYK+A  VFEDA KR+KGPFF+AREMFM YFLKF+QVD ALS +E+AISE+ DD+W P
Sbjct: 361 NDMYKEASAVFEDATKRTKGPFFKAREMFMNYFLKFRQVDSALSFMEAAISEARDDDWRP 420

Query: 421 SPAMANAFLMYFEEEKDVEGAEDFARILKRFKCLDASAYHLLLKTYAAAGKRAPDMRQRL 480
           SPA+A+AFL YFEEEKDV+ AE F +IL+RF CL+++AYHLLLKTY AAGK AP+MR+RL
Sbjct: 421 SPAVASAFLKYFEEEKDVDSAEQFCKILRRFNCLNSNAYHLLLKTYLAAGKLAPEMRRRL 480

Query: 481 KEDNIE 486
           +E++IE
Sbjct: 481 EEEDIE 483

BLAST of CmaCh04G003810 vs. NCBI nr
Match: gi|659107721|ref|XP_008453823.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial-like isoform X2 [Cucumis melo])

HSP 1 Score: 632.1 bits (1629), Expect = 8.5e-178
Identity = 319/396 (80.56%), Postives = 342/396 (86.36%), Query Frame = 1

Query: 89  MEWMEMRKINYSFTDYALRLDLISKVKGIAAAENYFCDLSSSAKNRFTYGALLNCYCKEL 148
           MEWME+RKINYSFTDYALRLDLISKV GI AAE YF DL  SAKNR TYGALLNCYCKE+
Sbjct: 1   MEWMEIRKINYSFTDYALRLDLISKVNGITAAEKYFYDLPPSAKNRCTYGALLNCYCKEM 60

Query: 149 MEEKALALSKKIDELKFASNLSFNNLMTMYMRMDQPEKVPPLIDEMKRRGIFLSTYTYNV 208
           MEEKA  L KK+DELKF ++L+FNNLMTMYMRMDQPEKVPPLI EMK+RG +L+T+TYNV
Sbjct: 61  MEEKASTLFKKMDELKFVTSLAFNNLMTMYMRMDQPEKVPPLIGEMKQRGFYLTTFTYNV 120

Query: 209 WMNSCASLNGVGKVEEILEEMKDEDRNKFDWTTFSNLAAIYVKVGQLEKAELALKKVENE 268
           WMNSCASLN +GKVEEILEEMK ED NK DWTTFSNLA+ YVK GQLEKAELALKKVE E
Sbjct: 121 WMNSCASLNDIGKVEEILEEMKMEDSNKLDWTTFSNLASFYVKAGQLEKAELALKKVEEE 180

Query: 269 IKSSRQQDRLAYHFLISLYASTSNRSEVYRIWNALKSVYPMTNNMSYLVMLQALSKLKDF 328
           IKS ++ DRLAYH LISLYASTSN SEV RIWN LKSVYP   N SYLVMLQALSKLKD 
Sbjct: 181 IKSDKK-DRLAYHCLISLYASTSNLSEVNRIWNLLKSVYPTMTNTSYLVMLQALSKLKDI 240

Query: 329 EGLKRTYKEWESSCSSFDLRLADVTIGAYLRQDMYKDAVLVFEDANKRSKGPFFRAREMF 388
           EGLK+TYKEWES C  FDLRL +V IGAYL+QDMY+DA ++FEDA KRSKGPF RARE F
Sbjct: 241 EGLKKTYKEWESICHIFDLRLVNVIIGAYLQQDMYEDAAMIFEDAIKRSKGPFSRAREKF 300

Query: 389 MIYFLKFKQVDLALSHLESAISESMDDEWHPSPAMANAFLMYFEEEKDVEGAEDFARILK 448
           M+YFLK KQVD A SHLESAISES + EWHPS A  NAFL YFEEEKDVEGAEDFARILK
Sbjct: 301 MVYFLKLKQVDSAFSHLESAISESKEKEWHPSLATTNAFLNYFEEEKDVEGAEDFARILK 360

Query: 449 RFKCLDASAYHLLLKTYAAAGKRAPDMRQRLKEDNI 485
           R KCLD S YHLLLKTY AAGK APDMRQRLKED+I
Sbjct: 361 RLKCLDESGYHLLLKTYVAAGKSAPDMRQRLKEDDI 395

BLAST of CmaCh04G003810 vs. NCBI nr
Match: gi|645243222|ref|XP_008227878.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial [Prunus mume])

HSP 1 Score: 604.4 bits (1557), Expect = 1.9e-169
Identity = 312/500 (62.40%), Postives = 384/500 (76.80%), Query Frame = 1

Query: 1   MNRRSLLSRASAGLRHLCTSAAELTRGPVND--QQQRLYPRLSKLGATGGSVAQTLNQYI 60
           MN    +S  +  +R LCT+    T    +      RLY RLS LGATGGSVA+TLNQYI
Sbjct: 1   MNSSRSISAGTWLVRKLCTAVEAATESARSQPGNPTRLYRRLSALGATGGSVAKTLNQYI 60

Query: 61  MEGKIVKKYELERCIKELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVKGIA 120
           MEGK++KKYELERCIKELRKYR++ HAL+IMEWME RK+NYS  D+A+RLDL SKVKGI 
Sbjct: 61  MEGKMLKKYELERCIKELRKYRKFQHALEIMEWMEFRKMNYSKADFAIRLDLTSKVKGIE 120

Query: 121 AAENYFCDLSSSAKNRFTYGALLNCYCKELMEEKALALSKKIDELKFASN-LSFNNLMTM 180
           AAE+YF  LS S K+RFTYGALLNCYCKELMEEKAL+L + +DEL+FAS+ L FNNLM+M
Sbjct: 121 AAEDYFSGLSPSLKDRFTYGALLNCYCKELMEEKALSLYETMDELEFASSSLVFNNLMSM 180

Query: 181 YMRMDQPEKVPPLIDEMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKDEDRNKF 240
           +MR  QPEKV PL+ EMK+R I L T+TYN+WM S ASLN    VE +L+EM+ +D ++ 
Sbjct: 181 HMRKQQPEKVAPLVQEMKQRKIPLDTFTYNIWMQSFASLNNFEGVERVLDEMQKQDGDQC 240

Query: 241 DWTTFSNLAAIYVKVGQLEKAELALKKVENEIKSSRQQDRLAYHFLISLYASTSNRSEVY 300
            W+T+SNLAAIYVK    +KAELALKK E  +K  +Q++   YHFLISLYA TSN  EV 
Sbjct: 241 SWSTYSNLAAIYVKAKIFDKAELALKKSEEMMKPLKQRN--TYHFLISLYACTSNLGEVK 300

Query: 301 RIWNALKSVYPMTNNMSYLVMLQALSKLKDFEGLKRTYKEWESSCSSFDLRLADVTIGAY 360
           R+W +LK  +P TNN+SYL+MLQAL KL D EGLK  ++EWE  CSS+D+RLA+  I  Y
Sbjct: 301 RVWESLKKAFPATNNISYLIMLQALCKLNDIEGLKECFEEWECKCSSYDMRLANTAIRGY 360

Query: 361 LRQDMYKDAVLVFEDANKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDDEW 420
           L QDMY++A LVF DA KR+KGPFF+AREMFM+YFLK  QVDLA+S+L +A+SE++D+EW
Sbjct: 361 LSQDMYEEAALVFSDACKRTKGPFFKAREMFMLYFLKNCQVDLAVSYLGAAVSETVDEEW 420

Query: 421 HPSPAMANAFLMYFEEEKDVEGAEDFARILKRFKCLDASAYHLLLKTYAAAGKRAPDMRQ 480
           HPSP   +AF  YFEEEKDVE AE+F +ILKR  CL ++ Y+LLLKTY AAGK  P+MRQ
Sbjct: 421 HPSPDTTSAFFKYFEEEKDVESAENFCKILKRLNCLCSNEYYLLLKTYIAAGKLDPEMRQ 480

Query: 481 RLKEDNIEKFPWFVVDLDRV 498
           RLKE++IE  P     L+RV
Sbjct: 481 RLKEEDIEISPELESLLERV 498

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR4_ARATH1.4e-13252.28Pentatricopeptide repeat-containing protein At1g02370, mitochondrial OS=Arabidop... [more]
PP300_ARATH2.1e-12048.11Pentatricopeptide repeat-containing protein At4g01990, mitochondrial OS=Arabidop... [more]
PPR86_ARATH3.5e-9942.23Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana GN... [more]
PPR3_ARATH6.7e-7435.71Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana GN... [more]
PP302_ARATH4.8e-6434.30Pentatricopeptide repeat-containing protein At4g02820, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0KUH1_CUCSA2.1e-21578.56Pentatricopeptide repeat-containing protein OS=Cucumis sativus GN=Csa_4G026260 P... [more]
M5X9A6_PRUPE1.5e-15762.91Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa005037mg PE=4 SV=1[more]
B9RNC6_RICCO3.0e-15057.00Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
A0A067K8Z8_JATCU3.7e-14855.56Uncharacterized protein OS=Jatropha curcas GN=JCGZ_14045 PE=4 SV=1[more]
A0A061DR66_THECC6.4e-14856.42Tetratricopeptide repeat (TPR)-like superfamily protein, putative OS=Theobroma c... [more]
Match NameE-valueIdentityDescription
AT1G02370.18.0e-13452.28 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G01990.11.2e-12148.11 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G60770.12.0e-10042.23 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G02150.13.8e-7535.71 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G02820.12.7e-6534.30 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659107719|ref|XP_008453822.1|9.2e-21780.17PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial-... [more]
gi|778690383|ref|XP_004146883.2|3.0e-21578.56PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial ... [more]
gi|1009148013|ref|XP_015891718.1|2.0e-17965.84PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial ... [more]
gi|659107721|ref|XP_008453823.1|8.5e-17880.56PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial-... [more]
gi|645243222|ref|XP_008227878.1|1.9e-16962.40PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial ... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh04G003810.1CmaCh04G003810.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 135..156
score: 0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 170..214
score: 2.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 311..341
score: 6.27coord: 238..268
score: 6.259coord: 276..306
score: 5.59coord: 167..201
score: 8.506coord: 202..236
score: 8.122coord: 133..163
score: 7.366coord: 346..380
score: 6
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 130..371
score: 5.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 242..373
score: 2.28E-5coord: 134..208
score: 2.2
NoneNo IPR availableunknownCoilCoilcoord: 255..275
scor
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 46..493
score: 5.0E
NoneNo IPR availablePANTHERPTHR24015:SF504SUBFAMILY NOT NAMEDcoord: 46..493
score: 5.0E