CmaCh05G004760 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh05G004760
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCma_Chr05: 2242261 .. 2244297 (-)
RNA-Seq ExpressionCmaCh05G004760
SyntenyCmaCh05G004760
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAACTCTCGTAGATGGGTTTCTTTCGTCAAACAACACTTCTCCTGCTCTTCTTCCATCGTCTTCCAAGTTAAACTTTGACCTCTGTCCCAGTTTCAGATTTTCTCGAAATTCCATGAATGTAGCTTGTAGGATGCATTCCACTGCGATATCGGCCCATAATAGATCCCAGTGTCGATTTGCTCCAGTTGCTAAATGTCCGGATAGTAATGATGCAGGTTCTAACGTTCCAATCGCTCGTAGTTTCGCTTTGTTTAATCGTAATGTCCAGTTTGTTAAATTAAATGCTCGGCGAGTTGATAGTTTGATTGGAAACAAGCTGGCAAAGGTTTGTGCGAAGTGTGCGACGTGCGTGGATAGTGACCGTAAGGTATTCGATGAAATGCCTGAGAGACCGCTGCCTGCGTATACAGCTTTGATTAGGGCGTATTGTCGGTCAGAGAAGTGGAATGAGCTCTTTGCGGCATTCGGATCGATGGTTGAGGAGGGCATACTACCTGATAAATACCTCGTGCCCACGATTCTTAAAGCATGTTCCAAAATACAAGCGGTGAAGACAGGTAAAATGATTCATGGGTATGCCATTAGGAAGAGGTTGGTCTCTGATATTTTCATTGGGAATGCTCTTATGGATTTCTATGGTAATTGTGGGGATTTGAGATTTTCGATCAATGTTTTTGATTCAATGAGTGAAAAAGATGTGGTGTCATGGACTGCGCTTGTGTCTGCTTACATGGAAGAAGGTCTTTTGGATGAGGCGATGGAAGCATTTCACTCCATGCAGTCTAGCGGATTGAAGCCTGATTTGATATCATGGAATGCATTGGTCTCTGGGTTTGCTCGACATGGAAAGATTGGCACTGCTCTCAAATACTTGGAAGCTATGCAAGAACAAGGATTGAGCCCAAGGGTTAATTCATGGAATGGAGTCATATCAGGCTGTGTTCTGAATGGATATTTCAAAGATGCTTTGTATGTATTCATTAACATGCTGTTGTTTCCTGAGAATCCAAATTCTGTCACTGTTGCGAGTGTTCTACCAGCTTGTGCAGGATTGAGATATCTGGGTTTAGGCAGGGCTGTTCATGCCTACGCTCTTAAGTGCGAGCTGTGTACGAACATCTACGTCGAAGGATCGTTAGTCAATATGTATTCGAAATGCGGGCAAGACGATTATGCTGAAGAAATTTTTGCTAAAGCAGAGAAGAAAAACATTACATTGTGGAATGAAATTATTGCAACTTATGTGAATCAGGGAAGAACTAGCCAGGCATTAGAACGTTTTAGATCAATGCAGCATCATGGACTAAGACCTGATGTTGTAACCTACAACACACTGCTGGCTGGATATGCAAAAAATGGGCAGAAAGTTGAAGCATATAACTTGCTAACTGAGATGTTGCAGAAAGACCTGGCACCTAATGTTGTATCTTTGAATGCTTTAGTGTCTGGATTTCAACAATCTGGGTTAAGTTATGAAGCTCTTGAGTTATTCCAGACCATGCTTTACACTGCTTGCCTTGTTGATAAGGTGATTACTTCGCCAATCAGACCGAATATCGTCATCACTATAACTGCAGCTCTGGCTGCTTGTGCTAGCTTGAATTTATTGCACAAAGGGAAAGAAATCCATGGATATATGTTGAGGAACGGTTTTGAAGACAACCACATTGTTTCGAGTGCTCTCATTGACATGTACTCGAAGTGCGAGTGTATTGATTCGGTGATTCAAGTATTTGGGGGAATAAAGAACAGGAATGAAGTTTGTTGGAATGCCTTGATTGCAGGTTTTAGGAGAGTTATGCAGCCCAAAATGGCGGTTGAACTCTTCTGTCAAATGCTAGTAGAAGGCATAAAACCAAGTTCAGACAGCTTTTCGATACTTCTCCCTGCCTTGGCTAGGACAGATTTGATAATGAGGAGACAGCTACATTCCTATATCATCAAGAGTCAACTCGTCGAATCGTGCGATGATCTCTCATATGTCTTAAGTTCAAACGAGTTTGATGGAGGAGTTATGCTTCATGGAACATAA

mRNA sequence

ATGGCAACTCTCGTAGATGGGTTTCTTTCGTCAAACAACACTTCTCCTGCTCTTCTTCCATCGTCTTCCAAGTTAAACTTTGACCTCTGTCCCAGTTTCAGATTTTCTCGAAATTCCATGAATGTAGCTTGTAGGATGCATTCCACTGCGATATCGGCCCATAATAGATCCCAGTGTCGATTTGCTCCAGTTGCTAAATGTCCGGATAGTAATGATGCAGGTTCTAACGTTCCAATCGCTCGTAGTTTCGCTTTGTTTAATCGTAATGTCCAGTTTGTTAAATTAAATGCTCGGCGAGTTGATAGTTTGATTGGAAACAAGCTGGCAAAGGTTTGTGCGAAGTGTGCGACGTGCGTGGATAGTGACCGTAAGGTATTCGATGAAATGCCTGAGAGACCGCTGCCTGCGTATACAGCTTTGATTAGGGCGTATTGTCGGTCAGAGAAGTGGAATGAGCTCTTTGCGGCATTCGGATCGATGGTTGAGGAGGGCATACTACCTGATAAATACCTCGTGCCCACGATTCTTAAAGCATGTTCCAAAATACAAGCGGTGAAGACAGGTAAAATGATTCATGGGTATGCCATTAGGAAGAGGTTGGTCTCTGATATTTTCATTGGGAATGCTCTTATGGATTTCTATGGTAATTGTGGGGATTTGAGATTTTCGATCAATGTTTTTGATTCAATGAGTGAAAAAGATGTGGTGTCATGGACTGCGCTTGTGTCTGCTTACATGGAAGAAGGTCTTTTGGATGAGGCGATGGAAGCATTTCACTCCATGCAGTCTAGCGGATTGAAGCCTGATTTGATATCATGGAATGCATTGGTCTCTGGGTTTGCTCGACATGGAAAGATTGGCACTGCTCTCAAATACTTGGAAGCTATGCAAGAACAAGGATTGAGCCCAAGGGTTAATTCATGGAATGGAGTCATATCAGGCTGTGTTCTGAATGGATATTTCAAAGATGCTTTGTATGTATTCATTAACATGCTGTTGTTTCCTGAGAATCCAAATTCTGTCACTGTTGCGAGTGTTCTACCAGCTTGTGCAGGATTGAGATATCTGGGTTTAGGCAGGGCTGTTCATGCCTACGCTCTTAAGTGCGAGCTGTGTACGAACATCTACGTCGAAGGATCGTTAGTCAATATGTATTCGAAATGCGGGCAAGACGATTATGCTGAAGAAATTTTTGCTAAAGCAGAGAAGAAAAACATTACATTGTGGAATGAAATTATTGCAACTTATGTGAATCAGGGAAGAACTAGCCAGGCATTAGAACGTTTTAGATCAATGCAGCATCATGGACTAAGACCTGATGTTGTAACCTACAACACACTGCTGGCTGGATATGCAAAAAATGGGCAGAAAGTTGAAGCATATAACTTGCTAACTGAGATGTTGCAGAAAGACCTGGCACCTAATGTTGTATCTTTGAATGCTTTAGTGTCTGGATTTCAACAATCTGGGTTAAGTTATGAAGCTCTTGAGTTATTCCAGACCATGCTTTACACTGCTTGCCTTGTTGATAAGGTGATTACTTCGCCAATCAGACCGAATATCGTCATCACTATAACTGCAGCTCTGGCTGCTTGTGCTAGCTTGAATTTATTGCACAAAGGGAAAGAAATCCATGGATATATGTTGAGGAACGGTTTTGAAGACAACCACATTGTTTCGAGTGCTCTCATTGACATGTACTCGAAGTGCGAGTGTATTGATTCGGTGATTCAAGTATTTGGGGGAATAAAGAACAGGAATGAAGTTTGTTGGAATGCCTTGATTGCAGGTTTTAGGAGAGTTATGCAGCCCAAAATGGCGGTTGAACTCTTCTGTCAAATGCTAGTAGAAGGCATAAAACCAAGTTCAGACAGCTTTTCGATACTTCTCCCTGCCTTGGCTAGGACAGATTTGATAATGAGGAGACAGCTACATTCCTATATCATCAAGAGTCAACTCGTCGAATCGTGCGATGATCTCTCATATGTCTTAAGTTCAAACGAGTTTGATGGAGGAGTTATGCTTCATGGAACATAA

Coding sequence (CDS)

ATGGCAACTCTCGTAGATGGGTTTCTTTCGTCAAACAACACTTCTCCTGCTCTTCTTCCATCGTCTTCCAAGTTAAACTTTGACCTCTGTCCCAGTTTCAGATTTTCTCGAAATTCCATGAATGTAGCTTGTAGGATGCATTCCACTGCGATATCGGCCCATAATAGATCCCAGTGTCGATTTGCTCCAGTTGCTAAATGTCCGGATAGTAATGATGCAGGTTCTAACGTTCCAATCGCTCGTAGTTTCGCTTTGTTTAATCGTAATGTCCAGTTTGTTAAATTAAATGCTCGGCGAGTTGATAGTTTGATTGGAAACAAGCTGGCAAAGGTTTGTGCGAAGTGTGCGACGTGCGTGGATAGTGACCGTAAGGTATTCGATGAAATGCCTGAGAGACCGCTGCCTGCGTATACAGCTTTGATTAGGGCGTATTGTCGGTCAGAGAAGTGGAATGAGCTCTTTGCGGCATTCGGATCGATGGTTGAGGAGGGCATACTACCTGATAAATACCTCGTGCCCACGATTCTTAAAGCATGTTCCAAAATACAAGCGGTGAAGACAGGTAAAATGATTCATGGGTATGCCATTAGGAAGAGGTTGGTCTCTGATATTTTCATTGGGAATGCTCTTATGGATTTCTATGGTAATTGTGGGGATTTGAGATTTTCGATCAATGTTTTTGATTCAATGAGTGAAAAAGATGTGGTGTCATGGACTGCGCTTGTGTCTGCTTACATGGAAGAAGGTCTTTTGGATGAGGCGATGGAAGCATTTCACTCCATGCAGTCTAGCGGATTGAAGCCTGATTTGATATCATGGAATGCATTGGTCTCTGGGTTTGCTCGACATGGAAAGATTGGCACTGCTCTCAAATACTTGGAAGCTATGCAAGAACAAGGATTGAGCCCAAGGGTTAATTCATGGAATGGAGTCATATCAGGCTGTGTTCTGAATGGATATTTCAAAGATGCTTTGTATGTATTCATTAACATGCTGTTGTTTCCTGAGAATCCAAATTCTGTCACTGTTGCGAGTGTTCTACCAGCTTGTGCAGGATTGAGATATCTGGGTTTAGGCAGGGCTGTTCATGCCTACGCTCTTAAGTGCGAGCTGTGTACGAACATCTACGTCGAAGGATCGTTAGTCAATATGTATTCGAAATGCGGGCAAGACGATTATGCTGAAGAAATTTTTGCTAAAGCAGAGAAGAAAAACATTACATTGTGGAATGAAATTATTGCAACTTATGTGAATCAGGGAAGAACTAGCCAGGCATTAGAACGTTTTAGATCAATGCAGCATCATGGACTAAGACCTGATGTTGTAACCTACAACACACTGCTGGCTGGATATGCAAAAAATGGGCAGAAAGTTGAAGCATATAACTTGCTAACTGAGATGTTGCAGAAAGACCTGGCACCTAATGTTGTATCTTTGAATGCTTTAGTGTCTGGATTTCAACAATCTGGGTTAAGTTATGAAGCTCTTGAGTTATTCCAGACCATGCTTTACACTGCTTGCCTTGTTGATAAGGTGATTACTTCGCCAATCAGACCGAATATCGTCATCACTATAACTGCAGCTCTGGCTGCTTGTGCTAGCTTGAATTTATTGCACAAAGGGAAAGAAATCCATGGATATATGTTGAGGAACGGTTTTGAAGACAACCACATTGTTTCGAGTGCTCTCATTGACATGTACTCGAAGTGCGAGTGTATTGATTCGGTGATTCAAGTATTTGGGGGAATAAAGAACAGGAATGAAGTTTGTTGGAATGCCTTGATTGCAGGTTTTAGGAGAGTTATGCAGCCCAAAATGGCGGTTGAACTCTTCTGTCAAATGCTAGTAGAAGGCATAAAACCAAGTTCAGACAGCTTTTCGATACTTCTCCCTGCCTTGGCTAGGACAGATTTGATAATGAGGAGACAGCTACATTCCTATATCATCAAGAGTCAACTCGTCGAATCGTGCGATGATCTCTCATATGTCTTAAGTTCAAACGAGTTTGATGGAGGAGTTATGCTTCATGGAACATAA

Protein sequence

MATLVDGFLSSNNTSPALLPSSSKLNFDLCPSFRFSRNSMNVACRMHSTAISAHNRSQCRFAPVAKCPDSNDAGSNVPIARSFALFNRNVQFVKLNARRVDSLIGNKLAKVCAKCATCVDSDRKVFDEMPERPLPAYTALIRAYCRSEKWNELFAAFGSMVEEGILPDKYLVPTILKACSKIQAVKTGKMIHGYAIRKRLVSDIFIGNALMDFYGNCGDLRFSINVFDSMSEKDVVSWTALVSAYMEEGLLDEAMEAFHSMQSSGLKPDLISWNALVSGFARHGKIGTALKYLEAMQEQGLSPRVNSWNGVISGCVLNGYFKDALYVFINMLLFPENPNSVTVASVLPACAGLRYLGLGRAVHAYALKCELCTNIYVEGSLVNMYSKCGQDDYAEEIFAKAEKKNITLWNEIIATYVNQGRTSQALERFRSMQHHGLRPDVVTYNTLLAGYAKNGQKVEAYNLLTEMLQKDLAPNVVSLNALVSGFQQSGLSYEALELFQTMLYTACLVDKVITSPIRPNIVITITAALAACASLNLLHKGKEIHGYMLRNGFEDNHIVSSALIDMYSKCECIDSVIQVFGGIKNRNEVCWNALIAGFRRVMQPKMAVELFCQMLVEGIKPSSDSFSILLPALARTDLIMRRQLHSYIIKSQLVESCDDLSYVLSSNEFDGGVMLHGT
Homology
BLAST of CmaCh05G004760 vs. ExPASy Swiss-Prot
Match: Q9FXH1 (Pentatricopeptide repeat-containing protein At1g19720 OS=Arabidopsis thaliana OX=3702 GN=DYW7 PE=2 SV=1)

HSP 1 Score: 351.7 bits (901), Expect = 1.9e-95
Identity = 191/534 (35.77%), Postives = 304/534 (56.93%), Query Frame = 0

Query: 101 DSLIGNKLAKVCAKCATCVDSDRKVFDEMPERPLPAYTALIRAYCRSEKWNELFAAFGSM 160
           D  +  KL  + AKC  C+   RKVFD M ER L  ++A+I AY R  +W E+   F  M
Sbjct: 114 DVFVETKLLSMYAKCG-CIADARKVFDSMRERNLFTWSAMIGAYSRENRWREVAKLFRLM 173

Query: 161 VEEGILPDKYLVPTILKACSKIQAVKTGKMIHGYAIRKRLVSDIFIGNALMDFYGNCGDL 220
           +++G+LPD +L P IL+ C+    V+ GK+IH   I+  + S + + N+++  Y  CG+L
Sbjct: 174 MKDGVLPDDFLFPKILQGCANCGDVEAGKVIHSVVIKLGMSSCLRVSNSILAVYAKCGEL 233

Query: 221 RFSINVFDSMSEKDVVSWTALVSAYMEEGLLDEAMEAFHSMQSSGLKPDLISWNALVSGF 280
            F+   F  M E+DV++W +++ AY + G  +EA+E    M+  G+ P L++WN L+ G+
Sbjct: 234 DFATKFFRRMRERDVIAWNSVLLAYCQNGKHEEAVELVKEMEKEGISPGLVTWNILIGGY 293

Query: 281 ARHGKIGTALKYLEAMQEQGLSPRVNSWNGVISGCVLNGYFKDALYVFINMLLFPENPNS 340
            + GK   A+  ++ M+  G++  V +W  +ISG + NG    AL +F  M L    PN+
Sbjct: 294 NQLGKCDAAMDLMQKMETFGITADVFTWTAMISGLIHNGMRYQALDMFRKMFLAGVVPNA 353

Query: 341 VTVASVLPACAGLRYLGLGRAVHAYALKCELCTNIYVEGSLVNMYSKCGQDDYAEEIFAK 400
           VT+ S + AC+ L+ +  G  VH+ A+K     ++ V  SLV+MYSKCG+ + A ++F  
Sbjct: 354 VTIMSAVSACSCLKVINQGSEVHSIAVKMGFIDDVLVGNSLVDMYSKCGKLEDARKVFDS 413

Query: 401 AEKKNITLWNEIIATYVNQGRTSQALERFRSMQHHGLRPDVVTYNTLLAGYAKNGQKVEA 460
            + K++  WN +I  Y   G   +A E F  MQ   LRP+++T+NT+++GY KNG + EA
Sbjct: 414 VKNKDVYTWNSMITGYCQAGYCGKAYELFTRMQDANLRPNIITWNTMISGYIKNGDEGEA 473

Query: 461 YNLLTEMLQKD--LAPNVVSLNALVSGFQQSGLSYEALELFQTMLYTACLVDKVITSPIR 520
            +L   M +KD  +  N  + N +++G+ Q+G   EALELF+ M +          S   
Sbjct: 474 MDLFQRM-EKDGKVQRNTATWNLIIAGYIQNGKKDEALELFRKMQF----------SRFM 533

Query: 521 PNIVITITAALAACASLNLLHKGKEIHGYMLRNGFEDNHIVSSALIDMYSKCECIDSVIQ 580
           PN V TI + L ACA+L      +EIHG +LR   +  H V +AL D Y+K   I+    
Sbjct: 534 PNSV-TILSLLPACANLLGAKMVREIHGCVLRRNLDAIHAVKNALTDTYAKSGDIEYSRT 593

Query: 581 VFGGIKNRNEVCWNALIAGFRRVMQPKMAVELFCQMLVEGIKPSSDSFSILLPA 633
           +F G++ ++ + WN+LI G+        A+ LF QM  +GI P+  + S ++ A
Sbjct: 594 IFLGMETKDIITWNSLIGGYVLHGSYGPALALFNQMKTQGITPNRGTLSSIILA 634

BLAST of CmaCh05G004760 vs. ExPASy Swiss-Prot
Match: Q9SV26 (Pentatricopeptide repeat-containing protein At4g01030, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H65 PE=3 SV=2)

HSP 1 Score: 280.0 bits (715), Expect = 7.0e-74
Identity = 163/531 (30.70%), Postives = 280/531 (52.73%), Query Frame = 0

Query: 124 KVFDEMPERPLPAYTALIRAYCRSEKWNELFAAFGSMVEEGILPDKYLVPTILKACSKIQ 183
           K+FDEMP+R   A+  ++    RS  W +    F  M   G       +  +L+ CS  +
Sbjct: 44  KLFDEMPKRDDLAWNEIVMVNLRSGNWEKAVELFREMQFSGAKAYDSTMVKLLQVCSNKE 103

Query: 184 AVKTGKMIHGYAIRKRLVSDIFIGNALMDFYGNCGDLRFSINVFDSMSEKDVVSWTALVS 243
               G+ IHGY +R  L S++ + N+L+  Y   G L  S  VF+SM ++++ SW +++S
Sbjct: 104 GFAEGRQIHGYVLRLGLESNVSMCNSLIVMYSRNGKLELSRKVFNSMKDRNLSSWNSILS 163

Query: 244 AYMEEGLLDEAMEAFHSMQSSGLKPDLISWNALVSGFARHGKIGTALKYLEAMQEQGLSP 303
           +Y + G +D+A+     M+  GLKPD+++WN+L+SG+A  G    A+  L+ MQ  GL P
Sbjct: 164 SYTKLGYVDDAIGLLDEMEICGLKPDIVTWNSLLSGYASKGLSKDAIAVLKRMQIAGLKP 223

Query: 304 RVNSWNGVISGCVLNGYFKDALYVFINMLLFPENPNSVTVASVLPACAGLRYLGLGRAVH 363
             +S                                   ++S+L A A   +L LG+A+H
Sbjct: 224 STSS-----------------------------------ISSLLQAVAEPGHLKLGKAIH 283

Query: 364 AYALKCELCTNIYVEGSLVNMYSKCGQDDYAEEIFAKAEKKNITLWNEIIA--TYVNQGR 423
            Y L+ +L  ++YVE +L++MY K G   YA  +F   + KNI  WN +++  +Y    +
Sbjct: 284 GYILRNQLWYDVYVETTLIDMYIKTGYLPYARMVFDMMDAKNIVAWNSLVSGLSYACLLK 343

Query: 424 TSQALERFRSMQHHGLRPDVVTYNTLLAGYAKNGQKVEAYNLLTEMLQKDLAPNVVSLNA 483
            ++AL     M+  G++PD +T+N+L +GYA  G+  +A +++ +M +K +APNVVS  A
Sbjct: 344 DAEAL--MIRMEKEGIKPDAITWNSLASGYATLGKPEKALDVIGKMKEKGVAPNVVSWTA 403

Query: 484 LVSGFQQSGLSYEALELFQTMLYTACLVDKVITSPIRPNIVITITAALAACASLNLLHKG 543
           + SG  ++G    AL++F           K+    + PN   T++  L     L+LLH G
Sbjct: 404 IFSGCSKNGNFRNALKVF----------IKMQEEGVGPN-AATMSTLLKILGCLSLLHSG 463

Query: 544 KEIHGYMLRNGFEDNHIVSSALIDMYSKCECIDSVIQVFGGIKNRNEVCWNALIAGFRRV 603
           KE+HG+ LR     +  V++AL+DMY K   + S I++F GIKN++   WN ++ G+   
Sbjct: 464 KEVHGFCLRKNLICDAYVATALVDMYGKSGDLQSAIEIFWGIKNKSLASWNCMLMGYAMF 523

Query: 604 MQPKMAVELFCQMLVEGIKPSSDSFSILLPALARTDLIMRRQLHSYIIKSQ 653
            + +  +  F  ML  G++P + +F+ +L     + L+     +  +++S+
Sbjct: 524 GRGEEGIAAFSVMLEAGMEPDAITFTSVLSVCKNSGLVQEGWKYFDLMRSR 526

BLAST of CmaCh05G004760 vs. ExPASy Swiss-Prot
Match: Q9FM64 (Pentatricopeptide repeat-containing protein At5g55740, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CRR21 PE=2 SV=1)

HSP 1 Score: 275.8 bits (704), Expect = 1.3e-72
Identity = 170/588 (28.91%), Postives = 304/588 (51.70%), Query Frame = 0

Query: 104 IGNKLAKVCAKCATCVDSDRKVFDEMPERPLPAYTALIRAYCRSEKWNELFAAFGSMVEE 163
           I  KL    AKC   ++    +F ++  R + ++ A+I   CR          F  M+E 
Sbjct: 109 IETKLVIFYAKC-DALEIAEVLFSKLRVRNVFSWAAIIGVKCRIGLCEGALMGFVEMLEN 168

Query: 164 GILPDKYLVPTILKACSKIQAVKTGKMIHGYAIRKRLVSDIFIGNALMDFYGNCGDLRFS 223
            I PD ++VP + KAC  ++  + G+ +HGY ++  L   +F+ ++L D YG CG L  +
Sbjct: 169 EIFPDNFVVPNVCKACGALKWSRFGRGVHGYVVKSGLEDCVFVASSLADMYGKCGVLDDA 228

Query: 224 INVFDSMSEKDVVSWTALVSAYMEEGLLDEAMEAFHSMQSSGLKPDLISWN--------- 283
             VFD + +++ V+W AL+  Y++ G  +EA+  F  M+  G++P  ++ +         
Sbjct: 229 SKVFDEIPDRNAVAWNALMVGYVQNGKNEEAIRLFSDMRKQGVEPTRVTVSTCLSASANM 288

Query: 284 ------------ALVSGFARHGKIGTAL----------KYLEAMQEQGLSPRVNSWNGVI 343
                       A+V+G      +GT+L          +Y E + ++     V +WN +I
Sbjct: 289 GGVEEGKQSHAIAIVNGMELDNILGTSLLNFYCKVGLIEYAEMVFDRMFEKDVVTWNLII 348

Query: 344 SGCVLNGYFKDALYVFINMLLFPENPNSVTVASVLPACAGLRYLGLGRAVHAYALKCELC 403
           SG V  G  +DA+Y+   M L     + VT+A+++ A A    L LG+ V  Y ++    
Sbjct: 349 SGYVQQGLVEDAIYMCQLMRLEKLKYDCVTLATLMSAAARTENLKLGKEVQCYCIRHSFE 408

Query: 404 TNIYVEGSLVNMYSKCGQDDYAEEIFAKAEKKNITLWNEIIATYVNQGRTSQALERFRSM 463
           ++I +  ++++MY+KCG    A+++F    +K++ LWN ++A Y   G + +AL  F  M
Sbjct: 409 SDIVLASTVMDMYAKCGSIVDAKKVFDSTVEKDLILWNTLLAAYAESGLSGEALRLFYGM 468

Query: 464 QHHGLRPDVVTYNTLLAGYAKNGQKVEAYNLLTEMLQKDLAPNVVSLNALVSGFQQSGLS 523
           Q  G+ P+V+T+N ++    +NGQ  EA ++  +M    + PN++S   +++G  Q+G S
Sbjct: 469 QLEGVPPNVITWNLIILSLLRNGQVDEAKDMFLQMQSSGIIPNLISWTTMMNGMVQNGCS 528

Query: 524 YEALELFQTMLYTACLVDKVITSPIRPNIVITITAALAACASLNLLHKGKEIHGYMLRNG 583
            EA+            + K+  S +RPN   +IT AL+ACA L  LH G+ IHGY++RN 
Sbjct: 529 EEAI----------LFLRKMQESGLRPN-AFSITVALSACAHLASLHIGRTIHGYIIRN- 588

Query: 584 FEDNHIVS--SALIDMYSKCECIDSVIQVFGGIKNRNEVCWNALIAGFRRVMQPKMAVEL 643
            + + +VS  ++L+DMY+KC  I+   +VFG          NA+I+ +      K A+ L
Sbjct: 589 LQHSSLVSIETSLVDMYAKCGDINKAEKVFGSKLYSELPLSNAMISAYALYGNLKEAIAL 648

Query: 644 FCQMLVEGIKPSSDSFSILLPALART-DLIMRRQLHSYIIKSQLVESC 658
           +  +   G+KP + + + +L A     D+    ++ + I+  + ++ C
Sbjct: 649 YRSLEGVGLKPDNITITNVLSACNHAGDINQAIEIFTDIVSKRSMKPC 683

BLAST of CmaCh05G004760 vs. ExPASy Swiss-Prot
Match: Q9SS83 (Pentatricopeptide repeat-containing protein At3g09040, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E88 PE=2 SV=1)

HSP 1 Score: 252.7 bits (644), Expect = 1.2e-65
Identity = 197/667 (29.54%), Postives = 319/667 (47.83%), Query Frame = 0

Query: 18  LLPSSSKLNFDLCPSFRFSRN---SMNVACRMHSTAISAHNRSQCRFAPV----AKCPDS 77
           L PSS+   FD   SF F R    S ++  R++   + +H++   R   +     K   S
Sbjct: 7   LTPSSAM--FD---SFSFVRRLSYSPDLGRRIYGHVLPSHDQIHQRLLEICLGQCKLFKS 66

Query: 78  NDAGSNVPIARSFALFNRNVQFVKLNARRVDS--LIGNKLAKVCAKCATCVDSDRKVFDE 137
                 +P   + AL        K     +DS   +GN +  + AKCA  V    K FD 
Sbjct: 67  RKVFDEMPQRLALALRIGKAVHSKSLILGIDSEGRLGNAIVDLYAKCAQ-VSYAEKQFDF 126

Query: 138 MPERPLPAYTALIRAYCRSEKWNELFAAFGSMVEEGILPDKYLVPTILKACSKIQAVKTG 197
           + E+ + A+ +++  Y    K  ++  +F S+ E  I P+K+    +L  C++   V+ G
Sbjct: 127 L-EKDVTAWNSMLSMYSSIGKPGKVLRSFVSLFENQIFPNKFTFSIVLSTCARETNVEFG 186

Query: 198 KMIHGYAIRKRLVSDIFIGNALMDFYGNCGDLRFSINVFDSMSEKDVVSWTALVSAYMEE 257
           + IH   I+  L  + + G AL+D Y  C  +  +  VF+ + + + V WT L S Y++ 
Sbjct: 187 RQIHCSMIKMGLERNSYCGGALVDMYAKCDRISDARRVFEWIVDPNTVCWTCLFSGYVKA 246

Query: 258 GLLDEAMEAFHSMQSSGLKPDLISWNALVSGFARHGKIGTALKYLEAMQEQGLSPRVNSW 317
           GL +EA+  F  M+  G +PD +++  +++ + R GK+  A      M     SP V +W
Sbjct: 247 GLPEEAVLVFERMRDEGHRPDHLAFVTVINTYIRLGKLKDARLLFGEMS----SPDVVAW 306

Query: 318 NGVISGCVLNGYFKDALYVFINMLLFPENPNSVTVASVLPACAGLRYLGLGRAVHAYALK 377
           N +ISG    G    A+  F NM          T+ SVL A   +  L LG  VHA A+K
Sbjct: 307 NVMISGHGKRGCETVAIEYFFNMRKSSVKSTRSTLGSVLSAIGIVANLDLGLVVHAEAIK 366

Query: 378 CELCTNIYVEGSLVNMYSKCGQDDYAEEIFAKAEKKNITLWNEIIATYVNQGRTSQALER 437
             L +NIYV  SLV+MYSKC + + A ++F   E+KN   WN +I  Y + G + + +E 
Sbjct: 367 LGLASNIYVGSSLVSMYSKCEKMEAAAKVFEALEEKNDVFWNAMIRGYAHNGESHKVMEL 426

Query: 438 FRSMQHHGLRPDVVTYNTLLAGYAKNGQKVEAYNLLTEMLQKDLAPNVVSLNALVSGFQQ 497
           F  M+  G   D  T+ +LL+  A +          + +++K LA N+   NALV  + +
Sbjct: 427 FMDMKSSGYNIDDFTFTSLLSTCAASHDLEMGSQFHSIIIKKKLAKNLFVGNALVDMYAK 486

Query: 498 SGLSYEALELFQTMLYTACLVDKVITSPI------------------RPNIV------IT 557
            G   +A ++F+ M    C  D V  + I                  R N+         
Sbjct: 487 CGALEDARQIFERM----CDRDNVTWNTIIGSYVQDENESEAFDLFKRMNLCGIVSDGAC 546

Query: 558 ITAALAACASLNLLHKGKEIHGYMLRNGFEDNHIVSSALIDMYSKCECIDSVIQVFGGIK 617
           + + L AC  ++ L++GK++H   ++ G + +    S+LIDMYSKC  I    +VF  + 
Sbjct: 547 LASTLKACTHVHGLYQGKQVHCLSVKCGLDRDLHTGSSLIDMYSKCGIIKDARKVFSSLP 606

Query: 618 NRNEVCWNALIAGFRRVMQPKMAVELFCQMLVEGIKPSSDSFSILLPALARTD-LIMRRQ 651
             + V  NALIAG+ +    + AV LF +ML  G+ PS  +F+ ++ A  + + L +  Q
Sbjct: 607 EWSVVSMNALIAGYSQ-NNLEEAVVLFQEMLTRGVNPSEITFATIVEACHKPESLTLGTQ 657

BLAST of CmaCh05G004760 vs. ExPASy Swiss-Prot
Match: Q9SS60 (Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H23 PE=2 SV=1)

HSP 1 Score: 233.0 bits (593), Expect = 9.8e-60
Identity = 150/561 (26.74%), Postives = 273/561 (48.66%), Query Frame = 0

Query: 130 PERPLPAYTALIRAYCRSEKWNELFAAFGSMVEEGILPDKYLVPTILKACSKIQAVKTGK 189
           P + +  + ++IRA+ ++  + E    +G + E  + PDKY  P+++KAC+ +   + G 
Sbjct: 67  PAKNVYLWNSIIRAFSKNGLFPEALEFYGKLRESKVSPDKYTFPSVIKACAGLFDAEMGD 126

Query: 190 MIHGYAIRKRLVSDIFIGNALMDFYGNCGDLRFSINVFDSMSEKDVVSWTALVSAYMEEG 249
           +++   +     SD+F+GNAL+D Y   G L  +  VFD M  +D+VSW +L+S Y   G
Sbjct: 127 LVYEQILDMGFESDLFVGNALVDMYSRMGLLTRARQVFDEMPVRDLVSWNSLISGYSSHG 186

Query: 250 LLDEAMEAFHSMQSSGLKPDLISWNALVSGF--------------------------ARH 309
             +EA+E +H +++S + PD  + ++++  F                            +
Sbjct: 187 YYEEALEIYHELKNSWIVPDSFTVSSVLPAFGNLLVVKQGQGLHGFALKSGVNSVVVVNN 246

Query: 310 GKIGTALKYLEAMQEQGLSPRVNSWNGVISGCVLNGYFKDALYVFINMLLFPEN-----P 369
           G +   LK+      + +   ++  + V    ++ GY K  + V  ++ +F EN     P
Sbjct: 247 GLVAMYLKFRRPTDARRVFDEMDVRDSVSYNTMICGYLKLEM-VEESVRMFLENLDQFKP 306

Query: 370 NSVTVASVLPACAGLRYLGLGRAVHAYALKCELCTNIYVEGSLVNMYSKCGQDDYAEEIF 429
           + +TV+SVL AC  LR L L + ++ Y LK        V   L+++Y+KCG    A ++F
Sbjct: 307 DLLTVSSVLRACGHLRDLSLAKYIYNYMLKAGFVLESTVRNILIDVYAKCGDMITARDVF 366

Query: 430 AKAEKKNITLWNEIIATYVNQGRTSQALERFRSMQHHGLRPDVVTYNTLLAGYAKNGQKV 489
              E K+   WN II+ Y+  G   +A++ F+ M     + D +TY  L++   +     
Sbjct: 367 NSMECKDTVSWNSIISGYIQSGDLMEAMKLFKMMMIMEEQADHITYLMLISVSTRLADLK 426

Query: 490 EAYNLLTEMLQKDLAPNVVSLNALVSGFQQSGLSYEALELFQTM----------LYTACL 549
               L +  ++  +  ++   NAL+  + + G   ++L++F +M          + +AC+
Sbjct: 427 FGKGLHSNGIKSGICIDLSVSNALIDMYAKCGEVGDSLKIFSSMGTGDTVTWNTVISACV 486

Query: 550 ------VDKVITSPIRPNIVI----TITAALAACASLNLLHKGKEIHGYMLRNGFEDNHI 609
                     +T+ +R + V+    T    L  CASL     GKEIH  +LR G+E    
Sbjct: 487 RFGDFATGLQVTTQMRKSEVVPDMATFLVTLPMCASLAAKRLGKEIHCCLLRFGYESELQ 546

Query: 610 VSSALIDMYSKCECIDSVIQVFGGIKNRNEVCWNALIAGFRRVMQPKMAVELFCQMLVEG 640
           + +ALI+MYSKC C+++  +VF  +  R+ V W  +I  +    + + A+E F  M   G
Sbjct: 547 IGNALIEMYSKCGCLENSSRVFERMSRRDVVTWTGMIYAYGMYGEGEKALETFADMEKSG 606

BLAST of CmaCh05G004760 vs. TAIR 10
Match: AT1G19720.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 351.7 bits (901), Expect = 1.3e-96
Identity = 191/534 (35.77%), Postives = 304/534 (56.93%), Query Frame = 0

Query: 101 DSLIGNKLAKVCAKCATCVDSDRKVFDEMPERPLPAYTALIRAYCRSEKWNELFAAFGSM 160
           D  +  KL  + AKC  C+   RKVFD M ER L  ++A+I AY R  +W E+   F  M
Sbjct: 114 DVFVETKLLSMYAKCG-CIADARKVFDSMRERNLFTWSAMIGAYSRENRWREVAKLFRLM 173

Query: 161 VEEGILPDKYLVPTILKACSKIQAVKTGKMIHGYAIRKRLVSDIFIGNALMDFYGNCGDL 220
           +++G+LPD +L P IL+ C+    V+ GK+IH   I+  + S + + N+++  Y  CG+L
Sbjct: 174 MKDGVLPDDFLFPKILQGCANCGDVEAGKVIHSVVIKLGMSSCLRVSNSILAVYAKCGEL 233

Query: 221 RFSINVFDSMSEKDVVSWTALVSAYMEEGLLDEAMEAFHSMQSSGLKPDLISWNALVSGF 280
            F+   F  M E+DV++W +++ AY + G  +EA+E    M+  G+ P L++WN L+ G+
Sbjct: 234 DFATKFFRRMRERDVIAWNSVLLAYCQNGKHEEAVELVKEMEKEGISPGLVTWNILIGGY 293

Query: 281 ARHGKIGTALKYLEAMQEQGLSPRVNSWNGVISGCVLNGYFKDALYVFINMLLFPENPNS 340
            + GK   A+  ++ M+  G++  V +W  +ISG + NG    AL +F  M L    PN+
Sbjct: 294 NQLGKCDAAMDLMQKMETFGITADVFTWTAMISGLIHNGMRYQALDMFRKMFLAGVVPNA 353

Query: 341 VTVASVLPACAGLRYLGLGRAVHAYALKCELCTNIYVEGSLVNMYSKCGQDDYAEEIFAK 400
           VT+ S + AC+ L+ +  G  VH+ A+K     ++ V  SLV+MYSKCG+ + A ++F  
Sbjct: 354 VTIMSAVSACSCLKVINQGSEVHSIAVKMGFIDDVLVGNSLVDMYSKCGKLEDARKVFDS 413

Query: 401 AEKKNITLWNEIIATYVNQGRTSQALERFRSMQHHGLRPDVVTYNTLLAGYAKNGQKVEA 460
            + K++  WN +I  Y   G   +A E F  MQ   LRP+++T+NT+++GY KNG + EA
Sbjct: 414 VKNKDVYTWNSMITGYCQAGYCGKAYELFTRMQDANLRPNIITWNTMISGYIKNGDEGEA 473

Query: 461 YNLLTEMLQKD--LAPNVVSLNALVSGFQQSGLSYEALELFQTMLYTACLVDKVITSPIR 520
            +L   M +KD  +  N  + N +++G+ Q+G   EALELF+ M +          S   
Sbjct: 474 MDLFQRM-EKDGKVQRNTATWNLIIAGYIQNGKKDEALELFRKMQF----------SRFM 533

Query: 521 PNIVITITAALAACASLNLLHKGKEIHGYMLRNGFEDNHIVSSALIDMYSKCECIDSVIQ 580
           PN V TI + L ACA+L      +EIHG +LR   +  H V +AL D Y+K   I+    
Sbjct: 534 PNSV-TILSLLPACANLLGAKMVREIHGCVLRRNLDAIHAVKNALTDTYAKSGDIEYSRT 593

Query: 581 VFGGIKNRNEVCWNALIAGFRRVMQPKMAVELFCQMLVEGIKPSSDSFSILLPA 633
           +F G++ ++ + WN+LI G+        A+ LF QM  +GI P+  + S ++ A
Sbjct: 594 IFLGMETKDIITWNSLIGGYVLHGSYGPALALFNQMKTQGITPNRGTLSSIILA 634

BLAST of CmaCh05G004760 vs. TAIR 10
Match: AT4G01030.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 280.0 bits (715), Expect = 5.0e-75
Identity = 163/531 (30.70%), Postives = 280/531 (52.73%), Query Frame = 0

Query: 124 KVFDEMPERPLPAYTALIRAYCRSEKWNELFAAFGSMVEEGILPDKYLVPTILKACSKIQ 183
           K+FDEMP+R   A+  ++    RS  W +    F  M   G       +  +L+ CS  +
Sbjct: 44  KLFDEMPKRDDLAWNEIVMVNLRSGNWEKAVELFREMQFSGAKAYDSTMVKLLQVCSNKE 103

Query: 184 AVKTGKMIHGYAIRKRLVSDIFIGNALMDFYGNCGDLRFSINVFDSMSEKDVVSWTALVS 243
               G+ IHGY +R  L S++ + N+L+  Y   G L  S  VF+SM ++++ SW +++S
Sbjct: 104 GFAEGRQIHGYVLRLGLESNVSMCNSLIVMYSRNGKLELSRKVFNSMKDRNLSSWNSILS 163

Query: 244 AYMEEGLLDEAMEAFHSMQSSGLKPDLISWNALVSGFARHGKIGTALKYLEAMQEQGLSP 303
           +Y + G +D+A+     M+  GLKPD+++WN+L+SG+A  G    A+  L+ MQ  GL P
Sbjct: 164 SYTKLGYVDDAIGLLDEMEICGLKPDIVTWNSLLSGYASKGLSKDAIAVLKRMQIAGLKP 223

Query: 304 RVNSWNGVISGCVLNGYFKDALYVFINMLLFPENPNSVTVASVLPACAGLRYLGLGRAVH 363
             +S                                   ++S+L A A   +L LG+A+H
Sbjct: 224 STSS-----------------------------------ISSLLQAVAEPGHLKLGKAIH 283

Query: 364 AYALKCELCTNIYVEGSLVNMYSKCGQDDYAEEIFAKAEKKNITLWNEIIA--TYVNQGR 423
            Y L+ +L  ++YVE +L++MY K G   YA  +F   + KNI  WN +++  +Y    +
Sbjct: 284 GYILRNQLWYDVYVETTLIDMYIKTGYLPYARMVFDMMDAKNIVAWNSLVSGLSYACLLK 343

Query: 424 TSQALERFRSMQHHGLRPDVVTYNTLLAGYAKNGQKVEAYNLLTEMLQKDLAPNVVSLNA 483
            ++AL     M+  G++PD +T+N+L +GYA  G+  +A +++ +M +K +APNVVS  A
Sbjct: 344 DAEAL--MIRMEKEGIKPDAITWNSLASGYATLGKPEKALDVIGKMKEKGVAPNVVSWTA 403

Query: 484 LVSGFQQSGLSYEALELFQTMLYTACLVDKVITSPIRPNIVITITAALAACASLNLLHKG 543
           + SG  ++G    AL++F           K+    + PN   T++  L     L+LLH G
Sbjct: 404 IFSGCSKNGNFRNALKVF----------IKMQEEGVGPN-AATMSTLLKILGCLSLLHSG 463

Query: 544 KEIHGYMLRNGFEDNHIVSSALIDMYSKCECIDSVIQVFGGIKNRNEVCWNALIAGFRRV 603
           KE+HG+ LR     +  V++AL+DMY K   + S I++F GIKN++   WN ++ G+   
Sbjct: 464 KEVHGFCLRKNLICDAYVATALVDMYGKSGDLQSAIEIFWGIKNKSLASWNCMLMGYAMF 523

Query: 604 MQPKMAVELFCQMLVEGIKPSSDSFSILLPALARTDLIMRRQLHSYIIKSQ 653
            + +  +  F  ML  G++P + +F+ +L     + L+     +  +++S+
Sbjct: 524 GRGEEGIAAFSVMLEAGMEPDAITFTSVLSVCKNSGLVQEGWKYFDLMRSR 526

BLAST of CmaCh05G004760 vs. TAIR 10
Match: AT5G55740.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 275.8 bits (704), Expect = 9.4e-74
Identity = 170/588 (28.91%), Postives = 304/588 (51.70%), Query Frame = 0

Query: 104 IGNKLAKVCAKCATCVDSDRKVFDEMPERPLPAYTALIRAYCRSEKWNELFAAFGSMVEE 163
           I  KL    AKC   ++    +F ++  R + ++ A+I   CR          F  M+E 
Sbjct: 109 IETKLVIFYAKC-DALEIAEVLFSKLRVRNVFSWAAIIGVKCRIGLCEGALMGFVEMLEN 168

Query: 164 GILPDKYLVPTILKACSKIQAVKTGKMIHGYAIRKRLVSDIFIGNALMDFYGNCGDLRFS 223
            I PD ++VP + KAC  ++  + G+ +HGY ++  L   +F+ ++L D YG CG L  +
Sbjct: 169 EIFPDNFVVPNVCKACGALKWSRFGRGVHGYVVKSGLEDCVFVASSLADMYGKCGVLDDA 228

Query: 224 INVFDSMSEKDVVSWTALVSAYMEEGLLDEAMEAFHSMQSSGLKPDLISWN--------- 283
             VFD + +++ V+W AL+  Y++ G  +EA+  F  M+  G++P  ++ +         
Sbjct: 229 SKVFDEIPDRNAVAWNALMVGYVQNGKNEEAIRLFSDMRKQGVEPTRVTVSTCLSASANM 288

Query: 284 ------------ALVSGFARHGKIGTAL----------KYLEAMQEQGLSPRVNSWNGVI 343
                       A+V+G      +GT+L          +Y E + ++     V +WN +I
Sbjct: 289 GGVEEGKQSHAIAIVNGMELDNILGTSLLNFYCKVGLIEYAEMVFDRMFEKDVVTWNLII 348

Query: 344 SGCVLNGYFKDALYVFINMLLFPENPNSVTVASVLPACAGLRYLGLGRAVHAYALKCELC 403
           SG V  G  +DA+Y+   M L     + VT+A+++ A A    L LG+ V  Y ++    
Sbjct: 349 SGYVQQGLVEDAIYMCQLMRLEKLKYDCVTLATLMSAAARTENLKLGKEVQCYCIRHSFE 408

Query: 404 TNIYVEGSLVNMYSKCGQDDYAEEIFAKAEKKNITLWNEIIATYVNQGRTSQALERFRSM 463
           ++I +  ++++MY+KCG    A+++F    +K++ LWN ++A Y   G + +AL  F  M
Sbjct: 409 SDIVLASTVMDMYAKCGSIVDAKKVFDSTVEKDLILWNTLLAAYAESGLSGEALRLFYGM 468

Query: 464 QHHGLRPDVVTYNTLLAGYAKNGQKVEAYNLLTEMLQKDLAPNVVSLNALVSGFQQSGLS 523
           Q  G+ P+V+T+N ++    +NGQ  EA ++  +M    + PN++S   +++G  Q+G S
Sbjct: 469 QLEGVPPNVITWNLIILSLLRNGQVDEAKDMFLQMQSSGIIPNLISWTTMMNGMVQNGCS 528

Query: 524 YEALELFQTMLYTACLVDKVITSPIRPNIVITITAALAACASLNLLHKGKEIHGYMLRNG 583
            EA+            + K+  S +RPN   +IT AL+ACA L  LH G+ IHGY++RN 
Sbjct: 529 EEAI----------LFLRKMQESGLRPN-AFSITVALSACAHLASLHIGRTIHGYIIRN- 588

Query: 584 FEDNHIVS--SALIDMYSKCECIDSVIQVFGGIKNRNEVCWNALIAGFRRVMQPKMAVEL 643
            + + +VS  ++L+DMY+KC  I+   +VFG          NA+I+ +      K A+ L
Sbjct: 589 LQHSSLVSIETSLVDMYAKCGDINKAEKVFGSKLYSELPLSNAMISAYALYGNLKEAIAL 648

Query: 644 FCQMLVEGIKPSSDSFSILLPALART-DLIMRRQLHSYIIKSQLVESC 658
           +  +   G+KP + + + +L A     D+    ++ + I+  + ++ C
Sbjct: 649 YRSLEGVGLKPDNITITNVLSACNHAGDINQAIEIFTDIVSKRSMKPC 683

BLAST of CmaCh05G004760 vs. TAIR 10
Match: AT3G09040.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 252.7 bits (644), Expect = 8.5e-67
Identity = 197/667 (29.54%), Postives = 319/667 (47.83%), Query Frame = 0

Query: 18  LLPSSSKLNFDLCPSFRFSRN---SMNVACRMHSTAISAHNRSQCRFAPV----AKCPDS 77
           L PSS+   FD   SF F R    S ++  R++   + +H++   R   +     K   S
Sbjct: 7   LTPSSAM--FD---SFSFVRRLSYSPDLGRRIYGHVLPSHDQIHQRLLEICLGQCKLFKS 66

Query: 78  NDAGSNVPIARSFALFNRNVQFVKLNARRVDS--LIGNKLAKVCAKCATCVDSDRKVFDE 137
                 +P   + AL        K     +DS   +GN +  + AKCA  V    K FD 
Sbjct: 67  RKVFDEMPQRLALALRIGKAVHSKSLILGIDSEGRLGNAIVDLYAKCAQ-VSYAEKQFDF 126

Query: 138 MPERPLPAYTALIRAYCRSEKWNELFAAFGSMVEEGILPDKYLVPTILKACSKIQAVKTG 197
           + E+ + A+ +++  Y    K  ++  +F S+ E  I P+K+    +L  C++   V+ G
Sbjct: 127 L-EKDVTAWNSMLSMYSSIGKPGKVLRSFVSLFENQIFPNKFTFSIVLSTCARETNVEFG 186

Query: 198 KMIHGYAIRKRLVSDIFIGNALMDFYGNCGDLRFSINVFDSMSEKDVVSWTALVSAYMEE 257
           + IH   I+  L  + + G AL+D Y  C  +  +  VF+ + + + V WT L S Y++ 
Sbjct: 187 RQIHCSMIKMGLERNSYCGGALVDMYAKCDRISDARRVFEWIVDPNTVCWTCLFSGYVKA 246

Query: 258 GLLDEAMEAFHSMQSSGLKPDLISWNALVSGFARHGKIGTALKYLEAMQEQGLSPRVNSW 317
           GL +EA+  F  M+  G +PD +++  +++ + R GK+  A      M     SP V +W
Sbjct: 247 GLPEEAVLVFERMRDEGHRPDHLAFVTVINTYIRLGKLKDARLLFGEMS----SPDVVAW 306

Query: 318 NGVISGCVLNGYFKDALYVFINMLLFPENPNSVTVASVLPACAGLRYLGLGRAVHAYALK 377
           N +ISG    G    A+  F NM          T+ SVL A   +  L LG  VHA A+K
Sbjct: 307 NVMISGHGKRGCETVAIEYFFNMRKSSVKSTRSTLGSVLSAIGIVANLDLGLVVHAEAIK 366

Query: 378 CELCTNIYVEGSLVNMYSKCGQDDYAEEIFAKAEKKNITLWNEIIATYVNQGRTSQALER 437
             L +NIYV  SLV+MYSKC + + A ++F   E+KN   WN +I  Y + G + + +E 
Sbjct: 367 LGLASNIYVGSSLVSMYSKCEKMEAAAKVFEALEEKNDVFWNAMIRGYAHNGESHKVMEL 426

Query: 438 FRSMQHHGLRPDVVTYNTLLAGYAKNGQKVEAYNLLTEMLQKDLAPNVVSLNALVSGFQQ 497
           F  M+  G   D  T+ +LL+  A +          + +++K LA N+   NALV  + +
Sbjct: 427 FMDMKSSGYNIDDFTFTSLLSTCAASHDLEMGSQFHSIIIKKKLAKNLFVGNALVDMYAK 486

Query: 498 SGLSYEALELFQTMLYTACLVDKVITSPI------------------RPNIV------IT 557
            G   +A ++F+ M    C  D V  + I                  R N+         
Sbjct: 487 CGALEDARQIFERM----CDRDNVTWNTIIGSYVQDENESEAFDLFKRMNLCGIVSDGAC 546

Query: 558 ITAALAACASLNLLHKGKEIHGYMLRNGFEDNHIVSSALIDMYSKCECIDSVIQVFGGIK 617
           + + L AC  ++ L++GK++H   ++ G + +    S+LIDMYSKC  I    +VF  + 
Sbjct: 547 LASTLKACTHVHGLYQGKQVHCLSVKCGLDRDLHTGSSLIDMYSKCGIIKDARKVFSSLP 606

Query: 618 NRNEVCWNALIAGFRRVMQPKMAVELFCQMLVEGIKPSSDSFSILLPALARTD-LIMRRQ 651
             + V  NALIAG+ +    + AV LF +ML  G+ PS  +F+ ++ A  + + L +  Q
Sbjct: 607 EWSVVSMNALIAGYSQ-NNLEEAVVLFQEMLTRGVNPSEITFATIVEACHKPESLTLGTQ 657

BLAST of CmaCh05G004760 vs. TAIR 10
Match: AT3G03580.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 233.0 bits (593), Expect = 7.0e-61
Identity = 150/561 (26.74%), Postives = 273/561 (48.66%), Query Frame = 0

Query: 130 PERPLPAYTALIRAYCRSEKWNELFAAFGSMVEEGILPDKYLVPTILKACSKIQAVKTGK 189
           P + +  + ++IRA+ ++  + E    +G + E  + PDKY  P+++KAC+ +   + G 
Sbjct: 67  PAKNVYLWNSIIRAFSKNGLFPEALEFYGKLRESKVSPDKYTFPSVIKACAGLFDAEMGD 126

Query: 190 MIHGYAIRKRLVSDIFIGNALMDFYGNCGDLRFSINVFDSMSEKDVVSWTALVSAYMEEG 249
           +++   +     SD+F+GNAL+D Y   G L  +  VFD M  +D+VSW +L+S Y   G
Sbjct: 127 LVYEQILDMGFESDLFVGNALVDMYSRMGLLTRARQVFDEMPVRDLVSWNSLISGYSSHG 186

Query: 250 LLDEAMEAFHSMQSSGLKPDLISWNALVSGF--------------------------ARH 309
             +EA+E +H +++S + PD  + ++++  F                            +
Sbjct: 187 YYEEALEIYHELKNSWIVPDSFTVSSVLPAFGNLLVVKQGQGLHGFALKSGVNSVVVVNN 246

Query: 310 GKIGTALKYLEAMQEQGLSPRVNSWNGVISGCVLNGYFKDALYVFINMLLFPEN-----P 369
           G +   LK+      + +   ++  + V    ++ GY K  + V  ++ +F EN     P
Sbjct: 247 GLVAMYLKFRRPTDARRVFDEMDVRDSVSYNTMICGYLKLEM-VEESVRMFLENLDQFKP 306

Query: 370 NSVTVASVLPACAGLRYLGLGRAVHAYALKCELCTNIYVEGSLVNMYSKCGQDDYAEEIF 429
           + +TV+SVL AC  LR L L + ++ Y LK        V   L+++Y+KCG    A ++F
Sbjct: 307 DLLTVSSVLRACGHLRDLSLAKYIYNYMLKAGFVLESTVRNILIDVYAKCGDMITARDVF 366

Query: 430 AKAEKKNITLWNEIIATYVNQGRTSQALERFRSMQHHGLRPDVVTYNTLLAGYAKNGQKV 489
              E K+   WN II+ Y+  G   +A++ F+ M     + D +TY  L++   +     
Sbjct: 367 NSMECKDTVSWNSIISGYIQSGDLMEAMKLFKMMMIMEEQADHITYLMLISVSTRLADLK 426

Query: 490 EAYNLLTEMLQKDLAPNVVSLNALVSGFQQSGLSYEALELFQTM----------LYTACL 549
               L +  ++  +  ++   NAL+  + + G   ++L++F +M          + +AC+
Sbjct: 427 FGKGLHSNGIKSGICIDLSVSNALIDMYAKCGEVGDSLKIFSSMGTGDTVTWNTVISACV 486

Query: 550 ------VDKVITSPIRPNIVI----TITAALAACASLNLLHKGKEIHGYMLRNGFEDNHI 609
                     +T+ +R + V+    T    L  CASL     GKEIH  +LR G+E    
Sbjct: 487 RFGDFATGLQVTTQMRKSEVVPDMATFLVTLPMCASLAAKRLGKEIHCCLLRFGYESELQ 546

Query: 610 VSSALIDMYSKCECIDSVIQVFGGIKNRNEVCWNALIAGFRRVMQPKMAVELFCQMLVEG 640
           + +ALI+MYSKC C+++  +VF  +  R+ V W  +I  +    + + A+E F  M   G
Sbjct: 547 IGNALIEMYSKCGCLENSSRVFERMSRRDVVTWTGMIYAYGMYGEGEKALETFADMEKSG 606

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9FXH11.9e-9535.77Pentatricopeptide repeat-containing protein At1g19720 OS=Arabidopsis thaliana OX... [more]
Q9SV267.0e-7430.70Pentatricopeptide repeat-containing protein At4g01030, mitochondrial OS=Arabidop... [more]
Q9FM641.3e-7228.91Pentatricopeptide repeat-containing protein At5g55740, chloroplastic OS=Arabidop... [more]
Q9SS831.2e-6529.54Pentatricopeptide repeat-containing protein At3g09040, mitochondrial OS=Arabidop... [more]
Q9SS609.8e-6026.74Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
AT1G19720.11.3e-9635.77Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT4G01030.15.0e-7530.70pentatricopeptide (PPR) repeat-containing protein [more]
AT5G55740.19.4e-7428.91Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G09040.18.5e-6729.54Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G03580.17.0e-6126.74Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 409..510
e-value: 6.2E-27
score: 96.1
coord: 83..189
e-value: 7.7E-13
score: 50.2
coord: 190..285
e-value: 6.2E-23
score: 83.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 286..408
e-value: 7.3E-16
score: 60.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 511..657
e-value: 7.4E-23
score: 83.4
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 381..470
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 271..303
e-value: 7.3E-7
score: 27.0
coord: 442..476
e-value: 1.1E-6
score: 26.4
coord: 307..332
e-value: 0.0013
score: 16.8
coord: 409..441
e-value: 8.1E-5
score: 20.5
coord: 236..269
e-value: 2.3E-8
score: 31.7
coord: 136..168
e-value: 3.9E-7
score: 27.8
coord: 589..622
e-value: 5.9E-6
score: 24.1
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 380..406
e-value: 0.91
score: 9.9
coord: 271..301
e-value: 1.2E-5
score: 25.2
coord: 307..332
e-value: 0.0063
score: 16.7
coord: 136..165
e-value: 1.2E-4
score: 22.0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 587..635
e-value: 2.5E-10
score: 40.4
coord: 439..486
e-value: 5.9E-12
score: 45.6
coord: 233..270
e-value: 1.5E-8
score: 34.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 475..509
score: 8.549871
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 269..303
score: 12.364404
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 234..268
score: 12.967276
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 133..167
score: 10.632519
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 440..474
score: 12.506901
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 587..621
score: 10.764054
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 405..439
score: 10.64348
NoneNo IPR availablePANTHERPTHR47928:SF72SUBFAMILY NOT NAMEDcoord: 15..630
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 15..630

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh05G004760.1CmaCh05G004760.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding