CmaCh05G005210.1 (mRNA) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh05G005210.1
TypemRNA
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCma_Chr05: 2492979 .. 2494950 (+)
Sequence length1585
RNA-Seq ExpressionCmaCh05G005210.1
SyntenyCmaCh05G005210.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
GGGCCTCTGGAGGTTTATTGTGTTCTTTTATGTTATGATCGCTCGGTAATGTTATGGGGATCGCTTTGTTGGGGTTCAAGTCTAACAAATATCGAACGGCGCTGATGAATAAGGCGTTTTGTCGCCGGTTTAGGGCAATATTTATAGCTTCGGTTTTTTTGGACCCACGAAAAGGAAGACCATTGTATACCTTTGTGCCAAGGCGCACACAAGGCAATCGATGACTCGGTGGCTCGGCGCTTCAGAGGTCGCCGCTCGAACGAAATCGGTGATGAAGACGACTCTCCGCTCCGGCACCGGACGGTAAGCCCTGTACTTTTACCTCTCTGTTTTCCTAGCTATCATTCCGACAGCTCCGAGTTTCATCGTTTTATTATCCACTTCTGATCTGAACCAAAAAGCTGTTGATTCCATAATTACAGAATTCATTTTCTTCTCCGAGCATGGAGGCCTATATCGTCACATACTTCGCCAGTCGACTCTTCGGACATCATTTCTGCCGTATCTCCCAAAACCCTACAAAAAGGTGGCGACAATGGCAACTTCAAGCAGGACTTTATGAAGCTCAAGCAACCTCGGAGTAGCGTGACCACCGTACTTCAGAGTGGTAACAGTGAAGCTCAAAGAACCTCAATCACTAAGCTCCGGCGTGTTGTCAAGAAGCTTTGCAAATCGAAGCGATTTGAGCGTGCACTAGAGGTTGGCAATTAACAGTTTGGAAATTGAAATTACTTCATTTTCAATGGAAGAATAGAATCATGATCACAAAGTAACGCATCTCACTGTATAATTCATAGGTTTCTTATAGAATGCAATACATTCTTACATTCAGGTACTAATGTTGATGGAAACTCAAAATAAGATCCGTATGTCCCCAGCTGATCATGCTCTCCGATTGGAATTGACCATCAAAGTGCACGGTTTGCTGAAAGCTGAAGAGCACTTCAATCAATTGCCCAATATAGCCTCTCAGAAAGCTTCATCTCTCCTTCTTCTTCATGGTTATGTCAAAGAGCAGAACACAGAGAAAGCCGAGGCTTTCATGGCGAAGCTAAAGGGCTTGGGACTGGTTGTGGATCCGCATCTTTACAATGAGATGATGAAACTGTATGTCGCCACATCTCAAAATGAGAAAGTTCCTCTTGTTATAAAGGACATGAAGCAAAATCAAATACTGAGAAATGTTCTCTCCTACAATCTTTGGATGAATGCATGCAGTGAGCTATATGGGGTTGGGTCAGTGGAATTGGTGTATGAAGAAATGCTGAGTGACAAGAATGTTCAAGTGGGATGGAGCACTCTGTCTACCCTGGCTAAAGTGTATACACAGGCAGGCCTTGTTCACAAAGCGTTTGCAGCCTTGAGAGAAGCAGAGAAGAAGCTATCCTCAGGTAACAGGCTTGGATATTTCTTCTTAATTACATTATATGCCACTTTGAACGATAAGGAGGGAGTTCTTCGAGTTTGGACAGCTAGTAAAGCTGTAAGTGACAGACCCACTTGTGCCAACTACATGTGTATATTGCTATGTTTGGTGAAGCTAGGAGATATCAATGAGGCAGAGAAAATATTCAAGGAATGGGAGCTCGACTGTCGTAATTATGACATTAGAGTCTCCAACGTTCTTCTGGGCGCATATGTGAGGAATGGACTGTTAGATCAGGCCGAGTCATTACATATGCACACATTAGAGAGAGGTGGTACTCCGAATTATAAGACATGGGAGATTCTCATGGAGGGATGTGTGAGAAGCCAAAACTTGGACAGAGGTATTAATTTTCTCAATAACGATTTTGCTGGGTTGAAATGTCATGATTAAAGTCAACATTTTTCTTTTTTGGTCGGATTCAAGTATAGTTAGGTAGATAAAGTACGAACTAAAGAAACATCATATATTTCATAAGCTTTCATTATTCAATCTCTGCAAGTTGCATCCATGCTACATGAACACAGCAGGCACCACACAGGGTAA

mRNA sequence

GGGCCTCTGGAGGTTTATTGTGTTCTTTTATGTTATGATCGCTCGGTAATGTTATGGGGATCGCTTTGTTGGGGTTCAAGTCTAACAAATATCGAACGGCGCTGATGAATAAGGCGTTTTGTCGCCGGTTTAGGGCAATATTTATAGCTTCGGTTTTTTTGGACCCACGAAAAGGAAGACCATTGTATACCTTTGTGCCAAGGCGCACACAAGGCAATCGATGACTCGGTGGCTCGGCGCTTCAGAGGTCGCCGCTCGAACGAAATCGGTGATGAAGACGACTCTCCGCTCCGGCACCGGACGAATTCATTTTCTTCTCCGAGCATGGAGGCCTATATCGTCACATACTTCGCCAGTCGACTCTTCGGACATCATTTCTGCCGTATCTCCCAAAACCCTACAAAAAGGTGGCGACAATGGCAACTTCAAGCAGGACTTTATGAAGCTCAAGCAACCTCGGAGTAGCGTGACCACCGTACTTCAGAGTGGTAACAGTGAAGCTCAAAGAACCTCAATCACTAAGCTCCGGCGTGTTGTCAAGAAGCTTTGCAAATCGAAGCGATTTGAGCGTGCACTAGAGGTACTAATGTTGATGGAAACTCAAAATAAGATCCGTATGTCCCCAGCTGATCATGCTCTCCGATTGGAATTGACCATCAAAGTGCACGGTTTGCTGAAAGCTGAAGAGCACTTCAATCAATTGCCCAATATAGCCTCTCAGAAAGCTTCATCTCTCCTTCTTCTTCATGGTTATGTCAAAGAGCAGAACACAGAGAAAGCCGAGGCTTTCATGGCGAAGCTAAAGGGCTTGGGACTGGTTGTGGATCCGCATCTTTACAATGAGATGATGAAACTGTATGTCGCCACATCTCAAAATGAGAAAGTTCCTCTTGTTATAAAGGACATGAAGCAAAATCAAATACTGAGAAATGTTCTCTCCTACAATCTTTGGATGAATGCATGCAGTGAGCTATATGGGGTTGGGTCAGTGGAATTGGTGTATGAAGAAATGCTGAGTGACAAGAATGTTCAAGTGGGATGGAGCACTCTGTCTACCCTGGCTAAAGTGTATACACAGGCAGGCCTTGTTCACAAAGCGTTTGCAGCCTTGAGAGAAGCAGAGAAGAAGCTATCCTCAGGTAACAGGCTTGGATATTTCTTCTTAATTACATTATATGCCACTTTGAACGATAAGGAGGGAGTTCTTCGAGTTTGGACAGCTAGTAAAGCTGTAAGTGACAGACCCACTTGTGCCAACTACATGTGTATATTGCTATGTTTGGTGAAGCTAGGAGATATCAATGAGGCAGAGAAAATATTCAAGGAATGGGAGCTCGACTGTCGTAATTATGACATTAGAGTCTCCAACGTTCTTCTGGGCGCATATGTGAGGAATGGACTGTTAGATCAGGCCGAGTCATTACATATGCACACATTAGAGAGAGGTGGTACTCCGAATTATAAGACATGGGAGATTCTCATGGAGGGATGTGTGAGAAGCCAAAACTTGGACAGAGCTTTCATTATTCAATCTCTGCAAGTTGCATCCATGCTACATGAACACAGCAGGCACCACACAGGGTAA

Coding sequence (CDS)

ATGACTCGGTGGCTCGGCGCTTCAGAGGTCGCCGCTCGAACGAAATCGGTGATGAAGACGACTCTCCGCTCCGGCACCGGACGAATTCATTTTCTTCTCCGAGCATGGAGGCCTATATCGTCACATACTTCGCCAGTCGACTCTTCGGACATCATTTCTGCCGTATCTCCCAAAACCCTACAAAAAGGTGGCGACAATGGCAACTTCAAGCAGGACTTTATGAAGCTCAAGCAACCTCGGAGTAGCGTGACCACCGTACTTCAGAGTGGTAACAGTGAAGCTCAAAGAACCTCAATCACTAAGCTCCGGCGTGTTGTCAAGAAGCTTTGCAAATCGAAGCGATTTGAGCGTGCACTAGAGGTACTAATGTTGATGGAAACTCAAAATAAGATCCGTATGTCCCCAGCTGATCATGCTCTCCGATTGGAATTGACCATCAAAGTGCACGGTTTGCTGAAAGCTGAAGAGCACTTCAATCAATTGCCCAATATAGCCTCTCAGAAAGCTTCATCTCTCCTTCTTCTTCATGGTTATGTCAAAGAGCAGAACACAGAGAAAGCCGAGGCTTTCATGGCGAAGCTAAAGGGCTTGGGACTGGTTGTGGATCCGCATCTTTACAATGAGATGATGAAACTGTATGTCGCCACATCTCAAAATGAGAAAGTTCCTCTTGTTATAAAGGACATGAAGCAAAATCAAATACTGAGAAATGTTCTCTCCTACAATCTTTGGATGAATGCATGCAGTGAGCTATATGGGGTTGGGTCAGTGGAATTGGTGTATGAAGAAATGCTGAGTGACAAGAATGTTCAAGTGGGATGGAGCACTCTGTCTACCCTGGCTAAAGTGTATACACAGGCAGGCCTTGTTCACAAAGCGTTTGCAGCCTTGAGAGAAGCAGAGAAGAAGCTATCCTCAGGTAACAGGCTTGGATATTTCTTCTTAATTACATTATATGCCACTTTGAACGATAAGGAGGGAGTTCTTCGAGTTTGGACAGCTAGTAAAGCTGTAAGTGACAGACCCACTTGTGCCAACTACATGTGTATATTGCTATGTTTGGTGAAGCTAGGAGATATCAATGAGGCAGAGAAAATATTCAAGGAATGGGAGCTCGACTGTCGTAATTATGACATTAGAGTCTCCAACGTTCTTCTGGGCGCATATGTGAGGAATGGACTGTTAGATCAGGCCGAGTCATTACATATGCACACATTAGAGAGAGGTGGTACTCCGAATTATAAGACATGGGAGATTCTCATGGAGGGATGTGTGAGAAGCCAAAACTTGGACAGAGCTTTCATTATTCAATCTCTGCAAGTTGCATCCATGCTACATGAACACAGCAGGCACCACACAGGGTAA

Protein sequence

MTRWLGASEVAARTKSVMKTTLRSGTGRIHFLLRAWRPISSHTSPVDSSDIISAVSPKTLQKGGDNGNFKQDFMKLKQPRSSVTTVLQSGNSEAQRTSITKLRRVVKKLCKSKRFERALEVLMLMETQNKIRMSPADHALRLELTIKVHGLLKAEEHFNQLPNIASQKASSLLLLHGYVKEQNTEKAEAFMAKLKGLGLVVDPHLYNEMMKLYVATSQNEKVPLVIKDMKQNQILRNVLSYNLWMNACSELYGVGSVELVYEEMLSDKNVQVGWSTLSTLAKVYTQAGLVHKAFAALREAEKKLSSGNRLGYFFLITLYATLNDKEGVLRVWTASKAVSDRPTCANYMCILLCLVKLGDINEAEKIFKEWELDCRNYDIRVSNVLLGAYVRNGLLDQAESLHMHTLERGGTPNYKTWEILMEGCVRSQNLDRAFIIQSLQVASMLHEHSRHHTG
Homology
BLAST of CmaCh05G005210.1 vs. ExPASy Swiss-Prot
Match: Q3E911 (Pentatricopeptide repeat-containing protein At5g27460 OS=Arabidopsis thaliana OX=3702 GN=At5g27460 PE=2 SV=1)

HSP 1 Score: 396.7 bits (1018), Expect = 3.4e-109
Identity = 204/389 (52.44%), Postives = 274/389 (70.44%), Query Frame = 0

Query: 49  SDIISAVSPKTLQKGGDNGNFKQDFMKLKQPRSSVTTVLQSGNSEAQRTSITKLRRVVKK 108
           S ++S+++  +      N N  ++ ++   PR SVT++LQ         S+++LR + K+
Sbjct: 20  SRLVSSLADGSDTSSVANRNSLKEILRKNGPRRSVTSLLQERIDSGHAVSLSELRLISKR 79

Query: 109 LCKSKRFERALEVLMLMETQNKIRMSPADHALRLELTIKVHGLLKAEEHFNQL----PNI 168
           L +S R++ AL+++  ME Q  I  S  D ALRL+L IK HGL + EE+F +L     ++
Sbjct: 80  LIRSNRYDLALQMMEWMENQKDIEFSVYDIALRLDLIIKTHGLKQGEEYFEKLLHSSVSM 139

Query: 169 ASQKASSLLLLHGYVKEQNTEKAEAFMAKLKGLGLVVDPHLYNEMMKLYVATSQNEKVPL 228
              K++ L LL  YVK +  ++AEA M KL GLG +V PH +NEMMKLY A+ Q EKV +
Sbjct: 140 RVAKSAYLPLLRAYVKNKMVKEAEALMEKLNGLGFLVTPHPFNEMMKLYEASGQYEKVVM 199

Query: 229 VIKDMKQNQILRNVLSYNLWMNACSELYGVGSVELVYEEMLSDKNVQVGWSTLSTLAKVY 288
           V+  MK N+I RNVLSYNLWMNAC E+ GV +VE VY+EM+ DK+V+VGWS+L TLA VY
Sbjct: 200 VVSMMKGNKIPRNVLSYNLWMNACCEVSGVAAVETVYKEMVGDKSVEVGWSSLCTLANVY 259

Query: 289 TQAGLVHKAFAALREAEKKLSSGNRLGYFFLITLYATLNDKEGVLRVWTASKAVSDRPTC 348
            ++G   KA   L +AEK L+  NRLGYFFLITLYA+L +KEGV+R+W  SK+V  R +C
Sbjct: 260 IKSGFDEKARLVLEDAEKMLNRSNRLGYFFLITLYASLGNKEGVVRLWEVSKSVCGRISC 319

Query: 349 ANYMCILLCLVKLGDINEAEKIFKEWELDCRNYDIRVSNVLLGAYVRNGLLDQAESLHMH 408
            NY+C+L  LVK GD+ EAE++F EWE  C NYD+RVSNVLLGAYVRNG + +AESLH  
Sbjct: 320 VNYICVLSSLVKTGDLEEAERVFSEWEAQCFNYDVRVSNVLLGAYVRNGEIRKAESLHGC 379

Query: 409 TLERGGTPNYKTWEILMEGCVRSQNLDRA 434
            LERGGTPNYKTWEILMEG V+ +N+++A
Sbjct: 380 VLERGGTPNYKTWEILMEGWVKCENMEKA 408

BLAST of CmaCh05G005210.1 vs. ExPASy Swiss-Prot
Match: Q8LPS6 (Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana OX=3702 GN=At1g02150 PE=2 SV=2)

HSP 1 Score: 240.0 bits (611), Expect = 5.4e-62
Identity = 132/359 (36.77%), Postives = 211/359 (58.77%), Query Frame = 0

Query: 76  LKQPRSSVTTVLQSGNSEAQRTSITKLRRVVKKLCKSKRFERALEVLMLMETQ-NKIRMS 135
           +++P     +VL       ++ +  +L RVVK+L K KR  +ALEV   M  +  + R+S
Sbjct: 76  MEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMNNRGERFRLS 135

Query: 136 PADHALRLELTIKVHGLLKAEEHFNQLPNIASQKASSLLLLHGYVKEQNTEKAEAFMAKL 195
            +D A++L+L  KV G+  AEE F QLP     +     LL+ YV+ ++ EKAEA +  +
Sbjct: 136 ASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSREKAEALLNTM 195

Query: 196 KGLGLVVDPHLYNEMMKLYVATSQNEKVPLVIKDMKQNQILRNVLSYNLWMNACSELYGV 255
           +  G  + P  +N MM LY+   + +KV  ++ +MKQ  I  ++ SYN+W+++C  L  V
Sbjct: 196 RDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNIWLSSCGSLGSV 255

Query: 256 GSVELVYEEMLSDKNVQVGWSTLSTLAKVYTQAGLVHKAFAALREAEKKLSSGNRLGYFF 315
             +ELVY++M SD ++   W+T ST+A +Y + G   KA  ALR+ E +++  NR+ Y +
Sbjct: 256 EKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARITGRNRIPYHY 315

Query: 316 LITLYATLNDKEGVLRVWTASKAVSDRPTCANYMCILLCLVKLGDINEAEKIFKEWELDC 375
           L++LY +L +K+ + RVW   K+V        Y  ++  LV++GDI  AEK+++EW    
Sbjct: 316 LLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAEKVYEEWLPVK 375

Query: 376 RNYDIRVSNVLLGAYVRNGLLDQAESLHMHTLERGGTPNYKTWEILMEGCVRSQNLDRA 434
            +YD R+ N+L+ AYV+N  L+ AE L  H +E GG P+  TWEIL  G  R + +  A
Sbjct: 376 SSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGHTRKRCISEA 434

BLAST of CmaCh05G005210.1 vs. ExPASy Swiss-Prot
Match: O22714 (Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana OX=3702 GN=At1g60770 PE=1 SV=1)

HSP 1 Score: 219.2 bits (557), Expect = 9.8e-56
Identity = 123/337 (36.50%), Postives = 184/337 (54.60%), Query Frame = 0

Query: 106 VKKLCKSKRFERALEVLMLMETQNKIRMSPADHALRLELTIKVHGLLKAEEHFNQLPNIA 165
           +KKL     +  AL++  +ME +  +  + +D A+ L+L  K   +   E +F  LP  +
Sbjct: 62  IKKLRNRGLYYPALKLSEVME-ERGMNKTVSDQAIHLDLVAKAREITAGENYFVDLPETS 121

Query: 166 SQKASSLLLLHGYVKEQNTEKAEAFMAKLKGLGLVVDPHLYNEMMKLYVATSQNEKVPLV 225
             + +   LL+ Y KE  TEKAE  + K+K L +      YN +M LY  T + EKVP +
Sbjct: 122 KTELTYGSLLNCYCKELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPAM 181

Query: 226 IKDMKQNQILRNVLSYNLWMNACSELYGVGSVELVYEEMLSDKNVQVGWSTLSTLAKVYT 285
           I+++K   ++ +  +YN+WM A +    +  VE V EEM  D  V   W+T S +A +Y 
Sbjct: 182 IQELKAENVMPDSYTYNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIYV 241

Query: 286 QAGLVHKAFAALREAEKKLSSGNRLGYFFLITLYATLNDKEGVLRVWTASKAVSDRPTCA 345
            AGL  KA  AL+E E K +  +   Y FLITLY  L     V R+W + +    + +  
Sbjct: 242 DAGLSQKAEKALQELEMKNTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSNV 301

Query: 346 NYMCILLCLVKLGDINEAEKIFKEWELDCRNYDIRVSNVLLGAYVRNGLLDQAESLHMHT 405
            Y+ ++  LVKL D+  AE +FKEW+ +C  YDIR+ NVL+GAY + GL+ +A  L    
Sbjct: 302 AYLNMIQVLVKLNDLPGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEKA 361

Query: 406 LERGGTPNYKTWEILMEGCVRSQNLDRAFIIQSLQVA 443
             RGG  N KTWEI M+  V+S ++ RA    S  V+
Sbjct: 362 PRRGGKLNAKTWEIFMDYYVKSGDMARALECMSKAVS 397

BLAST of CmaCh05G005210.1 vs. ExPASy Swiss-Prot
Match: Q9SY07 (Pentatricopeptide repeat-containing protein At4g02820, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At4g02820 PE=2 SV=1)

HSP 1 Score: 204.9 bits (520), Expect = 1.9e-51
Identity = 130/432 (30.09%), Postives = 222/432 (51.39%), Query Frame = 0

Query: 15  KSVMKTTLRSGTGRIH--FLLRAWRPISSHTSPVDSSDIISAVSPKT-LQKGGDNGNFKQ 74
           K+++  + R     IH  F   A   + + T+PV        V P++   KGG++ N K+
Sbjct: 3   KNMLVRSARPTLASIHRLFSAAAAATVDTATAPV--------VKPRSGGGKGGESANKKE 62

Query: 75  D-----------FMKLKQPRSSVTTVLQSGNSEAQRTSITKLRRVVKKLCKSKRFERALE 134
                        + L   + S    ++    E       +L R+V++L K KR++ ALE
Sbjct: 63  TVVGGRDTLGGRLLSLVYTKRSAVVTIRKWKEEGHSVRKYELNRIVRELRKIKRYKHALE 122

Query: 135 VLMLMETQNKIRMSPADHALRLELTIKVHGLLKAEEHFNQLPNIASQKASSLLLLHGYVK 194
           +   M  Q  I++   D+A+ L+L  K+ GL  AE+ F  +P+     A+   LLH YV+
Sbjct: 123 ICEWMVVQEDIKLQAGDYAVHLDLISKIRGLNSAEKFFEDMPDQMRGHAACTSLLHSYVQ 182

Query: 195 EQNTEKAEAFMAKLKGLGLVVDPHLYNEMMKLYVATSQNEKVPLVIKDMKQNQILRNVLS 254
            + ++KAEA   K+   G +     YN M+ +Y++  Q EKVP++IK++K  +   ++++
Sbjct: 183 NKLSDKAEALFEKMGECGFLKSCLPYNHMLSMYISRGQFEKVPVLIKELK-IRTSPDIVT 242

Query: 255 YNLWMNACSELYGVGSVELVYEEMLSDKNVQVGWSTLSTLAKVYTQAGLVHKAFAALREA 314
           YNLW+ A +    V   E VY +   +K +   W T S L  +Y +   V KA  AL+E 
Sbjct: 243 YNLWLTAFASGNDVEGAEKVYLKAKEEK-LNPDWVTYSVLTNLYAKTDNVEKARLALKEM 302

Query: 315 EKKLSSGNRLGYFFLITLYATLNDKEGVLRVWTASKAVSDRPTCANYMCILLCLVKLGDI 374
           EK +S  NR+ Y  LI+L+A L DK+GV   W   K+   +   A Y+ ++  +VKLG+ 
Sbjct: 303 EKLVSKKNRVAYASLISLHANLGDKDGVNLTWKKVKSSFKKMNDAEYLSMISAVVKLGEF 362

Query: 375 NEAEKIFKEWELDCRNYDIRVSNVLLGAYVRNGLLDQAESLHMHTLERGGTPNYKTWEIL 433
            +A+ ++ EWE      D R+ N++L  Y+    +   E  +   +E+G  P+Y TWEIL
Sbjct: 363 EQAKGLYDEWESVSGTGDARIPNLILAEYMNRDEVLLGEKFYERIVEKGINPSYSTWEIL 422

BLAST of CmaCh05G005210.1 vs. ExPASy Swiss-Prot
Match: Q84JR3 (Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At4g21705 PE=2 SV=1)

HSP 1 Score: 197.6 bits (501), Expect = 3.1e-49
Identity = 116/359 (32.31%), Postives = 185/359 (51.53%), Query Frame = 0

Query: 76  LKQPRSSVTTVLQSGNSEAQRTSITKLRRVVKKLCKSKRFERALEVLMLMETQNKIRMSP 135
           L  P+SSV   LQ+     ++ S+ +L R+V  L + KRF  ALEV   M        SP
Sbjct: 34  LGDPKSSVYPELQNWVQCGKKVSVAELIRIVHDLRRRKRFLHALEVSKWMNETGVCVFSP 93

Query: 136 ADHALRLELTIKVHGLLKAEEHFNQLPNIASQKASSLLLLHGYVKEQNTEKAEAFMAKLK 195
            +HA+ L+L  +V+G + AEE+F  L        +   LL+ YV++QN EK+     K+K
Sbjct: 94  TEHAVHLDLIGRVYGFVTAEEYFENLKEQYKNDKTYGALLNCYVRQQNVEKSLLHFEKMK 153

Query: 196 GLGLVVDPHLYNEMMKLYVATSQNEKVPLVIKDMKQNQILRNVLSYNLWMNACSELYGVG 255
            +G V     YN +M LY    Q+EKVP V+++MK+  +  +  SY + +NA   +Y + 
Sbjct: 154 EMGFVTSSLTYNNIMCLYTNIGQHEKVPKVLEEMKEENVAPDNYSYRICINAFGAMYDLE 213

Query: 256 SVELVYEEMLSDKNVQVGWSTLSTLAKVYTQAGLVHKAFAALREAEKKLSSGNRLGYFFL 315
            +     +M   +++ + W+T +  AK Y   G   +A   L+ +E +L   +  GY  L
Sbjct: 214 RIGGTLRDMERRQDITMDWNTYAVAAKFYIDGGDCDRAVELLKMSENRLEKKDGEGYNHL 273

Query: 316 ITLYATLNDKEGVLRVWTASKAVSDRPTCANYMCILLCLVKLGDINEAEKIFKEWELDCR 375
           ITLYA L  K  VLR+W   K V  R    +Y+ +L  LVK+  + EAE++  EW+    
Sbjct: 274 ITLYARLGKKIEVLRLWDLEKDVCKRRINQDYLTVLQSLVKIDALVEAEEVLTEWKSSGN 333

Query: 376 NYDIRVSNVLLGAYVRNGLLDQAESLHMHTLERGGTPNYKTWEILMEGCVRSQNLDRAF 435
            YD RV N ++  Y+   + ++AE++      RG     ++WE++         L+ AF
Sbjct: 334 CYDFRVPNTVIRGYIGKSMEEKAEAMLEDLARRGKATTPESWELVATAYAEKGTLENAF 392

BLAST of CmaCh05G005210.1 vs. TAIR 10
Match: AT5G27460.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 396.7 bits (1018), Expect = 2.4e-110
Identity = 204/389 (52.44%), Postives = 274/389 (70.44%), Query Frame = 0

Query: 49  SDIISAVSPKTLQKGGDNGNFKQDFMKLKQPRSSVTTVLQSGNSEAQRTSITKLRRVVKK 108
           S ++S+++  +      N N  ++ ++   PR SVT++LQ         S+++LR + K+
Sbjct: 20  SRLVSSLADGSDTSSVANRNSLKEILRKNGPRRSVTSLLQERIDSGHAVSLSELRLISKR 79

Query: 109 LCKSKRFERALEVLMLMETQNKIRMSPADHALRLELTIKVHGLLKAEEHFNQL----PNI 168
           L +S R++ AL+++  ME Q  I  S  D ALRL+L IK HGL + EE+F +L     ++
Sbjct: 80  LIRSNRYDLALQMMEWMENQKDIEFSVYDIALRLDLIIKTHGLKQGEEYFEKLLHSSVSM 139

Query: 169 ASQKASSLLLLHGYVKEQNTEKAEAFMAKLKGLGLVVDPHLYNEMMKLYVATSQNEKVPL 228
              K++ L LL  YVK +  ++AEA M KL GLG +V PH +NEMMKLY A+ Q EKV +
Sbjct: 140 RVAKSAYLPLLRAYVKNKMVKEAEALMEKLNGLGFLVTPHPFNEMMKLYEASGQYEKVVM 199

Query: 229 VIKDMKQNQILRNVLSYNLWMNACSELYGVGSVELVYEEMLSDKNVQVGWSTLSTLAKVY 288
           V+  MK N+I RNVLSYNLWMNAC E+ GV +VE VY+EM+ DK+V+VGWS+L TLA VY
Sbjct: 200 VVSMMKGNKIPRNVLSYNLWMNACCEVSGVAAVETVYKEMVGDKSVEVGWSSLCTLANVY 259

Query: 289 TQAGLVHKAFAALREAEKKLSSGNRLGYFFLITLYATLNDKEGVLRVWTASKAVSDRPTC 348
            ++G   KA   L +AEK L+  NRLGYFFLITLYA+L +KEGV+R+W  SK+V  R +C
Sbjct: 260 IKSGFDEKARLVLEDAEKMLNRSNRLGYFFLITLYASLGNKEGVVRLWEVSKSVCGRISC 319

Query: 349 ANYMCILLCLVKLGDINEAEKIFKEWELDCRNYDIRVSNVLLGAYVRNGLLDQAESLHMH 408
            NY+C+L  LVK GD+ EAE++F EWE  C NYD+RVSNVLLGAYVRNG + +AESLH  
Sbjct: 320 VNYICVLSSLVKTGDLEEAERVFSEWEAQCFNYDVRVSNVLLGAYVRNGEIRKAESLHGC 379

Query: 409 TLERGGTPNYKTWEILMEGCVRSQNLDRA 434
            LERGGTPNYKTWEILMEG V+ +N+++A
Sbjct: 380 VLERGGTPNYKTWEILMEGWVKCENMEKA 408

BLAST of CmaCh05G005210.1 vs. TAIR 10
Match: AT1G02150.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 240.0 bits (611), Expect = 3.8e-63
Identity = 132/359 (36.77%), Postives = 211/359 (58.77%), Query Frame = 0

Query: 76  LKQPRSSVTTVLQSGNSEAQRTSITKLRRVVKKLCKSKRFERALEVLMLMETQ-NKIRMS 135
           +++P     +VL       ++ +  +L RVVK+L K KR  +ALEV   M  +  + R+S
Sbjct: 76  MEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMNNRGERFRLS 135

Query: 136 PADHALRLELTIKVHGLLKAEEHFNQLPNIASQKASSLLLLHGYVKEQNTEKAEAFMAKL 195
            +D A++L+L  KV G+  AEE F QLP     +     LL+ YV+ ++ EKAEA +  +
Sbjct: 136 ASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSREKAEALLNTM 195

Query: 196 KGLGLVVDPHLYNEMMKLYVATSQNEKVPLVIKDMKQNQILRNVLSYNLWMNACSELYGV 255
           +  G  + P  +N MM LY+   + +KV  ++ +MKQ  I  ++ SYN+W+++C  L  V
Sbjct: 196 RDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNIWLSSCGSLGSV 255

Query: 256 GSVELVYEEMLSDKNVQVGWSTLSTLAKVYTQAGLVHKAFAALREAEKKLSSGNRLGYFF 315
             +ELVY++M SD ++   W+T ST+A +Y + G   KA  ALR+ E +++  NR+ Y +
Sbjct: 256 EKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARITGRNRIPYHY 315

Query: 316 LITLYATLNDKEGVLRVWTASKAVSDRPTCANYMCILLCLVKLGDINEAEKIFKEWELDC 375
           L++LY +L +K+ + RVW   K+V        Y  ++  LV++GDI  AEK+++EW    
Sbjct: 316 LLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAEKVYEEWLPVK 375

Query: 376 RNYDIRVSNVLLGAYVRNGLLDQAESLHMHTLERGGTPNYKTWEILMEGCVRSQNLDRA 434
            +YD R+ N+L+ AYV+N  L+ AE L  H +E GG P+  TWEIL  G  R + +  A
Sbjct: 376 SSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGHTRKRCISEA 434

BLAST of CmaCh05G005210.1 vs. TAIR 10
Match: AT1G60770.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 219.2 bits (557), Expect = 7.0e-57
Identity = 123/337 (36.50%), Postives = 184/337 (54.60%), Query Frame = 0

Query: 106 VKKLCKSKRFERALEVLMLMETQNKIRMSPADHALRLELTIKVHGLLKAEEHFNQLPNIA 165
           +KKL     +  AL++  +ME +  +  + +D A+ L+L  K   +   E +F  LP  +
Sbjct: 62  IKKLRNRGLYYPALKLSEVME-ERGMNKTVSDQAIHLDLVAKAREITAGENYFVDLPETS 121

Query: 166 SQKASSLLLLHGYVKEQNTEKAEAFMAKLKGLGLVVDPHLYNEMMKLYVATSQNEKVPLV 225
             + +   LL+ Y KE  TEKAE  + K+K L +      YN +M LY  T + EKVP +
Sbjct: 122 KTELTYGSLLNCYCKELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPAM 181

Query: 226 IKDMKQNQILRNVLSYNLWMNACSELYGVGSVELVYEEMLSDKNVQVGWSTLSTLAKVYT 285
           I+++K   ++ +  +YN+WM A +    +  VE V EEM  D  V   W+T S +A +Y 
Sbjct: 182 IQELKAENVMPDSYTYNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIYV 241

Query: 286 QAGLVHKAFAALREAEKKLSSGNRLGYFFLITLYATLNDKEGVLRVWTASKAVSDRPTCA 345
            AGL  KA  AL+E E K +  +   Y FLITLY  L     V R+W + +    + +  
Sbjct: 242 DAGLSQKAEKALQELEMKNTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSNV 301

Query: 346 NYMCILLCLVKLGDINEAEKIFKEWELDCRNYDIRVSNVLLGAYVRNGLLDQAESLHMHT 405
            Y+ ++  LVKL D+  AE +FKEW+ +C  YDIR+ NVL+GAY + GL+ +A  L    
Sbjct: 302 AYLNMIQVLVKLNDLPGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEKA 361

Query: 406 LERGGTPNYKTWEILMEGCVRSQNLDRAFIIQSLQVA 443
             RGG  N KTWEI M+  V+S ++ RA    S  V+
Sbjct: 362 PRRGGKLNAKTWEIFMDYYVKSGDMARALECMSKAVS 397

BLAST of CmaCh05G005210.1 vs. TAIR 10
Match: AT4G02820.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 204.9 bits (520), Expect = 1.4e-52
Identity = 130/432 (30.09%), Postives = 222/432 (51.39%), Query Frame = 0

Query: 15  KSVMKTTLRSGTGRIH--FLLRAWRPISSHTSPVDSSDIISAVSPKT-LQKGGDNGNFKQ 74
           K+++  + R     IH  F   A   + + T+PV        V P++   KGG++ N K+
Sbjct: 3   KNMLVRSARPTLASIHRLFSAAAAATVDTATAPV--------VKPRSGGGKGGESANKKE 62

Query: 75  D-----------FMKLKQPRSSVTTVLQSGNSEAQRTSITKLRRVVKKLCKSKRFERALE 134
                        + L   + S    ++    E       +L R+V++L K KR++ ALE
Sbjct: 63  TVVGGRDTLGGRLLSLVYTKRSAVVTIRKWKEEGHSVRKYELNRIVRELRKIKRYKHALE 122

Query: 135 VLMLMETQNKIRMSPADHALRLELTIKVHGLLKAEEHFNQLPNIASQKASSLLLLHGYVK 194
           +   M  Q  I++   D+A+ L+L  K+ GL  AE+ F  +P+     A+   LLH YV+
Sbjct: 123 ICEWMVVQEDIKLQAGDYAVHLDLISKIRGLNSAEKFFEDMPDQMRGHAACTSLLHSYVQ 182

Query: 195 EQNTEKAEAFMAKLKGLGLVVDPHLYNEMMKLYVATSQNEKVPLVIKDMKQNQILRNVLS 254
            + ++KAEA   K+   G +     YN M+ +Y++  Q EKVP++IK++K  +   ++++
Sbjct: 183 NKLSDKAEALFEKMGECGFLKSCLPYNHMLSMYISRGQFEKVPVLIKELK-IRTSPDIVT 242

Query: 255 YNLWMNACSELYGVGSVELVYEEMLSDKNVQVGWSTLSTLAKVYTQAGLVHKAFAALREA 314
           YNLW+ A +    V   E VY +   +K +   W T S L  +Y +   V KA  AL+E 
Sbjct: 243 YNLWLTAFASGNDVEGAEKVYLKAKEEK-LNPDWVTYSVLTNLYAKTDNVEKARLALKEM 302

Query: 315 EKKLSSGNRLGYFFLITLYATLNDKEGVLRVWTASKAVSDRPTCANYMCILLCLVKLGDI 374
           EK +S  NR+ Y  LI+L+A L DK+GV   W   K+   +   A Y+ ++  +VKLG+ 
Sbjct: 303 EKLVSKKNRVAYASLISLHANLGDKDGVNLTWKKVKSSFKKMNDAEYLSMISAVVKLGEF 362

Query: 375 NEAEKIFKEWELDCRNYDIRVSNVLLGAYVRNGLLDQAESLHMHTLERGGTPNYKTWEIL 433
            +A+ ++ EWE      D R+ N++L  Y+    +   E  +   +E+G  P+Y TWEIL
Sbjct: 363 EQAKGLYDEWESVSGTGDARIPNLILAEYMNRDEVLLGEKFYERIVEKGINPSYSTWEIL 422

BLAST of CmaCh05G005210.1 vs. TAIR 10
Match: AT4G21705.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 197.6 bits (501), Expect = 2.2e-50
Identity = 116/359 (32.31%), Postives = 185/359 (51.53%), Query Frame = 0

Query: 76  LKQPRSSVTTVLQSGNSEAQRTSITKLRRVVKKLCKSKRFERALEVLMLMETQNKIRMSP 135
           L  P+SSV   LQ+     ++ S+ +L R+V  L + KRF  ALEV   M        SP
Sbjct: 34  LGDPKSSVYPELQNWVQCGKKVSVAELIRIVHDLRRRKRFLHALEVSKWMNETGVCVFSP 93

Query: 136 ADHALRLELTIKVHGLLKAEEHFNQLPNIASQKASSLLLLHGYVKEQNTEKAEAFMAKLK 195
            +HA+ L+L  +V+G + AEE+F  L        +   LL+ YV++QN EK+     K+K
Sbjct: 94  TEHAVHLDLIGRVYGFVTAEEYFENLKEQYKNDKTYGALLNCYVRQQNVEKSLLHFEKMK 153

Query: 196 GLGLVVDPHLYNEMMKLYVATSQNEKVPLVIKDMKQNQILRNVLSYNLWMNACSELYGVG 255
            +G V     YN +M LY    Q+EKVP V+++MK+  +  +  SY + +NA   +Y + 
Sbjct: 154 EMGFVTSSLTYNNIMCLYTNIGQHEKVPKVLEEMKEENVAPDNYSYRICINAFGAMYDLE 213

Query: 256 SVELVYEEMLSDKNVQVGWSTLSTLAKVYTQAGLVHKAFAALREAEKKLSSGNRLGYFFL 315
            +     +M   +++ + W+T +  AK Y   G   +A   L+ +E +L   +  GY  L
Sbjct: 214 RIGGTLRDMERRQDITMDWNTYAVAAKFYIDGGDCDRAVELLKMSENRLEKKDGEGYNHL 273

Query: 316 ITLYATLNDKEGVLRVWTASKAVSDRPTCANYMCILLCLVKLGDINEAEKIFKEWELDCR 375
           ITLYA L  K  VLR+W   K V  R    +Y+ +L  LVK+  + EAE++  EW+    
Sbjct: 274 ITLYARLGKKIEVLRLWDLEKDVCKRRINQDYLTVLQSLVKIDALVEAEEVLTEWKSSGN 333

Query: 376 NYDIRVSNVLLGAYVRNGLLDQAESLHMHTLERGGTPNYKTWEILMEGCVRSQNLDRAF 435
            YD RV N ++  Y+   + ++AE++      RG     ++WE++         L+ AF
Sbjct: 334 CYDFRVPNTVIRGYIGKSMEEKAEAMLEDLARRGKATTPESWELVATAYAEKGTLENAF 392

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q3E9113.4e-10952.44Pentatricopeptide repeat-containing protein At5g27460 OS=Arabidopsis thaliana OX... [more]
Q8LPS65.4e-6236.77Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana OX... [more]
O227149.8e-5636.50Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana OX... [more]
Q9SY071.9e-5130.09Pentatricopeptide repeat-containing protein At4g02820, mitochondrial OS=Arabidop... [more]
Q84JR33.1e-4932.31Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
AT5G27460.12.4e-11052.44Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G02150.13.8e-6336.77Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G60770.17.0e-5736.50Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G02820.11.4e-5230.09Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G21705.12.2e-5032.31Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 69..255
e-value: 2.5E-12
score: 48.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 282..452
e-value: 3.9E-21
score: 77.7
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 194..248
e-value: 0.0013
score: 18.8
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 347..370
e-value: 0.2
score: 12.0
coord: 107..127
e-value: 1.4
score: 9.4
coord: 383..404
e-value: 0.054
score: 13.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 378..412
score: 9.229472
NoneNo IPR availablePANTHERPTHR45717:SF13OS02G0796400 PROTEINcoord: 42..436
NoneNo IPR availablePANTHERPTHR45717OS12G0527900 PROTEINcoord: 42..436

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmaCh05G005210CmaCh05G005210gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh05G005210.1:exon:2196CmaCh05G005210.1:exon:2196exon
CmaCh05G005210.1:exon:2197CmaCh05G005210.1:exon:2197exon
CmaCh05G005210.1:exon:2198CmaCh05G005210.1:exon:2198exon
CmaCh05G005210.1:exon:2199CmaCh05G005210.1:exon:2199exon


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh05G005210.1:five_prime_utrCmaCh05G005210.1:five_prime_utrfive_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmaCh05G005210.1:cdsCmaCh05G005210.1:cdsCDS
CmaCh05G005210.1:cdsCmaCh05G005210.1:cds_2CDS
CmaCh05G005210.1:cdsCmaCh05G005210.1:cds_3CDS
CmaCh05G005210.1:cdsCmaCh05G005210.1:cds_4CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmaCh05G005210.1CmaCh05G005210.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005739 mitochondrion
molecular_function GO:0005515 protein binding