Cla021114 (gene) Watermelon (97103) v1

NameCla021114
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionPentatricopeptide repeat-containing protein (AHRD V1 ***- D7M6Q4_ARALL); contains Interpro domain(s) IPR002885 Pentatricopeptide repeat
LocationChr5 : 535407 .. 537230 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGGGATGGAAGAATGAGATCAAAGCTTTGTGCATTCTCGCTTCTGGGTCGTACGTGTTTCAGTTGGTCTTCTATTAAATCCACCCAACATGTCGAACTCGGAGCTCTTACCAATGCTACCATGGCCTCTGGTTTTAGTTTGTGGCCTACACGCATTCATACTGCCGACACTGCTACTGCATTGAACTCCAATGTTGGGGATCTCTTCTCGTTTGGCTTTTGTTCTCGTTCTTATATCTCTCCGACCTCAAACCACGATATTTTGAATGCAACTTCCAAAGAATCTTGTCACTATGCAGCCAGTGATGATCATGTTAGTGGTGATGATGACGATGATCGTGAAGAACATGAAGAAGAGGACATGAACATCAATGACGAGGGTGTGATACAAGATGTAGATGCCATTATGGATATATTTCGTGGGTTCAGGAATGTAAATCGAACTCAAGTGAAAAATAAGCTTGAGCATTGCTGTATTAAGGTATCAGGAGAGTTGGTAGTAGCAGTACTTTCTCGGATCCGTAATGATTGGGAAGCAGCATTCACTTTTTTTGTTTGGGCTGGTAAGCAGCCTGGTTATGCCCACTCTGTGCGTGAATATCATTCAATGATTTCAATTCTTGGCAAAATGAGAAAGTTCGACACGGCATGGGCCTTGATCGATGAGATGAGAGGAGGGACAACGGGCTCTTCTCTAGTGAGCCCTCAAACCCTTCTAATCATGATTAGGAAATACTGTGCTGTTCATGATGTTGGTAAAGCTATAAATACCTTCCATGCTCATAAAAGATTTGGATTTAACATTGGACTAGAGGAATTTCAGAACCTTCTCTCCGCCCTCTGCCGTTACAAAAATGTGAAAGATGCTGAATATTTGCTGTTTTGCAACAAAGATGTGTTCCCATTTAACACAAAGAGTTTCAATATTATCCTCAATGGGTGGTGTAATATAATTGGCAGTCTTCGGGATGCCGAGAGAGTTTGGAAGGAGATGATGCAGAGAGGTATCAGTCATGATGCTGTCTCATATGCAAGTTTCATTTCTTGTTACTCAAAAACTCGCAACCTTTATAAGGTGCTCAGGCTTTTTGACGATATGAAGCAAATGAAAGTTGAACCTGATAGGAAAGTCTACAATGCAGTCATTCACGCTCTTGCCAAAGGTAGGCTTCTGAAAGAAGCTATCAATCTCATAAAAACAATGGAAGATAAGGGTATTATTGCAAATGTTGTTACTTATAACTCAGTAATCAAACCTCTATGCAAGGCCTGGAAATTTGATGAAGCTAGAGCGGTTTTTGATGAGCTGTTGCAGCGGGGCCTCTGCCCAACAATTCAGACTTATCATGCCTTCTTGAGATTCCTGAGAACGGAGGAAGAAATATTTGAGCTCTTGGAGAAGATGAGAAAAATGGGGTGCGATCCAACCGCTGAAACCTACATAATGTTGATCAGAAAGTTTTGCCGATGGCGCCAGCTTGATAATGTTTCCAGAATATGGCATGAAATGAGTGAAAATGGAATTAGTCCTGATCGCAGCTCTTATATGGTCTTAATACATGGTCTCTTCTTGAATGGAAAATTAGAAGATGCATATAAATACTATTTGGAAATGAAGGAAAAAGAGTTGCTTCCTGAACCAAAGATAGATGAGATACTACAAGCTTGGCTTGCTGGTAAGCCAATGTTTCAAGAGAAGCCATCAGATTGTTCTGGGGAAGGCAAGAACTCCAGATTGTTCCCCAAAAGGATTGATTTTCACCGACAACCTGAGATTAGAAAGGTTTCAAGAGATTGTGGTTTTTCATTTTGGAAGCAGTAG

mRNA sequence

ATGTGGGATGGAAGAATGAGATCAAAGCTTTGTGCATTCTCGCTTCTGGGTCGTACGTGTTTCAGTTGGTCTTCTATTAAATCCACCCAACATGTCGAACTCGGAGCTCTTACCAATGCTACCATGGCCTCTGGTTTTAGTTTGTGGCCTACACGCATTCATACTGCCGACACTGCTACTGCATTGAACTCCAATGTTGGGGATCTCTTCTCGTTTGGCTTTTGTTCTCGTTCTTATATCTCTCCGACCTCAAACCACGATATTTTGAATGCAACTTCCAAAGAATCTTGTCACTATGCAGCCAGTGATGATCATGTTAGTGGTGATGATGACGATGATCGTGAAGAACATGAAGAAGAGGACATGAACATCAATGACGAGGGTGTGATACAAGATGTAGATGCCATTATGGATATATTTCGTGGGTTCAGGAATGTAAATCGAACTCAAGTGAAAAATAAGCTTGAGCATTGCTGTATTAAGGTATCAGGAGAGTTGGTAGTAGCAGTACTTTCTCGGATCCGTAATGATTGGGAAGCAGCATTCACTTTTTTTGTTTGGGCTGGTAAGCAGCCTGGTTATGCCCACTCTGTGCGTGAATATCATTCAATGATTTCAATTCTTGGCAAAATGAGAAAGTTCGACACGGCATGGGCCTTGATCGATGAGATGAGAGGAGGGACAACGGGCTCTTCTCTAGTGAGCCCTCAAACCCTTCTAATCATGATTAGGAAATACTGTGCTGTTCATGATGTTGGTAAAGCTATAAATACCTTCCATGCTCATAAAAGATTTGGATTTAACATTGGACTAGAGGAATTTCAGAACCTTCTCTCCGCCCTCTGCCGTTACAAAAATGTGAAAGATGCTGAATATTTGCTGTTTTGCAACAAAGATGTGTTCCCATTTAACACAAAGAGTTTCAATATTATCCTCAATGGGTGGTGTAATATAATTGGCAGTCTTCGGGATGCCGAGAGAGTTTGGAAGGAGATGATGCAGAGAGGTATCAGTCATGATGCTGTCTCATATGCAAGTTTCATTTCTTGTTACTCAAAAACTCGCAACCTTTATAAGGTGCTCAGGCTTTTTGACGATATGAAGCAAATGAAAGTTGAACCTGATAGGAAAGTCTACAATGCAGTCATTCACGCTCTTGCCAAAGGTAGGCTTCTGAAAGAAGCTATCAATCTCATAAAAACAATGGAAGATAAGGGTATTATTGCAAATGTTGTTACTTATAACTCAGTAATCAAACCTCTATGCAAGGCCTGGAAATTTGATGAAGCTAGAGCGGTTTTTGATGAGCTGTTGCAGCGGGGCCTCTGCCCAACAATTCAGACTTATCATGCCTTCTTGAGATTCCTGAGAACGGAGGAAGAAATATTTGAGCTCTTGGAGAAGATGAGAAAAATGGGGTGCGATCCAACCGCTGAAACCTACATAATGTTGATCAGAAAGTTTTGCCGATGGCGCCAGCTTGATAATGTTTCCAGAATATGGCATGAAATGAGTGAAAATGGAATTAGTCCTGATCGCAGCTCTTATATGGTCTTAATACATGGTCTCTTCTTGAATGGAAAATTAGAAGATGCATATAAATACTATTTGGAAATGAAGGAAAAAGAGTTGCTTCCTGAACCAAAGATAGATGAGATACTACAAGCTTGGCTTGCTGGTAAGCCAATGTTTCAAGAGAAGCCATCAGATTGTTCTGGGGAAGGCAAGAACTCCAGATTGTTCCCCAAAAGGATTGATTTTCACCGACAACCTGAGATTAGAAAGGTTTCAAGAGATTGTGGTTTTTCATTTTGGAAGCAGTAG

Coding sequence (CDS)

ATGTGGGATGGAAGAATGAGATCAAAGCTTTGTGCATTCTCGCTTCTGGGTCGTACGTGTTTCAGTTGGTCTTCTATTAAATCCACCCAACATGTCGAACTCGGAGCTCTTACCAATGCTACCATGGCCTCTGGTTTTAGTTTGTGGCCTACACGCATTCATACTGCCGACACTGCTACTGCATTGAACTCCAATGTTGGGGATCTCTTCTCGTTTGGCTTTTGTTCTCGTTCTTATATCTCTCCGACCTCAAACCACGATATTTTGAATGCAACTTCCAAAGAATCTTGTCACTATGCAGCCAGTGATGATCATGTTAGTGGTGATGATGACGATGATCGTGAAGAACATGAAGAAGAGGACATGAACATCAATGACGAGGGTGTGATACAAGATGTAGATGCCATTATGGATATATTTCGTGGGTTCAGGAATGTAAATCGAACTCAAGTGAAAAATAAGCTTGAGCATTGCTGTATTAAGGTATCAGGAGAGTTGGTAGTAGCAGTACTTTCTCGGATCCGTAATGATTGGGAAGCAGCATTCACTTTTTTTGTTTGGGCTGGTAAGCAGCCTGGTTATGCCCACTCTGTGCGTGAATATCATTCAATGATTTCAATTCTTGGCAAAATGAGAAAGTTCGACACGGCATGGGCCTTGATCGATGAGATGAGAGGAGGGACAACGGGCTCTTCTCTAGTGAGCCCTCAAACCCTTCTAATCATGATTAGGAAATACTGTGCTGTTCATGATGTTGGTAAAGCTATAAATACCTTCCATGCTCATAAAAGATTTGGATTTAACATTGGACTAGAGGAATTTCAGAACCTTCTCTCCGCCCTCTGCCGTTACAAAAATGTGAAAGATGCTGAATATTTGCTGTTTTGCAACAAAGATGTGTTCCCATTTAACACAAAGAGTTTCAATATTATCCTCAATGGGTGGTGTAATATAATTGGCAGTCTTCGGGATGCCGAGAGAGTTTGGAAGGAGATGATGCAGAGAGGTATCAGTCATGATGCTGTCTCATATGCAAGTTTCATTTCTTGTTACTCAAAAACTCGCAACCTTTATAAGGTGCTCAGGCTTTTTGACGATATGAAGCAAATGAAAGTTGAACCTGATAGGAAAGTCTACAATGCAGTCATTCACGCTCTTGCCAAAGGTAGGCTTCTGAAAGAAGCTATCAATCTCATAAAAACAATGGAAGATAAGGGTATTATTGCAAATGTTGTTACTTATAACTCAGTAATCAAACCTCTATGCAAGGCCTGGAAATTTGATGAAGCTAGAGCGGTTTTTGATGAGCTGTTGCAGCGGGGCCTCTGCCCAACAATTCAGACTTATCATGCCTTCTTGAGATTCCTGAGAACGGAGGAAGAAATATTTGAGCTCTTGGAGAAGATGAGAAAAATGGGGTGCGATCCAACCGCTGAAACCTACATAATGTTGATCAGAAAGTTTTGCCGATGGCGCCAGCTTGATAATGTTTCCAGAATATGGCATGAAATGAGTGAAAATGGAATTAGTCCTGATCGCAGCTCTTATATGGTCTTAATACATGGTCTCTTCTTGAATGGAAAATTAGAAGATGCATATAAATACTATTTGGAAATGAAGGAAAAAGAGTTGCTTCCTGAACCAAAGATAGATGAGATACTACAAGCTTGGCTTGCTGGTAAGCCAATGTTTCAAGAGAAGCCATCAGATTGTTCTGGGGAAGGCAAGAACTCCAGATTGTTCCCCAAAAGGATTGATTTTCACCGACAACCTGAGATTAGAAAGGTTTCAAGAGATTGTGGTTTTTCATTTTGGAAGCAGTAG

Protein sequence

MWDGRMRSKLCAFSLLGRTCFSWSSIKSTQHVELGALTNATMASGFSLWPTRIHTADTATALNSNVGDLFSFGFCSRSYISPTSNHDILNATSKESCHYAASDDHVSGDDDDDREEHEEEDMNINDEGVIQDVDAIMDIFRGFRNVNRTQVKNKLEHCCIKVSGELVVAVLSRIRNDWEAAFTFFVWAGKQPGYAHSVREYHSMISILGKMRKFDTAWALIDEMRGGTTGSSLVSPQTLLIMIRKYCAVHDVGKAINTFHAHKRFGFNIGLEEFQNLLSALCRYKNVKDAEYLLFCNKDVFPFNTKSFNIILNGWCNIIGSLRDAERVWKEMMQRGISHDAVSYASFISCYSKTRNLYKVLRLFDDMKQMKVEPDRKVYNAVIHALAKGRLLKEAINLIKTMEDKGIIANVVTYNSVIKPLCKAWKFDEARAVFDELLQRGLCPTIQTYHAFLRFLRTEEEIFELLEKMRKMGCDPTAETYIMLIRKFCRWRQLDNVSRIWHEMSENGISPDRSSYMVLIHGLFLNGKLEDAYKYYLEMKEKELLPEPKIDEILQAWLAGKPMFQEKPSDCSGEGKNSRLFPKRIDFHRQPEIRKVSRDCGFSFWKQ
BLAST of Cla021114 vs. Swiss-Prot
Match: PP383_ARATH (Pentatricopeptide repeat-containing protein At5g15010, mitochondrial OS=Arabidopsis thaliana GN=At5g15010 PE=2 SV=2)

HSP 1 Score: 633.6 bits (1633), Expect = 2.2e-180
Identity = 314/514 (61.09%), Postives = 390/514 (75.88%), Query Frame = 1

Query: 102 SDDHVSGDDDDDREEHEEEDMN----INDEGVIQDVDAIMDIFRGFRNVNRTQVKNKLEH 161
           +D    G    + E+ +E D++    I+DE V +DV  I  + +   + +R +++NKLE 
Sbjct: 62  ADSEQVGFTRSNIEKDDESDIDLGCSISDELVSEDVGKISKLVKDCGS-DRKELRNKLEE 121

Query: 162 CCIKVSGELVVAVLSRIRNDWEAAFTFFVWAGKQPGYAHSVREYHSMISILGKMRKFDTA 221
           C +K S ELVV +LSR+RNDWE AFTFFVWAGKQ GY  SVREYHSMISILGKMRKFDTA
Sbjct: 122 CDVKPSNELVVEILSRVRNDWETAFTFFVWAGKQQGYVRSVREYHSMISILGKMRKFDTA 181

Query: 222 WALIDEMRGGTTGSSLVSPQTLLIMIRKYCAVHDVGKAINTFHAHKRFGFNIGLEEFQNL 281
           W LIDEMR      SLV+ QTLLIMIRKYCAVHDVGKAINTFHA+KRF   +G+++FQ+L
Sbjct: 182 WTLIDEMR--KFSPSLVNSQTLLIMIRKYCAVHDVGKAINTFHAYKRFKLEMGIDDFQSL 241

Query: 282 LSALCRYKNVKDAEYLLFCNKDVFPFNTKSFNIILNGWCNIIGSLRDAERVWKEMMQRGI 341
           LSALCRYKNV DA +L+FCNKD +PF+ KSFNI+LNGWCN+IGS R+AERVW EM   G+
Sbjct: 242 LSALCRYKNVSDAGHLIFCNKDKYPFDAKSFNIVLNGWCNVIGSPREAERVWMEMGNVGV 301

Query: 342 SHDAVSYASFISCYSKTRNLYKVLRLFDDMKQMKVEPDRKVYNAVIHALAKGRLLKEAIN 401
            HD VSY+S ISCYSK  +L KVL+LFD MK+  +EPDRKVYNAV+HALAK   + EA N
Sbjct: 302 KHDVVSYSSMISCYSKGGSLNKVLKLFDRMKKECIEPDRKVYNAVVHALAKASFVSEARN 361

Query: 402 LIKTM-EDKGIIANVVTYNSVIKPLCKAWKFDEARAVFDELLQRGLCPTIQTYHAFLRFL 461
           L+KTM E+KGI  NVVTYNS+IKPLCKA K +EA+ VFDE+L++GL PTI+TYHAF+R L
Sbjct: 362 LMKTMEEEKGIEPNVVTYNSLIKPLCKARKTEEAKQVFDEMLEKGLFPTIRTYHAFMRIL 421

Query: 462 RTEEEIFELLEKMRKMGCDPTAETYIMLIRKFCRWRQLDNVSRIWHEMSENGISPDRSSY 521
           RT EE+FELL KMRKMGC+PT ETYIMLIRK CRWR  DNV  +W EM E  + PD SSY
Sbjct: 422 RTGEEVFELLAKMRKMGCEPTVETYIMLIRKLCRWRDFDNVLLLWDEMKEKTVGPDLSSY 481

Query: 522 MVLIHGLFLNGKLEDAYKYYLEMKEKELLPEPKIDEILQAWLAGKPMFQEKPSDCSGEGK 581
           +V+IHGLFLNGK+E+AY YY EMK+K + P   +++++Q+W +GK   +++ +D  GE  
Sbjct: 482 IVMIHGLFLNGKIEEAYGYYKEMKDKGMRPNENVEDMIQSWFSGKQYAEQRITDSKGEVN 541

Query: 582 NSRLFPK---RIDFHRQPEIRKVSRDCGFSFWKQ 608
              +  K     +F +QPE+RKV R+ G+SFW +
Sbjct: 542 KGAIVKKSEREKNFLQQPEVRKVVREHGYSFWDE 572

BLAST of Cla021114 vs. Swiss-Prot
Match: PP233_ARATH (Putative pentatricopeptide repeat-containing protein At3g15200 OS=Arabidopsis thaliana GN=At3g15200 PE=3 SV=1)

HSP 1 Score: 263.1 bits (671), Expect = 7.7e-69
Identity = 142/432 (32.87%), Postives = 244/432 (56.48%), Query Frame = 1

Query: 125 NDEGVIQDVDAIMDIFRGFRNVNRTQVKNKLEHCCIKVSGELVVAVLSRIRNDWEAAFTF 184
           ND+   Q    + +I +  R  +  ++K  L+ C I ++ ELV+ V++R R+DW+ A+  
Sbjct: 70  NDDKDKQSALDVHNIIKHHRGSSPEKIKRILDKCGIDLTEELVLEVVNRNRSDWKPAYIL 129

Query: 185 FVWAGKQPGYAHSVREYHSMISILGKMRKFDTAWALIDEMRGGTTGSSLVSPQTLLIMIR 244
                KQ  +  S   Y+ ++ +LGKMR+F+    + DEM   +     V+ +T  +++ 
Sbjct: 130 SQLVVKQSVHLSSSMLYNEILDVLGKMRRFEEFHQVFDEM---SKRDGFVNEKTYEVLLN 189

Query: 245 KYCAVHDVGKAINTFHAHKRFGFNIGLEEFQNLLSALCRYKNVKDAEYLLFCNKDVFPFN 304
           +Y A H V +A+  F   K FG +  L  F  LL  LCRYK+V+ AE L    +  F  +
Sbjct: 190 RYAAAHKVDEAVGVFERRKEFGIDDDLVAFHGLLMWLCRYKHVEFAETLFCSRRREFGCD 249

Query: 305 TKSFNIILNGWCNIIGSLRDAERVWKEMMQRGISHDAVSYASFISCYSKTRNLYKVLRLF 364
            K+ N+ILNGWC ++G++ +A+R WK+++      D VSY + I+  +K   L K + L+
Sbjct: 250 IKAMNMILNGWC-VLGNVHEAKRFWKDIIASKCRPDVVSYGTMINALTKKGKLGKAMELY 309

Query: 365 DDMKQMKVEPDRKVYNAVIHALAKGRLLKEAINLIKTMEDKGIIANVVTYNSVIKPLCKA 424
             M   +  PD K+ N VI AL   + + EA+ + + + +KG   NVVTYNS++K LCK 
Sbjct: 310 RAMWDTRRNPDVKICNNVIDALCFKKRIPEALEVFREISEKGPDPNVVTYNSLLKHLCKI 369

Query: 425 WKFDEARAVFDEL-LQRGLC-PTIQTYHAFLRFLRTEEEIFELLEKMRKMGCDPTAETYI 484
            + ++   + +E+ L+ G C P   T+   L++ +  +++  +LE+M K  C+ T++ Y 
Sbjct: 370 RRTEKVWELVEEMELKGGSCSPNDVTFSYLLKYSQRSKDVDIVLERMAKNKCEMTSDLYN 429

Query: 485 MLIRKFCRWRQLDNVSRIWHEMSENGISPDRSSYMVLIHGLFLNGKLEDAYKYYLEMKEK 544
           ++ R + +W + + V  IW EM  +G+ PD+ +Y + IHGL   GK+ +A  Y+ EM  K
Sbjct: 430 LMFRLYVQWDKEEKVREIWSEMERSGLGPDQRTYTIRIHGLHTKGKIGEALSYFQEMMSK 489

Query: 545 ELLPEPKIDEIL 555
            ++PEP+ + +L
Sbjct: 490 GMVPEPRTEMLL 497

BLAST of Cla021114 vs. Swiss-Prot
Match: PP137_ARATH (Pentatricopeptide repeat-containing protein At1g80880, mitochondrial OS=Arabidopsis thaliana GN=At1g80880 PE=2 SV=1)

HSP 1 Score: 260.4 bits (664), Expect = 5.0e-68
Identity = 144/437 (32.95%), Postives = 241/437 (55.15%), Query Frame = 1

Query: 120 EDMNINDEGVIQDVDAIMDIFRGFRNV-NRTQVKNKLEHCCIKVSGELVVAVLSRIRNDW 179
           E  +IN   +      ++D+ R    + +       LE     ++ +   +++  +R++W
Sbjct: 78  ETFDINLTALAPLEKGLIDLIRQVSELESEADAMASLEDSSFDLNHDSFYSLIWELRDEW 137

Query: 180 EAAFTFFVWAGKQPGYAHSVREYHSMISILGKMRKFDTAWALIDEMRGGTTGSSLVSPQT 239
             AF  F W  K+       +    MI +LG  +KF+ AW LI +M       S  + + 
Sbjct: 138 RLAFLAFKWGEKRG--CDDQKSCDLMIWVLGNHQKFNIAWCLIRDM----FNVSKDTRKA 197

Query: 240 LLIMIRKYCAVHDVGKAINTFHAHKRFGFNIGLEEFQNLLSALCRYKNVKDAEYLLFCNK 299
           + +M+ +Y A +D  +AI TF    +F      E FQ LL ALCR+ +++ AE  +  +K
Sbjct: 198 MFLMMDRYAAANDTSQAIRTFDIMDKFKHTPYDEAFQGLLCALCRHGHIEKAEEFMLASK 257

Query: 300 DVFPFNTKSFNIILNGWCNIIGSLRDAERVWKEMMQRGISHDAVSYASFISCYSKTRNLY 359
            +FP + + FN+ILNGWCNI   + +A+R+W+EM    I+ +  SY+  ISC+SK  NL+
Sbjct: 258 KLFPVDVEGFNVILNGWCNIWTDVTEAKRIWREMGNYCITPNKDSYSHMISCFSKVGNLF 317

Query: 360 KVLRLFDDMKQMKVEPDRKVYNAVIHALAKGRLLKEAINLIKTMEDKGIIANVVTYNSVI 419
             LRL+D+MK+  + P  +VYN++++ L +     EA+ L+K + ++G+  + VTYNS+I
Sbjct: 318 DSLRLYDEMKKRGLAPGIEVYNSLVYVLTREDCFDEAMKLMKKLNEEGLKPDSVTYNSMI 377

Query: 420 KPLCKAWKFDEARAVFDELLQRGLCPTIQTYHAFLRFLRTEEEIFELLEKMRKMGCDPTA 479
           +PLC+A K D AR V   ++   L PT+ T+HAFL  +  E+ + E+L +M+     PT 
Sbjct: 378 RPLCEAGKLDVARNVLATMISENLSPTVDTFHAFLEAVNFEKTL-EVLGQMKISDLGPTE 437

Query: 480 ETYIMLIRKFCRWRQLDNVSRIWHEMSENGISPDRSSYMVLIHGLFLNGKLEDAYKYYLE 539
           ET+++++ K  + +Q +N  +IW EM    I  + + Y+  I GL   G LE A + Y E
Sbjct: 438 ETFLLILGKLFKGKQPENALKIWAEMDRFEIVANPALYLATIQGLLSCGWLEKAREIYSE 497

Query: 540 MKEKELLPEPKIDEILQ 556
           MK K  +  P + ++L+
Sbjct: 498 MKSKGFVGNPMLQKLLE 507

BLAST of Cla021114 vs. Swiss-Prot
Match: PP112_ARATH (Pentatricopeptide repeat-containing protein At1g71060, mitochondrial OS=Arabidopsis thaliana GN=At1g71060 PE=2 SV=1)

HSP 1 Score: 220.3 bits (560), Expect = 5.7e-56
Identity = 124/429 (28.90%), Postives = 214/429 (49.88%), Query Frame = 1

Query: 119 EEDMNINDEGVIQDVDAIMDIFRGFRNVNRTQVKNKLEHCCIKVSGELVVAVLSRIRNDW 178
           E  ++ ND    QD + I  I   F +   ++V+  L    +K+S  L+  VL ++ N  
Sbjct: 54  ETQVSANDAS--QDAERICKILTKFTD---SKVETLLNEASVKLSPALIEEVLKKLSNAG 113

Query: 179 EAAFTFFVWAGKQPGYAHSVREYHSMISILGKMRKFDTAWALIDEMRGGTTGSSLVSPQT 238
             A + F WA  Q G+ H+   Y+++I  LGK+++F   W+L+D+M+       L+S +T
Sbjct: 114 VLALSVFKWAENQKGFKHTTSNYNALIESLGKIKQFKLIWSLVDDMKA----KKLLSKET 173

Query: 239 LLIMIRKYCAVHDVGKAINTFHAHKRFGFNIGLEEFQNLLSALCRYKNVKDAEYLLF-CN 298
             ++ R+Y     V +AI  FH  + FGF +   +F  +L  L + +NV DA+ +     
Sbjct: 174 FALISRRYARARKVKEAIGAFHKMEEFGFKMESSDFNRMLDTLSKSRNVGDAQKVFDKMK 233

Query: 299 KDVFPFNTKSFNIILNGWCNIIGSLRDAERVWKEMMQRGISHDAVSYASFISCYSKTRNL 358
           K  F  + KS+ I+L GW   +  LR  + V +EM   G   D V+Y   I+ + K +  
Sbjct: 234 KKRFEPDIKSYTILLEGWGQELNLLR-VDEVNREMKDEGFEPDVVAYGIIINAHCKAKKY 293

Query: 359 YKVLRLFDDMKQMKVEPDRKVYNAVIHALAKGRLLKEAINLIKTMEDKGIIANVVTYNSV 418
            + +R F++M+Q   +P   ++ ++I+ L   + L +A+   +  +  G      TYN++
Sbjct: 294 EEAIRFFNEMEQRNCKPSPHIFCSLINGLGSEKKLNDALEFFERSKSSGFPLEAPTYNAL 353

Query: 419 IKPLCKAWKFDEARAVFDELLQRGLCPTIQTYHAFLRFLRTEEEIFELLEKMRKMGCDPT 478
           +   C + + ++A    DE+  +G+ P  +TY   L  L   +   E  E  + M C+PT
Sbjct: 354 VGAYCWSQRMEDAYKTVDEMRLKGVGPNARTYDIILHHLIRMQRSKEAYEVYQTMSCEPT 413

Query: 479 AETYIMLIRKFCRWRQLDNVSRIWHEMSENGISPDRSSYMVLIHGLFLNGKLEDAYKYYL 538
             TY +++R FC   +LD   +IW EM   G+ P    +  LI  L    KL++A +Y+ 
Sbjct: 414 VSTYEIMVRMFCNKERLDMAIKIWDEMKGKGVLPGMHMFSSLITALCHENKLDEACEYFN 472

Query: 539 EMKEKELLP 547
           EM +  + P
Sbjct: 474 EMLDVGIRP 472

BLAST of Cla021114 vs. Swiss-Prot
Match: PP248_ARATH (Pentatricopeptide repeat-containing protein At3g22670, mitochondrial OS=Arabidopsis thaliana GN=At3g22670 PE=2 SV=1)

HSP 1 Score: 212.6 bits (540), Expect = 1.2e-53
Identity = 123/445 (27.64%), Postives = 225/445 (50.56%), Query Frame = 1

Query: 130 IQDVDAIMDIFRGFRNVNRTQVKNKLEHCCIKVSGELVVAVLSRIRNDWEAAFTFFVWAG 189
           ++D+D + D F   ++ +   V  +L  C + V+  LV+ VL R  N W  A+ FF+WA 
Sbjct: 99  VEDIDKVCD-FLNKKDTSHEDVVKELSKCDVVVTESLVLQVLRRFSNGWNQAYGFFIWAN 158

Query: 190 KQPGYAHSVREYHSMISILGKMRKFDTAWALIDEMRGGTTGSSLVSPQTLLIMIRKYCAV 249
            Q GY HS   Y++M+ +LGK R FD  W L++EM      S LV+  T+  ++R+    
Sbjct: 159 SQTGYVHSGHTYNAMVDVLGKCRNFDLMWELVNEMNKNEE-SKLVTLDTMSKVMRRLAKS 218

Query: 250 HDVGKAINTF-HAHKRFGFNIGLEEFQNLLSALCRYKNVKDAEYLLFCNKDVFPFNTKSF 309
               KA++ F    K +G         +L+ AL +  +++ A  +     D    + ++F
Sbjct: 219 GKYNKAVDAFLEMEKSYGVKTDTIAMNSLMDALVKENSIEHAHEVFLKLFDTIKPDARTF 278

Query: 310 NIILNGWCNIIGSLRDAERVWKEMMQRGISHDAVSYASFISCYSKTRNLYKVLRLFDDMK 369
           NI+++G+C       DA  +   M     + D V+Y SF+  Y K  +  +V  + ++M+
Sbjct: 279 NILIHGFCK-ARKFDDARAMMDLMKVTEFTPDVVTYTSFVEAYCKEGDFRRVNEMLEEMR 338

Query: 370 QMKVEPDRKVYNAVIHALAKGRLLKEAINLIKTMEDKGIIANVVTYNSVIKPLCKAWKFD 429
           +    P+   Y  V+H+L K + + EA+ + + M++ G + +   Y+S+I  L K  +F 
Sbjct: 339 ENGCNPNVVTYTIVMHSLGKSKQVAEALGVYEKMKEDGCVPDAKFYSSLIHILSKTGRFK 398

Query: 430 EARAVFDELLQRGLCPTIQTYHAFLRFL---RTEEEIFELLEKMRK---MGCDPTAETYI 489
           +A  +F+++  +G+   +  Y+  +        +E    LL++M       C P  ETY 
Sbjct: 399 DAAEIFEDMTNQGVRRDVLVYNTMISAALHHSRDEMALRLLKRMEDEEGESCSPNVETYA 458

Query: 490 MLIRKFCRWRQLDNVSRIWHEMSENGISPDRSSYMVLIHGLFLNGKLEDAYKYYLEMKEK 549
            L++  C  +++  +  + H M +N +S D S+Y++LI GL ++GK+E+A  ++ E   K
Sbjct: 459 PLLKMCCHKKKMKLLGILLHHMVKNDVSIDVSTYILLIRGLCMSGKVEEACLFFEEAVRK 518

Query: 550 ELLPEPKIDEILQAWLAGKPMFQEK 568
            ++P     ++L   L  K M + K
Sbjct: 519 GMVPRDSTCKMLVDELEKKNMAEAK 540

BLAST of Cla021114 vs. TrEMBL
Match: A0A0A0L308_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G118150 PE=4 SV=1)

HSP 1 Score: 1070.8 bits (2768), Expect = 5.9e-310
Identity = 521/602 (86.54%), Postives = 559/602 (92.86%), Query Frame = 1

Query: 6   MRSKLCAFSLLGRTCFSWSSIKSTQHVELGALTNATMASGFSLWPTRIHTADTATALNSN 65
           MRSKLCAFSLLGRT FSWSS+K   HV   ALT+ATMASGFSLWPTRIHT DTAT  NSN
Sbjct: 1   MRSKLCAFSLLGRTYFSWSSVKYAHHVNHRALTSATMASGFSLWPTRIHTVDTATVSNSN 60

Query: 66  VGDLFSFGFCSRSYISPTSNHDILNATSKESCHYAASDDHVSGDDDDDREEHEEEDMNIN 125
           VGDLFS GFCS SY+SP+SNHDIL ATS++SCH+AA  DH S DD D+ EE EEEDM+IN
Sbjct: 61  VGDLFSLGFCSHSYVSPSSNHDILPATSEQSCHHAAESDHESDDDHDNLEECEEEDMDIN 120

Query: 126 DEGVIQDVDAIMDIFRGFRNVNRTQVKNKLEHCCIKVSGELVVAVLSRIRNDWEAAFTFF 185
           D+GVI+DVDAIMDIFRGFR+ NR QV+NKLEHC IKVSGELVVAVLSRIRNDWEAAFTFF
Sbjct: 121 DKGVIKDVDAIMDIFRGFRDANRIQVRNKLEHCFIKVSGELVVAVLSRIRNDWEAAFTFF 180

Query: 186 VWAGKQPGYAHSVREYHSMISILGKMRKFDTAWALIDEMRGGTTGSSLVSPQTLLIMIRK 245
           VWAGKQPGYAHSVREYHSMISILGKMRKFDTAWALIDEMRGGT GSSLV+PQTLLIMIR+
Sbjct: 181 VWAGKQPGYAHSVREYHSMISILGKMRKFDTAWALIDEMRGGTPGSSLVTPQTLLIMIRR 240

Query: 246 YCAVHDVGKAINTFHAHKRFGFNIGLEEFQNLLSALCRYKNVKDAEYLLFCNKDVFPFNT 305
           YCAVHDV KAINTF+AHKRFGFNIGLEEFQ+LLSALCRYKNVKDAEYLLFCNKDVFPFNT
Sbjct: 241 YCAVHDVAKAINTFYAHKRFGFNIGLEEFQSLLSALCRYKNVKDAEYLLFCNKDVFPFNT 300

Query: 306 KSFNIILNGWCNIIGSLRDAERVWKEMMQRGISHDAVSYASFISCYSKTRNLYKVLRLFD 365
           KSFNIILNGWC +IGSLRD ERVWKEM +RGISHDAVSYAS ISCYSK RNL+KVLRLF+
Sbjct: 301 KSFNIILNGWC-VIGSLRDTERVWKEMTRRGISHDAVSYASCISCYSKVRNLHKVLRLFE 360

Query: 366 DMKQMKVEPDRKVYNAVIHALAKGRLLKEAINLIKTMEDKGIIANVVTYNSVIKPLCKAW 425
           DMK+MK++PDRKVYNAVIH+LAKGR LKEA +LIKTME+KGIIANVVTYNSVIKPLCKA 
Sbjct: 361 DMKRMKIDPDRKVYNAVIHSLAKGRCLKEAADLIKTMEEKGIIANVVTYNSVIKPLCKAR 420

Query: 426 KFDEARAVFDELLQRGLCPTIQTYHAFLRFLRTEEEIFELLEKMRKMGCDPTAETYIMLI 485
           +FDEARAVF+ELLQRGLCPTIQTYHAFLRFLRTEEEIFELL+KMR MGC+PT +TYIMLI
Sbjct: 421 RFDEARAVFEELLQRGLCPTIQTYHAFLRFLRTEEEIFELLKKMRTMGCNPTTDTYIMLI 480

Query: 486 RKFCRWRQLDNVSRIWHEMSENGISPDRSSYMVLIHGLFLNGKLEDAYKYYLEMKEKELL 545
           RKFCRWRQLDNVSRIWHEMSENGISPDRSSY+VLIHGLFLNGKLEDA+KYYLEMKEK+LL
Sbjct: 481 RKFCRWRQLDNVSRIWHEMSENGISPDRSSYIVLIHGLFLNGKLEDAHKYYLEMKEKDLL 540

Query: 546 PEPKIDEILQAWLAGKPMFQEKPSDCSGEGKNSRLFPKRIDFHRQPEIRKVSRDCGFSFW 605
           PEPKIDE+LQ WLAGK +FQE PSDCSGEGKNS LFP + DFHRQPEIRKVSR  GFSFW
Sbjct: 541 PEPKIDEVLQTWLAGKSVFQENPSDCSGEGKNSSLFPNKNDFHRQPEIRKVSRHRGFSFW 600

Query: 606 KQ 608
           KQ
Sbjct: 601 KQ 601

BLAST of Cla021114 vs. TrEMBL
Match: B9SHK0_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_1122470 PE=4 SV=1)

HSP 1 Score: 718.8 bits (1854), Expect = 5.7e-204
Identity = 359/569 (63.09%), Postives = 440/569 (77.33%), Query Frame = 1

Query: 50  PTRIHTADTATALNSNVGDLFSFGFCSRSYISPTSNHDILNATSKESCHYAASDDHVSGD 109
           P+ I   +T   +  +  +L S  F S +  +P +        S E C   +    VS D
Sbjct: 21  PSAISIIETINIIKPSSSNL-SVAFSSSNLETPIAPKI---PNSHEGCSSESDHSDVSDD 80

Query: 110 DDDDREEHEEEDMNINDEGVIQDVDAIMDIFRGFRNVNRTQVKNKLEHCCIKVSGELVVA 169
           DDDD    +   +N+ DEG++QDV  IM I       N  ++KNK+EHC +KVS ELV+ 
Sbjct: 81  DDDDDNIRQ---LNLKDEGLVQDVTIIMSILHQLSG-NPVEMKNKIEHCGVKVSRELVLE 140

Query: 170 VLSRIRNDWEAAFTFFVWAGKQPGYAHSVREYHSMISILGKMRKFDTAWALIDEMRGGTT 229
           VLSRIRNDWEAAFTFF+WAG+Q GY+HSVREYH+MISILGKMRKFDTAWALIDEMRG  T
Sbjct: 141 VLSRIRNDWEAAFTFFLWAGRQLGYSHSVREYHAMISILGKMRKFDTAWALIDEMRGVKT 200

Query: 230 GSSLVSPQTLLIMIRKYCAVHDVGKAINTFHAHKRFGFNIGLEEFQNLLSALCRYKNVKD 289
           G SLV+PQTLLIMIRKYCAVHDVG+AINTF+AHKRF F++G++EFQ+LLSALCRYKNV+D
Sbjct: 201 GISLVTPQTLLIMIRKYCAVHDVGRAINTFYAHKRFKFDLGIDEFQSLLSALCRYKNVQD 260

Query: 290 AEYLLFCNKDVFPFNTKSFNIILNGWCNIIGSLRDAERVWKEMMQRGISHDAVSYASFIS 349
           AE+L+FCNKDVFPFNTKSFNI+LNGWCN+IGS R+A+R+W+EM +R I +D VSYAS IS
Sbjct: 261 AEHLMFCNKDVFPFNTKSFNIVLNGWCNVIGSPREADRIWREMCKRRIHYDVVSYASIIS 320

Query: 350 CYSKTRNLYKVLRLFDDMKQMKVEPDRKVYNAVIHALAKGRLLKEAINLIKTMEDKGIIA 409
           CYSK  NLYKV +L++ MK++ +EPDRK+YN+VI ALAKGR + EAINL+KTME+KGI  
Sbjct: 321 CYSKAGNLYKVFKLYNQMKEVGIEPDRKIYNSVIFALAKGRHVSEAINLMKTMEEKGIAP 380

Query: 410 NVVTYNSVIKPLCKAWKFDEARAVFDELLQRGLCPTIQTYHAFLRFLRTEEEIFELLEKM 469
           N VTYNS+IKPLC+A K DEAR +FDE+LQ G  PTI+TYHAF R LRT EE+F LLE M
Sbjct: 381 NTVTYNSLIKPLCRARKIDEARGLFDEMLQHGHSPTIRTYHAFFRSLRTGEEVFALLENM 440

Query: 470 RKMGCDPTAETYIMLIRKFCRWRQLDNVSRIWHEMSENGISPDRSSYMVLIHGLFLNGKL 529
           RK+GC PT +TYIMLIRKFCRWRQ D+V ++W+++SENG+ PDRSSY+VLIHGLFLNGKL
Sbjct: 441 RKLGCHPTIDTYIMLIRKFCRWRQFDDVFKLWNQISENGLGPDRSSYIVLIHGLFLNGKL 500

Query: 530 EDAYKYYLEMKEKELLPEPKIDEILQAWLAGK-----PMFQEKPS--DCSGEGKNSRLFP 589
           E+AYK+Y +MKEK+LLP+PK+DE+LQ WL+ K      M + K    DCS  G   R   
Sbjct: 501 EEAYKFYADMKEKQLLPDPKLDEMLQTWLSNKQVAECQMTESKVDQLDCSRLGNQRRATS 560

Query: 590 KRI----DFHRQPEIRKVSRDCGFSFWKQ 608
           KRI    DF +Q EIR+V R+ GFSFWKQ
Sbjct: 561 KRINHEKDFLQQAEIRRVVRERGFSFWKQ 581

BLAST of Cla021114 vs. TrEMBL
Match: A0A061F1G5_THECC (Pentatricopeptide repeat-containing protein, putative OS=Theobroma cacao GN=TCM_026033 PE=4 SV=1)

HSP 1 Score: 718.0 bits (1852), Expect = 9.7e-204
Identity = 374/619 (60.42%), Postives = 455/619 (73.51%), Query Frame = 1

Query: 1   MWDGRM-RSKLCAFSLLGRTCFSWSSIKSTQHVELGALTNATMASGFSLWPTRIHTADTA 60
           MW  R  RSK+  FS+   T      +K TQ+      T  T  +  +++P         
Sbjct: 1   MWGIRFTRSKVYVFSVFANTHLRTIHLKPTQYRNPIFETVGTRRTVANIYPMN------- 60

Query: 61  TALNSNVGDLFSFGFCSRSYISPTSNHDILNATSKESCHYAASDDHVSGDDDDDREEHEE 120
                     F F F   S+   TSN    +    E+    + +D+  G+D ++ + +  
Sbjct: 61  ----------FCFPFQLFSFC--TSNIGFPSGPKIETFDEHSDNDNDDGEDSENFDGNSV 120

Query: 121 EDMNINDEGVIQDVDAIMDIFRGFRNVNRTQVKNKLEHCCIKVSGELVVAVLSRIRNDWE 180
              +++DE V QD  AIMDI R   + N  ++KNKLEHC I+VS ELVV +LSRIRNDWE
Sbjct: 121 NGSSLSDEAV-QDGKAIMDIIRETGS-NYVEMKNKLEHCRIRVSSELVVEILSRIRNDWE 180

Query: 181 AAFTFFVWAGKQPGYAHSVREYHSMISILGKMRKFDTAWALIDEMRGGTTGSSLVSPQTL 240
            AFTFF+WAGKQPGYAHS+RE HSMISILGKMRKFDTAWALIDEMRGG  G  LV+PQTL
Sbjct: 181 VAFTFFLWAGKQPGYAHSLRECHSMISILGKMRKFDTAWALIDEMRGGRAGPCLVTPQTL 240

Query: 241 LIMIRKYCAVHDVGKAINTFHAHKRFGFNIGLEEFQNLLSALCRYKNVKDAEYLLFCNKD 300
           LIMIR+YCAVHDVG+AINTF+A+K+F F++G+EEFQ+LLSALCRYKNV+DAE+L+FCNKD
Sbjct: 241 LIMIRRYCAVHDVGRAINTFYAYKKFKFDVGIEEFQSLLSALCRYKNVQDAEHLMFCNKD 300

Query: 301 VFPFNTKSFNIILNGWCNIIGSLRDAERVWKEMMQRGISHDAVSYASFISCYSKTRNLYK 360
           VFPFN KSFNIILNGWCN IGS R AERVW+EM +RG+ HD VSYAS +SCYSK  NL+K
Sbjct: 301 VFPFNIKSFNIILNGWCNAIGSPRQAERVWREMSKRGVQHDVVSYASVMSCYSKACNLHK 360

Query: 361 VLRLFDDMKQMKVEPDRKVYNAVIHALAKGRLLKEAINLIKTMEDKGIIANVVTYNSVIK 420
           VL+LF  MK+M +EPDRKVYNAVIHALAK R +KEAINL+KTME+KGI  NVVTYNS+IK
Sbjct: 361 VLKLFSQMKRMGIEPDRKVYNAVIHALAKARHVKEAINLLKTMEEKGIAPNVVTYNSLIK 420

Query: 421 PLCKAWKFDEARAVFDELLQRGLCPTIQTYHAFLRFLRTEEEIFELLEKMRKMGCDPTAE 480
           PLCKA K DEAR VFDE+LQR L PTI+TYHAF R LR  EE+FELLEKMRKMGC PT +
Sbjct: 421 PLCKAQKIDEARQVFDEMLQRDLSPTIRTYHAFFRILRNGEEVFELLEKMRKMGCQPTND 480

Query: 481 TYIMLIRKFCRWRQLDNVSRIWHEMSENGISPDRSSYMVLIHGLFLNGKLEDAYKYYLEM 540
           TYIMLIRKF RW Q DNV ++W+EM+E+G+  DRSSY+VLIHGLFLNGKL++AYKYY EM
Sbjct: 481 TYIMLIRKFSRWCQFDNVFKLWNEMTEDGVGHDRSSYIVLIHGLFLNGKLDEAYKYYTEM 540

Query: 541 KEKELLPEPKIDEILQAWLAGKPMFQEKPSDCSGE-------GKNSRLFPKRI----DFH 600
           K+K+LLPEPKIDE+LQAW++GK   + + +D            +  R+  K+I    DF 
Sbjct: 541 KDKQLLPEPKIDEMLQAWVSGKQFAERQMADLKNNQLLDNQLDEQVRVESKKIDQEKDFL 598

Query: 601 RQPEIRKVSRDCGFSFWKQ 608
           R PE R+V R+ GFSFW+Q
Sbjct: 601 RLPETRRVIRERGFSFWEQ 598

BLAST of Cla021114 vs. TrEMBL
Match: U7DWX5_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0671s00210g PE=4 SV=1)

HSP 1 Score: 698.7 bits (1802), Expect = 6.1e-198
Identity = 354/578 (61.25%), Postives = 438/578 (75.78%), Query Frame = 1

Query: 50  PTRIHTADTATALNSNVGDLFSFGFCSRSYISP-TSNHDILNATSKESCHYAASDDHVSG 109
           PT          L+S     FS  F + S  +P T  H  LN   + S     +   +  
Sbjct: 29  PTHFANLTVERPLSSKFSTAFS-SFSTSSLETPITPKH--LNEPEECSSDNDDNGSEMEN 88

Query: 110 DDDDDREEHEEEDMNINDEGVIQDVDAIMDIFRGFRNVNRTQVKNKLEHCCIKVSGELVV 169
           DDDDD    +   +N++D+G+ QD   I++I +   + NR ++K+K+E C IKVS ELV+
Sbjct: 89  DDDDD----DVRGLNLSDDGLFQDAKTIVNILQESCD-NRVEMKSKIEQCGIKVSQELVL 148

Query: 170 AVLSRIRNDWEAAFTFFVWAGKQPGYAHSVREYHSMISILGKMRKFDTAWALIDEMRGGT 229
            VLSR+RNDWEAAFTFF+WA +QPGYAHSVREYHSMISILGKMRKFDTAW LIDEMRG  
Sbjct: 149 EVLSRVRNDWEAAFTFFLWAARQPGYAHSVREYHSMISILGKMRKFDTAWVLIDEMRGVK 208

Query: 230 TGSSLVSPQTLLIMIRKYCAVHDVGKAINTFHAHKRFGFNIGLEEFQNLLSALCRYKNVK 289
           TG SLV+PQTLLIMIRKYCAVHDVG+AINTF+A+KRF F++G+EEFQ+LLSALCRYKNV+
Sbjct: 209 TGVSLVTPQTLLIMIRKYCAVHDVGRAINTFYAYKRFKFDMGIEEFQSLLSALCRYKNVQ 268

Query: 290 DAEYLLFCNKDVFPFNTKSFNIILNGWCNIIGSLRDAERVWKEMMQRGISHDAVSYASFI 349
           DAE L++CNK VFP NTKSFNI+LNGWCN+IGS R++ERVW+EM +RGI  D VSYAS +
Sbjct: 269 DAEQLMYCNKAVFPLNTKSFNIVLNGWCNLIGSPRESERVWREMSKRGIRFDVVSYASMM 328

Query: 350 SCYSKTRNLYKVLRLFDDMKQMKVEPDRKVYNAVIHALAKGRLLKEAINLIKTMEDKGII 409
           SCYSK  +LY+VLRL+  MK++ +EPDRKVYNAVIHALAKGRL+ EA NL++TME+KG+ 
Sbjct: 329 SCYSKAGSLYRVLRLYKQMKKIGIEPDRKVYNAVIHALAKGRLVNEAFNLMRTMEEKGVA 388

Query: 410 ANVVTYNSVIKPLCKAWKFDEARAVFDELLQRGLCPTIQTYHAFLRFLRTEEEIFELLEK 469
            N+VTYNS+IKPLC+A K +EA+  FD++L+R + PTI+TYHAFLR LRT EE+F LLEK
Sbjct: 389 PNIVTYNSLIKPLCRARKVEEAKGAFDDMLKRCISPTIRTYHAFLRILRTGEEVFALLEK 448

Query: 470 MRKMGCDPTAETYIMLIRKFCRWRQLDNVSRIWHEMSENGISPDRSSYMVLIHGLFLNGK 529
           MRKMGC P  +TYIMLIRKFCRWRQL+NV ++W EMSENGISPDRSSY+VLIHGLFLNG+
Sbjct: 449 MRKMGCQPINDTYIMLIRKFCRWRQLENVFKLWDEMSENGISPDRSSYIVLIHGLFLNGE 508

Query: 530 LEDAYKYYLEMKEKELLPEPKIDEILQAWLAGKPMFQEKPS-----------------DC 589
           L+ A+KYY EMKEK+LLPEPKIDE+LQ WL+ K + + + +                 DC
Sbjct: 509 LDAAHKYYTEMKEKQLLPEPKIDEMLQTWLSNKQIAEGQTTESRSNQCQTTELRSNQLDC 568

Query: 590 SGEGKNSRLFPKRI---DFHRQPEIRKVSRDCGFSFWK 607
           S   + +R  PKR    +F RQ E RKV R+ G SFW+
Sbjct: 569 SQSREQTRGIPKRSHERNFIRQAETRKVVRERGISFWE 598

BLAST of Cla021114 vs. TrEMBL
Match: B9I5U4_POPTR (Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POPTR_0013s05050g PE=4 SV=2)

HSP 1 Score: 698.4 bits (1801), Expect = 7.9e-198
Identity = 339/517 (65.57%), Postives = 420/517 (81.24%), Query Frame = 1

Query: 110 DDDDREEHEEEDMNINDEGVIQDVDAIMDIFRGFRNVNRTQVKNKLEHCCIKVSGELVVA 169
           ++DD ++ +   +N++D+G+ QD   I++I +   + NR ++K+K+E C IKVS ELV+ 
Sbjct: 2   ENDDDDDDDVRGLNLSDDGLFQDAKTIVNILQESCD-NRVEMKSKIEQCGIKVSQELVLE 61

Query: 170 VLSRIRNDWEAAFTFFVWAGKQPGYAHSVREYHSMISILGKMRKFDTAWALIDEMRGGTT 229
           VLSR+RNDWEAAFTFF+WA +QPGYAHSVREYHSMISILGKMRKFDTAW LIDEMRG  T
Sbjct: 62  VLSRVRNDWEAAFTFFLWAARQPGYAHSVREYHSMISILGKMRKFDTAWVLIDEMRGVKT 121

Query: 230 GSSLVSPQTLLIMIRKYCAVHDVGKAINTFHAHKRFGFNIGLEEFQNLLSALCRYKNVKD 289
           G SLV+PQTLLIMIRKYCAVHDVG+AINTF+A+KRF F++G+EEFQ+LLSALCRYKNV+D
Sbjct: 122 GVSLVTPQTLLIMIRKYCAVHDVGRAINTFYAYKRFKFDMGIEEFQSLLSALCRYKNVQD 181

Query: 290 AEYLLFCNKDVFPFNTKSFNIILNGWCNIIGSLRDAERVWKEMMQRGISHDAVSYASFIS 349
           AE L++CNK VFP NTKSFNI+LNGWCN+IGS R++ERVW+EM +RGI  D VSYAS +S
Sbjct: 182 AEQLMYCNKAVFPLNTKSFNIVLNGWCNLIGSPRESERVWREMSKRGIRFDVVSYASMMS 241

Query: 350 CYSKTRNLYKVLRLFDDMKQMKVEPDRKVYNAVIHALAKGRLLKEAINLIKTMEDKGIIA 409
           CYSK  +LY+VLRL+  MK++ +EPDRKVYNAVIHALAKGRL+ EA NL+KTME+KG+  
Sbjct: 242 CYSKAGSLYRVLRLYKQMKKIGIEPDRKVYNAVIHALAKGRLVNEAFNLMKTMEEKGVAP 301

Query: 410 NVVTYNSVIKPLCKAWKFDEARAVFDELLQRGLCPTIQTYHAFLRFLRTEEEIFELLEKM 469
           N+VTYNS+IKPLC+A K +EA+  FD++L+R + PTI+TYHAFLR LRT EE+F LLEKM
Sbjct: 302 NIVTYNSLIKPLCRARKVEEAKGAFDDMLKRCISPTIRTYHAFLRILRTGEEVFALLEKM 361

Query: 470 RKMGCDPTAETYIMLIRKFCRWRQLDNVSRIWHEMSENGISPDRSSYMVLIHGLFLNGKL 529
           RKMGC P  +TYIMLIRKFCRWRQL+NV ++W EMSENGISPDRSSY+VLIHGLFLNG+L
Sbjct: 362 RKMGCQPINDTYIMLIRKFCRWRQLENVFKLWDEMSENGISPDRSSYIVLIHGLFLNGEL 421

Query: 530 EDAYKYYLEMKEKELLPEPKIDEILQAWLAGKPMFQEKPS-----------------DCS 589
           + A+KYY EMKEK+LLPEPKIDE+LQ WL+ K + + + +                 DCS
Sbjct: 422 DAAHKYYTEMKEKQLLPEPKIDEMLQTWLSNKQIAEGQTTESRSNQCQTTELRSNQLDCS 481

Query: 590 GEGKNSRLFPKRI---DFHRQPEIRKVSRDCGFSFWK 607
              + +R  PKR    +F RQ E RKV R+ GFSFW+
Sbjct: 482 QSREQTRDIPKRSHERNFIRQAETRKVVRERGFSFWE 517

BLAST of Cla021114 vs. NCBI nr
Match: gi|659074841|ref|XP_008437825.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g15010, mitochondrial [Cucumis melo])

HSP 1 Score: 1073.9 bits (2776), Expect = 9.4e-311
Identity = 523/602 (86.88%), Postives = 561/602 (93.19%), Query Frame = 1

Query: 6   MRSKLCAFSLLGRTCFSWSSIKSTQHVELGALTNATMASGFSLWPTRIHTADTATALNSN 65
           MRSKLCAFSLLGRT FSWSSIK   HV L ALT+ TMASGFS WPTRIHT D AT  NSN
Sbjct: 1   MRSKLCAFSLLGRTYFSWSSIKYAHHVNLRALTSVTMASGFSSWPTRIHTVDAATVPNSN 60

Query: 66  VGDLFSFGFCSRSYISPTSNHDILNATSKESCHYAASDDHVSGDDDDDREEHEEEDMNIN 125
           +G+ FSFGFCS SY+S +SNHDILNAT +ESCHYAA +DH SGDD D+ EE EEEDM+IN
Sbjct: 61  IGNFFSFGFCSHSYVSSSSNHDILNATPEESCHYAADNDHESGDDHDNLEECEEEDMDIN 120

Query: 126 DEGVIQDVDAIMDIFRGFRNVNRTQVKNKLEHCCIKVSGELVVAVLSRIRNDWEAAFTFF 185
           DEGVI+DVDAIMDIFRGFRNVNR +VKNKLEHC IKVSGELVVAVLSRIRNDWEAAFTFF
Sbjct: 121 DEGVIKDVDAIMDIFRGFRNVNRIEVKNKLEHCFIKVSGELVVAVLSRIRNDWEAAFTFF 180

Query: 186 VWAGKQPGYAHSVREYHSMISILGKMRKFDTAWALIDEMRGGTTGSSLVSPQTLLIMIRK 245
           VWAGKQPGYAHSVREYHSMISILGKMRKFDTAWALIDEMRGGT GSSLV+PQTLLIMIR+
Sbjct: 181 VWAGKQPGYAHSVREYHSMISILGKMRKFDTAWALIDEMRGGTAGSSLVTPQTLLIMIRR 240

Query: 246 YCAVHDVGKAINTFHAHKRFGFNIGLEEFQNLLSALCRYKNVKDAEYLLFCNKDVFPFNT 305
           YCAVHDV KAINTFHAHKRFGFNIGLEEFQ+LLSALCRYKNVKDAEYLLFCNKDVFPFNT
Sbjct: 241 YCAVHDVAKAINTFHAHKRFGFNIGLEEFQSLLSALCRYKNVKDAEYLLFCNKDVFPFNT 300

Query: 306 KSFNIILNGWCNIIGSLRDAERVWKEMMQRGISHDAVSYASFISCYSKTRNLYKVLRLFD 365
           KSFNIILNGWC IIGSLRD ERVWKEM +RGISHDA+SYAS ISCYSK RNL+KVL+LF+
Sbjct: 301 KSFNIILNGWC-IIGSLRDTERVWKEMTRRGISHDAISYASSISCYSKVRNLHKVLKLFE 360

Query: 366 DMKQMKVEPDRKVYNAVIHALAKGRLLKEAINLIKTMEDKGIIANVVTYNSVIKPLCKAW 425
           DMK+MK++PDRKVYNAVIH+LAKGR LKEA +LIKTME+KGII NVVTYNSVIKPLCKA 
Sbjct: 361 DMKRMKIDPDRKVYNAVIHSLAKGRCLKEAADLIKTMEEKGIIPNVVTYNSVIKPLCKAR 420

Query: 426 KFDEARAVFDELLQRGLCPTIQTYHAFLRFLRTEEEIFELLEKMRKMGCDPTAETYIMLI 485
           +FDEARAVF+E+LQRGLCPTIQTYHAFLRFLRTEEEIFELL+KMR MGC+PTA+TYIMLI
Sbjct: 421 RFDEARAVFEEILQRGLCPTIQTYHAFLRFLRTEEEIFELLKKMRTMGCNPTADTYIMLI 480

Query: 486 RKFCRWRQLDNVSRIWHEMSENGISPDRSSYMVLIHGLFLNGKLEDAYKYYLEMKEKELL 545
           RKFCRWRQLDNVSRIWHEMSENGISPDRSSY+VLIHGLFLNGKLEDA+KYYLEMKEKELL
Sbjct: 481 RKFCRWRQLDNVSRIWHEMSENGISPDRSSYIVLIHGLFLNGKLEDAHKYYLEMKEKELL 540

Query: 546 PEPKIDEILQAWLAGKPMFQEKPSDCSGEGKNSRLFPKRIDFHRQPEIRKVSRDCGFSFW 605
           PEPKIDE+LQAWLAG+ MFQE PSDCSGEGKNSRLFP++ DFHRQPEI K SRD GFSFW
Sbjct: 541 PEPKIDEVLQAWLAGRSMFQENPSDCSGEGKNSRLFPEK-DFHRQPEIIKGSRDRGFSFW 600

Query: 606 KQ 608
           +Q
Sbjct: 601 RQ 600

BLAST of Cla021114 vs. NCBI nr
Match: gi|778676742|ref|XP_011650654.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g15010, mitochondrial [Cucumis sativus])

HSP 1 Score: 1070.8 bits (2768), Expect = 8.5e-310
Identity = 521/602 (86.54%), Postives = 559/602 (92.86%), Query Frame = 1

Query: 6   MRSKLCAFSLLGRTCFSWSSIKSTQHVELGALTNATMASGFSLWPTRIHTADTATALNSN 65
           MRSKLCAFSLLGRT FSWSS+K   HV   ALT+ATMASGFSLWPTRIHT DTAT  NSN
Sbjct: 1   MRSKLCAFSLLGRTYFSWSSVKYAHHVNHRALTSATMASGFSLWPTRIHTVDTATVSNSN 60

Query: 66  VGDLFSFGFCSRSYISPTSNHDILNATSKESCHYAASDDHVSGDDDDDREEHEEEDMNIN 125
           VGDLFS GFCS SY+SP+SNHDIL ATS++SCH+AA  DH S DD D+ EE EEEDM+IN
Sbjct: 61  VGDLFSLGFCSHSYVSPSSNHDILPATSEQSCHHAAESDHESDDDHDNLEECEEEDMDIN 120

Query: 126 DEGVIQDVDAIMDIFRGFRNVNRTQVKNKLEHCCIKVSGELVVAVLSRIRNDWEAAFTFF 185
           D+GVI+DVDAIMDIFRGFR+ NR QV+NKLEHC IKVSGELVVAVLSRIRNDWEAAFTFF
Sbjct: 121 DKGVIKDVDAIMDIFRGFRDANRIQVRNKLEHCFIKVSGELVVAVLSRIRNDWEAAFTFF 180

Query: 186 VWAGKQPGYAHSVREYHSMISILGKMRKFDTAWALIDEMRGGTTGSSLVSPQTLLIMIRK 245
           VWAGKQPGYAHSVREYHSMISILGKMRKFDTAWALIDEMRGGT GSSLV+PQTLLIMIR+
Sbjct: 181 VWAGKQPGYAHSVREYHSMISILGKMRKFDTAWALIDEMRGGTPGSSLVTPQTLLIMIRR 240

Query: 246 YCAVHDVGKAINTFHAHKRFGFNIGLEEFQNLLSALCRYKNVKDAEYLLFCNKDVFPFNT 305
           YCAVHDV KAINTF+AHKRFGFNIGLEEFQ+LLSALCRYKNVKDAEYLLFCNKDVFPFNT
Sbjct: 241 YCAVHDVAKAINTFYAHKRFGFNIGLEEFQSLLSALCRYKNVKDAEYLLFCNKDVFPFNT 300

Query: 306 KSFNIILNGWCNIIGSLRDAERVWKEMMQRGISHDAVSYASFISCYSKTRNLYKVLRLFD 365
           KSFNIILNGWC +IGSLRD ERVWKEM +RGISHDAVSYAS ISCYSK RNL+KVLRLF+
Sbjct: 301 KSFNIILNGWC-VIGSLRDTERVWKEMTRRGISHDAVSYASCISCYSKVRNLHKVLRLFE 360

Query: 366 DMKQMKVEPDRKVYNAVIHALAKGRLLKEAINLIKTMEDKGIIANVVTYNSVIKPLCKAW 425
           DMK+MK++PDRKVYNAVIH+LAKGR LKEA +LIKTME+KGIIANVVTYNSVIKPLCKA 
Sbjct: 361 DMKRMKIDPDRKVYNAVIHSLAKGRCLKEAADLIKTMEEKGIIANVVTYNSVIKPLCKAR 420

Query: 426 KFDEARAVFDELLQRGLCPTIQTYHAFLRFLRTEEEIFELLEKMRKMGCDPTAETYIMLI 485
           +FDEARAVF+ELLQRGLCPTIQTYHAFLRFLRTEEEIFELL+KMR MGC+PT +TYIMLI
Sbjct: 421 RFDEARAVFEELLQRGLCPTIQTYHAFLRFLRTEEEIFELLKKMRTMGCNPTTDTYIMLI 480

Query: 486 RKFCRWRQLDNVSRIWHEMSENGISPDRSSYMVLIHGLFLNGKLEDAYKYYLEMKEKELL 545
           RKFCRWRQLDNVSRIWHEMSENGISPDRSSY+VLIHGLFLNGKLEDA+KYYLEMKEK+LL
Sbjct: 481 RKFCRWRQLDNVSRIWHEMSENGISPDRSSYIVLIHGLFLNGKLEDAHKYYLEMKEKDLL 540

Query: 546 PEPKIDEILQAWLAGKPMFQEKPSDCSGEGKNSRLFPKRIDFHRQPEIRKVSRDCGFSFW 605
           PEPKIDE+LQ WLAGK +FQE PSDCSGEGKNS LFP + DFHRQPEIRKVSR  GFSFW
Sbjct: 541 PEPKIDEVLQTWLAGKSVFQENPSDCSGEGKNSSLFPNKNDFHRQPEIRKVSRHRGFSFW 600

Query: 606 KQ 608
           KQ
Sbjct: 601 KQ 601

BLAST of Cla021114 vs. NCBI nr
Match: gi|223535282|gb|EEF36959.1| (pentatricopeptide repeat-containing protein, putative [Ricinus communis])

HSP 1 Score: 718.8 bits (1854), Expect = 8.1e-204
Identity = 359/569 (63.09%), Postives = 440/569 (77.33%), Query Frame = 1

Query: 50  PTRIHTADTATALNSNVGDLFSFGFCSRSYISPTSNHDILNATSKESCHYAASDDHVSGD 109
           P+ I   +T   +  +  +L S  F S +  +P +        S E C   +    VS D
Sbjct: 21  PSAISIIETINIIKPSSSNL-SVAFSSSNLETPIAPKI---PNSHEGCSSESDHSDVSDD 80

Query: 110 DDDDREEHEEEDMNINDEGVIQDVDAIMDIFRGFRNVNRTQVKNKLEHCCIKVSGELVVA 169
           DDDD    +   +N+ DEG++QDV  IM I       N  ++KNK+EHC +KVS ELV+ 
Sbjct: 81  DDDDDNIRQ---LNLKDEGLVQDVTIIMSILHQLSG-NPVEMKNKIEHCGVKVSRELVLE 140

Query: 170 VLSRIRNDWEAAFTFFVWAGKQPGYAHSVREYHSMISILGKMRKFDTAWALIDEMRGGTT 229
           VLSRIRNDWEAAFTFF+WAG+Q GY+HSVREYH+MISILGKMRKFDTAWALIDEMRG  T
Sbjct: 141 VLSRIRNDWEAAFTFFLWAGRQLGYSHSVREYHAMISILGKMRKFDTAWALIDEMRGVKT 200

Query: 230 GSSLVSPQTLLIMIRKYCAVHDVGKAINTFHAHKRFGFNIGLEEFQNLLSALCRYKNVKD 289
           G SLV+PQTLLIMIRKYCAVHDVG+AINTF+AHKRF F++G++EFQ+LLSALCRYKNV+D
Sbjct: 201 GISLVTPQTLLIMIRKYCAVHDVGRAINTFYAHKRFKFDLGIDEFQSLLSALCRYKNVQD 260

Query: 290 AEYLLFCNKDVFPFNTKSFNIILNGWCNIIGSLRDAERVWKEMMQRGISHDAVSYASFIS 349
           AE+L+FCNKDVFPFNTKSFNI+LNGWCN+IGS R+A+R+W+EM +R I +D VSYAS IS
Sbjct: 261 AEHLMFCNKDVFPFNTKSFNIVLNGWCNVIGSPREADRIWREMCKRRIHYDVVSYASIIS 320

Query: 350 CYSKTRNLYKVLRLFDDMKQMKVEPDRKVYNAVIHALAKGRLLKEAINLIKTMEDKGIIA 409
           CYSK  NLYKV +L++ MK++ +EPDRK+YN+VI ALAKGR + EAINL+KTME+KGI  
Sbjct: 321 CYSKAGNLYKVFKLYNQMKEVGIEPDRKIYNSVIFALAKGRHVSEAINLMKTMEEKGIAP 380

Query: 410 NVVTYNSVIKPLCKAWKFDEARAVFDELLQRGLCPTIQTYHAFLRFLRTEEEIFELLEKM 469
           N VTYNS+IKPLC+A K DEAR +FDE+LQ G  PTI+TYHAF R LRT EE+F LLE M
Sbjct: 381 NTVTYNSLIKPLCRARKIDEARGLFDEMLQHGHSPTIRTYHAFFRSLRTGEEVFALLENM 440

Query: 470 RKMGCDPTAETYIMLIRKFCRWRQLDNVSRIWHEMSENGISPDRSSYMVLIHGLFLNGKL 529
           RK+GC PT +TYIMLIRKFCRWRQ D+V ++W+++SENG+ PDRSSY+VLIHGLFLNGKL
Sbjct: 441 RKLGCHPTIDTYIMLIRKFCRWRQFDDVFKLWNQISENGLGPDRSSYIVLIHGLFLNGKL 500

Query: 530 EDAYKYYLEMKEKELLPEPKIDEILQAWLAGK-----PMFQEKPS--DCSGEGKNSRLFP 589
           E+AYK+Y +MKEK+LLP+PK+DE+LQ WL+ K      M + K    DCS  G   R   
Sbjct: 501 EEAYKFYADMKEKQLLPDPKLDEMLQTWLSNKQVAECQMTESKVDQLDCSRLGNQRRATS 560

Query: 590 KRI----DFHRQPEIRKVSRDCGFSFWKQ 608
           KRI    DF +Q EIR+V R+ GFSFWKQ
Sbjct: 561 KRINHEKDFLQQAEIRRVVRERGFSFWKQ 581

BLAST of Cla021114 vs. NCBI nr
Match: gi|1000953115|ref|XP_002525469.2| (PREDICTED: pentatricopeptide repeat-containing protein At5g15010, mitochondrial [Ricinus communis])

HSP 1 Score: 718.8 bits (1854), Expect = 8.1e-204
Identity = 359/569 (63.09%), Postives = 440/569 (77.33%), Query Frame = 1

Query: 50  PTRIHTADTATALNSNVGDLFSFGFCSRSYISPTSNHDILNATSKESCHYAASDDHVSGD 109
           P+ I   +T   +  +  +L S  F S +  +P +        S E C   +    VS D
Sbjct: 32  PSAISIIETINIIKPSSSNL-SVAFSSSNLETPIAPKI---PNSHEGCSSESDHSDVSDD 91

Query: 110 DDDDREEHEEEDMNINDEGVIQDVDAIMDIFRGFRNVNRTQVKNKLEHCCIKVSGELVVA 169
           DDDD    +   +N+ DEG++QDV  IM I       N  ++KNK+EHC +KVS ELV+ 
Sbjct: 92  DDDDDNIRQ---LNLKDEGLVQDVTIIMSILHQLSG-NPVEMKNKIEHCGVKVSRELVLE 151

Query: 170 VLSRIRNDWEAAFTFFVWAGKQPGYAHSVREYHSMISILGKMRKFDTAWALIDEMRGGTT 229
           VLSRIRNDWEAAFTFF+WAG+Q GY+HSVREYH+MISILGKMRKFDTAWALIDEMRG  T
Sbjct: 152 VLSRIRNDWEAAFTFFLWAGRQLGYSHSVREYHAMISILGKMRKFDTAWALIDEMRGVKT 211

Query: 230 GSSLVSPQTLLIMIRKYCAVHDVGKAINTFHAHKRFGFNIGLEEFQNLLSALCRYKNVKD 289
           G SLV+PQTLLIMIRKYCAVHDVG+AINTF+AHKRF F++G++EFQ+LLSALCRYKNV+D
Sbjct: 212 GISLVTPQTLLIMIRKYCAVHDVGRAINTFYAHKRFKFDLGIDEFQSLLSALCRYKNVQD 271

Query: 290 AEYLLFCNKDVFPFNTKSFNIILNGWCNIIGSLRDAERVWKEMMQRGISHDAVSYASFIS 349
           AE+L+FCNKDVFPFNTKSFNI+LNGWCN+IGS R+A+R+W+EM +R I +D VSYAS IS
Sbjct: 272 AEHLMFCNKDVFPFNTKSFNIVLNGWCNVIGSPREADRIWREMCKRRIHYDVVSYASIIS 331

Query: 350 CYSKTRNLYKVLRLFDDMKQMKVEPDRKVYNAVIHALAKGRLLKEAINLIKTMEDKGIIA 409
           CYSK  NLYKV +L++ MK++ +EPDRK+YN+VI ALAKGR + EAINL+KTME+KGI  
Sbjct: 332 CYSKAGNLYKVFKLYNQMKEVGIEPDRKIYNSVIFALAKGRHVSEAINLMKTMEEKGIAP 391

Query: 410 NVVTYNSVIKPLCKAWKFDEARAVFDELLQRGLCPTIQTYHAFLRFLRTEEEIFELLEKM 469
           N VTYNS+IKPLC+A K DEAR +FDE+LQ G  PTI+TYHAF R LRT EE+F LLE M
Sbjct: 392 NTVTYNSLIKPLCRARKIDEARGLFDEMLQHGHSPTIRTYHAFFRSLRTGEEVFALLENM 451

Query: 470 RKMGCDPTAETYIMLIRKFCRWRQLDNVSRIWHEMSENGISPDRSSYMVLIHGLFLNGKL 529
           RK+GC PT +TYIMLIRKFCRWRQ D+V ++W+++SENG+ PDRSSY+VLIHGLFLNGKL
Sbjct: 452 RKLGCHPTIDTYIMLIRKFCRWRQFDDVFKLWNQISENGLGPDRSSYIVLIHGLFLNGKL 511

Query: 530 EDAYKYYLEMKEKELLPEPKIDEILQAWLAGK-----PMFQEKPS--DCSGEGKNSRLFP 589
           E+AYK+Y +MKEK+LLP+PK+DE+LQ WL+ K      M + K    DCS  G   R   
Sbjct: 512 EEAYKFYADMKEKQLLPDPKLDEMLQTWLSNKQVAECQMTESKVDQLDCSRLGNQRRATS 571

Query: 590 KRI----DFHRQPEIRKVSRDCGFSFWKQ 608
           KRI    DF +Q EIR+V R+ GFSFWKQ
Sbjct: 572 KRINHEKDFLQQAEIRRVVRERGFSFWKQ 592

BLAST of Cla021114 vs. NCBI nr
Match: gi|590641441|ref|XP_007030231.1| (Pentatricopeptide repeat-containing protein, putative [Theobroma cacao])

HSP 1 Score: 718.0 bits (1852), Expect = 1.4e-203
Identity = 374/619 (60.42%), Postives = 455/619 (73.51%), Query Frame = 1

Query: 1   MWDGRM-RSKLCAFSLLGRTCFSWSSIKSTQHVELGALTNATMASGFSLWPTRIHTADTA 60
           MW  R  RSK+  FS+   T      +K TQ+      T  T  +  +++P         
Sbjct: 1   MWGIRFTRSKVYVFSVFANTHLRTIHLKPTQYRNPIFETVGTRRTVANIYPMN------- 60

Query: 61  TALNSNVGDLFSFGFCSRSYISPTSNHDILNATSKESCHYAASDDHVSGDDDDDREEHEE 120
                     F F F   S+   TSN    +    E+    + +D+  G+D ++ + +  
Sbjct: 61  ----------FCFPFQLFSFC--TSNIGFPSGPKIETFDEHSDNDNDDGEDSENFDGNSV 120

Query: 121 EDMNINDEGVIQDVDAIMDIFRGFRNVNRTQVKNKLEHCCIKVSGELVVAVLSRIRNDWE 180
              +++DE V QD  AIMDI R   + N  ++KNKLEHC I+VS ELVV +LSRIRNDWE
Sbjct: 121 NGSSLSDEAV-QDGKAIMDIIRETGS-NYVEMKNKLEHCRIRVSSELVVEILSRIRNDWE 180

Query: 181 AAFTFFVWAGKQPGYAHSVREYHSMISILGKMRKFDTAWALIDEMRGGTTGSSLVSPQTL 240
            AFTFF+WAGKQPGYAHS+RE HSMISILGKMRKFDTAWALIDEMRGG  G  LV+PQTL
Sbjct: 181 VAFTFFLWAGKQPGYAHSLRECHSMISILGKMRKFDTAWALIDEMRGGRAGPCLVTPQTL 240

Query: 241 LIMIRKYCAVHDVGKAINTFHAHKRFGFNIGLEEFQNLLSALCRYKNVKDAEYLLFCNKD 300
           LIMIR+YCAVHDVG+AINTF+A+K+F F++G+EEFQ+LLSALCRYKNV+DAE+L+FCNKD
Sbjct: 241 LIMIRRYCAVHDVGRAINTFYAYKKFKFDVGIEEFQSLLSALCRYKNVQDAEHLMFCNKD 300

Query: 301 VFPFNTKSFNIILNGWCNIIGSLRDAERVWKEMMQRGISHDAVSYASFISCYSKTRNLYK 360
           VFPFN KSFNIILNGWCN IGS R AERVW+EM +RG+ HD VSYAS +SCYSK  NL+K
Sbjct: 301 VFPFNIKSFNIILNGWCNAIGSPRQAERVWREMSKRGVQHDVVSYASVMSCYSKACNLHK 360

Query: 361 VLRLFDDMKQMKVEPDRKVYNAVIHALAKGRLLKEAINLIKTMEDKGIIANVVTYNSVIK 420
           VL+LF  MK+M +EPDRKVYNAVIHALAK R +KEAINL+KTME+KGI  NVVTYNS+IK
Sbjct: 361 VLKLFSQMKRMGIEPDRKVYNAVIHALAKARHVKEAINLLKTMEEKGIAPNVVTYNSLIK 420

Query: 421 PLCKAWKFDEARAVFDELLQRGLCPTIQTYHAFLRFLRTEEEIFELLEKMRKMGCDPTAE 480
           PLCKA K DEAR VFDE+LQR L PTI+TYHAF R LR  EE+FELLEKMRKMGC PT +
Sbjct: 421 PLCKAQKIDEARQVFDEMLQRDLSPTIRTYHAFFRILRNGEEVFELLEKMRKMGCQPTND 480

Query: 481 TYIMLIRKFCRWRQLDNVSRIWHEMSENGISPDRSSYMVLIHGLFLNGKLEDAYKYYLEM 540
           TYIMLIRKF RW Q DNV ++W+EM+E+G+  DRSSY+VLIHGLFLNGKL++AYKYY EM
Sbjct: 481 TYIMLIRKFSRWCQFDNVFKLWNEMTEDGVGHDRSSYIVLIHGLFLNGKLDEAYKYYTEM 540

Query: 541 KEKELLPEPKIDEILQAWLAGKPMFQEKPSDCSGE-------GKNSRLFPKRI----DFH 600
           K+K+LLPEPKIDE+LQAW++GK   + + +D            +  R+  K+I    DF 
Sbjct: 541 KDKQLLPEPKIDEMLQAWVSGKQFAERQMADLKNNQLLDNQLDEQVRVESKKIDQEKDFL 598

Query: 601 RQPEIRKVSRDCGFSFWKQ 608
           R PE R+V R+ GFSFW+Q
Sbjct: 601 RLPETRRVIRERGFSFWEQ 598

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP383_ARATH2.2e-18061.09Pentatricopeptide repeat-containing protein At5g15010, mitochondrial OS=Arabidop... [more]
PP233_ARATH7.7e-6932.87Putative pentatricopeptide repeat-containing protein At3g15200 OS=Arabidopsis th... [more]
PP137_ARATH5.0e-6832.95Pentatricopeptide repeat-containing protein At1g80880, mitochondrial OS=Arabidop... [more]
PP112_ARATH5.7e-5628.90Pentatricopeptide repeat-containing protein At1g71060, mitochondrial OS=Arabidop... [more]
PP248_ARATH1.2e-5327.64Pentatricopeptide repeat-containing protein At3g22670, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0L308_CUCSA5.9e-31086.54Uncharacterized protein OS=Cucumis sativus GN=Csa_3G118150 PE=4 SV=1[more]
B9SHK0_RICCO5.7e-20463.09Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
A0A061F1G5_THECC9.7e-20460.42Pentatricopeptide repeat-containing protein, putative OS=Theobroma cacao GN=TCM_... [more]
U7DWX5_POPTR6.1e-19861.25Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0671s00210g PE=4 SV=1[more]
B9I5U4_POPTR7.9e-19865.57Pentatricopeptide repeat-containing family protein OS=Populus trichocarpa GN=POP... [more]
Match NameE-valueIdentityDescription
gi|659074841|ref|XP_008437825.1|9.4e-31186.88PREDICTED: pentatricopeptide repeat-containing protein At5g15010, mitochondrial ... [more]
gi|778676742|ref|XP_011650654.1|8.5e-31086.54PREDICTED: pentatricopeptide repeat-containing protein At5g15010, mitochondrial ... [more]
gi|223535282|gb|EEF36959.1|8.1e-20463.09pentatricopeptide repeat-containing protein, putative [Ricinus communis][more]
gi|1000953115|ref|XP_002525469.2|8.1e-20463.09PREDICTED: pentatricopeptide repeat-containing protein At5g15010, mitochondrial ... [more]
gi|590641441|ref|XP_007030231.1|1.4e-20360.42Pentatricopeptide repeat-containing protein, putative [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0003674 molecular_function
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding
molecular_function GO:0005515 protein binding
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU54493watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla021114Cla021114.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU54493WMU54493transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 201..225
score: 0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 481..523
score: 2.6E-7coord: 340..388
score: 8.9E-10coord: 410..454
score: 9.9
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 515..546
score: 4.2E-5coord: 201..225
score: 9.2E-4coord: 378..410
score: 4.2E-5coord: 480..512
score: 4.7E-7coord: 412..445
score: 8.0E-6coord: 307..340
score: 0.0017coord: 342..375
score: 4.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 375..409
score: 10.435coord: 340..374
score: 11.126coord: 197..227
score: 7.75coord: 512..546
score: 10.676coord: 477..511
score: 10.742coord: 235..269
score: 6.544coord: 410..444
score: 12.704coord: 304..339
score: 10
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 175..589
score: 1.0E-231coord: 114..135
score: 1.0E
NoneNo IPR availablePANTHERPTHR24015:SF227SUBFAMILY NOT NAMEDcoord: 175..589
score: 1.0E-231coord: 114..135
score: 1.0E