Cla023458 (gene) Watermelon (97103) v1

NameCla023458
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionPentatricopeptide repeat (AHRD V1 ***- A2Q460_MEDTR); contains Interpro domain(s) IPR002885 Pentatricopeptide repeat
LocationChr11 : 20909540 .. 20911264 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCATTCGGCGCTTCCCGGCGTTTTCTTTCCCATCAATTCAGAGGGTGCTTTTTGGGACGTCTCGCCAGTTGCAGGTATCACTATCCCTTACTTTACCCGCCCTCGCCCTCATCGGCTTTATCATTCTTGTTTTCAACCCTAGACGAACCACCAAATCTATTTGATGATGGTGTTTTGGCTGATGGGACTCGGAATCAACGCAGCATAGACGAGCGTTTCGTTATCAGCGAACTTTCTGATCTCTTACTAGTTAATCCTCATGGTTCGGTCTCTAACACTCTCAATGAGAATCCTTCTGAGAAACAGATGCCAATTAGGGCGGTTGATGGATTTTTACTACCGGAAGAAAAATTGCGAGGTGTTTTCCTTCAAAAACTGAATGGTAAAACTGCAATTGAACATGCTTTAGCTAATACTGATGTGAACTTGAGCCAAGATGTTGTCAATAAAGTACTAAACACGGGGAGTTTAGGTAGCGAAGCAATGGTTACCTTCTTTTATTGGGCTATTAAACAGCCGTCGATACCTAAAGATACTTCAAGTTACAACATAATTCTTAAAGCTCTAGGTAGAAGAAGGTTTTTTGATTCCATGATGCATGTTTTGCACAAAATGACACGGGAGGGAGTGAATGCGAACATGGAAACAGTGTCCATTGTGGTAGACAGTTTGGTGAAGGCTCACCAAGTTTCTAAGGCACTTCAGTTATTCAGAAACTTGAAAGAAATTGGGTTGGAATGTGATACTGAAACCTTGAATATTCTTCTAGAATGCATGTGTCGACGATCCCACGTGGGTGCTGCAAACTCCTTCTTTAATTTAATCAAGGGGAATGTTCCTTTCAATGCTATGTCATATAACATTATAATTGGTGGATGGTCAAGATATGGTAGGCACAGTGAAGTCGAGCGAATTTTGAAAGCAATGGAAGTTGATGGATTTTCTCCAGATTGTCTGACCTACACTTATCTTATTGAGTGTCTTGGCAGAGCTAATCGCATTGACGACGCTGTCAAGATCTTTGATAAAATGTATGAAAATGGCTGTAAGCCAGATGTCAATGCTTATAATGCAATGATCTCCAACTTTATATGTATAGGTGATTTTGATGAATGCCTGACCTATTACAAGCGTATGTTGAGTAATAGATGTGAACCTGACATCAACACCTATTCCAATTTGATTATTGGCTTTCTTAAAGCCAAGAAAGTGGCCGATGCACTAGAAATGTTTGATGAGATGGTGCCAAGAATAATTCCCACTACGGGGGCAATAACATCCTTTATGAAACTTAGCTGTAGTTATGGTCCTCCGCATGCAGCTATGTTAATCTACAAGAAAGCAAGAAAAGTTGGATGTAGGATATCCAAGAATGCTTATAAATTGTTGCTAATGCGGCTTTCTTTGTTTGGTAAATTTGGCATGCTATTAAATATATGGAATGAGATGCAAGAAAGTGGTTATGATCCTGATGTGGATACTTATGAGCATGCCATTGACTGTCTTTGTAAAACCGGGCAGCTTGAAAATGCTGTACTCGTCATGGAGGAATGCTTACGTCAGGGTTTCTTTCCAAGTAGGCAAATACGTAGTAAGCTTAATAACAAACTATTGGACTGTAATAGGACAGAGATGGCATATAAACTCTGGTTGAAAATCAAAGTTGCTCGTCATCAGGAAAATTTGCAAAGATGTTGGCGTGCTAAGGGATGGCATCATTGA

mRNA sequence

ATGGCATTCGGCGCTTCCCGGCGTTTTCTTTCCCATCAATTCAGAGGGTGCTTTTTGGGACGTCTCGCCAGTTGCAGGTATCACTATCCCTTACTTTACCCGCCCTCGCCCTCATCGGCTTTATCATTCTTGTTTTCAACCCTAGACGAACCACCAAATCTATTTGATGATGGTGTTTTGGCTGATGGGACTCGGAATCAACGCAGCATAGACGAGCGTTTCGTTATCAGCGAACTTTCTGATCTCTTACTAGTTAATCCTCATGGTTCGGTCTCTAACACTCTCAATGAGAATCCTTCTGAGAAACAGATGCCAATTAGGGCGGTTGATGGATTTTTACTACCGGAAGAAAAATTGCGAGGTGTTTTCCTTCAAAAACTGAATGGTAAAACTGCAATTGAACATGCTTTAGCTAATACTGATGTGAACTTGAGCCAAGATGTTGTCAATAAAGTACTAAACACGGGGAGTTTAGGTAGCGAAGCAATGGTTACCTTCTTTTATTGGGCTATTAAACAGCCGTCGATACCTAAAGATACTTCAAGTTACAACATAATTCTTAAAGCTCTAGGTAGAAGAAGGTTTTTTGATTCCATGATGCATGTTTTGCACAAAATGACACGGGAGGGAGTGAATGCGAACATGGAAACAGTGTCCATTGTGGTAGACAGTTTGGTGAAGGCTCACCAAGTTTCTAAGGCACTTCAGTTATTCAGAAACTTGAAAGAAATTGGGTTGGAATGTGATACTGAAACCTTGAATATTCTTCTAGAATGCATGTGTCGACGATCCCACGTGGGTGCTGCAAACTCCTTCTTTAATTTAATCAAGGGGAATGTTCCTTTCAATGCTATGTCATATAACATTATAATTGGTGGATGGTCAAGATATGGTAGGCACAGTGAAGTCGAGCGAATTTTGAAAGCAATGGAAGTTGATGGATTTTCTCCAGATTGTCTGACCTACACTTATCTTATTGAGTGTCTTGGCAGAGCTAATCGCATTGACGACGCTGTCAAGATCTTTGATAAAATGTATGAAAATGGCTGTAAGCCAGATGTCAATGCTTATAATGCAATGATCTCCAACTTTATATGTATAGGTGATTTTGATGAATGCCTGACCTATTACAAGCGTATGTTGAGTAATAGATGTGAACCTGACATCAACACCTATTCCAATTTGATTATTGGCTTTCTTAAAGCCAAGAAAGTGGCCGATGCACTAGAAATGTTTGATGAGATGGTGCCAAGAATAATTCCCACTACGGGGGCAATAACATCCTTTATGAAACTTAGCTGTAGTTATGGTCCTCCGCATGCAGCTATGTTAATCTACAAGAAAGCAAGAAAAGTTGGATGTAGGATATCCAAGAATGCTTATAAATTGTTGCTAATGCGGCTTTCTTTGTTTGGTAAATTTGGCATGCTATTAAATATATGGAATGAGATGCAAGAAAGTGGTTATGATCCTGATGTGGATACTTATGAGCATGCCATTGACTGTCTTTGTAAAACCGGGCAGCTTGAAAATGCTGTACTCGTCATGGAGGAATGCTTACGTCAGGGTTTCTTTCCAAGTAGGCAAATACGTAGTAAGCTTAATAACAAACTATTGGACTGTAATAGGACAGAGATGGCATATAAACTCTGGTTGAAAATCAAAGTTGCTCGTCATCAGGAAAATTTGCAAAGATGTTGGCGTGCTAAGGGATGGCATCATTGA

Coding sequence (CDS)

ATGGCATTCGGCGCTTCCCGGCGTTTTCTTTCCCATCAATTCAGAGGGTGCTTTTTGGGACGTCTCGCCAGTTGCAGGTATCACTATCCCTTACTTTACCCGCCCTCGCCCTCATCGGCTTTATCATTCTTGTTTTCAACCCTAGACGAACCACCAAATCTATTTGATGATGGTGTTTTGGCTGATGGGACTCGGAATCAACGCAGCATAGACGAGCGTTTCGTTATCAGCGAACTTTCTGATCTCTTACTAGTTAATCCTCATGGTTCGGTCTCTAACACTCTCAATGAGAATCCTTCTGAGAAACAGATGCCAATTAGGGCGGTTGATGGATTTTTACTACCGGAAGAAAAATTGCGAGGTGTTTTCCTTCAAAAACTGAATGGTAAAACTGCAATTGAACATGCTTTAGCTAATACTGATGTGAACTTGAGCCAAGATGTTGTCAATAAAGTACTAAACACGGGGAGTTTAGGTAGCGAAGCAATGGTTACCTTCTTTTATTGGGCTATTAAACAGCCGTCGATACCTAAAGATACTTCAAGTTACAACATAATTCTTAAAGCTCTAGGTAGAAGAAGGTTTTTTGATTCCATGATGCATGTTTTGCACAAAATGACACGGGAGGGAGTGAATGCGAACATGGAAACAGTGTCCATTGTGGTAGACAGTTTGGTGAAGGCTCACCAAGTTTCTAAGGCACTTCAGTTATTCAGAAACTTGAAAGAAATTGGGTTGGAATGTGATACTGAAACCTTGAATATTCTTCTAGAATGCATGTGTCGACGATCCCACGTGGGTGCTGCAAACTCCTTCTTTAATTTAATCAAGGGGAATGTTCCTTTCAATGCTATGTCATATAACATTATAATTGGTGGATGGTCAAGATATGGTAGGCACAGTGAAGTCGAGCGAATTTTGAAAGCAATGGAAGTTGATGGATTTTCTCCAGATTGTCTGACCTACACTTATCTTATTGAGTGTCTTGGCAGAGCTAATCGCATTGACGACGCTGTCAAGATCTTTGATAAAATGTATGAAAATGGCTGTAAGCCAGATGTCAATGCTTATAATGCAATGATCTCCAACTTTATATGTATAGGTGATTTTGATGAATGCCTGACCTATTACAAGCGTATGTTGAGTAATAGATGTGAACCTGACATCAACACCTATTCCAATTTGATTATTGGCTTTCTTAAAGCCAAGAAAGTGGCCGATGCACTAGAAATGTTTGATGAGATGGTGCCAAGAATAATTCCCACTACGGGGGCAATAACATCCTTTATGAAACTTAGCTGTAGTTATGGTCCTCCGCATGCAGCTATGTTAATCTACAAGAAAGCAAGAAAAGTTGGATGTAGGATATCCAAGAATGCTTATAAATTGTTGCTAATGCGGCTTTCTTTGTTTGGTAAATTTGGCATGCTATTAAATATATGGAATGAGATGCAAGAAAGTGGTTATGATCCTGATGTGGATACTTATGAGCATGCCATTGACTGTCTTTGTAAAACCGGGCAGCTTGAAAATGCTGTACTCGTCATGGAGGAATGCTTACGTCAGGGTTTCTTTCCAAGTAGGCAAATACGTAGTAAGCTTAATAACAAACTATTGGACTGTAATAGGACAGAGATGGCATATAAACTCTGGTTGAAAATCAAAGTTGCTCGTCATCAGGAAAATTTGCAAAGATGTTGGCGTGCTAAGGGATGGCATCATTGA

Protein sequence

MAFGASRRFLSHQFRGCFLGRLASCRYHYPLLYPPSPSSALSFLFSTLDEPPNLFDDGVLADGTRNQRSIDERFVISELSDLLLVNPHGSVSNTLNENPSEKQMPIRAVDGFLLPEEKLRGVFLQKLNGKTAIEHALANTDVNLSQDVVNKVLNTGSLGSEAMVTFFYWAIKQPSIPKDTSSYNIILKALGRRRFFDSMMHVLHKMTREGVNANMETVSIVVDSLVKAHQVSKALQLFRNLKEIGLECDTETLNILLECMCRRSHVGAANSFFNLIKGNVPFNAMSYNIIIGGWSRYGRHSEVERILKAMEVDGFSPDCLTYTYLIECLGRANRIDDAVKIFDKMYENGCKPDVNAYNAMISNFICIGDFDECLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVPRIIPTTGAITSFMKLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESGYDPDVDTYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLDCNRTEMAYKLWLKIKVARHQENLQRCWRAKGWHH
BLAST of Cla023458 vs. Swiss-Prot
Match: PP416_ARATH (Putative pentatricopeptide repeat-containing protein At5g43820 OS=Arabidopsis thaliana GN=At5g43820 PE=3 SV=1)

HSP 1 Score: 576.2 bits (1484), Expect = 3.9e-163
Identity = 280/514 (54.47%), Postives = 376/514 (73.15%), Query Frame = 1

Query: 61  ADGTRNQRSIDERFVISELSDLLLVNPHGSVSNTLNENPSEKQMPIRAVDGFLLPEEKLR 120
           A  + N   +DE +V++ELS LL ++ +   S +  ++ S+ Q+   A+D FL  E+KLR
Sbjct: 36  ASESLNHGVVDESYVLAELSSLLPISSN-KTSVSKEDSSSKNQV---AIDSFLSAEDKLR 95

Query: 121 GVFLQKLNGKTAIEHALANTDVNLSQDVVNKVLNTGSLGSEAMVTFFYWAIKQPSIPKDT 180
           GVFLQKL GK+AI+ +L++  + LS D+V  VLN G+L  EAMVTFF WA+++P + KD 
Sbjct: 96  GVFLQKLKGKSAIQKSLSSLGIGLSIDIVADVLNRGNLSGEAMVTFFDWAVREPGVTKDV 155

Query: 181 SSYNIILKALGRRRFFDSMMHVLHKMTREGVNANMETVSIVVDSLVKAHQVSKALQLFRN 240
            SY++IL+ALGRR+ F  MM VL  M  EGVN ++E ++I +DS V+ H V +A++LF  
Sbjct: 156 GSYSVILRALGRRKLFSFMMDVLKGMVCEGVNPDLECLTIAMDSFVRVHYVRRAIELFEE 215

Query: 241 LKEIGLECDTETLNILLECMCRRSHVGAANSFFNLIKGNVPFNAMSYNIIIGGWSRYGRH 300
            +  G++C TE+ N LL C+C RSHV AA S FN  KGN+PF++ SYNI+I GWS+ G  
Sbjct: 216 SESFGVKCSTESFNALLRCLCERSHVSAAKSVFNAKKGNIPFDSCSYNIMISGWSKLGEV 275

Query: 301 SEVERILKAMEVDGFSPDCLTYTYLIECLGRANRIDDAVKIFDKMYENGCKPDVNAYNAM 360
            E+E++LK M   GF PDCL+Y++LIE LGR  RI+D+V+IFD +   G  PD N YNAM
Sbjct: 276 EEMEKVLKEMVESGFGPDCLSYSHLIEGLGRTGRINDSVEIFDNIKHKGNVPDANVYNAM 335

Query: 361 ISNFICIGDFDECLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVPR-I 420
           I NFI   DFDE + YY+RML   CEP++ TYS L+ G +K +KV+DALE+F+EM+ R +
Sbjct: 336 ICNFISARDFDESMRYYRRMLDEECEPNLETYSKLVSGLIKGRKVSDALEIFEEMLSRGV 395

Query: 421 IPTTGAITSFMKLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLN 480
           +PTTG +TSF+K  CSYGPPHAAM+IY+K+RK GCRIS++AYKLLL RLS FGK GMLLN
Sbjct: 396 LPTTGLVTSFLKPLCSYGPPHAAMVIYQKSRKAGCRISESAYKLLLKRLSRFGKCGMLLN 455

Query: 481 IWNEMQESGYDPDVDTYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLL 540
           +W+EMQESGY  DV+ YE+ +D LC  G LENAVLVMEE +R+GF P+R + S+L++KL+
Sbjct: 456 VWDEMQESGYPSDVEVYEYIVDGLCIIGHLENAVLVMEEAMRKGFCPNRFVYSRLSSKLM 515

Query: 541 DCNRTEMAYKLWLKIKVARHQENLQRCWRAKGWH 574
             N+TE+AYKL+LKIK AR  EN +  WR+ GWH
Sbjct: 516 ASNKTELAYKLFLKIKKARATENARSFWRSNGWH 545

BLAST of Cla023458 vs. Swiss-Prot
Match: PP293_ARATH (Pentatricopeptide repeat-containing protein At3g62470, mitochondrial OS=Arabidopsis thaliana GN=At3g62470 PE=2 SV=1)

HSP 1 Score: 166.0 bits (419), Expect = 1.2e-39
Identity = 131/514 (25.49%), Postives = 223/514 (43.39%), Query Frame = 1

Query: 13  QFRGCFLGRLASCRYHYPLLYPPSPSSALSFLFSTLDEPPNLFDDGVLADGTRNQRSIDE 72
           +F G    R+     ++P    P P S++  L ++L              G R   S   
Sbjct: 52  RFSGALFSRMIHSSTYHPYRQIPLPHSSVQLLDASL--------------GCRGFSSGS- 111

Query: 73  RFVISELSDLLLVNPHGSVSNTLNENPSEKQMPIRAVDGFLLPEEKLR--GVFLQKLNGK 132
               S +SD       G      +E  ++++  +  V+    PEE  R   V  +     
Sbjct: 112 ----SNVSD-------GCDEEVESECDNDEETGVSCVESSTNPEEVERVCKVIDELFALD 171

Query: 133 TAIEHALANTDVNLSQDVVNKVLNTGSLGSEAMVTFFYWAIKQPSIPKDTSSYNIILKAL 192
             +E  L    ++LS D++ +VL       +    FF WA ++     D+ +YN ++  L
Sbjct: 172 RNMEAVLDEMKLDLSHDLIVEVLERFRHARKPAFRFFCWAAERQGFAHDSRTYNSMMSIL 231

Query: 193 GRRRFFDSMMHVLHKMTREGVNANMETVSIVVDSLVKAHQVSKALQLFRNLKEIGLECDT 252
            + R F++M+ VL +M  +G+   MET +I + +   A +  KA+ +F  +K+   +   
Sbjct: 232 AKTRQFETMVSVLEEMGTKGL-LTMETFTIAMKAFAAAKERKKAVGIFELMKKYKFKIGV 291

Query: 253 ETLNILLECMCRRSHVGAANSFFNLIKGNVPFNAMSYNIIIGGWSRYGRHSEVERILKAM 312
           ET+N LL+ + R      A   F+ +K     N M+Y +++ GW R     E  RI   M
Sbjct: 292 ETINCLLDSLGRAKLGKEAQVLFDKLKERFTPNMMTYTVLLNGWCRVRNLIEAARIWNDM 351

Query: 313 EVDGFSPDCLTYTYLIECLGRANRIDDAVKIFDKMYENGCKPDVNAYNAMISNFICIGDF 372
              G  PD + +  ++E L R+ +  DA+K+F  M   G  P+V +Y  MI +F      
Sbjct: 352 IDQGLKPDIVAHNVMLEGLLRSRKKSDAIKLFHVMKSKGPCPNVRSYTIMIRDFCKQSSM 411

Query: 373 DECLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVPRIIPTTG-AITSF 432
           +  + Y+  M+ +  +PD   Y+ LI GF   KK+    E+  EM  +  P  G    + 
Sbjct: 412 ETAIEYFDDMVDSGLQPDAAVYTCLITGFGTQKKLDTVYELLKEMQEKGHPPDGKTYNAL 471

Query: 433 MKLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESGY 492
           +KL  +   P  A  IY K  +     S + + +++    +   + M   +W EM + G 
Sbjct: 472 IKLMANQKMPEHATRIYNKMIQNEIEPSIHTFNMIMKSYFMARNYEMGRAVWEEMIKKGI 531

Query: 493 DPDVDTYEHAIDCLCKTGQLENAVLVMEECLRQG 524
            PD ++Y   I  L   G+   A   +EE L +G
Sbjct: 532 CPDDNSYTVLIRGLIGEGKSREACRYLEEMLDKG 538

BLAST of Cla023458 vs. Swiss-Prot
Match: PP275_ARATH (Pentatricopeptide repeat-containing protein At3g49730 OS=Arabidopsis thaliana GN=At3g49730 PE=2 SV=1)

HSP 1 Score: 154.5 bits (389), Expect = 3.6e-36
Identity = 114/468 (24.36%), Postives = 206/468 (44.02%), Query Frame = 1

Query: 91  VSNTLNENPSEKQMPIRAVDGFLLPEEKLRGVFLQKLNGKTAIEHALANTDVNLSQDVVN 150
           V +T  +N      P +  D F    EK+  +     +    +E AL  + ++L   ++ 
Sbjct: 42  VESTERKNGVGLVCPEKHEDEFAGEVEKIYRILRNHHSRVPKLELALNESGIDLRPGLII 101

Query: 151 KVLNTGSLGSEAMVTFFYWAIKQPSIPKDTSSYNIILKALGRRRFFDSMMHVLHKMTREG 210
           +VL+           FF WA KQP           ++  L + R F ++  ++ +M +  
Sbjct: 102 RVLSRCGDAGNLGYRFFLWATKQPGYFHSYEVCKSMVMILSKMRQFGAVWGLIEEMRKTN 161

Query: 211 VNA-NMETVSIVVDSLVKAHQVSKALQLFRNLKEIGLECDTETLNILLECMCRRSHVGAA 270
                 E   +++     A+ V KA+++   + + GLE D      LL+ +C+   V  A
Sbjct: 162 PELIEPELFVVLMRRFASANMVKKAVEVLDEMPKYGLEPDEYVFGCLLDALCKNGSVKEA 221

Query: 271 NSFFNLIKGNVPFNAMSYNIIIGGWSRYGRHSEVERILKAMEVDGFSPDCLTYTYLIECL 330
           +  F  ++   P N   +  ++ GW R G+  E + +L  M+  G  PD + +T L+   
Sbjct: 222 SKVFEDMREKFPPNLRYFTSLLYGWCREGKLMEAKEVLVQMKEAGLEPDIVVFTNLLSGY 281

Query: 331 GRANRIDDAVKIFDKMYENGCKPDVNAYNAMISNFICIGD--FDECLTYYKRMLSNRCEP 390
             A ++ DA  + + M + G +P+VN Y  +I   +C  +   DE +  +  M    CE 
Sbjct: 282 AHAGKMADAYDLMNDMRKRGFEPNVNCYTVLI-QALCRTEKRMDEAMRVFVEMERYGCEA 341

Query: 391 DINTYSNLIIGFLKAKKVADALEMFDEMVPR-IIPTTGAITSFMKLSCSYGPPHAAMLIY 450
           DI TY+ LI GF K   +     + D+M  + ++P+       M            + + 
Sbjct: 342 DIVTYTALISGFCKWGMIDKGYSVLDDMRKKGVMPSQVTYMQIMVAHEKKEQFEECLELI 401

Query: 451 KKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESGYDPDVDTYEHAIDCLCKT 510
           +K ++ GC      Y +++      G+    + +WNEM+ +G  P VDT+   I+     
Sbjct: 402 EKMKRRGCHPDLLIYNVVIRLACKLGEVKEAVRLWNEMEANGLSPGVDTFVIMINGFTSQ 461

Query: 511 GQLENAVLVMEECLRQGFFPSRQ---IRSKLNNKLLDCNRTEMAYKLW 552
           G L  A    +E + +G F + Q   ++S LNN + D ++ EMA  +W
Sbjct: 462 GFLIEACNHFKEMVSRGIFSAPQYGTLKSLLNNLVRD-DKLEMAKDVW 507


HSP 2 Score: 85.1 bits (209), Expect = 2.7e-15
Identity = 69/343 (20.12%), Postives = 144/343 (41.98%), Query Frame = 1

Query: 202 VLHKMTREGVNANMETVSIVVDSLVKAHQVSKALQLFRNLKEIGLECDTETLNILLECMC 261
           VL +M   G+  ++   + ++     A +++ A  L  ++++ G E +     +L++ +C
Sbjct: 258 VLVQMKEAGLEPDIVVFTNLLSGYAHAGKMADAYDLMNDMRKRGFEPNVNCYTVLIQALC 317

Query: 262 R--RSHVGAANSFFNLIKGNVPFNAMSYNIIIGGWSRYGRHSEVERILKAMEVDGFSPDC 321
           R  +    A   F  + +     + ++Y  +I G+ ++G   +   +L  M   G  P  
Sbjct: 318 RTEKRMDEAMRVFVEMERYGCEADIVTYTALISGFCKWGMIDKGYSVLDDMRKKGVMPSQ 377

Query: 322 LTYTYLIECLGRANRIDDAVKIFDKMYENGCKPDVNAYNAMISNFICIGDFDECLTYYKR 381
           +TY  ++    +  + ++ +++ +KM   GC PD+  YN +I     +G+  E +  +  
Sbjct: 378 VTYMQIMVAHEKKEQFEECLELIEKMKRRGCHPDLLIYNVVIRLACKLGEVKEAVRLWNE 437

Query: 382 MLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVPRII---PTTGAITSFMKLSCSY 441
           M +N   P ++T+  +I GF     + +A   F EMV R I   P  G + S +      
Sbjct: 438 MEANGLSPGVDTFVIMINGFTSQGFLIEACNHFKEMVSRGIFSAPQYGTLKSLLNNLVRD 497

Query: 442 GPPHAAMLIYK--KARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESGYDPDVD 501
                A  ++     +   C ++ +A+ + +  L   G      +   +M E    P  +
Sbjct: 498 DKLEMAKDVWSCISNKTSSCELNVSAWTIWIHALYAKGHVKEACSYCLDMMEMDLMPQPN 557

Query: 502 TYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNK 538
           TY   +  L K      A  + E+ ++      R++  K+  K
Sbjct: 558 TYAKLMKGLNKLYNRTIAAEITEKVVKMA--SEREMSFKMYKK 598

BLAST of Cla023458 vs. Swiss-Prot
Match: PP248_ARATH (Pentatricopeptide repeat-containing protein At3g22670, mitochondrial OS=Arabidopsis thaliana GN=At3g22670 PE=2 SV=1)

HSP 1 Score: 151.8 bits (382), Expect = 2.3e-35
Identity = 108/421 (25.65%), Postives = 197/421 (46.79%), Query Frame = 1

Query: 137 LANTDVNLSQDVVNKVLNTGSLGSEAMVTFFYWAIKQPSIPKDTSSYNIILKALGRRRFF 196
           L+  DV +++ +V +VL   S G      FF WA  Q        +YN ++  LG+ R F
Sbjct: 123 LSKCDVVVTESLVLQVLRRFSNGWNQAYGFFIWANSQTGYVHSGHTYNAMVDVLGKCRNF 182

Query: 197 DSMMHVLHKMTR--EGVNANMETVSIVVDSLVKAHQVSKALQLFRNL-KEIGLECDTETL 256
           D M  ++++M +  E     ++T+S V+  L K+ + +KA+  F  + K  G++ DT  +
Sbjct: 183 DLMWELVNEMNKNEESKLVTLDTMSKVMRRLAKSGKYNKAVDAFLEMEKSYGVKTDTIAM 242

Query: 257 NILLECMCRRSHVGAANSFFNLIKGNVPFNAMSYNIIIGGWSRYGRHSEVERILKAMEVD 316
           N L++ + + + +  A+  F  +   +  +A ++NI+I G+ +  +  +   ++  M+V 
Sbjct: 243 NSLMDALVKENSIEHAHEVFLKLFDTIKPDARTFNILIHGFCKARKFDDARAMMDLMKVT 302

Query: 317 GFSPDCLTYTYLIECLGRANRIDDAVKIFDKMYENGCKPDVNAYNAMISNFICIGDFDEC 376
            F+PD +TYT  +E   +        ++ ++M ENGC P+V  Y  ++ +        E 
Sbjct: 303 EFTPDVVTYTSFVEAYCKEGDFRRVNEMLEEMRENGCNPNVVTYTIVMHSLGKSKQVAEA 362

Query: 377 LTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVPRIIPTTGAI-TSFMKL 436
           L  Y++M  + C PD   YS+LI    K  +  DA E+F++M  + +     +  + +  
Sbjct: 363 LGVYEKMKEDGCVPDAKFYSSLIHILSKTGRFKDAAEIFEDMTNQGVRRDVLVYNTMISA 422

Query: 437 SCSYGPPHAAMLIYKKARK---VGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESGY 496
           +  +     A+ + K+        C  +   Y  LL       K  +L  + + M ++  
Sbjct: 423 ALHHSRDEMALRLLKRMEDEEGESCSPNVETYAPLLKMCCHKKKMKLLGILLHHMVKNDV 482

Query: 497 DPDVDTYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLDCNRTEMAYK 551
             DV TY   I  LC +G++E A L  EE +R+G  P       L ++L   N  E   K
Sbjct: 483 SIDVSTYILLIRGLCMSGKVEEACLFFEEAVRKGMVPRDSTCKMLVDELEKKNMAEAKLK 542

BLAST of Cla023458 vs. Swiss-Prot
Match: PPR54_ARATH (Pentatricopeptide repeat-containing protein At1g20300, mitochondrial OS=Arabidopsis thaliana GN=At1g20300 PE=2 SV=1)

HSP 1 Score: 150.6 bits (379), Expect = 5.2e-35
Identity = 99/402 (24.63%), Postives = 180/402 (44.78%), Query Frame = 1

Query: 164 VTFFYWAIKQPSIP-KDTSSYNIILKALGRRRFFDSMMHVLHKMTREGVNANMETVSIVV 223
           + FF WA  +     K    YN ++   G+ R FD   H++  M    V  ++ET +I++
Sbjct: 134 LAFFNWATSRDDYDHKSPHPYNEMIDLSGKVRQFDLAWHLIDLMKSRNVEISIETFTILI 193

Query: 224 DSLVKAHQVSKALQLFRNLKEIGLECDTETLNILLECMCRRSHVGAANSFFNLIKGNVPF 283
              V+A   S+A+  F  +++ G   D    +I++  + R+     A SFF+ +K     
Sbjct: 194 RRYVRAGLASEAVHCFNRMEDYGCVPDKIAFSIVISNLSRKRRASEAQSFFDSLKDRFEP 253

Query: 284 NAMSYNIIIGGWSRYGRHSEVERILKAMEVDGFSPDCLTYTYLIECLGRANRIDDAVKIF 343
           + + Y  ++ GW R G  SE E++ K M++ G  P+  TY+ +I+ L R  +I  A  +F
Sbjct: 254 DVIVYTNLVRGWCRAGEISEAEKVFKEMKLAGIEPNVYTYSIVIDALCRCGQISRAHDVF 313

Query: 344 DKMYENGCKPDVNAYNAMISNFICIGDFDECLTYYKRMLSNRCEPDINTYSNLIIGFLKA 403
             M ++GC P+   +N ++   +  G  ++ L  Y +M    CEPD  TY+ LI    + 
Sbjct: 314 ADMLDSGCAPNAITFNNLMRVHVKAGRTEKVLQVYNQMKKLGCEPDTITYNFLIEAHCRD 373

Query: 404 KKVADALEMFDEMVPRIIPTTGA-ITSFMKLSCSYGPPHAAMLIYKKARKVGCRISKNAY 463
           + + +A+++ + M+ +      +   +  +        + A  +Y K  +  C  +   Y
Sbjct: 374 ENLENAVKVLNTMIKKKCEVNASTFNTIFRYIEKKRDVNGAHRMYSKMMEAKCEPNTVTY 433

Query: 464 KLLLMRLSLFGKFGMLLNIWNEMQESGYDPDVDTYEHAIDCLCKTGQLENAV-----LVM 523
            +L+          M+L +  EM +   +P+V+TY   +   C  G   NA      +V 
Sbjct: 434 NILMRMFVGSKSTDMVLKMKKEMDDKEVEPNVNTYRLLVTMFCGMGHWNNAYKLFKEMVE 493

Query: 524 EECLRQGFFPSRQIRSKLNNKLLDCNRTEMAYKLWLKIKVAR 559
           E+CL         + ++L          E+  K+  K  VAR
Sbjct: 494 EKCLTPSLSLYEMVLAQLRRAGQLKKHEELVEKMIQKGLVAR 535

BLAST of Cla023458 vs. TrEMBL
Match: A0A0A0L3E0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G663730 PE=4 SV=1)

HSP 1 Score: 1031.9 bits (2667), Expect = 2.9e-298
Identity = 505/574 (87.98%), Postives = 537/574 (93.55%), Query Frame = 1

Query: 1   MAFGASRRFLSHQFRGCFLGRLASCRYHYPLLYPPSPSSALSFLFSTLDEPPNLFDDGVL 60
           MAFGASRR + +Q R CFLG +AS RYHYPL++ PSP  ALS+LFSTLDEP NLFDDG+ 
Sbjct: 1   MAFGASRRLIPYQLRACFLGLIASGRYHYPLIHSPSP--ALSYLFSTLDEPSNLFDDGLS 60

Query: 61  ADGTRNQRSIDERFVISELSDLLLVNPHGSVSNTLNENPSEKQMPIRAVDGFLLPEEKLR 120
            +G RNQR IDERFVISELSDLLLVNP+GSV NTL EN  EKQMP+RAVDGFLLPEEKLR
Sbjct: 61  GNGDRNQRCIDERFVISELSDLLLVNPYGSVYNTLKENSIEKQMPVRAVDGFLLPEEKLR 120

Query: 121 GVFLQKLNGKTAIEHALANTDVNLSQDVVNKVLNTGSLGSEAMVTFFYWAIKQPSIPKDT 180
           GVFLQKLNGKTAIEHALANTDV LSQDVV+KVLNTGSLGSEAMVTFFYWAIKQPSIPKD 
Sbjct: 121 GVFLQKLNGKTAIEHALANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDA 180

Query: 181 SSYNIILKALGRRRFFDSMMHVLHKMTREGVNANMETVSIVVDSLVKAHQVSKALQLFRN 240
           SSYNIILKALGRR FFDSMM VL+ MTREGV A +E VSIVVDSLVK HQVSKALQ FRN
Sbjct: 181 SSYNIILKALGRRGFFDSMMDVLYNMTREGVEATLEMVSIVVDSLVKGHQVSKALQFFRN 240

Query: 241 LKEIGLECDTETLNILLECMCRRSHVGAANSFFNLIKGNVPFNAMSYNIIIGGWSRYGRH 300
           LKEIGL+CDTETLNILL+CMCRRSHVGAANSFFNL KGN+PFN M+YNI+IGGWSRYGRH
Sbjct: 241 LKEIGLKCDTETLNILLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGRH 300

Query: 301 SEVERILKAMEVDGFSPDCLTYTYLIECLGRANRIDDAVKIFDKMYENGCKPDVNAYNAM 360
            EVE++LKAME+DGFSPDCLT+TYLIECLGRAN+IDDAVKIFDKM ENGC PDV+AYNAM
Sbjct: 301 GEVEQMLKAMELDGFSPDCLTHTYLIECLGRANQIDDAVKIFDKMDENGCTPDVDAYNAM 360

Query: 361 ISNFICIGDFDECLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVPRII 420
           ISNFICIGDFD+CLTYY+RMLSNRCEPD+NTYSNLI GFLKAKKVADALEMFDEMV RII
Sbjct: 361 ISNFICIGDFDQCLTYYERMLSNRCEPDMNTYSNLITGFLKAKKVADALEMFDEMVARII 420

Query: 421 PTTGAITSFMKLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNI 480
           PTTGAITSF++LSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNI
Sbjct: 421 PTTGAITSFIQLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNI 480

Query: 481 WNEMQESGYDPDVDTYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLD 540
           WNEMQESGYDPDV+TYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSR+ RSKLNNKLL 
Sbjct: 481 WNEMQESGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRRTRSKLNNKLLA 540

Query: 541 CNRTEMAYKLWLKIKVARHQENLQRCWRAKGWHH 575
           CNRTEMAYKLWLKIKVARHQENLQRCWRAKGWH+
Sbjct: 541 CNRTEMAYKLWLKIKVARHQENLQRCWRAKGWHY 572

BLAST of Cla023458 vs. TrEMBL
Match: A5C8V0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_018999 PE=4 SV=1)

HSP 1 Score: 671.0 bits (1730), Expect = 1.3e-189
Identity = 337/565 (59.65%), Postives = 425/565 (75.22%), Query Frame = 1

Query: 10  LSHQFRGCFLGRLASCRYHYPLLYPPSPSSALSFLFSTLDEPPNLFDDGVLADGTRNQRS 69
           ++ Q +G FL R +  RYH   L    PSS   F FSTL    N   D    +  +   +
Sbjct: 1   MASQLQG-FLSRFSRTRYHTRYL----PSSVSLFQFSTLQVTSNPLMDEPTDNQIKRPSN 60

Query: 70  IDERFVISELSDLLLVNPHGSVSNTLNENPSEKQMPIRAVDGFLLPEEKLRGVFLQKLNG 129
            +ER V+ +LS LL +  + S+S    EN  ++Q+  RAVDGFL P EKLRGVF+Q+L G
Sbjct: 61  FNERDVLYQLSGLLPICCNTSISKPFTENSPKEQLKTRAVDGFLSPGEKLRGVFIQRLRG 120

Query: 130 KTAIEHALANTDVNLSQDVVNKVLNTGSLGSEAMVTFFYWAIKQPSIPKDTSSYNIILKA 189
           K AIE AL N  ++L+ D+V++V N G+LG EAMV FF WA+KQP+IPKD  +YN+I+KA
Sbjct: 121 KAAIELALTNVGIDLTIDIVSEVXNRGNLGGEAMVXFFNWAVKQPTIPKDVDTYNVIIKA 180

Query: 190 LGRRRFFDSMMHVLHKMTREGVNANMETVSIVVDSLVKAHQVSKALQLFRNLKEIGLECD 249
           LGRR+F +  + VL  M  +G++ N ET+SIV+DS +KA QVSKA+++FRNL+E G +CD
Sbjct: 181 LGRRKFIEFXVXVLKDMHIQGISPNYETLSIVMDSFIKARQVSKAIEMFRNLEEFGGKCD 240

Query: 250 TETLNILLECMCRRSHVGAANSFFNLIKGNVPFNAMSYNIIIGGWSRYGRHSEVERILKA 309
           TE+LN+LL+C+C+RSHVGAAN FFN +KG +PFN M+YNIIIGGWS+YG+  E+ER LKA
Sbjct: 241 TESLNVLLQCLCQRSHVGAANLFFNAMKGGIPFNCMTYNIIIGGWSKYGKIGEMERCLKA 300

Query: 310 MEVDGFSPDCLTYTYLIECLGRANRIDDAVKIFDKMYENGCKPDVNAYNAMISNFICIGD 369
           M  DGFSP+CLT+++LIE LGRA RIDDAV++F  M E GC P+   YNA+ISNFI   D
Sbjct: 301 MVADGFSPNCLTFSHLIEGLGRAGRIDDAVEVFHHMEETGCVPNACVYNALISNFISTRD 360

Query: 370 FDECLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVPR-IIPTTGAITS 429
           FDECL YY  M+S+ C+P+++TY+ LI+ FLKA+KVADALEM DEMV R +IPTTGAITS
Sbjct: 361 FDECLKYYNFMVSSNCDPNMDTYTKLIVAFLKARKVADALEMLDEMVGRGMIPTTGAITS 420

Query: 430 FMKLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESG 489
           F++  C YGPPHAAM+IYKKARKVGCRIS +AYKLLLMRLS FGK GMLLN+W+EMQESG
Sbjct: 421 FIEPLCQYGPPHAAMMIYKKARKVGCRISLSAYKLLLMRLSRFGKCGMLLNLWDEMQESG 480

Query: 490 YDPDVDTYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLDCNRTEMAY 549
           Y  D + YE+ I+ LC  GQL+ AVLVMEE L +GF PSR IRSKLNNKLL  N+ EMAY
Sbjct: 481 YSSDTEVYEYVINGLCNIGQLDTAVLVMEESLXKGFCPSRLIRSKLNNKLLASNKVEMAY 540

Query: 550 KLWLKIKVARHQENLQRCWRAKGWH 574
           KL+LKIK AR  +N +R WR  GWH
Sbjct: 541 KLFLKIKXARQNDNARRFWRGNGWH 560

BLAST of Cla023458 vs. TrEMBL
Match: M5WDR4_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa023340mg PE=4 SV=1)

HSP 1 Score: 663.3 bits (1710), Expect = 2.7e-187
Identity = 341/548 (62.23%), Postives = 421/548 (76.82%), Query Frame = 1

Query: 29  YPLLY-PPSPSSALSFLFSTLDEPPNLFDDGVLADGTRNQRSIDERFVISELSDLLLVNP 88
           YPL Y   SP S  S LFSTL    N   D       ++Q ++DE FV+  LS+LL ++ 
Sbjct: 19  YPLSYLVHSPIS--SSLFSTLYAQSNSLHD---EHRIKSQSTLDESFVLDRLSNLLPISR 78

Query: 89  HGSVSNTLNE-NPSEKQMPIRAVDGFLLPEEKLRGVFLQKLNGKTAIEHALANTDVNLSQ 148
             S + TL E + S+KQ+ IR VDGFLLP+EKLRGVFLQKL G  AIEHAL N  V+LS 
Sbjct: 79  SNSSTATLFEPSNSDKQIEIRTVDGFLLPDEKLRGVFLQKLRGTAAIEHALDNGGVDLSV 138

Query: 149 DVVNKVLNTGSLGSEAMVTFFYWAIKQPSIPKDTSSYNIILKALGRRRFFDSMMHVLHKM 208
           DVV +V+N G LG+EAM+ FF WAI++P+I K   +Y+IILKALGRR+FF  MM +LH M
Sbjct: 139 DVVAQVVNRGGLGAEAMLVFFNWAIRKPTIAKYIETYHIILKALGRRKFFTHMMQILHHM 198

Query: 209 TREGVNANMETVSIVVDSLVKAHQVSKALQLFRNLKEIGLECDTETLNILLECMCRRSHV 268
             +G++ N+ET+SIV+DS V+A  VSKA+Q+FRNL+EIGLECDTE+LN+LL+C+C+RSHV
Sbjct: 199 RAQGISPNLETISIVMDSFVRAQHVSKAIQMFRNLEEIGLECDTESLNLLLQCLCQRSHV 258

Query: 269 GAANSFFNLIKGNVPFNAMSYNIIIGGWSRYGRHSEVERILKAMEVDGFSPDCLTYTYLI 328
           GAANSF N +KG + FN  +YNIIIGGWSR+GR SE+ERIL+AM  DGFS D  T+++++
Sbjct: 259 GAANSFLNSVKGKIQFNGNTYNIIIGGWSRHGRVSEIERILEAMVADGFSADSSTFSFIL 318

Query: 329 ECLGRANRIDDAVKIFDKMYENGCKPDVNAYNAMISNFICIGDFDECLTYYKRMLSNRCE 388
           E LGRA RIDDAV+IFD M   GC PD   YNAMISNFI + +FDEC+ YYK M SN C+
Sbjct: 319 EGLGRAGRIDDAVEIFDSMKGKGCMPDTRVYNAMISNFISVRNFDECVRYYKGMSSNSCD 378

Query: 389 PDINTYSNLIIGFLKAKKVADALEMFDEMVPR-IIPTTGAITSFMKLSCSYGPPHAAMLI 448
           P+I+TY+ LI  FLKA+KVA ALEMFDEM+ R ++PTTG ITSF++  CSYGPP+AAM+I
Sbjct: 379 PNIDTYTKLIAAFLKARKVAGALEMFDEMLGRGLVPTTGTITSFIEPLCSYGPPYAAMMI 438

Query: 449 YKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESGYDPDVDTYEHAIDCLCK 508
           YKKARKVGCRIS +AYKLLLMRLS FGK GMLLNIW +MQE GY  D + Y++ I+ LC 
Sbjct: 439 YKKARKVGCRISLSAYKLLLMRLSRFGKCGMLLNIWEDMQECGYASDKEVYDYVINGLCN 498

Query: 509 TGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLDCNRTEMAYKLWLKIKVARHQENLQR 568
            G LENAVLVMEE L++GF PSR + SKLNNKLL  N+ E AYKL+LKIK AR  +N QR
Sbjct: 499 IGHLENAVLVMEESLQKGFCPSRLVYSKLNNKLLASNKVERAYKLFLKIKHARRYDNAQR 558

Query: 569 CWRAKGWH 574
            WR+KGWH
Sbjct: 559 FWRSKGWH 561

BLAST of Cla023458 vs. TrEMBL
Match: D7UDB7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0122g01120 PE=4 SV=1)

HSP 1 Score: 656.8 bits (1693), Expect = 2.5e-185
Identity = 318/506 (62.85%), Postives = 403/506 (79.64%), Query Frame = 1

Query: 69  SIDERFVISELSDLLLVNPHGSVSNTLNENPSEKQMPIRAVDGFLLPEEKLRGVFLQKLN 128
           + +ER V+ +LS LL +  + S+S    EN  ++Q+  RAVDGFL P EKLRGVF+Q+L 
Sbjct: 14  NFNERDVLYQLSGLLPICCNTSISKPFTENSPKEQLKTRAVDGFLSPGEKLRGVFIQRLR 73

Query: 129 GKTAIEHALANTDVNLSQDVVNKVLNTGSLGSEAMVTFFYWAIKQPSIPKDTSSYNIILK 188
           GK AIE AL N  ++L+ D+V++V+N G+LG EAMV FF WA+KQP+IPKD  +YN+I+K
Sbjct: 74  GKAAIELALTNVGIDLTIDIVSEVINRGNLGGEAMVIFFNWAVKQPTIPKDVDTYNVIIK 133

Query: 189 ALGRRRFFDSMMHVLHKMTREGVNANMETVSIVVDSLVKAHQVSKALQLFRNLKEIGLEC 248
           ALGRR+F + ++ VL  M  +G++ N ET+SIV+DS +KA QVSKA+++FRNL+E G +C
Sbjct: 134 ALGRRKFIEFVVKVLKDMHIQGISPNYETLSIVMDSFIKARQVSKAIEMFRNLEEFGGKC 193

Query: 249 DTETLNILLECMCRRSHVGAANSFFNLIKGNVPFNAMSYNIIIGGWSRYGRHSEVERILK 308
           DTE+LN+LL+C+C+RSHVGAAN FFN +KG +PFN M+YNIIIGGWS+YG+  E+ER LK
Sbjct: 194 DTESLNVLLQCLCQRSHVGAANLFFNAMKGGIPFNCMTYNIIIGGWSKYGKIGEMERCLK 253

Query: 309 AMEVDGFSPDCLTYTYLIECLGRANRIDDAVKIFDKMYENGCKPDVNAYNAMISNFICIG 368
           AM  DGFSP+CLT+++LIE LGRA RIDDAV++F  M E GC P+   YNA+ISNFI   
Sbjct: 254 AMVADGFSPNCLTFSHLIEGLGRAGRIDDAVEVFHHMEETGCVPNACVYNALISNFISTR 313

Query: 369 DFDECLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVPR-IIPTTGAIT 428
           DFDECL YY  M+S+ C+P+++TY+ LI+ FLKA+KVADALEM DEMV R +IPTTGAIT
Sbjct: 314 DFDECLKYYNFMVSSNCDPNMDTYTKLIVAFLKARKVADALEMLDEMVGRGMIPTTGAIT 373

Query: 429 SFMKLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQES 488
           SF++  C YGPPHAAM+IYKKARKVGCRIS +AYKLLLMRLS FGK GMLLN+W+EMQES
Sbjct: 374 SFIEPLCQYGPPHAAMMIYKKARKVGCRISLSAYKLLLMRLSRFGKCGMLLNLWDEMQES 433

Query: 489 GYDPDVDTYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLDCNRTEMA 548
           GY  D + YE+ I+ LC  GQL+ AVLVMEE L +GF PSR IRSKLNNKLL  N+ EMA
Sbjct: 434 GYSSDTEVYEYVINGLCNIGQLDTAVLVMEESLHKGFCPSRLIRSKLNNKLLASNKVEMA 493

Query: 549 YKLWLKIKVARHQENLQRCWRAKGWH 574
           YKL+LKIK+AR  +N +R WR  GWH
Sbjct: 494 YKLFLKIKIARQNDNARRFWRGNGWH 519

BLAST of Cla023458 vs. TrEMBL
Match: A0A061FE27_THECC (Pentatricopeptide repeat (PPR) superfamily protein, putative isoform 1 OS=Theobroma cacao GN=TCM_034412 PE=4 SV=1)

HSP 1 Score: 638.6 bits (1646), Expect = 7.1e-180
Identity = 327/567 (57.67%), Postives = 412/567 (72.66%), Query Frame = 1

Query: 9   FLSHQFRGCFLGRLASCRYHYPLLYPPSPSSALSFLFSTLDEPPNLFDDGVLADGTRNQR 68
           F  H   G  L      R H P +   S + + S L  +  + P+        +   NQ 
Sbjct: 3   FHFHHLHGVSLS-FNRARNHLPCINSFSSAFSFSTLSDSSIKEPSF-------NQISNQS 62

Query: 69  SIDERFVISELSDLL-LVNPHGSVSNTLNENPSEKQMPIRAVDGFLLPEEKLRGVFLQKL 128
           ++DER V+ ELSDL    + + +V     E+   KQ+   AVD +LLPEEKLRGVFLQKL
Sbjct: 63  TVDERRVLGELSDLFQFSHSNATVPYPYRESYPPKQIESGAVDEYLLPEEKLRGVFLQKL 122

Query: 129 NGKTAIEHALANTDVNLSQDVVNKVLNTGSLGSEAMVTFFYWAIKQPSIPKDTSSYNIIL 188
            GKTAIEHAL+N  V LS D++ KV+N G+LG EAMV FF WA+KQP I +D  SY II+
Sbjct: 123 RGKTAIEHALSNVPVELSIDIIAKVVNIGNLGGEAMVLFFNWAMKQPGIARDIHSYYIII 182

Query: 189 KALGRRRFFDSMMHVLHKMTREGVNANMETVSIVVDSLVKAHQVSKALQLFRNLKEIGLE 248
           KALGRR+FF  M+  LH M +EG+  ++ET+SIV+DS ++A +V KA++ F NL+E+GL+
Sbjct: 183 KALGRRKFFKFMIETLHDMVKEGIKPDVETLSIVMDSFIRAQRVQKAIETFENLEELGLK 242

Query: 249 CDTETLNILLECMCRRSHVGAANSFFNLIKGNVPFNAMSYNIIIGGWSRYGRHSEVERIL 308
            DT++LN+LL+C+CRR+HVGAANS FN + G V FN  +YNI+I GWS+ GR S++ERIL
Sbjct: 243 RDTKSLNVLLQCLCRRAHVGAANSLFNAVNGKVKFNCDTYNIMISGWSKLGRVSKIERIL 302

Query: 309 KAMEVDGFSPDCLTYTYLIECLGRANRIDDAVKIFDKMYENGCKPDVNAYNAMISNFICI 368
           KAM  D F+PDC T++YLIE LGRA RIDDAV+IFD M E GC PD   YNAMISNFI +
Sbjct: 303 KAMIADEFTPDCSTFSYLIEGLGRAGRIDDAVEIFDHMKEKGCIPDTRVYNAMISNFISV 362

Query: 369 GDFDECLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEM-VPRIIPTTGAI 428
           G+FDEC+ YYK +L++  +PD++TY+ LI  FLKA+ VADALE+FDEM V  I+PTTG +
Sbjct: 363 GNFDECMKYYKGLLNSNSDPDVDTYTKLISAFLKAQNVADALEIFDEMLVQGIVPTTGTL 422

Query: 429 TSFMKLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQE 488
           TSF++  CSYGPP+AAM+ YKKARK GC+IS +AYKLLLMRLS FGK GMLLNIW+EMQE
Sbjct: 423 TSFVEPLCSYGPPYAAMMFYKKARKFGCKISLSAYKLLLMRLSRFGKCGMLLNIWDEMQE 482

Query: 489 SGYDPDVDTYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLDCNRTEM 548
           SG+  D++ YEH I+ LC  G LENAVLVMEE LR+GF PSR + SKLNNKLL  N  E 
Sbjct: 483 SGHTSDMEVYEHVINGLCNIGHLENAVLVMEEALRKGFCPSRVLYSKLNNKLLASNEVEK 542

Query: 549 AYKLWLKIKVARHQENLQRCWRAKGWH 574
           AYKL+LKIK AR  EN +R WRA GWH
Sbjct: 543 AYKLFLKIKNARRDENARRYWRANGWH 561

BLAST of Cla023458 vs. NCBI nr
Match: gi|659104798|ref|XP_008452985.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820 [Cucumis melo])

HSP 1 Score: 1033.9 bits (2672), Expect = 1.1e-298
Identity = 510/574 (88.85%), Postives = 539/574 (93.90%), Query Frame = 1

Query: 1   MAFGASRRFLSHQFRGCFLGRLASCRYHYPLLYPPSPSSALSFLFSTLDEPPNLFDDGVL 60
           MAFGASRR L +Q + CFLG +AS RYHYPL++ PSP  ALS+LFSTLDEP NLFDDGV 
Sbjct: 1   MAFGASRRLLPYQVKACFLGLIASGRYHYPLIHSPSP--ALSYLFSTLDEPSNLFDDGVS 60

Query: 61  ADGTRNQRSIDERFVISELSDLLLVNPHGSVSNTLNENPSEKQMPIRAVDGFLLPEEKLR 120
            +G RNQR IDERFVISELSDLLLVNPHGSVSNT+ EN +EKQ+PIRAVDGFLLPEEKLR
Sbjct: 61  GNGDRNQRCIDERFVISELSDLLLVNPHGSVSNTVKENLTEKQVPIRAVDGFLLPEEKLR 120

Query: 121 GVFLQKLNGKTAIEHALANTDVNLSQDVVNKVLNTGSLGSEAMVTFFYWAIKQPSIPKDT 180
           GVFLQKLNGKTAIEHALANTDVNLSQDVV+KVLNTGSLGSEAMVTFFYW+IKQPSIPKD 
Sbjct: 121 GVFLQKLNGKTAIEHALANTDVNLSQDVVSKVLNTGSLGSEAMVTFFYWSIKQPSIPKDA 180

Query: 181 SSYNIILKALGRRRFFDSMMHVLHKMTREGVNANMETVSIVVDSLVKAHQVSKALQLFRN 240
           SSYNIILKALGRR FFDSMM VL+ MTREGV+A +ETVSIVVDSLVKAHQVSKALQ FRN
Sbjct: 181 SSYNIILKALGRRGFFDSMMDVLYSMTREGVDATLETVSIVVDSLVKAHQVSKALQFFRN 240

Query: 241 LKEIGLECDTETLNILLECMCRRSHVGAANSFFNLIKGNVPFNAMSYNIIIGGWSRYGRH 300
           LKEIGL+CDTETLNILL+CMCRRSHVGAANSF NL KG++PFN M+YNIIIGGWSRYGRH
Sbjct: 241 LKEIGLKCDTETLNILLQCMCRRSHVGAANSFLNLTKGSIPFNVMTYNIIIGGWSRYGRH 300

Query: 301 SEVERILKAMEVDGFSPDCLTYTYLIECLGRANRIDDAVKIFDKMYENGCKPDVNAYNAM 360
           SEVE+ LKAMEVDGFSPD LT+TYLIECLGRANRIDDAVKIFDKM E GC PDV AYNAM
Sbjct: 301 SEVEQTLKAMEVDGFSPDYLTHTYLIECLGRANRIDDAVKIFDKMDEKGCTPDVAAYNAM 360

Query: 361 ISNFICIGDFDECLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVPRII 420
           ISNFICIGDFD+CLTYYKRMLSNRCEPD+NTYSNLI GFLKAKKVADALEMFDEMV RII
Sbjct: 361 ISNFICIGDFDQCLTYYKRMLSNRCEPDMNTYSNLITGFLKAKKVADALEMFDEMVARII 420

Query: 421 PTTGAITSFMKLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNI 480
           PTTGAITSF++LSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLL+I
Sbjct: 421 PTTGAITSFIQLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLSI 480

Query: 481 WNEMQESGYDPDVDTYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLD 540
           WNEMQESGYDPDV+TYEHAI CLCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLL 
Sbjct: 481 WNEMQESGYDPDVETYEHAIGCLCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLA 540

Query: 541 CNRTEMAYKLWLKIKVARHQENLQRCWRAKGWHH 575
           CNRTEMAYKLWLKIKVARHQENLQRCWRAKGWH+
Sbjct: 541 CNRTEMAYKLWLKIKVARHQENLQRCWRAKGWHY 572

BLAST of Cla023458 vs. NCBI nr
Match: gi|778697198|ref|XP_011654277.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820 [Cucumis sativus])

HSP 1 Score: 1031.9 bits (2667), Expect = 4.1e-298
Identity = 505/574 (87.98%), Postives = 537/574 (93.55%), Query Frame = 1

Query: 1   MAFGASRRFLSHQFRGCFLGRLASCRYHYPLLYPPSPSSALSFLFSTLDEPPNLFDDGVL 60
           MAFGASRR + +Q R CFLG +AS RYHYPL++ PSP  ALS+LFSTLDEP NLFDDG+ 
Sbjct: 1   MAFGASRRLIPYQLRACFLGLIASGRYHYPLIHSPSP--ALSYLFSTLDEPSNLFDDGLS 60

Query: 61  ADGTRNQRSIDERFVISELSDLLLVNPHGSVSNTLNENPSEKQMPIRAVDGFLLPEEKLR 120
            +G RNQR IDERFVISELSDLLLVNP+GSV NTL EN  EKQMP+RAVDGFLLPEEKLR
Sbjct: 61  GNGDRNQRCIDERFVISELSDLLLVNPYGSVYNTLKENSIEKQMPVRAVDGFLLPEEKLR 120

Query: 121 GVFLQKLNGKTAIEHALANTDVNLSQDVVNKVLNTGSLGSEAMVTFFYWAIKQPSIPKDT 180
           GVFLQKLNGKTAIEHALANTDV LSQDVV+KVLNTGSLGSEAMVTFFYWAIKQPSIPKD 
Sbjct: 121 GVFLQKLNGKTAIEHALANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDA 180

Query: 181 SSYNIILKALGRRRFFDSMMHVLHKMTREGVNANMETVSIVVDSLVKAHQVSKALQLFRN 240
           SSYNIILKALGRR FFDSMM VL+ MTREGV A +E VSIVVDSLVK HQVSKALQ FRN
Sbjct: 181 SSYNIILKALGRRGFFDSMMDVLYNMTREGVEATLEMVSIVVDSLVKGHQVSKALQFFRN 240

Query: 241 LKEIGLECDTETLNILLECMCRRSHVGAANSFFNLIKGNVPFNAMSYNIIIGGWSRYGRH 300
           LKEIGL+CDTETLNILL+CMCRRSHVGAANSFFNL KGN+PFN M+YNI+IGGWSRYGRH
Sbjct: 241 LKEIGLKCDTETLNILLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGRH 300

Query: 301 SEVERILKAMEVDGFSPDCLTYTYLIECLGRANRIDDAVKIFDKMYENGCKPDVNAYNAM 360
            EVE++LKAME+DGFSPDCLT+TYLIECLGRAN+IDDAVKIFDKM ENGC PDV+AYNAM
Sbjct: 301 GEVEQMLKAMELDGFSPDCLTHTYLIECLGRANQIDDAVKIFDKMDENGCTPDVDAYNAM 360

Query: 361 ISNFICIGDFDECLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVPRII 420
           ISNFICIGDFD+CLTYY+RMLSNRCEPD+NTYSNLI GFLKAKKVADALEMFDEMV RII
Sbjct: 361 ISNFICIGDFDQCLTYYERMLSNRCEPDMNTYSNLITGFLKAKKVADALEMFDEMVARII 420

Query: 421 PTTGAITSFMKLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNI 480
           PTTGAITSF++LSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNI
Sbjct: 421 PTTGAITSFIQLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNI 480

Query: 481 WNEMQESGYDPDVDTYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLD 540
           WNEMQESGYDPDV+TYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSR+ RSKLNNKLL 
Sbjct: 481 WNEMQESGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRRTRSKLNNKLLA 540

Query: 541 CNRTEMAYKLWLKIKVARHQENLQRCWRAKGWHH 575
           CNRTEMAYKLWLKIKVARHQENLQRCWRAKGWH+
Sbjct: 541 CNRTEMAYKLWLKIKVARHQENLQRCWRAKGWHY 572

BLAST of Cla023458 vs. NCBI nr
Match: gi|147865347|emb|CAN84084.1| (hypothetical protein VITISV_018999 [Vitis vinifera])

HSP 1 Score: 671.0 bits (1730), Expect = 1.8e-189
Identity = 337/565 (59.65%), Postives = 425/565 (75.22%), Query Frame = 1

Query: 10  LSHQFRGCFLGRLASCRYHYPLLYPPSPSSALSFLFSTLDEPPNLFDDGVLADGTRNQRS 69
           ++ Q +G FL R +  RYH   L    PSS   F FSTL    N   D    +  +   +
Sbjct: 1   MASQLQG-FLSRFSRTRYHTRYL----PSSVSLFQFSTLQVTSNPLMDEPTDNQIKRPSN 60

Query: 70  IDERFVISELSDLLLVNPHGSVSNTLNENPSEKQMPIRAVDGFLLPEEKLRGVFLQKLNG 129
            +ER V+ +LS LL +  + S+S    EN  ++Q+  RAVDGFL P EKLRGVF+Q+L G
Sbjct: 61  FNERDVLYQLSGLLPICCNTSISKPFTENSPKEQLKTRAVDGFLSPGEKLRGVFIQRLRG 120

Query: 130 KTAIEHALANTDVNLSQDVVNKVLNTGSLGSEAMVTFFYWAIKQPSIPKDTSSYNIILKA 189
           K AIE AL N  ++L+ D+V++V N G+LG EAMV FF WA+KQP+IPKD  +YN+I+KA
Sbjct: 121 KAAIELALTNVGIDLTIDIVSEVXNRGNLGGEAMVXFFNWAVKQPTIPKDVDTYNVIIKA 180

Query: 190 LGRRRFFDSMMHVLHKMTREGVNANMETVSIVVDSLVKAHQVSKALQLFRNLKEIGLECD 249
           LGRR+F +  + VL  M  +G++ N ET+SIV+DS +KA QVSKA+++FRNL+E G +CD
Sbjct: 181 LGRRKFIEFXVXVLKDMHIQGISPNYETLSIVMDSFIKARQVSKAIEMFRNLEEFGGKCD 240

Query: 250 TETLNILLECMCRRSHVGAANSFFNLIKGNVPFNAMSYNIIIGGWSRYGRHSEVERILKA 309
           TE+LN+LL+C+C+RSHVGAAN FFN +KG +PFN M+YNIIIGGWS+YG+  E+ER LKA
Sbjct: 241 TESLNVLLQCLCQRSHVGAANLFFNAMKGGIPFNCMTYNIIIGGWSKYGKIGEMERCLKA 300

Query: 310 MEVDGFSPDCLTYTYLIECLGRANRIDDAVKIFDKMYENGCKPDVNAYNAMISNFICIGD 369
           M  DGFSP+CLT+++LIE LGRA RIDDAV++F  M E GC P+   YNA+ISNFI   D
Sbjct: 301 MVADGFSPNCLTFSHLIEGLGRAGRIDDAVEVFHHMEETGCVPNACVYNALISNFISTRD 360

Query: 370 FDECLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMVPR-IIPTTGAITS 429
           FDECL YY  M+S+ C+P+++TY+ LI+ FLKA+KVADALEM DEMV R +IPTTGAITS
Sbjct: 361 FDECLKYYNFMVSSNCDPNMDTYTKLIVAFLKARKVADALEMLDEMVGRGMIPTTGAITS 420

Query: 430 FMKLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESG 489
           F++  C YGPPHAAM+IYKKARKVGCRIS +AYKLLLMRLS FGK GMLLN+W+EMQESG
Sbjct: 421 FIEPLCQYGPPHAAMMIYKKARKVGCRISLSAYKLLLMRLSRFGKCGMLLNLWDEMQESG 480

Query: 490 YDPDVDTYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLDCNRTEMAY 549
           Y  D + YE+ I+ LC  GQL+ AVLVMEE L +GF PSR IRSKLNNKLL  N+ EMAY
Sbjct: 481 YSSDTEVYEYVINGLCNIGQLDTAVLVMEESLXKGFCPSRLIRSKLNNKLLASNKVEMAY 540

Query: 550 KLWLKIKVARHQENLQRCWRAKGWH 574
           KL+LKIK AR  +N +R WR  GWH
Sbjct: 541 KLFLKIKXARQNDNARRFWRGNGWH 560

BLAST of Cla023458 vs. NCBI nr
Match: gi|645265054|ref|XP_008237970.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820 [Prunus mume])

HSP 1 Score: 667.2 bits (1720), Expect = 2.7e-188
Identity = 342/550 (62.18%), Postives = 421/550 (76.55%), Query Frame = 1

Query: 29  YPLLY---PPSPSSALSFLFSTLDEPPNLFDDGVLADGTRNQRSIDERFVISELSDLLLV 88
           YPL Y    P PSS    LFSTL    N   D       +NQ ++DE FV+ +LS+LL +
Sbjct: 19  YPLSYLVRSPIPSS----LFSTLYAQSNSLHD---EHRIKNQSTLDESFVLDQLSNLLPI 78

Query: 89  NPHGSVSNTLNE-NPSEKQMPIRAVDGFLLPEEKLRGVFLQKLNGKTAIEHALANTDVNL 148
               S + TL E + S+KQ+ IRAVDGFLLP+EKLRGVFLQKL G  AIEHAL N  V+L
Sbjct: 79  CRSNSSTATLFEPSNSDKQIEIRAVDGFLLPDEKLRGVFLQKLRGTAAIEHALDNGGVDL 138

Query: 149 SQDVVNKVLNTGSLGSEAMVTFFYWAIKQPSIPKDTSSYNIILKALGRRRFFDSMMHVLH 208
           S DVV +V+N G LG+EAM+ FF WAI++P+I K+  +++IILKALGRR+FF  MM +LH
Sbjct: 139 SVDVVAQVVNRGGLGAEAMLVFFNWAIRKPTIAKNIETFHIILKALGRRKFFTHMMQILH 198

Query: 209 KMTREGVNANMETVSIVVDSLVKAHQVSKALQLFRNLKEIGLECDTETLNILLECMCRRS 268
            M  +G+  N+ET+SIV+DS V+A  VSKA+Q+FRNL+EIGLECDTE+LN+LL+C+C+RS
Sbjct: 199 HMRAQGIRPNLETISIVMDSFVRAQHVSKAIQMFRNLEEIGLECDTESLNLLLQCLCQRS 258

Query: 269 HVGAANSFFNLIKGNVPFNAMSYNIIIGGWSRYGRHSEVERILKAMEVDGFSPDCLTYTY 328
           HVGAANSF N +KG + FN  +YNIIIGGWSR+GR SE+ERIL+AM  DGFS D  T+++
Sbjct: 259 HVGAANSFLNSVKGKIQFNGNTYNIIIGGWSRHGRVSEIERILEAMVADGFSADSSTFSF 318

Query: 329 LIECLGRANRIDDAVKIFDKMYENGCKPDVNAYNAMISNFICIGDFDECLTYYKRMLSNR 388
           ++E LGRA  IDDAV+IFD M   GC PD   YNAMISNFI + +FDEC+ YYK M SN 
Sbjct: 319 ILEGLGRAGHIDDAVEIFDSMKGKGCMPDTRVYNAMISNFISVRNFDECVRYYKGMSSNS 378

Query: 389 CEPDINTYSNLIIGFLKAKKVADALEMFDEMVPR-IIPTTGAITSFMKLSCSYGPPHAAM 448
           C P+I+TY+ LI  FLKA+KVADALEMFDEM+ R ++PTTG ITSF++  CSYGPP+AAM
Sbjct: 379 CNPNIDTYTKLIAAFLKARKVADALEMFDEMLGRGLVPTTGTITSFIEPLCSYGPPYAAM 438

Query: 449 LIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESGYDPDVDTYEHAIDCL 508
           +IYKKARKVGCRIS +AYKLLLMRLS FGK GMLLNIW +MQE GY  D + Y++ I+ L
Sbjct: 439 MIYKKARKVGCRISLSAYKLLLMRLSRFGKCGMLLNIWEDMQECGYASDKEVYDYVINGL 498

Query: 509 CKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLDCNRTEMAYKLWLKIKVARHQENL 568
           C  G LENAVLVMEE L++GF PSR + SKLNNKLL  N+ E AYKL+LKIK AR  +N 
Sbjct: 499 CNIGHLENAVLVMEESLQKGFCPSRLVYSKLNNKLLASNKVERAYKLFLKIKHARRYDNA 558

Query: 569 QRCWRAKGWH 574
           QR WR+KGWH
Sbjct: 559 QRFWRSKGWH 561

BLAST of Cla023458 vs. NCBI nr
Match: gi|1009155752|ref|XP_015895879.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820 [Ziziphus jujuba])

HSP 1 Score: 665.6 bits (1716), Expect = 7.7e-188
Identity = 339/578 (58.65%), Postives = 434/578 (75.09%), Query Frame = 1

Query: 1   MAFGASRR-FLSHQFRGCFLG---RLASCRYHYPLLYPPSPSSALSFLFSTLDEPPNLFD 60
           MAFG     FL+ Q +  +LG    L   R+ +P L  P P     F FSTL +    F+
Sbjct: 1   MAFGGIPWCFLASQSQR-YLGLSRHLRRARHPFPCLRLPIPL----FSFSTLSDSSYTFN 60

Query: 61  DGVLADGTRNQRSIDERFVISELSDLLLVNPHGSVSNTLNENPSEKQMPIRAVDGFLLPE 120
           D  L    ++Q ++DER V+ ELS+LL V+   S +N   +  +E ++ IRA DGFL PE
Sbjct: 61  DEYLI---KHQSTLDERNVLDELSNLLPVSCSTSATNLYKKEYAENKIDIRAADGFLSPE 120

Query: 121 EKLRGVFLQKLNGKTAIEHALANTDVNLSQDVVNKVLNTGSLGSEAMVTFFYWAIKQPSI 180
           +KLRGVFLQKL GKTAIEHAL+N  V L+ +VV +V+N GSLGSE +V F  WAIKQP I
Sbjct: 121 DKLRGVFLQKLKGKTAIEHALSNVGVELNLNVVAEVVNRGSLGSEDIVIFSNWAIKQPLI 180

Query: 181 PKDTSSYNIILKALGRRRFFDSMMHVLHKMTREGVNANMETVSIVVDSLVKAHQVSKALQ 240
            KD   Y+IIL+ALGRR+FF  M+ +L  M  +G+N N+ET+SIV+DS ++A QVSKA+Q
Sbjct: 181 SKDIHFYHIILRALGRRKFFKDMIKILRDMRTKGINPNLETISIVMDSFLRARQVSKAIQ 240

Query: 241 LFRNLKEIGLECDTETLNILLECMCRRSHVGAANSFFNLIKGNVPFNAMSYNIIIGGWSR 300
            FRNL+E+GL C+T+TLN+LL+C+C+RSHVG ANSF N +KG +PFN  +YNI++ GWS+
Sbjct: 241 TFRNLEEVGLNCETKTLNVLLQCLCQRSHVGTANSFLNSMKGKIPFNGTTYNIVVNGWSK 300

Query: 301 YGRHSEVERILKAMEVDGFSPDCLTYTYLIECLGRANRIDDAVKIFDKMYENGCKPDVNA 360
           +GR SE+ER+L  M VDG SPD LT+T+LI+  GRA +ID A++IF+ M + GC  + ++
Sbjct: 301 FGRISEMERLLDEMVVDGISPDSLTFTHLIDGFGRAGQIDKAIEIFENMKQGGCLLNTSS 360

Query: 361 YNAMISNFICIGDFDECLTYYKRMLSNRCEPDINTYSNLIIGFLKAKKVADALEMFDEMV 420
           YNAMISNFI +GDFDE   YY+ MLSN CE DI+TY+++I GFLKA+KVADALEMFDEM+
Sbjct: 361 YNAMISNFIYVGDFDEATKYYRSMLSNNCEADIDTYTSIITGFLKARKVADALEMFDEML 420

Query: 421 PR-IIPTTGAITSFMKLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFG 480
            R + P TG +TSF+K  CSYGPPHAAM++Y+KA+ VGCR S +AYKLLLMRLS FGK G
Sbjct: 421 ARGVFPPTGTLTSFIKTLCSYGPPHAAMIVYRKAKAVGCRFSSSAYKLLLMRLSRFGKCG 480

Query: 481 MLLNIWNEMQESGYDPDVDTYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRQIRSKLN 540
           MLL+IWNEMQE GY  DV+ YE+ I+ LC  GQLENAVLVMEECLR+GF PSR I SK+N
Sbjct: 481 MLLSIWNEMQECGYSSDVEVYEYVINGLCNVGQLENAVLVMEECLRKGFCPSRLICSKVN 540

Query: 541 NKLLDCNRTEMAYKLWLKIKVARHQENLQRCWRAKGWH 574
           +KLLD N+ E AYKL+LKIKVAR  +N +R WR+KGWH
Sbjct: 541 HKLLDSNKVEKAYKLFLKIKVARRNDNARRFWRSKGWH 570

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP416_ARATH3.9e-16354.47Putative pentatricopeptide repeat-containing protein At5g43820 OS=Arabidopsis th... [more]
PP293_ARATH1.2e-3925.49Pentatricopeptide repeat-containing protein At3g62470, mitochondrial OS=Arabidop... [more]
PP275_ARATH3.6e-3624.36Pentatricopeptide repeat-containing protein At3g49730 OS=Arabidopsis thaliana GN... [more]
PP248_ARATH2.3e-3525.65Pentatricopeptide repeat-containing protein At3g22670, mitochondrial OS=Arabidop... [more]
PPR54_ARATH5.2e-3524.63Pentatricopeptide repeat-containing protein At1g20300, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0L3E0_CUCSA2.9e-29887.98Uncharacterized protein OS=Cucumis sativus GN=Csa_4G663730 PE=4 SV=1[more]
A5C8V0_VITVI1.3e-18959.65Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_018999 PE=4 SV=1[more]
M5WDR4_PRUPE2.7e-18762.23Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa023340mg PE=4 SV=1[more]
D7UDB7_VITVI2.5e-18562.85Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0122g01120 PE=4 SV=... [more]
A0A061FE27_THECC7.1e-18057.67Pentatricopeptide repeat (PPR) superfamily protein, putative isoform 1 OS=Theobr... [more]
Match NameE-valueIdentityDescription
gi|659104798|ref|XP_008452985.1|1.1e-29888.85PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820 [Cucum... [more]
gi|778697198|ref|XP_011654277.1|4.1e-29887.98PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820 [Cucum... [more]
gi|147865347|emb|CAN84084.1|1.8e-18959.65hypothetical protein VITISV_018999 [Vitis vinifera][more]
gi|645265054|ref|XP_008237970.1|2.7e-18862.18PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820 [Prunu... [more]
gi|1009155752|ref|XP_015895879.1|7.7e-18858.65PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820 [Zizip... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU46387watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla023458Cla023458.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU46387WMU46387transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 217..246
score: 0.18coord: 495..524
score: 0.059coord: 182..211
score: 0.
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 384..416
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 305..363
score: 3.7
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 391..416
score: 1.2E-4coord: 182..214
score: 3.7E-5coord: 286..319
score: 4.9E-6coord: 356..389
score: 4.9E-8coord: 321..354
score: 6.3
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 457..491
score: 7.892coord: 353..387
score: 10.523coord: 388..422
score: 9.372coord: 249..279
score: 6.763coord: 283..317
score: 10.852coord: 214..248
score: 8.659coord: 179..213
score: 10.084coord: 492..526
score: 10.358coord: 318..352
score: 12
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 490..520
score: 2.7E-9coord: 230..415
score: 2.7E-9coord: 447..448
score: 2.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 90..568
score: 1.8E
NoneNo IPR availablePANTHERPTHR24015:SF378SUBFAMILY NOT NAMEDcoord: 90..568
score: 1.8E
NoneNo IPR availableunknownSSF81901HCP-likecoord: 229..453
score: 6.2