Cla015352.1 (mRNA) Watermelon (97103) v1

NameCla015352
TypemRNA
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionPentatricopeptide repeat-containing protein (AHRD V1 ***- D7KHY5_ARALL); contains Interpro domain(s) IPR002885 Pentatricopeptide repeat
LocationChr9 : 2209725 .. 2211959 (+)
Sequence length2235
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCAACCCCGGTGCCTTTCAGGACTCTGTTGCATCATCGTCATGTTACAAATTCCAAGCAAATGGCCACCATAGCCACCATCTCCTCAGCTTCAAGCTCCTTCTCTCCACCGACCCACCCTCTAATCTCTCTTCTCGAGACCTGCAAATCCATGGACCAGCTTCAGCAGATCCACTGTCAAGCAATCAAAACAGCTTTCAATGCTAACCCAGTTCTGCAAAACAAAGTCATGTCCTTTTGTTGTACTCATGAATGCGGTGACTTGAAATATGCACATCACCTGTTTGATGAAATTCCTGAACCGAATTTGTTCATCTGGAACACCATGATCAGAGGCTACTCCCGTTTGGATTCTCCTGAGCTCGGAGTTTCTTTGTATCTGGAAATGTTGAGGAGAGGTTTCAAGCCTGATCGTTACACCTTCCCTTTCCTGTTCAAGGGATTTACAAGAGACATTGCATTGGAATATGGAAGAGAGCTTCATGGCCATGTTCTGAAGCTTGGACTTCAGTCTAATGTCTTTGTTCACACTGCTTTGGTGCAAATGTATCTTTTGTGTGGCCAACTTGATACGGCTCGTAGGGTTTTGGATGTTTGTTCTAAAGCTGATGTGATTGCTTGGAATATGATGATTTCTGCTTACAATAAAGTTGGTGAGTTTGAGGAATCAAGAAGAATTTTTCTTGGTATGGAGGAAAAACAAGTGCTGCCCACCACAGTGACCCTTGTTTTAATCCTGTCAGCTTGCTCCAAGTTGAAGGATTTAAAAACTGGGAAGCAGGTTCATAGTTATGTGAACAACTGCAAGGTTGAGAGCAATTTGGTTCTTGAAAATGCTCTGATTGATATGTATGCTACTTGTGGGGAAATGGATTCTGCCCTTGGGATATTCAGGAGTATGAATAACAAAGATATCATTTCTTGGACGACCATTGTATCTGGGTTTACCAACTTGGGAGAAATTGATGTTGCTCGGAACTACTTCGACAAGATGCCAGAGAAAGATTATGTTTCATGGACTGCCATGATTGATGGATACATCCGCTCAAATCGATTCAAAGAAGCATTGGAGTTATTCCGCAATATGCAAGCAACCAATGTAAAGCCTGATGAGTTCACTATGGTTAGTATTCTGACTGCTTGTGCACATCTAGGGGCCCTTGAGTTAGGAGAATGGATAAGAACTTACATTGATCGGAACAAGATCAACAATGATGCATTTGTTAGAAATGCTTTAATAGACATGTACTTCAAGTGTGGAAATGTTGACAAAGCAGAAAGAATATTCAGAGAGATGTGTCAGAGAGACAAGTTTACATGGACAGCCATGATAGTTGGCCTTGCAGTAAATGGCCATGGTGAGAAAGCTCTTGATATGTTTTCTCAAATGCTAAAAGCTTCAATTTTGCCAGATGAGATTACTTACATTGGTGTTCTTTCTGCTTGTACACACACTGGCATGGTAGAGAAAGGACGAGAGTATTTTCTTAGCATGACAACCCAACATGGTATTGAACCCAATATAGCACACTACGGTTGTCTGGTTGATCTTCTTGCTCGAGCTGGTTGTCTAAAAGAAGCCCATGAAGTCATCGAGAACATGCCAATGAAACCCAATTCCATTGTCTGGGGGGCTCTTCTAGCTGGTTGTAGAGTTTATAGAGAAGCCGATATGGCTGAAATGGTTGTTAAGCAGATTCTTGATTTGGAGCCTGAGAATGGTGCTGTCTATGTTCTCCTGTGTAATATTTATGCAGCTTGCAAGAGATGGAATGACCTGCGAGAGTTGAGGCAGATGATGATGGACAAAGGAATCAAGAAAACACCTGGTTGCAGTTTGATAGAGATGAATGGCACAGTTCATGAATTTGTAGCTGGGGACCGATCACATCCTCAAACTGAAAAAATTGATGTTAAGCTAAACAAAATGACCCAAGACCTGAAATTTGCAGGGTATTCACCTGATGTCTCAGAAGTGTTCCTTGACATAGCAGAAGAGGATAAAGAGAACTCAGTCTTTCGTCACAGTGAGAAGTTGGCCATTGCTTTTGGACTCATTAATTCCCCACCTGGGGTCACGATTAGAATCGTGAAGAACCTTCGAATGTGCATGGATTGTCACAATATGGCGAAGTTAGTCTCAAAGGTGTATAATAGAGAAGTAATTGTTAGGGACAGAACCAGATTCCACCATTTCAAACATGGTTTATGTTCGTGTAAAGACTACTGGTGA

mRNA sequence

ATGCCAACCCCGGTGCCTTTCAGGACTCTGTTGCATCATCGTCATGTTACAAATTCCAAGCAAATGGCCACCATAGCCACCATCTCCTCAGCTTCAAGCTCCTTCTCTCCACCGACCCACCCTCTAATCTCTCTTCTCGAGACCTGCAAATCCATGGACCAGCTTCAGCAGATCCACTGTCAAGCAATCAAAACAGCTTTCAATGCTAACCCAGTTCTGCAAAACAAAGTCATGTCCTTTTGTTGTACTCATGAATGCGGTGACTTGAAATATGCACATCACCTGTTTGATGAAATTCCTGAACCGAATTTGTTCATCTGGAACACCATGATCAGAGGCTACTCCCGTTTGGATTCTCCTGAGCTCGGAGTTTCTTTGTATCTGGAAATGTTGAGGAGAGGTTTCAAGCCTGATCGTTACACCTTCCCTTTCCTGTTCAAGGGATTTACAAGAGACATTGCATTGGAATATGGAAGAGAGCTTCATGGCCATGTTCTGAAGCTTGGACTTCAGTCTAATGTCTTTGTTCACACTGCTTTGGTGCAAATGTATCTTTTGTGTGGCCAACTTGATACGGCTCGTAGGGTTTTGGATGTTTGTTCTAAAGCTGATGTGATTGCTTGGAATATGATGATTTCTGCTTACAATAAAGTTGGTGAGTTTGAGGAATCAAGAAGAATTTTTCTTGGTATGGAGGAAAAACAAGTGCTGCCCACCACAGTGACCCTTGTTTTAATCCTGTCAGCTTGCTCCAAGTTGAAGGATTTAAAAACTGGGAAGCAGGTTCATAGTTATGTGAACAACTGCAAGGTTGAGAGCAATTTGGTTCTTGAAAATGCTCTGATTGATATGTATGCTACTTGTGGGGAAATGGATTCTGCCCTTGGGATATTCAGGAGTATGAATAACAAAGATATCATTTCTTGGACGACCATTGTATCTGGGTTTACCAACTTGGGAGAAATTGATGTTGCTCGGAACTACTTCGACAAGATGCCAGAGAAAGATTATGTTTCATGGACTGCCATGATTGATGGATACATCCGCTCAAATCGATTCAAAGAAGCATTGGAGTTATTCCGCAATATGCAAGCAACCAATGTAAAGCCTGATGAGTTCACTATGGTTAGTATTCTGACTGCTTGTGCACATCTAGGGGCCCTTGAGTTAGGAGAATGGATAAGAACTTACATTGATCGGAACAAGATCAACAATGATGCATTTGTTAGAAATGCTTTAATAGACATGTACTTCAAGTGTGGAAATGTTGACAAAGCAGAAAGAATATTCAGAGAGATGTGTCAGAGAGACAAGTTTACATGGACAGCCATGATAGTTGGCCTTGCAGTAAATGGCCATGGTGAGAAAGCTCTTGATATGTTTTCTCAAATGCTAAAAGCTTCAATTTTGCCAGATGAGATTACTTACATTGGTGTTCTTTCTGCTTGTACACACACTGGCATGGTAGAGAAAGGACGAGAGTATTTTCTTAGCATGACAACCCAACATGGTATTGAACCCAATATAGCACACTACGGTTGTCTGGTTGATCTTCTTGCTCGAGCTGGTTGTCTAAAAGAAGCCCATGAAGTCATCGAGAACATGCCAATGAAACCCAATTCCATTGTCTGGGGGGCTCTTCTAGCTGGTTGTAGAGTTTATAGAGAAGCCGATATGGCTGAAATGGTTGTTAAGCAGATTCTTGATTTGGAGCCTGAGAATGGTGCTGTCTATGTTCTCCTGTGTAATATTTATGCAGCTTGCAAGAGATGGAATGACCTGCGAGAGTTGAGGCAGATGATGATGGACAAAGGAATCAAGAAAACACCTGGTTGCAGTTTGATAGAGATGAATGGCACAGTTCATGAATTTGTAGCTGGGGACCGATCACATCCTCAAACTGAAAAAATTGATGTTAAGCTAAACAAAATGACCCAAGACCTGAAATTTGCAGGGTATTCACCTGATGTCTCAGAAGTGTTCCTTGACATAGCAGAAGAGGATAAAGAGAACTCAGTCTTTCGTCACAGTGAGAAGTTGGCCATTGCTTTTGGACTCATTAATTCCCCACCTGGGGTCACGATTAGAATCGTGAAGAACCTTCGAATGTGCATGGATTGTCACAATATGGCGAAGTTAGTCTCAAAGGTGTATAATAGAGAAGTAATTGTTAGGGACAGAACCAGATTCCACCATTTCAAACATGGTTTATGTTCGTGTAAAGACTACTGGTGA

Coding sequence (CDS)

ATGCCAACCCCGGTGCCTTTCAGGACTCTGTTGCATCATCGTCATGTTACAAATTCCAAGCAAATGGCCACCATAGCCACCATCTCCTCAGCTTCAAGCTCCTTCTCTCCACCGACCCACCCTCTAATCTCTCTTCTCGAGACCTGCAAATCCATGGACCAGCTTCAGCAGATCCACTGTCAAGCAATCAAAACAGCTTTCAATGCTAACCCAGTTCTGCAAAACAAAGTCATGTCCTTTTGTTGTACTCATGAATGCGGTGACTTGAAATATGCACATCACCTGTTTGATGAAATTCCTGAACCGAATTTGTTCATCTGGAACACCATGATCAGAGGCTACTCCCGTTTGGATTCTCCTGAGCTCGGAGTTTCTTTGTATCTGGAAATGTTGAGGAGAGGTTTCAAGCCTGATCGTTACACCTTCCCTTTCCTGTTCAAGGGATTTACAAGAGACATTGCATTGGAATATGGAAGAGAGCTTCATGGCCATGTTCTGAAGCTTGGACTTCAGTCTAATGTCTTTGTTCACACTGCTTTGGTGCAAATGTATCTTTTGTGTGGCCAACTTGATACGGCTCGTAGGGTTTTGGATGTTTGTTCTAAAGCTGATGTGATTGCTTGGAATATGATGATTTCTGCTTACAATAAAGTTGGTGAGTTTGAGGAATCAAGAAGAATTTTTCTTGGTATGGAGGAAAAACAAGTGCTGCCCACCACAGTGACCCTTGTTTTAATCCTGTCAGCTTGCTCCAAGTTGAAGGATTTAAAAACTGGGAAGCAGGTTCATAGTTATGTGAACAACTGCAAGGTTGAGAGCAATTTGGTTCTTGAAAATGCTCTGATTGATATGTATGCTACTTGTGGGGAAATGGATTCTGCCCTTGGGATATTCAGGAGTATGAATAACAAAGATATCATTTCTTGGACGACCATTGTATCTGGGTTTACCAACTTGGGAGAAATTGATGTTGCTCGGAACTACTTCGACAAGATGCCAGAGAAAGATTATGTTTCATGGACTGCCATGATTGATGGATACATCCGCTCAAATCGATTCAAAGAAGCATTGGAGTTATTCCGCAATATGCAAGCAACCAATGTAAAGCCTGATGAGTTCACTATGGTTAGTATTCTGACTGCTTGTGCACATCTAGGGGCCCTTGAGTTAGGAGAATGGATAAGAACTTACATTGATCGGAACAAGATCAACAATGATGCATTTGTTAGAAATGCTTTAATAGACATGTACTTCAAGTGTGGAAATGTTGACAAAGCAGAAAGAATATTCAGAGAGATGTGTCAGAGAGACAAGTTTACATGGACAGCCATGATAGTTGGCCTTGCAGTAAATGGCCATGGTGAGAAAGCTCTTGATATGTTTTCTCAAATGCTAAAAGCTTCAATTTTGCCAGATGAGATTACTTACATTGGTGTTCTTTCTGCTTGTACACACACTGGCATGGTAGAGAAAGGACGAGAGTATTTTCTTAGCATGACAACCCAACATGGTATTGAACCCAATATAGCACACTACGGTTGTCTGGTTGATCTTCTTGCTCGAGCTGGTTGTCTAAAAGAAGCCCATGAAGTCATCGAGAACATGCCAATGAAACCCAATTCCATTGTCTGGGGGGCTCTTCTAGCTGGTTGTAGAGTTTATAGAGAAGCCGATATGGCTGAAATGGTTGTTAAGCAGATTCTTGATTTGGAGCCTGAGAATGGTGCTGTCTATGTTCTCCTGTGTAATATTTATGCAGCTTGCAAGAGATGGAATGACCTGCGAGAGTTGAGGCAGATGATGATGGACAAAGGAATCAAGAAAACACCTGGTTGCAGTTTGATAGAGATGAATGGCACAGTTCATGAATTTGTAGCTGGGGACCGATCACATCCTCAAACTGAAAAAATTGATGTTAAGCTAAACAAAATGACCCAAGACCTGAAATTTGCAGGGTATTCACCTGATGTCTCAGAAGTGTTCCTTGACATAGCAGAAGAGGATAAAGAGAACTCAGTCTTTCGTCACAGTGAGAAGTTGGCCATTGCTTTTGGACTCATTAATTCCCCACCTGGGGTCACGATTAGAATCGTGAAGAACCTTCGAATGTGCATGGATTGTCACAATATGGCGAAGTTAGTCTCAAAGGTGTATAATAGAGAAGTAATTGTTAGGGACAGAACCAGATTCCACCATTTCAAACATGGTTTATGTTCGTGTAAAGACTACTGGTGA

Protein sequence

MPTPVPFRTLLHHRHVTNSKQMATIATISSASSSFSPPTHPLISLLETCKSMDQLQQIHCQAIKTAFNANPVLQNKVMSFCCTHECGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSPELGVSLYLEMLRRGFKPDRYTFPFLFKGFTRDIALEYGRELHGHVLKLGLQSNVFVHTALVQMYLLCGQLDTARRVLDVCSKADVIAWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTTVTLVLILSACSKLKDLKTGKQVHSYVNNCKVESNLVLENALIDMYATCGEMDSALGIFRSMNNKDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELFRNMQATNVKPDEFTMVSILTACAHLGALELGEWIRTYIDRNKINNDAFVRNALIDMYFKCGNVDKAERIFREMCQRDKFTWTAMIVGLAVNGHGEKALDMFSQMLKASILPDEITYIGVLSACTHTGMVEKGREYFLSMTTQHGIEPNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPNSIVWGALLAGCRVYREADMAEMVVKQILDLEPENGAVYVLLCNIYAACKRWNDLRELRQMMMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEVFLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRIVKNLRMCMDCHNMAKLVSKVYNREVIVRDRTRFHHFKHGLCSCKDYW
BLAST of Cla015352 vs. Swiss-Prot
Match: PP235_ARATH (Putative pentatricopeptide repeat-containing protein At3g15930 OS=Arabidopsis thaliana GN=PCMP-E51 PE=3 SV=2)

HSP 1 Score: 750.7 bits (1937), Expect = 1.5e-215
Identity = 357/640 (55.78%), Postives = 469/640 (73.28%), Query Frame = 1

Query: 28  ISSASSSFSPPTHPLISLLETCKSMDQLQQIHCQAIKTAFNANPVLQNKVMSFCCTHECG 87
           +S+ + S S      IS+L  CK+ DQ +Q+H Q+I      NP  Q K+  F C+   G
Sbjct: 23  MSTITESISNDYSRFISILGVCKTTDQFKQLHSQSITRGVAPNPTFQKKLFVFWCSRLGG 82

Query: 88  DLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSPELGVSLYLEMLRRGFKPDRYTFPFLFK 147
            + YA+ LF +IPEP++ +WN MI+G+S++D    GV LYL ML+ G  PD +TFPFL  
Sbjct: 83  HVSYAYKLFVKIPEPDVVVWNNMIKGWSKVDCDGEGVRLYLNMLKEGVTPDSHTFPFLLN 142

Query: 148 GFTRDI-ALEYGRELHGHVLKLGLQSNVFVHTALVQMYLLCGQLDTARRVLDVCSKADVI 207
           G  RD  AL  G++LH HV+K GL SN++V  ALV+MY LCG +D AR V D   K DV 
Sbjct: 143 GLKRDGGALACGKKLHCHVVKFGLGSNLYVQNALVKMYSLCGLMDMARGVFDRRCKEDVF 202

Query: 208 AWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTTVTLVLILSACSKLKDLKTGKQVHSYV 267
           +WN+MIS YN++ E+EES  + + ME   V PT+VTL+L+LSACSK+KD    K+VH YV
Sbjct: 203 SWNLMISGYNRMKEYEESIELLVEMERNLVSPTSVTLLLVLSACSKVKDKDLCKRVHEYV 262

Query: 268 NNCKVESNLVLENALIDMYATCGEMDSALGIFRSMNNKDIISWTTIVSGFTNLGEIDVAR 327
           + CK E +L LENAL++ YA CGEMD A+ IFRSM  +D+ISWT+IV G+   G + +AR
Sbjct: 263 SECKTEPSLRLENALVNAYAACGEMDIAVRIFRSMKARDVISWTSIVKGYVERGNLKLAR 322

Query: 328 NYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELFRNMQATNVKPDEFTMVSILTACAHLG 387
            YFD+MP +D +SWT MIDGY+R+  F E+LE+FR MQ+  + PDEFTMVS+LTACAHLG
Sbjct: 323 TYFDQMPVRDRISWTIMIDGYLRAGCFNESLEIFREMQSAGMIPDEFTMVSVLTACAHLG 382

Query: 388 ALELGEWIRTYIDRNKINNDAFVRNALIDMYFKCGNVDKAERIFREMCQRDKFTWTAMIV 447
           +LE+GEWI+TYID+NKI ND  V NALIDMYFKCG  +KA+++F +M QRDKFTWTAM+V
Sbjct: 383 SLEIGEWIKTYIDKNKIKNDVVVGNALIDMYFKCGCSEKAQKVFHDMDQRDKFTWTAMVV 442

Query: 448 GLAVNGHGEKALDMFSQMLKASILPDEITYIGVLSACTHTGMVEKGREYFLSMTTQHGIE 507
           GLA NG G++A+ +F QM   SI PD+ITY+GVLSAC H+GMV++ R++F  M + H IE
Sbjct: 443 GLANNGQGQEAIKVFFQMQDMSIQPDDITYLGVLSACNHSGMVDQARKFFAKMRSDHRIE 502

Query: 508 PNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPNSIVWGALLAGCRVYREADMAEMVVKQ 567
           P++ HYGC+VD+L RAG +KEA+E++  MPM PNSIVWGALL   R++ +  MAE+  K+
Sbjct: 503 PSLVHYGCMVDMLGRAGLVKEAYEILRKMPMNPNSIVWGALLGASRLHNDEPMAELAAKK 562

Query: 568 ILDLEPENGAVYVLLCNIYAACKRWNDLRELRQMMMDKGIKKTPGCSLIEMNGTVHEFVA 627
           IL+LEP+NGAVY LLCNIYA CKRW DLRE+R+ ++D  IKKTPG SLIE+NG  HEFVA
Sbjct: 563 ILELEPDNGAVYALLCNIYAGCKRWKDLREVRRKIVDVAIKKTPGFSLIEVNGFAHEFVA 622

Query: 628 GDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEVFLDIAE 667
           GD+SH Q+E+I +KL ++ Q+  FA Y PD SE+  +  +
Sbjct: 623 GDKSHLQSEEIYMKLEELAQESTFAAYLPDTSELLFEAGD 662

BLAST of Cla015352 vs. Swiss-Prot
Match: PP175_ARATH (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 634.8 bits (1636), Expect = 1.2e-180
Identity = 308/722 (42.66%), Postives = 472/722 (65.37%), Query Frame = 1

Query: 34  SFSPPTHPL--------ISLLETCKSMDQLQQIHCQAIKTAFNANPVLQNKVMSFCCTHE 93
           +FS P  P         ISL+E C S+ QL+Q H   I+T   ++P   +K+ +      
Sbjct: 17  NFSNPNQPTTNNERSRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSS 76

Query: 94  CGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSPELGVSLYLEMLRRG-FKPDRYTFPF 153
              L+YA  +FDEIP+PN F WNT+IR Y+    P L +  +L+M+      P++YTFPF
Sbjct: 77  FASLEYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPF 136

Query: 154 LFKGFTRDIALEYGRELHGHVLKLGLQSNVFVHTALVQMYLLCGQLDTARRVLDVCSKAD 213
           L K      +L  G+ LHG  +K  + S+VFV  +L+  Y  CG LD+A +V     + D
Sbjct: 137 LIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKD 196

Query: 214 VIAWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTTVTLVLILSACSKLKDLKTGKQVHS 273
           V++WN MI+ + + G  +++  +F  ME + V  + VT+V +LSAC+K+++L+ G+QV S
Sbjct: 197 VVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCS 256

Query: 274 YVNNCKVESNLVLENALIDMYATCGEMDSALGIFRSMNNKDIISWTTIVSGFTNLGEIDV 333
           Y+   +V  NL L NA++DMY  CG ++ A  +F +M  KD ++WTT++ G+    + + 
Sbjct: 257 YIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEA 316

Query: 334 ARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELFRNMQAT-NVKPDEFTMVSILTACA 393
           AR   + MP+KD V+W A+I  Y ++ +  EAL +F  +Q   N+K ++ T+VS L+ACA
Sbjct: 317 AREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACA 376

Query: 394 HLGALELGEWIRTYIDRNKINNDAFVRNALIDMYFKCGNVDKAERIFREMCQRDKFTWTA 453
            +GALELG WI +YI ++ I  +  V +ALI MY KCG+++K+  +F  + +RD F W+A
Sbjct: 377 QVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSA 436

Query: 454 MIVGLAVNGHGEKALDMFSQMLKASILPDEITYIGVLSACTHTGMVEKGREYFLSMTTQH 513
           MI GLA++G G +A+DMF +M +A++ P+ +T+  V  AC+HTG+V++    F  M + +
Sbjct: 437 MIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNY 496

Query: 514 GIEPNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPNSIVWGALLAGCRVYREADMAEMV 573
           GI P   HY C+VD+L R+G L++A + IE MP+ P++ VWGALL  C+++   ++AEM 
Sbjct: 497 GIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMA 556

Query: 574 VKQILDLEPENGAVYVLLCNIYAACKRWNDLRELRQMMMDKGIKKTPGCSLIEMNGTVHE 633
             ++L+LEP N   +VLL NIYA   +W ++ ELR+ M   G+KK PGCS IE++G +HE
Sbjct: 557 CTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHE 616

Query: 634 FVAGDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEVFLDIAEED-KENSVFRHSEKLAI 693
           F++GD +HP +EK+  KL+++ + LK  GY P++S+V   I EE+ KE S+  HSEKLAI
Sbjct: 617 FLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAI 676

Query: 694 AFGLINSPPGVTIRIVKNLRMCMDCHNMAKLVSKVYNREVIVRDRTRFHHFKHGLCSCKD 745
            +GLI++     IR++KNLR+C DCH++AKL+S++Y+RE+IVRDR RFHHF++G CSC D
Sbjct: 677 CYGLISTEAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCND 736

BLAST of Cla015352 vs. Swiss-Prot
Match: PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 582.4 bits (1500), Expect = 7.0e-165
Identity = 288/730 (39.45%), Postives = 447/730 (61.23%), Query Frame = 1

Query: 33  SSFSPP-----THPLISLLETCKSMDQLQQIHCQAIKTAFNANPVLQNKVMSFCC-THEC 92
           SS  PP      HP +SLL  CK++  L+ IH Q IK   +      +K++ FC  +   
Sbjct: 22  SSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHF 81

Query: 93  GDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSPELGVSLYLEMLRRGFKPDRYTFPFLF 152
             L YA  +F  I EPNL IWNTM RG++    P   + LY+ M+  G  P+ YTFPF+ 
Sbjct: 82  EGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVL 141

Query: 153 KGFTRDIALEYGRELHGHVLKLGLQSNVFVHTALVQMYLLCGQLDTARRVLDVCSKADVI 212
           K   +  A + G+++HGHVLKLG   +++VHT+L+ MY+  G+L+ A +V D     DV+
Sbjct: 142 KSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVV 201

Query: 213 AWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTTVTLVLILSACSKLKDLKTGKQVHSYV 272
           ++  +I  Y   G  E ++++F  +  K V    V+   ++S  ++  + K   ++   +
Sbjct: 202 SYTALIKGYASRGYIENAQKLFDEIPVKDV----VSWNAMISGYAETGNYKEALELFKDM 261

Query: 273 NNCKVESNLVLENALIDMYATCGEMDSALGIFRSM----------NNKDIISWTTIVSGF 332
               V  +   E+ ++ + + C +  S + + R +          +N  I++   ++  +
Sbjct: 262 MKTNVRPD---ESTMVTVVSACAQSGS-IELGRQVHLWIDDHGFGSNLKIVN--ALIDLY 321

Query: 333 TNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELFRNMQATNVKPDEFTMV 392
           +  GE++ A   F+++P KD +SW  +I GY   N +KEAL LF+ M  +   P++ TM+
Sbjct: 322 SKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTML 381

Query: 393 SILTACAHLGALELGEWIRTYIDR--NKINNDAFVRNALIDMYFKCGNVDKAERIFREMC 452
           SIL ACAHLGA+++G WI  YID+    + N + +R +LIDMY KCG+++ A ++F  + 
Sbjct: 382 SILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSIL 441

Query: 453 QRDKFTWTAMIVGLAVNGHGEKALDMFSQMLKASILPDEITYIGVLSACTHTGMVEKGRE 512
            +   +W AMI G A++G  + + D+FS+M K  I PD+IT++G+LSAC+H+GM++ GR 
Sbjct: 442 HKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRH 501

Query: 513 YFLSMTTQHGIEPNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPNSIVWGALLAGCRVY 572
            F +MT  + + P + HYGC++DLL  +G  KEA E+I  M M+P+ ++W +LL  C+++
Sbjct: 502 IFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMH 561

Query: 573 READMAEMVVKQILDLEPENGAVYVLLCNIYAACKRWNDLRELRQMMMDKGIKKTPGCSL 632
              ++ E   + ++ +EPEN   YVLL NIYA+  RWN++ + R ++ DKG+KK PGCS 
Sbjct: 562 GNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSS 621

Query: 633 IEMNGTVHEFVAGDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEVFLDIAEEDKENSVF 692
           IE++  VHEF+ GD+ HP+  +I   L +M   L+ AG+ PD SEV  ++ EE KE ++ 
Sbjct: 622 IEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALR 681

Query: 693 RHSEKLAIAFGLINSPPGVTIRIVKNLRMCMDCHNMAKLVSKVYNREVIVRDRTRFHHFK 745
            HSEKLAIAFGLI++ PG  + IVKNLR+C +CH   KL+SK+Y RE+I RDRTRFHHF+
Sbjct: 682 HHSEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFR 741

BLAST of Cla015352 vs. Swiss-Prot
Match: PP249_ARATH (Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana GN=PCMP-H56 PE=2 SV=1)

HSP 1 Score: 577.8 bits (1488), Expect = 1.7e-163
Identity = 285/690 (41.30%), Postives = 430/690 (62.32%), Query Frame = 1

Query: 57  QIHCQAIKTAFNANPVLQNKVMSFCCTHECGDLKYAHHLFDEIPEPNLFIWNTMIRGYSR 116
           QIH   +K  +  +  +QN ++ F    ECG+L  A  +FDE+ E N+  W +MI GY+R
Sbjct: 155 QIHGLIVKMGYAKDLFVQNSLVHFYA--ECGELDSARKVFDEMSERNVVSWTSMICGYAR 214

Query: 117 LDSPELGVSLYLEMLR-RGFKPDRYTFPFLFKGFTRDIALEYGRELHGHVLKLGLQSNVF 176
            D  +  V L+  M+R     P+  T   +     +   LE G +++  +   G++ N  
Sbjct: 215 RDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAFIRNSGIEVNDL 274

Query: 177 VHTALVQMYLLCGQLDTARRVLDVCSKADVIAWNMMISAYNKVGEFEESRRIFLGMEEKQ 236
           + +ALV MY+ C  +D A+R+ D    +++   N M S Y + G   E+  +F  M +  
Sbjct: 275 MVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSG 334

Query: 237 VLPTTVTLVLILSACSKLKDLKTGKQVHSYVNNCKVESNLVLENALIDMYATCGEMDSAL 296
           V P  ++++  +S+CS+L+++  GK  H YV     ES   + NALIDMY  C   D+A 
Sbjct: 335 VRPDRISMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQDTAF 394

Query: 297 GIFRSMNNKDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKE 356
            IF  M+NK +++W +IV+G+   GE+D A   F+ MPEK+ VSW  +I G ++ + F+E
Sbjct: 395 RIFDRMSNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEE 454

Query: 357 ALELFRNMQATN-VKPDEFTMVSILTACAHLGALELGEWIRTYIDRNKINNDAFVRNALI 416
           A+E+F +MQ+   V  D  TM+SI +AC HLGAL+L +WI  YI++N I  D  +   L+
Sbjct: 455 AIEVFCSMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLGTTLV 514

Query: 417 DMYFKCGNVDKAERIFREMCQRDKFTWTAMIVGLAVNGHGEKALDMFSQMLKASILPDEI 476
           DM+ +CG+ + A  IF  +  RD   WTA I  +A+ G+ E+A+++F  M++  + PD +
Sbjct: 515 DMFSRCGDPESAMSIFNSLTNRDVSAWTAAIGAMAMAGNAERAIELFDDMIEQGLKPDGV 574

Query: 477 TYIGVLSACTHTGMVEKGREYFLSMTTQHGIEPNIAHYGCLVDLLARAGCLKEAHEVIEN 536
            ++G L+AC+H G+V++G+E F SM   HG+ P   HYGC+VDLL RAG L+EA ++IE+
Sbjct: 575 AFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLGRAGLLEEAVQLIED 634

Query: 537 MPMKPNSIVWGALLAGCRVYREADMAEMVVKQILDLEPENGAVYVLLCNIYAACKRWNDL 596
           MPM+PN ++W +LLA CRV    +MA    ++I  L PE    YVLL N+YA+  RWND+
Sbjct: 635 MPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVLLSNVYASAGRWNDM 694

Query: 597 RELRQMMMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQTEKIDVKLNKMTQDLKFAGYS 656
            ++R  M +KG++K PG S I++ G  HEF +GD SHP+   I+  L++++Q     G+ 
Sbjct: 695 AKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDESHPEMPNIEAMLDEVSQRASHLGHV 754

Query: 657 PDVSEVFLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRIVKNLRMCMDCHNMAKLV 716
           PD+S V +D+ E++K   + RHSEKLA+A+GLI+S  G TIRIVKNLR+C DCH+ AK  
Sbjct: 755 PDLSNVLMDVDEKEKIFMLSRHSEKLAMAYGLISSNKGTTIRIVKNLRVCSDCHSFAKFA 814

Query: 717 SKVYNREVIVRDRTRFHHFKHGLCSCKDYW 745
           SKVYNRE+I+RD  RFH+ + G CSC D+W
Sbjct: 815 SKVYNREIILRDNNRFHYIRQGKCSCGDFW 842


HSP 2 Score: 242.7 bits (618), Expect = 1.3e-62
Identity = 146/489 (29.86%), Postives = 247/489 (50.51%), Query Frame = 1

Query: 22  MATIATISSASSSFSPPTHPLISLLETCKSMDQLQQIHCQAIKTAFNANPVLQNKVMSFC 81
           +AT  T   +  + S  T    S L+ CK++D+L+  H    K   + +     K+++  
Sbjct: 15  LATTTTTKPSLLNQSKCTKATPSSLKNCKTIDELKMFHRSLTKQGLDNDVSTITKLVARS 74

Query: 82  C---THECGDLKYAHHLFDEIPE-PNLFIWNTMIRGYSRLDSPELGVSLYLEMLRRGFKP 141
           C   T E   L +A  +F+        F++N++IRGY+        + L+L M+  G  P
Sbjct: 75  CELGTRE--SLSFAKEVFENSESYGTCFMYNSLIRGYASSGLCNEAILLFLRMMNSGISP 134

Query: 142 DRYTFPFLFKGFTRDIALEYGRELHGHVLKLGLQSNVFVHTALVQMYLLCGQLDTARRVL 201
           D+YTFPF      +  A   G ++HG ++K+G   ++FV  +LV  Y  CG+LD+AR+V 
Sbjct: 135 DKYTFPFGLSACAKSRAKGNGIQIHGLIVKMGYAKDLFVQNSLVHFYAECGELDSARKVF 194

Query: 202 DVCSKADVIAWNMMISAYNKVGEFEESRRIFLGM-EEKQVLPTTVTLVLILSACSKLKDL 261
           D  S+ +V++W  MI  Y +    +++  +F  M  +++V P +VT+V ++SAC+KL+DL
Sbjct: 195 DEMSERNVVSWTSMICGYARRDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDL 254

Query: 262 KTGKQVHSYVNNCKVESNLVLENALIDMYATCGEMDSALGIFRSMNNKDIISWTTIVSGF 321
           +TG++V++++ N  +E N ++ +AL+DMY  C                            
Sbjct: 255 ETGEKVYAFIRNSGIEVNDLMVSALVDMYMKCNA-------------------------- 314

Query: 322 TNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELFRNMQATNVKPDEFTMV 381
                IDVA+  FD+    +     AM   Y+R    +EAL +F  M  + V+PD  +M+
Sbjct: 315 -----IDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSGVRPDRISML 374

Query: 382 SILTACAHLGALELGEWIRTYIDRNKINNDAFVRNALIDMYFKCGNVDKAERIFREMCQR 441
           S +++C+ L  +  G+    Y+ RN   +   + NALIDMY KC   D A RIF  M  +
Sbjct: 375 SAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQDTAFRIFDRMSNK 434

Query: 442 DKFTWTAMIVGLAVNGHGEKALDMFSQMLKASILPDEITYIGVLSACTHTGMVEKGREYF 501
              TW +++ G   NG  + A + F  M + +I    +++  ++S      + E+  E F
Sbjct: 435 TVVTWNSIVAGYVENGEVDAAWETFETMPEKNI----VSWNTIISGLVQGSLFEEAIEVF 466

Query: 502 LSMTTQHGI 506
            SM +Q G+
Sbjct: 495 CSMQSQEGV 466


HSP 3 Score: 153.7 bits (387), Expect = 8.0e-36
Identity = 111/404 (27.48%), Postives = 181/404 (44.80%), Query Frame = 1

Query: 162 HGHVLKLGLQSNVFVHTALVQMYLLCGQLDTARRVLDVCSKAD----VIAWNMMISAYNK 221
           H  + K GL ++V   T LV      G  ++     +V   ++       +N +I  Y  
Sbjct: 52  HRSLTKQGLDNDVSTITKLVARSCELGTRESLSFAKEVFENSESYGTCFMYNSLIRGYAS 111

Query: 222 VGEFEESRRIFLGMEEKQVLPTTVTLVLILSACSKLKDLKTGKQVHSYVNNCKVESNLVL 281
            G   E+  +FL M    + P   T    LSAC+K +    G Q+H  +       +L +
Sbjct: 112 SGLCNEAILLFLRMMNSGISPDKYTFPFGLSACAKSRAKGNGIQIHGLIVKMGYAKDLFV 171

Query: 282 ENALIDMYATCGEMDSALGIFRSMNNKDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDY 341
           +N+L+  YA CGE+DSA                               R  FD+M E++ 
Sbjct: 172 QNSLVHFYAECGELDSA-------------------------------RKVFDEMSERNV 231

Query: 342 VSWTAMIDGYIRSNRFKEALELF-RNMQATNVKPDEFTMVSILTACAHLGALELGEWIRT 401
           VSWT+MI GY R +  K+A++LF R ++   V P+  TMV +++ACA L  LE GE +  
Sbjct: 232 VSWTSMICGYARRDFAKDAVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYA 291

Query: 402 YIDRNKINNDAFVRNALIDMYFKCGNVDKAERIFREMCQRDKFTWTAMIVGLAVNGHGEK 461
           +I  + I  +  + +AL+DMY KC  +D A+R+F E    +     AM       G   +
Sbjct: 292 FIRNSGIEVNDLMVSALVDMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTRE 351

Query: 462 ALDMFSQMLKASILPDEITYIGVLSACTHTGMVEKGREYFLSMTTQHGIEPNIAHYGCLV 521
           AL +F+ M+ + + PD I+ +  +S+C+    +  G+        ++G E        L+
Sbjct: 352 ALGVFNLMMDSGVRPDRISMLSAISSCSQLRNILWGKSCH-GYVLRNGFESWDNICNALI 411

Query: 522 DLLARAGCLKEAHEVIENMPMKPNSIVWGALLAGCRVYREADMA 561
           D+  +      A  + + M  K   + W +++AG     E D A
Sbjct: 412 DMYMKCHRQDTAFRIFDRMSNK-TVVTWNSIVAGYVENGEVDAA 422

BLAST of Cla015352 vs. Swiss-Prot
Match: PP311_ARATH (Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana GN=PCMP-H3 PE=2 SV=1)

HSP 1 Score: 567.8 bits (1462), Expect = 1.8e-160
Identity = 280/717 (39.05%), Postives = 442/717 (61.65%), Query Frame = 1

Query: 36  SPPTHPLISLLETCKSMDQLQQIHCQAIKTAFNANPVLQNKVMSFCCTHECGDLKYAHHL 95
           S   + ++  L  CKS++ ++Q+H   ++T  N    L + + +   +    +L YA ++
Sbjct: 9   STAANTILEKLSFCKSLNHIKQLHAHILRTVINHK--LNSFLFNLSVSSSSINLSYALNV 68

Query: 96  FDEIPEP-NLFIWNTMIRGYSRLDSPELGVSLYLEMLRRGFKPDRYTFPFLFKGFTRDIA 155
           F  IP P    ++N  +R  SR   P   +  Y  +   G + D+++F  + K  ++  A
Sbjct: 69  FSSIPSPPESIVFNPFLRDLSRSSEPRATILFYQRIRHVGGRLDQFSFLPILKAVSKVSA 128

Query: 156 LEYGRELHGHVLKLGLQSNVFVHTALVQMYLLCGQLDTARRVLDVCSKADVIAWNMMISA 215
           L  G ELHG   K+    + FV T  + MY  CG+++ AR V D  S  DV+ WN MI  
Sbjct: 129 LFEGMELHGVAFKIATLCDPFVETGFMDMYASCGRINYARNVFDEMSHRDVVTWNTMIER 188

Query: 216 YNKVGEFEESRRIFLGMEEKQVLPTTVTLVLILSACSKLKDLKTGKQVHSYVNNCKVESN 275
           Y + G  +E+ ++F  M++  V+P  + L  I+SAC +  +++  + ++ ++    V  +
Sbjct: 189 YCRFGLVDEAFKLFEEMKDSNVMPDEMILCNIVSACGRTGNMRYNRAIYEFLIENDVRMD 248

Query: 276 LVLENALIDMYATCGEMDSALGIFRSMNNKDIISWTTIVSGFTNLGEIDVARNYFDKMPE 335
             L  AL+ MYA  G MD A   FR M+ +++   T +VSG++  G +D A+  FD+  +
Sbjct: 249 THLLTALVTMYAGAGCMDMAREFFRKMSVRNLFVSTAMVSGYSKCGRLDDAQVIFDQTEK 308

Query: 336 KDYVSWTAMIDGYIRSNRFKEALELFRNMQATNVKPDEFTMVSILTACAHLGALELGEWI 395
           KD V WT MI  Y+ S+  +EAL +F  M  + +KPD  +M S+++ACA+LG L+  +W+
Sbjct: 309 KDLVCWTTMISAYVESDYPQEALRVFEEMCCSGIKPDVVSMFSVISACANLGILDKAKWV 368

Query: 396 RTYIDRNKINNDAFVRNALIDMYFKCGNVDKAERIFREMCQRDKFTWTAMIVGLAVNGHG 455
            + I  N + ++  + NALI+MY KCG +D    +F +M +R+  +W++MI  L+++G  
Sbjct: 369 HSCIHVNGLESELSINNALINMYAKCGGLDATRDVFEKMPRRNVVSWSSMINALSMHGEA 428

Query: 456 EKALDMFSQMLKASILPDEITYIGVLSACTHTGMVEKGREYFLSMTTQHGIEPNIAHYGC 515
             AL +F++M + ++ P+E+T++GVL  C+H+G+VE+G++ F SMT ++ I P + HYGC
Sbjct: 429 SDALSLFARMKQENVEPNEVTFVGVLYGCSHSGLVEEGKKIFASMTDEYNITPKLEHYGC 488

Query: 516 LVDLLARAGCLKEAHEVIENMPMKPNSIVWGALLAGCRVYREADMAEMVVKQILDLEPEN 575
           +VDL  RA  L+EA EVIE+MP+  N ++WG+L++ CR++ E ++ +   K+IL+LEP++
Sbjct: 489 MVDLFGRANLLREALEVIESMPVASNVVIWGSLMSACRIHGELELGKFAAKRILELEPDH 548

Query: 576 GAVYVLLCNIYAACKRWNDLRELRQMMMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQT 635
               VL+ NIYA  +RW D+R +R++M +K + K  G S I+ NG  HEF+ GD+ H Q+
Sbjct: 549 DGALVLMSNIYAREQRWEDVRNIRRVMEEKNVFKEKGLSRIDQNGKSHEFLIGDKRHKQS 608

Query: 636 EKIDVKLNKMTQDLKFAGYSPDVSEVFLDIAEEDKENSVFRHSEKLAIAFGLINSPP--- 695
            +I  KL+++   LK AGY PD   V +D+ EE+K++ V  HSEKLA+ FGL+N      
Sbjct: 609 NEIYAKLDEVVSKLKLAGYVPDCGSVLVDVEEEEKKDLVLWHSEKLALCFGLMNEEKEEE 668

Query: 696 ----GVTIRIVKNLRMCMDCHNMAKLVSKVYNREVIVRDRTRFHHFKHGLCSCKDYW 745
               GV IRIVKNLR+C DCH   KLVSKVY RE+IVRDRTRFH +K+GLCSC+DYW
Sbjct: 669 KDSCGV-IRIVKNLRVCEDCHLFFKLVSKVYEREIIVRDRTRFHCYKNGLCSCRDYW 722

BLAST of Cla015352 vs. TrEMBL
Match: A0A0A0K6A7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G432370 PE=4 SV=1)

HSP 1 Score: 1320.8 bits (3417), Expect = 0.0e+00
Identity = 645/716 (90.08%), Postives = 675/716 (94.27%), Query Frame = 1

Query: 1   MPTPVPFRTLLHHRHVTNSKQMATIATISSASSSFSPPTHPLISLLETCKSMDQLQQIHC 60
           MP PV FRTLLHHRHV   KQM TIA  SSA  SFSPPTHPLISLLETC+SMDQLQQ+HC
Sbjct: 1   MPVPVRFRTLLHHRHVKKPKQMTTIAATSSALKSFSPPTHPLISLLETCESMDQLQQVHC 60

Query: 61  QAIKTAFNANPVLQNKVMSFCCTHECGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSP 120
           QAIK   NANPVLQN+VM+FCCTHE GD +YA  LFDEIPEPNLFIWNTMIRGYSRLD P
Sbjct: 61  QAIKKGLNANPVLQNRVMTFCCTHEYGDFQYARRLFDEIPEPNLFIWNTMIRGYSRLDFP 120

Query: 121 ELGVSLYLEMLRRGFKPDRYTFPFLFKGFTRDIALEYGRELHGHVLKLGLQSNVFVHTAL 180
           +LGVSLYLEMLRRG KPDRYTFPFLFKGFTRDIALEYGR+LHGHVLK GLQ NVFVHTAL
Sbjct: 121 QLGVSLYLEMLRRGVKPDRYTFPFLFKGFTRDIALEYGRQLHGHVLKHGLQYNVFVHTAL 180

Query: 181 VQMYLLCGQLDTARRVLDVCSKADVIAWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTT 240
           VQMYLLCGQLDTAR V DVC KADVI WNM+ISAYNKVG+FEESRR+FL ME+KQVLPTT
Sbjct: 181 VQMYLLCGQLDTARGVFDVCPKADVITWNMIISAYNKVGKFEESRRLFLVMEDKQVLPTT 240

Query: 241 VTLVLILSACSKLKDLKTGKQVHSYVNNCKVESNLVLENALIDMYATCGEMDSALGIFRS 300
           VTLVL+LSACSKLKDL+TGK+VHSYV NCKVESNLVLENA+IDMYA CGEMDSALGIFRS
Sbjct: 241 VTLVLVLSACSKLKDLRTGKKVHSYVKNCKVESNLVLENAMIDMYADCGEMDSALGIFRS 300

Query: 301 MNNKDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF 360
           MNN+DIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF
Sbjct: 301 MNNRDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF 360

Query: 361 RNMQATNVKPDEFTMVSILTACAHLGALELGEWIRTYIDRNKINNDAFVRNALIDMYFKC 420
           RNMQATNVKPDEFTMVS+LTACAHLGALELGEWIRTYIDRNKI ND FVRNALIDMYFKC
Sbjct: 361 RNMQATNVKPDEFTMVSVLTACAHLGALELGEWIRTYIDRNKIKNDLFVRNALIDMYFKC 420

Query: 421 GNVDKAERIFREMCQRDKFTWTAMIVGLAVNGHGEKALDMFSQMLKASILPDEITYIGVL 480
           G+VDKAE IFREM QRDKFTWTAMIVGLAVNGHGEKALDMFS MLKASILPDEITYIGVL
Sbjct: 421 GDVDKAESIFREMSQRDKFTWTAMIVGLAVNGHGEKALDMFSNMLKASILPDEITYIGVL 480

Query: 481 SACTHTGMVEKGREYFLSMTTQHGIEPNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPN 540
           SACTHTG+V+KGR+YFL MT+QHGIEPNIAHYGCLVDLLARAG LKEA+EVIENMP+K N
Sbjct: 481 SACTHTGLVDKGRKYFLRMTSQHGIEPNIAHYGCLVDLLARAGRLKEAYEVIENMPIKAN 540

Query: 541 SIVWGALLAGCRVYREADMAEMVVKQILDLEPENGAVYVLLCNIYAACKRWNDLRELRQM 600
           SIVWGALLAGCRVYRE+DMAEMVVKQIL+LEP+NGAVYVLLCNIYAACKRWNDLRELRQM
Sbjct: 541 SIVWGALLAGCRVYRESDMAEMVVKQILELEPDNGAVYVLLCNIYAACKRWNDLRELRQM 600

Query: 601 MMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEV 660
           MMDKGIKKTPGCSLIEMNG VHEFVAGDRSHPQT+ ID KL+KMTQDLK AGYSPD+SEV
Sbjct: 601 MMDKGIKKTPGCSLIEMNGRVHEFVAGDRSHPQTKNIDAKLDKMTQDLKLAGYSPDISEV 660

Query: 661 FLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRIVKNLRMCMDCHNMAKLVSK 717
           FLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRI KNLRMCMDCHNMAKLVSK
Sbjct: 661 FLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRITKNLRMCMDCHNMAKLVSK 716

BLAST of Cla015352 vs. TrEMBL
Match: U5GP98_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0003s02590g PE=4 SV=1)

HSP 1 Score: 1072.8 bits (2773), Expect = 1.6e-310
Identity = 514/754 (68.17%), Postives = 615/754 (81.56%), Query Frame = 1

Query: 12  HHRHVTNSKQMATIATISSASSSFSPPT-HPLISLLETCKSMDQLQQIHCQAIKTAFNAN 71
           HH H +  K+M     IS A S  SP T +P +SL ETCKSM  L+QIH + IKT    N
Sbjct: 16  HHLHSSFLKKM-----ISMACSQSSPVTENPPLSLFETCKSMYHLKQIHSRTIKTGIICN 75

Query: 72  PVLQNKVMSFCCTHECGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSPELGVSLYLEM 131
           P++QNK++SFCC+ E GD+ YA  LFD IPEP++F WN M +GYSR+  P+LGVSLYLEM
Sbjct: 76  PIIQNKILSFCCSREFGDMCYARQLFDTIPEPSVFSWNIMFKGYSRIACPKLGVSLYLEM 135

Query: 132 LRRGFKPDRYTFPFLFKGFTRDIALEYGRELHGHVLKLGLQSNVFVHTALVQMYLLCGQL 191
           L R  KPD YT+PFLFKGFTR +AL+ GRELH HV+K GL SNVF H AL+ MY LCG +
Sbjct: 136 LERNVKPDCYTYPFLFKGFTRSVALQLGRELHCHVVKYGLDSNVFAHNALINMYSLCGLI 195

Query: 192 DTARRVLDVCSKADVIAWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTTVTLVLILSAC 251
           D AR + D+  K+DV+ WN MIS YN++ +++E+R++F  MEEK +LPT+VT V +LSAC
Sbjct: 196 DMARGIFDMSCKSDVVTWNAMISGYNRIKKYDEARKLFDMMEEKGILPTSVTCVSVLSAC 255

Query: 252 SKLKDLKTGKQVHSYVNNCKVESNLVLENALIDMYATCGEMDSALGIFRSMNNKDIISWT 311
           SKLKDL+ GK+V  Y+ N  VE NL +ENALIDMYA+CGEM+ ALGIF +M N+D+ISWT
Sbjct: 256 SKLKDLECGKRVQKYIRNGVVEVNLKVENALIDMYASCGEMNVALGIFENMKNRDVISWT 315

Query: 312 TIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELFRNMQATNVKP 371
            IV+GF N G++D AR YF KMPE+D+VSWTAMIDGY+R N +KEAL LFR MQ + +KP
Sbjct: 316 AIVTGFVNTGQVDAARKYFHKMPERDHVSWTAMIDGYLRLNCYKEALMLFREMQTSKIKP 375

Query: 372 DEFTMVSILTACAHLGALELGEWIRTYIDRNKINNDAFVRNALIDMYFKCGNVDKAERIF 431
           DEFTMVS+LTACA LGALELGEWIRTYID+NK+ ND FV NALIDMYFKCGNV+ A  IF
Sbjct: 376 DEFTMVSVLTACAQLGALELGEWIRTYIDKNKVKNDTFVGNALIDMYFKCGNVEMALSIF 435

Query: 432 REMCQRDKFTWTAMIVGLAVNGHGEKALDMFSQMLKASILPDEITYIGVLSACTHTGMVE 491
             + QRDKFTWTAM+VGLA+NG GE+AL+MFSQMLKAS+ PDE+TY+GVLSACTHTGMV+
Sbjct: 436 NTLPQRDKFTWTAMVVGLAINGCGEEALNMFSQMLKASVTPDEVTYVGVLSACTHTGMVD 495

Query: 492 KGREYFLSMTTQHGIEPNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPNSIVWGALLAG 551
           +G+++F SMT +HGIEPNIAHYGC+VDLL +AG LKEAHE+I+NMPMKPNSIVWGALL  
Sbjct: 496 EGKKFFASMTARHGIEPNIAHYGCMVDLLGKAGHLKEAHEIIKNMPMKPNSIVWGALLGA 555

Query: 552 CRVYREADMAEMVVKQILDLEPENGAVYVLLCNIYAACKRWNDLRELRQMMMDKGIKKTP 611
           CR++++A+MAE  ++QIL+LEP NGAVYVL CNIYAAC +W+ LRELRQ+MMD+GIKKTP
Sbjct: 556 CRIHKDAEMAERAIEQILELEPNNGAVYVLQCNIYAACNKWDKLRELRQVMMDRGIKKTP 615

Query: 612 GCSLIEMNGTVHEFVAGDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEVFLDIAEEDKE 671
           GCSLIEMNG VHEFVAGD+SHPQT++I  KLNKMT DLK AGYSP+ SEVFLDIAEEDKE
Sbjct: 616 GCSLIEMNGIVHEFVAGDQSHPQTKEIYGKLNKMTSDLKIAGYSPNTSEVFLDIAEEDKE 675

Query: 672 NSVFRHSEKLAIAFGLINSPPGV--------------------TIRIVKNLRMCMDCHNM 731
           N+V+RHSEKLAIAFGLINS PGV                    TIRIVKNLRMC+DCH++
Sbjct: 676 NAVYRHSEKLAIAFGLINSGPGVTIRIVKNLRMCIDCHHVAKFTIRIVKNLRMCIDCHHV 735

Query: 732 AKLVSKVYNREVIVRDRTRFHHFKHGLCSCKDYW 745
           AKLVSKVY+REVIVRDRTRFHHF+HG CSCKDYW
Sbjct: 736 AKLVSKVYDREVIVRDRTRFHHFRHGSCSCKDYW 764

BLAST of Cla015352 vs. TrEMBL
Match: A0A067KYX9_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_06115 PE=4 SV=1)

HSP 1 Score: 1059.7 bits (2739), Expect = 1.7e-306
Identity = 498/719 (69.26%), Postives = 607/719 (84.42%), Query Frame = 1

Query: 26  ATISSASSSFSPPTHPLISLLETCKSMDQLQQIHCQAIKTAFNANPVLQNKVMSFCCTHE 85
           +T+  +S +  P  +P  SLL+TCKSMDQL+QIH  AIKT    NP+ QNK+++ CCT E
Sbjct: 3   STLPLSSPTHIPSENPPFSLLQTCKSMDQLKQIHSLAIKTGIACNPITQNKIIALCCTKE 62

Query: 86  CGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSPELGVSLYLEMLRRGFKPDRYTFPFL 145
            GD+ YA  LFD I EP +F+WNTM++GYSR   P+LGVS YLEML+R F PD YT+PFL
Sbjct: 63  FGDMDYARQLFDTISEPTVFLWNTMLKGYSRTGYPKLGVSTYLEMLKRSFIPDCYTYPFL 122

Query: 146 FKGFTRDIALEYGRELHGHVLKLGLQSNVFVHTALVQMYLLCGQLDTARRVLDVCSKADV 205
            KGFTRDIALE G+ELH HV+K GL S+VF+  AL+ MY LCG +D AR + D+  K+DV
Sbjct: 123 MKGFTRDIALECGKELHCHVVKYGLGSSVFIQNALINMYSLCGLIDMARGIFDMSCKSDV 182

Query: 206 IAWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTTVTLVLILSACSKLKDLKTGKQVHSY 265
           + WN +IS YN++ +++ES+++F  ME+K VLP++VT+VL+LSACSKLKDL+ G+QVH Y
Sbjct: 183 VTWNTVISGYNRIKQYDESKKLFCKMEKKGVLPSSVTVVLVLSACSKLKDLECGQQVHKY 242

Query: 266 VNNCKVESNLVLENALIDMYATCGEMDSALGIFRSMNNKDIISWTTIVSGFTNLGEIDVA 325
           V +  VESNL +ENALIDMYA CGEM  AL IF++M  +D+ISWT IV+GF N+G++D+A
Sbjct: 243 VTDRIVESNLTVENALIDMYAACGEMSVALQIFKNMKKRDVISWTAIVTGFVNIGQLDMA 302

Query: 326 RNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELFRNMQATNVKPDEFTMVSILTACAHL 385
           RNYFD+MPE+DYVSWTAMI+GYIR N FKEAL LFR MQA+NVKPDEFTMVS+LTACA L
Sbjct: 303 RNYFDQMPERDYVSWTAMINGYIRVNCFKEALILFRQMQASNVKPDEFTMVSVLTACAQL 362

Query: 386 GALELGEWIRTYIDRNKINNDAFVRNALIDMYFKCGNVDKAERIFREMCQRDKFTWTAMI 445
           GALELGEW++TYID+NK+ NDAFV NALIDMYFKCG V+KA  IF  + QRDKFTWTAMI
Sbjct: 363 GALELGEWVKTYIDKNKVKNDAFVGNALIDMYFKCGEVEKARSIFNGISQRDKFTWTAMI 422

Query: 446 VGLAVNGHGEKALDMFSQMLKASILPDEITYIGVLSACTHTGMVEKGREYFLSMTTQHGI 505
           VGLA+NGHG++ALD+F+QMLKAS+ PDEITY+GVL ACTHTGMV +GR++F+ MTTQHGI
Sbjct: 423 VGLAINGHGKEALDVFAQMLKASVTPDEITYVGVLCACTHTGMVNEGRKFFVGMTTQHGI 482

Query: 506 EPNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPNSIVWGALLAGCRVYREADMAEMVVK 565
           +PN+AHYGC+VDLL RAG LKEAHEVI+NMPMKPNSIVWGALL  CRV+++A+MAEM  K
Sbjct: 483 DPNVAHYGCMVDLLGRAGHLKEAHEVIKNMPMKPNSIVWGALLGACRVHKDAEMAEMAAK 542

Query: 566 QILDLEPENGAVYVLLCNIYAACKRWNDLRELRQMMMDKGIKKTPGCSLIEMNGTVHEFV 625
           QIL+L+P NGAVYV+L NIY AC + ++LRELR+ MMD+GIKK PGCSLIEMNG VHEFV
Sbjct: 543 QILELDPANGAVYVILRNIYIACNKRDNLRELRKTMMDRGIKKIPGCSLIEMNGAVHEFV 602

Query: 626 AGDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEVFLDIAEEDKENSVFRHSEKLAIAFG 685
           AGD+SHPQ + I +KL++MT DLK AGYSPD SEVFL+I EEDKE++V+ HSEKLAIAFG
Sbjct: 603 AGDQSHPQKKAIYLKLDEMTSDLKLAGYSPDTSEVFLEIGEEDKESAVYLHSEKLAIAFG 662

Query: 686 LINSPPGVTIRIVKNLRMCMDCHNMAKLVSKVYNREVIVRDRTRFHHFKHGLCSCKDYW 745
           LI+S  G TIRIVKNLR+C+DCH MAKLVSKVY+REVIVRDRTRFHHF+HG CSC+DYW
Sbjct: 663 LISSGAGTTIRIVKNLRICVDCHQMAKLVSKVYDREVIVRDRTRFHHFRHGSCSCEDYW 721

BLAST of Cla015352 vs. TrEMBL
Match: A0A067ECK2_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g005000mg PE=4 SV=1)

HSP 1 Score: 1059.7 bits (2739), Expect = 1.7e-306
Identity = 495/719 (68.85%), Postives = 607/719 (84.42%), Query Frame = 1

Query: 31  ASSSFSPPTH-----PLISLLETCKSMDQLQQIHCQAIKTAFNANPVLQNKVMSFCCTHE 90
           ++SS SPP+      PLIS +ETC+SM QL+QIH Q IK     NP +QNK+++FCC+ E
Sbjct: 3   SNSSISPPSTLTQETPLISPIETCESMHQLKQIHSQTIKLGLLTNPTVQNKLVTFCCS-E 62

Query: 91  CGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSPELGVSLYLEMLRRGFKPDRYTFPFL 150
            GD+KYA  +F +IP P++ +WNTMI+GYSR+DS + GV +YL+ML+   +PD YTFPFL
Sbjct: 63  KGDMKYACKVFRKIPRPSVCLWNTMIKGYSRIDSHKNGVLIYLDMLKSDVRPDNYTFPFL 122

Query: 151 FKGFTRDIALEYGRELHGHVLKLGLQSNVFVHTALVQMYLLCGQLDTARRVLDVCSKADV 210
            KGFTRDIA+E+G+ELH HVLK G  S+VFV  AL+  Y LCG++D AR + DV  K DV
Sbjct: 123 LKGFTRDIAVEFGKELHCHVLKFGFDSSVFVQNALISTYCLCGEVDMARGIFDVSYKDDV 182

Query: 211 IAWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTTVTLVLILSACSKLKDLKTGKQVHSY 270
           + WN M S Y +V +F+E+R++F  ME K VLPT+VT+VL+LSAC+KLKDL  GK+ H Y
Sbjct: 183 VTWNAMFSGYKRVKQFDETRKLFGEMERKGVLPTSVTIVLVLSACAKLKDLDVGKRAHRY 242

Query: 271 VNNCKVESNLVLENALIDMYATCGEMDSALGIFRSMNNKDIISWTTIVSGFTNLGEIDVA 330
           V  CK+  NL+LENAL DMYA CGEM  AL IF ++ NKD+ISWT IV+G+ N G++D+A
Sbjct: 243 VKECKIVPNLILENALTDMYAACGEMGFALEIFGNIKNKDVISWTAIVTGYINRGQVDMA 302

Query: 331 RNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELFRNMQATNVKPDEFTMVSILTACAHL 390
           R YFD+MPE+DYV WTAMIDGY+R NRF+EAL LFR MQ +N++PDEFT+VSILTACA+L
Sbjct: 303 RQYFDQMPERDYVLWTAMIDGYLRVNRFREALTLFREMQTSNIRPDEFTIVSILTACANL 362

Query: 391 GALELGEWIRTYIDRNKINNDAFVRNALIDMYFKCGNVDKAERIFREMCQRDKFTWTAMI 450
           GALELGEW++TYID+NK+ ND FV NALIDMY KCG+V+KA+R+FREM ++DKFTWTAMI
Sbjct: 363 GALELGEWVKTYIDKNKVKNDIFVGNALIDMYCKCGDVEKAQRVFREMLRKDKFTWTAMI 422

Query: 451 VGLAVNGHGEKALDMFSQMLKASILPDEITYIGVLSACTHTGMVEKGREYFLSMTTQHGI 510
           VGLA+NGHG+K+LDMFSQML+ASI+PDE+TY+GVLSACTHTGMV++GREYF  MT QHGI
Sbjct: 423 VGLAINGHGDKSLDMFSQMLRASIIPDEVTYVGVLSACTHTGMVDEGREYFADMTIQHGI 482

Query: 511 EPNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPNSIVWGALLAGCRVYREADMAEMVVK 570
           EPN AHYGC+VDLL RAG L EA EVI+NMPMKPNSIVWGALL  CRV+R+A+MAEM  K
Sbjct: 483 EPNEAHYGCMVDLLGRAGHLNEALEVIKNMPMKPNSIVWGALLGACRVHRDAEMAEMAAK 542

Query: 571 QILDLEPENGAVYVLLCNIYAACKRWNDLRELRQMMMDKGIKKTPGCSLIEMNGTVHEFV 630
           QIL+L+P+N AVYVLLCNIYAAC RW++ RELRQM++D+GIKKTPGCS+IEMNG VHEFV
Sbjct: 543 QILELDPDNEAVYVLLCNIYAACNRWDNFRELRQMILDRGIKKTPGCSMIEMNGVVHEFV 602

Query: 631 AGDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEVFLDIAEEDKENSVFRHSEKLAIAFG 690
           AGD+SHPQT++I +KL++MT DLKF GY PD+SEVFLD+ EEDKE +V++HSEKLA+AFG
Sbjct: 603 AGDKSHPQTKEIYLKLDEMTSDLKFVGYMPDISEVFLDVGEEDKERAVYQHSEKLAMAFG 662

Query: 691 LINSPPGVTIRIVKNLRMCMDCHNMAKLVSKVYNREVIVRDRTRFHHFKHGLCSCKDYW 745
           LI+S PGVTIRIVKNLRMC+DCH MAKLVS VY+REVIVRD+TRFHHFKHG CSCKDYW
Sbjct: 663 LISSGPGVTIRIVKNLRMCVDCHRMAKLVSMVYDREVIVRDKTRFHHFKHGSCSCKDYW 720

BLAST of Cla015352 vs. TrEMBL
Match: F6HXG1_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_09s0002g07350 PE=4 SV=1)

HSP 1 Score: 1047.0 bits (2706), Expect = 1.1e-302
Identity = 494/722 (68.42%), Postives = 600/722 (83.10%), Query Frame = 1

Query: 30  SASSSFSPPTH-------PLISLLETCKSMDQLQQIHCQAIKTAFNANPVLQNKVMSFCC 89
           SA++   PPTH       P +SL++TCKSM QL+QIH Q I T   +NP++  ++++FCC
Sbjct: 3   SATTLSPPPTHLPSLPQTPPLSLIKTCKSMAQLKQIHSQTICTGLISNPIVPAQIIAFCC 62

Query: 90  THECGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSPELGVSLYLEMLRRGFKPDRYTF 149
            HE GD++YA  +FD +P PN F+WN MI+GYSR+  P   VS+Y EML RG  PD YT+
Sbjct: 63  KHELGDMEYARMVFDTMPGPNHFVWNNMIKGYSRVGCPNSAVSMYCEMLERGVMPDEYTY 122

Query: 150 PFLFKGFTRDIALEYGRELHGHVLKLGLQSNVFVHTALVQMYLLCGQLDTARRVLDVCSK 209
           PFL K FTRD A++ GRELH H++KLG  SNVFV  AL+ +Y L G++  AR V D  SK
Sbjct: 123 PFLLKRFTRDTAVKCGRELHDHIVKLGFSSNVFVQNALIHLYSLSGEVSVARGVFDRSSK 182

Query: 210 ADVIAWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTTVTLVLILSACSKLKDLKTGKQV 269
            DV+ WN+MIS YN+  +F+ES ++F  ME  +VLP+++TLV +LSACSKLKDL  GK+V
Sbjct: 183 GDVVTWNVMISGYNRSKQFDESMKLFDEMERMRVLPSSITLVSVLSACSKLKDLNVGKRV 242

Query: 270 HSYVNNCKVESNLVLENALIDMYATCGEMDSALGIFRSMNNKDIISWTTIVSGFTNLGEI 329
           H YV + K+E   VLENALIDMYA CG+MD+ALGIF +M ++D+ISWT IV+GFTNLG++
Sbjct: 243 HRYVKDLKIEPVRVLENALIDMYAACGDMDTALGIFDNMKSRDVISWTAIVTGFTNLGQV 302

Query: 330 DVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELFRNMQATNVKPDEFTMVSILTAC 389
            +ARNYFDKMPE+D+VSWTAMIDGY++ NRFKE L LFR MQA N+KPDEFTMVSILTAC
Sbjct: 303 GLARNYFDKMPERDFVSWTAMIDGYLQVNRFKEVLSLFREMQAANIKPDEFTMVSILTAC 362

Query: 390 AHLGALELGEWIRTYIDRNKINNDAFVRNALIDMYFKCGNVDKAERIFREMCQRDKFTWT 449
           AHLGALELGEWI+ YID+N+I  D+FV NALIDMYF CGNV+KA RIF  M  RDK +WT
Sbjct: 363 AHLGALELGEWIKAYIDKNEIKIDSFVGNALIDMYFNCGNVEKAIRIFNAMPHRDKISWT 422

Query: 450 AMIVGLAVNGHGEKALDMFSQMLKASILPDEITYIGVLSACTHTGMVEKGREYFLSMTTQ 509
           A+I GLA+NG+GE+ALDMFSQMLKASI PDE+T IGVL ACTH+GMV+KG+++F  MTTQ
Sbjct: 423 AVIFGLAINGYGEEALDMFSQMLKASITPDEVTCIGVLCACTHSGMVDKGKKFFARMTTQ 482

Query: 510 HGIEPNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPNSIVWGALLAGCRVYREADMAEM 569
           HGIEPN+AHYGC+VDLL RAG LKEAHEVI+NMP+KPNSIVWG+LL  CRV+R+ +MAEM
Sbjct: 483 HGIEPNVAHYGCMVDLLGRAGHLKEAHEVIKNMPVKPNSIVWGSLLGACRVHRDEEMAEM 542

Query: 570 VVKQILDLEPENGAVYVLLCNIYAACKRWNDLRELRQMMMDKGIKKTPGCSLIEMNGTVH 629
             +QIL+LEPENGAVYVLLCNIYAAC RW  L E+R++MMD+GIKKTPGCSLIEMNG+VH
Sbjct: 543 AAQQILELEPENGAVYVLLCNIYAACNRWEKLHEVRKLMMDRGIKKTPGCSLIEMNGSVH 602

Query: 630 EFVAGDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEVFLDIAEEDKENSVFRHSEKLAI 689
           EFVAGD+ HPQ+++I  KL++M+ DLKFAGYSPD SEVFLDI EE+KE++V+RHSEKLAI
Sbjct: 603 EFVAGDQVHPQSKEIYSKLDEMSVDLKFAGYSPDTSEVFLDIGEEEKESAVYRHSEKLAI 662

Query: 690 AFGLINSPPGVTIRIVKNLRMCMDCHNMAKLVSKVYNREVIVRDRTRFHHFKHGLCSCKD 745
           AFGLI+S PGVTIRIVKNLRMC+DCH +AKLVSKVYNREVIVRDRTRFHHF+HG CSCKD
Sbjct: 663 AFGLISSGPGVTIRIVKNLRMCVDCHYVAKLVSKVYNREVIVRDRTRFHHFRHGSCSCKD 722

BLAST of Cla015352 vs. NCBI nr
Match: gi|778729028|ref|XP_004136090.2| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g15930 [Cucumis sativus])

HSP 1 Score: 1385.2 bits (3584), Expect = 0.0e+00
Identity = 673/744 (90.46%), Postives = 703/744 (94.49%), Query Frame = 1

Query: 1   MPTPVPFRTLLHHRHVTNSKQMATIATISSASSSFSPPTHPLISLLETCKSMDQLQQIHC 60
           MP PV FRTLLHHRHV   KQM TIA  SSA  SFSPPTHPLISLLETC+SMDQLQQ+HC
Sbjct: 1   MPVPVRFRTLLHHRHVKKPKQMTTIAATSSALKSFSPPTHPLISLLETCESMDQLQQVHC 60

Query: 61  QAIKTAFNANPVLQNKVMSFCCTHECGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSP 120
           QAIK   NANPVLQN+VM+FCCTHE GD +YA  LFDEIPEPNLFIWNTMIRGYSRLD P
Sbjct: 61  QAIKKGLNANPVLQNRVMTFCCTHEYGDFQYARRLFDEIPEPNLFIWNTMIRGYSRLDFP 120

Query: 121 ELGVSLYLEMLRRGFKPDRYTFPFLFKGFTRDIALEYGRELHGHVLKLGLQSNVFVHTAL 180
           +LGVSLYLEMLRRG KPDRYTFPFLFKGFTRDIALEYGR+LHGHVLK GLQ NVFVHTAL
Sbjct: 121 QLGVSLYLEMLRRGVKPDRYTFPFLFKGFTRDIALEYGRQLHGHVLKHGLQYNVFVHTAL 180

Query: 181 VQMYLLCGQLDTARRVLDVCSKADVIAWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTT 240
           VQMYLLCGQLDTAR V DVC KADVI WNM+ISAYNKVG+FEESRR+FL ME+KQVLPTT
Sbjct: 181 VQMYLLCGQLDTARGVFDVCPKADVITWNMIISAYNKVGKFEESRRLFLVMEDKQVLPTT 240

Query: 241 VTLVLILSACSKLKDLKTGKQVHSYVNNCKVESNLVLENALIDMYATCGEMDSALGIFRS 300
           VTLVL+LSACSKLKDL+TGK+VHSYV NCKVESNLVLENA+IDMYA CGEMDSALGIFRS
Sbjct: 241 VTLVLVLSACSKLKDLRTGKKVHSYVKNCKVESNLVLENAMIDMYADCGEMDSALGIFRS 300

Query: 301 MNNKDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF 360
           MNN+DIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF
Sbjct: 301 MNNRDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF 360

Query: 361 RNMQATNVKPDEFTMVSILTACAHLGALELGEWIRTYIDRNKINNDAFVRNALIDMYFKC 420
           RNMQATNVKPDEFTMVS+LTACAHLGALELGEWIRTYIDRNKI ND FVRNALIDMYFKC
Sbjct: 361 RNMQATNVKPDEFTMVSVLTACAHLGALELGEWIRTYIDRNKIKNDLFVRNALIDMYFKC 420

Query: 421 GNVDKAERIFREMCQRDKFTWTAMIVGLAVNGHGEKALDMFSQMLKASILPDEITYIGVL 480
           G+VDKAE IFREM QRDKFTWTAMIVGLAVNGHGEKALDMFS MLKASILPDEITYIGVL
Sbjct: 421 GDVDKAESIFREMSQRDKFTWTAMIVGLAVNGHGEKALDMFSNMLKASILPDEITYIGVL 480

Query: 481 SACTHTGMVEKGREYFLSMTTQHGIEPNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPN 540
           SACTHTG+V+KGR+YFL MT+QHGIEPNIAHYGCLVDLLARAG LKEA+EVIENMP+K N
Sbjct: 481 SACTHTGLVDKGRKYFLRMTSQHGIEPNIAHYGCLVDLLARAGRLKEAYEVIENMPIKAN 540

Query: 541 SIVWGALLAGCRVYREADMAEMVVKQILDLEPENGAVYVLLCNIYAACKRWNDLRELRQM 600
           SIVWGALLAGCRVYRE+DMAEMVVKQIL+LEP+NGAVYVLLCNIYAACKRWNDLRELRQM
Sbjct: 541 SIVWGALLAGCRVYRESDMAEMVVKQILELEPDNGAVYVLLCNIYAACKRWNDLRELRQM 600

Query: 601 MMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEV 660
           MMDKGIKKTPGCSLIEMNG VHEFVAGDRSHPQT+ ID KL+KMTQDLK AGYSPD+SEV
Sbjct: 601 MMDKGIKKTPGCSLIEMNGRVHEFVAGDRSHPQTKNIDAKLDKMTQDLKLAGYSPDISEV 660

Query: 661 FLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRIVKNLRMCMDCHNMAKLVSKVYNR 720
           FLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRI KNLRMCMDCHNMAKLVSKVYNR
Sbjct: 661 FLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRITKNLRMCMDCHNMAKLVSKVYNR 720

Query: 721 EVIVRDRTRFHHFKHGLCSCKDYW 745
           EVIVRDRTRFHHFKHGLCSCKDYW
Sbjct: 721 EVIVRDRTRFHHFKHGLCSCKDYW 744

BLAST of Cla015352 vs. NCBI nr
Match: gi|659122425|ref|XP_008461137.1| (PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At3g15930 [Cucumis melo])

HSP 1 Score: 1377.5 bits (3564), Expect = 0.0e+00
Identity = 671/744 (90.19%), Postives = 703/744 (94.49%), Query Frame = 1

Query: 1   MPTPVPFRTLLHHRHVTNSKQMATIATISSASSSFSPPTHPLISLLETCKSMDQLQQIHC 60
           MP PV FRTLLH  HV  SKQM TIA  SSAS SFSPPT PLI LLETCKSMDQLQQ+HC
Sbjct: 13  MPVPVRFRTLLHRFHVKESKQMPTIAATSSASKSFSPPTRPLIYLLETCKSMDQLQQVHC 72

Query: 61  QAIKTAFNANPVLQNKVMSFCCTHECGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSP 120
           QAIKT  NANPVLQN+VMSFCCT + GD +YA HLFDEIPEPNLFIWNTMIRGYSRLD P
Sbjct: 73  QAIKTGLNANPVLQNRVMSFCCTDDYGDFQYARHLFDEIPEPNLFIWNTMIRGYSRLDFP 132

Query: 121 ELGVSLYLEMLRRGFKPDRYTFPFLFKGFTRDIALEYGRELHGHVLKLGLQSNVFVHTAL 180
           +LGVSLYLEMLRRG KPDRYTFPFLFKGFTRDIALEYGR+LHGHVLK GLQ+NVFVHTAL
Sbjct: 133 QLGVSLYLEMLRRGVKPDRYTFPFLFKGFTRDIALEYGRQLHGHVLKHGLQNNVFVHTAL 192

Query: 181 VQMYLLCGQLDTARRVLDVCSKADVIAWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTT 240
           VQMYLLCGQLDTAR VLDVCSKADVI WNM+ISAYNKVG+FEESRR+FL ME KQVL TT
Sbjct: 193 VQMYLLCGQLDTARGVLDVCSKADVITWNMIISAYNKVGKFEESRRLFLVMENKQVLATT 252

Query: 241 VTLVLILSACSKLKDLKTGKQVHSYVNNCKVESNLVLENALIDMYATCGEMDSALGIFRS 300
           VTLVL+LSACSKLKDL+TGK+VHSYV NCKVESNLVLENALIDMYA CGEMDSALGIFRS
Sbjct: 253 VTLVLVLSACSKLKDLRTGKKVHSYVKNCKVESNLVLENALIDMYADCGEMDSALGIFRS 312

Query: 301 MNNKDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF 360
           MNN+DIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF
Sbjct: 313 MNNRDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF 372

Query: 361 RNMQATNVKPDEFTMVSILTACAHLGALELGEWIRTYIDRNKINNDAFVRNALIDMYFKC 420
           RNMQATNVKPDEFTMVS+LTACAHLGALELGEWIRTYI+RNKINND FVRNALIDMYFKC
Sbjct: 373 RNMQATNVKPDEFTMVSVLTACAHLGALELGEWIRTYINRNKINNDLFVRNALIDMYFKC 432

Query: 421 GNVDKAERIFREMCQRDKFTWTAMIVGLAVNGHGEKALDMFSQMLKASILPDEITYIGVL 480
           G+VDKAERIFREM QRDKFTWTAMIVGLAVNGHGEKALDMFS MLKASILPDEITYIGVL
Sbjct: 433 GDVDKAERIFREMSQRDKFTWTAMIVGLAVNGHGEKALDMFSNMLKASILPDEITYIGVL 492

Query: 481 SACTHTGMVEKGREYFLSMTTQHGIEPNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPN 540
           SACTHTG+V+KGR+YFL MT+QHGIEPNIAHYGCLVDLLARAG LKEA++VI+NMP+K N
Sbjct: 493 SACTHTGLVDKGRKYFLRMTSQHGIEPNIAHYGCLVDLLARAGRLKEAYDVIKNMPIKAN 552

Query: 541 SIVWGALLAGCRVYREADMAEMVVKQILDLEPENGAVYVLLCNIYAACKRWNDLRELRQM 600
           SIVWGALLAGCRVYREADMAEMVVK IL+LEP+NGAVYVLLCNIYAACKRWN+LRELRQM
Sbjct: 553 SIVWGALLAGCRVYREADMAEMVVKHILELEPDNGAVYVLLCNIYAACKRWNELRELRQM 612

Query: 601 MMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEV 660
           MMDKGI KTPGCSLIEMNG VHEFVAGDRSHPQT+ ID KL+KMTQDLK AGYSPD+SEV
Sbjct: 613 MMDKGIXKTPGCSLIEMNGRVHEFVAGDRSHPQTKNIDAKLDKMTQDLKLAGYSPDISEV 672

Query: 661 FLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRIVKNLRMCMDCHNMAKLVSKVYNR 720
           FLD+AEEDKENSVFRHSEKLAIAFGLINSPPGVTIRI KNLRMCMDCHNMAKLVSKVYNR
Sbjct: 673 FLDVAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRITKNLRMCMDCHNMAKLVSKVYNR 732

Query: 721 EVIVRDRTRFHHFKHGLCSCKDYW 745
           EVIVRDRTRFHHFKHGLCSCKDYW
Sbjct: 733 EVIVRDRTRFHHFKHGLCSCKDYW 756

BLAST of Cla015352 vs. NCBI nr
Match: gi|700190022|gb|KGN45255.1| (hypothetical protein Csa_7G432370 [Cucumis sativus])

HSP 1 Score: 1320.8 bits (3417), Expect = 0.0e+00
Identity = 645/716 (90.08%), Postives = 675/716 (94.27%), Query Frame = 1

Query: 1   MPTPVPFRTLLHHRHVTNSKQMATIATISSASSSFSPPTHPLISLLETCKSMDQLQQIHC 60
           MP PV FRTLLHHRHV   KQM TIA  SSA  SFSPPTHPLISLLETC+SMDQLQQ+HC
Sbjct: 1   MPVPVRFRTLLHHRHVKKPKQMTTIAATSSALKSFSPPTHPLISLLETCESMDQLQQVHC 60

Query: 61  QAIKTAFNANPVLQNKVMSFCCTHECGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSP 120
           QAIK   NANPVLQN+VM+FCCTHE GD +YA  LFDEIPEPNLFIWNTMIRGYSRLD P
Sbjct: 61  QAIKKGLNANPVLQNRVMTFCCTHEYGDFQYARRLFDEIPEPNLFIWNTMIRGYSRLDFP 120

Query: 121 ELGVSLYLEMLRRGFKPDRYTFPFLFKGFTRDIALEYGRELHGHVLKLGLQSNVFVHTAL 180
           +LGVSLYLEMLRRG KPDRYTFPFLFKGFTRDIALEYGR+LHGHVLK GLQ NVFVHTAL
Sbjct: 121 QLGVSLYLEMLRRGVKPDRYTFPFLFKGFTRDIALEYGRQLHGHVLKHGLQYNVFVHTAL 180

Query: 181 VQMYLLCGQLDTARRVLDVCSKADVIAWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTT 240
           VQMYLLCGQLDTAR V DVC KADVI WNM+ISAYNKVG+FEESRR+FL ME+KQVLPTT
Sbjct: 181 VQMYLLCGQLDTARGVFDVCPKADVITWNMIISAYNKVGKFEESRRLFLVMEDKQVLPTT 240

Query: 241 VTLVLILSACSKLKDLKTGKQVHSYVNNCKVESNLVLENALIDMYATCGEMDSALGIFRS 300
           VTLVL+LSACSKLKDL+TGK+VHSYV NCKVESNLVLENA+IDMYA CGEMDSALGIFRS
Sbjct: 241 VTLVLVLSACSKLKDLRTGKKVHSYVKNCKVESNLVLENAMIDMYADCGEMDSALGIFRS 300

Query: 301 MNNKDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF 360
           MNN+DIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF
Sbjct: 301 MNNRDIISWTTIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELF 360

Query: 361 RNMQATNVKPDEFTMVSILTACAHLGALELGEWIRTYIDRNKINNDAFVRNALIDMYFKC 420
           RNMQATNVKPDEFTMVS+LTACAHLGALELGEWIRTYIDRNKI ND FVRNALIDMYFKC
Sbjct: 361 RNMQATNVKPDEFTMVSVLTACAHLGALELGEWIRTYIDRNKIKNDLFVRNALIDMYFKC 420

Query: 421 GNVDKAERIFREMCQRDKFTWTAMIVGLAVNGHGEKALDMFSQMLKASILPDEITYIGVL 480
           G+VDKAE IFREM QRDKFTWTAMIVGLAVNGHGEKALDMFS MLKASILPDEITYIGVL
Sbjct: 421 GDVDKAESIFREMSQRDKFTWTAMIVGLAVNGHGEKALDMFSNMLKASILPDEITYIGVL 480

Query: 481 SACTHTGMVEKGREYFLSMTTQHGIEPNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPN 540
           SACTHTG+V+KGR+YFL MT+QHGIEPNIAHYGCLVDLLARAG LKEA+EVIENMP+K N
Sbjct: 481 SACTHTGLVDKGRKYFLRMTSQHGIEPNIAHYGCLVDLLARAGRLKEAYEVIENMPIKAN 540

Query: 541 SIVWGALLAGCRVYREADMAEMVVKQILDLEPENGAVYVLLCNIYAACKRWNDLRELRQM 600
           SIVWGALLAGCRVYRE+DMAEMVVKQIL+LEP+NGAVYVLLCNIYAACKRWNDLRELRQM
Sbjct: 541 SIVWGALLAGCRVYRESDMAEMVVKQILELEPDNGAVYVLLCNIYAACKRWNDLRELRQM 600

Query: 601 MMDKGIKKTPGCSLIEMNGTVHEFVAGDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEV 660
           MMDKGIKKTPGCSLIEMNG VHEFVAGDRSHPQT+ ID KL+KMTQDLK AGYSPD+SEV
Sbjct: 601 MMDKGIKKTPGCSLIEMNGRVHEFVAGDRSHPQTKNIDAKLDKMTQDLKLAGYSPDISEV 660

Query: 661 FLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRIVKNLRMCMDCHNMAKLVSK 717
           FLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRI KNLRMCMDCHNMAKLVSK
Sbjct: 661 FLDIAEEDKENSVFRHSEKLAIAFGLINSPPGVTIRITKNLRMCMDCHNMAKLVSK 716

BLAST of Cla015352 vs. NCBI nr
Match: gi|743939749|ref|XP_011014327.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g15930 [Populus euphratica])

HSP 1 Score: 1075.5 bits (2780), Expect = 0.0e+00
Identity = 507/734 (69.07%), Postives = 612/734 (83.38%), Query Frame = 1

Query: 12  HHRHVTNSKQMATIATISSASSSFSPPT-HPLISLLETCKSMDQLQQIHCQAIKTAFNAN 71
           HH H +   +M     IS A S  SP T +P +SL ETCKSM  L+QIH + IKT    N
Sbjct: 42  HHLHSSFLNKM-----ISMACSQSSPVTENPPLSLFETCKSMYHLKQIHSRTIKTGIICN 101

Query: 72  PVLQNKVMSFCCTHECGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSPELGVSLYLEM 131
           P++QNK++SFCC+ E  D+ YA  LFD IPEP++F WN M +GYSR+  P+LGVSLYLEM
Sbjct: 102 PIIQNKILSFCCSRELSDMCYARQLFDTIPEPSVFSWNIMFKGYSRIACPKLGVSLYLEM 161

Query: 132 LRRGFKPDRYTFPFLFKGFTRDIALEYGRELHGHVLKLGLQSNVFVHTALVQMYLLCGQL 191
           L R  KPD YT+PFLFKGFTR +AL++GRELH HV+K GL SNVF H AL+ MY LCG +
Sbjct: 162 LERNVKPDCYTYPFLFKGFTRSVALQFGRELHCHVVKYGLDSNVFAHNALINMYSLCGLI 221

Query: 192 DTARRVLDVCSKADVIAWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTTVTLVLILSAC 251
           D AR + D+  K+DV+ WN MIS YN++ +++E+R++F  MEEK +LPT+VT V +LSAC
Sbjct: 222 DMARGIFDMSCKSDVVTWNAMISGYNRIKKYDEARKLFDMMEEKGILPTSVTCVSVLSAC 281

Query: 252 SKLKDLKTGKQVHSYVNNCKVESNLVLENALIDMYATCGEMDSALGIFRSMNNKDIISWT 311
           SKLKDL+ GK+V  Y+ N  VE NL + NALID+YA CGEM+ ALGIF +M N+D+ISWT
Sbjct: 282 SKLKDLECGKRVQEYIRNGVVEVNLKVGNALIDLYAACGEMNVALGIFENMKNRDVISWT 341

Query: 312 TIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELFRNMQATNVKP 371
            IV+GF N G++D AR YF KMPE+D++SWTAMIDGY+R N +KEAL LFR MQ + +KP
Sbjct: 342 AIVTGFVNTGQVDAARKYFHKMPERDHISWTAMIDGYLRLNYYKEALMLFREMQTSKIKP 401

Query: 372 DEFTMVSILTACAHLGALELGEWIRTYIDRNKINNDAFVRNALIDMYFKCGNVDKAERIF 431
           DEFTMVSILTACA LGALELGEWIRTYID+NK+ ND FV NALIDMYFKCG+V+KA  IF
Sbjct: 402 DEFTMVSILTACAQLGALELGEWIRTYIDKNKVKNDTFVGNALIDMYFKCGHVEKALSIF 461

Query: 432 REMCQRDKFTWTAMIVGLAVNGHGEKALDMFSQMLKASILPDEITYIGVLSACTHTGMVE 491
             + QRDKFTWTAM+VGLA+NG GE+AL+MFSQMLKA + PDE+TY+GVLSACTHTGMV+
Sbjct: 462 NTLPQRDKFTWTAMVVGLAINGCGEEALNMFSQMLKACVTPDEVTYVGVLSACTHTGMVD 521

Query: 492 KGREYFLSMTTQHGIEPNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPNSIVWGALLAG 551
           +G+++F SMT +HGIEPN+AHYGC+VDLL +AG LKEAHE+I+NMPMKPNSIVWGALL  
Sbjct: 522 EGKKFFASMTARHGIEPNVAHYGCMVDLLGKAGHLKEAHEIIKNMPMKPNSIVWGALLGA 581

Query: 552 CRVYREADMAEMVVKQILDLEPENGAVYVLLCNIYAACKRWNDLRELRQMMMDKGIKKTP 611
           CR++++A+MAE  ++QIL+LEP NGAVYVL CNIYAAC +W+ LRELR++MMD+GIKKTP
Sbjct: 582 CRIHKDAEMAERAIEQILELEPNNGAVYVLQCNIYAACNKWDKLRELRRVMMDRGIKKTP 641

Query: 612 GCSLIEMNGTVHEFVAGDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEVFLDIAEEDKE 671
           GCSLIEMNG VHEFVAGDRSHPQT++I  KLNKMT DLK AGYSP+ SEVFLDI+EEDKE
Sbjct: 642 GCSLIEMNGIVHEFVAGDRSHPQTKEIYGKLNKMTSDLKIAGYSPNTSEVFLDISEEDKE 701

Query: 672 NSVFRHSEKLAIAFGLINSPPGVTIRIVKNLRMCMDCHNMAKLVSKVYNREVIVRDRTRF 731
           N+V+RHSEKLAIAFGLINS PGVTIRIVKNLRMC+DCH++AKLVSKVY+REVIVRDRTRF
Sbjct: 702 NAVYRHSEKLAIAFGLINSGPGVTIRIVKNLRMCIDCHHVAKLVSKVYDREVIVRDRTRF 761

Query: 732 HHFKHGLCSCKDYW 745
           HHF+HG CSCKDYW
Sbjct: 762 HHFRHGSCSCKDYW 770

BLAST of Cla015352 vs. NCBI nr
Match: gi|566160501|ref|XP_006385300.1| (hypothetical protein POPTR_0003s02590g [Populus trichocarpa])

HSP 1 Score: 1072.8 bits (2773), Expect = 2.3e-310
Identity = 514/754 (68.17%), Postives = 615/754 (81.56%), Query Frame = 1

Query: 12  HHRHVTNSKQMATIATISSASSSFSPPT-HPLISLLETCKSMDQLQQIHCQAIKTAFNAN 71
           HH H +  K+M     IS A S  SP T +P +SL ETCKSM  L+QIH + IKT    N
Sbjct: 16  HHLHSSFLKKM-----ISMACSQSSPVTENPPLSLFETCKSMYHLKQIHSRTIKTGIICN 75

Query: 72  PVLQNKVMSFCCTHECGDLKYAHHLFDEIPEPNLFIWNTMIRGYSRLDSPELGVSLYLEM 131
           P++QNK++SFCC+ E GD+ YA  LFD IPEP++F WN M +GYSR+  P+LGVSLYLEM
Sbjct: 76  PIIQNKILSFCCSREFGDMCYARQLFDTIPEPSVFSWNIMFKGYSRIACPKLGVSLYLEM 135

Query: 132 LRRGFKPDRYTFPFLFKGFTRDIALEYGRELHGHVLKLGLQSNVFVHTALVQMYLLCGQL 191
           L R  KPD YT+PFLFKGFTR +AL+ GRELH HV+K GL SNVF H AL+ MY LCG +
Sbjct: 136 LERNVKPDCYTYPFLFKGFTRSVALQLGRELHCHVVKYGLDSNVFAHNALINMYSLCGLI 195

Query: 192 DTARRVLDVCSKADVIAWNMMISAYNKVGEFEESRRIFLGMEEKQVLPTTVTLVLILSAC 251
           D AR + D+  K+DV+ WN MIS YN++ +++E+R++F  MEEK +LPT+VT V +LSAC
Sbjct: 196 DMARGIFDMSCKSDVVTWNAMISGYNRIKKYDEARKLFDMMEEKGILPTSVTCVSVLSAC 255

Query: 252 SKLKDLKTGKQVHSYVNNCKVESNLVLENALIDMYATCGEMDSALGIFRSMNNKDIISWT 311
           SKLKDL+ GK+V  Y+ N  VE NL +ENALIDMYA+CGEM+ ALGIF +M N+D+ISWT
Sbjct: 256 SKLKDLECGKRVQKYIRNGVVEVNLKVENALIDMYASCGEMNVALGIFENMKNRDVISWT 315

Query: 312 TIVSGFTNLGEIDVARNYFDKMPEKDYVSWTAMIDGYIRSNRFKEALELFRNMQATNVKP 371
            IV+GF N G++D AR YF KMPE+D+VSWTAMIDGY+R N +KEAL LFR MQ + +KP
Sbjct: 316 AIVTGFVNTGQVDAARKYFHKMPERDHVSWTAMIDGYLRLNCYKEALMLFREMQTSKIKP 375

Query: 372 DEFTMVSILTACAHLGALELGEWIRTYIDRNKINNDAFVRNALIDMYFKCGNVDKAERIF 431
           DEFTMVS+LTACA LGALELGEWIRTYID+NK+ ND FV NALIDMYFKCGNV+ A  IF
Sbjct: 376 DEFTMVSVLTACAQLGALELGEWIRTYIDKNKVKNDTFVGNALIDMYFKCGNVEMALSIF 435

Query: 432 REMCQRDKFTWTAMIVGLAVNGHGEKALDMFSQMLKASILPDEITYIGVLSACTHTGMVE 491
             + QRDKFTWTAM+VGLA+NG GE+AL+MFSQMLKAS+ PDE+TY+GVLSACTHTGMV+
Sbjct: 436 NTLPQRDKFTWTAMVVGLAINGCGEEALNMFSQMLKASVTPDEVTYVGVLSACTHTGMVD 495

Query: 492 KGREYFLSMTTQHGIEPNIAHYGCLVDLLARAGCLKEAHEVIENMPMKPNSIVWGALLAG 551
           +G+++F SMT +HGIEPNIAHYGC+VDLL +AG LKEAHE+I+NMPMKPNSIVWGALL  
Sbjct: 496 EGKKFFASMTARHGIEPNIAHYGCMVDLLGKAGHLKEAHEIIKNMPMKPNSIVWGALLGA 555

Query: 552 CRVYREADMAEMVVKQILDLEPENGAVYVLLCNIYAACKRWNDLRELRQMMMDKGIKKTP 611
           CR++++A+MAE  ++QIL+LEP NGAVYVL CNIYAAC +W+ LRELRQ+MMD+GIKKTP
Sbjct: 556 CRIHKDAEMAERAIEQILELEPNNGAVYVLQCNIYAACNKWDKLRELRQVMMDRGIKKTP 615

Query: 612 GCSLIEMNGTVHEFVAGDRSHPQTEKIDVKLNKMTQDLKFAGYSPDVSEVFLDIAEEDKE 671
           GCSLIEMNG VHEFVAGD+SHPQT++I  KLNKMT DLK AGYSP+ SEVFLDIAEEDKE
Sbjct: 616 GCSLIEMNGIVHEFVAGDQSHPQTKEIYGKLNKMTSDLKIAGYSPNTSEVFLDIAEEDKE 675

Query: 672 NSVFRHSEKLAIAFGLINSPPGV--------------------TIRIVKNLRMCMDCHNM 731
           N+V+RHSEKLAIAFGLINS PGV                    TIRIVKNLRMC+DCH++
Sbjct: 676 NAVYRHSEKLAIAFGLINSGPGVTIRIVKNLRMCIDCHHVAKFTIRIVKNLRMCIDCHHV 735

Query: 732 AKLVSKVYNREVIVRDRTRFHHFKHGLCSCKDYW 745
           AKLVSKVY+REVIVRDRTRFHHF+HG CSCKDYW
Sbjct: 736 AKLVSKVYDREVIVRDRTRFHHFRHGSCSCKDYW 764

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP235_ARATH1.5e-21555.78Putative pentatricopeptide repeat-containing protein At3g15930 OS=Arabidopsis th... [more]
PP175_ARATH1.2e-18042.66Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
PPR21_ARATH7.0e-16539.45Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
PP249_ARATH1.7e-16341.30Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana GN... [more]
PP311_ARATH1.8e-16039.05Pentatricopeptide repeat-containing protein At4g14820 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0K6A7_CUCSA0.0e+0090.08Uncharacterized protein OS=Cucumis sativus GN=Csa_7G432370 PE=4 SV=1[more]
U5GP98_POPTR1.6e-31068.17Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0003s02590g PE=4 SV=1[more]
A0A067KYX9_JATCU1.7e-30669.26Uncharacterized protein OS=Jatropha curcas GN=JCGZ_06115 PE=4 SV=1[more]
A0A067ECK2_CITSI1.7e-30668.85Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g005000mg PE=4 SV=1[more]
F6HXG1_VITVI1.1e-30268.42Putative uncharacterized protein OS=Vitis vinifera GN=VIT_09s0002g07350 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
gi|778729028|ref|XP_004136090.2|0.0e+0090.46PREDICTED: putative pentatricopeptide repeat-containing protein At3g15930 [Cucum... [more]
gi|659122425|ref|XP_008461137.1|0.0e+0090.19PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing pro... [more]
gi|700190022|gb|KGN45255.1|0.0e+0090.08hypothetical protein Csa_7G432370 [Cucumis sativus][more]
gi|743939749|ref|XP_011014327.1|0.0e+0069.07PREDICTED: putative pentatricopeptide repeat-containing protein At3g15930 [Popul... [more]
gi|566160501|ref|XP_006385300.1|2.3e-31068.17hypothetical protein POPTR_0003s02590g [Populus trichocarpa][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0008568 microtubule-severing ATPase activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cla015352Cla015352gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cla015352Cla015352.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla015352.1.cds1Cla015352.1.cds1CDS


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 411..436
score: 6.1E-6coord: 279..305
score: 0.0049coord: 511..536
score: 0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 437..483
score: 2.6E-9coord: 204..251
score: 3.5E-7coord: 335..383
score: 1.8E-12coord: 102..150
score: 2.7
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 338..372
score: 3.4E-9coord: 106..138
score: 2.1E-8coord: 439..473
score: 2.7E-7coord: 411..436
score: 3.0E-6coord: 207..239
score: 1.0E-5coord: 474..509
score: 3.1E-4coord: 307..337
score: 5.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 336..370
score: 12.847coord: 204..238
score: 11.542coord: 274..304
score: 8.089coord: 540..570
score: 5.393coord: 437..471
score: 10.983coord: 508..538
score: 7.794coord: 70..102
score: 5.733coord: 305..335
score: 8.912coord: 138..172
score: 5.875coord: 103..137
score: 12.025coord: 371..405
score: 5.525coord: 406..436
score: 10.095coord: 239..273
score: 5.744coord: 472..507
score: 8.747coord: 574..608
score: 7.706coord: 173..203
score: 6
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 334..535
score: 1.6E-11coord: 568..593
score: 1.6E-11coord: 184..235
score: 1.6
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 400..599
score: 3.79E-6coord: 174..231
score: 3.79E-6coord: 301..370
score: 1.38E-6coord: 407..443
score: 1.38E-6coord: 558..596
score: 1.3
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 4..23
score: 0.0coord: 50..615
score:
NoneNo IPR availablePANTHERPTHR24015:SF373SUBFAMILY NOT NAMEDcoord: 50..615
score: 0.0coord: 4..23
score: