Cla011294 (gene) Watermelon (97103) v1

NameCla011294
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionPentatricopeptide repeat-containing protein (AHRD V1 ***- D7L966_ARALL); contains Interpro domain(s) IPR002885 Pentatricopeptide repeat
LocationChr3 : 27297526 .. 27298980 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGCTTAGCCCCAAGACTGAGGTGCATCCACCACCTAAATAACATACGAATTTCTTCCCGTCAACTTAAACAAATTCACGCCCAATTGATAACCAATGGCTTCAAATTCCCCTCCCCTTACGCCAAACTAATCGCCCATTCCTGCAAGAAATCTTCCCCAGAAGCCATCGCCTACGCCCAGTTGATATTCCGGCACCACCAATACCCTCCAAGTCTCTTCCTCTTCAACACTCTCATAAGATGCGCTCCACCTCAACATTCCATCTCCACTTTTGCCACTTGGGTCTCCACCCCCCACTTCGAATTCGACGATTTAACTTTCATTTTCGTGCTCGGAGCCTGCGCGCGAGCCCCATCGCTGTCTACGTTAATGATCGGTAGGCAAATTCATACTCATATTCTTAAACGTGGGATTGTTTCGAACATTTGGGTGCAGACTACGATGATACATTTTTATGCGATTAACAAAGATGTGGGTATCGCACGGAAGGTGTTTGATGAAATGTGTGTGAGAAATAGTGTTACCTGGAATGCGATGATTGCAGGGTACTGCTCACAAAGTGGAAAGGTTGCTCAGAGATATGCCCGAGATTCGTTGGAATTGTTTCGGGGGATGTTGGTTGAATCAACGAATTCTGAGGTGAAACCAACGGATACTACAATGGTTTGCCTTCTTTCAGCTGCATCTCAACTGGGTGTGCATGAAACCGGCTCTTGTGTACATGCATATATCGAGAAGACAATTGATTCTCCCGAAAATGATCTGTTTATTGGCACTGGTTTGGTTAATATGTACTCGAAATGTGGGTGTATTAACAGTGCTTCATCAGTTTTTAAGCAGATGAAGCAGAGGAACGTTTTGACGTGGACAGCCATGGCGACAGGACTGGCCGTTCATGGAAGGGGTAAAGAAGCATTGGAGCTATTGGATGCAATGGGAACTCATGGTGTAAAGCCAAACGCAGTAACTTTCACAAGTCTGCTTTCTGCTTGCTGTCATGGAGGGCTTATTGAAGAGGGGCTCCATTTGTTTCATGTCATGGAGAGGAAGTTTGGGGTTGTGCCTCAAATGCAGCATTATGGCTGCATTGTTGACCTTCTTGGGCGCTCCGGGCACTTGAGAGAGGCATATGACTTGATACTTGGAATGCCAGTGGAACCTGATGGTGTTTTATGGAGGAGTTTGCTGAGTTCTTGTTTGGTCCATGGCGATGTTCAAATGGGAGAGAGGGTGGGTAAGTTGCTTGTGGAGAGACAGGGAGGCGAGAGTTCTGATGATGAGTGGTGTGTTGGAAGTGAGGACTTTGTAGCTTTGTCAAATGTGTATGCTTCTGCTGAAAGGTGGGGGGATGTGGAGGCTGTAAGGGAGGAAATGAAGATCAAAGGGATTGAAAACAAAGCTGGATGTAGTTCGGTTCAAACTACGGGTTCTCAAGGCTTGGAGGTTTTATAG

mRNA sequence

ATGCGCTTAGCCCCAAGACTGAGGTGCATCCACCACCTAAATAACATACGAATTTCTTCCCGTCAACTTAAACAAATTCACGCCCAATTGATAACCAATGGCTTCAAATTCCCCTCCCCTTACGCCAAACTAATCGCCCATTCCTGCAAGAAATCTTCCCCAGAAGCCATCGCCTACGCCCAGTTGATATTCCGGCACCACCAATACCCTCCAAGTCTCTTCCTCTTCAACACTCTCATAAGATGCGCTCCACCTCAACATTCCATCTCCACTTTTGCCACTTGGGTCTCCACCCCCCACTTCGAATTCGACGATTTAACTTTCATTTTCGTGCTCGGAGCCTGCGCGCGAGCCCCATCGCTGTCTACGTTAATGATCGGTAGGCAAATTCATACTCATATTCTTAAACGTGGGATTGTTTCGAACATTTGGGTGCAGACTACGATGATACATTTTTATGCGATTAACAAAGATGTGGGTATCGCACGGAAGGTGTTTGATGAAATGTGTGTGAGAAATAGTGTTACCTGGAATGCGATGATTGCAGGGTACTGCTCACAAAGTGGAAAGGTTGCTCAGAGATATGCCCGAGATTCGTTGGAATTGTTTCGGGGGATGTTGGTTGAATCAACGAATTCTGAGGTGAAACCAACGGATACTACAATGGTTTGCCTTCTTTCAGCTGCATCTCAACTGGGTGTGCATGAAACCGGCTCTTGTGTACATGCATATATCGAGAAGACAATTGATTCTCCCGAAAATGATCTGTTTATTGGCACTGGTTTGGTTAATATGTACTCGAAATGTGGGTGTATTAACAGTGCTTCATCAGTTTTTAAGCAGATGAAGCAGAGGAACGTTTTGACGTGGACAGCCATGGCGACAGGACTGGCCGTTCATGGAAGGGGTAAAGAAGCATTGGAGCTATTGGATGCAATGGGAACTCATGGTGTAAAGCCAAACGCAGTAACTTTCACAAGTCTGCTTTCTGCTTGCTGTCATGGAGGGCTTATTGAAGAGGGGCTCCATTTGTTTCATGTCATGGAGAGGAAGTTTGGGGTTGTGCCTCAAATGCAGCATTATGGCTGCATTGTTGACCTTCTTGGGCGCTCCGGGCACTTGAGAGAGGCATATGACTTGATACTTGGAATGCCAGTGGAACCTGATGGTGTTTTATGGAGGAGTTTGCTGAGTTCTTGTTTGGTCCATGGCGATGTTCAAATGGGAGAGAGGGTGGGTAAGTTGCTTGTGGAGAGACAGGGAGGCGAGAGTTCTGATGATGAGTGGTGTGTTGGAAGTGAGGACTTTGTAGCTTTGTCAAATGTGTATGCTTCTGCTGAAAGGTGGGGGGATGTGGAGGCTGTAAGGGAGGAAATGAAGATCAAAGGGATTGAAAACAAAGCTGGATGTAGTTCGGTTCAAACTACGGGTTCTCAAGGCTTGGAGGTTTTATAG

Coding sequence (CDS)

ATGCGCTTAGCCCCAAGACTGAGGTGCATCCACCACCTAAATAACATACGAATTTCTTCCCGTCAACTTAAACAAATTCACGCCCAATTGATAACCAATGGCTTCAAATTCCCCTCCCCTTACGCCAAACTAATCGCCCATTCCTGCAAGAAATCTTCCCCAGAAGCCATCGCCTACGCCCAGTTGATATTCCGGCACCACCAATACCCTCCAAGTCTCTTCCTCTTCAACACTCTCATAAGATGCGCTCCACCTCAACATTCCATCTCCACTTTTGCCACTTGGGTCTCCACCCCCCACTTCGAATTCGACGATTTAACTTTCATTTTCGTGCTCGGAGCCTGCGCGCGAGCCCCATCGCTGTCTACGTTAATGATCGGTAGGCAAATTCATACTCATATTCTTAAACGTGGGATTGTTTCGAACATTTGGGTGCAGACTACGATGATACATTTTTATGCGATTAACAAAGATGTGGGTATCGCACGGAAGGTGTTTGATGAAATGTGTGTGAGAAATAGTGTTACCTGGAATGCGATGATTGCAGGGTACTGCTCACAAAGTGGAAAGGTTGCTCAGAGATATGCCCGAGATTCGTTGGAATTGTTTCGGGGGATGTTGGTTGAATCAACGAATTCTGAGGTGAAACCAACGGATACTACAATGGTTTGCCTTCTTTCAGCTGCATCTCAACTGGGTGTGCATGAAACCGGCTCTTGTGTACATGCATATATCGAGAAGACAATTGATTCTCCCGAAAATGATCTGTTTATTGGCACTGGTTTGGTTAATATGTACTCGAAATGTGGGTGTATTAACAGTGCTTCATCAGTTTTTAAGCAGATGAAGCAGAGGAACGTTTTGACGTGGACAGCCATGGCGACAGGACTGGCCGTTCATGGAAGGGGTAAAGAAGCATTGGAGCTATTGGATGCAATGGGAACTCATGGTGTAAAGCCAAACGCAGTAACTTTCACAAGTCTGCTTTCTGCTTGCTGTCATGGAGGGCTTATTGAAGAGGGGCTCCATTTGTTTCATGTCATGGAGAGGAAGTTTGGGGTTGTGCCTCAAATGCAGCATTATGGCTGCATTGTTGACCTTCTTGGGCGCTCCGGGCACTTGAGAGAGGCATATGACTTGATACTTGGAATGCCAGTGGAACCTGATGGTGTTTTATGGAGGAGTTTGCTGAGTTCTTGTTTGGTCCATGGCGATGTTCAAATGGGAGAGAGGGTGGGTAAGTTGCTTGTGGAGAGACAGGGAGGCGAGAGTTCTGATGATGAGTGGTGTGTTGGAAGTGAGGACTTTGTAGCTTTGTCAAATGTGTATGCTTCTGCTGAAAGGTGGGGGGATGTGGAGGCTGTAAGGGAGGAAATGAAGATCAAAGGGATTGAAAACAAAGCTGGATGTAGTTCGGTTCAAACTACGGGTTCTCAAGGCTTGGAGGTTTTATAG

Protein sequence

MRLAPRLRCIHHLNNIRISSRQLKQIHAQLITNGFKFPSPYAKLIAHSCKKSSPEAIAYAQLIFRHHQYPPSLFLFNTLIRCAPPQHSISTFATWVSTPHFEFDDLTFIFVLGACARAPSLSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNAMIAGYCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGSCVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAVHGRGKEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVERQGGESSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTTGSQGLEVL
BLAST of Cla011294 vs. Swiss-Prot
Match: PP243_ARATH (Pentatricopeptide repeat-containing protein At3g18970 OS=Arabidopsis thaliana GN=PCMP-E93 PE=2 SV=1)

HSP 1 Score: 433.3 bits (1113), Expect = 3.4e-120
Identity = 236/457 (51.64%), Postives = 306/457 (66.96%), Query Frame = 1

Query: 22  QLKQIHAQLITNGFKFPSPYAKLIAHSCKKSSPEAIA-YAQLIFRHHQYPPSLFLFNTLI 81
           Q KQIHAQL+ NG    S + KLI H C K S E+ +  A L+       P  FLFNTL+
Sbjct: 23  QAKQIHAQLVINGCHDNSLFGKLIGHYCSKPSTESSSKLAHLLVFPRFGHPDKFLFNTLL 82

Query: 82  RCAPPQHSISTFATWVSTPHFEF-DDLTFIFVLGACARAPSLSTLMIGRQIHTHILKRGI 141
           +C+ P+ SI  FA + S     + ++ TF+FVLGACAR+ S S L +GR +H  + K G 
Sbjct: 83  KCSKPEDSIRIFANYASKSSLLYLNERTFVFVLGACARSASSSALRVGRIVHGMVKKLGF 142

Query: 142 V-SNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNAMIAGYCSQSGKVAQRYARD 201
           +  +  + TT++HFYA N D+  ARKVFDEM  R SVTWNAMI GYCS   K     AR 
Sbjct: 143 LYESELIGTTLLHFYAKNGDLRYARKVFDEMPERTSVTWNAMIGGYCSHKDK-GNHNARK 202

Query: 202 SLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGSCVHAYIEKTIDSPENDLFI 261
           ++ LFR        S V+PTDTTMVC+LSA SQ G+ E GS VH YIEK   +PE D+FI
Sbjct: 203 AMVLFRRF--SCCGSGVRPTDTTMVCVLSAISQTGLLEIGSLVHGYIEKLGFTPEVDVFI 262

Query: 262 GTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAVHGRGKEALELLDAMGTHGV 321
           GT LV+MYSKCGC+N+A SVF+ MK +NV TWT+MATGLA++GRG E   LL+ M   G+
Sbjct: 263 GTALVDMYSKCGCLNNAFSVFELMKVKNVFTWTSMATGLALNGRGNETPNLLNRMAESGI 322

Query: 322 KPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAY 381
           KPN +TFTSLLSA  H GL+EEG+ LF  M+ +FGV P ++HYGCIVDLLG++G ++EAY
Sbjct: 323 KPNEITFTSLLSAYRHIGLVEEGIELFKSMKTRFGVTPVIEHYGCIVDLLGKAGRIQEAY 382

Query: 382 DLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVERQGGESSDDEWCVGS--EDF 441
             IL MP++PD +L RSL ++C ++G+  MGE +GK L+E +     +DE   GS  ED+
Sbjct: 383 QFILAMPIKPDAILLRSLCNACSIYGETVMGEEIGKALLEIE----REDEKLSGSECEDY 442

Query: 442 VALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSV 474
           VALSNV A   +W +VE +R+EMK + I+ + G S V
Sbjct: 443 VALSNVLAHKGKWVEVEKLRKEMKERRIKTRPGYSFV 472

BLAST of Cla011294 vs. Swiss-Prot
Match: PP330_ARATH (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 270.4 bits (690), Expect = 3.8e-71
Identity = 161/475 (33.89%), Postives = 269/475 (56.63%), Query Frame = 1

Query: 8   RCIHHLNNIRISS-RQLKQIHAQLITNGFKFPSPYA--KLIAHSCKKSSPEAIAYAQLIF 67
           +CI+ L    +SS  +L+QIHA  I +G           LI +     SP  ++YA  +F
Sbjct: 17  KCINLLQTYGVSSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVF 76

Query: 68  RHHQYPPSLFLFNTLIR-CAPPQHSISTFATWVS---TPHFEFDDLTFIFVLGACARAPS 127
              + P ++F++NTLIR  A   +SIS F+ +     +   E D  T+ F++ A     +
Sbjct: 77  SKIEKPINVFIWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVT---T 136

Query: 128 LSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNAM 187
           ++ + +G  IH+ +++ G  S I+VQ +++H YA   DV  A KVFD+M  ++ V WN++
Sbjct: 137 MADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSV 196

Query: 188 IAGYCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGSC 247
           I G+ +++GK  +  A         +  E  +  +KP   T+V LLSA +++G    G  
Sbjct: 197 INGF-AENGKPEEALA---------LYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKR 256

Query: 248 VHAYIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAVH 307
           VH Y+ K       +L     L+++Y++CG +  A ++F +M  +N ++WT++  GLAV+
Sbjct: 257 VHVYMIKV--GLTRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVN 316

Query: 308 GRGKEALELLDAM-GTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQ 367
           G GKEA+EL   M  T G+ P  +TF  +L AC H G+++EG   F  M  ++ + P+++
Sbjct: 317 GFGKEAIELFKYMESTEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIE 376

Query: 368 HYGCIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVER 427
           H+GC+VDLL R+G +++AY+ I  MP++P+ V+WR+LL +C VHGD  + E     +++ 
Sbjct: 377 HFGCMVDLLARAGQVKKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQL 436

Query: 428 QGGESSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQ 475
           +   S          D+V LSN+YAS +RW DV+ +R++M   G++   G S V+
Sbjct: 437 EPNHSG---------DYVLLSNMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVE 467

BLAST of Cla011294 vs. Swiss-Prot
Match: PPR85_ARATH (Pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondrial OS=Arabidopsis thaliana GN=PCMP-H51 PE=2 SV=2)

HSP 1 Score: 266.2 bits (679), Expect = 7.2e-70
Identity = 169/469 (36.03%), Postives = 250/469 (53.30%), Query Frame = 1

Query: 22  QLKQIHAQLITNGF-KFPSP---YAKLIAHSCKKSSPEAIAYAQLIFRHHQYPPSLFLFN 81
           QLKQ+HA  +   + + P+    Y K++  S   SS   + YA  +F   +   S F++N
Sbjct: 63  QLKQLHAFTLRTTYPEEPATLFLYGKILQLS---SSFSDVNYAFRVFDSIENHSS-FMWN 122

Query: 82  TLIR-CAPP----QHSISTFATWVSTPHFEFDDLTFIFVLGACARAPSLSTLMIGRQIHT 141
           TLIR CA      + +   +   +       D  TF FVL ACA     S    G+Q+H 
Sbjct: 123 TLIRACAHDVSRKEEAFMLYRKMLERGESSPDKHTFPFVLKACAYIFGFSE---GKQVHC 182

Query: 142 HILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNAMIAGYCSQSGKVA 201
            I+K G   +++V   +IH Y     + +ARKVFDEM  R+ V+WN+MI         V 
Sbjct: 183 QIVKHGFGGDVYVNNGLIHLYGSCGCLDLARKVFDEMPERSLVSWNSMI------DALVR 242

Query: 202 QRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGSCVHAYIEKTID-S 261
                 +L+LFR M         +P   TM  +LSA + LG    G+  HA++ +  D  
Sbjct: 243 FGEYDSALQLFREM-----QRSFEPDGYTMQSVLSACAGLGSLSLGTWAHAFLLRKCDVD 302

Query: 262 PENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAVHGRGKEALELLD 321
              D+ +   L+ MY KCG +  A  VF+ M++R++ +W AM  G A HGR +EA+   D
Sbjct: 303 VAMDVLVKNSLIEMYCKCGSLRMAEQVFQGMQKRDLASWNAMILGFATHGRAEEAMNFFD 362

Query: 322 AM--GTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLG 381
            M      V+PN+VTF  LL AC H G + +G   F +M R + + P ++HYGCIVDL+ 
Sbjct: 363 RMVDKRENVRPNSVTFVGLLIACNHRGFVNKGRQYFDMMVRDYCIEPALEHYGCIVDLIA 422

Query: 382 RSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHG-DVQMGERVGKLLVERQGGESSDDE 441
           R+G++ EA D+++ MP++PD V+WRSLL +C   G  V++ E + + ++  +    S + 
Sbjct: 423 RAGYITEAIDMVMSMPMKPDAVIWRSLLDACCKKGASVELSEEIARNIIGTKEDNESSNG 482

Query: 442 WCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTTG 478
            C G+  +V LS VYASA RW DV  VR+ M   GI  + GCSS++  G
Sbjct: 483 NCSGA--YVLLSRVYASASRWNDVGIVRKLMSEHGIRKEPGCSSIEING 511

BLAST of Cla011294 vs. Swiss-Prot
Match: PP415_ARATH (Pentatricopeptide repeat-containing protein At5g43790 OS=Arabidopsis thaliana GN=PCMP-E30 PE=2 SV=1)

HSP 1 Score: 260.8 bits (665), Expect = 3.0e-68
Identity = 167/477 (35.01%), Postives = 256/477 (53.67%), Query Frame = 1

Query: 8   RCIHHLNNIRISSRQLKQIHAQLITNGFKFPS-PYAKLIAHSCKKSSPEAIAYAQLIFRH 67
           RC++ ++  + S + LKQIHAQ+IT G    + P +KL+      SS   ++YA  I R 
Sbjct: 11  RCLNLISKCK-SLQNLKQIHAQIITIGLSHHTYPLSKLL----HLSSTVCLSYALSILR- 70

Query: 68  HQYP-PSLFLFNTLIRCAPPQH-SISTFATWVSTPHFEFDDLTFI----FVLGACARAPS 127
            Q P PS+FL+NTLI      H S  T   +            F+    F   +  +A  
Sbjct: 71  -QIPNPSVFLYNTLISSIVSNHNSTQTHLAFSLYDQILSSRSNFVRPNEFTYPSLFKASG 130

Query: 128 LSTLMI--GRQIHTHILK--RGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVT 187
                   GR +H H+LK    +  + +VQ  ++ FYA    +  AR +F+ +   +  T
Sbjct: 131 FDAQWHRHGRALHAHVLKFLEPVNHDRFVQAALVGFYANCGKLREARSLFERIREPDLAT 190

Query: 188 WNAMIAGYCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHE 247
           WN ++A Y +           + ++    +L+     +V+P + ++V L+ + + LG   
Sbjct: 191 WNTLLAAYANS----------EEIDSDEEVLLLFMRMQVRPNELSLVALIKSCANLGEFV 250

Query: 248 TGSCVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATG 307
            G   H Y+ K  ++   + F+GT L+++YSKCGC++ A  VF +M QR+V  + AM  G
Sbjct: 251 RGVWAHVYVLK--NNLTLNQFVGTSLIDLYSKCGCLSFARKVFDEMSQRDVSCYNAMIRG 310

Query: 308 LAVHGRGKEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVP 367
           LAVHG G+E +EL  ++ + G+ P++ TF   +SAC H GL++EGL +F+ M+  +G+ P
Sbjct: 311 LAVHGFGQEGIELYKSLISQGLVPDSATFVVTISACSHSGLVDEGLQIFNSMKAVYGIEP 370

Query: 368 QMQHYGCIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLL 427
           +++HYGC+VDLLGRSG L EA + I  MPV+P+  LWRS L S   HGD + GE   K L
Sbjct: 371 KVEHYGCLVDLLGRSGRLEEAEECIKKMPVKPNATLWRSFLGSSQTHGDFERGEIALKHL 430

Query: 428 VERQGGESSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSV 474
           +   G E  +      S ++V LSN+YA   RW DVE  RE MK   +    G S++
Sbjct: 431 L---GLEFEN------SGNYVLLSNIYAGVNRWTDVEKTRELMKDHRVNKSPGISTL 459

BLAST of Cla011294 vs. Swiss-Prot
Match: PP267_ARATH (Pentatricopeptide repeat-containing protein At3g47530 OS=Arabidopsis thaliana GN=PCMP-H76 PE=2 SV=1)

HSP 1 Score: 256.5 bits (654), Expect = 5.7e-67
Identity = 165/476 (34.66%), Postives = 251/476 (52.73%), Query Frame = 1

Query: 12  HLNNIRISSR---QLKQIHAQLI-TNGFKFPSPYAKLIAHSCKKSSPEAIAYAQLIFRHH 71
           HL ++ +SS     L+QIHA L+ T+  +    +   ++       P  I Y+  +F   
Sbjct: 13  HLLSLIVSSTGKLHLRQIHALLLRTSLIRNSDVFHHFLSRLALSLIPRDINYSCRVFSQ- 72

Query: 72  QYPPSLFLFNTLIRC----APPQHSISTFATWVSTPHFEFDDLTFIFVLGACARAPSLST 131
           +  P+L   NT+IR       P      F +         + L+  F L  C ++     
Sbjct: 73  RLNPTLSHCNTMIRAFSLSQTPCEGFRLFRSLRRNSSLPANPLSSSFALKCCIKS---GD 132

Query: 132 LMIGRQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNAMIAG 191
           L+ G QIH  I   G +S+  + TT++  Y+  ++   A KVFDE+  R++V+WN + + 
Sbjct: 133 LLGGLQIHGKIFSDGFLSDSLLMTTLMDLYSTCENSTDACKVFDEIPKRDTVSWNVLFSC 192

Query: 192 YCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGSCVHA 251
           Y      +  +  RD L LF  M     +  VKP   T +  L A + LG  + G  VH 
Sbjct: 193 Y------LRNKRTRDVLVLFDKMK-NDVDGCVKPDGVTCLLALQACANLGALDFGKQVHD 252

Query: 252 YIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAVHGRG 311
           +I++  +     L +   LV+MYS+CG ++ A  VF  M++RNV++WTA+ +GLA++G G
Sbjct: 253 FIDE--NGLSGALNLSNTLVSMYSRCGSMDKAYQVFYGMRERNVVSWTALISGLAMNGFG 312

Query: 312 KEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMER-KFGVVPQMQHYG 371
           KEA+E  + M   G+ P   T T LLSAC H GL+ EG+  F  M   +F + P + HYG
Sbjct: 313 KEAIEAFNEMLKFGISPEEQTLTGLLSACSHSGLVAEGMMFFDRMRSGEFKIKPNLHHYG 372

Query: 372 CIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVERQGG 431
           C+VDLLGR+  L +AY LI  M ++PD  +WR+LL +C VHGDV++GERV   L+E +  
Sbjct: 373 CVVDLLGRARLLDKAYSLIKSMEMKPDSTIWRTLLGACRVHGDVELGERVISHLIELKAE 432

Query: 432 ESSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTTGS 479
           E+          D+V L N Y++  +W  V  +R  MK K I  K GCS+++  G+
Sbjct: 433 EAG---------DYVLLLNTYSTVGKWEKVTELRSLMKEKRIHTKPGCSAIELQGT 466

BLAST of Cla011294 vs. TrEMBL
Match: A0A0A0LZ63_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G573660 PE=4 SV=1)

HSP 1 Score: 877.9 bits (2267), Expect = 5.8e-252
Identity = 426/481 (88.57%), Postives = 453/481 (94.18%), Query Frame = 1

Query: 1   MRLAPRLRCIHHLNNIRISSRQLKQIHAQLITNGFKFPSPYAKLIAHSCKKSSPEAIAYA 60
           M LAPRL CIHHL+N+RISS QL QIHAQLITNGFK PSPYAKLI H CKKSS E+IA+A
Sbjct: 1   MHLAPRLSCIHHLSNVRISSLQLLQIHAQLITNGFKSPSPYAKLITHLCKKSSSESIAHA 60

Query: 61  QLIFRHHQYPPSLFLFNTLIRCAPPQHSISTFATWVSTPHFEFDDLTFIFVLGACARAPS 120
            LIFRHHQY P+LFLFNTLIRCAPP HSIS FATWVST HFEFDD TFIFVLGACARAPS
Sbjct: 61  HLIFRHHQYSPNLFLFNTLIRCAPPHHSISIFATWVSTSHFEFDDFTFIFVLGACARAPS 120

Query: 121 LSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNAM 180
           +STLMIGRQIHTHILKRGIVSNIWVQTTMIHFY+INKDVG ARK+FDEM +RNSVTWNAM
Sbjct: 121 VSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWNAM 180

Query: 181 IAGYCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGSC 240
           IAGYCSQ GKV+Q+YARD+LELFRGMLVESTN EVKPTDTTMVC+LSAASQLG+ ETGSC
Sbjct: 181 IAGYCSQGGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASQLGMLETGSC 240

Query: 241 VHAYIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAVH 300
           VHAYI+KT+DSPE D+FIGTGLVNMYSKCG +NSASSVFKQMKQ+NVLTWT+MATGLAVH
Sbjct: 241 VHAYIKKTVDSPEKDVFIGTGLVNMYSKCGLLNSASSVFKQMKQKNVLTWTSMATGLAVH 300

Query: 301 GRGKEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQH 360
           GRGKEALELLDAMG HGVKPNAVTFTSLLSACCHGGLIEEGLHLF VMERKFGVVPQMQH
Sbjct: 301 GRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQH 360

Query: 361 YGCIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVERQ 420
           YGCIVDLLGRSGHLREAY LIL MP+EPDGVLWRSLLSSC++HGDV+MGERVGKLLVERQ
Sbjct: 361 YGCIVDLLGRSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVERQ 420

Query: 421 GGESSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTTGSQG 480
           GGES DDEWCVGSEDFVALSNVYAS ERW DVEA+R+EMKIKGIENKAGCSS+QTTGSQG
Sbjct: 421 GGESFDDEWCVGSEDFVALSNVYASVERWDDVEALRDEMKIKGIENKAGCSSLQTTGSQG 480

Query: 481 L 482
           L
Sbjct: 481 L 481

BLAST of Cla011294 vs. TrEMBL
Match: M5WFE9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025445mg PE=4 SV=1)

HSP 1 Score: 604.0 bits (1556), Expect = 1.6e-169
Identity = 308/486 (63.37%), Postives = 370/486 (76.13%), Query Frame = 1

Query: 1   MRLAPRLRCIHHLNNIRISSRQLKQIHAQLITNGF-KFPSPYAKLIAHSCKKSSPEAI-A 60
           M   PR+R +  LN    S+ QLK+ HAQLIT+G  K P+ YAKLI      S P++   
Sbjct: 1   MHHLPRVRALFLLNLKLKSTHQLKRTHAQLITSGLLKSPTLYAKLIQQYGALSDPQSTNL 60

Query: 61  YAQLIFRHHQYPPSLFLFNTLIRCAPPQHSISTFATWVSTPHFEFDDLTFIFVLGACARA 120
           YA  +F+H    P+LFL NTLIRC  P+ SI  FA WVS     FDD T+ FVLGACAR 
Sbjct: 61  YAHFVFKHFD-EPNLFLLNTLIRCTQPKDSILVFANWVSKATLIFDDFTYKFVLGACARL 120

Query: 121 PSLSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWN 180
           PS+STL++G QIH  I+K  +VSNI VQTT++HFYA NKD   AR+VFDEM V+NSVTWN
Sbjct: 121 PSVSTLLVGSQIHARIIKHDVVSNILVQTTLVHFYASNKDFVSARRVFDEMAVKNSVTWN 180

Query: 181 AMIAGYCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETG 240
           AMI GYCSQ     +  ARD+L LFR ML +     VKPTDTTMVC+LSAASQLGV ETG
Sbjct: 181 AMITGYCSQ-----RESARDALVLFRDMLDDVCG--VKPTDTTMVCVLSAASQLGVLETG 240

Query: 241 SCVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLA 300
           +CVH YIEK I  P ND+FIGTGLV MYSKCGC++ A S+FK+MK++N+LTWTAMATGLA
Sbjct: 241 ACVHGYIEKAIWVPHNDVFIGTGLVGMYSKCGCVDGALSIFKRMKEKNILTWTAMATGLA 300

Query: 301 VHGRGKEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQM 360
           +HG+G EAL LLD M  +G+KPNAVTFTSLLSACCH GL+EEGLHLFH+M+  F V+PQM
Sbjct: 301 IHGKGNEALVLLDVMEAYGIKPNAVTFTSLLSACCHSGLVEEGLHLFHMMKSNFDVMPQM 360

Query: 361 QHYGCIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVE 420
           QHYGCIVD+L R G+L+EAY+ ++GMPVEPD VLWRSLLS+C VHGDV MGE+VGK L+ 
Sbjct: 361 QHYGCIVDMLSRRGYLKEAYEFVVGMPVEPDAVLWRSLLSACKVHGDVAMGEKVGKKLLH 420

Query: 421 RQGGESSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTTGS 480
            Q  ++  D   + SED+VALSN+YASAERW DVE VR+EMK+KGIENKAGCSS+QT+ +
Sbjct: 421 IQSAQTCAD-LTLKSEDYVALSNIYASAERWEDVEMVRQEMKVKGIENKAGCSSIQTSSN 477

Query: 481 QGLEVL 485
               VL
Sbjct: 481 ISNHVL 477

BLAST of Cla011294 vs. TrEMBL
Match: D7TUI5_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_14s0030g02210 PE=4 SV=1)

HSP 1 Score: 590.9 bits (1522), Expect = 1.4e-165
Identity = 309/459 (67.32%), Postives = 347/459 (75.60%), Query Frame = 1

Query: 19  SSRQLKQIHAQLITNGFKFPSPYAKLIAHSCKKSSPEAIAYAQLIFRHHQYPPSLFLFNT 78
           S   +KQ+HA LITN    P   AKLI H C  SSP    YA   F H +  P+LFLFNT
Sbjct: 24  SIHHIKQLHAHLITNAVSSPPLLAKLIHHYCAFSSPH---YAYTFFIHLR-SPNLFLFNT 83

Query: 79  LIRCAPPQHSISTFATWVSTPHFEFDDLTFIFVLGACARAPSLSTLMIGRQIHTHILKRG 138
           LI+C PP  SI  FA WVS     FDD T+IF LGACAR+PSL     GRQIH  ILK+G
Sbjct: 84  LIKCLPPSSSILVFADWVSREALVFDDFTYIFALGACARSPSLWE---GRQIHARILKQG 143

Query: 139 IVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNAMIAGYCSQSGKVAQRYARD 198
           + SN+ VQTT IHFYA N DV +AR VFDEM  R+SVTWNAMI GYCSQ GKV   YARD
Sbjct: 144 VWSNVLVQTTAIHFYANNNDVALARLVFDEMRKRSSVTWNAMITGYCSQRGKVVC-YARD 203

Query: 199 SLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGSCVHAYIEKTIDSPENDLFI 258
           +L LFR MLV++    VKPTDTTMVC+LSAASQLGV ETG  VH YIEKT+ +P ND+F+
Sbjct: 204 ALVLFRAMLVDACG--VKPTDTTMVCVLSAASQLGVLETGVGVHGYIEKTVLAPANDVFV 263

Query: 259 GTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAVHGRGKEALELLDAMGTHGV 318
           GTGLV+MYSKCGC+ SA  +F  MK+RNVLTWTAM TGLA HGRGKEALELLD M  +GV
Sbjct: 264 GTGLVDMYSKCGCLGSALCIFWGMKERNVLTWTAMITGLARHGRGKEALELLDEMVAYGV 323

Query: 319 KPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGCIVDLLGRSGHLREAY 378
           KPNAVTFTSL SACCH GL+EEGL LFH M  KFGV P +QHYGCIVDLLGR+GHL+EAY
Sbjct: 324 KPNAVTFTSLFSACCHAGLVEEGLQLFHSMRSKFGVTPGIQHYGCIVDLLGRAGHLKEAY 383

Query: 379 DLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVERQGGESSDDEWCVGSEDFVA 438
           D + GMPVEPD +LWRSLLS+C VH DV MGE VGKLL++ Q  +S  D     SEDF+A
Sbjct: 384 DFVRGMPVEPDAILWRSLLSACKVHRDVVMGEEVGKLLLQLQPQQSFAD-LVAASEDFIA 443

Query: 439 LSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTTG 478
           LSNVYASAERW DVE VRE MK+KGIE K GCSSVQT+G
Sbjct: 444 LSNVYASAERWEDVETVREAMKVKGIETKPGCSSVQTSG 471

BLAST of Cla011294 vs. TrEMBL
Match: W9S8I0_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_004418 PE=4 SV=1)

HSP 1 Score: 590.1 bits (1520), Expect = 2.4e-165
Identity = 302/472 (63.98%), Postives = 363/472 (76.91%), Query Frame = 1

Query: 5   PRLRCIHHLNNIRISSRQLKQIHAQLITNGFKFPSPYAKLIAHSCKKSSPEA-IAYAQLI 64
           PR R I  L+    S  QLKQIH+QLI NG   PS  AK+I      S+P     YA L+
Sbjct: 5   PRFRAIALLDLKLKSICQLKQIHSQLIVNGLNSPSLVAKVIQQYFTSSNPHNNYHYAHLV 64

Query: 65  FRHHQYPPSLFLFNTLIRCAPPQHSISTFATWVSTPHFEFDDLTFIFVLGACARAPSLST 124
           F+H    P++FL NTLIRC+ P+ +I  F+ WVS    +FDD T+IFVLGACAR+PS+ +
Sbjct: 65  FKHFD-KPNVFLLNTLIRCSQPKEAILVFSNWVSRGSLDFDDFTYIFVLGACARSPSVPS 124

Query: 125 LMIGRQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNAMIAG 184
           +  G QIH  I++ GIVSNI VQTT+IHFYA NKD+  AR+VFDEM VRNSVTWNAMI G
Sbjct: 125 IWTGSQIHARIMRHGIVSNIMVQTTLIHFYASNKDIDSARRVFDEMLVRNSVTWNAMITG 184

Query: 185 YCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGSCVHA 244
           YCSQ G      A D+L LFR ML +   +  KPTDTT+VC+LSAASQLGV ETG+CVH 
Sbjct: 185 YCSQKGS-----ACDALLLFRDMLDDVCGA--KPTDTTIVCILSAASQLGVLETGACVHG 244

Query: 245 YIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAVHGRG 304
           Y++KTI  PE+D+FIGTGLV+MYSKCGC+NSA ++F +MK++N+LTWTAMATGLA+HG+G
Sbjct: 245 YMQKTICVPEDDVFIGTGLVDMYSKCGCLNSALAIFTRMKEKNILTWTAMATGLAIHGKG 304

Query: 305 KEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQHYGC 364
           KEAL L DAMG +G+KPNAVTFTSLL ACCH GL+EEGLHLFH M  KF VVPQMQHY C
Sbjct: 305 KEALVLFDAMGAYGIKPNAVTFTSLLLACCHAGLVEEGLHLFHSMS-KFNVVPQMQHYSC 364

Query: 365 IVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVERQGGE 424
           IVDLLGR+G L+EAY+ I GMPVEPD +LWRSLLS+  +HGDV MGE+VGKLL+ RQ   
Sbjct: 365 IVDLLGRTGLLKEAYEFIKGMPVEPDAILWRSLLSASKIHGDVTMGEKVGKLLLHRQPEP 424

Query: 425 SSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQT 476
           S D    V SED++ALSN+YASA +W +VE VREEMK+K IENKAGCSS+QT
Sbjct: 425 SLD----VTSEDYIALSNIYASAGKWENVEMVREEMKVKRIENKAGCSSLQT 463

BLAST of Cla011294 vs. TrEMBL
Match: A0A061F154_THECC (Mitochondrial editing factor 20 OS=Theobroma cacao GN=TCM_026348 PE=4 SV=1)

HSP 1 Score: 568.5 bits (1464), Expect = 7.6e-159
Identity = 296/476 (62.18%), Postives = 355/476 (74.58%), Query Frame = 1

Query: 1   MRLAPRLRCIHHLN-NIRISSRQLKQIHAQLITNGFKFPSPYAKLIAHSCKKSSPEAIAY 60
           M   PR  C+  LN    I S  +KQIHAQLI NG K PS  AKLI + C   SP+   Y
Sbjct: 1   MHSLPRFSCLPLLNLKSPIPSHHIKQIHAQLIINGLKEPSFLAKLIENYCFSPSPQNTKY 60

Query: 61  AQLIFRHHQYPPSLFLFNTLIRCAPPQHSISTFATWVSTPHFEFDDLTFIFVLGACARAP 120
           AQL+ +      SLFLFNTL+RC+ P+ SI TFA WVS  H  FDD TFIFVLGACAR+ 
Sbjct: 61  AQLVNKQFD-TQSLFLFNTLLRCSQPKVSIITFANWVSKGHLVFDDFTFIFVLGACARSH 120

Query: 121 SLSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNA 180
           SLSTL +GRQIH   LK G++SN+ V+TT+IHFYA NKD+  AR+VFDEM  R+SVTWNA
Sbjct: 121 SLSTLWLGRQIHVKALKFGVMSNLLVETTLIHFYAKNKDILSARRVFDEMTERSSVTWNA 180

Query: 181 MIAGYCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGS 240
           +I GYCSQ  + A+   R++L LFR ML +   S VKPTDTTMVC+LSA SQLG   +G+
Sbjct: 181 IIKGYCSQKER-AKECCREALVLFRDMLNDV--SGVKPTDTTMVCVLSACSQLGELYSGA 240

Query: 241 CVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAV 300
           C+H +IEKT   PEND+FIGTG V+MY+KCGCINSA  VF+ M+ +NVLTWTAM TGLAV
Sbjct: 241 CIHGFIEKTFFRPENDVFIGTGFVDMYAKCGCINSALCVFRLMRVKNVLTWTAMGTGLAV 300

Query: 301 HGRGKEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQ 360
           HGRG+EALELLDAM   GVKPN VTFTSL SACCH GL+E+GLHLFH M  +F + PQ+Q
Sbjct: 301 HGRGEEALELLDAMEGSGVKPNPVTFTSLFSACCHAGLVEQGLHLFHSMGSRFCLKPQIQ 360

Query: 361 HYGCIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVER 420
           HYGCIVDLLGR+GHL EAYD I+ MP++PD +LWRSLLS+C VHGDV M E+VGK+L+  
Sbjct: 361 HYGCIVDLLGRAGHLNEAYDFIIEMPMKPDAILWRSLLSACNVHGDVVMAEKVGKILLRL 420

Query: 421 QGGESSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQT 476
           +   S  D     SED+VALSNVYASA RW  VE VR++MK+K +E + G SS+QT
Sbjct: 421 KPPNSYVD-MATTSEDYVALSNVYASAGRWQQVEMVRKKMKLKRVETEPGGSSIQT 471

BLAST of Cla011294 vs. NCBI nr
Match: gi|778662661|ref|XP_011659935.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g18970 [Cucumis sativus])

HSP 1 Score: 877.9 bits (2267), Expect = 8.4e-252
Identity = 426/481 (88.57%), Postives = 453/481 (94.18%), Query Frame = 1

Query: 1   MRLAPRLRCIHHLNNIRISSRQLKQIHAQLITNGFKFPSPYAKLIAHSCKKSSPEAIAYA 60
           M LAPRL CIHHL+N+RISS QL QIHAQLITNGFK PSPYAKLI H CKKSS E+IA+A
Sbjct: 1   MHLAPRLSCIHHLSNVRISSLQLLQIHAQLITNGFKSPSPYAKLITHLCKKSSSESIAHA 60

Query: 61  QLIFRHHQYPPSLFLFNTLIRCAPPQHSISTFATWVSTPHFEFDDLTFIFVLGACARAPS 120
            LIFRHHQY P+LFLFNTLIRCAPP HSIS FATWVST HFEFDD TFIFVLGACARAPS
Sbjct: 61  HLIFRHHQYSPNLFLFNTLIRCAPPHHSISIFATWVSTSHFEFDDFTFIFVLGACARAPS 120

Query: 121 LSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNAM 180
           +STLMIGRQIHTHILKRGIVSNIWVQTTMIHFY+INKDVG ARK+FDEM +RNSVTWNAM
Sbjct: 121 VSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWNAM 180

Query: 181 IAGYCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGSC 240
           IAGYCSQ GKV+Q+YARD+LELFRGMLVESTN EVKPTDTTMVC+LSAASQLG+ ETGSC
Sbjct: 181 IAGYCSQGGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASQLGMLETGSC 240

Query: 241 VHAYIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAVH 300
           VHAYI+KT+DSPE D+FIGTGLVNMYSKCG +NSASSVFKQMKQ+NVLTWT+MATGLAVH
Sbjct: 241 VHAYIKKTVDSPEKDVFIGTGLVNMYSKCGLLNSASSVFKQMKQKNVLTWTSMATGLAVH 300

Query: 301 GRGKEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQH 360
           GRGKEALELLDAMG HGVKPNAVTFTSLLSACCHGGLIEEGLHLF VMERKFGVVPQMQH
Sbjct: 301 GRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQH 360

Query: 361 YGCIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVERQ 420
           YGCIVDLLGRSGHLREAY LIL MP+EPDGVLWRSLLSSC++HGDV+MGERVGKLLVERQ
Sbjct: 361 YGCIVDLLGRSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVERQ 420

Query: 421 GGESSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTTGSQG 480
           GGES DDEWCVGSEDFVALSNVYAS ERW DVEA+R+EMKIKGIENKAGCSS+QTTGSQG
Sbjct: 421 GGESFDDEWCVGSEDFVALSNVYASVERWDDVEALRDEMKIKGIENKAGCSSLQTTGSQG 480

Query: 481 L 482
           L
Sbjct: 481 L 481

BLAST of Cla011294 vs. NCBI nr
Match: gi|659099715|ref|XP_008450741.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g18970 [Cucumis melo])

HSP 1 Score: 869.4 bits (2245), Expect = 3.0e-249
Identity = 426/481 (88.57%), Postives = 451/481 (93.76%), Query Frame = 1

Query: 1   MRLAPRLRCIHHLNNIRISSRQLKQIHAQLITNGFKFPSPYAKLIAHSCKKSSPEAIAYA 60
           M LAPRL CI+HL+NIRISS QL QIHAQ ITNGFK PSPYAKLI H CKKSS E+IA+A
Sbjct: 9   MHLAPRLSCINHLSNIRISSLQLLQIHAQFITNGFKSPSPYAKLITHLCKKSSSESIAHA 68

Query: 61  QLIFRHHQYPPSLFLFNTLIRCAPPQHSISTFATWVSTPHFEFDDLTFIFVLGACARAPS 120
            LIFRHHQ+ P+LFLFNTLIRCAPPQ+SIS FA WVSTPHFEFDD TFIFVLGACARAPS
Sbjct: 69  HLIFRHHQHSPNLFLFNTLIRCAPPQYSISIFANWVSTPHFEFDDFTFIFVLGACARAPS 128

Query: 121 LSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNAM 180
           +STLMIGRQIHTHILKRGIVSNIW QTTMIHFY+ NKDVG ARKVFDEM VRNSVTWNAM
Sbjct: 129 VSTLMIGRQIHTHILKRGIVSNIWAQTTMIHFYSTNKDVGSARKVFDEMSVRNSVTWNAM 188

Query: 181 IAGYCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGSC 240
           IAGYCSQSGKV+Q+YARD+LELFRGMLVESTN EVKPTDTTMVC+LSAAS LG+ ETG C
Sbjct: 189 IAGYCSQSGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASHLGMLETGVC 248

Query: 241 VHAYIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAVH 300
           VHAYI+KTIDSPE D+FIGTGLVNMYSKCG ++SASSVFKQMKQRNVLTWT+MATGLAVH
Sbjct: 249 VHAYIKKTIDSPEKDVFIGTGLVNMYSKCGLLSSASSVFKQMKQRNVLTWTSMATGLAVH 308

Query: 301 GRGKEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQH 360
           GRGKEALELLDAMG HGVKPNAVTFTSLLSACCHGGLIEEGLHLF VMERKFGVVPQMQH
Sbjct: 309 GRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQH 368

Query: 361 YGCIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVERQ 420
           YGCIVDLLGRSGHLREAY+LIL MP+EPDGVLWRSLLSSC++HGDV+MGERVGKLLVERQ
Sbjct: 369 YGCIVDLLGRSGHLREAYELILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVERQ 428

Query: 421 GGESSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTTGSQG 480
           GGES DDEWCVGSEDFVALSNVYASAERW DVEA+REEMKIKGIENKAG SSVQTTGSQG
Sbjct: 429 GGESFDDEWCVGSEDFVALSNVYASAERWDDVEALREEMKIKGIENKAGFSSVQTTGSQG 488

Query: 481 L 482
           L
Sbjct: 489 L 489

BLAST of Cla011294 vs. NCBI nr
Match: gi|1009117454|ref|XP_015875328.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g18970 [Ziziphus jujuba])

HSP 1 Score: 610.5 bits (1573), Expect = 2.5e-171
Identity = 310/480 (64.58%), Postives = 370/480 (77.08%), Query Frame = 1

Query: 1   MRLAPRLRCIHHLNNIRISSRQLKQIHAQLITNGFKFPSPYAKLIAHSCKKSSPEAIA-Y 60
           M   PRLR +  L+    S+ Q+KQIHAQLI NG   PS  AKLI   C    P++I  Y
Sbjct: 1   MHSLPRLRALALLHQKLTSNSQVKQIHAQLIINGLNSPSILAKLIQQYCTLPDPQSIQNY 60

Query: 61  AQLIFRHHQYPPSLFLFNTLIRCAPPQHSISTFATWVSTPHFEFDDLTFIFVLGACARAP 120
           A  +F+H    P+LFLFNTLIRC+ P+ SI  FA WVS     FD  T+IFVLGACAR+P
Sbjct: 61  AYSVFKHFD-KPNLFLFNTLIRCSQPKESILVFADWVSRGDLVFDHFTYIFVLGACARSP 120

Query: 121 SLSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNA 180
           S+ TL +GRQ H  +LKRG +SNI +QTT IHFYA NKDV  AR +FDEM VRNS+TWN 
Sbjct: 121 SVPTLWVGRQTHAQMLKRGTMSNILLQTTAIHFYASNKDVSSARGMFDEMIVRNSITWNV 180

Query: 181 MIAGYCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGS 240
           MI GYCSQ     +  A ++L LFRGML +     VKPTDTTMVC+LSAASQLGV ETG+
Sbjct: 181 MIKGYCSQ-----REIASEALVLFRGMLEDDCG--VKPTDTTMVCILSAASQLGVLETGA 240

Query: 241 CVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAV 300
           CVH YIEKTI  PEND+FIGTGL++MYSKCGC++SA ++F +M+++NVLTWTAMATGLA+
Sbjct: 241 CVHGYIEKTIPFPENDVFIGTGLIDMYSKCGCLDSALTIFMRMEEKNVLTWTAMATGLAI 300

Query: 301 HGRGKEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQ 360
           HG+GKEAL+L D M  +GVKPN+VTFTSLLSACCHGGL+EEGLHLFHVM R+F V P MQ
Sbjct: 301 HGKGKEALKLFDEMDAYGVKPNSVTFTSLLSACCHGGLVEEGLHLFHVM-REFNVTPSMQ 360

Query: 361 HYGCIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVER 420
           HYGCIVD+LGRS  LREAY+ I GMPV+PD +LWRSLLS+C VHGDV MGE+VGKLL++ 
Sbjct: 361 HYGCIVDILGRSALLREAYEFIKGMPVKPDAILWRSLLSACKVHGDVTMGEKVGKLLLQL 420

Query: 421 QGGESSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTTGSQ 480
           Q  +SS D      ED+VALSN+YASAERW DV+ +REEMK+KGIENKAG SS+QT  +Q
Sbjct: 421 QPEQSSVDA-SPNGEDYVALSNIYASAERWEDVQMIREEMKVKGIENKAGSSSIQTISNQ 470

BLAST of Cla011294 vs. NCBI nr
Match: gi|694375800|ref|XP_009364498.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g18970 [Pyrus x bretschneideri])

HSP 1 Score: 604.4 bits (1557), Expect = 1.8e-169
Identity = 311/477 (65.20%), Postives = 369/477 (77.36%), Query Frame = 1

Query: 1   MRLAPRLRCIHHLNNIRISSRQLKQIHAQLITNGFKFPSPYAKLIAHSCKKSSPEAI-AY 60
           M   PR R +  LN    S+ QLK+IHAQLITNG K PS +AKLI      SSP++   Y
Sbjct: 1   MHHLPRARALFLLNLNLKSTHQLKKIHAQLITNGLKSPSLHAKLIQQYSAFSSPKSTNLY 60

Query: 61  AQLIFRHHQYPPSLFLFNTLIRCAPPQHSISTFATWVSTPHFEFDDLTFIFVLGACARAP 120
           A L+F+H Q  P+LFL NTLIRC  P+ S+  F+ WVS     FDD T+IFVLGACAR P
Sbjct: 61  AHLVFQHFQ-EPNLFLLNTLIRCFRPRDSLLVFSNWVSKSTLVFDDSTYIFVLGACARLP 120

Query: 121 SLSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWNA 180
           S+ TL++GRQIH  I+K  +VSNI VQTT+IHFY  N DV  AR VFDEM VRN VTWNA
Sbjct: 121 SVPTLLVGRQIHGRIVKHDVVSNISVQTTLIHFYGSNDDVVSARNVFDEMPVRNCVTWNA 180

Query: 181 MIAGYCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETGS 240
           MI GYCSQ   V     RD+L LFR ML +     VKPTDTTMVC+LSAASQLGV ETG+
Sbjct: 181 MITGYCSQRESV-----RDALVLFRDMLDDVCG--VKPTDTTMVCVLSAASQLGVLETGA 240

Query: 241 CVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLAV 300
           CVH Y+EKT+  P+ND+F+GTGLV+MYSKCGC++SA SVFK+MK++NVLTWTAMATGLA+
Sbjct: 241 CVHGYVEKTMCVPDNDVFMGTGLVDMYSKCGCVDSAFSVFKRMKEKNVLTWTAMATGLAI 300

Query: 301 HGRGKEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQMQ 360
           HG+G EALELLD M   G+KPNAVTFT LLSACCH GL+EEGLHLFH+M+ KF VVP+MQ
Sbjct: 301 HGKGNEALELLDVMQGSGIKPNAVTFTGLLSACCHSGLVEEGLHLFHLMKSKFDVVPRMQ 360

Query: 361 HYGCIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVER 420
           HYGCIVD+L R GHL+EAY+ I+GMPVEPD VLWRSLLS+C +HGDV MGE+VGK L+  
Sbjct: 361 HYGCIVDMLSRGGHLKEAYEFIVGMPVEPDAVLWRSLLSACNLHGDVAMGEKVGKKLLRV 420

Query: 421 QGGESSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTT 477
           Q  +S  D   + SED+VALSN+YA AERW D+E VR+EMK+KGIENKAG SS+QT+
Sbjct: 421 QSTQSYADS-TLKSEDYVALSNLYAFAERWEDLEMVRQEMKVKGIENKAGWSSLQTS 468

BLAST of Cla011294 vs. NCBI nr
Match: gi|595834005|ref|XP_007206779.1| (hypothetical protein PRUPE_ppa025445mg [Prunus persica])

HSP 1 Score: 604.0 bits (1556), Expect = 2.3e-169
Identity = 308/486 (63.37%), Postives = 370/486 (76.13%), Query Frame = 1

Query: 1   MRLAPRLRCIHHLNNIRISSRQLKQIHAQLITNGF-KFPSPYAKLIAHSCKKSSPEAI-A 60
           M   PR+R +  LN    S+ QLK+ HAQLIT+G  K P+ YAKLI      S P++   
Sbjct: 1   MHHLPRVRALFLLNLKLKSTHQLKRTHAQLITSGLLKSPTLYAKLIQQYGALSDPQSTNL 60

Query: 61  YAQLIFRHHQYPPSLFLFNTLIRCAPPQHSISTFATWVSTPHFEFDDLTFIFVLGACARA 120
           YA  +F+H    P+LFL NTLIRC  P+ SI  FA WVS     FDD T+ FVLGACAR 
Sbjct: 61  YAHFVFKHFD-EPNLFLLNTLIRCTQPKDSILVFANWVSKATLIFDDFTYKFVLGACARL 120

Query: 121 PSLSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYAINKDVGIARKVFDEMCVRNSVTWN 180
           PS+STL++G QIH  I+K  +VSNI VQTT++HFYA NKD   AR+VFDEM V+NSVTWN
Sbjct: 121 PSVSTLLVGSQIHARIIKHDVVSNILVQTTLVHFYASNKDFVSARRVFDEMAVKNSVTWN 180

Query: 181 AMIAGYCSQSGKVAQRYARDSLELFRGMLVESTNSEVKPTDTTMVCLLSAASQLGVHETG 240
           AMI GYCSQ     +  ARD+L LFR ML +     VKPTDTTMVC+LSAASQLGV ETG
Sbjct: 181 AMITGYCSQ-----RESARDALVLFRDMLDDVCG--VKPTDTTMVCVLSAASQLGVLETG 240

Query: 241 SCVHAYIEKTIDSPENDLFIGTGLVNMYSKCGCINSASSVFKQMKQRNVLTWTAMATGLA 300
           +CVH YIEK I  P ND+FIGTGLV MYSKCGC++ A S+FK+MK++N+LTWTAMATGLA
Sbjct: 241 ACVHGYIEKAIWVPHNDVFIGTGLVGMYSKCGCVDGALSIFKRMKEKNILTWTAMATGLA 300

Query: 301 VHGRGKEALELLDAMGTHGVKPNAVTFTSLLSACCHGGLIEEGLHLFHVMERKFGVVPQM 360
           +HG+G EAL LLD M  +G+KPNAVTFTSLLSACCH GL+EEGLHLFH+M+  F V+PQM
Sbjct: 301 IHGKGNEALVLLDVMEAYGIKPNAVTFTSLLSACCHSGLVEEGLHLFHMMKSNFDVMPQM 360

Query: 361 QHYGCIVDLLGRSGHLREAYDLILGMPVEPDGVLWRSLLSSCLVHGDVQMGERVGKLLVE 420
           QHYGCIVD+L R G+L+EAY+ ++GMPVEPD VLWRSLLS+C VHGDV MGE+VGK L+ 
Sbjct: 361 QHYGCIVDMLSRRGYLKEAYEFVVGMPVEPDAVLWRSLLSACKVHGDVAMGEKVGKKLLH 420

Query: 421 RQGGESSDDEWCVGSEDFVALSNVYASAERWGDVEAVREEMKIKGIENKAGCSSVQTTGS 480
            Q  ++  D   + SED+VALSN+YASAERW DVE VR+EMK+KGIENKAGCSS+QT+ +
Sbjct: 421 IQSAQTCAD-LTLKSEDYVALSNIYASAERWEDVEMVRQEMKVKGIENKAGCSSIQTSSN 477

Query: 481 QGLEVL 485
               VL
Sbjct: 481 ISNHVL 477

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP243_ARATH3.4e-12051.64Pentatricopeptide repeat-containing protein At3g18970 OS=Arabidopsis thaliana GN... [more]
PP330_ARATH3.8e-7133.89Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana GN... [more]
PPR85_ARATH7.2e-7036.03Pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondri... [more]
PP415_ARATH3.0e-6835.01Pentatricopeptide repeat-containing protein At5g43790 OS=Arabidopsis thaliana GN... [more]
PP267_ARATH5.7e-6734.66Pentatricopeptide repeat-containing protein At3g47530 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0LZ63_CUCSA5.8e-25288.57Uncharacterized protein OS=Cucumis sativus GN=Csa_1G573660 PE=4 SV=1[more]
M5WFE9_PRUPE1.6e-16963.37Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa025445mg PE=4 SV=1[more]
D7TUI5_VITVI1.4e-16567.32Putative uncharacterized protein OS=Vitis vinifera GN=VIT_14s0030g02210 PE=4 SV=... [more]
W9S8I0_9ROSA2.4e-16563.98Uncharacterized protein OS=Morus notabilis GN=L484_004418 PE=4 SV=1[more]
A0A061F154_THECC7.6e-15962.18Mitochondrial editing factor 20 OS=Theobroma cacao GN=TCM_026348 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|778662661|ref|XP_011659935.1|8.4e-25288.57PREDICTED: pentatricopeptide repeat-containing protein At3g18970 [Cucumis sativu... [more]
gi|659099715|ref|XP_008450741.1|3.0e-24988.57PREDICTED: pentatricopeptide repeat-containing protein At3g18970 [Cucumis melo][more]
gi|1009117454|ref|XP_015875328.1|2.5e-17164.58PREDICTED: pentatricopeptide repeat-containing protein At3g18970 [Ziziphus jujub... [more]
gi|694375800|ref|XP_009364498.1|1.8e-16965.20PREDICTED: pentatricopeptide repeat-containing protein At3g18970 [Pyrus x bretsc... [more]
gi|595834005|ref|XP_007206779.1|2.3e-16963.37hypothetical protein PRUPE_ppa025445mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0008150 biological_process
biological_process GO:0080156 mitochondrial mRNA modification
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0005739 mitochondrion
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0003674 molecular_function
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla011294Cla011294.1mRNA


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 175..186
score: 0.12coord: 147..172
score: 0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 286..334
score: 1.9
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 323..356
score: 1.6E-6coord: 289..322
score: 2.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 321..356
score: 10.128coord: 357..387
score: 5.766coord: 389..419
score: 5.941coord: 255..285
score: 7.552coord: 432..466
score: 6.719coord: 142..172
score: 6.204coord: 286..320
score: 10.841coord: 173..210
score:
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 21..473
score: 3.4E
NoneNo IPR availablePANTHERPTHR24015:SF873SUBFAMILY NOT NAMEDcoord: 21..473
score: 3.4E