Cla009208 (gene) Watermelon (97103) v1

NameCla009208
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionPentatricopeptide repeat-containing protein (AHRD V1 ***- D7KHY5_ARALL); contains Interpro domain(s) IPR002885 Pentatricopeptide repeat
LocationChr6 : 4530977 .. 4532776 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGCTCTCTGTGTGTCTATCCTGAGTCCATCAATCTGTCCGACCTCCTTCAGGGCCGCATGAACAATTCCCATCTTCGTCAAATCCACGCCCGAGTCTTTCGTCTGCTGAAGCATCAAGACAATCTAATTGCAACTCGACTTATTGGCCACTACCCACCTTCTGTTGGACTCAGAGTTTTCAATCAACTCATTCGCCCCAACATATTTCCTTGCAATGCCATTATCAGAGTACTTGCTGAACACAACTCTTCCTTCCTTGCCTTTTCCATCTTCAAATCTTTGAAGCACTTTTCACTTTCCCCTAATGACTTCACTTTTTCTTTCCTTCTCAAAGCGTTTCACCGTTCCAGCCATGCTCTAAATTTGAAACAAGTTCATACCCATGTCCTTAAAATGGGTTATTTGGGTGATTCTTTTATCTCCAATGCGCTTCTTGCAGTCTACGCGAGAGGCTTGAAGGATATGGCTTCTGCACATAAGGTGTTCGATGAAATGTCGGATAGAGATATGGTTTGTTGTTGGACTTCTTTGATTGCTGGCTATGCCCAGATGGGTCTTGCTGAAAAGGCTCTGCTGCTTTTTGTGATGATGATCAAAGAGAATATGCAGCCGGAGGATGACACCATGGTCAGTGTTCTGTCTGCTTGTTCCAAGTTTCATATTGCTGAAATTGAGAAATGGGTTGTAGAATTAAGACAACTGGTTAATAAATTTGATTCCAAGAGCTCTTGTTGTGATTCAATCAATATTGTTCTTATTTATCTATATGGGAAGTGGGGGATGATTGAGAAGAGTGAAGAAAAGTTTAATGAAATTGTTGATAAGAGAAGTTTGCTTGTTTGGAATTCAATGATAAATGCATATTTTCAGAATGGTTTCCCTGTTGAAGCCTTGGCCCTTTTCCGTCTAATGGTTGAAAATTCCCATTGCAAACCTAACCACGTTACAATGGTTACTGTCCTTTCAGCTTGTGCTCAAATAGGAGATTTGCAGCTCGGTTGTTGGGTTCATGAAGTTCTCAAATGTGGTGGCCGTAGAGGTATCATTGCATCAAACAAAATGTTAGCCACTGCATTGATTGATATGTATTGTAAAAGTGGTAGCTTGGAGAGGGCAAAAGAAGTTTTTCATCAACTAATCAACAAAGATGTAATCTCCTTCAATGCCATGATCATGGGCCTTGCAGTTAACGGCAAAGGCGATGAGGCATTGAAGCTTTTTGCCCAAATGCAAGAGGTTGATATAAGGCCAACGACCGGAACATTTATTGGTTTATTATCTGCTTGTAGCCATTCGGGCTTTCTAGAACAAGGGCGTCAAATTTTTATTGAAATGACTACCCGCTATTTAATATTGCCTAGTTTAGAACACTATGCTTGTTACATTGATCTCCTTGCTCGGGCAGGCCACTTCGAAGATGCTCTTGAAGTTGTTTCAACCATGCCTTTTGAACCTAATAATTTTGTTTGGAGTTCTCTGCTGAGAGGCTGCCTACTCCATTCAAAATTTGAGTTGGCACAATATGTTTCCAAAAAGCTTGTTGAAGTGGATCCTGAAAACTCTGCTGGCTATGTAATGCAGGCTAATTCATTTGCTACAGATCTTCAATGGGATGATGTCTCGGCTTTGAGATGGTTTATGAGAGAAAAGGGTATCCGTAAGCAGCCAGGCCAGAGTTGGATCAGTATAGATGGAATTGTACATGAATTTTTCTCAGCAACCAAATCACATCCTTATGTTGATCTATTATACACGTTGAATGTGCTTGAAAAACAAATGAAGCTAGTAATCCCATAG

mRNA sequence

ATGCGCTCTCTGTGTGTCTATCCTGAGTCCATCAATCTGTCCGACCTCCTTCAGGGCCGCATGAACAATTCCCATCTTCGTCAAATCCACGCCCGAGTCTTTCGTCTGCTGAAGCATCAAGACAATCTAATTGCAACTCGACTTATTGGCCACTACCCACCTTCTGTTGGACTCAGAGTTTTCAATCAACTCATTCGCCCCAACATATTTCCTTGCAATGCCATTATCAGAGTACTTGCTGAACACAACTCTTCCTTCCTTGCCTTTTCCATCTTCAAATCTTTGAAGCACTTTTCACTTTCCCCTAATGACTTCACTTTTTCTTTCCTTCTCAAAGCGTTTCACCGTTCCAGCCATGCTCTAAATTTGAAACAAGTTCATACCCATGTCCTTAAAATGGGTTATTTGGGTGATTCTTTTATCTCCAATGCGCTTCTTGCAGTCTACGCGAGAGGCTTGAAGGATATGGCTTCTGCACATAAGGTGTTCGATGAAATGTCGGATAGAGATATGGTTTGTTGTTGGACTTCTTTGATTGCTGGCTATGCCCAGATGGGTCTTGCTGAAAAGGCTCTGCTGCTTTTTGTGATGATGATCAAAGAGAATATGCAGCCGGAGGATGACACCATGGTCAGTGTTCTGTCTGCTTGTTCCAAGTTTCATATTGCTGAAATTGAGAAATGGGTTGTAGAATTAAGACAACTGGTTAATAAATTTGATTCCAAGAGCTCTTGTTGTGATTCAATCAATATTGTTCTTATTTATCTATATGGGAAGTGGGGGATGATTGAGAAGAGTGAAGAAAAGTTTAATGAAATTGTTGATAAGAGAAGTTTGCTTGTTTGGAATTCAATGATAAATGCATATTTTCAGAATGGTTTCCCTGTTGAAGCCTTGGCCCTTTTCCGTCTAATGGTTGAAAATTCCCATTGCAAACCTAACCACGTTACAATGGTTACTGTCCTTTCAGCTTGTGCTCAAATAGGAGATTTGCAGCTCGGTTGTTGGGTTCATGAAGTTCTCAAATGTGGTGGCCGTAGAGGTATCATTGCATCAAACAAAATGTTAGCCACTGCATTGATTGATATGTATTGTAAAAGTGGTAGCTTGGAGAGGGCAAAAGAAGTTTTTCATCAACTAATCAACAAAGATGTAATCTCCTTCAATGCCATGATCATGGGCCTTGCAGTTAACGGCAAAGGCGATGAGGCATTGAAGCTTTTTGCCCAAATGCAAGAGGTTGATATAAGGCCAACGACCGGAACATTTATTGGTTTATTATCTGCTTGTAGCCATTCGGGCTTTCTAGAACAAGGGCGTCAAATTTTTATTGAAATGACTACCCGCTATTTAATATTGCCTAGTTTAGAACACTATGCTTGTTACATTGATCTCCTTGCTCGGGCAGGCCACTTCGAAGATGCTCTTGAAGTTGTTTCAACCATGCCTTTTGAACCTAATAATTTTGTTTGGAGTTCTCTGCTGAGAGGCTGCCTACTCCATTCAAAATTTGAGTTGGCACAATATGTTTCCAAAAAGCTTGTTGAAGTGGATCCTGAAAACTCTGCTGGCTATGTAATGCAGGCTAATTCATTTGCTACAGATCTTCAATGGGATGATGTCTCGGCTTTGAGATGGTTTATGAGAGAAAAGGGTATCCGTAAGCAGCCAGGCCAGAGTTGGATCAGTATAGATGGAATTGTACATGAATTTTTCTCAGCAACCAAATCACATCCTTATGTTGATCTATTATACACGTTGAATGTGCTTGAAAAACAAATGAAGCTAGTAATCCCATAG

Coding sequence (CDS)

ATGCGCTCTCTGTGTGTCTATCCTGAGTCCATCAATCTGTCCGACCTCCTTCAGGGCCGCATGAACAATTCCCATCTTCGTCAAATCCACGCCCGAGTCTTTCGTCTGCTGAAGCATCAAGACAATCTAATTGCAACTCGACTTATTGGCCACTACCCACCTTCTGTTGGACTCAGAGTTTTCAATCAACTCATTCGCCCCAACATATTTCCTTGCAATGCCATTATCAGAGTACTTGCTGAACACAACTCTTCCTTCCTTGCCTTTTCCATCTTCAAATCTTTGAAGCACTTTTCACTTTCCCCTAATGACTTCACTTTTTCTTTCCTTCTCAAAGCGTTTCACCGTTCCAGCCATGCTCTAAATTTGAAACAAGTTCATACCCATGTCCTTAAAATGGGTTATTTGGGTGATTCTTTTATCTCCAATGCGCTTCTTGCAGTCTACGCGAGAGGCTTGAAGGATATGGCTTCTGCACATAAGGTGTTCGATGAAATGTCGGATAGAGATATGGTTTGTTGTTGGACTTCTTTGATTGCTGGCTATGCCCAGATGGGTCTTGCTGAAAAGGCTCTGCTGCTTTTTGTGATGATGATCAAAGAGAATATGCAGCCGGAGGATGACACCATGGTCAGTGTTCTGTCTGCTTGTTCCAAGTTTCATATTGCTGAAATTGAGAAATGGGTTGTAGAATTAAGACAACTGGTTAATAAATTTGATTCCAAGAGCTCTTGTTGTGATTCAATCAATATTGTTCTTATTTATCTATATGGGAAGTGGGGGATGATTGAGAAGAGTGAAGAAAAGTTTAATGAAATTGTTGATAAGAGAAGTTTGCTTGTTTGGAATTCAATGATAAATGCATATTTTCAGAATGGTTTCCCTGTTGAAGCCTTGGCCCTTTTCCGTCTAATGGTTGAAAATTCCCATTGCAAACCTAACCACGTTACAATGGTTACTGTCCTTTCAGCTTGTGCTCAAATAGGAGATTTGCAGCTCGGTTGTTGGGTTCATGAAGTTCTCAAATGTGGTGGCCGTAGAGGTATCATTGCATCAAACAAAATGTTAGCCACTGCATTGATTGATATGTATTGTAAAAGTGGTAGCTTGGAGAGGGCAAAAGAAGTTTTTCATCAACTAATCAACAAAGATGTAATCTCCTTCAATGCCATGATCATGGGCCTTGCAGTTAACGGCAAAGGCGATGAGGCATTGAAGCTTTTTGCCCAAATGCAAGAGGTTGATATAAGGCCAACGACCGGAACATTTATTGGTTTATTATCTGCTTGTAGCCATTCGGGCTTTCTAGAACAAGGGCGTCAAATTTTTATTGAAATGACTACCCGCTATTTAATATTGCCTAGTTTAGAACACTATGCTTGTTACATTGATCTCCTTGCTCGGGCAGGCCACTTCGAAGATGCTCTTGAAGTTGTTTCAACCATGCCTTTTGAACCTAATAATTTTGTTTGGAGTTCTCTGCTGAGAGGCTGCCTACTCCATTCAAAATTTGAGTTGGCACAATATGTTTCCAAAAAGCTTGTTGAAGTGGATCCTGAAAACTCTGCTGGCTATGTAATGCAGGCTAATTCATTTGCTACAGATCTTCAATGGGATGATGTCTCGGCTTTGAGATGGTTTATGAGAGAAAAGGGTATCCGTAAGCAGCCAGGCCAGAGTTGGATCAGTATAGATGGAATTGTACATGAATTTTTCTCAGCAACCAAATCACATCCTTATGTTGATCTATTATACACGTTGAATGTGCTTGAAAAACAAATGAAGCTAGTAATCCCATAG

Protein sequence

MRSLCVYPESINLSDLLQGRMNNSHLRQIHARVFRLLKHQDNLIATRLIGHYPPSVGLRVFNQLIRPNIFPCNAIIRVLAEHNSSFLAFSIFKSLKHFSLSPNDFTFSFLLKAFHRSSHALNLKQVHTHVLKMGYLGDSFISNALLAVYARGLKDMASAHKVFDEMSDRDMVCCWTSLIAGYAQMGLAEKALLLFVMMIKENMQPEDDTMVSVLSACSKFHIAEIEKWVVELRQLVNKFDSKSSCCDSINIVLIYLYGKWGMIEKSEEKFNEIVDKRSLLVWNSMINAYFQNGFPVEALALFRLMVENSHCKPNHVTMVTVLSACAQIGDLQLGCWVHEVLKCGGRRGIIASNKMLATALIDMYCKSGSLERAKEVFHQLINKDVISFNAMIMGLAVNGKGDEALKLFAQMQEVDIRPTTGTFIGLLSACSHSGFLEQGRQIFIEMTTRYLILPSLEHYACYIDLLARAGHFEDALEVVSTMPFEPNNFVWSSLLRGCLLHSKFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDDVSALRWFMREKGIRKQPGQSWISIDGIVHEFFSATKSHPYVDLLYTLNVLEKQMKLVIP
BLAST of Cla009208 vs. Swiss-Prot
Match: PP219_ARATH (Putative pentatricopeptide repeat-containing protein At3g08820 OS=Arabidopsis thaliana GN=PCMP-H84 PE=3 SV=1)

HSP 1 Score: 344.0 bits (881), Expect = 3.4e-93
Identity = 201/585 (34.36%), Postives = 319/585 (54.53%), Query Frame = 1

Query: 24  SHLRQIHARVFRLLKHQD----NLIATRLIGHYPPSVGLRVFNQLIRPNIFPCNAIIRVL 83
           +HL+QIH  +     H D    NL+  R +          +F+    PNIF  N++I   
Sbjct: 27  NHLKQIHVSLINHHLHHDTFLVNLLLKRTLFFRQTKYSYLLFSHTQFPNIFLYNSLINGF 86

Query: 84  AEHNSSFLAFSIFKSLKHFSLSPNDFTFSFLLKAFHRSSHALNLKQVHTHVLKMGYLGDS 143
             ++       +F S++   L  + FTF  +LKA  R+S       +H+ V+K G+  D 
Sbjct: 87  VNNHLFHETLDLFLSIRKHGLYLHGFTFPLVLKACTRASSRKLGIDLHSLVVKCGFNHDV 146

Query: 144 FISNALLAVYARGLKDMASAHKVFDEMSDRDMVCCWTSLIAGYAQMGLAEKALLLFVMMI 203
               +LL++Y+ G   +  AHK+FDE+ DR +V  WT+L +GY   G   +A+ LF  M+
Sbjct: 147 AAMTSLLSIYS-GSGRLNDAHKLFDEIPDRSVVT-WTALFSGYTTSGRHREAIDLFKKMV 206

Query: 204 KENMQPEDDTMVSVLSACSKFHIAEIE--KWVV----ELRQLVNKFDSKSSCCDSINIVL 263
           +  ++P+   +V VLSAC   H+ +++  +W+V    E+    N F         +   L
Sbjct: 207 EMGVKPDSYFIVQVLSAC--VHVGDLDSGEWIVKYMEEMEMQKNSF---------VRTTL 266

Query: 264 IYLYGKWGMIEKSEEKFNEIVDKRSLLVWNSMINAYFQNGFPVEALALFRLMVENSHCKP 323
           + LY K G +EK+   F+ +V+K  ++ W++MI  Y  N FP E + LF  M++ +  KP
Sbjct: 267 VNLYAKCGKMEKARSVFDSMVEK-DIVTWSTMIQGYASNSFPKEGIELFLQMLQEN-LKP 326

Query: 324 NHVTMVTVLSACAQIGDLQLGCWVHEVLKCGGRRGIIASNKMLATALIDMYCKSGSLERA 383
           +  ++V  LS+CA +G L LG W   ++     R    +N  +A ALIDMY K G++ R 
Sbjct: 327 DQFSIVGFLSSCASLGALDLGEWGISLID----RHEFLTNLFMANALIDMYAKCGAMARG 386

Query: 384 KEVFHQLINKDVISFNAMIMGLAVNGKGDEALKLFAQMQEVDIRPTTGTFIGLLSACSHS 443
            EVF ++  KD++  NA I GLA NG    +  +F Q +++ I P   TF+GLL  C H+
Sbjct: 387 FEVFKEMKEKDIVIMNAAISGLAKNGHVKLSFAVFGQTEKLGISPDGSTFLGLLCGCVHA 446

Query: 444 GFLEQGRQIFIEMTTRYLILPSLEHYACYIDLLARAGHFEDALEVVSTMPFEPNNFVWSS 503
           G ++ G + F  ++  Y +  ++EHY C +DL  RAG  +DA  ++  MP  PN  VW +
Sbjct: 447 GLIQDGLRFFNAISCVYALKRTVEHYGCMVDLWGRAGMLDDAYRLICDMPMRPNAIVWGA 506

Query: 504 LLRGCLLHSKFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDDVSALRWFMREKGI 563
           LL GC L    +LA+ V K+L+ ++P N+  YV  +N ++   +WD+ + +R  M +KG+
Sbjct: 507 LLSGCRLVKDTQLAETVLKELIALEPWNAGNYVQLSNIYSVGGRWDEAAEVRDMMNKKGM 566

Query: 564 RKQPGQSWISIDGIVHEFFSATKSHPYVDLLYT-LNVLEKQMKLV 598
           +K PG SWI ++G VHEF +  KSHP  D +Y  L  L  +M+L+
Sbjct: 567 KKIPGYSWIELEGKVHEFLADDKSHPLSDKIYAKLEDLGNEMRLM 592

BLAST of Cla009208 vs. Swiss-Prot
Match: PP175_ARATH (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 336.3 bits (861), Expect = 7.0e-91
Identity = 206/603 (34.16%), Postives = 325/603 (53.90%), Query Frame = 1

Query: 16  LLQGRMNNSHLRQIHARVFRLLKHQDNLIATRLIGHYPPS------VGLRVFNQLIRPNI 75
           L++  ++   L+Q H  + R     D   A++L      S         +VF+++ +PN 
Sbjct: 36  LIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPKPNS 95

Query: 76  FPCNAIIRVLAEHNSSFLAFSIFKSLKHFSLS-PNDFTFSFLLKAFHRSSHALNLKQVHT 135
           F  N +IR  A      L+   F  +   S   PN +TF FL+KA    S     + +H 
Sbjct: 96  FAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSLHG 155

Query: 136 HVLKMGYLGDSFISNALLAVYARGLKDMASAHKVFDEMSDRDMVCCWTSLIAGYAQMGLA 195
             +K     D F++N+L+  Y     D+ SA KVF  + ++D+V  W S+I G+ Q G  
Sbjct: 156 MAVKSAVGSDVFVANSLIHCYF-SCGDLDSACKVFTTIKEKDVVS-WNSMINGFVQKGSP 215

Query: 196 EKALLLFVMMIKENMQPEDDTMVSVLSACSKFH--------IAEIEKWVVELR-QLVNKF 255
           +KAL LF  M  E+++    TMV VLSAC+K           + IE+  V +   L N  
Sbjct: 216 DKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLANAM 275

Query: 256 DSKSSCCDSI--------------NIVLIYLYGKWGMIEKSE---EKFNEIVDKRSLLVW 315
               + C SI              N+    +   + + E  E   E  N +  K  ++ W
Sbjct: 276 LDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQK-DIVAW 335

Query: 316 NSMINAYFQNGFPVEALALFRLMVENSHCKPNHVTMVTVLSACAQIGDLQLGCWVHEVLK 375
           N++I+AY QNG P EAL +F  +    + K N +T+V+ LSACAQ+G L+LG W+H  +K
Sbjct: 336 NALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSYIK 395

Query: 376 CGGRRGIIASNKMLATALIDMYCKSGSLERAKEVFHQLINKDVISFNAMIMGLAVNGKGD 435
             G    I  N  + +ALI MY K G LE+++EVF+ +  +DV  ++AMI GLA++G G+
Sbjct: 396 KHG----IRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGN 455

Query: 436 EALKLFAQMQEVDIRPTTGTFIGLLSACSHSGFLEQGRQIFIEMTTRYLILPSLEHYACY 495
           EA+ +F +MQE +++P   TF  +  ACSH+G +++   +F +M + Y I+P  +HYAC 
Sbjct: 456 EAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACI 515

Query: 496 IDLLARAGHFEDALEVVSTMPFEPNNFVWSSLLRGCLLHSKFELAQYVSKKLVEVDPENS 555
           +D+L R+G+ E A++ +  MP  P+  VW +LL  C +H+   LA+    +L+E++P N 
Sbjct: 516 VDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRND 575

Query: 556 AGYVMQANSFATDLQWDDVSALRWFMREKGIRKQPGQSWISIDGIVHEFFSATKSHPYVD 586
             +V+ +N +A   +W++VS LR  MR  G++K+PG S I IDG++HEF S   +HP  +
Sbjct: 576 GAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSE 631

BLAST of Cla009208 vs. Swiss-Prot
Match: PP224_ARATH (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 330.5 bits (846), Expect = 3.9e-89
Identity = 202/590 (34.24%), Postives = 331/590 (56.10%), Query Frame = 1

Query: 14  SDLLQGRMNNSHLRQIHARVFRLLKHQDNLIATRLIGHYPPSVG-----LRVFNQLIRPN 73
           + L+    + + L+QIHAR+  L       + T+LI H   S G      +VF+ L RP 
Sbjct: 25  ASLIDSATHKAQLKQIHARLLVLGLQFSGFLITKLI-HASSSFGDITFARQVFDDLPRPQ 84

Query: 74  IFPCNAIIRVLAEHNSSFLAFSIFKSLKHFSLSPNDFTFSFLLKAFHRSSHALNLKQVHT 133
           IFP NAIIR  + +N    A  ++ +++   +SP+ FTF  LLKA    SH    + VH 
Sbjct: 85  IFPWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHA 144

Query: 134 HVLKMGYLGDSFISNALLAVYARGLKDMASAHKVFD--EMSDRDMVCCWTSLIAGYAQMG 193
            V ++G+  D F+ N L+A+YA+  + + SA  VF+   + +R +V  WT++++ YAQ G
Sbjct: 145 QVFRLGFDADVFVQNGLIALYAK-CRRLGSARTVFEGLPLPERTIVS-WTAIVSAYAQNG 204

Query: 194 LAEKALLLFVMMIKENMQPEDDTMVSVLSACSKFHIAEIEKWVVELRQLVNKFDSKSSCC 253
              +AL +F  M K +++P+   +VSVL+A   F   +  K    +   V K   +    
Sbjct: 205 EPMEALEIFSQMRKMDVKPDWVALVSVLNA---FTCLQDLKQGRSIHASVVKMGLEIE-- 264

Query: 254 DSINIVLIYLYGKWGMIEKSEEKFNEIVDKRSLLVWNSMINAYFQNGFPVEALALFRLMV 313
             + I L  +Y K G +  ++  F+++    +L++WN+MI+ Y +NG+  EA+ +F  M+
Sbjct: 265 PDLLISLNTMYAKCGQVATAKILFDKMKSP-NLILWNAMISGYAKNGYAREAIDMFHEMI 324

Query: 314 ENSHCKPNHVTMVTVLSACAQIGDLQLGCWVHEVLKCGGRRGIIASNKMLATALIDMYCK 373
            N   +P+ +++ + +SACAQ+G L+    ++E +     R     +  +++ALIDM+ K
Sbjct: 325 -NKDVRPDTISITSAISACAQVGSLEQARSMYEYVG----RSDYRDDVFISSALIDMFAK 384

Query: 374 SGSLERAKEVFHQLINKDVISFNAMIMGLAVNGKGDEALKLFAQMQEVDIRPTTGTFIGL 433
            GS+E A+ VF + +++DV+ ++AMI+G  ++G+  EA+ L+  M+   + P   TF+GL
Sbjct: 385 CGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGL 444

Query: 434 LSACSHSGFLEQGRQIFIEMTTRYLILPSLEHYACYIDLLARAGHFEDALEVVSTMPFEP 493
           L AC+HSG + +G   F  M   + I P  +HYAC IDLL RAGH + A EV+  MP +P
Sbjct: 445 LMACNHSGMVREGWWFFNRMAD-HKINPQQQHYACVIDLLGRAGHLDQAYEVIKCMPVQP 504

Query: 494 NNFVWSSLLRGCLLHSKFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDDVSALRW 553
              VW +LL  C  H   EL +Y +++L  +DP N+  YV  +N +A    WD V+ +R 
Sbjct: 505 GVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYVQLSNLYAAARLWDRVAEVRV 564

Query: 554 FMREKGIRKQPGQSWISIDGIVHEFFSATKSHP-YVDLLYTLNVLEKQMK 596
            M+EKG+ K  G SW+ + G +  F    KSHP Y ++   +  +E ++K
Sbjct: 565 RMKEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIERQVEWIESRLK 599

BLAST of Cla009208 vs. Swiss-Prot
Match: PP261_ARATH (Pentatricopeptide repeat-containing protein At3g29230 OS=Arabidopsis thaliana GN=PCMP-E27 PE=2 SV=1)

HSP 1 Score: 330.5 bits (846), Expect = 3.9e-89
Identity = 190/572 (33.22%), Postives = 323/572 (56.47%), Query Frame = 1

Query: 22  NNSHLRQIHARVFRLLKHQDNLIATRLIGHYP----PSVGLRVFNQLIRPNIFPCNAIIR 81
           N + ++Q+HA++ R   H+D  IA +LI         ++ +RVFNQ+  PN+  CN++IR
Sbjct: 31  NLNQVKQLHAQIIRRNLHEDLHIAPKLISALSLCRQTNLAVRVFNQVQEPNVHLCNSLIR 90

Query: 82  VLAEHNSSFLAFSIFKSLKHFSLSPNDFTFSFLLKAFHRSSHALNLKQVHTHVLKMGYLG 141
             A+++  + AF +F  ++ F L  ++FT+ FLLKA    S    +K +H H+ K+G   
Sbjct: 91  AHAQNSQPYQAFFVFSEMQRFGLFADNFTYPFLLKACSGQSWLPVVKMMHNHIEKLGLSS 150

Query: 142 DSFISNALLAVYAR-GLKDMASAHKVFDEMSDRDMVCCWTSLIAGYAQMGLAEKALLLFV 201
           D ++ NAL+  Y+R G   +  A K+F++MS+RD V  W S++ G  + G    A  LF 
Sbjct: 151 DIYVPNALIDCYSRCGGLGVRDAMKLFEKMSERDTVS-WNSMLGGLVKAGELRDARRLFD 210

Query: 202 MMIKENMQPEDDTMVSVLSACSKFHIAEIEKWVVELRQLVNKFDSKSSCCDSINIVLIYL 261
            M + ++   + TM+   + C +   A          +L  K   +++   S    ++  
Sbjct: 211 EMPQRDLISWN-TMLDGYARCREMSKAF---------ELFEKMPERNTVSWS---TMVMG 270

Query: 262 YGKWGMIEKSEEKFNEI-VDKRSLLVWNSMINAYFQNGFPVEALALFRLMVENSHCKPNH 321
           Y K G +E +   F+++ +  ++++ W  +I  Y + G   EA  L   MV  S  K + 
Sbjct: 271 YSKAGDMEMARVMFDKMPLPAKNVVTWTIIIAGYAEKGLLKEADRLVDQMVA-SGLKFDA 330

Query: 322 VTMVTVLSACAQIGDLQLGCWVHEVLKCGGRRGIIASNKMLATALIDMYCKSGSLERAKE 381
             ++++L+AC + G L LG  +H +LK    R  + SN  +  AL+DMY K G+L++A +
Sbjct: 331 AAVISILAACTESGLLSLGMRIHSILK----RSNLGSNAYVLNALLDMYAKCGNLKKAFD 390

Query: 382 VFHQLINKDVISFNAMIMGLAVNGKGDEALKLFAQMQEVDIRPTTGTFIGLLSACSHSGF 441
           VF+ +  KD++S+N M+ GL V+G G EA++LF++M+   IRP   TFI +L +C+H+G 
Sbjct: 391 VFNDIPKKDLVSWNTMLHGLGVHGHGKEAIELFSRMRREGIRPDKVTFIAVLCSCNHAGL 450

Query: 442 LEQGRQIFIEMTTRYLILPSLEHYACYIDLLARAGHFEDALEVVSTMPFEPNNFVWSSLL 501
           +++G   F  M   Y ++P +EHY C +DLL R G  ++A++VV TMP EPN  +W +LL
Sbjct: 451 IDEGIDYFYSMEKVYDLVPQVEHYGCLVDLLGRVGRLKEAIKVVQTMPMEPNVVIWGALL 510

Query: 502 RGCLLHSKFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDDVSALRWFMREKGIRK 561
             C +H++ ++A+ V   LV++DP +   Y + +N +A    W+ V+ +R  M+  G+ K
Sbjct: 511 GACRMHNEVDIAKEVLDNLVKLDPCDPGNYSLLSNIYAAAEDWEGVADIRSKMKSMGVEK 570

Query: 562 QPGQSWISIDGIVHEFFSATKSHPYVDLLYTL 588
             G S + ++  +HEF    KSHP  D +Y +
Sbjct: 571 PSGASSVELEDGIHEFTVFDKSHPKSDQIYQM 583

BLAST of Cla009208 vs. Swiss-Prot
Match: PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 313.9 bits (803), Expect = 3.7e-84
Identity = 172/439 (39.18%), Postives = 270/439 (61.50%), Query Frame = 1

Query: 144 ALLAVYA-RGLKDMASAHKVFDEMSDRDMVCCWTSLIAGYAQMGLAEKALLLFVMMIKEN 203
           AL+  YA RG   + +A K+FDE+  +D+V  W ++I+GYA+ G  ++AL LF  M+K N
Sbjct: 205 ALIKGYASRGY--IENAQKLFDEIPVKDVVS-WNAMISGYAETGNYKEALELFKDMMKTN 264

Query: 204 MQPEDDTMVSVLSACSKFHIAEIEKWVVELRQLVNKFDSKSSCCDSINIV--LIYLYGKW 263
           ++P++ TMV+V+SAC+       +   +EL + V+ +        ++ IV  LI LY K 
Sbjct: 265 VRPDESTMVTVVSACA-------QSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKC 324

Query: 264 GMIEKSEEKFNEIVDKRSLLVWNSMINAYFQNGFPVEALALFRLMVENSHCKPNHVTMVT 323
           G +E +   F  +  K  ++ WN++I  Y       EAL LF+ M+ +    PN VTM++
Sbjct: 325 GELETACGLFERLPYK-DVISWNTLIGGYTHMNLYKEALLLFQEMLRSGET-PNDVTMLS 384

Query: 324 VLSACAQIGDLQLGCWVHEVLKCGGRRGIIASNKMLATALIDMYCKSGSLERAKEVFHQL 383
           +L ACA +G + +G W+H  +    R   + +   L T+LIDMY K G +E A +VF+ +
Sbjct: 385 ILPACAHLGAIDIGRWIHVYID--KRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSI 444

Query: 384 INKDVISFNAMIMGLAVNGKGDEALKLFAQMQEVDIRPTTGTFIGLLSACSHSGFLEQGR 443
           ++K + S+NAMI G A++G+ D +  LF++M+++ I+P   TF+GLLSACSHSG L+ GR
Sbjct: 445 LHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGR 504

Query: 444 QIFIEMTTRYLILPSLEHYACYIDLLARAGHFEDALEVVSTMPFEPNNFVWSSLLRGCLL 503
            IF  MT  Y + P LEHY C IDLL  +G F++A E+++ M  EP+  +W SLL+ C +
Sbjct: 505 HIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKM 564

Query: 504 HSKFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDDVSALRWFMREKGIRKQPGQS 563
           H   EL +  ++ L++++PEN   YV+ +N +A+  +W++V+  R  + +KG++K PG S
Sbjct: 565 HGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCS 624

Query: 564 WISIDGIVHEFFSATKSHP 580
            I ID +VHEF    K HP
Sbjct: 625 SIEIDSVVHEFIIGDKFHP 629


HSP 2 Score: 149.8 bits (377), Expect = 9.3e-35
Identity = 129/444 (29.05%), Postives = 203/444 (45.72%), Query Frame = 1

Query: 97  HFSLSPNDFTFSFL-----LKAFHRSSHALNLKQVHTHVLKMG-----YLGDSFISNALL 156
           HF  S +D  +  +     L   H      +L+ +H  ++K+G     Y     I   +L
Sbjct: 18  HFLPSSSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCIL 77

Query: 157 AVYARGLKDMASAHKVFDEMSDRDMVCCWTSLIAGYAQMGLAEKALLLFVMMIKENMQPE 216
           + +  GL    S  K   E +    +  W ++  G+A       AL L+V MI   + P 
Sbjct: 78  SPHFEGLPYAISVFKTIQEPN----LLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPN 137

Query: 217 DDTMVSVLSACSK------------------------FHIAEIEKWVVELR-QLVNKFDS 276
             T   VL +C+K                         H + I  +V   R +  +K   
Sbjct: 138 SYTFPFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFD 197

Query: 277 KSSCCDSINIV-LIYLYGKWGMIEKSEEKFNEIVDKRSLLVWNSMINAYFQNGFPVEALA 336
           KS   D ++   LI  Y   G IE +++ F+EI  K  ++ WN+MI+ Y + G   EAL 
Sbjct: 198 KSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVK-DVVSWNAMISGYAETGNYKEALE 257

Query: 337 LFRLMVENSHCKPNHVTMVTVLSACAQIGDLQLGCWVHEVLKCGGRRGIIASNKMLATAL 396
           LF+ M++ ++ +P+  TMVTV+SACAQ G ++LG  VH  +   G      SN  +  AL
Sbjct: 258 LFKDMMK-TNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHG----FGSNLKIVNAL 317

Query: 397 IDMYCKSGSLERAKEVFHQLINKDVISFNAMIMGLAVNGKGDEALKLFAQMQEVDIRPTT 456
           ID+Y K G LE A  +F +L  KDVIS+N +I G        EAL LF +M      P  
Sbjct: 318 IDLYSKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPND 377

Query: 457 GTFIGLLSACSHSGFLEQGRQIFIEMTTRYL-ILPSLEHYACYIDLLARAGHFEDALEVV 504
            T + +L AC+H G ++ GR I + +  R   +  +       ID+ A+ G  E A +V 
Sbjct: 378 VTMLSILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVF 437


HSP 3 Score: 124.4 bits (311), Expect = 4.2e-27
Identity = 103/376 (27.39%), Postives = 180/376 (47.87%), Query Frame = 1

Query: 14  SDLLQGRMNNSHLRQIHARVFRLLKHQDNLIATRLIGHYPPSVGL----RVFNQLIRPNI 73
           + L+   + N  L   H +VF    H+D +  T LI  Y     +    ++F+++   ++
Sbjct: 173 TSLISMYVQNGRLEDAH-KVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVKDV 232

Query: 74  FPCNAIIRVLAEHNSSFLAFSIFKSLKHFSLSPNDFTFSFLLKAFHRSSHALNLKQVHTH 133
              NA+I   AE  +   A  +FK +   ++ P++ T   ++ A  +S      +QVH  
Sbjct: 233 VSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQVHLW 292

Query: 134 VLKMGYLGDSFISNALLAVYARGLKDMASAHKVFDEMSDRDMVCCWTSLIAGYAQMGLAE 193
           +   G+  +  I NAL+ +Y++   ++ +A  +F+ +  +D++  W +LI GY  M L +
Sbjct: 293 IDDHGFGSNLKIVNALIDLYSK-CGELETACGLFERLPYKDVIS-WNTLIGGYTHMNLYK 352

Query: 194 KALLLFVMMIKENMQPEDDTMVSVLSACSKFHIAEIEKWVVELRQLVNKFDSKSSCCDSI 253
           +ALLLF  M++    P D TM+S+L AC+     +I +W+      ++K     +   S+
Sbjct: 353 EALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWI---HVYIDKRLKGVTNASSL 412

Query: 254 NIVLIYLYGKWGMIEKSEEKFNEIVDKRSLLVWNSMINAYFQNGFPVEALALFRLMVENS 313
              LI +Y K G IE + + FN I+ K SL  WN+MI  +  +G    +  LF  M    
Sbjct: 413 RTSLIDMYAKCGDIEAAHQVFNSILHK-SLSSWNAMIFGFAMHGRADASFDLFSRM-RKI 472

Query: 314 HCKPNHVTMVTVLSACAQIGDLQLGCWVHEVLKCGGRRGIIASNKMLAT-----ALIDMY 373
             +P+ +T V +LSAC+  G L LG  +         R +    KM         +ID+ 
Sbjct: 473 GIQPDDITFVGLLSACSHSGMLDLGRHIF--------RTMTQDYKMTPKLEHYGCMIDLL 532

Query: 374 CKSGSLERAKEVFHQL 381
             SG  + A+E+ + +
Sbjct: 533 GHSGLFKEAEEMINMM 532

BLAST of Cla009208 vs. TrEMBL
Match: A0A0A0K4I3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G071580 PE=4 SV=1)

HSP 1 Score: 1099.7 bits (2843), Expect = 0.0e+00
Identity = 542/600 (90.33%), Postives = 570/600 (95.00%), Query Frame = 1

Query: 1   MRSLCVYPESINLSDLLQGRMNNSHLRQIHARVFRLLKHQDNLIATRLIGHYPPSVGLRV 60
           MR LCV PE I+LSDLLQGR+NNSHLRQIHARVFRLLKHQDNLIATRLIGHYP SVGLRV
Sbjct: 1   MRCLCVNPEFISLSDLLQGRINNSHLRQIHARVFRLLKHQDNLIATRLIGHYPHSVGLRV 60

Query: 61  FNQLIRPNIFPCNAIIRVLAEHNSSFLAFSIFKSLKHFSLSPNDFTFSFLLKAFHRSSHA 120
           FNQLIRPNIFPCNAIIRVLAEHNSSF A SIFK LKH SLSPNDFTFSFLLKAFHRS +A
Sbjct: 61  FNQLIRPNIFPCNAIIRVLAEHNSSFFALSIFKYLKHLSLSPNDFTFSFLLKAFHRSCNA 120

Query: 121 LNLKQVHTHVLKMGYLGDSFISNALLAVYARGLKDMASAHKVFDEMSDRDMVCCWTSLIA 180
           LN+KQVHTHVLKMGY GDSFISN+LL VYARGLK+MASAHK+FDEMSDR+M CCWTSLIA
Sbjct: 121 LNVKQVHTHVLKMGYFGDSFISNSLLGVYARGLKEMASAHKLFDEMSDREMACCWTSLIA 180

Query: 181 GYAQMGLAEKALLLFVMMIKENMQPEDDTMVSVLSACSKFHIAEIEKWVVELRQLVNKFD 240
           GYAQMGLAEKA+LLF MM+KEN+QPEDDT+VSVLSACSK  IAEIEKWVVELRQLVNK D
Sbjct: 181 GYAQMGLAEKAMLLFFMMVKENIQPEDDTIVSVLSACSKLQIAEIEKWVVELRQLVNKCD 240

Query: 241 SKSSCCDSINIVLIYLYGKWGMIEKSEEKFNEIVDKRSLLVWNSMINAYFQNGFPVEALA 300
           SK SCCDSINIVLIYLYGKWGM+EKSEEKFNE+VDKRS+LVWNSMINAYFQNGFPVEAL 
Sbjct: 241 SKRSCCDSINIVLIYLYGKWGMVEKSEEKFNEVVDKRSVLVWNSMINAYFQNGFPVEALT 300

Query: 301 LFRLMVENSHCKPNHVTMVTVLSACAQIGDLQLGCWVHEVLKCGGRRGIIASNKMLATAL 360
           LFRLMVEN HCKPNHVTMVTV+SACAQIGDLQLG WVHEVL+ GGR+GIIASNKMLAT+L
Sbjct: 301 LFRLMVENPHCKPNHVTMVTVISACAQIGDLQLGSWVHEVLQRGGRKGIIASNKMLATSL 360

Query: 361 IDMYCKSGSLERAKEVFHQLINKDVISFNAMIMGLAVNGKGDEALKLFAQMQEVDIRPTT 420
           IDMYCK GSLERAKEVFHQLINKDVI+FNAMIMGLAVN KGDEALKLFAQMQE++I P+T
Sbjct: 361 IDMYCKCGSLERAKEVFHQLINKDVITFNAMIMGLAVNSKGDEALKLFAQMQEINIIPST 420

Query: 421 GTFIGLLSACSHSGFLEQGRQIFIEMTTRYLILPSLEHYACYIDLLARAGHFEDALEVVS 480
           GTFIGLLSACSHSGFLEQGRQIFIEMTT YL+ PSLEHYACYIDLLARAGHF+DALEV+S
Sbjct: 421 GTFIGLLSACSHSGFLEQGRQIFIEMTTHYLVSPSLEHYACYIDLLARAGHFDDALEVIS 480

Query: 481 TMPFEPNNFVWSSLLRGCLLHSKFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDD 540
           TMPFEPNNFVWSSLLRGCLLHS+FELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDD
Sbjct: 481 TMPFEPNNFVWSSLLRGCLLHSRFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDD 540

Query: 541 VSALRWFMREKGIRKQPGQSWISIDGIVHEFFSATKSHPYVDLLY-TLNVLEKQMKLVIP 600
           VSALRWFMREKG+ KQPGQSWISIDG VHEFFSATKSHPYVDLLY TLN LEKQMKLVIP
Sbjct: 541 VSALRWFMREKGVHKQPGQSWISIDGTVHEFFSATKSHPYVDLLYTTLNELEKQMKLVIP 600

BLAST of Cla009208 vs. TrEMBL
Match: M5W238_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa021613mg PE=4 SV=1)

HSP 1 Score: 755.7 bits (1950), Expect = 4.1e-215
Identity = 369/581 (63.51%), Postives = 464/581 (79.86%), Query Frame = 1

Query: 19  GRMNNSHLRQIHARVFRLLKHQDNLIATRLIGHYPPSVGLRVFNQLIRPNIFPCNAIIRV 78
           GR++   L QIHA+VF++   QDNLIATRLIGHYP  + LRVF+QL +PNIFP NAIIRV
Sbjct: 60  GRISYPRLLQIHAQVFQVGAQQDNLIATRLIGHYPSHLALRVFHQLQKPNIFPFNAIIRV 119

Query: 79  LAEHNSSFLAFSIFKSLKHFSLSPNDFTFSFLLKAFHRSSHALNLKQVHTHVLKMGYLGD 138
            AE      AFS+FKSLK  SLSPNDFTFSFLLKA  RS ++  +KQ+HTHV+KMG+L +
Sbjct: 120 FAEEGLFSDAFSLFKSLKQTSLSPNDFTFSFLLKACFRSQNSRYVKQIHTHVMKMGFLCN 179

Query: 139 SFISNALLAVYARGLKDMASAHKVFDEMSDRDMVCCWTSLIAGYAQMGLAEKALLLFVMM 198
           SF+  +LLAVYA+GLKD+ SA  VFDEM ++ +VCCWTSLIAGYA  G +E+ L LF+MM
Sbjct: 180 SFVCASLLAVYAKGLKDLGSARLVFDEMPEKSIVCCWTSLIAGYALSGQSEQVLRLFLMM 239

Query: 199 IKENMQPEDDTMVSVLSACSKFHIAEIEKWVVELRQLVNKFDSKSSCCDSINIVLIYLYG 258
           + EN++PEDDTMVSVLSACS   I +IEKWV  L ++V+  D+K   CDS+N  L+YLYG
Sbjct: 240 VDENLRPEDDTMVSVLSACSNLDIVDIEKWVTILSKVVSNVDAKKFGCDSVNTALVYLYG 299

Query: 259 KWGMIEKSEEKFNEIVD--KRSLLVWNSMINAYFQNGFPVEALALFRLMVENSHCKPNHV 318
           KWG +EKS ++F++I D  K+S+L WN+MI A+ QNGFP+E+L+LFR+MVE+   +PNHV
Sbjct: 300 KWGKVEKSRDRFDQISDNGKQSVLPWNAMIGAFVQNGFPMESLSLFRVMVEDPKYRPNHV 359

Query: 319 TMVTVLSACAQIGDLQLGCWVHEVLKCGGRRGIIASNKMLATALIDMYCKSGSLERAKEV 378
           TMV+VLSACAQIGDL LG WVHE LK  G +G+I SN++LATALIDMY K GSLERAKEV
Sbjct: 360 TMVSVLSACAQIGDLDLGRWVHEYLKSKGSKGVIGSNRILATALIDMYSKCGSLERAKEV 419

Query: 379 FHQLINKDVISFNAMIMGLAVNGKGDEALKLFAQMQEVDIRPTTGTFIGLLSACSHSGFL 438
           F Q+++KD++SFNAMIMGLAVN +G+EAL+LF+++QE  ++P  GTF+G L ACSHSG  
Sbjct: 420 FDQMVSKDIVSFNAMIMGLAVNSEGEEALRLFSRIQEFGLQPNAGTFLGALCACSHSGLS 479

Query: 439 EQGRQIFIEMTTRYLILPSLEHYACYIDLLARAGHFEDALEVVSTMPFEPNNFVWSSLLR 498
           E+GRQIF +MT+ + +   LEHYACY+DLLAR G  E+ALEVV++MPFEPN+FVW +LL 
Sbjct: 480 EEGRQIFNDMTSSFSVSSKLEHYACYVDLLARVGLVEEALEVVTSMPFEPNSFVWGALLG 539

Query: 499 GCLLHSKFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDDVSALRWFMREKGIRKQ 558
           GCLLHS+ +LAQYVS KLV  DP+NS GY+M AN+FA+D +W DVSALRW MREKG+ KQ
Sbjct: 540 GCLLHSRVDLAQYVSNKLVRSDPDNSGGYIMLANAFASDRRWGDVSALRWVMREKGVNKQ 599

Query: 559 PGQSWISIDGIVHEFFSATKSHPYVDLLY-TLNVLEKQMKL 597
           PG SWISIDG+VHEF     SHP ++ +Y TL  L K+MK+
Sbjct: 600 PGCSWISIDGVVHEFLVGCPSHPQIESIYNTLVGLVKEMKI 640

BLAST of Cla009208 vs. TrEMBL
Match: F6H681_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_03s0091g00370 PE=4 SV=1)

HSP 1 Score: 744.6 bits (1921), Expect = 9.5e-212
Identity = 363/584 (62.16%), Postives = 460/584 (78.77%), Query Frame = 1

Query: 16  LLQGRMNNSHLRQIHARVFRLLKHQDNLIATRLIGHYPPSVGLRVFNQLIRPNIFPCNAI 75
           +LQG +++SHL QIHA++FR+L HQDNL+ATRLIGHYP  + LRVF+QL+ PNIFP NAI
Sbjct: 1   MLQGHISHSHLLQIHAQIFRVLAHQDNLVATRLIGHYPSRLALRVFDQLLTPNIFPFNAI 60

Query: 76  IRVLAEHNSSFLAFSIFKSLKHFSLSPNDFTFSFLLKAFHRSSHALNLKQVHTHVLKMGY 135
           IRVL E +    AF +FK+L   SLSPNDFTFSFLLKA  RS+ A  +KQ HTHV+K+G+
Sbjct: 61  IRVLGEESLCSCAFFVFKALLQRSLSPNDFTFSFLLKACFRSNDAKYVKQAHTHVVKLGF 120

Query: 136 LGDSFISNALLAVYARGLKDMASAHKVFDEMSDRDMVCCWTSLIAGYAQMGLAEKALLLF 195
           + DSFI N LL  YA G KDM S  KVFDEM DR MV CWTSLIAG AQ G  E+ L LF
Sbjct: 121 VSDSFICNGLLVAYAMGFKDMISGRKVFDEMPDRAMVRCWTSLIAGSAQSGQTEEVLRLF 180

Query: 196 VMMIKENMQPEDDTMVSVLSACSKFHIAEIEKWVVELRQLVNKFDSKSSCCDSINIVLIY 255
            MM+KEN++PE+DT+VSVLSACSK    EIEKWV+ L + +N  D+ S   DS+N VL Y
Sbjct: 181 FMMVKENLRPENDTIVSVLSACSKLEAVEIEKWVMILSEFINDDDTGSFGRDSVNTVLAY 240

Query: 256 LYGKWGMIEKSEEKFNEIVD--KRSLLVWNSMINAYFQNGFPVEALALFRLMVENSHCKP 315
           LYGKWG +EK +E+F+EIV   KRS+L WN +I+AY QNG   EAL+LFR+M+E+ + +P
Sbjct: 241 LYGKWGKVEKCKERFDEIVGIGKRSVLPWNVIISAYVQNGCSFEALSLFRVMIEDLNLRP 300

Query: 316 NHVTMVTVLSACAQIGDLQLGCWVHEVLKCGGRRGIIASNKMLATALIDMYCKSGSLERA 375
           NHVTMV+VLSACAQ+GDL LG W+H  +K  G + I+ SN  LATALIDMY K G+L +A
Sbjct: 301 NHVTMVSVLSACAQVGDLDLGKWIHGYVKSEGCKAIVESNTFLATALIDMYSKCGNLGKA 360

Query: 376 KEVFHQLINKDVISFNAMIMGLAVNGKGDEALKLFAQMQEVDIRPTTGTFIGLLSACSHS 435
           K+VF Q+++KDV+SFNAMIMGLA+NG+G+EAL+LF++MQE+ +RP +GTF+G+L ACSHS
Sbjct: 361 KDVFEQMVSKDVVSFNAMIMGLAINGEGEEALRLFSKMQELSLRPNSGTFLGVLCACSHS 420

Query: 436 GFLEQGRQIFIEMTTRYLILPSLEHYACYIDLLARAGHFEDALEVVSTMPFEPNNFVWSS 495
           G L+ GRQ+F++M   + + P LEHYACY+DLLAR G  E+A EVV++MPF PNNFVW +
Sbjct: 421 GLLDTGRQMFLDMIPHFSVPPELEHYACYVDLLARVGLLEEAFEVVASMPFVPNNFVWGA 480

Query: 496 LLRGCLLHSKFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDDVSALRWFMREKGI 555
           LL+GC LHS+ ELAQ VS+KLV+VDPENSAGYVM +N+ A+D QW +VS LRW MREKG+
Sbjct: 481 LLQGCRLHSRLELAQDVSQKLVKVDPENSAGYVMFSNALASDQQWGEVSGLRWLMREKGV 540

Query: 556 RKQPGQSWISIDGIVHEFFSATKSHPYVDLLY-TLNVLEKQMKL 597
           RK PG SWIS++ +VHEF + + SHP +D +Y TLN L K+MK+
Sbjct: 541 RKHPGCSWISVNRVVHEFLAGSLSHPQIDSIYHTLNGLVKEMKV 584

BLAST of Cla009208 vs. TrEMBL
Match: A0A061E036_THECC (Pentatricopeptide repeat-containing protein OS=Theobroma cacao GN=TCM_007174 PE=4 SV=1)

HSP 1 Score: 737.3 bits (1902), Expect = 1.5e-209
Identity = 366/591 (61.93%), Postives = 460/591 (77.83%), Query Frame = 1

Query: 12  NLSDLLQGRMNNSHLRQIHARVFRLLKHQDNLIATRLIGHYPPSVGLRVFNQLIRPNIFP 71
           NLS LLQGR+ +SHLRQIHAR+FRL  HQDNL+ATRLIGHYP S  LRVFNQL  PNIFP
Sbjct: 57  NLSLLLQGRILHSHLRQIHARIFRLNAHQDNLVATRLIGHYPSSFALRVFNQLHNPNIFP 116

Query: 72  CNAIIRVLAEHNSSFLAFSIFKSLKHFSLSPNDFTFSFLLKAFHRSSHALNLKQVHTHVL 131
            NAIIRVLAE+   FLA S F +L   SLSPND TFSFLLKA   S+ A  + Q+HT+++
Sbjct: 117 FNAIIRVLAENGLFFLACSFFNNLIQRSLSPNDLTFSFLLKACFLSNDAQYVNQIHTYII 176

Query: 132 KMGYLGDSFISNALLAVYARGLKDMASAHKVFDEMSDRDMVCCWTSLIAGYAQMGLAEKA 191
           K+GYL D  + N LL+VYA+G KD+ASAHK+FDEM ++  V  WT+LIA YA+ G  E+ 
Sbjct: 177 KLGYLCDPTVCNGLLSVYAQGFKDVASAHKLFDEMPEKVSVTPWTNLIACYARSGRNEEV 236

Query: 192 LLLFVMMIKENMQPEDDTMVSVLSACSKFHIAEIEKWVVELRQLVNKFDSKSSCCDSINI 251
           L LF  MI++N++PE+DTMVSVLSACS   I +IEKWV  L ++++  D+K    DS+NI
Sbjct: 237 LRLFCSMIEKNLRPENDTMVSVLSACSSAEIFDIEKWVTILSEIIHNSDNKIPNRDSVNI 296

Query: 252 VLIYLYGKWGMIEKSEEKFNEI--VDKRSLLVWNSMINAYFQNGFPVEALALFRLMVENS 311
            LIYLYG+   +EKS E+FNEI  + K S++ WN+MI AY QNG P+EAL+LF LM+E+S
Sbjct: 297 ALIYLYGRLENVEKSRERFNEIYAIGKMSVIPWNAMIGAYVQNGCPMEALSLFHLMMEDS 356

Query: 312 HCKPNHVTMVTVLSACAQIGDLQLGCWVHEVLKCGGRRGIIASNKMLATALIDMYCKSGS 371
           +C+PNHVTMV+VLSACAQ+GDL LG WVH+ L+  GR+G++ +N  LATALIDMY K G 
Sbjct: 357 NCRPNHVTMVSVLSACAQMGDLDLGKWVHQYLEYNGRKGVLETNTFLATALIDMYSKCGD 416

Query: 372 LERAKEVFHQLINKDVISFNAMIMGLAVNGKGDEALKLFAQMQEVDIRPTTGTFIGLLSA 431
           LE AK VF Q+I+KDV+SFNAMIMGLA+NG+G+EA+ L +++QE+ + P  GTF+GLL A
Sbjct: 417 LEMAKRVFDQMISKDVVSFNAMIMGLAMNGEGEEAVSLLSKVQELGLHPNAGTFLGLLCA 476

Query: 432 CSHSGFLEQGRQIFIEMTTRYLILPSLEHYACYIDLLARAGHFEDALEVVSTMPFEPNNF 491
           CSHSG  E+GRQIF+EM +R+ + P LEHYACYID+LAR G  E AL VV +MP+EPNNF
Sbjct: 477 CSHSGLSEEGRQIFLEMNSRFSVYPRLEHYACYIDILARVGLVEAALTVVDSMPYEPNNF 536

Query: 492 VWSSLLRGCLLHSKFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDDVSALRWFMR 551
           VW +LL GC+LHS+ +LAQ V KKLVEVDP+NS GYVM AN+ A D +W+DVS LRW MR
Sbjct: 537 VWGALLGGCVLHSRADLAQKVYKKLVEVDPQNSGGYVMLANTLAVDHRWNDVSVLRWLMR 596

Query: 552 EKGIRKQPGQSWISIDGIVHEFFSATKSHPYVDLLY-TLNVLEKQMKLVIP 600
           EKG++KQPG SWISIDG+VHEF + + SHP ++ +Y TLN L   MK+  P
Sbjct: 597 EKGVKKQPGHSWISIDGVVHEFLAGSPSHPKMESIYHTLNGLVNVMKVTSP 647

BLAST of Cla009208 vs. TrEMBL
Match: A0A067LJI3_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_16282 PE=4 SV=1)

HSP 1 Score: 736.9 bits (1901), Expect = 2.0e-209
Identity = 357/593 (60.20%), Postives = 462/593 (77.91%), Query Frame = 1

Query: 10  SINLSDLLQGRMNNSHLRQIHARVFRLLKHQDNLIATRLIGHYPPSVGLRVFNQLIRPNI 69
           S  LS LLQGR+   HL QIHA+VFRL  HQDNLIATRLIGHYP    +R+FNQ+  PN+
Sbjct: 9   SATLSALLQGRIPIPHLLQIHAKVFRLDAHQDNLIATRLIGHYPSKFSIRLFNQIQNPNL 68

Query: 70  FPCNAIIRVLAEHNSSFLAFSIFKSLKHFSLSPNDFTFSFLLKAFHRSSHALNLKQVHTH 129
           FP NAIIRVLA       +F +F+ LK   L PND TFSF+LKA   S +   ++QVHTH
Sbjct: 69  FPFNAIIRVLAHEGDFHGSFLLFRRLKRQHLYPNDLTFSFILKACFGSKNVFYVEQVHTH 128

Query: 130 VLKMGYLGDSFISNALLAVYARGLKDMASAHKVFDEMSDRDMVCCWTSLIAGYAQMGLAE 189
           + K+G++ D F+ NALLA+YA+G KD+ SA  +FDEM ++ +VCCWTSLIAG+AQ G AE
Sbjct: 129 IFKVGFITDPFVCNALLALYAKGFKDLVSARMLFDEMPEKGVVCCWTSLIAGFAQSGYAE 188

Query: 190 KALLLFVMMIKENMQPEDDTMVSVLSACSKFHIAEIEKWVVELRQLVNKFDSKSSCCDSI 249
           +AL  F +M+KEN+ PEDDT+VSVLSACS   I +IEKW+  L +L+N+ DSK    DS+
Sbjct: 189 EALRFFRLMVKENLSPEDDTLVSVLSACSSLEIHQIEKWLTLLLELINEIDSKIR--DSV 248

Query: 250 NIVLIYLYGKWGMIEKSEEKFNEIVD--KRSLLVWNSMINAYFQNGFPVEALALFRLMVE 309
           N VL+YLYGKWG IEKS E+F++I D  KRS+L WNSMINAY QNG  +  L LFRLM+ 
Sbjct: 249 NNVLVYLYGKWGNIEKSRERFDDISDDGKRSVLPWNSMINAYVQNGDSLGGLNLFRLMIM 308

Query: 310 NSHCKPNHVTMVTVLSACAQIGDLQLGCWVHEVLKCGGRRGIIASNKMLATALIDMYCKS 369
           +  C+PNHVTMV+VLSACAQIGDL+LG WVH+ +K  G++G++ SN++LATA IDMY K 
Sbjct: 309 DPTCRPNHVTMVSVLSACAQIGDLELGMWVHQYMKSRGQKGVLQSNRILATAFIDMYSKC 368

Query: 370 GSLERAKEVFHQLINKDVISFNAMIMGLAVNGKGDEALKLFAQMQEVDIRPTTGTFIGLL 429
           GSL++AK+VF+Q+++KDV+SFNAMIMGLA+NG+G +A+ LF++MQE  + P  GTF+GLL
Sbjct: 369 GSLDKAKDVFNQMVSKDVVSFNAMIMGLAINGEGVKAVNLFSKMQEFGLHPNPGTFLGLL 428

Query: 430 SACSHSGFLEQGRQIFIEMTTRYLILPSLEHYACYIDLLARAGHFEDALEVVSTMPFEPN 489
            ACSHSG  ++G++IF++M++R+L+ P LEHYACYIDLLAR GH E+A +V ++MPF+PN
Sbjct: 429 WACSHSGLSDEGQKIFLDMSSRFLVRPKLEHYACYIDLLAREGHLEEAFKVTTSMPFKPN 488

Query: 490 NFVWSSLLRGCLLHSKFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDDVSALRWF 549
           NFVW +LL GCLLH K +LA+ + K+LVEVDP NSAGYVM AN FA D +W+DVSALRWF
Sbjct: 489 NFVWGALLGGCLLHYKVDLAKIIYKRLVEVDPANSAGYVMLANIFAVDHKWNDVSALRWF 548

Query: 550 MREKGIRKQPGQSWISIDGIVHEFFSATKSHPYVDLLY-TLNVLEKQMKLVIP 600
           MREKG++KQPG SWI+++GIVHEF   + SHP ++ +Y  L+ L + MK   P
Sbjct: 549 MREKGVKKQPGCSWINVNGIVHEFLVGSPSHPQMESIYHILHGLVRDMKNANP 599

BLAST of Cla009208 vs. NCBI nr
Match: gi|659110039|ref|XP_008455016.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g08070-like isoform X1 [Cucumis melo])

HSP 1 Score: 1103.6 bits (2853), Expect = 0.0e+00
Identity = 548/600 (91.33%), Postives = 570/600 (95.00%), Query Frame = 1

Query: 1   MRSLCVYPESINLSDLLQGRMNNSHLRQIHARVFRLLKHQDNLIATRLIGHYPPSVGLRV 60
           MR L V PE INLSDLLQGR+NNSHLRQIHARVFRLLKHQDNLIATRLIGHYP SVGLRV
Sbjct: 9   MRCLFVNPEFINLSDLLQGRINNSHLRQIHARVFRLLKHQDNLIATRLIGHYPHSVGLRV 68

Query: 61  FNQLIRPNIFPCNAIIRVLAEHNSSFLAFSIFKSLKHFSLSPNDFTFSFLLKAFHRSSHA 120
           FNQLIRPNIFPCNAIIRVLAEHN+SFLA SIFKSLKH SLSPNDFTFSFLLKAFHRS +A
Sbjct: 69  FNQLIRPNIFPCNAIIRVLAEHNTSFLALSIFKSLKHLSLSPNDFTFSFLLKAFHRSCNA 128

Query: 121 LNLKQVHTHVLKMGYLGDSFISNALLAVYARGLKDMASAHKVFDEMSDRDMVCCWTSLIA 180
           L++KQVHTHVLKMGY GDSFISNALL VYARGLKDMASAHKVFDEMSDR+M CCWTSLIA
Sbjct: 129 LDVKQVHTHVLKMGYFGDSFISNALLGVYARGLKDMASAHKVFDEMSDREMACCWTSLIA 188

Query: 181 GYAQMGLAEKALLLFVMMIKENMQPEDDTMVSVLSACSKFHIAEIEKWVVELRQLVNKFD 240
           GYAQMGLAEKA+L+FV MIKENMQPEDDTMVSVLSACSKF IAEIEKWVV LR+LVNKFD
Sbjct: 189 GYAQMGLAEKAMLIFVTMIKENMQPEDDTMVSVLSACSKFQIAEIEKWVVALRELVNKFD 248

Query: 241 SKSSCCDSINIVLIYLYGKWGMIEKSEEKFNEIVDKRSLLVWNSMINAYFQNGFPVEALA 300
           SKSSCCDSINIVLIYLYGKWGM+EKSEEKFNEI+DK+S+LVWNSMINAYFQNGFPVEAL 
Sbjct: 249 SKSSCCDSINIVLIYLYGKWGMVEKSEEKFNEIIDKKSVLVWNSMINAYFQNGFPVEALT 308

Query: 301 LFRLMVENSHCKPNHVTMVTVLSACAQIGDLQLGCWVHEVLKCGGRRGIIASNKMLATAL 360
           LFRLMVEN HCKPNHVTMVTV+SACAQIGDLQLG WVHEVL+  GR+GIIASNKMLATAL
Sbjct: 309 LFRLMVENPHCKPNHVTMVTVISACAQIGDLQLGSWVHEVLQRSGRKGIIASNKMLATAL 368

Query: 361 IDMYCKSGSLERAKEVFHQLINKDVISFNAMIMGLAVNGKGDEALKLFAQMQEVDIRPTT 420
           IDMYCK GSLERAKEVFHQLINKDVISFNAMIMGLAVNGKGDEALKLFAQMQE+DIRP+T
Sbjct: 369 IDMYCKCGSLERAKEVFHQLINKDVISFNAMIMGLAVNGKGDEALKLFAQMQEIDIRPST 428

Query: 421 GTFIGLLSACSHSGFLEQGRQIFIEMTTRYLILPSLEHYACYIDLLARAGHFEDALEVVS 480
           GTFIGLLSACSHSGFLEQG QIFIEMTT+YLI PSLEHYACYIDLLARAG FEDALEVVS
Sbjct: 429 GTFIGLLSACSHSGFLEQGHQIFIEMTTQYLISPSLEHYACYIDLLARAGRFEDALEVVS 488

Query: 481 TMPFEPNNFVWSSLLRGCLLHSKFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDD 540
           TMPFEPNNFVWSSLLRGCLLHS FELAQYVSKKLVEVDPENSAGYVMQANSFA+D QWDD
Sbjct: 489 TMPFEPNNFVWSSLLRGCLLHSSFELAQYVSKKLVEVDPENSAGYVMQANSFASDRQWDD 548

Query: 541 VSALRWFMREKGIRKQPGQSWISIDGIVHEFFSATKSHPYVDLLY-TLNVLEKQMKLVIP 600
           VSALRWFMREKG+ KQPGQSWISIDG VHEFFSATKSHPYVDLLY TLN L+KQ KLVIP
Sbjct: 549 VSALRWFMREKGVHKQPGQSWISIDGTVHEFFSATKSHPYVDLLYSTLNELDKQTKLVIP 608

BLAST of Cla009208 vs. NCBI nr
Match: gi|700188636|gb|KGN43869.1| (hypothetical protein Csa_7G071580 [Cucumis sativus])

HSP 1 Score: 1099.7 bits (2843), Expect = 0.0e+00
Identity = 542/600 (90.33%), Postives = 570/600 (95.00%), Query Frame = 1

Query: 1   MRSLCVYPESINLSDLLQGRMNNSHLRQIHARVFRLLKHQDNLIATRLIGHYPPSVGLRV 60
           MR LCV PE I+LSDLLQGR+NNSHLRQIHARVFRLLKHQDNLIATRLIGHYP SVGLRV
Sbjct: 1   MRCLCVNPEFISLSDLLQGRINNSHLRQIHARVFRLLKHQDNLIATRLIGHYPHSVGLRV 60

Query: 61  FNQLIRPNIFPCNAIIRVLAEHNSSFLAFSIFKSLKHFSLSPNDFTFSFLLKAFHRSSHA 120
           FNQLIRPNIFPCNAIIRVLAEHNSSF A SIFK LKH SLSPNDFTFSFLLKAFHRS +A
Sbjct: 61  FNQLIRPNIFPCNAIIRVLAEHNSSFFALSIFKYLKHLSLSPNDFTFSFLLKAFHRSCNA 120

Query: 121 LNLKQVHTHVLKMGYLGDSFISNALLAVYARGLKDMASAHKVFDEMSDRDMVCCWTSLIA 180
           LN+KQVHTHVLKMGY GDSFISN+LL VYARGLK+MASAHK+FDEMSDR+M CCWTSLIA
Sbjct: 121 LNVKQVHTHVLKMGYFGDSFISNSLLGVYARGLKEMASAHKLFDEMSDREMACCWTSLIA 180

Query: 181 GYAQMGLAEKALLLFVMMIKENMQPEDDTMVSVLSACSKFHIAEIEKWVVELRQLVNKFD 240
           GYAQMGLAEKA+LLF MM+KEN+QPEDDT+VSVLSACSK  IAEIEKWVVELRQLVNK D
Sbjct: 181 GYAQMGLAEKAMLLFFMMVKENIQPEDDTIVSVLSACSKLQIAEIEKWVVELRQLVNKCD 240

Query: 241 SKSSCCDSINIVLIYLYGKWGMIEKSEEKFNEIVDKRSLLVWNSMINAYFQNGFPVEALA 300
           SK SCCDSINIVLIYLYGKWGM+EKSEEKFNE+VDKRS+LVWNSMINAYFQNGFPVEAL 
Sbjct: 241 SKRSCCDSINIVLIYLYGKWGMVEKSEEKFNEVVDKRSVLVWNSMINAYFQNGFPVEALT 300

Query: 301 LFRLMVENSHCKPNHVTMVTVLSACAQIGDLQLGCWVHEVLKCGGRRGIIASNKMLATAL 360
           LFRLMVEN HCKPNHVTMVTV+SACAQIGDLQLG WVHEVL+ GGR+GIIASNKMLAT+L
Sbjct: 301 LFRLMVENPHCKPNHVTMVTVISACAQIGDLQLGSWVHEVLQRGGRKGIIASNKMLATSL 360

Query: 361 IDMYCKSGSLERAKEVFHQLINKDVISFNAMIMGLAVNGKGDEALKLFAQMQEVDIRPTT 420
           IDMYCK GSLERAKEVFHQLINKDVI+FNAMIMGLAVN KGDEALKLFAQMQE++I P+T
Sbjct: 361 IDMYCKCGSLERAKEVFHQLINKDVITFNAMIMGLAVNSKGDEALKLFAQMQEINIIPST 420

Query: 421 GTFIGLLSACSHSGFLEQGRQIFIEMTTRYLILPSLEHYACYIDLLARAGHFEDALEVVS 480
           GTFIGLLSACSHSGFLEQGRQIFIEMTT YL+ PSLEHYACYIDLLARAGHF+DALEV+S
Sbjct: 421 GTFIGLLSACSHSGFLEQGRQIFIEMTTHYLVSPSLEHYACYIDLLARAGHFDDALEVIS 480

Query: 481 TMPFEPNNFVWSSLLRGCLLHSKFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDD 540
           TMPFEPNNFVWSSLLRGCLLHS+FELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDD
Sbjct: 481 TMPFEPNNFVWSSLLRGCLLHSRFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDD 540

Query: 541 VSALRWFMREKGIRKQPGQSWISIDGIVHEFFSATKSHPYVDLLY-TLNVLEKQMKLVIP 600
           VSALRWFMREKG+ KQPGQSWISIDG VHEFFSATKSHPYVDLLY TLN LEKQMKLVIP
Sbjct: 541 VSALRWFMREKGVHKQPGQSWISIDGTVHEFFSATKSHPYVDLLYTTLNELEKQMKLVIP 600

BLAST of Cla009208 vs. NCBI nr
Match: gi|659110041|ref|XP_008455017.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g08070-like isoform X2 [Cucumis melo])

HSP 1 Score: 870.5 bits (2248), Expect = 1.7e-249
Identity = 429/468 (91.67%), Postives = 446/468 (95.30%), Query Frame = 1

Query: 133 MGYLGDSFISNALLAVYARGLKDMASAHKVFDEMSDRDMVCCWTSLIAGYAQMGLAEKAL 192
           MGY GDSFISNALL VYARGLKDMASAHKVFDEMSDR+M CCWTSLIAGYAQMGLAEKA+
Sbjct: 1   MGYFGDSFISNALLGVYARGLKDMASAHKVFDEMSDREMACCWTSLIAGYAQMGLAEKAM 60

Query: 193 LLFVMMIKENMQPEDDTMVSVLSACSKFHIAEIEKWVVELRQLVNKFDSKSSCCDSINIV 252
           L+FV MIKENMQPEDDTMVSVLSACSKF IAEIEKWVV LR+LVNKFDSKSSCCDSINIV
Sbjct: 61  LIFVTMIKENMQPEDDTMVSVLSACSKFQIAEIEKWVVALRELVNKFDSKSSCCDSINIV 120

Query: 253 LIYLYGKWGMIEKSEEKFNEIVDKRSLLVWNSMINAYFQNGFPVEALALFRLMVENSHCK 312
           LIYLYGKWGM+EKSEEKFNEI+DK+S+LVWNSMINAYFQNGFPVEAL LFRLMVEN HCK
Sbjct: 121 LIYLYGKWGMVEKSEEKFNEIIDKKSVLVWNSMINAYFQNGFPVEALTLFRLMVENPHCK 180

Query: 313 PNHVTMVTVLSACAQIGDLQLGCWVHEVLKCGGRRGIIASNKMLATALIDMYCKSGSLER 372
           PNHVTMVTV+SACAQIGDLQLG WVHEVL+  GR+GIIASNKMLATALIDMYCK GSLER
Sbjct: 181 PNHVTMVTVISACAQIGDLQLGSWVHEVLQRSGRKGIIASNKMLATALIDMYCKCGSLER 240

Query: 373 AKEVFHQLINKDVISFNAMIMGLAVNGKGDEALKLFAQMQEVDIRPTTGTFIGLLSACSH 432
           AKEVFHQLINKDVISFNAMIMGLAVNGKGDEALKLFAQMQE+DIRP+TGTFIGLLSACSH
Sbjct: 241 AKEVFHQLINKDVISFNAMIMGLAVNGKGDEALKLFAQMQEIDIRPSTGTFIGLLSACSH 300

Query: 433 SGFLEQGRQIFIEMTTRYLILPSLEHYACYIDLLARAGHFEDALEVVSTMPFEPNNFVWS 492
           SGFLEQG QIFIEMTT+YLI PSLEHYACYIDLLARAG FEDALEVVSTMPFEPNNFVWS
Sbjct: 301 SGFLEQGHQIFIEMTTQYLISPSLEHYACYIDLLARAGRFEDALEVVSTMPFEPNNFVWS 360

Query: 493 SLLRGCLLHSKFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDDVSALRWFMREKG 552
           SLLRGCLLHS FELAQYVSKKLVEVDPENSAGYVMQANSFA+D QWDDVSALRWFMREKG
Sbjct: 361 SLLRGCLLHSSFELAQYVSKKLVEVDPENSAGYVMQANSFASDRQWDDVSALRWFMREKG 420

Query: 553 IRKQPGQSWISIDGIVHEFFSATKSHPYVDLLY-TLNVLEKQMKLVIP 600
           + KQPGQSWISIDG VHEFFSATKSHPYVDLLY TLN L+KQ KLVIP
Sbjct: 421 VHKQPGQSWISIDGTVHEFFSATKSHPYVDLLYSTLNELDKQTKLVIP 468

BLAST of Cla009208 vs. NCBI nr
Match: gi|778724922|ref|XP_011658883.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820 [Cucumis sativus])

HSP 1 Score: 865.1 bits (2234), Expect = 6.9e-248
Identity = 423/468 (90.38%), Postives = 447/468 (95.51%), Query Frame = 1

Query: 133 MGYLGDSFISNALLAVYARGLKDMASAHKVFDEMSDRDMVCCWTSLIAGYAQMGLAEKAL 192
           MGY GDSFISN+LL VYARGLK+MASAHK+FDEMSDR+M CCWTSLIAGYAQMGLAEKA+
Sbjct: 1   MGYFGDSFISNSLLGVYARGLKEMASAHKLFDEMSDREMACCWTSLIAGYAQMGLAEKAM 60

Query: 193 LLFVMMIKENMQPEDDTMVSVLSACSKFHIAEIEKWVVELRQLVNKFDSKSSCCDSINIV 252
           LLF MM+KEN+QPEDDT+VSVLSACSK  IAEIEKWVVELRQLVNK DSK SCCDSINIV
Sbjct: 61  LLFFMMVKENIQPEDDTIVSVLSACSKLQIAEIEKWVVELRQLVNKCDSKRSCCDSINIV 120

Query: 253 LIYLYGKWGMIEKSEEKFNEIVDKRSLLVWNSMINAYFQNGFPVEALALFRLMVENSHCK 312
           LIYLYGKWGM+EKSEEKFNE+VDKRS+LVWNSMINAYFQNGFPVEAL LFRLMVEN HCK
Sbjct: 121 LIYLYGKWGMVEKSEEKFNEVVDKRSVLVWNSMINAYFQNGFPVEALTLFRLMVENPHCK 180

Query: 313 PNHVTMVTVLSACAQIGDLQLGCWVHEVLKCGGRRGIIASNKMLATALIDMYCKSGSLER 372
           PNHVTMVTV+SACAQIGDLQLG WVHEVL+ GGR+GIIASNKMLAT+LIDMYCK GSLER
Sbjct: 181 PNHVTMVTVISACAQIGDLQLGSWVHEVLQRGGRKGIIASNKMLATSLIDMYCKCGSLER 240

Query: 373 AKEVFHQLINKDVISFNAMIMGLAVNGKGDEALKLFAQMQEVDIRPTTGTFIGLLSACSH 432
           AKEVFHQLINKDVI+FNAMIMGLAVN KGDEALKLFAQMQE++I P+TGTFIGLLSACSH
Sbjct: 241 AKEVFHQLINKDVITFNAMIMGLAVNSKGDEALKLFAQMQEINIIPSTGTFIGLLSACSH 300

Query: 433 SGFLEQGRQIFIEMTTRYLILPSLEHYACYIDLLARAGHFEDALEVVSTMPFEPNNFVWS 492
           SGFLEQGRQIFIEMTT YL+ PSLEHYACYIDLLARAGHF+DALEV+STMPFEPNNFVWS
Sbjct: 301 SGFLEQGRQIFIEMTTHYLVSPSLEHYACYIDLLARAGHFDDALEVISTMPFEPNNFVWS 360

Query: 493 SLLRGCLLHSKFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDDVSALRWFMREKG 552
           SLLRGCLLHS+FELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDDVSALRWFMREKG
Sbjct: 361 SLLRGCLLHSRFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDDVSALRWFMREKG 420

Query: 553 IRKQPGQSWISIDGIVHEFFSATKSHPYVDLLY-TLNVLEKQMKLVIP 600
           + KQPGQSWISIDG VHEFFSATKSHPYVDLLY TLN LEKQMKLVIP
Sbjct: 421 VHKQPGQSWISIDGTVHEFFSATKSHPYVDLLYTTLNELEKQMKLVIP 468

BLAST of Cla009208 vs. NCBI nr
Match: gi|645261674|ref|XP_008236408.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g08070-like [Prunus mume])

HSP 1 Score: 762.7 bits (1968), Expect = 4.8e-217
Identity = 372/590 (63.05%), Postives = 471/590 (79.83%), Query Frame = 1

Query: 10  SINLSDLLQGRMNNSHLRQIHARVFRLLKHQDNLIATRLIGHYPPSVGLRVFNQLIRPNI 69
           S +L+  LQGR++   L QIHA+VF++   QDNLIATRLIGHYP  + LRVF+QL +PNI
Sbjct: 38  STDLAASLQGRISYPRLLQIHAQVFQVGAQQDNLIATRLIGHYPSHLALRVFHQLQKPNI 97

Query: 70  FPCNAIIRVLAEHNSSFLAFSIFKSLKHFSLSPNDFTFSFLLKAFHRSSHALNLKQVHTH 129
           FP NAIIRV AE      AFS+FK LK  SLSPNDFTFSFLLKA  RS ++  +KQ+HTH
Sbjct: 98  FPFNAIIRVFAEEGLFSDAFSLFKILKQTSLSPNDFTFSFLLKACFRSENSRYVKQIHTH 157

Query: 130 VLKMGYLGDSFISNALLAVYARGLKDMASAHKVFDEMSDRDMVCCWTSLIAGYAQMGLAE 189
           V K+G+L +SF+  +LLAVYA+GLKD+ SAH VFDEM ++ +VCCWTSLIAGYA+ G +E
Sbjct: 158 VTKVGFLCNSFVCASLLAVYAKGLKDLGSAHLVFDEMPEKSIVCCWTSLIAGYARSGQSE 217

Query: 190 KALLLFVMMIKENMQPEDDTMVSVLSACSKFHIAEIEKWVVELRQLVNKFDSKSSCCDSI 249
           + L LF+MM+ EN++PEDDTMVSVLSACS   I ++EKWV  L ++V+  D+K   CDS+
Sbjct: 218 QVLRLFLMMVDENLRPEDDTMVSVLSACSNLDIVDVEKWVTILSEVVSNVDAKKFGCDSV 277

Query: 250 NIVLIYLYGKWGMIEKSEEKFNEIVD--KRSLLVWNSMINAYFQNGFPVEALALFRLMVE 309
           N  L+YLYGKWG +EKS ++F++I D  K+S+L WN+MI A+ QNGFP+E+L+LFR+MVE
Sbjct: 278 NTALVYLYGKWGKVEKSRDQFDQISDNGKQSVLPWNAMIGAFVQNGFPMESLSLFRVMVE 337

Query: 310 NSHCKPNHVTMVTVLSACAQIGDLQLGCWVHEVLKCGGRRGIIASNKMLATALIDMYCKS 369
           +   +PNHVTMV+VLSACAQIGDL LG WVHE LK  G +G+I SN++LATALIDMY K 
Sbjct: 338 DPKYRPNHVTMVSVLSACAQIGDLDLGRWVHEYLKSKGSKGVIGSNRILATALIDMYSKC 397

Query: 370 GSLERAKEVFHQLINKDVISFNAMIMGLAVNGKGDEALKLFAQMQEVDIRPTTGTFIGLL 429
           GSLERAKEVF Q+++KD++SFNAMIMGLAVN +G+EAL+LF+++Q+  ++P  GTF+G L
Sbjct: 398 GSLERAKEVFDQMVSKDIVSFNAMIMGLAVNSEGEEALRLFSRIQKFGLQPNAGTFLGAL 457

Query: 430 SACSHSGFLEQGRQIFIEMTTRYLILPSLEHYACYIDLLARAGHFEDALEVVSTMPFEPN 489
            ACSHSG  E+GRQIF +MT+ + + P LEHYACYIDLLAR G  E+ALEVV++MPFEPN
Sbjct: 458 CACSHSGLSEEGRQIFNDMTSSFSVSPKLEHYACYIDLLARVGLVEEALEVVTSMPFEPN 517

Query: 490 NFVWSSLLRGCLLHSKFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDDVSALRWF 549
           +FVW +LL GCLLHS+ +LAQYVS KLV  DP+NS GY+M AN+FA+D +W DVS LRWF
Sbjct: 518 SFVWGALLGGCLLHSRVDLAQYVSNKLVRSDPDNSGGYIMLANAFASDRRWGDVSVLRWF 577

Query: 550 MREKGIRKQPGQSWISIDGIVHEFFSATKSHPYVDLLY-TLNVLEKQMKL 597
           MREKG+ KQPG SWISIDG+VHEF     SHP ++ +Y TL  L K+MK+
Sbjct: 578 MREKGVTKQPGFSWISIDGVVHEFLVGCPSHPQIESIYNTLVGLVKEMKI 627

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP219_ARATH3.4e-9334.36Putative pentatricopeptide repeat-containing protein At3g08820 OS=Arabidopsis th... [more]
PP175_ARATH7.0e-9134.16Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
PP224_ARATH3.9e-8934.24Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana GN... [more]
PP261_ARATH3.9e-8933.22Pentatricopeptide repeat-containing protein At3g29230 OS=Arabidopsis thaliana GN... [more]
PPR21_ARATH3.7e-8439.18Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0K4I3_CUCSA0.0e+0090.33Uncharacterized protein OS=Cucumis sativus GN=Csa_7G071580 PE=4 SV=1[more]
M5W238_PRUPE4.1e-21563.51Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa021613mg PE=4 SV=1[more]
F6H681_VITVI9.5e-21262.16Putative uncharacterized protein OS=Vitis vinifera GN=VIT_03s0091g00370 PE=4 SV=... [more]
A0A061E036_THECC1.5e-20961.93Pentatricopeptide repeat-containing protein OS=Theobroma cacao GN=TCM_007174 PE=... [more]
A0A067LJI3_JATCU2.0e-20960.20Uncharacterized protein OS=Jatropha curcas GN=JCGZ_16282 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|659110039|ref|XP_008455016.1|0.0e+0091.33PREDICTED: pentatricopeptide repeat-containing protein At1g08070-like isoform X1... [more]
gi|700188636|gb|KGN43869.1|0.0e+0090.33hypothetical protein Csa_7G071580 [Cucumis sativus][more]
gi|659110041|ref|XP_008455017.1|1.7e-24991.67PREDICTED: pentatricopeptide repeat-containing protein At1g08070-like isoform X2... [more]
gi|778724922|ref|XP_011658883.1|6.9e-24890.38PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820 [Cucum... [more]
gi|645261674|ref|XP_008236408.1|4.8e-21763.05PREDICTED: pentatricopeptide repeat-containing protein At1g08070-like [Prunus mu... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0005515 protein binding
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU52548watermelon EST collection version 2.0transcribed_cluster
WMU55035watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla009208Cla009208.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU55035WMU55035transcribed_cluster
WMU52548WMU52548transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 253..276
score: 0.49coord: 174..202
score: 6.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 383..431
score: 7.9E-10coord: 67..115
score: 3.5E-7coord: 281..326
score: 5.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 281..314
score: 5.2E-8coord: 358..384
score: 1.5E-4coord: 174..205
score: 2.6E-5coord: 386..419
score: 6.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 353..383
score: 8.659coord: 314..348
score: 6.073coord: 487..521
score: 6.095coord: 384..418
score: 11.871coord: 138..169
score: 7.684coord: 278..312
score: 9.778coord: 455..485
score: 6.96coord: 68..102
score: 7.322coord: 171..205
score: 9.613coord: 103..137
score: 5.503coord: 419..453
score: 7
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 463..527
score: 5.0E-6coord: 263..427
score: 5.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 269..523
score: 9.4
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 19..562
score: 1.3E
NoneNo IPR availablePANTHERPTHR24015:SF514SUBFAMILY NOT NAMEDcoord: 19..562
score: 1.3E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cla009208Cla97C06G113610Watermelon (97103) v2wmwmbB406
Cla009208ClCG06G004190Watermelon (Charleston Gray)wcgwmB344
The following gene(s) are paralogous to this gene:

None