CmoCh16G004070 (gene) Cucurbita moschata (Rifu)

NameCmoCh16G004070
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionPentatricopeptide repeat-containing family protein
LocationCmo_Chr16 : 1870820 .. 1872859 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTCAATGGCATTCGGTTATCGATCTTCATCCCAAACCCTAATCGCCTTCTCTTTCGGATCCTCCATTCTTACCTCGGTTCTTCTCACATCGACATTGCCCCTCCGCCATCATCCCCACCATTCAAATGCTCAATCTCGCCCCGTTCCCTCTCTGCAACTCTTCGCAACCTCCTGCAGCCGCTCTCTGCGCCGGATCCACCTCCGATTCTCTCTTATGCCTCGGTTTTCCAGTTCCTTACTGGCCAAAATCTGTTGAAATTGGGCCAACAAGTTCATGCCCATATGCTTCTCCGTGGCCTTGAGCCCACTGCACTTGTTGGTTCCAAGATGGTTGCGTTTTATGCGAGTTCTGGTGATATTGATTCATCTGTTGCGGTTTTCAATCGGATTAGTGAGCCTTCTTCTCTCTTGTTTAATTCTATGATTCGAGCCTATGCGCGATATGGGTTTGCGGAGAGAACGGTTGCCACTTATTTTTCTATGCATTCTTGGGGCTTTACAGGGGATTACTTTACTTTTCCTTTTGTTCTTAAGTCTTCTGTGGATTTGTTGAGTGTTTGGATGGGGAAATGTGTTCATGGACTGGTTTTGAGAGCTGGGTTGGAGTTTGATTTGTATGTGGCTACTTCTTTGATTGATATGTATGGGAAATGTGGTGAAATAAATGATGCGCGTAAGGTGTTTGATAAAATGACTGTTAGAGATGTTTCGTCTTGGAATGCTTTACTTGCTGGTTACATGAAGGGGGGGTTTATAGATGCTGCTGTGGCGATTTTTGAGAGAATGCCGTGGAGGAATATCGTCTCTTGGACGACTATGATTTCTGGATACTCACAGAGCGGCTTGGCACAGCAGGCATTGAGTTTGTTTGATGAAATGCTGAAAGAAGATTCAGGAGTAAGACCCAATTGGGTGACTATAATGAGTGTCCTCCCAGCTTGTGCACAATCATCGGCACTCGAACGTGGAAGGCGGATTCACGAGTTGGCTTGTCGGATGGGTTTGAATTCCAATGCTTCTGTGCTGATTGCCCTTACTGCAATGTATGCTAAATGTGGAAGCTTAGCTGATGCTCGCAACTGTTTCAATCGGCTTAATAGAAGTGAAAAGAGTTTGGTTGCTTGGAATACCATGATTACTGCTTATGCTTCGTATGGACATGGGCGGGAAGCCGTGTCAACCTTTCAGGAGATGATCGAAGCAGGCATACGGCCCGACGACATTACATTCACAGGATTGTTATCCGCTTGCAGCCATTCAGGTCTTGTTGACATTGGCTTGAATTACTTCAACTACATGAGCACCACATATTCGACGAATCCCAGAGCTGAGCATTATGCTTGTGTTGTCGATCTCTTAGGTCGGGCTGGAAGATTAGCTGAAGCAAGTAAACTTGTAGATGAAATGCCAATGCCAGCAGGACCGAGCATTTGGGGTTCGTTATTAGCTGCCTGCCGAAAATACCGCAATCTAGAAATGGCAGAAACTGCAGCAAGAAAGCTATTTGTCCTCGAACCCGAAAACACTGGCAACTATGTCCTGCTCTCAAACATGTACGCCGAAGCTGGAAGGTGGCAGGAAGTTGACAAGCTGAGAGCGATTCTGATATCCCAGGGGACAAAGAAAAGTCCAGGTTGCAGTTGGATCGAGGTCAATGGCATAGCGCATATGTTTCTCGGTGGCGACACGTCCCACCCTCAAACCAAGGAAATCTACATGTTCTTGGAGGCATTGCCGGAGAAGATGAAGGCAGCTGGCTACACTCCTGATACTAGCTTTGTGTTGCATGATATCAGCGAGGAAGAGAAAGAATTTAACCTCATTGCACACAGTGAGAAGCTTGCGGTTGCATTCGGAATCCTCAACACTCCTTCCGAAACCGTTCTCCGAGTTACGAAGAACTTGAGAATCTGTGGGGATTGCCACACTGCAATGGTGTTCATATCAGAGATATATGGACGGGAAGTCGTTGTTCGAGATGTGAATCGGTTCCATCACTTCAAAGCGGGTTCGTGTTCTTGTGGAGATTACTGGTGA

mRNA sequence

ATGCTCAATGGCATTCGGTTATCGATCTTCATCCCAAACCCTAATCGCCTTCTCTTTCGGATCCTCCATTCTTACCTCGGTTCTTCTCACATCGACATTGCCCCTCCGCCATCATCCCCACCATTCAAATGCTCAATCTCGCCCCGTTCCCTCTCTGCAACTCTTCGCAACCTCCTGCAGCCGCTCTCTGCGCCGGATCCACCTCCGATTCTCTCTTATGCCTCGGTTTTCCAGTTCCTTACTGGCCAAAATCTGTTGAAATTGGGCCAACAAGTTCATGCCCATATGCTTCTCCGTGGCCTTGAGCCCACTGCACTTGTTGGTTCCAAGATGGTTGCGTTTTATGCGAGTTCTGGTGATATTGATTCATCTGTTGCGGTTTTCAATCGGATTAGTGAGCCTTCTTCTCTCTTGTTTAATTCTATGATTCGAGCCTATGCGCGATATGGGTTTGCGGAGAGAACGGTTGCCACTTATTTTTCTATGCATTCTTGGGGCTTTACAGGGGATTACTTTACTTTTCCTTTTGTTCTTAAGTCTTCTGTGGATTTGTTGAGTGTTTGGATGGGGAAATGTGTTCATGGACTGGTTTTGAGAGCTGGGTTGGAGTTTGATTTGTATGTGGCTACTTCTTTGATTGATATGTATGGGAAATGTGGTGAAATAAATGATGCGCGTAAGGTGTTTGATAAAATGACTGTTAGAGATGTTTCGTCTTGGAATGCTTTACTTGCTGGTTACATGAAGGGGGGGTTTATAGATGCTGCTGTGGCGATTTTTGAGAGAATGCCGTGGAGGAATATCGTCTCTTGGACGACTATGATTTCTGGATACTCACAGAGCGGCTTGGCACAGCAGGCATTGAGTTTGTTTGATGAAATGCTGAAAGAAGATTCAGGAGTAAGACCCAATTGGGTGACTATAATGAGTGTCCTCCCAGCTTGTGCACAATCATCGGCACTCGAACGTGGAAGGCGGATTCACGAGTTGGCTTGTCGGATGGGTTTGAATTCCAATGCTTCTGTGCTGATTGCCCTTACTGCAATGTATGCTAAATGTGGAAGCTTAGCTGATGCTCGCAACTGTTTCAATCGGCTTAATAGAAGTGAAAAGAGTTTGGTTGCTTGGAATACCATGATTACTGCTTATGCTTCGTATGGACATGGGCGGGAAGCCGTGTCAACCTTTCAGGAGATGATCGAAGCAGGCATACGGCCCGACGACATTACATTCACAGGATTGTTATCCGCTTGCAGCCATTCAGGTCTTGTTGACATTGGCTTGAATTACTTCAACTACATGAGCACCACATATTCGACGAATCCCAGAGCTGAGCATTATGCTTGTGTTGTCGATCTCTTAGGTCGGGCTGGAAGATTAGCTGAAGCAAGTAAACTTGTAGATGAAATGCCAATGCCAGCAGGACCGAGCATTTGGGGTTCGTTATTAGCTGCCTGCCGAAAATACCGCAATCTAGAAATGGCAGAAACTGCAGCAAGAAAGCTATTTGTCCTCGAACCCGAAAACACTGGCAACTATGTCCTGCTCTCAAACATGTACGCCGAAGCTGGAAGGTGGCAGGAAGTTGACAAGCTGAGAGCGATTCTGATATCCCAGGGGACAAAGAAAAGTCCAGGTTGCAGTTGGATCGAGGTCAATGGCATAGCGCATATGTTTCTCGGTGGCGACACGTCCCACCCTCAAACCAAGGAAATCTACATGTTCTTGGAGGCATTGCCGGAGAAGATGAAGGCAGCTGGCTACACTCCTGATACTAGCTTTGTGTTGCATGATATCAGCGAGGAAGAGAAAGAATTTAACCTCATTGCACACAGTGAGAAGCTTGCGGTTGCATTCGGAATCCTCAACACTCCTTCCGAAACCGTTCTCCGAGTTACGAAGAACTTGAGAATCTGTGGGGATTGCCACACTGCAATGGTGTTCATATCAGAGATATATGGACGGGAAGTCGTTGTTCGAGATGTGAATCGGTTCCATCACTTCAAAGCGGGTTCGTGTTCTTGTGGAGATTACTGGTGA

Coding sequence (CDS)

ATGCTCAATGGCATTCGGTTATCGATCTTCATCCCAAACCCTAATCGCCTTCTCTTTCGGATCCTCCATTCTTACCTCGGTTCTTCTCACATCGACATTGCCCCTCCGCCATCATCCCCACCATTCAAATGCTCAATCTCGCCCCGTTCCCTCTCTGCAACTCTTCGCAACCTCCTGCAGCCGCTCTCTGCGCCGGATCCACCTCCGATTCTCTCTTATGCCTCGGTTTTCCAGTTCCTTACTGGCCAAAATCTGTTGAAATTGGGCCAACAAGTTCATGCCCATATGCTTCTCCGTGGCCTTGAGCCCACTGCACTTGTTGGTTCCAAGATGGTTGCGTTTTATGCGAGTTCTGGTGATATTGATTCATCTGTTGCGGTTTTCAATCGGATTAGTGAGCCTTCTTCTCTCTTGTTTAATTCTATGATTCGAGCCTATGCGCGATATGGGTTTGCGGAGAGAACGGTTGCCACTTATTTTTCTATGCATTCTTGGGGCTTTACAGGGGATTACTTTACTTTTCCTTTTGTTCTTAAGTCTTCTGTGGATTTGTTGAGTGTTTGGATGGGGAAATGTGTTCATGGACTGGTTTTGAGAGCTGGGTTGGAGTTTGATTTGTATGTGGCTACTTCTTTGATTGATATGTATGGGAAATGTGGTGAAATAAATGATGCGCGTAAGGTGTTTGATAAAATGACTGTTAGAGATGTTTCGTCTTGGAATGCTTTACTTGCTGGTTACATGAAGGGGGGGTTTATAGATGCTGCTGTGGCGATTTTTGAGAGAATGCCGTGGAGGAATATCGTCTCTTGGACGACTATGATTTCTGGATACTCACAGAGCGGCTTGGCACAGCAGGCATTGAGTTTGTTTGATGAAATGCTGAAAGAAGATTCAGGAGTAAGACCCAATTGGGTGACTATAATGAGTGTCCTCCCAGCTTGTGCACAATCATCGGCACTCGAACGTGGAAGGCGGATTCACGAGTTGGCTTGTCGGATGGGTTTGAATTCCAATGCTTCTGTGCTGATTGCCCTTACTGCAATGTATGCTAAATGTGGAAGCTTAGCTGATGCTCGCAACTGTTTCAATCGGCTTAATAGAAGTGAAAAGAGTTTGGTTGCTTGGAATACCATGATTACTGCTTATGCTTCGTATGGACATGGGCGGGAAGCCGTGTCAACCTTTCAGGAGATGATCGAAGCAGGCATACGGCCCGACGACATTACATTCACAGGATTGTTATCCGCTTGCAGCCATTCAGGTCTTGTTGACATTGGCTTGAATTACTTCAACTACATGAGCACCACATATTCGACGAATCCCAGAGCTGAGCATTATGCTTGTGTTGTCGATCTCTTAGGTCGGGCTGGAAGATTAGCTGAAGCAAGTAAACTTGTAGATGAAATGCCAATGCCAGCAGGACCGAGCATTTGGGGTTCGTTATTAGCTGCCTGCCGAAAATACCGCAATCTAGAAATGGCAGAAACTGCAGCAAGAAAGCTATTTGTCCTCGAACCCGAAAACACTGGCAACTATGTCCTGCTCTCAAACATGTACGCCGAAGCTGGAAGGTGGCAGGAAGTTGACAAGCTGAGAGCGATTCTGATATCCCAGGGGACAAAGAAAAGTCCAGGTTGCAGTTGGATCGAGGTCAATGGCATAGCGCATATGTTTCTCGGTGGCGACACGTCCCACCCTCAAACCAAGGAAATCTACATGTTCTTGGAGGCATTGCCGGAGAAGATGAAGGCAGCTGGCTACACTCCTGATACTAGCTTTGTGTTGCATGATATCAGCGAGGAAGAGAAAGAATTTAACCTCATTGCACACAGTGAGAAGCTTGCGGTTGCATTCGGAATCCTCAACACTCCTTCCGAAACCGTTCTCCGAGTTACGAAGAACTTGAGAATCTGTGGGGATTGCCACACTGCAATGGTGTTCATATCAGAGATATATGGACGGGAAGTCGTTGTTCGAGATGTGAATCGGTTCCATCACTTCAAAGCGGGTTCGTGTTCTTGTGGAGATTACTGGTGA
BLAST of CmoCh16G004070 vs. Swiss-Prot
Match: PPR53_ARATH (Pentatricopeptide repeat-containing protein At1g20230 OS=Arabidopsis thaliana GN=PCMP-H21 PE=2 SV=2)

HSP 1 Score: 497.3 bits (1279), Expect = 2.7e-139
Identity = 251/579 (43.35%), Postives = 356/579 (61.49%), Query Frame = 1

Query: 109 SKMVAFYASSGDIDSSVAVFNRIS----EPSSLLFNSMIRAYARYGFAERTVATYFSMHS 168
           S ++  YA  G ++  V + + +     E + + +N ++  + R G+ +  V  +  +H 
Sbjct: 186 SALLCAYARKGCLEEVVRILSEMESSGIEANIVSWNGILSGFNRSGYHKEAVVMFQKIHH 245

Query: 169 WGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVLRAGLEFDLYVATSLIDMYGKCGEIND 228
            GF  D  T   VL S  D   + MG+ +HG V++ GL  D  V +++IDMYGK G +  
Sbjct: 246 LGFCPDQVTVSSVLPSVGDSEMLNMGRLIHGYVIKQGLLKDKCVISAMIDMYGKSGHVYG 305

Query: 229 ARKVFDKMTVRDVSSWNALLAGYMKGGFIDAAVAIFERMPWR----NIVSWTTMISGYSQ 288
              +F++  + +    NA + G  + G +D A+ +FE    +    N+VSWT++I+G +Q
Sbjct: 306 IISLFNQFEMMEAGVCNAYITGLSRNGLVDKALEMFELFKEQTMELNVVSWTSIIAGCAQ 365

Query: 289 SGLAQQALSLFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELACRMGLNSNA 348
           +G   +AL LF EM  + +GV+PN VTI S+LPAC   +AL  GR  H  A R+ L  N 
Sbjct: 366 NGKDIEALELFREM--QVAGVKPNHVTIPSMLPACGNIAALGHGRSTHGFAVRVHLLDNV 425

Query: 349 SVLIALTAMYAKCGSLADARNCFNRLNRSEKSLVAWNTMITAYASYGHGREAVSTFQEMI 408
            V  AL  MYAKCG +  ++  FN +    K+LV WN+++  ++ +G  +E +S F+ ++
Sbjct: 426 HVGSALIDMYAKCGRINLSQIVFNMM--PTKNLVCWNSLMNGFSMHGKAKEVMSIFESLM 485

Query: 409 EAGIRPDDITFTGLLSACSHSGLVDIGLNYFNYMSTTYSTNPRAEHYACVVDLLGRAGRL 468
              ++PD I+FT LLSAC   GL D G  YF  MS  Y   PR EHY+C+V+LLGRAG+L
Sbjct: 486 RTRLKPDFISFTSLLSACGQVGLTDEGWKYFKMMSEEYGIKPRLEHYSCMVNLLGRAGKL 545

Query: 469 AEASKLVDEMPMPAGPSIWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMY 528
            EA  L+ EMP      +WG+LL +CR   N+++AE AA KLF LEPEN G YVLLSN+Y
Sbjct: 546 QEAYDLIKEMPFEPDSCVWGALLNSCRLQNNVDLAEIAAEKLFHLEPENPGTYVLLSNIY 605

Query: 529 AEAGRWQEVDKLRAILISQGTKKSPGCSWIEVNGIAHMFLGGDTSHPQTKEIYMFLEALP 588
           A  G W EVD +R  + S G KK+PGCSWI+V    +  L GD SHPQ  +I   ++ + 
Sbjct: 606 AAKGMWTEVDSIRNKMESLGLKKNPGCSWIQVKNRVYTLLAGDKSHPQIDQITEKMDEIS 665

Query: 589 EKMKAAGYTPDTSFVLHDISEEEKEFNLIAHSEKLAVAFGILNTPSETVLRVTKNLRICG 648
           ++M+ +G+ P+  F LHD+ E+E+E  L  HSEKLAV FG+LNTP  T L+V KNLRICG
Sbjct: 666 KEMRKSGHRPNLDFALHDVEEQEQEQMLWGHSEKLAVVFGLLNTPDGTPLQVIKNLRICG 725

Query: 649 DCHTAMVFISEIYGREVVVRDVNRFHHFKAGSCSCGDYW 680
           DCH  + FIS   GRE+ +RD NRFHHFK G CSCGD+W
Sbjct: 726 DCHAVIKFISSYAGREIFIRDTNRFHHFKDGICSCGDFW 760

BLAST of CmoCh16G004070 vs. Swiss-Prot
Match: PP175_ARATH (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 490.7 bits (1262), Expect = 2.5e-137
Identity = 242/595 (40.67%), Postives = 361/595 (60.67%), Query Frame = 1

Query: 86  LKLGQQVHAHMLLRGLEPTALVGSKMVAFYASSGDIDSSVAVFNRISEPSSLLFNSMIRA 145
           L LGQ +H   +   +     V + ++  Y S GD+DS+  VF  I E   + +NSMI  
Sbjct: 147 LSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMING 206

Query: 146 YARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVLRAGLEFD 205
           + + G  ++ +  +  M S      + T   VL +   + ++  G+ V   +    +  +
Sbjct: 207 FVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVN 266

Query: 206 LYVATSLIDMYGKCGEINDARKVFDKMTVRDVSSWNALLAGYMKGGFIDAAVAIFERMPW 265
           L +A +++DMY KCG I DA+++FD M  +D  +W  +L GY      +AA  +   MP 
Sbjct: 267 LTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQ 326

Query: 266 RNIVSWTTMISGYSQSGLAQQALSLFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGR 325
           ++IV+W  +IS Y Q+G   +AL +F E L+    ++ N +T++S L ACAQ  ALE GR
Sbjct: 327 KDIVAWNALISAYEQNGKPNEALIVFHE-LQLQKNMKLNQITLVSTLSACAQVGALELGR 386

Query: 326 RIHELACRMGLNSNASVLIALTAMYAKCGSLADARNCFNRLNRSEKSLVAWNTMITAYAS 385
            IH    + G+  N  V  AL  MY+KCG L  +R  FN + +  + +  W+ MI   A 
Sbjct: 387 WIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEK--RDVFVWSAMIGGLAM 446

Query: 386 YGHGREAVSTFQEMIEAGIRPDDITFTGLLSACSHSGLVDIGLNYFNYMSTTYSTNPRAE 445
           +G G EAV  F +M EA ++P+ +TFT +  ACSH+GLVD   + F+ M + Y   P  +
Sbjct: 447 HGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEK 506

Query: 446 HYACVVDLLGRAGRLAEASKLVDEMPMPAGPSIWGSLLAACRKYRNLEMAETAARKLFVL 505
           HYAC+VD+LGR+G L +A K ++ MP+P   S+WG+LL AC+ + NL +AE A  +L  L
Sbjct: 507 HYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLEL 566

Query: 506 EPENTGNYVLLSNMYAEAGRWQEVDKLRAILISQGTKKSPGCSWIEVNGIAHMFLGGDTS 565
           EP N G +VLLSN+YA+ G+W+ V +LR  +   G KK PGCS IE++G+ H FL GD +
Sbjct: 567 EPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNA 626

Query: 566 HPQTKEIYMFLEALPEKMKAAGYTPDTSFVLHDISEEE-KEFNLIAHSEKLAVAFGILNT 625
           HP ++++Y  L  + EK+K+ GY P+ S VL  I EEE KE +L  HSEKLA+ +G+++T
Sbjct: 627 HPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLIST 686

Query: 626 PSETVLRVTKNLRICGDCHTAMVFISEIYGREVVVRDVNRFHHFKAGSCSCGDYW 680
            +  V+RV KNLR+CGDCH+    IS++Y RE++VRD  RFHHF+ G CSC D+W
Sbjct: 687 EAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738

BLAST of CmoCh16G004070 vs. Swiss-Prot
Match: PP265_ARATH (Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidopsis thaliana GN=CRR2 PE=2 SV=1)

HSP 1 Score: 481.1 bits (1237), Expect = 2.0e-134
Identity = 246/629 (39.11%), Postives = 372/629 (59.14%), Query Frame = 1

Query: 55  LRNLLQPLSAPDPPPILSYASVFQFLTGQNLLKLGQQVHAHMLLRGLEPTALVGSKMVAF 114
           L+  ++ LS    P   +Y  +      ++ L    +VH H+L  G +    + +K++  
Sbjct: 62  LKQAIRVLSQESSPSQQTYELLILCCGHRSSLSDALRVHRHILDNGSDQDPFLATKLIGM 121

Query: 115 YASSGDIDSSVAVFNRISEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTF 174
           Y+  G +D +  VF++  + +  ++N++ RA    G  E  +  Y+ M+  G   D FT+
Sbjct: 122 YSDLGSVDYARKVFDKTRKRTIYVWNALFRALTLAGHGEEVLGLYWKMNRIGVESDRFTY 181

Query: 175 PFVLKSSV----DLLSVWMGKCVHGLVLRAGLEFDLYVATSLIDMYGKCGEINDARKVFD 234
            +VLK+ V     +  +  GK +H  + R G    +Y+ T+L+DMY + G          
Sbjct: 182 TYVLKACVASECTVNHLMKGKEIHAHLTRRGYSSHVYIMTTLVDMYARFG---------- 241

Query: 235 KMTVRDVSSWNALLAGYMKGGFIDAAVAIFERMPWRNIVSWTTMISGYSQSGLAQQALSL 294
                                 +D A  +F  MP RN+VSW+ MI+ Y+++G A +AL  
Sbjct: 242 ---------------------CVDYASYVFGGMPVRNVVSWSAMIACYAKNGKAFEALRT 301

Query: 295 FDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELACRMGLNSNASVLIALTAMY 354
           F EM++E     PN VT++SVL ACA  +ALE+G+ IH    R GL+S   V+ AL  MY
Sbjct: 302 FREMMRETKDSSPNSVTMVSVLQACASLAALEQGKLIHGYILRRGLDSILPVISALVTMY 361

Query: 355 AKCGSLADARNCFNRLNRSEKSLVAWNTMITAYASYGHGREAVSTFQEMIEAGIRPDDIT 414
            +CG L   +  F+R++  ++ +V+WN++I++Y  +G+G++A+  F+EM+  G  P  +T
Sbjct: 362 GRCGKLEVGQRVFDRMH--DRDVVSWNSLISSYGVHGYGKKAIQIFEEMLANGASPTPVT 421

Query: 415 FTGLLSACSHSGLVDIGLNYFNYMSTTYSTNPRAEHYACVVDLLGRAGRLAEASKLVDEM 474
           F  +L ACSH GLV+ G   F  M   +   P+ EHYAC+VDLLGRA RL EA+K+V +M
Sbjct: 422 FVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYACMVDLLGRANRLDEAAKMVQDM 481

Query: 475 PMPAGPSIWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVD 534
               GP +WGSLL +CR + N+E+AE A+R+LF LEP+N GNYVLL+++YAEA  W EV 
Sbjct: 482 RTEPGPKVWGSLLGSCRIHGNVELAERASRRLFALEPKNAGNYVLLADIYAEAQMWDEVK 541

Query: 535 KLRAILISQGTKKSPGCSWIEVNGIAHMFLGGDTSHPQTKEIYMFLEALPEKMKAAGYTP 594
           +++ +L  +G +K PG  W+EV    + F+  D  +P  ++I+ FL  L E MK  GY P
Sbjct: 542 RVKKLLEHRGLQKLPGRCWMEVRRKMYSFVSVDEFNPLMEQIHAFLVKLAEDMKEKGYIP 601

Query: 595 DTSFVLHDISEEEKEFNLIAHSEKLAVAFGILNTPSETVLRVTKNLRICGDCHTAMVFIS 654
            T  VL+++  EEKE  ++ HSEKLA+AFG++NT     +R+TKNLR+C DCH    FIS
Sbjct: 602 QTKGVLYELETEEKERIVLGHSEKLALAFGLINTSKGEPIRITKNLRLCEDCHLFTKFIS 657

Query: 655 EIYGREVVVRDVNRFHHFKAGSCSCGDYW 680
           +   +E++VRDVNRFH FK G CSCGDYW
Sbjct: 662 KFMEKEILVRDVNRFHRFKNGVCSCGDYW 657

BLAST of CmoCh16G004070 vs. Swiss-Prot
Match: PP320_ARATH (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana GN=DOT4 PE=2 SV=1)

HSP 1 Score: 479.9 bits (1234), Expect = 4.5e-134
Identity = 249/610 (40.82%), Postives = 358/610 (58.69%), Query Frame = 1

Query: 70  ILSYASVFQFLTGQNLLKLGQQVHAHMLLRGLEPTALVGSKMVAFYASSGDIDSSVAVFN 129
           + +  SVF       L+ LG+ VH+  +           + ++  Y+  GD+DS+ AVF 
Sbjct: 296 LATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFR 355

Query: 130 RISEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWM 189
            +S+ S + + SMI  YAR G A   V  +  M   G + D +T   VL        +  
Sbjct: 356 EMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDE 415

Query: 190 GKCVHGLVLRAGLEFDLYVATSLIDMYGKCGEINDARKVFDKMTVRDVSSWNALLAGYMK 249
           GK VH  +    L FD++V+ +L+DMY KCG + +A  VF +M V+D+ SWN        
Sbjct: 416 GKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWN-------- 475

Query: 250 GGFIDAAVAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKEDSGVRPNWVTIM 309
                                  T+I GYS++  A +ALSLF+ +L E+    P+  T+ 
Sbjct: 476 -----------------------TIIGGYSKNCYANEALSLFN-LLLEEKRFSPDERTVA 535

Query: 310 SVLPACAQSSALERGRRIHELACRMGLNSNASVLIALTAMYAKCGSLADARNCFNRLNRS 369
            VLPACA  SA ++GR IH    R G  S+  V  +L  MYAKCG+L  A   F+ +  +
Sbjct: 536 CVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDI--A 595

Query: 370 EKSLVAWNTMITAYASYGHGREAVSTFQEMIEAGIRPDDITFTGLLSACSHSGLVDIGLN 429
            K LV+W  MI  Y  +G G+EA++ F +M +AGI  D+I+F  LL ACSHSGLVD G  
Sbjct: 596 SKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWR 655

Query: 430 YFNYMSTTYSTNPRAEHYACVVDLLGRAGRLAEASKLVDEMPMPAGPSIWGSLLAACRKY 489
           +FN M       P  EHYAC+VD+L R G L +A + ++ MP+P   +IWG+LL  CR +
Sbjct: 656 FFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIH 715

Query: 490 RNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAILISQGTKKSPGCSW 549
            ++++AE  A K+F LEPENTG YVL++N+YAEA +W++V +LR  +  +G +K+PGCSW
Sbjct: 716 HDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCSW 775

Query: 550 IEVNGIAHMFLGGDTSHPQTKEIYMFLEALPEKMKAAGYTPDTSFVLHDISEEEKEFNLI 609
           IE+ G  ++F+ GD+S+P+T+ I  FL  +  +M   GY+P T + L D  E EKE  L 
Sbjct: 776 IEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMIEEGYSPLTKYALIDAEEMEKEEALC 835

Query: 610 AHSEKLAVAFGILNTPSETVLRVTKNLRICGDCHTAMVFISEIYGREVVVRDVNRFHHFK 669
            HSEKLA+A GI+++    ++RVTKNLR+CGDCH    F+S++  RE+V+RD NRFH FK
Sbjct: 836 GHSEKLAMALGIISSGHGKIIRVTKNLRVCGDCHEMAKFMSKLTRREIVLRDSNRFHQFK 871

Query: 670 AGSCSCGDYW 680
            G CSC  +W
Sbjct: 896 DGHCSCRGFW 871

BLAST of CmoCh16G004070 vs. Swiss-Prot
Match: PPR57_ARATH (Pentatricopeptide repeat-containing protein At1g25360 OS=Arabidopsis thaliana GN=PCMP-H74 PE=2 SV=1)

HSP 1 Score: 477.6 bits (1228), Expect = 2.2e-133
Identity = 236/571 (41.33%), Postives = 349/571 (61.12%), Query Frame = 1

Query: 111 MVAFYASSGDIDSSVAVFNRISEPSSLL-FNSMIRAYARYGFAERTVATYFSMHSWGFTG 170
           M+  Y  +G  D    +   + +   L+ +N+MI  Y   GF +  +     M S G   
Sbjct: 225 MMTGYVKNGYFDLGEELLEGMDDNMKLVAYNAMISGYVNRGFYQEALEMVRRMVSSGIEL 284

Query: 171 DYFTFPFVLKSSVDLLSVWMGKCVHGLVLRAGLEFDLYVATSLIDMYGKCGEINDARKVF 230
           D FT+P V+++      + +GK VH  VLR   +F  +   SL+ +Y KCG+ ++AR +F
Sbjct: 285 DEFTYPSVIRACATAGLLQLGKQVHAYVLRRE-DFSFHFDNSLVSLYYKCGKFDEARAIF 344

Query: 231 DKMTVRDVSSWNALLAGYMKGGFIDAAVAIFERMPWRNIVSWTTMISGYSQSGLAQQALS 290
           +KM  +D+ SWNALL+GY+  G I  A  IF+ M  +NI+SW  MISG +++G  ++ L 
Sbjct: 345 EKMPAKDLVSWNALLSGYVSSGHIGEAKLIFKEMKEKNILSWMIMISGLAENGFGEEGLK 404

Query: 291 LFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELACRMGLNSNASVLIALTAM 350
           LF  M +E  G  P        + +CA   A   G++ H    ++G +S+ S   AL  M
Sbjct: 405 LFSCMKRE--GFEPCDYAFSGAIKSCAVLGAYCNGQQYHAQLLKIGFDSSLSAGNALITM 464

Query: 351 YAKCGSLADARNCFNRLNRSEKSLVAWNTMITAYASYGHGREAVSTFQEMIEAGIRPDDI 410
           YAKCG + +AR  F  +   +   V+WN +I A   +GHG EAV  ++EM++ GIRPD I
Sbjct: 465 YAKCGVVEEARQVFRTMPCLDS--VSWNALIAALGQHGHGAEAVDVYEEMLKKGIRPDRI 524

Query: 411 TFTGLLSACSHSGLVDIGLNYFNYMSTTYSTNPRAEHYACVVDLLGRAGRLAEASKLVDE 470
           T   +L+ACSH+GLVD G  YF+ M T Y   P A+HYA ++DLL R+G+ ++A  +++ 
Sbjct: 525 TLLTVLTACSHAGLVDQGRKYFDSMETVYRIPPGADHYARLIDLLCRSGKFSDAESVIES 584

Query: 471 MPMPAGPSIWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEV 530
           +P      IW +LL+ CR + N+E+   AA KLF L PE+ G Y+LLSNM+A  G+W+EV
Sbjct: 585 LPFKPTAEIWEALLSGCRVHGNMELGIIAADKLFGLIPEHDGTYMLLSNMHAATGQWEEV 644

Query: 531 DKLRAILISQGTKKSPGCSWIEVNGIAHMFLGGDTSHPQTKEIYMFLEALPEKMKAAGYT 590
            ++R ++  +G KK   CSWIE+    H FL  DTSHP+ + +Y++L+ L ++M+  GY 
Sbjct: 645 ARVRKLMRDRGVKKEVACSWIEMETQVHTFLVDDTSHPEAEAVYIYLQDLGKEMRRLGYV 704

Query: 591 PDTSFVLHDI-SEEEKEFNLIAHSEKLAVAFGILNTPSETVLRVTKNLRICGDCHTAMVF 650
           PDTSFVLHD+ S+  KE  L  HSEK+AVAFG++  P  T +R+ KNLR CGDCH    F
Sbjct: 705 PDTSFVLHDVESDGHKEDMLTTHSEKIAVAFGLMKLPPGTTIRIFKNLRTCGDCHNFFRF 764

Query: 651 ISEIYGREVVVRDVNRFHHFKAGSCSCGDYW 680
           +S +  R++++RD  RFHHF+ G CSCG++W
Sbjct: 765 LSWVVQRDIILRDRKRFHHFRNGECSCGNFW 790

BLAST of CmoCh16G004070 vs. TrEMBL
Match: A0A0A0KEZ1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G430650 PE=4 SV=1)

HSP 1 Score: 1252.7 bits (3240), Expect = 0.0e+00
Identity = 608/679 (89.54%), Postives = 642/679 (94.55%), Query Frame = 1

Query: 1   MLNGIRLSIFIPNPNRLLFRILHSYLGSSHIDIAPPPSSPPFKCSISPRSLSATLRNLLQ 60
           M NGIRLSI IP P+ LLFRILHSY GS+HID  PPPSSPPFKCSISP ++SATL+NLLQ
Sbjct: 1   MHNGIRLSISIPTPSHLLFRILHSYSGSAHIDTVPPPSSPPFKCSISPLTISATLQNLLQ 60

Query: 61  PLSAPDPPPILSYASVFQFLTGQNLLKLGQQVHAHMLLRGLEPTALVGSKMVAFYASSGD 120
           PLSAP PPPILSYA VFQFLTG N+LKLG QVHAHMLLRGL+PTALVGSKMVAFYASSGD
Sbjct: 61  PLSAPGPPPILSYAPVFQFLTGLNMLKLGHQVHAHMLLRGLQPTALVGSKMVAFYASSGD 120

Query: 121 IDSSVAVFNRISEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKS 180
           IDSSV+VFN I EPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKS
Sbjct: 121 IDSSVSVFNGIGEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKS 180

Query: 181 SVDLLSVWMGKCVHGLVLRAGLEFDLYVATSLIDMYGKCGEINDARKVFDKMTVRDVSSW 240
           SV+LLSVWMGKCVHGL+LR GL+FDLYVATSLI +YGKCGEINDA KVFD MT+RDVSSW
Sbjct: 181 SVELLSVWMGKCVHGLILRIGLQFDLYVATSLIILYGKCGEINDAGKVFDNMTIRDVSSW 240

Query: 241 NALLAGYMKGGFIDAAVAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKEDSG 300
           NALLAGY K G IDAA+AIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEM+KEDSG
Sbjct: 241 NALLAGYTKSGCIDAALAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMVKEDSG 300

Query: 301 VRPNWVTIMSVLPACAQSSALERGRRIHELACRMGLNSNASVLIALTAMYAKCGSLADAR 360
           VRPNWVTIMSVLPACAQ S LERGR+IHELACRMGLNSNASVLIALTAMYAKCGSL DAR
Sbjct: 301 VRPNWVTIMSVLPACAQLSTLERGRQIHELACRMGLNSNASVLIALTAMYAKCGSLVDAR 360

Query: 361 NCFNRLNRSEKSLVAWNTMITAYASYGHGREAVSTFQEMIEAGIRPDDITFTGLLSACSH 420
           NCF++LNR+EK+L+AWNTMITAYASYGHG +AVSTF+EMI+AGI+PDDITFTGLLS CSH
Sbjct: 361 NCFDKLNRNEKNLIAWNTMITAYASYGHGLQAVSTFREMIQAGIQPDDITFTGLLSGCSH 420

Query: 421 SGLVDIGLNYFNYMSTTYSTNPRAEHYACVVDLLGRAGRLAEASKLVDEMPMPAGPSIWG 480
           SGLVD+GL YFN+MSTTYS NPR EHYACV DLLGRAGRLAEASKLV EMPMPAGPSIWG
Sbjct: 421 SGLVDVGLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLVGEMPMPAGPSIWG 480

Query: 481 SLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAILISQG 540
           SLLAACRK+RNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAI+ SQG
Sbjct: 481 SLLAACRKHRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIVKSQG 540

Query: 541 TKKSPGCSWIEVNGIAHMFLGGDTSHPQTKEIYMFLEALPEKMKAAGYTPDTSFVLHDIS 600
           TKKSPGCSWIE+NG AHMFLGGDTSHPQ KEIYMFLEALPEKMKAAGY PDTS+VLHDIS
Sbjct: 541 TKKSPGCSWIEINGKAHMFLGGDTSHPQGKEIYMFLEALPEKMKAAGYFPDTSYVLHDIS 600

Query: 601 EEEKEFNLIAHSEKLAVAFGILNTPSETVLRVTKNLRICGDCHTAMVFISEIYGREVVVR 660
           EEEKEFNLIAHSEKLAVAFGILNTP+ETVLRVTKNLRICGDCHTAMVFISEIYGREV+VR
Sbjct: 601 EEEKEFNLIAHSEKLAVAFGILNTPAETVLRVTKNLRICGDCHTAMVFISEIYGREVIVR 660

Query: 661 DVNRFHHFKAGSCSCGDYW 680
           D+NRFHHFK G CSCGDYW
Sbjct: 661 DINRFHHFKGGCCSCGDYW 679

BLAST of CmoCh16G004070 vs. TrEMBL
Match: M5X3I7_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002838mg PE=4 SV=1)

HSP 1 Score: 912.9 bits (2358), Expect = 2.3e-262
Identity = 441/625 (70.56%), Postives = 513/625 (82.08%), Query Frame = 1

Query: 56  RNLLQPLSAPDPPPILSYASVFQFLTGQNLLKLGQQVHAHMLLRGLEPTALVGSKMVAFY 115
           R LL+ L A DP  I  YA +FQ LT QNLLKLGQQVHA M LRGLEP A +G+KMVA Y
Sbjct: 4   RTLLKSLLAQDPTCISFYAPIFQSLTSQNLLKLGQQVHAQMALRGLEPNAFLGAKMVAMY 63

Query: 116 ASSGDIDSSVAVFNRISEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFP 175
           ASS ++DS+V +F+R++ PS+LL+NS+IRAY  YG++E+T+  Y  MH  G  GD FT+P
Sbjct: 64  ASSDNLDSAVNIFHRVNNPSTLLYNSIIRAYTLYGYSEKTMEIYGQMHRLGLKGDNFTYP 123

Query: 176 FVLKSSVDLLSVWMGKCVHGLVLRAGLEFDLYVATSLIDMYGKCGEINDARKVFDKMTVR 235
           FVLK   +L S+W+GKCVH L LR GL  D+YV TSLIDMY KCGE++DAR  FDKMTVR
Sbjct: 124 FVLKCCANLSSIWLGKCVHSLSLRIGLASDMYVGTSLIDMYVKCGEMSDARSSFDKMTVR 183

Query: 236 DVSSWNALLAGYMKGGFIDAAVAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEML 295
           DVSSWNAL+AGYMK G I  A  +F RMP +NIVSWT MISGY+Q+GLA+QAL LFDEML
Sbjct: 184 DVSSWNALIAGYMKDGEICFAEDLFRRMPCKNIVSWTAMISGYTQNGLAEQALVLFDEML 243

Query: 296 KEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELACRMGLNSNASVLIALTAMYAKCGS 355
           ++DS V+PNWVTIMSVLPACA S+ALERGR+IH  A R GL+SN S+  AL AMYAKCGS
Sbjct: 244 RKDSEVKPNWVTIMSVLPACAHSAALERGRQIHNFASRTGLDSNTSIQTALLAMYAKCGS 303

Query: 356 LADARNCFNRLNRSEKSLVAWNTMITAYASYGHGREAVSTFQEMIEAGIRPDDITFTGLL 415
           L+DAR CF R++++E SLVAWNTMITAYAS+G G EAVSTF++MI AG++PD+ITFTGLL
Sbjct: 304 LSDARQCFERVHQTENSLVAWNTMITAYASHGRGSEAVSTFEDMIGAGLQPDNITFTGLL 363

Query: 416 SACSHSGLVDIGLNYFNYMSTTYSTNPRAEHYACVVDLLGRAGRLAEASKLVDEMPMPAG 475
           S CSHSGLVD GL YFN M T YS  PR EHYACVVDLLGRAGRL EA  LV +MPM AG
Sbjct: 364 SGCSHSGLVDGGLKYFNCMKTIYSIEPRVEHYACVVDLLGRAGRLVEAIDLVSKMPMQAG 423

Query: 476 PSIWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAI 535
           PSIWG+LL+ACRK+ NLE+AE AARKLF+LEP+N+GNYVLLSN+YA+AG W+EVD LRA+
Sbjct: 424 PSIWGALLSACRKHHNLEIAEIAARKLFILEPDNSGNYVLLSNIYADAGMWKEVDDLRAL 483

Query: 536 LISQGTKKSPGCSWIEVNGIAHMFLGGDTSHPQTKEIY-MFLEALPEKMKAAGYTPDTSF 595
           L SQG KK+PGCSWIEVNG AH+FLGGDT HPQ KEIY + LE LP K+KAAGY PDTSF
Sbjct: 484 LKSQGMKKNPGCSWIEVNGKAHLFLGGDTCHPQAKEIYEVLLEELPNKIKAAGYVPDTSF 543

Query: 596 VLHDISEEEKEFNLIAHSEKLAVAFGILNTPSETVLRVTKNLRICGDCHTAMVFISEIYG 655
           VLHD+SEEEKE NL  HSEKLA+AFG+LN     VLRVTKNLRICGDCHTA   IS IY 
Sbjct: 544 VLHDVSEEEKEHNLTTHSEKLAIAFGLLNASPGVVLRVTKNLRICGDCHTATKLISRIYE 603

Query: 656 REVVVRDVNRFHHFKAGSCSCGDYW 680
           RE++VRD+NRFHHF+ G CSCGDYW
Sbjct: 604 REIIVRDLNRFHHFRDGCCSCGDYW 628

BLAST of CmoCh16G004070 vs. TrEMBL
Match: K4B1Y4_SOLLC (Uncharacterized protein OS=Solanum lycopersicum PE=4 SV=1)

HSP 1 Score: 880.6 bits (2274), Expect = 1.3e-252
Identity = 421/625 (67.36%), Postives = 500/625 (80.00%), Query Frame = 1

Query: 55  LRNLLQPLSAPDPPPILSYASVFQFLTGQNLLKLGQQVHAHMLLRGLEPTALVGSKMVAF 114
           L+ +LQPL     PP  +YAS+FQFL G+N +KLGQQVHAHM +RG+ P  LV +KMVA 
Sbjct: 2   LKIILQPLYQNSFPPS-TYASIFQFLVGKNFVKLGQQVHAHMAVRGVSPNGLVAAKMVAM 61

Query: 115 YASSGDIDSSVAVFNRISEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTF 174
           YASSG+IDS+  +F+  +EPSSLL+N+MIRA   YG  +RT+  +F MHS GF GD FTF
Sbjct: 62  YASSGEIDSASYIFDSATEPSSLLYNAMIRALTLYGITKRTIEIFFQMHSLGFRGDNFTF 121

Query: 175 PFVLKSSVDLLSVWMGKCVHGLVLRAGLEFDLYVATSLIDMYGKCGEINDARKVFDKMTV 234
           PFV KS  DL  VW GKCVH L+LR+G  FD+YV TSL+DMY KCG++ DARK+FD+M V
Sbjct: 122 PFVFKSCADLSDVWCGKCVHSLILRSGFVFDMYVGTSLVDMYVKCGDLIDARKLFDEMPV 181

Query: 235 RDVSSWNALLAGYMKGGFIDAAVAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEM 294
           RDVS+WN L+AGYMK G    A  +FE MP RNIVSWT MISGY+Q+GLA ++L LFD+M
Sbjct: 182 RDVSAWNVLIAGYMKDGLFKDAEELFEEMPIRNIVSWTAMISGYAQNGLADESLQLFDKM 241

Query: 295 LKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELACRMGLNSNASVLIALTAMYAKCG 354
           L  DS VRPNWVT+MSVLPACA S+AL+RG++IH  A   GL  N SV  AL AMYAKCG
Sbjct: 242 LDPDSEVRPNWVTVMSVLPACAHSAALDRGKKIHSFAREAGLEKNPSVQTALIAMYAKCG 301

Query: 355 SLADARNCFNRLNRSEKSLVAWNTMITAYASYGHGREAVSTFQEMIEAGIRPDDITFTGL 414
           SL DAR CF+++N  EK LVAWNTMITAYAS+G GREAVSTF++M+ AGI+PD ITFTGL
Sbjct: 302 SLVDARLCFDQINPREKKLVAWNTMITAYASHGFGREAVSTFEDMLRAGIQPDKITFTGL 361

Query: 415 LSACSHSGLVDIGLNYFNYMSTTYSTNPRAEHYACVVDLLGRAGRLAEASKLVDEMPMPA 474
           LS CSHSGLVD+GL YF+ MS  Y      +HYACVVDLLGRAGRL EA  L+ +MPM A
Sbjct: 362 LSGCSHSGLVDVGLRYFDCMSLVYFVEKGHDHYACVVDLLGRAGRLVEAYNLISQMPMAA 421

Query: 475 GPSIWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRA 534
           GPSIWGSLLAA R +RNLE+AE AA+KLF+LEP+N+GNY++LSNMYAEAG W+EV  LR 
Sbjct: 422 GPSIWGSLLAAGRSHRNLEIAELAAKKLFILEPDNSGNYIVLSNMYAEAGMWEEVTHLRI 481

Query: 535 ILISQGTKKSPGCSWIEVNGIAHMFLGGDTSHPQTKEIYMFLEALPEKMKAAGYTPDTSF 594
              S+   KSPGCSWIE +G AH+FLGGDTSHPQ ++IY+FLEALP K+KAAGY PDT+F
Sbjct: 482 QQKSRRIMKSPGCSWIEFDGKAHLFLGGDTSHPQAEQIYLFLEALPAKIKAAGYMPDTTF 541

Query: 595 VLHDISEEEKEFNLIAHSEKLAVAFGILNTPSETVLRVTKNLRICGDCHTAMVFISEIYG 654
            LHD+SEEEKE NL +HSE+LA+AFGILNT   TVLRVTKNLRICGDCHTA+  +S+IY 
Sbjct: 542 ALHDVSEEEKEQNLSSHSERLAIAFGILNTSPGTVLRVTKNLRICGDCHTAIKLVSKIYE 601

Query: 655 REVVVRDVNRFHHFKAGSCSCGDYW 680
           RE++VRDVNRFHHFK GSCSC DYW
Sbjct: 602 REIIVRDVNRFHHFKDGSCSCRDYW 625

BLAST of CmoCh16G004070 vs. TrEMBL
Match: W9QT12_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_002732 PE=4 SV=1)

HSP 1 Score: 874.4 bits (2258), Expect = 9.0e-251
Identity = 432/658 (65.65%), Postives = 522/658 (79.33%), Query Frame = 1

Query: 23  HSYLGSSHIDIAPPPSS-PPFKCSISPRSLSATLRNLLQPLSAPDPPPILSYASVFQFLT 82
           HS L   H D++ P    PP+       SL +TLR+L Q     DPP + SYA++FQ LT
Sbjct: 29  HSQL---HFDVSLPKHQIPPWL------SLVSTLRSLAQ-----DPPQVSSYAAIFQSLT 88

Query: 83  GQNLLKLGQQVHAHMLLRGLEPTALVGSKMVAFYASSGDIDSSVAVFNRISEPSSLLFNS 142
           G+NLL+LG+QVH+HM LR LEP A +G+KM+A YAS+GD+ S+VAVF RI  PS+LL NS
Sbjct: 89  GKNLLRLGRQVHSHMSLRALEPDAFLGAKMIAMYASAGDLRSAVAVFRRIKYPSALLCNS 148

Query: 143 MIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVLRAG 202
           +IRAY+ + F ++T+  YF M S G   D+FT+PFVLKS  DL  V MG+  HGL LR G
Sbjct: 149 IIRAYSWHWFPKKTIGVYFRMRSLGLKADHFTYPFVLKSCADLSDVRMGRYAHGLSLRTG 208

Query: 203 LEFDLYVATSLIDMYGKCGEINDARKVFDKMTVRDVSSWNALLAGYMKGGFIDAAVAIFE 262
            E D YV TSLI+MY KCG I DARK+FD MTVRD+SSWNAL+AGYMK G I  A  +F 
Sbjct: 209 FEEDFYVGTSLINMYVKCGGIGDARKMFDVMTVRDISSWNALIAGYMKIGEIRLAEDLFG 268

Query: 263 RMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKEDSGVRPNWVTIMSVLPACAQSSAL 322
           RM  RNIVSWT MISGY+Q+GLA QAL LFD+ML++DSG++P WVTIMSVLPACA S+AL
Sbjct: 269 RMVRRNIVSWTAMISGYAQNGLAGQALVLFDKMLEDDSGIKPTWVTIMSVLPACAHSAAL 328

Query: 323 ERGRRIHELACRMGLNSNASVLIALTAMYAKCGSLADARNCFNRLNRSEKSLVAWNTMIT 382
           ERGR IH+LA R+GL+S+ SV  AL AMYA+CGSLA+A  CF+R+++ +K LV WNTMI+
Sbjct: 329 ERGREIHKLASRIGLDSDVSVQSALIAMYARCGSLAEACQCFDRIHQHKKDLVVWNTMIS 388

Query: 383 AYASYGHGREAVSTFQEMIEAGIRPDDITFTGLLSACSHSGLVDIGLNYFNYMSTTYSTN 442
           AYAS+G G E+VSTF++MI A I+PD I+FTGLLS CSHSGLVD+G+ YFN M T Y+  
Sbjct: 389 AYASHGRGLESVSTFEDMIRARIQPDIISFTGLLSGCSHSGLVDLGIKYFNRMKTMYNVE 448

Query: 443 PRAEHYACVVDLLGRAGRLAEASKLVDEMPMPAGPSIWGSLLAACRKYRNLEMAETAARK 502
           P  +H ACVVDLLGRAGRL EA +L+D+MPM AG S WG+LLAACRK+RNLE+AE AA+K
Sbjct: 449 PEVQHCACVVDLLGRAGRLVEAKELIDKMPMQAGASAWGALLAACRKHRNLELAEVAAKK 508

Query: 503 LFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAILISQGTKKSPGCSWIEVNGIAHMFLG 562
           LFVLEP ++ NYV LSNMYAEAG W+EV  LR +L  +G +K+PGCSWIEVNG AHMFLG
Sbjct: 509 LFVLEPYSSANYVHLSNMYAEAGMWKEVANLRDLLKYRGIRKTPGCSWIEVNGKAHMFLG 568

Query: 563 GDTSHPQTKEIYMFLEALPEKMKAAGYTPDTSFVLHDISEEEKEFNLIAHSEKLAVAFGI 622
           GDTSHPQT+EIYMFLE+LPEKMK AGY PDTS VLHD+SEEEKE NL +HSEKLA+AFG+
Sbjct: 569 GDTSHPQTREIYMFLESLPEKMKQAGYVPDTSPVLHDLSEEEKEHNLTSHSEKLAIAFGL 628

Query: 623 LNTPSETVLRVTKNLRICGDCHTAMVFISEIYGREVVVRDVNRFHHFKAGSCSCGDYW 680
           LNT   T++RVTKNLRIC DCHTA  FIS+I+ RE++VRD+NRFHHF  GSCSCGDYW
Sbjct: 629 LNTSPSTIIRVTKNLRICVDCHTATKFISKIFRREIIVRDLNRFHHFTDGSCSCGDYW 672

BLAST of CmoCh16G004070 vs. TrEMBL
Match: A0A0D2SZE2_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G028200 PE=4 SV=1)

HSP 1 Score: 871.3 bits (2250), Expect = 7.6e-250
Identity = 430/657 (65.45%), Postives = 508/657 (77.32%), Query Frame = 1

Query: 24  SYLGSSHIDIAPPPSSPPFKCSI-SPRSLSATLRNLLQPLSAPDPPPILSYASVFQFLTG 83
           ++L + H  I P  +    KC+   P   ++TL  LLQP+S  +PPP LSYA +FQFLTG
Sbjct: 21  AFLSTIHPHIDPSQT----KCTTPKPFPYTSTLPTLLQPISDQNPPPHLSYAPLFQFLTG 80

Query: 84  QNLLKLGQQVHAHMLLRGLEPTALVGSKMVAFYASSGDIDSSVAVFNRISEPSSLLFNSM 143
           QN LKLGQQ+HAHM L GL+P A +G+KMVA YASSGD++S+V VF +I +P+SLL+NS+
Sbjct: 81  QNFLKLGQQIHAHMTLHGLQPNAFLGAKMVAMYASSGDLESAVTVFRKIKDPTSLLYNSI 140

Query: 144 IRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVLRAGL 203
           IRAY   G+  +T+  Y  MHS    GD FTFPFVLKS  ++L VWMG+CVHG  LR GL
Sbjct: 141 IRAYTNNGYPLKTIDIYREMHSLRLKGDNFTFPFVLKSCANVLDVWMGECVHGQSLRFGL 200

Query: 204 EFDLYVATSLIDMYGKCGEINDARKVFDKMTVRDVSSWNALLAGYMKGGFIDAAVAIFER 263
           E D YV TSLID Y K GE+ DA KVFD MTVR VSSWNAL+AGYMK G I  A  +F  
Sbjct: 201 ELDAYVGTSLIDFYVKVGELRDANKVFDLMTVRAVSSWNALIAGYMKEGEIRVAEDLFRG 260

Query: 264 MPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKEDSGVRPNWVTIMSVLPACAQSSALE 323
           MP RNIVSWT+MISGY+Q+GLA++ALSLFDEMLKEDS V+PNWVTIMSVLPACA S++ E
Sbjct: 261 MPCRNIVSWTSMISGYTQNGLAEEALSLFDEMLKEDSEVKPNWVTIMSVLPACAHSASFE 320

Query: 324 RGRRIHELACRMGLNSNASVLIALTAMYAKCGSLADARNCFNRLNRSEKSLVAWNTMITA 383
           RGRRI+E   R+GL SN SV  AL AMYAKCGSL  AR CF+R+  +EK+L AWNTMITA
Sbjct: 321 RGRRINEYVNRIGLESNPSVQTALIAMYAKCGSLVSARCCFDRILENEKNLCAWNTMITA 380

Query: 384 YASYGHGREAVSTFQEMIEAGIRPDDITFTGLLSACSHSGLVDIGLNYFNYMSTTYSTNP 443
           YAS+G G E+VSTF+ M+ AG+ PD ITFTGLLS CSHSG+V+ GL YFN M T YS  P
Sbjct: 381 YASHGQGLESVSTFENMVRAGVYPDAITFTGLLSGCSHSGIVEFGLRYFNSMQTKYSVEP 440

Query: 444 RAEHYACVVDLLGRAGRLAEASKLVDEMPMPAGPSIWGSLLAACRKYRNLEMAETAARKL 503
           R EHYACVVDLL RAGRL EA + + ++PM  GPSIWG+LLAACRK RNLE+AE AA++L
Sbjct: 441 RHEHYACVVDLLARAGRLVEAKEFIKKIPMQPGPSIWGALLAACRKSRNLEIAEIAAKEL 500

Query: 504 FVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAILISQGTKKSPGCSWIEVNGIAHMFLGG 563
           FVLEPEN+ NY+LLSNMYAEAG W+EVDKLRA L  +G KK+PGCSWIE+ G AH+FL G
Sbjct: 501 FVLEPENSCNYILLSNMYAEAGMWKEVDKLRARLKCEGIKKNPGCSWIEIKGKAHLFLSG 560

Query: 564 DTSHPQTKEIYMFLEALPEKMKAAGYTPDTSFVLHDISEEEKEFNLIAHSEKLAVAFGIL 623
           D SHPQ+KEIY  LEALPEK+KAAGY P+T FVLHDISEEEKE NLI H           
Sbjct: 561 DLSHPQSKEIYNLLEALPEKIKAAGYIPNTGFVLHDISEEEKEQNLIIH----------- 620

Query: 624 NTPSETVLRVTKNLRICGDCHTAMVFISEIYGREVVVRDVNRFHHFKAGSCSCGDYW 680
                 ++R+TKNLRICGDCHT + FIS+IY RE+VVRDVNRFHHF+ G+CSCGDYW
Sbjct: 621 ------IIRITKNLRICGDCHTVIKFISKIYEREIVVRDVNRFHHFRHGACSCGDYW 656

BLAST of CmoCh16G004070 vs. TAIR10
Match: AT1G20230.1 (AT1G20230.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 497.3 bits (1279), Expect = 1.5e-140
Identity = 251/579 (43.35%), Postives = 356/579 (61.49%), Query Frame = 1

Query: 109 SKMVAFYASSGDIDSSVAVFNRIS----EPSSLLFNSMIRAYARYGFAERTVATYFSMHS 168
           S ++  YA  G ++  V + + +     E + + +N ++  + R G+ +  V  +  +H 
Sbjct: 186 SALLCAYARKGCLEEVVRILSEMESSGIEANIVSWNGILSGFNRSGYHKEAVVMFQKIHH 245

Query: 169 WGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVLRAGLEFDLYVATSLIDMYGKCGEIND 228
            GF  D  T   VL S  D   + MG+ +HG V++ GL  D  V +++IDMYGK G +  
Sbjct: 246 LGFCPDQVTVSSVLPSVGDSEMLNMGRLIHGYVIKQGLLKDKCVISAMIDMYGKSGHVYG 305

Query: 229 ARKVFDKMTVRDVSSWNALLAGYMKGGFIDAAVAIFERMPWR----NIVSWTTMISGYSQ 288
              +F++  + +    NA + G  + G +D A+ +FE    +    N+VSWT++I+G +Q
Sbjct: 306 IISLFNQFEMMEAGVCNAYITGLSRNGLVDKALEMFELFKEQTMELNVVSWTSIIAGCAQ 365

Query: 289 SGLAQQALSLFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELACRMGLNSNA 348
           +G   +AL LF EM  + +GV+PN VTI S+LPAC   +AL  GR  H  A R+ L  N 
Sbjct: 366 NGKDIEALELFREM--QVAGVKPNHVTIPSMLPACGNIAALGHGRSTHGFAVRVHLLDNV 425

Query: 349 SVLIALTAMYAKCGSLADARNCFNRLNRSEKSLVAWNTMITAYASYGHGREAVSTFQEMI 408
            V  AL  MYAKCG +  ++  FN +    K+LV WN+++  ++ +G  +E +S F+ ++
Sbjct: 426 HVGSALIDMYAKCGRINLSQIVFNMM--PTKNLVCWNSLMNGFSMHGKAKEVMSIFESLM 485

Query: 409 EAGIRPDDITFTGLLSACSHSGLVDIGLNYFNYMSTTYSTNPRAEHYACVVDLLGRAGRL 468
              ++PD I+FT LLSAC   GL D G  YF  MS  Y   PR EHY+C+V+LLGRAG+L
Sbjct: 486 RTRLKPDFISFTSLLSACGQVGLTDEGWKYFKMMSEEYGIKPRLEHYSCMVNLLGRAGKL 545

Query: 469 AEASKLVDEMPMPAGPSIWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMY 528
            EA  L+ EMP      +WG+LL +CR   N+++AE AA KLF LEPEN G YVLLSN+Y
Sbjct: 546 QEAYDLIKEMPFEPDSCVWGALLNSCRLQNNVDLAEIAAEKLFHLEPENPGTYVLLSNIY 605

Query: 529 AEAGRWQEVDKLRAILISQGTKKSPGCSWIEVNGIAHMFLGGDTSHPQTKEIYMFLEALP 588
           A  G W EVD +R  + S G KK+PGCSWI+V    +  L GD SHPQ  +I   ++ + 
Sbjct: 606 AAKGMWTEVDSIRNKMESLGLKKNPGCSWIQVKNRVYTLLAGDKSHPQIDQITEKMDEIS 665

Query: 589 EKMKAAGYTPDTSFVLHDISEEEKEFNLIAHSEKLAVAFGILNTPSETVLRVTKNLRICG 648
           ++M+ +G+ P+  F LHD+ E+E+E  L  HSEKLAV FG+LNTP  T L+V KNLRICG
Sbjct: 666 KEMRKSGHRPNLDFALHDVEEQEQEQMLWGHSEKLAVVFGLLNTPDGTPLQVIKNLRICG 725

Query: 649 DCHTAMVFISEIYGREVVVRDVNRFHHFKAGSCSCGDYW 680
           DCH  + FIS   GRE+ +RD NRFHHFK G CSCGD+W
Sbjct: 726 DCHAVIKFISSYAGREIFIRDTNRFHHFKDGICSCGDFW 760

BLAST of CmoCh16G004070 vs. TAIR10
Match: AT2G29760.1 (AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 490.7 bits (1262), Expect = 1.4e-138
Identity = 242/595 (40.67%), Postives = 361/595 (60.67%), Query Frame = 1

Query: 86  LKLGQQVHAHMLLRGLEPTALVGSKMVAFYASSGDIDSSVAVFNRISEPSSLLFNSMIRA 145
           L LGQ +H   +   +     V + ++  Y S GD+DS+  VF  I E   + +NSMI  
Sbjct: 147 LSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMING 206

Query: 146 YARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWMGKCVHGLVLRAGLEFD 205
           + + G  ++ +  +  M S      + T   VL +   + ++  G+ V   +    +  +
Sbjct: 207 FVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVN 266

Query: 206 LYVATSLIDMYGKCGEINDARKVFDKMTVRDVSSWNALLAGYMKGGFIDAAVAIFERMPW 265
           L +A +++DMY KCG I DA+++FD M  +D  +W  +L GY      +AA  +   MP 
Sbjct: 267 LTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQ 326

Query: 266 RNIVSWTTMISGYSQSGLAQQALSLFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGR 325
           ++IV+W  +IS Y Q+G   +AL +F E L+    ++ N +T++S L ACAQ  ALE GR
Sbjct: 327 KDIVAWNALISAYEQNGKPNEALIVFHE-LQLQKNMKLNQITLVSTLSACAQVGALELGR 386

Query: 326 RIHELACRMGLNSNASVLIALTAMYAKCGSLADARNCFNRLNRSEKSLVAWNTMITAYAS 385
            IH    + G+  N  V  AL  MY+KCG L  +R  FN + +  + +  W+ MI   A 
Sbjct: 387 WIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEK--RDVFVWSAMIGGLAM 446

Query: 386 YGHGREAVSTFQEMIEAGIRPDDITFTGLLSACSHSGLVDIGLNYFNYMSTTYSTNPRAE 445
           +G G EAV  F +M EA ++P+ +TFT +  ACSH+GLVD   + F+ M + Y   P  +
Sbjct: 447 HGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEK 506

Query: 446 HYACVVDLLGRAGRLAEASKLVDEMPMPAGPSIWGSLLAACRKYRNLEMAETAARKLFVL 505
           HYAC+VD+LGR+G L +A K ++ MP+P   S+WG+LL AC+ + NL +AE A  +L  L
Sbjct: 507 HYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLEL 566

Query: 506 EPENTGNYVLLSNMYAEAGRWQEVDKLRAILISQGTKKSPGCSWIEVNGIAHMFLGGDTS 565
           EP N G +VLLSN+YA+ G+W+ V +LR  +   G KK PGCS IE++G+ H FL GD +
Sbjct: 567 EPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNA 626

Query: 566 HPQTKEIYMFLEALPEKMKAAGYTPDTSFVLHDISEEE-KEFNLIAHSEKLAVAFGILNT 625
           HP ++++Y  L  + EK+K+ GY P+ S VL  I EEE KE +L  HSEKLA+ +G+++T
Sbjct: 627 HPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLIST 686

Query: 626 PSETVLRVTKNLRICGDCHTAMVFISEIYGREVVVRDVNRFHHFKAGSCSCGDYW 680
            +  V+RV KNLR+CGDCH+    IS++Y RE++VRD  RFHHF+ G CSC D+W
Sbjct: 687 EAPKVIRVIKNLRVCGDCHSVAKLISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738

BLAST of CmoCh16G004070 vs. TAIR10
Match: AT3G46790.1 (AT3G46790.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 481.1 bits (1237), Expect = 1.1e-135
Identity = 246/629 (39.11%), Postives = 372/629 (59.14%), Query Frame = 1

Query: 55  LRNLLQPLSAPDPPPILSYASVFQFLTGQNLLKLGQQVHAHMLLRGLEPTALVGSKMVAF 114
           L+  ++ LS    P   +Y  +      ++ L    +VH H+L  G +    + +K++  
Sbjct: 62  LKQAIRVLSQESSPSQQTYELLILCCGHRSSLSDALRVHRHILDNGSDQDPFLATKLIGM 121

Query: 115 YASSGDIDSSVAVFNRISEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTF 174
           Y+  G +D +  VF++  + +  ++N++ RA    G  E  +  Y+ M+  G   D FT+
Sbjct: 122 YSDLGSVDYARKVFDKTRKRTIYVWNALFRALTLAGHGEEVLGLYWKMNRIGVESDRFTY 181

Query: 175 PFVLKSSV----DLLSVWMGKCVHGLVLRAGLEFDLYVATSLIDMYGKCGEINDARKVFD 234
            +VLK+ V     +  +  GK +H  + R G    +Y+ T+L+DMY + G          
Sbjct: 182 TYVLKACVASECTVNHLMKGKEIHAHLTRRGYSSHVYIMTTLVDMYARFG---------- 241

Query: 235 KMTVRDVSSWNALLAGYMKGGFIDAAVAIFERMPWRNIVSWTTMISGYSQSGLAQQALSL 294
                                 +D A  +F  MP RN+VSW+ MI+ Y+++G A +AL  
Sbjct: 242 ---------------------CVDYASYVFGGMPVRNVVSWSAMIACYAKNGKAFEALRT 301

Query: 295 FDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELACRMGLNSNASVLIALTAMY 354
           F EM++E     PN VT++SVL ACA  +ALE+G+ IH    R GL+S   V+ AL  MY
Sbjct: 302 FREMMRETKDSSPNSVTMVSVLQACASLAALEQGKLIHGYILRRGLDSILPVISALVTMY 361

Query: 355 AKCGSLADARNCFNRLNRSEKSLVAWNTMITAYASYGHGREAVSTFQEMIEAGIRPDDIT 414
            +CG L   +  F+R++  ++ +V+WN++I++Y  +G+G++A+  F+EM+  G  P  +T
Sbjct: 362 GRCGKLEVGQRVFDRMH--DRDVVSWNSLISSYGVHGYGKKAIQIFEEMLANGASPTPVT 421

Query: 415 FTGLLSACSHSGLVDIGLNYFNYMSTTYSTNPRAEHYACVVDLLGRAGRLAEASKLVDEM 474
           F  +L ACSH GLV+ G   F  M   +   P+ EHYAC+VDLLGRA RL EA+K+V +M
Sbjct: 422 FVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYACMVDLLGRANRLDEAAKMVQDM 481

Query: 475 PMPAGPSIWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVD 534
               GP +WGSLL +CR + N+E+AE A+R+LF LEP+N GNYVLL+++YAEA  W EV 
Sbjct: 482 RTEPGPKVWGSLLGSCRIHGNVELAERASRRLFALEPKNAGNYVLLADIYAEAQMWDEVK 541

Query: 535 KLRAILISQGTKKSPGCSWIEVNGIAHMFLGGDTSHPQTKEIYMFLEALPEKMKAAGYTP 594
           +++ +L  +G +K PG  W+EV    + F+  D  +P  ++I+ FL  L E MK  GY P
Sbjct: 542 RVKKLLEHRGLQKLPGRCWMEVRRKMYSFVSVDEFNPLMEQIHAFLVKLAEDMKEKGYIP 601

Query: 595 DTSFVLHDISEEEKEFNLIAHSEKLAVAFGILNTPSETVLRVTKNLRICGDCHTAMVFIS 654
            T  VL+++  EEKE  ++ HSEKLA+AFG++NT     +R+TKNLR+C DCH    FIS
Sbjct: 602 QTKGVLYELETEEKERIVLGHSEKLALAFGLINTSKGEPIRITKNLRLCEDCHLFTKFIS 657

Query: 655 EIYGREVVVRDVNRFHHFKAGSCSCGDYW 680
           +   +E++VRDVNRFH FK G CSCGDYW
Sbjct: 662 KFMEKEILVRDVNRFHRFKNGVCSCGDYW 657

BLAST of CmoCh16G004070 vs. TAIR10
Match: AT4G18750.1 (AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 479.9 bits (1234), Expect = 2.5e-135
Identity = 249/610 (40.82%), Postives = 358/610 (58.69%), Query Frame = 1

Query: 70  ILSYASVFQFLTGQNLLKLGQQVHAHMLLRGLEPTALVGSKMVAFYASSGDIDSSVAVFN 129
           + +  SVF       L+ LG+ VH+  +           + ++  Y+  GD+DS+ AVF 
Sbjct: 296 LATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFR 355

Query: 130 RISEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKSSVDLLSVWM 189
            +S+ S + + SMI  YAR G A   V  +  M   G + D +T   VL        +  
Sbjct: 356 EMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDE 415

Query: 190 GKCVHGLVLRAGLEFDLYVATSLIDMYGKCGEINDARKVFDKMTVRDVSSWNALLAGYMK 249
           GK VH  +    L FD++V+ +L+DMY KCG + +A  VF +M V+D+ SWN        
Sbjct: 416 GKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWN-------- 475

Query: 250 GGFIDAAVAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKEDSGVRPNWVTIM 309
                                  T+I GYS++  A +ALSLF+ +L E+    P+  T+ 
Sbjct: 476 -----------------------TIIGGYSKNCYANEALSLFN-LLLEEKRFSPDERTVA 535

Query: 310 SVLPACAQSSALERGRRIHELACRMGLNSNASVLIALTAMYAKCGSLADARNCFNRLNRS 369
            VLPACA  SA ++GR IH    R G  S+  V  +L  MYAKCG+L  A   F+ +  +
Sbjct: 536 CVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDI--A 595

Query: 370 EKSLVAWNTMITAYASYGHGREAVSTFQEMIEAGIRPDDITFTGLLSACSHSGLVDIGLN 429
            K LV+W  MI  Y  +G G+EA++ F +M +AGI  D+I+F  LL ACSHSGLVD G  
Sbjct: 596 SKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWR 655

Query: 430 YFNYMSTTYSTNPRAEHYACVVDLLGRAGRLAEASKLVDEMPMPAGPSIWGSLLAACRKY 489
           +FN M       P  EHYAC+VD+L R G L +A + ++ MP+P   +IWG+LL  CR +
Sbjct: 656 FFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIH 715

Query: 490 RNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAILISQGTKKSPGCSW 549
            ++++AE  A K+F LEPENTG YVL++N+YAEA +W++V +LR  +  +G +K+PGCSW
Sbjct: 716 HDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCSW 775

Query: 550 IEVNGIAHMFLGGDTSHPQTKEIYMFLEALPEKMKAAGYTPDTSFVLHDISEEEKEFNLI 609
           IE+ G  ++F+ GD+S+P+T+ I  FL  +  +M   GY+P T + L D  E EKE  L 
Sbjct: 776 IEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMIEEGYSPLTKYALIDAEEMEKEEALC 835

Query: 610 AHSEKLAVAFGILNTPSETVLRVTKNLRICGDCHTAMVFISEIYGREVVVRDVNRFHHFK 669
            HSEKLA+A GI+++    ++RVTKNLR+CGDCH    F+S++  RE+V+RD NRFH FK
Sbjct: 836 GHSEKLAMALGIISSGHGKIIRVTKNLRVCGDCHEMAKFMSKLTRREIVLRDSNRFHQFK 871

Query: 670 AGSCSCGDYW 680
            G CSC  +W
Sbjct: 896 DGHCSCRGFW 871

BLAST of CmoCh16G004070 vs. TAIR10
Match: AT1G25360.1 (AT1G25360.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 477.6 bits (1228), Expect = 1.2e-134
Identity = 236/571 (41.33%), Postives = 349/571 (61.12%), Query Frame = 1

Query: 111 MVAFYASSGDIDSSVAVFNRISEPSSLL-FNSMIRAYARYGFAERTVATYFSMHSWGFTG 170
           M+  Y  +G  D    +   + +   L+ +N+MI  Y   GF +  +     M S G   
Sbjct: 225 MMTGYVKNGYFDLGEELLEGMDDNMKLVAYNAMISGYVNRGFYQEALEMVRRMVSSGIEL 284

Query: 171 DYFTFPFVLKSSVDLLSVWMGKCVHGLVLRAGLEFDLYVATSLIDMYGKCGEINDARKVF 230
           D FT+P V+++      + +GK VH  VLR   +F  +   SL+ +Y KCG+ ++AR +F
Sbjct: 285 DEFTYPSVIRACATAGLLQLGKQVHAYVLRRE-DFSFHFDNSLVSLYYKCGKFDEARAIF 344

Query: 231 DKMTVRDVSSWNALLAGYMKGGFIDAAVAIFERMPWRNIVSWTTMISGYSQSGLAQQALS 290
           +KM  +D+ SWNALL+GY+  G I  A  IF+ M  +NI+SW  MISG +++G  ++ L 
Sbjct: 345 EKMPAKDLVSWNALLSGYVSSGHIGEAKLIFKEMKEKNILSWMIMISGLAENGFGEEGLK 404

Query: 291 LFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELACRMGLNSNASVLIALTAM 350
           LF  M +E  G  P        + +CA   A   G++ H    ++G +S+ S   AL  M
Sbjct: 405 LFSCMKRE--GFEPCDYAFSGAIKSCAVLGAYCNGQQYHAQLLKIGFDSSLSAGNALITM 464

Query: 351 YAKCGSLADARNCFNRLNRSEKSLVAWNTMITAYASYGHGREAVSTFQEMIEAGIRPDDI 410
           YAKCG + +AR  F  +   +   V+WN +I A   +GHG EAV  ++EM++ GIRPD I
Sbjct: 465 YAKCGVVEEARQVFRTMPCLDS--VSWNALIAALGQHGHGAEAVDVYEEMLKKGIRPDRI 524

Query: 411 TFTGLLSACSHSGLVDIGLNYFNYMSTTYSTNPRAEHYACVVDLLGRAGRLAEASKLVDE 470
           T   +L+ACSH+GLVD G  YF+ M T Y   P A+HYA ++DLL R+G+ ++A  +++ 
Sbjct: 525 TLLTVLTACSHAGLVDQGRKYFDSMETVYRIPPGADHYARLIDLLCRSGKFSDAESVIES 584

Query: 471 MPMPAGPSIWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEV 530
           +P      IW +LL+ CR + N+E+   AA KLF L PE+ G Y+LLSNM+A  G+W+EV
Sbjct: 585 LPFKPTAEIWEALLSGCRVHGNMELGIIAADKLFGLIPEHDGTYMLLSNMHAATGQWEEV 644

Query: 531 DKLRAILISQGTKKSPGCSWIEVNGIAHMFLGGDTSHPQTKEIYMFLEALPEKMKAAGYT 590
            ++R ++  +G KK   CSWIE+    H FL  DTSHP+ + +Y++L+ L ++M+  GY 
Sbjct: 645 ARVRKLMRDRGVKKEVACSWIEMETQVHTFLVDDTSHPEAEAVYIYLQDLGKEMRRLGYV 704

Query: 591 PDTSFVLHDI-SEEEKEFNLIAHSEKLAVAFGILNTPSETVLRVTKNLRICGDCHTAMVF 650
           PDTSFVLHD+ S+  KE  L  HSEK+AVAFG++  P  T +R+ KNLR CGDCH    F
Sbjct: 705 PDTSFVLHDVESDGHKEDMLTTHSEKIAVAFGLMKLPPGTTIRIFKNLRTCGDCHNFFRF 764

Query: 651 ISEIYGREVVVRDVNRFHHFKAGSCSCGDYW 680
           +S +  R++++RD  RFHHF+ G CSCG++W
Sbjct: 765 LSWVVQRDIILRDRKRFHHFRNGECSCGNFW 790

BLAST of CmoCh16G004070 vs. NCBI nr
Match: gi|449445033|ref|XP_004140278.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g62890-like [Cucumis sativus])

HSP 1 Score: 1252.7 bits (3240), Expect = 0.0e+00
Identity = 608/679 (89.54%), Postives = 642/679 (94.55%), Query Frame = 1

Query: 1   MLNGIRLSIFIPNPNRLLFRILHSYLGSSHIDIAPPPSSPPFKCSISPRSLSATLRNLLQ 60
           M NGIRLSI IP P+ LLFRILHSY GS+HID  PPPSSPPFKCSISP ++SATL+NLLQ
Sbjct: 1   MHNGIRLSISIPTPSHLLFRILHSYSGSAHIDTVPPPSSPPFKCSISPLTISATLQNLLQ 60

Query: 61  PLSAPDPPPILSYASVFQFLTGQNLLKLGQQVHAHMLLRGLEPTALVGSKMVAFYASSGD 120
           PLSAP PPPILSYA VFQFLTG N+LKLG QVHAHMLLRGL+PTALVGSKMVAFYASSGD
Sbjct: 61  PLSAPGPPPILSYAPVFQFLTGLNMLKLGHQVHAHMLLRGLQPTALVGSKMVAFYASSGD 120

Query: 121 IDSSVAVFNRISEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKS 180
           IDSSV+VFN I EPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKS
Sbjct: 121 IDSSVSVFNGIGEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKS 180

Query: 181 SVDLLSVWMGKCVHGLVLRAGLEFDLYVATSLIDMYGKCGEINDARKVFDKMTVRDVSSW 240
           SV+LLSVWMGKCVHGL+LR GL+FDLYVATSLI +YGKCGEINDA KVFD MT+RDVSSW
Sbjct: 181 SVELLSVWMGKCVHGLILRIGLQFDLYVATSLIILYGKCGEINDAGKVFDNMTIRDVSSW 240

Query: 241 NALLAGYMKGGFIDAAVAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKEDSG 300
           NALLAGY K G IDAA+AIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEM+KEDSG
Sbjct: 241 NALLAGYTKSGCIDAALAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMVKEDSG 300

Query: 301 VRPNWVTIMSVLPACAQSSALERGRRIHELACRMGLNSNASVLIALTAMYAKCGSLADAR 360
           VRPNWVTIMSVLPACAQ S LERGR+IHELACRMGLNSNASVLIALTAMYAKCGSL DAR
Sbjct: 301 VRPNWVTIMSVLPACAQLSTLERGRQIHELACRMGLNSNASVLIALTAMYAKCGSLVDAR 360

Query: 361 NCFNRLNRSEKSLVAWNTMITAYASYGHGREAVSTFQEMIEAGIRPDDITFTGLLSACSH 420
           NCF++LNR+EK+L+AWNTMITAYASYGHG +AVSTF+EMI+AGI+PDDITFTGLLS CSH
Sbjct: 361 NCFDKLNRNEKNLIAWNTMITAYASYGHGLQAVSTFREMIQAGIQPDDITFTGLLSGCSH 420

Query: 421 SGLVDIGLNYFNYMSTTYSTNPRAEHYACVVDLLGRAGRLAEASKLVDEMPMPAGPSIWG 480
           SGLVD+GL YFN+MSTTYS NPR EHYACV DLLGRAGRLAEASKLV EMPMPAGPSIWG
Sbjct: 421 SGLVDVGLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLVGEMPMPAGPSIWG 480

Query: 481 SLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAILISQG 540
           SLLAACRK+RNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAI+ SQG
Sbjct: 481 SLLAACRKHRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIVKSQG 540

Query: 541 TKKSPGCSWIEVNGIAHMFLGGDTSHPQTKEIYMFLEALPEKMKAAGYTPDTSFVLHDIS 600
           TKKSPGCSWIE+NG AHMFLGGDTSHPQ KEIYMFLEALPEKMKAAGY PDTS+VLHDIS
Sbjct: 541 TKKSPGCSWIEINGKAHMFLGGDTSHPQGKEIYMFLEALPEKMKAAGYFPDTSYVLHDIS 600

Query: 601 EEEKEFNLIAHSEKLAVAFGILNTPSETVLRVTKNLRICGDCHTAMVFISEIYGREVVVR 660
           EEEKEFNLIAHSEKLAVAFGILNTP+ETVLRVTKNLRICGDCHTAMVFISEIYGREV+VR
Sbjct: 601 EEEKEFNLIAHSEKLAVAFGILNTPAETVLRVTKNLRICGDCHTAMVFISEIYGREVIVR 660

Query: 661 DVNRFHHFKAGSCSCGDYW 680
           D+NRFHHFK G CSCGDYW
Sbjct: 661 DINRFHHFKGGCCSCGDYW 679

BLAST of CmoCh16G004070 vs. NCBI nr
Match: gi|659112126|ref|XP_008456075.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g62890-like [Cucumis melo])

HSP 1 Score: 1246.9 bits (3225), Expect = 0.0e+00
Identity = 606/679 (89.25%), Postives = 638/679 (93.96%), Query Frame = 1

Query: 1   MLNGIRLSIFIPNPNRLLFRILHSYLGSSHIDIAPPPSSPPFKCSISPRSLSATLRNLLQ 60
           M NGIRLSI IP P  LLFRILHSY GS+HI+  PPPSSP FKCSISP ++SATL+NLLQ
Sbjct: 1   MHNGIRLSISIPTPTLLLFRILHSYSGSAHIETVPPPSSPLFKCSISPLTISATLQNLLQ 60

Query: 61  PLSAPDPPPILSYASVFQFLTGQNLLKLGQQVHAHMLLRGLEPTALVGSKMVAFYASSGD 120
           PLSAP PPPILSYA VFQFLTG N+LKLG QVHAHMLLRGL+PTALVGSKMVAFYASSGD
Sbjct: 61  PLSAPGPPPILSYAPVFQFLTGLNMLKLGHQVHAHMLLRGLQPTALVGSKMVAFYASSGD 120

Query: 121 IDSSVAVFNRISEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKS 180
           IDSSV+VFN I EPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKS
Sbjct: 121 IDSSVSVFNGIGEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKS 180

Query: 181 SVDLLSVWMGKCVHGLVLRAGLEFDLYVATSLIDMYGKCGEINDARKVFDKMTVRDVSSW 240
           S DLLSVWMGKCVHGL+LR GL  DLYVATSLID+YGKCGEIN+A KVFD MT+RDVSSW
Sbjct: 181 SADLLSVWMGKCVHGLILRIGLHCDLYVATSLIDLYGKCGEINEAGKVFDNMTIRDVSSW 240

Query: 241 NALLAGYMKGGFIDAAVAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMLKEDSG 300
           NALLAGYMK G +DAAVAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEM+KEDSG
Sbjct: 241 NALLAGYMKSGCVDAAVAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEMMKEDSG 300

Query: 301 VRPNWVTIMSVLPACAQSSALERGRRIHELACRMGLNSNASVLIALTAMYAKCGSLADAR 360
           VRPNWVTIMSVLPACAQ S LERG +IHELACRMGLNSNASVLIALTAMYAKCGSL DAR
Sbjct: 301 VRPNWVTIMSVLPACAQLSTLERGTQIHELACRMGLNSNASVLIALTAMYAKCGSLVDAR 360

Query: 361 NCFNRLNRSEKSLVAWNTMITAYASYGHGREAVSTFQEMIEAGIRPDDITFTGLLSACSH 420
           NCF++LNRSEK+L+AWNTMITAYASYGHG EAVSTF+EMI+AGI+PDDITFTGLLS CSH
Sbjct: 361 NCFDKLNRSEKNLIAWNTMITAYASYGHGLEAVSTFREMIQAGIQPDDITFTGLLSGCSH 420

Query: 421 SGLVDIGLNYFNYMSTTYSTNPRAEHYACVVDLLGRAGRLAEASKLVDEMPMPAGPSIWG 480
           SGLVD+GL YFN+MSTTYS NPR EHYACV DLLGRAGRLAEASKLVDEMPMPAG SIWG
Sbjct: 421 SGLVDVGLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLVDEMPMPAGASIWG 480

Query: 481 SLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAILISQG 540
           SLLAACRK+RNLEMAE AARKLFVLEPEN+GNYVLLSNMYAEAGRWQEVDKLRAI+ SQG
Sbjct: 481 SLLAACRKHRNLEMAEIAARKLFVLEPENSGNYVLLSNMYAEAGRWQEVDKLRAIVKSQG 540

Query: 541 TKKSPGCSWIEVNGIAHMFLGGDTSHPQTKEIYMFLEALPEKMKAAGYTPDTSFVLHDIS 600
           TKKSPGCSWIE+NG AHMFLGGDTSHPQ KEIYMFLEALPEKMKAAGY PDTS+VLHDIS
Sbjct: 541 TKKSPGCSWIEINGKAHMFLGGDTSHPQAKEIYMFLEALPEKMKAAGYVPDTSYVLHDIS 600

Query: 601 EEEKEFNLIAHSEKLAVAFGILNTPSETVLRVTKNLRICGDCHTAMVFISEIYGREVVVR 660
           EEEKEFNLIAHSEKLAVAFGILNTP+ETVLRVTKNLRICGDCHTAMVFISEIYGREV+VR
Sbjct: 601 EEEKEFNLIAHSEKLAVAFGILNTPAETVLRVTKNLRICGDCHTAMVFISEIYGREVIVR 660

Query: 661 DVNRFHHFKAGSCSCGDYW 680
           D+NRFHHFK GSCSCGDYW
Sbjct: 661 DINRFHHFKGGSCSCGDYW 679

BLAST of CmoCh16G004070 vs. NCBI nr
Match: gi|596016252|ref|XP_007218862.1| (hypothetical protein PRUPE_ppa002838mg [Prunus persica])

HSP 1 Score: 912.9 bits (2358), Expect = 3.3e-262
Identity = 441/625 (70.56%), Postives = 513/625 (82.08%), Query Frame = 1

Query: 56  RNLLQPLSAPDPPPILSYASVFQFLTGQNLLKLGQQVHAHMLLRGLEPTALVGSKMVAFY 115
           R LL+ L A DP  I  YA +FQ LT QNLLKLGQQVHA M LRGLEP A +G+KMVA Y
Sbjct: 4   RTLLKSLLAQDPTCISFYAPIFQSLTSQNLLKLGQQVHAQMALRGLEPNAFLGAKMVAMY 63

Query: 116 ASSGDIDSSVAVFNRISEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFP 175
           ASS ++DS+V +F+R++ PS+LL+NS+IRAY  YG++E+T+  Y  MH  G  GD FT+P
Sbjct: 64  ASSDNLDSAVNIFHRVNNPSTLLYNSIIRAYTLYGYSEKTMEIYGQMHRLGLKGDNFTYP 123

Query: 176 FVLKSSVDLLSVWMGKCVHGLVLRAGLEFDLYVATSLIDMYGKCGEINDARKVFDKMTVR 235
           FVLK   +L S+W+GKCVH L LR GL  D+YV TSLIDMY KCGE++DAR  FDKMTVR
Sbjct: 124 FVLKCCANLSSIWLGKCVHSLSLRIGLASDMYVGTSLIDMYVKCGEMSDARSSFDKMTVR 183

Query: 236 DVSSWNALLAGYMKGGFIDAAVAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEML 295
           DVSSWNAL+AGYMK G I  A  +F RMP +NIVSWT MISGY+Q+GLA+QAL LFDEML
Sbjct: 184 DVSSWNALIAGYMKDGEICFAEDLFRRMPCKNIVSWTAMISGYTQNGLAEQALVLFDEML 243

Query: 296 KEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELACRMGLNSNASVLIALTAMYAKCGS 355
           ++DS V+PNWVTIMSVLPACA S+ALERGR+IH  A R GL+SN S+  AL AMYAKCGS
Sbjct: 244 RKDSEVKPNWVTIMSVLPACAHSAALERGRQIHNFASRTGLDSNTSIQTALLAMYAKCGS 303

Query: 356 LADARNCFNRLNRSEKSLVAWNTMITAYASYGHGREAVSTFQEMIEAGIRPDDITFTGLL 415
           L+DAR CF R++++E SLVAWNTMITAYAS+G G EAVSTF++MI AG++PD+ITFTGLL
Sbjct: 304 LSDARQCFERVHQTENSLVAWNTMITAYASHGRGSEAVSTFEDMIGAGLQPDNITFTGLL 363

Query: 416 SACSHSGLVDIGLNYFNYMSTTYSTNPRAEHYACVVDLLGRAGRLAEASKLVDEMPMPAG 475
           S CSHSGLVD GL YFN M T YS  PR EHYACVVDLLGRAGRL EA  LV +MPM AG
Sbjct: 364 SGCSHSGLVDGGLKYFNCMKTIYSIEPRVEHYACVVDLLGRAGRLVEAIDLVSKMPMQAG 423

Query: 476 PSIWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAI 535
           PSIWG+LL+ACRK+ NLE+AE AARKLF+LEP+N+GNYVLLSN+YA+AG W+EVD LRA+
Sbjct: 424 PSIWGALLSACRKHHNLEIAEIAARKLFILEPDNSGNYVLLSNIYADAGMWKEVDDLRAL 483

Query: 536 LISQGTKKSPGCSWIEVNGIAHMFLGGDTSHPQTKEIY-MFLEALPEKMKAAGYTPDTSF 595
           L SQG KK+PGCSWIEVNG AH+FLGGDT HPQ KEIY + LE LP K+KAAGY PDTSF
Sbjct: 484 LKSQGMKKNPGCSWIEVNGKAHLFLGGDTCHPQAKEIYEVLLEELPNKIKAAGYVPDTSF 543

Query: 596 VLHDISEEEKEFNLIAHSEKLAVAFGILNTPSETVLRVTKNLRICGDCHTAMVFISEIYG 655
           VLHD+SEEEKE NL  HSEKLA+AFG+LN     VLRVTKNLRICGDCHTA   IS IY 
Sbjct: 544 VLHDVSEEEKEHNLTTHSEKLAIAFGLLNASPGVVLRVTKNLRICGDCHTATKLISRIYE 603

Query: 656 REVVVRDVNRFHHFKAGSCSCGDYW 680
           RE++VRD+NRFHHF+ G CSCGDYW
Sbjct: 604 REIIVRDLNRFHHFRDGCCSCGDYW 628

BLAST of CmoCh16G004070 vs. NCBI nr
Match: gi|720077886|ref|XP_010241184.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g49142 [Nelumbo nucifera])

HSP 1 Score: 911.4 bits (2354), Expect = 9.6e-262
Identity = 435/630 (69.05%), Postives = 518/630 (82.22%), Query Frame = 1

Query: 50  SLSATLRNLLQPLSAPDPPPILSYASVFQFLTGQNLLKLGQQVHAHMLLRGLEPTALVGS 109
           S   +LR LL+P+   +PP I+SYA +FQFLTG + LKLG+QVHAHM LRGL+P A +G+
Sbjct: 13  STQVSLRILLEPIKQ-NPPQIVSYAPIFQFLTGTHSLKLGKQVHAHMTLRGLQPNAFLGA 72

Query: 110 KMVAFYASSGDIDSSVAVFNRISEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTG 169
           KMVA YASSGDIDS+  VF+++S PSSLL+NS+IR Y R+G+ ERT+ TYF M+S G   
Sbjct: 73  KMVAMYASSGDIDSAETVFDQVSFPSSLLYNSIIRGYTRFGYYERTLKTYFIMNSQGLRP 132

Query: 170 DYFTFPFVLKSSVDLLSVWMGKCVHGLVLRAGLEFDLYVATSLIDMYGKCGEINDARKVF 229
           DYFTFPFVLKSS +L  +  GKCVHG  LR GLE+DLYV TSLIDMY KCGE+++A K+F
Sbjct: 133 DYFTFPFVLKSSAELSCLRTGKCVHGKSLRIGLEYDLYVGTSLIDMYVKCGELSNAHKLF 192

Query: 230 DKMTVRDVSSWNALLAGYMKGGFIDAAVAIFERMPWRNIVSWTTMISGYSQSGLAQQALS 289
           D+M V+DVSSWNAL+AGYM+ G I  A A+F+ MP RNI+SWT MISGY+QSGLA +ALS
Sbjct: 193 DRMHVKDVSSWNALIAGYMRNGVIQIAEALFQSMPKRNIISWTAMISGYTQSGLADRALS 252

Query: 290 LFDEMLKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELACRMGLNSNASVLIALTAM 349
           LF EML+ DS V+PNWVTIMSVLPACA S+ALE G++IH  A  +GL+ + SV  AL AM
Sbjct: 253 LFGEMLRVDSEVKPNWVTIMSVLPACAHSAALEYGKKIHSYASEIGLDKSFSVQTALIAM 312

Query: 350 YAKCGSLADARNCFNRLNRSEKSLVAWNTMITAYASYGHGREAVSTFQEMIEAGIRPDDI 409
           YAKCGSL DA +CF R+   EKSL+ WNTMI AYAS+G G+EAVSTF+ MI+ G++PD I
Sbjct: 313 YAKCGSLIDACHCFERIPEKEKSLITWNTMIAAYASHGCGKEAVSTFRNMIKCGVQPDAI 372

Query: 410 TFTGLLSACSHSGLVDIGLNYFNYMSTTYSTNPRAEHYACVVDLLGRAGRLAEASKLVDE 469
           TF GLLS+CSHSGLVD+GL YFN M+  YS +PRAEHYACVVDLL RAGR+ EA +L+D 
Sbjct: 373 TFLGLLSSCSHSGLVDVGLEYFNCMTRIYSVDPRAEHYACVVDLLARAGRIVEAKELIDR 432

Query: 470 MPMPAGPSIWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEV 529
           MPM A PSIWG+LLAACR + NLE+ E AA++LF+LEPEN+GNY+LLSNMYAE GRW+EV
Sbjct: 433 MPMQASPSIWGALLAACRNHGNLEIGEIAAKQLFILEPENSGNYILLSNMYAEVGRWEEV 492

Query: 530 DKLRAILISQGTKKSPGCSWIEVNGIAHMFLGGDTSHPQTKEIYMFLEALPEKMKAAGYT 589
           + LRA+L +QG KKSPGCSW E+NG  H+FLGGDTSHPQ KEIYM L  LP+K+KAAGY 
Sbjct: 493 NNLRALLKNQGVKKSPGCSWTEINGKCHLFLGGDTSHPQMKEIYMLLGDLPKKIKAAGYI 552

Query: 590 PDTSFVLHDISEEEKEFNLIAHSEKLAVAFGILNTPSETVLRVTKNLRICGDCHTAMVFI 649
           PDTSFVLHD+SEEEKE NL  HSEKLA+AFG+LNT   TV+ VTKNLRICGDCHTA+ FI
Sbjct: 553 PDTSFVLHDVSEEEKEHNLTMHSEKLAIAFGLLNTSPATVIXVTKNLRICGDCHTAIKFI 612

Query: 650 SEIYGREVVVRDVNRFHHFKAGSCSCGDYW 680
           S IYGRE+VVRDVNRFHHFK GSCSCGDYW
Sbjct: 613 SRIYGREIVVRDVNRFHHFKDGSCSCGDYW 641

BLAST of CmoCh16G004070 vs. NCBI nr
Match: gi|658042725|ref|XP_008356987.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g49142 [Malus domestica])

HSP 1 Score: 909.4 bits (2349), Expect = 3.6e-261
Identity = 433/626 (69.17%), Postives = 515/626 (82.27%), Query Frame = 1

Query: 55  LRNLLQPLSAPDPPPILSYASVFQFLTGQNLLKLGQQVHAHMLLRGLEPTALVGSKMVAF 114
           +R LL+PL A DP  +  YA +FQ LTG+NLLKLGQQVHA M LRG EP A +G+KMVA 
Sbjct: 1   MRTLLKPLLAQDPRFVSFYAPIFQSLTGKNLLKLGQQVHAQMALRGFEPDAYLGAKMVAM 60

Query: 115 YASSGDIDSSVAVFNRISEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTF 174
           YASS D+DS+VA+F+R++ PS+LL+NS+IRAY  +GF+E T+  Y  MH  G   D FT+
Sbjct: 61  YASSDDLDSAVAIFHRVNNPSTLLYNSIIRAYTLHGFSEETMEIYGRMHCLGLKXDNFTY 120

Query: 175 PFVLKSSVDLLSVWMGKCVHGLVLRAGLEFDLYVATSLIDMYGKCGEINDARKVFDKMTV 234
           PFVLK   +L  +W+GKCVHGL L+ GLE D+YV TSLI+MY KC +++DAR++FDKMTV
Sbjct: 121 PFVLKCCAELSRIWIGKCVHGLSLKVGLESDMYVGTSLINMYVKCCDMSDARRLFDKMTV 180

Query: 235 RDVSSWNALLAGYMKGGFIDAAVAIFERMPWRNIVSWTTMISGYSQSGLAQQALSLFDEM 294
           RDVSSWNAL+AGYMK G I  A  +F +MP RNIVSWT MISGY+Q+GLA+QAL LFDEM
Sbjct: 181 RDVSSWNALIAGYMKDGEICLAEDLFGKMPGRNIVSWTAMISGYTQNGLAEQALFLFDEM 240

Query: 295 LKEDSGVRPNWVTIMSVLPACAQSSALERGRRIHELACRMGLNSNASVLIALTAMYAKCG 354
           LK+DS V+PNWVTIMSVLPACA S+ALERGR+IH  A R+GL SN S+  AL AMYAKCG
Sbjct: 241 LKKDSKVKPNWVTIMSVLPACAHSAALERGRKIHNFASRIGLESNVSIQTALLAMYAKCG 300

Query: 355 SLADARNCFNRLNRSEKSLVAWNTMITAYASYGHGREAVSTFQEMIEAGIRPDDITFTGL 414
           SL DAR CF R+  ++ +LVAWNTMITAYAS+G G EAVSTF++MI AG++PD+ITFTGL
Sbjct: 301 SLLDARQCFERVRXTQNNLVAWNTMITAYASHGRGSEAVSTFEDMIVAGVQPDNITFTGL 360

Query: 415 LSACSHSGLVDIGLNYFNYMSTTYSTNPRAEHYACVVDLLGRAGRLAEASKLVDEMPMPA 474
           LS CSHSGLVD+GL YF+YM   YS  P  EHYACVVDLLGRAGRLAEA  L+ +MPM A
Sbjct: 361 LSGCSHSGLVDVGLKYFDYMKRVYSVEPGVEHYACVVDLLGRAGRLAEAKDLIXKMPMQA 420

Query: 475 GPSIWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRA 534
           GPSIWG++L+ACRK+ NLE+AE AAR LF+LEPEN+GNYV+LSN+YAEAG W+EVD LR 
Sbjct: 421 GPSIWGAMLSACRKHHNLEIAEIAARSLFILEPENSGNYVMLSNIYAEAGMWKEVDNLRV 480

Query: 535 ILISQGTKKSPGCSWIEVNGIAHMFLGGDTSHPQTKEIYMF-LEALPEKMKAAGYTPDTS 594
           +L +QG KK+PGCSW EVNG AH+FLGGDTSHPQ KEIY F L+ LP+K+KAAGY PDTS
Sbjct: 481 LLKAQGVKKNPGCSWTEVNGKAHLFLGGDTSHPQAKEIYEFLLDELPKKIKAAGYVPDTS 540

Query: 595 FVLHDISEEEKEFNLIAHSEKLAVAFGILNTPSETVLRVTKNLRICGDCHTAMVFISEIY 654
           FVLHD+SEEEKE +L  HSEKLA+AFG+LNT    VLRVTKNLRICGDCHTA   IS IY
Sbjct: 541 FVLHDVSEEEKEHSLTTHSEKLAIAFGLLNTSPGVVLRVTKNLRICGDCHTATKLISRIY 600

Query: 655 GREVVVRDVNRFHHFKAGSCSCGDYW 680
            RE++VRD+NRFHHFK G+CSCGDYW
Sbjct: 601 EREIIVRDLNRFHHFKDGNCSCGDYW 626

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR53_ARATH2.7e-13943.35Pentatricopeptide repeat-containing protein At1g20230 OS=Arabidopsis thaliana GN... [more]
PP175_ARATH2.5e-13740.67Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
PP265_ARATH2.0e-13439.11Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidop... [more]
PP320_ARATH4.5e-13440.82Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
PPR57_ARATH2.2e-13341.33Pentatricopeptide repeat-containing protein At1g25360 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0KEZ1_CUCSA0.0e+0089.54Uncharacterized protein OS=Cucumis sativus GN=Csa_6G430650 PE=4 SV=1[more]
M5X3I7_PRUPE2.3e-26270.56Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002838mg PE=4 SV=1[more]
K4B1Y4_SOLLC1.3e-25267.36Uncharacterized protein OS=Solanum lycopersicum PE=4 SV=1[more]
W9QT12_9ROSA9.0e-25165.65Uncharacterized protein OS=Morus notabilis GN=L484_002732 PE=4 SV=1[more]
A0A0D2SZE2_GOSRA7.6e-25065.45Uncharacterized protein OS=Gossypium raimondii GN=B456_008G028200 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G20230.11.5e-14043.35 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G29760.11.4e-13840.67 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G46790.11.1e-13539.11 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G18750.12.5e-13540.82 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G25360.11.2e-13441.33 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449445033|ref|XP_004140278.1|0.0e+0089.54PREDICTED: pentatricopeptide repeat-containing protein At3g62890-like [Cucumis s... [more]
gi|659112126|ref|XP_008456075.1|0.0e+0089.25PREDICTED: pentatricopeptide repeat-containing protein At3g62890-like [Cucumis m... [more]
gi|596016252|ref|XP_007218862.1|3.3e-26270.56hypothetical protein PRUPE_ppa002838mg [Prunus persica][more]
gi|720077886|ref|XP_010241184.1|9.6e-26269.05PREDICTED: putative pentatricopeptide repeat-containing protein At3g49142 [Nelum... [more]
gi|658042725|ref|XP_008356987.1|3.6e-26169.17PREDICTED: putative pentatricopeptide repeat-containing protein At3g49142 [Malus... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh16G004070.1CmoCh16G004070.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 210..235
score: 1.3E-4coord: 139..167
score: 0.07coord: 447..470
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 267..316
score: 1.2E-10coord: 372..419
score: 2.8
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 269..304
score: 7.2E-7coord: 374..407
score: 2.1E-8coord: 239..263
score: 6.1E-4coord: 210..237
score: 7.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 69..103
score: 6.434coord: 267..301
score: 10.896coord: 509..543
score: 6.818coord: 443..477
score: 6.906coord: 407..437
score: 7.015coord: 240..266
score: 6.084coord: 372..406
score: 12.014coord: 339..369
score: 6.062coord: 135..169
score: 8.046coord: 304..338
score: 5.821coord: 205..239
score: 9
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 213..301
score: 1.4E-11coord: 340..408
score: 1.4E-11coord: 476..528
score: 1.4
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 346..529
score: 4.27E-8coord: 266..307
score: 4.2
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 69..550
score:
NoneNo IPR availablePANTHERPTHR24015:SF728SUBFAMILY NOT NAMEDcoord: 69..550
score:

The following gene(s) are paralogous to this gene:

None