CmoCh04G016450 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G016450
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCmo_Chr04 : 8395312 .. 8398989 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TGCCCTTTGGGGAAATTACACAAATCTGCTCGCGCTATCTATGGCCGAAAACGGCAGCAACCCGAAACCAGCAGTGGCTGCCGCCCTTCAGAATTGCCGCGAAACCCATCGGTTCGCTAAAACCCTAACTCCCTTTCCTTACCACGTTTCATACTGCTTTCCGATCCCTCTAACTTATCCCTATCTTTGTTGCTTTTAATTTCACGTTGTCTTAGACGAGAAATCGGCAGGGTTTTCTTCCAATCCTGCTCCCATTTCCATGCTTAGCCGTATTCATCAATGGAAGCCATTACATTCTCTGAGGAAATGCGGGATACTCGCTTCTTTTAGTTCTGTGATCCTTGCTAGGCCTTCAGTTTCTGCCGCCCGCCTCGAAGCGGAATCTGTCACTCCCTCCTTCGTTCTGGGCCAGAACGACCCAGTTTGTGAGATTCTTACGGGTTTAAATTCCTTTGGGTTTAGAGCGTATGTTGGTGGATGTAACTTTCGAACTGTAGTTTCTACTTTGAGTGAAACTGTAGTGGACGGCGTTCTTGAGAGTTTGAATATTCAGAATCCTGATGTTGCTGTGGCATTTTTCTATTTGTTGAGAAATAAGTACGGATTTCGGCATTCCGGATTCTCCCAGCTTGCCGTTTCTCATATTCTAGCGGGTAAAGGAAGATTCAAGGAGTTGCATTGCGTTATAAAGCAATTGGTTGAGGAGCAAGGTATCAGTTCTTGGCATCTCTTTCTTCATGATTGATTGGCTGATATTAGAGAAAAGCAGAGAAGTCGGGCATTCTCCTTCTTAACGCCTAGTGTGTATATTGGCTGAACTAAGAGATTCATAGTTATCATGATAATGTGCATACTTAAGAGTATAAAAAAAAAAAAAAATGGTAGACTAATTCCCCACATTTTCTGCAGTTGAAAAGCACCGTGCTTTCTTTCATGTCTTCCCTGTTTCTTTGATGTATATATAAACCTCGTCTTTTAACGTACTTCTTTTAGACCATCCTTATTTTTCAGTTCATCAACCCCCCCCCCCCCCCCCCCCCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCCCCCCCCCCCCCCCCTTTTTTTTTGTTAACATTGGAATGTCTATTTTATGTTGTTTATGTCTCTCGATATTTACTCTCTGGTAGGGTCGGGTTCTGCATCTTCCTTTTGTGACCTGCTCTTGAACAAATTCAGGAATTGGGATTCAAATGGTGTGGTATGGGATATGCTGGCATTTGCGTATTCTAGACATGAAATGATCCATGATGCCCTCTTTGTCGTCGCAAAAATGAAGGATCTAAATTTACAAGCTTCAGTTCCAACTTATAACTCTCTATTGCATAACTTAAGGCACACTGATGTCATGTGGGATATATGCAATGAGATCAAAGCTAGTGGAGCTCCTCAGAGTGAATATACTACCTCAATACTTATACACGGCCTATGTGCGCAGTCCAAGTTACAAGATGCGATTTCATTCCTACAGGACAGCAATGAAGTAGTTGGACCTTCTATTGTGTCTATCAATACCGTTATGTCAAAGTTTTGTAAAGTGGGGCTAGTAGATGTTGCAAGGTCATTTTTCTGTTTGATGGTCAAGAATGGACTTCTTCCTGATTCGTACAGTTATAATATTCTTATTCATGGGTTATGTGTAGCAGGTTCCATGGATGAAGCTCTGGAATTCACAGATGACATGGAAAAGCATGGTGTGGAGCCTGATGTAGTAACATACAACACACTTGCTAAAGGTTTTCTCTTGCTTGGTTTTATGAGTGGGGCCTGGAAAGTCGTCCAGAAAATGTTGCTAAAAGGTCTAAATCCGGATATCGTGACATATACAATACTGATATGTGGGCACTGTCAAATGGGAAATATTGAGGAAGCCCTTAAGCTGCGGCAAGAAACCCTTTCAAGGGGGTTTCAGTTGAACATCATTTCTTACAGTGTGCTACTTAGCTGTTTGTGTAAAGTTGGACGAATAGAAGAAGCATTGGCATTGCTCAATGAAATGGAAACTCTACGTTTGAAACCTGATCTTATAGTATATTCAATCCTCATTCATGGCCTCTGCAAGGAAGGGTTTGTACAAAGGGCTTACCAACTATATGAACAAATGCGTTTGAAGAGAAATTTTCCTAACTACTTTGCTCAACGTGCTGTACTTTTGGGTTTCTTTGAGAATGGAAATATTTCTGAGGCAAGAAGATATTTTGATGCTTTGACTCATATGGATCTGATAGAGGATATTATTTTGTATAATATTATGATTGATGGTTACGTAAGGCTTGGTGATATTTCTGAGGCTATGCAGCTATATTACAGAATGTTTGAAAGGGGGATTACTCCAACTGTTGTCACTTTCAACACTCTTGTCCATGGGTTTTGCAGAAATGGAGACCTAGTGGAGGCTCGAAAGATGTTCGAAATCATTAGGTTGAATGGATTGCTACCCAGTGTAGTAACTTATACTACCCTTATGAATGCGTACTGTGAAGCGGGAAACATGCAGGAAATGTTTGATCTACTCCATGAGATGGAAGCAAATGCTGTTGTTCCAACTCATATAACTTATACTGTACTAATCAAAGGACTCTGCAGACAGAATAAAATGCATGAAGCCCTCCAGTTACTTGAGTATATGTATGCAAAGGGTCTAATGCCAGATCAGATTACATATAATACTATTATCCAATGTTTTTGCAAGGCCAGAGACATTGCAAAAGCTTTCCAGGTTTATAATGAGATGTTGCTCCATAATCTTGATCCTACCCATGTAACTTATAATGTACTTATTAGTGGTCTTTGTGTATATGGTGACCTAAAGGATGCTGATCGAATGCTGGTTTCTATGGAGGATCAAAATATTAGCTTGACAAAAGTTGCTTATATGACAATTATTAAGGCACATTGTGCAAAGGGTCAGGTGTCTAAGGCATTAGGGTTCTTCAATCAAATGTTGGCTAAGAGTTTTGTCATTTCCATCAGAGATTATAGTGCTATCATTAATAGGCTGTGCAAAAGGGGTCTAATTACTGAAGCAAAGTACTTATTTGCTATGATGTTATCTGAAGGTGTAACGCCTGATTCTGAAATCTGCGAGACAATGCTTAATGCTTTCCATCAACATGGTGATAGCAGTTCAGCATTTGAATTTCTTGCTGTGATGGTTAAATCTGGCGTCATTTCACATTGA

mRNA sequence

TGCCCTTTGGGGAAATTACACAAATCTGCTCGCGCTATCTATGGCCGAAAACGGCAGCAACCCGAAACCAGCAGTGGCTGCCGCCCTTCAGAATTGCCGCGAAACCCATCGGTTCGCTAAAACCCTAACTCCCTTTCCTTACCACGTTTCATACTGCTTTCCGATCCCTCTAACTTATCCCTATCTTTGTTGCTTTTAATTTCACGTTGTCTTAGACGAGAAATCGGCAGGGTTTTCTTCCAATCCTGCTCCCATTTCCATGCTTAGCCGTATTCATCAATGGAAGCCATTACATTCTCTGAGGAAATGCGGGATACTCGCTTCTTTTAGTTCTGTGATCCTTGCTAGGCCTTCAGTTTCTGCCGCCCGCCTCGAAGCGGAATCTGTCACTCCCTCCTTCGTTCTGGGCCAGAACGACCCAGTTTGTGAGATTCTTACGGGTTTAAATTCCTTTGGGTTTAGAGCGTATGTTGGTGGATGTAACTTTCGAACTGTAGTTTCTACTTTGAGTGAAACTGTAGTGGACGGCGTTCTTGAGAGTTTGAATATTCAGAATCCTGATGTTGCTGTGGCATTTTTCTATTTGTTGAGAAATAAGTACGGATTTCGGCATTCCGGATTCTCCCAGCTTGCCGTTTCTCATATTCTAGCGGGTAAAGGAAGATTCAAGGAGTTGCATTGCGTTATAAAGCAATTGGTTGAGGAGCAAGGGTCGGGTTCTGCATCTTCCTTTTGTGACCTGCTCTTGAACAAATTCAGGAATTGGGATTCAAATGGTGTGGTATGGGATATGCTGGCATTTGCGTATTCTAGACATGAAATGATCCATGATGCCCTCTTTGTCGTCGCAAAAATGAAGGATCTAAATTTACAAGCTTCAGTTCCAACTTATAACTCTCTATTGCATAACTTAAGGCACACTGATGTCATGTGGGATATATGCAATGAGATCAAAGCTAGTGGAGCTCCTCAGAGTGAATATACTACCTCAATACTTATACACGGCCTATGTGCGCAGTCCAAGTTACAAGATGCGATTTCATTCCTACAGGACAGCAATGAAGTAGTTGGACCTTCTATTGTGTCTATCAATACCGTTATGTCAAAGTTTTGTAAAGTGGGGCTAGTAGATGTTGCAAGGTCATTTTTCTGTTTGATGGTCAAGAATGGACTTCTTCCTGATTCGTACAGTTATAATATTCTTATTCATGGGTTATGTGTAGCAGGTTCCATGGATGAAGCTCTGGAATTCACAGATGACATGGAAAAGCATGGTGTGGAGCCTGATGTAGTAACATACAACACACTTGCTAAAGGTTTTCTCTTGCTTGGTTTTATGAGTGGGGCCTGGAAAGTCGTCCAGAAAATGTTGCTAAAAGGTCTAAATCCGGATATCGTGACATATACAATACTGATATGTGGGCACTGTCAAATGGGAAATATTGAGGAAGCCCTTAAGCTGCGGCAAGAAACCCTTTCAAGGGGGTTTCAGTTGAACATCATTTCTTACAGTGTGCTACTTAGCTGTTTGTGTAAAGTTGGACGAATAGAAGAAGCATTGGCATTGCTCAATGAAATGGAAACTCTACGTTTGAAACCTGATCTTATAGTATATTCAATCCTCATTCATGGCCTCTGCAAGGAAGGGTTTGTACAAAGGGCTTACCAACTATATGAACAAATGCGTTTGAAGAGAAATTTTCCTAACTACTTTGCTCAACGTGCTGTACTTTTGGGTTTCTTTGAGAATGGAAATATTTCTGAGGCAAGAAGATATTTTGATGCTTTGACTCATATGGATCTGATAGAGGATATTATTTTGTATAATATTATGATTGATGGTTACGTAAGGCTTGGTGATATTTCTGAGGCTATGCAGCTATATTACAGAATGTTTGAAAGGGGGATTACTCCAACTGTTGTCACTTTCAACACTCTTGTCCATGGGTTTTGCAGAAATGGAGACCTAGTGGAGGCTCGAAAGATGTTCGAAATCATTAGGTTGAATGGATTGCTACCCAGTGTAGTAACTTATACTACCCTTATGAATGCGTACTGTGAAGCGGGAAACATGCAGGAAATGTTTGATCTACTCCATGAGATGGAAGCAAATGCTGTTGTTCCAACTCATATAACTTATACTGTACTAATCAAAGGACTCTGCAGACAGAATAAAATGCATGAAGCCCTCCAGTTACTTGAGTATATGTATGCAAAGGGTCTAATGCCAGATCAGATTACATATAATACTATTATCCAATGTTTTTGCAAGGCCAGAGACATTGCAAAAGCTTTCCAGGTTTATAATGAGATGTTGCTCCATAATCTTGATCCTACCCATGTAACTTATAATGTACTTATTAGTGGTCTTTGTGTATATGGTGACCTAAAGGATGCTGATCGAATGCTGGTTTCTATGGAGGATCAAAATATTAGCTTGACAAAAGTTGCTTATATGACAATTATTAAGGCACATTGTGCAAAGGGTCAGGTGTCTAAGGCATTAGGGTTCTTCAATCAAATGTTGGCTAAGAGTTTTGTCATTTCCATCAGAGATTATAGTGCTATCATTAATAGGCTGTGCAAAAGGGGTCTAATTACTGAAGCAAAGTACTTATTTGCTATGATGTTATCTGAAGGTGTAACGCCTGATTCTGAAATCTGCGAGACAATGCTTAATGCTTTCCATCAACATGGTGATAGCAGTTCAGCATTTGAATTTCTTGCTGTGATGGTTAAATCTGGCGTCATTTCACATTGA

Coding sequence (CDS)

ATGCTTAGCCGTATTCATCAATGGAAGCCATTACATTCTCTGAGGAAATGCGGGATACTCGCTTCTTTTAGTTCTGTGATCCTTGCTAGGCCTTCAGTTTCTGCCGCCCGCCTCGAAGCGGAATCTGTCACTCCCTCCTTCGTTCTGGGCCAGAACGACCCAGTTTGTGAGATTCTTACGGGTTTAAATTCCTTTGGGTTTAGAGCGTATGTTGGTGGATGTAACTTTCGAACTGTAGTTTCTACTTTGAGTGAAACTGTAGTGGACGGCGTTCTTGAGAGTTTGAATATTCAGAATCCTGATGTTGCTGTGGCATTTTTCTATTTGTTGAGAAATAAGTACGGATTTCGGCATTCCGGATTCTCCCAGCTTGCCGTTTCTCATATTCTAGCGGGTAAAGGAAGATTCAAGGAGTTGCATTGCGTTATAAAGCAATTGGTTGAGGAGCAAGGGTCGGGTTCTGCATCTTCCTTTTGTGACCTGCTCTTGAACAAATTCAGGAATTGGGATTCAAATGGTGTGGTATGGGATATGCTGGCATTTGCGTATTCTAGACATGAAATGATCCATGATGCCCTCTTTGTCGTCGCAAAAATGAAGGATCTAAATTTACAAGCTTCAGTTCCAACTTATAACTCTCTATTGCATAACTTAAGGCACACTGATGTCATGTGGGATATATGCAATGAGATCAAAGCTAGTGGAGCTCCTCAGAGTGAATATACTACCTCAATACTTATACACGGCCTATGTGCGCAGTCCAAGTTACAAGATGCGATTTCATTCCTACAGGACAGCAATGAAGTAGTTGGACCTTCTATTGTGTCTATCAATACCGTTATGTCAAAGTTTTGTAAAGTGGGGCTAGTAGATGTTGCAAGGTCATTTTTCTGTTTGATGGTCAAGAATGGACTTCTTCCTGATTCGTACAGTTATAATATTCTTATTCATGGGTTATGTGTAGCAGGTTCCATGGATGAAGCTCTGGAATTCACAGATGACATGGAAAAGCATGGTGTGGAGCCTGATGTAGTAACATACAACACACTTGCTAAAGGTTTTCTCTTGCTTGGTTTTATGAGTGGGGCCTGGAAAGTCGTCCAGAAAATGTTGCTAAAAGGTCTAAATCCGGATATCGTGACATATACAATACTGATATGTGGGCACTGTCAAATGGGAAATATTGAGGAAGCCCTTAAGCTGCGGCAAGAAACCCTTTCAAGGGGGTTTCAGTTGAACATCATTTCTTACAGTGTGCTACTTAGCTGTTTGTGTAAAGTTGGACGAATAGAAGAAGCATTGGCATTGCTCAATGAAATGGAAACTCTACGTTTGAAACCTGATCTTATAGTATATTCAATCCTCATTCATGGCCTCTGCAAGGAAGGGTTTGTACAAAGGGCTTACCAACTATATGAACAAATGCGTTTGAAGAGAAATTTTCCTAACTACTTTGCTCAACGTGCTGTACTTTTGGGTTTCTTTGAGAATGGAAATATTTCTGAGGCAAGAAGATATTTTGATGCTTTGACTCATATGGATCTGATAGAGGATATTATTTTGTATAATATTATGATTGATGGTTACGTAAGGCTTGGTGATATTTCTGAGGCTATGCAGCTATATTACAGAATGTTTGAAAGGGGGATTACTCCAACTGTTGTCACTTTCAACACTCTTGTCCATGGGTTTTGCAGAAATGGAGACCTAGTGGAGGCTCGAAAGATGTTCGAAATCATTAGGTTGAATGGATTGCTACCCAGTGTAGTAACTTATACTACCCTTATGAATGCGTACTGTGAAGCGGGAAACATGCAGGAAATGTTTGATCTACTCCATGAGATGGAAGCAAATGCTGTTGTTCCAACTCATATAACTTATACTGTACTAATCAAAGGACTCTGCAGACAGAATAAAATGCATGAAGCCCTCCAGTTACTTGAGTATATGTATGCAAAGGGTCTAATGCCAGATCAGATTACATATAATACTATTATCCAATGTTTTTGCAAGGCCAGAGACATTGCAAAAGCTTTCCAGGTTTATAATGAGATGTTGCTCCATAATCTTGATCCTACCCATGTAACTTATAATGTACTTATTAGTGGTCTTTGTGTATATGGTGACCTAAAGGATGCTGATCGAATGCTGGTTTCTATGGAGGATCAAAATATTAGCTTGACAAAAGTTGCTTATATGACAATTATTAAGGCACATTGTGCAAAGGGTCAGGTGTCTAAGGCATTAGGGTTCTTCAATCAAATGTTGGCTAAGAGTTTTGTCATTTCCATCAGAGATTATAGTGCTATCATTAATAGGCTGTGCAAAAGGGGTCTAATTACTGAAGCAAAGTACTTATTTGCTATGATGTTATCTGAAGGTGTAACGCCTGATTCTGAAATCTGCGAGACAATGCTTAATGCTTTCCATCAACATGGTGATAGCAGTTCAGCATTTGAATTTCTTGCTGTGATGGTTAAATCTGGCGTCATTTCACATTGA
BLAST of CmoCh04G016450 vs. Swiss-Prot
Match: PPR41_ARATH (Putative pentatricopeptide repeat-containing protein At1g13630 OS=Arabidopsis thaliana GN=At1g13630 PE=2 SV=3)

HSP 1 Score: 786.6 bits (2030), Expect = 2.7e-226
Identity = 398/819 (48.60%), Postives = 563/819 (68.74%), Query Frame = 1

Query: 5   IHQWKPLHSLRKCGILASFSSVILARPSVSAARLEAESV-TPSFVLGQNDPVCEILTGLN 64
           I +W   +S +    L+ FSS++  + S S A+++ ES+ T +          EIL G+ 
Sbjct: 2   ICRWIAFNSSKVSRSLSPFSSLLFTKSSFSVAKMDDESLPTTNSTSDHRGFYKEILFGMK 61

Query: 65  SFGFRAYVGGCNFRTVVSTLSETVVDGVLESLNIQNPDVAVAFFYLLRNKYGFRHSGFSQ 124
             GFR ++ G +FR +VS L    V+ +++ L  ++ D++V FF  LR+ Y FRHS FS 
Sbjct: 62  KIGFREFLHGYHFRGLVSELRHVHVEEIMDELMSESSDLSVWFFKELRDIYAFRHSSFST 121

Query: 125 LAVSHILAGKGRFKELHCVIKQLVEEQGSGSASSFCDLLLNKFRNWDSNGVVWDMLAFAY 184
           L VSH+LAG+ RFKEL  +++QL++E+G+      C+LL N FR W+S G+VWDML F  
Sbjct: 122 LLVSHVLAGQRRFKELQVILEQLLQEEGT-----LCELLSNSFRKWESTGLVWDMLLFLS 181

Query: 185 SRHEMIHDALFVVAKMKDLNLQASVPTYNSLLHNLRHTDVMWDICNEIKASGAPQSEYTT 244
           SR  M+ D+L+++ KMKD NL  S  +YNS+L++ R TD MWD+  EIK     ++E+T 
Sbjct: 182 SRLRMVDDSLYILKKMKDQNLNVSTQSYNSVLYHFRETDKMWDVYKEIK----DKNEHTY 241

Query: 245 SILIHGLCAQSKLQDAISFLQDSN-EVVGPSIVSINTVMSKFCKVGLVDVARSFFCLMVK 304
           S ++ GLC Q KL+DA+ FL+ S  + +GPS+VS N++MS +CK+G VD+A+SFFC ++K
Sbjct: 242 STVVDGLCRQQKLEDAVLFLRTSEWKDIGPSVVSFNSIMSGYCKLGFVDMAKSFFCTVLK 301

Query: 305 NGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDVVTYNTLAKGFLLLGFMSG 364
            GL+P  YS+NILI+GLC+ GS+ EALE   DM KHGVEPD VTYN LAKGF LLG +SG
Sbjct: 302 CGLVPSVYSHNILINGLCLVGSIAEALELASDMNKHGVEPDSVTYNILAKGFHLLGMISG 361

Query: 365 AWKVVQKMLLKGLNPDIVTYTILICGHCQMGNIEEALKLRQETLSRGFQLN-IISYSVLL 424
           AW+V++ ML KGL+PD++TYTIL+CG CQ+GNI+  L L ++ LSRGF+LN II  SV+L
Sbjct: 362 AWEVIRDMLDKGLSPDVITYTILLCGQCQLGNIDMGLVLLKDMLSRGFELNSIIPCSVML 421

Query: 425 SCLCKVGRIEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQMRLKRNF 484
           S LCK GRI+EAL+L N+M+   L PDL+ YSI+IHGLCK G    A  LY++M  KR  
Sbjct: 422 SGLCKTGRIDEALSLFNQMKADGLSPDLVAYSIVIHGLCKLGKFDMALWLYDEMCDKRIL 481

Query: 485 PNYFAQRAVLLGFFENGNISEARRYFDALTHMDLIEDIILYNIMIDGYVRLGDISEAMQL 544
           PN     A+LLG  + G + EAR   D+L       DI+LYNI+IDGY + G I EA++L
Sbjct: 482 PNSRTHGALLLGLCQKGMLLEARSLLDSLISSGETLDIVLYNIVIDGYAKSGCIEEALEL 541

Query: 545 YYRMFERGITPTVVTFNTLVHGFCRNGDLVEARKMFEIIRLNGLLPSVVTYTTLMNAYCE 604
           +  + E GITP+V TFN+L++G+C+  ++ EARK+ ++I+L GL PSVV+YTTLM+AY  
Sbjct: 542 FKVVIETGITPSVATFNSLIYGYCKTQNIAEARKILDVIKLYGLAPSVVSYTTLMDAYAN 601

Query: 605 AGNMQEMFDLLHEMEANAVVPTHITYTVLIKGLCR----QNKMH--------EALQLLEY 664
            GN + + +L  EM+A  + PT++TY+V+ KGLCR    +N  H        +  Q L  
Sbjct: 602 CGNTKSIDELRREMKAEGIPPTNVTYSVIFKGLCRGWKHENCNHVLRERIFEKCKQGLRD 661

Query: 665 MYAKGLMPDQITYNTIIQCFCKARDIAKAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGD 724
           M ++G+ PDQITYNTIIQ  C+ + ++ AF     M   NLD +  TYN+LI  LCVYG 
Sbjct: 662 MESEGIPPDQITYNTIIQYLCRVKHLSGAFVFLEIMKSRNLDASSATYNILIDSLCVYGY 721

Query: 725 LKDADRMLVSMEDQNISLTKVAYMTIIKAHCAKGQVSKALGFFNQMLAKSFVISIRDYSA 784
           ++ AD  + S+++QN+SL+K AY T+IKAHC KG    A+  F+Q+L + F +SIRDYSA
Sbjct: 722 IRKADSFIYSLQEQNVSLSKFAYTTLIKAHCVKGDPEMAVKLFHQLLHRGFNVSIRDYSA 781

Query: 785 IINRLCKRGLITEAKYLFAMMLSEGVTPDSEICETMLNA 809
           +INRLC+R L+ E+K+ F +MLS+G++PD +ICE M+ +
Sbjct: 782 VINRLCRRHLVNESKFFFCLMLSQGISPDLDICEVMIKS 811

BLAST of CmoCh04G016450 vs. Swiss-Prot
Match: RF1_ORYSI (Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica GN=Rf1 PE=2 SV=1)

HSP 1 Score: 309.3 bits (791), Expect = 1.3e-82
Identity = 195/635 (30.71%), Postives = 320/635 (50.39%), Query Frame = 1

Query: 195 VVAKMKDLNLQASVPTYNSLLHNLRHTDVMWDICNEIKASGAPQSEYTTSILIHGLCAQS 254
           V+ K   ++  A  P    L  + R +D M  +   +   G   + ++ +IL+ GLC ++
Sbjct: 113 VIKKGFRVDAIAFTPLLKGLCADKRTSDAMDIVLRRMTELGCIPNVFSYNILLKGLCDEN 172

Query: 255 KLQDAISFLQDSNEVVG----PSIVSINTVMSKFCKVGLVDVARSFFCLMVKNGLLPDSY 314
           + Q+A+  L    +  G    P +VS  TV++ F K G  D A S +  M+  G+LPD  
Sbjct: 173 RSQEALELLHMMADDRGGGSPPDVVSYTTVINGFFKEGDSDKAYSTYHEMLDRGILPDVV 232

Query: 315 SYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDVVTYNTLAKGFLLLGFMSGAWKVVQKM 374
           +YN +I  LC A +MD+A+E  + M K+GV PD +TYN++  G+   G    A   ++KM
Sbjct: 233 TYNSIIAALCKAQAMDKAMEVLNTMVKNGVMPDCMTYNSILHGYCSSGQPKEAIGFLKKM 292

Query: 375 LLKGLNPDIVTYTILICGHCQMGNIEEALKLRQETLSRGFQLNIISYSVLLSCLCKVGRI 434
              G+ PD+VTY++L+   C+ G   EA K+      RG +  I +Y  LL      G +
Sbjct: 293 RSDGVEPDVVTYSLLMDYLCKNGRCMEARKIFDSMTKRGLKPEITTYGTLLQGYATKGAL 352

Query: 435 EEALALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQMRLKRNFPNYFAQRAV 494
            E   LL+ M    + PD  V+SILI    K+G V +A  ++ +MR +   PN     AV
Sbjct: 353 VEMHGLLDLMVRNGIHPDHYVFSILICAYAKQGKVDQAMLVFSKMRQQGLNPNAVTYGAV 412

Query: 495 LLGFFENGNISEARRYFDALTHMDLIEDIILYNIMIDGYVRLGDISEAMQLYYRMFERGI 554
           +    ++G + +A  YF+ +    L    I+YN +I G         A +L   M +RGI
Sbjct: 413 IGILCKSGRVEDAMLYFEQMIDEGLSPGNIVYNSLIHGLCTCNKWERAEELILEMLDRGI 472

Query: 555 TPTVVTFNTLVHGFCRNGDLVEARKMFEIIRLNGLLPSVVTYTTLMNAYCEAGNMQEMFD 614
               + FN+++   C+ G ++E+ K+FE++   G+ P+V+TY TL+N YC AG M E   
Sbjct: 473 CLNTIFFNSIIDSHCKEGRVIESEKLFELMVRIGVKPNVITYNTLINGYCLAGKMDEAMK 532

Query: 615 LLHEMEANAVVPTHITYTVLIKGLCRQNKMHEALQLLEYMYAKGLMPDQITYNTIIQCFC 674
           LL  M +  + P  +TY+ LI G C+ ++M +AL L + M + G+ PD ITYN I+Q   
Sbjct: 533 LLSGMVSVGLKPNTVTYSTLINGYCKISRMEDALVLFKEMESSGVSPDIITYNIILQGLF 592

Query: 675 KARDIAKAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGDLKDADRMLVSMEDQNISLTKV 734
           + R  A A ++Y  +          TYN+++ GLC      DA +M  ++   ++ L   
Sbjct: 593 QTRRTAAAKELYVRITESGTQIELSTYNIILHGLCKNKLTDDALQMFQNLCLMDLKLEAR 652

Query: 735 AYMTIIKAHCAKGQVSKALGFFNQMLAKSFVISIRDYSAIINRLCKRGLITEAKYLFAMM 794
            +  +I A    G+  +A   F    +   V +   Y  +   +  +GL+ E   LF  M
Sbjct: 653 TFNIMIDALLKVGRNDEAKDLFVAFSSNGLVPNYWTYRLMAENIIGQGLLEELDQLFLSM 712

Query: 795 LSEGVTPDSEICETMLNAFHQHGDSSSAFEFLAVM 826
              G T DS +   ++    Q G+ + A  +L+++
Sbjct: 713 EDNGCTVDSGMLNFIVRELLQRGEITRAGTYLSMI 747

BLAST of CmoCh04G016450 vs. Swiss-Prot
Match: PP437_ARATH (Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis thaliana GN=At5g59900 PE=3 SV=1)

HSP 1 Score: 298.5 bits (763), Expect = 2.3e-79
Identity = 184/640 (28.75%), Postives = 327/640 (51.09%), Query Frame = 1

Query: 170 DSNGVVWDMLAFAYSRHEMIHDALFVVAKMKDLNLQASVPTYNSLLHNL---RHTDVMWD 229
           D N V +++L     + + + +A+ +   +   +L+  V TY +L++ L   +  ++  +
Sbjct: 259 DVNIVPYNVLIDGLCKKQKVWEAVGIKKDLAGKDLKPDVVTYCTLVYGLCKVQEFEIGLE 318

Query: 230 ICNEIKASGAPQSEYTTSILIHGLCAQSKLQDAISFLQDSNEV-VGPSIVSINTVMSKFC 289
           + +E+       SE   S L+ GL  + K+++A++ ++   +  V P++   N ++   C
Sbjct: 319 MMDEMLCLRFSPSEAAVSSLVEGLRKRGKIEEALNLVKRVVDFGVSPNLFVYNALIDSLC 378

Query: 290 KVGLVDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDVV 349
           K      A   F  M K GL P+  +Y+ILI   C  G +D AL F  +M   G++  V 
Sbjct: 379 KGRKFHEAELLFDRMGKIGLRPNDVTYSILIDMFCRRGKLDTALSFLGEMVDTGLKLSVY 438

Query: 350 TYNTLAKGFLLLGFMSGAWKVVQKMLLKGLNPDIVTYTILICGHCQMGNIEEALKLRQET 409
            YN+L  G    G +S A   + +M+ K L P +VTYT L+ G+C  G I +AL+L  E 
Sbjct: 439 PYNSLINGHCKFGDISAAEGFMAEMINKKLEPTVVTYTSLMGGYCSKGKINKALRLYHEM 498

Query: 410 LSRGFQLNIISYSVLLSCLCKVGRIEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFV 469
             +G   +I +++ LLS L + G I +A+ L NEM    +KP+ + Y+++I G C+EG +
Sbjct: 499 TGKGIAPSIYTFTTLLSGLFRAGLIRDAVKLFNEMAEWNVKPNRVTYNVMIEGYCEEGDM 558

Query: 470 QRAYQLYEQMRLKRNFPNYFAQRAVLLGFFENGNISEARRYFDALTHMDLIEDIILYNIM 529
            +A++  ++M  K   P+ ++ R ++ G    G  SEA+ + D L   +   + I Y  +
Sbjct: 559 SKAFEFLKEMTEKGIVPDTYSYRPLIHGLCLTGQASEAKVFVDGLHKGNCELNEICYTGL 618

Query: 530 IDGYVRLGDISEAMQLYYRMFERGITPTVVTFNTLVHGFCRNGDLVEARKMFEIIRLNGL 589
           + G+ R G + EA+ +   M +RG+   +V +  L+ G  ++ D      + + +   GL
Sbjct: 619 LHGFCREGKLEEALSVCQEMVQRGVDLDLVCYGVLIDGSLKHKDRKLFFGLLKEMHDRGL 678

Query: 590 LPSVVTYTTLMNAYCEAGNMQEMFDLLHEMEANAVVPTHITYTVLIKGLCRQNKMHEALQ 649
            P  V YT++++A  + G+ +E F +   M     VP  +TYT +I GLC+   ++EA  
Sbjct: 679 KPDDVIYTSMIDAKSKTGDFKEAFGIWDLMINEGCVPNEVTYTAVINGLCKAGFVNEAEV 738

Query: 650 LLEYMYAKGLMPDQITYNTIIQCFCKAR-DIAKAFQVYNEMLLHNLDPTHVTYNVLISGL 709
           L   M     +P+Q+TY   +    K   D+ KA +++N +L   L  T  TYN+LI G 
Sbjct: 739 LCSKMQPVSSVPNQVTYGCFLDILTKGEVDMQKAVELHNAILKGLLANT-ATYNMLIRGF 798

Query: 710 CVYGDLKDADRMLVSMEDQNISLTKVAYMTIIKAHCAKGQVSKALGFFNQMLAKSFVISI 769
           C  G +++A  ++  M    +S   + Y T+I   C +  V KA+  +N M  K      
Sbjct: 799 CRQGRIEEASELITRMIGDGVSPDCITYTTMINELCRRNDVKKAIELWNSMTEKGIRPDR 858

Query: 770 RDYSAIINRLCKRGLITEAKYLFAMMLSEGVTPDSEICET 805
             Y+ +I+  C  G + +A  L   ML +G+ P+++   T
Sbjct: 859 VAYNTLIHGCCVAGEMGKATELRNEMLRQGLIPNNKTSRT 897

BLAST of CmoCh04G016450 vs. Swiss-Prot
Match: PP407_ARATH (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 293.9 bits (751), Expect = 5.6e-78
Identity = 170/572 (29.72%), Postives = 296/572 (51.75%), Query Frame = 1

Query: 263 LQDSNEVVGPSIVSINTVMSKFCKVGLVDVARSFFCLMVKNGLLPDSYSYNILIHGLCVA 322
           LQ++ ++   +    + V+  + ++ L+D A S   L   +G +P   SYN ++     +
Sbjct: 123 LQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLDATIRS 182

Query: 323 G-SMDEALEFTDDMEKHGVEPDVVTYNTLAKGFLLLGFMSGAWKVVQKMLLKGLNPDIVT 382
             ++  A     +M +  V P+V TYN L +GF   G +  A  +  KM  KG  P++VT
Sbjct: 183 KRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVT 242

Query: 383 YTILICGHCQMGNIEEALKLRQETLSRGFQLNIISYSVLLSCLCKVGRIEEALALLNEME 442
           Y  LI G+C++  I++  KL +    +G + N+ISY+V+++ LC+ GR++E   +L EM 
Sbjct: 243 YNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMN 302

Query: 443 TLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQMRLKRNFPNYFAQRAVLLGFFENGNIS 502
                 D + Y+ LI G CKEG   +A  ++ +M      P+     +++    + GN++
Sbjct: 303 RRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMN 362

Query: 503 EARRYFDALTHMDLIEDIILYNIMIDGYVRLGDISEAMQLYYRMFERGITPTVVTFNTLV 562
            A  + D +    L  +   Y  ++DG+ + G ++EA ++   M + G +P+VVT+N L+
Sbjct: 363 RAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALI 422

Query: 563 HGFCRNGDLVEARKMFEIIRLNGLLPSVVTYTTLMNAYCEAGNMQEMFDLLHEMEANAVV 622
           +G C  G + +A  + E ++  GL P VV+Y+T+++ +C + ++ E   +  EM    + 
Sbjct: 423 NGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIK 482

Query: 623 PTHITYTVLIKGLCRQNKMHEALQLLEYMYAKGLMPDQITYNTIIQCFCKARDIAKAFQV 682
           P  ITY+ LI+G C Q +  EA  L E M   GL PD+ TY  +I  +C   D+ KA Q+
Sbjct: 483 PDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQL 542

Query: 683 YNEMLLHNLDPTHVTYNVLISGLCVYGDLKDADRMLVSMEDQNISLTKVAYMTIIKAHCA 742
           +NEM+   + P  VTY+VLI+GL      ++A R+L+ +  +    + V Y T+I+ +C+
Sbjct: 543 HNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRLLLKLFYEESVPSDVTYHTLIE-NCS 602

Query: 743 KGQVSKALGFFNQMLAKSFVISIRDYSAIINRLCKRGLITEAKYLFAMMLSEGVTPDSEI 802
                               I  +   ++I   C +G++TEA  +F  ML +   PD   
Sbjct: 603 N-------------------IEFKSVVSLIKGFCMKGMMTEADQVFESMLGKNHKPDGTA 662

Query: 803 CETMLNAFHQHGDSSSAFEFLAVMVKSGVISH 834
              M++   + GD   A+     MVKSG + H
Sbjct: 663 YNIMIHGHCRAGDIRKAYTLYKEMVKSGFLLH 674

BLAST of CmoCh04G016450 vs. Swiss-Prot
Match: PP360_ARATH (Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana GN=At5g01110 PE=2 SV=1)

HSP 1 Score: 293.1 bits (749), Expect = 9.5e-78
Identity = 171/617 (27.71%), Postives = 307/617 (49.76%), Query Frame = 1

Query: 116 FRHSGFSQLAVSHILAGKGRFKELHCVIKQLVEEQGSGSASSFCDLLLNKFRNWDSNGVV 175
           F+H+  S  A+ HIL   GR  +    + +++   G  S     + L + F N  SN  V
Sbjct: 109 FKHTSLSLSAMIHILVRSGRLSDAQSCLLRMIRRSGV-SRLEIVNSLDSTFSNCGSNDSV 168

Query: 176 WDMLAFAYSRHEMIHDALFVVAKMKDLNLQASVPTYNSLLHNLRH---TDVMWDICNEIK 235
           +D+L   Y +   + +A      ++      S+   N+L+ +L      ++ W +  EI 
Sbjct: 169 FDLLIRTYVQARKLREAHEAFTLLRSKGFTVSIDACNALIGSLVRIGWVELAWGVYQEIS 228

Query: 236 ASGAPQSEYTTSILIHGLCAQSKLQDAISFLQDSNEV-VGPSIVSINTVMSKFCKVGLVD 295
            SG   + YT +I+++ LC   K++   +FL    E  V P IV+ NT++S +   GL++
Sbjct: 229 RSGVGINVYTLNIMVNALCKDGKMEKVGTFLSQVQEKGVYPDIVTYNTLISAYSSKGLME 288

Query: 296 VARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDVVTYNTLA 355
            A      M   G  P  Y+YN +I+GLC  G  + A E   +M + G+ PD  TY +L 
Sbjct: 289 EAFELMNAMPGKGFSPGVYTYNTVINGLCKHGKYERAKEVFAEMLRSGLSPDSTTYRSLL 348

Query: 356 KGFLLLGFMSGAWKVVQKMLLKGLNPDIVTYTILICGHCQMGNIEEALKLRQETLSRGFQ 415
                 G +    KV   M  + + PD+V ++ ++    + GN+++AL         G  
Sbjct: 349 MEACKKGDVVETEKVFSDMRSRDVVPDLVCFSSMMSLFTRSGNLDKALMYFNSVKEAGLI 408

Query: 416 LNIISYSVLLSCLCKVGRIEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQL 475
            + + Y++L+   C+ G I  A+ L NEM       D++ Y+ ++HGLCK   +  A +L
Sbjct: 409 PDNVIYTILIQGYCRKGMISVAMNLRNEMLQQGCAMDVVTYNTILHGLCKRKMLGEADKL 468

Query: 476 YEQMRLKRNFPNYFAQRAVLLGFFENGNISEARRYFDALTHMDLIEDIILYNIMIDGYVR 535
           + +M  +  FP+ +    ++ G  + GN+  A   F  +    +  D++ YN ++DG+ +
Sbjct: 469 FNEMTERALFPDSYTLTILIDGHCKLGNLQNAMELFQKMKEKRIRLDVVTYNTLLDGFGK 528

Query: 536 LGDISEAMQLYYRMFERGITPTVVTFNTLVHGFCRNGDLVEARKMFEIIRLNGLLPSVVT 595
           +GDI  A +++  M  + I PT ++++ LV+  C  G L EA ++++ +    + P+V+ 
Sbjct: 529 VGDIDTAKEIWADMVSKEILPTPISYSILVNALCSKGHLAEAFRVWDEMISKNIKPTVMI 588

Query: 596 YTTLMNAYCEAGNMQEMFDLLHEMEANAVVPTHITYTVLIKGLCRQNKMHEALQLLEYMY 655
             +++  YC +GN  +    L +M +   VP  I+Y  LI G  R+  M +A  L++ M 
Sbjct: 589 CNSMIKGYCRSGNASDGESFLEKMISEGFVPDCISYNTLIYGFVREENMSKAFGLVKKME 648

Query: 656 AK--GLMPDQITYNTIIQCFCKARDIAKAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGD 715
            +  GL+PD  TYN+I+  FC+   + +A  V  +M+   ++P   TY  +I+G     +
Sbjct: 649 EEQGGLVPDVFTYNSILHGFCRQNQMKEAEVVLRKMIERGVNPDRSTYTCMINGFVSQDN 708

Query: 716 LKDADRMLVSMEDQNIS 727
           L +A R+   M  +  S
Sbjct: 709 LTEAFRIHDEMLQRGFS 724

BLAST of CmoCh04G016450 vs. TrEMBL
Match: A0A0A0L9A2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G642640 PE=4 SV=1)

HSP 1 Score: 1388.6 bits (3593), Expect = 0.0e+00
Identity = 687/835 (82.28%), Postives = 747/835 (89.46%), Query Frame = 1

Query: 1   MLSRIHQWKPLHSLRKCGILASFSSVILARPSVS--AARLEAESVTPSFVLGQNDPVCEI 60
           MLSR HQ KPLH      I AS SSVILARPSVS  AARLE  +VT SFV  QND V EI
Sbjct: 1   MLSRAHQCKPLH-----WIFASLSSVILARPSVSVSAARLEPATVTTSFVSDQNDSVREI 60

Query: 61  LTGLNSFGFRAYVGGCNFRTVVSTLSETVVDGVLESLNIQNPDVAVAFFYLLRNKYGFRH 120
           L GLNS GFRAYVGGCNFRTVVSTLSETVVDGVL+ L    PDVAVAFFY L N+YGFRH
Sbjct: 61  LIGLNSLGFRAYVGGCNFRTVVSTLSETVVDGVLDRLRTLKPDVAVAFFYFLINEYGFRH 120

Query: 121 SGFSQLAVSHILAGKGRFKELHCVIKQLVEEQGSGSASSFCDLLLNKFRNWDSNGVVWDM 180
           S FSQ  VSHILAGKGRFKEL  VIK L+ +QG GSAS  CDLLL KFRNWDSNG+VWDM
Sbjct: 121 SIFSQFVVSHILAGKGRFKELDSVIKNLIVDQGLGSASIICDLLLEKFRNWDSNGLVWDM 180

Query: 181 LAFAYSRHEMIHDALFVVAKMKDLNLQASVPTYNSLLHNLRHTDVMWDICNEIKASGAPQ 240
           LAFAYSRHEMIHDALFV+AKMKDLN QASVPTYNSLLHN+RHTD+MWD+ NEIK SGAPQ
Sbjct: 181 LAFAYSRHEMIHDALFVIAKMKDLNFQASVPTYNSLLHNMRHTDIMWDVYNEIKVSGAPQ 240

Query: 241 SEYTTSILIHGLCAQSKLQDAISFLQDSNEVVGPSIVSINTVMSKFCKVGLVDVARSFFC 300
           SE TTSILIHGLC QSKL+DAISFL DSN+VVGPSIVSINT+MSKFCKVGL+DVARSFFC
Sbjct: 241 SECTTSILIHGLCEQSKLEDAISFLHDSNKVVGPSIVSINTIMSKFCKVGLIDVARSFFC 300

Query: 301 LMVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDVVTYNTLAKGFLLLG 360
           LMVKNGLL DS+SYNIL+HGLCVAGSMDEAL FTDDMEKHGVEPDVVTYNTLAKGFLLLG
Sbjct: 301 LMVKNGLLHDSFSYNILLHGLCVAGSMDEALGFTDDMEKHGVEPDVVTYNTLAKGFLLLG 360

Query: 361 FMSGAWKVVQKMLLKGLNPDIVTYTILICGHCQMGNIEEALKLRQETLSRGFQLNIISYS 420
            MSGA KVVQKMLL+GLNPD+VTYT LICGHCQMGNIEEALKLRQETLSRGF+LN+I Y+
Sbjct: 361 LMSGARKVVQKMLLQGLNPDLVTYTTLICGHCQMGNIEEALKLRQETLSRGFKLNVIFYN 420

Query: 421 VLLSCLCKVGRIEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQMRLK 480
           +LLSCLCKVGRIEEAL L +EMETLRL+PD IVYSILIHGLCKEGFVQRAYQLYEQMRLK
Sbjct: 421 MLLSCLCKVGRIEEALTLFDEMETLRLEPDFIVYSILIHGLCKEGFVQRAYQLYEQMRLK 480

Query: 481 RNFPNYFAQRAVLLGFFENGNISEARRYFDALTHMDLIEDIILYNIMIDGYVRLGDISEA 540
           R FP++FAQRAVLLG F+NGNISEAR YFD  T MDL+ED++LYNIMIDGYVRL  I+EA
Sbjct: 481 RKFPHHFAQRAVLLGLFKNGNISEARNYFDTWTRMDLMEDVVLYNIMIDGYVRLDGIAEA 540

Query: 541 MQLYYRMFERGITPTVVTFNTLVHGFCRNGDLVEARKMFEIIRLNGLLPSVVTYTTLMNA 600
           MQLYY+M ERGITP+VVTFNTL++GFCR GDL+EARKM E+IRL GL+PSVVTYTTLMNA
Sbjct: 541 MQLYYKMIERGITPSVVTFNTLINGFCRRGDLMEARKMLEVIRLKGLVPSVVTYTTLMNA 600

Query: 601 YCEAGNMQEMFDLLHEMEANAVVPTHITYTVLIKGLCRQNKMHEALQLLEYMYAKGLMPD 660
           YCE GNMQEMF  LHEMEANAVVPTH+TYTVLIKGLCRQNKMHE+LQLLEYMYAKGL+PD
Sbjct: 601 YCEVGNMQEMFHFLHEMEANAVVPTHVTYTVLIKGLCRQNKMHESLQLLEYMYAKGLLPD 660

Query: 661 QITYNTIIQCFCKARDIAKAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGDLKDADRMLV 720
            +TYNTIIQCFCK ++I KA Q+YN MLLHN DPT VTY VLI+ LC++GDLKD DRM+V
Sbjct: 661 SVTYNTIIQCFCKGKEITKALQLYNMMLLHNCDPTQVTYKVLINALCIFGDLKDVDRMVV 720

Query: 721 SMEDQNISLTKVAYMTIIKAHCAKGQVSKALGFFNQMLAKSFVISIRDYSAIINRLCKRG 780
           S+ED+NI+L KV YMTIIKAHCAKGQVSKALG+FNQMLAK FVISIRDYSA+INRLCKRG
Sbjct: 721 SIEDRNITLKKVTYMTIIKAHCAKGQVSKALGYFNQMLAKGFVISIRDYSAVINRLCKRG 780

Query: 781 LITEAKYLFAMMLSEGVTPDSEICETMLNAFHQHGDSSSAFEFLAVMVKSGVISH 834
           LITEAKY F MMLSEGVTPD EIC+T+LNAFHQ G++SS FEFLA++VKSG ISH
Sbjct: 781 LITEAKYFFVMMLSEGVTPDPEICKTVLNAFHQQGNNSSVFEFLAMVVKSGFISH 830

BLAST of CmoCh04G016450 vs. TrEMBL
Match: D7TA84_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0010g00630 PE=4 SV=1)

HSP 1 Score: 1055.0 bits (2727), Expect = 4.6e-305
Identity = 513/832 (61.66%), Postives = 656/832 (78.85%), Query Frame = 1

Query: 1   MLSRIHQWKPLHSLRKCGILASFSSVILARPSVSAARLEAESVTPSFVLGQNDPVCEILT 60
           ML+ I+ W+ L  LRK   L+  +S+   + SVSAA+L  ES   S     ND V +IL 
Sbjct: 1   MLNHIYPWRSL--LRKSLNLSPITSLGFTKHSVSAAKLHDESADASI---PNDAVRQILI 60

Query: 61  GLNSFGFRAYVGGCNFRTVVSTLSETVVDGVLESLNIQNPDVAVAFFYLLRNKYGFRHSG 120
           GL SFG   ++ G +F+T+ S L+   VD +L SL + N D A+  F LLRN+YGFRHS 
Sbjct: 61  GLRSFGASKFLWGHHFQTLASVLNTHQVDQILLSLRVDNSDSALFLFDLLRNEYGFRHSR 120

Query: 121 FSQLAVSHILAGKGRFKELHCVIKQLVEEQGSGSASSFCDLLLNKFRNWDSNGVVWDMLA 180
            S   VSH++A KG+ KEL  V+ Q+VEE+GSGSA S C+LL N FR+WD N VVWDMLA
Sbjct: 121 VSWFIVSHVVARKGQSKELRRVLNQMVEEEGSGSAPSLCELLCNSFRDWDLNNVVWDMLA 180

Query: 181 FAYSRHEMIHDALFVVAKMKDLNLQASVPTYNSLLHNLRHTDVMWDICNEIKASGAPQSE 240
            AYSR EM+HDALFV+AKMK LNLQ S+ TYNSLL+NLRHTD+MWD+ NEIKASG PQ+E
Sbjct: 181 CAYSRAEMVHDALFVLAKMKVLNLQVSIATYNSLLYNLRHTDIMWDVYNEIKASGVPQNE 240

Query: 241 YTTSILIHGLCAQSKLQDAISFLQDSN-EVVGPSIVSINTVMSKFCKVGLVDVARSFFCL 300
           YT  ILI GLC QS+LQDA++FL+++  E  GPS+VS N +MS FCK+G VDVA+SFFC+
Sbjct: 241 YTNPILIDGLCRQSRLQDAVTFLRETGGEEFGPSVVSFNALMSGFCKMGSVDVAKSFFCM 300

Query: 301 MVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDVVTYNTLAKGFLLLGF 360
           M+K GLLPD YSYNIL+HGLCVAGSM+EALEFT+DME HGVEPD+VTYN LA GF +LG 
Sbjct: 301 MIKYGLLPDVYSYNILLHGLCVAGSMEEALEFTNDMENHGVEPDIVTYNILANGFRILGL 360

Query: 361 MSGAWKVVQKMLLKGLNPDIVTYTILICGHCQMGNIEEALKLRQETLSRGFQLNIISYSV 420
           +SGAWKVVQ+MLL GLNPD+VTYTILICGHCQMGNIEE+ KL+++ LS+G +L+I++Y+V
Sbjct: 361 ISGAWKVVQRMLLNGLNPDLVTYTILICGHCQMGNIEESFKLKEKMLSQGLKLSIVTYTV 420

Query: 421 LLSCLCKVGRIEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQMRLKR 480
           LLS LCK GRI+EA+ LL+EME + LKPDL+ YS         G V+ A +LYE+M  KR
Sbjct: 421 LLSSLCKSGRIDEAVILLHEMEVIGLKPDLLTYS--------RGAVEEAIELYEEMCSKR 480

Query: 481 NFPNYFAQRAVLLGFFENGNISEARRYFDALTHMDLIEDIILYNIMIDGYVRLGDISEAM 540
            +PN F   A++ G FE G ISEA+ YFD++T  D+ E+IILYNIMIDGY +LG+I EA+
Sbjct: 481 IYPNSFVCSAIISGLFEKGAISEAQMYFDSVTKSDVAEEIILYNIMIDGYAKLGNIGEAV 540

Query: 541 QLYYRMFERGITPTVVTFNTLVHGFCRNGDLVEARKMFEIIRLNGLLPSVVTYTTLMNAY 600
           + Y ++ E+GI+PT+VTFN+L++GFC+ G L EA K+ + I+++GL+P+ VTYTTLMN Y
Sbjct: 541 RSYKQIIEKGISPTIVTFNSLIYGFCKKGKLAEAVKLLDTIKVHGLVPTSVTYTTLMNGY 600

Query: 601 CEAGNMQEMFDLLHEMEANAVVPTHITYTVLIKGLCRQNKMHEALQLLEYMYAKGLMPDQ 660
           CE G+M  MFD+LHEMEA A+ PT ITYTV++KGLC++ ++HE++QLL+YMYA+GL PDQ
Sbjct: 601 CEEGDMHSMFDMLHEMEAKAIKPTQITYTVVVKGLCKEGRLHESVQLLKYMYARGLFPDQ 660

Query: 661 ITYNTIIQCFCKARDIAKAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGDLKDADRMLVS 720
           ITYNT+IQ FCKA D+ KAFQ++N+ML H+L P+ VTYNVLI+GLCVYG+LKDADR+LV+
Sbjct: 661 ITYNTVIQSFCKAHDLQKAFQLHNQMLQHSLQPSPVTYNVLINGLCVYGNLKDADRLLVT 720

Query: 721 MEDQNISLTKVAYMTIIKAHCAKGQVSKALGFFNQMLAKSFVISIRDYSAIINRLCKRGL 780
           ++DQ+I LTKVAY TIIKAHCAKG V  AL FF+QM+ + F +SIRDYSA+INRLCKR L
Sbjct: 721 LQDQSIRLTKVAYTTIIKAHCAKGDVQNALVFFHQMVERGFEVSIRDYSAVINRLCKRNL 780

Query: 781 ITEAKYLFAMMLSEGVTPDSEICETMLNAFHQHGDSSSAFEFLAVMVKSGVI 832
           IT+AK+ F MML+ G+ PD +IC  MLNAFH+ GD +S FE  A+M+K G++
Sbjct: 781 ITDAKFFFCMMLTHGIPPDQDICLVMLNAFHRSGDPNSVFEIFAMMIKCGLL 819

BLAST of CmoCh04G016450 vs. TrEMBL
Match: A0A0D2RLR9_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_005G147900 PE=4 SV=1)

HSP 1 Score: 1017.3 bits (2629), Expect = 1.1e-293
Identity = 494/828 (59.66%), Postives = 640/828 (77.29%), Query Frame = 1

Query: 5   IHQWKPLHSLRKCGILASFSSVILARPSVSAARLEAESVTPSFVLGQNDPVCEILTGLNS 64
           +++WKP   L K  + +  SS+   +PSVS ARL  E   PS      DPV EIL+GL  
Sbjct: 2   LNKWKPFSFLAKPHVCSLLSSLTFFKPSVSVARLVEEE--PSLSHSPKDPVSEILSGLKK 61

Query: 65  FGFRAYVGGCNFRTVVSTLSETVVDGVLESLNIQNPDVAVAFFYLLRNKYGFRHSGFSQL 124
            GFR ++ G  FR VV +L +  VD ++ SL +++PD AV FF L+RN+Y FRHS FS+ 
Sbjct: 62  MGFRRFLAGDYFRNVVLSLDQLQVDKIINSLRVESPDFAVVFFDLMRNEYWFRHSRFSRF 121

Query: 125 AVSHILAGKGRFKELHCVIKQLVEEQGSGSASSFCDLLLNKFRNWDSNGVVWDMLAFAYS 184
            V+H+LAG+ R KEL  V++Q+++E+GSGSA S C+LLLN FR+WD   +VWDMLAF YS
Sbjct: 122 VVAHVLAGQRRHKELRFVVEQMLKEEGSGSAPSLCELLLNGFRDWDQKSLVWDMLAFVYS 181

Query: 185 RHEMIHDALFVVAKMKDLNLQASVPTYNSLLHNLRHTDVMWDICNEIKASGAPQSEYTTS 244
           R EM+HDAL+V+AKMKDL L+AS+ TYNSLL+NLRH  +MWD+ NEIK +GA QS+ T S
Sbjct: 182 RFEMVHDALYVLAKMKDLKLRASILTYNSLLYNLRHAYIMWDVYNEIKVAGATQSKQTNS 241

Query: 245 ILIHGLCAQSKLQDAISFLQDSN-EVVGPSIVSINTVMSKFCKVGLVDVARSFFCLMVKN 304
           I+I GLC+QSKLQDA+SFL+++  + +GPS+VS+NT+MS++CK+G  DVA+SFFC+M+K 
Sbjct: 242 IVIDGLCSQSKLQDAVSFLRETEAKGLGPSVVSLNTIMSRYCKLGFTDVAKSFFCMMLKY 301

Query: 305 GLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDVVTYNTLAKGFLLLGFMSGA 364
           GLLPD YSYNILIHGLC+AGSM+EALEFT DMEKHGVEPD+VTYN L KGF LLG M GA
Sbjct: 302 GLLPDVYSYNILIHGLCIAGSMEEALEFTSDMEKHGVEPDIVTYNILMKGFDLLGQMGGA 361

Query: 365 WKVVQKMLLKGLNPDIVTYTILICGHCQMGNIEEALKLRQETLSRGFQLNIISYSVLLSC 424
           W V+Q+ML KGLNPD+VTY +LICGHCQ GN+EE LKL++E LSRGFQL+ +SYSVLLS 
Sbjct: 362 WMVIQRMLDKGLNPDVVTYMMLICGHCQNGNVEEGLKLQEEMLSRGFQLSALSYSVLLSS 421

Query: 425 LCKVGRIEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQMRLKRNFPN 484
           LCK+G++ EAL L  EME   ++PD I YSILIHGLCK+G VQ A  LY++M  K   PN
Sbjct: 422 LCKIGQVHEALVLFYEMENHGVEPDHITYSILIHGLCKQGEVQSALLLYKEMCSKSIPPN 481

Query: 485 YFAQRAVLLGFFENGNISEARRYFDALTHMDLIEDIILYNIMIDGYVRLGDISEAMQLYY 544
             +  A+LL   +NG + EAR YFD+L   D   DI+LYNIMIDGYV+ G++ EA++LY 
Sbjct: 482 SHSAGAILLSLCKNGMVLEARMYFDSLVMNDSAHDIVLYNIMIDGYVKHGNLEEAVELYR 541

Query: 545 RMFERGITPTVVTFNTLVHGFCRNGDLVEARKMFEIIRLNGLLPSVVTYTTLMNAYCEAG 604
            + E+GITPT VTFN+L++GFC+  +  EAR++ E IRL GL P+ VTYTTLMNAYC+ G
Sbjct: 542 LITEKGITPTTVTFNSLIYGFCKRRNFTEARRLMETIRLLGLEPTAVTYTTLMNAYCKDG 601

Query: 605 NMQEMFDLLHEMEANAVVPTHITYTVLIKGLCRQNKMHEALQLLEYMYAKGLMPDQITYN 664
           N++ M +LL EM AN + PTH+TYTV+IKGLC+Q K+HEA+QLLE M  KGL PDQ+TYN
Sbjct: 602 NLRCMMELLQEMHANCIRPTHVTYTVIIKGLCKQQKLHEAVQLLEDMRIKGLNPDQVTYN 661

Query: 665 TIIQCFCKARDIAKAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGDLKDADRMLVSMEDQ 724
           TIIQ FCKAR+I  AF++ NEM L+NL+PT VTY++LI+GLCVYG+LKDA+++L+S+ +Q
Sbjct: 662 TIIQYFCKARNIKTAFKLLNEMWLNNLEPTPVTYSILINGLCVYGNLKDANKLLISLHEQ 721

Query: 725 NISLTKVAYMTIIKAHCAKGQVSKALGFFNQMLAKSFVISIRDYSAIINRLCKRGLITEA 784
           NI LT+V Y  IIKAHC KG V  A  FF+ M+   F ISI+DY+A+INRL KR LITEA
Sbjct: 722 NIKLTRVGYTQIIKAHCVKGDVHCAFTFFHLMMEMGFEISIKDYTALINRLGKRCLITEA 781

Query: 785 KYLFAMMLSEGVTPDSEICETMLNAFHQHGDSSSAFEFLAVMVKSGVI 832
           +  F++ML  G++PD EICE +LNA+ Q GD  S ++ LA+ +K+G++
Sbjct: 782 QQFFSIMLFHGISPDQEICEALLNAYQQCGDIISGYQMLALTIKAGLL 827

BLAST of CmoCh04G016450 vs. TrEMBL
Match: B9RLG0_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_1466530 PE=4 SV=1)

HSP 1 Score: 995.3 bits (2572), Expect = 4.3e-287
Identity = 486/822 (59.12%), Postives = 630/822 (76.64%), Query Frame = 1

Query: 14  LRKCGILASFSSVILARPS-VSAAR----LEAESVTPSFVLGQNDPVCEILTGLNSFGFR 73
           L+   IL S SS++L++ S VS A     ++    TPS      DPV  IL+GL    F+
Sbjct: 15  LKSHQILVSLSSLVLSKSSSVSTAAASIVVDRPGTTPSVTPDPGDPVPVILSGLKYSVFK 74

Query: 74  AYVGGCNFRTVVSTLSETVVDGVLESLNIQNPDVAVAFFYLLRNKYGFRHSGFSQLAVSH 133
            ++  C F+  +  L+ + VD ++E LN+++ D AV F+YLL N++GF+HS FS+L VSH
Sbjct: 75  RFMDQCLFKEKIFMLNHSQVDQIIEHLNVEDADSAVDFYYLLSNEFGFQHSRFSRLVVSH 134

Query: 134 ILAGKGRFKELHCVIKQLVEEQGSGSASSFCDLLLNKFRNWDSNGVVWDMLAFAYSRHEM 193
           +LA K R  EL  V+ Q++  +GSGSA S C+LLL  FR+WDS+ VVWDMLA AYSR  M
Sbjct: 135 VLARKKRLNELRLVLDQMLLHEGSGSAPSLCELLLGSFRSWDSSNVVWDMLACAYSRSAM 194

Query: 194 IHDALFVVAKMKDLNLQASVPTYNSLLHNLRHTDVMWDICNEIKASGAPQSEYTTSILIH 253
           +HDALFV+ KMKDLN   S+ TYNSLL+NLRH+++MWD+ NEIK SG PQSEYT+SI++ 
Sbjct: 195 VHDALFVLVKMKDLNFIVSIQTYNSLLYNLRHSNIMWDVYNEIKVSGTPQSEYTSSIVVD 254

Query: 254 GLCAQSKLQDAISFLQDSN-EVVGPSIVSINTVMSKFCKVGLVDVARSFFCLMVKNGLLP 313
           GLC QS+ QDA+ F QD+  +   PS+VS NT+MS++CK+G VDVA+SFFC+M+K+GLLP
Sbjct: 255 GLCRQSRFQDAVLFFQDTEGKEFQPSVVSFNTIMSRYCKLGFVDVAKSFFCMMLKHGLLP 314

Query: 314 DSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDVVTYNTLAKGFLLLGFMSGAWKVV 373
           D+YSYNILIHGLC+AGSM EAL+  +DME HG+EPD+VTYN LAKGF LLG ++GAW ++
Sbjct: 315 DAYSYNILIHGLCIAGSMGEALDLKNDMENHGLEPDMVTYNILAKGFRLLGLINGAWNII 374

Query: 374 QKMLLKGLNPDIVTYTILICGHCQMGNIEEALKLRQETLSRGFQLNIISYSVLLSCLCKV 433
           QKML+KG NP++VTYT+LICGHCQ+GN+EEALKL +E +S GFQL+IIS +VLL  LCK 
Sbjct: 375 QKMLIKGPNPNLVTYTVLICGHCQIGNVEEALKLYKEMISHGFQLSIISSTVLLGSLCKS 434

Query: 434 GRIEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQMRLKRNFPNYFAQ 493
            +++ A  L  EME   L+PDLI YS LIHGLCK+G VQ+A  LYE+M   R  PN    
Sbjct: 435 RQVDVAFKLFCEMEANGLRPDLITYSTLIHGLCKQGEVQQAILLYEKMCSNRIIPNSLIH 494

Query: 494 RAVLLGFFENGNISEARRYFDALTHMDLIEDIILYNIMIDGYVRLGDISEAMQLYYRMFE 553
            A+L+G  E G IS+AR YFD L   +L  DIILYNIMIDGY++ G+  EA++LY ++ E
Sbjct: 495 GAILMGLCEKGKISQARMYFDYLITSNLSLDIILYNIMIDGYIKRGNTREAVKLYKQLGE 554

Query: 554 RGITPTVVTFNTLVHGFCRNGDLVEARKMFEIIRLNGLLPSVVTYTTLMNAYCEAGNMQE 613
           +GI+PT+VTFN+L++GFC N  L +AR++ + I+L+GL P+ VTYTTLMN YCE GNMQ 
Sbjct: 555 KGISPTIVTFNSLMYGFCINRKLSQARRLLDTIKLHGLEPNAVTYTTLMNVYCEEGNMQS 614

Query: 614 MFDLLHEMEANAVVPTHITYTVLIKGLCRQNKMHEALQLLEYMYAKGLMPDQITYNTIIQ 673
           + +LL EM+A A+ PTHITYTV+IKGLC+Q K+ E+ QLLE M A GL PDQ++YNTIIQ
Sbjct: 615 LLELLSEMKAKAIGPTHITYTVVIKGLCKQWKLQESCQLLEDMDAVGLTPDQVSYNTIIQ 674

Query: 674 CFCKARDIAKAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGDLKDADRMLVSMEDQNISL 733
            FCKARD+ KAFQ+Y++MLLHNL+PT VTYN+LI+G CVYGDLKDAD +LVS++++ ++L
Sbjct: 675 AFCKARDMRKAFQLYDKMLLHNLEPTSVTYNILINGFCVYGDLKDADNLLVSLQNRKVNL 734

Query: 734 TKVAYMTIIKAHCAKGQVSKALGFFNQMLAKSFVISIRDYSAIINRLCKRGLITEAKYLF 793
            K AY TIIKAHCAKG V KA+ +F QM+ K F +SIRDYSA+I RLCKR L+TEAKY F
Sbjct: 735 NKYAYTTIIKAHCAKGDVDKAVVYFRQMVEKGFEVSIRDYSAVIGRLCKRCLVTEAKYFF 794

Query: 794 AMMLSEGVTPDSEICETMLNAFHQHGDSSSAFEFLAVMVKSG 830
            MMLS+GV PD ++ E +LNAFHQ G  +S FE LA M+KSG
Sbjct: 795 CMMLSDGVCPDQDLFEVLLNAFHQCGHLNSEFELLAEMIKSG 836

BLAST of CmoCh04G016450 vs. TrEMBL
Match: A0A068TYQ6_COFCA (Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00032443001 PE=4 SV=1)

HSP 1 Score: 908.3 bits (2346), Expect = 6.9e-261
Identity = 454/816 (55.64%), Postives = 599/816 (73.41%), Query Frame = 1

Query: 22  SFSSVILARPSVSAARLEAESVTPSFVLGQNDPVCEILTGLNSF------GFRAYVG-GC 81
           S +S+   RP  S A L A +      +  ND    I T L +F      GF   +G   
Sbjct: 16  SLTSLFFFRPIFSYATLAAVN---HLEVPPNDVSSAIFTRLINFNCGDKRGFAKRLGRDP 75

Query: 82  NFRTVVSTLSETVVDGVLESLNIQNPDVAVAFFYLLRNKYGFRHSGFSQLAVSHILAGKG 141
            F T++S LS   VDG+LE L I+ P+ A+ FF+LL+N+Y F+HS  S ++++H+LA K 
Sbjct: 76  EFNTLISGLSAPEVDGILEKLRIKYPETALDFFFLLKNEYDFKHSRDSCISIAHVLARKE 135

Query: 142 RFKELHCVIKQLVEEQGSGSASSFCDLLLNKFRNWDSNGVVWDMLAFAYSRHEMIHDALF 201
           RF+ L   + Q+V  +GSGSA S C+LL N FR  D +  VWDMLAFAYSR  M+HDALF
Sbjct: 136 RFRALKLHLLQMVHLEGSGSAPSLCELLSNGFRESDFSHTVWDMLAFAYSRSGMVHDALF 195

Query: 202 VVAKMKDLNLQASVPTYNSLLHNLRHTDVMWDICNEIKASGAPQSEYTTSILIHGLCAQS 261
           V+ KMKDLN+QAS+ T N LL+NLR TDVMWD+ + IKASG   S YT SI+I GLC QS
Sbjct: 196 VLFKMKDLNVQASIMTLNGLLYNLRLTDVMWDMNDVIKASGIRPSSYTNSIIIDGLCRQS 255

Query: 262 KLQDAISFLQDSN-EVVGPSIVSINTVMSKFCKVGLVDVARSFFCLMVKNGLLPDSYSYN 321
            +++A++F+Q++  E  GP IV +N +M+ FCK+G V+VA+SFFC+M K GLLPD+YSYN
Sbjct: 256 LVEEAVAFMQEAEKEESGPRIVWLNNLMTGFCKLGFVNVAKSFFCIMHKCGLLPDTYSYN 315

Query: 322 ILIHGLCVAGSMDEALEFTDDMEKHGVEPDVVTYNTLAKGFLLLGFMSGAWKVVQKMLLK 381
           ILI+GLC+AGSM+EALEFT DMEKHG+EPD+VTYNTLAKGF LLG MSGAWKV+  ML K
Sbjct: 316 ILINGLCIAGSMEEALEFTSDMEKHGLEPDIVTYNTLAKGFSLLGLMSGAWKVISLMLYK 375

Query: 382 GLNPDIVTYTILICGHCQMGNIEEALKLRQETLSRGFQLNIISYSVLLSCLCKVGRIEEA 441
           GLNP+++TYTILICGHCQ GNI+E  KLR+E LSRG QL  ISY V++SCLCK G + EA
Sbjct: 376 GLNPNLITYTILICGHCQTGNIKECFKLREEMLSRGMQLTNISYGVMISCLCKRGNVNEA 435

Query: 442 LALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQMRLKRNFPNYFAQRAVLLG 501
           L+L +EM+T+ L+ D+++YSILIHGLCK+G +  A  LY++M L+R  PN F QR++LL 
Sbjct: 436 LSLFDEMKTIGLEADVVIYSILIHGLCKQGRLHHAIHLYKEMCLERVMPNLFTQRSILLA 495

Query: 502 FFENGNISEARRYFDALTHMDLIEDIILYNIMIDGYVRLGDISEAMQLYYRMFERGITPT 561
             E G I EARRYFD L H DL+EDI L NIM+  Y ++G + EA+QLY  + E+GITPT
Sbjct: 496 LSEKGTIKEARRYFDTLMHCDLLEDIGLCNIMLYSYAKVGYMDEAIQLYRMILEKGITPT 555

Query: 562 VVTFNTLVHGFCRNGDLVEARKMFEIIRLNGLLPSVVTYTTLMNAYCEAGNMQEMFDLLH 621
           VVTFN++++GFC++  L +AR     I  +GL+PS VTYTTLMNA+CE  +MQ MF LL 
Sbjct: 556 VVTFNSVIYGFCKSRRLADARIWLNAIESHGLVPSAVTYTTLMNAFCEERDMQAMFKLLK 615

Query: 622 EMEANAVVPTHITYTVLIKGLCRQNKMHEALQLLEYMYAKGLMPDQITYNTIIQCFCKAR 681
           EMEA A+ PTH+TYTV+IKGLCRQ K+ EA+ +L+ M+AKG+ PD+I+YN IIQ  CK +
Sbjct: 616 EMEARAIEPTHVTYTVVIKGLCRQRKVKEAVGVLQDMFAKGVSPDEISYNIIIQSLCKTQ 675

Query: 682 DIAKAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGDLKDADRMLVSMEDQNISLTKVAYM 741
           D+ +AFQ+++EMLL NL P HVTYN+LI+GLCV G+LKDA+++L S++DQ + LTKVAY 
Sbjct: 676 DMKRAFQLHDEMLLRNLQPNHVTYNILINGLCVRGNLKDAEKLLASLQDQKVRLTKVAYT 735

Query: 742 TIIKAHCAKGQVSKALGFFNQMLAKSFVISIRDYSAIINRLCKRGLITEAKYLFAMMLSE 801
           T+IKA CAKG V KA+  F+QM+   + +S+RD SA++NRLCKR LI++AK    ++L  
Sbjct: 736 TLIKALCAKGNVHKAIVLFHQMVEMGYQVSVRDCSAVVNRLCKRHLISDAKAFLRLILQC 795

Query: 802 GVTPDSEICETMLNAFHQHGDSSSAFEFLAVMVKSG 830
           G+  D +IC  + N  ++  D     + LA+MVK G
Sbjct: 796 GIALDQQICSVLRNNLYRIHDKDMMVQLLALMVKCG 828

BLAST of CmoCh04G016450 vs. TAIR10
Match: AT1G13630.1 (AT1G13630.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 727.2 bits (1876), Expect = 1.1e-209
Identity = 368/738 (49.86%), Postives = 511/738 (69.24%), Query Frame = 1

Query: 57  EILTGLNSFGFRAYVGGCNFRTVVSTLSETVVDGVLESLNIQNPDVAVAFFYLLRNKYGF 116
           EIL G+   GFR ++ G +FR +VS L    V+ +++ L  ++ D++V FF  LR+ Y F
Sbjct: 21  EILFGMKKIGFREFLHGYHFRGLVSELRHVHVEEIMDELMSESSDLSVWFFKELRDIYAF 80

Query: 117 RHSGFSQLAVSHILAGKGRFKELHCVIKQLVEEQGSGSASSFCDLLLNKFRNWDSNGVVW 176
           RHS FS L VSH+LAG+ RFKEL  +++QL++E+G+             FR W+S G+VW
Sbjct: 81  RHSSFSTLLVSHVLAGQRRFKELQVILEQLLQEEGT-------------FRKWESTGLVW 140

Query: 177 DMLAFAYSRHEMIHDALFVVAKMKDLNLQASVPTYNSLLHNLRHTDVMWDICNEIKASGA 236
           DML F  SR  M+ D+L+++ KMKD NL  S  +YNS+L++ R TD MWD+  EIK    
Sbjct: 141 DMLLFLSSRLRMVDDSLYILKKMKDQNLNVSTQSYNSVLYHFRETDKMWDVYKEIK---- 200

Query: 237 PQSEYTTSILIHGLCAQSKLQDAISFLQDSN-EVVGPSIVSINTVMSKFCKVGLVDVARS 296
            ++E+T S ++ GLC Q KL+DA+ FL+ S  + +GPS+VS N++MS +CK+G VD+A+S
Sbjct: 201 DKNEHTYSTVVDGLCRQQKLEDAVLFLRTSEWKDIGPSVVSFNSIMSGYCKLGFVDMAKS 260

Query: 297 FFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDVVTYNTLAKGFL 356
           FFC ++K GL+P  YS+NILI+GLC+ GS+ EALE   DM KHGVEPD VTYN LAKGF 
Sbjct: 261 FFCTVLKCGLVPSVYSHNILINGLCLVGSIAEALELASDMNKHGVEPDSVTYNILAKGFH 320

Query: 357 LLGFMSGAWKVVQKMLLKGLNPDIVTYTILICGHCQMGNIEEALKLRQETLSRGFQLN-I 416
           LLG +SGAW+V++ ML KGL+PD++TYTIL+CG CQ+GNI+  L L ++ LSRGF+LN I
Sbjct: 321 LLGMISGAWEVIRDMLDKGLSPDVITYTILLCGQCQLGNIDMGLVLLKDMLSRGFELNSI 380

Query: 417 ISYSVLLSCLCKVGRIEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQ 476
           I  SV+LS LCK GRI+EAL+L N+M+   L PDL+ YSI+IHGLCK G    A  LY++
Sbjct: 381 IPCSVMLSGLCKTGRIDEALSLFNQMKADGLSPDLVAYSIVIHGLCKLGKFDMALWLYDE 440

Query: 477 MRLKRNFPNYFAQRAVLLGFFENGNISEARRYFDALTHMDLIEDIILYNIMIDGYVRLGD 536
           M  KR  PN     A+LLG  + G + EAR   D+L       DI+LYNI+IDGY + G 
Sbjct: 441 MCDKRILPNSRTHGALLLGLCQKGMLLEARSLLDSLISSGETLDIVLYNIVIDGYAKSGC 500

Query: 537 ISEAMQLYYRMFERGITPTVVTFNTLVHGFCRNGDLVEARKMFEIIRLNGLLPSVVTYTT 596
           I EA++L+  + E GITP+V TFN+L++G+C+  ++ EARK+ ++I+L GL PSVV+YTT
Sbjct: 501 IEEALELFKVVIETGITPSVATFNSLIYGYCKTQNIAEARKILDVIKLYGLAPSVVSYTT 560

Query: 597 LMNAYCEAGNMQEMFDLLHEMEANAVVPTHITYTVLIKGLCR----QNKMH--------E 656
           LM+AY   GN + + +L  EM+A  + PT++TY+V+ KGLCR    +N  H        +
Sbjct: 561 LMDAYANCGNTKSIDELRREMKAEGIPPTNVTYSVIFKGLCRGWKHENCNHVLRERIFEK 620

Query: 657 ALQLLEYMYAKGLMPDQITYNTIIQCFCKARDIAKAFQVYNEMLLHNLDPTHVTYNVLIS 716
             Q L  M ++G+ PDQITYNTIIQ  C+ + ++ AF     M   NLD +  TYN+LI 
Sbjct: 621 CKQGLRDMESEGIPPDQITYNTIIQYLCRVKHLSGAFVFLEIMKSRNLDASSATYNILID 680

Query: 717 GLCVYGDLKDADRMLVSMEDQNISLTKVAYMTIIKAHCAKGQVSKALGFFNQMLAKSFVI 776
            LCVYG ++ AD  + S+++QN+SL+K AY T+IKAHC KG    A+  F+Q+L + F +
Sbjct: 681 SLCVYGYIRKADSFIYSLQEQNVSLSKFAYTTLIKAHCVKGDPEMAVKLFHQLLHRGFNV 740

Query: 777 SIRDYSAIINRLCKRGLI 781
           SIRDYSA+INRLC+R L+
Sbjct: 741 SIRDYSAVINRLCRRHLM 741

BLAST of CmoCh04G016450 vs. TAIR10
Match: AT5G59900.1 (AT5G59900.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 298.5 bits (763), Expect = 1.3e-80
Identity = 184/640 (28.75%), Postives = 327/640 (51.09%), Query Frame = 1

Query: 170 DSNGVVWDMLAFAYSRHEMIHDALFVVAKMKDLNLQASVPTYNSLLHNL---RHTDVMWD 229
           D N V +++L     + + + +A+ +   +   +L+  V TY +L++ L   +  ++  +
Sbjct: 259 DVNIVPYNVLIDGLCKKQKVWEAVGIKKDLAGKDLKPDVVTYCTLVYGLCKVQEFEIGLE 318

Query: 230 ICNEIKASGAPQSEYTTSILIHGLCAQSKLQDAISFLQDSNEV-VGPSIVSINTVMSKFC 289
           + +E+       SE   S L+ GL  + K+++A++ ++   +  V P++   N ++   C
Sbjct: 319 MMDEMLCLRFSPSEAAVSSLVEGLRKRGKIEEALNLVKRVVDFGVSPNLFVYNALIDSLC 378

Query: 290 KVGLVDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDVV 349
           K      A   F  M K GL P+  +Y+ILI   C  G +D AL F  +M   G++  V 
Sbjct: 379 KGRKFHEAELLFDRMGKIGLRPNDVTYSILIDMFCRRGKLDTALSFLGEMVDTGLKLSVY 438

Query: 350 TYNTLAKGFLLLGFMSGAWKVVQKMLLKGLNPDIVTYTILICGHCQMGNIEEALKLRQET 409
            YN+L  G    G +S A   + +M+ K L P +VTYT L+ G+C  G I +AL+L  E 
Sbjct: 439 PYNSLINGHCKFGDISAAEGFMAEMINKKLEPTVVTYTSLMGGYCSKGKINKALRLYHEM 498

Query: 410 LSRGFQLNIISYSVLLSCLCKVGRIEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFV 469
             +G   +I +++ LLS L + G I +A+ L NEM    +KP+ + Y+++I G C+EG +
Sbjct: 499 TGKGIAPSIYTFTTLLSGLFRAGLIRDAVKLFNEMAEWNVKPNRVTYNVMIEGYCEEGDM 558

Query: 470 QRAYQLYEQMRLKRNFPNYFAQRAVLLGFFENGNISEARRYFDALTHMDLIEDIILYNIM 529
            +A++  ++M  K   P+ ++ R ++ G    G  SEA+ + D L   +   + I Y  +
Sbjct: 559 SKAFEFLKEMTEKGIVPDTYSYRPLIHGLCLTGQASEAKVFVDGLHKGNCELNEICYTGL 618

Query: 530 IDGYVRLGDISEAMQLYYRMFERGITPTVVTFNTLVHGFCRNGDLVEARKMFEIIRLNGL 589
           + G+ R G + EA+ +   M +RG+   +V +  L+ G  ++ D      + + +   GL
Sbjct: 619 LHGFCREGKLEEALSVCQEMVQRGVDLDLVCYGVLIDGSLKHKDRKLFFGLLKEMHDRGL 678

Query: 590 LPSVVTYTTLMNAYCEAGNMQEMFDLLHEMEANAVVPTHITYTVLIKGLCRQNKMHEALQ 649
            P  V YT++++A  + G+ +E F +   M     VP  +TYT +I GLC+   ++EA  
Sbjct: 679 KPDDVIYTSMIDAKSKTGDFKEAFGIWDLMINEGCVPNEVTYTAVINGLCKAGFVNEAEV 738

Query: 650 LLEYMYAKGLMPDQITYNTIIQCFCKAR-DIAKAFQVYNEMLLHNLDPTHVTYNVLISGL 709
           L   M     +P+Q+TY   +    K   D+ KA +++N +L   L  T  TYN+LI G 
Sbjct: 739 LCSKMQPVSSVPNQVTYGCFLDILTKGEVDMQKAVELHNAILKGLLANT-ATYNMLIRGF 798

Query: 710 CVYGDLKDADRMLVSMEDQNISLTKVAYMTIIKAHCAKGQVSKALGFFNQMLAKSFVISI 769
           C  G +++A  ++  M    +S   + Y T+I   C +  V KA+  +N M  K      
Sbjct: 799 CRQGRIEEASELITRMIGDGVSPDCITYTTMINELCRRNDVKKAIELWNSMTEKGIRPDR 858

Query: 770 RDYSAIINRLCKRGLITEAKYLFAMMLSEGVTPDSEICET 805
             Y+ +I+  C  G + +A  L   ML +G+ P+++   T
Sbjct: 859 VAYNTLIHGCCVAGEMGKATELRNEMLRQGLIPNNKTSRT 897

BLAST of CmoCh04G016450 vs. TAIR10
Match: AT5G39710.1 (AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 293.9 bits (751), Expect = 3.1e-79
Identity = 170/572 (29.72%), Postives = 296/572 (51.75%), Query Frame = 1

Query: 263 LQDSNEVVGPSIVSINTVMSKFCKVGLVDVARSFFCLMVKNGLLPDSYSYNILIHGLCVA 322
           LQ++ ++   +    + V+  + ++ L+D A S   L   +G +P   SYN ++     +
Sbjct: 123 LQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLDATIRS 182

Query: 323 G-SMDEALEFTDDMEKHGVEPDVVTYNTLAKGFLLLGFMSGAWKVVQKMLLKGLNPDIVT 382
             ++  A     +M +  V P+V TYN L +GF   G +  A  +  KM  KG  P++VT
Sbjct: 183 KRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVT 242

Query: 383 YTILICGHCQMGNIEEALKLRQETLSRGFQLNIISYSVLLSCLCKVGRIEEALALLNEME 442
           Y  LI G+C++  I++  KL +    +G + N+ISY+V+++ LC+ GR++E   +L EM 
Sbjct: 243 YNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMN 302

Query: 443 TLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQMRLKRNFPNYFAQRAVLLGFFENGNIS 502
                 D + Y+ LI G CKEG   +A  ++ +M      P+     +++    + GN++
Sbjct: 303 RRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMN 362

Query: 503 EARRYFDALTHMDLIEDIILYNIMIDGYVRLGDISEAMQLYYRMFERGITPTVVTFNTLV 562
            A  + D +    L  +   Y  ++DG+ + G ++EA ++   M + G +P+VVT+N L+
Sbjct: 363 RAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTYNALI 422

Query: 563 HGFCRNGDLVEARKMFEIIRLNGLLPSVVTYTTLMNAYCEAGNMQEMFDLLHEMEANAVV 622
           +G C  G + +A  + E ++  GL P VV+Y+T+++ +C + ++ E   +  EM    + 
Sbjct: 423 NGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIK 482

Query: 623 PTHITYTVLIKGLCRQNKMHEALQLLEYMYAKGLMPDQITYNTIIQCFCKARDIAKAFQV 682
           P  ITY+ LI+G C Q +  EA  L E M   GL PD+ TY  +I  +C   D+ KA Q+
Sbjct: 483 PDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQL 542

Query: 683 YNEMLLHNLDPTHVTYNVLISGLCVYGDLKDADRMLVSMEDQNISLTKVAYMTIIKAHCA 742
           +NEM+   + P  VTY+VLI+GL      ++A R+L+ +  +    + V Y T+I+ +C+
Sbjct: 543 HNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRLLLKLFYEESVPSDVTYHTLIE-NCS 602

Query: 743 KGQVSKALGFFNQMLAKSFVISIRDYSAIINRLCKRGLITEAKYLFAMMLSEGVTPDSEI 802
                               I  +   ++I   C +G++TEA  +F  ML +   PD   
Sbjct: 603 N-------------------IEFKSVVSLIKGFCMKGMMTEADQVFESMLGKNHKPDGTA 662

Query: 803 CETMLNAFHQHGDSSSAFEFLAVMVKSGVISH 834
              M++   + GD   A+     MVKSG + H
Sbjct: 663 YNIMIHGHCRAGDIRKAYTLYKEMVKSGFLLH 674

BLAST of CmoCh04G016450 vs. TAIR10
Match: AT5G01110.1 (AT5G01110.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 293.1 bits (749), Expect = 5.3e-79
Identity = 171/617 (27.71%), Postives = 307/617 (49.76%), Query Frame = 1

Query: 116 FRHSGFSQLAVSHILAGKGRFKELHCVIKQLVEEQGSGSASSFCDLLLNKFRNWDSNGVV 175
           F+H+  S  A+ HIL   GR  +    + +++   G  S     + L + F N  SN  V
Sbjct: 109 FKHTSLSLSAMIHILVRSGRLSDAQSCLLRMIRRSGV-SRLEIVNSLDSTFSNCGSNDSV 168

Query: 176 WDMLAFAYSRHEMIHDALFVVAKMKDLNLQASVPTYNSLLHNLRH---TDVMWDICNEIK 235
           +D+L   Y +   + +A      ++      S+   N+L+ +L      ++ W +  EI 
Sbjct: 169 FDLLIRTYVQARKLREAHEAFTLLRSKGFTVSIDACNALIGSLVRIGWVELAWGVYQEIS 228

Query: 236 ASGAPQSEYTTSILIHGLCAQSKLQDAISFLQDSNEV-VGPSIVSINTVMSKFCKVGLVD 295
            SG   + YT +I+++ LC   K++   +FL    E  V P IV+ NT++S +   GL++
Sbjct: 229 RSGVGINVYTLNIMVNALCKDGKMEKVGTFLSQVQEKGVYPDIVTYNTLISAYSSKGLME 288

Query: 296 VARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDVVTYNTLA 355
            A      M   G  P  Y+YN +I+GLC  G  + A E   +M + G+ PD  TY +L 
Sbjct: 289 EAFELMNAMPGKGFSPGVYTYNTVINGLCKHGKYERAKEVFAEMLRSGLSPDSTTYRSLL 348

Query: 356 KGFLLLGFMSGAWKVVQKMLLKGLNPDIVTYTILICGHCQMGNIEEALKLRQETLSRGFQ 415
                 G +    KV   M  + + PD+V ++ ++    + GN+++AL         G  
Sbjct: 349 MEACKKGDVVETEKVFSDMRSRDVVPDLVCFSSMMSLFTRSGNLDKALMYFNSVKEAGLI 408

Query: 416 LNIISYSVLLSCLCKVGRIEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQL 475
            + + Y++L+   C+ G I  A+ L NEM       D++ Y+ ++HGLCK   +  A +L
Sbjct: 409 PDNVIYTILIQGYCRKGMISVAMNLRNEMLQQGCAMDVVTYNTILHGLCKRKMLGEADKL 468

Query: 476 YEQMRLKRNFPNYFAQRAVLLGFFENGNISEARRYFDALTHMDLIEDIILYNIMIDGYVR 535
           + +M  +  FP+ +    ++ G  + GN+  A   F  +    +  D++ YN ++DG+ +
Sbjct: 469 FNEMTERALFPDSYTLTILIDGHCKLGNLQNAMELFQKMKEKRIRLDVVTYNTLLDGFGK 528

Query: 536 LGDISEAMQLYYRMFERGITPTVVTFNTLVHGFCRNGDLVEARKMFEIIRLNGLLPSVVT 595
           +GDI  A +++  M  + I PT ++++ LV+  C  G L EA ++++ +    + P+V+ 
Sbjct: 529 VGDIDTAKEIWADMVSKEILPTPISYSILVNALCSKGHLAEAFRVWDEMISKNIKPTVMI 588

Query: 596 YTTLMNAYCEAGNMQEMFDLLHEMEANAVVPTHITYTVLIKGLCRQNKMHEALQLLEYMY 655
             +++  YC +GN  +    L +M +   VP  I+Y  LI G  R+  M +A  L++ M 
Sbjct: 589 CNSMIKGYCRSGNASDGESFLEKMISEGFVPDCISYNTLIYGFVREENMSKAFGLVKKME 648

Query: 656 AK--GLMPDQITYNTIIQCFCKARDIAKAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGD 715
            +  GL+PD  TYN+I+  FC+   + +A  V  +M+   ++P   TY  +I+G     +
Sbjct: 649 EEQGGLVPDVFTYNSILHGFCRQNQMKEAEVVLRKMIERGVNPDRSTYTCMINGFVSQDN 708

Query: 716 LKDADRMLVSMEDQNIS 727
           L +A R+   M  +  S
Sbjct: 709 LTEAFRIHDEMLQRGFS 724

BLAST of CmoCh04G016450 vs. TAIR10
Match: AT1G62670.1 (AT1G62670.1 rna processing factor 2)

HSP 1 Score: 285.8 bits (730), Expect = 8.5e-77
Identity = 171/591 (28.93%), Postives = 300/591 (50.76%), Query Frame = 1

Query: 234 SGAPQSEYTTSILIHGLCAQSKLQDAISFLQDS-NEVVGPSIVSINTVMSKFCKVGLVDV 293
           SG    +Y   +  +GL ++ KL DA++   +       PSI+  + ++S   K+   DV
Sbjct: 41  SGKTSYDYREKLSRNGL-SELKLDDAVALFGEMVKSRPFPSIIEFSKLLSAIAKMNKFDV 100

Query: 294 ARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDVVTYNTLAK 353
             S    M   G+  + Y+Y+ILI+  C    +  AL     M K G EP++VT ++L  
Sbjct: 101 VISLGEQMQNLGIPHNHYTYSILINCFCRRSQLPLALAVLGKMMKLGYEPNIVTLSSLLN 160

Query: 354 GFLLLGFMSGAWKVVQKMLLKGLNPDIVTYTILICGHCQMGNIEEALKLRQETLSRGFQL 413
           G+     +S A  +V +M + G  P+ VT+  LI G        EA+ L    +++G Q 
Sbjct: 161 GYCHSKRISEAVALVDQMFVTGYQPNTVTFNTLIHGLFLHNKASEAMALIDRMVAKGCQP 220

Query: 414 NIISYSVLLSCLCKVGRIEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLY 473
           ++++Y V+++ LCK G  + A  LLN+ME  +L+P +++Y+ +I GLCK   +  A  L+
Sbjct: 221 DLVTYGVVVNGLCKRGDTDLAFNLLNKMEQGKLEPGVLIYNTIIDGLCKYKHMDDALNLF 280

Query: 474 EQMRLKRNFPNYFAQRAVLLGFFENGNISEARRYFDALTHMDLIEDIILYNIMIDGYVRL 533
           ++M  K   PN     +++      G  S+A R    +    +  D+  ++ +ID +V+ 
Sbjct: 281 KEMETKGIRPNVVTYSSLISCLCNYGRWSDASRLLSDMIERKINPDVFTFSALIDAFVKE 340

Query: 534 GDISEAMQLYYRMFERGITPTVVTFNTLVHGFCRNGDLVEARKMFEIIRLNGLLPSVVTY 593
           G + EA +LY  M +R I P++VT+++L++GFC +  L EA++MFE +      P VVTY
Sbjct: 341 GKLVEAEKLYDEMVKRSIDPSIVTYSSLINGFCMHDRLDEAKQMFEFMVSKHCFPDVVTY 400

Query: 594 TTLMNAYCEAGNMQEMFDLLHEMEANAVVPTHITYTVLIKGLCRQNKMHEALQLLEYMYA 653
            TL+  +C+   ++E  ++  EM    +V   +TY +LI+GL +      A ++ + M +
Sbjct: 401 NTLIKGFCKYKRVEEGMEVFREMSQRGLVGNTVTYNILIQGLFQAGDCDMAQEIFKEMVS 460

Query: 654 KGLMPDQITYNTIIQCFCKARDIAKAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGDLKD 713
            G+ P+ +TYNT++   CK   + KA  V+  +    ++PT  TYN++I G+C  G ++D
Sbjct: 461 DGVPPNIMTYNTLLDGLCKNGKLEKAMVVFEYLQRSKMEPTIYTYNIMIEGMCKAGKVED 520

Query: 714 ADRMLVSMEDQNISLTKVAYMTIIKAHCAKGQVSKALGFFNQMLAKSFVISIRDYSAIIN 773
              +  ++  + +    VAY T+I   C KG   +A   F +M     + +   Y+ +I 
Sbjct: 521 GWDLFCNLSLKGVKPDVVAYNTMISGFCRKGSKEEADALFKEMKEDGTLPNSGCYNTLIR 580

Query: 774 RLCKRGLITEAKYLFAMMLSEGVTPDSEICETMLNAFHQHGDSSSAFEFLA 824
              + G    +  L   M S G   D+     + N  H      S  + L+
Sbjct: 581 ARLRDGDREASAELIKEMRSCGFAGDASTIGLVTNMLHDGRLDKSFLDMLS 630

BLAST of CmoCh04G016450 vs. NCBI nr
Match: gi|659130189|ref|XP_008465042.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g13630 [Cucumis melo])

HSP 1 Score: 1438.3 bits (3722), Expect = 0.0e+00
Identity = 710/835 (85.03%), Postives = 765/835 (91.62%), Query Frame = 1

Query: 1   MLSRIHQWKPLHSLRKCGILASFSSVILARPSVS--AARLEAESVTPSFVLGQNDPVCEI 60
           MLSRIHQWKPLH +         SSVILARPSVS  AARLE  +VT SF   QND V EI
Sbjct: 1   MLSRIHQWKPLHWIFA-------SSVILARPSVSVSAARLEPATVTTSFFPDQNDSVREI 60

Query: 61  LTGLNSFGFRAYVGGCNFRTVVSTLSETVVDGVLESLNIQNPDVAVAFFYLLRNKYGFRH 120
           LTGLNS GFRAYVGGCNFRTVVSTLSETVVDGVL+SL    PDVAVAFFYLL N+YGFRH
Sbjct: 61  LTGLNSLGFRAYVGGCNFRTVVSTLSETVVDGVLDSLRTLKPDVAVAFFYLLINEYGFRH 120

Query: 121 SGFSQLAVSHILAGKGRFKELHCVIKQLVEEQGSGSASSFCDLLLNKFRNWDSNGVVWDM 180
           S FSQ  VSHILAG+GRFKELH VIK L+EEQG GSAS+FCDLLLNKFRNWDSNGVVWDM
Sbjct: 121 SRFSQFVVSHILAGEGRFKELHSVIKHLIEEQGLGSASTFCDLLLNKFRNWDSNGVVWDM 180

Query: 181 LAFAYSRHEMIHDALFVVAKMKDLNLQASVPTYNSLLHNLRHTDVMWDICNEIKASGAPQ 240
           LAFAYSRHEMIHDALFV AKMKDLNLQASVPTYNSLLHNLRHTD++WD+ NEIK SGAPQ
Sbjct: 181 LAFAYSRHEMIHDALFVFAKMKDLNLQASVPTYNSLLHNLRHTDIIWDVYNEIKVSGAPQ 240

Query: 241 SEYTTSILIHGLCAQSKLQDAISFLQDSNEVVGPSIVSINTVMSKFCKVGLVDVARSFFC 300
           SEYTTSILIHGLC QSK++DAISFLQDSNEVVGPS VSINT+MSKFCKVGL+DVARSFFC
Sbjct: 241 SEYTTSILIHGLCEQSKIEDAISFLQDSNEVVGPSTVSINTIMSKFCKVGLIDVARSFFC 300

Query: 301 LMVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDVVTYNTLAKGFLLLG 360
           L+VK+GLL DS+SYNIL+HGLCVAGSMDEALEFTDDMEKHGVEPDVVTYNTLAKGFLLLG
Sbjct: 301 LLVKSGLLHDSFSYNILVHGLCVAGSMDEALEFTDDMEKHGVEPDVVTYNTLAKGFLLLG 360

Query: 361 FMSGAWKVVQKMLLKGLNPDIVTYTILICGHCQMGNIEEALKLRQETLSRGFQLNIISYS 420
            MSGA KVVQKMLL+GLNPDIVTYTILICGHCQMGNIEEALKLRQETLSRGF+LNIISYS
Sbjct: 361 LMSGARKVVQKMLLQGLNPDIVTYTILICGHCQMGNIEEALKLRQETLSRGFKLNIISYS 420

Query: 421 VLLSCLCKVGRIEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQMRLK 480
           VLLSCLCKVGRIEEAL L +EMETL LKPD IVYSILIHGLCKEGFVQRAYQLYEQM LK
Sbjct: 421 VLLSCLCKVGRIEEALTLFDEMETLHLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLK 480

Query: 481 RNFPNYFAQRAVLLGFFENGNISEARRYFDALTHMDLIEDIILYNIMIDGYVRLGDISEA 540
           R FP+YFAQRAVLLG F+NGNISEAR+YFD L  MDLIED++LYNIMIDGYVRLGDI+EA
Sbjct: 481 RIFPHYFAQRAVLLGLFKNGNISEARKYFDTLNRMDLIEDVVLYNIMIDGYVRLGDIAEA 540

Query: 541 MQLYYRMFERGITPTVVTFNTLVHGFCRNGDLVEARKMFEIIRLNGLLPSVVTYTTLMNA 600
           MQLYY M ERGITP+VVTFNTL++GFCR GDL+EARKM ++IRL GL+PSVVTYTTLMNA
Sbjct: 541 MQLYYNMIERGITPSVVTFNTLINGFCRRGDLMEARKMLDVIRLKGLVPSVVTYTTLMNA 600

Query: 601 YCEAGNMQEMFDLLHEMEANAVVPTHITYTVLIKGLCRQNKMHEALQLLEYMYAKGLMPD 660
           YCE GNMQEMF  LHEMEANAVVPTH+TYTVLIKGLCRQNKMHE+LQLLEYMYAKGL+PD
Sbjct: 601 YCEVGNMQEMFHFLHEMEANAVVPTHVTYTVLIKGLCRQNKMHESLQLLEYMYAKGLVPD 660

Query: 661 QITYNTIIQCFCKARDIAKAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGDLKDADRMLV 720
            +TYNTIIQCFCK ++I KAFQ+YN+MLLHN DPTHVTYNVLI+GLC+YGDLKD DRM+V
Sbjct: 661 PVTYNTIIQCFCKGKEITKAFQLYNKMLLHNCDPTHVTYNVLINGLCIYGDLKDVDRMVV 720

Query: 721 SMEDQNISLTKVAYMTIIKAHCAKGQVSKALGFFNQMLAKSFVISIRDYSAIINRLCKRG 780
           SMED+NI LTKVAYMTII+AHCAKGQVSKALG+FNQMLAK+FVISIRDYSA+INRLCKRG
Sbjct: 721 SMEDRNIILTKVAYMTIIQAHCAKGQVSKALGYFNQMLAKNFVISIRDYSAVINRLCKRG 780

Query: 781 LITEAKYLFAMMLSEGVTPDSEICETMLNAFHQHGDSSSAFEFLAVMVKSGVISH 834
           LITEAKY F MMLSEG+TPD EICET+LNAFHQ GD+SS FEFLA++VKSG ISH
Sbjct: 781 LITEAKYFFVMMLSEGITPDPEICETVLNAFHQQGDNSSVFEFLAMVVKSGFISH 828

BLAST of CmoCh04G016450 vs. NCBI nr
Match: gi|449453449|ref|XP_004144470.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g13630 [Cucumis sativus])

HSP 1 Score: 1388.6 bits (3593), Expect = 0.0e+00
Identity = 687/835 (82.28%), Postives = 747/835 (89.46%), Query Frame = 1

Query: 1   MLSRIHQWKPLHSLRKCGILASFSSVILARPSVS--AARLEAESVTPSFVLGQNDPVCEI 60
           MLSR HQ KPLH      I AS SSVILARPSVS  AARLE  +VT SFV  QND V EI
Sbjct: 1   MLSRAHQCKPLH-----WIFASLSSVILARPSVSVSAARLEPATVTTSFVSDQNDSVREI 60

Query: 61  LTGLNSFGFRAYVGGCNFRTVVSTLSETVVDGVLESLNIQNPDVAVAFFYLLRNKYGFRH 120
           L GLNS GFRAYVGGCNFRTVVSTLSETVVDGVL+ L    PDVAVAFFY L N+YGFRH
Sbjct: 61  LIGLNSLGFRAYVGGCNFRTVVSTLSETVVDGVLDRLRTLKPDVAVAFFYFLINEYGFRH 120

Query: 121 SGFSQLAVSHILAGKGRFKELHCVIKQLVEEQGSGSASSFCDLLLNKFRNWDSNGVVWDM 180
           S FSQ  VSHILAGKGRFKEL  VIK L+ +QG GSAS  CDLLL KFRNWDSNG+VWDM
Sbjct: 121 SIFSQFVVSHILAGKGRFKELDSVIKNLIVDQGLGSASIICDLLLEKFRNWDSNGLVWDM 180

Query: 181 LAFAYSRHEMIHDALFVVAKMKDLNLQASVPTYNSLLHNLRHTDVMWDICNEIKASGAPQ 240
           LAFAYSRHEMIHDALFV+AKMKDLN QASVPTYNSLLHN+RHTD+MWD+ NEIK SGAPQ
Sbjct: 181 LAFAYSRHEMIHDALFVIAKMKDLNFQASVPTYNSLLHNMRHTDIMWDVYNEIKVSGAPQ 240

Query: 241 SEYTTSILIHGLCAQSKLQDAISFLQDSNEVVGPSIVSINTVMSKFCKVGLVDVARSFFC 300
           SE TTSILIHGLC QSKL+DAISFL DSN+VVGPSIVSINT+MSKFCKVGL+DVARSFFC
Sbjct: 241 SECTTSILIHGLCEQSKLEDAISFLHDSNKVVGPSIVSINTIMSKFCKVGLIDVARSFFC 300

Query: 301 LMVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDVVTYNTLAKGFLLLG 360
           LMVKNGLL DS+SYNIL+HGLCVAGSMDEAL FTDDMEKHGVEPDVVTYNTLAKGFLLLG
Sbjct: 301 LMVKNGLLHDSFSYNILLHGLCVAGSMDEALGFTDDMEKHGVEPDVVTYNTLAKGFLLLG 360

Query: 361 FMSGAWKVVQKMLLKGLNPDIVTYTILICGHCQMGNIEEALKLRQETLSRGFQLNIISYS 420
            MSGA KVVQKMLL+GLNPD+VTYT LICGHCQMGNIEEALKLRQETLSRGF+LN+I Y+
Sbjct: 361 LMSGARKVVQKMLLQGLNPDLVTYTTLICGHCQMGNIEEALKLRQETLSRGFKLNVIFYN 420

Query: 421 VLLSCLCKVGRIEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQMRLK 480
           +LLSCLCKVGRIEEAL L +EMETLRL+PD IVYSILIHGLCKEGFVQRAYQLYEQMRLK
Sbjct: 421 MLLSCLCKVGRIEEALTLFDEMETLRLEPDFIVYSILIHGLCKEGFVQRAYQLYEQMRLK 480

Query: 481 RNFPNYFAQRAVLLGFFENGNISEARRYFDALTHMDLIEDIILYNIMIDGYVRLGDISEA 540
           R FP++FAQRAVLLG F+NGNISEAR YFD  T MDL+ED++LYNIMIDGYVRL  I+EA
Sbjct: 481 RKFPHHFAQRAVLLGLFKNGNISEARNYFDTWTRMDLMEDVVLYNIMIDGYVRLDGIAEA 540

Query: 541 MQLYYRMFERGITPTVVTFNTLVHGFCRNGDLVEARKMFEIIRLNGLLPSVVTYTTLMNA 600
           MQLYY+M ERGITP+VVTFNTL++GFCR GDL+EARKM E+IRL GL+PSVVTYTTLMNA
Sbjct: 541 MQLYYKMIERGITPSVVTFNTLINGFCRRGDLMEARKMLEVIRLKGLVPSVVTYTTLMNA 600

Query: 601 YCEAGNMQEMFDLLHEMEANAVVPTHITYTVLIKGLCRQNKMHEALQLLEYMYAKGLMPD 660
           YCE GNMQEMF  LHEMEANAVVPTH+TYTVLIKGLCRQNKMHE+LQLLEYMYAKGL+PD
Sbjct: 601 YCEVGNMQEMFHFLHEMEANAVVPTHVTYTVLIKGLCRQNKMHESLQLLEYMYAKGLLPD 660

Query: 661 QITYNTIIQCFCKARDIAKAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGDLKDADRMLV 720
            +TYNTIIQCFCK ++I KA Q+YN MLLHN DPT VTY VLI+ LC++GDLKD DRM+V
Sbjct: 661 SVTYNTIIQCFCKGKEITKALQLYNMMLLHNCDPTQVTYKVLINALCIFGDLKDVDRMVV 720

Query: 721 SMEDQNISLTKVAYMTIIKAHCAKGQVSKALGFFNQMLAKSFVISIRDYSAIINRLCKRG 780
           S+ED+NI+L KV YMTIIKAHCAKGQVSKALG+FNQMLAK FVISIRDYSA+INRLCKRG
Sbjct: 721 SIEDRNITLKKVTYMTIIKAHCAKGQVSKALGYFNQMLAKGFVISIRDYSAVINRLCKRG 780

Query: 781 LITEAKYLFAMMLSEGVTPDSEICETMLNAFHQHGDSSSAFEFLAVMVKSGVISH 834
           LITEAKY F MMLSEGVTPD EIC+T+LNAFHQ G++SS FEFLA++VKSG ISH
Sbjct: 781 LITEAKYFFVMMLSEGVTPDPEICKTVLNAFHQQGNNSSVFEFLAMVVKSGFISH 830

BLAST of CmoCh04G016450 vs. NCBI nr
Match: gi|658009094|ref|XP_008339746.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g13630 isoform X1 [Malus domestica])

HSP 1 Score: 1114.8 bits (2882), Expect = 0.0e+00
Identity = 539/830 (64.94%), Postives = 665/830 (80.12%), Query Frame = 1

Query: 1   MLSRIHQWKPLHSLRKCGILASFSSVILARPSVSAARLEAESVTPSFVLGQNDPVCEILT 60
           ML  IH+WKPLH L+K  ILA  SS+I  +PS SAA+LE E    + +    + V E++T
Sbjct: 1   MLHHIHKWKPLHFLQKFQILAPLSSLIFTKPSASAAKLEDELAAAAAIPNPRNTVSEVIT 60

Query: 61  GLNSFGFRAYVGGCNFRTVVSTLSETVVDGVLESLNIQNPDVAVAFFYLLRNKYGFRHSG 120
           GL  FG R ++G C FRT+VS L++  VD ++ESL++++ D A  FF  LRN+ GFRHS 
Sbjct: 61  GLGIFGLRKFLGNCYFRTMVSKLNQPEVDLIIESLSLESSDSAFGFFKFLRNECGFRHSR 120

Query: 121 FSQLAVSHILAGKGRFKELHCVIKQLVEEQGSGSASSFCDLLLNKFRNWDSNGVVWDMLA 180
            S+  V H+LA   +F+EL  V+KQ+V+E+G GSA S C+LLL +FR+WDS+ VVWDMLA
Sbjct: 121 ISEFIVVHVLATNWQFQELRSVVKQMVDEEGPGSAPSLCELLLYRFRDWDSSSVVWDMLA 180

Query: 181 FAYSRHEMIHDALFVVAKMKDLNLQASVPTYNSLLHNLRHTDVMWDICNEIKASGAPQSE 240
           FAYSR EM+HDAL V+AKMKDLNL+ S  TYN LLHNLRHTD+MW++ NEIK SG P+S+
Sbjct: 181 FAYSRSEMVHDALSVLAKMKDLNLKVSTSTYNCLLHNLRHTDIMWNVYNEIKDSGTPESD 240

Query: 241 YTTSILIHGLCAQSKLQDAISFLQDSNEV-VGPSIVSINTVMSKFCKVGLVDVARSFFCL 300
           YTTSILI GLC QS +QDA+SFL D+     GPS+VS NT+MS+FCK+G VDVA+SFFC+
Sbjct: 241 YTTSILIDGLCQQSSVQDAVSFLMDAERTETGPSVVSFNTIMSRFCKLGFVDVAKSFFCV 300

Query: 301 MVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDVVTYNTLAKGFLLLGF 360
             K GL+PDSYSYNILIHGLCVAGS++EALEFT DME+HGV+PD VTYN L KGF LLG 
Sbjct: 301 XXKYGLVPDSYSYNILIHGLCVAGSLEEALEFTKDMERHGVQPDTVTYNILCKGFHLLGL 360

Query: 361 MSGAWKVVQKMLLKGLNPDIVTYTILICGHCQMGNIEEALKLRQETLSRGFQLNIISYSV 420
           MSGA KV+QKML+KGLNPD VTYTI+ICGHC +GNI+EALKL++E +SRGFQL++I YSV
Sbjct: 361 MSGARKVIQKMLVKGLNPDHVTYTIMICGHCHVGNIDEALKLQKEMISRGFQLSVIVYSV 420

Query: 421 LLSCLCKVGRIEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQMRLKR 480
           LLS +CK GR+E AL LL EME + L+PDLI YSILIHGLCK+G VQRA ++Y +M +KR
Sbjct: 421 LLSSMCKSGRVEXALRLLYEMEAVGLEPDLITYSILIHGLCKQGDVQRASEIYREMYMKR 480

Query: 481 NFPNYFAQRAVLLGFFENGNISEARRYFDALTHMDLIEDIILYNIMIDGYVRLGDISEAM 540
             PNYFA RA+LLG  E G+I EAR+YFD LT   + EDI+LYNIM+DGYV+LG+++EA+
Sbjct: 481 IIPNYFAHRAILLGLREKGDIYEARKYFDHLTTRAVTEDIVLYNIMMDGYVKLGNVAEAI 540

Query: 541 QLYYRMFERGITPTVVTFNTLVHGFCRNGDLVEARKMFEIIRLNGLLPSVVTYTTLMNAY 600
           QLY ++ E+G+ P+ VTFNTL+HGFC+NG LVEAR+M + I L+GLLPS VTYTTLMNA 
Sbjct: 541 QLYKQIIEKGLNPSTVTFNTLIHGFCKNGKLVEARRMLDTIELHGLLPSPVTYTTLMNAN 600

Query: 601 CEAGNMQEMFDLLHEMEANAVVPTHITYTVLIKGLCRQNKMHEALQLLEYMYAKGLMPDQ 660
           CE GN+  M +LL EMEA  V PTH++YTV+IKGLCRQ K  +A+ L+E MYAKGL PDQ
Sbjct: 601 CEQGNINGMXELLXEMEAKDVEPTHVSYTVVIKGLCRQGKRWDAVHLVEEMYAKGLSPDQ 660

Query: 661 ITYNTIIQCFCKARDIAKAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGDLKDADRMLVS 720
           ITYNTII+CFCKA+D  KAFQ++NEML+HNL PT VTYN+LI+GLCVYGDL+DADR+LVS
Sbjct: 661 ITYNTIIKCFCKAQDFEKAFQLHNEMLMHNLAPTPVTYNLLINGLCVYGDLEDADRLLVS 720

Query: 721 MEDQNISLTKVAYMTIIKAHCAKGQVSKALGFFNQMLAKSFVISIRDYSAIINRLCKRGL 780
           + D NI+LTKVAY T+IKAHCAKG V +A+  F+QM+ K F ISIRDYSA+INRLCKR  
Sbjct: 721 LNDSNINLTKVAYTTLIKAHCAKGDVYRAVALFHQMVEKGFEISIRDYSAVINRLCKRCW 780

Query: 781 ITEAKYLFAMMLSEGVTPDSEICETMLNAFHQHGDSSSAFEFLAVMVKSG 830
           ITEAKY F MMLS+G++PD E+CE MLN F Q GD  SA E LA M+K G
Sbjct: 781 ITEAKYFFCMMLSDGISPDQELCEVMLNVFXQGGDFDSAAELLAEMIKFG 830

BLAST of CmoCh04G016450 vs. NCBI nr
Match: gi|694310974|ref|XP_009355583.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g13630 [Pyrus x bretschneideri])

HSP 1 Score: 1112.8 bits (2877), Expect = 0.0e+00
Identity = 534/830 (64.34%), Postives = 671/830 (80.84%), Query Frame = 1

Query: 1   MLSRIHQWKPLHSLRKCGILASFSSVILARPSVSAARLEAESVTPSFVLGQNDPVCEILT 60
           ML  IH+WKPLH L+K  ILA  SS+I  +PS SAA+ + E    + +    + V E++T
Sbjct: 1   MLHHIHKWKPLHFLQKSQILAPRSSIIFTKPSASAAKFDDEPAAAAAIPNPRNTVSEVIT 60

Query: 61  GLNSFGFRAYVGGCNFRTVVSTLSETVVDGVLESLNIQNPDVAVAFFYLLRNKYGFRHSG 120
           GL  FG R ++G   FRT+VS L++  VD ++ESL++++ D+A  FF  LRN+ GFRHS 
Sbjct: 61  GLGIFGLRKFLGNRYFRTMVSKLNQPEVDLIIESLSLESSDLAFGFFKFLRNECGFRHSR 120

Query: 121 FSQLAVSHILAGKGRFKELHCVIKQLVEEQGSGSASSFCDLLLNKFRNWDSNGVVWDMLA 180
            S+  V+H+LA   +F+EL  V+KQ+V+E+G GSA S C+L+L+ FR+WDS+ VVWDMLA
Sbjct: 121 ISEFIVAHVLATNRQFQELRSVVKQIVDEEGPGSAPSLCELILHGFRDWDSSNVVWDMLA 180

Query: 181 FAYSRHEMIHDALFVVAKMKDLNLQASVPTYNSLLHNLRHTDVMWDICNEIKASGAPQSE 240
           FAYSR EM+HDAL V+AKMKDLNL+ S  TYN LLHNLRHTD+MW++ NEIK SG P+S+
Sbjct: 181 FAYSRSEMVHDALSVLAKMKDLNLKVSTSTYNCLLHNLRHTDIMWNVYNEIKDSGTPESD 240

Query: 241 YTTSILIHGLCAQSKLQDAISFLQDSNEVV-GPSIVSINTVMSKFCKVGLVDVARSFFCL 300
           YTTSILI GLC QS LQDA+SFL D+   V GPS+VS NT+MS+FCK+G VDVA+SFFC+
Sbjct: 241 YTTSILIDGLCQQSGLQDAVSFLMDAERTVNGPSVVSFNTIMSRFCKLGFVDVAKSFFCM 300

Query: 301 MVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDVVTYNTLAKGFLLLGF 360
           M K GL+PDSYSYNILIHGLCVAGS++EALEFT DME+HGV+PD VTYN L KGF LLG 
Sbjct: 301 MFKYGLVPDSYSYNILIHGLCVAGSLEEALEFTKDMERHGVQPDTVTYNILCKGFHLLGL 360

Query: 361 MSGAWKVVQKMLLKGLNPDIVTYTILICGHCQMGNIEEALKLRQETLSRGFQLNIISYSV 420
           MSGA KV+QKML++GLNPD VTYTI+ICGHC +GNI+EALKLR+E +SRGFQL++I YSV
Sbjct: 361 MSGARKVIQKMLVRGLNPDHVTYTIMICGHCHVGNIDEALKLRKEMISRGFQLSVIVYSV 420

Query: 421 LLSCLCKVGRIEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQMRLKR 480
           LLS +CK GR+EEAL LL EME + L+PDLI YSILIHGLCK+G VQRA ++Y +M +KR
Sbjct: 421 LLSSMCKSGRVEEALRLLYEMEAVGLEPDLITYSILIHGLCKQGDVQRASEIYREMYMKR 480

Query: 481 NFPNYFAQRAVLLGFFENGNISEARRYFDALTHMDLIEDIILYNIMIDGYVRLGDISEAM 540
             PNYFA RA+LLG  E G++ EAR+YFD LT   + EDI+LYNIM+DGYV+LG+++EA+
Sbjct: 481 IIPNYFAHRAILLGLREKGDLYEARKYFDHLTTRTVTEDIVLYNIMMDGYVKLGNVAEAI 540

Query: 541 QLYYRMFERGITPTVVTFNTLVHGFCRNGDLVEARKMFEIIRLNGLLPSVVTYTTLMNAY 600
           QLY ++ E+G+ P+ VTFNTL+HGFC+ G LVEAR++ + I L+GLLPS VTYTTLMNA 
Sbjct: 541 QLYKQIIEKGLNPSTVTFNTLIHGFCKTGKLVEARRILDTIELHGLLPSPVTYTTLMNAN 600

Query: 601 CEAGNMQEMFDLLHEMEANAVVPTHITYTVLIKGLCRQNKMHEALQLLEYMYAKGLMPDQ 660
           CE GN+  M +LL EMEA  V PTH++YTVLIKGLCRQ K+ +A+ L+  MYAKGL PDQ
Sbjct: 601 CEQGNINGMLELLREMEAKDVEPTHVSYTVLIKGLCRQGKLWDAVHLVGEMYAKGLSPDQ 660

Query: 661 ITYNTIIQCFCKARDIAKAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGDLKDADRMLVS 720
           ITYNT+I+CFCKA+D  KAFQ++NEML+HNL+PT VTYN+LI+GLCVYGDL+DADR+LVS
Sbjct: 661 ITYNTVIKCFCKAQDFEKAFQLHNEMLMHNLEPTPVTYNLLINGLCVYGDLEDADRLLVS 720

Query: 721 MEDQNISLTKVAYMTIIKAHCAKGQVSKALGFFNQMLAKSFVISIRDYSAIINRLCKRGL 780
           + D NI+LTKVAY T+IKAHCAKG V +A+  F+QM+ K F ISIRDYSA+INRLCKR  
Sbjct: 721 LNDSNINLTKVAYSTLIKAHCAKGDVYRAVELFHQMVDKGFEISIRDYSAVINRLCKRCW 780

Query: 781 ITEAKYLFAMMLSEGVTPDSEICETMLNAFHQHGDSSSAFEFLAVMVKSG 830
           +TEAKY F MMLS+G++PD E+CE MLNAF+Q G+ +SA E LA M+K G
Sbjct: 781 MTEAKYFFCMMLSDGISPDQELCEVMLNAFYQGGEFNSAAELLAEMIKFG 830

BLAST of CmoCh04G016450 vs. NCBI nr
Match: gi|359473479|ref|XP_002267299.2| (PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing protein At1g13630 [Vitis vinifera])

HSP 1 Score: 1078.5 bits (2788), Expect = 0.0e+00
Identity = 520/832 (62.50%), Postives = 664/832 (79.81%), Query Frame = 1

Query: 1   MLSRIHQWKPLHSLRKCGILASFSSVILARPSVSAARLEAESVTPSFVLGQNDPVCEILT 60
           ML+ I+ W+ L  LRK   L+  +S+   + SVSAA+L  ES   S     ND V +IL 
Sbjct: 1   MLNHIYPWRSL--LRKSLNLSPITSLGFTKHSVSAAKLHDESADASI---PNDAVRQILI 60

Query: 61  GLNSFGFRAYVGGCNFRTVVSTLSETVVDGVLESLNIQNPDVAVAFFYLLRNKYGFRHSG 120
           GL SFG   ++ G +F+T+ S L+   VD +L SL + N D A+  F LLRN+YGFRHS 
Sbjct: 61  GLRSFGASKFLWGHHFQTLASVLNTHQVDQILLSLRVDNSDSALFLFDLLRNEYGFRHSR 120

Query: 121 FSQLAVSHILAGKGRFKELHCVIKQLVEEQGSGSASSFCDLLLNKFRNWDSNGVVWDMLA 180
            S   VSH++A KG+ KEL  V+ Q+VEE+GSGSA S C+LL N FR+WD N VVWDMLA
Sbjct: 121 VSWFIVSHVVARKGQSKELRRVLNQMVEEEGSGSAPSLCELLCNSFRDWDLNNVVWDMLA 180

Query: 181 FAYSRHEMIHDALFVVAKMKDLNLQASVPTYNSLLHNLRHTDVMWDICNEIKASGAPQSE 240
            AYSR EM+HDALFV+AKMK LNLQ S+ TYNSLL+NLRHTD+MWD+ NEIKASG PQ+E
Sbjct: 181 CAYSRAEMVHDALFVLAKMKVLNLQVSIATYNSLLYNLRHTDIMWDVYNEIKASGVPQNE 240

Query: 241 YTTSILIHGLCAQSKLQDAISFLQDSN-EVVGPSIVSINTVMSKFCKVGLVDVARSFFCL 300
           YT  ILI GLC QS+LQDA++FL+++  E  GPS+VS N +MS FCK+G VDVA+SFFC+
Sbjct: 241 YTNPILIDGLCRQSRLQDAVTFLRETGGEEFGPSVVSFNALMSGFCKMGSVDVAKSFFCM 300

Query: 301 MVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDVVTYNTLAKGFLLLGF 360
           M+K GLLPD YSYNIL+HGLCVAGSM+EALEFT+DME HGVEPD+VTYN LA GF +LG 
Sbjct: 301 MIKYGLLPDVYSYNILLHGLCVAGSMEEALEFTNDMENHGVEPDIVTYNILANGFRILGL 360

Query: 361 MSGAWKVVQKMLLKGLNPDIVTYTILICGHCQMGNIEEALKLRQETLSRGFQLNIISYSV 420
           +SGAWKVVQ+MLL GLNPD+VTYTILICGHCQMGNIEE+ KL+++ LS+G +L+I++Y+V
Sbjct: 361 ISGAWKVVQRMLLNGLNPDLVTYTILICGHCQMGNIEESFKLKEKMLSQGLKLSIVTYTV 420

Query: 421 LLSCLCKVGRIEEALALLNEMETLRLKPDLIVYSILIHGLCKEGFVQRAYQLYEQMRLKR 480
           LLS LCK GRI+EA+ LL+EME + LKPDL+ YS+LIHGLCK G V+ A +LYE+M  KR
Sbjct: 421 LLSSLCKSGRIDEAVILLHEMEVIGLKPDLLTYSVLIHGLCKRGAVEEAIELYEEMCSKR 480

Query: 481 NFPNYFAQRAVLLGFFENGNISEARRYFDALTHMDLIEDIILYNIMIDGYVRLGDISEAM 540
            +PN F   A++ G FE G ISEA+ YFD++T  D+ E+IILYNIMIDGY +LG+I EA+
Sbjct: 481 IYPNSFVCSAIISGLFEKGAISEAQMYFDSVTKSDVAEEIILYNIMIDGYAKLGNIGEAV 540

Query: 541 QLYYRMFERGITPTVVTFNTLVHGFCRNGDLVEARKMFEIIRLNGLLPSVVTYTTLMNAY 600
           + Y ++ E+GI+PT+VTFN+L++GFC+ G L EA K+ + I+++GL+P+ VTYTTLMN Y
Sbjct: 541 RSYKQIIEKGISPTIVTFNSLIYGFCKKGKLAEAVKLLDTIKVHGLVPTSVTYTTLMNGY 600

Query: 601 CEAGNMQEMFDLLHEMEANAVVPTHITYTVLIKGLCRQNKMHEALQLLEYMYAKGLMPDQ 660
           CE G+M  MFD+LHEMEA A+ PT ITYTV++KGLC++ ++HE++QLL+YMYA+GL PDQ
Sbjct: 601 CEEGDMHSMFDMLHEMEAKAIKPTQITYTVVVKGLCKEGRLHESVQLLKYMYARGLFPDQ 660

Query: 661 ITYNTIIQCFCKARDIAKAFQVYNEMLLHNLDPTHVTYNVLISGLCVYGDLKDADRMLVS 720
           ITYNT+IQ FCKA D+ KAFQ++N+ML H+L P+ VTYNVLI+GLCVYG+LKDADR+LV+
Sbjct: 661 ITYNTVIQSFCKAHDLQKAFQLHNQMLQHSLQPSPVTYNVLINGLCVYGNLKDADRLLVT 720

Query: 721 MEDQNISLTKVAYMTIIKAHCAKGQVSKALGFFNQMLAKSFVISIRDYSAIINRLCKRGL 780
           ++DQ+I LTKVAY TIIKAHCAKG V  AL FF+QM+ + F +SIRDYSA+INRLCKR L
Sbjct: 721 LQDQSIRLTKVAYTTIIKAHCAKGDVQNALVFFHQMVERGFEVSIRDYSAVINRLCKRNL 780

Query: 781 ITEAKYLFAMMLSEGVTPDSEICETMLNAFHQHGDSSSAFEFLAVMVKSGVI 832
           IT+AK+ F MML+ G+ PD +IC  MLNAFH+ GD +S FE  A+M+K G++
Sbjct: 781 ITDAKFFFCMMLTHGIPPDQDICLVMLNAFHRSGDPNSVFEIFAMMIKCGLL 827

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR41_ARATH2.7e-22648.60Putative pentatricopeptide repeat-containing protein At1g13630 OS=Arabidopsis th... [more]
RF1_ORYSI1.3e-8230.71Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica GN=Rf1 PE=2 SV=1[more]
PP437_ARATH2.3e-7928.75Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis th... [more]
PP407_ARATH5.6e-7829.72Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN... [more]
PP360_ARATH9.5e-7827.71Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0L9A2_CUCSA0.0e+0082.28Uncharacterized protein OS=Cucumis sativus GN=Csa_3G642640 PE=4 SV=1[more]
D7TA84_VITVI4.6e-30561.66Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0010g00630 PE=4 SV=... [more]
A0A0D2RLR9_GOSRA1.1e-29359.66Uncharacterized protein OS=Gossypium raimondii GN=B456_005G147900 PE=4 SV=1[more]
B9RLG0_RICCO4.3e-28759.12Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
A0A068TYQ6_COFCA6.9e-26155.64Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00032443001 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G13630.11.1e-20949.86 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G59900.11.3e-8028.75 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G39710.13.1e-7929.72 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G01110.15.3e-7927.71 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G62670.18.5e-7728.93 rna processing factor 2[more]
Match NameE-valueIdentityDescription
gi|659130189|ref|XP_008465042.1|0.0e+0085.03PREDICTED: putative pentatricopeptide repeat-containing protein At1g13630 [Cucum... [more]
gi|449453449|ref|XP_004144470.1|0.0e+0082.28PREDICTED: putative pentatricopeptide repeat-containing protein At1g13630 [Cucum... [more]
gi|658009094|ref|XP_008339746.1|0.0e+0064.94PREDICTED: putative pentatricopeptide repeat-containing protein At1g13630 isofor... [more]
gi|694310974|ref|XP_009355583.1|0.0e+0064.34PREDICTED: putative pentatricopeptide repeat-containing protein At1g13630 [Pyrus... [more]
gi|359473479|ref|XP_002267299.2|0.0e+0062.50PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide repeat-containing pro... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G016450.1CmoCh04G016450.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 804..830
score: 0.31coord: 730..758
score: 2.2E-4coord: 767..795
score: 9.
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 373..402
score: 1.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 657..705
score: 3.0E-17coord: 413..461
score: 7.2E-14coord: 272..320
score: 2.7E-14coord: 518..566
score: 2.9E-15coord: 587..636
score: 1.6
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 767..799
score: 4.2E-7coord: 695..727
score: 1.3E-6coord: 660..693
score: 2.6E-9coord: 555..588
score: 2.3E-7coord: 380..413
score: 2.7E-7coord: 450..482
score: 1.3E-6coord: 625..658
score: 6.1E-8coord: 521..553
score: 8.7E-10coord: 730..763
score: 1.6E-5coord: 275..309
score: 6.9E-7coord: 590..623
score: 3.3E-9coord: 310..344
score: 1.1E-9coord: 345..379
score: 8.4E-5coord: 415..448
score: 8.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 172..206
score: 7.574coord: 658..692
score: 12.419coord: 308..342
score: 13.592coord: 413..447
score: 12.463coord: 343..377
score: 10.665coord: 763..797
score: 10.622coord: 553..587
score: 12.167coord: 623..657
score: 12.079coord: 518..552
score: 13.548coord: 588..622
score: 12.474coord: 728..762
score: 9.317coord: 378..412
score: 11.4coord: 798..832
score: 8.572coord: 239..269
score: 6.127coord: 448..482
score: 11.29coord: 483..517
score: 5.908coord: 273..307
score: 10.326coord: 693..727
score: 10
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 494..821
score: 7.7E-10coord: 320..443
score: 1.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 343..512
score: 3.5
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 215..769
score: 1.9E
NoneNo IPR availablePANTHERPTHR24015:SF309SUBFAMILY NOT NAMEDcoord: 215..769
score: 1.9E
NoneNo IPR availableunknownSSF81901HCP-likecoord: 494..688
score: 7.06

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmoCh04G016450CmoCh04G002870Cucurbita moschata (Rifu)cmocmoB465