CmaCh14G020230 (gene) Cucurbita maxima (Rimu)

NameCmaCh14G020230
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCma_Chr14 : 14138181 .. 14139986 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAAATGTGCAGCGTCCTAACTCGAACCCCCTCTTGGTTTTCCACTCGAAAGCTCTTCGAGCAGAAGCTATCAGATCTCCACAAGTGCACAGACCTCAACCAAGTGAAGCAACTCCACGCCCAAATCCTCAAATCCAATCTCCACCTCGACCTCTATGTTGTTCCCAAACTCATATCCGCTTTCTCTCTTTGCCGCCAAATGCCCCTCGCCACCAACGCTTTCAATCAAGTTCAATATCCAAATGGCCATTTGTACAACACTCTGATTCGAGCCCACGCCCAGAATTCACAACCTTCACAGGCCTTTTCCACTTTCTTCACCATGCAATTTGATGGATTATACCCTGATAATTTCACTTTCCCATTTCTTCTGAAAGCTTGTACTGGGAATGGGTGGTTGCCTGTTATTGAAATGGTACATGCCCAAATCGAGAAATTTGGTTTCATGTCGGATGTATTTGTGCCGAATTCTCTTATTGATTCATATTCCAAATGTGGTTCTGGTGGAATTTTGACAGCGAAGAAGTTGTTTGTGTCAATGGGAGATTGTAGGGATGTTGTGTCGTGGAATTTAATGATCTCTGGATTTGCTAAGGGTGGTTTGTACGAAGAAGCTCGGAAGGTGTTCGATAAAATGCCTATAAGGGATAGTATTAGTTGGAACACAATGTTGGATGGGTATGTTAAAGTTGGGAAAATGGATGATGCATTTAAATTGTTTGATGCAATGCCTGAGAGGAATGTTGTCTCTTGGTCGACAATGTTGTTGGGGTACTGCAAGGTAGGGGATATGGAGATGGCACAAATGTTGTTCAATAAAATGCCCACGAGGAATTTGGTTTCTTGGACCATAGTTATCTCTGGCTTTGCTGAGAAAGGGCTAGCCAAAGTGGCCATTGGCTTGTTTGATCAAATGGAAGAGGCTGGCGTGAAGTTAGACAATGGGGCAGTAATAAGTATATTGGCTGCTTGTGCTGAGTCTGGTTTGCTTGGGCTTGGTGAGAAAATACATGCTTCCATTAGGAACCACAATTTCAAATGTACTACTGAAATCTCCAATGCTTTGGTTGATATGTATGCAAAATGTGGTAGGCTAGATATTGCATACAATGTCTTTAATGACATACAAAACAAAGATGTTGTGTCTTGGAATGCAATGCTTCATGGACTGGCAATGCATGGGCACGGAGAGAAAGCGCTCGAGCTTTTCAAAAGAATGAAAGAAAAGGGCTTCTCCCCCGACAAAGTTACTATGATCGGAGTCTTGTGTGCTTGTTCGCATGCGGGATTGATCGACGATGGCATTCGATACTTCTCTTCAATGGAAAAGGACTACGCCCTAGTTCATGAAATCGAGCATTACGGTTGCATGGTAGACCTTTTGGGTCGCAAGGGAAGGCTTGAAGAAGCCATCAGGCTCATTCGCACCATGCCAATGGAACCAAATGTCATCATCTGGGGCACCCTTTTAGGGGCGTGTCGTATGCATAATGCTGTCGAACTTGCAAGAGAGGTTCTCGATCATTTGGTTAAGTTGGAACCATCTGATCCGGGTAATTTATCCATGTTGTCGAACATATATGCTGCAGCAGGCGATTGGGACTGTGTTGCCGATGTGAGGTTGAGAATGCGGAGTATTGGAACTCAAAAACCATCGGGTGCTAGTTCCATCGAGGTCAATAATGAGGTTCATGAATTTACAGTGTTCGATCGATCACATCCGAAATCTGATAAAATATATCAGATGATTAACGGATTGCGCCGTGAACTTAAACAAGTTGCATGCTTTCCAAACACGTGTTAA

mRNA sequence

ATGCAAATGTGCAGCGTCCTAACTCGAACCCCCTCTTGGTTTTCCACTCGAAAGCTCTTCGAGCAGAAGCTATCAGATCTCCACAAGTGCACAGACCTCAACCAAGTGAAGCAACTCCACGCCCAAATCCTCAAATCCAATCTCCACCTCGACCTCTATGTTGTTCCCAAACTCATATCCGCTTTCTCTCTTTGCCGCCAAATGCCCCTCGCCACCAACGCTTTCAATCAAGTTCAATATCCAAATGGCCATTTGTACAACACTCTGATTCGAGCCCACGCCCAGAATTCACAACCTTCACAGGCCTTTTCCACTTTCTTCACCATGCAATTTGATGGATTATACCCTGATAATTTCACTTTCCCATTTCTTCTGAAAGCTTGTACTGGGAATGGGTGGTTGCCTGTTATTGAAATGGTACATGCCCAAATCGAGAAATTTGGTTTCATGTCGGATGTATTTGTGCCGAATTCTCTTATTGATTCATATTCCAAATGTGGTTCTGGTGGAATTTTGACAGCGAAGAAGTTGTTTGTGTCAATGGGAGATTGTAGGGATGTTGTGTCGTGGAATTTAATGATCTCTGGATTTGCTAAGGGTGGTTTGTACGAAGAAGCTCGGAAGGTGTTCGATAAAATGCCTATAAGGGATAGTATTAGTTGGAACACAATGTTGGATGGGTATGTTAAAGTTGGGAAAATGGATGATGCATTTAAATTGTTTGATGCAATGCCTGAGAGGAATGTTGTCTCTTGGTCGACAATGTTGTTGGGGTACTGCAAGGTAGGGGATATGGAGATGGCACAAATGTTGTTCAATAAAATGCCCACGAGGAATTTGGTTTCTTGGACCATAGTTATCTCTGGCTTTGCTGAGAAAGGGCTAGCCAAAGTGGCCATTGGCTTGTTTGATCAAATGGAAGAGGCTGGCGTGAAGTTAGACAATGGGGCAGTAATAAGTATATTGGCTGCTTGTGCTGAGTCTGGTTTGCTTGGGCTTGGTGAGAAAATACATGCTTCCATTAGGAACCACAATTTCAAATGTACTACTGAAATCTCCAATGCTTTGGTTGATATGTATGCAAAATGTGGTAGGCTAGATATTGCATACAATGTCTTTAATGACATACAAAACAAAGATGTTGTGTCTTGGAATGCAATGCTTCATGGACTGGCAATGCATGGGCACGGAGAGAAAGCGCTCGAGCTTTTCAAAAGAATGAAAGAAAAGGGCTTCTCCCCCGACAAAGTTACTATGATCGGAGTCTTGTGTGCTTGTTCGCATGCGGGATTGATCGACGATGGCATTCGATACTTCTCTTCAATGGAAAAGGACTACGCCCTAGTTCATGAAATCGAGCATTACGGTTGCATGGTAGACCTTTTGGGTCGCAAGGGAAGGCTTGAAGAAGCCATCAGGCTCATTCGCACCATGCCAATGGAACCAAATGTCATCATCTGGGGCACCCTTTTAGGGGCGTGTCGTATGCATAATGCTGTCGAACTTGCAAGAGAGGTTCTCGATCATTTGGTTAAGTTGGAACCATCTGATCCGGGTAATTTATCCATGTTGTCGAACATATATGCTGCAGCAGGCGATTGGGACTGTGTTGCCGATGTGAGGTTGAGAATGCGGAGTATTGGAACTCAAAAACCATCGGGTGCTAGTTCCATCGAGGTCAATAATGAGGTTCATGAATTTACAGTGTTCGATCGATCACATCCGAAATCTGATAAAATATATCAGATGATTAACGGATTGCGCCGTGAACTTAAACAAGTTGCATGCTTTCCAAACACGTGTTAA

Coding sequence (CDS)

ATGCAAATGTGCAGCGTCCTAACTCGAACCCCCTCTTGGTTTTCCACTCGAAAGCTCTTCGAGCAGAAGCTATCAGATCTCCACAAGTGCACAGACCTCAACCAAGTGAAGCAACTCCACGCCCAAATCCTCAAATCCAATCTCCACCTCGACCTCTATGTTGTTCCCAAACTCATATCCGCTTTCTCTCTTTGCCGCCAAATGCCCCTCGCCACCAACGCTTTCAATCAAGTTCAATATCCAAATGGCCATTTGTACAACACTCTGATTCGAGCCCACGCCCAGAATTCACAACCTTCACAGGCCTTTTCCACTTTCTTCACCATGCAATTTGATGGATTATACCCTGATAATTTCACTTTCCCATTTCTTCTGAAAGCTTGTACTGGGAATGGGTGGTTGCCTGTTATTGAAATGGTACATGCCCAAATCGAGAAATTTGGTTTCATGTCGGATGTATTTGTGCCGAATTCTCTTATTGATTCATATTCCAAATGTGGTTCTGGTGGAATTTTGACAGCGAAGAAGTTGTTTGTGTCAATGGGAGATTGTAGGGATGTTGTGTCGTGGAATTTAATGATCTCTGGATTTGCTAAGGGTGGTTTGTACGAAGAAGCTCGGAAGGTGTTCGATAAAATGCCTATAAGGGATAGTATTAGTTGGAACACAATGTTGGATGGGTATGTTAAAGTTGGGAAAATGGATGATGCATTTAAATTGTTTGATGCAATGCCTGAGAGGAATGTTGTCTCTTGGTCGACAATGTTGTTGGGGTACTGCAAGGTAGGGGATATGGAGATGGCACAAATGTTGTTCAATAAAATGCCCACGAGGAATTTGGTTTCTTGGACCATAGTTATCTCTGGCTTTGCTGAGAAAGGGCTAGCCAAAGTGGCCATTGGCTTGTTTGATCAAATGGAAGAGGCTGGCGTGAAGTTAGACAATGGGGCAGTAATAAGTATATTGGCTGCTTGTGCTGAGTCTGGTTTGCTTGGGCTTGGTGAGAAAATACATGCTTCCATTAGGAACCACAATTTCAAATGTACTACTGAAATCTCCAATGCTTTGGTTGATATGTATGCAAAATGTGGTAGGCTAGATATTGCATACAATGTCTTTAATGACATACAAAACAAAGATGTTGTGTCTTGGAATGCAATGCTTCATGGACTGGCAATGCATGGGCACGGAGAGAAAGCGCTCGAGCTTTTCAAAAGAATGAAAGAAAAGGGCTTCTCCCCCGACAAAGTTACTATGATCGGAGTCTTGTGTGCTTGTTCGCATGCGGGATTGATCGACGATGGCATTCGATACTTCTCTTCAATGGAAAAGGACTACGCCCTAGTTCATGAAATCGAGCATTACGGTTGCATGGTAGACCTTTTGGGTCGCAAGGGAAGGCTTGAAGAAGCCATCAGGCTCATTCGCACCATGCCAATGGAACCAAATGTCATCATCTGGGGCACCCTTTTAGGGGCGTGTCGTATGCATAATGCTGTCGAACTTGCAAGAGAGGTTCTCGATCATTTGGTTAAGTTGGAACCATCTGATCCGGGTAATTTATCCATGTTGTCGAACATATATGCTGCAGCAGGCGATTGGGACTGTGTTGCCGATGTGAGGTTGAGAATGCGGAGTATTGGAACTCAAAAACCATCGGGTGCTAGTTCCATCGAGGTCAATAATGAGGTTCATGAATTTACAGTGTTCGATCGATCACATCCGAAATCTGATAAAATATATCAGATGATTAACGGATTGCGCCGTGAACTTAAACAAGTTGCATGCTTTCCAAACACGTGTTAA

Protein sequence

MQMCSVLTRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSNLHLDLYVVPKLISAFSLCRQMPLATNAFNQVQYPNGHLYNTLIRAHAQNSQPSQAFSTFFTMQFDGLYPDNFTFPFLLKACTGNGWLPVIEMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSGGILTAKKLFVSMGDCRDVVSWNLMISGFAKGGLYEEARKVFDKMPIRDSISWNTMLDGYVKVGKMDDAFKLFDAMPERNVVSWSTMLLGYCKVGDMEMAQMLFNKMPTRNLVSWTIVISGFAEKGLAKVAIGLFDQMEEAGVKLDNGAVISILAACAESGLLGLGEKIHASIRNHNFKCTTEISNALVDMYAKCGRLDIAYNVFNDIQNKDVVSWNAMLHGLAMHGHGEKALELFKRMKEKGFSPDKVTMIGVLCACSHAGLIDDGIRYFSSMEKDYALVHEIEHYGCMVDLLGRKGRLEEAIRLIRTMPMEPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVNNEVHEFTVFDRSHPKSDKIYQMINGLRRELKQVACFPNTC
BLAST of CmaCh14G020230 vs. Swiss-Prot
Match: PP261_ARATH (Pentatricopeptide repeat-containing protein At3g29230 OS=Arabidopsis thaliana GN=PCMP-E27 PE=2 SV=1)

HSP 1 Score: 791.6 bits (2043), Expect = 6.1e-228
Identity = 371/585 (63.42%), Postives = 464/585 (79.32%), Query Frame = 1

Query: 5   SVLTRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSNLHLDLYVVPKLISAFSL 64
           S+  R PSW S+R++FE++L DL KC +LNQVKQLHAQI++ NLH DL++ PKLISA SL
Sbjct: 4   SLPVRAPSWVSSRRIFEERLQDLPKCANLNQVKQLHAQIIRRNLHEDLHIAPKLISALSL 63

Query: 65  CRQMPLATNAFNQVQYPNGHLYNTLIRAHAQNSQPSQAFSTFFTMQFDGLYPDNFTFPFL 124
           CRQ  LA   FNQVQ PN HL N+LIRAHAQNSQP QAF  F  MQ  GL+ DNFT+PFL
Sbjct: 64  CRQTNLAVRVFNQVQEPNVHLCNSLIRAHAQNSQPYQAFFVFSEMQRFGLFADNFTYPFL 123

Query: 125 LKACTGNGWLPVIEMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSGGILTAKKLFVSMGDC 184
           LKAC+G  WLPV++M+H  IEK G  SD++VPN+LID YS+CG  G+  A KLF  M + 
Sbjct: 124 LKACSGQSWLPVVKMMHNHIEKLGLSSDIYVPNALIDCYSRCGGLGVRDAMKLFEKMSE- 183

Query: 185 RDVVSWNLMISGFAKGGLYEEARKVFDKMPIRDSISWNTMLDGYVKVGKMDDAFKLFDAM 244
           RD VSWN M+ G  K G   +AR++FD+MP RD ISWNTMLDGY +  +M  AF+LF+ M
Sbjct: 184 RDTVSWNSMLGGLVKAGELRDARRLFDEMPQRDLISWNTMLDGYARCREMSKAFELFEKM 243

Query: 245 PERNVVSWSTMLLGYCKVGDMEMAQMLFNKMP--TRNLVSWTIVISGFAEKGLAKVAIGL 304
           PERN VSWSTM++GY K GDMEMA+++F+KMP   +N+V+WTI+I+G+AEKGL K A  L
Sbjct: 244 PERNTVSWSTMVMGYSKAGDMEMARVMFDKMPLPAKNVVTWTIIIAGYAEKGLLKEADRL 303

Query: 305 FDQMEEAGVKLDNGAVISILAACAESGLLGLGEKIHASIRNHNFKCTTEISNALVDMYAK 364
            DQM  +G+K D  AVISILAAC ESGLL LG +IH+ ++  N      + NAL+DMYAK
Sbjct: 304 VDQMVASGLKFDAAAVISILAACTESGLLSLGMRIHSILKRSNLGSNAYVLNALLDMYAK 363

Query: 365 CGRLDIAYNVFNDIQNKDVVSWNAMLHGLAMHGHGEKALELFKRMKEKGFSPDKVTMIGV 424
           CG L  A++VFNDI  KD+VSWN MLHGL +HGHG++A+ELF RM+ +G  PDKVT I V
Sbjct: 364 CGNLKKAFDVFNDIPKKDLVSWNTMLHGLGVHGHGKEAIELFSRMRREGIRPDKVTFIAV 423

Query: 425 LCACSHAGLIDDGIRYFSSMEKDYALVHEIEHYGCMVDLLGRKGRLEEAIRLIRTMPMEP 484
           LC+C+HAGLID+GI YF SMEK Y LV ++EHYGC+VDLLGR GRL+EAI++++TMPMEP
Sbjct: 424 LCSCNHAGLIDEGIDYFYSMEKVYDLVPQVEHYGCLVDLLGRVGRLKEAIKVVQTMPMEP 483

Query: 485 NVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRL 544
           NV+IWG LLGACRMHN V++A+EVLD+LVKL+P DPGN S+LSNIYAAA DW+ VAD+R 
Sbjct: 484 NVVIWGALLGACRMHNEVDIAKEVLDNLVKLDPCDPGNYSLLSNIYAAAEDWEGVADIRS 543

Query: 545 RMRSIGTQKPSGASSIEVNNEVHEFTVFDRSHPKSDKIYQMINGL 588
           +M+S+G +KPSGASS+E+ + +HEFTVFD+SHPKSD+IYQM+  L
Sbjct: 544 KMKSMGVEKPSGASSVELEDGIHEFTVFDKSHPKSDQIYQMLGSL 587

BLAST of CmaCh14G020230 vs. Swiss-Prot
Match: PP175_ARATH (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 427.2 bits (1097), Expect = 3.0e-118
Identity = 225/625 (36.00%), Postives = 353/625 (56.48%), Query Frame = 1

Query: 11  PSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSNLHLDLYVVPKL--ISAFSLCRQM 70
           P+  +T     + +S + +C  L Q+KQ H  ++++    D Y   KL  ++A S    +
Sbjct: 21  PNQPTTNNERSRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASL 80

Query: 71  PLATNAFNQVQYPNGHLYNTLIRAHAQNSQPSQAFSTFFTMQFDG-LYPDNFTFPFLLKA 130
             A   F+++  PN   +NTLIRA+A    P  +   F  M  +   YP+ +TFPFL+KA
Sbjct: 81  EYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKA 140

Query: 131 CTGNGWLPVIEMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSGGILTAKKLFVSMGDCRDV 190
                 L + + +H    K    SDVFV NSLI  Y  CG   + +A K+F ++ + +DV
Sbjct: 141 AAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGD--LDSACKVFTTIKE-KDV 200

Query: 191 VSWNLMISGFAKGGLYEEARKVFDKMPIRDSISWNTMLDG-------------------- 250
           VSWN MI+GF + G  ++A ++F KM   D  + +  + G                    
Sbjct: 201 VSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSY 260

Query: 251 -------------------YVKVGKMDDAFKLFDAMPERNVVSWSTMLLGYCKVGDMEMA 310
                              Y K G ++DA +LFDAM E++ V+W+TML GY    D E A
Sbjct: 261 IEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAA 320

Query: 311 QMLFNKMPTRNLVSWTIVISGFAEKGLAKVAIGLFDQME-EAGVKLDNGAVISILAACAE 370
           + + N MP +++V+W  +IS + + G    A+ +F +++ +  +KL+   ++S L+ACA+
Sbjct: 321 REVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQ 380

Query: 371 SGLLGLGEKIHASIRNHNFKCTTEISNALVDMYAKCGRLDIAYNVFNDIQNKDVVSWNAM 430
            G L LG  IH+ I+ H  +    +++AL+ MY+KCG L+ +  VFN ++ +DV  W+AM
Sbjct: 381 VGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAM 440

Query: 431 LHGLAMHGHGEKALELFKRMKEKGFSPDKVTMIGVLCACSHAGLIDDGIRYFSSMEKDYA 490
           + GLAMHG G +A+++F +M+E    P+ VT   V CACSH GL+D+    F  ME +Y 
Sbjct: 441 IGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYG 500

Query: 491 LVHEIEHYGCMVDLLGRKGRLEEAIRLIRTMPMEPNVIIWGTLLGACRMHNAVELAREVL 550
           +V E +HY C+VD+LGR G LE+A++ I  MP+ P+  +WG LLGAC++H  + LA    
Sbjct: 501 IVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMAC 560

Query: 551 DHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVNNEVHEF 593
             L++LEP + G   +LSNIYA  G W+ V+++R  MR  G +K  G SSIE++  +HEF
Sbjct: 561 TRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEF 620

BLAST of CmaCh14G020230 vs. Swiss-Prot
Match: PP301_ARATH (Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana GN=PCMP-H24 PE=3 SV=1)

HSP 1 Score: 411.0 bits (1055), Expect = 2.2e-113
Identity = 212/532 (39.85%), Postives = 321/532 (60.34%), Query Frame = 1

Query: 71  ATNAFNQVQYPNGHLYNTLIRAHAQNSQPSQAFSTFFTMQFDGLYPDNFTFPFLLKACTG 130
           A + F+++   N   +N L+ A+ QNS+  +A   F + +   L   N         C  
Sbjct: 176 ARSVFDRMPEKNDVSWNALLSAYVQNSKMEEACMLFKSRENWALVSWN---------CLL 235

Query: 131 NGWLPVIEMVHAQIEKFGFMS--DVFVPNSLIDSYSKCGSGGILTAKKLFVSMGDCRDVV 190
            G++   ++V A+ + F  M+  DV   N++I  Y++  SG I  A++LF      +DV 
Sbjct: 236 GGFVKKKKIVEAR-QFFDSMNVRDVVSWNTIITGYAQ--SGKIDEARQLF-DESPVQDVF 295

Query: 191 SWNLMISGFAKGGLYEEARKVFDKMPIRDSISWNTMLDGYVKVGKMDDAFKLFDAMPERN 250
           +W  M+SG+ +  + EEAR++FDKMP R+ +SWN ML GYV+  +M+ A +LFD MP RN
Sbjct: 296 TWTAMVSGYIQNRMVEEARELFDKMPERNEVSWNAMLAGYVQGERMEMAKELFDVMPCRN 355

Query: 251 VVSWSTMLLGYCKVGDMEMAQMLFNKMPTRNLVSWTIVISGFAEKGLAKVAIGLFDQMEE 310
           V +W+TM+ GY + G +  A+ LF+KMP R+ VSW  +I+G+++ G +  A+ LF QME 
Sbjct: 356 VSTWNTMITGYAQCGKISEAKNLFDKMPKRDPVSWAAMIAGYSQSGHSFEALRLFVQMER 415

Query: 311 AGVKLDNGAVISILAACAESGLLGLGEKIHASIRNHNFKCTTEISNALVDMYAKCGRLDI 370
            G +L+  +  S L+ CA+   L LG+++H  +    ++    + NAL+ MY KCG ++ 
Sbjct: 416 EGGRLNRSSFSSALSTCADVVALELGKQLHGRLVKGGYETGCFVGNALLLMYCKCGSIEE 475

Query: 371 AYNVFNDIQNKDVVSWNAMLHGLAMHGHGEKALELFKRMKEKGFSPDKVTMIGVLCACSH 430
           A ++F ++  KD+VSWN M+ G + HG GE AL  F+ MK +G  PD  TM+ VL ACSH
Sbjct: 476 ANDLFKEMAGKDIVSWNTMIAGYSRHGFGEVALRFFESMKREGLKPDDATMVAVLSACSH 535

Query: 431 AGLIDDGIRYFSSMEKDYALVHEIEHYGCMVDLLGRKGRLEEAIRLIRTMPMEPNVIIWG 490
            GL+D G +YF +M +DY ++   +HY CMVDLLGR G LE+A  L++ MP EP+  IWG
Sbjct: 536 TGLVDKGRQYFYTMTQDYGVMPNSQHYACMVDLLGRAGLLEDAHNLMKNMPFEPDAAIWG 595

Query: 491 TLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIG 550
           TLLGA R+H   ELA    D +  +EP + G   +LSN+YA++G W  V  +R+RMR  G
Sbjct: 596 TLLGASRVHGNTELAETAADKIFAMEPENSGMYVLLSNLYASSGRWGDVGKLRVRMRDKG 655

Query: 551 TQKPSGASSIEVNNEVHEFTVFDRSHPKSDKIYQMINGLRRELKQVACFPNT 601
            +K  G S IE+ N+ H F+V D  HP+ D+I+  +  L   +K+      T
Sbjct: 656 VKKVPGYSWIEIQNKTHTFSVGDEFHPEKDEIFAFLEELDLRMKKAGYVSKT 694

BLAST of CmaCh14G020230 vs. Swiss-Prot
Match: PP219_ARATH (Putative pentatricopeptide repeat-containing protein At3g08820 OS=Arabidopsis thaliana GN=PCMP-H84 PE=3 SV=1)

HSP 1 Score: 411.0 bits (1055), Expect = 2.2e-113
Identity = 227/579 (39.21%), Postives = 324/579 (55.96%), Query Frame = 1

Query: 30  CTDLNQVKQLHAQILKSNLHLDLYVVPKLISAFSLCRQMPLATNAFNQVQYPNGHLYNTL 89
           CT +N +KQ+H  ++  +LH D ++V  L+      RQ   +   F+  Q+PN  LYN+L
Sbjct: 24  CT-VNHLKQIHVSLINHHLHHDTFLVNLLLKRTLFFRQTKYSYLLFSHTQFPNIFLYNSL 83

Query: 90  IRAHAQNSQPSQAFSTFFTMQFDGLYPDNFTFPFLLKACTGNGWLPVIEMVHAQIEKFGF 149
           I     N    +    F +++  GLY   FTFP +LKACT      +   +H+ + K GF
Sbjct: 84  INGFVNNHLFHETLDLFLSIRKHGLYLHGFTFPLVLKACTRASSRKLGIDLHSLVVKCGF 143

Query: 150 MSDVFVPNSLIDSYSKCGSGGILTAKKLFVSMGDCRDVVSWNLMISGFAKGGLYEEARKV 209
             DV    SL+  YS  GSG +  A KLF  + D R VV+W  + SG+   G + EA  +
Sbjct: 144 NHDVAAMTSLLSIYS--GSGRLNDAHKLFDEIPD-RSVVTWTALFSGYTTSGRHREAIDL 203

Query: 210 FDKMPIR----DSISWNTMLDGYVKVGKMDDAFKLFDAMPE----RNVVSWSTMLLGYCK 269
           F KM       DS     +L   V VG +D    +   M E    +N    +T++  Y K
Sbjct: 204 FKKMVEMGVKPDSYFIVQVLSACVHVGDLDSGEWIVKYMEEMEMQKNSFVRTTLVNLYAK 263

Query: 270 VGDMEMAQMLFNKMPTRNLVSWTIVISGFAEKGLAKVAIGLFDQMEEAGVKLDNGAVISI 329
            G ME A+ +F+ M  +++V+W+ +I G+A     K  I LF QM +  +K D  +++  
Sbjct: 264 CGKMEKARSVFDSMVEKDIVTWSTMIQGYASNSFPKEGIELFLQMLQENLKPDQFSIVGF 323

Query: 330 LAACAESGLLGLGEKIHASIRNHNFKCTTEISNALVDMYAKCGRLDIAYNVFNDIQNKDV 389
           L++CA  G L LGE   + I  H F     ++NAL+DMYAKCG +   + VF +++ KD+
Sbjct: 324 LSSCASLGALDLGEWGISLIDRHEFLTNLFMANALIDMYAKCGAMARGFEVFKEMKEKDI 383

Query: 390 VSWNAMLHGLAMHGHGEKALELFKRMKEKGFSPDKVTMIGVLCACSHAGLIDDGIRYFSS 449
           V  NA + GLA +GH + +  +F + ++ G SPD  T +G+LC C HAGLI DG+R+F++
Sbjct: 384 VIMNAAISGLAKNGHVKLSFAVFGQTEKLGISPDGSTFLGLLCGCVHAGLIQDGLRFFNA 443

Query: 450 MEKDYALVHEIEHYGCMVDLLGRKGRLEEAIRLIRTMPMEPNVIIWGTLLGACRMHNAVE 509
           +   YAL   +EHYGCMVDL GR G L++A RLI  MPM PN I+WG LL  CR+    +
Sbjct: 444 ISCVYALKRTVEHYGCMVDLWGRAGMLDDAYRLICDMPMRPNAIVWGALLSGCRLVKDTQ 503

Query: 510 LAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVN 569
           LA  VL  L+ LEP + GN   LSNIY+  G WD  A+VR  M   G +K  G S IE+ 
Sbjct: 504 LAETVLKELIALEPWNAGNYVQLSNIYSVGGRWDEAAEVRDMMNKKGMKKIPGYSWIELE 563

Query: 570 NEVHEFTVFDRSHPKSDKIYQMINGLRRELKQVACFPNT 601
            +VHEF   D+SHP SDKIY  +  L  E++ +   P T
Sbjct: 564 GKVHEFLADDKSHPLSDKIYAKLEDLGNEMRLMGFVPTT 598

BLAST of CmaCh14G020230 vs. Swiss-Prot
Match: PP151_ARATH (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 388.7 bits (997), Expect = 1.2e-106
Identity = 216/628 (34.39%), Postives = 346/628 (55.10%), Query Frame = 1

Query: 12  SWFSTRKLFEQKLSDLH-KCTDLNQVKQLHAQILKSNLHLDLYVVPKLISAFSLCRQMPL 71
           S FS     + +L D + KC  L   +Q+  ++ + N+    Y    +++  +    +  
Sbjct: 49  SGFSNEIFIQNRLIDAYSKCGSLEDGRQVFDKMPQRNI----YTWNSVVTGLTKLGFLDE 108

Query: 72  ATNAFNQVQYPNGHLYNTLIRAHAQNSQPSQAFSTFFTMQFDGLYPDNFTFPFLLKACTG 131
           A + F  +   +   +N+++   AQ+ +  +A   F  M  +G   + ++F  +L AC+G
Sbjct: 109 ADSLFRSMPERDQCTWNSMVSGFAQHDRCEEALCYFAMMHKEGFVLNEYSFASVLSACSG 168

Query: 132 NGWLPVIEMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSGGILTAKKLFVSMGDCRDVVSW 191
              +     VH+ I K  F+SDV++ ++L+D YSKCG+  +  A+++F  MGD R+VVSW
Sbjct: 169 LNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCGN--VNDAQRVFDEMGD-RNVVSW 228

Query: 192 NLMISGFAKGGLYEEARKVF----------DKMPIRDSIS-------------------- 251
           N +I+ F + G   EA  VF          D++ +   IS                    
Sbjct: 229 NSLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASVISACASLSAIKVGQEVHGRVVK 288

Query: 252 ----------WNTMLDGYVKVGKMDDAFKLFDAMPERNVVSWSTMLLGYCKVGDMEMAQM 311
                      N  +D Y K  ++ +A  +FD+MP RNV++ ++M+ GY      + A++
Sbjct: 289 NDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNVIAETSMISGYAMAASTKAARL 348

Query: 312 LFNKMPTRNLVSWTIVISGFAEKGLAKVAIGLFDQMEEAGVKLDNGAVISILAACAESGL 371
           +F KM  RN+VSW  +I+G+ + G  + A+ LF  ++   V   + +  +IL ACA+   
Sbjct: 349 MFTKMAERNVVSWNALIAGYTQNGENEEALSLFCLLKRESVCPTHYSFANILKACADLAE 408

Query: 372 LGLGEKIHASIRNHNFKCTTE------ISNALVDMYAKCGRLDIAYNVFNDIQNKDVVSW 431
           L LG + H  +  H FK  +       + N+L+DMY KCG ++  Y VF  +  +D VSW
Sbjct: 409 LHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERDCVSW 468

Query: 432 NAMLHGLAMHGHGEKALELFKRMKEKGFSPDKVTMIGVLCACSHAGLIDDGIRYFSSMEK 491
           NAM+ G A +G+G +ALELF+ M E G  PD +TMIGVL AC HAG +++G  YFSSM +
Sbjct: 469 NAMIIGFAQNGYGNEALELFREMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTR 528

Query: 492 DYALVHEIEHYGCMVDLLGRKGRLEEAIRLIRTMPMEPNVIIWGTLLGACRMHNAVELAR 551
           D+ +    +HY CMVDLLGR G LEEA  +I  MPM+P+ +IWG+LL AC++H  + L +
Sbjct: 529 DFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNITLGK 588

Query: 552 EVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVNNEV 593
            V + L+++EPS+ G   +LSN+YA  G W+ V +VR  MR  G  K  G S I++    
Sbjct: 589 YVAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVMNVRKSMRKEGVTKQPGCSWIKIQGHD 648

BLAST of CmaCh14G020230 vs. TrEMBL
Match: A0A0A0L7H7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G122560 PE=4 SV=1)

HSP 1 Score: 1050.0 bits (2714), Expect = 1.1e-303
Identity = 508/584 (86.99%), Postives = 544/584 (93.15%), Query Frame = 1

Query: 1   MQMCSVLTRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSNLHLDLYVVPKLIS 60
           MQMCSV  RTPSWFSTRKL EQKLSDLHKCT+LNQVKQLHAQILKSNLH+DL+VVPKLIS
Sbjct: 1   MQMCSVPIRTPSWFSTRKLLEQKLSDLHKCTNLNQVKQLHAQILKSNLHVDLFVVPKLIS 60

Query: 61  AFSLCRQMPLATNAFNQVQYPNGHLYNTLIRAHAQNSQPSQAFSTFFTMQFDGLYPDNFT 120
           AFSLCRQM LATNAFNQVQYPN HLYNT+IRAH+ NSQPSQAF+TFF MQ DG Y DNFT
Sbjct: 61  AFSLCRQMLLATNAFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFAMQRDGHYADNFT 120

Query: 121 FPFLLKACTGNGWLPVIEMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSGGILTAKKLFVS 180
           FPFLLK CTGN WLPVIE VHAQIEKFGFMSDVFVPNSLIDSYSKCGS GI  AKKLFVS
Sbjct: 121 FPFLLKVCTGNVWLPVIESVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVS 180

Query: 181 MGDCRDVVSWNLMISGFAKGGLYEEARKVFDKMPIRDSISWNTMLDGYVKVGKMDDAFKL 240
           MG  RDVVSWN MISG AKGGLYEEARKVFD+MP +D ISWNTMLDGYVKVGKMDDAFKL
Sbjct: 181 MGARRDVVSWNSMISGLAKGGLYEEARKVFDEMPEKDGISWNTMLDGYVKVGKMDDAFKL 240

Query: 241 FDAMPERNVVSWSTMLLGYCKVGDMEMAQMLFNKMPTRNLVSWTIVISGFAEKGLAKVAI 300
           FD MPERNVVSWSTM+LGYCK GDMEMA+MLF+KMP +NLVSWTI++SGFAEKGLA+ AI
Sbjct: 241 FDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIVSGFAEKGLAREAI 300

Query: 301 GLFDQMEEAGVKLDNGAVISILAACAESGLLGLGEKIHASIRNHNFKCTTEISNALVDMY 360
            LFDQME+A +KLDNG V+SILAACAESGLLGLGEKIHASI+N+NFKCTTEISNALVDMY
Sbjct: 301 SLFDQMEKACLKLDNGTVMSILAACAESGLLGLGEKIHASIKNNNFKCTTEISNALVDMY 360

Query: 361 AKCGRLDIAYNVFNDIQNKDVVSWNAMLHGLAMHGHGEKALELFKRMKEKGFSPDKVTMI 420
           AKCGRL+IAY+VFNDI+NKDVVSWNAML GLAMHGHG KALELFKRMKE+GFSP+KVTMI
Sbjct: 361 AKCGRLNIAYDVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPNKVTMI 420

Query: 421 GVLCACSHAGLIDDGIRYFSSMEKDYALVHEIEHYGCMVDLLGRKGRLEEAIRLIRTMPM 480
           GVLCAC+HAGLIDDGIRYFS+ME+DY LV E+EHYGCMVDLLGRKGRLEEAIRLIR MPM
Sbjct: 421 GVLCACTHAGLIDDGIRYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPM 480

Query: 481 EPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADV 540
            PN IIWGTLLGACRMHNAVELAREVLDHLV+LEP+D GN SMLSNIYAAAGDW+CVA+ 
Sbjct: 481 APNAIIWGTLLGACRMHNAVELAREVLDHLVELEPTDSGNFSMLSNIYAAAGDWNCVANT 540

Query: 541 RLRMRSIGTQKPSGASSIEVNNEVHEFTVFDRSHPKSDKIYQMI 585
           RLRMRSIGT+KPSGASSIEVNNEVHEFTVFDRSHPKSD IYQ++
Sbjct: 541 RLRMRSIGTKKPSGASSIEVNNEVHEFTVFDRSHPKSDNIYQVL 584

BLAST of CmaCh14G020230 vs. TrEMBL
Match: M5XAE6_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa017633mg PE=4 SV=1)

HSP 1 Score: 875.5 bits (2261), Expect = 3.6e-251
Identity = 413/591 (69.88%), Postives = 496/591 (83.93%), Query Frame = 1

Query: 3   MCSVLTRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSNLHLDLYVVPKLISAF 62
           MC V  R+PSW S R+L EQKLSDLH+CT+L+ +KQ+HAQILK+NLH DL+  PKLI+AF
Sbjct: 1   MC-VPVRSPSWVSRRRLLEQKLSDLHRCTNLSHIKQVHAQILKANLHQDLHTAPKLIAAF 60

Query: 63  SLCRQMPLATNAFNQVQYPNGHLYNTLIRAHAQNSQPSQAFSTFFTMQFDGLYPDNFTFP 122
           SLCRQM LA N FNQVQ PN HLYNTLIRAH QNSQ +QAF+TFF MQ +G+YPDNFT+P
Sbjct: 61  SLCRQMALAVNVFNQVQDPNVHLYNTLIRAHIQNSQTTQAFATFFDMQLNGVYPDNFTYP 120

Query: 123 FLLKACTGNGWLPVIEMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSGGILTAKKLFVSMG 182
           FLLKAC+G  W PV++M+H  IEKFGF  D+FVPNSLID+YSKCG  G+  AKK+F+ MG
Sbjct: 121 FLLKACSGRPWFPVVQMIHTSIEKFGFCLDIFVPNSLIDTYSKCGLLGVSEAKKMFMLMG 180

Query: 183 DCRDVVSWNLMISGFAKGGLYEEARKVFDKMPIRDSISWNTMLDGYVKVGKMDDAFKLFD 242
           + RD+VSWN MI G AK G   EAR++FD+MP +D++SWNT+LDGY K G+M++AF+LF+
Sbjct: 181 E-RDIVSWNSMIGGLAKTGELGEARRLFDEMPDKDAVSWNTILDGYAKAGQMNEAFELFE 240

Query: 243 AMPERNVVSWSTMLLGYCKVGDMEMAQMLFNKMPTRNLVSWTIVISGFAEKGLAKVAIGL 302
            MP+RNVVSWST++ GY K GDM MA+M+F+KMP RNLV WTI+ISG+AEKGLAK AI L
Sbjct: 241 RMPQRNVVSWSTLVSGYSKAGDMGMARMMFDKMPFRNLVPWTIIISGYAEKGLAKEAIML 300

Query: 303 FDQMEEAGVKLDNGAVISILAACAESGLLGLGEKIHASIRNHNFKCTTEISNALVDMYAK 362
           +DQMEEAG+K DNGA+ISILAACAESGL+GLG K+HASI    FKC+T +SNAL+DMYAK
Sbjct: 301 YDQMEEAGLKPDNGAIISILAACAESGLIGLGRKVHASIERTRFKCSTPVSNALLDMYAK 360

Query: 363 CGRLDIAYNVFNDIQNKDVVSWNAMLHGLAMHGHGEKALELFKRMKEKGFSPDKVTMIGV 422
           CG LD A  VF+ I  KD+VSWNAML GLAMHGHG+KAL+LF RM + GF PDKVT IGV
Sbjct: 361 CGMLDEASRVFHGIAKKDLVSWNAMLQGLAMHGHGDKALQLFSRMVKAGFLPDKVTFIGV 420

Query: 423 LCACSHAGLIDDGIRYFSSMEKDYALVHEIEHYGCMVDLLGRKGRLEEAIRLIRTMPMEP 482
           LCAC+HAG +++G++ F +ME++Y +V EIEHYGCM+DLLGR G L EA RL+ +MPMEP
Sbjct: 421 LCACTHAGFVEEGLQAFHTMEREYGIVPEIEHYGCMIDLLGRGGCLREAFRLVHSMPMEP 480

Query: 483 NVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRL 542
           NV+IWGTLLGACRMHN  ELA+EVLDHLVKL+PSD GN SMLSNIYAAAGDW  VA+VRL
Sbjct: 481 NVVIWGTLLGACRMHNDPELAQEVLDHLVKLDPSDAGNFSMLSNIYAAAGDWANVANVRL 540

Query: 543 RMRSIGTQKPSGASSIEVNNEVHEFTVFDRSHPKSDKIYQMINGLRRELKQ 594
           +MR+ G QKPSGASSIEV +EVHEFTVFD+ HPKS +IYQMI  LR++ KQ
Sbjct: 541 QMRNTGVQKPSGASSIEVGDEVHEFTVFDKLHPKSGEIYQMIERLRQDFKQ 589

BLAST of CmaCh14G020230 vs. TrEMBL
Match: B9T3T5_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_0169170 PE=4 SV=1)

HSP 1 Score: 867.1 bits (2239), Expect = 1.3e-248
Identity = 401/594 (67.51%), Postives = 495/594 (83.33%), Query Frame = 1

Query: 5   SVLTRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSNLHLDLYVVPKLISAFSL 64
           S+ TR P+W STR+LFE+KL DLHKCTD N +K++HAQI+K NLH DLYV PKLISAFSL
Sbjct: 8   SLPTRAPTWVSTRRLFEEKLQDLHKCTDFNHIKEVHAQIIKRNLHNDLYVAPKLISAFSL 67

Query: 65  CRQMPLATNAFNQVQYPNGHLYNTLIRAHAQNSQPSQAFSTFFTMQFDGLYPDNFTFPFL 124
           C QM LA N FNQ+Q PN HLYNTLIRAH QNSQ  +AF+TFF MQ +GL+ DNFT+PFL
Sbjct: 68  CHQMNLAVNVFNQIQDPNVHLYNTLIRAHVQNSQSLKAFATFFDMQKNGLFADNFTYPFL 127

Query: 125 LKACTGNGWLPVIEMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSGGILTAKKLFVSMGDC 184
           LKAC G GWLP ++M+H  +EK+GF  D+FVPNSLIDSYSKCG  G+  A KLF+ MG+ 
Sbjct: 128 LKACNGKGWLPTVQMIHCHVEKYGFFGDLFVPNSLIDSYSKCGLLGVNYAMKLFMEMGE- 187

Query: 185 RDVVSWNLMISGFAKGGLYEEARKVFDKMPIRDSISWNTMLDGYVKVGKMDDAFKLFDAM 244
           +D+VSWN MI G  K G    ARK+FD+M  RD++SWNT+LDGYVK G+M  AF LF+ M
Sbjct: 188 KDLVSWNSMIGGLVKAGDLGRARKLFDEMAERDAVSWNTILDGYVKAGEMSQAFNLFEKM 247

Query: 245 PERNVVSWSTMLLGYCKVGDMEMAQMLFNKMPTRNLVSWTIVISGFAEKGLAKVAIGLFD 304
           PERNVVSWSTM+ GYCK GDMEMA+MLF+KMP +NLV+WTI+ISGFAEKGLAK A  L++
Sbjct: 248 PERNVVSWSTMVSGYCKTGDMEMARMLFDKMPFKNLVTWTIIISGFAEKGLAKEATTLYN 307

Query: 305 QMEEAGVKLDNGAVISILAACAESGLLGLGEKIHASIRNHNFKCTTEISNALVDMYAKCG 364
           QME AG+K D+G +ISILAACAESGLL LG+K+HASI+    KC+  +SNALVDMYAKCG
Sbjct: 308 QMEAAGLKPDDGTLISILAACAESGLLVLGKKVHASIKKIRIKCSVNVSNALVDMYAKCG 367

Query: 365 RLDIAYNVFNDIQNKDVVSWNAMLHGLAMHGHGEKALELFKRMKEKGFSPDKVTMIGVLC 424
           R+D A ++FN++  +D+VSWN ML GLAMHGHGEKA++LF +M+++GF PDKVT+I +LC
Sbjct: 368 RVDKALSIFNEMSMRDLVSWNCMLQGLAMHGHGEKAIQLFSKMQQEGFKPDKVTLIAILC 427

Query: 425 ACSHAGLIDDGIRYFSSMEKDYALVHEIEHYGCMVDLLGRKGRLEEAIRLIRTMPMEPNV 484
           AC+HAG +D G+ YF+SME+D+ +V  IEHYGCM+DLLGR GRLEEA RL+++MPMEPN 
Sbjct: 428 ACTHAGFVDQGLSYFNSMERDHGIVPHIEHYGCMIDLLGRGGRLEEAFRLVQSMPMEPND 487

Query: 485 IIWGTLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRM 544
           +IWGTLLGACR+HNAV LA +VLD L+ LE SDPGN SMLSNI+AAAGDW+ VA++RL+M
Sbjct: 488 VIWGTLLGACRVHNAVPLAEKVLDRLITLEQSDPGNYSMLSNIFAAAGDWNSVANMRLQM 547

Query: 545 RSIGTQKPSGASSIEVNNEVHEFTVFDRSHPKSDKIYQMINGLRRELKQVACFP 599
           +S G QKPSGASSIE+++EVHEFTVFD+SHP++DKIYQ++  L ++LKQVA  P
Sbjct: 548 KSTGVQKPSGASSIELDDEVHEFTVFDKSHPETDKIYQILVKLGQDLKQVAYAP 600

BLAST of CmaCh14G020230 vs. TrEMBL
Match: B9H0F0_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0004s12430g PE=4 SV=2)

HSP 1 Score: 853.6 bits (2204), Expect = 1.5e-244
Identity = 401/594 (67.51%), Postives = 489/594 (82.32%), Query Frame = 1

Query: 5   SVLTRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSNLHLDLYVVPKLISAFSL 64
           SV  R P+W STR++F+QKL DLHKCT+LN +KQ+HAQILK NLH DLYV PKLISAFSL
Sbjct: 8   SVPIRAPTWVSTRRIFQQKLQDLHKCTNLNHIKQVHAQILKQNLHQDLYVAPKLISAFSL 67

Query: 65  CRQMPLATNAFNQVQYPNGHLYNTLIRAHAQNSQPSQAFSTFFTMQFDGLYPDNFTFPFL 124
            ++M LA N F Q+  PN HLYNT IRA  QNS    AF TFF MQ +GL+ DNFT+PFL
Sbjct: 68  SQEMTLAINVFKQIPDPNVHLYNTFIRACVQNSHSLLAFETFFEMQRNGLFADNFTYPFL 127

Query: 125 LKACTGNGWLPVIEMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSGGILTAKKLFVSMGDC 184
           LKAC G  WLP+++M+H  +EK+GF  D+FVPNSLIDSY KCG  G+ +A +LF  M D 
Sbjct: 128 LKACDGQSWLPLVKMIHNHLEKYGFFQDLFVPNSLIDSYCKCGLLGVKSAMRLFKVM-DE 187

Query: 185 RDVVSWNLMISGFAKGGLYEEARKVFDKMPIRDSISWNTMLDGYVKVGKMDDAFKLFDAM 244
           RDVVSWN MI G  K G   EA K+FD+MP++D++SWNT+LDGYVK G+M+ AF LF++M
Sbjct: 188 RDVVSWNSMIRGLLKVGELSEACKLFDEMPMKDAVSWNTILDGYVKAGEMNKAFGLFESM 247

Query: 245 PERNVVSWSTMLLGYCKVGDMEMAQMLFNKMPTRNLVSWTIVISGFAEKGLAKVAIGLFD 304
           PERNVVSWSTM+ GYCK GDMEMA+MLF++MP +NLVSWTI++SG+A KGLAK AI  F+
Sbjct: 248 PERNVVSWSTMVSGYCKAGDMEMARMLFDRMPVKNLVSWTIIVSGYAVKGLAKDAIRSFE 307

Query: 305 QMEEAGVKLDNGAVISILAACAESGLLGLGEKIHASIRNHNFKCTTEISNALVDMYAKCG 364
           QMEEAG+K D+G +ISILA+CAESGLLGLG+++H SI    +KC+  +SNALVDMYAKCG
Sbjct: 308 QMEEAGLKPDDGTIISILASCAESGLLGLGKRVHTSIERIRYKCSVNVSNALVDMYAKCG 367

Query: 365 RLDIAYNVFNDIQNKDVVSWNAMLHGLAMHGHGEKALELFKRMKEKGFSPDKVTMIGVLC 424
           ++D A +VFN +  KD+VSWN ML GLAMHGHGEKAL+LF  M+++GF PDKVT++ VLC
Sbjct: 368 QVDRALSVFNGMSKKDLVSWNCMLQGLAMHGHGEKALQLFSIMRQEGFRPDKVTLVAVLC 427

Query: 425 ACSHAGLIDDGIRYFSSMEKDYALVHEIEHYGCMVDLLGRKGRLEEAIRLIRTMPMEPNV 484
           AC HAG +D+GIRYF++ME+DY +V  IEHYGCMVDLLGR GRL+EA RL+++MP+EPNV
Sbjct: 428 ACVHAGFVDEGIRYFNNMERDYGIVPHIEHYGCMVDLLGRGGRLKEAYRLVQSMPVEPNV 487

Query: 485 IIWGTLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRM 544
           +IWGTLLGACRMHNAV LA EVLD L KLEPSDPGN S+LSNI+A+AGDW  VA+VRL+M
Sbjct: 488 VIWGTLLGACRMHNAVGLAEEVLDCLFKLEPSDPGNYSLLSNIFASAGDWSSVANVRLQM 547

Query: 545 RSIGTQKPSGASSIEVNNEVHEFTVFDRSHPKSDKIYQMINGLRRELKQVACFP 599
           ++ G QKPSGASSIEV++EVHEFTVFD+SHPKSDKIYQMIN L  +LK+V   P
Sbjct: 548 KNFGIQKPSGASSIEVDDEVHEFTVFDKSHPKSDKIYQMINRLGLDLKRVHVVP 600

BLAST of CmaCh14G020230 vs. TrEMBL
Match: W9QIJ8_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_020960 PE=4 SV=1)

HSP 1 Score: 842.4 bits (2175), Expect = 3.4e-241
Identity = 392/598 (65.55%), Postives = 484/598 (80.94%), Query Frame = 1

Query: 1   MQMCSVLTRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSNLHLDLYVVPKLIS 60
           MQMC V  R P W S RKL EQKL+ LHKC ++  +KQLHAQI+++NLHLD +V PKLI+
Sbjct: 1   MQMC-VPVRKPGWVSPRKLLEQKLTHLHKCANILHIKQLHAQIIRANLHLDPFVAPKLIA 60

Query: 61  AFSLCRQMPLATNAFNQVQYPNGHLYNTLIRAHAQNSQPSQAFSTFFTMQFDGLYPDNFT 120
           AFSLCRQ+ LA N F+Q+  PN H++N +IRAH QNSQ  QAF+ FF MQ +G+ PDN+T
Sbjct: 61  AFSLCRQIALAVNVFDQIPEPNVHVFNAMIRAHTQNSQGPQAFAAFFNMQSNGVSPDNYT 120

Query: 121 FPFLLKACTGNGWLPVIEMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSGGILTAKKLFVS 180
           + FLLKAC+G  WL V++MVH  IEK GF SD+FVPN+LIDSYSKCG  G+  AKKLF++
Sbjct: 121 YSFLLKACSGKAWLSVVQMVHTHIEKCGFCSDIFVPNALIDSYSKCGPVGVFAAKKLFLA 180

Query: 181 MGDCRDVVSWNLMISGFAKGGLYEEARKVFDKMPIRDSISWNTMLDGYVKVGKMDDAFKL 240
           MG  RDVVSWN M+ G  K G   EAR++FD+MP RD ++WNTMLDGYVK G+M +AF+ 
Sbjct: 181 MG-VRDVVSWNSMVGGLVKVGELGEARQLFDEMPERDLVTWNTMLDGYVKAGQMSEAFEF 240

Query: 241 FDAMPERNVVSWSTMLLGYCKVGDMEMAQMLFNKMPTRNLVSWTIVISGFAEKGLAKVAI 300
           F  MPERNVVSWST++ GY + GDM+MA++LF+KMP +NLV+WTI++SG+A KG+A  A 
Sbjct: 241 FQRMPERNVVSWSTIVSGYSRAGDMDMARILFDKMPVKNLVTWTIIVSGYAAKGIANEAN 300

Query: 301 GLFDQMEEAGVKLDNGAVISILAACAESGLLGLGEKIHASIRNHNFKCTTEISNALVDMY 360
            L+DQME+AG+K D+G +IS+LAACAESGLLGLGEK+HAS+    ++C T +SNALVDMY
Sbjct: 301 SLYDQMEDAGLKFDDGTIISVLAACAESGLLGLGEKVHASMDRIRYRCCTPVSNALVDMY 360

Query: 361 AKCGRLDIAYNVFNDIQNKDVVSWNAMLHGLAMHGHGEKALELFKRMKEKGFSPDKVTMI 420
           AKCG LD AY VFN I NKD+VSWNAM+HGLA+HGHGEKAL LF RMK++ FSPD VT +
Sbjct: 361 AKCGSLDKAYKVFNGIGNKDLVSWNAMIHGLAIHGHGEKALWLFGRMKKESFSPDAVTFV 420

Query: 421 GVLCACSHAGLIDDGIRYFSSMEKDYALVHEIEHYGCMVDLLGRKGRLEEAIRLIRTMPM 480
           G+LC C+HAG + +G+ YF SME+DY +V ++EHYGC+VDLLGR GRLEEA RL+ +MP+
Sbjct: 421 GILCGCAHAGFVKEGLYYFHSMERDYGVVPQVEHYGCVVDLLGRGGRLEEAFRLVNSMPV 480

Query: 481 EPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADV 540
           EPN +IW TLLGACR+HNAV+LA EVLDHL KLE SDPGN SMLSNIYAAAGDW+ VA +
Sbjct: 481 EPNPVIWSTLLGACRVHNAVDLAAEVLDHLAKLESSDPGNFSMLSNIYAAAGDWESVATM 540

Query: 541 RLRMRSIGTQKPSGASSIEVNNEVHEFTVFDRSHPKSDKIYQMINGLRRELKQVACFP 599
           R+RMRS G QK SGASSIEVN+EVHEF VF +SHP+SDKIYQ I  L ++LK+V   P
Sbjct: 541 RVRMRSTGIQKASGASSIEVNDEVHEFKVFHKSHPQSDKIYQTIGKLNQDLKRVEYVP 596

BLAST of CmaCh14G020230 vs. TAIR10
Match: AT3G29230.1 (AT3G29230.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 791.6 bits (2043), Expect = 3.5e-229
Identity = 371/585 (63.42%), Postives = 464/585 (79.32%), Query Frame = 1

Query: 5   SVLTRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSNLHLDLYVVPKLISAFSL 64
           S+  R PSW S+R++FE++L DL KC +LNQVKQLHAQI++ NLH DL++ PKLISA SL
Sbjct: 4   SLPVRAPSWVSSRRIFEERLQDLPKCANLNQVKQLHAQIIRRNLHEDLHIAPKLISALSL 63

Query: 65  CRQMPLATNAFNQVQYPNGHLYNTLIRAHAQNSQPSQAFSTFFTMQFDGLYPDNFTFPFL 124
           CRQ  LA   FNQVQ PN HL N+LIRAHAQNSQP QAF  F  MQ  GL+ DNFT+PFL
Sbjct: 64  CRQTNLAVRVFNQVQEPNVHLCNSLIRAHAQNSQPYQAFFVFSEMQRFGLFADNFTYPFL 123

Query: 125 LKACTGNGWLPVIEMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSGGILTAKKLFVSMGDC 184
           LKAC+G  WLPV++M+H  IEK G  SD++VPN+LID YS+CG  G+  A KLF  M + 
Sbjct: 124 LKACSGQSWLPVVKMMHNHIEKLGLSSDIYVPNALIDCYSRCGGLGVRDAMKLFEKMSE- 183

Query: 185 RDVVSWNLMISGFAKGGLYEEARKVFDKMPIRDSISWNTMLDGYVKVGKMDDAFKLFDAM 244
           RD VSWN M+ G  K G   +AR++FD+MP RD ISWNTMLDGY +  +M  AF+LF+ M
Sbjct: 184 RDTVSWNSMLGGLVKAGELRDARRLFDEMPQRDLISWNTMLDGYARCREMSKAFELFEKM 243

Query: 245 PERNVVSWSTMLLGYCKVGDMEMAQMLFNKMP--TRNLVSWTIVISGFAEKGLAKVAIGL 304
           PERN VSWSTM++GY K GDMEMA+++F+KMP   +N+V+WTI+I+G+AEKGL K A  L
Sbjct: 244 PERNTVSWSTMVMGYSKAGDMEMARVMFDKMPLPAKNVVTWTIIIAGYAEKGLLKEADRL 303

Query: 305 FDQMEEAGVKLDNGAVISILAACAESGLLGLGEKIHASIRNHNFKCTTEISNALVDMYAK 364
            DQM  +G+K D  AVISILAAC ESGLL LG +IH+ ++  N      + NAL+DMYAK
Sbjct: 304 VDQMVASGLKFDAAAVISILAACTESGLLSLGMRIHSILKRSNLGSNAYVLNALLDMYAK 363

Query: 365 CGRLDIAYNVFNDIQNKDVVSWNAMLHGLAMHGHGEKALELFKRMKEKGFSPDKVTMIGV 424
           CG L  A++VFNDI  KD+VSWN MLHGL +HGHG++A+ELF RM+ +G  PDKVT I V
Sbjct: 364 CGNLKKAFDVFNDIPKKDLVSWNTMLHGLGVHGHGKEAIELFSRMRREGIRPDKVTFIAV 423

Query: 425 LCACSHAGLIDDGIRYFSSMEKDYALVHEIEHYGCMVDLLGRKGRLEEAIRLIRTMPMEP 484
           LC+C+HAGLID+GI YF SMEK Y LV ++EHYGC+VDLLGR GRL+EAI++++TMPMEP
Sbjct: 424 LCSCNHAGLIDEGIDYFYSMEKVYDLVPQVEHYGCLVDLLGRVGRLKEAIKVVQTMPMEP 483

Query: 485 NVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRL 544
           NV+IWG LLGACRMHN V++A+EVLD+LVKL+P DPGN S+LSNIYAAA DW+ VAD+R 
Sbjct: 484 NVVIWGALLGACRMHNEVDIAKEVLDNLVKLDPCDPGNYSLLSNIYAAAEDWEGVADIRS 543

Query: 545 RMRSIGTQKPSGASSIEVNNEVHEFTVFDRSHPKSDKIYQMINGL 588
           +M+S+G +KPSGASS+E+ + +HEFTVFD+SHPKSD+IYQM+  L
Sbjct: 544 KMKSMGVEKPSGASSVELEDGIHEFTVFDKSHPKSDQIYQMLGSL 587

BLAST of CmaCh14G020230 vs. TAIR10
Match: AT2G29760.1 (AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 427.2 bits (1097), Expect = 1.7e-119
Identity = 225/625 (36.00%), Postives = 353/625 (56.48%), Query Frame = 1

Query: 11  PSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSNLHLDLYVVPKL--ISAFSLCRQM 70
           P+  +T     + +S + +C  L Q+KQ H  ++++    D Y   KL  ++A S    +
Sbjct: 21  PNQPTTNNERSRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASL 80

Query: 71  PLATNAFNQVQYPNGHLYNTLIRAHAQNSQPSQAFSTFFTMQFDG-LYPDNFTFPFLLKA 130
             A   F+++  PN   +NTLIRA+A    P  +   F  M  +   YP+ +TFPFL+KA
Sbjct: 81  EYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKA 140

Query: 131 CTGNGWLPVIEMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSGGILTAKKLFVSMGDCRDV 190
                 L + + +H    K    SDVFV NSLI  Y  CG   + +A K+F ++ + +DV
Sbjct: 141 AAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGD--LDSACKVFTTIKE-KDV 200

Query: 191 VSWNLMISGFAKGGLYEEARKVFDKMPIRDSISWNTMLDG-------------------- 250
           VSWN MI+GF + G  ++A ++F KM   D  + +  + G                    
Sbjct: 201 VSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSY 260

Query: 251 -------------------YVKVGKMDDAFKLFDAMPERNVVSWSTMLLGYCKVGDMEMA 310
                              Y K G ++DA +LFDAM E++ V+W+TML GY    D E A
Sbjct: 261 IEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAA 320

Query: 311 QMLFNKMPTRNLVSWTIVISGFAEKGLAKVAIGLFDQME-EAGVKLDNGAVISILAACAE 370
           + + N MP +++V+W  +IS + + G    A+ +F +++ +  +KL+   ++S L+ACA+
Sbjct: 321 REVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQ 380

Query: 371 SGLLGLGEKIHASIRNHNFKCTTEISNALVDMYAKCGRLDIAYNVFNDIQNKDVVSWNAM 430
            G L LG  IH+ I+ H  +    +++AL+ MY+KCG L+ +  VFN ++ +DV  W+AM
Sbjct: 381 VGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAM 440

Query: 431 LHGLAMHGHGEKALELFKRMKEKGFSPDKVTMIGVLCACSHAGLIDDGIRYFSSMEKDYA 490
           + GLAMHG G +A+++F +M+E    P+ VT   V CACSH GL+D+    F  ME +Y 
Sbjct: 441 IGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYG 500

Query: 491 LVHEIEHYGCMVDLLGRKGRLEEAIRLIRTMPMEPNVIIWGTLLGACRMHNAVELAREVL 550
           +V E +HY C+VD+LGR G LE+A++ I  MP+ P+  +WG LLGAC++H  + LA    
Sbjct: 501 IVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMAC 560

Query: 551 DHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVNNEVHEF 593
             L++LEP + G   +LSNIYA  G W+ V+++R  MR  G +K  G SSIE++  +HEF
Sbjct: 561 TRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEF 620

BLAST of CmaCh14G020230 vs. TAIR10
Match: AT4G02750.1 (AT4G02750.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 411.0 bits (1055), Expect = 1.3e-114
Identity = 212/532 (39.85%), Postives = 321/532 (60.34%), Query Frame = 1

Query: 71  ATNAFNQVQYPNGHLYNTLIRAHAQNSQPSQAFSTFFTMQFDGLYPDNFTFPFLLKACTG 130
           A + F+++   N   +N L+ A+ QNS+  +A   F + +   L   N         C  
Sbjct: 176 ARSVFDRMPEKNDVSWNALLSAYVQNSKMEEACMLFKSRENWALVSWN---------CLL 235

Query: 131 NGWLPVIEMVHAQIEKFGFMS--DVFVPNSLIDSYSKCGSGGILTAKKLFVSMGDCRDVV 190
            G++   ++V A+ + F  M+  DV   N++I  Y++  SG I  A++LF      +DV 
Sbjct: 236 GGFVKKKKIVEAR-QFFDSMNVRDVVSWNTIITGYAQ--SGKIDEARQLF-DESPVQDVF 295

Query: 191 SWNLMISGFAKGGLYEEARKVFDKMPIRDSISWNTMLDGYVKVGKMDDAFKLFDAMPERN 250
           +W  M+SG+ +  + EEAR++FDKMP R+ +SWN ML GYV+  +M+ A +LFD MP RN
Sbjct: 296 TWTAMVSGYIQNRMVEEARELFDKMPERNEVSWNAMLAGYVQGERMEMAKELFDVMPCRN 355

Query: 251 VVSWSTMLLGYCKVGDMEMAQMLFNKMPTRNLVSWTIVISGFAEKGLAKVAIGLFDQMEE 310
           V +W+TM+ GY + G +  A+ LF+KMP R+ VSW  +I+G+++ G +  A+ LF QME 
Sbjct: 356 VSTWNTMITGYAQCGKISEAKNLFDKMPKRDPVSWAAMIAGYSQSGHSFEALRLFVQMER 415

Query: 311 AGVKLDNGAVISILAACAESGLLGLGEKIHASIRNHNFKCTTEISNALVDMYAKCGRLDI 370
            G +L+  +  S L+ CA+   L LG+++H  +    ++    + NAL+ MY KCG ++ 
Sbjct: 416 EGGRLNRSSFSSALSTCADVVALELGKQLHGRLVKGGYETGCFVGNALLLMYCKCGSIEE 475

Query: 371 AYNVFNDIQNKDVVSWNAMLHGLAMHGHGEKALELFKRMKEKGFSPDKVTMIGVLCACSH 430
           A ++F ++  KD+VSWN M+ G + HG GE AL  F+ MK +G  PD  TM+ VL ACSH
Sbjct: 476 ANDLFKEMAGKDIVSWNTMIAGYSRHGFGEVALRFFESMKREGLKPDDATMVAVLSACSH 535

Query: 431 AGLIDDGIRYFSSMEKDYALVHEIEHYGCMVDLLGRKGRLEEAIRLIRTMPMEPNVIIWG 490
            GL+D G +YF +M +DY ++   +HY CMVDLLGR G LE+A  L++ MP EP+  IWG
Sbjct: 536 TGLVDKGRQYFYTMTQDYGVMPNSQHYACMVDLLGRAGLLEDAHNLMKNMPFEPDAAIWG 595

Query: 491 TLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIG 550
           TLLGA R+H   ELA    D +  +EP + G   +LSN+YA++G W  V  +R+RMR  G
Sbjct: 596 TLLGASRVHGNTELAETAADKIFAMEPENSGMYVLLSNLYASSGRWGDVGKLRVRMRDKG 655

Query: 551 TQKPSGASSIEVNNEVHEFTVFDRSHPKSDKIYQMINGLRRELKQVACFPNT 601
            +K  G S IE+ N+ H F+V D  HP+ D+I+  +  L   +K+      T
Sbjct: 656 VKKVPGYSWIEIQNKTHTFSVGDEFHPEKDEIFAFLEELDLRMKKAGYVSKT 694

BLAST of CmaCh14G020230 vs. TAIR10
Match: AT3G08820.1 (AT3G08820.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 411.0 bits (1055), Expect = 1.3e-114
Identity = 227/579 (39.21%), Postives = 324/579 (55.96%), Query Frame = 1

Query: 30  CTDLNQVKQLHAQILKSNLHLDLYVVPKLISAFSLCRQMPLATNAFNQVQYPNGHLYNTL 89
           CT +N +KQ+H  ++  +LH D ++V  L+      RQ   +   F+  Q+PN  LYN+L
Sbjct: 24  CT-VNHLKQIHVSLINHHLHHDTFLVNLLLKRTLFFRQTKYSYLLFSHTQFPNIFLYNSL 83

Query: 90  IRAHAQNSQPSQAFSTFFTMQFDGLYPDNFTFPFLLKACTGNGWLPVIEMVHAQIEKFGF 149
           I     N    +    F +++  GLY   FTFP +LKACT      +   +H+ + K GF
Sbjct: 84  INGFVNNHLFHETLDLFLSIRKHGLYLHGFTFPLVLKACTRASSRKLGIDLHSLVVKCGF 143

Query: 150 MSDVFVPNSLIDSYSKCGSGGILTAKKLFVSMGDCRDVVSWNLMISGFAKGGLYEEARKV 209
             DV    SL+  YS  GSG +  A KLF  + D R VV+W  + SG+   G + EA  +
Sbjct: 144 NHDVAAMTSLLSIYS--GSGRLNDAHKLFDEIPD-RSVVTWTALFSGYTTSGRHREAIDL 203

Query: 210 FDKMPIR----DSISWNTMLDGYVKVGKMDDAFKLFDAMPE----RNVVSWSTMLLGYCK 269
           F KM       DS     +L   V VG +D    +   M E    +N    +T++  Y K
Sbjct: 204 FKKMVEMGVKPDSYFIVQVLSACVHVGDLDSGEWIVKYMEEMEMQKNSFVRTTLVNLYAK 263

Query: 270 VGDMEMAQMLFNKMPTRNLVSWTIVISGFAEKGLAKVAIGLFDQMEEAGVKLDNGAVISI 329
            G ME A+ +F+ M  +++V+W+ +I G+A     K  I LF QM +  +K D  +++  
Sbjct: 264 CGKMEKARSVFDSMVEKDIVTWSTMIQGYASNSFPKEGIELFLQMLQENLKPDQFSIVGF 323

Query: 330 LAACAESGLLGLGEKIHASIRNHNFKCTTEISNALVDMYAKCGRLDIAYNVFNDIQNKDV 389
           L++CA  G L LGE   + I  H F     ++NAL+DMYAKCG +   + VF +++ KD+
Sbjct: 324 LSSCASLGALDLGEWGISLIDRHEFLTNLFMANALIDMYAKCGAMARGFEVFKEMKEKDI 383

Query: 390 VSWNAMLHGLAMHGHGEKALELFKRMKEKGFSPDKVTMIGVLCACSHAGLIDDGIRYFSS 449
           V  NA + GLA +GH + +  +F + ++ G SPD  T +G+LC C HAGLI DG+R+F++
Sbjct: 384 VIMNAAISGLAKNGHVKLSFAVFGQTEKLGISPDGSTFLGLLCGCVHAGLIQDGLRFFNA 443

Query: 450 MEKDYALVHEIEHYGCMVDLLGRKGRLEEAIRLIRTMPMEPNVIIWGTLLGACRMHNAVE 509
           +   YAL   +EHYGCMVDL GR G L++A RLI  MPM PN I+WG LL  CR+    +
Sbjct: 444 ISCVYALKRTVEHYGCMVDLWGRAGMLDDAYRLICDMPMRPNAIVWGALLSGCRLVKDTQ 503

Query: 510 LAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVN 569
           LA  VL  L+ LEP + GN   LSNIY+  G WD  A+VR  M   G +K  G S IE+ 
Sbjct: 504 LAETVLKELIALEPWNAGNYVQLSNIYSVGGRWDEAAEVRDMMNKKGMKKIPGYSWIELE 563

Query: 570 NEVHEFTVFDRSHPKSDKIYQMINGLRRELKQVACFPNT 601
            +VHEF   D+SHP SDKIY  +  L  E++ +   P T
Sbjct: 564 GKVHEFLADDKSHPLSDKIYAKLEDLGNEMRLMGFVPTT 598

BLAST of CmaCh14G020230 vs. TAIR10
Match: AT2G13600.1 (AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 388.7 bits (997), Expect = 6.7e-108
Identity = 216/628 (34.39%), Postives = 346/628 (55.10%), Query Frame = 1

Query: 12  SWFSTRKLFEQKLSDLH-KCTDLNQVKQLHAQILKSNLHLDLYVVPKLISAFSLCRQMPL 71
           S FS     + +L D + KC  L   +Q+  ++ + N+    Y    +++  +    +  
Sbjct: 49  SGFSNEIFIQNRLIDAYSKCGSLEDGRQVFDKMPQRNI----YTWNSVVTGLTKLGFLDE 108

Query: 72  ATNAFNQVQYPNGHLYNTLIRAHAQNSQPSQAFSTFFTMQFDGLYPDNFTFPFLLKACTG 131
           A + F  +   +   +N+++   AQ+ +  +A   F  M  +G   + ++F  +L AC+G
Sbjct: 109 ADSLFRSMPERDQCTWNSMVSGFAQHDRCEEALCYFAMMHKEGFVLNEYSFASVLSACSG 168

Query: 132 NGWLPVIEMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSGGILTAKKLFVSMGDCRDVVSW 191
              +     VH+ I K  F+SDV++ ++L+D YSKCG+  +  A+++F  MGD R+VVSW
Sbjct: 169 LNDMNKGVQVHSLIAKSPFLSDVYIGSALVDMYSKCGN--VNDAQRVFDEMGD-RNVVSW 228

Query: 192 NLMISGFAKGGLYEEARKVF----------DKMPIRDSIS-------------------- 251
           N +I+ F + G   EA  VF          D++ +   IS                    
Sbjct: 229 NSLITCFEQNGPAVEALDVFQMMLESRVEPDEVTLASVISACASLSAIKVGQEVHGRVVK 288

Query: 252 ----------WNTMLDGYVKVGKMDDAFKLFDAMPERNVVSWSTMLLGYCKVGDMEMAQM 311
                      N  +D Y K  ++ +A  +FD+MP RNV++ ++M+ GY      + A++
Sbjct: 289 NDKLRNDIILSNAFVDMYAKCSRIKEARFIFDSMPIRNVIAETSMISGYAMAASTKAARL 348

Query: 312 LFNKMPTRNLVSWTIVISGFAEKGLAKVAIGLFDQMEEAGVKLDNGAVISILAACAESGL 371
           +F KM  RN+VSW  +I+G+ + G  + A+ LF  ++   V   + +  +IL ACA+   
Sbjct: 349 MFTKMAERNVVSWNALIAGYTQNGENEEALSLFCLLKRESVCPTHYSFANILKACADLAE 408

Query: 372 LGLGEKIHASIRNHNFKCTTE------ISNALVDMYAKCGRLDIAYNVFNDIQNKDVVSW 431
           L LG + H  +  H FK  +       + N+L+DMY KCG ++  Y VF  +  +D VSW
Sbjct: 409 LHLGMQAHVHVLKHGFKFQSGEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERDCVSW 468

Query: 432 NAMLHGLAMHGHGEKALELFKRMKEKGFSPDKVTMIGVLCACSHAGLIDDGIRYFSSMEK 491
           NAM+ G A +G+G +ALELF+ M E G  PD +TMIGVL AC HAG +++G  YFSSM +
Sbjct: 469 NAMIIGFAQNGYGNEALELFREMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTR 528

Query: 492 DYALVHEIEHYGCMVDLLGRKGRLEEAIRLIRTMPMEPNVIIWGTLLGACRMHNAVELAR 551
           D+ +    +HY CMVDLLGR G LEEA  +I  MPM+P+ +IWG+LL AC++H  + L +
Sbjct: 529 DFGVAPLRDHYTCMVDLLGRAGFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNITLGK 588

Query: 552 EVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRLRMRSIGTQKPSGASSIEVNNEV 593
            V + L+++EPS+ G   +LSN+YA  G W+ V +VR  MR  G  K  G S I++    
Sbjct: 589 YVAEKLLEVEPSNSGPYVLLSNMYAELGKWEDVMNVRKSMRKEGVTKQPGCSWIKIQGHD 648

BLAST of CmaCh14G020230 vs. NCBI nr
Match: gi|659075293|ref|XP_008438067.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g29230 [Cucumis melo])

HSP 1 Score: 1078.9 bits (2789), Expect = 0.0e+00
Identity = 519/601 (86.36%), Postives = 558/601 (92.85%), Query Frame = 1

Query: 1   MQMCSVLTRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSNLHLDLYVVPKLIS 60
           MQMCSV  RTPSWFSTRKLFEQKL++LHKCTDLNQVKQLHAQILKSNLH+DL+VVPKLIS
Sbjct: 1   MQMCSVPIRTPSWFSTRKLFEQKLAELHKCTDLNQVKQLHAQILKSNLHVDLFVVPKLIS 60

Query: 61  AFSLCRQMPLATNAFNQVQYPNGHLYNTLIRAHAQNSQPSQAFSTFFTMQFDGLYPDNFT 120
           AFSLCRQM LATN FNQVQYPN HLYNT+IRAH+ NSQPSQAF+TFF MQ DG YPDNFT
Sbjct: 61  AFSLCRQMLLATNTFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFAMQRDGFYPDNFT 120

Query: 121 FPFLLKACTGNGWLPVIEMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSGGILTAKKLFVS 180
           FPFLLK CTGN WLPV+E VHAQIEKFGFMSDVFVPNSLIDSYSKCGS GI  AKKLFVS
Sbjct: 121 FPFLLKVCTGNVWLPVVERVHAQIEKFGFMSDVFVPNSLIDSYSKCGSRGISAAKKLFVS 180

Query: 181 MGDCRDVVSWNLMISGFAKGGLYEEARKVFDKMPIRDSISWNTMLDGYVKVGKMDDAFKL 240
           MG  RDVVSWN MISG AKGGLYEEARKVFD+MP RD ISWNTMLDGYVKVGKMDDAFKL
Sbjct: 181 MGARRDVVSWNSMISGLAKGGLYEEARKVFDEMPKRDGISWNTMLDGYVKVGKMDDAFKL 240

Query: 241 FDAMPERNVVSWSTMLLGYCKVGDMEMAQMLFNKMPTRNLVSWTIVISGFAEKGLAKVAI 300
           FD MPERNVVSWSTM+LGYCK G MEMA+MLF+KMP +NLVSWTI++SGFAEKGLA+ AI
Sbjct: 241 FDEMPERNVVSWSTMVLGYCKAGGMEMARMLFDKMPVKNLVSWTIIVSGFAEKGLAREAI 300

Query: 301 GLFDQMEEAGVKLDNGAVISILAACAESGLLGLGEKIHASIRNHNFKCTTEISNALVDMY 360
            LFDQME+A +KLDNG +ISIL ACAESGLLGLGEKIHASI+N+NFKCTTEISNALVDMY
Sbjct: 301 DLFDQMEKACLKLDNGTIISILDACAESGLLGLGEKIHASIKNNNFKCTTEISNALVDMY 360

Query: 361 AKCGRLDIAYNVFNDIQNKDVVSWNAMLHGLAMHGHGEKALELFKRMKEKGFSPDKVTMI 420
           AKCGRL+IAY+VF+DI+NKDVVSWNAML GLAMHGHG KALELFK+MKE+GFSP++VTMI
Sbjct: 361 AKCGRLNIAYDVFSDIKNKDVVSWNAMLQGLAMHGHGMKALELFKKMKEEGFSPNRVTMI 420

Query: 421 GVLCACSHAGLIDDGIRYFSSMEKDYALVHEIEHYGCMVDLLGRKGRLEEAIRLIRTMPM 480
           GVLCAC+HAGLIDDGIRYFS+ME+DY LV E+EHYGCMVDLLGRKGRLEEAIRLIR MPM
Sbjct: 421 GVLCACTHAGLIDDGIRYFSTMERDYGLVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPM 480

Query: 481 EPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADV 540
            PN IIWGTLLGACRMHNAVELAREVLDHLV+LEPSD GNLSMLSNIYAAAGDW+CVA+ 
Sbjct: 481 TPNAIIWGTLLGACRMHNAVELAREVLDHLVELEPSDSGNLSMLSNIYAAAGDWNCVANT 540

Query: 541 RLRMRSIGTQKPSGASSIEVNNEVHEFTVFDRSHPKSDKIYQMINGLRRELKQVACFPNT 600
           RLRMRSIGT+KPSGASSIEV+NEVHEFTVFDRSHPKSD IYQ+INGLRRELKQV CF N 
Sbjct: 541 RLRMRSIGTKKPSGASSIEVDNEVHEFTVFDRSHPKSDNIYQVINGLRRELKQVECFSNM 600

Query: 601 C 602
           C
Sbjct: 601 C 601

BLAST of CmaCh14G020230 vs. NCBI nr
Match: gi|700201399|gb|KGN56532.1| (hypothetical protein Csa_3G122560 [Cucumis sativus])

HSP 1 Score: 1050.0 bits (2714), Expect = 1.5e-303
Identity = 508/584 (86.99%), Postives = 544/584 (93.15%), Query Frame = 1

Query: 1   MQMCSVLTRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSNLHLDLYVVPKLIS 60
           MQMCSV  RTPSWFSTRKL EQKLSDLHKCT+LNQVKQLHAQILKSNLH+DL+VVPKLIS
Sbjct: 1   MQMCSVPIRTPSWFSTRKLLEQKLSDLHKCTNLNQVKQLHAQILKSNLHVDLFVVPKLIS 60

Query: 61  AFSLCRQMPLATNAFNQVQYPNGHLYNTLIRAHAQNSQPSQAFSTFFTMQFDGLYPDNFT 120
           AFSLCRQM LATNAFNQVQYPN HLYNT+IRAH+ NSQPSQAF+TFF MQ DG Y DNFT
Sbjct: 61  AFSLCRQMLLATNAFNQVQYPNVHLYNTMIRAHSHNSQPSQAFATFFAMQRDGHYADNFT 120

Query: 121 FPFLLKACTGNGWLPVIEMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSGGILTAKKLFVS 180
           FPFLLK CTGN WLPVIE VHAQIEKFGFMSDVFVPNSLIDSYSKCGS GI  AKKLFVS
Sbjct: 121 FPFLLKVCTGNVWLPVIESVHAQIEKFGFMSDVFVPNSLIDSYSKCGSCGISAAKKLFVS 180

Query: 181 MGDCRDVVSWNLMISGFAKGGLYEEARKVFDKMPIRDSISWNTMLDGYVKVGKMDDAFKL 240
           MG  RDVVSWN MISG AKGGLYEEARKVFD+MP +D ISWNTMLDGYVKVGKMDDAFKL
Sbjct: 181 MGARRDVVSWNSMISGLAKGGLYEEARKVFDEMPEKDGISWNTMLDGYVKVGKMDDAFKL 240

Query: 241 FDAMPERNVVSWSTMLLGYCKVGDMEMAQMLFNKMPTRNLVSWTIVISGFAEKGLAKVAI 300
           FD MPERNVVSWSTM+LGYCK GDMEMA+MLF+KMP +NLVSWTI++SGFAEKGLA+ AI
Sbjct: 241 FDEMPERNVVSWSTMVLGYCKAGDMEMARMLFDKMPVKNLVSWTIIVSGFAEKGLAREAI 300

Query: 301 GLFDQMEEAGVKLDNGAVISILAACAESGLLGLGEKIHASIRNHNFKCTTEISNALVDMY 360
            LFDQME+A +KLDNG V+SILAACAESGLLGLGEKIHASI+N+NFKCTTEISNALVDMY
Sbjct: 301 SLFDQMEKACLKLDNGTVMSILAACAESGLLGLGEKIHASIKNNNFKCTTEISNALVDMY 360

Query: 361 AKCGRLDIAYNVFNDIQNKDVVSWNAMLHGLAMHGHGEKALELFKRMKEKGFSPDKVTMI 420
           AKCGRL+IAY+VFNDI+NKDVVSWNAML GLAMHGHG KALELFKRMKE+GFSP+KVTMI
Sbjct: 361 AKCGRLNIAYDVFNDIKNKDVVSWNAMLQGLAMHGHGVKALELFKRMKEEGFSPNKVTMI 420

Query: 421 GVLCACSHAGLIDDGIRYFSSMEKDYALVHEIEHYGCMVDLLGRKGRLEEAIRLIRTMPM 480
           GVLCAC+HAGLIDDGIRYFS+ME+DY LV E+EHYGCMVDLLGRKGRLEEAIRLIR MPM
Sbjct: 421 GVLCACTHAGLIDDGIRYFSTMERDYTLVPEVEHYGCMVDLLGRKGRLEEAIRLIRNMPM 480

Query: 481 EPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADV 540
            PN IIWGTLLGACRMHNAVELAREVLDHLV+LEP+D GN SMLSNIYAAAGDW+CVA+ 
Sbjct: 481 APNAIIWGTLLGACRMHNAVELAREVLDHLVELEPTDSGNFSMLSNIYAAAGDWNCVANT 540

Query: 541 RLRMRSIGTQKPSGASSIEVNNEVHEFTVFDRSHPKSDKIYQMI 585
           RLRMRSIGT+KPSGASSIEVNNEVHEFTVFDRSHPKSD IYQ++
Sbjct: 541 RLRMRSIGTKKPSGASSIEVNNEVHEFTVFDRSHPKSDNIYQVL 584

BLAST of CmaCh14G020230 vs. NCBI nr
Match: gi|658012680|ref|XP_008341615.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g29230 [Malus domestica])

HSP 1 Score: 877.9 bits (2267), Expect = 1.0e-251
Identity = 412/594 (69.36%), Postives = 499/594 (84.01%), Query Frame = 1

Query: 1   MQMCSVLTRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSNLHLDLYVVPKLIS 60
           MQMC V  R+PSW S R++ +QKLSDLH+CT+L+ VKQ+HAQILK++LH D++  PKLI+
Sbjct: 1   MQMC-VPVRSPSWVSRRRILDQKLSDLHRCTNLSHVKQVHAQILKADLHQDIHTAPKLIA 60

Query: 61  AFSLCRQMPLATNAFNQVQYPNGHLYNTLIRAHAQNSQPSQAFSTFFTMQFDGLYPDNFT 120
           AFSLCRQM LA N FNQ+ YPN HLYNTLIRAH QNSQ SQAF+ FF MQ +G+YPDNFT
Sbjct: 61  AFSLCRQMALAVNVFNQIDYPNVHLYNTLIRAHIQNSQTSQAFAAFFDMQINGVYPDNFT 120

Query: 121 FPFLLKACTGNGWLPVIEMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSGGILTAKKLFVS 180
           +PFLLKAC G  WLPV++M+HA IEKFGF  D+FVPNSLID+YSKCG  G+  AKKLFV 
Sbjct: 121 YPFLLKACLGRPWLPVVQMIHAHIEKFGFGLDIFVPNSLIDTYSKCGLIGVSEAKKLFVV 180

Query: 181 MGDCRDVVSWNLMISGFAKGGLYEEARKVFDKMPIRDSISWNTMLDGYVKVGKMDDAFKL 240
           M + RD VSWN MI G AK G   +AR++F++MP RDS+SWNT+LDGYVK G+M++AF+L
Sbjct: 181 MEE-RDTVSWNSMIGGLAKAGELSDARRLFEEMPGRDSVSWNTILDGYVKAGEMNEAFEL 240

Query: 241 FDAMPERNVVSWSTMLLGYCKVGDMEMAQMLFNKMPTRNLVSWTIVISGFAEKGLAKVAI 300
           F+ MP+RNVVSWST++ GY K GDM MA+M+F++MP RNLV WTI+ISG+AEKGLAK AI
Sbjct: 241 FEKMPQRNVVSWSTLVSGYSKAGDMSMARMMFDRMPFRNLVPWTIIISGYAEKGLAKEAI 300

Query: 301 GLFDQMEEAGVKLDNGAVISILAACAESGLLGLGEKIHASIRNHNFKCTTEISNALVDMY 360
            L+DQMEEAG+K DNGAVISILAACAESGL+GLG K+HASI    FKC+T +SNAL+DMY
Sbjct: 301 MLYDQMEEAGLKPDNGAVISILAACAESGLIGLGRKVHASIERTQFKCSTPVSNALLDMY 360

Query: 361 AKCGRLDIAYNVFNDIQNKDVVSWNAMLHGLAMHGHGEKALELFKRMKEKGFSPDKVTMI 420
           AKCG LD A+ VF+ I  KD+VSWNAML GLAMHGHGEKALELF RM + GF PDKVT I
Sbjct: 361 AKCGVLDEAFRVFDGIAKKDLVSWNAMLQGLAMHGHGEKALELFSRMVKDGFFPDKVTFI 420

Query: 421 GVLCACSHAGLIDDGIRYFSSMEKDYALVHEIEHYGCMVDLLGRKGRLEEAIRLIRTMPM 480
           GVLCAC+H G +++G+  F++ME++Y +V E+EHYGCM+DLLGR G L+EA RL+ +MPM
Sbjct: 421 GVLCACTHIGFVEEGLHAFNTMEREYGIVPEVEHYGCMIDLLGRGGHLQEAFRLVHSMPM 480

Query: 481 EPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADV 540
           EPN +IWGTLLGACRMHN  ELA EVL+HLVKL+PS+PGN SMLSNIYAAAGDW  VA+V
Sbjct: 481 EPNAVIWGTLLGACRMHNDRELAEEVLNHLVKLDPSEPGNFSMLSNIYAAAGDWANVANV 540

Query: 541 RLRMRSIGTQKPSGASSIEVNNEVHEFTVFDRSHPKSDKIYQMINGLRRELKQV 595
           R++MRS G QKPSGASSIEV++EVHEFTVFD+ HPKSD++Y MI+ LR + KQ+
Sbjct: 541 RMQMRSTGVQKPSGASSIEVDDEVHEFTVFDKLHPKSDEVYGMIDRLRVDFKQL 592

BLAST of CmaCh14G020230 vs. NCBI nr
Match: gi|694361968|ref|XP_009360575.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g29230 [Pyrus x bretschneideri])

HSP 1 Score: 877.9 bits (2267), Expect = 1.0e-251
Identity = 411/594 (69.19%), Postives = 499/594 (84.01%), Query Frame = 1

Query: 1   MQMCSVLTRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSNLHLDLYVVPKLIS 60
           MQMC V  R+PSW S R++ +QKLSDLH+CT+L+ +KQ+HAQILK++LH D++  PKLI+
Sbjct: 1   MQMC-VPVRSPSWVSRRRILDQKLSDLHRCTNLSHIKQVHAQILKADLHQDIHSAPKLIA 60

Query: 61  AFSLCRQMPLATNAFNQVQYPNGHLYNTLIRAHAQNSQPSQAFSTFFTMQFDGLYPDNFT 120
           AFSLCRQM LA N FNQ+ YPN HLYNTLIRAH QNSQ SQAF+ FF MQ +G+YPDNFT
Sbjct: 61  AFSLCRQMALAVNVFNQIDYPNVHLYNTLIRAHIQNSQTSQAFAAFFDMQINGVYPDNFT 120

Query: 121 FPFLLKACTGNGWLPVIEMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSGGILTAKKLFVS 180
           +PFLLKAC G  WLPV++M+HA IEKFGF  D+FVPNSLID+YSKCG  G+  AKKLFV 
Sbjct: 121 YPFLLKACVGRPWLPVVQMIHAHIEKFGFGLDIFVPNSLIDTYSKCGLIGVSEAKKLFVV 180

Query: 181 MGDCRDVVSWNLMISGFAKGGLYEEARKVFDKMPIRDSISWNTMLDGYVKVGKMDDAFKL 240
           M + RD VSWN MI G AK G   +AR++F++MP RDS+SWNT+LDGYVK G+M++AF+L
Sbjct: 181 MEE-RDTVSWNSMIGGLAKAGELSDARRLFEEMPERDSVSWNTILDGYVKAGEMNEAFEL 240

Query: 241 FDAMPERNVVSWSTMLLGYCKVGDMEMAQMLFNKMPTRNLVSWTIVISGFAEKGLAKVAI 300
           F+ MP+RNVVSWST++ GY K GDM MA+M+F++MP RNLV WTI+ISG+AEKGLAK AI
Sbjct: 241 FEKMPQRNVVSWSTLVSGYSKAGDMSMARMMFDRMPFRNLVPWTIIISGYAEKGLAKEAI 300

Query: 301 GLFDQMEEAGVKLDNGAVISILAACAESGLLGLGEKIHASIRNHNFKCTTEISNALVDMY 360
            L+DQMEEAG+K DNGAVISILAACAESG++GLG K+HASI  + FKC+T +SNAL+DMY
Sbjct: 301 MLYDQMEEAGLKPDNGAVISILAACAESGMIGLGRKVHASIERNQFKCSTPVSNALLDMY 360

Query: 361 AKCGRLDIAYNVFNDIQNKDVVSWNAMLHGLAMHGHGEKALELFKRMKEKGFSPDKVTMI 420
           AKCG LD A  VF+ I  KD+VSWNAML GLAMHGHGEKALELF RM + GF PDKVT I
Sbjct: 361 AKCGVLDEASRVFDGIAKKDLVSWNAMLQGLAMHGHGEKALELFSRMVKAGFFPDKVTFI 420

Query: 421 GVLCACSHAGLIDDGIRYFSSMEKDYALVHEIEHYGCMVDLLGRKGRLEEAIRLIRTMPM 480
           GVLCAC+H G +++G+  F++MEK+Y +V E+EHYGCM+DLLGR GRL+EA RL+ +MPM
Sbjct: 421 GVLCACTHVGFVEEGLHAFNTMEKEYGIVPEVEHYGCMIDLLGRGGRLQEAFRLVHSMPM 480

Query: 481 EPNVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADV 540
           EPN +IWGTLLGACRMHN  ELA EVL+HLVKL+PS+PGN SM SNIYAAAGDW  VA+V
Sbjct: 481 EPNAVIWGTLLGACRMHNDRELAEEVLNHLVKLDPSEPGNFSMFSNIYAAAGDWANVANV 540

Query: 541 RLRMRSIGTQKPSGASSIEVNNEVHEFTVFDRSHPKSDKIYQMINGLRRELKQV 595
           R++MRS G QKPSGASSIEV++EVHEFTVFD+ HPKSD++Y MI+ LR + KQ+
Sbjct: 541 RMQMRSTGVQKPSGASSIEVDDEVHEFTVFDKLHPKSDEVYGMIDRLRVDFKQL 592

BLAST of CmaCh14G020230 vs. NCBI nr
Match: gi|595967381|ref|XP_007217279.1| (hypothetical protein PRUPE_ppa017633mg [Prunus persica])

HSP 1 Score: 875.5 bits (2261), Expect = 5.2e-251
Identity = 413/591 (69.88%), Postives = 496/591 (83.93%), Query Frame = 1

Query: 3   MCSVLTRTPSWFSTRKLFEQKLSDLHKCTDLNQVKQLHAQILKSNLHLDLYVVPKLISAF 62
           MC V  R+PSW S R+L EQKLSDLH+CT+L+ +KQ+HAQILK+NLH DL+  PKLI+AF
Sbjct: 1   MC-VPVRSPSWVSRRRLLEQKLSDLHRCTNLSHIKQVHAQILKANLHQDLHTAPKLIAAF 60

Query: 63  SLCRQMPLATNAFNQVQYPNGHLYNTLIRAHAQNSQPSQAFSTFFTMQFDGLYPDNFTFP 122
           SLCRQM LA N FNQVQ PN HLYNTLIRAH QNSQ +QAF+TFF MQ +G+YPDNFT+P
Sbjct: 61  SLCRQMALAVNVFNQVQDPNVHLYNTLIRAHIQNSQTTQAFATFFDMQLNGVYPDNFTYP 120

Query: 123 FLLKACTGNGWLPVIEMVHAQIEKFGFMSDVFVPNSLIDSYSKCGSGGILTAKKLFVSMG 182
           FLLKAC+G  W PV++M+H  IEKFGF  D+FVPNSLID+YSKCG  G+  AKK+F+ MG
Sbjct: 121 FLLKACSGRPWFPVVQMIHTSIEKFGFCLDIFVPNSLIDTYSKCGLLGVSEAKKMFMLMG 180

Query: 183 DCRDVVSWNLMISGFAKGGLYEEARKVFDKMPIRDSISWNTMLDGYVKVGKMDDAFKLFD 242
           + RD+VSWN MI G AK G   EAR++FD+MP +D++SWNT+LDGY K G+M++AF+LF+
Sbjct: 181 E-RDIVSWNSMIGGLAKTGELGEARRLFDEMPDKDAVSWNTILDGYAKAGQMNEAFELFE 240

Query: 243 AMPERNVVSWSTMLLGYCKVGDMEMAQMLFNKMPTRNLVSWTIVISGFAEKGLAKVAIGL 302
            MP+RNVVSWST++ GY K GDM MA+M+F+KMP RNLV WTI+ISG+AEKGLAK AI L
Sbjct: 241 RMPQRNVVSWSTLVSGYSKAGDMGMARMMFDKMPFRNLVPWTIIISGYAEKGLAKEAIML 300

Query: 303 FDQMEEAGVKLDNGAVISILAACAESGLLGLGEKIHASIRNHNFKCTTEISNALVDMYAK 362
           +DQMEEAG+K DNGA+ISILAACAESGL+GLG K+HASI    FKC+T +SNAL+DMYAK
Sbjct: 301 YDQMEEAGLKPDNGAIISILAACAESGLIGLGRKVHASIERTRFKCSTPVSNALLDMYAK 360

Query: 363 CGRLDIAYNVFNDIQNKDVVSWNAMLHGLAMHGHGEKALELFKRMKEKGFSPDKVTMIGV 422
           CG LD A  VF+ I  KD+VSWNAML GLAMHGHG+KAL+LF RM + GF PDKVT IGV
Sbjct: 361 CGMLDEASRVFHGIAKKDLVSWNAMLQGLAMHGHGDKALQLFSRMVKAGFLPDKVTFIGV 420

Query: 423 LCACSHAGLIDDGIRYFSSMEKDYALVHEIEHYGCMVDLLGRKGRLEEAIRLIRTMPMEP 482
           LCAC+HAG +++G++ F +ME++Y +V EIEHYGCM+DLLGR G L EA RL+ +MPMEP
Sbjct: 421 LCACTHAGFVEEGLQAFHTMEREYGIVPEIEHYGCMIDLLGRGGCLREAFRLVHSMPMEP 480

Query: 483 NVIIWGTLLGACRMHNAVELAREVLDHLVKLEPSDPGNLSMLSNIYAAAGDWDCVADVRL 542
           NV+IWGTLLGACRMHN  ELA+EVLDHLVKL+PSD GN SMLSNIYAAAGDW  VA+VRL
Sbjct: 481 NVVIWGTLLGACRMHNDPELAQEVLDHLVKLDPSDAGNFSMLSNIYAAAGDWANVANVRL 540

Query: 543 RMRSIGTQKPSGASSIEVNNEVHEFTVFDRSHPKSDKIYQMINGLRRELKQ 594
           +MR+ G QKPSGASSIEV +EVHEFTVFD+ HPKS +IYQMI  LR++ KQ
Sbjct: 541 QMRNTGVQKPSGASSIEVGDEVHEFTVFDKLHPKSGEIYQMIERLRQDFKQ 589

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP261_ARATH6.1e-22863.42Pentatricopeptide repeat-containing protein At3g29230 OS=Arabidopsis thaliana GN... [more]
PP175_ARATH3.0e-11836.00Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
PP301_ARATH2.2e-11339.85Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana GN... [more]
PP219_ARATH2.2e-11339.21Putative pentatricopeptide repeat-containing protein At3g08820 OS=Arabidopsis th... [more]
PP151_ARATH1.2e-10634.39Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0L7H7_CUCSA1.1e-30386.99Uncharacterized protein OS=Cucumis sativus GN=Csa_3G122560 PE=4 SV=1[more]
M5XAE6_PRUPE3.6e-25169.88Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa017633mg PE=4 SV=1[more]
B9T3T5_RICCO1.3e-24867.51Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
B9H0F0_POPTR1.5e-24467.51Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0004s12430g PE=4 SV=2[more]
W9QIJ8_9ROSA3.4e-24165.55Uncharacterized protein OS=Morus notabilis GN=L484_020960 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G29230.13.5e-22963.42 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G29760.11.7e-11936.00 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G02750.11.3e-11439.85 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G08820.11.3e-11439.21 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G13600.16.7e-10834.39 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659075293|ref|XP_008438067.1|0.0e+0086.36PREDICTED: pentatricopeptide repeat-containing protein At3g29230 [Cucumis melo][more]
gi|700201399|gb|KGN56532.1|1.5e-30386.99hypothetical protein Csa_3G122560 [Cucumis sativus][more]
gi|658012680|ref|XP_008341615.1|1.0e-25169.36PREDICTED: pentatricopeptide repeat-containing protein At3g29230 [Malus domestic... [more]
gi|694361968|ref|XP_009360575.1|1.0e-25169.19PREDICTED: pentatricopeptide repeat-containing protein At3g29230 [Pyrus x bretsc... [more]
gi|595967381|ref|XP_007217279.1|5.2e-25169.88hypothetical protein PRUPE_ppa017633mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0008568 microtubule-severing ATPase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh14G020230.1CmaCh14G020230.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 250..279
score: 2.9E-7coord: 281..311
score: 9.8E-6coord: 188..214
score: 4.3E-8coord: 454..479
score: 0.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 81..128
score: 1.9E-9coord: 217..249
score: 3.2E-9coord: 379..426
score: 6.4
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 85..117
score: 2.7E-5coord: 219..250
score: 1.8E-8coord: 382..415
score: 1.3E-10coord: 188..215
score: 5.2E-7coord: 281..314
score: 8.4E-6coord: 250..280
score: 2.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 279..313
score: 10.622coord: 451..481
score: 7.892coord: 117..151
score: 6.204coord: 380..414
score: 13.91coord: 152..184
score: 5.919coord: 252..278
score: 6.675coord: 82..116
score: 9.427coord: 415..445
score: 6.939coord: 186..216
score: 10.95coord: 217..251
score: 12.726coord: 314..348
score: 5.689coord: 349..379
score: 7.848coord: 517..551
score: 6.138coord: 483..513
score: 6
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 199..275
score: 1.2E-7coord: 443..535
score: 1.2E-7coord: 330..410
score: 1.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 38..558
score: 1.5E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmaCh14G020230CmaCh18G012430Cucurbita maxima (Rimu)cmacmaB253