CmoCh20G009190 (gene) Cucurbita moschata (Rifu)

NameCmoCh20G009190
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionPentatricopeptide repeat-containing family protein
LocationCmo_Chr20 : 4727571 .. 4729906 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTAGTCTCCGGCTGATACAATTTAGACAATTTCGCAAATGCTTTGCCTTCTCTTCCACATTTAGCTCACTTCCAGATCCCCACGACCTTGGTAGTTGTCTTCACTTTTTTTTTTTTCAAAACAAAATTTATCATTCCAATCTCTTCTTCAATTTCATTCCCTCATTATCACCACTGGCAACTCGAACAATGCCTTCTTTGCCACAAAGCTCATGGCCTTTTATGCCTGTCATGGGCAACCTGCGTTCTCCACACAATTGTTTCGATTTGTTCATCCTAAGGACAAATTTCTTTGGAATTCCATTATCCAATCCCACTTCTCCAATGGTGATTACCTACAGGCATTTGATTTCTACCTCGAGATGCGAGCATCGAGTAGCCTGCCAAACCAATTTACAATTCCCATGGTGGTTTCCACTTGTGCGGAACTAATGATGCTCAACCATGGCATGAACATTCATGGGTTGGCTTTGAAACTTGGGCTCTTTGTTGGTAATTCGGCTGTTGGTTCTTCTTTGATATACATGTATTCCAAATGTGGTAACGCAGAAAGTGCATCTCTCATGTTCAATGAAATTACTGTTAAGGATGTAGTTGCTTGGACTGCCCTTATAATTGGTTATGTCCAGAATAACGAGAGTGAGAAAGGTTTGAAATGTTTGTTTGAGATGCATAGGAATGGATGTACCCCAAATTATAGAACAATAGGAGGTGGGTTTCAAGCTTGTGTTGATTTGGATGCTTTAGTAGAGGGTAGATGCTTACATGGTTTGGCTTTAAAAAGTGGATTTCTCTGTTTTGAAGTCGTTAAATCTTCTATTCTCTCGATGTACTCGAGGTGTGGGTCACCTGAAGAAGCTTATCGTTGTTTTTTTAAATTGGAGCAAAAAGATCTCATCTCTTGGACATCAATTATTGCAGTTCACTCTAAACTCGGGTTAATGAGTGAATGTCTACATTTATTTTGGGAGATGCAGGCCAGTGGAATAATTCCAGATGACATCGTGATCAGTTGCATGCTTCTGGGTTTTGGTAATTTTGATAGAATCTCTGAAGGAAATGCCTTCCATGCTTGGATTCTGAAACAATGTTATGCAGTGAGTGGAATAACTCACAATGCATTACTCTCCATGTATTGTAAGTTCGGACTCTTACGTACGGCAGATAAGATCTTCCATAGTTTCCATAAAAGCAGTGAAGATTGGAACACAATGATATTAGGATACAGCAATATGGGGGAGAAAGAAAAGTGTATAGACTTTTTCAGGGAGATGCACCTCTTAGGCATAGAACCTGATTTGAATAGTTTAGTGTCAGTCATTTCTTCATGTTTACCAGTCCGAGCTGTGAATATTGGTCGGTCTGTCCACTGCTATGCAATTAAAAACTCGATCATTGACAATGTATCAATAGCCAACTCACTCTTGGACATGTATGGAAAAAGTGGTAATTTAACCGCCGCATGGAGGATATTTCATAGGACACAACAAAAGGATATTGTCTCATGGAATACGCTGATTTCGTCCTACAAGCAAAGTGGGCACCCTTCTGAAGCAATTGATTTATTCGATAAAATGATTAAAGAAAAGTTCAACCCCAACGTAGTTACGTGTGTAATAGCTCTTTCGGCATGTGCTCATCTTGCATCCTTAGATAAAGGTTTAAAAATTCACCAGTACATTAAGGAAAATGGATGTGAGACTGATATCACTGTTAGAACTGCATTGATTGATATGTATGCAAAATGTGGGGAGCTCGAGTCATCAAGAACATTGTTCAACTCAATGGAAGAGAGGGATGTTATTTTGTGTAATGTCATGATATCAAATTATGGGATGCATGGACATGTGGAATCTGCTATTGAAATCTTCCAACTAATGGAAGACTCAAACATTAAACCAAATGCACTTACCTTTCTTTCTCTTCTCTCAGCTTGTAATCATGCAGGGCATATTGTAGAAGGAAGGCGTCTCTTTGATGTAATGCATAAATATGGTATCAAACCTAGTCTTAAGCACTATGCTTCTATGGTAGATCTTCTTGGCAGGTCAGGTAGCCTTGAAGAAGCGGAGGCTCTTGTTTTATCAATGCCCATCACGCCTGATGGCACTGTGTGGGGCTCCTTGTTAAGTGCTTGTAAACTTCACAATGAATTTGAAATGGGTATAAGGATTGCCAGACATGCTATTGAGTCTGATCCAAAAAATGATGGGTATTATATAGTATTGTCTGATCTGTATGGTTGCTTGGGAAGGTGGGAGGAAGTGGAAAAAGTTCGTGGTATGATGAAGGAAAGAGGGGTGGAGAAGAGAGCTGGCTGGAGTGCCCTATGA

mRNA sequence

ATGCTTAGTCTCCGGCTGATACAATTTAGACAATTTCGCAAATGCTTTGCCTTCTCTTCCACATTTAGCTCACTTCCAGATCCCCACGACCTTGGTAGTTGTCTTCACTTTTTTTTTTTTCAAAACAAAATTTATCATTCCAATCTCTTCTTCAATTTCATTCCCTCATTATCACCACTGGCAACTCGAACAATGCCTTCTTTGCCACAAAGCTCATGGCCTTTTATGCCTGTCATGGGCAACCTGCGTTCTCCACACAATTGTTTCGATTTGTTCATCCTAAGGACAAATTTCTTTGGAATTCCATTATCCAATCCCACTTCTCCAATGGCATTTGATTTCTACCTCGAGATGCGAGCATCGAGTAGCCTGCCAAACCAATTTACAATTCCCATGGTGGTTTCCACTTGTGCGGAACTAATGATGCTCAACCATGGCATGAACATTCATGGGTTGGCTTTGAAACTTGGGCTCTTTGTTGGTAATTCGGCTGTTGGTTCTTCTTTGATATACATGTATTCCAAATGTGGTAACGCAGAAAGTGCATCTCTCATGTTCAATGAAATTACTGTTAAGGATGTAGTTGCTTGGACTGCCCTTATAATTGGTTATGTCCAGAATAACGAGAGTGAGAAAGGTTTGAAATGTTTGTTTGAGATGCATAGGAATGGATGTACCCCAAATTATAGAACAATAGGAGGTGGGTTTCAAGCTTGTGTTGATTTGGATGCTTTAGTAGAGGGTAGATGCTTACATGGTTTGGCTTTAAAAAGTGGATTTCTCTGTTTTGAAGTCGTTAAATCTTCTATTCTCTCGATGTACTCGAGGTGTGGGTCACCTGAAGAAGCTTATCGTTGTTTTTTTAAATTGGAGCAAAAAGATCTCATCTCTTGGACATCAATTATTGCAGTTCACTCTAAACTCGGGTTAATGAGTGAATGTCTACATTTATTTTGGGAGATGCAGGCCAGTGGAATAATTCCAGATGACATCGTGATCAGTTGCATGCTTCTGGGTTTTGGTAATTTTGATAGAATCTCTGAAGGAAATGCCTTCCATGCTTGGATTCTGAAACAATGTTATGCAGTGAGTGGAATAACTCACAATGCATTACTCTCCATGTATTGTAAGTTCGGACTCTTACGTACGGCAGATAAGATCTTCCATAGTTTCCATAAAAGCAGTGAAGATTGGAACACAATGATATTAGGATACAGCAATATGGGGGAGAAAGAAAAGTGTATAGACTTTTTCAGGGAGATGCACCTCTTAGGCATAGAACCTGATTTGAATAGTTTAGTGTCAGTCATTTCTTCATGTTTACCAGTCCGAGCTGTGAATATTGGTCGGTCTGTCCACTGCTATGCAATTAAAAACTCGATCATTGACAATGTATCAATAGCCAACTCACTCTTGGACATGTATGGAAAAAGTGGTAATTTAACCGCCGCATGGAGGATATTTCATAGGACACAACAAAAGGATATTGTCTCATGGAATACGCTGATTTCGTCCTACAAGCAAAGTGGGCACCCTTCTGAAGCAATTGATTTATTCGATAAAATGATTAAAGAAAAGTTCAACCCCAACGTAGTTACGTGTGTAATAGCTCTTTCGGCATGTGCTCATCTTGCATCCTTAGATAAAGGTTTAAAAATTCACCAGTACATTAAGGAAAATGGATGTGAGACTGATATCACTGTTAGAACTGCATTGATTGATATGTATGCAAAATGTGGGGAGCTCGAGTCATCAAGAACATTGTTCAACTCAATGGAAGAGAGGGATGTTATTTTGTGTAATGTCATGATATCAAATTATGGGATGCATGGACATGTGGAATCTGCTATTGAAATCTTCCAACTAATGGAAGACTCAAACATTAAACCAAATGCACTTACCTTTCTTTCTCTTCTCTCAGCTTGTAATCATGCAGGGCATATTGTAGAAGGAAGGCGTCTCTTTGATGTAATGCATAAATATGGTATCAAACCTAGTCTTAAGCACTATGCTTCTATGGTAGATCTTCTTGGCAGGTCAGGTAGCCTTGAAGAAGCGGAGGCTCTTGTTTTATCAATGCCCATCACGCCTGATGGCACTGTGTGGGGCTCCTTGTTAAGTGCTTGTAAACTTCACAATGAATTTGAAATGGGTATAAGGATTGCCAGACATGCTATTGAGTCTGATCCAAAAAATGATGGGTATTATATAGTATTGTCTGATCTGTATGGTTGCTTGGGAAGGTGGGAGGAAGTGGAAAAAGTTCGTGGTATGATGAAGGAAAGAGGGGTGGAGAAGAGAGCTGGCTGGAGTGCCCTATGA

Coding sequence (CDS)

ATGCTTAGTCTCCGGCTGATACAATTTAGACAATTTCGCAAATGCTTTGCCTTCTCTTCCACATTTAGCTCACTTCCAGATCCCCACGACCTTGGTAGTTGTCTTCACTTTTTTTTTTTTCAAAACAAAATTTATCATTCCAATCTCTTCTTCAATTTCATTCCCTCATTATCACCACTGGCAACTCGAACAATGCCTTCTTTGCCACAAAGCTCATGGCCTTTTATGCCTGTCATGGGCAACCTGCGTTCTCCACACAATTGTTTCGATTTGTTCATCCTAAGGACAAATTTCTTTGGAATTCCATTATCCAATCCCACTTCTCCAATGGCATTTGATTTCTACCTCGAGATGCGAGCATCGAGTAGCCTGCCAAACCAATTTACAATTCCCATGGTGGTTTCCACTTGTGCGGAACTAATGATGCTCAACCATGGCATGAACATTCATGGGTTGGCTTTGAAACTTGGGCTCTTTGTTGGTAATTCGGCTGTTGGTTCTTCTTTGATATACATGTATTCCAAATGTGGTAACGCAGAAAGTGCATCTCTCATGTTCAATGAAATTACTGTTAAGGATGTAGTTGCTTGGACTGCCCTTATAATTGGTTATGTCCAGAATAACGAGAGTGAGAAAGGTTTGAAATGTTTGTTTGAGATGCATAGGAATGGATGTACCCCAAATTATAGAACAATAGGAGGTGGGTTTCAAGCTTGTGTTGATTTGGATGCTTTAGTAGAGGGTAGATGCTTACATGGTTTGGCTTTAAAAAGTGGATTTCTCTGTTTTGAAGTCGTTAAATCTTCTATTCTCTCGATGTACTCGAGGTGTGGGTCACCTGAAGAAGCTTATCGTTGTTTTTTTAAATTGGAGCAAAAAGATCTCATCTCTTGGACATCAATTATTGCAGTTCACTCTAAACTCGGGTTAATGAGTGAATGTCTACATTTATTTTGGGAGATGCAGGCCAGTGGAATAATTCCAGATGACATCGTGATCAGTTGCATGCTTCTGGGTTTTGGTAATTTTGATAGAATCTCTGAAGGAAATGCCTTCCATGCTTGGATTCTGAAACAATGTTATGCAGTGAGTGGAATAACTCACAATGCATTACTCTCCATGTATTGTAAGTTCGGACTCTTACGTACGGCAGATAAGATCTTCCATAGTTTCCATAAAAGCAGTGAAGATTGGAACACAATGATATTAGGATACAGCAATATGGGGGAGAAAGAAAAGTGTATAGACTTTTTCAGGGAGATGCACCTCTTAGGCATAGAACCTGATTTGAATAGTTTAGTGTCAGTCATTTCTTCATGTTTACCAGTCCGAGCTGTGAATATTGGTCGGTCTGTCCACTGCTATGCAATTAAAAACTCGATCATTGACAATGTATCAATAGCCAACTCACTCTTGGACATGTATGGAAAAAGTGGTAATTTAACCGCCGCATGGAGGATATTTCATAGGACACAACAAAAGGATATTGTCTCATGGAATACGCTGATTTCGTCCTACAAGCAAAGTGGGCACCCTTCTGAAGCAATTGATTTATTCGATAAAATGATTAAAGAAAAGTTCAACCCCAACGTAGTTACGTGTGTAATAGCTCTTTCGGCATGTGCTCATCTTGCATCCTTAGATAAAGGTTTAAAAATTCACCAGTACATTAAGGAAAATGGATGTGAGACTGATATCACTGTTAGAACTGCATTGATTGATATGTATGCAAAATGTGGGGAGCTCGAGTCATCAAGAACATTGTTCAACTCAATGGAAGAGAGGGATGTTATTTTGTGTAATGTCATGATATCAAATTATGGGATGCATGGACATGTGGAATCTGCTATTGAAATCTTCCAACTAATGGAAGACTCAAACATTAAACCAAATGCACTTACCTTTCTTTCTCTTCTCTCAGCTTGTAATCATGCAGGGCATATTGTAGAAGGAAGGCGTCTCTTTGATGTAATGCATAAATATGGTATCAAACCTAGTCTTAAGCACTATGCTTCTATGGTAGATCTTCTTGGCAGGTCAGGTAGCCTTGAAGAAGCGGAGGCTCTTGTTTTATCAATGCCCATCACGCCTGATGGCACTGTGTGGGGCTCCTTGTTAAGTGCTTGTAAACTTCACAATGAATTTGAAATGGGTATAAGGATTGCCAGACATGCTATTGAGTCTGATCCAAAAAATGATGGGTATTATATAGTATTGTCTGATCTGTATGGTTGCTTGGGAAGGTGGGAGGAAGTGGAAAAAGTTCGTGGTATGATGAAGGAAAGAGGGGTGGAGAAGAGAGCTGGCTGGAGTGCCCTATGA
BLAST of CmoCh20G009190 vs. Swiss-Prot
Match: PP359_ARATH (Pentatricopeptide repeat-containing protein At4g39952, mitochondrial OS=Arabidopsis thaliana GN=PCMP-E98 PE=2 SV=2)

HSP 1 Score: 695.7 bits (1794), Expect = 5.9e-199
Identity = 339/675 (50.22%), Postives = 461/675 (68.30%), Query Frame = 1

Query: 104 SNPTSPMAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGLALKLGLFVGNS 163
           SN     +  F+  M  S   P+ FT PMVVS CAEL+  + G  +HGL LK G F  N+
Sbjct: 102 SNGDYARSLCFFFSMLLSGQSPDHFTAPMVVSACAELLWFHVGTFVHGLVLKHGGFDRNT 161

Query: 164 AVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEKGLKCLFEMHRN 223
           AVG+S +Y YSKCG  + A L+F+E+  +DVVAWTA+I G+VQN ESE GL  L +MH  
Sbjct: 162 AVGASFVYFYSKCGFLQDACLVFDEMPDRDVVAWTAIISGHVQNGESEGGLGYLCKMHSA 221

Query: 224 GCT---PNYRTIGGGFQACVDLDALVEGRCLHGLALKSGFLCFEVVKSSILSMYSRCGSP 283
           G     PN RT+  GFQAC +L AL EGRCLHG A+K+G    + V+SS+ S YS+ G+P
Sbjct: 222 GSDVDKPNPRTLECGFQACSNLGALKEGRCLHGFAVKNGLASSKFVQSSMFSFYSKSGNP 281

Query: 284 EEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIVISCMLLGF 343
            EAY  F +L  +D+ SWTSIIA  ++ G M E   +FWEMQ  G+ PD +VISC++   
Sbjct: 282 SEAYLSFRELGDEDMFSWTSIIASLARSGDMEESFDMFWEMQNKGMHPDGVVISCLINEL 341

Query: 344 GNFDRISEGNAFHAWILKQCYAVSGITHNALLSMYCKFGLLRTADKIFHSFHK--SSEDW 403
           G    + +G AFH ++++ C+++     N+LLSMYCKF LL  A+K+F    +  + E W
Sbjct: 342 GKMMLVPQGKAFHGFVIRHCFSLDSTVCNSLLSMYCKFELLSVAEKLFCRISEEGNKEAW 401

Query: 404 NTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVRAVNIGRSVHCYAIK 463
           NTM+ GY  M    KCI+ FR++  LGIE D  S  SVISSC  + AV +G+S+HCY +K
Sbjct: 402 NTMLKGYGKMKCHVKCIELFRKIQNLGIEIDSASATSVISSCSHIGAVLLGKSLHCYVVK 461

Query: 464 NSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHPSEAIDL 523
            S+   +S+ NSL+D+YGK G+LT AWR+F      ++++WN +I+SY       +AI L
Sbjct: 462 TSLDLTISVVNSLIDLYGKMGDLTVAWRMFCEADT-NVITWNAMIASYVHCEQSEKAIAL 521

Query: 524 FDKMIKEKFNPNVVTCVIALSACAHLASLDKGLKIHQYIKENGCETDITVRTALIDMYAK 583
           FD+M+ E F P+ +T V  L AC +  SL++G  IH+YI E   E ++++  ALIDMYAK
Sbjct: 522 FDRMVSENFKPSSITLVTLLMACVNTGSLERGQMIHRYITETEHEMNLSLSAALIDMYAK 581

Query: 584 CGELESSRTLFNSMEERDVILCNVMISNYGMHGHVESAIEIFQLMEDSNIKPNALTFLSL 643
           CG LE SR LF++  ++D +  NVMIS YGMHG VESAI +F  ME+S++KP   TFL+L
Sbjct: 582 CGHLEKSRELFDAGNQKDAVCWNVMISGYGMHGDVESAIALFDQMEESDVKPTGPTFLAL 641

Query: 644 LSACNHAGHIVEGRRLFDVMHKYGIKPSLKHYASMVDLLGRSGSLEEAEALVLSMPITPD 703
           LSAC HAG + +G++LF  MH+Y +KP+LKHY+ +VDLL RSG+LEEAE+ V+SMP +PD
Sbjct: 642 LSACTHAGLVEQGKKLFLKMHQYDVKPNLKHYSCLVDLLSRSGNLEEAESTVMSMPFSPD 701

Query: 704 GTVWGSLLSACKLHNEFEMGIRIARHAIESDPKNDGYYIVLSDLYGCLGRWEEVEKVRGM 763
           G +WG+LLS+C  H EFEMGIR+A  A+ SDP+NDGYYI+L+++Y   G+WEE E+ R M
Sbjct: 702 GVIWGTLLSSCMTHGEFEMGIRMAERAVASDPQNDGYYIMLANMYSAAGKWEEAERAREM 761

Query: 764 MKERGVEKRAGWSAL 774
           M+E GV KRAG S +
Sbjct: 762 MRESGVGKRAGHSVV 775

BLAST of CmoCh20G009190 vs. Swiss-Prot
Match: PP320_ARATH (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana GN=DOT4 PE=2 SV=1)

HSP 1 Score: 410.6 bits (1054), Expect = 3.8e-113
Identity = 219/646 (33.90%), Postives = 374/646 (57.89%), Query Frame = 1

Query: 129 TIPMVVSTCAELMMLNHGMNIHGLALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNE 188
           T+  V+  CA+   L  G  +       G FV +S +GS L  MY+ CG+ + AS +F+E
Sbjct: 96  TLCSVLQLCADSKSLKDGKEVDNFIRGNG-FVIDSNLGSKLSLMYTNCGDLKEASRVFDE 155

Query: 189 ITVKDVVAWTALIIGYVQNNESEKGLKCLFEMHRNGCTPNYRTIGGGFQACVDLDALVEG 248
           + ++  + W  L+    ++ +    +    +M  +G   +  T     ++   L ++  G
Sbjct: 156 VKIEKALFWNILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGG 215

Query: 249 RCLHGLALKSGFLCFEVVKSSILSMYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKL 308
             LHG  LKSGF     V +S+++ Y +    + A + F ++ ++D+ISW SII  +   
Sbjct: 216 EQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSN 275

Query: 309 GLMSECLHLFWEMQASGIIPDDIVISCMLLGFGNFDRISEGNAFHAWILKQCYAVSGITH 368
           GL  + L +F +M  SGI  D   I  +  G  +   IS G A H+  +K C++      
Sbjct: 276 GLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFC 335

Query: 369 NALLSMYCKFGLLRTADKIFHSFH-KSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIE 428
           N LL MY K G L +A  +F     +S   + +MI GY+  G   + +  F EM   GI 
Sbjct: 336 NTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGIS 395

Query: 429 PDLNSLVSVISSCLPVRAVNIGRSVHCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRI 488
           PD+ ++ +V++ C   R ++ G+ VH +  +N +  ++ ++N+L+DMY K G++  A  +
Sbjct: 396 PDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELV 455

Query: 489 FHRTQQKDIVSWNTLISSYKQSGHPSEAIDLFDKMIKEK-FNPNVVTCVIALSACAHLAS 548
           F   + KDI+SWNT+I  Y ++ + +EA+ LF+ +++EK F+P+  T    L ACA L++
Sbjct: 456 FSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLSA 515

Query: 549 LDKGLKIHQYIKENGCETDITVRTALIDMYAKCGELESSRTLFNSMEERDVILCNVMISN 608
            DKG +IH YI  NG  +D  V  +L+DMYAKCG L  +  LF+ +  +D++   VMI+ 
Sbjct: 516 FDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIAG 575

Query: 609 YGMHGHVESAIEIFQLMEDSNIKPNALTFLSLLSACNHAGHIVEGRRLFDVM-HKYGIKP 668
           YGMHG  + AI +F  M  + I+ + ++F+SLL AC+H+G + EG R F++M H+  I+P
Sbjct: 576 YGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEP 635

Query: 669 SLKHYASMVDLLGRSGSLEEAEALVLSMPITPDGTVWGSLLSACKLHNEFEMGIRIARHA 728
           +++HYA +VD+L R+G L +A   + +MPI PD T+WG+LL  C++H++ ++  ++A   
Sbjct: 636 TVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKV 695

Query: 729 IESDPKNDGYYIVLSDLYGCLGRWEEVEKVRGMMKERGVEKRAGWS 772
            E +P+N GYY++++++Y    +WE+V+++R  + +RG+ K  G S
Sbjct: 696 FELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCS 740

BLAST of CmoCh20G009190 vs. Swiss-Prot
Match: PP111_ARATH (Putative pentatricopeptide repeat-containing protein At1g69350, mitochondrial OS=Arabidopsis thaliana GN=PCMP-E66 PE=3 SV=1)

HSP 1 Score: 409.5 bits (1051), Expect = 8.4e-113
Identity = 218/666 (32.73%), Postives = 379/666 (56.91%), Query Frame = 1

Query: 111 AFDFYLEMRASSSLPNQFTIPMVVSTCA-ELMMLNHGMNIHGLALKLGLFVGNSAVGSSL 170
           A D Y  + + ++  ++F  P V+  CA     L+ G  +HG  +K G+   ++ + +SL
Sbjct: 84  AIDLYHRLVSETTQISKFVFPSVLRACAGSREHLSVGGKVHGRIIKGGVD-DDAVIETSL 143

Query: 171 IYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEKGLKCLFEMHRNGCTPNY 230
           + MY + GN   A  +F+ + V+D+VAW+ L+   ++N E  K L+    M  +G  P+ 
Sbjct: 144 LCMYGQTGNLSDAEKVFDGMPVRDLVAWSTLVSSCLENGEVVKALRMFKCMVDDGVEPDA 203

Query: 231 RTIGGGFQACVDLDALVEGRCLHGLALKSGFLCFEVVKSSILSMYSRCGSPEEAYRCFFK 290
            T+    + C +L  L   R +HG   +  F   E + +S+L+MYS+CG    + R F K
Sbjct: 204 VTMISVVEGCAELGCLRIARSVHGQITRKMFDLDETLCNSLLTMYSKCGDLLSSERIFEK 263

Query: 291 LEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIVISCMLLGFGNFDRISEG 350
           + +K+ +SWT++I+ +++     + L  F EM  SGI P+ + +  +L   G    I EG
Sbjct: 264 IAKKNAVSWTAMISSYNRGEFSEKALRSFSEMIKSGIEPNLVTLYSVLSSCGLIGLIREG 323

Query: 351 NAFHAWILKQCYAVSGITHN-ALLSMYCKFGLLRTADKIFHSFH-KSSEDWNTMILGYSN 410
            + H + +++    +  + + AL+ +Y + G L   + +      ++   WN++I  Y++
Sbjct: 324 KSVHGFAVRRELDPNYESLSLALVELYAECGKLSDCETVLRVVSDRNIVAWNSLISLYAH 383

Query: 411 MGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVRAVNIGRSVHCYAIKNSIIDNVSI 470
            G   + +  FR+M    I+PD  +L S IS+C     V +G+ +H + I+  + D   +
Sbjct: 384 RGMVIQALGLFRQMVTQRIKPDAFTLASSISACENAGLVPLGKQIHGHVIRTDVSDEF-V 443

Query: 471 ANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHPSEAIDLFDKMIKEKF 530
            NSL+DMY KSG++ +A  +F++ + + +V+WN+++  + Q+G+  EAI LFD M     
Sbjct: 444 QNSLIDMYSKSGSVDSASTVFNQIKHRSVVTWNSMLCGFSQNGNSVEAISLFDYMYHSYL 503

Query: 531 NPNVVTCVIALSACAHLASLDKGLKIHQYIKENGCETDITVRTALIDMYAKCGELESSRT 590
             N VT +  + AC+ + SL+KG  +H  +  +G + D+   TALIDMYAKCG+L ++ T
Sbjct: 504 EMNEVTFLAVIQACSSIGSLEKGKWVHHKLIISGLK-DLFTDTALIDMYAKCGDLNAAET 563

Query: 591 LFNSMEERDVILCNVMISNYGMHGHVESAIEIFQLMEDSNIKPNALTFLSLLSACNHAGH 650
           +F +M  R ++  + MI+ YGMHG + SAI  F  M +S  KPN + F+++LSAC H+G 
Sbjct: 564 VFRAMSSRSIVSWSSMINAYGMHGRIGSAISTFNQMVESGTKPNEVVFMNVLSACGHSGS 623

Query: 651 IVEGRRLFDVMHKYGIKPSLKHYASMVDLLGRSGSLEEAEALVLSMPITPDGTVWGSLLS 710
           + EG+  F++M  +G+ P+ +H+A  +DLL RSG L+EA   +  MP   D +VWGSL++
Sbjct: 624 VEEGKYYFNLMKSFGVSPNSEHFACFIDLLSRSGDLKEAYRTIKEMPFLADASVWGSLVN 683

Query: 711 ACKLHNEFEMGIRIARHAIESDPKNDGYYIVLSDLYGCLGRWEEVEKVRGMMKERGVEKR 770
            C++H + ++   I     +    + GYY +LS++Y   G WEE  ++R  MK   ++K 
Sbjct: 684 GCRIHQKMDIIKAIKNDLSDIVTDDTGYYTLLSNIYAEEGEWEEFRRLRSAMKSSNLKKV 743

Query: 771 AGWSAL 774
            G+SA+
Sbjct: 744 PGYSAI 746

BLAST of CmoCh20G009190 vs. Swiss-Prot
Match: PP210_ARATH (Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana GN=PCMP-H23 PE=2 SV=1)

HSP 1 Score: 401.4 bits (1030), Expect = 2.3e-110
Identity = 217/669 (32.44%), Postives = 376/669 (56.20%), Query Frame = 1

Query: 105 NPTSPMAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGLALKLGLFVGNSA 164
           N   P A +FY ++R S   P+++T P V+  CA L     G  ++   L +G F  +  
Sbjct: 84  NGLFPEALEFYGKLRESKVSPDKYTFPSVIKACAGLFDAEMGDLVYEQILDMG-FESDLF 143

Query: 165 VGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEKGLKCLFEMHRNG 224
           VG++L+ MYS+ G    A  +F+E+ V+D+V+W +LI GY  +   E+ L+   E+  + 
Sbjct: 144 VGNALVDMYSRMGLLTRARQVFDEMPVRDLVSWNSLISGYSSHGYYEEALEIYHELKNSW 203

Query: 225 CTPNYRTIGGGFQACVDLDALVEGRCLHGLALKSGFLCFEVVKSSILSMYSRCGSPEEAY 284
             P+  T+     A  +L  + +G+ LHG ALKSG     VV + +++MY +   P +A 
Sbjct: 204 IVPDSFTVSSVLPAFGNLLVVKQGQGLHGFALKSGVNSVVVVNNGLVAMYLKFRRPTDAR 263

Query: 285 RCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIVISCMLLGFGNFD 344
           R F +++ +D +S+ ++I  + KL ++ E + +F E       PD + +S +L   G+  
Sbjct: 264 RVFDEMDVRDSVSYNTMICGYLKLEMVEESVRMFLE-NLDQFKPDLLTVSSVLRACGHLR 323

Query: 345 RISEGNAFHAWILKQCYAVSGITHNALLSMYCKFGLLRTADKIFHSFH-KSSEDWNTMIL 404
            +S     + ++LK  + +     N L+ +Y K G + TA  +F+S   K +  WN++I 
Sbjct: 324 DLSLAKYIYNYMLKAGFVLESTVRNILIDVYAKCGDMITARDVFNSMECKDTVSWNSIIS 383

Query: 405 GYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVRAVNIGRSVHCYAIKNSIID 464
           GY   G+  + +  F+ M ++  + D  + + +IS    +  +  G+ +H   IK+ I  
Sbjct: 384 GYIQSGDLMEAMKLFKMMMIMEEQADHITYLMLISVSTRLADLKFGKGLHSNGIKSGICI 443

Query: 465 NVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHPSEAIDLFDKMI 524
           ++S++N+L+DMY K G +  + +IF      D V+WNT+IS+  + G  +  + +  +M 
Sbjct: 444 DLSVSNALIDMYAKCGEVGDSLKIFSSMGTGDTVTWNTVISACVRFGDFATGLQVTTQMR 503

Query: 525 KEKFNPNVVTCVIALSACAHLASLDKGLKIHQYIKENGCETDITVRTALIDMYAKCGELE 584
           K +  P++ T ++ L  CA LA+   G +IH  +   G E+++ +  ALI+MY+KCG LE
Sbjct: 504 KSEVVPDMATFLVTLPMCASLAAKRLGKEIHCCLLRFGYESELQIGNALIEMYSKCGCLE 563

Query: 585 SSRTLFNSMEERDVILCNVMISNYGMHGHVESAIEIFQLMEDSNIKPNALTFLSLLSACN 644
           +S  +F  M  RDV+    MI  YGM+G  E A+E F  ME S I P+++ F++++ AC+
Sbjct: 564 NSSRVFERMSRRDVVTWTGMIYAYGMYGEGEKALETFADMEKSGIVPDSVVFIAIIYACS 623

Query: 645 HAGHIVEGRRLFDVMH-KYGIKPSLKHYASMVDLLGRSGSLEEAEALVLSMPITPDGTVW 704
           H+G + EG   F+ M   Y I P ++HYA +VDLL RS  + +AE  + +MPI PD ++W
Sbjct: 624 HSGLVDEGLACFEKMKTHYKIDPMIEHYACVVDLLSRSQKISKAEEFIQAMPIKPDASIW 683

Query: 705 GSLLSACKLHNEFEMGIRIARHAIESDPKNDGYYIVLSDLYGCLGRWEEVEKVRGMMKER 764
            S+L AC+   + E   R++R  IE +P + GY I+ S+ Y  L +W++V  +R  +K++
Sbjct: 684 ASVLRACRTSGDMETAERVSRRIIELNPDDPGYSILASNAYAALRKWDKVSLIRKSLKDK 743

Query: 765 GVEKRAGWS 772
            + K  G+S
Sbjct: 744 HITKNPGYS 750

BLAST of CmoCh20G009190 vs. Swiss-Prot
Match: PP296_ARATH (Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H83 PE=2 SV=2)

HSP 1 Score: 400.6 bits (1028), Expect = 3.9e-110
Identity = 225/701 (32.10%), Postives = 383/701 (54.64%), Query Frame = 1

Query: 80  GNLRSPHNCFDLFILRTNF-----FGIPLSNPTSPMAFDFYLEMRASSSLPNQFTIPMVV 139
           G+L      FD    RT F      G  +SN     A   Y  MR         + P ++
Sbjct: 130 GSLDDAEKVFDEMPDRTAFAWNTMIGAYVSNGEPASALALYWNMRVEGVPLGLSSFPALL 189

Query: 140 STCAELMMLNHGMNIHGLALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVK-D 199
             CA+L  +  G  +H L +KLG +     + ++L+ MY+K  +  +A  +F+    K D
Sbjct: 190 KACAKLRDIRSGSELHSLLVKLG-YHSTGFIVNALVSMYAKNDDLSAARRLFDGFQEKGD 249

Query: 200 VVAWTALIIGYVQNNESEKGLKCLFEMHRNGCTPNYRTIGGGFQACVDLDALVEGRCLHG 259
            V W +++  Y  + +S + L+   EMH  G  PN  TI     AC        G+ +H 
Sbjct: 250 AVLWNSILSSYSTSGKSLETLELFREMHMTGPAPNSYTIVSALTACDGFSYAKLGKEIHA 309

Query: 260 LALKSGFLCFEV-VKSSILSMYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMS 319
             LKS     E+ V +++++MY+RCG   +A R   ++   D+++W S+I  + +  +  
Sbjct: 310 SVLKSSTHSSELYVCNALIAMYTRCGKMPQAERILRQMNNADVVTWNSLIKGYVQNLMYK 369

Query: 320 ECLHLFWEMQASGIIPDDIVISCMLLGFGNFDRISEGNAFHAWILKQCYAVSGITHNALL 379
           E L  F +M A+G   D++ ++ ++   G    +  G   HA+++K  +  +    N L+
Sbjct: 370 EALEFFSDMIAAGHKSDEVSMTSIIAASGRLSNLLAGMELHAYVIKHGWDSNLQVGNTLI 429

Query: 380 SMYCKFGLLRTADKIFHSFH-KSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLN 439
            MY K  L     + F   H K    W T+I GY+      + ++ FR++    +E D  
Sbjct: 430 DMYSKCNLTCYMGRAFLRMHDKDLISWTTVIAGYAQNDCHVEALELFRDVAKKRMEIDEM 489

Query: 440 SLVSVISSCLPVRAVNIGRSVHCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRT 499
            L S++ +   ++++ I + +HC+ ++  ++D V I N L+D+YGK  N+  A R+F   
Sbjct: 490 ILGSILRASSVLKSMLIVKEIHCHILRKGLLDTV-IQNELVDVYGKCRNMGYATRVFESI 549

Query: 500 QQKDIVSWNTLISSYKQSGHPSEAIDLFDKMIKEKFNPNVVTCVIALSACAHLASLDKGL 559
           + KD+VSW ++ISS   +G+ SEA++LF +M++   + + V  +  LSA A L++L+KG 
Sbjct: 550 KGKDVVSWTSMISSSALNGNESEAVELFRRMVETGLSADSVALLCILSAAASLSALNKGR 609

Query: 560 KIHQYIKENGCETDITVRTALIDMYAKCGELESSRTLFNSMEERDVILCNVMISNYGMHG 619
           +IH Y+   G   + ++  A++DMYA CG+L+S++ +F+ +E + ++    MI+ YGMHG
Sbjct: 610 EIHCYLLRKGFCLEGSIAVAVVDMYACCGDLQSAKAVFDRIERKGLLQYTSMINAYGMHG 669

Query: 620 HVESAIEIFQLMEDSNIKPNALTFLSLLSACNHAGHIVEGRRLFDVM-HKYGIKPSLKHY 679
             ++A+E+F  M   N+ P+ ++FL+LL AC+HAG + EGR    +M H+Y ++P  +HY
Sbjct: 670 CGKAAVELFDKMRHENVSPDHISFLALLYACSHAGLLDEGRGFLKIMEHEYELEPWPEHY 729

Query: 680 ASMVDLLGRSGSLEEAEALVLSMPITPDGTVWGSLLSACKLHNEFEMGIRIARHAIESDP 739
             +VD+LGR+  + EA   V  M   P   VW +LL+AC+ H+E E+G   A+  +E +P
Sbjct: 730 VCLVDMLGRANCVVEAFEFVKMMKTEPTAEVWCALLAACRSHSEKEIGEIAAQRLLELEP 789

Query: 740 KNDGYYIVLSDLYGCLGRWEEVEKVRGMMKERGVEKRAGWS 772
           KN G  +++S+++   GRW +VEKVR  MK  G+EK  G S
Sbjct: 790 KNPGNLVLVSNVFAEQGRWNDVEKVRAKMKASGMEKHPGCS 828

BLAST of CmoCh20G009190 vs. TrEMBL
Match: A0A0A0LRH3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G439160 PE=4 SV=1)

HSP 1 Score: 1188.3 bits (3073), Expect = 0.0e+00
Identity = 600/784 (76.53%), Postives = 667/784 (85.08%), Query Frame = 1

Query: 1   MLSLRLIQFRQFRKCFAFSSTFSSLPDPHDLGSCLHFFF------FQNKI-YHSNLFFNF 60
           ML LRL    QF   FAFSSTF+SL D H   +CLH FF      FQ+ + +HS +    
Sbjct: 11  MLRLRL---SQFHIRFAFSSTFTSLSDSHYPNNCLHSFFSKPNLTFQSLLQFHSLIITTG 70

Query: 61  IPSLSPLATRTMPSLPQSSWPFMPVMGNLRSPHNCFDLF----ILRTNFFGIPLSNPTSP 120
             +    AT+ M        P      +L    +  D+F    I++++F     SN    
Sbjct: 71  NSNNVFFATKLMAFYAYHRKPAFST--HLFRLIHSKDIFLWNSIIQSHF-----SNGDYQ 130

Query: 121 MAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGLALKLGLFVGNSAVGSSL 180
            AFDFYL+MRASSSLPNQFT+PMVVSTCAELMM NHGMNIHGL  KLGLFVGNSA+GSS 
Sbjct: 131 RAFDFYLQMRASSSLPNQFTVPMVVSTCAELMMFNHGMNIHGLTSKLGLFVGNSAIGSSF 190

Query: 181 IYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEKGLKCLFEMHRNGCTPNY 240
           IYMYSKCG+ ESAS+MF+EITVKDVV WTALI+GYVQNNES +GLKCLFEMHR G TPNY
Sbjct: 191 IYMYSKCGHVESASIMFSEITVKDVVTWTALIVGYVQNNESGRGLKCLFEMHRIGGTPNY 250

Query: 241 RTIGGGFQACVDLDALVEGRCLHGLALKSGFLCFEVVKSSILSMYSRCGSPEEAYRCFFK 300
           +TIG GFQACVDLDALVEG+CLHGLALK+GFLCFEVVKS+ILSMYSRCGSPEEAYRCF K
Sbjct: 251 KTIGSGFQACVDLDALVEGKCLHGLALKNGFLCFEVVKSTILSMYSRCGSPEEAYRCFCK 310

Query: 301 LEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIVISCMLLGFGNFDRISEG 360
           L+QKDLISWTSIIAVHSK GLMSECLHLFWEMQAS IIPD+IVISCML+GFGN DRI EG
Sbjct: 311 LDQKDLISWTSIIAVHSKFGLMSECLHLFWEMQASEIIPDEIVISCMLMGFGNSDRIFEG 370

Query: 361 NAFHAWILKQCYAVSGITHNALLSMYCKFGLLRTADKIFHSFHKSSEDWNTMILGYSNMG 420
            AFHA ILKQC A+SGITHNALLSMYCKFG L TA+KIFHSFHKSSEDW+TMILGYSNMG
Sbjct: 371 KAFHARILKQCCALSGITHNALLSMYCKFGHLGTANKIFHSFHKSSEDWSTMILGYSNMG 430

Query: 421 EKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVRAVNIGRSVHCYAIKNSIIDNVSIAN 480
           +KEKCI F REM LLG EPDLNSLVSVISSC  V A+NIGRS+HCYAIKNSII+NVS+AN
Sbjct: 431 QKEKCISFLREMLLLGREPDLNSLVSVISSCSQVGAINIGRSIHCYAIKNSIIENVSVAN 490

Query: 481 SLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHPSEAIDLFDKMIKEKFNP 540
           SL+DMYGKSG++TA WRIFHRT Q+D++SWNTLISSYKQSG  +EAI LFDKM+KEK  P
Sbjct: 491 SLMDMYGKSGHVTATWRIFHRTLQRDVISWNTLISSYKQSGILAEAIILFDKMVKEKVYP 550

Query: 541 NVVTCVIALSACAHLASLDKGLKIHQYIKENGCETDITVRTALIDMYAKCGELESSRTLF 600
           N VTC+I LSACAHLASLD+G KIHQYIKENG E++IT+RTALIDMYAKCGELE+SR LF
Sbjct: 551 NKVTCIIVLSACAHLASLDEGEKIHQYIKENGFESNITIRTALIDMYAKCGELETSRKLF 610

Query: 601 NSMEERDVILCNVMISNYGMHGHVESAIEIFQLMEDSNIKPNALTFLSLLSACNHAGHIV 660
           NS EERDVIL NVMISNYGMHGHVESA+EIFQLME+SNIKPNA TFLSLLSACNH GH++
Sbjct: 611 NSTEERDVILWNVMISNYGMHGHVESAMEIFQLMEESNIKPNAQTFLSLLSACNHTGHVL 670

Query: 661 EGRRLFDVMHKYGIKPSLKHYASMVDLLGRSGSLEEAEALVLSMPITPDGTVWGSLLSAC 720
           EGR LFD M KYGI+PSLKHYAS++DLLGRSGSLE AEALVLSMPITPDGTVWGSLLSAC
Sbjct: 671 EGRHLFDRMQKYGIEPSLKHYASIIDLLGRSGSLEAAEALVLSMPITPDGTVWGSLLSAC 730

Query: 721 KLHNEFEMGIRIARHAIESDPKNDGYYIVLSDLYGCLGRWEEVEKVRGMMKERGVEKRAG 774
           K+HNEFE+G+R+AR+AIESDPKNDGYYI+LSDLY CLGRW+EVEKVR MMK+RGVEKRAG
Sbjct: 731 KIHNEFEVGVRLARYAIESDPKNDGYYIILSDLYSCLGRWDEVEKVRDMMKKRGVEKRAG 784

BLAST of CmoCh20G009190 vs. TrEMBL
Match: W9QNE1_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_022141 PE=4 SV=1)

HSP 1 Score: 833.2 bits (2151), Expect = 2.6e-238
Identity = 408/672 (60.71%), Postives = 508/672 (75.60%), Query Frame = 1

Query: 104 SNPTSPMAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGLALKLGLFVGNS 163
           SN     A   +L MRAS  +PNQFT+PMVV +CA+LM+L+ G + HGL LKLGL  G++
Sbjct: 106 SNGDFQEALYLFLRMRASGFVPNQFTLPMVVGSCADLMLLDCGKSFHGLVLKLGLLSGDN 165

Query: 164 AVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEKGLKCLFEMHRN 223
             GSS +YMY KCG    A  +F+EITV+DVV+WTAL+IGYVQN ESEKGL+CL EMHR+
Sbjct: 166 VAGSSFVYMYCKCGQMGDAYKVFDEITVRDVVSWTALVIGYVQNGESEKGLECLCEMHRS 225

Query: 224 GCT---PNYRTIGGGFQACVDLDALVEGRCLHGLALKSGFLCFEVVKSSILSMYSRCGSP 283
           G     PN+RT+ GGFQAC ++ AL EGRCLHGL +K+G    E VKSSILSMYS+CG+P
Sbjct: 226 GGESERPNFRTLEGGFQACGNMGALAEGRCLHGLVVKTGLGSSEAVKSSILSMYSKCGTP 285

Query: 284 EEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIVISCMLLGF 343
            EA   F ++  KDL+SW S+I V+++ GLM+ECL+LF EMQ  G+ PD+IVISCML GF
Sbjct: 286 VEARFSFCEVTNKDLLSWMSVIGVYTRFGLMNECLNLFQEMQIGGLFPDEIVISCMLWGF 345

Query: 344 GNFDRISEGNAFHAWILKQCYAVSGITHNALLSMYCKFGLLRTADKIFHSFHK-SSEDWN 403
           GN   +  G AFHA I+++ Y +  + HN+LL MY KFGLL  A+K+F    + + E  +
Sbjct: 346 GNSMFVKPGKAFHALIIRRDYLLGEMVHNSLLFMYSKFGLLNIAEKLFSKMRQWTKESCS 405

Query: 404 TMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVRAVNIGRSVHCYAIKN 463
           TMI GYS +G   KCI+ FREMHLLG+E + +SLVSVISSC  + A  +GRS+HCY IKN
Sbjct: 406 TMISGYSKIGHSAKCIELFREMHLLGVEVNSDSLVSVISSCCQLGATRLGRSLHCYVIKN 465

Query: 464 SIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHPSEAIDLF 523
            I +NVS+ANSL+DMYGK G LT AWR+F R Q KD+V+WNT+IS Y   G   EAI LF
Sbjct: 466 FIDNNVSVANSLIDMYGKRGELTLAWRMFCRAQ-KDVVTWNTIISCYIHCGQFEEAIALF 525

Query: 524 DKMIKEKFNPNVVTCVIALSACAHLASLDKGLKIHQYIKENGCETDITVRTALIDMYAKC 583
           DKMI E   PN  T  + LS C+HLASLDKG K+H +IKE G E ++++ TAL+DMYAKC
Sbjct: 526 DKMISENLYPNSATLAMVLSTCSHLASLDKGEKVHHHIKERGLEINLSLGTALVDMYAKC 585

Query: 584 GELESSRTLFNSMEERDVILCNVMISNYGMHGHVESAIEIFQLMEDSNIKPNALTFLSLL 643
           G+LE SR LFNSM E+DVI  NVMIS YGMHG  ESAI+IFQ ME+S++ PN LTFL+LL
Sbjct: 586 GQLEQSRGLFNSMTEKDVISWNVMISGYGMHGDAESAIQIFQDMENSDVIPNELTFLALL 645

Query: 644 SACNHAGHIVEGRRLFDVMHKYGIKPSLKHYASMVDLLGRSGSLEEAEALVLSMPITPDG 703
            ACNH+G + EG+ LF  M  Y +KP+LKHYA MVDLLGRSG+L+EAEALVLSMP++PDG
Sbjct: 646 LACNHSGLVEEGQNLFHKMQDYSMKPNLKHYACMVDLLGRSGNLQEAEALVLSMPVSPDG 705

Query: 704 TVWGSLLSACKLHNEFEMGIRIARHAIESDPKNDGYYIVLSDLYGCLGRWEEVEKVRGMM 763
            VWGSLLSAC  HN+ +MG+R+AR AIESDP NDGYY++LS++Y   GRWE+ E VR +M
Sbjct: 706 GVWGSLLSACIKHNQNDMGVRVARRAIESDPGNDGYYVMLSNMYSSSGRWEQAENVRKVM 765

Query: 764 KERGVEKRAGWS 772
           KERGV+K AGWS
Sbjct: 766 KERGVDKEAGWS 776

BLAST of CmoCh20G009190 vs. TrEMBL
Match: B9H4S5_POPTR (Pentatricopeptide repeat-containing family protein (Fragment) OS=Populus trichocarpa GN=POPTR_0005s07530g PE=4 SV=2)

HSP 1 Score: 808.5 bits (2087), Expect = 6.9e-231
Identity = 414/761 (54.40%), Postives = 528/761 (69.38%), Query Frame = 1

Query: 40  FQNKIYHSNLFFN-FIPSLSPLATRTMPSLPQS---------------SWPFMPVMGNLR 99
           FQ   + S+ + N  I S     T+T+ SL +S               S   + +  + R
Sbjct: 20  FQISYHSSSNYLNCHIDSFLSNQTQTLQSLHKSHALIITTGNANNVFISSKLISLYASFR 79

Query: 100 SPHNCFDLF--------ILRTNFFGIPLSNPTSPMAFDFYLEMRASSSLPNQFTIPMVVS 159
            PH+   +F         L  +      SN     AFDFY++MR  ++ PNQFTIPM+V+
Sbjct: 80  KPHSSTYVFDSTNQKDTFLWNSIIKSHFSNGNYFKAFDFYIQMRYDNTPPNQFTIPMIVA 139

Query: 160 TCAELMMLNHGMNIHGLALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVKDVV 219
           TCAEL+ L  G  IHGL  K GLF  NSAVGSS +YMY+KCG  E ASLMF+EI V+DVV
Sbjct: 140 TCAELLWLEEGKYIHGLVSKSGLFAENSAVGSSFVYMYAKCGVMEDASLMFDEIVVRDVV 199

Query: 220 AWTALIIGYVQNNESEKGLKCLFEMHR---NGCTPNYRTIGGGFQACVDLDALVEGRCLH 279
           +WTAL+IGYV N++SEKGL+CL EM R   +G   N RT+ GGFQAC +L A++ GRCLH
Sbjct: 200 SWTALVIGYVHNDDSEKGLECLCEMRRIGGDGEKVNSRTLEGGFQACGNLGAMIAGRCLH 259

Query: 280 GLALKSGFLCFEVVKSSILSMYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMS 339
           GLA+K+G  C +VV+SS+LSMYS+CG+ EEA+  F ++  KD+ SWTS+I V ++ G M+
Sbjct: 260 GLAVKTGLGCSQVVQSSLLSMYSKCGNVEEAHNSFCQVVDKDVFSWTSVIGVCARFGFMN 319

Query: 340 ECLHLFWEMQASGIIPDDIVISCMLLGFGNFDRISEGNAFHAWILKQCYAVSGITHNALL 399
           ECL+LFW+MQ   + PD IV+SC+LLGFGN   + EG AFH  I+++ Y +    +NALL
Sbjct: 320 ECLNLFWDMQVDDVYPDGIVVSCILLGFGNSMMVREGKAFHGLIVRRNYVLDDTVNNALL 379

Query: 400 SMYCKFGLLRTADKIFHSFHKSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNS 459
           SMYCKFG L                              EK  D        G+      
Sbjct: 380 SMYCKFGTL---------------------------NPAEKLFD--------GVHEWSKD 439

Query: 460 LVSVISSCLPVRAVNIGRSVHCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQ 519
           LVSVISSC  +  +N+ RSVHCY IKNS+ ++VSIANSL+DMYGK GNL+ AW++F RTQ
Sbjct: 440 LVSVISSCSKLGLINLCRSVHCYIIKNSVDEDVSIANSLIDMYGKGGNLSIAWKMFCRTQ 499

Query: 520 QKDIVSWNTLISSYKQSGHPSEAIDLFDKMIKEKFNPNVVTCVIALSACAHLASLDKGLK 579
            +D+V+WNTLISSY  SGH +EAI LFD+MI EK NPN  T VI LSAC HL SL+KG  
Sbjct: 500 -RDVVTWNTLISSYTHSGHYAEAITLFDEMISEKLNPNSATLVIVLSACCHLPSLEKGKM 559

Query: 580 IHQYIKENGCETDITVRTALIDMYAKCGELESSRTLFNSMEERDVILCNVMISNYGMHGH 639
           +HQYIKE G E ++++ TAL+DMYAKCG+LE SR LFNSM+E+DVI  NVMIS YG+HG 
Sbjct: 560 VHQYIKEGGFELNVSLGTALVDMYAKCGQLEQSRELFNSMKEKDVISWNVMISGYGLHGD 619

Query: 640 VESAIEIFQLMEDSNIKPNALTFLSLLSACNHAGHIVEGRRLFDVMHKYGIKPSLKHYAS 699
             SA+E+FQ ME SN+KPNA+TFLSLLSAC HAG++ EG++LFD M  Y IKP+LKH+A 
Sbjct: 620 ANSAMEVFQQMEQSNVKPNAITFLSLLSACTHAGYVDEGKQLFDRMQYYSIKPNLKHFAC 679

Query: 700 MVDLLGRSGSLEEAEALVLSMPITPDGTVWGSLLSACKLHNEFEMGIRIARHAIESDPKN 759
           M DLLGRSG+L+EAE LV SMPI PDG VWG+LLSACK+HNE E+GIR+A+ AIESDP+N
Sbjct: 680 MADLLGRSGNLQEAEDLVQSMPICPDGGVWGTLLSACKIHNEIEIGIRVAKCAIESDPEN 739

Query: 760 DGYYIVLSDLYGCLGRWEEVEKVRGMMKERGVEKRAGWSAL 774
           DGYYI+LS++YG +G+W+E E+ R +MKERG+ KRAGWSA+
Sbjct: 740 DGYYIMLSNMYGSMGKWDEAERARELMKERGIGKRAGWSAV 744

BLAST of CmoCh20G009190 vs. TrEMBL
Match: M5W549_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa021864mg PE=4 SV=1)

HSP 1 Score: 801.2 bits (2068), Expect = 1.1e-228
Identity = 402/685 (58.69%), Postives = 500/685 (72.99%), Query Frame = 1

Query: 93  ILRTNFFGIPLSNPTSPMAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGL 152
           I++T+F     SN     A DF+ +MRA    P QFT+PMVV++CAELM+L HG N+HGL
Sbjct: 101 IIKTHF-----SNGDYSKALDFFFQMRALGFAPTQFTLPMVVASCAELMLLEHGNNVHGL 160

Query: 153 ALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEK 212
           ALKLGLF GNSAVGSS +YMYSKCG  E A  MF E TV+DVV WTALIIGYVQN+E EK
Sbjct: 161 ALKLGLFSGNSAVGSSFVYMYSKCGRMEDAYFMFEETTVRDVVCWTALIIGYVQNDEIEK 220

Query: 213 GLKCLFEMHRNGCT---PNYRTIGGGFQACVDLDALVEGRCLHGLALKSGFLCFEVVKSS 272
           GL+CL EMHR G +   PN+RT+  G QAC DL  LVEG+CLHG  +KSG  C E VKS 
Sbjct: 221 GLECLCEMHRVGGSDERPNFRTLEVGLQACGDLGTLVEGKCLHGFVVKSGIGCSEAVKSL 280

Query: 273 ILSMYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPD 332
           +LSMYSRCG P E+Y  F +++ KDL+SWTS+I V+++ GLM ECL LF  MQ S I PD
Sbjct: 281 LLSMYSRCGVPGESYLSFCEIKDKDLLSWTSVIGVYARSGLMDECLSLFQGMQVSDIFPD 340

Query: 333 DIVISCMLLGFGNFDRISEGNAFHAWILKQCYAVSGITHNALLSMYCKFGLLRTADKIFH 392
           +IV++CML GF N   I+EG AF   ++++ YA+S + H+ALLSMYCKF LL  A+K+F 
Sbjct: 341 EIVVNCMLSGFKNSTTINEGKAFLGSVIRKNYALSQMVHSALLSMYCKFELLTRAEKLFF 400

Query: 393 SF-HKSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVRAVNI 452
              H++ E  NTMI GY+ MG           +HL                     A+++
Sbjct: 401 GMQHQNKESCNTMICGYAKMG-----------LHL--------------------GAIHL 460

Query: 453 GRSVHCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQ 512
           GRS+HCY IK S+ +N+S+ANSLLDMYGKSG+L  A RIF  TQ +DI++WNT+ISSY  
Sbjct: 461 GRSLHCYLIKVSMDENISVANSLLDMYGKSGHLKIARRIFSGTQ-RDIITWNTMISSYTH 520

Query: 513 SGHPSEAIDLFDKMIKEKFNPNVVTCVIALSACAHLASLDKGLKIHQYIKENGCETDITV 572
           +GH +EAI LF+KMI   F PN  T V  LSAC+HLASL +G KIH +IKE   E ++++
Sbjct: 521 AGHSAEAIALFEKMIAVNFKPNSATLVTVLSACSHLASLGEGEKIHSHIKERRLEINLSL 580

Query: 573 RTALIDMYAKCGELESSRTLFNSMEERDVILCNVMISNYGMHGHVESAIEIFQLMEDSNI 632
            TAL+DMYAKCG+LE SR LF+SMEERDVI  NVMIS Y  HGH E A+EIF+ ME+SNI
Sbjct: 581 ATALVDMYAKCGQLEKSRELFDSMEERDVISWNVMISGYATHGHAEPALEIFRKMENSNI 640

Query: 633 KPNALTFLSLLSACNHAGHIVEGRRLFDVMHKYGIKPSLKHYASMVDLLGRSGSLEEAEA 692
           KPN LTFL+LLSACNH+G + EG+ LF  M    +KP+LKHYA MVD+LGRSG+L+EA+ 
Sbjct: 641 KPNELTFLALLSACNHSGLVEEGKYLFGKMQDLSLKPNLKHYACMVDILGRSGNLQEAKD 700

Query: 693 LVLSMPITPDGTVWGSLLSACKLHNEFEMGIRIARHAIESDPKNDGYYIVLSDLYGCLGR 752
           LVLSMPI PDG VWGSLLSACK+HNE E+G+R+ARHAIESDP+NDGYYI+LS+LY  +GR
Sbjct: 701 LVLSMPIPPDGGVWGSLLSACKIHNEIELGVRVARHAIESDPENDGYYIMLSNLYSSIGR 748

Query: 753 WEEVEKVRGMMKERGVEKRAGWSAL 774
           WEE   VR MM+++G+ K  GWS +
Sbjct: 761 WEEATNVRKMMEKQGIGKTQGWSVV 748

BLAST of CmoCh20G009190 vs. TrEMBL
Match: A0A061DJE7_THECC (Pentatricopeptide repeat (PPR) superfamily protein, putative OS=Theobroma cacao GN=TCM_001088 PE=4 SV=1)

HSP 1 Score: 797.0 bits (2057), Expect = 2.1e-227
Identity = 388/675 (57.48%), Postives = 505/675 (74.81%), Query Frame = 1

Query: 104 SNPTSPMAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGLALKLGLFVGNS 163
           SN     +F+++L+MR  ++ PN FTIPMV S CAEL     G  +HGL LK GLF  NS
Sbjct: 111 SNGNYAESFEYHLKMRLHNTPPNDFTIPMVASACAELRWEGCGKYVHGLTLKFGLFAENS 170

Query: 164 AVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEKGLKCLFEMHRN 223
           AVGSS +YMY+KCG+   A L+F+EI VKDVVAWTAL+IGYVQN ESEK LK L +MHR 
Sbjct: 171 AVGSSFVYMYAKCGSMGDACLVFDEIIVKDVVAWTALVIGYVQNGESEKALKRLRDMHRV 230

Query: 224 GCT----PNYRTIGGGFQACVDLDALVEGRCLHGLALKSGFLCFEVVKSSILSMYSRCGS 283
           G      PN+RT+ GG QAC  L AL EG+CLHG  +K+G   + VV+SSILSMYSRCGS
Sbjct: 231 GGDGEKRPNFRTLEGGLQACGSLCALYEGKCLHGFVVKTGLGFYPVVQSSILSMYSRCGS 290

Query: 284 PEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIVISCMLLG 343
             ++Y  F ++  KD+ISWTSII V+++ G + ECL L  +MQ  G+  D I+IS ++LG
Sbjct: 291 VGDSYASFSEVVHKDIISWTSIIGVYARFGFLKECLDLISKMQVDGLCADGILISSIVLG 350

Query: 344 FGNFDRISEGNAFHAWILKQCYAVSGITHNALLSMYCKFGLLRTADKIFHSFHK-SSEDW 403
           FGNF  + +G AFH  ++++ + +  I HNALLSMYCKFGLL  A+K+F      + E W
Sbjct: 351 FGNFMSVCDGKAFHGLLIRRNFLLDQIVHNALLSMYCKFGLLSIAEKLFGIIPNCNKESW 410

Query: 404 NTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVRAVNIGRSVHCYAIK 463
           N M+ GY   G++E+ I+ FREM  LGIE DLNS VSVI SC  + A+ IG S+HC  +K
Sbjct: 411 NIMVSGYCKNGQEEQSIELFREMQHLGIETDLNSFVSVIFSCSELGAIRIGHSLHCNIVK 470

Query: 464 NSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHPSEAIDL 523
           + ++DN++IANSL+DMYGK+GNLT AWRIF++TQ +DI++WNT++S+Y + GH SEAI L
Sbjct: 471 SYMVDNITIANSLIDMYGKNGNLTIAWRIFNQTQ-RDIITWNTMMSAYTRCGHFSEAIAL 530

Query: 524 FDKMIKEKFNPNVVTCVIALSACAHLASLDKGLKIHQYIKENGCETDITVRTALIDMYAK 583
           FD+MI     PN+ T +  LSAC+HLAS +KG  IH YIKE G E   ++ TALIDMYAK
Sbjct: 531 FDQMISGNLTPNLATLLTVLSACSHLASWEKGEIIHCYIKEEGYELCQSLATALIDMYAK 590

Query: 584 CGELESSRTLFNSMEERDVILCNVMISNYGMHGHVESAIEIFQLMEDSNIKPNALTFLSL 643
           CG+LE+SR LFNSM+E+D +  NVMIS YGMHG  +SA+EI+Q ME SN+KPNALTFLSL
Sbjct: 591 CGQLENSRELFNSMKEKDAVSWNVMISGYGMHGDAKSALEIYQQMEKSNVKPNALTFLSL 650

Query: 644 LSACNHAGHIVEGRRLFDVMHKYGIKPSLKHYASMVDLLGRSGSLEEAEALVLSMPITPD 703
           L++C HAG + EG+ LF  M  + +KP+LKHYA MVDLLGRSG+L++AEALV+SMPI+PD
Sbjct: 651 LNSCAHAGLVEEGKFLFGRMEHFLLKPNLKHYACMVDLLGRSGNLQDAEALVMSMPISPD 710

Query: 704 GTVWGSLLSACKLHNEFEMGIRIARHAIESDPKNDGYYIVLSDLYGCLGRWEEVEKVRGM 763
           G +WG+LL AC +HNE EMG+RIA+ A+ SDP+NDGYYI++S++   +G WEE E+ R +
Sbjct: 711 GGIWGALLCACVVHNEIEMGVRIAKCAVASDPENDGYYILISNMCSSMGWWEEAERTREI 770

Query: 764 MKERGVEKRAGWSAL 774
           MKERG+ K+AGWSA+
Sbjct: 771 MKERGIGKKAGWSAM 784

BLAST of CmoCh20G009190 vs. TAIR10
Match: AT4G39952.1 (AT4G39952.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 695.7 bits (1794), Expect = 3.3e-200
Identity = 339/675 (50.22%), Postives = 461/675 (68.30%), Query Frame = 1

Query: 104 SNPTSPMAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGLALKLGLFVGNS 163
           SN     +  F+  M  S   P+ FT PMVVS CAEL+  + G  +HGL LK G F  N+
Sbjct: 102 SNGDYARSLCFFFSMLLSGQSPDHFTAPMVVSACAELLWFHVGTFVHGLVLKHGGFDRNT 161

Query: 164 AVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEKGLKCLFEMHRN 223
           AVG+S +Y YSKCG  + A L+F+E+  +DVVAWTA+I G+VQN ESE GL  L +MH  
Sbjct: 162 AVGASFVYFYSKCGFLQDACLVFDEMPDRDVVAWTAIISGHVQNGESEGGLGYLCKMHSA 221

Query: 224 GCT---PNYRTIGGGFQACVDLDALVEGRCLHGLALKSGFLCFEVVKSSILSMYSRCGSP 283
           G     PN RT+  GFQAC +L AL EGRCLHG A+K+G    + V+SS+ S YS+ G+P
Sbjct: 222 GSDVDKPNPRTLECGFQACSNLGALKEGRCLHGFAVKNGLASSKFVQSSMFSFYSKSGNP 281

Query: 284 EEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIVISCMLLGF 343
            EAY  F +L  +D+ SWTSIIA  ++ G M E   +FWEMQ  G+ PD +VISC++   
Sbjct: 282 SEAYLSFRELGDEDMFSWTSIIASLARSGDMEESFDMFWEMQNKGMHPDGVVISCLINEL 341

Query: 344 GNFDRISEGNAFHAWILKQCYAVSGITHNALLSMYCKFGLLRTADKIFHSFHK--SSEDW 403
           G    + +G AFH ++++ C+++     N+LLSMYCKF LL  A+K+F    +  + E W
Sbjct: 342 GKMMLVPQGKAFHGFVIRHCFSLDSTVCNSLLSMYCKFELLSVAEKLFCRISEEGNKEAW 401

Query: 404 NTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVRAVNIGRSVHCYAIK 463
           NTM+ GY  M    KCI+ FR++  LGIE D  S  SVISSC  + AV +G+S+HCY +K
Sbjct: 402 NTMLKGYGKMKCHVKCIELFRKIQNLGIEIDSASATSVISSCSHIGAVLLGKSLHCYVVK 461

Query: 464 NSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHPSEAIDL 523
            S+   +S+ NSL+D+YGK G+LT AWR+F      ++++WN +I+SY       +AI L
Sbjct: 462 TSLDLTISVVNSLIDLYGKMGDLTVAWRMFCEADT-NVITWNAMIASYVHCEQSEKAIAL 521

Query: 524 FDKMIKEKFNPNVVTCVIALSACAHLASLDKGLKIHQYIKENGCETDITVRTALIDMYAK 583
           FD+M+ E F P+ +T V  L AC +  SL++G  IH+YI E   E ++++  ALIDMYAK
Sbjct: 522 FDRMVSENFKPSSITLVTLLMACVNTGSLERGQMIHRYITETEHEMNLSLSAALIDMYAK 581

Query: 584 CGELESSRTLFNSMEERDVILCNVMISNYGMHGHVESAIEIFQLMEDSNIKPNALTFLSL 643
           CG LE SR LF++  ++D +  NVMIS YGMHG VESAI +F  ME+S++KP   TFL+L
Sbjct: 582 CGHLEKSRELFDAGNQKDAVCWNVMISGYGMHGDVESAIALFDQMEESDVKPTGPTFLAL 641

Query: 644 LSACNHAGHIVEGRRLFDVMHKYGIKPSLKHYASMVDLLGRSGSLEEAEALVLSMPITPD 703
           LSAC HAG + +G++LF  MH+Y +KP+LKHY+ +VDLL RSG+LEEAE+ V+SMP +PD
Sbjct: 642 LSACTHAGLVEQGKKLFLKMHQYDVKPNLKHYSCLVDLLSRSGNLEEAESTVMSMPFSPD 701

Query: 704 GTVWGSLLSACKLHNEFEMGIRIARHAIESDPKNDGYYIVLSDLYGCLGRWEEVEKVRGM 763
           G +WG+LLS+C  H EFEMGIR+A  A+ SDP+NDGYYI+L+++Y   G+WEE E+ R M
Sbjct: 702 GVIWGTLLSSCMTHGEFEMGIRMAERAVASDPQNDGYYIMLANMYSAAGKWEEAERAREM 761

Query: 764 MKERGVEKRAGWSAL 774
           M+E GV KRAG S +
Sbjct: 762 MRESGVGKRAGHSVV 775

BLAST of CmoCh20G009190 vs. TAIR10
Match: AT4G18750.1 (AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 410.6 bits (1054), Expect = 2.1e-114
Identity = 219/646 (33.90%), Postives = 374/646 (57.89%), Query Frame = 1

Query: 129 TIPMVVSTCAELMMLNHGMNIHGLALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNE 188
           T+  V+  CA+   L  G  +       G FV +S +GS L  MY+ CG+ + AS +F+E
Sbjct: 96  TLCSVLQLCADSKSLKDGKEVDNFIRGNG-FVIDSNLGSKLSLMYTNCGDLKEASRVFDE 155

Query: 189 ITVKDVVAWTALIIGYVQNNESEKGLKCLFEMHRNGCTPNYRTIGGGFQACVDLDALVEG 248
           + ++  + W  L+    ++ +    +    +M  +G   +  T     ++   L ++  G
Sbjct: 156 VKIEKALFWNILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGG 215

Query: 249 RCLHGLALKSGFLCFEVVKSSILSMYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKL 308
             LHG  LKSGF     V +S+++ Y +    + A + F ++ ++D+ISW SII  +   
Sbjct: 216 EQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSN 275

Query: 309 GLMSECLHLFWEMQASGIIPDDIVISCMLLGFGNFDRISEGNAFHAWILKQCYAVSGITH 368
           GL  + L +F +M  SGI  D   I  +  G  +   IS G A H+  +K C++      
Sbjct: 276 GLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFC 335

Query: 369 NALLSMYCKFGLLRTADKIFHSFH-KSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIE 428
           N LL MY K G L +A  +F     +S   + +MI GY+  G   + +  F EM   GI 
Sbjct: 336 NTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGIS 395

Query: 429 PDLNSLVSVISSCLPVRAVNIGRSVHCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRI 488
           PD+ ++ +V++ C   R ++ G+ VH +  +N +  ++ ++N+L+DMY K G++  A  +
Sbjct: 396 PDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELV 455

Query: 489 FHRTQQKDIVSWNTLISSYKQSGHPSEAIDLFDKMIKEK-FNPNVVTCVIALSACAHLAS 548
           F   + KDI+SWNT+I  Y ++ + +EA+ LF+ +++EK F+P+  T    L ACA L++
Sbjct: 456 FSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLSA 515

Query: 549 LDKGLKIHQYIKENGCETDITVRTALIDMYAKCGELESSRTLFNSMEERDVILCNVMISN 608
            DKG +IH YI  NG  +D  V  +L+DMYAKCG L  +  LF+ +  +D++   VMI+ 
Sbjct: 516 FDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIAG 575

Query: 609 YGMHGHVESAIEIFQLMEDSNIKPNALTFLSLLSACNHAGHIVEGRRLFDVM-HKYGIKP 668
           YGMHG  + AI +F  M  + I+ + ++F+SLL AC+H+G + EG R F++M H+  I+P
Sbjct: 576 YGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEP 635

Query: 669 SLKHYASMVDLLGRSGSLEEAEALVLSMPITPDGTVWGSLLSACKLHNEFEMGIRIARHA 728
           +++HYA +VD+L R+G L +A   + +MPI PD T+WG+LL  C++H++ ++  ++A   
Sbjct: 636 TVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKV 695

Query: 729 IESDPKNDGYYIVLSDLYGCLGRWEEVEKVRGMMKERGVEKRAGWS 772
            E +P+N GYY++++++Y    +WE+V+++R  + +RG+ K  G S
Sbjct: 696 FELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRGLRKNPGCS 740

BLAST of CmoCh20G009190 vs. TAIR10
Match: AT1G69350.1 (AT1G69350.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 409.5 bits (1051), Expect = 4.7e-114
Identity = 218/666 (32.73%), Postives = 379/666 (56.91%), Query Frame = 1

Query: 111 AFDFYLEMRASSSLPNQFTIPMVVSTCA-ELMMLNHGMNIHGLALKLGLFVGNSAVGSSL 170
           A D Y  + + ++  ++F  P V+  CA     L+ G  +HG  +K G+   ++ + +SL
Sbjct: 84  AIDLYHRLVSETTQISKFVFPSVLRACAGSREHLSVGGKVHGRIIKGGVD-DDAVIETSL 143

Query: 171 IYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEKGLKCLFEMHRNGCTPNY 230
           + MY + GN   A  +F+ + V+D+VAW+ L+   ++N E  K L+    M  +G  P+ 
Sbjct: 144 LCMYGQTGNLSDAEKVFDGMPVRDLVAWSTLVSSCLENGEVVKALRMFKCMVDDGVEPDA 203

Query: 231 RTIGGGFQACVDLDALVEGRCLHGLALKSGFLCFEVVKSSILSMYSRCGSPEEAYRCFFK 290
            T+    + C +L  L   R +HG   +  F   E + +S+L+MYS+CG    + R F K
Sbjct: 204 VTMISVVEGCAELGCLRIARSVHGQITRKMFDLDETLCNSLLTMYSKCGDLLSSERIFEK 263

Query: 291 LEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIVISCMLLGFGNFDRISEG 350
           + +K+ +SWT++I+ +++     + L  F EM  SGI P+ + +  +L   G    I EG
Sbjct: 264 IAKKNAVSWTAMISSYNRGEFSEKALRSFSEMIKSGIEPNLVTLYSVLSSCGLIGLIREG 323

Query: 351 NAFHAWILKQCYAVSGITHN-ALLSMYCKFGLLRTADKIFHSFH-KSSEDWNTMILGYSN 410
            + H + +++    +  + + AL+ +Y + G L   + +      ++   WN++I  Y++
Sbjct: 324 KSVHGFAVRRELDPNYESLSLALVELYAECGKLSDCETVLRVVSDRNIVAWNSLISLYAH 383

Query: 411 MGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVRAVNIGRSVHCYAIKNSIIDNVSI 470
            G   + +  FR+M    I+PD  +L S IS+C     V +G+ +H + I+  + D   +
Sbjct: 384 RGMVIQALGLFRQMVTQRIKPDAFTLASSISACENAGLVPLGKQIHGHVIRTDVSDEF-V 443

Query: 471 ANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHPSEAIDLFDKMIKEKF 530
            NSL+DMY KSG++ +A  +F++ + + +V+WN+++  + Q+G+  EAI LFD M     
Sbjct: 444 QNSLIDMYSKSGSVDSASTVFNQIKHRSVVTWNSMLCGFSQNGNSVEAISLFDYMYHSYL 503

Query: 531 NPNVVTCVIALSACAHLASLDKGLKIHQYIKENGCETDITVRTALIDMYAKCGELESSRT 590
             N VT +  + AC+ + SL+KG  +H  +  +G + D+   TALIDMYAKCG+L ++ T
Sbjct: 504 EMNEVTFLAVIQACSSIGSLEKGKWVHHKLIISGLK-DLFTDTALIDMYAKCGDLNAAET 563

Query: 591 LFNSMEERDVILCNVMISNYGMHGHVESAIEIFQLMEDSNIKPNALTFLSLLSACNHAGH 650
           +F +M  R ++  + MI+ YGMHG + SAI  F  M +S  KPN + F+++LSAC H+G 
Sbjct: 564 VFRAMSSRSIVSWSSMINAYGMHGRIGSAISTFNQMVESGTKPNEVVFMNVLSACGHSGS 623

Query: 651 IVEGRRLFDVMHKYGIKPSLKHYASMVDLLGRSGSLEEAEALVLSMPITPDGTVWGSLLS 710
           + EG+  F++M  +G+ P+ +H+A  +DLL RSG L+EA   +  MP   D +VWGSL++
Sbjct: 624 VEEGKYYFNLMKSFGVSPNSEHFACFIDLLSRSGDLKEAYRTIKEMPFLADASVWGSLVN 683

Query: 711 ACKLHNEFEMGIRIARHAIESDPKNDGYYIVLSDLYGCLGRWEEVEKVRGMMKERGVEKR 770
            C++H + ++   I     +    + GYY +LS++Y   G WEE  ++R  MK   ++K 
Sbjct: 684 GCRIHQKMDIIKAIKNDLSDIVTDDTGYYTLLSNIYAEEGEWEEFRRLRSAMKSSNLKKV 743

Query: 771 AGWSAL 774
            G+SA+
Sbjct: 744 PGYSAI 746

BLAST of CmoCh20G009190 vs. TAIR10
Match: AT3G03580.1 (AT3G03580.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 401.4 bits (1030), Expect = 1.3e-111
Identity = 217/669 (32.44%), Postives = 376/669 (56.20%), Query Frame = 1

Query: 105 NPTSPMAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGLALKLGLFVGNSA 164
           N   P A +FY ++R S   P+++T P V+  CA L     G  ++   L +G F  +  
Sbjct: 84  NGLFPEALEFYGKLRESKVSPDKYTFPSVIKACAGLFDAEMGDLVYEQILDMG-FESDLF 143

Query: 165 VGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEKGLKCLFEMHRNG 224
           VG++L+ MYS+ G    A  +F+E+ V+D+V+W +LI GY  +   E+ L+   E+  + 
Sbjct: 144 VGNALVDMYSRMGLLTRARQVFDEMPVRDLVSWNSLISGYSSHGYYEEALEIYHELKNSW 203

Query: 225 CTPNYRTIGGGFQACVDLDALVEGRCLHGLALKSGFLCFEVVKSSILSMYSRCGSPEEAY 284
             P+  T+     A  +L  + +G+ LHG ALKSG     VV + +++MY +   P +A 
Sbjct: 204 IVPDSFTVSSVLPAFGNLLVVKQGQGLHGFALKSGVNSVVVVNNGLVAMYLKFRRPTDAR 263

Query: 285 RCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIVISCMLLGFGNFD 344
           R F +++ +D +S+ ++I  + KL ++ E + +F E       PD + +S +L   G+  
Sbjct: 264 RVFDEMDVRDSVSYNTMICGYLKLEMVEESVRMFLE-NLDQFKPDLLTVSSVLRACGHLR 323

Query: 345 RISEGNAFHAWILKQCYAVSGITHNALLSMYCKFGLLRTADKIFHSFH-KSSEDWNTMIL 404
            +S     + ++LK  + +     N L+ +Y K G + TA  +F+S   K +  WN++I 
Sbjct: 324 DLSLAKYIYNYMLKAGFVLESTVRNILIDVYAKCGDMITARDVFNSMECKDTVSWNSIIS 383

Query: 405 GYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVRAVNIGRSVHCYAIKNSIID 464
           GY   G+  + +  F+ M ++  + D  + + +IS    +  +  G+ +H   IK+ I  
Sbjct: 384 GYIQSGDLMEAMKLFKMMMIMEEQADHITYLMLISVSTRLADLKFGKGLHSNGIKSGICI 443

Query: 465 NVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHPSEAIDLFDKMI 524
           ++S++N+L+DMY K G +  + +IF      D V+WNT+IS+  + G  +  + +  +M 
Sbjct: 444 DLSVSNALIDMYAKCGEVGDSLKIFSSMGTGDTVTWNTVISACVRFGDFATGLQVTTQMR 503

Query: 525 KEKFNPNVVTCVIALSACAHLASLDKGLKIHQYIKENGCETDITVRTALIDMYAKCGELE 584
           K +  P++ T ++ L  CA LA+   G +IH  +   G E+++ +  ALI+MY+KCG LE
Sbjct: 504 KSEVVPDMATFLVTLPMCASLAAKRLGKEIHCCLLRFGYESELQIGNALIEMYSKCGCLE 563

Query: 585 SSRTLFNSMEERDVILCNVMISNYGMHGHVESAIEIFQLMEDSNIKPNALTFLSLLSACN 644
           +S  +F  M  RDV+    MI  YGM+G  E A+E F  ME S I P+++ F++++ AC+
Sbjct: 564 NSSRVFERMSRRDVVTWTGMIYAYGMYGEGEKALETFADMEKSGIVPDSVVFIAIIYACS 623

Query: 645 HAGHIVEGRRLFDVMH-KYGIKPSLKHYASMVDLLGRSGSLEEAEALVLSMPITPDGTVW 704
           H+G + EG   F+ M   Y I P ++HYA +VDLL RS  + +AE  + +MPI PD ++W
Sbjct: 624 HSGLVDEGLACFEKMKTHYKIDPMIEHYACVVDLLSRSQKISKAEEFIQAMPIKPDASIW 683

Query: 705 GSLLSACKLHNEFEMGIRIARHAIESDPKNDGYYIVLSDLYGCLGRWEEVEKVRGMMKER 764
            S+L AC+   + E   R++R  IE +P + GY I+ S+ Y  L +W++V  +R  +K++
Sbjct: 684 ASVLRACRTSGDMETAERVSRRIIELNPDDPGYSILASNAYAALRKWDKVSLIRKSLKDK 743

Query: 765 GVEKRAGWS 772
            + K  G+S
Sbjct: 744 HITKNPGYS 750

BLAST of CmoCh20G009190 vs. TAIR10
Match: AT3G63370.1 (AT3G63370.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 400.6 bits (1028), Expect = 2.2e-111
Identity = 225/701 (32.10%), Postives = 383/701 (54.64%), Query Frame = 1

Query: 80  GNLRSPHNCFDLFILRTNF-----FGIPLSNPTSPMAFDFYLEMRASSSLPNQFTIPMVV 139
           G+L      FD    RT F      G  +SN     A   Y  MR         + P ++
Sbjct: 130 GSLDDAEKVFDEMPDRTAFAWNTMIGAYVSNGEPASALALYWNMRVEGVPLGLSSFPALL 189

Query: 140 STCAELMMLNHGMNIHGLALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVK-D 199
             CA+L  +  G  +H L +KLG +     + ++L+ MY+K  +  +A  +F+    K D
Sbjct: 190 KACAKLRDIRSGSELHSLLVKLG-YHSTGFIVNALVSMYAKNDDLSAARRLFDGFQEKGD 249

Query: 200 VVAWTALIIGYVQNNESEKGLKCLFEMHRNGCTPNYRTIGGGFQACVDLDALVEGRCLHG 259
            V W +++  Y  + +S + L+   EMH  G  PN  TI     AC        G+ +H 
Sbjct: 250 AVLWNSILSSYSTSGKSLETLELFREMHMTGPAPNSYTIVSALTACDGFSYAKLGKEIHA 309

Query: 260 LALKSGFLCFEV-VKSSILSMYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMS 319
             LKS     E+ V +++++MY+RCG   +A R   ++   D+++W S+I  + +  +  
Sbjct: 310 SVLKSSTHSSELYVCNALIAMYTRCGKMPQAERILRQMNNADVVTWNSLIKGYVQNLMYK 369

Query: 320 ECLHLFWEMQASGIIPDDIVISCMLLGFGNFDRISEGNAFHAWILKQCYAVSGITHNALL 379
           E L  F +M A+G   D++ ++ ++   G    +  G   HA+++K  +  +    N L+
Sbjct: 370 EALEFFSDMIAAGHKSDEVSMTSIIAASGRLSNLLAGMELHAYVIKHGWDSNLQVGNTLI 429

Query: 380 SMYCKFGLLRTADKIFHSFH-KSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLN 439
            MY K  L     + F   H K    W T+I GY+      + ++ FR++    +E D  
Sbjct: 430 DMYSKCNLTCYMGRAFLRMHDKDLISWTTVIAGYAQNDCHVEALELFRDVAKKRMEIDEM 489

Query: 440 SLVSVISSCLPVRAVNIGRSVHCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRT 499
            L S++ +   ++++ I + +HC+ ++  ++D V I N L+D+YGK  N+  A R+F   
Sbjct: 490 ILGSILRASSVLKSMLIVKEIHCHILRKGLLDTV-IQNELVDVYGKCRNMGYATRVFESI 549

Query: 500 QQKDIVSWNTLISSYKQSGHPSEAIDLFDKMIKEKFNPNVVTCVIALSACAHLASLDKGL 559
           + KD+VSW ++ISS   +G+ SEA++LF +M++   + + V  +  LSA A L++L+KG 
Sbjct: 550 KGKDVVSWTSMISSSALNGNESEAVELFRRMVETGLSADSVALLCILSAAASLSALNKGR 609

Query: 560 KIHQYIKENGCETDITVRTALIDMYAKCGELESSRTLFNSMEERDVILCNVMISNYGMHG 619
           +IH Y+   G   + ++  A++DMYA CG+L+S++ +F+ +E + ++    MI+ YGMHG
Sbjct: 610 EIHCYLLRKGFCLEGSIAVAVVDMYACCGDLQSAKAVFDRIERKGLLQYTSMINAYGMHG 669

Query: 620 HVESAIEIFQLMEDSNIKPNALTFLSLLSACNHAGHIVEGRRLFDVM-HKYGIKPSLKHY 679
             ++A+E+F  M   N+ P+ ++FL+LL AC+HAG + EGR    +M H+Y ++P  +HY
Sbjct: 670 CGKAAVELFDKMRHENVSPDHISFLALLYACSHAGLLDEGRGFLKIMEHEYELEPWPEHY 729

Query: 680 ASMVDLLGRSGSLEEAEALVLSMPITPDGTVWGSLLSACKLHNEFEMGIRIARHAIESDP 739
             +VD+LGR+  + EA   V  M   P   VW +LL+AC+ H+E E+G   A+  +E +P
Sbjct: 730 VCLVDMLGRANCVVEAFEFVKMMKTEPTAEVWCALLAACRSHSEKEIGEIAAQRLLELEP 789

Query: 740 KNDGYYIVLSDLYGCLGRWEEVEKVRGMMKERGVEKRAGWS 772
           KN G  +++S+++   GRW +VEKVR  MK  G+EK  G S
Sbjct: 790 KNPGNLVLVSNVFAEQGRWNDVEKVRAKMKASGMEKHPGCS 828

BLAST of CmoCh20G009190 vs. NCBI nr
Match: gi|659118561|ref|XP_008459184.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g39952, mitochondrial [Cucumis melo])

HSP 1 Score: 1196.8 bits (3095), Expect = 0.0e+00
Identity = 603/784 (76.91%), Postives = 666/784 (84.95%), Query Frame = 1

Query: 1   MLSLRLIQFRQFRKCFAFSSTFSSLPDPHDLGSCLHFFF------FQNKI-YHSNLFFNF 60
           ML LRL    QF   FAFSSTF+SLPDPH   +CLH FF      FQ+ + +HS +    
Sbjct: 56  MLRLRL---SQFHIRFAFSSTFTSLPDPHYPNNCLHSFFSKPSLTFQSLLQFHSLIITTG 115

Query: 61  IPSLSPLATRTMPSLPQSSWPFMPVMGNLRSPHNCFDLF----ILRTNFFGIPLSNPTSP 120
                  AT+ M        P      +L    +  D+F    I++++F     SN    
Sbjct: 116 NSDNVFFATKLMAFYASHRQPAFST--HLFRLIHSKDIFLWNSIIQSHF-----SNGDYQ 175

Query: 121 MAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGLALKLGLFVGNSAVGSSL 180
            AFDFYL+MRASSSLPNQFT+PMVVSTCAELMM NHGMNIHGL  KLGLFV NSA+GSS 
Sbjct: 176 RAFDFYLQMRASSSLPNQFTVPMVVSTCAELMMFNHGMNIHGLTSKLGLFVSNSAIGSSF 235

Query: 181 IYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEKGLKCLFEMHRNGCTPNY 240
           IYMYSKCG+ ESASLMF+EITVKDVVAWTALI+GYVQNNES +GLKCLFEMHR G TPNY
Sbjct: 236 IYMYSKCGHVESASLMFSEITVKDVVAWTALIVGYVQNNESGRGLKCLFEMHRIGGTPNY 295

Query: 241 RTIGGGFQACVDLDALVEGRCLHGLALKSGFLCFEVVKSSILSMYSRCGSPEEAYRCFFK 300
           +TIG GFQACVDLDALVEG+CLHGLALK+GFLCF+VVKS+ILSMYSRCGSPEEAYRCF K
Sbjct: 296 KTIGSGFQACVDLDALVEGKCLHGLALKNGFLCFKVVKSTILSMYSRCGSPEEAYRCFCK 355

Query: 301 LEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIVISCMLLGFGNFDRISEG 360
           L+QKDLISWTSIIAVHSK GLMSECLHLFWEMQ S IIPD+IVISCML+GFGN  RI EG
Sbjct: 356 LDQKDLISWTSIIAVHSKFGLMSECLHLFWEMQDSEIIPDEIVISCMLMGFGNSGRIFEG 415

Query: 361 NAFHAWILKQCYAVSGITHNALLSMYCKFGLLRTADKIFHSFHKSSEDWNTMILGYSNMG 420
            AFHAWILKQC A++GITHNALLSMYCKFG L TA+KIFHSFHKSSEDW+TMILGYSNMG
Sbjct: 416 KAFHAWILKQCCAMNGITHNALLSMYCKFGHLGTANKIFHSFHKSSEDWSTMILGYSNMG 475

Query: 421 EKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVRAVNIGRSVHCYAIKNSIIDNVSIAN 480
           +KE CI F REM LLG EPDLNSLVSVISSC  V A+NIGRS+HCYAIKNSII+NVSIAN
Sbjct: 476 QKENCISFLREMLLLGREPDLNSLVSVISSCSQVGAINIGRSIHCYAIKNSIIENVSIAN 535

Query: 481 SLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHPSEAIDLFDKMIKEKFNP 540
           SL+DMYGKSG++TA WRIFHRTQQ+D++SWNTLISSYKQSG+ +EAI LFDKM+KEK  P
Sbjct: 536 SLMDMYGKSGHVTATWRIFHRTQQRDVISWNTLISSYKQSGNLAEAIILFDKMVKEKVYP 595

Query: 541 NVVTCVIALSACAHLASLDKGLKIHQYIKENGCETDITVRTALIDMYAKCGELESSRTLF 600
           N VTCVI LS CAHLASLDKG KIHQYIKENG E++IT+RTALIDMYAKCGELE+SR LF
Sbjct: 596 NKVTCVIVLSVCAHLASLDKGEKIHQYIKENGFESNITIRTALIDMYAKCGELETSRKLF 655

Query: 601 NSMEERDVILCNVMISNYGMHGHVESAIEIFQLMEDSNIKPNALTFLSLLSACNHAGHIV 660
           NS EERD IL NVMISNYGMHGHVESA+EIFQLME+SNIKPNA TFLSLLSACNH GH++
Sbjct: 656 NSTEERDAILWNVMISNYGMHGHVESAMEIFQLMEESNIKPNAQTFLSLLSACNHTGHVL 715

Query: 661 EGRRLFDVMHKYGIKPSLKHYASMVDLLGRSGSLEEAEALVLSMPITPDGTVWGSLLSAC 720
           EGR LFD M KYGI+PSLKHYAS++DLLGRSGSLE AEALVLSMPITPDGTVWGSLLSAC
Sbjct: 716 EGRDLFDRMTKYGIEPSLKHYASVIDLLGRSGSLEAAEALVLSMPITPDGTVWGSLLSAC 775

Query: 721 KLHNEFEMGIRIARHAIESDPKNDGYYIVLSDLYGCLGRWEEVEKVRGMMKERGVEKRAG 774
           K+HNEFEMG+R+AR+AIESDPKNDGYYI+LSDLY CLGRW+EVEKVRGMMKERGVEKR G
Sbjct: 776 KIHNEFEMGVRLARYAIESDPKNDGYYIILSDLYSCLGRWDEVEKVRGMMKERGVEKRTG 829

BLAST of CmoCh20G009190 vs. NCBI nr
Match: gi|449460752|ref|XP_004148109.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g39952, mitochondrial [Cucumis sativus])

HSP 1 Score: 1188.3 bits (3073), Expect = 0.0e+00
Identity = 600/784 (76.53%), Postives = 667/784 (85.08%), Query Frame = 1

Query: 1   MLSLRLIQFRQFRKCFAFSSTFSSLPDPHDLGSCLHFFF------FQNKI-YHSNLFFNF 60
           ML LRL    QF   FAFSSTF+SL D H   +CLH FF      FQ+ + +HS +    
Sbjct: 11  MLRLRL---SQFHIRFAFSSTFTSLSDSHYPNNCLHSFFSKPNLTFQSLLQFHSLIITTG 70

Query: 61  IPSLSPLATRTMPSLPQSSWPFMPVMGNLRSPHNCFDLF----ILRTNFFGIPLSNPTSP 120
             +    AT+ M        P      +L    +  D+F    I++++F     SN    
Sbjct: 71  NSNNVFFATKLMAFYAYHRKPAFST--HLFRLIHSKDIFLWNSIIQSHF-----SNGDYQ 130

Query: 121 MAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGLALKLGLFVGNSAVGSSL 180
            AFDFYL+MRASSSLPNQFT+PMVVSTCAELMM NHGMNIHGL  KLGLFVGNSA+GSS 
Sbjct: 131 RAFDFYLQMRASSSLPNQFTVPMVVSTCAELMMFNHGMNIHGLTSKLGLFVGNSAIGSSF 190

Query: 181 IYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEKGLKCLFEMHRNGCTPNY 240
           IYMYSKCG+ ESAS+MF+EITVKDVV WTALI+GYVQNNES +GLKCLFEMHR G TPNY
Sbjct: 191 IYMYSKCGHVESASIMFSEITVKDVVTWTALIVGYVQNNESGRGLKCLFEMHRIGGTPNY 250

Query: 241 RTIGGGFQACVDLDALVEGRCLHGLALKSGFLCFEVVKSSILSMYSRCGSPEEAYRCFFK 300
           +TIG GFQACVDLDALVEG+CLHGLALK+GFLCFEVVKS+ILSMYSRCGSPEEAYRCF K
Sbjct: 251 KTIGSGFQACVDLDALVEGKCLHGLALKNGFLCFEVVKSTILSMYSRCGSPEEAYRCFCK 310

Query: 301 LEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIVISCMLLGFGNFDRISEG 360
           L+QKDLISWTSIIAVHSK GLMSECLHLFWEMQAS IIPD+IVISCML+GFGN DRI EG
Sbjct: 311 LDQKDLISWTSIIAVHSKFGLMSECLHLFWEMQASEIIPDEIVISCMLMGFGNSDRIFEG 370

Query: 361 NAFHAWILKQCYAVSGITHNALLSMYCKFGLLRTADKIFHSFHKSSEDWNTMILGYSNMG 420
            AFHA ILKQC A+SGITHNALLSMYCKFG L TA+KIFHSFHKSSEDW+TMILGYSNMG
Sbjct: 371 KAFHARILKQCCALSGITHNALLSMYCKFGHLGTANKIFHSFHKSSEDWSTMILGYSNMG 430

Query: 421 EKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVRAVNIGRSVHCYAIKNSIIDNVSIAN 480
           +KEKCI F REM LLG EPDLNSLVSVISSC  V A+NIGRS+HCYAIKNSII+NVS+AN
Sbjct: 431 QKEKCISFLREMLLLGREPDLNSLVSVISSCSQVGAINIGRSIHCYAIKNSIIENVSVAN 490

Query: 481 SLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHPSEAIDLFDKMIKEKFNP 540
           SL+DMYGKSG++TA WRIFHRT Q+D++SWNTLISSYKQSG  +EAI LFDKM+KEK  P
Sbjct: 491 SLMDMYGKSGHVTATWRIFHRTLQRDVISWNTLISSYKQSGILAEAIILFDKMVKEKVYP 550

Query: 541 NVVTCVIALSACAHLASLDKGLKIHQYIKENGCETDITVRTALIDMYAKCGELESSRTLF 600
           N VTC+I LSACAHLASLD+G KIHQYIKENG E++IT+RTALIDMYAKCGELE+SR LF
Sbjct: 551 NKVTCIIVLSACAHLASLDEGEKIHQYIKENGFESNITIRTALIDMYAKCGELETSRKLF 610

Query: 601 NSMEERDVILCNVMISNYGMHGHVESAIEIFQLMEDSNIKPNALTFLSLLSACNHAGHIV 660
           NS EERDVIL NVMISNYGMHGHVESA+EIFQLME+SNIKPNA TFLSLLSACNH GH++
Sbjct: 611 NSTEERDVILWNVMISNYGMHGHVESAMEIFQLMEESNIKPNAQTFLSLLSACNHTGHVL 670

Query: 661 EGRRLFDVMHKYGIKPSLKHYASMVDLLGRSGSLEEAEALVLSMPITPDGTVWGSLLSAC 720
           EGR LFD M KYGI+PSLKHYAS++DLLGRSGSLE AEALVLSMPITPDGTVWGSLLSAC
Sbjct: 671 EGRHLFDRMQKYGIEPSLKHYASIIDLLGRSGSLEAAEALVLSMPITPDGTVWGSLLSAC 730

Query: 721 KLHNEFEMGIRIARHAIESDPKNDGYYIVLSDLYGCLGRWEEVEKVRGMMKERGVEKRAG 774
           K+HNEFE+G+R+AR+AIESDPKNDGYYI+LSDLY CLGRW+EVEKVR MMK+RGVEKRAG
Sbjct: 731 KIHNEFEVGVRLARYAIESDPKNDGYYIILSDLYSCLGRWDEVEKVRDMMKKRGVEKRAG 784

BLAST of CmoCh20G009190 vs. NCBI nr
Match: gi|743861097|ref|XP_011031040.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g39952, mitochondrial [Populus euphratica])

HSP 1 Score: 871.7 bits (2251), Expect = 9.6e-250
Identity = 418/674 (62.02%), Postives = 523/674 (77.60%), Query Frame = 1

Query: 104 SNPTSPMAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGLALKLGLFVGNS 163
           SN     AFDFY++MR  ++ PNQFTIPM+V+TCAEL+ L  G  IHGL  K G F  NS
Sbjct: 107 SNGNYFKAFDFYIQMRYDNTPPNQFTIPMIVATCAELLWLEEGKYIHGLVSKSGFFAENS 166

Query: 164 AVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEKGLKCLFEMHR- 223
           AVGSS +YMY+KCG  E ASLMF+EI V+DVV+WTAL+IGYV N++SEKGL+CL EMHR 
Sbjct: 167 AVGSSFVYMYAKCGVMEDASLMFDEIVVRDVVSWTALVIGYVHNDDSEKGLECLCEMHRI 226

Query: 224 --NGCTPNYRTIGGGFQACVDLDALVEGRCLHGLALKSGFLCFEVVKSSILSMYSRCGSP 283
             +G   N RT+ GGFQAC +L A++ GRCLHGLA+K+G  C   V+SS+LSMYS+CG+ 
Sbjct: 227 GGDGEKVNSRTLEGGFQACGNLGAMIAGRCLHGLAVKTGLGCSHAVQSSLLSMYSKCGNV 286

Query: 284 EEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPDDIVISCMLLGF 343
           EEA++ F ++  KD+ SWTS+I V ++ G M+ECL+LFW+MQ   + PD IV+SC+LLGF
Sbjct: 287 EEAHKSFCQVVDKDVFSWTSVIGVCARFGFMNECLNLFWDMQVDDVYPDGIVVSCILLGF 346

Query: 344 GNFDRISEGNAFHAWILKQCYAVSGITHNALLSMYCKFGLLRTADKIFHSFHK-SSEDWN 403
           GN   + EG AFH  I+++ Y +    +NALLSMYCKFG L  A+K+    H+ S E WN
Sbjct: 347 GNSMMVREGKAFHGLIVRRNYVLDDTVNNALLSMYCKFGTLNPAEKLLDGVHEWSKESWN 406

Query: 404 TMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVRAVNIGRSVHCYAIKN 463
           TM+ GY  MG + KCI+ FREM  LGIE D NSLVSVISSC  +  +N  RSVHCY IKN
Sbjct: 407 TMVFGYGKMGIEGKCIELFREMRDLGIEADSNSLVSVISSCSKLGLINPCRSVHCYIIKN 466

Query: 464 SIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQSGHPSEAIDLF 523
           S+ ++VSIANSL+DMYGK GNL+ AW++F RTQ +D+V+WNTLISSY  SGH +EAI LF
Sbjct: 467 SVDEDVSIANSLIDMYGKGGNLSIAWKMFCRTQ-RDVVTWNTLISSYTHSGHHAEAITLF 526

Query: 524 DKMIKEKFNPNVVTCVIALSACAHLASLDKGLKIHQYIKENGCETDITVRTALIDMYAKC 583
           D+MI EK NPN  T VI LSAC HL SL+KG  +HQYIKE G E ++++ TAL+DMYAKC
Sbjct: 527 DEMISEKLNPNSATLVIVLSACGHLPSLEKGKMVHQYIKEGGFELNVSLGTALVDMYAKC 586

Query: 584 GELESSRTLFNSMEERDVILCNVMISNYGMHGHVESAIEIFQLMEDSNIKPNALTFLSLL 643
           G+LE SR LFNSM+E+DVI  NVMIS YG+HG   SA+E+FQ ME SN+KPNA+TFLSLL
Sbjct: 587 GQLEQSRELFNSMKEKDVISWNVMISGYGLHGDANSAMEVFQQMEQSNVKPNAITFLSLL 646

Query: 644 SACNHAGHIVEGRRLFDVMHKYGIKPSLKHYASMVDLLGRSGSLEEAEALVLSMPITPDG 703
           SAC HAG++ EG++LFD M  Y IKP+LKH+A + DLLGRSG+L+EAE LV SMPI PD 
Sbjct: 647 SACTHAGYVDEGKQLFDRMQYYSIKPNLKHFACLADLLGRSGNLQEAEDLVQSMPICPDA 706

Query: 704 TVWGSLLSACKLHNEFEMGIRIARHAIESDPKNDGYYIVLSDLYGCLGRWEEVEKVRGMM 763
            VWG+LLSACK+HNE E+GIR+A  AIESDP+NDGYYI+LS++YG +G+W+E E+ R +M
Sbjct: 707 GVWGTLLSACKIHNEIEIGIRVATCAIESDPENDGYYIMLSNMYGSMGKWDEAERTRELM 766

Query: 764 KERGVEKRAGWSAL 774
           KERGV KRAGWSA+
Sbjct: 767 KERGVGKRAGWSAV 779

BLAST of CmoCh20G009190 vs. NCBI nr
Match: gi|658021995|ref|XP_008346407.1| (PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g39952, mitochondrial-like [Malus domestica])

HSP 1 Score: 856.3 bits (2211), Expect = 4.2e-245
Identity = 421/685 (61.46%), Postives = 518/685 (75.62%), Query Frame = 1

Query: 93  ILRTNFFGIPLSNPTSPMAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGL 152
           I++T+F     SN     A  F+ +MRAS   PNQFT+PMVVS+CAELM+L+HG N+HGL
Sbjct: 103 IIKTHF-----SNGGYSKALVFFFQMRASGFAPNQFTLPMVVSSCAELMVLDHGNNVHGL 162

Query: 153 ALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEK 212
             KLGLF GNSAVGSS +YMYSKCG  E AS+MF+EITV+DVV WTALIIGYVQN+ESEK
Sbjct: 163 GKKLGLFAGNSAVGSSFVYMYSKCGRMEDASJMFDEITVRDVVCWTALIIGYVQNDESEK 222

Query: 213 GLKCLFEMHRNGCT---PNYRTIGGGFQACVDLDALVEGRCLHGLALKSGFLCFEVVKSS 272
           GL+CL EMHR G     PN+RT+  G QAC DL ALVEGRCLHG  +K G  C   VKS 
Sbjct: 223 GLECLCEMHRIGGIGERPNFRTLEVGLQACGDLGALVEGRCLHGFVVKRGIGCSGAVKSL 282

Query: 273 ILSMYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPD 332
           +LSMYSRCG PEE+Y  F  +E KD+ISWTS+I V+++ GLM  CL LFWEMQ S I PD
Sbjct: 283 LLSMYSRCGRPEESYLSFCDIENKDVISWTSVIGVYARSGLMDGCLSLFWEMQDSDIFPD 342

Query: 333 DIVISCMLLGFGNFDRISEGNAFHAWILKQCYAVSGITHNALLSMYCKFGLLRTADKIFH 392
           +IV+SCML GF N   I+EG AF   + +Q YA S + H+ LLSMYCKF LL  A+K+F 
Sbjct: 343 EIVVSCMLSGFRNSTNINEGKAFLGLVTRQNYASSQVVHSELLSMYCKFELLTLAEKLFS 402

Query: 393 SF-HKSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVRAVNI 452
              H++ E  NTMI GY  +G + KCI+ FR+M   GIE D NSLVSV+SSC  +  +++
Sbjct: 403 GMQHQNKESCNTMIYGYGKLGLRTKCIELFRKMRHQGIEADSNSLVSVVSSCFQMGTIHL 462

Query: 453 GRSVHCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQ 512
           G+S+HC+ IK  + +NVS+ANSL+DMYGKSG LT A RIF  TQ KDI++WN+LISSY  
Sbjct: 463 GQSLHCFIIKVCMDENVSVANSLIDMYGKSGYLTIARRIFSVTQ-KDIITWNSLISSYTH 522

Query: 513 SGHPSEAIDLFDKMIKEKFNPNVVTCVIALSACAHLASLDKGLKIHQYIKENGCETDITV 572
           +GH  EAIDL+ KMI E F PN  T V  LSAC+HLASL++G+K+H +IKE     ++++
Sbjct: 523 NGHSFEAIDLYHKMIAENFMPNSATLVTVLSACSHLASLEEGIKVHCHIKERRIGNNLSL 582

Query: 573 RTALIDMYAKCGELESSRTLFNSMEERDVILCNVMISNYGMHGHVESAIEIFQLMEDSNI 632
            TAL+DMYAKCGELE SR LFNSME+RDVI  NVMIS Y  HGH ESAIE+F  MEDSN+
Sbjct: 583 STALVDMYAKCGELEKSRELFNSMEDRDVISWNVMISGYATHGHAESAIELFHEMEDSNV 642

Query: 633 KPNALTFLSLLSACNHAGHIVEGRRLFDVMHKYGIKPSLKHYASMVDLLGRSGSLEEAEA 692
            PN LTFL+LLSACNH+G + EG+ LF  M    + P+LKHYA MVD+LGRSG+L+EAE 
Sbjct: 643 IPNELTFLALLSACNHSGLVEEGKYLFRKMQDLSLNPNLKHYACMVDILGRSGNLQEAED 702

Query: 693 LVLSMPITPDGTVWGSLLSACKLHNEFEMGIRIARHAIESDPKNDGYYIVLSDLYGCLGR 752
           LVLSMPI+PDG VWGSLL ACK+HNE E+G+R+ARHAI+SDP+NDGYY++LS+LYG +G+
Sbjct: 703 LVLSMPISPDGGVWGSLLGACKIHNEIELGVRVARHAIKSDPENDGYYVMLSNLYGSIGK 762

Query: 753 WEEVEKVRGMMKERGVEKRAGWSAL 774
           WEE   VR MM+E+GV    GWS +
Sbjct: 763 WEEAINVRKMMEEKGVGTTKGWSVV 781

BLAST of CmoCh20G009190 vs. NCBI nr
Match: gi|645219123|ref|XP_008234126.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g39952, mitochondrial [Prunus mume])

HSP 1 Score: 849.7 bits (2194), Expect = 3.9e-243
Identity = 418/685 (61.02%), Postives = 519/685 (75.77%), Query Frame = 1

Query: 93  ILRTNFFGIPLSNPTSPMAFDFYLEMRASSSLPNQFTIPMVVSTCAELMMLNHGMNIHGL 152
           I++T+F     SN     A DF+ +MRA    P QFT+PMVV++CAELM+L HG N+HGL
Sbjct: 101 IIKTHF-----SNGDYSKALDFFFQMRALGFAPTQFTLPMVVASCAELMLLEHGNNVHGL 160

Query: 153 ALKLGLFVGNSAVGSSLIYMYSKCGNAESASLMFNEITVKDVVAWTALIIGYVQNNESEK 212
           A KLG+F GNSAVGSS +YMYSKCG  E A  MF E TV+DVV WTALIIGYVQN+ESEK
Sbjct: 161 ASKLGIFSGNSAVGSSFVYMYSKCGRMEDAYFMFEETTVRDVVCWTALIIGYVQNDESEK 220

Query: 213 GLKCLFEMHRNGCT---PNYRTIGGGFQACVDLDALVEGRCLHGLALKSGFLCFEVVKSS 272
           GL+CL EMHR G +   PN+RT+  G QAC DL  LVEG+CLHG  +KSG  C E VKS 
Sbjct: 221 GLECLCEMHRVGGSDERPNFRTLEVGLQACGDLGTLVEGKCLHGFVVKSGIGCSEAVKSL 280

Query: 273 ILSMYSRCGSPEEAYRCFFKLEQKDLISWTSIIAVHSKLGLMSECLHLFWEMQASGIIPD 332
           +LSMYSRCG+P E+Y  F +++ KDL+SWTS+I V+++ GLM ECL LF  MQ S I PD
Sbjct: 281 LLSMYSRCGAPGESYLSFCEIKDKDLLSWTSVIGVYARSGLMDECLSLFQGMQVSDIFPD 340

Query: 333 DIVISCMLLGFGNFDRISEGNAFHAWILKQCYAVSGITHNALLSMYCKFGLLRTADKIFH 392
            IV++CML GF N   I+EG AF   ++++ YA+S + H+ALLSMYCKF LL  A+K+F 
Sbjct: 341 KIVVNCMLSGFKNSTTINEGKAFLGSVIRKNYALSQMVHSALLSMYCKFELLTRAEKLFF 400

Query: 393 SF-HKSSEDWNTMILGYSNMGEKEKCIDFFREMHLLGIEPDLNSLVSVISSCLPVRAVNI 452
              H++ E  NTMI GY+ MG   KCI+ FR+M  LGIE D NSLVSVI SC  + A+++
Sbjct: 401 GMQHQNKESCNTMICGYAKMGLHVKCIELFRKMQHLGIEADSNSLVSVICSCFQLGAIHL 460

Query: 453 GRSVHCYAIKNSIIDNVSIANSLLDMYGKSGNLTAAWRIFHRTQQKDIVSWNTLISSYKQ 512
           GRS+HCY IK S+ +N+S+ANSLLDMYGKSG+L  A RIF  TQ +DI++WNT+ISSY  
Sbjct: 461 GRSLHCYLIKVSMDENISVANSLLDMYGKSGHLNIARRIFSGTQ-RDIITWNTMISSYTH 520

Query: 513 SGHPSEAIDLFDKMIKEKFNPNVVTCVIALSACAHLASLDKGLKIHQYIKENGCETDITV 572
           +GH +EAI LF KMI   F PN  T V  LSAC+HLASL +G KIH +IKE   E ++++
Sbjct: 521 AGHSAEAIALFKKMIAVNFKPNSATLVTVLSACSHLASLGEGEKIHSHIKERRLEINLSL 580

Query: 573 RTALIDMYAKCGELESSRTLFNSMEERDVILCNVMISNYGMHGHVESAIEIFQLMEDSNI 632
            TAL+DMYAKCG+LE SR LF+SMEERDVI  NVMIS Y MHGH E A+EIF+ ME+SN+
Sbjct: 581 ATALVDMYAKCGQLEKSRELFDSMEERDVISWNVMISGYAMHGHAEPALEIFRKMENSNV 640

Query: 633 KPNALTFLSLLSACNHAGHIVEGRRLFDVMHKYGIKPSLKHYASMVDLLGRSGSLEEAEA 692
           KPN LTFL+LLSACNH+G + EG+ LF  M    ++P+LKHYA MVD+LGRSG+L+EA+ 
Sbjct: 641 KPNELTFLALLSACNHSGLVEEGKYLFGKMQDLSLEPNLKHYACMVDILGRSGNLQEAKD 700

Query: 693 LVLSMPITPDGTVWGSLLSACKLHNEFEMGIRIARHAIESDPKNDGYYIVLSDLYGCLGR 752
           LVLSMPI PDG VWGSLLSACK+HNE E+G+R+ARHAIESDP+NDGYYI+LS+LY  +GR
Sbjct: 701 LVLSMPIPPDGGVWGSLLSACKIHNEIELGVRVARHAIESDPENDGYYIMLSNLYSSVGR 760

Query: 753 WEEVEKVRGMMKERGVEKRAGWSAL 774
           WEE   VR MM ++G+ K  GWS +
Sbjct: 761 WEEATNVRKMMDKKGIGKTQGWSVV 779

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP359_ARATH5.9e-19950.22Pentatricopeptide repeat-containing protein At4g39952, mitochondrial OS=Arabidop... [more]
PP320_ARATH3.8e-11333.90Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
PP111_ARATH8.4e-11332.73Putative pentatricopeptide repeat-containing protein At1g69350, mitochondrial OS... [more]
PP210_ARATH2.3e-11032.44Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana GN... [more]
PP296_ARATH3.9e-11032.10Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0LRH3_CUCSA0.0e+0076.53Uncharacterized protein OS=Cucumis sativus GN=Csa_2G439160 PE=4 SV=1[more]
W9QNE1_9ROSA2.6e-23860.71Uncharacterized protein OS=Morus notabilis GN=L484_022141 PE=4 SV=1[more]
B9H4S5_POPTR6.9e-23154.40Pentatricopeptide repeat-containing family protein (Fragment) OS=Populus trichoc... [more]
M5W549_PRUPE1.1e-22858.69Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa021864mg PE=4 SV=1[more]
A0A061DJE7_THECC2.1e-22757.48Pentatricopeptide repeat (PPR) superfamily protein, putative OS=Theobroma cacao ... [more]
Match NameE-valueIdentityDescription
AT4G39952.13.3e-20050.22 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G18750.12.1e-11433.90 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G69350.14.7e-11432.73 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G03580.11.3e-11132.44 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G63370.12.2e-11132.10 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659118561|ref|XP_008459184.1|0.0e+0076.91PREDICTED: pentatricopeptide repeat-containing protein At4g39952, mitochondrial ... [more]
gi|449460752|ref|XP_004148109.1|0.0e+0076.53PREDICTED: pentatricopeptide repeat-containing protein At4g39952, mitochondrial ... [more]
gi|743861097|ref|XP_011031040.1|9.6e-25062.02PREDICTED: pentatricopeptide repeat-containing protein At4g39952, mitochondrial ... [more]
gi|658021995|ref|XP_008346407.1|4.2e-24561.46PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g... [more]
gi|645219123|ref|XP_008234126.1|3.9e-24361.02PREDICTED: pentatricopeptide repeat-containing protein At4g39952, mitochondrial ... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh20G009190.1CmoCh20G009190.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 366..391
score: 0.0068coord: 670..693
score: 0.4coord: 398..426
score: 9.4E-5coord: 268..294
score: 0.044coord: 296..326
score: 2.3E-5coord: 738..764
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 596..642
score: 1.1E-9coord: 494..542
score: 7.0E-13coord: 192..231
score: 3.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 366..394
score: 0.003coord: 634..666
score: 9.1E-5coord: 296..329
score: 2.0E-4coord: 398..429
score: 3.5E-6coord: 599..632
score: 6.6E-8coord: 570..597
score: 0.0014coord: 497..531
score: 7.8E-9coord: 195..228
score: 4.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 565..595
score: 8.353coord: 596..630
score: 11.29coord: 263..293
score: 6.423coord: 698..728
score: 5.404coord: 495..529
score: 12.814coord: 364..393
score: 5.952coord: 193..227
score: 10.709coord: 666..696
score: 6.774coord: 631..665
score: 10.348coord: 162..192
score: 6.127coord: 464..494
score: 8.057coord: 732..766
score: 7.837coord: 394..428
score: 10.545coord: 530..564
score: 8.046coord: 294..328
score:
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 465..627
score: 1.3E-10coord: 696..752
score: 1.3
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 705..752
score: 2.2E-7coord: 478..548
score: 2.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 66..109
score: 1.4E-278coord: 465..773
score: 1.4E-278coord: 145..429
score: 1.4E