CmoCh03G001870 (gene) Cucurbita moschata (Rifu)

NameCmoCh03G001870
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCmo_Chr03 : 2937199 .. 2939998 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTTGGAATTCCATCATCAAGTCCCAATTTGACTCAGGTTTGTTCTTATCTGCCATTATGTTGTATAAAAACATGAGGGAGGTGGGAGTTGAGCATGATGGGTTCACGTTTCCGATTCTTAACCATGTCGTTATGTCGATTTGGGTTGATGTAGTCTATGCGGGAATGGTTCATTGTGTTGGAATTCGAATGGGGTTTGGTTCTGATTTGTATTTCTGTAATACCATGATGGAGGTTTATGCGAAATGTGAGTGTTTGGGTCATGCACGCAAAGTGTTTGATGAAATGCCTAATAGAGACTTGGTCTCTTGGACGTCCATGATTTCTGCATATGTTAATGTCGGTGATATTGTTTGTGCTTTGAATCTTTTTGAGGGAATGAGGAGGGTGTTTGAGCCGAATTCGGTGACCATGATGGCAATGCTGCAAGCTTGTTGTGTTACTGAGGATTTGGTTCTGGGAAGGCTGATTCAATGTCTTGTGGTTAAGAATGGTTTATTGTTTGATGTAGGTCTGCAGAATTGGTTCTTACGAATGTATAGTCGACTAGGTGGGGAAGATGAATTTGTACGTTTTTTCTCTGAAATTGATTGCAAGAATGTTGTTTCTTGGGATATTTTGATATCTTTTTACTCCTCCGTGGGGGATATTGTGAAAGCTGTAGATATCTTCAAACAAATCATGGCTGGTGAAGTTCCACTCATCATTGAGACATTAACCATACTTATATCAGCAACAAAGACATCTGATTCCATGTGTCTGATCCTAGGTGAAAATCTACACTCTCTGGCAATTAAAACTGGTCTCTATGATAGCATTCTGCGGACTTCATTGTTGGATATGTATGCCAAGTTTGGGGAGTTGGACAATTCAACTAGGCTATTTAACGAAATTCCTAATAGGAGCATCATTACTTGGGGGGCCATGATGTCTAGTTTTATTCAAAATGGACACTTTGATGAGGCAGTAGAGATCTTCAGCCAAATGCAAGCTGCTGGCTTGAAACCCAGCCTTGGAATTTTGAAACACTTAATTGATGCTTACGCCCATTTGGGTGCTCTGCAGCTGGGAAGAGGCATACATTGTTACCTCATCCGAATCTATGGATTGGAGATATGTAATACCCACTTAGAAACGTCTCTTATGAACATGTATGTAAGATGTGGAAGCATTGCTTCTGCTAGAAAATGTTTTGATTTGATCATAGTTAAAGATGTTGTGGCGTGGACGTCCATGATTGAGGGATATGGCTCTCACGGACAAGGTATCAATGCCCTCAATCTATATCATCATATGATGAGTGAAGAAGTGGCCCCAAATAGTGTCACGTTCTTGAGTCTGCTATCTGCTTGTAGCCACTCTGGCCTTGTAAGTGAGGGCTGTGAAATCTTTTATTCAATGAGGTCAAGGTTCAATATTAAGCCTGATTTAGAGCATTACACTTGTTTTGTTGATCTTTTGAGTAGATCAACAAGAGTAAGAGAGGCCTTTGCGATTATATTGAGAATGACAAATCTCTGTGATGGTAGGATTTGGGGTGCTCTTATGGGTGCCTGCCGGGTGTATGGAGACAATAAAATCGCTATCTATGCTGCACACAGGCTTCTTGAATTAGAACCTGATAATGTAGGCTATTATACTCTGTTGAGCAATACACAGGCTAGTGTCGGGCAGTGGCATGAAGTTGAAAAACTACGTAGTGTTGTGTATGAGAAGGATTTTGTCAAGAAACCAGGTTGGAGCTTCGTTGAGTTAAATGGAACACTTCATGGGTTTGTTTCTGGAGATAGATCACACTGCAAGACCGATCAGATTTATGATTTATTGGTATATCTTAATAGGACAGAATAGGGAAAGGGGCCTGTGTGAGTTCCCACATCGGTTGGGGATGCTAGCGGTGGCTTTGGGCTGTTACAAATGGTATTCGAGCTAGACACTGAGCAATGTGCAAGCAAGGAGGTTGAGCCTTGAAGGGGAGTGCACGAGGCAGTGTGTAGCAAGGATGCTAGGCCCCGAAGGGGGGTGGATTGTGAGATCTCTCATCGGTTGGGGAGGAGAACGAAGCTTTCTTTATAAGAGTGTGAAAACCTCTCCCTAGCAAATGTGTTTTAAAAACCTTAAGAAGAAACTCGAAAGCGAAAGTTCAAAAAAACCAATATCTACTAACGTCCCCTTGAATGCTCCTTTCCCCGATACTGATCTTCCATGTTTCGTGTTCGTGTTCGTGTTCGTGTTCGTTGTTGTATCTAAGCTATGCAAATGTTGAGAGAAATTTGTTAAATGGTAGAAGTGAAAACCATGTTCAAATTTCACAGTAGCTACCTACCTAGGGTATTAATATCATGCAAGCAACTAAATGCTGTATGGTCAAATGGTTAACTGAAATGGTTTGGGGGCACTGAGGTTCTTGGCCTAATGCATAGCTTAGGAAGAAACTGGGTCGAAATGGTTGAGGATTATTGAGAAGAGTAGTCACATATCGTGAGTTTATAAGTAAGGAACATTATCTTCATTGATATAAAGCCTTTTGATGAAACCAGAAGGAAAGTCAAGAGAGTTTATGCTCAAAGTGAACAATATCATATCATACCATTGTGAAGGTCGTGATTCCTAAATCACTGAGCTGACCCCTACATCCTAGTTATTCTTCAATCTCCATCTTCAATCTCTATCTTCAATCTCTATCAATGGCAGGAACACCACACTCACAAAAGTATCCACTTTTGATCTTGTTCTAGCTCTATACCTCGTGTACTAATAGATCATTGTTTATGGACTTCTTTTGATCTTGTTCTAG

mRNA sequence

ATGCTTTGGAATTCCATCATCAAGTCCCAATTTGACTCAGGTTTGTTCTTATCTGCCATTATGTTGTATAAAAACATGAGGGAGGTGGGAGTTGAGCATGATGGGTTCACGTTTCCGATTCTTAACCATGTCGTTATGTCGATTTGGGTTGATGTAGTCTATGCGGGAATGGTTCATTGTGTTGGAATTCGAATGGGGTTTGGTTCTGATTTGTATTTCTGTAATACCATGATGGAGGTTTATGCGAAATGTGAGTGTTTGGGTCATGCACGCAAAGTGTTTGATGAAATGCCTAATAGAGACTTGGTCTCTTGGACGTCCATGATTTCTGCATATGTTAATGTCGGTGATATTGTTTGTGCTTTGAATCTTTTTGAGGGAATGAGGAGGGTGTTTGAGCCGAATTCGGTGACCATGATGGCAATGCTGCAAGCTTGTTGTGTTACTGAGGATTTGGTTCTGGGAAGGCTGATTCAATGTCTTGTGGTTAAGAATGGTTTATTGTTTGATGTAGGTCTGCAGAATTGGTTCTTACGAATGTATAGTCGACTAGGTGGGGAAGATGAATTTGTACGTTTTTTCTCTGAAATTGATTGCAAGAATGTTGTTTCTTGGGATATTTTGATATCTTTTTACTCCTCCGTGGGGGATATTGTGAAAGCTGTAGATATCTTCAAACAAATCATGGCTGGTGAAGTTCCACTCATCATTGAGACATTAACCATACTTATATCAGCAACAAAGACATCTGATTCCATGTGTCTGATCCTAGGTGAAAATCTACACTCTCTGGCAATTAAAACTGGTCTCTATGATAGCATTCTGCGGACTTCATTGTTGGATATGTATGCCAAGTTTGGGGAGTTGGACAATTCAACTAGGCTATTTAACGAAATTCCTAATAGGAGCATCATTACTTGGGGGGCCATGATGTCTAGTTTTATTCAAAATGGACACTTTGATGAGGCAGTAGAGATCTTCAGCCAAATGCAAGCTGCTGGCTTGAAACCCAGCCTTGGAATTTTGAAACACTTAATTGATGCTTACGCCCATTTGGGTGCTCTGCAGCTGGGAAGAGGCATACATTGTTACCTCATCCGAATCTATGGATTGGAGATATGTAATACCCACTTAGAAACGTCTCTTATGAACATGTATGTAAGATGTGGAAGCATTGCTTCTGCTAGAAAATGTTTTGATTTGATCATAGTTAAAGATGTTGTGGCGTGGACGTCCATGATTGAGGGATATGGCTCTCACGGACAAGGTATCAATGCCCTCAATCTATATCATCATATGATGAGTGAAGAAGTGGCCCCAAATAGTGTCACGTTCTTGAGTCTGCTATCTGCTTGTAGCCACTCTGGCCTTGTAAGTGAGGGCTGTGAAATCTTTTATTCAATGAGGTCAAGGTTCAATATTAAGCCTGATTTAGAGCATTACACTTGTTTTGTTGATCTTTTGAGTAGATCAACAAGAGTAAGAGAGGCCTTTGCGATTATATTGAGAATGACAAATCTCTGTGATGGTAGGATTTGGGGTGCTCTTATGGGTGCCTGCCGGGTGTATGGAGACAATAAAATCGCTATCTATGCTGCACACAGGCTTCTTGAATTAGAACCTGATAATGTAGGCTATTATACTCTGTTGAGCAATACACAGGCTAGTGTCGGGCAGTGGCATGAAGTTGAAAAACTACGTAGTGTTGTGTATGAGAAGGATTTTGTCAAGAAACCAGGTTGGAGCTTCGTTGAGTTAAATGGAACACTTCATGGGTTTGTTTCTGGAGATAGATCACACTGCAAGACCGATCAGATTTATGATTTATTGATCATTGTTTATGGACTTCTTTTGATCTTGTTCTAG

Coding sequence (CDS)

ATGCTTTGGAATTCCATCATCAAGTCCCAATTTGACTCAGGTTTGTTCTTATCTGCCATTATGTTGTATAAAAACATGAGGGAGGTGGGAGTTGAGCATGATGGGTTCACGTTTCCGATTCTTAACCATGTCGTTATGTCGATTTGGGTTGATGTAGTCTATGCGGGAATGGTTCATTGTGTTGGAATTCGAATGGGGTTTGGTTCTGATTTGTATTTCTGTAATACCATGATGGAGGTTTATGCGAAATGTGAGTGTTTGGGTCATGCACGCAAAGTGTTTGATGAAATGCCTAATAGAGACTTGGTCTCTTGGACGTCCATGATTTCTGCATATGTTAATGTCGGTGATATTGTTTGTGCTTTGAATCTTTTTGAGGGAATGAGGAGGGTGTTTGAGCCGAATTCGGTGACCATGATGGCAATGCTGCAAGCTTGTTGTGTTACTGAGGATTTGGTTCTGGGAAGGCTGATTCAATGTCTTGTGGTTAAGAATGGTTTATTGTTTGATGTAGGTCTGCAGAATTGGTTCTTACGAATGTATAGTCGACTAGGTGGGGAAGATGAATTTGTACGTTTTTTCTCTGAAATTGATTGCAAGAATGTTGTTTCTTGGGATATTTTGATATCTTTTTACTCCTCCGTGGGGGATATTGTGAAAGCTGTAGATATCTTCAAACAAATCATGGCTGGTGAAGTTCCACTCATCATTGAGACATTAACCATACTTATATCAGCAACAAAGACATCTGATTCCATGTGTCTGATCCTAGGTGAAAATCTACACTCTCTGGCAATTAAAACTGGTCTCTATGATAGCATTCTGCGGACTTCATTGTTGGATATGTATGCCAAGTTTGGGGAGTTGGACAATTCAACTAGGCTATTTAACGAAATTCCTAATAGGAGCATCATTACTTGGGGGGCCATGATGTCTAGTTTTATTCAAAATGGACACTTTGATGAGGCAGTAGAGATCTTCAGCCAAATGCAAGCTGCTGGCTTGAAACCCAGCCTTGGAATTTTGAAACACTTAATTGATGCTTACGCCCATTTGGGTGCTCTGCAGCTGGGAAGAGGCATACATTGTTACCTCATCCGAATCTATGGATTGGAGATATGTAATACCCACTTAGAAACGTCTCTTATGAACATGTATGTAAGATGTGGAAGCATTGCTTCTGCTAGAAAATGTTTTGATTTGATCATAGTTAAAGATGTTGTGGCGTGGACGTCCATGATTGAGGGATATGGCTCTCACGGACAAGGTATCAATGCCCTCAATCTATATCATCATATGATGAGTGAAGAAGTGGCCCCAAATAGTGTCACGTTCTTGAGTCTGCTATCTGCTTGTAGCCACTCTGGCCTTGTAAGTGAGGGCTGTGAAATCTTTTATTCAATGAGGTCAAGGTTCAATATTAAGCCTGATTTAGAGCATTACACTTGTTTTGTTGATCTTTTGAGTAGATCAACAAGAGTAAGAGAGGCCTTTGCGATTATATTGAGAATGACAAATCTCTGTGATGGTAGGATTTGGGGTGCTCTTATGGGTGCCTGCCGGGTGTATGGAGACAATAAAATCGCTATCTATGCTGCACACAGGCTTCTTGAATTAGAACCTGATAATGTAGGCTATTATACTCTGTTGAGCAATACACAGGCTAGTGTCGGGCAGTGGCATGAAGTTGAAAAACTACGTAGTGTTGTGTATGAGAAGGATTTTGTCAAGAAACCAGGTTGGAGCTTCGTTGAGTTAAATGGAACACTTCATGGGTTTGTTTCTGGAGATAGATCACACTGCAAGACCGATCAGATTTATGATTTATTGATCATTGTTTATGGACTTCTTTTGATCTTGTTCTAG
BLAST of CmoCh03G001870 vs. Swiss-Prot
Match: PP350_ARATH (Pentatricopeptide repeat-containing protein At4g35130, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H27 PE=3 SV=1)

HSP 1 Score: 392.1 bits (1006), Expect = 1.1e-107
Identity = 216/617 (35.01%), Postives = 354/617 (57.37%), Query Frame = 1

Query: 2   LWNSIIKSQFDSGLFLSAIMLYKNMREVGVEHDGFTFPILNHVVMSIWVDVVYAGMVHCV 61
           LWN +IK     GL++ A+  Y  M   GV+ D FT+P +   V  I   +     +H +
Sbjct: 97  LWNVMIKGFTSCGLYIEAVQFYSRMVFAGVKADTFTYPFVIKSVAGI-SSLEEGKKIHAM 156

Query: 62  GIRMGFGSDLYFCNTMMEVYAKCECLGHARKVFDEMPNRDLVSWTSMISAYVNVGDIVCA 121
            I++GF SD+Y CN+++ +Y K  C   A KVF+EMP RD+VSW SMIS Y+ +GD   +
Sbjct: 157 VIKLGFVSDVYVCNSLISLYMKLGCAWDAEKVFEEMPERDIVSWNSMISGYLALGDGFSS 216

Query: 122 LNLFEGMRRV-FEPNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGL-LFDVGLQNWFLR 181
           L LF+ M +  F+P+  + M+ L AC       +G+ I C  V++ +   DV +    L 
Sbjct: 217 LMLFKEMLKCGFKPDRFSTMSALGACSHVYSPKMGKEIHCHAVRSRIETGDVMVMTSILD 276

Query: 182 MYSRLGGEDEFVRFFSEIDCKNVVSWDILISFYSSVGDIVKAVDIFKQIMA--GEVPLII 241
           MYS+ G      R F+ +  +N+V+W+++I  Y+  G +  A   F+++    G  P +I
Sbjct: 277 MYSKYGEVSYAERIFNGMIQRNIVAWNVMIGCYARNGRVTDAFLCFQKMSEQNGLQPDVI 336

Query: 242 ETLTILISATKTSDSMCLILGENLHSLAIKTG-LYDSILRTSLLDMYAKFGELDNSTRLF 301
            ++ +L ++        ++ G  +H  A++ G L   +L T+L+DMY + G+L ++  +F
Sbjct: 337 TSINLLPASA-------ILEGRTIHGYAMRRGFLPHMVLETALIDMYGECGQLKSAEVIF 396

Query: 302 NEIPNRSIITWGAMMSSFIQNGHFDEAVEIFSQMQAAGLKPSLGILKHLIDAYAHLGALQ 361
           + +  +++I+W +++++++QNG    A+E+F ++  + L P    +  ++ AYA   +L 
Sbjct: 397 DRMAEKNVISWNSIIAAYVQNGKNYSALELFQELWDSSLVPDSTTIASILPAYAESLSLS 456

Query: 362 LGRGIHCYLIRIYGLEICNTHLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSMIEG 421
            GR IH Y+++       NT +  SL++MY  CG +  ARKCF+ I++KDVV+W S+I  
Sbjct: 457 EGREIHAYIVK--SRYWSNTIILNSLVHMYAMCGDLEDARKCFNHILLKDVVSWNSIIMA 516

Query: 422 YGSHGQGINALNLYHHMMSEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKP 481
           Y  HG G  ++ L+  M++  V PN  TF SLL+ACS SG+V EG E F SM+  + I P
Sbjct: 517 YAVHGFGRISVWLFSEMIASRVNPNKSTFASLLAACSISGMVDEGWEYFESMKREYGIDP 576

Query: 482 DLEHYTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIAIYAAHRL 541
            +EHY C +DL+ R+     A   +  M  +   RIWG+L+ A R + D  IA +AA ++
Sbjct: 577 GIEHYGCMLDLIGRTGNFSAAKRFLEEMPFVPTARIWGSLLNASRNHKDITIAEFAAEQI 636

Query: 542 LELEPDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKDFVKKPGWSFVELNGTLHGFVSG 601
            ++E DN G Y LL N  A  G+W +V +++ ++  K   +    S VE  G  H F +G
Sbjct: 637 FKMEHDNTGCYVLLLNMYAEAGRWEDVNRIKLLMESKGISRTSSRSTVEAKGKSHVFTNG 696

Query: 602 DRSHCKTDQIYDLLIIV 614
           DRSH  T++IY++L +V
Sbjct: 697 DRSHVATNKIYEVLDVV 703

BLAST of CmoCh03G001870 vs. Swiss-Prot
Match: PP210_ARATH (Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana GN=PCMP-H23 PE=2 SV=1)

HSP 1 Score: 355.9 bits (912), Expect = 8.9e-97
Identity = 202/617 (32.74%), Postives = 345/617 (55.92%), Query Frame = 1

Query: 3   WNSIIKSQFDSGLFLSAIMLYKNMREVGVEHDGFTFPILNHVVMSIWVDVVYAGMVHCVG 62
           WNS+I      G +  A+ +Y  ++   +  D FT   +     ++ V     G+ H   
Sbjct: 175 WNSLISGYSSHGYYEEALEIYHELKNSWIVPDSFTVSSVLPAFGNLLVVKQGQGL-HGFA 234

Query: 63  IRMGFGSDLYFCNTMMEVYAKCECLGHARKVFDEMPNRDLVSWTSMISAYVNVGDIVCAL 122
           ++ G  S +   N ++ +Y K      AR+VFDEM  RD VS+ +MI  Y+ +  +  ++
Sbjct: 235 LKSGVNSVVVVNNGLVAMYLKFRRPTDARRVFDEMDVRDSVSYNTMICGYLKLEMVEESV 294

Query: 123 NLFEGMRRVFEPNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGLLFDVGLQNWFLRMYS 182
            +F      F+P+ +T+ ++L+AC    DL L + I   ++K G + +  ++N  + +Y+
Sbjct: 295 RMFLENLDQFKPDLLTVSSVLRACGHLRDLSLAKYIYNYMLKAGFVLESTVRNILIDVYA 354

Query: 183 RLGGEDEFVRFFSEIDCKNVVSWDILISFYSSVGDIVKAVDIFKQIMAGEVPLIIETLTI 242
           + G        F+ ++CK+ VSW+ +IS Y   GD+++A+ +FK +M  E      T  +
Sbjct: 355 KCGDMITARDVFNSMECKDTVSWNSIISGYIQSGDLMEAMKLFKMMMIMEEQADHITYLM 414

Query: 243 LIS-ATKTSDSMCLILGENLHSLAIKTGL-YDSILRTSLLDMYAKFGELDNSTRLFNEIP 302
           LIS +T+ +D   L  G+ LHS  IK+G+  D  +  +L+DMYAK GE+ +S ++F+ + 
Sbjct: 415 LISVSTRLAD---LKFGKGLHSNGIKSGICIDLSVSNALIDMYAKCGEVGDSLKIFSSMG 474

Query: 303 NRSIITWGAMMSSFIQNGHFDEAVEIFSQMQAAGLKPSLGILKHLIDAYAHLGALQLGRG 362
               +TW  ++S+ ++ G F   +++ +QM+ + + P +      +   A L A +LG+ 
Sbjct: 475 TGDTVTWNTVISACVRFGDFATGLQVTTQMRKSEVVPDMATFLVTLPMCASLAAKRLGKE 534

Query: 363 IHCYLIRIYGLEICNTHLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSMIEGYGSH 422
           IHC L+R +G E     +  +L+ MY +CG + ++ + F+ +  +DVV WT MI  YG +
Sbjct: 535 IHCCLLR-FGYE-SELQIGNALIEMYSKCGCLENSSRVFERMSRRDVVTWTGMIYAYGMY 594

Query: 423 GQGINALNLYHHMMSEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEH 482
           G+G  AL  +  M    + P+SV F++++ ACSHSGLV EG   F  M++ + I P +EH
Sbjct: 595 GEGEKALETFADMEKSGIVPDSVVFIAIIYACSHSGLVDEGLACFEKMKTHYKIDPMIEH 654

Query: 483 YTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIAIYAAHRLLELE 542
           Y C VDLLSRS ++ +A   I  M    D  IW +++ ACR  GD + A   + R++EL 
Sbjct: 655 YACVVDLLSRSQKISKAEEFIQAMPIKPDASIWASVLRACRTSGDMETAERVSRRIIELN 714

Query: 543 PDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKDFVKKPGWSFVELNGTLHGFVSGDRSH 602
           PD+ GY  L SN  A++ +W +V  +R  + +K   K PG+S++E+   +H F SGD S 
Sbjct: 715 PDDPGYSILASNAYAALRKWDKVSLIRKSLKDKHITKNPGYSWIEVGKNVHVFSSGDDSA 774

Query: 603 CKTDQIYDLLIIVYGLL 618
            +++ IY  L I+Y L+
Sbjct: 775 PQSEAIYKSLEILYSLM 785

BLAST of CmoCh03G001870 vs. Swiss-Prot
Match: PP205_ARATH (Putative pentatricopeptide repeat-containing protein At3g01580 OS=Arabidopsis thaliana GN=PCMP-E87 PE=3 SV=2)

HSP 1 Score: 350.9 bits (899), Expect = 2.9e-95
Identity = 210/615 (34.15%), Postives = 333/615 (54.15%), Query Frame = 1

Query: 3   WNSIIKSQFDSGLFLSAIMLYKNMREVGVEHDGFTFPILNHVVMSIWVDVVYAGMVH-CV 62
           WN+++KS      +   +  + +M     + D FT P+       +  +V Y  M+H  V
Sbjct: 28  WNTLLKSLSREKQWEEVLYHFSHMFRDEEKPDNFTLPVALKACGELR-EVNYGEMIHGFV 87

Query: 63  GIRMGFGSDLYFCNTMMEVYAKCECLGHARKVFDEMPNRDLVSWTSMISAYVNVGDIVCA 122
              +  GSDLY  ++++ +Y KC  +  A ++FDE+   D+V+W+SM+S +   G    A
Sbjct: 88  KKDVTLGSDLYVGSSLIYMYIKCGRMIEALRMFDELEKPDIVTWSSMVSGFEKNGSPYQA 147

Query: 123 LNLFEGMRRVFE--PNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGLLFDVGLQNWFLR 182
           +  F  M    +  P+ VT++ ++ AC    +  LGR +   V++ G   D+ L N  L 
Sbjct: 148 VEFFRRMVMASDVTPDRVTLITLVSACTKLSNSRLGRCVHGFVIRRGFSNDLSLVNSLLN 207

Query: 183 MYSRLGGEDEFVRFFSEIDCKNVVSWDILISFYSSVGDIVKAVDIFKQIMA-GEVPLIIE 242
            Y++     E V  F  I  K+V+SW  +I+ Y   G   +A+ +F  +M  G  P +  
Sbjct: 208 CYAKSRAFKEAVNLFKMIAEKDVISWSTVIACYVQNGAAAEALLVFNDMMDDGTEPNVAT 267

Query: 243 TLTILISATKTSDSMCLILGENLHSLAIKTGLYDSI-LRTSLLDMYAKFGELDNSTRLFN 302
            L +L +     D   L  G   H LAI+ GL   + + T+L+DMY K    + +  +F+
Sbjct: 268 VLCVLQACAAAHD---LEQGRKTHELAIRKGLETEVKVSTALVDMYMKCFSPEEAYAVFS 327

Query: 303 EIPNRSIITWGAMMSSFIQNGHFDEAVEIFSQMQAAG-LKPSLGILKHLIDAYAHLGALQ 362
            IP + +++W A++S F  NG    ++E FS M      +P   ++  ++ + + LG L+
Sbjct: 328 RIPRKDVVSWVALISGFTLNGMAHRSIEEFSIMLLENNTRPDAILMVKVLGSCSELGFLE 387

Query: 363 LGRGIHCYLIRIYGLEICNTHLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSMIEG 422
             +  H Y+I+ YG +  N  +  SL+ +Y RCGS+ +A K F+ I +KD V WTS+I G
Sbjct: 388 QAKCFHSYVIK-YGFD-SNPFIGASLVELYSRCGSLGNASKVFNGIALKDTVVWTSLITG 447

Query: 423 YGSHGQGINALNLYHHMM-SEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIK 482
           YG HG+G  AL  ++HM+ S EV PN VTFLS+LSACSH+GL+ EG  IF  M + + + 
Sbjct: 448 YGIHGKGTKALETFNHMVKSSEVKPNEVTFLSILSACSHAGLIHEGLRIFKLMVNDYRLA 507

Query: 483 PDLEHYTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIAIYAAHR 542
           P+LEHY   VDLL R   +  A  I  RM      +I G L+GACR++ + ++A   A +
Sbjct: 508 PNLEHYAVLVDLLGRVGDLDTAIEITKRMPFSPTPQILGTLLGACRIHQNGEMAETVAKK 567

Query: 543 LLELEPDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKDFVKKPGWSFVELNGTLHGFVS 602
           L ELE ++ GYY L+SN     G+W  VEKLR+ V ++   K    S +E+   +H FV+
Sbjct: 568 LFELESNHAGYYMLMSNVYGVKGEWENVEKLRNSVKQRGIKKGLAESLIEIRRKVHRFVA 627

Query: 603 GDRSHCKTDQIYDLL 611
            D  H + + +Y LL
Sbjct: 628 DDELHPEKEPVYGLL 636

BLAST of CmoCh03G001870 vs. Swiss-Prot
Match: PP111_ARATH (Putative pentatricopeptide repeat-containing protein At1g69350, mitochondrial OS=Arabidopsis thaliana GN=PCMP-E66 PE=3 SV=1)

HSP 1 Score: 346.7 bits (888), Expect = 5.4e-94
Identity = 193/610 (31.64%), Postives = 346/610 (56.72%), Query Frame = 1

Query: 3   WNSIIKSQFDSGLFLSAIMLYKNMREVGVEHDGFTFPILNHVVMSIWVDVVYAGMVHCVG 62
           W++++ S  ++G  + A+ ++K M + GVE D  T   +      +   +  A  VH   
Sbjct: 170 WSTLVSSCLENGEVVKALRMFKCMVDDGVEPDAVTMISVVEGCAELGC-LRIARSVHGQI 229

Query: 63  IRMGFGSDLYFCNTMMEVYAKCECLGHARKVFDEMPNRDLVSWTSMISAYVNVGDIVCAL 122
            R  F  D   CN+++ +Y+KC  L  + ++F+++  ++ VSWT+MIS+Y        AL
Sbjct: 230 TRKMFDLDETLCNSLLTMYSKCGDLLSSERIFEKIAKKNAVSWTAMISSYNRGEFSEKAL 289

Query: 123 NLFEGM-RRVFEPNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGLLFDV-GLQNWFLRM 182
             F  M +   EPN VT+ ++L +C +   +  G+ +    V+  L  +   L    + +
Sbjct: 290 RSFSEMIKSGIEPNLVTLYSVLSSCGLIGLIREGKSVHGFAVRRELDPNYESLSLALVEL 349

Query: 183 YSRLGGEDEFVRFFSEIDCKNVVSWDILISFYSSVGDIVKAVDIFKQIMAGEVPLIIETL 242
           Y+  G   +       +  +N+V+W+ LIS Y+  G +++A+ +F+Q++   +     TL
Sbjct: 350 YAECGKLSDCETVLRVVSDRNIVAWNSLISLYAHRGMVIQALGLFRQMVTQRIKPDAFTL 409

Query: 243 TILISATKTSDSMCLILGENLHSLAIKTGLYDSILRTSLLDMYAKFGELDNSTRLFNEIP 302
              ISA + +    + LG+ +H   I+T + D  ++ SL+DMY+K G +D+++ +FN+I 
Sbjct: 410 ASSISACENAG--LVPLGKQIHGHVIRTDVSDEFVQNSLIDMYSKSGSVDSASTVFNQIK 469

Query: 303 NRSIITWGAMMSSFIQNGHFDEAVEIFSQMQAAGLKPSLGILKHLIDAYAHLGALQLGRG 362
           +RS++TW +M+  F QNG+  EA+ +F  M  + L+ +      +I A + +G+L+ G+ 
Sbjct: 470 HRSVVTWNSMLCGFSQNGNSVEAISLFDYMYHSYLEMNEVTFLAVIQACSSIGSLEKGKW 529

Query: 363 IHCYLIRIYGLEICNTHLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSMIEGYGSH 422
           +H  LI I GL+  +   +T+L++MY +CG + +A   F  +  + +V+W+SMI  YG H
Sbjct: 530 VHHKLI-ISGLK--DLFTDTALIDMYAKCGDLNAAETVFRAMSSRSIVSWSSMINAYGMH 589

Query: 423 GQGINALNLYHHMMSEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEH 482
           G+  +A++ ++ M+     PN V F+++LSAC HSG V EG + ++++   F + P+ EH
Sbjct: 590 GRIGSAISTFNQMVESGTKPNEVVFMNVLSACGHSGSVEEG-KYYFNLMKSFGVSPNSEH 649

Query: 483 YTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIAIYAAHRLLELE 542
           + CF+DLLSRS  ++EA+  I  M  L D  +WG+L+  CR++    I     + L ++ 
Sbjct: 650 FACFIDLLSRSGDLKEAYRTIKEMPFLADASVWGSLVNGCRIHQKMDIIKAIKNDLSDIV 709

Query: 543 PDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKDFVKKPGWSFVELNGTLHGFVSGDRSH 602
            D+ GYYTLLSN  A  G+W E  +LRS +   +  K PG+S +E++  +  F +G+ + 
Sbjct: 710 TDDTGYYTLLSNIYAEEGEWEEFRRLRSAMKSSNLKKVPGYSAIEIDQKVFRFGAGEENR 769

Query: 603 CKTDQIYDLL 611
            +TD+IY  L
Sbjct: 770 IQTDEIYRFL 772

BLAST of CmoCh03G001870 vs. Swiss-Prot
Match: PP341_ARATH (Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana GN=DYW9 PE=2 SV=1)

HSP 1 Score: 344.7 bits (883), Expect = 2.0e-93
Identity = 189/557 (33.93%), Postives = 315/557 (56.55%), Query Frame = 1

Query: 57  MVHCVGIRMGFGSDLYFCNTMMEVYAKCECLGHARKVFDEMPNRDLVSWTSMISAYVNVG 116
           ++H   +  G  S+L   + ++++Y K   +  ARKVFD MP +D + W +MIS Y    
Sbjct: 140 VIHGQAVVDGCDSELLLGSNIVKMYFKFWRVEDARKVFDRMPEKDTILWNTMISGYRKNE 199

Query: 117 DIVCALNLFEGM--RRVFEPNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGLLFDVGLQ 176
             V ++ +F  +        ++ T++ +L A    ++L LG  I  L  K G      + 
Sbjct: 200 MYVESIQVFRDLINESCTRLDTTTLLDILPAVAELQELRLGMQIHSLATKTGCYSHDYVL 259

Query: 177 NWFLRMYSRLGGEDEFVRFFSEIDCKNVVSWDILISFYSSVGDIVKAVDIFKQIMAGEVP 236
             F+ +YS+ G        F E    ++V+++ +I  Y+S G+   ++ +FK++M     
Sbjct: 260 TGFISLYSKCGKIKMGSALFREFRKPDIVAYNAMIHGYTSNGETELSLSLFKELMLSGAR 319

Query: 237 LIIETLTILISATKTSDSMCLILGENLHSLAIKTG-LYDSILRTSLLDMYAKFGELDNST 296
           L   TL   +S    S  + LI    +H   +K+  L  + + T+L  +Y+K  E++++ 
Sbjct: 320 LRSSTL---VSLVPVSGHLMLIYA--IHGYCLKSNFLSHASVSTALTTVYSKLNEIESAR 379

Query: 297 RLFNEIPNRSIITWGAMMSSFIQNGHFDEAVEIFSQMQAAGLKPSLGILKHLIDAYAHLG 356
           +LF+E P +S+ +W AM+S + QNG  ++A+ +F +MQ +   P+   +  ++ A A LG
Sbjct: 380 KLFDESPEKSLPSWNAMISGYTQNGLTEDAISLFREMQKSEFSPNPVTITCILSACAQLG 439

Query: 357 ALQLGRGIHCYLIRIYGLEICNTHLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSM 416
           AL LG+ +H  L+R    E  + ++ T+L+ MY +CGSIA AR+ FDL+  K+ V W +M
Sbjct: 440 ALSLGKWVHD-LVRSTDFE-SSIYVSTALIGMYAKCGSIAEARRLFDLMTKKNEVTWNTM 499

Query: 417 IEGYGSHGQGINALNLYHHMMSEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFN 476
           I GYG HGQG  ALN+++ M++  + P  VTFL +L ACSH+GLV EG EIF SM  R+ 
Sbjct: 500 ISGYGLHGQGQEALNIFYEMLNSGITPTPVTFLCVLYACSHAGLVKEGDEIFNSMIHRYG 559

Query: 477 IKPDLEHYTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIAIYAA 536
            +P ++HY C VD+L R+  ++ A   I  M+      +W  L+GACR++ D  +A   +
Sbjct: 560 FEPSVKHYACMVDILGRAGHLQRALQFIEAMSIEPGSSVWETLLGACRIHKDTNLARTVS 619

Query: 537 HRLLELEPDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKDFVKKPGWSFVELNGTLHGF 596
            +L EL+PDNVGY+ LLSN  ++   + +   +R    ++   K PG++ +E+  T H F
Sbjct: 620 EKLFELDPDNVGYHVLLSNIHSADRNYPQAATVRQTAKKRKLAKAPGYTLIEIGETPHVF 679

Query: 597 VSGDRSHCKTDQIYDLL 611
            SGD+SH +  +IY+ L
Sbjct: 680 TSGDQSHPQVKEIYEKL 689

BLAST of CmoCh03G001870 vs. TrEMBL
Match: M5VVK0_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002924mg PE=4 SV=1)

HSP 1 Score: 695.3 bits (1793), Expect = 6.9e-197
Identity = 351/613 (57.26%), Postives = 448/613 (73.08%), Query Frame = 1

Query: 1   MLWNSIIKSQFDSGLFLSAIMLYKNMREVGVEHDGFTFPILNHVVMSIWVDVVYAGMVHC 60
           ML N IIKS  DSGL  SA++LYK M E+GV HD FTFPI+N  V+ +  D  Y+GMVHC
Sbjct: 1   MLSNLIIKSHVDSGLLGSALLLYKKMLELGVSHDCFTFPIVNRAVLLLGSDATYSGMVHC 60

Query: 61  VGIRMGFGSDLYFCNTMMEVYAKCECLGHARKVFDEMPNRDLVSWTSMISAYVNVGDIVC 120
           V I+MGFG D+Y  NTM++ Y KC  L +ARK+FDEM  RDLV+WTSMIS YV+ G++ C
Sbjct: 61  VAIQMGFGMDVYVGNTMIDAYVKCGRLDYARKLFDEMRQRDLVTWTSMISGYVSEGNVAC 120

Query: 121 ALNLFEGMRRVFEPNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGLLFDVGLQNWFLRM 180
             +LF  MRR  EPN+VTM+ MLQ CC  E  V G  +    +K+GLL D  +QN   +M
Sbjct: 121 GFSLFSEMRRELEPNAVTMLVMLQGCCDIEISVYGEPLHGYGIKSGLLNDGSVQNSIFKM 180

Query: 181 YSRLGGEDEFVRFFSEIDCKNVVSWDILISFYSSVGDIVKAVDIFKQIMAGEVPLIIETL 240
           Y++LG  D+   FF E+D ++VVSW+I ISFYS  GD+VK  D+F + M GEV    ETL
Sbjct: 181 YAKLGTVDQVEDFFGELDRRDVVSWNIRISFYSWRGDVVKVRDLFHE-MQGEVAPSNETL 240

Query: 241 TILISATKTSDSMCLILGENLHSLAIKTGLYDSILRTSLLDMYAKFGELDNSTRLFNEIP 300
           T++ISA        L  GE+LH LA K+GL D +L+TSLLD YAK GEL NS +LF EIP
Sbjct: 241 TLVISAVTKHG--ILSQGESLHCLATKSGLCDDVLQTSLLDFYAKCGELGNSDKLFREIP 300

Query: 301 NRSIITWGAMMSSFIQNGHFDEAVEIFSQMQAAGLKPSLGILKHLIDAYAHLGALQLGRG 360
           +R+ ITWGAMM  FI NG+F+EAV +F +MQA G++P   IL+ L+DA+A++GAL+LG+G
Sbjct: 301 HRNSITWGAMMFGFILNGYFNEAVGLFGRMQAEGVEPGAEILRSLVDAFANIGALKLGKG 360

Query: 361 IHCYLIRIYGLEI--CNTHLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSMIEGYG 420
           IH  +IR    E+  CNTHLETSL+NMYVRCGSI+ AR CF  ++++D+VAWTSMIEGYG
Sbjct: 361 IHGCIIRKSFCEVKKCNTHLETSLINMYVRCGSISMARVCFSRMLIRDIVAWTSMIEGYG 420

Query: 421 SHGQGINALNLYHHMMSEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDL 480
           SHG G+ AL L+  M+ E   PNSVT LSLLSACSHSGLV+EGCE F SM+ +F I+PDL
Sbjct: 421 SHGLGLEALKLFDLMIREGTKPNSVTLLSLLSACSHSGLVTEGCEAFCSMKWKFGIEPDL 480

Query: 481 EHYTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIAIYAAHRLLE 540
           +HYT  VDLL RS +++EA  +I++M    D RIWGAL+   R+YG   +  +AA RLLE
Sbjct: 481 DHYTSIVDLLGRSGKLKEALVVIMKMVIFPDSRIWGALLSGSRIYGRRDVGEFAAQRLLE 540

Query: 541 LEPDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKDFVKKPGWSFVEL-NGTLHGFVSGD 600
           LEPDNVGYYTLLSN QASVG+W EVE++R V+ E+D  KKPGWS +E   G ++GFVSGD
Sbjct: 541 LEPDNVGYYTLLSNAQASVGEWDEVEEIRRVMKERDLKKKPGWSCIEAEEGRIYGFVSGD 600

Query: 601 RSHCKTDQIYDLL 611
           RSH + + +Y++L
Sbjct: 601 RSHHQMEAVYEVL 610

BLAST of CmoCh03G001870 vs. TrEMBL
Match: A0A0L9TUF7_PHAAN (Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan02g028600 PE=4 SV=1)

HSP 1 Score: 637.1 bits (1642), Expect = 2.2e-179
Identity = 328/612 (53.59%), Postives = 429/612 (70.10%), Query Frame = 1

Query: 2   LWNSIIKSQFDSGLFLSAIMLYKNMREVGVEHDGFTFPILNHVVMSIWVDVVYAGMVHCV 61
           +WN I++S  D GLF S + +YK MR+ GV HD FTFP+LN  + S+  DVVY  M+HCV
Sbjct: 1   MWNLIMRSHVDLGLFHSVLSVYKKMRQKGVPHDTFTFPLLNRALSSMRADVVYGKMIHCV 60

Query: 62  GIRMGFGSDLYFCNTMMEVYAKCECLGHARKVFDEMPNRDLVSWTSMISAYVNVGDIVCA 121
             +MG   DLYFCNTM++VY KC C+  AR++FDE+  RD+VSWT MI+ YV+   +  A
Sbjct: 61  ATKMGLDGDLYFCNTMIDVYVKCGCIACARRMFDEISLRDVVSWTLMIAGYVSQRLVSVA 120

Query: 122 LNLFEGMRRVFEPNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGLLFDVGLQNWFLRMY 181
             LF  MR   EPNSVT++ MLQA C +  L  G  I    +K+GLL D  ++N  LRMY
Sbjct: 121 FRLFNKMRMELEPNSVTLIVMLQAPCASIKLSEGTQIHGYALKSGLLMDWSVKNSVLRMY 180

Query: 182 SRLGGEDEFVRFFSEIDCKNVVSWDILISFYSSVGDIVKAVDIFKQIMAGEVPLI-IETL 241
              GG  E    F E++ K+VVSW+ILISFYSS GD  +   + K + + EV +  IETL
Sbjct: 181 GSKGGTREVELLFGEVNMKDVVSWNILISFYSSEGDATRVAGLLKAMQSLEVHVWNIETL 240

Query: 242 TILISATKTSDSMCLILGENLHSLAIKTGLYDSILRTSLLDMYAKFGELDNSTRLFNEIP 301
           T++ SA   S S  L  GE +H L +KTG  D +  TSLLD YAK G+L+ S  LF+EI 
Sbjct: 241 TLVTSAFAKSGS--LSEGEGVHCLVVKTGFSDDVWLTSLLDFYAKCGKLETSVLLFSEIH 300

Query: 302 NRSIITWGAMMSSFIQNGHFDEAVEIFSQMQAAGLKPSLGILKHLIDAYAHLGALQLGRG 361
           ++S ITW AMMS FIQNG F EA+ +F +MQA        I ++L+DAYA+LGAL+LG+ 
Sbjct: 301 SKSKITWCAMMSGFIQNGSFMEAIVLFQRMQAEDFNVVPEIWRNLLDAYANLGALKLGKE 360

Query: 362 IHCYLIR-IYGLEICNT-HLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSMIEGYG 421
           +H YLI+ ++   I N+ HLETS++NMY+R GS++SA+ CFD++ VKDVVAWT+MI+G G
Sbjct: 361 VHGYLIKNLFNGAIENSVHLETSILNMYLRGGSMSSAKTCFDMMSVKDVVAWTTMIDGLG 420

Query: 422 SHGQGINALNLYHHMMSEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDL 481
           SHG G +AL  ++ M+ + V PNSVTFLSLLSACSHSGLVSEGC I++SM+  F I+P L
Sbjct: 421 SHGFGFDALKYFNLMIEQRVQPNSVTFLSLLSACSHSGLVSEGCNIYHSMKWGFGIEPTL 480

Query: 482 EHYTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIAIYAAHRLLE 541
           +H+TC VDL  R   ++EA AII +M  L D +IW AL+ A RVYG+ K   YAA RLLE
Sbjct: 481 DHHTCIVDLFGRCGMLKEALAIIFKMVILPDSKIWSALLAASRVYGNKKFGEYAAQRLLE 540

Query: 542 LEPDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKDFVKKPGWSFVELNGTLHGFVSGDR 601
           LEPDN GYYTLLSN +ASVG+W EVEKLR  + E+D  KKPGWS +E+ G++ GFVSGD+
Sbjct: 541 LEPDNAGYYTLLSNVKASVGRWEEVEKLRRDMRERDLKKKPGWSCIEVAGSIRGFVSGDK 600

Query: 602 SHCKTDQIYDLL 611
           SH + ++IY+ L
Sbjct: 601 SHPEAEEIYEAL 610

BLAST of CmoCh03G001870 vs. TrEMBL
Match: A0A0S3SU84_PHAAN (Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.08G334900 PE=4 SV=1)

HSP 1 Score: 637.1 bits (1642), Expect = 2.2e-179
Identity = 328/612 (53.59%), Postives = 429/612 (70.10%), Query Frame = 1

Query: 2   LWNSIIKSQFDSGLFLSAIMLYKNMREVGVEHDGFTFPILNHVVMSIWVDVVYAGMVHCV 61
           +WN I++S  D GLF S + +YK MR+ GV HD FTFP+LN  + S+  DVVY  M+HCV
Sbjct: 1   MWNLIMRSHVDLGLFHSVLSVYKKMRQKGVPHDTFTFPLLNRALSSMRADVVYGKMIHCV 60

Query: 62  GIRMGFGSDLYFCNTMMEVYAKCECLGHARKVFDEMPNRDLVSWTSMISAYVNVGDIVCA 121
             +MG   DLYFCNTM++VY KC C+  AR++FDE+  RD+VSWT MI+ YV+   +  A
Sbjct: 61  ATKMGLDGDLYFCNTMIDVYVKCGCIACARRMFDEISLRDVVSWTLMIAGYVSQRLVSVA 120

Query: 122 LNLFEGMRRVFEPNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGLLFDVGLQNWFLRMY 181
             LF  MR   EPNSVT++ MLQA C +  L  G  I    +K+GLL D  ++N  LRMY
Sbjct: 121 FRLFNKMRMELEPNSVTLIVMLQAPCASIKLSEGTQIHGYALKSGLLMDWSVKNSVLRMY 180

Query: 182 SRLGGEDEFVRFFSEIDCKNVVSWDILISFYSSVGDIVKAVDIFKQIMAGEVPLI-IETL 241
              GG  E    F E++ K+VVSW+ILISFYSS GD  +   + K + + EV +  IETL
Sbjct: 181 GSKGGTREVELLFGEVNMKDVVSWNILISFYSSEGDATRVAGLLKAMQSLEVHVWNIETL 240

Query: 242 TILISATKTSDSMCLILGENLHSLAIKTGLYDSILRTSLLDMYAKFGELDNSTRLFNEIP 301
           T++ SA   S S  L  GE +H L +KTG  D +  TSLLD YAK G+L+ S  LF+EI 
Sbjct: 241 TLVTSAFAKSGS--LSEGEGVHCLVVKTGFSDDVWLTSLLDFYAKCGKLETSVLLFSEIH 300

Query: 302 NRSIITWGAMMSSFIQNGHFDEAVEIFSQMQAAGLKPSLGILKHLIDAYAHLGALQLGRG 361
           ++S ITW AMMS FIQNG F EA+ +F +MQA        I ++L+DAYA+LGAL+LG+ 
Sbjct: 301 SKSKITWCAMMSGFIQNGSFMEAIVLFQRMQAEDFNVVPEIWRNLLDAYANLGALKLGKE 360

Query: 362 IHCYLIR-IYGLEICNT-HLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSMIEGYG 421
           +H YLI+ ++   I N+ HLETS++NMY+R GS++SA+ CFD++ VKDVVAWT+MI+G G
Sbjct: 361 VHGYLIKNLFNGAIENSVHLETSILNMYLRGGSMSSAKTCFDMMSVKDVVAWTTMIDGLG 420

Query: 422 SHGQGINALNLYHHMMSEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDL 481
           SHG G +AL  ++ M+ + V PNSVTFLSLLSACSHSGLVSEGC I++SM+  F I+P L
Sbjct: 421 SHGFGFDALKYFNLMIEQRVQPNSVTFLSLLSACSHSGLVSEGCNIYHSMKWGFGIEPTL 480

Query: 482 EHYTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIAIYAAHRLLE 541
           +H+TC VDL  R   ++EA AII +M  L D +IW AL+ A RVYG+ K   YAA RLLE
Sbjct: 481 DHHTCIVDLFGRCGMLKEALAIIFKMVILPDSKIWSALLAASRVYGNKKFGEYAAQRLLE 540

Query: 542 LEPDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKDFVKKPGWSFVELNGTLHGFVSGDR 601
           LEPDN GYYTLLSN +ASVG+W EVEKLR  + E+D  KKPGWS +E+ G++ GFVSGD+
Sbjct: 541 LEPDNAGYYTLLSNVKASVGRWEEVEKLRRDMRERDLKKKPGWSCIEVAGSIRGFVSGDK 600

Query: 602 SHCKTDQIYDLL 611
           SH + ++IY+ L
Sbjct: 601 SHPEAEEIYEAL 610

BLAST of CmoCh03G001870 vs. TrEMBL
Match: B9T607_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_0160700 PE=4 SV=1)

HSP 1 Score: 634.4 bits (1635), Expect = 1.4e-178
Identity = 323/595 (54.29%), Postives = 414/595 (69.58%), Query Frame = 1

Query: 3   WNSIIKSQFDSGLFLSAIMLYKNMREVGVEHDGFTFPILNHVVMSIWVDVVYAGMVHCVG 62
           WN I+++  D GL   A++LYK MRE GV+ D FTFP +N  VMS+  DV+   MVHC  
Sbjct: 96  WNLIMRTHLDFGLVTEALLLYKKMRESGVKTDAFTFPTINRAVMSLKSDVLLGKMVHCDA 155

Query: 63  IRMGFGSDLYFCNTMMEVYAKCECLGHARKVFDEMPNRDLVSWTSMISAYVNVGDIVCAL 122
           +++GFG DLYFCNTM+EVYA+C C+ + R +FDEM  RDLVSWTSMIS YV+ G++  A 
Sbjct: 156 MKLGFGYDLYFCNTMIEVYARCGCVYYGRVMFDEMSPRDLVSWTSMISGYVSEGNVFSAF 215

Query: 123 NLFEGMRRVFEPNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGLLFDVGLQNWFLRMYS 182
            LF  MR   EPNSVT++ ML+ C   ++   GR + C ++KNGLL    +QN  LRMYS
Sbjct: 216 ELFNKMRLEMEPNSVTLIVMLKGCYAYDNFSEGRQLHCYIIKNGLLIYGSVQNSILRMYS 275

Query: 183 RLGGEDEFVRFFSEIDCKNVVSWDILISFYSSVGDIVKAVDIFKQIMAGEVPLIIETLTI 242
             G   E    F EI  ++V+SW+ LI FY+  GD  + V  F Q M GEV L  ETLT+
Sbjct: 276 ITGSAKEVESLFVEIYRRDVISWNTLIGFYALRGDAEEMVCGFNQ-MRGEVALSSETLTL 335

Query: 243 LISATKTSDSMCLILGENLHSLAIKTGLYDSILRTSLLDMYAKFGELDNSTRLFNEIPNR 302
           +IS      +  L+ GE LHS +IK GL D +L  SLLD YAK GEL NS +LF EIP R
Sbjct: 336 VISVFAKIGN--LVEGEKLHSFSIKVGLCDDVLLASLLDFYAKCGELRNSVQLFGEIPCR 395

Query: 303 SIITWGAMMSSFIQNGHFDEAVEIFSQMQAAGLKPSLGILKHLIDAYAHLGALQLGRGIH 362
           S  TW  MMS  IQNG+FDEA+ +F QMQA+G++    IL  L+DA +HLG+LQL + IH
Sbjct: 396 SSSTWKLMMSGCIQNGYFDEAIHLFRQMQASGVQLQAQILGSLVDACSHLGSLQLCKEIH 455

Query: 363 CYLIR--IYGLEICNTHLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSMIEGYGSH 422
            YL R   Y LE  N HL TS++NMY+RCGSI+SAR+ F+ ++ KD + WTSMIEGYG H
Sbjct: 456 GYLTRNFFYILEGDNIHLGTSILNMYIRCGSISSAREYFNRMVAKDNITWTSMIEGYGIH 515

Query: 423 GQGINALNLYHHMMSEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEH 482
           G  I AL L++ M+ E V PN VTFLSLLSACSHSGL+ +GCE+F SM+  F ++PDL+H
Sbjct: 516 GMAIEALKLFNQMLVERVLPNRVTFLSLLSACSHSGLIRQGCELFLSMKWVFGMEPDLDH 575

Query: 483 YTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIAIYAAHRLLELE 542
           YTC VDLL R  +++EA A+I+RM  + D RIWGAL+ +CRV+GD K+  +AA RLLE+E
Sbjct: 576 YTCMVDLLGRCGKIKEALAMIIRMVVVADSRIWGALVASCRVHGDKKVGEFAAQRLLEME 635

Query: 543 PDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKDFVKKPGWSFVELNGTLHGFVS 596
            DNVGYYTLLSN QA VG+W EVE++R V++EKD  K PGWS +   G  + F+S
Sbjct: 636 SDNVGYYTLLSNIQAMVGKWDEVEQVRKVIHEKDLRKTPGWSCIVGKGRNYCFIS 687

BLAST of CmoCh03G001870 vs. TrEMBL
Match: A0A061FB60_THECC (Pentatricopeptide repeat-containing protein, putative OS=Theobroma cacao GN=TCM_033552 PE=4 SV=1)

HSP 1 Score: 625.2 bits (1611), Expect = 8.7e-176
Identity = 317/612 (51.80%), Postives = 426/612 (69.61%), Query Frame = 1

Query: 3   WNSIIKSQFDSGLFLSAIMLYKNMREVGVEHDGFTFPILNHVVMSIWVDVVYAGMVHCVG 62
           WN IIKS  D G    A+ LY+ MR+ GV+HD FTFPI+N  V SI  D  +A ++HCV 
Sbjct: 61  WNLIIKSHVDFGYIEKALFLYRKMRKEGVKHDRFTFPIINRAVRSINADAEFAKLIHCVA 120

Query: 63  IRMGFGSDLYFCNTMMEVYAKCECLGHARKVFDEMPNRDLVSWTSMISAYVNVGDIVCAL 122
           ++MGFG DLYF NTM+E+Y KC C  +A K+FDEM  RDLV+WTSMIS     G++  A 
Sbjct: 121 VKMGFGFDLYFGNTMVEIYGKCGCFSNAYKMFDEMFERDLVTWTSMISGCFYEGNVAEAF 180

Query: 123 NLFEGMRRVFEPNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGLLFDVGLQNWFLRMYS 182
            LF+ MR   EPN+VT++ +LQ C      + G+     V+K+G+L D  + N  L+MY+
Sbjct: 181 TLFKKMRLEMEPNAVTVIVLLQGCSRWGSFIGGKQTHGYVIKSGVLADGSVLNSVLKMYT 240

Query: 183 RLGGEDEFVRFFSEIDCKNVVSWDILISFYSSVGDIVKAVDIFKQIMAGEVPLIIETLTI 242
            +G  +E   FF EI  +++VSW+ LIS+YS  GD+ +  D F ++   EV + +ETLT+
Sbjct: 241 TMGSVEEVETFFREIFQRDIVSWNTLISYYSLRGDVGEVADRFCKMQV-EVKVSMETLTL 300

Query: 243 LISATKTSDSMCLILGENLHSLAIKTGLYDSILRTSLLDMYAKFGELDNSTRLFNEIPNR 302
           +ISA   S +  L  GE LH  A+K GL+D +L+TSLLD YAK G L NS +LF  I +R
Sbjct: 301 VISAFAKSGN--LSQGEILHCCALKLGLHDDVLQTSLLDFYAKCGLLKNSIQLFKGISSR 360

Query: 303 SIITWGAMMSSFIQNGHFDEAVEIFSQMQAAGLKPSLGILKHLIDAYAHLGALQLGRGIH 362
           + I W AM+S +IQNG F EA+ +F +MQAAGL P+  IL +++ A AH+GAL++G+ +H
Sbjct: 361 NSIAWSAMLSGYIQNGFFKEAIVLFKEMQAAGLHPTPEILGNIVHACAHVGALEVGKEMH 420

Query: 363 CYLIRIY----GLEICNTHLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSMIEGYG 422
            Y I+        E     LETS++NMY+R GSI+SAR CF+ ++VKD+VAWTSMIEGYG
Sbjct: 421 GYSIKNMFHSPKKEGTYLELETSILNMYIRNGSISSARACFNRMLVKDIVAWTSMIEGYG 480

Query: 423 SHGQGINALNLYHHMMSEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDL 482
            HG G +AL L+  M+ E   PN VTFLSLLSACSHSGLVSEGC +FYSM+ RF+I+PDL
Sbjct: 481 IHGLGSDALKLFDQMVEEGATPNCVTFLSLLSACSHSGLVSEGCYVFYSMKWRFSIEPDL 540

Query: 483 EHYTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIAIYAAHRLLE 542
           +HYTC VDLL R+ +++EA A I++M    D RIWGAL+   RV+G  K+  YAA RLLE
Sbjct: 541 DHYTCMVDLLGRAGKLKEALATIMKMLAFPDSRIWGALLAGSRVHGHKKVGEYAAQRLLE 600

Query: 543 LEPDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKDFVKKPGWSFVELNGTLHGFVSGDR 602
           LE DNVGY+TLLSN QAS GQW EVE++R  ++EK+  K+PGWS++  N  +H FV GD+
Sbjct: 601 LESDNVGYHTLLSNVQASTGQWAEVEEVRRAMFEKNLKKQPGWSYIAENKHIHCFVCGDK 660

Query: 603 SHCKTDQIYDLL 611
           SH + ++IY++L
Sbjct: 661 SHNQVEEIYEVL 669

BLAST of CmoCh03G001870 vs. TAIR10
Match: AT4G35130.1 (AT4G35130.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 392.1 bits (1006), Expect = 6.3e-109
Identity = 216/617 (35.01%), Postives = 354/617 (57.37%), Query Frame = 1

Query: 2   LWNSIIKSQFDSGLFLSAIMLYKNMREVGVEHDGFTFPILNHVVMSIWVDVVYAGMVHCV 61
           LWN +IK     GL++ A+  Y  M   GV+ D FT+P +   V  I   +     +H +
Sbjct: 97  LWNVMIKGFTSCGLYIEAVQFYSRMVFAGVKADTFTYPFVIKSVAGI-SSLEEGKKIHAM 156

Query: 62  GIRMGFGSDLYFCNTMMEVYAKCECLGHARKVFDEMPNRDLVSWTSMISAYVNVGDIVCA 121
            I++GF SD+Y CN+++ +Y K  C   A KVF+EMP RD+VSW SMIS Y+ +GD   +
Sbjct: 157 VIKLGFVSDVYVCNSLISLYMKLGCAWDAEKVFEEMPERDIVSWNSMISGYLALGDGFSS 216

Query: 122 LNLFEGMRRV-FEPNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGL-LFDVGLQNWFLR 181
           L LF+ M +  F+P+  + M+ L AC       +G+ I C  V++ +   DV +    L 
Sbjct: 217 LMLFKEMLKCGFKPDRFSTMSALGACSHVYSPKMGKEIHCHAVRSRIETGDVMVMTSILD 276

Query: 182 MYSRLGGEDEFVRFFSEIDCKNVVSWDILISFYSSVGDIVKAVDIFKQIMA--GEVPLII 241
           MYS+ G      R F+ +  +N+V+W+++I  Y+  G +  A   F+++    G  P +I
Sbjct: 277 MYSKYGEVSYAERIFNGMIQRNIVAWNVMIGCYARNGRVTDAFLCFQKMSEQNGLQPDVI 336

Query: 242 ETLTILISATKTSDSMCLILGENLHSLAIKTG-LYDSILRTSLLDMYAKFGELDNSTRLF 301
            ++ +L ++        ++ G  +H  A++ G L   +L T+L+DMY + G+L ++  +F
Sbjct: 337 TSINLLPASA-------ILEGRTIHGYAMRRGFLPHMVLETALIDMYGECGQLKSAEVIF 396

Query: 302 NEIPNRSIITWGAMMSSFIQNGHFDEAVEIFSQMQAAGLKPSLGILKHLIDAYAHLGALQ 361
           + +  +++I+W +++++++QNG    A+E+F ++  + L P    +  ++ AYA   +L 
Sbjct: 397 DRMAEKNVISWNSIIAAYVQNGKNYSALELFQELWDSSLVPDSTTIASILPAYAESLSLS 456

Query: 362 LGRGIHCYLIRIYGLEICNTHLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSMIEG 421
            GR IH Y+++       NT +  SL++MY  CG +  ARKCF+ I++KDVV+W S+I  
Sbjct: 457 EGREIHAYIVK--SRYWSNTIILNSLVHMYAMCGDLEDARKCFNHILLKDVVSWNSIIMA 516

Query: 422 YGSHGQGINALNLYHHMMSEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKP 481
           Y  HG G  ++ L+  M++  V PN  TF SLL+ACS SG+V EG E F SM+  + I P
Sbjct: 517 YAVHGFGRISVWLFSEMIASRVNPNKSTFASLLAACSISGMVDEGWEYFESMKREYGIDP 576

Query: 482 DLEHYTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIAIYAAHRL 541
            +EHY C +DL+ R+     A   +  M  +   RIWG+L+ A R + D  IA +AA ++
Sbjct: 577 GIEHYGCMLDLIGRTGNFSAAKRFLEEMPFVPTARIWGSLLNASRNHKDITIAEFAAEQI 636

Query: 542 LELEPDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKDFVKKPGWSFVELNGTLHGFVSG 601
            ++E DN G Y LL N  A  G+W +V +++ ++  K   +    S VE  G  H F +G
Sbjct: 637 FKMEHDNTGCYVLLLNMYAEAGRWEDVNRIKLLMESKGISRTSSRSTVEAKGKSHVFTNG 696

Query: 602 DRSHCKTDQIYDLLIIV 614
           DRSH  T++IY++L +V
Sbjct: 697 DRSHVATNKIYEVLDVV 703

BLAST of CmoCh03G001870 vs. TAIR10
Match: AT3G03580.1 (AT3G03580.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 355.9 bits (912), Expect = 5.0e-98
Identity = 202/617 (32.74%), Postives = 345/617 (55.92%), Query Frame = 1

Query: 3   WNSIIKSQFDSGLFLSAIMLYKNMREVGVEHDGFTFPILNHVVMSIWVDVVYAGMVHCVG 62
           WNS+I      G +  A+ +Y  ++   +  D FT   +     ++ V     G+ H   
Sbjct: 175 WNSLISGYSSHGYYEEALEIYHELKNSWIVPDSFTVSSVLPAFGNLLVVKQGQGL-HGFA 234

Query: 63  IRMGFGSDLYFCNTMMEVYAKCECLGHARKVFDEMPNRDLVSWTSMISAYVNVGDIVCAL 122
           ++ G  S +   N ++ +Y K      AR+VFDEM  RD VS+ +MI  Y+ +  +  ++
Sbjct: 235 LKSGVNSVVVVNNGLVAMYLKFRRPTDARRVFDEMDVRDSVSYNTMICGYLKLEMVEESV 294

Query: 123 NLFEGMRRVFEPNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGLLFDVGLQNWFLRMYS 182
            +F      F+P+ +T+ ++L+AC    DL L + I   ++K G + +  ++N  + +Y+
Sbjct: 295 RMFLENLDQFKPDLLTVSSVLRACGHLRDLSLAKYIYNYMLKAGFVLESTVRNILIDVYA 354

Query: 183 RLGGEDEFVRFFSEIDCKNVVSWDILISFYSSVGDIVKAVDIFKQIMAGEVPLIIETLTI 242
           + G        F+ ++CK+ VSW+ +IS Y   GD+++A+ +FK +M  E      T  +
Sbjct: 355 KCGDMITARDVFNSMECKDTVSWNSIISGYIQSGDLMEAMKLFKMMMIMEEQADHITYLM 414

Query: 243 LIS-ATKTSDSMCLILGENLHSLAIKTGL-YDSILRTSLLDMYAKFGELDNSTRLFNEIP 302
           LIS +T+ +D   L  G+ LHS  IK+G+  D  +  +L+DMYAK GE+ +S ++F+ + 
Sbjct: 415 LISVSTRLAD---LKFGKGLHSNGIKSGICIDLSVSNALIDMYAKCGEVGDSLKIFSSMG 474

Query: 303 NRSIITWGAMMSSFIQNGHFDEAVEIFSQMQAAGLKPSLGILKHLIDAYAHLGALQLGRG 362
               +TW  ++S+ ++ G F   +++ +QM+ + + P +      +   A L A +LG+ 
Sbjct: 475 TGDTVTWNTVISACVRFGDFATGLQVTTQMRKSEVVPDMATFLVTLPMCASLAAKRLGKE 534

Query: 363 IHCYLIRIYGLEICNTHLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSMIEGYGSH 422
           IHC L+R +G E     +  +L+ MY +CG + ++ + F+ +  +DVV WT MI  YG +
Sbjct: 535 IHCCLLR-FGYE-SELQIGNALIEMYSKCGCLENSSRVFERMSRRDVVTWTGMIYAYGMY 594

Query: 423 GQGINALNLYHHMMSEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEH 482
           G+G  AL  +  M    + P+SV F++++ ACSHSGLV EG   F  M++ + I P +EH
Sbjct: 595 GEGEKALETFADMEKSGIVPDSVVFIAIIYACSHSGLVDEGLACFEKMKTHYKIDPMIEH 654

Query: 483 YTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIAIYAAHRLLELE 542
           Y C VDLLSRS ++ +A   I  M    D  IW +++ ACR  GD + A   + R++EL 
Sbjct: 655 YACVVDLLSRSQKISKAEEFIQAMPIKPDASIWASVLRACRTSGDMETAERVSRRIIELN 714

Query: 543 PDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKDFVKKPGWSFVELNGTLHGFVSGDRSH 602
           PD+ GY  L SN  A++ +W +V  +R  + +K   K PG+S++E+   +H F SGD S 
Sbjct: 715 PDDPGYSILASNAYAALRKWDKVSLIRKSLKDKHITKNPGYSWIEVGKNVHVFSSGDDSA 774

Query: 603 CKTDQIYDLLIIVYGLL 618
            +++ IY  L I+Y L+
Sbjct: 775 PQSEAIYKSLEILYSLM 785

BLAST of CmoCh03G001870 vs. TAIR10
Match: AT3G01580.1 (AT3G01580.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 350.9 bits (899), Expect = 1.6e-96
Identity = 210/615 (34.15%), Postives = 333/615 (54.15%), Query Frame = 1

Query: 3   WNSIIKSQFDSGLFLSAIMLYKNMREVGVEHDGFTFPILNHVVMSIWVDVVYAGMVH-CV 62
           WN+++KS      +   +  + +M     + D FT P+       +  +V Y  M+H  V
Sbjct: 28  WNTLLKSLSREKQWEEVLYHFSHMFRDEEKPDNFTLPVALKACGELR-EVNYGEMIHGFV 87

Query: 63  GIRMGFGSDLYFCNTMMEVYAKCECLGHARKVFDEMPNRDLVSWTSMISAYVNVGDIVCA 122
              +  GSDLY  ++++ +Y KC  +  A ++FDE+   D+V+W+SM+S +   G    A
Sbjct: 88  KKDVTLGSDLYVGSSLIYMYIKCGRMIEALRMFDELEKPDIVTWSSMVSGFEKNGSPYQA 147

Query: 123 LNLFEGMRRVFE--PNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGLLFDVGLQNWFLR 182
           +  F  M    +  P+ VT++ ++ AC    +  LGR +   V++ G   D+ L N  L 
Sbjct: 148 VEFFRRMVMASDVTPDRVTLITLVSACTKLSNSRLGRCVHGFVIRRGFSNDLSLVNSLLN 207

Query: 183 MYSRLGGEDEFVRFFSEIDCKNVVSWDILISFYSSVGDIVKAVDIFKQIMA-GEVPLIIE 242
            Y++     E V  F  I  K+V+SW  +I+ Y   G   +A+ +F  +M  G  P +  
Sbjct: 208 CYAKSRAFKEAVNLFKMIAEKDVISWSTVIACYVQNGAAAEALLVFNDMMDDGTEPNVAT 267

Query: 243 TLTILISATKTSDSMCLILGENLHSLAIKTGLYDSI-LRTSLLDMYAKFGELDNSTRLFN 302
            L +L +     D   L  G   H LAI+ GL   + + T+L+DMY K    + +  +F+
Sbjct: 268 VLCVLQACAAAHD---LEQGRKTHELAIRKGLETEVKVSTALVDMYMKCFSPEEAYAVFS 327

Query: 303 EIPNRSIITWGAMMSSFIQNGHFDEAVEIFSQMQAAG-LKPSLGILKHLIDAYAHLGALQ 362
            IP + +++W A++S F  NG    ++E FS M      +P   ++  ++ + + LG L+
Sbjct: 328 RIPRKDVVSWVALISGFTLNGMAHRSIEEFSIMLLENNTRPDAILMVKVLGSCSELGFLE 387

Query: 363 LGRGIHCYLIRIYGLEICNTHLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSMIEG 422
             +  H Y+I+ YG +  N  +  SL+ +Y RCGS+ +A K F+ I +KD V WTS+I G
Sbjct: 388 QAKCFHSYVIK-YGFD-SNPFIGASLVELYSRCGSLGNASKVFNGIALKDTVVWTSLITG 447

Query: 423 YGSHGQGINALNLYHHMM-SEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIK 482
           YG HG+G  AL  ++HM+ S EV PN VTFLS+LSACSH+GL+ EG  IF  M + + + 
Sbjct: 448 YGIHGKGTKALETFNHMVKSSEVKPNEVTFLSILSACSHAGLIHEGLRIFKLMVNDYRLA 507

Query: 483 PDLEHYTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIAIYAAHR 542
           P+LEHY   VDLL R   +  A  I  RM      +I G L+GACR++ + ++A   A +
Sbjct: 508 PNLEHYAVLVDLLGRVGDLDTAIEITKRMPFSPTPQILGTLLGACRIHQNGEMAETVAKK 567

Query: 543 LLELEPDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKDFVKKPGWSFVELNGTLHGFVS 602
           L ELE ++ GYY L+SN     G+W  VEKLR+ V ++   K    S +E+   +H FV+
Sbjct: 568 LFELESNHAGYYMLMSNVYGVKGEWENVEKLRNSVKQRGIKKGLAESLIEIRRKVHRFVA 627

Query: 603 GDRSHCKTDQIYDLL 611
            D  H + + +Y LL
Sbjct: 628 DDELHPEKEPVYGLL 636

BLAST of CmoCh03G001870 vs. TAIR10
Match: AT1G69350.1 (AT1G69350.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 346.7 bits (888), Expect = 3.0e-95
Identity = 193/610 (31.64%), Postives = 346/610 (56.72%), Query Frame = 1

Query: 3   WNSIIKSQFDSGLFLSAIMLYKNMREVGVEHDGFTFPILNHVVMSIWVDVVYAGMVHCVG 62
           W++++ S  ++G  + A+ ++K M + GVE D  T   +      +   +  A  VH   
Sbjct: 170 WSTLVSSCLENGEVVKALRMFKCMVDDGVEPDAVTMISVVEGCAELGC-LRIARSVHGQI 229

Query: 63  IRMGFGSDLYFCNTMMEVYAKCECLGHARKVFDEMPNRDLVSWTSMISAYVNVGDIVCAL 122
            R  F  D   CN+++ +Y+KC  L  + ++F+++  ++ VSWT+MIS+Y        AL
Sbjct: 230 TRKMFDLDETLCNSLLTMYSKCGDLLSSERIFEKIAKKNAVSWTAMISSYNRGEFSEKAL 289

Query: 123 NLFEGM-RRVFEPNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGLLFDV-GLQNWFLRM 182
             F  M +   EPN VT+ ++L +C +   +  G+ +    V+  L  +   L    + +
Sbjct: 290 RSFSEMIKSGIEPNLVTLYSVLSSCGLIGLIREGKSVHGFAVRRELDPNYESLSLALVEL 349

Query: 183 YSRLGGEDEFVRFFSEIDCKNVVSWDILISFYSSVGDIVKAVDIFKQIMAGEVPLIIETL 242
           Y+  G   +       +  +N+V+W+ LIS Y+  G +++A+ +F+Q++   +     TL
Sbjct: 350 YAECGKLSDCETVLRVVSDRNIVAWNSLISLYAHRGMVIQALGLFRQMVTQRIKPDAFTL 409

Query: 243 TILISATKTSDSMCLILGENLHSLAIKTGLYDSILRTSLLDMYAKFGELDNSTRLFNEIP 302
              ISA + +    + LG+ +H   I+T + D  ++ SL+DMY+K G +D+++ +FN+I 
Sbjct: 410 ASSISACENAG--LVPLGKQIHGHVIRTDVSDEFVQNSLIDMYSKSGSVDSASTVFNQIK 469

Query: 303 NRSIITWGAMMSSFIQNGHFDEAVEIFSQMQAAGLKPSLGILKHLIDAYAHLGALQLGRG 362
           +RS++TW +M+  F QNG+  EA+ +F  M  + L+ +      +I A + +G+L+ G+ 
Sbjct: 470 HRSVVTWNSMLCGFSQNGNSVEAISLFDYMYHSYLEMNEVTFLAVIQACSSIGSLEKGKW 529

Query: 363 IHCYLIRIYGLEICNTHLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSMIEGYGSH 422
           +H  LI I GL+  +   +T+L++MY +CG + +A   F  +  + +V+W+SMI  YG H
Sbjct: 530 VHHKLI-ISGLK--DLFTDTALIDMYAKCGDLNAAETVFRAMSSRSIVSWSSMINAYGMH 589

Query: 423 GQGINALNLYHHMMSEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEH 482
           G+  +A++ ++ M+     PN V F+++LSAC HSG V EG + ++++   F + P+ EH
Sbjct: 590 GRIGSAISTFNQMVESGTKPNEVVFMNVLSACGHSGSVEEG-KYYFNLMKSFGVSPNSEH 649

Query: 483 YTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIAIYAAHRLLELE 542
           + CF+DLLSRS  ++EA+  I  M  L D  +WG+L+  CR++    I     + L ++ 
Sbjct: 650 FACFIDLLSRSGDLKEAYRTIKEMPFLADASVWGSLVNGCRIHQKMDIIKAIKNDLSDIV 709

Query: 543 PDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKDFVKKPGWSFVELNGTLHGFVSGDRSH 602
            D+ GYYTLLSN  A  G+W E  +LRS +   +  K PG+S +E++  +  F +G+ + 
Sbjct: 710 TDDTGYYTLLSNIYAEEGEWEEFRRLRSAMKSSNLKKVPGYSAIEIDQKVFRFGAGEENR 769

Query: 603 CKTDQIYDLL 611
            +TD+IY  L
Sbjct: 770 IQTDEIYRFL 772

BLAST of CmoCh03G001870 vs. TAIR10
Match: AT4G30700.1 (AT4G30700.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 344.7 bits (883), Expect = 1.2e-94
Identity = 189/557 (33.93%), Postives = 315/557 (56.55%), Query Frame = 1

Query: 57  MVHCVGIRMGFGSDLYFCNTMMEVYAKCECLGHARKVFDEMPNRDLVSWTSMISAYVNVG 116
           ++H   +  G  S+L   + ++++Y K   +  ARKVFD MP +D + W +MIS Y    
Sbjct: 140 VIHGQAVVDGCDSELLLGSNIVKMYFKFWRVEDARKVFDRMPEKDTILWNTMISGYRKNE 199

Query: 117 DIVCALNLFEGM--RRVFEPNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGLLFDVGLQ 176
             V ++ +F  +        ++ T++ +L A    ++L LG  I  L  K G      + 
Sbjct: 200 MYVESIQVFRDLINESCTRLDTTTLLDILPAVAELQELRLGMQIHSLATKTGCYSHDYVL 259

Query: 177 NWFLRMYSRLGGEDEFVRFFSEIDCKNVVSWDILISFYSSVGDIVKAVDIFKQIMAGEVP 236
             F+ +YS+ G        F E    ++V+++ +I  Y+S G+   ++ +FK++M     
Sbjct: 260 TGFISLYSKCGKIKMGSALFREFRKPDIVAYNAMIHGYTSNGETELSLSLFKELMLSGAR 319

Query: 237 LIIETLTILISATKTSDSMCLILGENLHSLAIKTG-LYDSILRTSLLDMYAKFGELDNST 296
           L   TL   +S    S  + LI    +H   +K+  L  + + T+L  +Y+K  E++++ 
Sbjct: 320 LRSSTL---VSLVPVSGHLMLIYA--IHGYCLKSNFLSHASVSTALTTVYSKLNEIESAR 379

Query: 297 RLFNEIPNRSIITWGAMMSSFIQNGHFDEAVEIFSQMQAAGLKPSLGILKHLIDAYAHLG 356
           +LF+E P +S+ +W AM+S + QNG  ++A+ +F +MQ +   P+   +  ++ A A LG
Sbjct: 380 KLFDESPEKSLPSWNAMISGYTQNGLTEDAISLFREMQKSEFSPNPVTITCILSACAQLG 439

Query: 357 ALQLGRGIHCYLIRIYGLEICNTHLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSM 416
           AL LG+ +H  L+R    E  + ++ T+L+ MY +CGSIA AR+ FDL+  K+ V W +M
Sbjct: 440 ALSLGKWVHD-LVRSTDFE-SSIYVSTALIGMYAKCGSIAEARRLFDLMTKKNEVTWNTM 499

Query: 417 IEGYGSHGQGINALNLYHHMMSEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFN 476
           I GYG HGQG  ALN+++ M++  + P  VTFL +L ACSH+GLV EG EIF SM  R+ 
Sbjct: 500 ISGYGLHGQGQEALNIFYEMLNSGITPTPVTFLCVLYACSHAGLVKEGDEIFNSMIHRYG 559

Query: 477 IKPDLEHYTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIAIYAA 536
            +P ++HY C VD+L R+  ++ A   I  M+      +W  L+GACR++ D  +A   +
Sbjct: 560 FEPSVKHYACMVDILGRAGHLQRALQFIEAMSIEPGSSVWETLLGACRIHKDTNLARTVS 619

Query: 537 HRLLELEPDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKDFVKKPGWSFVELNGTLHGF 596
            +L EL+PDNVGY+ LLSN  ++   + +   +R    ++   K PG++ +E+  T H F
Sbjct: 620 EKLFELDPDNVGYHVLLSNIHSADRNYPQAATVRQTAKKRKLAKAPGYTLIEIGETPHVF 679

Query: 597 VSGDRSHCKTDQIYDLL 611
            SGD+SH +  +IY+ L
Sbjct: 680 TSGDQSHPQVKEIYEKL 689

BLAST of CmoCh03G001870 vs. NCBI nr
Match: gi|659115504|ref|XP_008457590.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like [Cucumis melo])

HSP 1 Score: 1060.1 bits (2740), Expect = 1.5e-306
Identity = 515/614 (83.88%), Postives = 560/614 (91.21%), Query Frame = 1

Query: 1   MLWNSIIKSQFDSGLFLSAIMLYKNMREVGVEHDGFTFPILNHVVMSIWVDVVYAGMVHC 60
           MLWN++IKS FDSGLF SA++LYKNMREV VEHDGFT PI+N V++SIWVDVVY GMVHC
Sbjct: 1   MLWNNVIKSHFDSGLFHSALLLYKNMREVRVEHDGFTLPIVNQVILSIWVDVVYGGMVHC 60

Query: 61  VGIRMGFGSDLYFCNTMMEVYAKCECLGHARKVFDEMPNRDLVSWTSMISAYVNVGDIVC 120
           VGIRMGF SDLYFCNTMMEVY KC CL  AR VFDEMPNRDLVSWTSMISAYV  GD+ C
Sbjct: 61  VGIRMGFSSDLYFCNTMMEVYGKCGCLVSARDVFDEMPNRDLVSWTSMISAYVKGGDVFC 120

Query: 121 ALNLFEGMRRVFEPNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGLLFDVGLQNWFLRM 180
           AL++FEGMRR  EPNSVT++ MLQACC T++LVLGRL+QC VVKNGLLFD GLQN FLRM
Sbjct: 121 ALDIFEGMRRELEPNSVTVIVMLQACCATQNLVLGRLLQCYVVKNGLLFDTGLQNSFLRM 180

Query: 181 YSRLGGEDEFVRFFSEIDCKNVVSWDILISFYSSVGDIVKAVDIFKQIMAGEVPLIIETL 240
           YSRLGGEDE V FFSEID KNVVSW+IL+SFYSS+GDIVK VDI  +IM GEVPL IETL
Sbjct: 181 YSRLGGEDEVVAFFSEIDFKNVVSWNILMSFYSSMGDIVKVVDILNKIM-GEVPLSIETL 240

Query: 241 TILISATKTSDSMCLILGENLHSLAIKTGLYDSILRTSLLDMYAKFGELDNSTRLFNEIP 300
           TILIS   TSDS CLILGENLHSLAIK+GLYD IL TSLLDMYAKFGEL+NSTRLF EIP
Sbjct: 241 TILISGIATSDSGCLILGENLHSLAIKSGLYDDILCTSLLDMYAKFGELENSTRLFKEIP 300

Query: 301 NRSIITWGAMMSSFIQNGHFDEAVEIFSQMQAAGLKPSLGILKHLIDAYAHLGALQLGRG 360
           NRSIITWGAMMSSFIQNGHFD+AV+IF QMQ AGLKPS+GILKHLIDAYA+LGALQLG+ 
Sbjct: 301 NRSIITWGAMMSSFIQNGHFDDAVDIFKQMQVAGLKPSVGILKHLIDAYAYLGALQLGKA 360

Query: 361 IHCYLIRIYGLEICNTHLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSMIEGYGSH 420
           IHC+LIRIYGL +CNT LETS++NMYVRCGSIASARKCFDLI++KDVVAWTSMIEGYG+H
Sbjct: 361 IHCHLIRIYGLVVCNTRLETSVLNMYVRCGSIASARKCFDLILIKDVVAWTSMIEGYGAH 420

Query: 421 GQGINALNLYHHMMSEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDLEH 480
           G GI+ALNL+H M SEEV PN+VTFLSLLSACSHSGLVSEGC IFYSMRSRFNIKPDLEH
Sbjct: 421 GLGIDALNLFHQMTSEEVTPNNVTFLSLLSACSHSGLVSEGCGIFYSMRSRFNIKPDLEH 480

Query: 481 YTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIAIYAAHRLLELE 540
           YTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIA YAAHRLLELE
Sbjct: 481 YTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIANYAAHRLLELE 540

Query: 541 PDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKDFVKKPGWSFVELNGTLHGFVSGDRSH 600
           PDNVGYYTLLSN+QASVGQWHE EKLRS+VYEK+  KKPGWSF+ELNGT+HGFVSGDRSH
Sbjct: 541 PDNVGYYTLLSNSQASVGQWHEAEKLRSLVYEKNLAKKPGWSFIELNGTIHGFVSGDRSH 600

Query: 601 CKTDQIYDLLIIVY 615
            K ++IYDLL+ +Y
Sbjct: 601 YKANEIYDLLVYIY 613

BLAST of CmoCh03G001870 vs. NCBI nr
Match: gi|694428826|ref|XP_009341960.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g15510, chloroplastic-like [Pyrus x bretschneideri])

HSP 1 Score: 718.0 bits (1852), Expect = 1.4e-203
Identity = 357/612 (58.33%), Postives = 454/612 (74.18%), Query Frame = 1

Query: 1   MLWNSIIKSQFDSGLFLSAIMLYKNMREVGVEHDGFTFPILNHVVMSIWVDVVYAGMVHC 60
           MLWN ++KS  + GL  SA++LYK MRE+GV HD FTFPI+N VVM +  +V YAGMVHC
Sbjct: 112 MLWNLMMKSHVECGLVDSALLLYKKMRELGVSHDCFTFPIVNRVVMLLGGEVGYAGMVHC 171

Query: 61  VGIRMGFGSDLYFCNTMMEVYAKCECLGHARKVFDEMPNRDLVSWTSMISAYVNVGDIVC 120
           V I+MGFG D+YF NTM++ Y KC  + HAR +FDEM  RDLVSWTSMIS YV+ G++ C
Sbjct: 172 VAIQMGFGMDVYFGNTMIDFYVKCGAIDHARMLFDEMCQRDLVSWTSMISGYVSEGNVAC 231

Query: 121 ALNLFEGMRRVFEPNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGLLFDVGLQNWFLRM 180
            L+LF  MR   EPNSVTM+ MLQ CC TE  + G      V+KNGLL+D  +QN  LRM
Sbjct: 232 GLSLFNEMRLELEPNSVTMLIMLQGCCGTESAICGSQFHGYVIKNGLLYDASVQNSILRM 291

Query: 181 YSRLGGEDEFVRFFSEIDCKNVVSWDILISFYSSVGDIVKAVDIFKQIMAGEVPLIIETL 240
           Y++LG  +E   FFSE+D ++VVSW+I IS +SS GD+ K  ++F   M G+V   +ETL
Sbjct: 292 YAKLGTINEVEGFFSELDRRDVVSWNICISIFSSRGDVAKVRELFND-MQGKVAPGVETL 351

Query: 241 TILISATKTSDSMCLILGENLHSLAIKTGLYDSILRTSLLDMYAKFGELDNSTRLFNEIP 300
           T++ISA        L  GE+LH LAIK GL D +L+TSLLD+YAK GEL  S RLF EIP
Sbjct: 352 TLVISALAKHG--ILSQGESLHCLAIKRGLCDHVLQTSLLDLYAKCGELGISDRLFREIP 411

Query: 301 NRSIITWGAMMSSFIQNGHFDEAVEIFSQMQAAGLKPSLGILKHLIDAYAHLGALQLGRG 360
           +R+ ITWGAMM  FIQNG F+EAV +  +MQA+G +P   IL+ L+DA+A+LGAL+LG+ 
Sbjct: 412 HRNTITWGAMMFGFIQNGWFNEAVGLLREMQASGPEPRAEILRSLVDAFANLGALKLGKQ 471

Query: 361 IHCYLIR--IYGLEICNTHLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSMIEGYG 420
           IH Y+IR  +Y  +   THLETS++NMY+RCGS+++AR CFD ++VKD+V WTSMIEGYG
Sbjct: 472 IHGYIIRKSLYEGDESYTHLETSIINMYIRCGSLSAARVCFDRMLVKDIVTWTSMIEGYG 531

Query: 421 SHGQGINALNLYHHMMSEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDL 480
           SHG G  AL L+  M+ E + PNSVTF+SLLSACSHSGLV+EGC+ FYSM+ +F I+PDL
Sbjct: 532 SHGLGFEALKLFDLMIREGIRPNSVTFISLLSACSHSGLVTEGCDAFYSMKWKFGIEPDL 591

Query: 481 EHYTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIAIYAAHRLLE 540
           +HYT  VDLL RS +++EA A+I++M    D RIWGAL+  CR+Y    +  YAA RLLE
Sbjct: 592 DHYTSIVDLLGRSGKLKEALAVIMKMMTFPDSRIWGALLSGCRIYSLRDVGEYAAQRLLE 651

Query: 541 LEPDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKDFVKKPGWSFVELNGTLHGFVSGDR 600
           LEPDN GYYTLLSNTQASVGQW EVE+ R V+ E D  K PGWS +E  G ++GFVSGDR
Sbjct: 652 LEPDNAGYYTLLSNTQASVGQWDEVEETRRVMSEMDLKKMPGWSCIEAEGRIYGFVSGDR 711

Query: 601 SHCKTDQIYDLL 611
           SH + ++IY++L
Sbjct: 712 SHHQVEEIYEVL 720

BLAST of CmoCh03G001870 vs. NCBI nr
Match: gi|645271147|ref|XP_008240777.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g19191, mitochondrial-like [Prunus mume])

HSP 1 Score: 703.7 bits (1815), Expect = 2.8e-199
Identity = 357/613 (58.24%), Postives = 450/613 (73.41%), Query Frame = 1

Query: 1   MLWNSIIKSQFDSGLFLSAIMLYKNMREVGVEHDGFTFPILNHVVMSIWVDVVYAGMVHC 60
           MLWN IIKS  DSGL  SA++LYK M ++GV HD FTFPI+N  V+ +  D  Y+GMVHC
Sbjct: 102 MLWNLIIKSHVDSGLLGSALLLYKKMLQLGVSHDCFTFPIVNRAVLLLGSDATYSGMVHC 161

Query: 61  VGIRMGFGSDLYFCNTMMEVYAKCECLGHARKVFDEMPNRDLVSWTSMISAYVNVGDIVC 120
           V I+MGFG DLY  NTM++VY KC  L +ARK+FDEM  RDLVSWTSMIS YV+ G++ C
Sbjct: 162 VAIQMGFGMDLYVGNTMIDVYVKCGRLDYARKLFDEMRQRDLVSWTSMISGYVSEGNVAC 221

Query: 121 ALNLFEGMRRVFEPNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGLLFDVGLQNWFLRM 180
             +LF  MRR  EPN+VTM+ MLQ CC  E  V G  +    +K+GLL D  +QN   RM
Sbjct: 222 GFSLFSEMRRELEPNAVTMLVMLQGCCDIEISVYGEPLHGYGIKSGLLGDGSVQNSIFRM 281

Query: 181 YSRLGGEDEFVRFFSEIDCKNVVSWDILISFYSSVGDIVKAVDIFKQIMAGEVPLIIETL 240
           Y++LG  D+   FF ++D ++VVSW+I ISFYS  GD+VK  D+F + M GEV    ETL
Sbjct: 282 YAKLGTVDQVEDFFGQLDRRDVVSWNIRISFYSWRGDVVKVRDLFHE-MQGEVAPSSETL 341

Query: 241 TILISATKTSDSMCLILGENLHSLAIKTGLYDSILRTSLLDMYAKFGELDNSTRLFNEIP 300
           T++ISA        L  GE+LH LA K+GL D IL+TSLLD YAK GEL NS +LF EIP
Sbjct: 342 TLVISALTKHG--ILSQGESLHCLATKSGLCDDILQTSLLDFYAKCGELGNSDKLFREIP 401

Query: 301 NRSIITWGAMMSSFIQNGHFDEAVEIFSQMQAAGLKPSLGILKHLIDAYAHLGALQLGRG 360
           +R+ ITWGAMM  FIQNG+F+EAV +F +MQA G++P   IL+ L+DA+A+LGAL+LG+G
Sbjct: 402 HRNSITWGAMMFGFIQNGYFNEAVRLFGRMQAEGVEPGAEILRGLVDAFANLGALKLGKG 461

Query: 361 IHCYLIRIYGLEI--CNTHLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSMIEGYG 420
           IH  +IR    E   CNTHLETSL+NMY+RCGSI++AR CF  ++++DVVAWTSMIEGYG
Sbjct: 462 IHGCIIRKSFCEAKKCNTHLETSLINMYIRCGSISTARVCFSRMLIRDVVAWTSMIEGYG 521

Query: 421 SHGQGINALNLYHHMMSEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDL 480
           SHG G+ AL L+  M+ E   PNSVT LSLLSACSHSGLV+EGCE F SM+ +F I+PDL
Sbjct: 522 SHGLGLEALKLFDLMIREGTKPNSVTLLSLLSACSHSGLVTEGCEAFCSMKWKFGIEPDL 581

Query: 481 EHYTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIAIYAAHRLLE 540
           +HYT  VDLL RS +++EA  +I++M    D RIWGAL+   R+YG   +  +AA RLLE
Sbjct: 582 DHYTSIVDLLGRSGKLKEALVVIMKMVIFPDSRIWGALLSGSRIYGRRDVGEFAAQRLLE 641

Query: 541 LEPDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKDFVKKPGWSFVEL-NGTLHGFVSGD 600
           LEPDNVGY TLLSN QASVG+W EVE++R V+ E+D  KKPGWS +E   G ++GFVSGD
Sbjct: 642 LEPDNVGYCTLLSNAQASVGEWDEVEEIRRVMKERDLKKKPGWSCIEAEEGRIYGFVSGD 701

Query: 601 RSHCKTDQIYDLL 611
           RSH + + IY++L
Sbjct: 702 RSHHQMEAIYEVL 711

BLAST of CmoCh03G001870 vs. NCBI nr
Match: gi|1009159814|ref|XP_015898019.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g35130, chloroplastic-like, partial [Ziziphus jujuba])

HSP 1 Score: 701.0 bits (1808), Expect = 1.8e-198
Identity = 355/613 (57.91%), Postives = 457/613 (74.55%), Query Frame = 1

Query: 3   WNSIIKSQFDSGLFLSAIMLYKNMREVGVEHDGFTFPILNHVVMSIWVDVVYAGMVHCVG 62
           WN IIKS  + G   SA +LY+ M E+GV HD FTFPI+N  +  + +DV+YAGMVHC+ 
Sbjct: 121 WNLIIKSHVEFGHLESAFLLYRKMHELGVAHDVFTFPIVNKALSLLRIDVLYAGMVHCLA 180

Query: 63  IRMGFGSDLYFCNTMMEVYAKCECLGHARKVFDEMPNRDLVSWTSMISAYVNVGDIVCAL 122
            +MGF  D+YF NTM+E+Y KC C+ +ARK+FDEM +RDLVSWT+MIS YV+ G+ +CAL
Sbjct: 181 NQMGFVLDVYFGNTMIELYVKCGCVYYARKLFDEMCHRDLVSWTAMISGYVSEGNFICAL 240

Query: 123 NLFEGMRRV-FEPNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGLLFDVGLQNWFLRMY 182
           N F  MR +  EPN+VTMM +LQ CC T   + GR + C + KNGLL D  LQN  L+MY
Sbjct: 241 NFFREMRMLDLEPNAVTMMVVLQGCCGTGSSIYGRQLHCYLFKNGLLMDGSLQNSILKMY 300

Query: 183 SRLGGEDEFVRFFSEIDCK-NVVSWDILISFYSSVGDIVKAVDIFKQIMAGEVPLIIETL 242
           ++LG  +E   F  E+D + +VV W++LISFYSSVGD VKA+ +F + M  EV   IETL
Sbjct: 301 TKLGTINEVESFSREVDRRRDVVYWNVLISFYSSVGDAVKAIGMFNK-MRLEVETSIETL 360

Query: 243 TILISATKTSDSMCLILGENLHSLAIKTGLYDSILRTSLLDMYAKFGELDNSTRLFNEIP 302
           T +ISA   S +  L  GE LH LAIK+G  D +L+TSLLD+YAK GEL  S RLF EI 
Sbjct: 361 TSVISAVGKSGN--LFQGEKLHCLAIKSGHLDDVLQTSLLDLYAKCGELGKSERLFKEIR 420

Query: 303 NRSIITWGAMMSSFIQNGHFDEAVEIFSQMQAAGLKPSLGILKHLIDAYAHLGALQLGRG 362
           +R+ ITW A+MS FIQNG+F+EAVE+F QMQA  L+PS   L++L+DAY +LGALQLG+ 
Sbjct: 421 HRNNITWSAIMSGFIQNGYFNEAVELFHQMQATDLEPSSENLRNLVDAYTNLGALQLGKR 480

Query: 363 IHCYLIR--IYGLEICNTHLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSMIEGYG 422
           +H +LIR   +  E+CNTHLETSL+NMY+RCGSI+SAR  F+ +++KDVV WTSMIEGYG
Sbjct: 481 VHGFLIRNIFHRSEVCNTHLETSLLNMYIRCGSISSARVYFNKMLIKDVVTWTSMIEGYG 540

Query: 423 SHGQGINALNLYHHMMSEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDL 482
           SHG G+ AL ++  M+ E +APN VTFLSLLSACSHSGLV EGCE+F SM+ +F I PDL
Sbjct: 541 SHGLGVEALRIFDLMIEERIAPNRVTFLSLLSACSHSGLVIEGCEVFSSMKWKFGIDPDL 600

Query: 483 EHYTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYG-DNKIAIYAAHRLL 542
           +HYTC VDLL R  +++EA  II+++  L D RIWGAL  A RV+    ++  YAA +LL
Sbjct: 601 DHYTCMVDLLGRYGKLKEALVIIMKLIALPDSRIWGALFSASRVHHIHRELGEYAAQKLL 660

Query: 543 ELEPDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKDFVKKPGWSFVELNGTLHGFVSGD 602
           ELE DN+GYYTLLSN QAS+GQW+EVE++R V+ EK+  KKPGWS +E  G ++GFVSGD
Sbjct: 661 ELELDNIGYYTLLSNAQASIGQWNEVEEIRRVMKEKEMKKKPGWSCIENKGRVYGFVSGD 720

Query: 603 RSHCKTDQIYDLL 611
           RSH +T++IY +L
Sbjct: 721 RSHHQTEEIYGVL 730

BLAST of CmoCh03G001870 vs. NCBI nr
Match: gi|764608911|ref|XP_004305697.2| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g01580, partial [Fragaria vesca subsp. vesca])

HSP 1 Score: 695.7 bits (1794), Expect = 7.5e-197
Identity = 348/612 (56.86%), Postives = 454/612 (74.18%), Query Frame = 1

Query: 1   MLWNSIIKSQFDSGLFLSAIMLYKNMREVGVEHDGFTFPILNHVVMSIWVDVVYAGMVHC 60
           M WN +IK+  + G   SA++LY+ MRE+GV HDGFTFPI+N  V+ I   V YAG++H 
Sbjct: 112 MQWNLLIKTHIEWGRLDSALLLYRKMRELGVPHDGFTFPIVNKAVLMIGDGVRYAGVLHS 171

Query: 61  VGIRMGFGSDLYFCNTMMEVYAKCECLGHARKVFDEMPNRDLVSWTSMISAYVNVGDIVC 120
           + I+MGFG DLYF NTM+E+Y KC C  +ARK+FDEM +RDL++WTSMIS YV+ G++  
Sbjct: 172 LAIQMGFGLDLYFGNTMIELYVKCGCFSYARKLFDEMCDRDLITWTSMISGYVSQGNVTS 231

Query: 121 ALNLFEGMRRVFEPNSVTMMAMLQACCVTEDLVLGRLIQCLVVKNGLLFDVGLQNWFLRM 180
             +LF  MR   EPNSVTM+AMLQ  C  E  + GR +   V+KNGLL    ++N  LRM
Sbjct: 232 GFSLFNEMRMELEPNSVTMLAMLQGGCCFETSIYGRQLHAYVIKNGLLSHGAVENSILRM 291

Query: 181 YSRLGGEDEFVRFFSEIDCKNVVSWDILISFYSSVGDIVKAVDIFKQIMAGEVPLIIETL 240
           Y++LG  +E   FF E+D ++VV+W+I IS+Y+S GD+VK  D+FK+ M GEV   IETL
Sbjct: 292 YAKLGTGEEVEDFFRELDRRDVVTWNICISYYTSRGDVVKVRDLFKE-MQGEVAPSIETL 351

Query: 241 TILISATKTSDSMCLILGENLHSLAIKTGLYDSILRTSLLDMYAKFGELDNSTRLFNEIP 300
           TI+ISA        L  GE+LH LAIK+GL D +L+TSLLD YAK  +L+++ +LF EI 
Sbjct: 352 TIVISALAIHG--ILSQGESLHGLAIKSGLRDDVLQTSLLDFYAKCAKLESADKLFREIR 411

Query: 301 NRSIITWGAMMSSFIQNGHFDEAVEIFSQMQAAGLKPSLGILKHLIDAYAHLGALQLGRG 360
           +R+ IT GAMM   IQNG+  EAV +F Q+QAAGL P   IL++LIDA+A+LGAL+LG+ 
Sbjct: 412 DRNCITCGAMMFGLIQNGYIYEAVGVFRQIQAAGLDPGAEILRNLIDAFANLGALKLGQA 471

Query: 361 IHCYLIR--IYGLEICNTHLETSLMNMYVRCGSIASARKCFDLIIVKDVVAWTSMIEGYG 420
           +H  +IR   YG E C+THLETS++NMY+RCGSI++AR CF+ +++KDVVAWTSMIEGYG
Sbjct: 472 VHGCIIRKSFYGTEECHTHLETSVINMYIRCGSISTARVCFNGMVLKDVVAWTSMIEGYG 531

Query: 421 SHGQGINALNLYHHMMSEEVAPNSVTFLSLLSACSHSGLVSEGCEIFYSMRSRFNIKPDL 480
           SHG G  AL L+  M  + + PNSVTFLSLLSACSHSGLV+EGC  FYSM+ R+ I+PDL
Sbjct: 532 SHGLGFEALKLFDLMTRQGIKPNSVTFLSLLSACSHSGLVTEGCSAFYSMKWRYGIEPDL 591

Query: 481 EHYTCFVDLLSRSTRVREAFAIILRMTNLCDGRIWGALMGACRVYGDNKIAIYAAHRLLE 540
           +HYTC VDLL+R  +++EA A+IL+M    D RI+GAL+   R+YG+ ++  YAA RLLE
Sbjct: 592 DHYTCLVDLLARCGKLKEALAVILKMLAFPDSRIFGALLSGSRIYGNIELGQYAAQRLLE 651

Query: 541 LEPDNVGYYTLLSNTQASVGQWHEVEKLRSVVYEKDFVKKPGWSFVELNGTLHGFVSGDR 600
           LEPDNVGY+TLLSNTQASV QW EVE++R  + E D  KKPGWS +E  G +HGFVSGD 
Sbjct: 652 LEPDNVGYFTLLSNTQASVRQWDEVEEIRRTMKENDLKKKPGWSCIEAKGLIHGFVSGDN 711

Query: 601 SHCKTDQIYDLL 611
           SH   ++IY++L
Sbjct: 712 SHHHIEEIYEVL 720

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP350_ARATH1.1e-10735.01Pentatricopeptide repeat-containing protein At4g35130, chloroplastic OS=Arabidop... [more]
PP210_ARATH8.9e-9732.74Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana GN... [more]
PP205_ARATH2.9e-9534.15Putative pentatricopeptide repeat-containing protein At3g01580 OS=Arabidopsis th... [more]
PP111_ARATH5.4e-9431.64Putative pentatricopeptide repeat-containing protein At1g69350, mitochondrial OS... [more]
PP341_ARATH2.0e-9333.93Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
M5VVK0_PRUPE6.9e-19757.26Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002924mg PE=4 SV=1[more]
A0A0L9TUF7_PHAAN2.2e-17953.59Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan02g028600 PE=4 SV=1[more]
A0A0S3SU84_PHAAN2.2e-17953.59Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.08G334900 PE=... [more]
B9T607_RICCO1.4e-17854.29Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
A0A061FB60_THECC8.7e-17651.80Pentatricopeptide repeat-containing protein, putative OS=Theobroma cacao GN=TCM_... [more]
Match NameE-valueIdentityDescription
AT4G35130.16.3e-10935.01 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G03580.15.0e-9832.74 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G01580.11.6e-9634.15 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G69350.13.0e-9531.64 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G30700.11.2e-9433.93 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659115504|ref|XP_008457590.1|1.5e-30683.88PREDICTED: pentatricopeptide repeat-containing protein At4g35130, chloroplastic-... [more]
gi|694428826|ref|XP_009341960.1|1.4e-20358.33PREDICTED: pentatricopeptide repeat-containing protein At1g15510, chloroplastic-... [more]
gi|645271147|ref|XP_008240777.1|2.8e-19958.24PREDICTED: pentatricopeptide repeat-containing protein At4g19191, mitochondrial-... [more]
gi|1009159814|ref|XP_015898019.1|1.8e-19857.91PREDICTED: pentatricopeptide repeat-containing protein At4g35130, chloroplastic-... [more]
gi|764608911|ref|XP_004305697.2|7.5e-19756.86PREDICTED: putative pentatricopeptide repeat-containing protein At3g01580, parti... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh03G001870.1CmoCh03G001870.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 277..302
score: 0.36coord: 203..228
score: 0.032coord: 74..100
score: 9.4E-4coord: 3..31
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 303..338
score: 2.9E-7coord: 101..147
score: 1.1E-7coord: 405..453
score: 1.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 74..102
score: 1.6E-4coord: 103..130
score: 2.0E-4coord: 203..230
score: 0.0022coord: 408..441
score: 4.8E-5coord: 443..477
score: 1.0E-4coord: 305..338
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 105..131
score: 5.437coord: 135..169
score: 5.174coord: 441..471
score: 8.638coord: 272..302
score: 7.235coord: 375..405
score: 5.996coord: 201..235
score: 8.627coord: 406..440
score: 10.523coord: 1..33
score: 7.772coord: 543..577
score: 5.24coord: 70..104
score: 9.262coord: 303..337
score: 12.726coord: 477..507
score: 6
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 517..563
score: 3.5E-8coord: 300..449
score: 3.5E-8coord: 200..232
score: 3.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 272..333
score: 3.8E-5coord: 513..564
score: 3.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 2..236
score: 8.3E-235coord: 273..584
score: 8.3E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CmoCh03G001870Watermelon (97103) v2cmowmbB647
CmoCh03G001870Cucumber (Gy14) v1cgycmoB0937
CmoCh03G001870Cucurbita pepo (Zucchini)cmocpeB606
CmoCh03G001870Bottle gourd (USVL1VR-Ls)cmolsiB600
CmoCh03G001870Melon (DHL92) v3.6.1cmomedB680