MC03g0739 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC03g0739
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPentatricopeptide repeat-containing protein DOT4
LocationMC03: 14185168 .. 14187798 (-)
RNA-Seq ExpressionMC03g0739
SyntenyMC03g0739
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTATTGGTGGCAAAATCTCCTCCAACCTTCTGGTTATCTCCCACCGGGCACGATCGCCATGGTTTAGTGAACCTGAAATTCTCGCATTCCTTCGTCTTTGCCAAACCAAAATCAAAATTTTCCTTTTCGAATTCGGCCTATGCTTGTACGCAGATTTACCCTTCACCATCGCAAACGAAAAGCTATCTCGATATTGAACTTGATAACTCCGCCAGAATTGTCGAGTTTTGTGAAGTGGGTGATCTGAAAAATGCTATGGAGCTTCTCTGCAGCTCCGGAAATGCCAACCTTGACTTGGAAACTTACTGCTCCGTCTTGCAGCTTTGTGCTGAACGAAAATCGATTCGATATGGGAAAAGGGTTCATTCAATAATTGAATCTAATGGGGTTGTGATGGACGGAATCTTGGGGGCGAAACTAGTTTTTATGTATGTAAAATGTGGGGATCTAAAAGAAGCGAGGATGATTTTTGATAAACTATCGGAACAGAAGGTTTTCCTCTGGAACCTCATGATTAGTGAGTATGCGGGAAATGGTAACTATGTGGAGAGTGTAAATTTGTTCAAGCGAATGATGGAGTTGGGGATAAAACCTAATTCTTATACATTTTCTAGTGTTTTAAAATGTCTCGCTGCAGTTGCACGTGTAGAACAAGGCAGGCTGGTTCATGGGTTTATCTGCAAGCTGGGGTTTTCATCCTATAATACAGTCGTTAACTCGCTAATTTCTTTCTACTTCGTGAGTAAAAAGGTTAGAAGTGCACAGAAGCTGTTTGATGAATTGAGTGACCGAGATGTGATATCATGGAACTCTATGATCAGTGGCTATGTTAAGAATGGTTTAGAAGACAAGGGAATTGAGATTTTCATAAAGATGTTAGATTTCAGTGTTGATGTTGATTTGGCTACAATGGTCAATGTGCTTGTGGCTTGTGCAAATACGGGCACTCTTTTGTTGGGTAAGGCACTTCATTCTTATGCAATAAAGGCGGCTTCTTCTCTTGACAGAGAAGTTATGTTCAAGAATACTTTACTGGACATGTACTCAAAATGTGGAGATTTGAACAGTGCCATCCGGGTTTTTGAGAAAATGGATGAGAAAACCGTTGTATCTTGGACTTCGATGATTGCAGGCTATGTACGTGAAGGTCTATCCGATGGTGCAATTGAGTTGTTTGATGAAATGAAAAGCAGAGGCGTCGTCCCGGATGTTTATGCTGTGACAAGCATTCTTCATGCTTGTGCTATCAATGGCAACCTGAGTAGTGGGAAGATTGTACACAACTACATCAGGAAAAACAACTTGGAAACTAACTCGTTTGTTAGTAATGCTCTTATGGACATGTATGCCAAATGTGGCAGCATGAAGGACGCGCAGAGTGTTTTTTCTCACATGAAAGGGAAGGATGTAATATCATGGAATACTATGATTGGAGGTTACTCAAAGAACCATCTTCCAAATGAAGCTCTTAACTTGTTTGCAGAGATGCAAAGAGAATCAAAGCCTGATGGCACAACGGTGGCATGCATCCTTCCAGCCTGTGCGAGCCTTGCAGCGTTGGATAGAGGTAGAGAAATCCATGGATATGCATTAAGAAATGGATACTCTAAAGACAAATATGTTGTCAATGCACTTGTTGATATGTATGTAAAGTGTGGGCTATTAGTTCTTGCACGGTCACTCTTCGATATGATTCTCGATAAGGACCTTGTCTCATGGACAGTGATGATAGCAGGATATGGCATGCATGGTTTTGGTAATGAAGCTGTCGATATATTTAATCAGATGAGGATTTCTGGAGTTGAGCCTGATGAAGTATCCTTCATTTCAATTCTTTATGCCTGTAGCCATTCTGGATTGCTTGATGAAGGATGGAAATTTTTCAATATTATGAAGAAGGAATGTCGAATTGAACCCAAGTTGGAGCACTATGCTTGTATGGTGGATCTTCTTGCCCGAACCGGGAATCTGGTGAAGGCTCATAAATTCATCAAAACAATGCCGATCGAACCAGATGCAACAATTTGGGGTGCGTTGTTGTGCGGATGCAGGATACACCATGATGTCAAAGTAGCAGAGAAAGTTGCAGAACGGATCTTTGAGCTAGAACCAGAAAACACAGGATATTATGTACTTTTGGCAAACATCTATGCAGAGGCAGAGAAGTGGGAAGAAGTTCAAAAGTTAAGGAAGAAAATCGGACAACGTGGTTTGAAGAAAAATCCAGGCTGCAGTTGGATAGAGATCAAGGGCAAAGTCAATATCTTTGTTGCTGGAGATTGCTCCAAACCCCAAGCCAAGAAGATAGAGCTACTTCTGAAAAAACTAAGAAGCAAGATGAAGGAAGAAGGTTACTCTCCAAAAACAAGGTATGCCTTGTTAAATGCAGATGAAAGGGAGAAGGAAGTAGCCCTCTGTGGGCACAGTGAGAAGCTAGCCATGGCTTTCGGTATGCTGAATCTCCCACCCGGCAAGACTATACGGGTGACTAAAAATCTCCGAGTTTGCGGTGACTGTCATGAGATGGCCAAGTTCATGTCGAAGAATACCACGAGAGAAATCGTTTTGAGAGATTCGAATCGTTTTCATCATTTCAAAGATGGATATTGTTCTTGTAGAGGTTACTGG

mRNA sequence

ATGCTATTGGTGGCAAAATCTCCTCCAACCTTCTGGTTATCTCCCACCGGGCACGATCGCCATGGTTTAGTGAACCTGAAATTCTCGCATTCCTTCGTCTTTGCCAAACCAAAATCAAAATTTTCCTTTTCGAATTCGGCCTATGCTTGTACGCAGATTTACCCTTCACCATCGCAAACGAAAAGCTATCTCGATATTGAACTTGATAACTCCGCCAGAATTGTCGAGTTTTGTGAAGTGGGTGATCTGAAAAATGCTATGGAGCTTCTCTGCAGCTCCGGAAATGCCAACCTTGACTTGGAAACTTACTGCTCCGTCTTGCAGCTTTGTGCTGAACGAAAATCGATTCGATATGGGAAAAGGGTTCATTCAATAATTGAATCTAATGGGGTTGTGATGGACGGAATCTTGGGGGCGAAACTAGTTTTTATGTATGTAAAATGTGGGGATCTAAAAGAAGCGAGGATGATTTTTGATAAACTATCGGAACAGAAGGTTTTCCTCTGGAACCTCATGATTAGTGAGTATGCGGGAAATGGTAACTATGTGGAGAGTGTAAATTTGTTCAAGCGAATGATGGAGTTGGGGATAAAACCTAATTCTTATACATTTTCTAGTGTTTTAAAATGTCTCGCTGCAGTTGCACGTGTAGAACAAGGCAGGCTGGTTCATGGGTTTATCTGCAAGCTGGGGTTTTCATCCTATAATACAGTCGTTAACTCGCTAATTTCTTTCTACTTCGTGAGTAAAAAGGTTAGAAGTGCACAGAAGCTGTTTGATGAATTGAGTGACCGAGATGTGATATCATGGAACTCTATGATCAGTGGCTATGTTAAGAATGGTTTAGAAGACAAGGGAATTGAGATTTTCATAAAGATGTTAGATTTCAGTGTTGATGTTGATTTGGCTACAATGGTCAATGTGCTTGTGGCTTGTGCAAATACGGGCACTCTTTTGTTGGGTAAGGCACTTCATTCTTATGCAATAAAGGCGGCTTCTTCTCTTGACAGAGAAGTTATGTTCAAGAATACTTTACTGGACATGTACTCAAAATGTGGAGATTTGAACAGTGCCATCCGGGTTTTTGAGAAAATGGATGAGAAAACCGTTGTATCTTGGACTTCGATGATTGCAGGCTATGTACGTGAAGGTCTATCCGATGGTGCAATTGAGTTGTTTGATGAAATGAAAAGCAGAGGCGTCGTCCCGGATGTTTATGCTGTGACAAGCATTCTTCATGCTTGTGCTATCAATGGCAACCTGAGTAGTGGGAAGATTGTACACAACTACATCAGGAAAAACAACTTGGAAACTAACTCGTTTGTTAGTAATGCTCTTATGGACATGTATGCCAAATGTGGCAGCATGAAGGACGCGCAGAGTGTTTTTTCTCACATGAAAGGGAAGGATGTAATATCATGGAATACTATGATTGGAGGTTACTCAAAGAACCATCTTCCAAATGAAGCTCTTAACTTGTTTGCAGAGATGCAAAGAGAATCAAAGCCTGATGGCACAACGGTGGCATGCATCCTTCCAGCCTGTGCGAGCCTTGCAGCGTTGGATAGAGGTAGAGAAATCCATGGATATGCATTAAGAAATGGATACTCTAAAGACAAATATGTTGTCAATGCACTTGTTGATATGTATGTAAAGTGTGGGCTATTAGTTCTTGCACGGTCACTCTTCGATATGATTCTCGATAAGGACCTTGTCTCATGGACAGTGATGATAGCAGGATATGGCATGCATGGTTTTGGTAATGAAGCTGTCGATATATTTAATCAGATGAGGATTTCTGGAGTTGAGCCTGATGAAGTATCCTTCATTTCAATTCTTTATGCCTGTAGCCATTCTGGATTGCTTGATGAAGGATGGAAATTTTTCAATATTATGAAGAAGGAATGTCGAATTGAACCCAAGTTGGAGCACTATGCTTGTATGGTGGATCTTCTTGCCCGAACCGGGAATCTGGTGAAGGCTCATAAATTCATCAAAACAATGCCGATCGAACCAGATGCAACAATTTGGGGTGCGTTGTTGTGCGGATGCAGGATACACCATGATGTCAAAGTAGCAGAGAAAGTTGCAGAACGGATCTTTGAGCTAGAACCAGAAAACACAGGATATTATGTACTTTTGGCAAACATCTATGCAGAGGCAGAGAAGTGGGAAGAAGTTCAAAAGTTAAGGAAGAAAATCGGACAACGTGGTTTGAAGAAAAATCCAGGCTGCAGTTGGATAGAGATCAAGGGCAAAGTCAATATCTTTGTTGCTGGAGATTGCTCCAAACCCCAAGCCAAGAAGATAGAGCTACTTCTGAAAAAACTAAGAAGCAAGATGAAGGAAGAAGGTTACTCTCCAAAAACAAGGTATGCCTTGTTAAATGCAGATGAAAGGGAGAAGGAAGTAGCCCTCTGTGGGCACAGTGAGAAGCTAGCCATGGCTTTCGGTATGCTGAATCTCCCACCCGGCAAGACTATACGGGTGACTAAAAATCTCCGAGTTTGCGGTGACTGTCATGAGATGGCCAAGTTCATGTCGAAGAATACCACGAGAGAAATCGTTTTGAGAGATTCGAATCGTTTTCATCATTTCAAAGATGGATATTGTTCTTGTAGAGGTTACTGG

Coding sequence (CDS)

ATGCTATTGGTGGCAAAATCTCCTCCAACCTTCTGGTTATCTCCCACCGGGCACGATCGCCATGGTTTAGTGAACCTGAAATTCTCGCATTCCTTCGTCTTTGCCAAACCAAAATCAAAATTTTCCTTTTCGAATTCGGCCTATGCTTGTACGCAGATTTACCCTTCACCATCGCAAACGAAAAGCTATCTCGATATTGAACTTGATAACTCCGCCAGAATTGTCGAGTTTTGTGAAGTGGGTGATCTGAAAAATGCTATGGAGCTTCTCTGCAGCTCCGGAAATGCCAACCTTGACTTGGAAACTTACTGCTCCGTCTTGCAGCTTTGTGCTGAACGAAAATCGATTCGATATGGGAAAAGGGTTCATTCAATAATTGAATCTAATGGGGTTGTGATGGACGGAATCTTGGGGGCGAAACTAGTTTTTATGTATGTAAAATGTGGGGATCTAAAAGAAGCGAGGATGATTTTTGATAAACTATCGGAACAGAAGGTTTTCCTCTGGAACCTCATGATTAGTGAGTATGCGGGAAATGGTAACTATGTGGAGAGTGTAAATTTGTTCAAGCGAATGATGGAGTTGGGGATAAAACCTAATTCTTATACATTTTCTAGTGTTTTAAAATGTCTCGCTGCAGTTGCACGTGTAGAACAAGGCAGGCTGGTTCATGGGTTTATCTGCAAGCTGGGGTTTTCATCCTATAATACAGTCGTTAACTCGCTAATTTCTTTCTACTTCGTGAGTAAAAAGGTTAGAAGTGCACAGAAGCTGTTTGATGAATTGAGTGACCGAGATGTGATATCATGGAACTCTATGATCAGTGGCTATGTTAAGAATGGTTTAGAAGACAAGGGAATTGAGATTTTCATAAAGATGTTAGATTTCAGTGTTGATGTTGATTTGGCTACAATGGTCAATGTGCTTGTGGCTTGTGCAAATACGGGCACTCTTTTGTTGGGTAAGGCACTTCATTCTTATGCAATAAAGGCGGCTTCTTCTCTTGACAGAGAAGTTATGTTCAAGAATACTTTACTGGACATGTACTCAAAATGTGGAGATTTGAACAGTGCCATCCGGGTTTTTGAGAAAATGGATGAGAAAACCGTTGTATCTTGGACTTCGATGATTGCAGGCTATGTACGTGAAGGTCTATCCGATGGTGCAATTGAGTTGTTTGATGAAATGAAAAGCAGAGGCGTCGTCCCGGATGTTTATGCTGTGACAAGCATTCTTCATGCTTGTGCTATCAATGGCAACCTGAGTAGTGGGAAGATTGTACACAACTACATCAGGAAAAACAACTTGGAAACTAACTCGTTTGTTAGTAATGCTCTTATGGACATGTATGCCAAATGTGGCAGCATGAAGGACGCGCAGAGTGTTTTTTCTCACATGAAAGGGAAGGATGTAATATCATGGAATACTATGATTGGAGGTTACTCAAAGAACCATCTTCCAAATGAAGCTCTTAACTTGTTTGCAGAGATGCAAAGAGAATCAAAGCCTGATGGCACAACGGTGGCATGCATCCTTCCAGCCTGTGCGAGCCTTGCAGCGTTGGATAGAGGTAGAGAAATCCATGGATATGCATTAAGAAATGGATACTCTAAAGACAAATATGTTGTCAATGCACTTGTTGATATGTATGTAAAGTGTGGGCTATTAGTTCTTGCACGGTCACTCTTCGATATGATTCTCGATAAGGACCTTGTCTCATGGACAGTGATGATAGCAGGATATGGCATGCATGGTTTTGGTAATGAAGCTGTCGATATATTTAATCAGATGAGGATTTCTGGAGTTGAGCCTGATGAAGTATCCTTCATTTCAATTCTTTATGCCTGTAGCCATTCTGGATTGCTTGATGAAGGATGGAAATTTTTCAATATTATGAAGAAGGAATGTCGAATTGAACCCAAGTTGGAGCACTATGCTTGTATGGTGGATCTTCTTGCCCGAACCGGGAATCTGGTGAAGGCTCATAAATTCATCAAAACAATGCCGATCGAACCAGATGCAACAATTTGGGGTGCGTTGTTGTGCGGATGCAGGATACACCATGATGTCAAAGTAGCAGAGAAAGTTGCAGAACGGATCTTTGAGCTAGAACCAGAAAACACAGGATATTATGTACTTTTGGCAAACATCTATGCAGAGGCAGAGAAGTGGGAAGAAGTTCAAAAGTTAAGGAAGAAAATCGGACAACGTGGTTTGAAGAAAAATCCAGGCTGCAGTTGGATAGAGATCAAGGGCAAAGTCAATATCTTTGTTGCTGGAGATTGCTCCAAACCCCAAGCCAAGAAGATAGAGCTACTTCTGAAAAAACTAAGAAGCAAGATGAAGGAAGAAGGTTACTCTCCAAAAACAAGGTATGCCTTGTTAAATGCAGATGAAAGGGAGAAGGAAGTAGCCCTCTGTGGGCACAGTGAGAAGCTAGCCATGGCTTTCGGTATGCTGAATCTCCCACCCGGCAAGACTATACGGGTGACTAAAAATCTCCGAGTTTGCGGTGACTGTCATGAGATGGCCAAGTTCATGTCGAAGAATACCACGAGAGAAATCGTTTTGAGAGATTCGAATCGTTTTCATCATTTCAAAGATGGATATTGTTCTTGTAGAGGTTACTGG

Protein sequence

MLLVAKSPPTFWLSPTGHDRHGLVNLKFSHSFVFAKPKSKFSFSNSAYACTQIYPSPSQTKSYLDIELDNSARIVEFCEVGDLKNAMELLCSSGNANLDLETYCSVLQLCAERKSIRYGKRVHSIIESNGVVMDGILGAKLVFMYVKCGDLKEARMIFDKLSEQKVFLWNLMISEYAGNGNYVESVNLFKRMMELGIKPNSYTFSSVLKCLAAVARVEQGRLVHGFICKLGFSSYNTVVNSLISFYFVSKKVRSAQKLFDELSDRDVISWNSMISGYVKNGLEDKGIEIFIKMLDFSVDVDLATMVNVLVACANTGTLLLGKALHSYAIKAASSLDREVMFKNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIELFDEMKSRGVVPDVYAVTSILHACAINGNLSSGKIVHNYIRKNNLETNSFVSNALMDMYAKCGSMKDAQSVFSHMKGKDVISWNTMIGGYSKNHLPNEALNLFAEMQRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGNEAVDIFNQMRISGVEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECRIEPKLEHYACMVDLLARTGNLVKAHKFIKTMPIEPDATIWGALLCGCRIHHDVKVAEKVAERIFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKNTTREIVLRDSNRFHHFKDGYCSCRGYW
Homology
BLAST of MC03g0739 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 1080.9 bits (2794), Expect = 0.0e+00
Identity = 510/811 (62.89%), Postives = 655/811 (80.76%), Query Frame = 0

Query: 69  DNSARIVEFCEVGDLKNAMELLCSSGNANLDLETYCSVLQLCAERKSIRYGKRVHSIIES 128
           D + ++  FCE G+L+NA++LLC SG  ++D  T CSVLQLCA+ KS++ GK V + I  
Sbjct: 63  DANTQLRRFCESGNLENAVKLLCVSGKWDIDPRTLCSVLQLCADSKSLKDGKEVDNFIRG 122

Query: 129 NGVVMDGILGAKLVFMYVKCGDLKEARMIFDKLSEQKVFLWNLMISEYAGNGNYVESVNL 188
           NG V+D  LG+KL  MY  CGDLKEA  +FD++  +K   WN++++E A +G++  S+ L
Sbjct: 123 NGFVIDSNLGSKLSLMYTNCGDLKEASRVFDEVKIEKALFWNILMNELAKSGDFSGSIGL 182

Query: 189 FKRMMELGIKPNSYTFSSVLKCLAAVARVEQGRLVHGFICKLGFSSYNTVVNSLISFYFV 248
           FK+MM  G++ +SYTFS V K  +++  V  G  +HGFI K GF   N+V NSL++FY  
Sbjct: 183 FKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGEQLHGFILKSGFGERNSVGNSLVAFYLK 242

Query: 249 SKKVRSAQKLFDELSDRDVISWNSMISGYVKNGLEDKGIEIFIKMLDFSVDVDLATMVNV 308
           +++V SA+K+FDE+++RDVISWNS+I+GYV NGL +KG+ +F++ML   +++DLAT+V+V
Sbjct: 243 NQRVDSARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSV 302

Query: 309 LVACANTGTLLLGKALHSYAIKAASSLDREVMFKNTLLDMYSKCGDLNSAIRVFEKMDEK 368
              CA++  + LG+A+HS  +KA  S  RE  F NTLLDMYSKCGDL+SA  VF +M ++
Sbjct: 303 FAGCADSRLISLGRAVHSIGVKACFS--REDRFCNTLLDMYSKCGDLDSAKAVFREMSDR 362

Query: 369 TVVSWTSMIAGYVREGLSDGAIELFDEMKSRGVVPDVYAVTSILHACAINGNLSSGKIVH 428
           +VVS+TSMIAGY REGL+  A++LF+EM+  G+ PDVY VT++L+ CA    L  GK VH
Sbjct: 363 SVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVH 422

Query: 429 NYIRKNNLETNSFVSNALMDMYAKCGSMKDAQSVFSHMKGKDVISWNTMIGGYSKNHLPN 488
            +I++N+L  + FVSNALMDMYAKCGSM++A+ VFS M+ KD+ISWNT+IGGYSKN   N
Sbjct: 423 EWIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYAN 482

Query: 489 EALNLFAEMQRESK--PDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNAL 548
           EAL+LF  +  E +  PD  TVAC+LPACASL+A D+GREIHGY +RNGY  D++V N+L
Sbjct: 483 EALSLFNLLLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSL 542

Query: 549 VDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGNEAVDIFNQMRISGVEPDE 608
           VDMY KCG L+LA  LFD I  KDLVSWTVMIAGYGMHGFG EA+ +FNQMR +G+E DE
Sbjct: 543 VDMYAKCGALLLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIEADE 602

Query: 609 VSFISILYACSHSGLLDEGWKFFNIMKKECRIEPKLEHYACMVDLLARTGNLVKAHKFIK 668
           +SF+S+LYACSHSGL+DEGW+FFNIM+ EC+IEP +EHYAC+VD+LARTG+L+KA++FI+
Sbjct: 603 ISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIE 662

Query: 669 TMPIEPDATIWGALLCGCRIHHDVKVAEKVAERIFELEPENTGYYVLLANIYAEAEKWEE 728
            MPI PDATIWGALLCGCRIHHDVK+AEKVAE++FELEPENTGYYVL+ANIYAEAEKWE+
Sbjct: 663 NMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQ 722

Query: 729 VQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGY 788
           V++LRK+IGQRGL+KNPGCSWIEIKG+VNIFVAGD S P+ + IE  L+K+R++M EEGY
Sbjct: 723 VKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMIEEGY 782

Query: 789 SPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKF 848
           SP T+YAL++A+E EKE ALCGHSEKLAMA G+++   GK IRVTKNLRVCGDCHEMAKF
Sbjct: 783 SPLTKYALIDAEEMEKEEALCGHSEKLAMALGIISSGHGKIIRVTKNLRVCGDCHEMAKF 842

Query: 849 MSKNTTREIVLRDSNRFHHFKDGYCSCRGYW 878
           MSK T REIVLRDSNRFH FKDG+CSCRG+W
Sbjct: 843 MSKLTRREIVLRDSNRFHQFKDGHCSCRGFW 871

BLAST of MC03g0739 vs. ExPASy Swiss-Prot
Match: Q9M9E2 (Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H73 PE=1 SV=1)

HSP 1 Score: 626.3 bits (1614), Expect = 5.1e-178
Identity = 321/807 (39.78%), Postives = 497/807 (61.59%), Query Frame = 0

Query: 70  NSARIVEFCEVGDLKNAMELLCSSGNAN--LDLETYCSVLQLCAERKSIRYGKRVHSIIE 129
           +++++   C  G L+ AM+LL S       +D + + ++++LC  +++   G +V+SI  
Sbjct: 62  SNSQLHGLCANGKLEEAMKLLNSMQELRVAVDEDVFVALVRLCEWKRAQEEGSKVYSIAL 121

Query: 130 SNGVVMDGILGAKLVFMYVKCGDLKEARMIFDKLSEQKVFLWNLMISEYAGNGNYVESVN 189
           S+   +   LG   + M+V+ G+L +A  +F K+SE+ +F WN+++  YA  G + E++ 
Sbjct: 122 SSMSSLGVELGNAFLAMFVRFGNLVDAWYVFGKMSERNLFSWNVLVGGYAKQGYFDEAMC 181

Query: 190 LFKRMMEL-GIKPNSYTFSSVLKCLAAVARVEQGRLVHGFICKLGFSSYNTVVNSLISFY 249
           L+ RM+ + G+KP+ YTF  VL+    +  + +G+ VH  + + G+     VVN+LI+ Y
Sbjct: 182 LYHRMLWVGGVKPDVYTFPCVLRTCGGIPDLARGKEVHVHVVRYGYELDIDVVNALITMY 241

Query: 250 FVSKKVRSAQKLFDELSDRDVISWNSMISGYVKNGLEDKGIEIFIKMLDFSVDVDLATMV 309
                V+SA+ LFD +  RD+ISWN+MISGY +NG+  +G+E+F  M   SVD DL T+ 
Sbjct: 242 VKCGDVKSARLLFDRMPRRDIISWNAMISGYFENGMCHEGLELFFAMRGLSVDPDLMTLT 301

Query: 310 NVLVACANTGTLLLGKALHSYAIKAASSLDREVMFKNTLLDMYSKCGDLNSAIRVFEKMD 369
           +V+ AC   G   LG+ +H+Y I    ++D  V   N+L  MY   G    A ++F +M+
Sbjct: 302 SVISACELLGDRRLGRDIHAYVITTGFAVDISVC--NSLTQMYLNAGSWREAEKLFSRME 361

Query: 370 EKTVVSWTSMIAGYVREGLSDGAIELFDEMKSRGVVPDVYAVTSILHACAINGNLSSGKI 429
            K +VSWT+MI+GY    L D AI+ +  M    V PD   V ++L ACA  G+L +G  
Sbjct: 362 RKDIVSWTTMISGYEYNFLPDKAIDTYRMMDQDSVKPDEITVAAVLSACATLGDLDTGVE 421

Query: 430 VHNYIRKNNLETNSFVSNALMDMYAKCGSMKDAQSVFSHMKGKDVISWNTMIGGYSKNHL 489
           +H    K  L +   V+N L++MY+KC  +  A  +F ++  K+VISW ++I G   N+ 
Sbjct: 422 LHKLAIKARLISYVIVANNLINMYSKCKCIDKALDIFHNIPRKNVISWTSIIAGLRLNNR 481

Query: 490 PNEALNLFAEMQRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNAL 549
             EAL    +M+   +P+  T+   L ACA + AL  G+EIH + LR G   D ++ NAL
Sbjct: 482 CFEALIFLRQMKMTLQPNAITLTAALAACARIGALMCGKEIHAHVLRTGVGLDDFLPNAL 541

Query: 550 VDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGNEAVDIFNQMRISGVEPDE 609
           +DMYV+CG +  A S F+    KD+ SW +++ GY   G G+  V++F++M  S V PDE
Sbjct: 542 LDMYVRCGRMNTAWSQFNS-QKKDVTSWNILLTGYSERGQGSMVVELFDRMVKSRVRPDE 601

Query: 610 VSFISILYACSHSGLLDEGWKFFNIMKKECRIEPKLEHYACMVDLLARTGNLVKAHKFIK 669
           ++FIS+L  CS S ++ +G  +F+ M ++  + P L+HYAC+VDLL R G L +AHKFI+
Sbjct: 602 ITFISLLCGCSKSQMVRQGLMYFSKM-EDYGVTPNLKHYACVVDLLGRAGELQEAHKFIQ 661

Query: 670 TMPIEPDATIWGALLCGCRIHHDVKVAEKVAERIFELEPENTGYYVLLANIYAEAEKWEE 729
            MP+ PD  +WGALL  CRIHH + + E  A+ IFEL+ ++ GYY+LL N+YA+  KW E
Sbjct: 662 KMPVTPDPAVWGALLNACRIHHKIDLGELSAQHIFELDKKSVGYYILLCNLYADCGKWRE 721

Query: 730 VQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGY 789
           V K+R+ + + GL  + GCSW+E+KGKV+ F++ D   PQ K+I  +L+    KM E G 
Sbjct: 722 VAKVRRMMKENGLTVDAGCSWVEVKGKVHAFLSDDKYHPQTKEINTVLEGFYEKMSEVGL 781

Query: 790 SPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKF 849
           +  +  + ++  E  ++   CGHSE+ A+AFG++N  PG  I VTKNL +C +CH+  KF
Sbjct: 782 TKISESSSMDETEISRDEIFCGHSERKAIAFGLINTVPGMPIWVTKNLSMCENCHDTVKF 841

Query: 850 MSKNTTREIVLRDSNRFHHFKDGYCSC 874
           +SK   REI +RD+  FHHFKDG CSC
Sbjct: 842 ISKTVRREISVRDAEHFHHFKDGECSC 864

BLAST of MC03g0739 vs. ExPASy Swiss-Prot
Match: Q9M1V3 (Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H83 PE=2 SV=2)

HSP 1 Score: 607.4 bits (1565), Expect = 2.5e-172
Identity = 320/784 (40.82%), Postives = 476/784 (60.71%), Query Frame = 0

Query: 98  LDLETYCSVLQLCAERKSIRYGKRVHSIIESNGVVMDGILGAKLVFMYVKCGDLKEARMI 157
           L L ++ ++L+ CA+ + IR G  +HS++   G    G +   LV MY K  DL  AR +
Sbjct: 180 LGLSSFPALLKACAKLRDIRSGSELHSLLVKLGYHSTGFIVNALVSMYAKNDDLSAARRL 239

Query: 158 FDKLSEQ-KVFLWNLMISEYAGNGNYVESVNLFKRMMELGIKPNSYTFSSVLKCLAAVAR 217
           FD   E+    LWN ++S Y+ +G  +E++ LF+ M   G  PNSYT  S L      + 
Sbjct: 240 FDGFQEKGDAVLWNSILSSYSTSGKSLETLELFREMHMTGPAPNSYTIVSALTACDGFSY 299

Query: 218 VEQGRLVHGFICKLG-FSSYNTVVNSLISFYFVSKKVRSAQKLFDELSDRDVISWNSMIS 277
            + G+ +H  + K    SS   V N+LI+ Y    K+  A+++  ++++ DV++WNS+I 
Sbjct: 300 AKLGKEIHASVLKSSTHSSELYVCNALIAMYTRCGKMPQAERILRQMNNADVVTWNSLIK 359

Query: 278 GYVKNGLEDKGIEIFIKMLDFSVDVDLATMVNVLVACANTGTLLLGKALHSYAIKAASSL 337
           GYV+N +  + +E F  M+      D  +M +++ A      LL G  LH+Y IK     
Sbjct: 360 GYVQNLMYKEALEFFSDMIAAGHKSDEVSMTSIIAASGRLSNLLAGMELHAYVIK--HGW 419

Query: 338 DREVMFKNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIELFDE 397
           D  +   NTL+DMYSKC       R F +M +K ++SWT++IAGY +      A+ELF +
Sbjct: 420 DSNLQVGNTLIDMYSKCNLTCYMGRAFLRMHDKDLISWTTVIAGYAQNDCHVEALELFRD 479

Query: 398 MKSRGVVPDVYAVTSILHACAINGNLSSGKIVHNYIRKNNLETNSFVSNALMDMYAKCGS 457
           +  + +  D   + SIL A ++  ++   K +H +I +  L  ++ + N L+D+Y KC +
Sbjct: 480 VAKKRMEIDEMILGSILRASSVLKSMLIVKEIHCHILRKGL-LDTVIQNELVDVYGKCRN 539

Query: 458 MKDAQSVFSHMKGKDVISWNTMIGGYSKNHLPNEALNLFAEM-QRESKPDGTTVACILPA 517
           M  A  VF  +KGKDV+SW +MI   + N   +EA+ LF  M +     D   + CIL A
Sbjct: 540 MGYATRVFESIKGKDVVSWTSMISSSALNGNESEAVELFRRMVETGLSADSVALLCILSA 599

Query: 518 CASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSW 577
            ASL+AL++GREIH Y LR G+  +  +  A+VDMY  CG L  A+++FD I  K L+ +
Sbjct: 600 AASLSALNKGREIHCYLLRKGFCLEGSIAVAVVDMYACCGDLQSAKAVFDRIERKGLLQY 659

Query: 578 TVMIAGYGMHGFGNEAVDIFNQMRISGVEPDEVSFISILYACSHSGLLDEGWKFFNIMKK 637
           T MI  YGMHG G  AV++F++MR   V PD +SF+++LYACSH+GLLDEG  F  IM+ 
Sbjct: 660 TSMINAYGMHGCGKAAVELFDKMRHENVSPDHISFLALLYACSHAGLLDEGRGFLKIMEH 719

Query: 638 ECRIEPKLEHYACMVDLLARTGNLVKAHKFIKTMPIEPDATIWGALLCGCRIHHDVKVAE 697
           E  +EP  EHY C+VD+L R   +V+A +F+K M  EP A +W ALL  CR H + ++ E
Sbjct: 720 EYELEPWPEHYVCLVDMLGRANCVVEAFEFVKMMKTEPTAEVWCALLAACRSHSEKEIGE 779

Query: 698 KVAERIFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKV 757
             A+R+ ELEP+N G  VL++N++AE  +W +V+K+R K+   G++K+PGCSWIE+ GKV
Sbjct: 780 IAAQRLLELEPKNPGNLVLVSNVFAEQGRWNDVEKVRAKMKASGMEKHPGCSWIEMDGKV 839

Query: 758 NIFVAGDCSKPQAKKIELLLKKLRSKMKEE-GYSPKTRYALLNADEREKEVALCGHSEKL 817
           + F A D S P++K+I   L ++  K++ E GY   T++ L N DE EK   L GHSE++
Sbjct: 840 HKFTARDKSHPESKEIYEKLSEVTRKLEREVGYVADTKFVLHNVDEGEKVQMLHGHSERI 899

Query: 818 AMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKNTTREIVLRDSNRFHHFKDGYCSC 877
           A+A+G+L  P    +R+TKNLRVC DCH   K +SK   R+IV+RD+NRFHHF+ G CSC
Sbjct: 900 AIAYGLLRTPDRACLRITKNLRVCRDCHTFCKLVSKLFRRDIVMRDANRFHHFESGLCSC 959

BLAST of MC03g0739 vs. ExPASy Swiss-Prot
Match: Q9SS60 (Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H23 PE=2 SV=1)

HSP 1 Score: 605.9 bits (1561), Expect = 7.2e-172
Identity = 339/853 (39.74%), Postives = 502/853 (58.85%), Query Frame = 0

Query: 29  SHSFVFAKPKSKFSFSNSAYACTQIYPSPSQTKSYLDIELDNSARIVEFCEVGDLKNAME 88
           S  F   K   K+S      +   ++   S  K   ++ L NS  I  F + G    A+E
Sbjct: 37  SSDFFSGKLIDKYSHFREPASSLSVFRRVSPAK---NVYLWNSI-IRAFSKNGLFPEALE 96

Query: 89  LL--CSSGNANLDLETYCSVLQLCAERKSIRYGKRVHSIIESNGVVMDGILGAKLVFMYV 148
                     + D  T+ SV++ CA       G  V+  I   G   D  +G  LV MY 
Sbjct: 97  FYGKLRESKVSPDKYTFPSVIKACAGLFDAEMGDLVYEQILDMGFESDLFVGNALVDMYS 156

Query: 149 KCGDLKEARMIFDKLSEQKVFLWNLMISEYAGNGNYVESVNLFKRMMELGIKPNSYTFSS 208
           + G L  AR +FD++  + +  WN +IS Y+ +G Y E++ ++  +    I P+S+T SS
Sbjct: 157 RMGLLTRARQVFDEMPVRDLVSWNSLISGYSSHGYYEEALEIYHELKNSWIVPDSFTVSS 216

Query: 209 VLKCLAAVARVEQGRLVHGFICKLGFSSYNTVVNSLISFYFVSKKVRSAQKLFDELSDRD 268
           VL     +  V+QG+ +HGF  K G +S   V N L++ Y   ++   A+++FDE+  RD
Sbjct: 217 VLPAFGNLLVVKQGQGLHGFALKSGVNSVVVVNNGLVAMYLKFRRPTDARRVFDEMDVRD 276

Query: 269 VISWNSMISGYVKNGLEDKGIEIFIKMLDFSVDVDLATMVNVLVACANTGTLLLGKALHS 328
            +S+N+MI GY+K  + ++ + +F++ LD     DL T+ +VL AC +   L L K +++
Sbjct: 277 SVSYNTMICGYLKLEMVEESVRMFLENLD-QFKPDLLTVSSVLRACGHLRDLSLAKYIYN 336

Query: 329 YAIKAASSLDREVMFKNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLS 388
           Y +KA   L+  V  +N L+D+Y+KCGD+ +A  VF  M+ K  VSW S+I+GY++ G  
Sbjct: 337 YMLKAGFVLESTV--RNILIDVYAKCGDMITARDVFNSMECKDTVSWNSIISGYIQSGDL 396

Query: 389 DGAIELFDEMKSRGVVPDVYAVTSILHACAINGNLSSGKIVHNYIRKNNLETNSFVSNAL 448
             A++LF  M       D      ++       +L  GK +H+   K+ +  +  VSNAL
Sbjct: 397 MEAMKLFKMMMIMEEQADHITYLMLISVSTRLADLKFGKGLHSNGIKSGICIDLSVSNAL 456

Query: 449 MDMYAKCGSMKDAQSVFSHMKGKDVISWNTMIGGYSKNHLPNEALNLFAEMQR-ESKPDG 508
           +DMYAKCG + D+  +FS M   D ++WNT+I    +       L +  +M++ E  PD 
Sbjct: 457 IDMYAKCGEVGDSLKIFSSMGTGDTVTWNTVISACVRFGDFATGLQVTTQMRKSEVVPDM 516

Query: 509 TTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDM 568
            T    LP CASLAA   G+EIH   LR GY  +  + NAL++MY KCG L  +  +F+ 
Sbjct: 517 ATFLVTLPMCASLAAKRLGKEIHCCLLRFGYESELQIGNALIEMYSKCGCLENSSRVFER 576

Query: 569 ILDKDLVSWTVMIAGYGMHGFGNEAVDIFNQMRISGVEPDEVSFISILYACSHSGLLDEG 628
           +  +D+V+WT MI  YGM+G G +A++ F  M  SG+ PD V FI+I+YACSHSGL+DEG
Sbjct: 577 MSRRDVVTWTGMIYAYGMYGEGEKALETFADMEKSGIVPDSVVFIAIIYACSHSGLVDEG 636

Query: 629 WKFFNIMKKECRIEPKLEHYACMVDLLARTGNLVKAHKFIKTMPIEPDATIWGALLCGCR 688
              F  MK   +I+P +EHYAC+VDLL+R+  + KA +FI+ MPI+PDA+IW ++L  CR
Sbjct: 637 LACFEKMKTHYKIDPMIEHYACVVDLLSRSQKISKAEEFIQAMPIKPDASIWASVLRACR 696

Query: 689 IHHDVKVAEKVAERIFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGC 748
              D++ AE+V+ RI EL P++ GY +L +N YA   KW++V  +RK +  + + KNPG 
Sbjct: 697 TSGDMETAERVSRRIIELNPDDPGYSILASNAYAALRKWDKVSLIRKSLKDKHITKNPGY 756

Query: 749 SWIEIKGKVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLN-ADEREKEV 808
           SWIE+   V++F +GD S PQ++ I   L+ L S M +EGY P  R    N  +E EK  
Sbjct: 757 SWIEVGKNVHVFSSGDDSAPQSEAIYKSLEILYSLMAKEGYIPDPREVSQNLEEEEEKRR 816

Query: 809 ALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKNTTREIVLRDSNRFH 868
            +CGHSE+LA+AFG+LN  PG  ++V KNLRVCGDCHE+ K +SK   REI++RD+NRFH
Sbjct: 817 LICGHSERLAIAFGLLNTEPGTPLQVMKNLRVCGDCHEVTKLISKIVGREILVRDANRFH 876

Query: 869 HFKDGYCSCRGYW 878
            FKDG CSC+  W
Sbjct: 877 LFKDGTCSCKDRW 882

BLAST of MC03g0739 vs. ExPASy Swiss-Prot
Match: Q9LFL5 (Pentatricopeptide repeat-containing protein At5g16860 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H92 PE=2 SV=1)

HSP 1 Score: 605.5 bits (1560), Expect = 9.4e-172
Identity = 334/816 (40.93%), Postives = 480/816 (58.82%), Query Frame = 0

Query: 114 KSIRYGKRVHSIIESNGVVMDGILGAKLVFMYVKCGDLKEARMIFDKL--SEQKVFLWNL 173
           K+I   K +H  + S G++    L + L+  Y+  G L  A  +  +   S+  V+ WN 
Sbjct: 39  KTISQVKLIHQKLLSFGILTLN-LTSHLISTYISVGCLSHAVSLLRRFPPSDAGVYHWNS 98

Query: 174 MISEYAGNGNYVESVNLFKRMMELGIKPNSYTFSSVLKCLAAVARVEQGRLVHGFICKLG 233
           +I  Y  NG   + + LF  M  L   P++YTF  V K    ++ V  G   H      G
Sbjct: 99  LIRSYGDNGCANKCLYLFGLMHSLSWTPDNYTFPFVFKACGEISSVRCGESAHALSLVTG 158

Query: 234 FSSYNTVVNSLISFYFVSKKVRSAQKLFDELSDRDVISWNSMISGYVKNGLEDKGIEIFI 293
           F S   V N+L++ Y   + +  A+K+FDE+S  DV+SWNS+I  Y K G     +E+F 
Sbjct: 159 FISNVFVGNALVAMYSRCRSLSDARKVFDEMSVWDVVSWNSIIESYAKLGKPKVALEMFS 218

Query: 294 KML-DFSVDVDLATMVNVLVACANTGTLLLGKALHSYAIKAASSLDREVMFKNTLLDMYS 353
           +M  +F    D  T+VNVL  CA+ GT  LGK LH +A+   S + + +   N L+DMY+
Sbjct: 219 RMTNEFGCRPDNITLVNVLPPCASLGTHSLGKQLHCFAV--TSEMIQNMFVGNCLVDMYA 278

Query: 354 KCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIELFDEMK------------- 413
           KCG ++ A  VF  M  K VVSW +M+AGY + G  + A+ LF++M+             
Sbjct: 279 KCGMMDEANTVFSNMSVKDVVSWNAMVAGYSQIGRFEDAVRLFEKMQEEKIKMDVVTWSA 338

Query: 414 ----------------------SRGVVPDVYAVTSILHACAINGNLSSGKIVHNY----- 473
                                 S G+ P+   + S+L  CA  G L  GK +H Y     
Sbjct: 339 AISGYAQRGLGYEALGVCRQMLSSGIKPNEVTLISVLSGCASVGALMHGKEIHCYAIKYP 398

Query: 474 --IRKNNLETNSFVSNALMDMYAKCGSMKDAQSVFSHM--KGKDVISWNTMIGGYSKNHL 533
             +RKN     + V N L+DMYAKC  +  A+++F  +  K +DV++W  MIGGYS++  
Sbjct: 399 IDLRKNGHGDENMVINQLIDMYAKCKKVDTARAMFDSLSPKERDVVTWTVMIGGYSQHGD 458

Query: 534 PNEALNLFAEMQRE---SKPDGTTVACILPACASLAALDRGREIHGYALRNGYSK-DKYV 593
            N+AL L +EM  E   ++P+  T++C L ACASLAAL  G++IH YALRN  +    +V
Sbjct: 459 ANKALELLSEMFEEDCQTRPNAFTISCALVACASLAALRIGKQIHAYALRNQQNAVPLFV 518

Query: 594 VNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGNEAVDIFNQMRISGV 653
            N L+DMY KCG +  AR +FD ++ K+ V+WT ++ GYGMHG+G EA+ IF++MR  G 
Sbjct: 519 SNCLIDMYAKCGSISDARLVFDNMMAKNEVTWTSLMTGYGMHGYGEEALGIFDEMRRIGF 578

Query: 654 EPDEVSFISILYACSHSGLLDEGWKFFNIMKKECRIEPKLEHYACMVDLLARTGNLVKAH 713
           + D V+ + +LYACSHSG++D+G ++FN MK    + P  EHYAC+VDLL R G L  A 
Sbjct: 579 KLDGVTLLVVLYACSHSGMIDQGMEYFNRMKTVFGVSPGPEHYACLVDLLGRAGRLNAAL 638

Query: 714 KFIKTMPIEPDATIWGALLCGCRIHHDVKVAEKVAERIFELEPENTGYYVLLANIYAEAE 773
           + I+ MP+EP   +W A L  CRIH  V++ E  AE+I EL   + G Y LL+N+YA A 
Sbjct: 639 RLIEEMPMEPPPVVWVAFLSCCRIHGKVELGEYAAEKITELASNHDGSYTLLSNLYANAG 698

Query: 774 KWEEVQKLRKKIGQRGLKKNPGCSWIE-IKGKVNIFVAGDCSKPQAKKIELLLKKLRSKM 833
           +W++V ++R  +  +G+KK PGCSW+E IKG    FV GD + P AK+I  +L     ++
Sbjct: 699 RWKDVTRIRSLMRHKGVKKRPGCSWVEGIKGTTTFFV-GDKTHPHAKEIYQVLLDHMQRI 758

Query: 834 KEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCH 878
           K+ GY P+T +AL + D+ EK+  L  HSEKLA+A+G+L  P G  IR+TKNLRVCGDCH
Sbjct: 759 KDIGYVPETGFALHDVDDEEKDDLLFEHSEKLALAYGILTTPQGAAIRITKNLRVCGDCH 818

BLAST of MC03g0739 vs. NCBI nr
Match: XP_022139839.1 (pentatricopeptide repeat-containing protein DOT4, chloroplastic [Momordica charantia])

HSP 1 Score: 1763 bits (4567), Expect = 0.0
Identity = 877/877 (100.00%), Postives = 877/877 (100.00%), Query Frame = 0

Query: 1   MLLVAKSPPTFWLSPTGHDRHGLVNLKFSHSFVFAKPKSKFSFSNSAYACTQIYPSPSQT 60
           MLLVAKSPPTFWLSPTGHDRHGLVNLKFSHSFVFAKPKSKFSFSNSAYACTQIYPSPSQT
Sbjct: 1   MLLVAKSPPTFWLSPTGHDRHGLVNLKFSHSFVFAKPKSKFSFSNSAYACTQIYPSPSQT 60

Query: 61  KSYLDIELDNSARIVEFCEVGDLKNAMELLCSSGNANLDLETYCSVLQLCAERKSIRYGK 120
           KSYLDIELDNSARIVEFCEVGDLKNAMELLCSSGNANLDLETYCSVLQLCAERKSIRYGK
Sbjct: 61  KSYLDIELDNSARIVEFCEVGDLKNAMELLCSSGNANLDLETYCSVLQLCAERKSIRYGK 120

Query: 121 RVHSIIESNGVVMDGILGAKLVFMYVKCGDLKEARMIFDKLSEQKVFLWNLMISEYAGNG 180
           RVHSIIESNGVVMDGILGAKLVFMYVKCGDLKEARMIFDKLSEQKVFLWNLMISEYAGNG
Sbjct: 121 RVHSIIESNGVVMDGILGAKLVFMYVKCGDLKEARMIFDKLSEQKVFLWNLMISEYAGNG 180

Query: 181 NYVESVNLFKRMMELGIKPNSYTFSSVLKCLAAVARVEQGRLVHGFICKLGFSSYNTVVN 240
           NYVESVNLFKRMMELGIKPNSYTFSSVLKCLAAVARVEQGRLVHGFICKLGFSSYNTVVN
Sbjct: 181 NYVESVNLFKRMMELGIKPNSYTFSSVLKCLAAVARVEQGRLVHGFICKLGFSSYNTVVN 240

Query: 241 SLISFYFVSKKVRSAQKLFDELSDRDVISWNSMISGYVKNGLEDKGIEIFIKMLDFSVDV 300
           SLISFYFVSKKVRSAQKLFDELSDRDVISWNSMISGYVKNGLEDKGIEIFIKMLDFSVDV
Sbjct: 241 SLISFYFVSKKVRSAQKLFDELSDRDVISWNSMISGYVKNGLEDKGIEIFIKMLDFSVDV 300

Query: 301 DLATMVNVLVACANTGTLLLGKALHSYAIKAASSLDREVMFKNTLLDMYSKCGDLNSAIR 360
           DLATMVNVLVACANTGTLLLGKALHSYAIKAASSLDREVMFKNTLLDMYSKCGDLNSAIR
Sbjct: 301 DLATMVNVLVACANTGTLLLGKALHSYAIKAASSLDREVMFKNTLLDMYSKCGDLNSAIR 360

Query: 361 VFEKMDEKTVVSWTSMIAGYVREGLSDGAIELFDEMKSRGVVPDVYAVTSILHACAINGN 420
           VFEKMDEKTVVSWTSMIAGYVREGLSDGAIELFDEMKSRGVVPDVYAVTSILHACAINGN
Sbjct: 361 VFEKMDEKTVVSWTSMIAGYVREGLSDGAIELFDEMKSRGVVPDVYAVTSILHACAINGN 420

Query: 421 LSSGKIVHNYIRKNNLETNSFVSNALMDMYAKCGSMKDAQSVFSHMKGKDVISWNTMIGG 480
           LSSGKIVHNYIRKNNLETNSFVSNALMDMYAKCGSMKDAQSVFSHMKGKDVISWNTMIGG
Sbjct: 421 LSSGKIVHNYIRKNNLETNSFVSNALMDMYAKCGSMKDAQSVFSHMKGKDVISWNTMIGG 480

Query: 481 YSKNHLPNEALNLFAEMQRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDK 540
           YSKNHLPNEALNLFAEMQRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDK
Sbjct: 481 YSKNHLPNEALNLFAEMQRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDK 540

Query: 541 YVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGNEAVDIFNQMRIS 600
           YVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGNEAVDIFNQMRIS
Sbjct: 541 YVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGNEAVDIFNQMRIS 600

Query: 601 GVEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECRIEPKLEHYACMVDLLARTGNLVK 660
           GVEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECRIEPKLEHYACMVDLLARTGNLVK
Sbjct: 601 GVEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECRIEPKLEHYACMVDLLARTGNLVK 660

Query: 661 AHKFIKTMPIEPDATIWGALLCGCRIHHDVKVAEKVAERIFELEPENTGYYVLLANIYAE 720
           AHKFIKTMPIEPDATIWGALLCGCRIHHDVKVAEKVAERIFELEPENTGYYVLLANIYAE
Sbjct: 661 AHKFIKTMPIEPDATIWGALLCGCRIHHDVKVAEKVAERIFELEPENTGYYVLLANIYAE 720

Query: 721 AEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKKLRSK 780
           AEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKKLRSK
Sbjct: 721 AEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKKLRSK 780

Query: 781 MKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC 840
           MKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC
Sbjct: 781 MKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC 840

Query: 841 HEMAKFMSKNTTREIVLRDSNRFHHFKDGYCSCRGYW 877
           HEMAKFMSKNTTREIVLRDSNRFHHFKDGYCSCRGYW
Sbjct: 841 HEMAKFMSKNTTREIVLRDSNRFHHFKDGYCSCRGYW 877

BLAST of MC03g0739 vs. NCBI nr
Match: XP_038893908.1 (pentatricopeptide repeat-containing protein DOT4, chloroplastic [Benincasa hispida])

HSP 1 Score: 1572 bits (4071), Expect = 0.0
Identity = 774/879 (88.05%), Postives = 829/879 (94.31%), Query Frame = 0

Query: 1   MLLVAKSPPTFWLSPTGHDRHGLVNLKFSHSFVFAKPKSKFSFSNSAYACTQIYPSPSQT 60
           MLLVAK P TFWLSP G+D  GL++LKF  SFVF KP SKFSFSNSA+ACT+ Y    +T
Sbjct: 1   MLLVAKPPTTFWLSPVGYDHRGLLSLKFRQSFVFVKPNSKFSFSNSAHACTEGYTPALET 60

Query: 61  K--SYLDIELDNSARIVEFCEVGDLKNAMELLCSSGNANLDLETYCSVLQLCAERKSIRY 120
           K  SY+D+ELDNS +IVEFCE+GDLKNAMELLC S N+  DL+TYCS+LQLCAE+KSIR 
Sbjct: 61  KRKSYIDVELDNSHKIVEFCEMGDLKNAMELLCGSQNSYFDLDTYCSILQLCAEQKSIRD 120

Query: 121 GKRVHSIIESNGVVMDGILGAKLVFMYVKCGDLKEARMIFDKLSEQKVFLWNLMISEYAG 180
           G+RVHSIIESNGV++DGILG KLVFMYVKCGDLKE R+IFDKLSE KVFLWNLMISEY+G
Sbjct: 121 GRRVHSIIESNGVMIDGILGVKLVFMYVKCGDLKEGRVIFDKLSENKVFLWNLMISEYSG 180

Query: 181 NGNYVESVNLFKRMMELGIKPNSYTFSSVLKCLAAVARVEQGRLVHGFICKLGFSSYNTV 240
           NGNY ES+NLFK+M+ELGIKPNSYTFSSVLKCLAAVARVE+GR VHG ICKLGF+SYNTV
Sbjct: 181 NGNYGESINLFKQMLELGIKPNSYTFSSVLKCLAAVARVEEGRQVHGLICKLGFNSYNTV 240

Query: 241 VNSLISFYFVSKKVRSAQKLFDELSDRDVISWNSMISGYVKNGLEDKGIEIFIKMLDFSV 300
           VNSLISFYFVS+KVR AQKLFDEL+DRDVISWNSMISGYVKNGLEDKGIEIFIKML FS+
Sbjct: 241 VNSLISFYFVSRKVRDAQKLFDELTDRDVISWNSMISGYVKNGLEDKGIEIFIKMLAFSI 300

Query: 301 DVDLATMVNVLVACANTGTLLLGKALHSYAIKAASSLDREVMFKNTLLDMYSKCGDLNSA 360
           D DLATMVNVLVACAN GTLLLGKALHSY IKAA+ L++EVMF NTLLDMYSKCG LNSA
Sbjct: 301 DFDLATMVNVLVACANMGTLLLGKALHSYTIKAAA-LEKEVMFNNTLLDMYSKCGALNSA 360

Query: 361 IRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIELFDEMKSRGVVPDVYAVTSILHACAIN 420
           IRVFE+MDEKTVVSWTSMI GYVREGLSDGAI+LFDEMKS+G++PDVYAVTSILHACAIN
Sbjct: 361 IRVFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMKSKGILPDVYAVTSILHACAIN 420

Query: 421 GNLSSGKIVHNYIRKNNLETNSFVSNALMDMYAKCGSMKDAQSVFSHMKGKDVISWNTMI 480
           GNL+SGKIVHNYIR+N LETNSFVSNALMDMYAK GSMKDA  VFSHMK KDVISWNTMI
Sbjct: 421 GNLNSGKIVHNYIRENYLETNSFVSNALMDMYAKSGSMKDAHDVFSHMKRKDVISWNTMI 480

Query: 481 GGYSKNHLPNEALNLFAEMQRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSK 540
           GGYSKN LPNEALNLFAEMQRE KPD TTVACILPACASLAALDRGREIHGYALRNGYSK
Sbjct: 481 GGYSKNRLPNEALNLFAEMQRELKPDSTTVACILPACASLAALDRGREIHGYALRNGYSK 540

Query: 541 DKYVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGNEAVDIFNQMR 600
           DKYVVNALVDMYVKCGLLVLARSLFDMI +KDLVSWTVMIAGYGMHGFG+EA++ FNQMR
Sbjct: 541 DKYVVNALVDMYVKCGLLVLARSLFDMIFNKDLVSWTVMIAGYGMHGFGSEAINTFNQMR 600

Query: 601 ISGVEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECRIEPKLEHYACMVDLLARTGNL 660
           I+G+EPDEVSFISILYACSHSGLLDEGWKF+NIMKKEC+IEP LEHYACMVDLLARTGNL
Sbjct: 601 IAGIEPDEVSFISILYACSHSGLLDEGWKFYNIMKKECQIEPNLEHYACMVDLLARTGNL 660

Query: 661 VKAHKFIKTMPIEPDATIWGALLCGCRIHHDVKVAEKVAERIFELEPENTGYYVLLANIY 720
           VKAHKFI+TMPI+PDATIWGALLCGCRIHHDVK+AEKVAE+IFELEPENTGYYVLLANIY
Sbjct: 661 VKAHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAEQIFELEPENTGYYVLLANIY 720

Query: 721 AEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKKLR 780
           AEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLK+LR
Sbjct: 721 AEAEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLR 780

Query: 781 SKMKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCG 840
           SKMKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCG
Sbjct: 781 SKMKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCG 840

Query: 841 DCHEMAKFMSKNTTREIVLRDSNRFHHFKDGYCSCRGYW 877
           DCHEMAKFMSK+ +REI+LRDS+RFH+FKDG CSCRGYW
Sbjct: 841 DCHEMAKFMSKSASREIILRDSSRFHYFKDGNCSCRGYW 878

BLAST of MC03g0739 vs. NCBI nr
Match: XP_022927496.1 (pentatricopeptide repeat-containing protein DOT4, chloroplastic [Cucurbita moschata])

HSP 1 Score: 1558 bits (4033), Expect = 0.0
Identity = 762/877 (86.89%), Postives = 825/877 (94.07%), Query Frame = 0

Query: 2   LLVAKSPPTFWLSPTGHDRHGLVNLKFSHSFVFAKPKSKFSFSNSAYACTQIY-PSPSQT 61
           LLVAK+PPTFWLS  G+D  GLVNLKF  S  F KP S+ SFSNSA+A T+ Y P+  + 
Sbjct: 3   LLVAKAPPTFWLSSAGYDHRGLVNLKFRQSSAFVKPNSQSSFSNSAHASTESYTPTALEA 62

Query: 62  KSYLDIELDNSARIVEFCEVGDLKNAMELLCSSGNANLDLETYCSVLQLCAERKSIRYGK 121
           K+Y+D+EL+NS +IV+FCEVGDLKNA+ELLCSS N+NLDL+TYC +LQLCAE+KSIR G+
Sbjct: 63  KNYIDVELNNSRKIVKFCEVGDLKNAIELLCSSQNSNLDLDTYCVILQLCAEQKSIRDGR 122

Query: 122 RVHSIIESNGVVMDGILGAKLVFMYVKCGDLKEARMIFDKLSEQKVFLWNLMISEYAGNG 181
           RVHSIIESN VV+DGILGAKLVFMYVKCGDL+E RMIFDKLSE+KVFLWNLMISEY+G+G
Sbjct: 123 RVHSIIESNEVVIDGILGAKLVFMYVKCGDLREGRMIFDKLSEKKVFLWNLMISEYSGSG 182

Query: 182 NYVESVNLFKRMMELGIKPNSYTFSSVLKCLAAVARVEQGRLVHGFICKLGFSSYNTVVN 241
           NY ES+NLFKRM+ELGI PNSYTFSSVLKC AAVARVE+G  VHG ICKLGF+SYN VVN
Sbjct: 183 NYGESINLFKRMLELGINPNSYTFSSVLKCFAAVARVEEGMQVHGLICKLGFTSYNAVVN 242

Query: 242 SLISFYFVSKKVRSAQKLFDELSDRDVISWNSMISGYVKNGLEDKGIEIFIKMLDFSVDV 301
           SLISFYFV +KVRSA+KLFDE+SDRDVISWNSMISGYVKNGLED+GIEIF++ML FSVDV
Sbjct: 243 SLISFYFVGRKVRSARKLFDEMSDRDVISWNSMISGYVKNGLEDRGIEIFLRMLVFSVDV 302

Query: 302 DLATMVNVLVACANTGTLLLGKALHSYAIKAASSLDREVMFKNTLLDMYSKCGDLNSAIR 361
           DLATMVNVLVACAN GTL LGK LHSY+IKAA++LDR+VMF NTLLDMYSKCGDLNSAIR
Sbjct: 303 DLATMVNVLVACANMGTLSLGKTLHSYSIKAAAALDRDVMFNNTLLDMYSKCGDLNSAIR 362

Query: 362 VFEKMDEKTVVSWTSMIAGYVREGLSDGAIELFDEMKSRGVVPDVYAVTSILHACAINGN 421
           VFE+MDEKTVVSWTS+IAGYVREGLSDGAIELF+EMKSRGV+PDVYAV SILHACA NGN
Sbjct: 363 VFERMDEKTVVSWTSIIAGYVREGLSDGAIELFNEMKSRGVLPDVYAVASILHACATNGN 422

Query: 422 LSSGKIVHNYIRKNNLETNSFVSNALMDMYAKCGSMKDAQSVFSHMKGKDVISWNTMIGG 481
           L+SGK +HNYIR+NNLETNSFVSNALMDMYAKCGSM+DA  VFSHMK KDVISWNTMIGG
Sbjct: 423 LNSGKSLHNYIRENNLETNSFVSNALMDMYAKCGSMRDAADVFSHMKRKDVISWNTMIGG 482

Query: 482 YSKNHLPNEALNLFAEMQRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDK 541
           YSKN LPNEAL+LFAEMQRESKPDGTTVACILPACASLAALD+GREIHGYALRNGYSKDK
Sbjct: 483 YSKNRLPNEALSLFAEMQRESKPDGTTVACILPACASLAALDKGREIHGYALRNGYSKDK 542

Query: 542 YVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGNEAVDIFNQMRIS 601
           +V NALVDMYVKCGLLVLARSLFDMIL+KDLVSWTVMIAGYGMHG+G+EAV  FNQMRI+
Sbjct: 543 FVANALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGYGSEAVSAFNQMRIA 602

Query: 602 GVEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECRIEPKLEHYACMVDLLARTGNLVK 661
           G+EPDEVSFISILYACSHSGLLDEGW FFNIMKKEC+IEP LEHYACMVDLLARTGNL +
Sbjct: 603 GIEPDEVSFISILYACSHSGLLDEGWNFFNIMKKECQIEPNLEHYACMVDLLARTGNLAR 662

Query: 662 AHKFIKTMPIEPDATIWGALLCGCRIHHDVKVAEKVAERIFELEPENTGYYVLLANIYAE 721
           AHKFIKTMPI+PDATIWGALLCGCRIHHDVK+AEKVAERIFELEPENTGYYVLLANIYAE
Sbjct: 663 AHKFIKTMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAE 722

Query: 722 AEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKKLRSK 781
           AEKWEEVQKLR +IGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLL +LRSK
Sbjct: 723 AEKWEEVQKLRTRIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLTRLRSK 782

Query: 782 MKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC 841
           MKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC
Sbjct: 783 MKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC 842

Query: 842 HEMAKFMSKNTTREIVLRDSNRFHHFKDGYCSCRGYW 877
           HEMA+FMSK+T REIVLRDS+RFHHFKDGYCSCRGYW
Sbjct: 843 HEMARFMSKDTKREIVLRDSSRFHHFKDGYCSCRGYW 879

BLAST of MC03g0739 vs. NCBI nr
Match: KAG7019566.1 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1556 bits (4028), Expect = 0.0
Identity = 760/877 (86.66%), Postives = 825/877 (94.07%), Query Frame = 0

Query: 2   LLVAKSPPTFWLSPTGHDRHGLVNLKFSHSFVFAKPKSKFSFSNSAYACTQIY-PSPSQT 61
           LLVAK+PPTFWLS  G+D  GLVNLKF  S  F KP S+ SFSNSA+A T+ Y P+  + 
Sbjct: 61  LLVAKAPPTFWLSSAGYDHRGLVNLKFRQSSAFVKPNSQSSFSNSAHAFTESYTPTALEA 120

Query: 62  KSYLDIELDNSARIVEFCEVGDLKNAMELLCSSGNANLDLETYCSVLQLCAERKSIRYGK 121
           K+Y+D+EL+NS +IV+FCEVGDLKNA+ELLCSS N+NLDL+TYC +LQLCAE+KSIR G+
Sbjct: 121 KNYIDVELNNSRKIVKFCEVGDLKNAIELLCSSQNSNLDLDTYCVILQLCAEQKSIRDGR 180

Query: 122 RVHSIIESNGVVMDGILGAKLVFMYVKCGDLKEARMIFDKLSEQKVFLWNLMISEYAGNG 181
           RVHSIIESN VV+DGILGAKL+FMYVKCGDL+E RMIFDKLSE+KVFLWNLMISEY+G+G
Sbjct: 181 RVHSIIESNEVVIDGILGAKLIFMYVKCGDLREGRMIFDKLSEKKVFLWNLMISEYSGSG 240

Query: 182 NYVESVNLFKRMMELGIKPNSYTFSSVLKCLAAVARVEQGRLVHGFICKLGFSSYNTVVN 241
           NY ES+NLFKRM+ELGI PNSYTFSSVLKC AAVARVE+G  VHG ICKLGF+SYN VVN
Sbjct: 241 NYGESINLFKRMLELGINPNSYTFSSVLKCFAAVARVEEGMQVHGLICKLGFTSYNAVVN 300

Query: 242 SLISFYFVSKKVRSAQKLFDELSDRDVISWNSMISGYVKNGLEDKGIEIFIKMLDFSVDV 301
           SLISFYFV +KVRSA+KLFDE+SDRDVISWNSMISGYVKNGLED+GIEIF++ML FSVDV
Sbjct: 301 SLISFYFVGRKVRSARKLFDEMSDRDVISWNSMISGYVKNGLEDRGIEIFLRMLVFSVDV 360

Query: 302 DLATMVNVLVACANTGTLLLGKALHSYAIKAASSLDREVMFKNTLLDMYSKCGDLNSAIR 361
           DLATMVNVLVACAN GTL LGK LHSY+IKAA++LDR+VMF NTLLDMYSKCGDLNSAIR
Sbjct: 361 DLATMVNVLVACANMGTLSLGKTLHSYSIKAAAALDRDVMFNNTLLDMYSKCGDLNSAIR 420

Query: 362 VFEKMDEKTVVSWTSMIAGYVREGLSDGAIELFDEMKSRGVVPDVYAVTSILHACAINGN 421
           VFE+MDEKTVVSWTSMIAGYVREGLSDGAIELF+EMKSRGV+PDVYAV SILHACA NGN
Sbjct: 421 VFERMDEKTVVSWTSMIAGYVREGLSDGAIELFNEMKSRGVLPDVYAVASILHACATNGN 480

Query: 422 LSSGKIVHNYIRKNNLETNSFVSNALMDMYAKCGSMKDAQSVFSHMKGKDVISWNTMIGG 481
           L+SGK +HNYIR+NNLETNSFVSNALMDMYAKCGSM+DA  VFSHMK KDVISWNTMIGG
Sbjct: 481 LNSGKSLHNYIRENNLETNSFVSNALMDMYAKCGSMRDAADVFSHMKRKDVISWNTMIGG 540

Query: 482 YSKNHLPNEALNLFAEMQRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDK 541
           YSKN LPNEAL+LFAEMQRESKPDGTTVACILPACASLAALD+GREIHGYALRNGYSKDK
Sbjct: 541 YSKNRLPNEALSLFAEMQRESKPDGTTVACILPACASLAALDKGREIHGYALRNGYSKDK 600

Query: 542 YVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGNEAVDIFNQMRIS 601
           +V NALVDMYVKCGLLVLARSLFDMIL+KDLVSWTVMIAGYGMHG+G+EAV  FNQMRI+
Sbjct: 601 FVANALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGYGSEAVSAFNQMRIA 660

Query: 602 GVEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECRIEPKLEHYACMVDLLARTGNLVK 661
           G+EPDEVSFISILYACSHSGLLDEGW FFNIMKKEC+IEP LEHYACMVDLLARTGNL +
Sbjct: 661 GIEPDEVSFISILYACSHSGLLDEGWNFFNIMKKECQIEPNLEHYACMVDLLARTGNLAR 720

Query: 662 AHKFIKTMPIEPDATIWGALLCGCRIHHDVKVAEKVAERIFELEPENTGYYVLLANIYAE 721
           AHKFI+TMPI+PDATIWGALLCGCRIHHDVK+AEKVAERIFELEPENTGYYVLLANIYAE
Sbjct: 721 AHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAE 780

Query: 722 AEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKKLRSK 781
           AEKWEEVQKLR +IGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLL +LR+K
Sbjct: 781 AEKWEEVQKLRTRIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLTRLRNK 840

Query: 782 MKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC 841
           MKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC
Sbjct: 841 MKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC 900

Query: 842 HEMAKFMSKNTTREIVLRDSNRFHHFKDGYCSCRGYW 877
           HEMA+FMSK+T REIVLRDS+RFHHFKDGYCSCRGYW
Sbjct: 901 HEMARFMSKDTKREIVLRDSSRFHHFKDGYCSCRGYW 937

BLAST of MC03g0739 vs. NCBI nr
Match: KAG6583948.1 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1555 bits (4025), Expect = 0.0
Identity = 761/877 (86.77%), Postives = 824/877 (93.96%), Query Frame = 0

Query: 2   LLVAKSPPTFWLSPTGHDRHGLVNLKFSHSFVFAKPKSKFSFSNSAYACTQIY-PSPSQT 61
           LLVAK+PPTFWLS  G+D  GLVNLKF  S  F KP S+ SFSNSA+A T+ Y P+  + 
Sbjct: 3   LLVAKAPPTFWLSSAGYDHRGLVNLKFRQSSAFVKPNSQSSFSNSAHASTESYTPTALEA 62

Query: 62  KSYLDIELDNSARIVEFCEVGDLKNAMELLCSSGNANLDLETYCSVLQLCAERKSIRYGK 121
           K+Y+D+EL+NS +IV+FCEVGDLKNA+ELLCSS N+NLDL+TYC +LQLCAE+KSIR G+
Sbjct: 63  KNYIDVELNNSRKIVKFCEVGDLKNAIELLCSSQNSNLDLDTYCVILQLCAEQKSIRDGR 122

Query: 122 RVHSIIESNGVVMDGILGAKLVFMYVKCGDLKEARMIFDKLSEQKVFLWNLMISEYAGNG 181
           RVHSIIESN VV+DGILGAKLVFMYVKCGDL+E RMIFDKLSE+KVFLWNLMISEY+G+G
Sbjct: 123 RVHSIIESNEVVIDGILGAKLVFMYVKCGDLREGRMIFDKLSEKKVFLWNLMISEYSGSG 182

Query: 182 NYVESVNLFKRMMELGIKPNSYTFSSVLKCLAAVARVEQGRLVHGFICKLGFSSYNTVVN 241
           NY ES+NLFKRM+ELGI PNSYTFSSVLKC AAVARVE+G  VHG ICKLGF+SYN VVN
Sbjct: 183 NYGESINLFKRMLELGINPNSYTFSSVLKCFAAVARVEEGMQVHGLICKLGFTSYNAVVN 242

Query: 242 SLISFYFVSKKVRSAQKLFDELSDRDVISWNSMISGYVKNGLEDKGIEIFIKMLDFSVDV 301
           SLISFYFV +KVRSA+KLFDE+SDRDVISWNSMISGYVKNGLED+GIEIF++ML FSVDV
Sbjct: 243 SLISFYFVGRKVRSARKLFDEMSDRDVISWNSMISGYVKNGLEDRGIEIFLRMLVFSVDV 302

Query: 302 DLATMVNVLVACANTGTLLLGKALHSYAIKAASSLDREVMFKNTLLDMYSKCGDLNSAIR 361
           DLATMVNVLVACAN GTL LGK LHSY+IKAA++LDR+VMF NTLLDMYSKCGDLNSAIR
Sbjct: 303 DLATMVNVLVACANMGTLSLGKTLHSYSIKAAAALDRDVMFNNTLLDMYSKCGDLNSAIR 362

Query: 362 VFEKMDEKTVVSWTSMIAGYVREGLSDGAIELFDEMKSRGVVPDVYAVTSILHACAINGN 421
           VFE+MDEKTVVSWTSMIAGYVREGLSDGAIELF+EMKSRGV+PDVYAV SILHACA NGN
Sbjct: 363 VFERMDEKTVVSWTSMIAGYVREGLSDGAIELFNEMKSRGVLPDVYAVASILHACATNGN 422

Query: 422 LSSGKIVHNYIRKNNLETNSFVSNALMDMYAKCGSMKDAQSVFSHMKGKDVISWNTMIGG 481
           L+SGK +HNYIR+NNLETNSFVSNALMDMYAKCGSM+DA  VFSHMK KDVISWNTMIGG
Sbjct: 423 LNSGKSLHNYIRENNLETNSFVSNALMDMYAKCGSMRDAADVFSHMKRKDVISWNTMIGG 482

Query: 482 YSKNHLPNEALNLFAEMQRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDK 541
           YSKN LPNEAL+LFAEMQRESKPDGTTVACILPACASLAALD+GREIHGYALRNGYSKDK
Sbjct: 483 YSKNRLPNEALSLFAEMQRESKPDGTTVACILPACASLAALDKGREIHGYALRNGYSKDK 542

Query: 542 YVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGNEAVDIFNQMRIS 601
           +V NALVDMYVKCGLLVLARSLF MIL+KDLVSWTVMIAGYGMHG+G+EAV  FNQMRI+
Sbjct: 543 FVANALVDMYVKCGLLVLARSLFAMILNKDLVSWTVMIAGYGMHGYGSEAVSAFNQMRIA 602

Query: 602 GVEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECRIEPKLEHYACMVDLLARTGNLVK 661
           G+EPDEVSFISILYACSHSGLLDEGW FFNIMKKEC+IEP LEHYACMVDLLARTGNL +
Sbjct: 603 GIEPDEVSFISILYACSHSGLLDEGWNFFNIMKKECQIEPNLEHYACMVDLLARTGNLAR 662

Query: 662 AHKFIKTMPIEPDATIWGALLCGCRIHHDVKVAEKVAERIFELEPENTGYYVLLANIYAE 721
           AHKFI+TMPI+PDATIWGALLCGCRIHHDVK+AEKVAERIFELEPENTGYYVLLANIYAE
Sbjct: 663 AHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAE 722

Query: 722 AEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKKLRSK 781
           AEKWEEVQKLR +IGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLL +LRSK
Sbjct: 723 AEKWEEVQKLRTRIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLTRLRSK 782

Query: 782 MKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC 841
           MKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC
Sbjct: 783 MKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC 842

Query: 842 HEMAKFMSKNTTREIVLRDSNRFHHFKDGYCSCRGYW 877
           HEMA+FMSK+T REIVLRDS+RFHHFKDGYCSCRGYW
Sbjct: 843 HEMARFMSKDTKREIVLRDSSRFHHFKDGYCSCRGYW 879

BLAST of MC03g0739 vs. ExPASy TrEMBL
Match: A0A6J1CE34 (pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111010653 PE=3 SV=1)

HSP 1 Score: 1763 bits (4567), Expect = 0.0
Identity = 877/877 (100.00%), Postives = 877/877 (100.00%), Query Frame = 0

Query: 1   MLLVAKSPPTFWLSPTGHDRHGLVNLKFSHSFVFAKPKSKFSFSNSAYACTQIYPSPSQT 60
           MLLVAKSPPTFWLSPTGHDRHGLVNLKFSHSFVFAKPKSKFSFSNSAYACTQIYPSPSQT
Sbjct: 1   MLLVAKSPPTFWLSPTGHDRHGLVNLKFSHSFVFAKPKSKFSFSNSAYACTQIYPSPSQT 60

Query: 61  KSYLDIELDNSARIVEFCEVGDLKNAMELLCSSGNANLDLETYCSVLQLCAERKSIRYGK 120
           KSYLDIELDNSARIVEFCEVGDLKNAMELLCSSGNANLDLETYCSVLQLCAERKSIRYGK
Sbjct: 61  KSYLDIELDNSARIVEFCEVGDLKNAMELLCSSGNANLDLETYCSVLQLCAERKSIRYGK 120

Query: 121 RVHSIIESNGVVMDGILGAKLVFMYVKCGDLKEARMIFDKLSEQKVFLWNLMISEYAGNG 180
           RVHSIIESNGVVMDGILGAKLVFMYVKCGDLKEARMIFDKLSEQKVFLWNLMISEYAGNG
Sbjct: 121 RVHSIIESNGVVMDGILGAKLVFMYVKCGDLKEARMIFDKLSEQKVFLWNLMISEYAGNG 180

Query: 181 NYVESVNLFKRMMELGIKPNSYTFSSVLKCLAAVARVEQGRLVHGFICKLGFSSYNTVVN 240
           NYVESVNLFKRMMELGIKPNSYTFSSVLKCLAAVARVEQGRLVHGFICKLGFSSYNTVVN
Sbjct: 181 NYVESVNLFKRMMELGIKPNSYTFSSVLKCLAAVARVEQGRLVHGFICKLGFSSYNTVVN 240

Query: 241 SLISFYFVSKKVRSAQKLFDELSDRDVISWNSMISGYVKNGLEDKGIEIFIKMLDFSVDV 300
           SLISFYFVSKKVRSAQKLFDELSDRDVISWNSMISGYVKNGLEDKGIEIFIKMLDFSVDV
Sbjct: 241 SLISFYFVSKKVRSAQKLFDELSDRDVISWNSMISGYVKNGLEDKGIEIFIKMLDFSVDV 300

Query: 301 DLATMVNVLVACANTGTLLLGKALHSYAIKAASSLDREVMFKNTLLDMYSKCGDLNSAIR 360
           DLATMVNVLVACANTGTLLLGKALHSYAIKAASSLDREVMFKNTLLDMYSKCGDLNSAIR
Sbjct: 301 DLATMVNVLVACANTGTLLLGKALHSYAIKAASSLDREVMFKNTLLDMYSKCGDLNSAIR 360

Query: 361 VFEKMDEKTVVSWTSMIAGYVREGLSDGAIELFDEMKSRGVVPDVYAVTSILHACAINGN 420
           VFEKMDEKTVVSWTSMIAGYVREGLSDGAIELFDEMKSRGVVPDVYAVTSILHACAINGN
Sbjct: 361 VFEKMDEKTVVSWTSMIAGYVREGLSDGAIELFDEMKSRGVVPDVYAVTSILHACAINGN 420

Query: 421 LSSGKIVHNYIRKNNLETNSFVSNALMDMYAKCGSMKDAQSVFSHMKGKDVISWNTMIGG 480
           LSSGKIVHNYIRKNNLETNSFVSNALMDMYAKCGSMKDAQSVFSHMKGKDVISWNTMIGG
Sbjct: 421 LSSGKIVHNYIRKNNLETNSFVSNALMDMYAKCGSMKDAQSVFSHMKGKDVISWNTMIGG 480

Query: 481 YSKNHLPNEALNLFAEMQRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDK 540
           YSKNHLPNEALNLFAEMQRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDK
Sbjct: 481 YSKNHLPNEALNLFAEMQRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDK 540

Query: 541 YVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGNEAVDIFNQMRIS 600
           YVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGNEAVDIFNQMRIS
Sbjct: 541 YVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGNEAVDIFNQMRIS 600

Query: 601 GVEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECRIEPKLEHYACMVDLLARTGNLVK 660
           GVEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECRIEPKLEHYACMVDLLARTGNLVK
Sbjct: 601 GVEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECRIEPKLEHYACMVDLLARTGNLVK 660

Query: 661 AHKFIKTMPIEPDATIWGALLCGCRIHHDVKVAEKVAERIFELEPENTGYYVLLANIYAE 720
           AHKFIKTMPIEPDATIWGALLCGCRIHHDVKVAEKVAERIFELEPENTGYYVLLANIYAE
Sbjct: 661 AHKFIKTMPIEPDATIWGALLCGCRIHHDVKVAEKVAERIFELEPENTGYYVLLANIYAE 720

Query: 721 AEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKKLRSK 780
           AEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKKLRSK
Sbjct: 721 AEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKKLRSK 780

Query: 781 MKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC 840
           MKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC
Sbjct: 781 MKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC 840

Query: 841 HEMAKFMSKNTTREIVLRDSNRFHHFKDGYCSCRGYW 877
           HEMAKFMSKNTTREIVLRDSNRFHHFKDGYCSCRGYW
Sbjct: 841 HEMAKFMSKNTTREIVLRDSNRFHHFKDGYCSCRGYW 877

BLAST of MC03g0739 vs. ExPASy TrEMBL
Match: A0A6J1EHU9 (pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111434304 PE=3 SV=1)

HSP 1 Score: 1558 bits (4033), Expect = 0.0
Identity = 762/877 (86.89%), Postives = 825/877 (94.07%), Query Frame = 0

Query: 2   LLVAKSPPTFWLSPTGHDRHGLVNLKFSHSFVFAKPKSKFSFSNSAYACTQIY-PSPSQT 61
           LLVAK+PPTFWLS  G+D  GLVNLKF  S  F KP S+ SFSNSA+A T+ Y P+  + 
Sbjct: 3   LLVAKAPPTFWLSSAGYDHRGLVNLKFRQSSAFVKPNSQSSFSNSAHASTESYTPTALEA 62

Query: 62  KSYLDIELDNSARIVEFCEVGDLKNAMELLCSSGNANLDLETYCSVLQLCAERKSIRYGK 121
           K+Y+D+EL+NS +IV+FCEVGDLKNA+ELLCSS N+NLDL+TYC +LQLCAE+KSIR G+
Sbjct: 63  KNYIDVELNNSRKIVKFCEVGDLKNAIELLCSSQNSNLDLDTYCVILQLCAEQKSIRDGR 122

Query: 122 RVHSIIESNGVVMDGILGAKLVFMYVKCGDLKEARMIFDKLSEQKVFLWNLMISEYAGNG 181
           RVHSIIESN VV+DGILGAKLVFMYVKCGDL+E RMIFDKLSE+KVFLWNLMISEY+G+G
Sbjct: 123 RVHSIIESNEVVIDGILGAKLVFMYVKCGDLREGRMIFDKLSEKKVFLWNLMISEYSGSG 182

Query: 182 NYVESVNLFKRMMELGIKPNSYTFSSVLKCLAAVARVEQGRLVHGFICKLGFSSYNTVVN 241
           NY ES+NLFKRM+ELGI PNSYTFSSVLKC AAVARVE+G  VHG ICKLGF+SYN VVN
Sbjct: 183 NYGESINLFKRMLELGINPNSYTFSSVLKCFAAVARVEEGMQVHGLICKLGFTSYNAVVN 242

Query: 242 SLISFYFVSKKVRSAQKLFDELSDRDVISWNSMISGYVKNGLEDKGIEIFIKMLDFSVDV 301
           SLISFYFV +KVRSA+KLFDE+SDRDVISWNSMISGYVKNGLED+GIEIF++ML FSVDV
Sbjct: 243 SLISFYFVGRKVRSARKLFDEMSDRDVISWNSMISGYVKNGLEDRGIEIFLRMLVFSVDV 302

Query: 302 DLATMVNVLVACANTGTLLLGKALHSYAIKAASSLDREVMFKNTLLDMYSKCGDLNSAIR 361
           DLATMVNVLVACAN GTL LGK LHSY+IKAA++LDR+VMF NTLLDMYSKCGDLNSAIR
Sbjct: 303 DLATMVNVLVACANMGTLSLGKTLHSYSIKAAAALDRDVMFNNTLLDMYSKCGDLNSAIR 362

Query: 362 VFEKMDEKTVVSWTSMIAGYVREGLSDGAIELFDEMKSRGVVPDVYAVTSILHACAINGN 421
           VFE+MDEKTVVSWTS+IAGYVREGLSDGAIELF+EMKSRGV+PDVYAV SILHACA NGN
Sbjct: 363 VFERMDEKTVVSWTSIIAGYVREGLSDGAIELFNEMKSRGVLPDVYAVASILHACATNGN 422

Query: 422 LSSGKIVHNYIRKNNLETNSFVSNALMDMYAKCGSMKDAQSVFSHMKGKDVISWNTMIGG 481
           L+SGK +HNYIR+NNLETNSFVSNALMDMYAKCGSM+DA  VFSHMK KDVISWNTMIGG
Sbjct: 423 LNSGKSLHNYIRENNLETNSFVSNALMDMYAKCGSMRDAADVFSHMKRKDVISWNTMIGG 482

Query: 482 YSKNHLPNEALNLFAEMQRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDK 541
           YSKN LPNEAL+LFAEMQRESKPDGTTVACILPACASLAALD+GREIHGYALRNGYSKDK
Sbjct: 483 YSKNRLPNEALSLFAEMQRESKPDGTTVACILPACASLAALDKGREIHGYALRNGYSKDK 542

Query: 542 YVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGNEAVDIFNQMRIS 601
           +V NALVDMYVKCGLLVLARSLFDMIL+KDLVSWTVMIAGYGMHG+G+EAV  FNQMRI+
Sbjct: 543 FVANALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGYGSEAVSAFNQMRIA 602

Query: 602 GVEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECRIEPKLEHYACMVDLLARTGNLVK 661
           G+EPDEVSFISILYACSHSGLLDEGW FFNIMKKEC+IEP LEHYACMVDLLARTGNL +
Sbjct: 603 GIEPDEVSFISILYACSHSGLLDEGWNFFNIMKKECQIEPNLEHYACMVDLLARTGNLAR 662

Query: 662 AHKFIKTMPIEPDATIWGALLCGCRIHHDVKVAEKVAERIFELEPENTGYYVLLANIYAE 721
           AHKFIKTMPI+PDATIWGALLCGCRIHHDVK+AEKVAERIFELEPENTGYYVLLANIYAE
Sbjct: 663 AHKFIKTMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAE 722

Query: 722 AEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKKLRSK 781
           AEKWEEVQKLR +IGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLL +LRSK
Sbjct: 723 AEKWEEVQKLRTRIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLTRLRSK 782

Query: 782 MKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC 841
           MKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC
Sbjct: 783 MKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC 842

Query: 842 HEMAKFMSKNTTREIVLRDSNRFHHFKDGYCSCRGYW 877
           HEMA+FMSK+T REIVLRDS+RFHHFKDGYCSCRGYW
Sbjct: 843 HEMARFMSKDTKREIVLRDSSRFHHFKDGYCSCRGYW 879

BLAST of MC03g0739 vs. ExPASy TrEMBL
Match: A0A6J1KNK7 (pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111495126 PE=3 SV=1)

HSP 1 Score: 1550 bits (4012), Expect = 0.0
Identity = 758/876 (86.53%), Postives = 824/876 (94.06%), Query Frame = 0

Query: 2   LLVAKSPPTFWLSPTGHDRHGLVNLKFSHSFVFAKPKSKFSFSNSAYACTQIY-PSPSQT 61
           LLVAK+PPTFWLS  G+D  GLVNLKF  S  F KP S+ SFSNSAYA T+ Y P+  + 
Sbjct: 3   LLVAKAPPTFWLSSAGYDHRGLVNLKFRQSSAFVKPNSQSSFSNSAYASTESYTPAALEA 62

Query: 62  KSYLDIELDNSARIVEFCEVGDLKNAMELLCSSGNANLDLETYCSVLQLCAERKSIRYGK 121
           K+Y+D EL+NS +IV+FCEVGDLKNA+ELLCSS N+NLDL+TYC +LQLCAE+KSIR G+
Sbjct: 63  KNYIDAELNNSRKIVKFCEVGDLKNAIELLCSSQNSNLDLDTYCIILQLCAEQKSIRDGR 122

Query: 122 RVHSIIESNGVVMDGILGAKLVFMYVKCGDLKEARMIFDKLSEQKVFLWNLMISEYAGNG 181
           RVHSIIESN VV+DGILGAKLVFMYVKCGDL+E RMIFDKLSE+KVFLWNLMISEY+G+G
Sbjct: 123 RVHSIIESNEVVIDGILGAKLVFMYVKCGDLREGRMIFDKLSEKKVFLWNLMISEYSGSG 182

Query: 182 NYVESVNLFKRMMELGIKPNSYTFSSVLKCLAAVARVEQGRLVHGFICKLGFSSYNTVVN 241
           NY ES+NLFKRM+ELGI PNSYTFSSVLKC AAV RVE+GR VHG ICKLGF+SYN VVN
Sbjct: 183 NYGESINLFKRMLELGINPNSYTFSSVLKCFAAVVRVEEGRQVHGLICKLGFTSYNAVVN 242

Query: 242 SLISFYFVSKKVRSAQKLFDELSDRDVISWNSMISGYVKNGLEDKGIEIFIKMLDFSVDV 301
           SLISFYFV +KVRSA+KLFDE+SDRDVISWNSMISGYVKNGLED+GIEIF++ML FSVDV
Sbjct: 243 SLISFYFVGRKVRSARKLFDEMSDRDVISWNSMISGYVKNGLEDRGIEIFLRMLVFSVDV 302

Query: 302 DLATMVNVLVACANTGTLLLGKALHSYAIKAASSLDREVMFKNTLLDMYSKCGDLNSAIR 361
           DLATMVNVLVACAN GTL LGK LHSY+IKAA++LDR+VMF NTLLDMYSKCGDLNSAIR
Sbjct: 303 DLATMVNVLVACANMGTLSLGKTLHSYSIKAAAALDRDVMFNNTLLDMYSKCGDLNSAIR 362

Query: 362 VFEKMDEKTVVSWTSMIAGYVREGLSDGAIELFDEMKSRGVVPDVYAVTSILHACAINGN 421
           VFE+MDEKTVVSWTSMIAGYVREGLSDGAIELF+EMKSRGV+PDVYAV SILHACAINGN
Sbjct: 363 VFERMDEKTVVSWTSMIAGYVREGLSDGAIELFNEMKSRGVLPDVYAVASILHACAINGN 422

Query: 422 LSSGKIVHNYIRKNNLETNSFVSNALMDMYAKCGSMKDAQSVFSHMKGKDVISWNTMIGG 481
           L+SGK +HNYI++NNLETNSFVSNALMDMYAKCGSMKDA  VFSH+K KDVISWNTMIGG
Sbjct: 423 LNSGKSLHNYIKENNLETNSFVSNALMDMYAKCGSMKDAADVFSHVKRKDVISWNTMIGG 482

Query: 482 YSKNHLPNEALNLFAEMQRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDK 541
           YSKN LPNEAL+LFAEMQRESKPDGTTVACILPACASLAALD+GREIHGYALRNGYS+DK
Sbjct: 483 YSKNRLPNEALSLFAEMQRESKPDGTTVACILPACASLAALDKGREIHGYALRNGYSRDK 542

Query: 542 YVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGNEAVDIFNQMRIS 601
           +V NALVDMYVKCGLLVLARSLFDMIL+KDLVSWTVMIAGYGMHG+G+EAV+ FNQMRI+
Sbjct: 543 FVANALVDMYVKCGLLVLARSLFDMILNKDLVSWTVMIAGYGMHGYGSEAVNAFNQMRIA 602

Query: 602 GVEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECRIEPKLEHYACMVDLLARTGNLVK 661
           G+EPDEVSFISILYACSHSGLLDEGW FF IMKKEC+IEPKLEHYACMVDLLARTGNL +
Sbjct: 603 GIEPDEVSFISILYACSHSGLLDEGWNFFTIMKKECQIEPKLEHYACMVDLLARTGNLAR 662

Query: 662 AHKFIKTMPIEPDATIWGALLCGCRIHHDVKVAEKVAERIFELEPENTGYYVLLANIYAE 721
           AHKFI+TMPI+PDATIWGALLCGCRIHHDVK+AEKVAERIFELEPENTGYYVL+ANIYAE
Sbjct: 663 AHKFIETMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLMANIYAE 722

Query: 722 AEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKKLRSK 781
           AEKWEEVQKLR +IG+ GLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLL +LRSK
Sbjct: 723 AEKWEEVQKLRTRIGRHGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLTRLRSK 782

Query: 782 MKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC 841
           MKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC
Sbjct: 783 MKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC 842

Query: 842 HEMAKFMSKNTTREIVLRDSNRFHHFKDGYCSCRGY 876
           HEMA+FMSK+T REIVLRDS+RFHHFKDGYCSCRGY
Sbjct: 843 HEMARFMSKDTKREIVLRDSSRFHHFKDGYCSCRGY 878

BLAST of MC03g0739 vs. ExPASy TrEMBL
Match: A0A5A7UPC9 (Pentatricopeptide repeat-containing protein DOT4 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold96G002420 PE=3 SV=1)

HSP 1 Score: 1545 bits (4000), Expect = 0.0
Identity = 759/877 (86.55%), Postives = 817/877 (93.16%), Query Frame = 0

Query: 1   MLLVAKSPPTFWLSPTGHDRHGLVNLKFSHSFVFAKPKSKFSFSNSAYACTQIYPSPSQT 60
           MLL AK+P TFWLSP GHD  G VNLKF  SF+FAKP SK SFS+ AYA         +T
Sbjct: 2   MLLTAKAPVTFWLSPAGHDHRGSVNLKFRQSFLFAKPNSKLSFSSLAYAPAM------ET 61

Query: 61  KSYLDIELDNSARIVEFCEVGDLKNAMELLCSSGNANLDLETYCSVLQLCAERKSIRYGK 120
           KSY+D+ELD+S +IVEFCEVGDLKNAMELLCSS N+N DL+ +CS+LQLCAERKSIR G+
Sbjct: 62  KSYMDVELDSSRKIVEFCEVGDLKNAMELLCSSQNSNFDLDAFCSILQLCAERKSIRDGR 121

Query: 121 RVHSIIESNGVVMDGILGAKLVFMYVKCGDLKEARMIFDKLSEQKVFLWNLMISEYAGNG 180
           RVHSIIES+GV++DGILG KLVFMYVKCGDLKE RMIFDKLSE KVF+WNLMISEY GNG
Sbjct: 122 RVHSIIESSGVIIDGILGVKLVFMYVKCGDLKEGRMIFDKLSESKVFIWNLMISEYLGNG 181

Query: 181 NYVESVNLFKRMMELGIKPNSYTFSSVLKCLAAVARVEQGRLVHGFICKLGFSSYNTVVN 240
           NY ES+NLFK+M+ELGIKPNSYTFSSVLKC AAVA VE+GR VHG I KLG++SYNTVVN
Sbjct: 182 NYGESINLFKQMLELGIKPNSYTFSSVLKCFAAVASVEEGRQVHGLIYKLGYNSYNTVVN 241

Query: 241 SLISFYFVSKKVRSAQKLFDELSDRDVISWNSMISGYVKNGLEDKGIEIFIKMLDFSVDV 300
           SLISFYFV +KVR AQKLFDEL+DRDVISWNSMISGYVKNGL+D+GIEIFIKML F V++
Sbjct: 242 SLISFYFVGRKVRCAQKLFDELTDRDVISWNSMISGYVKNGLDDRGIEIFIKMLVFGVEI 301

Query: 301 DLATMVNVLVACANTGTLLLGKALHSYAIKAASSLDREVMFKNTLLDMYSKCGDLNSAIR 360
           DLATMVNVLVACANTGTLL GK LHSY+IKAA+ LDREV F NTLLDMYSKCGDLNSAIR
Sbjct: 302 DLATMVNVLVACANTGTLLFGKVLHSYSIKAAA-LDREVRFNNTLLDMYSKCGDLNSAIR 361

Query: 361 VFEKMDEKTVVSWTSMIAGYVREGLSDGAIELFDEMKSRGVVPDVYAVTSILHACAINGN 420
           VFE+MDEKTVVSWTSMI GYVREGLSDGAI+LFDEMKSRGVVPDVYAVTSILHACAINGN
Sbjct: 362 VFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMKSRGVVPDVYAVTSILHACAINGN 421

Query: 421 LSSGKIVHNYIRKNNLETNSFVSNALMDMYAKCGSMKDAQSVFSHMKGKDVISWNTMIGG 480
           L SG+IVH+YIR+NNLETNSFVSNAL DMYAKCGSMKDA  VFSHMK KDVISWNTMIGG
Sbjct: 422 LKSGQIVHDYIRENNLETNSFVSNALTDMYAKCGSMKDAHDVFSHMKKKDVISWNTMIGG 481

Query: 481 YSKNHLPNEALNLFAEMQRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDK 540
           YSKN LPNEAL LFAEMQ ESKPDGTTVACILPACASLAALDRGREIHGYALRNGYS+DK
Sbjct: 482 YSKNRLPNEALTLFAEMQSESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSEDK 541

Query: 541 YVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGNEAVDIFNQMRIS 600
           YVVNAL+DMYVKCGLLVLARS FDMIL+KDLVSWTVMIAGYGMHGFG+EA++ FNQMR++
Sbjct: 542 YVVNALLDMYVKCGLLVLARSFFDMILNKDLVSWTVMIAGYGMHGFGSEAINTFNQMRMT 601

Query: 601 GVEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECRIEPKLEHYACMVDLLARTGNLVK 660
           G++PDEVSFISILYACSHSGLLDEGWK F+IMKKEC+IEP LEHYACMVDLLARTGNLVK
Sbjct: 602 GIKPDEVSFISILYACSHSGLLDEGWKIFSIMKKECQIEPNLEHYACMVDLLARTGNLVK 661

Query: 661 AHKFIKTMPIEPDATIWGALLCGCRIHHDVKVAEKVAERIFELEPENTGYYVLLANIYAE 720
           AHKFI+TMPI+PDATIWGALLCGCRIHHDVK+AEKVAERIFELEPENTGYYVLLANIYAE
Sbjct: 662 AHKFIQTMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAE 721

Query: 721 AEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKKLRSK 780
           AEKWEEVQKLRK+IGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLK+LRSK
Sbjct: 722 AEKWEEVQKLRKRIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSK 781

Query: 781 MKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC 840
           MKEEGYSPKT YALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC
Sbjct: 782 MKEEGYSPKTTYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC 841

Query: 841 HEMAKFMSKNTTREIVLRDSNRFHHFKDGYCSCRGYW 877
           HEMAKFMSK+ +REI+LRDS+RFHHFKDG CSCRG+W
Sbjct: 842 HEMAKFMSKSVSREIILRDSSRFHHFKDGSCSCRGFW 871

BLAST of MC03g0739 vs. ExPASy TrEMBL
Match: A0A1S3B857 (pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103487046 PE=3 SV=1)

HSP 1 Score: 1544 bits (3997), Expect = 0.0
Identity = 758/877 (86.43%), Postives = 817/877 (93.16%), Query Frame = 0

Query: 1   MLLVAKSPPTFWLSPTGHDRHGLVNLKFSHSFVFAKPKSKFSFSNSAYACTQIYPSPSQT 60
           MLL AK+P TFWLSP GHD  G VNLKF  SF+FAKP SK SFS+ AYA         +T
Sbjct: 2   MLLTAKAPVTFWLSPAGHDHRGSVNLKFRQSFLFAKPNSKLSFSSLAYAPAM------ET 61

Query: 61  KSYLDIELDNSARIVEFCEVGDLKNAMELLCSSGNANLDLETYCSVLQLCAERKSIRYGK 120
           KSY+D+ELD+S +IVEFCEVGDLKNAMELLCSS N+N DL+ +CS+LQLCAERKSIR G+
Sbjct: 62  KSYMDVELDSSRKIVEFCEVGDLKNAMELLCSSQNSNFDLDAFCSILQLCAERKSIRDGR 121

Query: 121 RVHSIIESNGVVMDGILGAKLVFMYVKCGDLKEARMIFDKLSEQKVFLWNLMISEYAGNG 180
           RVHSIIES+GV++DGILG KLVFMYVKCGDLKE RMIFDKLSE KVF+WNLMISEY GNG
Sbjct: 122 RVHSIIESSGVIIDGILGVKLVFMYVKCGDLKEGRMIFDKLSESKVFIWNLMISEYLGNG 181

Query: 181 NYVESVNLFKRMMELGIKPNSYTFSSVLKCLAAVARVEQGRLVHGFICKLGFSSYNTVVN 240
           NY ES+NLFK+M+ELGIKPNSYTFSSVLKC AAVA VE+GR VHG I KLG++SYNTVVN
Sbjct: 182 NYGESINLFKQMLELGIKPNSYTFSSVLKCFAAVASVEEGRQVHGLIYKLGYNSYNTVVN 241

Query: 241 SLISFYFVSKKVRSAQKLFDELSDRDVISWNSMISGYVKNGLEDKGIEIFIKMLDFSVDV 300
           SLISFYFV +KVR AQKLFDEL+DRDVISWNSMISGYVKNGL+D+GIEIFIKML F V++
Sbjct: 242 SLISFYFVGRKVRCAQKLFDELTDRDVISWNSMISGYVKNGLDDRGIEIFIKMLVFGVEI 301

Query: 301 DLATMVNVLVACANTGTLLLGKALHSYAIKAASSLDREVMFKNTLLDMYSKCGDLNSAIR 360
           DLATMVNVLVACANTGTLL GK LHSY+IKAA+ LDREV F NTLLDMYSKCGDLNSAIR
Sbjct: 302 DLATMVNVLVACANTGTLLFGKVLHSYSIKAAA-LDREVRFNNTLLDMYSKCGDLNSAIR 361

Query: 361 VFEKMDEKTVVSWTSMIAGYVREGLSDGAIELFDEMKSRGVVPDVYAVTSILHACAINGN 420
           VFE+MDEKTVVSWTSMI GYVREGLSDGAI+LFDEMKSRGVVPDVYAVTSILHACAINGN
Sbjct: 362 VFERMDEKTVVSWTSMITGYVREGLSDGAIKLFDEMKSRGVVPDVYAVTSILHACAINGN 421

Query: 421 LSSGKIVHNYIRKNNLETNSFVSNALMDMYAKCGSMKDAQSVFSHMKGKDVISWNTMIGG 480
           L SG+IVH+YIR+NNLETNSFVSNAL DMYAKCGSMKDA  VFSHMK KDVISWNTMIGG
Sbjct: 422 LKSGQIVHDYIRENNLETNSFVSNALTDMYAKCGSMKDAHDVFSHMKKKDVISWNTMIGG 481

Query: 481 YSKNHLPNEALNLFAEMQRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDK 540
           YSKN LPNEAL LFAEMQ ESKPDGTTVACILPACASLAALDRGREIHGYALRNGYS+DK
Sbjct: 482 YSKNRLPNEALTLFAEMQSESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSEDK 541

Query: 541 YVVNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGNEAVDIFNQMRIS 600
           YVVNAL+DMYVKCGLLVLARS FDMIL+KDLVSWTVMIAGYGMHGFG+EA++ FNQMR++
Sbjct: 542 YVVNALLDMYVKCGLLVLARSFFDMILNKDLVSWTVMIAGYGMHGFGSEAINTFNQMRMT 601

Query: 601 GVEPDEVSFISILYACSHSGLLDEGWKFFNIMKKECRIEPKLEHYACMVDLLARTGNLVK 660
           G++PDEVSFISILYACSHSGLLDEGWK F+IMKKEC+IEP LEHYACMVDLLARTGNLVK
Sbjct: 602 GIKPDEVSFISILYACSHSGLLDEGWKIFSIMKKECQIEPNLEHYACMVDLLARTGNLVK 661

Query: 661 AHKFIKTMPIEPDATIWGALLCGCRIHHDVKVAEKVAERIFELEPENTGYYVLLANIYAE 720
           AHKFI+TMPI+PDATIWGALLCGCRIHHDVK+AEKVAERIFELEPENTGYYVLLANIYAE
Sbjct: 662 AHKFIQTMPIKPDATIWGALLCGCRIHHDVKLAEKVAERIFELEPENTGYYVLLANIYAE 721

Query: 721 AEKWEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKKLRSK 780
           AEKWEEVQKLRK+IGQRGLKKNPGC+WIEIKGKVNIFVAGDCSKPQAKKIELLLK+LRSK
Sbjct: 722 AEKWEEVQKLRKRIGQRGLKKNPGCNWIEIKGKVNIFVAGDCSKPQAKKIELLLKRLRSK 781

Query: 781 MKEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC 840
           MKEEGYSPKT YALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC
Sbjct: 782 MKEEGYSPKTTYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDC 841

Query: 841 HEMAKFMSKNTTREIVLRDSNRFHHFKDGYCSCRGYW 877
           HEMAKFMSK+ +REI+LRDS+RFHHFKDG CSCRG+W
Sbjct: 842 HEMAKFMSKSVSREIILRDSSRFHHFKDGSCSCRGFW 871

BLAST of MC03g0739 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 1080.9 bits (2794), Expect = 0.0e+00
Identity = 510/811 (62.89%), Postives = 655/811 (80.76%), Query Frame = 0

Query: 69  DNSARIVEFCEVGDLKNAMELLCSSGNANLDLETYCSVLQLCAERKSIRYGKRVHSIIES 128
           D + ++  FCE G+L+NA++LLC SG  ++D  T CSVLQLCA+ KS++ GK V + I  
Sbjct: 63  DANTQLRRFCESGNLENAVKLLCVSGKWDIDPRTLCSVLQLCADSKSLKDGKEVDNFIRG 122

Query: 129 NGVVMDGILGAKLVFMYVKCGDLKEARMIFDKLSEQKVFLWNLMISEYAGNGNYVESVNL 188
           NG V+D  LG+KL  MY  CGDLKEA  +FD++  +K   WN++++E A +G++  S+ L
Sbjct: 123 NGFVIDSNLGSKLSLMYTNCGDLKEASRVFDEVKIEKALFWNILMNELAKSGDFSGSIGL 182

Query: 189 FKRMMELGIKPNSYTFSSVLKCLAAVARVEQGRLVHGFICKLGFSSYNTVVNSLISFYFV 248
           FK+MM  G++ +SYTFS V K  +++  V  G  +HGFI K GF   N+V NSL++FY  
Sbjct: 183 FKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGEQLHGFILKSGFGERNSVGNSLVAFYLK 242

Query: 249 SKKVRSAQKLFDELSDRDVISWNSMISGYVKNGLEDKGIEIFIKMLDFSVDVDLATMVNV 308
           +++V SA+K+FDE+++RDVISWNS+I+GYV NGL +KG+ +F++ML   +++DLAT+V+V
Sbjct: 243 NQRVDSARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSV 302

Query: 309 LVACANTGTLLLGKALHSYAIKAASSLDREVMFKNTLLDMYSKCGDLNSAIRVFEKMDEK 368
              CA++  + LG+A+HS  +KA  S  RE  F NTLLDMYSKCGDL+SA  VF +M ++
Sbjct: 303 FAGCADSRLISLGRAVHSIGVKACFS--REDRFCNTLLDMYSKCGDLDSAKAVFREMSDR 362

Query: 369 TVVSWTSMIAGYVREGLSDGAIELFDEMKSRGVVPDVYAVTSILHACAINGNLSSGKIVH 428
           +VVS+TSMIAGY REGL+  A++LF+EM+  G+ PDVY VT++L+ CA    L  GK VH
Sbjct: 363 SVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVH 422

Query: 429 NYIRKNNLETNSFVSNALMDMYAKCGSMKDAQSVFSHMKGKDVISWNTMIGGYSKNHLPN 488
            +I++N+L  + FVSNALMDMYAKCGSM++A+ VFS M+ KD+ISWNT+IGGYSKN   N
Sbjct: 423 EWIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYAN 482

Query: 489 EALNLFAEMQRESK--PDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNAL 548
           EAL+LF  +  E +  PD  TVAC+LPACASL+A D+GREIHGY +RNGY  D++V N+L
Sbjct: 483 EALSLFNLLLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSL 542

Query: 549 VDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGNEAVDIFNQMRISGVEPDE 608
           VDMY KCG L+LA  LFD I  KDLVSWTVMIAGYGMHGFG EA+ +FNQMR +G+E DE
Sbjct: 543 VDMYAKCGALLLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIEADE 602

Query: 609 VSFISILYACSHSGLLDEGWKFFNIMKKECRIEPKLEHYACMVDLLARTGNLVKAHKFIK 668
           +SF+S+LYACSHSGL+DEGW+FFNIM+ EC+IEP +EHYAC+VD+LARTG+L+KA++FI+
Sbjct: 603 ISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIE 662

Query: 669 TMPIEPDATIWGALLCGCRIHHDVKVAEKVAERIFELEPENTGYYVLLANIYAEAEKWEE 728
            MPI PDATIWGALLCGCRIHHDVK+AEKVAE++FELEPENTGYYVL+ANIYAEAEKWE+
Sbjct: 663 NMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQ 722

Query: 729 VQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGY 788
           V++LRK+IGQRGL+KNPGCSWIEIKG+VNIFVAGD S P+ + IE  L+K+R++M EEGY
Sbjct: 723 VKRLRKRIGQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMIEEGY 782

Query: 789 SPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKF 848
           SP T+YAL++A+E EKE ALCGHSEKLAMA G+++   GK IRVTKNLRVCGDCHEMAKF
Sbjct: 783 SPLTKYALIDAEEMEKEEALCGHSEKLAMALGIISSGHGKIIRVTKNLRVCGDCHEMAKF 842

Query: 849 MSKNTTREIVLRDSNRFHHFKDGYCSCRGYW 878
           MSK T REIVLRDSNRFH FKDG+CSCRG+W
Sbjct: 843 MSKLTRREIVLRDSNRFHQFKDGHCSCRGFW 871

BLAST of MC03g0739 vs. TAIR 10
Match: AT1G15510.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 626.3 bits (1614), Expect = 3.7e-179
Identity = 321/807 (39.78%), Postives = 497/807 (61.59%), Query Frame = 0

Query: 70  NSARIVEFCEVGDLKNAMELLCSSGNAN--LDLETYCSVLQLCAERKSIRYGKRVHSIIE 129
           +++++   C  G L+ AM+LL S       +D + + ++++LC  +++   G +V+SI  
Sbjct: 62  SNSQLHGLCANGKLEEAMKLLNSMQELRVAVDEDVFVALVRLCEWKRAQEEGSKVYSIAL 121

Query: 130 SNGVVMDGILGAKLVFMYVKCGDLKEARMIFDKLSEQKVFLWNLMISEYAGNGNYVESVN 189
           S+   +   LG   + M+V+ G+L +A  +F K+SE+ +F WN+++  YA  G + E++ 
Sbjct: 122 SSMSSLGVELGNAFLAMFVRFGNLVDAWYVFGKMSERNLFSWNVLVGGYAKQGYFDEAMC 181

Query: 190 LFKRMMEL-GIKPNSYTFSSVLKCLAAVARVEQGRLVHGFICKLGFSSYNTVVNSLISFY 249
           L+ RM+ + G+KP+ YTF  VL+    +  + +G+ VH  + + G+     VVN+LI+ Y
Sbjct: 182 LYHRMLWVGGVKPDVYTFPCVLRTCGGIPDLARGKEVHVHVVRYGYELDIDVVNALITMY 241

Query: 250 FVSKKVRSAQKLFDELSDRDVISWNSMISGYVKNGLEDKGIEIFIKMLDFSVDVDLATMV 309
                V+SA+ LFD +  RD+ISWN+MISGY +NG+  +G+E+F  M   SVD DL T+ 
Sbjct: 242 VKCGDVKSARLLFDRMPRRDIISWNAMISGYFENGMCHEGLELFFAMRGLSVDPDLMTLT 301

Query: 310 NVLVACANTGTLLLGKALHSYAIKAASSLDREVMFKNTLLDMYSKCGDLNSAIRVFEKMD 369
           +V+ AC   G   LG+ +H+Y I    ++D  V   N+L  MY   G    A ++F +M+
Sbjct: 302 SVISACELLGDRRLGRDIHAYVITTGFAVDISVC--NSLTQMYLNAGSWREAEKLFSRME 361

Query: 370 EKTVVSWTSMIAGYVREGLSDGAIELFDEMKSRGVVPDVYAVTSILHACAINGNLSSGKI 429
            K +VSWT+MI+GY    L D AI+ +  M    V PD   V ++L ACA  G+L +G  
Sbjct: 362 RKDIVSWTTMISGYEYNFLPDKAIDTYRMMDQDSVKPDEITVAAVLSACATLGDLDTGVE 421

Query: 430 VHNYIRKNNLETNSFVSNALMDMYAKCGSMKDAQSVFSHMKGKDVISWNTMIGGYSKNHL 489
           +H    K  L +   V+N L++MY+KC  +  A  +F ++  K+VISW ++I G   N+ 
Sbjct: 422 LHKLAIKARLISYVIVANNLINMYSKCKCIDKALDIFHNIPRKNVISWTSIIAGLRLNNR 481

Query: 490 PNEALNLFAEMQRESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNAL 549
             EAL    +M+   +P+  T+   L ACA + AL  G+EIH + LR G   D ++ NAL
Sbjct: 482 CFEALIFLRQMKMTLQPNAITLTAALAACARIGALMCGKEIHAHVLRTGVGLDDFLPNAL 541

Query: 550 VDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGNEAVDIFNQMRISGVEPDE 609
           +DMYV+CG +  A S F+    KD+ SW +++ GY   G G+  V++F++M  S V PDE
Sbjct: 542 LDMYVRCGRMNTAWSQFNS-QKKDVTSWNILLTGYSERGQGSMVVELFDRMVKSRVRPDE 601

Query: 610 VSFISILYACSHSGLLDEGWKFFNIMKKECRIEPKLEHYACMVDLLARTGNLVKAHKFIK 669
           ++FIS+L  CS S ++ +G  +F+ M ++  + P L+HYAC+VDLL R G L +AHKFI+
Sbjct: 602 ITFISLLCGCSKSQMVRQGLMYFSKM-EDYGVTPNLKHYACVVDLLGRAGELQEAHKFIQ 661

Query: 670 TMPIEPDATIWGALLCGCRIHHDVKVAEKVAERIFELEPENTGYYVLLANIYAEAEKWEE 729
            MP+ PD  +WGALL  CRIHH + + E  A+ IFEL+ ++ GYY+LL N+YA+  KW E
Sbjct: 662 KMPVTPDPAVWGALLNACRIHHKIDLGELSAQHIFELDKKSVGYYILLCNLYADCGKWRE 721

Query: 730 VQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGY 789
           V K+R+ + + GL  + GCSW+E+KGKV+ F++ D   PQ K+I  +L+    KM E G 
Sbjct: 722 VAKVRRMMKENGLTVDAGCSWVEVKGKVHAFLSDDKYHPQTKEINTVLEGFYEKMSEVGL 781

Query: 790 SPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKF 849
           +  +  + ++  E  ++   CGHSE+ A+AFG++N  PG  I VTKNL +C +CH+  KF
Sbjct: 782 TKISESSSMDETEISRDEIFCGHSERKAIAFGLINTVPGMPIWVTKNLSMCENCHDTVKF 841

Query: 850 MSKNTTREIVLRDSNRFHHFKDGYCSC 874
           +SK   REI +RD+  FHHFKDG CSC
Sbjct: 842 ISKTVRREISVRDAEHFHHFKDGECSC 864

BLAST of MC03g0739 vs. TAIR 10
Match: AT3G03580.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 605.9 bits (1561), Expect = 5.1e-173
Identity = 339/853 (39.74%), Postives = 502/853 (58.85%), Query Frame = 0

Query: 29  SHSFVFAKPKSKFSFSNSAYACTQIYPSPSQTKSYLDIELDNSARIVEFCEVGDLKNAME 88
           S  F   K   K+S      +   ++   S  K   ++ L NS  I  F + G    A+E
Sbjct: 37  SSDFFSGKLIDKYSHFREPASSLSVFRRVSPAK---NVYLWNSI-IRAFSKNGLFPEALE 96

Query: 89  LL--CSSGNANLDLETYCSVLQLCAERKSIRYGKRVHSIIESNGVVMDGILGAKLVFMYV 148
                     + D  T+ SV++ CA       G  V+  I   G   D  +G  LV MY 
Sbjct: 97  FYGKLRESKVSPDKYTFPSVIKACAGLFDAEMGDLVYEQILDMGFESDLFVGNALVDMYS 156

Query: 149 KCGDLKEARMIFDKLSEQKVFLWNLMISEYAGNGNYVESVNLFKRMMELGIKPNSYTFSS 208
           + G L  AR +FD++  + +  WN +IS Y+ +G Y E++ ++  +    I P+S+T SS
Sbjct: 157 RMGLLTRARQVFDEMPVRDLVSWNSLISGYSSHGYYEEALEIYHELKNSWIVPDSFTVSS 216

Query: 209 VLKCLAAVARVEQGRLVHGFICKLGFSSYNTVVNSLISFYFVSKKVRSAQKLFDELSDRD 268
           VL     +  V+QG+ +HGF  K G +S   V N L++ Y   ++   A+++FDE+  RD
Sbjct: 217 VLPAFGNLLVVKQGQGLHGFALKSGVNSVVVVNNGLVAMYLKFRRPTDARRVFDEMDVRD 276

Query: 269 VISWNSMISGYVKNGLEDKGIEIFIKMLDFSVDVDLATMVNVLVACANTGTLLLGKALHS 328
            +S+N+MI GY+K  + ++ + +F++ LD     DL T+ +VL AC +   L L K +++
Sbjct: 277 SVSYNTMICGYLKLEMVEESVRMFLENLD-QFKPDLLTVSSVLRACGHLRDLSLAKYIYN 336

Query: 329 YAIKAASSLDREVMFKNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLS 388
           Y +KA   L+  V  +N L+D+Y+KCGD+ +A  VF  M+ K  VSW S+I+GY++ G  
Sbjct: 337 YMLKAGFVLESTV--RNILIDVYAKCGDMITARDVFNSMECKDTVSWNSIISGYIQSGDL 396

Query: 389 DGAIELFDEMKSRGVVPDVYAVTSILHACAINGNLSSGKIVHNYIRKNNLETNSFVSNAL 448
             A++LF  M       D      ++       +L  GK +H+   K+ +  +  VSNAL
Sbjct: 397 MEAMKLFKMMMIMEEQADHITYLMLISVSTRLADLKFGKGLHSNGIKSGICIDLSVSNAL 456

Query: 449 MDMYAKCGSMKDAQSVFSHMKGKDVISWNTMIGGYSKNHLPNEALNLFAEMQR-ESKPDG 508
           +DMYAKCG + D+  +FS M   D ++WNT+I    +       L +  +M++ E  PD 
Sbjct: 457 IDMYAKCGEVGDSLKIFSSMGTGDTVTWNTVISACVRFGDFATGLQVTTQMRKSEVVPDM 516

Query: 509 TTVACILPACASLAALDRGREIHGYALRNGYSKDKYVVNALVDMYVKCGLLVLARSLFDM 568
            T    LP CASLAA   G+EIH   LR GY  +  + NAL++MY KCG L  +  +F+ 
Sbjct: 517 ATFLVTLPMCASLAAKRLGKEIHCCLLRFGYESELQIGNALIEMYSKCGCLENSSRVFER 576

Query: 569 ILDKDLVSWTVMIAGYGMHGFGNEAVDIFNQMRISGVEPDEVSFISILYACSHSGLLDEG 628
           +  +D+V+WT MI  YGM+G G +A++ F  M  SG+ PD V FI+I+YACSHSGL+DEG
Sbjct: 577 MSRRDVVTWTGMIYAYGMYGEGEKALETFADMEKSGIVPDSVVFIAIIYACSHSGLVDEG 636

Query: 629 WKFFNIMKKECRIEPKLEHYACMVDLLARTGNLVKAHKFIKTMPIEPDATIWGALLCGCR 688
              F  MK   +I+P +EHYAC+VDLL+R+  + KA +FI+ MPI+PDA+IW ++L  CR
Sbjct: 637 LACFEKMKTHYKIDPMIEHYACVVDLLSRSQKISKAEEFIQAMPIKPDASIWASVLRACR 696

Query: 689 IHHDVKVAEKVAERIFELEPENTGYYVLLANIYAEAEKWEEVQKLRKKIGQRGLKKNPGC 748
              D++ AE+V+ RI EL P++ GY +L +N YA   KW++V  +RK +  + + KNPG 
Sbjct: 697 TSGDMETAERVSRRIIELNPDDPGYSILASNAYAALRKWDKVSLIRKSLKDKHITKNPGY 756

Query: 749 SWIEIKGKVNIFVAGDCSKPQAKKIELLLKKLRSKMKEEGYSPKTRYALLN-ADEREKEV 808
           SWIE+   V++F +GD S PQ++ I   L+ L S M +EGY P  R    N  +E EK  
Sbjct: 757 SWIEVGKNVHVFSSGDDSAPQSEAIYKSLEILYSLMAKEGYIPDPREVSQNLEEEEEKRR 816

Query: 809 ALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEMAKFMSKNTTREIVLRDSNRFH 868
            +CGHSE+LA+AFG+LN  PG  ++V KNLRVCGDCHE+ K +SK   REI++RD+NRFH
Sbjct: 817 LICGHSERLAIAFGLLNTEPGTPLQVMKNLRVCGDCHEVTKLISKIVGREILVRDANRFH 876

Query: 869 HFKDGYCSCRGYW 878
            FKDG CSC+  W
Sbjct: 877 LFKDGTCSCKDRW 882

BLAST of MC03g0739 vs. TAIR 10
Match: AT5G16860.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 605.5 bits (1560), Expect = 6.7e-173
Identity = 334/816 (40.93%), Postives = 480/816 (58.82%), Query Frame = 0

Query: 114 KSIRYGKRVHSIIESNGVVMDGILGAKLVFMYVKCGDLKEARMIFDKL--SEQKVFLWNL 173
           K+I   K +H  + S G++    L + L+  Y+  G L  A  +  +   S+  V+ WN 
Sbjct: 39  KTISQVKLIHQKLLSFGILTLN-LTSHLISTYISVGCLSHAVSLLRRFPPSDAGVYHWNS 98

Query: 174 MISEYAGNGNYVESVNLFKRMMELGIKPNSYTFSSVLKCLAAVARVEQGRLVHGFICKLG 233
           +I  Y  NG   + + LF  M  L   P++YTF  V K    ++ V  G   H      G
Sbjct: 99  LIRSYGDNGCANKCLYLFGLMHSLSWTPDNYTFPFVFKACGEISSVRCGESAHALSLVTG 158

Query: 234 FSSYNTVVNSLISFYFVSKKVRSAQKLFDELSDRDVISWNSMISGYVKNGLEDKGIEIFI 293
           F S   V N+L++ Y   + +  A+K+FDE+S  DV+SWNS+I  Y K G     +E+F 
Sbjct: 159 FISNVFVGNALVAMYSRCRSLSDARKVFDEMSVWDVVSWNSIIESYAKLGKPKVALEMFS 218

Query: 294 KML-DFSVDVDLATMVNVLVACANTGTLLLGKALHSYAIKAASSLDREVMFKNTLLDMYS 353
           +M  +F    D  T+VNVL  CA+ GT  LGK LH +A+   S + + +   N L+DMY+
Sbjct: 219 RMTNEFGCRPDNITLVNVLPPCASLGTHSLGKQLHCFAV--TSEMIQNMFVGNCLVDMYA 278

Query: 354 KCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAIELFDEMK------------- 413
           KCG ++ A  VF  M  K VVSW +M+AGY + G  + A+ LF++M+             
Sbjct: 279 KCGMMDEANTVFSNMSVKDVVSWNAMVAGYSQIGRFEDAVRLFEKMQEEKIKMDVVTWSA 338

Query: 414 ----------------------SRGVVPDVYAVTSILHACAINGNLSSGKIVHNY----- 473
                                 S G+ P+   + S+L  CA  G L  GK +H Y     
Sbjct: 339 AISGYAQRGLGYEALGVCRQMLSSGIKPNEVTLISVLSGCASVGALMHGKEIHCYAIKYP 398

Query: 474 --IRKNNLETNSFVSNALMDMYAKCGSMKDAQSVFSHM--KGKDVISWNTMIGGYSKNHL 533
             +RKN     + V N L+DMYAKC  +  A+++F  +  K +DV++W  MIGGYS++  
Sbjct: 399 IDLRKNGHGDENMVINQLIDMYAKCKKVDTARAMFDSLSPKERDVVTWTVMIGGYSQHGD 458

Query: 534 PNEALNLFAEMQRE---SKPDGTTVACILPACASLAALDRGREIHGYALRNGYSK-DKYV 593
            N+AL L +EM  E   ++P+  T++C L ACASLAAL  G++IH YALRN  +    +V
Sbjct: 459 ANKALELLSEMFEEDCQTRPNAFTISCALVACASLAALRIGKQIHAYALRNQQNAVPLFV 518

Query: 594 VNALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGNEAVDIFNQMRISGV 653
            N L+DMY KCG +  AR +FD ++ K+ V+WT ++ GYGMHG+G EA+ IF++MR  G 
Sbjct: 519 SNCLIDMYAKCGSISDARLVFDNMMAKNEVTWTSLMTGYGMHGYGEEALGIFDEMRRIGF 578

Query: 654 EPDEVSFISILYACSHSGLLDEGWKFFNIMKKECRIEPKLEHYACMVDLLARTGNLVKAH 713
           + D V+ + +LYACSHSG++D+G ++FN MK    + P  EHYAC+VDLL R G L  A 
Sbjct: 579 KLDGVTLLVVLYACSHSGMIDQGMEYFNRMKTVFGVSPGPEHYACLVDLLGRAGRLNAAL 638

Query: 714 KFIKTMPIEPDATIWGALLCGCRIHHDVKVAEKVAERIFELEPENTGYYVLLANIYAEAE 773
           + I+ MP+EP   +W A L  CRIH  V++ E  AE+I EL   + G Y LL+N+YA A 
Sbjct: 639 RLIEEMPMEPPPVVWVAFLSCCRIHGKVELGEYAAEKITELASNHDGSYTLLSNLYANAG 698

Query: 774 KWEEVQKLRKKIGQRGLKKNPGCSWIE-IKGKVNIFVAGDCSKPQAKKIELLLKKLRSKM 833
           +W++V ++R  +  +G+KK PGCSW+E IKG    FV GD + P AK+I  +L     ++
Sbjct: 699 RWKDVTRIRSLMRHKGVKKRPGCSWVEGIKGTTTFFV-GDKTHPHAKEIYQVLLDHMQRI 758

Query: 834 KEEGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCH 878
           K+ GY P+T +AL + D+ EK+  L  HSEKLA+A+G+L  P G  IR+TKNLRVCGDCH
Sbjct: 759 KDIGYVPETGFALHDVDDEEKDDLLFEHSEKLALAYGILTTPQGAAIRITKNLRVCGDCH 818

BLAST of MC03g0739 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 602.1 bits (1551), Expect = 7.4e-172
Identity = 314/814 (38.57%), Postives = 487/814 (59.83%), Query Frame = 0

Query: 71  SARIVEFCEVGDLKNAMEL---LCSSGN---ANLDLETYCSVLQLCAERKSIRYGKRVHS 130
           S+++V+F  V  + N         S  N   AN+       +L+ C+  K +R   ++  
Sbjct: 2   SSQLVQFSTVPQIPNPPSRHRHFLSERNYIPANVYEHPAALLLERCSSLKELR---QILP 61

Query: 131 IIESNGVVMDGILGAKLVFMYVKCGDLKEARMIFDKLSEQKVFLWNLMISEYAGNGNYVE 190
           ++  NG+  +     KLV ++ + G + EA  +F+ +  +   L++ M+  +A   +  +
Sbjct: 62  LVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDSKLNVLYHTMLKGFAKVSDLDK 121

Query: 191 SVNLFKRMMELGIKPNSYTFSSVLKCLAAVARVEQGRLVHGFICKLGFSSYNTVVNSLIS 250
           ++  F RM    ++P  Y F+ +LK     A +  G+ +HG + K GFS     +  L +
Sbjct: 122 ALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGLLVKSGFSLDLFAMTGLEN 181

Query: 251 FYFVSKKVRSAQKLFDELSDRDVISWNSMISGYVKNGLEDKGIEIFIKMLDFSVDVDLAT 310
            Y   ++V  A+K+FD + +RD++SWN++++GY +NG+    +E+   M + ++     T
Sbjct: 182 MYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVKSMCEENLKPSFIT 241

Query: 311 MVNVLVACANTGTLLLGKALHSYAIKAASSLDREVMFKNTLLDMYSKCGDLNSAIRVFEK 370
           +V+VL A +    + +GK +H YA++  S  D  V     L+DMY+KCG L +A ++F+ 
Sbjct: 242 IVSVLPAVSALRLISVGKEIHGYAMR--SGFDSLVNISTALVDMYAKCGSLETARQLFDG 301

Query: 371 MDEKTVVSWTSMIAGYVREGLSDGAIELFDEMKSRGVVPDVYAVTSILHACAINGNLSSG 430
           M E+ VVSW SMI  YV+      A+ +F +M   GV P   +V   LHACA  G+L  G
Sbjct: 302 MLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERG 361

Query: 431 KIVHNYIRKNNLETNSFVSNALMDMYAKCGSMKDAQSVFSHMKGKDVISWNTMIGGYSKN 490
           + +H    +  L+ N  V N+L+ MY KC  +  A S+F  ++ + ++SWN MI G+++N
Sbjct: 362 RFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQN 421

Query: 491 HLPNEALNLFAEMQ-RESKPDGTTVACILPACASLAALDRGREIHGYALRNGYSKDKYVV 550
             P +ALN F++M+ R  KPD  T   ++ A A L+     + IHG  +R+   K+ +V 
Sbjct: 422 GRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVT 481

Query: 551 NALVDMYVKCGLLVLARSLFDMILDKDLVSWTVMIAGYGMHGFGNEAVDIFNQMRISGVE 610
            ALVDMY KCG +++AR +FDM+ ++ + +W  MI GYG HGFG  A+++F +M+   ++
Sbjct: 482 TALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIK 541

Query: 611 PDEVSFISILYACSHSGLLDEGWKFFNIMKKECRIEPKLEHYACMVDLLARTGNLVKAHK 670
           P+ V+F+S++ ACSHSGL++ G K F +MK+   IE  ++HY  MVDLL R G L +A  
Sbjct: 542 PNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWD 601

Query: 671 FIKTMPIEPDATIWGALLCGCRIHHDVKVAEKVAERIFELEPENTGYYVLLANIYAEAEK 730
           FI  MP++P   ++GA+L  C+IH +V  AEK AER+FEL P++ GY+VLLANIY  A  
Sbjct: 602 FIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASM 661

Query: 731 WEEVQKLRKKIGQRGLKKNPGCSWIEIKGKVNIFVAGDCSKPQAKKIELLLKKLRSKMKE 790
           WE+V ++R  + ++GL+K PGCS +EIK +V+ F +G  + P +KKI   L+KL   +KE
Sbjct: 662 WEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKE 721

Query: 791 EGYSPKTRYALLNADEREKEVALCGHSEKLAMAFGMLNLPPGKTIRVTKNLRVCGDCHEM 850
            GY P T   +L  +   KE  L  HSEKLA++FG+LN   G TI V KNLRVC DCH  
Sbjct: 722 AGYVPDTN-LVLGVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRKNLRVCADCHNA 781

Query: 851 AKFMSKNTTREIVLRDSNRFHHFKDGYCSCRGYW 878
            K++S  T REIV+RD  RFHHFK+G CSC  YW
Sbjct: 782 TKYISLVTGREIVVRDMQRFHHFKNGACSCGDYW 809

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SN390.0e+0062.89Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Q9M9E25.1e-17839.78Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidop... [more]
Q9M1V32.5e-17240.82Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidop... [more]
Q9SS607.2e-17239.74Pentatricopeptide repeat-containing protein At3g03580 OS=Arabidopsis thaliana OX... [more]
Q9LFL59.4e-17240.93Pentatricopeptide repeat-containing protein At5g16860 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_022139839.10.0100.00pentatricopeptide repeat-containing protein DOT4, chloroplastic [Momordica chara... [more]
XP_038893908.10.088.05pentatricopeptide repeat-containing protein DOT4, chloroplastic [Benincasa hispi... [more]
XP_022927496.10.086.89pentatricopeptide repeat-containing protein DOT4, chloroplastic [Cucurbita mosch... [more]
KAG7019566.10.086.66Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurb... [more]
KAG6583948.10.086.77Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurb... [more]
Match NameE-valueIdentityDescription
A0A6J1CE340.0100.00pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Momordica cha... [more]
A0A6J1EHU90.086.89pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Cucurbita mos... [more]
A0A6J1KNK70.086.53pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Cucurbita max... [more]
A0A5A7UPC90.086.55Pentatricopeptide repeat-containing protein DOT4 OS=Cucumis melo var. makuwa OX=... [more]
A0A1S3B8570.086.43pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Cucumis melo ... [more]
Match NameE-valueIdentityDescription
AT4G18750.10.0e+0062.89Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G15510.13.7e-17939.78Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G03580.15.1e-17339.74Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G16860.16.7e-17340.93Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G11290.17.4e-17238.57Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 743..867
e-value: 9.5E-40
score: 135.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 168..201
e-value: 2.6E-6
score: 25.2
coord: 472..500
e-value: 4.8E-6
score: 24.4
coord: 343..367
e-value: 0.0011
score: 17.0
coord: 268..300
e-value: 1.5E-5
score: 22.9
coord: 572..606
e-value: 1.7E-7
score: 29.0
coord: 371..405
e-value: 1.7E-10
score: 38.4
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 143..163
e-value: 0.69
score: 10.3
coord: 268..296
e-value: 1.9E-7
score: 30.9
coord: 239..266
e-value: 0.21
score: 11.9
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 368..416
e-value: 3.7E-14
score: 52.6
coord: 569..616
e-value: 5.4E-9
score: 36.1
coord: 166..212
e-value: 1.4E-10
score: 41.2
coord: 469..516
e-value: 3.6E-9
score: 36.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 470..504
score: 10.731171
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 570..604
score: 11.640958
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 165..199
score: 11.213468
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 605..635
score: 8.6266
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 369..403
score: 13.274192
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 266..300
score: 10.742131
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 439..469
score: 8.889672
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 69..223
e-value: 2.8E-24
score: 88.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 594..821
e-value: 1.9E-25
score: 92.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 526..593
e-value: 1.1E-6
score: 30.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 323..422
e-value: 6.5E-25
score: 89.5
coord: 423..525
e-value: 2.7E-22
score: 81.0
coord: 224..322
e-value: 3.2E-15
score: 57.9
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 141..730
NoneNo IPR availablePANTHERPTHR47928:SF149PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN DOT4, CHLOROPLASTICcoord: 98..193
coord: 187..848
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 98..193
coord: 187..848

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC03g0739.1MC03g0739.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding