CmoCh16G002770 (gene) Cucurbita moschata (Rifu)

NameCmoCh16G002770
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionPentatricopeptide repeat-containing-like protein
LocationCmo_Chr16 : 1242417 .. 1245108 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGGTATCCGTCTCTCTATGTGCATATGCTGTACTGATTATTTCATTTCTATGATGTGAATTTTTGTTTCAATTCAACATGTTCAAGCATGGGTAAACCTTTTGTTGACAAAAAATATATTTTCCTGTCTATTATTCTCAGATTTCTTCCATGTAAATGGAGACGGATTTCCTTGTTTAGGCCTTCATTCCAAGCTTGTTGCCCTTTGTATTCTGCAACCACAACTGCTCCCACTCCCAAGTATTACTTGGATGAAGTTGAAATTGAGAAAAAGGAAATTGATTTCAACCGACTATTCCTTGTCTGCAAAAAAGTACACCTTGCTAAGCGACTTCATGCACTACTTGTGGTGTCTGGAAAGGTTCAGAGCATCTTTCTTTCTGCTAAACTCATCAATCTCTATGCTTTTCTTGGTGATGTATCGTTCGCTCGCCGTACTTTTGACCAAATTCAGGCAAAAGATGTCTACACATGGAATTCTATGATATCTGCTTATGCTCGAATTGGTCACTTCCATGAAGCTGTAGATTGTTTCCATGAATTTATGTCAACTTCTATCCTTCAGCCTGATTATTACACATTTCCTCCTGTTATAAGGGCATGTGGAAATCTAGATGATGGGAAGAAGATACATTGCTTGGCTCTAAAATTGGGTTTTGAATGTGACGTATTCATTGCTGCTTCTTTGATTCATTTTTATTCTCGGTTTGGCTTTGTCAATTTAGCTCGTAACTTGTTTGATAGCTTGATGATTCGAGATATCGGTACTTGGAATGCTATGATTTCAGGGTTTTGTCTTAATGGTAAAGTTGTAGAAGCATTGGAAGTCTTTGATGAGATGCGATTCAAGAGTGTAACTATGGATTCTGTAACATTTTCAAGTCTACTTCCTATTTGTGCACAGTTGGACGATATAATAAGCGGTGTCCTAATTCATGTCTATGCCATCAAGCTCGGGTTGGAATTTGACTTGTTTGTCTGTAATGCATTGATAAACATGTATGCCAAATTTGGTGAACTGGGAAGTGCAGAAACCATTTTCAACCAAATCGAAGCGAAGGATATTGTATCGTGGAACTCTTTGATTGCTGCATTCGAGCAGAATAAAGAGCCAGTGGTGGCTCTTGGATTGTACAAAAAGATGCACGCTACTGGGGCGGTACCCGACTTGTTGACACTGGTGAGTTTGGCTTCTGTTGCTGCTGAACTTGGCAATTTCTTAAGTAGTAGATCTATTCATGGATTTGTTACAAGGAAAGGTTGGTTTCTACAAGATGTTGTCATTGGTAATGCAATTATAGACATGTATGCTAAATTGGGGTATATAGATTCAGCACGAAAAGTTTTTGAAGAACTTCCTGTCAAAGATGTGGTCTCATGGAACACTTTGATAACAGGTTATTCTCAAAACGGTTTAGCGAATGAGGCAATCGATGTGTATCATTTGATGAACGATTATAGTGATGCAGTGCCGAACCAGGGCACTTGGGTGAGCATTCTGACAGCATATTCCCAGATAGGAGCATTGAAACAAGGTATGAAAACACATGGTCTGCTGATCAAGAACTTTCTATACTTTGATATCTTTGTGGGTACTTGTCTTATTGATATGTATGGAAAATGTGGAAGATTAGCTGATGCGTTGTCTTTATTTTATGAAATACCTCACAAAAGTTCGGTTTCGTGGAATGCCATCATATCGTGTCATGGCCTCCATGGATACGGTTTAAAAGCTGTCGAGTTATTTAGGGAAATGCAAACTGAAGGAGTGAAGCCTGACCACATTACTTTTGTATCTCTATTATCTGCTTGTAGTCATTCTGGTTTGGTTGATGAGGGTCAGTGGTGCTTCCAATTGATGGGAGAGTTGTATGGTATAAGGCCTAGCTTGAAGCATTATGGCTGCATGGTCGATTTGTTCGGTAGGGCAGGCCATCTCAAAAAAGCTTATGATTTTGTTAAAACTATGCCGATACAACCCGATGCATCCGTGTGGGGGGCGCTTCTTGGTGCTTGTAGGATACATGAGAATGTAGAGTTGGCTAGAACTGTCTCGGATCACTTGTTGGAGGTTGAGTCGAAAAACGTTGGCTACTATGTTTTGTTGTCGAATATTTATGCGAAACTTGGACAGTGGGACGGAGTTGACGAAGTGCGATCATTAGCTCGAGACAGGGGATTGAGGAAGACTCCTGGTTGGAGCTCAATTGAAATAGACAAGAAAATTGATGTCTTTTACACAGGCAACCGAACACATCCAAGATGTGAGGAGATATATGATGAACTGAGGGATCTAACTGCTAAAATGAAGAGTCTTGGCTATGTTGCAAACTATAACTTTGTATTGCAGGATGTGGAGGATGATGAAAAGGAGAACATTCTTATCAGTCATAGCGAGCGGTTGGCAATGGCATTCGGGATCATCAGCACGCCACCGAAAACAACTCTTCAGATCTTTAAGAACTTACGGGTTTGTGGAGACTGTCATAACGCTACCAAGTTCATATCTAAAATTACTGAAAGAGAGATAATCGTTAGAGATTCAAACCGATTCCATCATTTCAAAGATGGAGTCTGTTCTTGTGGTGATTATTGGTGACATGTTCGGAAAGAACATAAAATAAACATGATTCACTTCATTCGTTTTACGTTTGACAAACATGAATGGACACGTATTGATTGATTAGGAG

mRNA sequence

ATGAGATTTCTTCCATGTAAATGGAGACGGATTTCCTTGTTTAGGCCTTCATTCCAAGCTTGTTGCCCTTTGTATTCTGCAACCACAACTGCTCCCACTCCCAAGTATTACTTGGATGAAGTTGAAATTGAGAAAAAGGAAATTGATTTCAACCGACTATTCCTTGTCTGCAAAAAAGTACACCTTGCTAAGCGACTTCATGCACTACTTGTGGTGTCTGGAAAGGTTCAGAGCATCTTTCTTTCTGCTAAACTCATCAATCTCTATGCTTTTCTTGGTGATGTATCGTTCGCTCGCCGTACTTTTGACCAAATTCAGGCAAAAGATGTCTACACATGGAATTCTATGATATCTGCTTATGCTCGAATTGGTCACTTCCATGAAGCTGTAGATTGTTTCCATGAATTTATGTCAACTTCTATCCTTCAGCCTGATTATTACACATTTCCTCCTGTTATAAGGGCATGTGGAAATCTAGATGATGGGAAGAAGATACATTGCTTGGCTCTAAAATTGGGTTTTGAATGTGACGTATTCATTGCTGCTTCTTTGATTCATTTTTATTCTCGGTTTGGCTTTGTCAATTTAGCTCGTAACTTGTTTGATAGCTTGATGATTCGAGATATCGGTACTTGGAATGCTATGATTTCAGGGTTTTGTCTTAATGGTAAAGTTGTAGAAGCATTGGAAGTCTTTGATGAGATGCGATTCAAGAGTGTAACTATGGATTCTGTAACATTTTCAAGTCTACTTCCTATTTGTGCACAGTTGGACGATATAATAAGCGGTGTCCTAATTCATGTCTATGCCATCAAGCTCGGGTTGGAATTTGACTTGTTTGTCTGTAATGCATTGATAAACATGTATGCCAAATTTGGTGAACTGGGAAGTGCAGAAACCATTTTCAACCAAATCGAAGCGAAGGATATTGTATCGTGGAACTCTTTGATTGCTGCATTCGAGCAGAATAAAGAGCCAGTGGTGGCTCTTGGATTGTACAAAAAGATGCACGCTACTGGGGCGGTACCCGACTTGTTGACACTGGTGAGTTTGGCTTCTGTTGCTGCTGAACTTGGCAATTTCTTAAGTAGTAGATCTATTCATGGATTTGTTACAAGGAAAGGTTGGTTTCTACAAGATGTTGTCATTGGTAATGCAATTATAGACATGTATGCTAAATTGGGGTATATAGATTCAGCACGAAAAGTTTTTGAAGAACTTCCTGTCAAAGATGTGGTCTCATGGAACACTTTGATAACAGGTTATTCTCAAAACGGTTTAGCGAATGAGGCAATCGATGTGTATCATTTGATGAACGATTATAGTGATGCAGTGCCGAACCAGGGCACTTGGGTGAGCATTCTGACAGCATATTCCCAGATAGGAGCATTGAAACAAGGTATGAAAACACATGGTCTGCTGATCAAGAACTTTCTATACTTTGATATCTTTGTGGGTACTTGTCTTATTGATATGTATGGAAAATGTGGAAGATTAGCTGATGCGTTGTCTTTATTTTATGAAATACCTCACAAAAGTTCGGTTTCGTGGAATGCCATCATATCGTGTCATGGCCTCCATGGATACGGTTTAAAAGCTGTCGAGTTATTTAGGGAAATGCAAACTGAAGGAGTGAAGCCTGACCACATTACTTTTGTATCTCTATTATCTGCTTGTAGTCATTCTGGTTTGGTTGATGAGGGTCAGTGGTGCTTCCAATTGATGGGAGAGTTGTATGGTATAAGGCCTAGCTTGAAGCATTATGGCTGCATGGTCGATTTGTTCGGTAGGGCAGGCCATCTCAAAAAAGCTTATGATTTTGTTAAAACTATGCCGATACAACCCGATGCATCCGTGTGGGGGGCGCTTCTTGGTGCTTGTAGGATACATGAGAATGTAGAGTTGGCTAGAACTGTCTCGGATCACTTGTTGGAGGTTGAGTCGAAAAACGTTGGCTACTATGTTTTGTTGTCGAATATTTATGCGAAACTTGGACAGTGGGACGGAGTTGACGAAGTGCGATCATTAGCTCGAGACAGGGGATTGAGGAAGACTCCTGGTTGGAGCTCAATTGAAATAGACAAGAAAATTGATGTCTTTTACACAGGCAACCGAACACATCCAAGATGTGAGGAGATATATGATGAACTGAGGGATCTAACTGCTAAAATGAAGAGTCTTGGCTATGTTGCAAACTATAACTTTGTATTGCAGGATGTGGAGGATGATGAAAAGGAGAACATTCTTATCAGTCATAGCGAGCGGTTGGCAATGGCATTCGGGATCATCAGCACGCCACCGAAAACAACTCTTCAGATCTTTAAGAACTTACGGGTTTGTGGAGACTGTCATAACGCTACCAAGTTCATATCTAAAATTACTGAAAGAGAGATAATCGTTAGAGATTCAAACCGATTCCATCATTTCAAAGATGGAGTCTGTTCTTGTGGTGATTATTGGTGACATGTTCGGAAAGAACATAAAATAAACATGATTCACTTCATTCGTTTTACGTTTGACAAACATGAATGGACACGTATTGATTGATTAGGAG

Coding sequence (CDS)

ATGAGATTTCTTCCATGTAAATGGAGACGGATTTCCTTGTTTAGGCCTTCATTCCAAGCTTGTTGCCCTTTGTATTCTGCAACCACAACTGCTCCCACTCCCAAGTATTACTTGGATGAAGTTGAAATTGAGAAAAAGGAAATTGATTTCAACCGACTATTCCTTGTCTGCAAAAAAGTACACCTTGCTAAGCGACTTCATGCACTACTTGTGGTGTCTGGAAAGGTTCAGAGCATCTTTCTTTCTGCTAAACTCATCAATCTCTATGCTTTTCTTGGTGATGTATCGTTCGCTCGCCGTACTTTTGACCAAATTCAGGCAAAAGATGTCTACACATGGAATTCTATGATATCTGCTTATGCTCGAATTGGTCACTTCCATGAAGCTGTAGATTGTTTCCATGAATTTATGTCAACTTCTATCCTTCAGCCTGATTATTACACATTTCCTCCTGTTATAAGGGCATGTGGAAATCTAGATGATGGGAAGAAGATACATTGCTTGGCTCTAAAATTGGGTTTTGAATGTGACGTATTCATTGCTGCTTCTTTGATTCATTTTTATTCTCGGTTTGGCTTTGTCAATTTAGCTCGTAACTTGTTTGATAGCTTGATGATTCGAGATATCGGTACTTGGAATGCTATGATTTCAGGGTTTTGTCTTAATGGTAAAGTTGTAGAAGCATTGGAAGTCTTTGATGAGATGCGATTCAAGAGTGTAACTATGGATTCTGTAACATTTTCAAGTCTACTTCCTATTTGTGCACAGTTGGACGATATAATAAGCGGTGTCCTAATTCATGTCTATGCCATCAAGCTCGGGTTGGAATTTGACTTGTTTGTCTGTAATGCATTGATAAACATGTATGCCAAATTTGGTGAACTGGGAAGTGCAGAAACCATTTTCAACCAAATCGAAGCGAAGGATATTGTATCGTGGAACTCTTTGATTGCTGCATTCGAGCAGAATAAAGAGCCAGTGGTGGCTCTTGGATTGTACAAAAAGATGCACGCTACTGGGGCGGTACCCGACTTGTTGACACTGGTGAGTTTGGCTTCTGTTGCTGCTGAACTTGGCAATTTCTTAAGTAGTAGATCTATTCATGGATTTGTTACAAGGAAAGGTTGGTTTCTACAAGATGTTGTCATTGGTAATGCAATTATAGACATGTATGCTAAATTGGGGTATATAGATTCAGCACGAAAAGTTTTTGAAGAACTTCCTGTCAAAGATGTGGTCTCATGGAACACTTTGATAACAGGTTATTCTCAAAACGGTTTAGCGAATGAGGCAATCGATGTGTATCATTTGATGAACGATTATAGTGATGCAGTGCCGAACCAGGGCACTTGGGTGAGCATTCTGACAGCATATTCCCAGATAGGAGCATTGAAACAAGGTATGAAAACACATGGTCTGCTGATCAAGAACTTTCTATACTTTGATATCTTTGTGGGTACTTGTCTTATTGATATGTATGGAAAATGTGGAAGATTAGCTGATGCGTTGTCTTTATTTTATGAAATACCTCACAAAAGTTCGGTTTCGTGGAATGCCATCATATCGTGTCATGGCCTCCATGGATACGGTTTAAAAGCTGTCGAGTTATTTAGGGAAATGCAAACTGAAGGAGTGAAGCCTGACCACATTACTTTTGTATCTCTATTATCTGCTTGTAGTCATTCTGGTTTGGTTGATGAGGGTCAGTGGTGCTTCCAATTGATGGGAGAGTTGTATGGTATAAGGCCTAGCTTGAAGCATTATGGCTGCATGGTCGATTTGTTCGGTAGGGCAGGCCATCTCAAAAAAGCTTATGATTTTGTTAAAACTATGCCGATACAACCCGATGCATCCGTGTGGGGGGCGCTTCTTGGTGCTTGTAGGATACATGAGAATGTAGAGTTGGCTAGAACTGTCTCGGATCACTTGTTGGAGGTTGAGTCGAAAAACGTTGGCTACTATGTTTTGTTGTCGAATATTTATGCGAAACTTGGACAGTGGGACGGAGTTGACGAAGTGCGATCATTAGCTCGAGACAGGGGATTGAGGAAGACTCCTGGTTGGAGCTCAATTGAAATAGACAAGAAAATTGATGTCTTTTACACAGGCAACCGAACACATCCAAGATGTGAGGAGATATATGATGAACTGAGGGATCTAACTGCTAAAATGAAGAGTCTTGGCTATGTTGCAAACTATAACTTTGTATTGCAGGATGTGGAGGATGATGAAAAGGAGAACATTCTTATCAGTCATAGCGAGCGGTTGGCAATGGCATTCGGGATCATCAGCACGCCACCGAAAACAACTCTTCAGATCTTTAAGAACTTACGGGTTTGTGGAGACTGTCATAACGCTACCAAGTTCATATCTAAAATTACTGAAAGAGAGATAATCGTTAGAGATTCAAACCGATTCCATCATTTCAAAGATGGAGTCTGTTCTTGTGGTGATTATTGGTGA
BLAST of CmoCh16G002770 vs. Swiss-Prot
Match: PP348_ARATH (Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana GN=EMB2758 PE=3 SV=2)

HSP 1 Score: 1023.8 bits (2646), Expect = 1.0e-297
Identity = 479/778 (61.57%), Postives = 608/778 (78.15%), Query Frame = 1

Query: 44  EKKEID-FNRLFLVCKKVHLAKRLHALLVVSGKVQSIFLSAKLINLYAFLGDVSFARRTF 103
           E KEID  + LF  C  +  AK LHA LVVS ++Q++ +SAKL+NLY +LG+V+ AR TF
Sbjct: 50  ESKEIDDVHTLFRYCTNLQSAKCLHARLVVSKQIQNVCISAKLVNLYCYLGNVALARHTF 109

Query: 104 DQIQAKDVYTWNSMISAYARIGHFHEAVDCFHEFMSTSILQPDYYTFPPVIRACGNLDDG 163
           D IQ +DVY WN MIS Y R G+  E + CF  FM +S L PDY TFP V++AC  + DG
Sbjct: 110 DHIQNRDVYAWNLMISGYGRAGNSSEVIRCFSLFMLSSGLTPDYRTFPSVLKACRTVIDG 169

Query: 164 KKIHCLALKLGFECDVFIAASLIHFYSRFGFVNLARNLFDSLMIRDIGTWNAMISGFCLN 223
            KIHCLALK GF  DV++AASLIH YSR+  V  AR LFD + +RD+G+WNAMISG+C +
Sbjct: 170 NKIHCLALKFGFMWDVYVAASLIHLYSRYKAVGNARILFDEMPVRDMGSWNAMISGYCQS 229

Query: 224 GKVVEALEVFDEMRFKSVTMDSVTFSSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVC 283
           G   EAL + + +R     MDSVT  SLL  C +  D   GV IH Y+IK GLE +LFV 
Sbjct: 230 GNAKEALTLSNGLR----AMDSVTVVSLLSACTEAGDFNRGVTIHSYSIKHGLESELFVS 289

Query: 284 NALINMYAKFGELGSAETIFNQIEAKDIVSWNSLIAAFEQNKEPVVALGLYKKMHATGAV 343
           N LI++YA+FG L   + +F+++  +D++SWNS+I A+E N++P+ A+ L+++M  +   
Sbjct: 290 NKLIDLYAEFGRLRDCQKVFDRMYVRDLISWNSIIKAYELNEQPLRAISLFQEMRLSRIQ 349

Query: 344 PDLLTLVSLASVAAELGNFLSSRSIHGFVTRKGWFLQDVVIGNAIIDMYAKLGYIDSARK 403
           PD LTL+SLAS+ ++LG+  + RS+ GF  RKGWFL+D+ IGNA++ MYAKLG +DSAR 
Sbjct: 350 PDCLTLISLASILSQLGDIRACRSVQGFTLRKGWFLEDITIGNAVVVMYAKLGLVDSARA 409

Query: 404 VFEELPVKDVVSWNTLITGYSQNGLANEAIDVYHLMNDYSDAVPNQGTWVSILTAYSQIG 463
           VF  LP  DV+SWNT+I+GY+QNG A+EAI++Y++M +  +   NQGTWVS+L A SQ G
Sbjct: 410 VFNWLPNTDVISWNTIISGYAQNGFASEAIEMYNIMEEEGEIAANQGTWVSVLPACSQAG 469

Query: 464 ALKQGMKTHGLLIKNFLYFDIFVGTCLIDMYGKCGRLADALSLFYEIPHKSSVSWNAIIS 523
           AL+QGMK HG L+KN LY D+FV T L DMYGKCGRL DALSLFY+IP  +SV WN +I+
Sbjct: 470 ALRQGMKLHGRLLKNGLYLDVFVVTSLADMYGKCGRLEDALSLFYQIPRVNSVPWNTLIA 529

Query: 524 CHGLHGYGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMGELYGIR 583
           CHG HG+G KAV LF+EM  EGVKPDHITFV+LLSACSHSGLVDEGQWCF++M   YGI 
Sbjct: 530 CHGFHGHGEKAVMLFKEMLDEGVKPDHITFVTLLSACSHSGLVDEGQWCFEMMQTDYGIT 589

Query: 584 PSLKHYGCMVDLFGRAGHLKKAYDFVKTMPIQPDASVWGALLGACRIHENVELARTVSDH 643
           PSLKHYGCMVD++GRAG L+ A  F+K+M +QPDAS+WGALL ACR+H NV+L +  S+H
Sbjct: 590 PSLKHYGCMVDMYGRAGQLETALKFIKSMSLQPDASIWGALLSACRVHGNVDLGKIASEH 649

Query: 644 LLEVESKNVGYYVLLSNIYAKLGQWDGVDEVRSLARDRGLRKTPGWSSIEIDKKIDVFYT 703
           L EVE ++VGY+VLLSN+YA  G+W+GVDE+RS+A  +GLRKTPGWSS+E+D K++VFYT
Sbjct: 650 LFEVEPEHVGYHVLLSNMYASAGKWEGVDEIRSIAHGKGLRKTPGWSSMEVDNKVEVFYT 709

Query: 704 GNRTHPRCEEIYDELRDLTAKMKSLGYVANYNFVLQDVEDDEKENILISHSERLAMAFGI 763
           GN+THP  EE+Y EL  L AK+K +GYV ++ FVLQDVEDDEKE+IL+SHSERLA+AF +
Sbjct: 710 GNQTHPMYEEMYRELTALQAKLKMIGYVPDHRFVLQDVEDDEKEHILMSHSERLAIAFAL 769

Query: 764 ISTPPKTTLQIFKNLRVCGDCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW 821
           I+TP KTT++IFKNLRVCGDCH+ TKFISKITEREIIVRDSNRFHHFK+GVCSCGDYW
Sbjct: 770 IATPAKTTIRIFKNLRVCGDCHSVTKFISKITEREIIVRDSNRFHHFKNGVCSCGDYW 823

BLAST of CmoCh16G002770 vs. Swiss-Prot
Match: PP341_ARATH (Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana GN=DYW9 PE=2 SV=1)

HSP 1 Score: 600.1 bits (1546), Expect = 3.6e-170
Identity = 315/764 (41.23%), Postives = 466/764 (60.99%), Query Frame = 1

Query: 61  HLAKRLHALLVVSGKVQSIFLSAKLINLYAFLGDVSFARRTFDQIQAKDVYTWNSMISAY 120
           HLA+  HA +++ G    I L  KL    + LG + +AR  F  +Q  DV+ +N ++  +
Sbjct: 35  HLAQT-HAQIILHGFRNDISLLTKLTQRLSDLGAIYYARDIFLSVQRPDVFLFNVLMRGF 94

Query: 121 ARIGHFHEAVDCFHEFMSTSILQPDYYTFPPVIRACGNLDD---GKKIHCLALKLGFECD 180
           +     H ++  F     ++ L+P+  T+   I A     D   G+ IH  A+  G + +
Sbjct: 95  SVNESPHSSLSVFAHLRKSTDLKPNSSTYAFAISAASGFRDDRAGRVIHGQAVVDGCDSE 154

Query: 181 VFIAASLIHFYSRFGFVNLARNLFDSLMIRDIGTWNAMISGFCLNGKVVEALEVFDEMRF 240
           + + ++++  Y +F  V  AR +FD +  +D   WN MISG+  N   VE+++VF ++  
Sbjct: 155 LLLGSNIVKMYFKFWRVEDARKVFDRMPEKDTILWNTMISGYRKNEMYVESIQVFRDLIN 214

Query: 241 KSVT-MDSVTFSSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVCNALINMYAKFGELG 300
           +S T +D+ T   +LP  A+L ++  G+ IH  A K G     +V    I++Y+K G++ 
Sbjct: 215 ESCTRLDTTTLLDILPAVAELQELRLGMQIHSLATKTGCYSHDYVLTGFISLYSKCGKIK 274

Query: 301 SAETIFNQIEAKDIVSWNSLIAAFEQNKEPVVALGLYKKMHATGAVPDLLTLVSLASVAA 360
               +F +    DIV++N++I  +  N E  ++L L+K++  +GA     TLVSL  V+ 
Sbjct: 275 MGSALFREFRKPDIVAYNAMIHGYTSNGETELSLSLFKELMLSGARLRSSTLVSLVPVS- 334

Query: 361 ELGNFLSSRSIHGFVTRKGWFLQDVVIGNAIIDMYAKLGYIDSARKVFEELPVKDVVSWN 420
             G+ +   +IHG+  +   FL    +  A+  +Y+KL  I+SARK+F+E P K + SWN
Sbjct: 335 --GHLMLIYAIHGYCLKSN-FLSHASVSTALTTVYSKLNEIESARKLFDESPEKSLPSWN 394

Query: 421 TLITGYSQNGLANEAIDVYHLMNDYSDAVPNQGTWVSILTAYSQIGALKQGMKTHGLLIK 480
            +I+GY+QNGL  +AI ++  M   S+  PN  T   IL+A +Q+GAL  G   H L+  
Sbjct: 395 AMISGYTQNGLTEDAISLFREMQK-SEFSPNPVTITCILSACAQLGALSLGKWVHDLVRS 454

Query: 481 NFLYFDIFVGTCLIDMYGKCGRLADALSLFYEIPHKSSVSWNAIISCHGLHGYGLKAVEL 540
                 I+V T LI MY KCG +A+A  LF  +  K+ V+WN +IS +GLHG G +A+ +
Sbjct: 455 TDFESSIYVSTALIGMYAKCGSIAEARRLFDLMTKKNEVTWNTMISGYGLHGQGQEALNI 514

Query: 541 FREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMGELYGIRPSLKHYGCMVDLFG 600
           F EM   G+ P  +TF+ +L ACSH+GLV EG   F  M   YG  PS+KHY CMVD+ G
Sbjct: 515 FYEMLNSGITPTPVTFLCVLYACSHAGLVKEGDEIFNSMIHRYGFEPSVKHYACMVDILG 574

Query: 601 RAGHLKKAYDFVKTMPIQPDASVWGALLGACRIHENVELARTVSDHLLEVESKNVGYYVL 660
           RAGHL++A  F++ M I+P +SVW  LLGACRIH++  LARTVS+ L E++  NVGY+VL
Sbjct: 575 RAGHLQRALQFIEAMSIEPGSSVWETLLGACRIHKDTNLARTVSEKLFELDPDNVGYHVL 634

Query: 661 LSNIYAKLGQWDGVDEVRSLARDRGLRKTPGWSSIEIDKKIDVFYTGNRTHPRCEEIYDE 720
           LSNI++    +     VR  A+ R L K PG++ IEI +   VF +G+++HP+ +EIY++
Sbjct: 635 LSNIHSADRNYPQAATVRQTAKKRKLAKAPGYTLIEIGETPHVFTSGDQSHPQVKEIYEK 694

Query: 721 LRDLTAKMKSLGYVANYNFVLQDVEDDEKENILISHSERLAMAFGIISTPPKTTLQIFKN 780
           L  L  KM+  GY       L DVE++E+E ++  HSERLA+AFG+I+T P T ++I KN
Sbjct: 695 LEKLEGKMREAGYQPETELALHDVEEEERELMVKVHSERLAIAFGLIATEPGTEIRIIKN 754

Query: 781 LRVCGDCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW 821
           LRVC DCH  TK ISKITER I+VRD+NRFHHFKDGVCSCGDYW
Sbjct: 755 LRVCLDCHTVTKLISKITERVIVVRDANRFHHFKDGVCSCGDYW 792

BLAST of CmoCh16G002770 vs. Swiss-Prot
Match: PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 589.3 bits (1518), Expect = 6.3e-167
Identity = 299/767 (38.98%), Postives = 469/767 (61.15%), Query Frame = 1

Query: 57  CKKVHLAKRLHALLVVSGKVQSIFLSAKLINLYAFLGDVSFARRTFDQIQAKDVYTWNSM 116
           C  +   +++  L+  +G  Q  F   KL++L+   G V  A R F+ I +K    +++M
Sbjct: 47  CSSLKELRQILPLVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDSKLNVLYHTM 106

Query: 117 ISAYARIGHFHEAVDCFHEFMSTSILQPDYYTFPPVIRACGN---LDDGKKIHCLALKLG 176
           +  +A++    +A+  F   M    ++P  Y F  +++ CG+   L  GK+IH L +K G
Sbjct: 107 LKGFAKVSDLDKALQFFVR-MRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGLLVKSG 166

Query: 177 FECDVFIAASLIHFYSRFGFVNLARNLFDSLMIRDIGTWNAMISGFCLNGKVVEALEVFD 236
           F  D+F    L + Y++   VN AR +FD +  RD+ +WN +++G+  NG    ALE+  
Sbjct: 167 FSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVK 226

Query: 237 EMRFKSVTMDSVTFSSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVCNALINMYAKFG 296
            M  +++    +T  S+LP  + L  I  G  IH YA++ G +  + +  AL++MYAK G
Sbjct: 227 SMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCG 286

Query: 297 ELGSAETIFNQIEAKDIVSWNSLIAAFEQNKEPVVALGLYKKMHATGAVPDLLTLVSLAS 356
            L +A  +F+ +  +++VSWNS+I A+ QN+ P  A+ +++KM   G  P  ++++    
Sbjct: 287 SLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALH 346

Query: 357 VAAELGNFLSSRSIHGFVTRKGWFLQDVVIGNAIIDMYAKLGYIDSARKVFEELPVKDVV 416
             A+LG+    R IH      G   ++V + N++I MY K   +D+A  +F +L  + +V
Sbjct: 347 ACADLGDLERGRFIHKLSVELG-LDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLV 406

Query: 417 SWNTLITGYSQNGLANEAIDVYHLMNDYSDAVPNQGTWVSILTAYSQIGALKQGMKTHGL 476
           SWN +I G++QNG   +A++ +  M   +   P+  T+VS++TA +++         HG+
Sbjct: 407 SWNAMILGFAQNGRPIDALNYFSQMRSRT-VKPDTFTYVSVITAIAELSITHHAKWIHGV 466

Query: 477 LIKNFLYFDIFVGTCLIDMYGKCGRLADALSLFYEIPHKSSVSWNAIISCHGLHGYGLKA 536
           ++++ L  ++FV T L+DMY KCG +  A  +F  +  +   +WNA+I  +G HG+G  A
Sbjct: 467 VMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAA 526

Query: 537 VELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMGELYGIRPSLKHYGCMVD 596
           +ELF EMQ   +KP+ +TF+S++SACSHSGLV+ G  CF +M E Y I  S+ HYG MVD
Sbjct: 527 LELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVD 586

Query: 597 LFGRAGHLKKAYDFVKTMPIQPDASVWGALLGACRIHENVELARTVSDHLLEVESKNVGY 656
           L GRAG L +A+DF+  MP++P  +V+GA+LGAC+IH+NV  A   ++ L E+   + GY
Sbjct: 587 LLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGY 646

Query: 657 YVLLSNIYAKLGQWDGVDEVRSLARDRGLRKTPGWSSIEIDKKIDVFYTGNRTHPRCEEI 716
           +VLL+NIY     W+ V +VR     +GLRKTPG S +EI  ++  F++G+  HP  ++I
Sbjct: 647 HVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKI 706

Query: 717 YDELRDLTAKMKSLGYVANYNFVLQDVEDDEKENILISHSERLAMAFGIISTPPKTTLQI 776
           Y  L  L   +K  GYV + N VL  VE+D KE +L +HSE+LA++FG+++T   TT+ +
Sbjct: 707 YAFLEKLICHIKEAGYVPDTNLVL-GVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHV 766

Query: 777 FKNLRVCGDCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW 821
            KNLRVC DCHNATK+IS +T REI+VRD  RFHHFK+G CSCGDYW
Sbjct: 767 RKNLRVCADCHNATKYISLVTGREIVVRDMQRFHHFKNGACSCGDYW 809

BLAST of CmoCh16G002770 vs. Swiss-Prot
Match: PP285_ARATH (Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H81 PE=2 SV=2)

HSP 1 Score: 588.6 bits (1516), Expect = 1.1e-166
Identity = 306/781 (39.18%), Postives = 471/781 (60.31%), Query Frame = 1

Query: 60  VHLAKRLHALLVVSGK-VQSIFLSAKLINLYAFLGDVSFARRTFDQIQAKDVYTWNSMIS 119
           + L K++HA +   G  V S+ ++  L+NLY   GD     + FD+I  ++  +WNS+IS
Sbjct: 113 MELGKQIHAHVYKFGYGVDSVTVANTLVNLYRKCGDFGAVYKVFDRISERNQVSWNSLIS 172

Query: 120 AYARIGHFHEAVDCFHEFMSTSILQPDYYTFPPVIRACGNLDD------GKKIHCLALKL 179
           +      +  A++ F   +  ++ +P  +T   V+ AC NL        GK++H   L+ 
Sbjct: 173 SLCSFEKWEMALEAFRCMLDENV-EPSSFTLVSVVTACSNLPMPEGLMMGKQVHAYGLRK 232

Query: 180 GFECDVFIAASLIHFYSRFGFVNLARNLFDSLMIRDIGTWNAMISGFCLNGKVVEALEVF 239
           G E + FI  +L+  Y + G +  ++ L  S   RD+ TWN ++S  C N +++EALE  
Sbjct: 233 G-ELNSFIINTLVAMYGKLGKLASSKVLLGSFGGRDLVTWNTVLSSLCQNEQLLEALEYL 292

Query: 240 DEMRFKSVTMDSVTFSSLLPICAQLDDIISGVLIHVYAIKLG-LEFDLFVCNALINMYAK 299
            EM  + V  D  T SS+LP C+ L+ + +G  +H YA+K G L+ + FV +AL++MY  
Sbjct: 293 REMVLEGVEPDEFTISSVLPACSHLEMLRTGKELHAYALKNGSLDENSFVGSALVDMYCN 352

Query: 300 FGELGSAETIFNQIEAKDIVSWNSLIAAFEQNKEPVVALGLYKKMH-ATGAVPDLLTLVS 359
             ++ S   +F+ +  + I  WN++IA + QN+    AL L+  M  + G + +  T+  
Sbjct: 353 CKQVLSGRRVFDGMFDRKIGLWNAMIAGYSQNEHDKEALLLFIGMEESAGLLANSTTMAG 412

Query: 360 LASVAAELGNFLSSRSIHGFVTRKGWFLQDVVIGNAIIDMYAKLGYIDSARKVFEELPVK 419
           +       G F    +IHGFV ++G   +D  + N ++DMY++LG ID A ++F ++  +
Sbjct: 413 VVPACVRSGAFSRKEAIHGFVVKRG-LDRDRFVQNTLMDMYSRLGKIDIAMRIFGKMEDR 472

Query: 420 DVVSWNTLITGYSQNGLANEAIDVYHLMNDYSDAV----------PNQGTWVSILTAYSQ 479
           D+V+WNT+ITGY  +    +A+ + H M +    V          PN  T ++IL + + 
Sbjct: 473 DLVTWNTMITGYVFSEHHEDALLLLHKMQNLERKVSKGASRVSLKPNSITLMTILPSCAA 532

Query: 480 IGALKQGMKTHGLLIKNFLYFDIFVGTCLIDMYGKCGRLADALSLFYEIPHKSSVSWNAI 539
           + AL +G + H   IKN L  D+ VG+ L+DMY KCG L  +  +F +IP K+ ++WN I
Sbjct: 533 LSALAKGKEIHAYAIKNNLATDVAVGSALVDMYAKCGCLQMSRKVFDQIPQKNVITWNVI 592

Query: 540 ISCHGLHGYGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMGELYG 599
           I  +G+HG G +A++L R M  +GVKP+ +TF+S+ +ACSHSG+VDEG   F +M   YG
Sbjct: 593 IMAYGMHGNGQEAIDLLRMMMVQGVKPNEVTFISVFAACSHSGMVDEGLRIFYVMKPDYG 652

Query: 600 IRPSLKHYGCMVDLFGRAGHLKKAYDFVKTMPIQPD-ASVWGALLGACRIHENVELARTV 659
           + PS  HY C+VDL GRAG +K+AY  +  MP   + A  W +LLGA RIH N+E+    
Sbjct: 653 VEPSSDHYACVVDLLGRAGRIKEAYQLMNMMPRDFNKAGAWSSLLGASRIHNNLEIGEIA 712

Query: 660 SDHLLEVESKNVGYYVLLSNIYAKLGQWDGVDEVRSLARDRGLRKTPGWSSIEIDKKIDV 719
           + +L+++E     +YVLL+NIY+  G WD   EVR   +++G+RK PG S IE   ++  
Sbjct: 713 AQNLIQLEPNVASHYVLLANIYSSAGLWDKATEVRRNMKEQGVRKEPGCSWIEHGDEVHK 772

Query: 720 FYTGNRTHPRCEEIYDELRDLTAKMKSLGYVANYNFVLQDVEDDEKENILISHSERLAMA 779
           F  G+ +HP+ E++   L  L  +M+  GYV + + VL +VE+DEKE +L  HSE+LA+A
Sbjct: 773 FVAGDSSHPQSEKLSGYLETLWERMRKEGYVPDTSCVLHNVEEDEKEILLCGHSEKLAIA 832

Query: 780 FGIISTPPKTTLQIFKNLRVCGDCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDY 821
           FGI++T P T +++ KNLRVC DCH ATKFISKI +REII+RD  RFH FK+G CSCGDY
Sbjct: 833 FGILNTSPGTIIRVAKNLRVCNDCHLATKFISKIVDREIILRDVRRFHRFKNGTCSCGDY 890

BLAST of CmoCh16G002770 vs. Swiss-Prot
Match: PP320_ARATH (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana GN=DOT4 PE=2 SV=1)

HSP 1 Score: 582.8 bits (1501), Expect = 5.9e-165
Identity = 293/743 (39.43%), Postives = 448/743 (60.30%), Query Frame = 1

Query: 81  LSAKLINLYAFLGDVSFARRTFDQIQAKDVYTWNSMISAYARIGHFHEAVDCFHEFMSTS 140
           L +KL  +Y   GD+  A R FD+++ +    WN +++  A+ G F  ++  F + MS+ 
Sbjct: 131 LGSKLSLMYTNCGDLKEASRVFDEVKIEKALFWNILMNELAKSGDFSGSIGLFKKMMSSG 190

Query: 141 ILQPDYYTFPPVIRACGNLDD---GKKIHCLALKLGFECDVFIAASLIHFYSRFGFVNLA 200
           + + D YTF  V ++  +L     G+++H   LK GF     +  SL+ FY +   V+ A
Sbjct: 191 V-EMDSYTFSCVSKSFSSLRSVHGGEQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSA 250

Query: 201 RNLFDSLMIRDIGTWNAMISGFCLNGKVVEALEVFDEMRFKSVTMDSVTFSSLLPICAQL 260
           R +FD +  RD+ +WN++I+G+  NG   + L VF +M    + +D  T  S+   CA  
Sbjct: 251 RKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADS 310

Query: 261 DDIISGVLIHVYAIKLGLEFDLFVCNALINMYAKFGELGSAETIFNQIEAKDIVSWNSLI 320
             I  G  +H   +K     +   CN L++MY+K G+L SA+ +F ++  + +VS+ S+I
Sbjct: 311 RLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMI 370

Query: 321 AAFEQNKEPVVALGLYKKMHATGAVPDLLTLVSLASVAAELGNFLSSRSIHGFVTRKGWF 380
           A + +      A+ L+++M   G  PD+ T+ ++ +  A        + +H ++      
Sbjct: 371 AGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLG 430

Query: 381 LQDVVIGNAIIDMYAKLGYIDSARKVFEELPVKDVVSWNTLITGYSQNGLANEAIDVYHL 440
             D+ + NA++DMYAK G +  A  VF E+ VKD++SWNT+I GYS+N  ANEA+ +++L
Sbjct: 431 F-DIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNL 490

Query: 441 MNDYSDAVPNQGTWVSILTAYSQIGALKQGMKTHGLLIKNFLYFDIFVGTCLIDMYGKCG 500
           + +     P++ T   +L A + + A  +G + HG +++N  + D  V   L+DMY KCG
Sbjct: 491 LLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCG 550

Query: 501 RLADALSLFYEIPHKSSVSWNAIISCHGLHGYGLKAVELFREMQTEGVKPDHITFVSLLS 560
            L  A  LF +I  K  VSW  +I+ +G+HG+G +A+ LF +M+  G++ D I+FVSLL 
Sbjct: 551 ALLLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLY 610

Query: 561 ACSHSGLVDEGQWCFQLMGELYGIRPSLKHYGCMVDLFGRAGHLKKAYDFVKTMPIQPDA 620
           ACSHSGLVDEG   F +M     I P+++HY C+VD+  R G L KAY F++ MPI PDA
Sbjct: 611 ACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDA 670

Query: 621 SVWGALLGACRIHENVELARTVSDHLLEVESKNVGYYVLLSNIYAKLGQWDGVDEVRSLA 680
           ++WGALL  CRIH +V+LA  V++ + E+E +N GYYVL++NIYA+  +W+ V  +R   
Sbjct: 671 TIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRI 730

Query: 681 RDRGLRKTPGWSSIEIDKKIDVFYTGNRTHPRCEEIYDELRDLTAKMKSLGYVANYNFVL 740
             RGLRK PG S IEI  ++++F  G+ ++P  E I   LR + A+M   GY     + L
Sbjct: 731 GQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMIEEGYSPLTKYAL 790

Query: 741 QDVEDDEKENILISHSERLAMAFGIISTPPKTTLQIFKNLRVCGDCHNATKFISKITERE 800
            D E+ EKE  L  HSE+LAMA GIIS+     +++ KNLRVCGDCH   KF+SK+T RE
Sbjct: 791 IDAEEMEKEEALCGHSEKLAMALGIISSGHGKIIRVTKNLRVCGDCHEMAKFMSKLTRRE 850

Query: 801 IIVRDSNRFHHFKDGVCSCGDYW 821
           I++RDSNRFH FKDG CSC  +W
Sbjct: 851 IVLRDSNRFHQFKDGHCSCRGFW 871

BLAST of CmoCh16G002770 vs. TrEMBL
Match: A0A0A0L0N9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G107430 PE=4 SV=1)

HSP 1 Score: 1476.1 bits (3820), Expect = 0.0e+00
Identity = 710/820 (86.59%), Postives = 765/820 (93.29%), Query Frame = 1

Query: 1   MRFLPCKWRRISLFRPSFQACCPLYSATTTAPTPKYYLDEVEIEKKEIDFNRLFLVCKKV 60
           +RFL CKWRR+SLF+PSFQAC  LYSAT     PKY LD VE EK+EIDFNR+FL C KV
Sbjct: 25  LRFLQCKWRRVSLFKPSFQACS-LYSATAA---PKY-LDGVENEKREIDFNRIFLYCTKV 84

Query: 61  HLAKRLHALLVVSGKVQSIFLSAKLINLYAFLGDVSFARRTFDQIQAKDVYTWNSMISAY 120
           HLAK+LHALLVVSGK QSIFLSAKLIN YAFLGD+  AR TFDQIQ KDVYTWNSMISAY
Sbjct: 85  HLAKQLHALLVVSGKTQSIFLSAKLINRYAFLGDIPHARLTFDQIQTKDVYTWNSMISAY 144

Query: 121 ARIGHFHEAVDCFHEFMSTSILQPDYYTFPPVIRACGNLDDGKKIHCLALKLGFECDVFI 180
           ARIGHFH AVDCF+EF+STS LQ D+YTFPPVIRACGNLDDG+K+HCL LKLGFECDV+I
Sbjct: 145 ARIGHFHAAVDCFNEFLSTSFLQSDHYTFPPVIRACGNLDDGRKVHCLVLKLGFECDVYI 204

Query: 181 AASLIHFYSRFGFVNLARNLFDSLMIRDIGTWNAMISGFCLNGKVVEALEVFDEMRFKSV 240
           AAS IHFYSRFGFV+LA NLFD++MIRDIGTWNAMISGF LNGKV EALEVFDEMRFKSV
Sbjct: 205 AASFIHFYSRFGFVSLACNLFDNMMIRDIGTWNAMISGFYLNGKVAEALEVFDEMRFKSV 264

Query: 241 TMDSVTFSSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVCNALINMYAKFGELGSAET 300
           +MDSVT SSLLPIC QLDDIISGVLIHVYAIKLGLEFDLFVCNALINMYAKFGEL SAET
Sbjct: 265 SMDSVTISSLLPICVQLDDIISGVLIHVYAIKLGLEFDLFVCNALINMYAKFGELRSAET 324

Query: 301 IFNQIEAKDIVSWNSLIAAFEQNKEPVVALGLYKKMHATGAVPDLLTLVSLASVAAELGN 360
           IFNQ++ +DIVSWNSL+AAFEQNK+PV+ALG+Y KMH+ G VPDLLTLVSLASVAAELGN
Sbjct: 325 IFNQMKVRDIVSWNSLLAAFEQNKKPVIALGVYNKMHSIGVVPDLLTLVSLASVAAELGN 384

Query: 361 FLSSRSIHGFVTRKGWFLQDVVIGNAIIDMYAKLGYIDSARKVFEELPVKDVVSWNTLIT 420
           FLSSRSIHGFVTR+ WFL D+ +GNAIIDMYAKLG+IDSARKVFE LPVKDV+SWN+LIT
Sbjct: 385 FLSSRSIHGFVTRRCWFLHDIALGNAIIDMYAKLGFIDSARKVFEGLPVKDVISWNSLIT 444

Query: 421 GYSQNGLANEAIDVYHLMNDYSDAVPNQGTWVSILTAYSQIGALKQGMKTHGLLIKNFLY 480
           GYSQNGLANEAIDVY  M  YS AVPNQGTWVSILTA+SQ+GALKQGMK HG LIKNFLY
Sbjct: 445 GYSQNGLANEAIDVYSSMRYYSGAVPNQGTWVSILTAHSQLGALKQGMKAHGQLIKNFLY 504

Query: 481 FDIFVGTCLIDMYGKCGRLADALSLFYEIPHKSSVSWNAIISCHGLHGYGLKAVELFREM 540
           FDIFV TCL+DMYGKCG+LADALSLFYE+PH+SSVSWNAIISCHGLHGYGLKAV+LF+EM
Sbjct: 505 FDIFVSTCLVDMYGKCGKLADALSLFYEVPHQSSVSWNAIISCHGLHGYGLKAVKLFKEM 564

Query: 541 QTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMGELYGIRPSLKHYGCMVDLFGRAGH 600
           Q+EGVKPDHITFVSLLSACSHSGLVDEGQWCFQLM E YGIRPSLKHYGCMVDLFGRAGH
Sbjct: 565 QSEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMQETYGIRPSLKHYGCMVDLFGRAGH 624

Query: 601 LKKAYDFVKTMPIQPDASVWGALLGACRIHENVELARTVSDHLLEVESKNVGYYVLLSNI 660
           L+KA++FVK MP++PD SVWGALLGACRIHENVEL RTVSDHLL+VES+NVGYYVLLSNI
Sbjct: 625 LEKAFNFVKNMPVRPDVSVWGALLGACRIHENVELVRTVSDHLLKVESENVGYYVLLSNI 684

Query: 661 YAKLGQWDGVDEVRSLARDRGLRKTPGWSSIEIDKKIDVFYTGNRTHPRCEEIYDELRDL 720
           YAKLG W+GVDEVRSLARDRGL+KTPGWSSIE+DKKIDVFYTGN+THP+CEEIY ELR+L
Sbjct: 685 YAKLGHWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVFYTGNQTHPKCEEIYSELRNL 744

Query: 721 TAKMKSLGYVANYNFVLQDVEDDEKENILISHSERLAMAFGIISTPPKTTLQIFKNLRVC 780
           TAKMKS+GYV +YNFVLQDVEDDEKENIL SHSERLAMAFGIISTPPKTTLQIFKNLRVC
Sbjct: 745 TAKMKSIGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPKTTLQIFKNLRVC 804

Query: 781 GDCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW 821
           GDCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW
Sbjct: 805 GDCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW 839

BLAST of CmoCh16G002770 vs. TrEMBL
Match: A0A061DZS3_THECC (Tetratricopeptide repeat (TPR)-like superfamily protein OS=Theobroma cacao GN=TCM_007072 PE=4 SV=1)

HSP 1 Score: 1199.9 bits (3103), Expect = 0.0e+00
Identity = 559/812 (68.84%), Postives = 674/812 (83.00%), Query Frame = 1

Query: 9   RRISLFRPSFQACCPLYSATTTAPTPKYYLDEVEIEKKEIDFNRLFLVCKKVHLAKRLHA 68
           R IS   P  Q  CPL+SA   A + +   +  E   K IDFN LF  C ++HLAKRLHA
Sbjct: 11  RHISKIFPLLQVRCPLFSAA--ANSLQGTSNGCEDNDKSIDFNHLFKSCTQLHLAKRLHA 70

Query: 69  LLVVSGKVQSIFLSAKLINLYAFLGDVSFARRTFDQIQAKDVYTWNSMISAYARIGHFHE 128
           L++VSGK QSIF+SAKL+NLYA+L DVSF+RRTFDQI  KDVYTWNSM+SAY R G F E
Sbjct: 71  LVLVSGKAQSIFISAKLVNLYAYLCDVSFSRRTFDQINEKDVYTWNSMVSAYVRSGRFQE 130

Query: 129 AVDCFHEFMSTSILQPDYYTFPPVIRACGNLDDGKKIHCLALKLGFECDVFIAASLIHFY 188
           AVDCF++F STS L+PD+YTFPPV++AC NL DG ++HCL LKLGFE DVF+ ASL+H Y
Sbjct: 131 AVDCFYQFFSTSGLRPDFYTFPPVLKACKNLPDGMRMHCLVLKLGFEWDVFVTASLVHMY 190

Query: 189 SRFGFVNLARNLFDSLMIRDIGTWNAMISGFCLNGKVVEALEVFDEMRFKSVTMDSVTFS 248
           +RF  V  AR LFD + +RD+G+WNAMISG+C NG   EALEV +EMR + V MD VT +
Sbjct: 191 TRFRIVGSARKLFDDMPVRDMGSWNAMISGYCQNGNAAEALEVLNEMRLERVMMDPVTIA 250

Query: 249 SLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVCNALINMYAKFGELGSAETIFNQIEAK 308
           S+LPICAQLDDI+ G LIH+YAIK GLEFDLFV NALINMYAKFG+L  A+ +F+ +  +
Sbjct: 251 SILPICAQLDDILYGRLIHLYAIKSGLEFDLFVSNALINMYAKFGKLEHAQKVFDHMVVR 310

Query: 309 DIVSWNSLIAAFEQNKEPVVALGLYKKMHATGAVPDLLTLVSLASVAAELGNFLSSRSIH 368
           D+VSWNS+IAA+EQN +P +ALGL+  M   G  PD LTLVSL+S+ A+L +    +S+H
Sbjct: 311 DLVSWNSIIAAYEQNDDPHMALGLFYNMKLIGINPDYLTLVSLSSIVAQLSDSRKGKSVH 370

Query: 369 GFVTRKGWFLQDVVIGNAIIDMYAKLGYIDSARKVFEELPVKDVVSWNTLITGYSQNGLA 428
           GFV R+GWFL+DV+ GN+++DMYAKLG +DSA  VF  LPVKDVVSWNTLITGY+QNGLA
Sbjct: 371 GFVMRRGWFLKDVISGNSVVDMYAKLGIMDSAHAVFYVLPVKDVVSWNTLITGYAQNGLA 430

Query: 429 NEAIDVYHLMNDYSDAVPNQGTWVSILTAYSQIGALKQGMKTHGLLIKNFLYFDIFVGTC 488
            EAI+ Y +M +  +  PNQ TWVSIL AYS +GAL+QGM+ HG LIKN  Y DIFVGTC
Sbjct: 431 GEAIEAYGMMQECKEITPNQATWVSILPAYSNVGALQQGMRVHGRLIKNSFYLDIFVGTC 490

Query: 489 LIDMYGKCGRLADALSLFYEIPHKSSVSWNAIISCHGLHGYGLKAVELFREMQTEGVKPD 548
           LIDMYGKCG+L DA+SLF+E+P  +SV WNAIISCHG+HG+  KA++LFREM+ EGVKPD
Sbjct: 491 LIDMYGKCGKLDDAMSLFFEVPKMTSVPWNAIISCHGIHGHAEKALKLFREMREEGVKPD 550

Query: 549 HITFVSLLSACSHSGLVDEGQWCFQLMGELYGIRPSLKHYGCMVDLFGRAGHLKKAYDFV 608
           H+TFVSLLSACSHSGLVDEGQWCF +M E YGI P LKHYGCMVDLFGRAGHL+ AY+F+
Sbjct: 551 HVTFVSLLSACSHSGLVDEGQWCFHVMQEEYGIEPILKHYGCMVDLFGRAGHLEMAYNFI 610

Query: 609 KTMPIQPDASVWGALLGACRIHENVELARTVSDHLLEVESKNVGYYVLLSNIYAKLGQWD 668
           K +P++PDASVWGALLGACRIH N++L    SD L EV+S NVGYYVLLSNIYA +G+W+
Sbjct: 611 KNLPVKPDASVWGALLGACRIHGNIDLGTFASDRLFEVDSDNVGYYVLLSNIYANIGKWE 670

Query: 669 GVDEVRSLARDRGLRKTPGWSSIEIDKKIDVFYTGNRTHPRCEEIYDELRDLTAKMKSLG 728
           GVD+VR++ARD+GLRKTPGWSSIE+  K+DVFYTGNR+HP+CEEI+ ELR LTAKMKSLG
Sbjct: 671 GVDKVRAVARDKGLRKTPGWSSIEVSNKVDVFYTGNRSHPKCEEIFKELRSLTAKMKSLG 730

Query: 729 YVANYNFVLQDVEDDEKENILISHSERLAMAFGIISTPPKTTLQIFKNLRVCGDCHNATK 788
           YV +Y+FVLQDVE+DEKE+IL+SHSERLA+A+GIIS+PPK+ ++IFKNLRVCGDCHNATK
Sbjct: 731 YVPDYSFVLQDVEEDEKEHILMSHSERLAIAYGIISSPPKSPIRIFKNLRVCGDCHNATK 790

Query: 789 FISKITEREIIVRDSNRFHHFKDGVCSCGDYW 821
           FIS+IT+REIIVRDSNRFHHFKDG+CSCGDYW
Sbjct: 791 FISQITDREIIVRDSNRFHHFKDGICSCGDYW 820

BLAST of CmoCh16G002770 vs. TrEMBL
Match: F6HBK0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_03s0088g01130 PE=4 SV=1)

HSP 1 Score: 1178.7 bits (3048), Expect = 0.0e+00
Identity = 550/810 (67.90%), Postives = 673/810 (83.09%), Query Frame = 1

Query: 11  ISLFRPSFQACCPLYSATTTAPTPKYYLDEVEIEKKEIDFNRLFLVCKKVHLAKRLHALL 70
           IS F P  +    L+SA T++P    Y   +E + +EIDFN LF  C K  LAKRLHALL
Sbjct: 16  ISKFLPLLRRHYQLFSAATSSPHFSSY--GLENQNEEIDFNSLFDSCTKTLLAKRLHALL 75

Query: 71  VVSGKVQSIFLSAKLINLYAFLGDVSFARRTFDQIQAKDVYTWNSMISAYARIGHFHEAV 130
           VVSGK+QS F+S +L+NLYA LGDVS +R TFDQIQ KDVYTWNSMISAY R GHF EA+
Sbjct: 76  VVSGKIQSNFISIRLVNLYASLGDVSLSRGTFDQIQRKDVYTWNSMISAYVRNGHFREAI 135

Query: 131 DCFHEFMSTSILQPDYYTFPPVIRACGNLDDGKKIHCLALKLGFECDVFIAASLIHFYSR 190
           DCF++ +  +  Q D+YTFPPV++AC  L DG+KIHC   KLGF+ DVF+AASLIH YSR
Sbjct: 136 DCFYQLLLVTKFQADFYTFPPVLKACQTLVDGRKIHCWVFKLGFQWDVFVAASLIHMYSR 195

Query: 191 FGFVNLARNLFDSLMIRDIGTWNAMISGFCLNGKVVEALEVFDEMRFKSVTMDSVTFSSL 250
           FGFV +AR+LFD +  RD+G+WNAMISG   NG   +AL+V DEMR + + MDSVT +S+
Sbjct: 196 FGFVGIARSLFDDMPFRDMGSWNAMISGLIQNGNAAQALDVLDEMRLEGINMDSVTVASI 255

Query: 251 LPICAQLDDIISGVLIHVYAIKLGLEFDLFVCNALINMYAKFGELGSAETIFNQIEAKDI 310
           LP+CAQL DI +  LIH+Y IK GLEF+LFV NALINMYAKFG LG A+ +F Q+  +D+
Sbjct: 256 LPVCAQLGDISTATLIHLYVIKHGLEFELFVSNALINMYAKFGNLGDAQKVFQQMFLRDV 315

Query: 311 VSWNSLIAAFEQNKEPVVALGLYKKMHATGAVPDLLTLVSLASVAAELGNFLSSRSIHGF 370
           VSWNS+IAA+EQN +PV A G + KM   G  PDLLTLVSLAS+AA+  ++ +SRS+HGF
Sbjct: 316 VSWNSIIAAYEQNDDPVTARGFFFKMQLNGLEPDLLTLVSLASIAAQSRDYKNSRSVHGF 375

Query: 371 VTRKGWFLQDVVIGNAIIDMYAKLGYIDSARKVFEELPVKDVVSWNTLITGYSQNGLANE 430
           + R+GW ++ VVIGNA++DMYAKLG IDSA KVF  +PVKDVVSWNTLI+GY+QNGLA+E
Sbjct: 376 IMRRGWLMEAVVIGNAVMDMYAKLGVIDSAHKVFNLIPVKDVVSWNTLISGYTQNGLASE 435

Query: 431 AIDVYHLMNDYSDAVPNQGTWVSILTAYSQIGALKQGMKTHGLLIKNFLYFDIFVGTCLI 490
           AI+VY +M +  +   NQGTWVSIL AY+ +GAL+QGM+ HG LIK  L+ D+FVGTCLI
Sbjct: 436 AIEVYRMMEECREIKLNQGTWVSILAAYAHVGALQQGMRIHGHLIKTNLHLDVFVGTCLI 495

Query: 491 DMYGKCGRLADALSLFYEIPHKSSVSWNAIISCHGLHGYGLKAVELFREMQTEGVKPDHI 550
           D+YGKCGRL DA+ LFY++P +SSV WNAIISCHG+HG+G KA++LFREMQ EGVKPDH+
Sbjct: 496 DLYGKCGRLVDAMCLFYQVPRESSVPWNAIISCHGIHGHGEKALKLFREMQDEGVKPDHV 555

Query: 551 TFVSLLSACSHSGLVDEGQWCFQLMGELYGIRPSLKHYGCMVDLFGRAGHLKKAYDFVKT 610
           TF+SLLSACSHSGLVDEG+W F LM E YGI+PSLKHYGCMVDL GRAG L+ AYDF+K 
Sbjct: 556 TFISLLSACSHSGLVDEGKWFFHLMQE-YGIKPSLKHYGCMVDLLGRAGFLEMAYDFIKD 615

Query: 611 MPIQPDASVWGALLGACRIHENVELARTVSDHLLEVESKNVGYYVLLSNIYAKLGQWDGV 670
           MP+ PDAS+WGALLGACRIH N+EL +  SD L EV+S+NVGYYVLLSNIYA +G+W+GV
Sbjct: 616 MPLHPDASIWGALLGACRIHGNIELGKFASDRLFEVDSENVGYYVLLSNIYANVGKWEGV 675

Query: 671 DEVRSLARDRGLRKTPGWSSIEIDKKIDVFYTGNRTHPRCEEIYDELRDLTAKMKSLGYV 730
           D+VRSLAR+RGL+KTPGWSSIE+++++D+FYTGN++HP+C+EIY ELR LTAKMKSLGY+
Sbjct: 676 DKVRSLARERGLKKTPGWSSIEVNRRVDIFYTGNQSHPKCKEIYAELRILTAKMKSLGYI 735

Query: 731 ANYNFVLQDVEDDEKENILISHSERLAMAFGIISTPPKTTLQIFKNLRVCGDCHNATKFI 790
            +Y+FVLQDVE+DEKE+IL SHSERLA+AFGIISTPPK+ ++IFKNLRVCGDCHNATKFI
Sbjct: 736 PDYSFVLQDVEEDEKEHILTSHSERLAIAFGIISTPPKSAIRIFKNLRVCGDCHNATKFI 795

Query: 791 SKITEREIIVRDSNRFHHFKDGVCSCGDYW 821
           S+ITEREI+VRDS RFHHFK+G+CSCGDYW
Sbjct: 796 SRITEREIVVRDSKRFHHFKNGICSCGDYW 822

BLAST of CmoCh16G002770 vs. TrEMBL
Match: A0A0D2RKK6_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_005G237500 PE=4 SV=1)

HSP 1 Score: 1172.9 bits (3033), Expect = 0.0e+00
Identity = 551/812 (67.86%), Postives = 670/812 (82.51%), Query Frame = 1

Query: 9   RRISLFRPSFQACCPLYSATTTAPTPKYYLDEVEIEKKEIDFNRLFLVCKKVHLAKRLHA 68
           R IS   P FQA   L+S +  A          E   K IDF+ LF  C ++HLAK LHA
Sbjct: 11  RHISESLPLFQARRTLFSTSVNA----LQRTSDEDGDKRIDFDHLFKSCNRLHLAKLLHA 70

Query: 69  LLVVSGKVQSIFLSAKLINLYAFLGDVSFARRTFDQIQAKDVYTWNSMISAYARIGHFHE 128
           L+VV+GK +SIF SAKL+N+YA+LGDVSF+RRTFDQI  KDVYTWNSM+SAY R GHF E
Sbjct: 71  LVVVAGKARSIFFSAKLVNVYAYLGDVSFSRRTFDQIPNKDVYTWNSMVSAYVRTGHFRE 130

Query: 129 AVDCFHEFMSTSILQPDYYTFPPVIRACGNLDDGKKIHCLALKLGFECDVFIAASLIHFY 188
           AVDCF++F  TS L+PD+YTF PV++AC N  DG +IHCL LKLGFE DVF+ ASL+H Y
Sbjct: 131 AVDCFYQFFLTSGLRPDFYTFAPVLKACKNPLDGMRIHCLVLKLGFEWDVFVTASLVHMY 190

Query: 189 SRFGFVNLARNLFDSLMIRDIGTWNAMISGFCLNGKVVEALEVFDEMRFKSVTMDSVTFS 248
           +RF  +  AR LFD + +RD+G+WNAMISG+C N    EAL+V +EMR + V MD VT  
Sbjct: 191 TRFRALGNARKLFDDMPVRDMGSWNAMISGYCQNSNAAEALDVLNEMRSEGVLMDPVTIV 250

Query: 249 SLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVCNALINMYAKFGELGSAETIFNQIEAK 308
           S+LPICAQLDDI++G+ IHVY+IK GLE+DLFV NALINMYAKFGEL +A+ + + +  +
Sbjct: 251 SILPICAQLDDILNGMSIHVYSIKRGLEYDLFVSNALINMYAKFGELANAQKVLDNMVVR 310

Query: 309 DIVSWNSLIAAFEQNKEPVVALGLYKKMHATGAVPDLLTLVSLASVAAELGNFLSSRSIH 368
           D+VSWNS+IAA+EQN +P  AL L+  M  TG  PD LTLVS+ S+ A+LG+  + +S+H
Sbjct: 311 DVVSWNSIIAAYEQNDDPNRALALFYDMQLTGISPDYLTLVSVTSIVAQLGDSWNGKSVH 370

Query: 369 GFVTRKGWFLQDVVIGNAIIDMYAKLGYIDSARKVFEELPVKDVVSWNTLITGYSQNGLA 428
           GFV R+GW L+DV+ GN+++DMY+KLG + SAR VFE LPVKDVVSWNTLITGY+QNGLA
Sbjct: 371 GFVMRRGWILKDVISGNSVVDMYSKLGDMSSARAVFESLPVKDVVSWNTLITGYTQNGLA 430

Query: 429 NEAIDVYHLMNDYSDAVPNQGTWVSILTAYSQIGALKQGMKTHGLLIKNFLYFDIFVGTC 488
           +EAI+V+ +M    + VPNQ TWVSIL AYS IGAL+QGM+ HGLL+K+ LY DIFVGTC
Sbjct: 431 SEAIEVFDMMQ--KEIVPNQATWVSILPAYSNIGALRQGMRVHGLLVKSSLYLDIFVGTC 490

Query: 489 LIDMYGKCGRLADALSLFYEIPHKSSVSWNAIISCHGLHGYGLKAVELFREMQTEGVKPD 548
           LIDMYGKCG+L DA+SLFYE+P  +SV WNAIISCHG+HG+  KA++LFREM+ E VKPD
Sbjct: 491 LIDMYGKCGKLDDAMSLFYEVPKMTSVPWNAIISCHGIHGHAEKALKLFREMREERVKPD 550

Query: 549 HITFVSLLSACSHSGLVDEGQWCFQLMGELYGIRPSLKHYGCMVDLFGRAGHLKKAYDFV 608
           H+TFVSLLSACSHSGLV+EGQWCF +M E YGI P LKHYGCMVD+FGRAGHL+KAY+F+
Sbjct: 551 HVTFVSLLSACSHSGLVEEGQWCFNVMREEYGIEPILKHYGCMVDMFGRAGHLEKAYNFI 610

Query: 609 KTMPIQPDASVWGALLGACRIHENVELARTVSDHLLEVESKNVGYYVLLSNIYAKLGQWD 668
           K MP++PDASVWGALLGACRIH N++L    S+ L EV+S+NVGYYVL+SNIYA +G+W+
Sbjct: 611 KDMPVKPDASVWGALLGACRIHGNIDLGAFASERLFEVDSENVGYYVLMSNIYANIGKWE 670

Query: 669 GVDEVRSLARDRGLRKTPGWSSIEIDKKIDVFYTGNRTHPRCEEIYDELRDLTAKMKSLG 728
           GVD+VR+LARD GLRKTPGWSSIE + K+DVFYTGN++HP+CEEIY ELR+L AKMKSLG
Sbjct: 671 GVDKVRTLARDMGLRKTPGWSSIEANNKVDVFYTGNQSHPKCEEIYKELRNLNAKMKSLG 730

Query: 729 YVANYNFVLQDVEDDEKENILISHSERLAMAFGIISTPPKTTLQIFKNLRVCGDCHNATK 788
           +V +Y+FVLQDVE+DEKE+IL+SHSERLA+AFGIISTPPKT ++IFKNLRVCGDCHNATK
Sbjct: 731 HVPDYSFVLQDVEEDEKEHILMSHSERLAIAFGIISTPPKTPIRIFKNLRVCGDCHNATK 790

Query: 789 FISKITEREIIVRDSNRFHHFKDGVCSCGDYW 821
           +ISKITEREIIVRDSNRFHHFKDGVCSC DYW
Sbjct: 791 YISKITEREIIVRDSNRFHHFKDGVCSCRDYW 816

BLAST of CmoCh16G002770 vs. TrEMBL
Match: A0A067F8D6_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g003439mg PE=4 SV=1)

HSP 1 Score: 1158.7 bits (2996), Expect = 0.0e+00
Identity = 539/815 (66.13%), Postives = 670/815 (82.21%), Query Frame = 1

Query: 6   CKWRRISLFRPSFQACCPLYSATTTAPTPKYYLDEVEIEKKEIDFNRLFLVCKKVHLAKR 65
           CK RR+    P  QA  PL+SA   A + +   D +E E +EIDF+ LF  C K+H  KR
Sbjct: 8   CKDRRLCKLLPLLQAHRPLFSAA--ANSLQISPDCLENESREIDFDDLFQSCTKLHHVKR 67

Query: 66  LHALLVVSGKVQSIFLSAKLINLYAFLGDVSFARRTFDQIQAKDVYTWNSMISAYARIGH 125
           LHALLVVSGK++++F S KL+N YA LGD+SF+R TFD I  ++VYTWNSMIS Y R G 
Sbjct: 68  LHALLVVSGKIKTVFSSTKLVNFYANLGDLSFSRHTFDHISYRNVYTWNSMISVYVRCGR 127

Query: 126 FHEAVDCFHEFMSTSILQPDYYTFPPVIRACGNLDDGKKIHCLALKLGFECDVFIAASLI 185
             EAVDCF++F  TS L+PD+YTFPPV++AC NL DGKKIHC  LKLGFE DVF+AASL+
Sbjct: 128 LSEAVDCFYQFTLTSGLRPDFYTFPPVLKACRNLVDGKKIHCSVLKLGFEWDVFVAASLL 187

Query: 186 HFYSRFGFVNLARNLFDSLMIRDIGTWNAMISGFCLNGKVVEALEVFDEMRFKSVTMDSV 245
           H Y RFG  N+AR LFD + +RD G+WNAMISG+C +G  VEAL++ DEMR + V+MD +
Sbjct: 188 HMYCRFGLANVARKLFDDMPVRDSGSWNAMISGYCQSGNAVEALDILDEMRLEGVSMDPI 247

Query: 246 TFSSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVCNALINMYAKFGELGSAETIFNQI 305
           T +S+LP+CA+ D+I+SG+LIH+Y +K GLEF+LFV N LINMYAKFG +  A  +F+Q+
Sbjct: 248 TVASILPVCARSDNILSGLLIHLYIVKHGLEFNLFVSNNLINMYAKFGMMRHALRVFDQM 307

Query: 306 EAKDIVSWNSLIAAFEQNKEPVVALGLYKKMHATGAVPDLLTLVSLASVAAELGNFLSSR 365
             +D+VSWNS+IAA+EQ+ +P+ A G +  M   G  PDLLTLVSL S+ A+L +  +SR
Sbjct: 308 MERDVVSWNSIIAAYEQSNDPITAHGFFTTMQQAGIQPDLLTLVSLTSIVAQLNDCRNSR 367

Query: 366 SIHGFVTRKGWFLQDVVIGNAIIDMYAKLGYIDSARKVFEELPVKDVVSWNTLITGYSQN 425
           S+HGF+ R+GWF++DV+IGNA++DMYAKLG I+SA  VFE LPVKDV+SWNTLITGY+QN
Sbjct: 368 SVHGFIMRRGWFMEDVIIGNAVVDMYAKLGIINSACAVFEGLPVKDVISWNTLITGYAQN 427

Query: 426 GLANEAIDVYHLMNDYSDAVPNQGTWVSILTAYSQIGALKQGMKTHGLLIKNFLYFDIFV 485
           GLA+EAI+V+ +M + ++  PNQGT+VSIL AYS +GAL+QG+K H  +IKN L FD+FV
Sbjct: 428 GLASEAIEVFQMMEECNEINPNQGTYVSILPAYSHVGALRQGIKIHARVIKNCLCFDVFV 487

Query: 486 GTCLIDMYGKCGRLADALSLFYEIPHKSSVSWNAIISCHGLHGYGLKAVELFREMQTEGV 545
            TCL+DMYGKCGR+ DA+SLFY++P  SSV WNAIISCHG+HG G KA+  FR+M  EGV
Sbjct: 488 ATCLVDMYGKCGRIDDAMSLFYQVPRSSSVPWNAIISCHGIHGQGDKALNFFRQMLDEGV 547

Query: 546 KPDHITFVSLLSACSHSGLVDEGQWCFQLMGELYGIRPSLKHYGCMVDLFGRAGHLKKAY 605
           +PDHITFVSLL+ACSHSGLV EGQ  F +M E +GI+P LKHYGCMVDLFGRAGHL  A+
Sbjct: 548 RPDHITFVSLLTACSHSGLVSEGQRYFHMMQEEFGIKPHLKHYGCMVDLFGRAGHLGMAH 607

Query: 606 DFVKTMPIQPDASVWGALLGACRIHENVELARTVSDHLLEVESKNVGYYVLLSNIYAKLG 665
           +F++ MP++PDAS+WGALLGACRIH N+EL    SD L EV+S+NVGYYVL+SNIYA +G
Sbjct: 608 NFIQNMPVRPDASIWGALLGACRIHGNMELGAVASDRLFEVDSENVGYYVLMSNIYANVG 667

Query: 666 QWDGVDEVRSLARDRGLRKTPGWSSIEIDKKIDVFYTGNRTHPRCEEIYDELRDLTAKMK 725
           +W+GVDEVRSLARDRGL+KTPGWSSIE++ K+D+FYTGNRTHP+ E+IYDELR+LTAKMK
Sbjct: 668 KWEGVDEVRSLARDRGLKKTPGWSSIEVNNKVDIFYTGNRTHPKYEKIYDELRNLTAKMK 727

Query: 726 SLGYVANYNFVLQDVEDDEKENILISHSERLAMAFGIISTPPKTTLQIFKNLRVCGDCHN 785
           SLGYV + +FVLQDVE+DEKE+IL SHSERLA+AFGIIS+PPK+ +QIFKNLRVCGDCHN
Sbjct: 728 SLGYVPDKSFVLQDVEEDEKEHILTSHSERLAIAFGIISSPPKSPIQIFKNLRVCGDCHN 787

Query: 786 ATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW 821
            TKFIS+ITEREIIVRDSNRFHHFKDG+CSCGDYW
Sbjct: 788 WTKFISQITEREIIVRDSNRFHHFKDGICSCGDYW 820

BLAST of CmoCh16G002770 vs. TAIR10
Match: AT4G33990.1 (AT4G33990.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 1023.8 bits (2646), Expect = 5.6e-299
Identity = 479/778 (61.57%), Postives = 608/778 (78.15%), Query Frame = 1

Query: 44  EKKEID-FNRLFLVCKKVHLAKRLHALLVVSGKVQSIFLSAKLINLYAFLGDVSFARRTF 103
           E KEID  + LF  C  +  AK LHA LVVS ++Q++ +SAKL+NLY +LG+V+ AR TF
Sbjct: 50  ESKEIDDVHTLFRYCTNLQSAKCLHARLVVSKQIQNVCISAKLVNLYCYLGNVALARHTF 109

Query: 104 DQIQAKDVYTWNSMISAYARIGHFHEAVDCFHEFMSTSILQPDYYTFPPVIRACGNLDDG 163
           D IQ +DVY WN MIS Y R G+  E + CF  FM +S L PDY TFP V++AC  + DG
Sbjct: 110 DHIQNRDVYAWNLMISGYGRAGNSSEVIRCFSLFMLSSGLTPDYRTFPSVLKACRTVIDG 169

Query: 164 KKIHCLALKLGFECDVFIAASLIHFYSRFGFVNLARNLFDSLMIRDIGTWNAMISGFCLN 223
            KIHCLALK GF  DV++AASLIH YSR+  V  AR LFD + +RD+G+WNAMISG+C +
Sbjct: 170 NKIHCLALKFGFMWDVYVAASLIHLYSRYKAVGNARILFDEMPVRDMGSWNAMISGYCQS 229

Query: 224 GKVVEALEVFDEMRFKSVTMDSVTFSSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVC 283
           G   EAL + + +R     MDSVT  SLL  C +  D   GV IH Y+IK GLE +LFV 
Sbjct: 230 GNAKEALTLSNGLR----AMDSVTVVSLLSACTEAGDFNRGVTIHSYSIKHGLESELFVS 289

Query: 284 NALINMYAKFGELGSAETIFNQIEAKDIVSWNSLIAAFEQNKEPVVALGLYKKMHATGAV 343
           N LI++YA+FG L   + +F+++  +D++SWNS+I A+E N++P+ A+ L+++M  +   
Sbjct: 290 NKLIDLYAEFGRLRDCQKVFDRMYVRDLISWNSIIKAYELNEQPLRAISLFQEMRLSRIQ 349

Query: 344 PDLLTLVSLASVAAELGNFLSSRSIHGFVTRKGWFLQDVVIGNAIIDMYAKLGYIDSARK 403
           PD LTL+SLAS+ ++LG+  + RS+ GF  RKGWFL+D+ IGNA++ MYAKLG +DSAR 
Sbjct: 350 PDCLTLISLASILSQLGDIRACRSVQGFTLRKGWFLEDITIGNAVVVMYAKLGLVDSARA 409

Query: 404 VFEELPVKDVVSWNTLITGYSQNGLANEAIDVYHLMNDYSDAVPNQGTWVSILTAYSQIG 463
           VF  LP  DV+SWNT+I+GY+QNG A+EAI++Y++M +  +   NQGTWVS+L A SQ G
Sbjct: 410 VFNWLPNTDVISWNTIISGYAQNGFASEAIEMYNIMEEEGEIAANQGTWVSVLPACSQAG 469

Query: 464 ALKQGMKTHGLLIKNFLYFDIFVGTCLIDMYGKCGRLADALSLFYEIPHKSSVSWNAIIS 523
           AL+QGMK HG L+KN LY D+FV T L DMYGKCGRL DALSLFY+IP  +SV WN +I+
Sbjct: 470 ALRQGMKLHGRLLKNGLYLDVFVVTSLADMYGKCGRLEDALSLFYQIPRVNSVPWNTLIA 529

Query: 524 CHGLHGYGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMGELYGIR 583
           CHG HG+G KAV LF+EM  EGVKPDHITFV+LLSACSHSGLVDEGQWCF++M   YGI 
Sbjct: 530 CHGFHGHGEKAVMLFKEMLDEGVKPDHITFVTLLSACSHSGLVDEGQWCFEMMQTDYGIT 589

Query: 584 PSLKHYGCMVDLFGRAGHLKKAYDFVKTMPIQPDASVWGALLGACRIHENVELARTVSDH 643
           PSLKHYGCMVD++GRAG L+ A  F+K+M +QPDAS+WGALL ACR+H NV+L +  S+H
Sbjct: 590 PSLKHYGCMVDMYGRAGQLETALKFIKSMSLQPDASIWGALLSACRVHGNVDLGKIASEH 649

Query: 644 LLEVESKNVGYYVLLSNIYAKLGQWDGVDEVRSLARDRGLRKTPGWSSIEIDKKIDVFYT 703
           L EVE ++VGY+VLLSN+YA  G+W+GVDE+RS+A  +GLRKTPGWSS+E+D K++VFYT
Sbjct: 650 LFEVEPEHVGYHVLLSNMYASAGKWEGVDEIRSIAHGKGLRKTPGWSSMEVDNKVEVFYT 709

Query: 704 GNRTHPRCEEIYDELRDLTAKMKSLGYVANYNFVLQDVEDDEKENILISHSERLAMAFGI 763
           GN+THP  EE+Y EL  L AK+K +GYV ++ FVLQDVEDDEKE+IL+SHSERLA+AF +
Sbjct: 710 GNQTHPMYEEMYRELTALQAKLKMIGYVPDHRFVLQDVEDDEKEHILMSHSERLAIAFAL 769

Query: 764 ISTPPKTTLQIFKNLRVCGDCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW 821
           I+TP KTT++IFKNLRVCGDCH+ TKFISKITEREIIVRDSNRFHHFK+GVCSCGDYW
Sbjct: 770 IATPAKTTIRIFKNLRVCGDCHSVTKFISKITEREIIVRDSNRFHHFKNGVCSCGDYW 823

BLAST of CmoCh16G002770 vs. TAIR10
Match: AT4G30700.1 (AT4G30700.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 600.1 bits (1546), Expect = 2.0e-171
Identity = 315/764 (41.23%), Postives = 466/764 (60.99%), Query Frame = 1

Query: 61  HLAKRLHALLVVSGKVQSIFLSAKLINLYAFLGDVSFARRTFDQIQAKDVYTWNSMISAY 120
           HLA+  HA +++ G    I L  KL    + LG + +AR  F  +Q  DV+ +N ++  +
Sbjct: 35  HLAQT-HAQIILHGFRNDISLLTKLTQRLSDLGAIYYARDIFLSVQRPDVFLFNVLMRGF 94

Query: 121 ARIGHFHEAVDCFHEFMSTSILQPDYYTFPPVIRACGNLDD---GKKIHCLALKLGFECD 180
           +     H ++  F     ++ L+P+  T+   I A     D   G+ IH  A+  G + +
Sbjct: 95  SVNESPHSSLSVFAHLRKSTDLKPNSSTYAFAISAASGFRDDRAGRVIHGQAVVDGCDSE 154

Query: 181 VFIAASLIHFYSRFGFVNLARNLFDSLMIRDIGTWNAMISGFCLNGKVVEALEVFDEMRF 240
           + + ++++  Y +F  V  AR +FD +  +D   WN MISG+  N   VE+++VF ++  
Sbjct: 155 LLLGSNIVKMYFKFWRVEDARKVFDRMPEKDTILWNTMISGYRKNEMYVESIQVFRDLIN 214

Query: 241 KSVT-MDSVTFSSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVCNALINMYAKFGELG 300
           +S T +D+ T   +LP  A+L ++  G+ IH  A K G     +V    I++Y+K G++ 
Sbjct: 215 ESCTRLDTTTLLDILPAVAELQELRLGMQIHSLATKTGCYSHDYVLTGFISLYSKCGKIK 274

Query: 301 SAETIFNQIEAKDIVSWNSLIAAFEQNKEPVVALGLYKKMHATGAVPDLLTLVSLASVAA 360
               +F +    DIV++N++I  +  N E  ++L L+K++  +GA     TLVSL  V+ 
Sbjct: 275 MGSALFREFRKPDIVAYNAMIHGYTSNGETELSLSLFKELMLSGARLRSSTLVSLVPVS- 334

Query: 361 ELGNFLSSRSIHGFVTRKGWFLQDVVIGNAIIDMYAKLGYIDSARKVFEELPVKDVVSWN 420
             G+ +   +IHG+  +   FL    +  A+  +Y+KL  I+SARK+F+E P K + SWN
Sbjct: 335 --GHLMLIYAIHGYCLKSN-FLSHASVSTALTTVYSKLNEIESARKLFDESPEKSLPSWN 394

Query: 421 TLITGYSQNGLANEAIDVYHLMNDYSDAVPNQGTWVSILTAYSQIGALKQGMKTHGLLIK 480
            +I+GY+QNGL  +AI ++  M   S+  PN  T   IL+A +Q+GAL  G   H L+  
Sbjct: 395 AMISGYTQNGLTEDAISLFREMQK-SEFSPNPVTITCILSACAQLGALSLGKWVHDLVRS 454

Query: 481 NFLYFDIFVGTCLIDMYGKCGRLADALSLFYEIPHKSSVSWNAIISCHGLHGYGLKAVEL 540
                 I+V T LI MY KCG +A+A  LF  +  K+ V+WN +IS +GLHG G +A+ +
Sbjct: 455 TDFESSIYVSTALIGMYAKCGSIAEARRLFDLMTKKNEVTWNTMISGYGLHGQGQEALNI 514

Query: 541 FREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMGELYGIRPSLKHYGCMVDLFG 600
           F EM   G+ P  +TF+ +L ACSH+GLV EG   F  M   YG  PS+KHY CMVD+ G
Sbjct: 515 FYEMLNSGITPTPVTFLCVLYACSHAGLVKEGDEIFNSMIHRYGFEPSVKHYACMVDILG 574

Query: 601 RAGHLKKAYDFVKTMPIQPDASVWGALLGACRIHENVELARTVSDHLLEVESKNVGYYVL 660
           RAGHL++A  F++ M I+P +SVW  LLGACRIH++  LARTVS+ L E++  NVGY+VL
Sbjct: 575 RAGHLQRALQFIEAMSIEPGSSVWETLLGACRIHKDTNLARTVSEKLFELDPDNVGYHVL 634

Query: 661 LSNIYAKLGQWDGVDEVRSLARDRGLRKTPGWSSIEIDKKIDVFYTGNRTHPRCEEIYDE 720
           LSNI++    +     VR  A+ R L K PG++ IEI +   VF +G+++HP+ +EIY++
Sbjct: 635 LSNIHSADRNYPQAATVRQTAKKRKLAKAPGYTLIEIGETPHVFTSGDQSHPQVKEIYEK 694

Query: 721 LRDLTAKMKSLGYVANYNFVLQDVEDDEKENILISHSERLAMAFGIISTPPKTTLQIFKN 780
           L  L  KM+  GY       L DVE++E+E ++  HSERLA+AFG+I+T P T ++I KN
Sbjct: 695 LEKLEGKMREAGYQPETELALHDVEEEERELMVKVHSERLAIAFGLIATEPGTEIRIIKN 754

Query: 781 LRVCGDCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW 821
           LRVC DCH  TK ISKITER I+VRD+NRFHHFKDGVCSCGDYW
Sbjct: 755 LRVCLDCHTVTKLISKITERVIVVRDANRFHHFKDGVCSCGDYW 792

BLAST of CmoCh16G002770 vs. TAIR10
Match: AT1G11290.1 (AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 589.3 bits (1518), Expect = 3.5e-168
Identity = 299/767 (38.98%), Postives = 469/767 (61.15%), Query Frame = 1

Query: 57  CKKVHLAKRLHALLVVSGKVQSIFLSAKLINLYAFLGDVSFARRTFDQIQAKDVYTWNSM 116
           C  +   +++  L+  +G  Q  F   KL++L+   G V  A R F+ I +K    +++M
Sbjct: 47  CSSLKELRQILPLVFKNGLYQEHFFQTKLVSLFCRYGSVDEAARVFEPIDSKLNVLYHTM 106

Query: 117 ISAYARIGHFHEAVDCFHEFMSTSILQPDYYTFPPVIRACGN---LDDGKKIHCLALKLG 176
           +  +A++    +A+  F   M    ++P  Y F  +++ CG+   L  GK+IH L +K G
Sbjct: 107 LKGFAKVSDLDKALQFFVR-MRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGLLVKSG 166

Query: 177 FECDVFIAASLIHFYSRFGFVNLARNLFDSLMIRDIGTWNAMISGFCLNGKVVEALEVFD 236
           F  D+F    L + Y++   VN AR +FD +  RD+ +WN +++G+  NG    ALE+  
Sbjct: 167 FSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEMVK 226

Query: 237 EMRFKSVTMDSVTFSSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVCNALINMYAKFG 296
            M  +++    +T  S+LP  + L  I  G  IH YA++ G +  + +  AL++MYAK G
Sbjct: 227 SMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYAKCG 286

Query: 297 ELGSAETIFNQIEAKDIVSWNSLIAAFEQNKEPVVALGLYKKMHATGAVPDLLTLVSLAS 356
            L +A  +F+ +  +++VSWNS+I A+ QN+ P  A+ +++KM   G  P  ++++    
Sbjct: 287 SLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMGALH 346

Query: 357 VAAELGNFLSSRSIHGFVTRKGWFLQDVVIGNAIIDMYAKLGYIDSARKVFEELPVKDVV 416
             A+LG+    R IH      G   ++V + N++I MY K   +D+A  +F +L  + +V
Sbjct: 347 ACADLGDLERGRFIHKLSVELG-LDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTLV 406

Query: 417 SWNTLITGYSQNGLANEAIDVYHLMNDYSDAVPNQGTWVSILTAYSQIGALKQGMKTHGL 476
           SWN +I G++QNG   +A++ +  M   +   P+  T+VS++TA +++         HG+
Sbjct: 407 SWNAMILGFAQNGRPIDALNYFSQMRSRT-VKPDTFTYVSVITAIAELSITHHAKWIHGV 466

Query: 477 LIKNFLYFDIFVGTCLIDMYGKCGRLADALSLFYEIPHKSSVSWNAIISCHGLHGYGLKA 536
           ++++ L  ++FV T L+DMY KCG +  A  +F  +  +   +WNA+I  +G HG+G  A
Sbjct: 467 VMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAA 526

Query: 537 VELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMGELYGIRPSLKHYGCMVD 596
           +ELF EMQ   +KP+ +TF+S++SACSHSGLV+ G  CF +M E Y I  S+ HYG MVD
Sbjct: 527 LELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVD 586

Query: 597 LFGRAGHLKKAYDFVKTMPIQPDASVWGALLGACRIHENVELARTVSDHLLEVESKNVGY 656
           L GRAG L +A+DF+  MP++P  +V+GA+LGAC+IH+NV  A   ++ L E+   + GY
Sbjct: 587 LLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGY 646

Query: 657 YVLLSNIYAKLGQWDGVDEVRSLARDRGLRKTPGWSSIEIDKKIDVFYTGNRTHPRCEEI 716
           +VLL+NIY     W+ V +VR     +GLRKTPG S +EI  ++  F++G+  HP  ++I
Sbjct: 647 HVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKI 706

Query: 717 YDELRDLTAKMKSLGYVANYNFVLQDVEDDEKENILISHSERLAMAFGIISTPPKTTLQI 776
           Y  L  L   +K  GYV + N VL  VE+D KE +L +HSE+LA++FG+++T   TT+ +
Sbjct: 707 YAFLEKLICHIKEAGYVPDTNLVL-GVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHV 766

Query: 777 FKNLRVCGDCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW 821
            KNLRVC DCHNATK+IS +T REI+VRD  RFHHFK+G CSCGDYW
Sbjct: 767 RKNLRVCADCHNATKYISLVTGREIVVRDMQRFHHFKNGACSCGDYW 809

BLAST of CmoCh16G002770 vs. TAIR10
Match: AT3G57430.1 (AT3G57430.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 588.6 bits (1516), Expect = 6.1e-168
Identity = 306/781 (39.18%), Postives = 471/781 (60.31%), Query Frame = 1

Query: 60  VHLAKRLHALLVVSGK-VQSIFLSAKLINLYAFLGDVSFARRTFDQIQAKDVYTWNSMIS 119
           + L K++HA +   G  V S+ ++  L+NLY   GD     + FD+I  ++  +WNS+IS
Sbjct: 113 MELGKQIHAHVYKFGYGVDSVTVANTLVNLYRKCGDFGAVYKVFDRISERNQVSWNSLIS 172

Query: 120 AYARIGHFHEAVDCFHEFMSTSILQPDYYTFPPVIRACGNLDD------GKKIHCLALKL 179
           +      +  A++ F   +  ++ +P  +T   V+ AC NL        GK++H   L+ 
Sbjct: 173 SLCSFEKWEMALEAFRCMLDENV-EPSSFTLVSVVTACSNLPMPEGLMMGKQVHAYGLRK 232

Query: 180 GFECDVFIAASLIHFYSRFGFVNLARNLFDSLMIRDIGTWNAMISGFCLNGKVVEALEVF 239
           G E + FI  +L+  Y + G +  ++ L  S   RD+ TWN ++S  C N +++EALE  
Sbjct: 233 G-ELNSFIINTLVAMYGKLGKLASSKVLLGSFGGRDLVTWNTVLSSLCQNEQLLEALEYL 292

Query: 240 DEMRFKSVTMDSVTFSSLLPICAQLDDIISGVLIHVYAIKLG-LEFDLFVCNALINMYAK 299
            EM  + V  D  T SS+LP C+ L+ + +G  +H YA+K G L+ + FV +AL++MY  
Sbjct: 293 REMVLEGVEPDEFTISSVLPACSHLEMLRTGKELHAYALKNGSLDENSFVGSALVDMYCN 352

Query: 300 FGELGSAETIFNQIEAKDIVSWNSLIAAFEQNKEPVVALGLYKKMH-ATGAVPDLLTLVS 359
             ++ S   +F+ +  + I  WN++IA + QN+    AL L+  M  + G + +  T+  
Sbjct: 353 CKQVLSGRRVFDGMFDRKIGLWNAMIAGYSQNEHDKEALLLFIGMEESAGLLANSTTMAG 412

Query: 360 LASVAAELGNFLSSRSIHGFVTRKGWFLQDVVIGNAIIDMYAKLGYIDSARKVFEELPVK 419
           +       G F    +IHGFV ++G   +D  + N ++DMY++LG ID A ++F ++  +
Sbjct: 413 VVPACVRSGAFSRKEAIHGFVVKRG-LDRDRFVQNTLMDMYSRLGKIDIAMRIFGKMEDR 472

Query: 420 DVVSWNTLITGYSQNGLANEAIDVYHLMNDYSDAV----------PNQGTWVSILTAYSQ 479
           D+V+WNT+ITGY  +    +A+ + H M +    V          PN  T ++IL + + 
Sbjct: 473 DLVTWNTMITGYVFSEHHEDALLLLHKMQNLERKVSKGASRVSLKPNSITLMTILPSCAA 532

Query: 480 IGALKQGMKTHGLLIKNFLYFDIFVGTCLIDMYGKCGRLADALSLFYEIPHKSSVSWNAI 539
           + AL +G + H   IKN L  D+ VG+ L+DMY KCG L  +  +F +IP K+ ++WN I
Sbjct: 533 LSALAKGKEIHAYAIKNNLATDVAVGSALVDMYAKCGCLQMSRKVFDQIPQKNVITWNVI 592

Query: 540 ISCHGLHGYGLKAVELFREMQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMGELYG 599
           I  +G+HG G +A++L R M  +GVKP+ +TF+S+ +ACSHSG+VDEG   F +M   YG
Sbjct: 593 IMAYGMHGNGQEAIDLLRMMMVQGVKPNEVTFISVFAACSHSGMVDEGLRIFYVMKPDYG 652

Query: 600 IRPSLKHYGCMVDLFGRAGHLKKAYDFVKTMPIQPD-ASVWGALLGACRIHENVELARTV 659
           + PS  HY C+VDL GRAG +K+AY  +  MP   + A  W +LLGA RIH N+E+    
Sbjct: 653 VEPSSDHYACVVDLLGRAGRIKEAYQLMNMMPRDFNKAGAWSSLLGASRIHNNLEIGEIA 712

Query: 660 SDHLLEVESKNVGYYVLLSNIYAKLGQWDGVDEVRSLARDRGLRKTPGWSSIEIDKKIDV 719
           + +L+++E     +YVLL+NIY+  G WD   EVR   +++G+RK PG S IE   ++  
Sbjct: 713 AQNLIQLEPNVASHYVLLANIYSSAGLWDKATEVRRNMKEQGVRKEPGCSWIEHGDEVHK 772

Query: 720 FYTGNRTHPRCEEIYDELRDLTAKMKSLGYVANYNFVLQDVEDDEKENILISHSERLAMA 779
           F  G+ +HP+ E++   L  L  +M+  GYV + + VL +VE+DEKE +L  HSE+LA+A
Sbjct: 773 FVAGDSSHPQSEKLSGYLETLWERMRKEGYVPDTSCVLHNVEEDEKEILLCGHSEKLAIA 832

Query: 780 FGIISTPPKTTLQIFKNLRVCGDCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDY 821
           FGI++T P T +++ KNLRVC DCH ATKFISKI +REII+RD  RFH FK+G CSCGDY
Sbjct: 833 FGILNTSPGTIIRVAKNLRVCNDCHLATKFISKIVDREIILRDVRRFHRFKNGTCSCGDY 890

BLAST of CmoCh16G002770 vs. TAIR10
Match: AT4G18750.1 (AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 582.8 bits (1501), Expect = 3.3e-166
Identity = 293/743 (39.43%), Postives = 448/743 (60.30%), Query Frame = 1

Query: 81  LSAKLINLYAFLGDVSFARRTFDQIQAKDVYTWNSMISAYARIGHFHEAVDCFHEFMSTS 140
           L +KL  +Y   GD+  A R FD+++ +    WN +++  A+ G F  ++  F + MS+ 
Sbjct: 131 LGSKLSLMYTNCGDLKEASRVFDEVKIEKALFWNILMNELAKSGDFSGSIGLFKKMMSSG 190

Query: 141 ILQPDYYTFPPVIRACGNLDD---GKKIHCLALKLGFECDVFIAASLIHFYSRFGFVNLA 200
           + + D YTF  V ++  +L     G+++H   LK GF     +  SL+ FY +   V+ A
Sbjct: 191 V-EMDSYTFSCVSKSFSSLRSVHGGEQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSA 250

Query: 201 RNLFDSLMIRDIGTWNAMISGFCLNGKVVEALEVFDEMRFKSVTMDSVTFSSLLPICAQL 260
           R +FD +  RD+ +WN++I+G+  NG   + L VF +M    + +D  T  S+   CA  
Sbjct: 251 RKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADS 310

Query: 261 DDIISGVLIHVYAIKLGLEFDLFVCNALINMYAKFGELGSAETIFNQIEAKDIVSWNSLI 320
             I  G  +H   +K     +   CN L++MY+K G+L SA+ +F ++  + +VS+ S+I
Sbjct: 311 RLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMI 370

Query: 321 AAFEQNKEPVVALGLYKKMHATGAVPDLLTLVSLASVAAELGNFLSSRSIHGFVTRKGWF 380
           A + +      A+ L+++M   G  PD+ T+ ++ +  A        + +H ++      
Sbjct: 371 AGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLG 430

Query: 381 LQDVVIGNAIIDMYAKLGYIDSARKVFEELPVKDVVSWNTLITGYSQNGLANEAIDVYHL 440
             D+ + NA++DMYAK G +  A  VF E+ VKD++SWNT+I GYS+N  ANEA+ +++L
Sbjct: 431 F-DIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNL 490

Query: 441 MNDYSDAVPNQGTWVSILTAYSQIGALKQGMKTHGLLIKNFLYFDIFVGTCLIDMYGKCG 500
           + +     P++ T   +L A + + A  +G + HG +++N  + D  V   L+DMY KCG
Sbjct: 491 LLEEKRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCG 550

Query: 501 RLADALSLFYEIPHKSSVSWNAIISCHGLHGYGLKAVELFREMQTEGVKPDHITFVSLLS 560
            L  A  LF +I  K  VSW  +I+ +G+HG+G +A+ LF +M+  G++ D I+FVSLL 
Sbjct: 551 ALLLAHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLY 610

Query: 561 ACSHSGLVDEGQWCFQLMGELYGIRPSLKHYGCMVDLFGRAGHLKKAYDFVKTMPIQPDA 620
           ACSHSGLVDEG   F +M     I P+++HY C+VD+  R G L KAY F++ MPI PDA
Sbjct: 611 ACSHSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDA 670

Query: 621 SVWGALLGACRIHENVELARTVSDHLLEVESKNVGYYVLLSNIYAKLGQWDGVDEVRSLA 680
           ++WGALL  CRIH +V+LA  V++ + E+E +N GYYVL++NIYA+  +W+ V  +R   
Sbjct: 671 TIWGALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRI 730

Query: 681 RDRGLRKTPGWSSIEIDKKIDVFYTGNRTHPRCEEIYDELRDLTAKMKSLGYVANYNFVL 740
             RGLRK PG S IEI  ++++F  G+ ++P  E I   LR + A+M   GY     + L
Sbjct: 731 GQRGLRKNPGCSWIEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMIEEGYSPLTKYAL 790

Query: 741 QDVEDDEKENILISHSERLAMAFGIISTPPKTTLQIFKNLRVCGDCHNATKFISKITERE 800
            D E+ EKE  L  HSE+LAMA GIIS+     +++ KNLRVCGDCH   KF+SK+T RE
Sbjct: 791 IDAEEMEKEEALCGHSEKLAMALGIISSGHGKIIRVTKNLRVCGDCHEMAKFMSKLTRRE 850

Query: 801 IIVRDSNRFHHFKDGVCSCGDYW 821
           I++RDSNRFH FKDG CSC  +W
Sbjct: 851 IVLRDSNRFHQFKDGHCSCRGFW 871

BLAST of CmoCh16G002770 vs. NCBI nr
Match: gi|659111236|ref|XP_008455647.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g33990 [Cucumis melo])

HSP 1 Score: 1490.3 bits (3857), Expect = 0.0e+00
Identity = 715/820 (87.20%), Postives = 763/820 (93.05%), Query Frame = 1

Query: 1   MRFLPCKWRRISLFRPSFQACCPLYSATTTAPTPKYYLDEVEIEKKEIDFNRLFLVCKKV 60
           MRFL CKWR++SLF+PSFQACC LYSATT    PKYYLD VE EK+EIDFNRLFL C KV
Sbjct: 1   MRFLQCKWRQVSLFKPSFQACCSLYSATTA---PKYYLDGVENEKREIDFNRLFLFCTKV 60

Query: 61  HLAKRLHALLVVSGKVQSIFLSAKLINLYAFLGDVSFARRTFDQIQAKDVYTWNSMISAY 120
           HLAK+LH LLVVSGK QSIFLSAKLIN YAFLGD+S AR TFDQIQ KDVYTWNSMISAY
Sbjct: 61  HLAKQLHGLLVVSGKTQSIFLSAKLINRYAFLGDISHARLTFDQIQTKDVYTWNSMISAY 120

Query: 121 ARIGHFHEAVDCFHEFMSTSILQPDYYTFPPVIRACGNLDDGKKIHCLALKLGFECDVFI 180
           ARIGHFH A+DCF+EF+STSILQ D+YTFPPVIRACGNLDDG+KIHCL LKLGFECDV+I
Sbjct: 121 ARIGHFHAAIDCFNEFLSTSILQSDHYTFPPVIRACGNLDDGRKIHCLVLKLGFECDVYI 180

Query: 181 AASLIHFYSRFGFVNLARNLFDSLMIRDIGTWNAMISGFCLNGKVVEALEVFDEMRFKSV 240
           AAS IHFYSRFGFV+LA NLFD++MIRDIGTWNAMISGFCLN KV EALEVFDEMR KSV
Sbjct: 181 AASFIHFYSRFGFVSLACNLFDNMMIRDIGTWNAMISGFCLNDKVAEALEVFDEMRLKSV 240

Query: 241 TMDSVTFSSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVCNALINMYAKFGELGSAET 300
           TMDSVT SSLLPICAQLDDII GVLIHVYAIKLGLEFDLFVCNALINMYAKFGEL SAET
Sbjct: 241 TMDSVTISSLLPICAQLDDIIWGVLIHVYAIKLGLEFDLFVCNALINMYAKFGELRSAET 300

Query: 301 IFNQIEAKDIVSWNSLIAAFEQNKEPVVALGLYKKMHATGAVPDLLTLVSLASVAAELGN 360
           IFNQ++ +DIVSWNSL+AAFEQNK+PV+ALG+Y KMH+ G VPDLLTLVSLASV AELGN
Sbjct: 301 IFNQMKVRDIVSWNSLLAAFEQNKKPVIALGVYNKMHSIGIVPDLLTLVSLASVIAELGN 360

Query: 361 FLSSRSIHGFVTRKGWFLQDVVIGNAIIDMYAKLGYIDSARKVFEELPVKDVVSWNTLIT 420
           FLSSRSIHGFVTR+ WFL D+ +GNAIIDMYAKLG+IDSARKVFE LPVKDV+SWN+LIT
Sbjct: 361 FLSSRSIHGFVTRRCWFLHDIALGNAIIDMYAKLGFIDSARKVFEGLPVKDVISWNSLIT 420

Query: 421 GYSQNGLANEAIDVYHLMNDYSDAVPNQGTWVSILTAYSQIGALKQGMKTHGLLIKNFLY 480
           GYSQNGLANEAIDVY  M DYS+AVPNQGTWVSILTA SQ+GALKQGMKTHG LIKNFLY
Sbjct: 421 GYSQNGLANEAIDVYCSMRDYSNAVPNQGTWVSILTALSQLGALKQGMKTHGQLIKNFLY 480

Query: 481 FDIFVGTCLIDMYGKCGRLADALSLFYEIPHKSSVSWNAIISCHGLHGYGLKAVELFREM 540
           FDIFV TCLIDMYGKCGRLADALSLFYE+PHKSSVSWNAIISCHGLHGYGLKAV+LF+EM
Sbjct: 481 FDIFVSTCLIDMYGKCGRLADALSLFYEVPHKSSVSWNAIISCHGLHGYGLKAVKLFKEM 540

Query: 541 QTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMGELYGIRPSLKHYGCMVDLFGRAGH 600
           Q+EGVKPDHITFVSLLSACSHSGLVDEGQWCFQLM   Y IRPSLKHYGCMVDLFGRAGH
Sbjct: 541 QSEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMEGTYAIRPSLKHYGCMVDLFGRAGH 600

Query: 601 LKKAYDFVKTMPIQPDASVWGALLGACRIHENVELARTVSDHLLEVESKNVGYYVLLSNI 660
           L+KAY+FVK MP+QPD SVWGALLGACRIHENVEL RTVSDHLL+VESKNVGYYVLLSNI
Sbjct: 601 LEKAYNFVKNMPVQPDVSVWGALLGACRIHENVELVRTVSDHLLKVESKNVGYYVLLSNI 660

Query: 661 YAKLGQWDGVDEVRSLARDRGLRKTPGWSSIEIDKKIDVFYTGNRTHPRCEEIYDELRDL 720
           YAK GQW+G D VRS AR+RGL+KTPGWSSIE+DKKIDVFYTGN+THP+CEEIY ELR+L
Sbjct: 661 YAKFGQWEGADVVRSKARERGLKKTPGWSSIEVDKKIDVFYTGNQTHPKCEEIYSELRNL 720

Query: 721 TAKMKSLGYVANYNFVLQDVEDDEKENILISHSERLAMAFGIISTPPKTTLQIFKNLRVC 780
           TAKMKS+GYV +YNFVLQDVEDDEKENIL SHSERLAMAFGIISTPPKTTLQIFKNLRVC
Sbjct: 721 TAKMKSIGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPKTTLQIFKNLRVC 780

Query: 781 GDCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW 821
           GDCHNATKFISKITEREIIVRDSNRFHHFKDG CSCGDYW
Sbjct: 781 GDCHNATKFISKITEREIIVRDSNRFHHFKDGACSCGDYW 817

BLAST of CmoCh16G002770 vs. NCBI nr
Match: gi|700198543|gb|KGN53701.1| (hypothetical protein Csa_4G107430 [Cucumis sativus])

HSP 1 Score: 1476.1 bits (3820), Expect = 0.0e+00
Identity = 710/820 (86.59%), Postives = 765/820 (93.29%), Query Frame = 1

Query: 1   MRFLPCKWRRISLFRPSFQACCPLYSATTTAPTPKYYLDEVEIEKKEIDFNRLFLVCKKV 60
           +RFL CKWRR+SLF+PSFQAC  LYSAT     PKY LD VE EK+EIDFNR+FL C KV
Sbjct: 25  LRFLQCKWRRVSLFKPSFQACS-LYSATAA---PKY-LDGVENEKREIDFNRIFLYCTKV 84

Query: 61  HLAKRLHALLVVSGKVQSIFLSAKLINLYAFLGDVSFARRTFDQIQAKDVYTWNSMISAY 120
           HLAK+LHALLVVSGK QSIFLSAKLIN YAFLGD+  AR TFDQIQ KDVYTWNSMISAY
Sbjct: 85  HLAKQLHALLVVSGKTQSIFLSAKLINRYAFLGDIPHARLTFDQIQTKDVYTWNSMISAY 144

Query: 121 ARIGHFHEAVDCFHEFMSTSILQPDYYTFPPVIRACGNLDDGKKIHCLALKLGFECDVFI 180
           ARIGHFH AVDCF+EF+STS LQ D+YTFPPVIRACGNLDDG+K+HCL LKLGFECDV+I
Sbjct: 145 ARIGHFHAAVDCFNEFLSTSFLQSDHYTFPPVIRACGNLDDGRKVHCLVLKLGFECDVYI 204

Query: 181 AASLIHFYSRFGFVNLARNLFDSLMIRDIGTWNAMISGFCLNGKVVEALEVFDEMRFKSV 240
           AAS IHFYSRFGFV+LA NLFD++MIRDIGTWNAMISGF LNGKV EALEVFDEMRFKSV
Sbjct: 205 AASFIHFYSRFGFVSLACNLFDNMMIRDIGTWNAMISGFYLNGKVAEALEVFDEMRFKSV 264

Query: 241 TMDSVTFSSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVCNALINMYAKFGELGSAET 300
           +MDSVT SSLLPIC QLDDIISGVLIHVYAIKLGLEFDLFVCNALINMYAKFGEL SAET
Sbjct: 265 SMDSVTISSLLPICVQLDDIISGVLIHVYAIKLGLEFDLFVCNALINMYAKFGELRSAET 324

Query: 301 IFNQIEAKDIVSWNSLIAAFEQNKEPVVALGLYKKMHATGAVPDLLTLVSLASVAAELGN 360
           IFNQ++ +DIVSWNSL+AAFEQNK+PV+ALG+Y KMH+ G VPDLLTLVSLASVAAELGN
Sbjct: 325 IFNQMKVRDIVSWNSLLAAFEQNKKPVIALGVYNKMHSIGVVPDLLTLVSLASVAAELGN 384

Query: 361 FLSSRSIHGFVTRKGWFLQDVVIGNAIIDMYAKLGYIDSARKVFEELPVKDVVSWNTLIT 420
           FLSSRSIHGFVTR+ WFL D+ +GNAIIDMYAKLG+IDSARKVFE LPVKDV+SWN+LIT
Sbjct: 385 FLSSRSIHGFVTRRCWFLHDIALGNAIIDMYAKLGFIDSARKVFEGLPVKDVISWNSLIT 444

Query: 421 GYSQNGLANEAIDVYHLMNDYSDAVPNQGTWVSILTAYSQIGALKQGMKTHGLLIKNFLY 480
           GYSQNGLANEAIDVY  M  YS AVPNQGTWVSILTA+SQ+GALKQGMK HG LIKNFLY
Sbjct: 445 GYSQNGLANEAIDVYSSMRYYSGAVPNQGTWVSILTAHSQLGALKQGMKAHGQLIKNFLY 504

Query: 481 FDIFVGTCLIDMYGKCGRLADALSLFYEIPHKSSVSWNAIISCHGLHGYGLKAVELFREM 540
           FDIFV TCL+DMYGKCG+LADALSLFYE+PH+SSVSWNAIISCHGLHGYGLKAV+LF+EM
Sbjct: 505 FDIFVSTCLVDMYGKCGKLADALSLFYEVPHQSSVSWNAIISCHGLHGYGLKAVKLFKEM 564

Query: 541 QTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMGELYGIRPSLKHYGCMVDLFGRAGH 600
           Q+EGVKPDHITFVSLLSACSHSGLVDEGQWCFQLM E YGIRPSLKHYGCMVDLFGRAGH
Sbjct: 565 QSEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMQETYGIRPSLKHYGCMVDLFGRAGH 624

Query: 601 LKKAYDFVKTMPIQPDASVWGALLGACRIHENVELARTVSDHLLEVESKNVGYYVLLSNI 660
           L+KA++FVK MP++PD SVWGALLGACRIHENVEL RTVSDHLL+VES+NVGYYVLLSNI
Sbjct: 625 LEKAFNFVKNMPVRPDVSVWGALLGACRIHENVELVRTVSDHLLKVESENVGYYVLLSNI 684

Query: 661 YAKLGQWDGVDEVRSLARDRGLRKTPGWSSIEIDKKIDVFYTGNRTHPRCEEIYDELRDL 720
           YAKLG W+GVDEVRSLARDRGL+KTPGWSSIE+DKKIDVFYTGN+THP+CEEIY ELR+L
Sbjct: 685 YAKLGHWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVFYTGNQTHPKCEEIYSELRNL 744

Query: 721 TAKMKSLGYVANYNFVLQDVEDDEKENILISHSERLAMAFGIISTPPKTTLQIFKNLRVC 780
           TAKMKS+GYV +YNFVLQDVEDDEKENIL SHSERLAMAFGIISTPPKTTLQIFKNLRVC
Sbjct: 745 TAKMKSIGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPKTTLQIFKNLRVC 804

Query: 781 GDCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW 821
           GDCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW
Sbjct: 805 GDCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW 839

BLAST of CmoCh16G002770 vs. NCBI nr
Match: gi|449439005|ref|XP_004137278.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g33990 [Cucumis sativus])

HSP 1 Score: 1475.3 bits (3818), Expect = 0.0e+00
Identity = 710/819 (86.69%), Postives = 764/819 (93.28%), Query Frame = 1

Query: 2   RFLPCKWRRISLFRPSFQACCPLYSATTTAPTPKYYLDEVEIEKKEIDFNRLFLVCKKVH 61
           RFL CKWRR+SLF+PSFQAC  LYSAT     PKY LD VE EK+EIDFNR+FL C KVH
Sbjct: 3   RFLQCKWRRVSLFKPSFQACS-LYSATAA---PKY-LDGVENEKREIDFNRIFLYCTKVH 62

Query: 62  LAKRLHALLVVSGKVQSIFLSAKLINLYAFLGDVSFARRTFDQIQAKDVYTWNSMISAYA 121
           LAK+LHALLVVSGK QSIFLSAKLIN YAFLGD+  AR TFDQIQ KDVYTWNSMISAYA
Sbjct: 63  LAKQLHALLVVSGKTQSIFLSAKLINRYAFLGDIPHARLTFDQIQTKDVYTWNSMISAYA 122

Query: 122 RIGHFHEAVDCFHEFMSTSILQPDYYTFPPVIRACGNLDDGKKIHCLALKLGFECDVFIA 181
           RIGHFH AVDCF+EF+STS LQ D+YTFPPVIRACGNLDDG+K+HCL LKLGFECDV+IA
Sbjct: 123 RIGHFHAAVDCFNEFLSTSFLQSDHYTFPPVIRACGNLDDGRKVHCLVLKLGFECDVYIA 182

Query: 182 ASLIHFYSRFGFVNLARNLFDSLMIRDIGTWNAMISGFCLNGKVVEALEVFDEMRFKSVT 241
           AS IHFYSRFGFV+LA NLFD++MIRDIGTWNAMISGF LNGKV EALEVFDEMRFKSV+
Sbjct: 183 ASFIHFYSRFGFVSLACNLFDNMMIRDIGTWNAMISGFYLNGKVAEALEVFDEMRFKSVS 242

Query: 242 MDSVTFSSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVCNALINMYAKFGELGSAETI 301
           MDSVT SSLLPIC QLDDIISGVLIHVYAIKLGLEFDLFVCNALINMYAKFGEL SAETI
Sbjct: 243 MDSVTISSLLPICVQLDDIISGVLIHVYAIKLGLEFDLFVCNALINMYAKFGELRSAETI 302

Query: 302 FNQIEAKDIVSWNSLIAAFEQNKEPVVALGLYKKMHATGAVPDLLTLVSLASVAAELGNF 361
           FNQ++ +DIVSWNSL+AAFEQNK+PV+ALG+Y KMH+ G VPDLLTLVSLASVAAELGNF
Sbjct: 303 FNQMKVRDIVSWNSLLAAFEQNKKPVIALGVYNKMHSIGVVPDLLTLVSLASVAAELGNF 362

Query: 362 LSSRSIHGFVTRKGWFLQDVVIGNAIIDMYAKLGYIDSARKVFEELPVKDVVSWNTLITG 421
           LSSRSIHGFVTR+ WFL D+ +GNAIIDMYAKLG+IDSARKVFE LPVKDV+SWN+LITG
Sbjct: 363 LSSRSIHGFVTRRCWFLHDIALGNAIIDMYAKLGFIDSARKVFEGLPVKDVISWNSLITG 422

Query: 422 YSQNGLANEAIDVYHLMNDYSDAVPNQGTWVSILTAYSQIGALKQGMKTHGLLIKNFLYF 481
           YSQNGLANEAIDVY  M  YS AVPNQGTWVSILTA+SQ+GALKQGMK HG LIKNFLYF
Sbjct: 423 YSQNGLANEAIDVYSSMRYYSGAVPNQGTWVSILTAHSQLGALKQGMKAHGQLIKNFLYF 482

Query: 482 DIFVGTCLIDMYGKCGRLADALSLFYEIPHKSSVSWNAIISCHGLHGYGLKAVELFREMQ 541
           DIFV TCL+DMYGKCG+LADALSLFYE+PH+SSVSWNAIISCHGLHGYGLKAV+LF+EMQ
Sbjct: 483 DIFVSTCLVDMYGKCGKLADALSLFYEVPHQSSVSWNAIISCHGLHGYGLKAVKLFKEMQ 542

Query: 542 TEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMGELYGIRPSLKHYGCMVDLFGRAGHL 601
           +EGVKPDHITFVSLLSACSHSGLVDEGQWCFQLM E YGIRPSLKHYGCMVDLFGRAGHL
Sbjct: 543 SEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMQETYGIRPSLKHYGCMVDLFGRAGHL 602

Query: 602 KKAYDFVKTMPIQPDASVWGALLGACRIHENVELARTVSDHLLEVESKNVGYYVLLSNIY 661
           +KA++FVK MP++PD SVWGALLGACRIHENVEL RTVSDHLL+VES+NVGYYVLLSNIY
Sbjct: 603 EKAFNFVKNMPVRPDVSVWGALLGACRIHENVELVRTVSDHLLKVESENVGYYVLLSNIY 662

Query: 662 AKLGQWDGVDEVRSLARDRGLRKTPGWSSIEIDKKIDVFYTGNRTHPRCEEIYDELRDLT 721
           AKLG W+GVDEVRSLARDRGL+KTPGWSSIE+DKKIDVFYTGN+THP+CEEIY ELR+LT
Sbjct: 663 AKLGHWEGVDEVRSLARDRGLKKTPGWSSIEVDKKIDVFYTGNQTHPKCEEIYSELRNLT 722

Query: 722 AKMKSLGYVANYNFVLQDVEDDEKENILISHSERLAMAFGIISTPPKTTLQIFKNLRVCG 781
           AKMKS+GYV +YNFVLQDVEDDEKENIL SHSERLAMAFGIISTPPKTTLQIFKNLRVCG
Sbjct: 723 AKMKSIGYVPDYNFVLQDVEDDEKENILTSHSERLAMAFGIISTPPKTTLQIFKNLRVCG 782

Query: 782 DCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW 821
           DCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW
Sbjct: 783 DCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW 816

BLAST of CmoCh16G002770 vs. NCBI nr
Match: gi|1009150234|ref|XP_015892910.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g33990 [Ziziphus jujuba])

HSP 1 Score: 1204.9 bits (3116), Expect = 0.0e+00
Identity = 568/821 (69.18%), Postives = 680/821 (82.83%), Query Frame = 1

Query: 1   MRFLP-CKWRRISLFRPSFQACCPLYSATTTAPTPKYYLDEVEIEKKEIDFNRLFLVCKK 60
           MR +P CK  +I  F PS QA C  +SA T   T +   D  E E K+IDF+ LF  C  
Sbjct: 1   MRLVPACKNLQIFKFLPSVQAHCSFFSAVTN--TLQVPADGFENENKKIDFDMLFPSCTT 60

Query: 61  VHLAKRLHALLVVSGKVQSIFLSAKLINLYAFLGDVSFARRTFDQIQAKDVYTWNSMISA 120
           VHLAK LH+LLVVSG+V++IFLSAKL+NLYA+L DVSF+RRTFDQI  KD+YTWNSM+SA
Sbjct: 61  VHLAKCLHSLLVVSGRVENIFLSAKLVNLYAYLDDVSFSRRTFDQIPKKDIYTWNSMVSA 120

Query: 121 YARIGHFHEAVDCFHEFMSTSILQPDYYTFPPVIRACGNLDDGKKIHCLALKLGFECDVF 180
           Y R G F +A++CF+ F+ TS L+PD+YTFPPV++ACGNL DGKKIHC   KLGFE DVF
Sbjct: 121 YVRSGRFQQAIECFYHFLLTSDLRPDFYTFPPVLKACGNLVDGKKIHCWVQKLGFESDVF 180

Query: 181 IAASLIHFYSRFGFVNLARNLFDSLMIRDIGTWNAMISGFCLNGKVVEALEVFDEMRFKS 240
           +AASLIH YSRFG + +AR LF+ + IRD G+WNAMISGFC NG   EAL+V +EMR   
Sbjct: 181 VAASLIHMYSRFGHLVIARKLFNEMPIRDTGSWNAMISGFCQNGNAAEALDVMNEMRLDG 240

Query: 241 VTMDSVTFSSLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVCNALINMYAKFGELGSAE 300
           V MD VT SSLL +CAQ +D++SG+LIH+Y IK GLEFD+FVCNALINMYAKF  +  A 
Sbjct: 241 VKMDPVTVSSLLTVCAQSNDMLSGMLIHLYVIKHGLEFDVFVCNALINMYAKFCIVDHAR 300

Query: 301 TIFNQIEAKDIVSWNSLIAAFEQNKEPVVALGLYKKMHATGAVPDLLTLVSLASVAAELG 360
            +F+Q++ +D+VSWNS+IAA+EQN EP+ A   YKK+   G   D LTL+SLAS+ A+L 
Sbjct: 301 KVFDQMKIRDVVSWNSIIAAYEQNDEPITAFEFYKKLQQNGIQSDSLTLLSLASIIAQLT 360

Query: 361 NFLSSRSIHGFVTRKGWFLQDVVIGNAIIDMYAKLGYIDSARKVFEELPVKDVVSWNTLI 420
           +   SRS+HGF+ R+GW +QDV  GNA++DMYAKLG IDSAR VFE LPVKDV+SWNTLI
Sbjct: 361 DDRKSRSVHGFILRRGWLMQDVATGNAVVDMYAKLGSIDSARTVFEGLPVKDVISWNTLI 420

Query: 421 TGYSQNGLANEAIDVYHLMNDYSDAVPNQGTWVSILTAYSQIGALKQGMKTHGLLIKNFL 480
           TGY+QNGLA+EA++VY +M + +D +PNQGTWVS+L AYS +GAL+QGM+ HG ++KN L
Sbjct: 421 TGYAQNGLASEAVEVYDMMKERTDIIPNQGTWVSVLPAYSHLGALQQGMRIHGRVMKNCL 480

Query: 481 YFDIFVGTCLIDMYGKCGRLADALSLFYEIPHKSSVSWNAIISCHGLHGYGLKAVELFRE 540
           Y D+FVGTCLIDMYGKCGRL DA+ LFYE+P KSSV WNAIISCHG+HG+G KA+ELF+ 
Sbjct: 481 YMDVFVGTCLIDMYGKCGRLDDAMLLFYEVPRKSSVPWNAIISCHGIHGHGDKALELFKN 540

Query: 541 MQTEGVKPDHITFVSLLSACSHSGLVDEGQWCFQLMGELYGIRPSLKHYGCMVDLFGRAG 600
           M  E VKPDH+TFVSLLSACSHSGLV EGQ  F  M + YGI+PSLKHYGCMVDLFGRAG
Sbjct: 541 MLVEEVKPDHVTFVSLLSACSHSGLVGEGQRYFDAMQKEYGIKPSLKHYGCMVDLFGRAG 600

Query: 601 HLKKAYDFVKTMPIQPDASVWGALLGACRIHENVELARTVSDHLLEVESKNVGYYVLLSN 660
           HL+ AY+F+K MP+QPDAS+WGALLGACRIH NVEL +  SD L EVE++NVGYYVLLSN
Sbjct: 601 HLEMAYNFIKNMPVQPDASIWGALLGACRIHGNVELCKFASDSLFEVETENVGYYVLLSN 660

Query: 661 IYAKLGQWDGVDEVRSLARDRGLRKTPGWSSIEIDKKIDVFYTGNRTHPRCEEIYDELRD 720
           IYA  G+W+GVD+VRSLARD+GLRKTPGWSSIE + K+DVFYTGN++HP CEEIY ELR 
Sbjct: 661 IYANFGKWEGVDKVRSLARDKGLRKTPGWSSIEANNKVDVFYTGNQSHPNCEEIYTELRF 720

Query: 721 LTAKMKSLGYVANYNFVLQDVEDDEKENILISHSERLAMAFGIISTPPKTTLQIFKNLRV 780
           LTAKMKSLGY+ +Y+FVLQDVE+DEKE+IL SHSERLA+AFGIISTPPKT ++IFKNLRV
Sbjct: 721 LTAKMKSLGYIPDYSFVLQDVEEDEKEHILTSHSERLAIAFGIISTPPKTPIRIFKNLRV 780

Query: 781 CGDCHNATKFISKITEREIIVRDSNRFHHFKDGVCSCGDYW 821
           CGDCHNATK+ISKITEREIIVRD+NRFHHFKDG+CSCGDYW
Sbjct: 781 CGDCHNATKYISKITEREIIVRDANRFHHFKDGICSCGDYW 819

BLAST of CmoCh16G002770 vs. NCBI nr
Match: gi|590686638|ref|XP_007042438.1| (Tetratricopeptide repeat (TPR)-like superfamily protein [Theobroma cacao])

HSP 1 Score: 1199.9 bits (3103), Expect = 0.0e+00
Identity = 559/812 (68.84%), Postives = 674/812 (83.00%), Query Frame = 1

Query: 9   RRISLFRPSFQACCPLYSATTTAPTPKYYLDEVEIEKKEIDFNRLFLVCKKVHLAKRLHA 68
           R IS   P  Q  CPL+SA   A + +   +  E   K IDFN LF  C ++HLAKRLHA
Sbjct: 11  RHISKIFPLLQVRCPLFSAA--ANSLQGTSNGCEDNDKSIDFNHLFKSCTQLHLAKRLHA 70

Query: 69  LLVVSGKVQSIFLSAKLINLYAFLGDVSFARRTFDQIQAKDVYTWNSMISAYARIGHFHE 128
           L++VSGK QSIF+SAKL+NLYA+L DVSF+RRTFDQI  KDVYTWNSM+SAY R G F E
Sbjct: 71  LVLVSGKAQSIFISAKLVNLYAYLCDVSFSRRTFDQINEKDVYTWNSMVSAYVRSGRFQE 130

Query: 129 AVDCFHEFMSTSILQPDYYTFPPVIRACGNLDDGKKIHCLALKLGFECDVFIAASLIHFY 188
           AVDCF++F STS L+PD+YTFPPV++AC NL DG ++HCL LKLGFE DVF+ ASL+H Y
Sbjct: 131 AVDCFYQFFSTSGLRPDFYTFPPVLKACKNLPDGMRMHCLVLKLGFEWDVFVTASLVHMY 190

Query: 189 SRFGFVNLARNLFDSLMIRDIGTWNAMISGFCLNGKVVEALEVFDEMRFKSVTMDSVTFS 248
           +RF  V  AR LFD + +RD+G+WNAMISG+C NG   EALEV +EMR + V MD VT +
Sbjct: 191 TRFRIVGSARKLFDDMPVRDMGSWNAMISGYCQNGNAAEALEVLNEMRLERVMMDPVTIA 250

Query: 249 SLLPICAQLDDIISGVLIHVYAIKLGLEFDLFVCNALINMYAKFGELGSAETIFNQIEAK 308
           S+LPICAQLDDI+ G LIH+YAIK GLEFDLFV NALINMYAKFG+L  A+ +F+ +  +
Sbjct: 251 SILPICAQLDDILYGRLIHLYAIKSGLEFDLFVSNALINMYAKFGKLEHAQKVFDHMVVR 310

Query: 309 DIVSWNSLIAAFEQNKEPVVALGLYKKMHATGAVPDLLTLVSLASVAAELGNFLSSRSIH 368
           D+VSWNS+IAA+EQN +P +ALGL+  M   G  PD LTLVSL+S+ A+L +    +S+H
Sbjct: 311 DLVSWNSIIAAYEQNDDPHMALGLFYNMKLIGINPDYLTLVSLSSIVAQLSDSRKGKSVH 370

Query: 369 GFVTRKGWFLQDVVIGNAIIDMYAKLGYIDSARKVFEELPVKDVVSWNTLITGYSQNGLA 428
           GFV R+GWFL+DV+ GN+++DMYAKLG +DSA  VF  LPVKDVVSWNTLITGY+QNGLA
Sbjct: 371 GFVMRRGWFLKDVISGNSVVDMYAKLGIMDSAHAVFYVLPVKDVVSWNTLITGYAQNGLA 430

Query: 429 NEAIDVYHLMNDYSDAVPNQGTWVSILTAYSQIGALKQGMKTHGLLIKNFLYFDIFVGTC 488
            EAI+ Y +M +  +  PNQ TWVSIL AYS +GAL+QGM+ HG LIKN  Y DIFVGTC
Sbjct: 431 GEAIEAYGMMQECKEITPNQATWVSILPAYSNVGALQQGMRVHGRLIKNSFYLDIFVGTC 490

Query: 489 LIDMYGKCGRLADALSLFYEIPHKSSVSWNAIISCHGLHGYGLKAVELFREMQTEGVKPD 548
           LIDMYGKCG+L DA+SLF+E+P  +SV WNAIISCHG+HG+  KA++LFREM+ EGVKPD
Sbjct: 491 LIDMYGKCGKLDDAMSLFFEVPKMTSVPWNAIISCHGIHGHAEKALKLFREMREEGVKPD 550

Query: 549 HITFVSLLSACSHSGLVDEGQWCFQLMGELYGIRPSLKHYGCMVDLFGRAGHLKKAYDFV 608
           H+TFVSLLSACSHSGLVDEGQWCF +M E YGI P LKHYGCMVDLFGRAGHL+ AY+F+
Sbjct: 551 HVTFVSLLSACSHSGLVDEGQWCFHVMQEEYGIEPILKHYGCMVDLFGRAGHLEMAYNFI 610

Query: 609 KTMPIQPDASVWGALLGACRIHENVELARTVSDHLLEVESKNVGYYVLLSNIYAKLGQWD 668
           K +P++PDASVWGALLGACRIH N++L    SD L EV+S NVGYYVLLSNIYA +G+W+
Sbjct: 611 KNLPVKPDASVWGALLGACRIHGNIDLGTFASDRLFEVDSDNVGYYVLLSNIYANIGKWE 670

Query: 669 GVDEVRSLARDRGLRKTPGWSSIEIDKKIDVFYTGNRTHPRCEEIYDELRDLTAKMKSLG 728
           GVD+VR++ARD+GLRKTPGWSSIE+  K+DVFYTGNR+HP+CEEI+ ELR LTAKMKSLG
Sbjct: 671 GVDKVRAVARDKGLRKTPGWSSIEVSNKVDVFYTGNRSHPKCEEIFKELRSLTAKMKSLG 730

Query: 729 YVANYNFVLQDVEDDEKENILISHSERLAMAFGIISTPPKTTLQIFKNLRVCGDCHNATK 788
           YV +Y+FVLQDVE+DEKE+IL+SHSERLA+A+GIIS+PPK+ ++IFKNLRVCGDCHNATK
Sbjct: 731 YVPDYSFVLQDVEEDEKEHILMSHSERLAIAYGIISSPPKSPIRIFKNLRVCGDCHNATK 790

Query: 789 FISKITEREIIVRDSNRFHHFKDGVCSCGDYW 821
           FIS+IT+REIIVRDSNRFHHFKDG+CSCGDYW
Sbjct: 791 FISQITDREIIVRDSNRFHHFKDGICSCGDYW 820

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP348_ARATH1.0e-29761.57Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana GN... [more]
PP341_ARATH3.6e-17041.23Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana GN... [more]
PPR32_ARATH6.3e-16738.98Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
PP285_ARATH1.1e-16639.18Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidop... [more]
PP320_ARATH5.9e-16539.43Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Match NameE-valueIdentityDescription
A0A0A0L0N9_CUCSA0.0e+0086.59Uncharacterized protein OS=Cucumis sativus GN=Csa_4G107430 PE=4 SV=1[more]
A0A061DZS3_THECC0.0e+0068.84Tetratricopeptide repeat (TPR)-like superfamily protein OS=Theobroma cacao GN=TC... [more]
F6HBK0_VITVI0.0e+0067.90Putative uncharacterized protein OS=Vitis vinifera GN=VIT_03s0088g01130 PE=4 SV=... [more]
A0A0D2RKK6_GOSRA0.0e+0067.86Uncharacterized protein OS=Gossypium raimondii GN=B456_005G237500 PE=4 SV=1[more]
A0A067F8D6_CITSI0.0e+0066.13Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g003439mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G33990.15.6e-29961.57 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G30700.12.0e-17141.23 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G11290.13.5e-16838.98 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G57430.16.1e-16839.18 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G18750.13.3e-16639.43 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659111236|ref|XP_008455647.1|0.0e+0087.20PREDICTED: pentatricopeptide repeat-containing protein At4g33990 [Cucumis melo][more]
gi|700198543|gb|KGN53701.1|0.0e+0086.59hypothetical protein Csa_4G107430 [Cucumis sativus][more]
gi|449439005|ref|XP_004137278.1|0.0e+0086.69PREDICTED: pentatricopeptide repeat-containing protein At4g33990 [Cucumis sativu... [more]
gi|1009150234|ref|XP_015892910.1|0.0e+0069.18PREDICTED: pentatricopeptide repeat-containing protein At4g33990 [Ziziphus jujub... [more]
gi|590686638|ref|XP_007042438.1|0.0e+0068.84Tetratricopeptide repeat (TPR)-like superfamily protein [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh16G002770.1CmoCh16G002770.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 588..611
score: 0.51coord: 311..337
score: 0.0018coord: 281..309
score: 0.0068coord: 487..509
score: 0.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 108..156
score: 4.5E-8coord: 513..560
score: 1.5E-10coord: 410..458
score: 9.9E-8coord: 208..252
score: 8.2
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 281..311
score: 0.0015coord: 111..141
score: 2.5E-7coord: 311..344
score: 1.2E-4coord: 211..243
score: 4.6E-9coord: 413..439
score: 1.2E-5coord: 515..548
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 243..277
score: 5.821coord: 584..614
score: 6.686coord: 208..242
score: 12.211coord: 447..477
score: 6.029coord: 482..512
score: 7.728coord: 548..578
score: 7.607coord: 78..108
score: 5.229coord: 650..684
score: 6.818coord: 109..144
score: 11.115coord: 411..441
score: 9.821coord: 380..410
score: 8.122coord: 309..343
score: 10.073coord: 278..308
score: 8.385coord: 177..207
score: 6.182coord: 513..547
score: 10
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 307..442
score: 1.0E-9coord: 86..140
score: 1.0E-9coord: 614..668
score: 1.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 65..344
score: 0.0coord: 381..691
score:
NoneNo IPR availablePANTHERPTHR24015:SF631SUBFAMILY NOT NAMEDcoord: 381..691
score: 0.0coord: 65..344
score:

The following gene(s) are paralogous to this gene:

None