CmoCh04G022650 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G022650
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCmo_Chr04 : 16915995 .. 16918358 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGCTTTCCGTTGAAGTTCTTGCATTTGCTTCACTCTTCTCAGCGATGTTACTTTTCTTTCGCAGTCTTTTCCACGTTAGTCGCAGAGCCTCTTACCGAGTAATCTCTCTATCTTTGAATTCCTCGCATCCGGGTTGCCTTTCTTTCAATGTATTTAATGGCCCATCATCGTTAACGTCAATGAATGGCTACTACATTTCTTGCCCCTTTTTCTGGTTCACTAGCTTTCTTTGTATATTTCGGCTCCCTTTTGTTAGTTACTCGATTACAAATGATTCTTTTGAACTTTTAGACATTGGTTCCCTTCGTAAAATTATACAACAAGACCTCTGGAATGATCCTAAGATTGTTGTTTTATTTGATTCAGCACTAGCGCCCATTTGGGTTTCTAAGATTTTAGTTGAATTGAAAGAAGATCCAAAATTAGCCCTTAAGTTCTTCAAATGGGCTGGAACCCATATTGGTTTCCGCCATACCACAGAGTCTTACTGCATTATAGTTCACATGCTGTTTCGTGCGAGAATGTACACAAATGCCCATGATATTATGAAAGAAATGGTTTTGAAGAGCCGTACTGACTTGATTTTACCCGTTTGTAATGTATTTGATATTTTATGGTCGACTAGGAACTTTTGTGTGTCAGGAACAGGAGTCTTTGACGTTTTGTTTAGTGTTTTGGTAGAGTTGGGTCTGCTTGAGGAAGCTAATGAATGTTTCTCAAAAATGAGGAAGTTTAGGACTCTTCCCAAAGCACGTTCTTGCAATTTTCTCTTGCATAGATTATCAAAGTCAGGGAATGGACAGTTGGTGAGGAAATTTTTCCATGACATGATTGGGGCTGGTATTGCACCTTCAGTTTTTACCTACAATGTAATGATAGATCACTTGTGCAAAGAAGGGGATTTGGAAAATGCTAGAAGTTTGTTTGTGCAAATGAGGACGATGGGCTTTTCTCCAGATGTTGTCACATATAATTCTTTGATTGATGGCTATGGCAAGGTTGGTTTATTAAAAGAATCTGTGTATTTATTTAATGAAATGAAGGATGTAGGTTGTGTTCCTGATGTAATTACCTATAATGCTTTGATCAATTGTTTCTGCAAGTTTGAGAAGATGCCTCAAGCTTTTGAGTATCTCTCTGAGATGAAGAACATTGGGTTAAAACCAAATGTTGTAACCTATAGCACATTGATTGATGCTTTTTGCAAGGAGGGAATGATGCAAGGTGCCATTAAACTTTTTGTTGATATGAGAAGAGTTGGGCTTTTACCTAATGAATTCACATACACTTCTCTGATTGATGCCAATTGTAAGGCAGGTAATTTAACCGAAGCATGGAAGTTGTCCAATGATATGTTGCAAGCAGGAGTTAATTTAAACATAGTCACCTATACAGCTCTAATGGATGGCCTTTGTGAGGATGGAAGAATGATGGAGGCAGAAGAAGTGTTCAGGGCAATGCTGAAAGATGGAATATCTCCCAACCAGCAGGTTTACACTGCTTTGGTTCATGGCTATATTAAGGCGGAGAAAATGGAAGATGCTTTGGAAATATTGAAGCAAATGACCGAATGTGGCATCAAACCAGATTTAGTACTCTATGGCACTATTATTTGGGGTCTCTGTAATCAAAACAAACTTGAAGAAACTAAGCTTATTATTAAAGAAATGAAAAGTCGGGGTATCCGTGCAAATCCTGTTATATATACAACAATTATAGATGCTTATTTTAAGGCTGGAAAAAGCTCAGATGCATTGGATCTTCTTCAGGAGATGCAGGAAGTAGGTGTTGAGGCAACCGTTGTAACCTACTGTGTATTAATTGATGGCTTGTGCAAAACAGGTATGGTGGAAGTGGCAGTTGATTATTTTGGTAGAATGTCTGATTTTGGTGTACAGCCTAATGTTGCAGTTTATACGGCCCTCATTGATGGTCTTTGTAAAATTAATTGCATTGAATCTGCCGAAAAGTTGTTTGAGGAAATGCAATGTAGGGGTATGACTCCGGATAAAACAGCTTTCACTGCTCTAATTGATGGCAACTTAAAGCTTGGAAATCTTCAGGAAACTTTGAATTTGATTAGCAAAATGACAGAATTAGTTATTGAGTTTGATTTGCATGCTTATACGACCTTGGTTTCGGGATTTTCTCAATGTGGTGAGCTGCACCAAGCGAGGAAGTTCTTTAATGAGATGATTGAGAAGGGCATACTTCCCGATGAAATTTTATGCATATGTCTATTGAAGGAGTATAACAAGCTTGGACATTTGGATGAAGCCATCAAATTGAAGAACGAAATGCAAAGGAGGGGTTTAATTACTGAAAAGTGCAGCCATGAAGTTCCCAGTCTAAAAACTTGA

mRNA sequence

ATGAAGCTTTCCGTTGAAGTTCTTGCATTTGCTTCACTCTTCTCAGCGATGTTACTTTTCTTTCGCAGTCTTTTCCACGTTAGTCGCAGAGCCTCTTACCGAGTAATCTCTCTATCTTTGAATTCCTCGCATCCGGGTTGCCTTTCTTTCAATGTATTTAATGGCCCATCATCGTTAACGTCAATGAATGGCTACTACATTTCTTGCCCCTTTTTCTGGTTCACTAGCTTTCTTTGTATATTTCGGCTCCCTTTTGTTAGTTACTCGATTACAAATGATTCTTTTGAACTTTTAGACATTGGTTCCCTTCGTAAAATTATACAACAAGACCTCTGGAATGATCCTAAGATTGTTGTTTTATTTGATTCAGCACTAGCGCCCATTTGGGTTTCTAAGATTTTAGTTGAATTGAAAGAAGATCCAAAATTAGCCCTTAAGTTCTTCAAATGGGCTGGAACCCATATTGGTTTCCGCCATACCACAGAGTCTTACTGCATTATAGTTCACATGCTGTTTCGTGCGAGAATGTACACAAATGCCCATGATATTATGAAAGAAATGGTTTTGAAGAGCCGTACTGACTTGATTTTACCCGTTTGTAATGTATTTGATATTTTATGGTCGACTAGGAACTTTTGTGTGTCAGGAACAGGAGTCTTTGACGTTTTGTTTAGTGTTTTGGTAGAGTTGGGTCTGCTTGAGGAAGCTAATGAATGTTTCTCAAAAATGAGGAAGTTTAGGACTCTTCCCAAAGCACGTTCTTGCAATTTTCTCTTGCATAGATTATCAAAGTCAGGGAATGGACAGTTGGTGAGGAAATTTTTCCATGACATGATTGGGGCTGGTATTGCACCTTCAGTTTTTACCTACAATGTAATGATAGATCACTTGTGCAAAGAAGGGGATTTGGAAAATGCTAGAAGTTTGTTTGTGCAAATGAGGACGATGGGCTTTTCTCCAGATGTTGTCACATATAATTCTTTGATTGATGGCTATGGCAAGGTTGGTTTATTAAAAGAATCTGTGTATTTATTTAATGAAATGAAGGATGTAGGTTGTGTTCCTGATGTAATTACCTATAATGCTTTGATCAATTGTTTCTGCAAGTTTGAGAAGATGCCTCAAGCTTTTGAGTATCTCTCTGAGATGAAGAACATTGGGTTAAAACCAAATGTTGTAACCTATAGCACATTGATTGATGCTTTTTGCAAGGAGGGAATGATGCAAGGTGCCATTAAACTTTTTGTTGATATGAGAAGAGTTGGGCTTTTACCTAATGAATTCACATACACTTCTCTGATTGATGCCAATTGTAAGGCAGGTAATTTAACCGAAGCATGGAAGTTGTCCAATGATATGTTGCAAGCAGGAGTTAATTTAAACATAGTCACCTATACAGCTCTAATGGATGGCCTTTGTGAGGATGGAAGAATGATGGAGGCAGAAGAAGTGTTCAGGGCAATGCTGAAAGATGGAATATCTCCCAACCAGCAGGTTTACACTGCTTTGGTTCATGGCTATATTAAGGCGGAGAAAATGGAAGATGCTTTGGAAATATTGAAGCAAATGACCGAATGTGGCATCAAACCAGATTTAGTACTCTATGGCACTATTATTTGGGGTCTCTGTAATCAAAACAAACTTGAAGAAACTAAGCTTATTATTAAAGAAATGAAAAGTCGGGGTATCCGTGCAAATCCTGTTATATATACAACAATTATAGATGCTTATTTTAAGGCTGGAAAAAGCTCAGATGCATTGGATCTTCTTCAGGAGATGCAGGAAGTAGGTGTTGAGGCAACCGTTGTAACCTACTGTGTATTAATTGATGGCTTGTGCAAAACAGGTATGGTGGAAGTGGCAGTTGATTATTTTGGTAGAATGTCTGATTTTGGTGTACAGCCTAATGTTGCAGTTTATACGGCCCTCATTGATGGTCTTTGTAAAATTAATTGCATTGAATCTGCCGAAAAGTTGTTTGAGGAAATGCAATGTAGGGGTATGACTCCGGATAAAACAGCTTTCACTGCTCTAATTGATGGCAACTTAAAGCTTGGAAATCTTCAGGAAACTTTGAATTTGATTAGCAAAATGACAGAATTAGTTATTGAGTTTGATTTGCATGCTTATACGACCTTGGTTTCGGGATTTTCTCAATGTGGTGAGCTGCACCAAGCGAGGAAGTTCTTTAATGAGATGATTGAGAAGGGCATACTTCCCGATGAAATTTTATGCATATGTCTATTGAAGGAGTATAACAAGCTTGGACATTTGGATGAAGCCATCAAATTGAAGAACGAAATGCAAAGGAGGGGTTTAATTACTGAAAAGTGCAGCCATGAAGTTCCCAGTCTAAAAACTTGA

Coding sequence (CDS)

ATGAAGCTTTCCGTTGAAGTTCTTGCATTTGCTTCACTCTTCTCAGCGATGTTACTTTTCTTTCGCAGTCTTTTCCACGTTAGTCGCAGAGCCTCTTACCGAGTAATCTCTCTATCTTTGAATTCCTCGCATCCGGGTTGCCTTTCTTTCAATGTATTTAATGGCCCATCATCGTTAACGTCAATGAATGGCTACTACATTTCTTGCCCCTTTTTCTGGTTCACTAGCTTTCTTTGTATATTTCGGCTCCCTTTTGTTAGTTACTCGATTACAAATGATTCTTTTGAACTTTTAGACATTGGTTCCCTTCGTAAAATTATACAACAAGACCTCTGGAATGATCCTAAGATTGTTGTTTTATTTGATTCAGCACTAGCGCCCATTTGGGTTTCTAAGATTTTAGTTGAATTGAAAGAAGATCCAAAATTAGCCCTTAAGTTCTTCAAATGGGCTGGAACCCATATTGGTTTCCGCCATACCACAGAGTCTTACTGCATTATAGTTCACATGCTGTTTCGTGCGAGAATGTACACAAATGCCCATGATATTATGAAAGAAATGGTTTTGAAGAGCCGTACTGACTTGATTTTACCCGTTTGTAATGTATTTGATATTTTATGGTCGACTAGGAACTTTTGTGTGTCAGGAACAGGAGTCTTTGACGTTTTGTTTAGTGTTTTGGTAGAGTTGGGTCTGCTTGAGGAAGCTAATGAATGTTTCTCAAAAATGAGGAAGTTTAGGACTCTTCCCAAAGCACGTTCTTGCAATTTTCTCTTGCATAGATTATCAAAGTCAGGGAATGGACAGTTGGTGAGGAAATTTTTCCATGACATGATTGGGGCTGGTATTGCACCTTCAGTTTTTACCTACAATGTAATGATAGATCACTTGTGCAAAGAAGGGGATTTGGAAAATGCTAGAAGTTTGTTTGTGCAAATGAGGACGATGGGCTTTTCTCCAGATGTTGTCACATATAATTCTTTGATTGATGGCTATGGCAAGGTTGGTTTATTAAAAGAATCTGTGTATTTATTTAATGAAATGAAGGATGTAGGTTGTGTTCCTGATGTAATTACCTATAATGCTTTGATCAATTGTTTCTGCAAGTTTGAGAAGATGCCTCAAGCTTTTGAGTATCTCTCTGAGATGAAGAACATTGGGTTAAAACCAAATGTTGTAACCTATAGCACATTGATTGATGCTTTTTGCAAGGAGGGAATGATGCAAGGTGCCATTAAACTTTTTGTTGATATGAGAAGAGTTGGGCTTTTACCTAATGAATTCACATACACTTCTCTGATTGATGCCAATTGTAAGGCAGGTAATTTAACCGAAGCATGGAAGTTGTCCAATGATATGTTGCAAGCAGGAGTTAATTTAAACATAGTCACCTATACAGCTCTAATGGATGGCCTTTGTGAGGATGGAAGAATGATGGAGGCAGAAGAAGTGTTCAGGGCAATGCTGAAAGATGGAATATCTCCCAACCAGCAGGTTTACACTGCTTTGGTTCATGGCTATATTAAGGCGGAGAAAATGGAAGATGCTTTGGAAATATTGAAGCAAATGACCGAATGTGGCATCAAACCAGATTTAGTACTCTATGGCACTATTATTTGGGGTCTCTGTAATCAAAACAAACTTGAAGAAACTAAGCTTATTATTAAAGAAATGAAAAGTCGGGGTATCCGTGCAAATCCTGTTATATATACAACAATTATAGATGCTTATTTTAAGGCTGGAAAAAGCTCAGATGCATTGGATCTTCTTCAGGAGATGCAGGAAGTAGGTGTTGAGGCAACCGTTGTAACCTACTGTGTATTAATTGATGGCTTGTGCAAAACAGGTATGGTGGAAGTGGCAGTTGATTATTTTGGTAGAATGTCTGATTTTGGTGTACAGCCTAATGTTGCAGTTTATACGGCCCTCATTGATGGTCTTTGTAAAATTAATTGCATTGAATCTGCCGAAAAGTTGTTTGAGGAAATGCAATGTAGGGGTATGACTCCGGATAAAACAGCTTTCACTGCTCTAATTGATGGCAACTTAAAGCTTGGAAATCTTCAGGAAACTTTGAATTTGATTAGCAAAATGACAGAATTAGTTATTGAGTTTGATTTGCATGCTTATACGACCTTGGTTTCGGGATTTTCTCAATGTGGTGAGCTGCACCAAGCGAGGAAGTTCTTTAATGAGATGATTGAGAAGGGCATACTTCCCGATGAAATTTTATGCATATGTCTATTGAAGGAGTATAACAAGCTTGGACATTTGGATGAAGCCATCAAATTGAAGAACGAAATGCAAAGGAGGGGTTTAATTACTGAAAAGTGCAGCCATGAAGTTCCCAGTCTAAAAACTTGA
BLAST of CmoCh04G022650 vs. Swiss-Prot
Match: PP143_ARATH (Putative pentatricopeptide repeat-containing protein At2g02150 OS=Arabidopsis thaliana GN=At2g02150 PE=3 SV=1)

HSP 1 Score: 894.4 bits (2310), Expect = 8.8e-259
Identity = 441/773 (57.05%), Postives = 569/773 (73.61%), Query Frame = 1

Query: 17  MLLFFRSLFHVSRRASYRVISLSLNSSH---PGCLSFNVFNGPSSLTSMNGYYISCPFFW 76
           M    R+  HV+RR    V   S + S    P C         SS +     +ISCPF W
Sbjct: 1   MFCSLRNFLHVNRRFPRHVSPSSSSLSQIQSPLCFPL------SSPSPSQSSFISCPFVW 60

Query: 77  FTSFLCIFRLPFVSYSITNDSFELLDIGSLRKIIQQDLWNDPKIVVLFDSALAPIWVSKI 136
           FTSFLCI R PFV+ S T+   E  D   +RK++  DLW+DP +  LFD  LAPIWV ++
Sbjct: 61  FTSFLCIIRYPFVTKSGTSTYSEDFDRDWIRKVVHNDLWDDPGLEKLFDLTLAPIWVPRV 120

Query: 137 LVELKEDPKLALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNAHDIMKEMVLKSRT 196
           LVELKEDPKLA KFFKW+ T  GF+H+ ESYCI+ H+LF ARMY +A+ ++KEMVL S+ 
Sbjct: 121 LVELKEDPKLAFKFFKWSMTRNGFKHSVESYCIVAHILFCARMYYDANSVLKEMVL-SKA 180

Query: 197 DLILPVCNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECFSKMRKFRTLPKAR 256
           D     C+VFD+LWSTRN CV G GVFD LFSVL++LG+LEEA +CFSKM++FR  PK R
Sbjct: 181 D-----CDVFDVLWSTRNVCVPGFGVFDALFSVLIDLGMLEEAIQCFSKMKRFRVFPKTR 240

Query: 257 SCNFLLHRLSKSGNGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLCKEGDLENARSLFVQM 316
           SCN LLHR +K G    V++FF DMIGAG  P+VFTYN+MID +CKEGD+E AR LF +M
Sbjct: 241 SCNGLLHRFAKLGKTDDVKRFFKDMIGAGARPTVFTYNIMIDCMCKEGDVEAARGLFEEM 300

Query: 317 RTMGFSPDVVTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITYNALINCFCKFEKM 376
           +  G  PD VTYNS+IDG+GKVG L ++V  F EMKD+ C PDVITYNALINCFCKF K+
Sbjct: 301 KFRGLVPDTVTYNSMIDGFGKVGRLDDTVCFFEEMKDMCCEPDVITYNALINCFCKFGKL 360

Query: 377 PQAFEYLSEMKNIGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSL 436
           P   E+  EMK  GLKPNVV+YSTL+DAFCKEGMMQ AIK +VDMRRVGL+PNE+TYTSL
Sbjct: 361 PIGLEFYREMKGNGLKPNVVSYSTLVDAFCKEGMMQQAIKFYVDMRRVGLVPNEYTYTSL 420

Query: 437 IDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMMEAEEVFRAMLKDGI 496
           IDANCK GNL++A++L N+MLQ GV  N+VTYTAL+DGLC+  RM EAEE+F  M   G+
Sbjct: 421 IDANCKIGNLSDAFRLGNEMLQVGVEWNVVTYTALIDGLCDAERMKEAEELFGKMDTAGV 480

Query: 497 SPNQQVYTALVHGYIKAEKMEDALEILKQMTECGIKPDLVLYGTIIWGLCNQNKLEETKL 556
            PN   Y AL+HG++KA+ M+ ALE+L ++   GIKPDL+LYGT IWGLC+  K+E  K+
Sbjct: 481 IPNLASYNALIHGFVKAKNMDRALELLNELKGRGIKPDLLLYGTFIWGLCSLEKIEAAKV 540

Query: 557 IIKEMKSRGIRANPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGVEATVVTYCVLIDGLC 616
           ++ EMK  GI+AN +IYTT++DAYFK+G  ++ L LL EM+E+ +E TVVT+CVLIDGLC
Sbjct: 541 VMNEMKECGIKANSLIYTTLMDAYFKSGNPTEGLHLLDEMKELDIEVTVVTFCVLIDGLC 600

Query: 617 KTGMVEVAVDYFGRMS-DFGVQPNVAVYTALIDGLCKINCIESAEKLFEEMQCRGMTPDK 676
           K  +V  AVDYF R+S DFG+Q N A++TA+IDGLCK N +E+A  LFE+M  +G+ PD+
Sbjct: 601 KNKLVSKAVDYFNRISNDFGLQANAAIFTAMIDGLCKDNQVEAATTLFEQMVQKGLVPDR 660

Query: 677 TAFTALIDGNLKLGNLQETLNLISKMTELVIEFDLHAYTTLVSGFSQCGELHQARKFFNE 736
           TA+T+L+DGN K GN+ E L L  KM E+ ++ DL AYT+LV G S C +L +AR F  E
Sbjct: 661 TAYTSLMDGNFKQGNVLEALALRDKMAEIGMKLDLLAYTSLVWGLSHCNQLQKARSFLEE 720

Query: 737 MIEKGILPDEILCICLLKEYNKLGHLDEAIKLKNEMQRRGLITEKCSHEVPSL 786
           MI +GI PDE+LCI +LK++ +LG +DEA++L++ + +  L+T    + +P++
Sbjct: 721 MIGEGIHPDEVLCISVLKKHYELGCIDEAVELQSYLMKHQLLTSDNDNALPNM 761

BLAST of CmoCh04G022650 vs. Swiss-Prot
Match: PP141_ARATH (Pentatricopeptide repeat-containing protein At2g01740 OS=Arabidopsis thaliana GN=At2g01740 PE=3 SV=1)

HSP 1 Score: 384.0 bits (985), Expect = 3.9e-105
Identity = 206/562 (36.65%), Postives = 324/562 (57.65%), Query Frame = 1

Query: 232 LLEEANECFSKMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFHDMIGAGIAPSVFTYN 291
           ++ EA +  S++RK   LP   +CN  +H+L  S  G L  KF   ++  G  P   ++N
Sbjct: 1   MVREALQFLSRLRKSSNLPDPFTCNKHIHQLINSNCGILSLKFLAYLVSRGYTPHRSSFN 60

Query: 292 VMIDHLCKEGDLENARSLFVQMRTMGFSPDVVTYNSLIDGYGKVGLLKESVYLFNEMKDV 351
            ++  +CK G ++ A  +   M   G  PDV++YNSLIDG+ + G ++ +  +   ++  
Sbjct: 61  SVVSFVCKLGQVKFAEDIVHSMPRFGCEPDVISYNSLIDGHCRNGDIRSASLVLESLRAS 120

Query: 352 G---CVPDVITYNALINCFCKFEKMPQAFEYLSEMKNIGLKPNVVTYSTLIDAFCKEGMM 411
               C PD++++N+L N F K + + + F Y+  M      PNVVTYST ID FCK G +
Sbjct: 121 HGFICKPDIVSFNSLFNGFSKMKMLDEVFVYMGVMLKC-CSPNVVTYSTWIDTFCKSGEL 180

Query: 412 QGAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTAL 471
           Q A+K F  M+R  L PN  T+T LID  CKAG+L  A  L  +M +  ++LN+VTYTAL
Sbjct: 181 QLALKSFHSMKRDALSPNVVTFTCLIDGYCKAGDLEVAVSLYKEMRRVRMSLNVVTYTAL 240

Query: 472 MDGLCEDGRMMEAEEVFRAMLKDGISPNQQVYTALVHGYIKAEKMEDALEILKQMTECGI 531
           +DG C+ G M  AEE++  M++D + PN  VYT ++ G+ +    ++A++ L +M   G+
Sbjct: 241 IDGFCKKGEMQRAEEMYSRMVEDRVEPNSLVYTTIIDGFFQRGDSDNAMKFLAKMLNQGM 300

Query: 532 KPDLVLYGTIIWGLCNQNKLEETKLIIKEMKSRGIRANPVIYTTIIDAYFKAGKSSDALD 591
           + D+  YG II GLC   KL+E   I+++M+   +  + VI+TT+++AYFK+G+   A++
Sbjct: 301 RLDITAYGVIISGLCGNGKLKEATEIVEDMEKSDLVPDMVIFTTMMNAYFKSGRMKAAVN 360

Query: 592 LLQEMQEVGVEATVVTYCVLIDGLCKTGMVEVAVDYFGRMSDFGVQPNVAVYTALIDGLC 651
           +  ++ E G E  VV    +IDG+ K G +  A+ YF        + N  +YT LID LC
Sbjct: 361 MYHKLIERGFEPDVVALSTMIDGIAKNGQLHEAIVYF-----CIEKANDVMYTVLIDALC 420

Query: 652 KINCIESAEKLFEEMQCRGMTPDKTAFTALIDGNLKLGNLQETLNLISKMTELVIEFDLH 711
           K       E+LF ++   G+ PDK  +T+ I G  K GNL +   L ++M +  +  DL 
Sbjct: 421 KEGDFIEVERLFSKISEAGLVPDKFMYTSWIAGLCKQGNLVDAFKLKTRMVQEGLLLDLL 480

Query: 712 AYTTLVSGFSQCGELHQARKFFNEMIEKGILPDEILCICLLKEYNKLGHLDEAIKLKNEM 771
           AYTTL+ G +  G + +AR+ F+EM+  GI PD  +   L++ Y K G++  A  L  +M
Sbjct: 481 AYTTLIYGLASKGLMVEARQVFDEMLNSGISPDSAVFDLLIRAYEKEGNMAAASDLLLDM 540

Query: 772 QRRGLIT--------EKCSHEV 783
           QRRGL+T        ++C +EV
Sbjct: 541 QRRGLVTAVSDADCSKQCGNEV 556

BLAST of CmoCh04G022650 vs. Swiss-Prot
Match: PPR12_ARATH (Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidopsis thaliana GN=At1g05670 PE=2 SV=1)

HSP 1 Score: 368.6 bits (945), Expect = 1.7e-100
Identity = 200/616 (32.47%), Postives = 335/616 (54.38%), Query Frame = 1

Query: 128 IWVSKILVELKEDPKLALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNAHDIMKEM 187
           IWV   L+++K D +L L FF WA +        ES CI++H+   ++    A  ++   
Sbjct: 91  IWV---LMKIKCDYRLVLDFFDWARSRRD--SNLESLCIVIHLAVASKDLKVAQSLISSF 150

Query: 188 VLKSRTDLILPVCNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECFSKMRKFR 247
             + + ++       FD+L  T     S   VFDV F VLV+ GLL EA   F KM  + 
Sbjct: 151 WERPKLNVTDSFVQFFDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNYG 210

Query: 248 TLPKARSCNFLLHRLSKSGNGQLVRKF-FHDMIGAGIAPSVFTYNVMIDHLCKEGDLENA 307
            +    SCN  L RLSK           F +    G+  +V +YN++I  +C+ G ++ A
Sbjct: 211 LVLSVDSCNVYLTRLSKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEA 270

Query: 308 RSLFVQMRTMGFSPDVVTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITYNALINC 367
             L + M   G++PDV++Y+++++GY + G L +   L   MK  G  P+   Y ++I  
Sbjct: 271 HHLLLLMELKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGL 330

Query: 368 FCKFEKMPQAFEYLSEMKNIGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPN 427
            C+  K+ +A E  SEM   G+ P+ V Y+TLID FCK G ++ A K F +M    + P+
Sbjct: 331 LCRICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPD 390

Query: 428 EFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMMEAEEVFR 487
             TYT++I   C+ G++ EA KL ++M   G+  + VT+T L++G C+ G M +A  V  
Sbjct: 391 VLTYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHN 450

Query: 488 AMLKDGISPNQQVYTALVHGYIKAEKMEDALEILKQMTECGIKPDLVLYGTIIWGLCNQN 547
            M++ G SPN   YT L+ G  K   ++ A E+L +M + G++P++  Y +I+ GLC   
Sbjct: 451 HMIQAGCSPNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSG 510

Query: 548 KLEETKLIIKEMKSRGIRANPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGVEATVVTYC 607
            +EE   ++ E ++ G+ A+ V YTT++DAY K+G+   A ++L+EM   G++ T+VT+ 
Sbjct: 511 NIEEAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFN 570

Query: 608 VLIDGLCKTGMVEVAVDYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAEKLFEEMQCR 667
           VL++G C  GM+E        M   G+ PN   + +L+   C  N +++A  ++++M  R
Sbjct: 571 VLMNGFCLHGMLEDGEKLLNWMLAKGIAPNATTFNSLVKQYCIRNNLKAATAIYKDMCSR 630

Query: 668 GMTPDKTAFTALIDGNLKLGNLQETLNLISKMTELVIEFDLHAYTTLVSGFSQCGELHQA 727
           G+ PD   +  L+ G+ K  N++E   L  +M        +  Y+ L+ GF +  +  +A
Sbjct: 631 GVGPDGKTYENLVKGHCKARNMKEAWFLFQEMKGKGFSVSVSTYSVLIKGFLKRKKFLEA 690

Query: 728 RKFFNEMIEKGILPDE 743
           R+ F++M  +G+  D+
Sbjct: 691 REVFDQMRREGLAADK 701

BLAST of CmoCh04G022650 vs. Swiss-Prot
Match: PP407_ARATH (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 347.8 bits (891), Expect = 3.1e-94
Identity = 205/649 (31.59%), Postives = 330/649 (50.85%), Query Frame = 1

Query: 127 PIWVSKILVELKEDPKLALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNAHDIMKE 186
           P   S +L++ + D  L LKF  WA  H  F  T    CI +H+L + ++Y  A  + ++
Sbjct: 48  PEAASNLLLKSQNDQALILKFLNWANPHQFF--TLRCKCITLHILTKFKLYKTAQILAED 107

Query: 187 MVLKSRTDLILPVCNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECFSKMRKF 246
           +  K+  D    +  VF  L  T + C S + VFD++      L L+++A       +  
Sbjct: 108 VAAKTLDDEYASL--VFKSLQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIVHLAQAH 167

Query: 247 RTLPKARSCNFLLHRLSKSG-NGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLCKEGDLEN 306
             +P   S N +L    +S  N       F +M+ + ++P+VFTYN++I   C  G+++ 
Sbjct: 168 GFMPGVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDV 227

Query: 307 ARSLFVQMRTMGFSPDVVTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITYNALIN 366
           A +LF +M T G  P+VVTYN+LIDGY K+  + +   L   M   G  P++I+YN +IN
Sbjct: 228 ALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVIN 287

Query: 367 CFCKFEKMPQAFEYLSEMKNIGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLP 426
             C+  +M +    L+EM   G   + VTY+TLI  +CKEG    A+ +  +M R GL P
Sbjct: 288 GLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTP 347

Query: 427 NEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMMEAEEVF 486
           +  TYTSLI + CKAGN+  A +  + M   G+  N  TYT L+DG  + G M EA  V 
Sbjct: 348 SVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVL 407

Query: 487 RAMLKDGISPNQQVYTALVHGYIKAEKMEDALEILKQMTECGIKPDLVLYGTIIWGLCNQ 546
           R M  +G SP+   Y AL++G+    KMEDA+ +L+ M E G+ PD+V Y T++ G C  
Sbjct: 408 REMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRS 467

Query: 547 NKLEETKLIIKEMKSRGIRANPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGVEATVVTY 606
             ++E   + +EM  +GI+ + + Y+++I  + +  ++ +A DL +EM  VG+     TY
Sbjct: 468 YDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTY 527

Query: 607 CVLIDGLCKTGMVEVAVDYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAEKLFEEMQC 666
             LI+  C  G +E A+     M + GV P+V  Y+ LI+GL K +    A++L  ++  
Sbjct: 528 TALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRLLLKLFY 587

Query: 667 RGMTPDKTAFTALIDGNLKLGNLQETLNLISKMTELVIEFDLHAYTTLVSGFSQCGELHQ 726
               P    +  LI                    E     +  +  +L+ GF   G + +
Sbjct: 588 EESVPSDVTYHTLI--------------------ENCSNIEFKSVVSLIKGFCMKGMMTE 647

Query: 727 ARKFFNEMIEKGILPDEILCICLLKEYNKLGHLDEAIKLKNEMQRRGLI 775
           A + F  M+ K   PD      ++  + + G + +A  L  EM + G +
Sbjct: 648 ADQVFESMLGKNHKPDGTAYNIMIHGHCRAGDIRKAYTLYKEMVKSGFL 672

BLAST of CmoCh04G022650 vs. Swiss-Prot
Match: PP432_ARATH (Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana GN=At5g55840 PE=3 SV=2)

HSP 1 Score: 341.7 bits (875), Expect = 2.2e-92
Identity = 202/633 (31.91%), Postives = 318/633 (50.24%), Query Frame = 1

Query: 142 KLALKFFKWAGTHIGFR--HTTESYCIIVHMLFRARMYTNAHDIMKEMVLKSRTDLILPV 201
           KLALKF KW     G    H  +  CI  H+L RARMY  A  I+KE+ L S        
Sbjct: 51  KLALKFLKWVVKQPGLETDHIVQLVCITTHILVRARMYDPARHILKELSLMSGKSSF--- 110

Query: 202 CNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECFSKMRKFRTLPKARSCNFLL 261
             VF  L +T   C S   V+D+L  V +  G+++++ E F  M  +   P   +CN +L
Sbjct: 111 --VFGALMTTYRLCNSNPSVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCNAIL 170

Query: 262 HRLSKSGNGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLCKEGDLENARSLFVQMRTMGFS 321
             + KSG    V  F  +M+   I P V T+N++I+ LC EG  E +  L  +M   G++
Sbjct: 171 GSVVKSGEDVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYA 230

Query: 322 PDVVTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITYNALINCFCKFEKMPQAFEY 381
           P +VTYN+++  Y K G  K ++ L + MK  G   DV TYN LI+  C+  ++ + +  
Sbjct: 231 PTIVTYNTVLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYLL 290

Query: 382 LSEMKNIGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDANCK 441
           L +M+   + PN VTY+TLI+ F  EG +  A +L  +M   GL PN  T+ +LID +  
Sbjct: 291 LRDMRKRMIHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHIS 350

Query: 442 AGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMMEAEEVFRAMLKDGISPNQQV 501
            GN  EA K+   M   G+  + V+Y  L+DGLC++     A   +  M ++G+   +  
Sbjct: 351 EGNFKEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRIT 410

Query: 502 YTALVHGYIKAEKMEDALEILKQMTECGIKPDLVLYGTIIWGLCNQNKLEETKLIIKEMK 561
           YT ++ G  K   +++A+ +L +M++ GI PD+V Y  +I G C   + +  K I+  + 
Sbjct: 411 YTGMIDGLCKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIY 470

Query: 562 SRGIRANPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGVEATVVTYCVLIDGLCKTGMVE 621
             G+  N +IY+T+I    + G   +A+ + + M   G      T+ VL+  LCK G V 
Sbjct: 471 RVGLSPNGIIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVA 530

Query: 622 VAVDYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAEKLFEEMQCRGMTPDKTAFTALI 681
            A ++   M+  G+ PN   +  LI+G         A  +F+EM   G  P    + +L+
Sbjct: 531 EAEEFMRCMTSDGILPNTVSFDCLINGYGNSGEGLKAFSVFDEMTKVGHHPTFFTYGSLL 590

Query: 682 DGNLKLGNLQETLNLISKMTELVIEFDLHAYTTLVSGFSQCGELHQARKFFNEMIEKGIL 741
            G  K G+L+E    +  +  +    D   Y TL++   + G L +A   F EM+++ IL
Sbjct: 591 KGLCKGGHLREAEKFLKSLHAVPAAVDTVMYNTLLTAMCKSGNLAKAVSLFGEMVQRSIL 650

Query: 742 PDEILCICLLKEYNKLGHLDEAIKLKNEMQRRG 773
           PD      L+    + G    AI    E + RG
Sbjct: 651 PDSYTYTSLISGLCRKGKTVIAILFAKEAEARG 678

BLAST of CmoCh04G022650 vs. TrEMBL
Match: A0A061E9Z5_THECC (Pentatricopeptide repeat-containing protein, putative isoform 1 OS=Theobroma cacao GN=TCM_011095 PE=4 SV=1)

HSP 1 Score: 1005.7 bits (2599), Expect = 3.0e-290
Identity = 496/785 (63.18%), Postives = 615/785 (78.34%), Query Frame = 1

Query: 7   VLAFASLFSAMLLFFRSLFHVSRRASYRVISLSLNSSHPGCLSFNVFNGPSSLTSM---N 66
           + A  S F+ ML+  RSLFH++RR     I + +  SHP    F +F     L      N
Sbjct: 4   ISAALSFFTKMLVSLRSLFHINRR-----IPVCVRVSHP----FPLFQNSRPLNFFPPSN 63

Query: 67  GYYISCPFFWFTSFLCIFRLPFVSYSITNDSFELLDIG--SLRKIIQQDLWNDPKIVVLF 126
              I CPF   TSF  + + PF +   +N    L D    S+ KIIQQD WNDPKIV LF
Sbjct: 64  NSIIVCPFILLTSFFYMMKFPFGTKCNSNTHIFLDDFNRESICKIIQQDQWNDPKIVTLF 123

Query: 127 DSALAPIWVSKILVELKEDPKLALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNAH 186
           DS+LAPIWVSKILV LK++PKLALKFFKWA TH GF HT+ESYCI+VH+LF  RMY++A 
Sbjct: 124 DSSLAPIWVSKILVGLKQEPKLALKFFKWAKTHKGFGHTSESYCILVHILFYGRMYSDAS 183

Query: 187 DIMKEMVLKSRTDLILPVCNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECFS 246
            I+KE +L  R  ++LP C+ FD+LWSTRN C  G GVFD LFSVLV+LG+LEEA++CFS
Sbjct: 184 AILKEFILL-RQRVVLPGCDFFDVLWSTRNVCRYGFGVFDALFSVLVDLGMLEEASQCFS 243

Query: 247 KMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLCKEG 306
           KM+++R LPK RSCN LLHRLSK+G     R+FF +MIG G+APSVFTYN++ID++CKEG
Sbjct: 244 KMKRYRVLPKVRSCNALLHRLSKTGRRDQSRRFFAEMIGVGVAPSVFTYNILIDYMCKEG 303

Query: 307 DLENARSLFVQMRTMGFSPDVVTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITYN 366
           +L+ AR LF QM+ +G +PD+VTYNSLIDGYGKVGLL E ++LF EMK V C PD+ITYN
Sbjct: 304 ELDTARMLFGQMKQIGLTPDIVTYNSLIDGYGKVGLLDEVIFLFEEMKSVECAPDIITYN 363

Query: 367 ALINCFCKFEKMPQAFEYLSEMKNIGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRV 426
           ALINCFCKF++MPQAFE+  EM+N GLKPNVVTYSTLIDAFCKEGMMQ  IK  VDMRRV
Sbjct: 364 ALINCFCKFQRMPQAFEFFREMRNKGLKPNVVTYSTLIDAFCKEGMMQQGIKFLVDMRRV 423

Query: 427 GLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMMEA 486
           GLLPN FTYTSLIDA CKAG+LTEA KL+N+MLQ  V+LNIVTYT ++DGLCE GR  EA
Sbjct: 424 GLLPNVFTYTSLIDATCKAGSLTEALKLANEMLQENVDLNIVTYTTIIDGLCEAGRTKEA 483

Query: 487 EEVFRAMLKDGISPNQQVYTALVHGYIKAEKMEDALEILKQMTECGIKPDLVLYGTIIWG 546
           EE+FRAMLK  + PN  +YTAL HGY+K +KME AL +LK+M E  IKPDL+LYGTIIWG
Sbjct: 484 EEIFRAMLKAALKPNVHIYTALAHGYMKVKKMEHALNLLKEMKEKSIKPDLLLYGTIIWG 543

Query: 547 LCNQNKLEETKLIIKEMKSRGIRANPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGVEAT 606
           LCNQ+K+EETK+++ EMK   + +NPVIYTT++D+YFKAGK+++AL+LL+EM ++G+E T
Sbjct: 544 LCNQDKIEETKVVMSEMKESRLSSNPVIYTTVMDSYFKAGKTAEALNLLEEMSDLGIEVT 603

Query: 607 VVTYCVLIDGLCKTGMVEVAVDYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAEKLFE 666
           VVT+CVL+DGLCKTG+V  A++YF RMS+F +QPNVA YT LIDGLCK N I++A+ +F+
Sbjct: 604 VVTFCVLVDGLCKTGLVLEAINYFNRMSEFNLQPNVAAYTVLIDGLCKNNFIQAAKNMFD 663

Query: 667 EMQCRGMTPDKTAFTALIDGNLKLGNLQETLNLISKMTELVIEFDLHAYTTLVSGFSQCG 726
           EM  + + PDKTA+TALIDGNLK GN QE LNL ++M E+ IE DL AYT+LV GF QCG
Sbjct: 664 EMLSKNLVPDKTAYTALIDGNLKHGNFQEALNLQNEMIEMGIELDLPAYTSLVWGFCQCG 723

Query: 727 ELHQARKFFNEMIEKGILPDEILCICLLKEYNKLGHLDEAIKLKNEMQRRGLITEKCSHE 786
           +L QARKF +EMI K ILPDEILCI +L++Y +LGH+DEAI+L+NEM +RGLIT    + 
Sbjct: 724 QLQQARKFLDEMIRKHILPDEILCIGVLRKYYELGHVDEAIELQNEMAKRGLITSPIHYA 778

BLAST of CmoCh04G022650 vs. TrEMBL
Match: W9SE38_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_000446 PE=4 SV=1)

HSP 1 Score: 993.0 bits (2566), Expect = 2.0e-286
Identity = 491/778 (63.11%), Postives = 615/778 (79.05%), Query Frame = 1

Query: 14  FSAMLLFFRSLFHVSRRASYRVISLSLNSSHP-GC---LSFNVFNGPSSLTSMNGYYISC 73
           F+ MLLF R+LFH SRRAS RV   S +  +P  C    S     G SS    N   ++C
Sbjct: 19  FTKMLLFLRNLFHTSRRASTRVSPFSPSIPYPHNCDLLPSLRSVYGKSS----NSCIVAC 78

Query: 74  PFFWFTSFLCIFRLPFVSYSITNDSFELLDIGSLRKIIQQDLWNDPKIVVLFDSALAPIW 133
           P  WFTSFL + R PF S S  + S E+LD   LR+I++QD W+DPKIV LFDSA+API 
Sbjct: 79  PLAWFTSFLFLVRFPFYSKSSASFSLEVLDREQLRRIVEQDQWHDPKIVNLFDSAIAPIL 138

Query: 134 VSKILVELKEDPKLALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNAHDIMKEMVL 193
           VS+ LVELKE P LALK FKW     GFRHT ESYCI+VH+LF ARM+ +A+ +++E+V 
Sbjct: 139 VSRFLVELKEYPFLALKLFKWVRNRTGFRHTAESYCILVHILFYARMFFDANGVLRELVS 198

Query: 194 KSRTDLILPVCNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECFSKMRKFRTL 253
            +R   +LP C+VFD+LWSTRN CV G GVFD LFSVLVELG+LEEAN+CF KMRKF  L
Sbjct: 199 SNR---VLPGCDVFDVLWSTRNVCVPGFGVFDALFSVLVELGMLEEANQCFLKMRKFHVL 258

Query: 254 PKARSCNFLLHRLSKSGNGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLCKEGDLENARSL 313
           PK RSCN  LHRLSK G   + RKFF DM+ AGIAPSVFTYN+MI++LCKEGD++ ARSL
Sbjct: 259 PKPRSCNAFLHRLSKLGKVDMSRKFFKDMVAAGIAPSVFTYNIMINYLCKEGDMDEARSL 318

Query: 314 FVQMRTMGFSPDVVTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITYNALINCFCK 373
           F +M+  G  PD+VTYNSLIDG+GKVG + E++ +F +MKDVGC PD+IT+NALINCF K
Sbjct: 319 FEEMKHRGLIPDIVTYNSLIDGFGKVGNMDEAICIFEKMKDVGCEPDIITFNALINCFGK 378

Query: 374 FEKMPQAFEYLSEMKNIGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFT 433
            +++P+A E+L E++N GLKPNVVTYSTLIDAFCKEGMM+ A+K FVDMRRVGL PNE+T
Sbjct: 379 SQRLPRALEFLHELRNHGLKPNVVTYSTLIDAFCKEGMMREALKFFVDMRRVGLFPNEYT 438

Query: 434 YTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMMEAEEVFRAML 493
           YTSL+DANCKAGNLTEA KL+N+MLQAG+NLNIV Y+AL++ LCEDGRM EAE+VF  ML
Sbjct: 439 YTSLVDANCKAGNLTEALKLTNEMLQAGINLNIVGYSALLNCLCEDGRMKEAEKVFMEML 498

Query: 494 KDGISPNQQVYTALVHGYIKAEKMEDALEILKQMTECGIKPDLVLYGTIIWGLCNQNKLE 553
           K G++PN QVY++LVHGY+KA+K E A + LK+M E  IKPDL+LYGTIIWGLC+QNKLE
Sbjct: 499 KAGVTPNLQVYSSLVHGYVKAKKTEKAFQTLKEMEEKKIKPDLLLYGTIIWGLCSQNKLE 558

Query: 554 ETKLIIKEMKSRGIRANPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGVEATVVTYCVLI 613
           E++L++ EM+SRG+ AN  IYTT++DAYFKAGK+++AL LLQEM   G+E  VVTYC LI
Sbjct: 559 ESELVVNEMRSRGLNANHFIYTTLMDAYFKAGKTTEALLLLQEMHYYGIEVNVVTYCALI 618

Query: 614 DGLCKTGMVEVAVDYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAEKLFEEMQCRGMT 673
           DGLCK G+VE A DYF RM   G+QPNVAVYTALIDGLCK N IE+A+KLF+EM  +G++
Sbjct: 619 DGLCKRGLVEEATDYFDRMVSIGLQPNVAVYTALIDGLCKNNRIEAAKKLFDEMLEKGIS 678

Query: 674 PDKTAFTALIDGNLKLGNLQETLNLISKMTELVIEFDLHAYTTLVSGFSQCGELHQARKF 733
           PD+TA+T LIDGNLK G+LQE L L ++M E+ +E DL+AYT+L+ GFSQ G++ QA+ +
Sbjct: 679 PDRTAYTTLIDGNLKHGHLQEALTLKNRMIEMGMELDLYAYTSLIWGFSQFGQVQQAKTW 738

Query: 734 FNEMIEKGILPDEILCICLLKEYNKLGHLDEAIKLKNEMQRRGLITEKCSHEVPSLKT 788
            +EMI KGILPDEILC+CLL++Y +LG++ EA +L++E+ +RGLI   C++ VP   T
Sbjct: 739 LDEMIGKGILPDEILCVCLLRKYYELGNVVEADELRDELVKRGLIKGACTYAVPEAGT 789

BLAST of CmoCh04G022650 vs. TrEMBL
Match: W9S012_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_000854 PE=4 SV=1)

HSP 1 Score: 988.8 bits (2555), Expect = 3.8e-285
Identity = 490/778 (62.98%), Postives = 614/778 (78.92%), Query Frame = 1

Query: 14  FSAMLLFFRSLFHVSRRASYRVISLSLNSSHP-GC---LSFNVFNGPSSLTSMNGYYISC 73
           F+ MLLF R+LF  SRRAS RV   S +  +P  C    S     G SS    N   ++C
Sbjct: 19  FTKMLLFLRNLFLTSRRASTRVSPFSPSIPYPHNCDLLPSLRSVYGKSS----NSCIVAC 78

Query: 74  PFFWFTSFLCIFRLPFVSYSITNDSFELLDIGSLRKIIQQDLWNDPKIVVLFDSALAPIW 133
           P  WFTSFL + R PF S S  + S E+LD   LR+I++QD W+DPKIV LFDSA+API 
Sbjct: 79  PLAWFTSFLFLVRFPFYSKSSASFSLEVLDREQLRRIVEQDQWHDPKIVNLFDSAIAPIL 138

Query: 134 VSKILVELKEDPKLALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNAHDIMKEMVL 193
           VS+ LVELKE P LALK FKW     GFRHT ESYCI+VH+LF ARM+ +A+ +++E+V 
Sbjct: 139 VSRFLVELKEYPFLALKLFKWVRNRTGFRHTAESYCILVHILFYARMFFDANGVLRELVS 198

Query: 194 KSRTDLILPVCNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECFSKMRKFRTL 253
            +R   +LP C+VFD+LWSTRN CV G GVFD LFSVLVELG+LEEAN+CF KMRKF  L
Sbjct: 199 SNR---VLPGCDVFDVLWSTRNVCVPGFGVFDALFSVLVELGMLEEANQCFLKMRKFHVL 258

Query: 254 PKARSCNFLLHRLSKSGNGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLCKEGDLENARSL 313
           PK RSCN  LHRLSK G   + RKFF DM+ AGIAPSVFTYN+MI++LCKEGD++ ARSL
Sbjct: 259 PKPRSCNAFLHRLSKLGKVDMSRKFFKDMVAAGIAPSVFTYNIMINYLCKEGDMDEARSL 318

Query: 314 FVQMRTMGFSPDVVTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITYNALINCFCK 373
           F +M+  G  PD+VTYNSLIDG+GKVG + E++ +F +MKDVGC PD+IT+NALINCF K
Sbjct: 319 FEEMKHRGLIPDIVTYNSLIDGFGKVGNMDEAICIFEKMKDVGCEPDIITFNALINCFGK 378

Query: 374 FEKMPQAFEYLSEMKNIGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFT 433
            +++P+A E+L E++N GLKPNVVTYSTLIDAFCKEGMM+ A+K FVDMRRVGL PNE+T
Sbjct: 379 SQRLPRALEFLHELRNHGLKPNVVTYSTLIDAFCKEGMMREALKFFVDMRRVGLFPNEYT 438

Query: 434 YTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMMEAEEVFRAML 493
           YTSL+DANCKAGNLTEA KL+N+MLQAG+NLNIV Y+AL++ LCEDGRM EAE+VF  ML
Sbjct: 439 YTSLVDANCKAGNLTEALKLTNEMLQAGINLNIVGYSALLNCLCEDGRMKEAEKVFMEML 498

Query: 494 KDGISPNQQVYTALVHGYIKAEKMEDALEILKQMTECGIKPDLVLYGTIIWGLCNQNKLE 553
           K G++PN QVY++LVHGY+KA+K E A + LK+M E  IKPDL+LYGTIIWGLC+QNKLE
Sbjct: 499 KAGVTPNLQVYSSLVHGYVKAKKTEKAFQTLKEMEEKKIKPDLLLYGTIIWGLCSQNKLE 558

Query: 554 ETKLIIKEMKSRGIRANPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGVEATVVTYCVLI 613
           E++L++ EM+SRG+ AN  IYTT++DAYFKAGK+++AL LLQEM   G+E  VVTYC LI
Sbjct: 559 ESELVVNEMRSRGLNANHFIYTTLMDAYFKAGKTTEALLLLQEMHYYGIEVNVVTYCALI 618

Query: 614 DGLCKTGMVEVAVDYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAEKLFEEMQCRGMT 673
           DGLCK G+VE A DYF RM   G+QPNVAVYTALIDGLCK N IE+A+KLF+EM  +G++
Sbjct: 619 DGLCKRGLVEEATDYFDRMVSIGLQPNVAVYTALIDGLCKNNRIEAAKKLFDEMLEKGIS 678

Query: 674 PDKTAFTALIDGNLKLGNLQETLNLISKMTELVIEFDLHAYTTLVSGFSQCGELHQARKF 733
           PD+TA+T LIDGNLK G+LQE L L ++M E+ +E DL+AYT+L+ GFSQ G++ QA+ +
Sbjct: 679 PDRTAYTTLIDGNLKHGHLQEALTLKNRMIEMGMELDLYAYTSLIWGFSQFGQVQQAKTW 738

Query: 734 FNEMIEKGILPDEILCICLLKEYNKLGHLDEAIKLKNEMQRRGLITEKCSHEVPSLKT 788
            +EMI KGILPDEILC+CLL++Y +LG++ EA +L++E+ +RGLI   C++ VP   T
Sbjct: 739 LDEMIGKGILPDEILCVCLLRKYYELGNVVEADELRDELVKRGLIKGACTYAVPEAGT 789

BLAST of CmoCh04G022650 vs. TrEMBL
Match: B9RY36_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_0814140 PE=4 SV=1)

HSP 1 Score: 947.6 bits (2448), Expect = 9.7e-273
Identity = 479/747 (64.12%), Postives = 590/747 (78.98%), Query Frame = 1

Query: 27  VSRRASYRVISLSLNSSHPGCLSFNVFNGPSSLTSMNGYYISCPFFWFTSFLCIFRLPFV 86
           + RR  +RV S  L+S+    L F VF+ PS + S +G    CP    T FLCI R PF 
Sbjct: 1   MGRRFPHRV-SPPLSSNPNAHLPF-VFSSPSLVPS-HGSLSYCPLMLLTGFLCILRFPF- 60

Query: 87  SYSITNDSF-ELLDIGSLRKIIQQDLWNDPKIVVLFDSALAPIWVSKILVELKEDPKLAL 146
              IT  SF   LD  S+ KIIQQD WNDPK V   DS+L PIWVS++LVELK+DPKLAL
Sbjct: 61  ---ITQSSFLGQLDKASIIKIIQQDQWNDPKFVRFIDSSLGPIWVSRVLVELKQDPKLAL 120

Query: 147 KFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNAHDIMKEMVLKSRTDLILPVCNVFDI 206
           KFF+WA T  GF  TTESYC++VH+LF ARMY +A+  +KE++   R   ILP  +VF++
Sbjct: 121 KFFRWAKTKFGFCLTTESYCLLVHILFYARMYFDANFFLKELISSRR---ILPGFDVFEV 180

Query: 207 LWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECFSKMRKFRTLPKARSCNFLLHRLSKS 266
           LWSTRN CV G GVFD LFSV +ELG+LEEA +CFS+M +FR  PKARSCN  L+RL+K+
Sbjct: 181 LWSTRNVCVPGFGVFDALFSVFIELGMLEEAGQCFSRMTRFRVFPKARSCNAFLYRLAKT 240

Query: 267 GNGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLCKEGDLENARSLFVQMRTMGFSPDVVTY 326
           G G L  KFF DM+GAGIA SVFTYN+MI ++CKEGD+  A+SLF QM+ MG +PD+VTY
Sbjct: 241 GKGDLSNKFFRDMVGAGIAQSVFTYNIMIGYMCKEGDMVTAKSLFHQMKQMGLTPDIVTY 300

Query: 327 NSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITYNALINCFCKFEKMPQAFEYLSEMKN 386
           NSLIDGYGK+GLL ES  LF EMKDVGC PDVITYNALINCFCK+E+MP+AF +L EMKN
Sbjct: 301 NSLIDGYGKLGLLDESFCLFEEMKDVGCEPDVITYNALINCFCKYEQMPKAFHFLHEMKN 360

Query: 387 IGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGNLTE 446
            GLKPNVVTYSTLIDA CKE M+Q AIK  +DMRRVGL PNEFTYTSLIDANCKAG L++
Sbjct: 361 SGLKPNVVTYSTLIDALCKEHMLQQAIKFLLDMRRVGLSPNEFTYTSLIDANCKAGYLSD 420

Query: 447 AWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMMEAEEVFRAMLKDGISPNQQVYTALVH 506
           A KL+++MLQ  V  N+VTYT L+DGLC++GRMMEAE++FRAM+K G++PN + YTALVH
Sbjct: 421 ALKLADEMLQVQVGFNVVTYTTLLDGLCKEGRMMEAEDLFRAMIKAGVTPNLKTYTALVH 480

Query: 507 GYIKAEKMEDALEILKQMTECGIKPDLVLYGTIIWGLCNQNKLEETKLIIKEMKSRGIRA 566
           G+IK +++E+ALE+LK++ E  IKPDL+LYGTIIWGLC+QNKLEE + ++ EMK+ GIRA
Sbjct: 481 GHIKNKRVENALELLKEIKEKKIKPDLLLYGTIIWGLCSQNKLEECEFVMSEMKACGIRA 540

Query: 567 NPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGVEATVVTYCVLIDGLCKTGMVEVAVDYF 626
           N VIYT  +DAYFK GK+ +AL+LLQEM ++GVE T+VT+CVLIDGLCK G+VE A+DYF
Sbjct: 541 NSVIYTIRMDAYFKTGKTVEALNLLQEMCDLGVEVTIVTFCVLIDGLCKKGLVEEAIDYF 600

Query: 627 GRMSDFGVQP-NVAVYTALIDGLCKINCIESAEKLFEEMQCRGMTPDKTAFTALIDGNLK 686
            RM+DF +QP NVAV TALIDGLCK N IE+A+KLF+EMQ + M PDK A+TALIDGNLK
Sbjct: 601 ARMADFNLQPNNVAVCTALIDGLCKNNYIEAAKKLFDEMQDKNMVPDKIAYTALIDGNLK 660

Query: 687 LGNLQETLNLISKMTELVIEFDLHAYTTLVSGFSQCGELHQARKFFNEMIEKGILPDEIL 746
             + QE LN+ S+M+EL +E DLHAYT+LV G SQ   + QAR F NEMI KGI+PDEIL
Sbjct: 661 HKDFQEALNIRSRMSELGMELDLHAYTSLVWGLSQGNLVQQARMFLNEMIGKGIVPDEIL 720

Query: 747 CICLLKEYNKLGHLDEAIKLKNEMQRR 772
           CI LL++Y +LG +DEAI+L +E+ ++
Sbjct: 721 CIRLLRKYYELGSIDEAIELHDELLKK 737

BLAST of CmoCh04G022650 vs. TrEMBL
Match: K7L5N5_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_08G090700 PE=4 SV=1)

HSP 1 Score: 897.1 bits (2317), Expect = 1.5e-257
Identity = 457/771 (59.27%), Postives = 574/771 (74.45%), Query Frame = 1

Query: 17  MLLFFRSLFHVSRRASYRVISLSLNSSHPGCLSFNVFNGPSSLTSMNGYYISCPFFWFTS 76
           MLLF R+   +  RAS RV S     S P    F +F  PSSL+S N  +   P  WFTS
Sbjct: 1   MLLFARN---IGGRASLRVSSFH---SSPLQNPFPLFLTPSSLSSQNSIFAR-PVIWFTS 60

Query: 77  FLCIFRLPFVSYSITNDSFELLDIGSLRKIIQQDLWNDPKIVVLFDSALAPIWVSKILVE 136
           FLC+ R PFVS      SF+ +   S+R  +QQD    P    L DSALAPIWVSK LV+
Sbjct: 61  FLCVIRYPFVS----KPSFDDIASESMRSFLQQD---GPH---LSDSALAPIWVSKALVK 120

Query: 137 LKEDPKLALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNAHDIMKEMVLKSRTDLI 196
           LK DPK ALKFFK AG   GFRH  ESYC++ H+LF    Y +A  ++KE +L  R    
Sbjct: 121 LKGDPKSALKFFKEAGARAGFRHAAESYCVLAHILFCGMFYLDARSVIKEWILLGRE--- 180

Query: 197 LPVCNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECFSKMRKFRTLPKARSCN 256
            P C+ FD+LWSTRN C  G GVFD LF+VLV+LG+LEEA +CF KM KFR LPK RSCN
Sbjct: 181 FPGCDFFDMLWSTRNVCRPGFGVFDTLFNVLVDLGMLEEARQCFWKMNKFRVLPKVRSCN 240

Query: 257 FLLHRLSKSGNGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLCKEGDLENARSLFVQMRTM 316
            LLHRLSKS  G L   FF DM+ AG++PSVFTYN++I  L +EGDLE ARSLF +M+  
Sbjct: 241 ELLHRLSKSSKGGLALSFFKDMVVAGLSPSVFTYNMVIGCLAREGDLEAARSLFEEMKAK 300

Query: 317 GFSPDVVTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITYNALINCFCKFEKMPQA 376
           G  PD+VTYNSLIDGYGKVG+L  +V +F EMKD GC PDVITYN+LINCFCKFE++PQA
Sbjct: 301 GLRPDIVTYNSLIDGYGKVGMLTGAVSVFEEMKDAGCEPDVITYNSLINCFCKFERIPQA 360

Query: 377 FEYLSEMKNIGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDA 436
           FEYL  MK  GL+PNVVTYSTLIDAFCK GM+  A K FVDM RVGL PNEFTYTSLIDA
Sbjct: 361 FEYLHGMKQRGLQPNVVTYSTLIDAFCKAGMLLEANKFFVDMIRVGLQPNEFTYTSLIDA 420

Query: 437 NCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMMEAEEVFRAMLKDGISPN 496
           NCK G+L EA+KL ++M QAGVNLNIVTYTAL+DGLCEDGRM EAEE+F A+LK G + N
Sbjct: 421 NCKIGDLNEAFKLESEMQQAGVNLNIVTYTALLDGLCEDGRMREAEELFGALLKAGWTLN 480

Query: 497 QQVYTALVHGYIKAEKMEDALEILKQMTECGIKPDLVLYGTIIWGLCNQNKLEETKLIIK 556
           QQ+YT+L HGYIKA+ ME A++IL++M +  +KPDL+LYGT IWGLC QN++E++  +I+
Sbjct: 481 QQIYTSLFHGYIKAKMMEKAMDILEEMNKKNLKPDLLLYGTKIWGLCRQNEIEDSMAVIR 540

Query: 557 EMKSRGIRANPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGVEATVVTYCVLIDGLCKTG 616
           EM   G+ AN  IYTT+IDAYFK GK+++A++LLQEMQ++G++ TVVTY VLIDGLCK G
Sbjct: 541 EMMDCGLTANSYIYTTLIDAYFKVGKTTEAVNLLQEMQDLGIKITVVTYGVLIDGLCKIG 600

Query: 617 MVEVAVDYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAEKLFEEMQCRGMTPDKTAFT 676
           +V+ AV YF  M+  G+QPN+ +YTALIDGLCK +C+E A+ LF EM  +G++PDK  +T
Sbjct: 601 LVQQAVRYFDHMTRNGLQPNIMIYTALIDGLCKNDCLEEAKNLFNEMLDKGISPDKLVYT 660

Query: 677 ALIDGNLKLGNLQETLNLISKMTELVIEFDLHAYTTLVSGFSQCGELHQARKFFNEMIEK 736
           +LIDGN+K GN  E L+L ++M E+ +E DL AYT+L+ GFS+ G++  A+   +EM+ K
Sbjct: 661 SLIDGNMKHGNPGEALSLRNRMVEIGMELDLCAYTSLIWGFSRYGQVQLAKSLLDEMLRK 720

Query: 737 GILPDEILCICLLKEYNKLGHLDEAIKLKNEMQRRGLITEKCSHEVPSLKT 788
           GI+PD++LCICLL++Y +LG ++EA+ L ++M RRGLI+      VPS+ T
Sbjct: 721 GIIPDQVLCICLLRKYYELGDINEALALHDDMARRGLISGTIDITVPSVHT 751

BLAST of CmoCh04G022650 vs. TAIR10
Match: AT2G02150.1 (AT2G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 894.4 bits (2310), Expect = 4.9e-260
Identity = 441/773 (57.05%), Postives = 569/773 (73.61%), Query Frame = 1

Query: 17  MLLFFRSLFHVSRRASYRVISLSLNSSH---PGCLSFNVFNGPSSLTSMNGYYISCPFFW 76
           M    R+  HV+RR    V   S + S    P C         SS +     +ISCPF W
Sbjct: 1   MFCSLRNFLHVNRRFPRHVSPSSSSLSQIQSPLCFPL------SSPSPSQSSFISCPFVW 60

Query: 77  FTSFLCIFRLPFVSYSITNDSFELLDIGSLRKIIQQDLWNDPKIVVLFDSALAPIWVSKI 136
           FTSFLCI R PFV+ S T+   E  D   +RK++  DLW+DP +  LFD  LAPIWV ++
Sbjct: 61  FTSFLCIIRYPFVTKSGTSTYSEDFDRDWIRKVVHNDLWDDPGLEKLFDLTLAPIWVPRV 120

Query: 137 LVELKEDPKLALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNAHDIMKEMVLKSRT 196
           LVELKEDPKLA KFFKW+ T  GF+H+ ESYCI+ H+LF ARMY +A+ ++KEMVL S+ 
Sbjct: 121 LVELKEDPKLAFKFFKWSMTRNGFKHSVESYCIVAHILFCARMYYDANSVLKEMVL-SKA 180

Query: 197 DLILPVCNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECFSKMRKFRTLPKAR 256
           D     C+VFD+LWSTRN CV G GVFD LFSVL++LG+LEEA +CFSKM++FR  PK R
Sbjct: 181 D-----CDVFDVLWSTRNVCVPGFGVFDALFSVLIDLGMLEEAIQCFSKMKRFRVFPKTR 240

Query: 257 SCNFLLHRLSKSGNGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLCKEGDLENARSLFVQM 316
           SCN LLHR +K G    V++FF DMIGAG  P+VFTYN+MID +CKEGD+E AR LF +M
Sbjct: 241 SCNGLLHRFAKLGKTDDVKRFFKDMIGAGARPTVFTYNIMIDCMCKEGDVEAARGLFEEM 300

Query: 317 RTMGFSPDVVTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITYNALINCFCKFEKM 376
           +  G  PD VTYNS+IDG+GKVG L ++V  F EMKD+ C PDVITYNALINCFCKF K+
Sbjct: 301 KFRGLVPDTVTYNSMIDGFGKVGRLDDTVCFFEEMKDMCCEPDVITYNALINCFCKFGKL 360

Query: 377 PQAFEYLSEMKNIGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSL 436
           P   E+  EMK  GLKPNVV+YSTL+DAFCKEGMMQ AIK +VDMRRVGL+PNE+TYTSL
Sbjct: 361 PIGLEFYREMKGNGLKPNVVSYSTLVDAFCKEGMMQQAIKFYVDMRRVGLVPNEYTYTSL 420

Query: 437 IDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMMEAEEVFRAMLKDGI 496
           IDANCK GNL++A++L N+MLQ GV  N+VTYTAL+DGLC+  RM EAEE+F  M   G+
Sbjct: 421 IDANCKIGNLSDAFRLGNEMLQVGVEWNVVTYTALIDGLCDAERMKEAEELFGKMDTAGV 480

Query: 497 SPNQQVYTALVHGYIKAEKMEDALEILKQMTECGIKPDLVLYGTIIWGLCNQNKLEETKL 556
            PN   Y AL+HG++KA+ M+ ALE+L ++   GIKPDL+LYGT IWGLC+  K+E  K+
Sbjct: 481 IPNLASYNALIHGFVKAKNMDRALELLNELKGRGIKPDLLLYGTFIWGLCSLEKIEAAKV 540

Query: 557 IIKEMKSRGIRANPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGVEATVVTYCVLIDGLC 616
           ++ EMK  GI+AN +IYTT++DAYFK+G  ++ L LL EM+E+ +E TVVT+CVLIDGLC
Sbjct: 541 VMNEMKECGIKANSLIYTTLMDAYFKSGNPTEGLHLLDEMKELDIEVTVVTFCVLIDGLC 600

Query: 617 KTGMVEVAVDYFGRMS-DFGVQPNVAVYTALIDGLCKINCIESAEKLFEEMQCRGMTPDK 676
           K  +V  AVDYF R+S DFG+Q N A++TA+IDGLCK N +E+A  LFE+M  +G+ PD+
Sbjct: 601 KNKLVSKAVDYFNRISNDFGLQANAAIFTAMIDGLCKDNQVEAATTLFEQMVQKGLVPDR 660

Query: 677 TAFTALIDGNLKLGNLQETLNLISKMTELVIEFDLHAYTTLVSGFSQCGELHQARKFFNE 736
           TA+T+L+DGN K GN+ E L L  KM E+ ++ DL AYT+LV G S C +L +AR F  E
Sbjct: 661 TAYTSLMDGNFKQGNVLEALALRDKMAEIGMKLDLLAYTSLVWGLSHCNQLQKARSFLEE 720

Query: 737 MIEKGILPDEILCICLLKEYNKLGHLDEAIKLKNEMQRRGLITEKCSHEVPSL 786
           MI +GI PDE+LCI +LK++ +LG +DEA++L++ + +  L+T    + +P++
Sbjct: 721 MIGEGIHPDEVLCISVLKKHYELGCIDEAVELQSYLMKHQLLTSDNDNALPNM 761

BLAST of CmoCh04G022650 vs. TAIR10
Match: AT2G01740.1 (AT2G01740.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 384.0 bits (985), Expect = 2.2e-106
Identity = 206/562 (36.65%), Postives = 324/562 (57.65%), Query Frame = 1

Query: 232 LLEEANECFSKMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFHDMIGAGIAPSVFTYN 291
           ++ EA +  S++RK   LP   +CN  +H+L  S  G L  KF   ++  G  P   ++N
Sbjct: 1   MVREALQFLSRLRKSSNLPDPFTCNKHIHQLINSNCGILSLKFLAYLVSRGYTPHRSSFN 60

Query: 292 VMIDHLCKEGDLENARSLFVQMRTMGFSPDVVTYNSLIDGYGKVGLLKESVYLFNEMKDV 351
            ++  +CK G ++ A  +   M   G  PDV++YNSLIDG+ + G ++ +  +   ++  
Sbjct: 61  SVVSFVCKLGQVKFAEDIVHSMPRFGCEPDVISYNSLIDGHCRNGDIRSASLVLESLRAS 120

Query: 352 G---CVPDVITYNALINCFCKFEKMPQAFEYLSEMKNIGLKPNVVTYSTLIDAFCKEGMM 411
               C PD++++N+L N F K + + + F Y+  M      PNVVTYST ID FCK G +
Sbjct: 121 HGFICKPDIVSFNSLFNGFSKMKMLDEVFVYMGVMLKC-CSPNVVTYSTWIDTFCKSGEL 180

Query: 412 QGAIKLFVDMRRVGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTAL 471
           Q A+K F  M+R  L PN  T+T LID  CKAG+L  A  L  +M +  ++LN+VTYTAL
Sbjct: 181 QLALKSFHSMKRDALSPNVVTFTCLIDGYCKAGDLEVAVSLYKEMRRVRMSLNVVTYTAL 240

Query: 472 MDGLCEDGRMMEAEEVFRAMLKDGISPNQQVYTALVHGYIKAEKMEDALEILKQMTECGI 531
           +DG C+ G M  AEE++  M++D + PN  VYT ++ G+ +    ++A++ L +M   G+
Sbjct: 241 IDGFCKKGEMQRAEEMYSRMVEDRVEPNSLVYTTIIDGFFQRGDSDNAMKFLAKMLNQGM 300

Query: 532 KPDLVLYGTIIWGLCNQNKLEETKLIIKEMKSRGIRANPVIYTTIIDAYFKAGKSSDALD 591
           + D+  YG II GLC   KL+E   I+++M+   +  + VI+TT+++AYFK+G+   A++
Sbjct: 301 RLDITAYGVIISGLCGNGKLKEATEIVEDMEKSDLVPDMVIFTTMMNAYFKSGRMKAAVN 360

Query: 592 LLQEMQEVGVEATVVTYCVLIDGLCKTGMVEVAVDYFGRMSDFGVQPNVAVYTALIDGLC 651
           +  ++ E G E  VV    +IDG+ K G +  A+ YF        + N  +YT LID LC
Sbjct: 361 MYHKLIERGFEPDVVALSTMIDGIAKNGQLHEAIVYF-----CIEKANDVMYTVLIDALC 420

Query: 652 KINCIESAEKLFEEMQCRGMTPDKTAFTALIDGNLKLGNLQETLNLISKMTELVIEFDLH 711
           K       E+LF ++   G+ PDK  +T+ I G  K GNL +   L ++M +  +  DL 
Sbjct: 421 KEGDFIEVERLFSKISEAGLVPDKFMYTSWIAGLCKQGNLVDAFKLKTRMVQEGLLLDLL 480

Query: 712 AYTTLVSGFSQCGELHQARKFFNEMIEKGILPDEILCICLLKEYNKLGHLDEAIKLKNEM 771
           AYTTL+ G +  G + +AR+ F+EM+  GI PD  +   L++ Y K G++  A  L  +M
Sbjct: 481 AYTTLIYGLASKGLMVEARQVFDEMLNSGISPDSAVFDLLIRAYEKEGNMAAASDLLLDM 540

Query: 772 QRRGLIT--------EKCSHEV 783
           QRRGL+T        ++C +EV
Sbjct: 541 QRRGLVTAVSDADCSKQCGNEV 556

BLAST of CmoCh04G022650 vs. TAIR10
Match: AT1G05670.1 (AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 368.6 bits (945), Expect = 9.4e-102
Identity = 200/616 (32.47%), Postives = 335/616 (54.38%), Query Frame = 1

Query: 128 IWVSKILVELKEDPKLALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNAHDIMKEM 187
           IWV   L+++K D +L L FF WA +        ES CI++H+   ++    A  ++   
Sbjct: 91  IWV---LMKIKCDYRLVLDFFDWARSRRD--SNLESLCIVIHLAVASKDLKVAQSLISSF 150

Query: 188 VLKSRTDLILPVCNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECFSKMRKFR 247
             + + ++       FD+L  T     S   VFDV F VLV+ GLL EA   F KM  + 
Sbjct: 151 WERPKLNVTDSFVQFFDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKMLNYG 210

Query: 248 TLPKARSCNFLLHRLSKSGNGQLVRKF-FHDMIGAGIAPSVFTYNVMIDHLCKEGDLENA 307
            +    SCN  L RLSK           F +    G+  +V +YN++I  +C+ G ++ A
Sbjct: 211 LVLSVDSCNVYLTRLSKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEA 270

Query: 308 RSLFVQMRTMGFSPDVVTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITYNALINC 367
             L + M   G++PDV++Y+++++GY + G L +   L   MK  G  P+   Y ++I  
Sbjct: 271 HHLLLLMELKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGL 330

Query: 368 FCKFEKMPQAFEYLSEMKNIGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPN 427
            C+  K+ +A E  SEM   G+ P+ V Y+TLID FCK G ++ A K F +M    + P+
Sbjct: 331 LCRICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPD 390

Query: 428 EFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMMEAEEVFR 487
             TYT++I   C+ G++ EA KL ++M   G+  + VT+T L++G C+ G M +A  V  
Sbjct: 391 VLTYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHN 450

Query: 488 AMLKDGISPNQQVYTALVHGYIKAEKMEDALEILKQMTECGIKPDLVLYGTIIWGLCNQN 547
            M++ G SPN   YT L+ G  K   ++ A E+L +M + G++P++  Y +I+ GLC   
Sbjct: 451 HMIQAGCSPNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSG 510

Query: 548 KLEETKLIIKEMKSRGIRANPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGVEATVVTYC 607
            +EE   ++ E ++ G+ A+ V YTT++DAY K+G+   A ++L+EM   G++ T+VT+ 
Sbjct: 511 NIEEAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFN 570

Query: 608 VLIDGLCKTGMVEVAVDYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAEKLFEEMQCR 667
           VL++G C  GM+E        M   G+ PN   + +L+   C  N +++A  ++++M  R
Sbjct: 571 VLMNGFCLHGMLEDGEKLLNWMLAKGIAPNATTFNSLVKQYCIRNNLKAATAIYKDMCSR 630

Query: 668 GMTPDKTAFTALIDGNLKLGNLQETLNLISKMTELVIEFDLHAYTTLVSGFSQCGELHQA 727
           G+ PD   +  L+ G+ K  N++E   L  +M        +  Y+ L+ GF +  +  +A
Sbjct: 631 GVGPDGKTYENLVKGHCKARNMKEAWFLFQEMKGKGFSVSVSTYSVLIKGFLKRKKFLEA 690

Query: 728 RKFFNEMIEKGILPDE 743
           R+ F++M  +G+  D+
Sbjct: 691 REVFDQMRREGLAADK 701

BLAST of CmoCh04G022650 vs. TAIR10
Match: AT5G39710.1 (AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 347.8 bits (891), Expect = 1.7e-95
Identity = 205/649 (31.59%), Postives = 330/649 (50.85%), Query Frame = 1

Query: 127 PIWVSKILVELKEDPKLALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNAHDIMKE 186
           P   S +L++ + D  L LKF  WA  H  F  T    CI +H+L + ++Y  A  + ++
Sbjct: 48  PEAASNLLLKSQNDQALILKFLNWANPHQFF--TLRCKCITLHILTKFKLYKTAQILAED 107

Query: 187 MVLKSRTDLILPVCNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECFSKMRKF 246
           +  K+  D    +  VF  L  T + C S + VFD++      L L+++A       +  
Sbjct: 108 VAAKTLDDEYASL--VFKSLQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIVHLAQAH 167

Query: 247 RTLPKARSCNFLLHRLSKSG-NGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLCKEGDLEN 306
             +P   S N +L    +S  N       F +M+ + ++P+VFTYN++I   C  G+++ 
Sbjct: 168 GFMPGVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDV 227

Query: 307 ARSLFVQMRTMGFSPDVVTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITYNALIN 366
           A +LF +M T G  P+VVTYN+LIDGY K+  + +   L   M   G  P++I+YN +IN
Sbjct: 228 ALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVIN 287

Query: 367 CFCKFEKMPQAFEYLSEMKNIGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLP 426
             C+  +M +    L+EM   G   + VTY+TLI  +CKEG    A+ +  +M R GL P
Sbjct: 288 GLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTP 347

Query: 427 NEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMMEAEEVF 486
           +  TYTSLI + CKAGN+  A +  + M   G+  N  TYT L+DG  + G M EA  V 
Sbjct: 348 SVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVL 407

Query: 487 RAMLKDGISPNQQVYTALVHGYIKAEKMEDALEILKQMTECGIKPDLVLYGTIIWGLCNQ 546
           R M  +G SP+   Y AL++G+    KMEDA+ +L+ M E G+ PD+V Y T++ G C  
Sbjct: 408 REMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRS 467

Query: 547 NKLEETKLIIKEMKSRGIRANPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGVEATVVTY 606
             ++E   + +EM  +GI+ + + Y+++I  + +  ++ +A DL +EM  VG+     TY
Sbjct: 468 YDVDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTY 527

Query: 607 CVLIDGLCKTGMVEVAVDYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAEKLFEEMQC 666
             LI+  C  G +E A+     M + GV P+V  Y+ LI+GL K +    A++L  ++  
Sbjct: 528 TALINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRLLLKLFY 587

Query: 667 RGMTPDKTAFTALIDGNLKLGNLQETLNLISKMTELVIEFDLHAYTTLVSGFSQCGELHQ 726
               P    +  LI                    E     +  +  +L+ GF   G + +
Sbjct: 588 EESVPSDVTYHTLI--------------------ENCSNIEFKSVVSLIKGFCMKGMMTE 647

Query: 727 ARKFFNEMIEKGILPDEILCICLLKEYNKLGHLDEAIKLKNEMQRRGLI 775
           A + F  M+ K   PD      ++  + + G + +A  L  EM + G +
Sbjct: 648 ADQVFESMLGKNHKPDGTAYNIMIHGHCRAGDIRKAYTLYKEMVKSGFL 672

BLAST of CmoCh04G022650 vs. TAIR10
Match: AT5G55840.1 (AT5G55840.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 341.7 bits (875), Expect = 1.2e-93
Identity = 202/633 (31.91%), Postives = 318/633 (50.24%), Query Frame = 1

Query: 142 KLALKFFKWAGTHIGFR--HTTESYCIIVHMLFRARMYTNAHDIMKEMVLKSRTDLILPV 201
           KLALKF KW     G    H  +  CI  H+L RARMY  A  I+KE+ L S        
Sbjct: 91  KLALKFLKWVVKQPGLETDHIVQLVCITTHILVRARMYDPARHILKELSLMSGKSSF--- 150

Query: 202 CNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECFSKMRKFRTLPKARSCNFLL 261
             VF  L +T   C S   V+D+L  V +  G+++++ E F  M  +   P   +CN +L
Sbjct: 151 --VFGALMTTYRLCNSNPSVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCNAIL 210

Query: 262 HRLSKSGNGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLCKEGDLENARSLFVQMRTMGFS 321
             + KSG    V  F  +M+   I P V T+N++I+ LC EG  E +  L  +M   G++
Sbjct: 211 GSVVKSGEDVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYA 270

Query: 322 PDVVTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITYNALINCFCKFEKMPQAFEY 381
           P +VTYN+++  Y K G  K ++ L + MK  G   DV TYN LI+  C+  ++ + +  
Sbjct: 271 PTIVTYNTVLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYLL 330

Query: 382 LSEMKNIGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLIDANCK 441
           L +M+   + PN VTY+TLI+ F  EG +  A +L  +M   GL PN  T+ +LID +  
Sbjct: 331 LRDMRKRMIHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHIS 390

Query: 442 AGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMMEAEEVFRAMLKDGISPNQQV 501
            GN  EA K+   M   G+  + V+Y  L+DGLC++     A   +  M ++G+   +  
Sbjct: 391 EGNFKEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRIT 450

Query: 502 YTALVHGYIKAEKMEDALEILKQMTECGIKPDLVLYGTIIWGLCNQNKLEETKLIIKEMK 561
           YT ++ G  K   +++A+ +L +M++ GI PD+V Y  +I G C   + +  K I+  + 
Sbjct: 451 YTGMIDGLCKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIY 510

Query: 562 SRGIRANPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGVEATVVTYCVLIDGLCKTGMVE 621
             G+  N +IY+T+I    + G   +A+ + + M   G      T+ VL+  LCK G V 
Sbjct: 511 RVGLSPNGIIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVA 570

Query: 622 VAVDYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAEKLFEEMQCRGMTPDKTAFTALI 681
            A ++   M+  G+ PN   +  LI+G         A  +F+EM   G  P    + +L+
Sbjct: 571 EAEEFMRCMTSDGILPNTVSFDCLINGYGNSGEGLKAFSVFDEMTKVGHHPTFFTYGSLL 630

Query: 682 DGNLKLGNLQETLNLISKMTELVIEFDLHAYTTLVSGFSQCGELHQARKFFNEMIEKGIL 741
            G  K G+L+E    +  +  +    D   Y TL++   + G L +A   F EM+++ IL
Sbjct: 631 KGLCKGGHLREAEKFLKSLHAVPAAVDTVMYNTLLTAMCKSGNLAKAVSLFGEMVQRSIL 690

Query: 742 PDEILCICLLKEYNKLGHLDEAIKLKNEMQRRG 773
           PD      L+    + G    AI    E + RG
Sbjct: 691 PDSYTYTSLISGLCRKGKTVIAILFAKEAEARG 718

BLAST of CmoCh04G022650 vs. NCBI nr
Match: gi|449463537|ref|XP_004149490.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At2g02150 [Cucumis sativus])

HSP 1 Score: 1340.5 bits (3468), Expect = 0.0e+00
Identity = 660/784 (84.18%), Postives = 719/784 (91.71%), Query Frame = 1

Query: 1   MKLSVEVL-AFASLFSAMLLFFRSLFHVSRRASYRVISLSLNSSHPGCLSFNVFNGPSSL 60
           MKLS E+L AFASL SAMLLFFR+LFHVSRRAS+RVISLS NSSHP  LSFNVFN  SSL
Sbjct: 2   MKLSAELLLAFASLLSAMLLFFRTLFHVSRRASFRVISLSSNSSHPDSLSFNVFNPSSSL 61

Query: 61  TSMNGYYISCPFFWFTSFLCIFRLPFVSYSITNDSFELLDIGSLRKIIQQDLWNDPKIVV 120
           TS+N Y IS PFFWFTSFLCIFRLPFVSYS  N+SF+ LDIGSLRKIIQQDLWNDPKIVV
Sbjct: 62  TSINAYCISRPFFWFTSFLCIFRLPFVSYSNANNSFQYLDIGSLRKIIQQDLWNDPKIVV 121

Query: 121 LFDSALAPIWVSKILVELKEDPKLALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTN 180
           LFDSALAPIWVSKIL+ L+EDPKLALKFFKWAG+ +GFRHTTESYCIIVH++FRARMYT+
Sbjct: 122 LFDSALAPIWVSKILLGLREDPKLALKFFKWAGSQVGFRHTTESYCIIVHLVFRARMYTD 181

Query: 181 AHDIMKEMVLKSRTDLILPVCNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANEC 240
           AHD +KE+++ SR D+  PVCN+FD+LWSTRN CVSG+GVFDVLFSV VELGLLEEANEC
Sbjct: 182 AHDTVKEVIMNSRMDMGFPVCNIFDMLWSTRNICVSGSGVFDVLFSVFVELGLLEEANEC 241

Query: 241 FSKMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLCK 300
           FS+MR FRTLPKARSCNFLLHRLSKSGNGQLVRKFF+DMIGAGIAPSVFTYNVMID+LCK
Sbjct: 242 FSRMRNFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLCK 301

Query: 301 EGDLENARSLFVQMRTMGFSPDVVTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVIT 360
           EGDLEN+R LFVQMR MG SPDVVTYNSLIDGYGKVG L+E   LFNEMKDVGCVPD+IT
Sbjct: 302 EGDLENSRRLFVQMREMGLSPDVVTYNSLIDGYGKVGSLEEVASLFNEMKDVGCVPDIIT 361

Query: 361 YNALINCFCKFEKMPQAFEYLSEMKNIGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMR 420
           YN LINC+CKFEKMP+AFEY SEMKN GLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMR
Sbjct: 362 YNGLINCYCKFEKMPRAFEYFSEMKNNGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMR 421

Query: 421 RVGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMM 480
           R GLLPNEFTYTSLIDANCKAGNLTEAWKL NDMLQAGV LNIVTYTAL+DGLC+ GRM+
Sbjct: 422 RTGLLPNEFTYTSLIDANCKAGNLTEAWKLLNDMLQAGVKLNIVTYTALLDGLCKAGRMI 481

Query: 481 EAEEVFRAMLKDGISPNQQVYTALVHGYIKAEKMEDALEILKQMTECGIKPDLVLYGTII 540
           EAEEVFR+MLKDGISPNQQVYTALVHGYIKAE+MEDA++ILKQMTEC IKPDL+LYG+II
Sbjct: 482 EAEEVFRSMLKDGISPNQQVYTALVHGYIKAERMEDAMKILKQMTECNIKPDLILYGSII 541

Query: 541 WGLCNQNKLEETKLIIKEMKSRGIRANPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGVE 600
           WG C+Q KLEETKLI++EMKSRGI ANPVI TTIIDAYFKAGKSSDAL+  QEMQ+VGVE
Sbjct: 542 WGHCSQRKLEETKLILEEMKSRGISANPVISTTIIDAYFKAGKSSDALNFFQEMQDVGVE 601

Query: 601 ATVVTYCVLIDGLCKTGMVEVAVDYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAEKL 660
           AT+VTYCVLIDGLCK G+VE+AVDYF RM   G+QPNVAVYT+LIDGLCK NCIESA+KL
Sbjct: 602 ATIVTYCVLIDGLCKAGIVELAVDYFCRMLSLGLQPNVAVYTSLIDGLCKNNCIESAKKL 661

Query: 661 FEEMQCRGMTPDKTAFTALIDGNLKLGNLQETLNLISKMTELVIEFDLHAYTTLVSGFSQ 720
           F+EMQCRGMTPD TAFTALIDGNLK GNLQE L LIS+MTEL IEFDLH YT+LVSGFSQ
Sbjct: 662 FDEMQCRGMTPDITAFTALIDGNLKHGNLQEALVLISRMTELAIEFDLHVYTSLVSGFSQ 721

Query: 721 CGELHQARKFFNEMIEKGILPDEILCICLLKEYNKLGHLDEAIKLKNEMQRRGLITEKCS 780
           CGELHQARKFFNEMIEKGILP+E+LCICLL+EY K G LDEAI+LKNEM+R GLITE  +
Sbjct: 722 CGELHQARKFFNEMIEKGILPEEVLCICLLREYYKRGQLDEAIELKNEMERMGLITESAT 781

Query: 781 HEVP 784
            + P
Sbjct: 782 MQFP 785

BLAST of CmoCh04G022650 vs. NCBI nr
Match: gi|659072656|ref|XP_008466646.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At2g02150 [Cucumis melo])

HSP 1 Score: 1335.9 bits (3456), Expect = 0.0e+00
Identity = 657/785 (83.69%), Postives = 719/785 (91.59%), Query Frame = 1

Query: 1   MKLSVEVL--AFASLFSAMLLFFRSLFHVSRRASYRVISLSLNSSHPGCLSFNVFNGPSS 60
           MKLSVE+L  AF SL SAMLLFFR+LFHVSRRAS+RVISLS NSSHP  LSFNVFN  SS
Sbjct: 2   MKLSVELLLLAFPSLLSAMLLFFRTLFHVSRRASFRVISLSSNSSHPDSLSFNVFNPSSS 61

Query: 61  LTSMNGYYISCPFFWFTSFLCIFRLPFVSYSITNDSFELLDIGSLRKIIQQDLWNDPKIV 120
           LTS+N Y IS PFFWFTSFLCIFRLPFVSYS  N+S E LDIGSLRKIIQQDLWNDPKIV
Sbjct: 62  LTSINAYRISRPFFWFTSFLCIFRLPFVSYSNANNSIEFLDIGSLRKIIQQDLWNDPKIV 121

Query: 121 VLFDSALAPIWVSKILVELKEDPKLALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYT 180
           VLFDSALAPIWVS+ILV LKEDPKLALKFFKWAG+ +GFRHTTESYCIIVH++FRARMYT
Sbjct: 122 VLFDSALAPIWVSRILVGLKEDPKLALKFFKWAGSQVGFRHTTESYCIIVHLVFRARMYT 181

Query: 181 NAHDIMKEMVLKSRTDLILPVCNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANE 240
           +AHD +KE+++K+R D+  PVCN+FD+LWSTRN CVSG+GVFDVLFSV VELGLLEEANE
Sbjct: 182 DAHDTVKEVIMKNRIDMGFPVCNIFDMLWSTRNICVSGSGVFDVLFSVFVELGLLEEANE 241

Query: 241 CFSKMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLC 300
           CFS+MR FRTLPKARSCNFLLHRLSKSGNGQLVRKFF+DMIGAGIAPSVFTYNVMID+LC
Sbjct: 242 CFSRMRNFRTLPKARSCNFLLHRLSKSGNGQLVRKFFNDMIGAGIAPSVFTYNVMIDYLC 301

Query: 301 KEGDLENARSLFVQMRTMGFSPDVVTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVI 360
           KEGDLENAR LFVQMR MG SPDVVTYNSLIDGYGKVG L+E+V  FNEMKDVGCVPD+I
Sbjct: 302 KEGDLENARRLFVQMREMGLSPDVVTYNSLIDGYGKVGSLEEAVSFFNEMKDVGCVPDII 361

Query: 361 TYNALINCFCKFEKMPQAFEYLSEMKNIGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDM 420
           TYN LINC+CKFEKMP+AFEY SEMKN GLKPNVVTYSTLIDAFCKEGMMQGA+KLFVDM
Sbjct: 362 TYNGLINCYCKFEKMPRAFEYFSEMKNNGLKPNVVTYSTLIDAFCKEGMMQGAVKLFVDM 421

Query: 421 RRVGLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRM 480
           +R GLLPNEFTYTSLIDANCKAGNLTEAWKL NDMLQAGV LNIVTYTAL+DGLCEDGRM
Sbjct: 422 KRAGLLPNEFTYTSLIDANCKAGNLTEAWKLLNDMLQAGVKLNIVTYTALVDGLCEDGRM 481

Query: 481 MEAEEVFRAMLKDGISPNQQVYTALVHGYIKAEKMEDALEILKQMTECGIKPDLVLYGTI 540
           +EAEEVFR+MLKDGISPNQQVYTALVHGYIKAE+MEDA++ILKQM EC IKPDL+LYG++
Sbjct: 482 IEAEEVFRSMLKDGISPNQQVYTALVHGYIKAERMEDAMKILKQMKECNIKPDLILYGSV 541

Query: 541 IWGLCNQNKLEETKLIIKEMKSRGIRANPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGV 600
           IWGLC+Q+KLEETKLI+KEMKSRGI ANPVIYTTIIDAYFKAGKSSDA++L QEMQ+VGV
Sbjct: 542 IWGLCSQSKLEETKLILKEMKSRGISANPVIYTTIIDAYFKAGKSSDAINLFQEMQDVGV 601

Query: 601 EATVVTYCVLIDGLCKTGMVEVAVDYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAEK 660
           EATVVTYCVLIDGLCK G+VE+AVDYF RM   G+QPNVAVYT+LIDGL K NCI+SA K
Sbjct: 602 EATVVTYCVLIDGLCKAGIVELAVDYFCRMFSLGLQPNVAVYTSLIDGLSKTNCIKSANK 661

Query: 661 LFEEMQCRGMTPDKTAFTALIDGNLKLGNLQETLNLISKMTELVIEFDLHAYTTLVSGFS 720
           LF+EMQCRGMTPD TAFTALIDGNLK GNLQE L  IS+MTEL IEFDLH YT+LV+GFS
Sbjct: 662 LFDEMQCRGMTPDITAFTALIDGNLKHGNLQEALVFISRMTELAIEFDLHFYTSLVAGFS 721

Query: 721 QCGELHQARKFFNEMIEKGILPDEILCICLLKEYNKLGHLDEAIKLKNEMQRRGLITEKC 780
           +CGEL QARKFFNEMI+KGILP+E+LCICLL+EY K G LDEAI+LKNEMQ  GLITE  
Sbjct: 722 KCGELRQARKFFNEMIKKGILPEEVLCICLLREYCKRGQLDEAIELKNEMQGMGLITESA 781

Query: 781 SHEVP 784
           + + P
Sbjct: 782 AMQFP 786

BLAST of CmoCh04G022650 vs. NCBI nr
Match: gi|657995268|ref|XP_008389961.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At2g02150 [Malus domestica])

HSP 1 Score: 1010.7 bits (2612), Expect = 1.3e-291
Identity = 503/782 (64.32%), Postives = 615/782 (78.64%), Query Frame = 1

Query: 10  FASLFSAMLLFFRSLFHVSRRASYRVIS-LSLNSSHPGCLSFNVFNGPSSLTSMNGY--- 69
           F S FS MLLF R+LF    RAS    S +S  SS P   S   F   SSLTS + +   
Sbjct: 18  FISFFSEMLLFLRNLFRTGCRASSSASSRVSXLSSIPQYPSNCRFINLSSLTSSSSHATS 77

Query: 70  YISCPFFWFTSFLCIFRLPFVSYSITNDSFELLDIGSLRKIIQQDLWNDPKIVVLFDSAL 129
            I+CPF WFT FLCIFR PFV+ S  +   E L+  SL +I+Q D W+DP+IV LFDSAL
Sbjct: 78  LIACPFVWFTGFLCIFRFPFVTKSQPSSFPESLNTDSLSRIVQHDYWDDPRIVNLFDSAL 137

Query: 130 APIWVSKILVELKEDPKLALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNAHDIMK 189
           APIWVS+ LVELK DPKLALK FKWA T IGFRHTTESYCI+VH+LF ARMY +AH++++
Sbjct: 138 APIWVSRFLVELKGDPKLALKLFKWAKTQIGFRHTTESYCILVHILFFARMYVDAHEVLR 197

Query: 190 EMVLKSRTDLILPVCNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECFSKMRK 249
           E+VL SR    LP C+VFD+LW TRN C  G GVFD LF VLVE+G+LEEA+ECF +M+K
Sbjct: 198 ELVLLSRA---LPGCDVFDVLWWTRNVCRVGFGVFDALFGVLVEVGMLEEASECFLRMKK 257

Query: 250 FRTLPKARSCNFLLHRLSKSGNGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLCKEGDLEN 309
           FR LPK RSCN LLHRLSK G G L RKFF DM+GAGI PSVFTYN+MI ++CKEGDL+ 
Sbjct: 258 FRVLPKVRSCNALLHRLSKPGKGNLSRKFFKDMLGAGINPSVFTYNIMIGYMCKEGDLDT 317

Query: 310 ARSLFVQMRTMGFSPDVVTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITYNALIN 369
           A  LF QM+ MG +PDVVTYNSLIDGYGKVGLL +SV +F EMKD  C PD IT+N+LIN
Sbjct: 318 ASCLFAQMKRMGLTPDVVTYNSLIDGYGKVGLLDDSVCIFEEMKDADCEPDTITFNSLIN 377

Query: 370 CFCKFEKMPQAFEYLSEMKNIGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLP 429
           C CKF++MPQA  +L EM N GLKPNV+TYSTLIDAFCKEGMMQ A+K+F+DM+RVGLLP
Sbjct: 378 CCCKFDRMPQALNFLREMNNNGLKPNVITYSTLIDAFCKEGMMQEAVKIFMDMKRVGLLP 437

Query: 430 NEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMMEAEEVF 489
           NEFTYTSLIDANCK GNL+EA KL ++MLQAG++ NIVTYTAL+DGLCEDGRM EAEEVF
Sbjct: 438 NEFTYTSLIDANCKXGNLSEALKLKSEMLQAGISWNIVTYTALLDGLCEDGRMDEAEEVF 497

Query: 490 RAMLKDGISPNQQVYTALVHGYIKAEKMEDALEILKQMTECGIKPDLVLYGTIIWGLCNQ 549
           R + K GI PNQQ+ TAL+HGYIKA+K+E+A+EI  ++   G KPDL+LYGTIIWGLC+Q
Sbjct: 498 REVQKSGIIPNQQICTALLHGYIKAKKIENAMEIWNEIKGKGFKPDLLLYGTIIWGLCSQ 557

Query: 550 NKLEETKLIIKEMKSRGIRANPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGVEATVVTY 609
           NKLEE++L++KEM   G+ AN  IYTT++DAY+KAGK+  AL+L+QEM++ G E TVVTY
Sbjct: 558 NKLEESELVLKEMXGYGLTANHFIYTTLMDAYYKAGKTEAALNLVQEMRDNGXELTVVTY 617

Query: 610 CVLIDGLCKTGMVEVAVDYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAEKLFEEMQC 669
           C LIDGLCK G+ + A  +F  M D G+QPNVAV+TALIDGLCK NCIE+A++LF EM  
Sbjct: 618 CALIDGLCKKGLFQEATSHFRTMPDLGLQPNVAVFTALIDGLCKNNCIEAAKELFXEMXD 677

Query: 670 RGMTPDKTAFTALIDGNLKLGNLQETLNLISKMTELVIEFDLHAYTTLVSGFSQCGELHQ 729
           +G+ PDK A+T L+DGNLK GNL+E L++ ++M E+ +E DL+AYT+L+ G S+ G++ Q
Sbjct: 678 KGLIPDKAAYTTLMDGNLKHGNLEEALSIQNRMREIGMELDLYAYTSLIWGLSEFGQVKQ 737

Query: 730 ARKFFNEMIEKGILPDEILCICLLKEYNKLGHLDEAIKLKNEMQRRGLITEKCSHEVPSL 788
           A+   +EMI KGILPDEILCI LL++Y KLG+LDEAI+L+ EM  RGLI+  C H +P+ 
Sbjct: 738 AKMLLDEMIGKGILPDEILCISLLRKYYKLGNLDEAIELQIEMVNRGLISGTCDHVIPNA 796

BLAST of CmoCh04G022650 vs. NCBI nr
Match: gi|590697037|ref|XP_007045328.1| (Pentatricopeptide repeat-containing protein, putative isoform 1 [Theobroma cacao])

HSP 1 Score: 1005.7 bits (2599), Expect = 4.3e-290
Identity = 496/785 (63.18%), Postives = 615/785 (78.34%), Query Frame = 1

Query: 7   VLAFASLFSAMLLFFRSLFHVSRRASYRVISLSLNSSHPGCLSFNVFNGPSSLTSM---N 66
           + A  S F+ ML+  RSLFH++RR     I + +  SHP    F +F     L      N
Sbjct: 4   ISAALSFFTKMLVSLRSLFHINRR-----IPVCVRVSHP----FPLFQNSRPLNFFPPSN 63

Query: 67  GYYISCPFFWFTSFLCIFRLPFVSYSITNDSFELLDIG--SLRKIIQQDLWNDPKIVVLF 126
              I CPF   TSF  + + PF +   +N    L D    S+ KIIQQD WNDPKIV LF
Sbjct: 64  NSIIVCPFILLTSFFYMMKFPFGTKCNSNTHIFLDDFNRESICKIIQQDQWNDPKIVTLF 123

Query: 127 DSALAPIWVSKILVELKEDPKLALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNAH 186
           DS+LAPIWVSKILV LK++PKLALKFFKWA TH GF HT+ESYCI+VH+LF  RMY++A 
Sbjct: 124 DSSLAPIWVSKILVGLKQEPKLALKFFKWAKTHKGFGHTSESYCILVHILFYGRMYSDAS 183

Query: 187 DIMKEMVLKSRTDLILPVCNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECFS 246
            I+KE +L  R  ++LP C+ FD+LWSTRN C  G GVFD LFSVLV+LG+LEEA++CFS
Sbjct: 184 AILKEFILL-RQRVVLPGCDFFDVLWSTRNVCRYGFGVFDALFSVLVDLGMLEEASQCFS 243

Query: 247 KMRKFRTLPKARSCNFLLHRLSKSGNGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLCKEG 306
           KM+++R LPK RSCN LLHRLSK+G     R+FF +MIG G+APSVFTYN++ID++CKEG
Sbjct: 244 KMKRYRVLPKVRSCNALLHRLSKTGRRDQSRRFFAEMIGVGVAPSVFTYNILIDYMCKEG 303

Query: 307 DLENARSLFVQMRTMGFSPDVVTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITYN 366
           +L+ AR LF QM+ +G +PD+VTYNSLIDGYGKVGLL E ++LF EMK V C PD+ITYN
Sbjct: 304 ELDTARMLFGQMKQIGLTPDIVTYNSLIDGYGKVGLLDEVIFLFEEMKSVECAPDIITYN 363

Query: 367 ALINCFCKFEKMPQAFEYLSEMKNIGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRV 426
           ALINCFCKF++MPQAFE+  EM+N GLKPNVVTYSTLIDAFCKEGMMQ  IK  VDMRRV
Sbjct: 364 ALINCFCKFQRMPQAFEFFREMRNKGLKPNVVTYSTLIDAFCKEGMMQQGIKFLVDMRRV 423

Query: 427 GLLPNEFTYTSLIDANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMMEA 486
           GLLPN FTYTSLIDA CKAG+LTEA KL+N+MLQ  V+LNIVTYT ++DGLCE GR  EA
Sbjct: 424 GLLPNVFTYTSLIDATCKAGSLTEALKLANEMLQENVDLNIVTYTTIIDGLCEAGRTKEA 483

Query: 487 EEVFRAMLKDGISPNQQVYTALVHGYIKAEKMEDALEILKQMTECGIKPDLVLYGTIIWG 546
           EE+FRAMLK  + PN  +YTAL HGY+K +KME AL +LK+M E  IKPDL+LYGTIIWG
Sbjct: 484 EEIFRAMLKAALKPNVHIYTALAHGYMKVKKMEHALNLLKEMKEKSIKPDLLLYGTIIWG 543

Query: 547 LCNQNKLEETKLIIKEMKSRGIRANPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGVEAT 606
           LCNQ+K+EETK+++ EMK   + +NPVIYTT++D+YFKAGK+++AL+LL+EM ++G+E T
Sbjct: 544 LCNQDKIEETKVVMSEMKESRLSSNPVIYTTVMDSYFKAGKTAEALNLLEEMSDLGIEVT 603

Query: 607 VVTYCVLIDGLCKTGMVEVAVDYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAEKLFE 666
           VVT+CVL+DGLCKTG+V  A++YF RMS+F +QPNVA YT LIDGLCK N I++A+ +F+
Sbjct: 604 VVTFCVLVDGLCKTGLVLEAINYFNRMSEFNLQPNVAAYTVLIDGLCKNNFIQAAKNMFD 663

Query: 667 EMQCRGMTPDKTAFTALIDGNLKLGNLQETLNLISKMTELVIEFDLHAYTTLVSGFSQCG 726
           EM  + + PDKTA+TALIDGNLK GN QE LNL ++M E+ IE DL AYT+LV GF QCG
Sbjct: 664 EMLSKNLVPDKTAYTALIDGNLKHGNFQEALNLQNEMIEMGIELDLPAYTSLVWGFCQCG 723

Query: 727 ELHQARKFFNEMIEKGILPDEILCICLLKEYNKLGHLDEAIKLKNEMQRRGLITEKCSHE 786
           +L QARKF +EMI K ILPDEILCI +L++Y +LGH+DEAI+L+NEM +RGLIT    + 
Sbjct: 724 QLQQARKFLDEMIRKHILPDEILCIGVLRKYYELGHVDEAIELQNEMAKRGLITSPIHYA 778

BLAST of CmoCh04G022650 vs. NCBI nr
Match: gi|645229248|ref|XP_008221377.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At2g02150 [Prunus mume])

HSP 1 Score: 1001.5 bits (2588), Expect = 8.2e-289
Identity = 496/773 (64.17%), Postives = 607/773 (78.53%), Query Frame = 1

Query: 17  MLLFFRSLFHVSRRASY-RVISLSLNSSHPG-CLSFNVFNGPSSLTSMNGYYISCPFFWF 76
           ML+F R+L  +  RAS+ RV  LS    H   CL  NV +   SL+S +G  I+CP  WF
Sbjct: 1   MLIFLRNLLQMGCRASFHRVSPLSSIPQHSSNCLFINVSS--LSLSSSHGSLIACPLVWF 60

Query: 77  TSFLCIFRLPFVSYSITNDSFELLDIGSLRKIIQQDLWNDPKIVVLFDSALAPIWVSKIL 136
           TSFLCI R PFV+ S  N   + L+  SLR IIQ D W+DP+IV LF SALAPIW SK L
Sbjct: 61  TSFLCITRFPFVTKSNPNSFRDNLNTESLRIIIQHDYWDDPRIVNLFGSALAPIWASKFL 120

Query: 137 VELKEDPKLALKFFKWAGTHIGFRHTTESYCIIVHMLFRARMYTNAHDIMKEMVLKSRTD 196
           VEL+ DPKLALK F+W+ T IGF HTTESYCI+VH+LF ARMY +AH+I+KE+V   R  
Sbjct: 121 VELRGDPKLALKLFRWSKTRIGFCHTTESYCILVHILFYARMYFDAHEILKELVSLRRVS 180

Query: 197 LILPVCNVFDILWSTRNFCVSGTGVFDVLFSVLVELGLLEEANECFSKMRKFRTLPKARS 256
           L    C+VFD+LWSTRN C  G GVFD LFSVLVE G+LE+A+ECF +M+KFR LPK RS
Sbjct: 181 L---GCDVFDVLWSTRNVCRLGFGVFDALFSVLVEFGMLEKASECFLRMKKFRVLPKVRS 240

Query: 257 CNFLLHRLSKSGNGQLVRKFFHDMIGAGIAPSVFTYNVMIDHLCKEGDLENARSLFVQMR 316
           CN LL RLSKSG G   RKFF DM+GAGI PSVFTYN+MI +LCKEGDL+ A  LF QM+
Sbjct: 241 CNALLQRLSKSGKGNFSRKFFKDMLGAGITPSVFTYNIMIGYLCKEGDLDTASCLFAQMK 300

Query: 317 TMGFSPDVVTYNSLIDGYGKVGLLKESVYLFNEMKDVGCVPDVITYNALINCFCKFEKMP 376
            MG +PD+VTYNSLIDGYGKVG+L  S  +F EMKD GC PDVIT+N+LINC CKF+KMP
Sbjct: 301 RMGLTPDIVTYNSLIDGYGKVGILDNSFCIFEEMKDAGCEPDVITFNSLINCCCKFDKMP 360

Query: 377 QAFEYLSEMKNIGLKPNVVTYSTLIDAFCKEGMMQGAIKLFVDMRRVGLLPNEFTYTSLI 436
           +A  +L EM N GLKPNV+TYSTLIDAFCKEGMMQ A+K+F+DM+RVGL PNEFTYTSLI
Sbjct: 361 EALNFLREMNNKGLKPNVITYSTLIDAFCKEGMMQEAVKIFMDMKRVGLSPNEFTYTSLI 420

Query: 437 DANCKAGNLTEAWKLSNDMLQAGVNLNIVTYTALMDGLCEDGRMMEAEEVFRAMLKDGIS 496
           DANCKAGNL+EA KL  +M Q G++LNIVTYTAL+DGLC+DGRM +AEEVFR +L+ GIS
Sbjct: 421 DANCKAGNLSEALKLKKEMFQEGISLNIVTYTALLDGLCQDGRMEDAEEVFREVLETGIS 480

Query: 497 PNQQVYTALVHGYIKAEKMEDALEILKQMTECGIKPDLVLYGTIIWGLCNQNKLEETKLI 556
           PNQQ+ TALVHGYIKA++ME+A+EI K++   G KPDL+LYGTIIWGLC+QNKLEE++L+
Sbjct: 481 PNQQICTALVHGYIKAKRMENAMEIWKEIKGKGFKPDLLLYGTIIWGLCSQNKLEESELV 540

Query: 557 IKEMKSRGIRANPVIYTTIIDAYFKAGKSSDALDLLQEMQEVGVEATVVTYCVLIDGLCK 616
             EMK  G   N  IYTT++DAYFKAGK+ +AL+LLQEM + G+E TVVTYC LIDGLCK
Sbjct: 541 FSEMKGCGSTPNHFIYTTLMDAYFKAGKTKEALNLLQEMLDNGIEFTVVTYCALIDGLCK 600

Query: 617 TGMVEVAVDYFGRMSDFGVQPNVAVYTALIDGLCKINCIESAEKLFEEMQCRGMTPDKTA 676
            G+++ A++YF RM D G++PNVAV+TALIDG CK NCIE+A++LF EM  +GM PDK A
Sbjct: 601 KGLLQEAINYFRRMPDIGLEPNVAVFTALIDGHCKNNCIEAAKELFNEMLDKGMIPDKAA 660

Query: 677 FTALIDGNLKLGNLQETLNLISKMTELVIEFDLHAYTTLVSGFSQCGELHQARKFFNEMI 736
           ++ LIDGNLK GNLQE L++  +M E+ +E DL+AYT+L+ G S  G++ QA+   +EMI
Sbjct: 661 YSTLIDGNLKHGNLQEALSVEKRMREMGMELDLYAYTSLIWGLSHFGQVQQAKILLDEMI 720

Query: 737 EKGILPDEILCICLLKEYNKLGHLDEAIKLKNEMQRRGLITEKCSHEVPSLKT 788
            KGILPDEILCICLLK+Y +LG+LDEA +L+ EM  +GLIT  C + VP+ +T
Sbjct: 721 GKGILPDEILCICLLKKYYELGYLDEAFELQTEMVNKGLITGTCDYAVPNART 768

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP143_ARATH8.8e-25957.05Putative pentatricopeptide repeat-containing protein At2g02150 OS=Arabidopsis th... [more]
PP141_ARATH3.9e-10536.65Pentatricopeptide repeat-containing protein At2g01740 OS=Arabidopsis thaliana GN... [more]
PPR12_ARATH1.7e-10032.47Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidop... [more]
PP407_ARATH3.1e-9431.59Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN... [more]
PP432_ARATH2.2e-9231.91Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A061E9Z5_THECC3.0e-29063.18Pentatricopeptide repeat-containing protein, putative isoform 1 OS=Theobroma cac... [more]
W9SE38_9ROSA2.0e-28663.11Uncharacterized protein OS=Morus notabilis GN=L484_000446 PE=4 SV=1[more]
W9S012_9ROSA3.8e-28562.98Uncharacterized protein OS=Morus notabilis GN=L484_000854 PE=4 SV=1[more]
B9RY36_RICCO9.7e-27364.12Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
K7L5N5_SOYBN1.5e-25759.27Uncharacterized protein OS=Glycine max GN=GLYMA_08G090700 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G02150.14.9e-26057.05 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G01740.12.2e-10636.65 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G05670.19.4e-10232.47 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT5G39710.11.7e-9531.59 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G55840.11.2e-9331.91 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449463537|ref|XP_004149490.1|0.0e+0084.18PREDICTED: putative pentatricopeptide repeat-containing protein At2g02150 [Cucum... [more]
gi|659072656|ref|XP_008466646.1|0.0e+0083.69PREDICTED: putative pentatricopeptide repeat-containing protein At2g02150 [Cucum... [more]
gi|657995268|ref|XP_008389961.1|1.3e-29164.32PREDICTED: putative pentatricopeptide repeat-containing protein At2g02150 [Malus... [more]
gi|590697037|ref|XP_007045328.1|4.3e-29063.18Pentatricopeptide repeat-containing protein, putative isoform 1 [Theobroma cacao... [more]
gi|645229248|ref|XP_008221377.1|8.2e-28964.17PREDICTED: putative pentatricopeptide repeat-containing protein At2g02150 [Prunu... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G022650.1CmoCh04G022650.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 221..246
score: 0.099coord: 533..563
score: 0.0026coord: 748..773
score: 0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 254..299
score: 2.0E-10coord: 706..742
score: 5.9E-8coord: 390..439
score: 5.1E-19coord: 635..681
score: 1.6E-13coord: 566..614
score: 6.3E-16coord: 461..503
score: 1.9E-11coord: 320..369
score: 2.1
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 358..392
score: 1.1E-5coord: 288..322
score: 2.7E-10coord: 499..531
score: 2.6E-8coord: 534..566
score: 0.0013coord: 709..742
score: 2.2E-8coord: 254..286
score: 0.0021coord: 428..461
score: 1.6E-5coord: 323..357
score: 6.0E-11coord: 463..496
score: 3.6E-10coord: 639..671
score: 3.8E-9coord: 568..601
score: 4.0E-8coord: 603..637
score: 1.2E-9coord: 393..427
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 741..775
score: 9.339coord: 566..600
score: 11.783coord: 356..390
score: 12.562coord: 496..530
score: 12.211coord: 601..635
score: 12.047coord: 426..460
score: 11.323coord: 636..670
score: 12.408coord: 160..190
score: 6.697coord: 391..425
score: 13.23coord: 461..495
score: 13.713coord: 321..355
score: 13.614coord: 251..285
score: 8.714coord: 706..740
score: 12.781coord: 216..250
score: 7.892coord: 286..320
score: 13.329coord: 671..705
score: 7.465coord: 531..565
score: 10
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 563..663
score: 4.5E-10coord: 358..526
score: 4.5
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 203..782
score: 1.4E
NoneNo IPR availablePANTHERPTHR24015:SF329SUBFAMILY NOT NAMEDcoord: 203..782
score: 1.4E
NoneNo IPR availableunknownSSF81901HCP-likecoord: 281..388
score: 7.32E-6coord: 425..536
score: 7.32E-6coord: 670..763
score: 3.66E-5coord: 499..598
score: 3.6

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmoCh04G022650CmoCh15G009000Cucurbita moschata (Rifu)cmocmoB262
The following block(s) are covering this gene:
GeneOrganismBlock
CmoCh04G022650Cucumber (Chinese Long) v3cmocucB0850
CmoCh04G022650Watermelon (97103) v2cmowmbB676
CmoCh04G022650Wax gourdcmowgoB0823
CmoCh04G022650Wild cucumber (PI 183967)cmocpiB727
CmoCh04G022650Cucumber (Chinese Long) v2cmocuB719
CmoCh04G022650Melon (DHL92) v3.5.1cmomeB625
CmoCh04G022650Watermelon (Charleston Gray)cmowcgB636
CmoCh04G022650Watermelon (97103) v1cmowmB706
CmoCh04G022650Bottle gourd (USVL1VR-Ls)cmolsiB610