CmoCh04G021980 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G021980
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCmo_Chr04 : 16324477 .. 16326129 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTTTTCGTCAATTTCTCAGTTCGCTTCGTGGATTATCTCGAAGAACAAGTTCTTTCCACTCGCTACCATCATCTTCTTCTTACACTGGTTCATTCAGATCAGTTTCGCGTTCACCAGCAATTACCAGAAACCAAACCCCAATCTGTACTTCGTCCCGTTGTAATGATCATTCGGCTGGGGTTATTCCGAATCGCAGTTACGCCTCTCACCATTTCAGCGATCATGGTACAGAGCACAGCAAACAGGATTCGGACGCCGACGAAATTTCAATCATGGCAAGTGCTGAGATTGCCCAGGATGCTGAAAAAATCTGTAAGTTGCTTACGAAAAGCCCTAGTTCTTGCATTGAATCATTGCTTGATGGTGCTTCAATCGAGGTGTCGCCGGCTCTGGTTGTCGAGGTGCTGAAGAAGATGAGCAATGCGGGACTTCTTGCGCTGTCGTTTTTCAGGTGGGCCGAGAAGCAGAAAGGCTTCAAACACACAACGGAGAGCTACAACTCGTTAATCGAATCCCTCGGTAAGATCAAACAGTTCAATGTGATTTGGAATTTGGTGAATGATATGAAACGAAAAGGGATTTTAAGTAGGGAAACATTTGCTTTAATTTCTCGGAGATATGCTCGAGCTAGAAAGGTTAAAGAAGCAATCGAGGCATTTGAGAAGATGGAGAAGTTTGGATTCCAACTGGGAATATCAGATTTCAACAGACTAATCGACACCCTGAGCAAATCGAGAAACGTTGGGCATGCACAAGAGGTGTTTGATAAAATGAAGCACAGAAGATTCAAGCCTGATATCAAGTCTTACACAATTCTATTAGAAGGATGGGGTCAGGAGCAGAATTTGTTGAGGTTGAATGAGGTTTATAGGGAGATGAGAGACGATGGGTTCGAACCGGACGTCGTGACGTTCGGTATAGTTATCAATGCACATTGCAAGGCAAAGAAGTATGATGAAGCTATTCAGTTGTTTCACACAATGAAAGCTAAGAATGTCAAGCCATCACCTCATGTGTTCTGTACCTTAATCAATGGTTTGGGCTCTGAGAAAAGATTGAATGAGGCTCTAGATTTTTTCAAACAATCAAAGTCGAGTGGCTATGCTCCAGAGGCACCGACTTATAATGCCGTGGTGGGGGCTTACTGCTGGTCGATGAAGATGGCTGATGCATATAAGACGGTTAACGACATGAAAAAACTAGGCATCGGTCCAAATTCGAGGACTTATGACATCATATTACATCATTTGATAAAGGCTGGGAGATCAAAAGAAGCTTATTCTGTTTTCGAGAGAATGAGTAGGGAGCCAGGGTGTGAACCAGCTTTGAGTACATATGAAATCATGGTGAGAATGTTATGCAATAAGGAGCGAGTAGACATGGCGATTCGGATTTGGGATCAAATGAAGGCCAGAGGAGTTCTTCCGGGAATGCATATGTTTTCAACATTGATTAACAGCTTGTGCCACGAGAACAAGTTGGAGTGTGCCTGCAAATACTTTGAAGAGATGCTGGATTTGGGTATTCGGCCGCCAGCAACAATGTTTAGCAATCTGAAACAGGCTCTTCTTGATGAGGGCAGACAGGATACAGCTTTACTTCTGGTAGAGAAACTCGATAGACTAAGAAAGGCACCATTGCACGGTTGA

mRNA sequence

ATGGGTTTTCGTCAATTTCTCAGTTCGCTTCGTGGATTATCTCGAAGAACAAGTTCTTTCCACTCGCTACCATCATCTTCTTCTTACACTGGTTCATTCAGATCAGTTTCGCGTTCACCAGCAATTACCAGAAACCAAACCCCAATCTGTACTTCGTCCCGTTGTAATGATCATTCGGCTGGGGTTATTCCGAATCGCAGTTACGCCTCTCACCATTTCAGCGATCATGGTACAGAGCACAGCAAACAGGATTCGGACGCCGACGAAATTTCAATCATGGCAAGTGCTGAGATTGCCCAGGATGCTGAAAAAATCTGTAAGTTGCTTACGAAAAGCCCTAGTTCTTGCATTGAATCATTGCTTGATGGTGCTTCAATCGAGGTGTCGCCGGCTCTGGTTGTCGAGGTGCTGAAGAAGATGAGCAATGCGGGACTTCTTGCGCTGTCGTTTTTCAGGTGGGCCGAGAAGCAGAAAGGCTTCAAACACACAACGGAGAGCTACAACTCGTTAATCGAATCCCTCGGTAAGATCAAACAGTTCAATGTGATTTGGAATTTGGTGAATGATATGAAACGAAAAGGGATTTTAAGTAGGGAAACATTTGCTTTAATTTCTCGGAGATATGCTCGAGCTAGAAAGGTTAAAGAAGCAATCGAGGCATTTGAGAAGATGGAGAAGTTTGGATTCCAACTGGGAATATCAGATTTCAACAGACTAATCGACACCCTGAGCAAATCGAGAAACGTTGGGCATGCACAAGAGGTGTTTGATAAAATGAAGCACAGAAGATTCAAGCCTGATATCAAGTCTTACACAATTCTATTAGAAGGATGGGGTCAGGAGCAGAATTTGTTGAGGTTGAATGAGGTTTATAGGGAGATGAGAGACGATGGGTTCGAACCGGACGTCGTGACGTTCGGTATAGTTATCAATGCACATTGCAAGGCAAAGAAGTATGATGAAGCTATTCAGTTGTTTCACACAATGAAAGCTAAGAATGTCAAGCCATCACCTCATGTGTTCTGTACCTTAATCAATGGTTTGGGCTCTGAGAAAAGATTGAATGAGGCTCTAGATTTTTTCAAACAATCAAAGTCGAGTGGCTATGCTCCAGAGGCACCGACTTATAATGCCGTGGTGGGGGCTTACTGCTGGTCGATGAAGATGGCTGATGCATATAAGACGGTTAACGACATGAAAAAACTAGGCATCGGTCCAAATTCGAGGACTTATGACATCATATTACATCATTTGATAAAGGCTGGGAGATCAAAAGAAGCTTATTCTGTTTTCGAGAGAATGAGTAGGGAGCCAGGGTGTGAACCAGCTTTGAGTACATATGAAATCATGGTGAGAATGTTATGCAATAAGGAGCGAGTAGACATGGCGATTCGGATTTGGGATCAAATGAAGGCCAGAGGAGTTCTTCCGGGAATGCATATGTTTTCAACATTGATTAACAGCTTGTGCCACGAGAACAAGTTGGAGTGTGCCTGCAAATACTTTGAAGAGATGCTGGATTTGGGTATTCGGCCGCCAGCAACAATGTTTAGCAATCTGAAACAGGCTCTTCTTGATGAGGGCAGACAGGATACAGCTTTACTTCTGGTAGAGAAACTCGATAGACTAAGAAAGGCACCATTGCACGGTTGA

Coding sequence (CDS)

ATGGGTTTTCGTCAATTTCTCAGTTCGCTTCGTGGATTATCTCGAAGAACAAGTTCTTTCCACTCGCTACCATCATCTTCTTCTTACACTGGTTCATTCAGATCAGTTTCGCGTTCACCAGCAATTACCAGAAACCAAACCCCAATCTGTACTTCGTCCCGTTGTAATGATCATTCGGCTGGGGTTATTCCGAATCGCAGTTACGCCTCTCACCATTTCAGCGATCATGGTACAGAGCACAGCAAACAGGATTCGGACGCCGACGAAATTTCAATCATGGCAAGTGCTGAGATTGCCCAGGATGCTGAAAAAATCTGTAAGTTGCTTACGAAAAGCCCTAGTTCTTGCATTGAATCATTGCTTGATGGTGCTTCAATCGAGGTGTCGCCGGCTCTGGTTGTCGAGGTGCTGAAGAAGATGAGCAATGCGGGACTTCTTGCGCTGTCGTTTTTCAGGTGGGCCGAGAAGCAGAAAGGCTTCAAACACACAACGGAGAGCTACAACTCGTTAATCGAATCCCTCGGTAAGATCAAACAGTTCAATGTGATTTGGAATTTGGTGAATGATATGAAACGAAAAGGGATTTTAAGTAGGGAAACATTTGCTTTAATTTCTCGGAGATATGCTCGAGCTAGAAAGGTTAAAGAAGCAATCGAGGCATTTGAGAAGATGGAGAAGTTTGGATTCCAACTGGGAATATCAGATTTCAACAGACTAATCGACACCCTGAGCAAATCGAGAAACGTTGGGCATGCACAAGAGGTGTTTGATAAAATGAAGCACAGAAGATTCAAGCCTGATATCAAGTCTTACACAATTCTATTAGAAGGATGGGGTCAGGAGCAGAATTTGTTGAGGTTGAATGAGGTTTATAGGGAGATGAGAGACGATGGGTTCGAACCGGACGTCGTGACGTTCGGTATAGTTATCAATGCACATTGCAAGGCAAAGAAGTATGATGAAGCTATTCAGTTGTTTCACACAATGAAAGCTAAGAATGTCAAGCCATCACCTCATGTGTTCTGTACCTTAATCAATGGTTTGGGCTCTGAGAAAAGATTGAATGAGGCTCTAGATTTTTTCAAACAATCAAAGTCGAGTGGCTATGCTCCAGAGGCACCGACTTATAATGCCGTGGTGGGGGCTTACTGCTGGTCGATGAAGATGGCTGATGCATATAAGACGGTTAACGACATGAAAAAACTAGGCATCGGTCCAAATTCGAGGACTTATGACATCATATTACATCATTTGATAAAGGCTGGGAGATCAAAAGAAGCTTATTCTGTTTTCGAGAGAATGAGTAGGGAGCCAGGGTGTGAACCAGCTTTGAGTACATATGAAATCATGGTGAGAATGTTATGCAATAAGGAGCGAGTAGACATGGCGATTCGGATTTGGGATCAAATGAAGGCCAGAGGAGTTCTTCCGGGAATGCATATGTTTTCAACATTGATTAACAGCTTGTGCCACGAGAACAAGTTGGAGTGTGCCTGCAAATACTTTGAAGAGATGCTGGATTTGGGTATTCGGCCGCCAGCAACAATGTTTAGCAATCTGAAACAGGCTCTTCTTGATGAGGGCAGACAGGATACAGCTTTACTTCTGGTAGAGAAACTCGATAGACTAAGAAAGGCACCATTGCACGGTTGA
BLAST of CmoCh04G021980 vs. Swiss-Prot
Match: PP112_ARATH (Pentatricopeptide repeat-containing protein At1g71060, mitochondrial OS=Arabidopsis thaliana GN=At1g71060 PE=2 SV=1)

HSP 1 Score: 678.7 bits (1750), Expect = 5.3e-194
Identity = 333/483 (68.94%), Postives = 404/483 (83.64%), Query Frame = 1

Query: 68  YASHHFSDHGTEHSKQDSDADEISIMASAEIAQDAEKICKLLTKSPSSCIESLLDGASIE 127
           Y S H S   T+ S  D+             +QDAE+ICK+LTK   S +E+LL+ AS++
Sbjct: 45  YGSFHASSVETQVSANDA-------------SQDAERICKILTKFTDSKVETLLNEASVK 104

Query: 128 VSPALVVEVLKKMSNAGLLALSFFRWAEKQKGFKHTTESYNSLIESLGKIKQFNVIWNLV 187
           +SPAL+ EVLKK+SNAG+LALS F+WAE QKGFKHTT +YN+LIESLGKIKQF +IW+LV
Sbjct: 105 LSPALIEEVLKKLSNAGVLALSVFKWAENQKGFKHTTSNYNALIESLGKIKQFKLIWSLV 164

Query: 188 NDMKRKGILSRETFALISRRYARARKVKEAIEAFEKMEKFGFQLGISDFNRLIDTLSKSR 247
           +DMK K +LS+ETFALISRRYARARKVKEAI AF KME+FGF++  SDFNR++DTLSKSR
Sbjct: 165 DDMKAKKLLSKETFALISRRYARARKVKEAIGAFHKMEEFGFKMESSDFNRMLDTLSKSR 224

Query: 248 NVGHAQEVFDKMKHRRFKPDIKSYTILLEGWGQEQNLLRLNEVYREMRDDGFEPDVVTFG 307
           NVG AQ+VFDKMK +RF+PDIKSYTILLEGWGQE NLLR++EV REM+D+GFEPDVV +G
Sbjct: 225 NVGDAQKVFDKMKKKRFEPDIKSYTILLEGWGQELNLLRVDEVNREMKDEGFEPDVVAYG 284

Query: 308 IVINAHCKAKKYDEAIQLFHTMKAKNVKPSPHVFCTLINGLGSEKRLNEALDFFKQSKSS 367
           I+INAHCKAKKY+EAI+ F+ M+ +N KPSPH+FC+LINGLGSEK+LN+AL+FF++SKSS
Sbjct: 285 IIINAHCKAKKYEEAIRFFNEMEQRNCKPSPHIFCSLINGLGSEKKLNDALEFFERSKSS 344

Query: 368 GYAPEAPTYNAVVGAYCWSMKMADAYKTVNDMKKLGIGPNSRTYDIILHHLIKAGRSKEA 427
           G+  EAPTYNA+VGAYCWS +M DAYKTV++M+  G+GPN+RTYDIILHHLI+  RSKEA
Sbjct: 345 GFPLEAPTYNALVGAYCWSQRMEDAYKTVDEMRLKGVGPNARTYDIILHHLIRMQRSKEA 404

Query: 428 YSVFERMSREPGCEPALSTYEIMVRMLCNKERVDMAIRIWDQMKARGVLPGMHMFSTLIN 487
           Y V++ MS    CEP +STYEIMVRM CNKER+DMAI+IWD+MK +GVLPGMHMFS+LI 
Sbjct: 405 YEVYQTMS----CEPTVSTYEIMVRMFCNKERLDMAIKIWDEMKGKGVLPGMHMFSSLIT 464

Query: 488 SLCHENKLECACKYFEEMLDLGIRPPATMFSNLKQALLDEGRQDTALLLVEKLDRLRKAP 547
           +LCHENKL+ AC+YF EMLD+GIRPP  MFS LKQ LLDEGR+D    LV K+DRLRK  
Sbjct: 465 ALCHENKLDEACEYFNEMLDVGIRPPGHMFSRLKQTLLDEGRKDKVTDLVVKMDRLRKTQ 510

Query: 548 LHG 551
           L G
Sbjct: 525 LVG 510

BLAST of CmoCh04G021980 vs. Swiss-Prot
Match: PP129_ARATH (Pentatricopeptide repeat-containing protein At1g77360, mitochondrial OS=Arabidopsis thaliana GN=At1g77360 PE=2 SV=2)

HSP 1 Score: 394.8 bits (1013), Expect = 1.5e-108
Identity = 190/452 (42.04%), Postives = 293/452 (64.82%), Query Frame = 1

Query: 97  EIAQDAEKICKLLTKSPSSCIESLLDGASIEVSPALVVEVLKKMSNAGLLALSFFRWAEK 156
           ++A  A+ I K+L  SP   ++S LD + + VS  +V +VL +  NAGLL   FF+W+EK
Sbjct: 67  DVADVAKNISKVLMSSPQLVLDSALDQSGLRVSQEVVEDVLNRFRNAGLLTYRFFQWSEK 126

Query: 157 QKGFKHTTESYNSLIESLGKIKQFNVIWNLVNDMKRKGILSRETFALISRRYARARKVKE 216
           Q+ ++H+  +Y+ +IES  KI+Q+ ++W+L+N M++K +L+ ETF ++ R+YARA+KV E
Sbjct: 127 QRHYEHSVRAYHMMIESTAKIRQYKLMWDLINAMRKKKMLNVETFCIVMRKYARAQKVDE 186

Query: 217 AIEAFEKMEKFGFQLGISDFNRLIDTLSKSRNVGHAQEVFDKMKHRRFKPDIKSYTILLE 276
           AI AF  MEK+     +  FN L+  L KS+NV  AQEVF+ M+ R F PD K+Y+ILLE
Sbjct: 187 AIYAFNVMEKYDLPPNLVAFNGLLSALCKSKNVRKAQEVFENMRDR-FTPDSKTYSILLE 246

Query: 277 GWGQEQNLLRLNEVYREMRDDGFEPDVVTFGIVINAHCKAKKYDEAIQLFHTMKAKNVKP 336
           GWG+E NL +  EV+REM D G  PD+VT+ I+++  CKA + DEA+ +  +M     KP
Sbjct: 247 GWGKEPNLPKAREVFREMIDAGCHPDIVTYSIMVDILCKAGRVDEALGIVRSMDPSICKP 306

Query: 337 SPHVFCTLINGLGSEKRLNEALDFFKQSKSSGYAPEAPTYNAVVGAYCWSMKMADAYKTV 396
           +  ++  L++  G+E RL EA+D F + + SG   +   +N+++GA+C + +M + Y+ +
Sbjct: 307 TTFIYSVLVHTYGTENRLEEAVDTFLEMERSGMKADVAVFNSLIGAFCKANRMKNVYRVL 366

Query: 397 NDMKKLGIGPNSRTYDIILHHLIKAGRSKEAYSVFERMSREPGCEPALSTYEIMVRMLCN 456
            +MK  G+ PNS++ +IIL HLI+ G   EA+ VF +M +   CEP   TY ++++M C 
Sbjct: 367 KEMKSKGVTPNSKSCNIILRHLIERGEKDEAFDVFRKMIKV--CEPDADTYTMVIKMFCE 426

Query: 457 KERVDMAIRIWDQMKARGVLPGMHMFSTLINSLCHENKLECACKYFEEMLDLGIRPPATM 516
           K+ ++ A ++W  M+ +GV P MH FS LIN LC E   + AC   EEM+++GIRP    
Sbjct: 427 KKEMETADKVWKYMRKKGVFPSMHTFSVLINGLCEERTTQKACVLLEEMIEMGIRPSGVT 486

Query: 517 FSNLKQALLDEGRQDTALLLVEKLDRLRKAPL 549
           F  L+Q L+ E R+D    L EK++ L   PL
Sbjct: 487 FGRLRQLLIKEEREDVLKFLNEKMNVLVNEPL 515

BLAST of CmoCh04G021980 vs. Swiss-Prot
Match: PP293_ARATH (Pentatricopeptide repeat-containing protein At3g62470, mitochondrial OS=Arabidopsis thaliana GN=At3g62470 PE=2 SV=1)

HSP 1 Score: 260.4 bits (664), Expect = 4.5e-68
Identity = 132/452 (29.20%), Postives = 259/452 (57.30%), Query Frame = 1

Query: 67  SYASHHFSDHGTEHSKQDSDADE---ISIMASAEIAQDAEKICKLLTK--SPSSCIESLL 126
           S  S + SD   E  + + D DE   +S + S+   ++ E++CK++ +  +    +E++L
Sbjct: 93  SSGSSNVSDGCDEEVESECDNDEETGVSCVESSTNPEEVERVCKVIDELFALDRNMEAVL 152

Query: 127 DGASIEVSPALVVEVLKKMSNAGLLALSFFRWAEKQKGFKHTTESYNSLIESLGKIKQFN 186
           D   +++S  L+VEVL++  +A   A  FF WA +++GF H + +YNS++  L K +QF 
Sbjct: 153 DEMKLDLSHDLIVEVLERFRHARKPAFRFFCWAAERQGFAHDSRTYNSMMSILAKTRQFE 212

Query: 187 VIWNLVNDMKRKGILSRETFALISRRYARARKVKEAIEAFEKMEKFGFQLGISDFNRLID 246
            + +++ +M  KG+L+ ETF +  + +A A++ K+A+  FE M+K+ F++G+   N L+D
Sbjct: 213 TMVSVLEEMGTKGLLTMETFTIAMKAFAAAKERKKAVGIFELMKKYKFKIGVETINCLLD 272

Query: 247 TLSKSRNVGHAQEVFDKMKHRRFKPDIKSYTILLEGWGQEQNLLRLNEVYREMRDDGFEP 306
           +L +++    AQ +FDK+K  RF P++ +YT+LL GW + +NL+    ++ +M D G +P
Sbjct: 273 SLGRAKLGKEAQVLFDKLK-ERFTPNMMTYTVLLNGWCRVRNLIEAARIWNDMIDQGLKP 332

Query: 307 DVVTFGIVINAHCKAKKYDEAIQLFHTMKAKNVKPSPHVFCTLINGLGSEKRLNEALDFF 366
           D+V   +++    +++K  +AI+LFH MK+K   P+   +  +I     +  +  A+++F
Sbjct: 333 DIVAHNVMLEGLLRSRKKSDAIKLFHVMKSKGPCPNVRSYTIMIRDFCKQSSMETAIEYF 392

Query: 367 KQSKSSGYAPEAPTYNAVVGAYCWSMKMADAYKTVNDMKKLGIGPNSRTYDIILHHLIKA 426
                SG  P+A  Y  ++  +    K+   Y+ + +M++ G  P+ +TY+ ++  +   
Sbjct: 393 DDMVDSGLQPDAAVYTCLITGFGTQKKLDTVYELLKEMQEKGHPPDGKTYNALIKLMANQ 452

Query: 427 GRSKEAYSVFERMSREPGCEPALSTYEIMVRMLCNKERVDMAIRIWDQMKARGVLPGMHM 486
              + A  ++ +M +    EP++ T+ ++++        +M   +W++M  +G+ P  + 
Sbjct: 453 KMPEHATRIYNKMIQNE-IEPSIHTFNMIMKSYFMARNYEMGRAVWEEMIKKGICPDDNS 512

Query: 487 FSTLINSLCHENKLECACKYFEEMLDLGIRPP 514
           ++ LI  L  E K   AC+Y EEMLD G++ P
Sbjct: 513 YTVLIRGLIGEGKSREACRYLEEMLDKGMKTP 542

BLAST of CmoCh04G021980 vs. Swiss-Prot
Match: PP294_ARATH (Pentatricopeptide repeat-containing protein At3g62540, mitochondrial OS=Arabidopsis thaliana GN=At3g62540 PE=2 SV=1)

HSP 1 Score: 259.2 bits (661), Expect = 1.0e-67
Identity = 132/452 (29.20%), Postives = 257/452 (56.86%), Query Frame = 1

Query: 67  SYASHHFSDHGTEHSKQDSDADE---ISIMASAEIAQDAEKICKLLTK--SPSSCIESLL 126
           S  S + SD   E  + + D DE   +S + S+   ++ E++CK++ +  +    +E++L
Sbjct: 93  SSGSSNVSDGCDEEVESECDNDEETGVSCVESSTNPEEVERVCKVIDELFALDRNMEAVL 152

Query: 127 DGASIEVSPALVVEVLKKMSNAGLLALSFFRWAEKQKGFKHTTESYNSLIESLGKIKQFN 186
           D   +++S  L+VEVL++  +A   A  FF WA +++GF H + +YNS++  L K +QF 
Sbjct: 153 DEMKLDLSHDLIVEVLERFRHARKPAFRFFCWAAERQGFAHASRTYNSMMSILAKTRQFE 212

Query: 187 VIWNLVNDMKRKGILSRETFALISRRYARARKVKEAIEAFEKMEKFGFQLGISDFNRLID 246
            + +++ +M  KG+L+ ETF +  + +A A++ K+A+  FE M+K+ F++G+   N L+D
Sbjct: 213 TMVSVLEEMGTKGLLTMETFTIAMKAFAAAKERKKAVGIFELMKKYKFKIGVETINCLLD 272

Query: 247 TLSKSRNVGHAQEVFDKMKHRRFKPDIKSYTILLEGWGQEQNLLRLNEVYREMRDDGFEP 306
           +L +++    AQ +FDK+K  RF P++ +YT+LL GW + +NL+    ++ +M D G +P
Sbjct: 273 SLGRAKLGKEAQVLFDKLK-ERFTPNMMTYTVLLNGWCRVRNLIEAARIWNDMIDHGLKP 332

Query: 307 DVVTFGIVINAHCKAKKYDEAIQLFHTMKAKNVKPSPHVFCTLINGLGSEKRLNEALDFF 366
           D+V   +++    ++ K  +AI+LFH MK+K   P+   +  +I     +  +  A+++F
Sbjct: 333 DIVAHNVMLEGLLRSMKKSDAIKLFHVMKSKGPCPNVRSYTIMIRDFCKQSSMETAIEYF 392

Query: 367 KQSKSSGYAPEAPTYNAVVGAYCWSMKMADAYKTVNDMKKLGIGPNSRTYDIILHHLIKA 426
                SG  P+A  Y  ++  +    K+   Y+ + +M++ G  P+ +TY+ ++  +   
Sbjct: 393 DDMVDSGLQPDAAVYTCLITGFGTQKKLDTVYELLKEMQEKGHPPDGKTYNALIKLMANQ 452

Query: 427 GRSKEAYSVFERMSREPGCEPALSTYEIMVRMLCNKERVDMAIRIWDQMKARGVLPGMHM 486
              +    ++ +M +    EP++ T+ ++++        +M   +WD+M  +G+ P  + 
Sbjct: 453 KMPEHGTRIYNKMIQNE-IEPSIHTFNMIMKSYFVARNYEMGRAVWDEMIKKGICPDDNS 512

Query: 487 FSTLINSLCHENKLECACKYFEEMLDLGIRPP 514
           ++ LI  L  E K   AC+Y EEMLD G++ P
Sbjct: 513 YTVLIRGLISEGKSREACRYLEEMLDKGMKTP 542

BLAST of CmoCh04G021980 vs. Swiss-Prot
Match: PP382_ARATH (Pentatricopeptide repeat-containing protein At5g14820, mitochondrial OS=Arabidopsis thaliana GN=At5g14820 PE=2 SV=1)

HSP 1 Score: 258.5 bits (659), Expect = 1.7e-67
Identity = 138/473 (29.18%), Postives = 265/473 (56.03%), Query Frame = 1

Query: 46  QTPICTSSRCNDHSAGVIPNRSYASHHFSDHGTEHSKQDSDADE---ISIMASAEIAQDA 105
           Q P+  S +  D S G     S  S + SD   E  + + D DE   +S + S+   ++ 
Sbjct: 72  QIPLPHSVQLLDASLGC-RGFSSGSSNVSDGCDEEVESECDNDEETGVSCVESSTNPEEV 131

Query: 106 EKICKLLTK--SPSSCIESLLDGASIEVSPALVVEVLKKMSNAGLLALSFFRWAEKQKGF 165
           E++CK++ +  +    +E++LD   +++S  L+VEVL++  +A   A  FF WA +++GF
Sbjct: 132 ERVCKVIDELFALDRNMEAVLDEMKLDLSHDLIVEVLERFRHARKPAFRFFCWAAERQGF 191

Query: 166 KHTTESYNSLIESLGKIKQFNVIWNLVNDMKRKGILSRETFALISRRYARARKVKEAIEA 225
            H + +YNS++  L K +QF  + +++ +M  KG+L+ ETF +  + +A A++ K+A+  
Sbjct: 192 AHDSRTYNSMMSILAKTRQFETMVSVLEEMGTKGLLTMETFTIAMKAFAAAKERKKAVGI 251

Query: 226 FEKMEKFGFQLGISDFNRLIDTLSKSRNVGHAQEVFDKMKHRRFKPDIKSYTILLEGWGQ 285
           FE M+K+ F++G+   N L+D+L +++    AQ +FDK+K  RF P++ +YT+LL GW +
Sbjct: 252 FELMKKYKFKIGVETINCLLDSLGRAKLGKEAQVLFDKLK-ERFTPNMMTYTVLLNGWCR 311

Query: 286 EQNLLRLNEVYREMRDDGFEPDVVTFGIVINAHCKAKKYDEAIQLFHTMKAKNVKPSPHV 345
            +NL+    ++ +M D G +PD+V   +++    ++ K  +AI+LFH MK+K   P+   
Sbjct: 312 VRNLIEAARIWNDMIDHGLKPDIVAHNVMLEGLLRSMKKSDAIKLFHVMKSKGPCPNVRS 371

Query: 346 FCTLINGLGSEKRLNEALDFFKQSKSSGYAPEAPTYNAVVGAYCWSMKMADAYKTVNDMK 405
           +  +I     +  +  A+++F     SG  P+A  Y  ++  +    K+   Y+ + +M+
Sbjct: 372 YTIMIRDFCKQSSMETAIEYFDDMVDSGLQPDAAVYTCLITGFGTQKKLDTVYELLKEMQ 431

Query: 406 KLGIGPNSRTYDIILHHLIKAGRSKEAYSVFERMSREPGCEPALSTYEIMVRMLCNKERV 465
           + G  P+ +TY+ ++  +      +    ++ +M +    EP++ T+ ++++        
Sbjct: 432 EKGHPPDGKTYNALIKLMANQKMPEHGTRIYNKMIQNE-IEPSIHTFNMIMKSYFVARNY 491

Query: 466 DMAIRIWDQMKARGVLPGMHMFSTLINSLCHENKLECACKYFEEMLDLGIRPP 514
           +M   +WD+M  +G+ P  + ++ LI  L  E K   AC+Y EEMLD G++ P
Sbjct: 492 EMGRAVWDEMIKKGICPDDNSYTVLIRGLISEGKSREACRYLEEMLDKGMKTP 541

BLAST of CmoCh04G021980 vs. TrEMBL
Match: A0A0A0KUY5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G571490 PE=4 SV=1)

HSP 1 Score: 873.2 bits (2255), Expect = 1.6e-250
Identity = 433/550 (78.73%), Postives = 490/550 (89.09%), Query Frame = 1

Query: 1   MGFRQFLSSLRGLSRRTSSFHSLPSSSSYTGSFRSVSRSPAITRNQTPICTSSRCNDHSA 60
           MGFRQ LSSL+ L  RTSSF           S RSVS S  IT+NQT IC SS CN+HS 
Sbjct: 1   MGFRQSLSSLQRLFPRTSSFQY--------ASARSVSCSKTITKNQTSICISSHCNEHST 60

Query: 61  GVIPNRSYASHHFSDHGTEHSKQDSDADEISIMASAEIAQDAEKICKLLTKSPSSCIESL 120
            +IPNR+Y SH  + H  EHSKQD  A + SI+ S EIAQDAEK CKL++K+P+SCIESL
Sbjct: 61  LLIPNRNYTSHRSTHHRIEHSKQDLKASQKSIVESDEIAQDAEKFCKLISKNPNSCIESL 120

Query: 121 LDGASIEVSPALVVEVLKKMSNAGLLALSFFRWAEKQKGFKHTTESYNSLIESLGKIKQF 180
           LDGA +E+SPAL+VEVLKK+SNAG LALSFFRWAEKQKGFKHTTESYN LIE+LGKIKQF
Sbjct: 121 LDGAPMELSPALIVEVLKKLSNAGFLALSFFRWAEKQKGFKHTTESYNLLIEALGKIKQF 180

Query: 181 NVIWNLVNDMKRKGILSRETFALISRRYARARKVKEAIEAFEKMEKFGFQLGISDFNRLI 240
           NVIWNLV+DMKRKGILSRETFALI+RRYARARKVKEA+E+FEKMEKFGFQ+G+SDFNRL+
Sbjct: 181 NVIWNLVSDMKRKGILSRETFALITRRYARARKVKEAVESFEKMEKFGFQMGVSDFNRLL 240

Query: 241 DTLSKSRNVGHAQEVFDKMKHRRFKPDIKSYTILLEGWGQEQNLLRLNEVYREMRDDGFE 300
           DTL KSRNV  AQEVFDKMKH RFKPDIKSYTILLEGWGQ+QNLL+LNEVYREMRD+GFE
Sbjct: 241 DTLCKSRNVKKAQEVFDKMKHGRFKPDIKSYTILLEGWGQDQNLLKLNEVYREMRDEGFE 300

Query: 301 PDVVTFGIVINAHCKAKKYDEAIQLFHTMKAKNVKPSPHVFCTLINGLGSEKRLNEALDF 360
           PDVVTFGI+INAHCKA+KYDEAI+LFH M+AKN+KPSPHVFCTLINGLGSEKRL EAL+F
Sbjct: 301 PDVVTFGILINAHCKARKYDEAIRLFHEMEAKNIKPSPHVFCTLINGLGSEKRLKEALEF 360

Query: 361 FKQSKSSGYAPEAPTYNAVVGAYCWSMKMADAYKTVNDMKKLGIGPNSRTYDIILHHLIK 420
           F+Q K SG+APEAPTYNAVVGAYCWSMKMA AY+ V++M+K G+GPNSRTYDIILHHLIK
Sbjct: 361 FEQLKLSGFAPEAPTYNAVVGAYCWSMKMAYAYRMVDEMRKSGVGPNSRTYDIILHHLIK 420

Query: 421 AGRSKEAYSVFERMSREPGCEPALSTYEIMVRMLCNKERVDMAIRIWDQMKARGVLPGMH 480
             +SKEAYSVF+RMSREPGCEP LSTY+IM+RM CN+ERVDMAI+IWD+MKA+GVLPGMH
Sbjct: 421 GRKSKEAYSVFQRMSREPGCEPTLSTYDIMIRMFCNEERVDMAIQIWDEMKAKGVLPGMH 480

Query: 481 MFSTLINSLCHENKLECACKYFEEMLDLGIRPPATMFSNLKQALLDEGRQDTALLLVEKL 540
           +FSTLINSLCHE+KLE AC YF+EMLD+GIRPPATMFSNLKQALLD+GR+DTALL+ EK+
Sbjct: 481 LFSTLINSLCHEHKLEDACTYFQEMLDVGIRPPATMFSNLKQALLDDGRKDTALLMAEKI 540

Query: 541 DRLRKAPLHG 551
            +LRKAPL G
Sbjct: 541 KKLRKAPLVG 542

BLAST of CmoCh04G021980 vs. TrEMBL
Match: F6HFJ5_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g04690 PE=4 SV=1)

HSP 1 Score: 705.7 bits (1820), Expect = 4.5e-200
Identity = 340/495 (68.69%), Postives = 417/495 (84.24%), Query Frame = 1

Query: 56  NDHSAGVIPNRSYASHHFSDHGTEHSKQDSDADEISIMASAEIAQDAEKICKLLTKSPSS 115
           N++S G  P     S H      +   + +DA       +  IAQD  K+CKLL    +S
Sbjct: 43  NENSCGFSPFSFCRSIHGGSIRNQACTETTDA------TNHHIAQDTGKLCKLLCTHSNS 102

Query: 116 CIESLLDGASIEVSPALVVEVLKKMSNAGLLALSFFRWAEKQKGFKHTTESYNSLIESLG 175
            IESLL+GAS++VSP LV+EVLKK+SN+G++ALSFFRWAEKQKGFK++TE+YN+LIE+LG
Sbjct: 103 SIESLLNGASVDVSPTLVLEVLKKLSNSGVIALSFFRWAEKQKGFKYSTENYNALIEALG 162

Query: 176 KIKQFNVIWNLVNDMKRKGILSRETFALISRRYARARKVKEAIEAFEKMEKFGFQLGISD 235
           KIKQF +IWNLVNDM+ KG+L++ETFALISRRYARARKVKEA+E FEKMEKFG Q  +SD
Sbjct: 163 KIKQFKMIWNLVNDMRSKGLLTQETFALISRRYARARKVKEAVETFEKMEKFGLQPVLSD 222

Query: 236 FNRLIDTLSKSRNVGHAQEVFDKMKHRRFKPDIKSYTILLEGWGQEQNLLRLNEVYREMR 295
           FNRL+D L KSR+V  AQEVFDKMK R+F+PDIKSYTILLEGWGQEQNLLRL+EVYREM+
Sbjct: 223 FNRLLDALCKSRHVERAQEVFDKMKDRKFRPDIKSYTILLEGWGQEQNLLRLDEVYREMK 282

Query: 296 DDGFEPDVVTFGIVINAHCKAKKYDEAIQLFHTMKAKNVKPSPHVFCTLINGLGSEKRLN 355
           D+GFEPD VT+GI+INAHCKA++YD A++LFH M+A    P+PH++CTLINGLGSE+RL 
Sbjct: 283 DEGFEPDAVTYGILINAHCKARRYDAAVELFHKMEANKCMPTPHIYCTLINGLGSERRLT 342

Query: 356 EALDFFKQSKSSGYAPEAPTYNAVVGAYCWSMKMADAYKTVNDMKKLGIGPNSRTYDIIL 415
           EAL FF++SK+SG+ PEAPTYNAVVG+YC SM+M DAY+ V++M+K G+GP +RTYDIIL
Sbjct: 343 EALQFFERSKASGFTPEAPTYNAVVGSYCQSMRMDDAYRIVDEMRKCGVGPQTRTYDIIL 402

Query: 416 HHLIKAGRSKEAYSVFERMSREPGCEPALSTYEIMVRMLCNKERVDMAIRIWDQMKARGV 475
           HHLIKA R+KEAY VF+ MS EPGCEP++STYEI+VRM CN+ERVDMA+R+WD+MKA+GV
Sbjct: 403 HHLIKARRTKEAYRVFQGMSSEPGCEPSVSTYEIVVRMFCNEERVDMALRVWDEMKAKGV 462

Query: 476 LPGMHMFSTLINSLCHENKLECACKYFEEMLDLGIRPPATMFSNLKQALLDEGRQDTALL 535
           LPGMHMFSTLINSLC+ENKL+ ACKYF EMLD+GIRPPA MFSNLKQ LLDEG+QD  L+
Sbjct: 463 LPGMHMFSTLINSLCYENKLDEACKYFHEMLDMGIRPPAAMFSNLKQTLLDEGKQDMVLI 522

Query: 536 LVEKLDRLRKAPLHG 551
           L +KLD++R   + G
Sbjct: 523 LAQKLDKIRTTQVIG 531

BLAST of CmoCh04G021980 vs. TrEMBL
Match: A0A061E7F9_THECC (Pentatricopeptide repeat-containing protein, putative isoform 1 OS=Theobroma cacao GN=TCM_010824 PE=4 SV=1)

HSP 1 Score: 695.3 bits (1793), Expect = 6.1e-197
Identity = 332/457 (72.65%), Postives = 401/457 (87.75%), Query Frame = 1

Query: 94  ASAEIAQDAEKICKLLTKSPSSCIESLLDGASIEVSPALVVEVLKKMSNAGLLALSFFRW 153
           A  ++ +DA KICKLL+      ++ LL+ ASIEVSP+LV EVLK++SNAG++A+SFF W
Sbjct: 86  AEPKLEEDAAKICKLLSSRSDIHVDKLLENASIEVSPSLVAEVLKRLSNAGVIAMSFFTW 145

Query: 154 AEKQKGFKHTTESYNSLIESLGKIKQFNVIWNLVNDMKRKGILSRETFALISRRYARARK 213
           AEKQKGFK+ TESYN+LIE+LGKIKQF +IWNL+NDMK   +LS++TFALISRRYARARK
Sbjct: 146 AEKQKGFKYNTESYNALIEALGKIKQFKLIWNLLNDMKSSKLLSKDTFALISRRYARARK 205

Query: 214 VKEAIEAFEKMEKFGFQLGISDFNRLIDTLSKSRNVGHAQEVFDKMKHRRFKPDIKSYTI 273
           V+EAIEAFE+ME+FGF+L  SDFNRLIDTLSKSR+V  A +VFDKMK RRF PDIKSYTI
Sbjct: 206 VEEAIEAFERMEEFGFKLDTSDFNRLIDTLSKSRHVEKANKVFDKMKKRRFVPDIKSYTI 265

Query: 274 LLEGWGQEQNLLRLNEVYREMRDDGFEPDVVTFGIVINAHCKAKKYDEAIQLFHTMKAKN 333
           LLEGWG+E NLLRL+EVYREM+D+GFEPDVVT+GI+INA+CKAKKY+ A++LFH M+AKN
Sbjct: 266 LLEGWGKEHNLLRLDEVYREMKDEGFEPDVVTYGILINAYCKAKKYNRAVELFHEMEAKN 325

Query: 334 VKPSPHVFCTLINGLGSEKRLNEALDFFKQSKSSGYAPEAPTYNAVVGAYCWSMKMADAY 393
            KPSPHVFCTLINGLGSEKRL+EAL+FF++SKS G+APEAPTYN++VGAYCWSM+M DA+
Sbjct: 326 CKPSPHVFCTLINGLGSEKRLSEALEFFERSKSCGFAPEAPTYNSLVGAYCWSMQMDDAF 385

Query: 394 KTVNDMKKLGIGPNSRTYDIILHHLIKAGRSKEAYSVFERMSREPGCEPALSTYEIMVRM 453
           + + +M++  +GPNSRTYDIILHHLIKA R KEAY VF++M+ EPGC P +STYEI+VRM
Sbjct: 386 RVIGEMRRNLVGPNSRTYDIILHHLIKARRMKEAYLVFQKMTSEPGCVPTVSTYEIIVRM 445

Query: 454 LCNKERVDMAIRIWDQMKARGVLPGMHMFSTLINSLCHENKLECACKYFEEMLDLGIRPP 513
            CN+E+VDMA+ +W QMKA GVLPGMHMFS LINSLCH +KL+ ACKYF+EMLD GIRPP
Sbjct: 446 FCNEEQVDMAMLVWAQMKAEGVLPGMHMFSDLINSLCHNSKLDDACKYFQEMLDAGIRPP 505

Query: 514 ATMFSNLKQALLDEGRQDTALLLVEKLDRLRKAPLHG 551
           A MFSNLKQALLDEG++DTAL L  K+D+LRK PL G
Sbjct: 506 AKMFSNLKQALLDEGKKDTALNLARKIDKLRKMPLVG 542

BLAST of CmoCh04G021980 vs. TrEMBL
Match: B9SYJ4_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_1288600 PE=4 SV=1)

HSP 1 Score: 686.4 bits (1770), Expect = 2.8e-194
Identity = 327/482 (67.84%), Postives = 415/482 (86.10%), Query Frame = 1

Query: 66  RSYASHHFSDHGTEHSKQDSDADEISIMASAEIAQDAEKICKLLTKSPSSCIESLLDGAS 125
           R+  S H +   + H+    D ++   + + +I +D   IC+LL+K+P+S IE LL GAS
Sbjct: 53  RALLSPHLNFQRSIHA----DLEQNPEIFTDKIIEDTRNICRLLSKNPNSSIEKLLLGAS 112

Query: 126 IEVSPALVVEVLKKMSNAGLLALSFFRWAEKQKGFKHTTESYNSLIESLGKIKQFNVIWN 185
            +VSPALV+EVLK++SNAG LALSFF+WAEKQKGF + TESYN+LI+SLGKIKQFN+IWN
Sbjct: 113 FKVSPALVLEVLKRLSNAGALALSFFKWAEKQKGFMYNTESYNALIDSLGKIKQFNMIWN 172

Query: 186 LVNDMKRKGILSRETFALISRRYARARKVKEAIEAFEKMEKFGFQLGISDFNRLIDTLSK 245
           LVNDMKRKG+L++ETFALISRRYAR+ KVKEA+  FEKMEKFG ++  +DFNRL+DTL K
Sbjct: 173 LVNDMKRKGVLTKETFALISRRYARSGKVKEAMNTFEKMEKFGLKIESTDFNRLLDTLIK 232

Query: 246 SRNVGHAQEVFDKMKHRRFKPDIKSYTILLEGWGQEQNLLRLNEVYREMRDDGFEPDVVT 305
           SR V  AQ VFDKMK RRF PDIKSYTILLEGWGQE+NLL+L+EVYREM+D+GFEPDVVT
Sbjct: 233 SRQVLSAQNVFDKMKIRRFVPDIKSYTILLEGWGQEKNLLKLDEVYREMKDEGFEPDVVT 292

Query: 306 FGIVINAHCKAKKYDEAIQLFHTMKAKNVKPSPHVFCTLINGLGSEKRLNEALDFFKQSK 365
           +GI+INA+CK +KYD+AI+LF  M++KN +PSPH+FCTLINGLGS +RL+EAL+FF++SK
Sbjct: 293 YGILINAYCKVRKYDDAIELFREMESKNCQPSPHIFCTLINGLGSVRRLSEALEFFRRSK 352

Query: 366 SSGYAPEAPTYNAVVGAYCWSMKMADAYKTVNDMKKLGIGPNSRTYDIILHHLIKAGRSK 425
           +SG+APE PTYNAVVGAYCWSM++ DAY+ V++M+K G+GPNSRTYDIILHHLIKA ++ 
Sbjct: 353 ASGFAPETPTYNAVVGAYCWSMRIDDAYRMVDEMRKSGVGPNSRTYDIILHHLIKAEKTT 412

Query: 426 EAYSVFERMSREPGCEPALSTYEIMVRMLCNKERVDMAIRIWDQMKARGVLPGMHMFSTL 485
           EA+SVFE+MS E  CEP +STY+I+VRM CN ++V+ AI++WD+MKA+GV PGMHMFS L
Sbjct: 413 EAFSVFEKMSSEEECEPTVSTYDIIVRMFCNMDKVESAIKVWDRMKAKGVHPGMHMFSIL 472

Query: 486 INSLCHENKLECACKYFEEMLDLGIRPPATMFSNLKQALLDEGRQDTALLLVEKLDRLRK 545
           INSLCHENKL+ ACKYF+EMLD+GIRPPA +FS+LKQAL+++GR+DT +LL +K+D+LRK
Sbjct: 473 INSLCHENKLDIACKYFQEMLDVGIRPPAALFSHLKQALIEDGRKDTVVLLAQKIDKLRK 530

Query: 546 AP 548
            P
Sbjct: 533 TP 530

BLAST of CmoCh04G021980 vs. TrEMBL
Match: B9HW85_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0010s12490g PE=4 SV=2)

HSP 1 Score: 685.3 bits (1767), Expect = 6.3e-194
Identity = 334/483 (69.15%), Postives = 415/483 (85.92%), Query Frame = 1

Query: 72  HFSDHGTEHSKQD-----SDADEISIMASAE-IAQDAEKICKLLTKSPSSCIESLLDGAS 131
           HF  H + H+        +DA   +    A+ I +DAE ICKLL+K+P+S +E+LL+ AS
Sbjct: 5   HFIVHKSTHTDLQQTVVLTDASNQTPQVLADKIVEDAENICKLLSKNPNSSVEALLNKAS 64

Query: 132 IEVSPALVVEVLKKMSNAGLLALSFFRWAEKQKGFKHTTESYNSLIESLGKIKQFNVIWN 191
           +EVSP+LV E LKK+SNAG LALSFFRWAEKQKGF+++TESY++LIESLGKIKQFNVIWN
Sbjct: 65  MEVSPSLVFEALKKLSNAGALALSFFRWAEKQKGFQYSTESYHALIESLGKIKQFNVIWN 124

Query: 192 LVNDMKRKGILSRETFALISRRYARARKVKEAIEAFEKMEKFGFQLGISDFNRLIDTLSK 251
           LV DMK+KG+L++ETFALISRRYARARKVKEA++AF KMEKFG ++  SD NRL+DTL K
Sbjct: 125 LVTDMKQKGLLNKETFALISRRYARARKVKEAVDAFMKMEKFGLKIESSDVNRLLDTLCK 184

Query: 252 SRNVGHAQEVFDKMKHRRFKPDIKSYTILLEGWGQEQNLLRLNEVYREMRDDGFEPDVVT 311
           SR V  AQ VFDKM  R F  DIKSYTILLEGWGQE+NL RL EVY EM+D+GFEPDVVT
Sbjct: 185 SRQVERAQLVFDKMNKRGFVADIKSYTILLEGWGQEKNLSRLMEVYNEMKDEGFEPDVVT 244

Query: 312 FGIVINAHCKAKKYDEAIQLFHTMKAKNVKPSPHVFCTLINGLGSEKRLNEALDFFKQSK 371
           +GI+INAHCK+++YD+AI+LFH M+AKN KPSPH++CTLINGLG+EKRL+EAL+FF+ SK
Sbjct: 245 YGILINAHCKSRRYDDAIELFHEMEAKNCKPSPHIYCTLINGLGAEKRLSEALEFFELSK 304

Query: 372 SSGYAPEAPTYNAVVGAYCWSMKMADAYKTVNDMKKLGIGPNSRTYDIILHHLIKAGRSK 431
           +SG+ PEAPTYNAVVGAYCWS +M D  +T+++M+K G+GP++RTYDIILHHLI+AG++K
Sbjct: 305 ASGFVPEAPTYNAVVGAYCWSERMDDVQRTIDEMRKGGVGPSARTYDIILHHLIRAGKTK 364

Query: 432 EAYSVFERMSREPGCEPALSTYEIMVRMLCNKERVDMAIRIWDQMKARGVLPGMHMFSTL 491
            AYSVF++MS E GCEP++STYEI+VRM CN++RVDMAI++WDQMKA+G+LP MHMFSTL
Sbjct: 365 IAYSVFQKMSCE-GCEPSVSTYEIIVRMFCNEDRVDMAIKVWDQMKAKGILPVMHMFSTL 424

Query: 492 INSLCHENKLECACKYFEEMLDLGIRPPATMFSNLKQALLDEGRQDTALLLVEKLDRLRK 549
           INSLCHE+KL+ AC YF+EMLD+GIRPPA +FSNLKQ LLDEG++DT ++   KLD+LRK
Sbjct: 425 INSLCHESKLDEACMYFQEMLDVGIRPPAQLFSNLKQNLLDEGKKDTVVVFERKLDKLRK 484

BLAST of CmoCh04G021980 vs. TAIR10
Match: AT1G71060.1 (AT1G71060.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 678.7 bits (1750), Expect = 3.0e-195
Identity = 333/483 (68.94%), Postives = 404/483 (83.64%), Query Frame = 1

Query: 68  YASHHFSDHGTEHSKQDSDADEISIMASAEIAQDAEKICKLLTKSPSSCIESLLDGASIE 127
           Y S H S   T+ S  D+             +QDAE+ICK+LTK   S +E+LL+ AS++
Sbjct: 45  YGSFHASSVETQVSANDA-------------SQDAERICKILTKFTDSKVETLLNEASVK 104

Query: 128 VSPALVVEVLKKMSNAGLLALSFFRWAEKQKGFKHTTESYNSLIESLGKIKQFNVIWNLV 187
           +SPAL+ EVLKK+SNAG+LALS F+WAE QKGFKHTT +YN+LIESLGKIKQF +IW+LV
Sbjct: 105 LSPALIEEVLKKLSNAGVLALSVFKWAENQKGFKHTTSNYNALIESLGKIKQFKLIWSLV 164

Query: 188 NDMKRKGILSRETFALISRRYARARKVKEAIEAFEKMEKFGFQLGISDFNRLIDTLSKSR 247
           +DMK K +LS+ETFALISRRYARARKVKEAI AF KME+FGF++  SDFNR++DTLSKSR
Sbjct: 165 DDMKAKKLLSKETFALISRRYARARKVKEAIGAFHKMEEFGFKMESSDFNRMLDTLSKSR 224

Query: 248 NVGHAQEVFDKMKHRRFKPDIKSYTILLEGWGQEQNLLRLNEVYREMRDDGFEPDVVTFG 307
           NVG AQ+VFDKMK +RF+PDIKSYTILLEGWGQE NLLR++EV REM+D+GFEPDVV +G
Sbjct: 225 NVGDAQKVFDKMKKKRFEPDIKSYTILLEGWGQELNLLRVDEVNREMKDEGFEPDVVAYG 284

Query: 308 IVINAHCKAKKYDEAIQLFHTMKAKNVKPSPHVFCTLINGLGSEKRLNEALDFFKQSKSS 367
           I+INAHCKAKKY+EAI+ F+ M+ +N KPSPH+FC+LINGLGSEK+LN+AL+FF++SKSS
Sbjct: 285 IIINAHCKAKKYEEAIRFFNEMEQRNCKPSPHIFCSLINGLGSEKKLNDALEFFERSKSS 344

Query: 368 GYAPEAPTYNAVVGAYCWSMKMADAYKTVNDMKKLGIGPNSRTYDIILHHLIKAGRSKEA 427
           G+  EAPTYNA+VGAYCWS +M DAYKTV++M+  G+GPN+RTYDIILHHLI+  RSKEA
Sbjct: 345 GFPLEAPTYNALVGAYCWSQRMEDAYKTVDEMRLKGVGPNARTYDIILHHLIRMQRSKEA 404

Query: 428 YSVFERMSREPGCEPALSTYEIMVRMLCNKERVDMAIRIWDQMKARGVLPGMHMFSTLIN 487
           Y V++ MS    CEP +STYEIMVRM CNKER+DMAI+IWD+MK +GVLPGMHMFS+LI 
Sbjct: 405 YEVYQTMS----CEPTVSTYEIMVRMFCNKERLDMAIKIWDEMKGKGVLPGMHMFSSLIT 464

Query: 488 SLCHENKLECACKYFEEMLDLGIRPPATMFSNLKQALLDEGRQDTALLLVEKLDRLRKAP 547
           +LCHENKL+ AC+YF EMLD+GIRPP  MFS LKQ LLDEGR+D    LV K+DRLRK  
Sbjct: 465 ALCHENKLDEACEYFNEMLDVGIRPPGHMFSRLKQTLLDEGRKDKVTDLVVKMDRLRKTQ 510

Query: 548 LHG 551
           L G
Sbjct: 525 LVG 510

BLAST of CmoCh04G021980 vs. TAIR10
Match: AT1G77360.1 (AT1G77360.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 394.8 bits (1013), Expect = 8.6e-110
Identity = 190/452 (42.04%), Postives = 293/452 (64.82%), Query Frame = 1

Query: 97  EIAQDAEKICKLLTKSPSSCIESLLDGASIEVSPALVVEVLKKMSNAGLLALSFFRWAEK 156
           ++A  A+ I K+L  SP   ++S LD + + VS  +V +VL +  NAGLL   FF+W+EK
Sbjct: 67  DVADVAKNISKVLMSSPQLVLDSALDQSGLRVSQEVVEDVLNRFRNAGLLTYRFFQWSEK 126

Query: 157 QKGFKHTTESYNSLIESLGKIKQFNVIWNLVNDMKRKGILSRETFALISRRYARARKVKE 216
           Q+ ++H+  +Y+ +IES  KI+Q+ ++W+L+N M++K +L+ ETF ++ R+YARA+KV E
Sbjct: 127 QRHYEHSVRAYHMMIESTAKIRQYKLMWDLINAMRKKKMLNVETFCIVMRKYARAQKVDE 186

Query: 217 AIEAFEKMEKFGFQLGISDFNRLIDTLSKSRNVGHAQEVFDKMKHRRFKPDIKSYTILLE 276
           AI AF  MEK+     +  FN L+  L KS+NV  AQEVF+ M+ R F PD K+Y+ILLE
Sbjct: 187 AIYAFNVMEKYDLPPNLVAFNGLLSALCKSKNVRKAQEVFENMRDR-FTPDSKTYSILLE 246

Query: 277 GWGQEQNLLRLNEVYREMRDDGFEPDVVTFGIVINAHCKAKKYDEAIQLFHTMKAKNVKP 336
           GWG+E NL +  EV+REM D G  PD+VT+ I+++  CKA + DEA+ +  +M     KP
Sbjct: 247 GWGKEPNLPKAREVFREMIDAGCHPDIVTYSIMVDILCKAGRVDEALGIVRSMDPSICKP 306

Query: 337 SPHVFCTLINGLGSEKRLNEALDFFKQSKSSGYAPEAPTYNAVVGAYCWSMKMADAYKTV 396
           +  ++  L++  G+E RL EA+D F + + SG   +   +N+++GA+C + +M + Y+ +
Sbjct: 307 TTFIYSVLVHTYGTENRLEEAVDTFLEMERSGMKADVAVFNSLIGAFCKANRMKNVYRVL 366

Query: 397 NDMKKLGIGPNSRTYDIILHHLIKAGRSKEAYSVFERMSREPGCEPALSTYEIMVRMLCN 456
            +MK  G+ PNS++ +IIL HLI+ G   EA+ VF +M +   CEP   TY ++++M C 
Sbjct: 367 KEMKSKGVTPNSKSCNIILRHLIERGEKDEAFDVFRKMIKV--CEPDADTYTMVIKMFCE 426

Query: 457 KERVDMAIRIWDQMKARGVLPGMHMFSTLINSLCHENKLECACKYFEEMLDLGIRPPATM 516
           K+ ++ A ++W  M+ +GV P MH FS LIN LC E   + AC   EEM+++GIRP    
Sbjct: 427 KKEMETADKVWKYMRKKGVFPSMHTFSVLINGLCEERTTQKACVLLEEMIEMGIRPSGVT 486

Query: 517 FSNLKQALLDEGRQDTALLLVEKLDRLRKAPL 549
           F  L+Q L+ E R+D    L EK++ L   PL
Sbjct: 487 FGRLRQLLIKEEREDVLKFLNEKMNVLVNEPL 515

BLAST of CmoCh04G021980 vs. TAIR10
Match: AT3G62470.1 (AT3G62470.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 260.4 bits (664), Expect = 2.5e-69
Identity = 132/452 (29.20%), Postives = 259/452 (57.30%), Query Frame = 1

Query: 67  SYASHHFSDHGTEHSKQDSDADE---ISIMASAEIAQDAEKICKLLTK--SPSSCIESLL 126
           S  S + SD   E  + + D DE   +S + S+   ++ E++CK++ +  +    +E++L
Sbjct: 93  SSGSSNVSDGCDEEVESECDNDEETGVSCVESSTNPEEVERVCKVIDELFALDRNMEAVL 152

Query: 127 DGASIEVSPALVVEVLKKMSNAGLLALSFFRWAEKQKGFKHTTESYNSLIESLGKIKQFN 186
           D   +++S  L+VEVL++  +A   A  FF WA +++GF H + +YNS++  L K +QF 
Sbjct: 153 DEMKLDLSHDLIVEVLERFRHARKPAFRFFCWAAERQGFAHDSRTYNSMMSILAKTRQFE 212

Query: 187 VIWNLVNDMKRKGILSRETFALISRRYARARKVKEAIEAFEKMEKFGFQLGISDFNRLID 246
            + +++ +M  KG+L+ ETF +  + +A A++ K+A+  FE M+K+ F++G+   N L+D
Sbjct: 213 TMVSVLEEMGTKGLLTMETFTIAMKAFAAAKERKKAVGIFELMKKYKFKIGVETINCLLD 272

Query: 247 TLSKSRNVGHAQEVFDKMKHRRFKPDIKSYTILLEGWGQEQNLLRLNEVYREMRDDGFEP 306
           +L +++    AQ +FDK+K  RF P++ +YT+LL GW + +NL+    ++ +M D G +P
Sbjct: 273 SLGRAKLGKEAQVLFDKLK-ERFTPNMMTYTVLLNGWCRVRNLIEAARIWNDMIDQGLKP 332

Query: 307 DVVTFGIVINAHCKAKKYDEAIQLFHTMKAKNVKPSPHVFCTLINGLGSEKRLNEALDFF 366
           D+V   +++    +++K  +AI+LFH MK+K   P+   +  +I     +  +  A+++F
Sbjct: 333 DIVAHNVMLEGLLRSRKKSDAIKLFHVMKSKGPCPNVRSYTIMIRDFCKQSSMETAIEYF 392

Query: 367 KQSKSSGYAPEAPTYNAVVGAYCWSMKMADAYKTVNDMKKLGIGPNSRTYDIILHHLIKA 426
                SG  P+A  Y  ++  +    K+   Y+ + +M++ G  P+ +TY+ ++  +   
Sbjct: 393 DDMVDSGLQPDAAVYTCLITGFGTQKKLDTVYELLKEMQEKGHPPDGKTYNALIKLMANQ 452

Query: 427 GRSKEAYSVFERMSREPGCEPALSTYEIMVRMLCNKERVDMAIRIWDQMKARGVLPGMHM 486
              + A  ++ +M +    EP++ T+ ++++        +M   +W++M  +G+ P  + 
Sbjct: 453 KMPEHATRIYNKMIQNE-IEPSIHTFNMIMKSYFMARNYEMGRAVWEEMIKKGICPDDNS 512

Query: 487 FSTLINSLCHENKLECACKYFEEMLDLGIRPP 514
           ++ LI  L  E K   AC+Y EEMLD G++ P
Sbjct: 513 YTVLIRGLIGEGKSREACRYLEEMLDKGMKTP 542

BLAST of CmoCh04G021980 vs. TAIR10
Match: AT3G62540.1 (AT3G62540.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 259.2 bits (661), Expect = 5.6e-69
Identity = 132/452 (29.20%), Postives = 257/452 (56.86%), Query Frame = 1

Query: 67  SYASHHFSDHGTEHSKQDSDADE---ISIMASAEIAQDAEKICKLLTK--SPSSCIESLL 126
           S  S + SD   E  + + D DE   +S + S+   ++ E++CK++ +  +    +E++L
Sbjct: 93  SSGSSNVSDGCDEEVESECDNDEETGVSCVESSTNPEEVERVCKVIDELFALDRNMEAVL 152

Query: 127 DGASIEVSPALVVEVLKKMSNAGLLALSFFRWAEKQKGFKHTTESYNSLIESLGKIKQFN 186
           D   +++S  L+VEVL++  +A   A  FF WA +++GF H + +YNS++  L K +QF 
Sbjct: 153 DEMKLDLSHDLIVEVLERFRHARKPAFRFFCWAAERQGFAHASRTYNSMMSILAKTRQFE 212

Query: 187 VIWNLVNDMKRKGILSRETFALISRRYARARKVKEAIEAFEKMEKFGFQLGISDFNRLID 246
            + +++ +M  KG+L+ ETF +  + +A A++ K+A+  FE M+K+ F++G+   N L+D
Sbjct: 213 TMVSVLEEMGTKGLLTMETFTIAMKAFAAAKERKKAVGIFELMKKYKFKIGVETINCLLD 272

Query: 247 TLSKSRNVGHAQEVFDKMKHRRFKPDIKSYTILLEGWGQEQNLLRLNEVYREMRDDGFEP 306
           +L +++    AQ +FDK+K  RF P++ +YT+LL GW + +NL+    ++ +M D G +P
Sbjct: 273 SLGRAKLGKEAQVLFDKLK-ERFTPNMMTYTVLLNGWCRVRNLIEAARIWNDMIDHGLKP 332

Query: 307 DVVTFGIVINAHCKAKKYDEAIQLFHTMKAKNVKPSPHVFCTLINGLGSEKRLNEALDFF 366
           D+V   +++    ++ K  +AI+LFH MK+K   P+   +  +I     +  +  A+++F
Sbjct: 333 DIVAHNVMLEGLLRSMKKSDAIKLFHVMKSKGPCPNVRSYTIMIRDFCKQSSMETAIEYF 392

Query: 367 KQSKSSGYAPEAPTYNAVVGAYCWSMKMADAYKTVNDMKKLGIGPNSRTYDIILHHLIKA 426
                SG  P+A  Y  ++  +    K+   Y+ + +M++ G  P+ +TY+ ++  +   
Sbjct: 393 DDMVDSGLQPDAAVYTCLITGFGTQKKLDTVYELLKEMQEKGHPPDGKTYNALIKLMANQ 452

Query: 427 GRSKEAYSVFERMSREPGCEPALSTYEIMVRMLCNKERVDMAIRIWDQMKARGVLPGMHM 486
              +    ++ +M +    EP++ T+ ++++        +M   +WD+M  +G+ P  + 
Sbjct: 453 KMPEHGTRIYNKMIQNE-IEPSIHTFNMIMKSYFVARNYEMGRAVWDEMIKKGICPDDNS 512

Query: 487 FSTLINSLCHENKLECACKYFEEMLDLGIRPP 514
           ++ LI  L  E K   AC+Y EEMLD G++ P
Sbjct: 513 YTVLIRGLISEGKSREACRYLEEMLDKGMKTP 542

BLAST of CmoCh04G021980 vs. TAIR10
Match: AT5G14820.1 (AT5G14820.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 258.5 bits (659), Expect = 9.6e-69
Identity = 138/473 (29.18%), Postives = 265/473 (56.03%), Query Frame = 1

Query: 46  QTPICTSSRCNDHSAGVIPNRSYASHHFSDHGTEHSKQDSDADE---ISIMASAEIAQDA 105
           Q P+  S +  D S G     S  S + SD   E  + + D DE   +S + S+   ++ 
Sbjct: 72  QIPLPHSVQLLDASLGC-RGFSSGSSNVSDGCDEEVESECDNDEETGVSCVESSTNPEEV 131

Query: 106 EKICKLLTK--SPSSCIESLLDGASIEVSPALVVEVLKKMSNAGLLALSFFRWAEKQKGF 165
           E++CK++ +  +    +E++LD   +++S  L+VEVL++  +A   A  FF WA +++GF
Sbjct: 132 ERVCKVIDELFALDRNMEAVLDEMKLDLSHDLIVEVLERFRHARKPAFRFFCWAAERQGF 191

Query: 166 KHTTESYNSLIESLGKIKQFNVIWNLVNDMKRKGILSRETFALISRRYARARKVKEAIEA 225
            H + +YNS++  L K +QF  + +++ +M  KG+L+ ETF +  + +A A++ K+A+  
Sbjct: 192 AHDSRTYNSMMSILAKTRQFETMVSVLEEMGTKGLLTMETFTIAMKAFAAAKERKKAVGI 251

Query: 226 FEKMEKFGFQLGISDFNRLIDTLSKSRNVGHAQEVFDKMKHRRFKPDIKSYTILLEGWGQ 285
           FE M+K+ F++G+   N L+D+L +++    AQ +FDK+K  RF P++ +YT+LL GW +
Sbjct: 252 FELMKKYKFKIGVETINCLLDSLGRAKLGKEAQVLFDKLK-ERFTPNMMTYTVLLNGWCR 311

Query: 286 EQNLLRLNEVYREMRDDGFEPDVVTFGIVINAHCKAKKYDEAIQLFHTMKAKNVKPSPHV 345
            +NL+    ++ +M D G +PD+V   +++    ++ K  +AI+LFH MK+K   P+   
Sbjct: 312 VRNLIEAARIWNDMIDHGLKPDIVAHNVMLEGLLRSMKKSDAIKLFHVMKSKGPCPNVRS 371

Query: 346 FCTLINGLGSEKRLNEALDFFKQSKSSGYAPEAPTYNAVVGAYCWSMKMADAYKTVNDMK 405
           +  +I     +  +  A+++F     SG  P+A  Y  ++  +    K+   Y+ + +M+
Sbjct: 372 YTIMIRDFCKQSSMETAIEYFDDMVDSGLQPDAAVYTCLITGFGTQKKLDTVYELLKEMQ 431

Query: 406 KLGIGPNSRTYDIILHHLIKAGRSKEAYSVFERMSREPGCEPALSTYEIMVRMLCNKERV 465
           + G  P+ +TY+ ++  +      +    ++ +M +    EP++ T+ ++++        
Sbjct: 432 EKGHPPDGKTYNALIKLMANQKMPEHGTRIYNKMIQNE-IEPSIHTFNMIMKSYFVARNY 491

Query: 466 DMAIRIWDQMKARGVLPGMHMFSTLINSLCHENKLECACKYFEEMLDLGIRPP 514
           +M   +WD+M  +G+ P  + ++ LI  L  E K   AC+Y EEMLD G++ P
Sbjct: 492 EMGRAVWDEMIKKGICPDDNSYTVLIRGLISEGKSREACRYLEEMLDKGMKTP 541

BLAST of CmoCh04G021980 vs. NCBI nr
Match: gi|659072880|ref|XP_008467144.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g71060, mitochondrial [Cucumis melo])

HSP 1 Score: 880.2 bits (2273), Expect = 1.9e-252
Identity = 440/550 (80.00%), Postives = 487/550 (88.55%), Query Frame = 1

Query: 1   MGFRQFLSSLRGLSRRTSSFHSLPSSSSYTGSFRSVSRSPAITRNQTPICTSSRCNDHSA 60
           MGFR+ LSSL+ L  RTSSF           S RS S S  ITRNQT IC SSRCN+HS 
Sbjct: 1   MGFRRSLSSLQRLFPRTSSFQY--------ASARSFSCSKTITRNQTSICISSRCNEHST 60

Query: 61  GVIPNRSYASHHFSDHGTEHSKQDSDADEISIMASAEIAQDAEKICKLLTKSPSSCIESL 120
             I NR+Y S H + HG EHSKQD +A + SIM S EI QDAEK CKL++K+P+SCIESL
Sbjct: 61  WPILNRNYTSQHSTHHGNEHSKQDLEAGQKSIMESDEITQDAEKFCKLISKNPNSCIESL 120

Query: 121 LDGASIEVSPALVVEVLKKMSNAGLLALSFFRWAEKQKGFKHTTESYNSLIESLGKIKQF 180
           LDGA++E+SPAL+ EVLKK+ NAG LALSFFRWAEKQKGFKHTTESYN LIE+LGKIKQF
Sbjct: 121 LDGAALELSPALIEEVLKKLCNAGFLALSFFRWAEKQKGFKHTTESYNCLIEALGKIKQF 180

Query: 181 NVIWNLVNDMKRKGILSRETFALISRRYARARKVKEAIEAFEKMEKFGFQLGISDFNRLI 240
           NVIWNLV+DMKRKGILSRETFALI+RRYARARKVKEAIE+FEKMEKFGFQLG+SDFNRLI
Sbjct: 181 NVIWNLVSDMKRKGILSRETFALITRRYARARKVKEAIESFEKMEKFGFQLGVSDFNRLI 240

Query: 241 DTLSKSRNVGHAQEVFDKMKHRRFKPDIKSYTILLEGWGQEQNLLRLNEVYREMRDDGFE 300
           DTL KSRNV  AQEVFDKMKH RFKPDIKSYTILLEGWGQ+QNLLRLNEVYREMRD+GFE
Sbjct: 241 DTLCKSRNVKKAQEVFDKMKHGRFKPDIKSYTILLEGWGQDQNLLRLNEVYREMRDNGFE 300

Query: 301 PDVVTFGIVINAHCKAKKYDEAIQLFHTMKAKNVKPSPHVFCTLINGLGSEKRLNEALDF 360
           PDVVTFGI+INAHCKA+KYDEAI+LFH MKAKN+KPSPHVFCTLINGLGSEKRL EAL+F
Sbjct: 301 PDVVTFGILINAHCKARKYDEAIRLFHDMKAKNIKPSPHVFCTLINGLGSEKRLKEALEF 360

Query: 361 FKQSKSSGYAPEAPTYNAVVGAYCWSMKMADAYKTVNDMKKLGIGPNSRTYDIILHHLIK 420
           F+QSK SG+APEAPTYNAVVGAYCWSMKMADAY+ V+DM+K GIGPNSRTYDIILHHLIK
Sbjct: 361 FEQSKLSGFAPEAPTYNAVVGAYCWSMKMADAYRMVDDMRKSGIGPNSRTYDIILHHLIK 420

Query: 421 AGRSKEAYSVFERMSREPGCEPALSTYEIMVRMLCNKERVDMAIRIWDQMKARGVLPGMH 480
             +SKEAYSVF+RMSREPGCEP LSTYEIM+RM CN+ERVDMAI+IWD+MKA+GVLPGMH
Sbjct: 421 GRKSKEAYSVFQRMSREPGCEPTLSTYEIMIRMFCNEERVDMAIQIWDEMKAKGVLPGMH 480

Query: 481 MFSTLINSLCHENKLECACKYFEEMLDLGIRPPATMFSNLKQALLDEGRQDTALLLVEKL 540
           +FS LIN LCHENKLE AC YF+EMLD+GIRPPATMFSNLKQALLD+GR+DTALLL EK+
Sbjct: 481 LFSMLINRLCHENKLEDACTYFQEMLDVGIRPPATMFSNLKQALLDDGRKDTALLLAEKI 540

Query: 541 DRLRKAPLHG 551
            +LRK PL G
Sbjct: 541 KKLRKTPLVG 542

BLAST of CmoCh04G021980 vs. NCBI nr
Match: gi|449465535|ref|XP_004150483.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g71060, mitochondrial [Cucumis sativus])

HSP 1 Score: 873.2 bits (2255), Expect = 2.3e-250
Identity = 433/550 (78.73%), Postives = 490/550 (89.09%), Query Frame = 1

Query: 1   MGFRQFLSSLRGLSRRTSSFHSLPSSSSYTGSFRSVSRSPAITRNQTPICTSSRCNDHSA 60
           MGFRQ LSSL+ L  RTSSF           S RSVS S  IT+NQT IC SS CN+HS 
Sbjct: 1   MGFRQSLSSLQRLFPRTSSFQY--------ASARSVSCSKTITKNQTSICISSHCNEHST 60

Query: 61  GVIPNRSYASHHFSDHGTEHSKQDSDADEISIMASAEIAQDAEKICKLLTKSPSSCIESL 120
            +IPNR+Y SH  + H  EHSKQD  A + SI+ S EIAQDAEK CKL++K+P+SCIESL
Sbjct: 61  LLIPNRNYTSHRSTHHRIEHSKQDLKASQKSIVESDEIAQDAEKFCKLISKNPNSCIESL 120

Query: 121 LDGASIEVSPALVVEVLKKMSNAGLLALSFFRWAEKQKGFKHTTESYNSLIESLGKIKQF 180
           LDGA +E+SPAL+VEVLKK+SNAG LALSFFRWAEKQKGFKHTTESYN LIE+LGKIKQF
Sbjct: 121 LDGAPMELSPALIVEVLKKLSNAGFLALSFFRWAEKQKGFKHTTESYNLLIEALGKIKQF 180

Query: 181 NVIWNLVNDMKRKGILSRETFALISRRYARARKVKEAIEAFEKMEKFGFQLGISDFNRLI 240
           NVIWNLV+DMKRKGILSRETFALI+RRYARARKVKEA+E+FEKMEKFGFQ+G+SDFNRL+
Sbjct: 181 NVIWNLVSDMKRKGILSRETFALITRRYARARKVKEAVESFEKMEKFGFQMGVSDFNRLL 240

Query: 241 DTLSKSRNVGHAQEVFDKMKHRRFKPDIKSYTILLEGWGQEQNLLRLNEVYREMRDDGFE 300
           DTL KSRNV  AQEVFDKMKH RFKPDIKSYTILLEGWGQ+QNLL+LNEVYREMRD+GFE
Sbjct: 241 DTLCKSRNVKKAQEVFDKMKHGRFKPDIKSYTILLEGWGQDQNLLKLNEVYREMRDEGFE 300

Query: 301 PDVVTFGIVINAHCKAKKYDEAIQLFHTMKAKNVKPSPHVFCTLINGLGSEKRLNEALDF 360
           PDVVTFGI+INAHCKA+KYDEAI+LFH M+AKN+KPSPHVFCTLINGLGSEKRL EAL+F
Sbjct: 301 PDVVTFGILINAHCKARKYDEAIRLFHEMEAKNIKPSPHVFCTLINGLGSEKRLKEALEF 360

Query: 361 FKQSKSSGYAPEAPTYNAVVGAYCWSMKMADAYKTVNDMKKLGIGPNSRTYDIILHHLIK 420
           F+Q K SG+APEAPTYNAVVGAYCWSMKMA AY+ V++M+K G+GPNSRTYDIILHHLIK
Sbjct: 361 FEQLKLSGFAPEAPTYNAVVGAYCWSMKMAYAYRMVDEMRKSGVGPNSRTYDIILHHLIK 420

Query: 421 AGRSKEAYSVFERMSREPGCEPALSTYEIMVRMLCNKERVDMAIRIWDQMKARGVLPGMH 480
             +SKEAYSVF+RMSREPGCEP LSTY+IM+RM CN+ERVDMAI+IWD+MKA+GVLPGMH
Sbjct: 421 GRKSKEAYSVFQRMSREPGCEPTLSTYDIMIRMFCNEERVDMAIQIWDEMKAKGVLPGMH 480

Query: 481 MFSTLINSLCHENKLECACKYFEEMLDLGIRPPATMFSNLKQALLDEGRQDTALLLVEKL 540
           +FSTLINSLCHE+KLE AC YF+EMLD+GIRPPATMFSNLKQALLD+GR+DTALL+ EK+
Sbjct: 481 LFSTLINSLCHEHKLEDACTYFQEMLDVGIRPPATMFSNLKQALLDDGRKDTALLMAEKI 540

Query: 541 DRLRKAPLHG 551
            +LRKAPL G
Sbjct: 541 KKLRKAPLVG 542

BLAST of CmoCh04G021980 vs. NCBI nr
Match: gi|694330949|ref|XP_009356158.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g71060, mitochondrial-like [Pyrus x bretschneideri])

HSP 1 Score: 726.9 bits (1875), Expect = 2.7e-206
Identity = 353/496 (71.17%), Postives = 428/496 (86.29%), Query Frame = 1

Query: 57  DHSAGVIPNRSYASHHFSDHGTEHSKQDSDA-DEISIMASAEIAQDAEKICKLL-TKSPS 116
           ++ +G + NR+ + H     G E      DA D+  ++ + E+++DAEKICK+L T+S +
Sbjct: 26  ENPSGFLCNRTASFHRAIHDGLEKPLVIDDAGDKTPLLKNPEVSEDAEKICKILSTRSSN 85

Query: 117 SCIESLLDGASIEVSPALVVEVLKKMSNAGLLALSFFRWAEKQKGFKHTTESYNSLIESL 176
           S IES LDGAS+E SP LVVEVLKK+SN+G+LALSFFRWAEKQKGFKH+TESYN+LIE+L
Sbjct: 86  SPIESFLDGASVEASPTLVVEVLKKLSNSGVLALSFFRWAEKQKGFKHSTESYNALIEAL 145

Query: 177 GKIKQFNVIWNLVNDMKRKGILSRETFALISRRYARARKVKEAIEAFEKMEKFGFQLGIS 236
           GKIKQF ++W LVN+MK KG+LS+ETFALISRRYARA+KVKEAI+AFEKM KFG ++  S
Sbjct: 146 GKIKQFKMMWELVNEMKIKGMLSKETFALISRRYARAKKVKEAIDAFEKMAKFGMKVEGS 205

Query: 237 DFNRLIDTLSKSRNVGHAQEVFDKMKHRRFKPDIKSYTILLEGWGQEQNLLRLNEVYREM 296
           DFNRLIDTLSKSR V  AQEVFDKMK RRF+PDIKSYTILLEGWGQEQN LRLNEVYREM
Sbjct: 206 DFNRLIDTLSKSRQVERAQEVFDKMKKRRFEPDIKSYTILLEGWGQEQNFLRLNEVYREM 265

Query: 297 RDDGFEPDVVTFGIVINAHCKAKKYDEAIQLFHTMKAKNVKPSPHVFCTLINGLGSEKRL 356
           +D+GF+PDVVT+ I+INAHCKAKKYDEAI LF  M+AKN+K +PH+FC LINGLGSEKRL
Sbjct: 266 KDEGFDPDVVTYAILINAHCKAKKYDEAIDLFREMEAKNIKATPHIFCILINGLGSEKRL 325

Query: 357 NEALDFFKQSKSSGYAPEAPTYNAVVGAYCWSMKMADAYKTVNDMKKLGIGPNSRTYDII 416
           +EAL+FF+ +K+SG+ PEAPTYNA+VG+YCWSM+M D ++ V++M+K GIGPN+RTYDII
Sbjct: 326 SEALEFFELNKASGFIPEAPTYNALVGSYCWSMRMQDTFRVVDEMRKCGIGPNARTYDII 385

Query: 417 LHHLIKAGRSKEAYSVFERMSREPGCEPALSTYEIMVRMLCNKERVDMAIRIWDQMKARG 476
           LHHL+KA R+++AYS+F++MSREPGCEP +STYEIMVRM CN+ERVDMA+++WDQMK RG
Sbjct: 386 LHHLVKARRTEQAYSIFQQMSREPGCEPTVSTYEIMVRMFCNEERVDMAMQVWDQMKTRG 445

Query: 477 VLPGMHMFSTLINSLCHENKLECACKYFEEMLDLGIRPPATMFSNLKQALLDEGRQDTAL 536
           VLPGMHMFSTLINSLCH NKL+ ACKYF+EMLD GIRPPA +FSNLKQALLD GR+D  +
Sbjct: 446 VLPGMHMFSTLINSLCHGNKLDDACKYFQEMLDAGIRPPAQLFSNLKQALLDGGRKDVVI 505

Query: 537 LLVEKLDRLRKAPLHG 551
            L  K+DRLRK PL G
Sbjct: 506 SLGLKIDRLRKTPLVG 521

BLAST of CmoCh04G021980 vs. NCBI nr
Match: gi|645276551|ref|XP_008243338.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g71060, mitochondrial [Prunus mume])

HSP 1 Score: 720.3 bits (1858), Expect = 2.5e-204
Identity = 372/563 (66.07%), Postives = 453/563 (80.46%), Query Frame = 1

Query: 1   MGFRQFLSSLRGLSRRT-SSFHSLPS----SSSYTGSFRSVSRS------PAITRNQTPI 60
           MG  +F + +   S+   +SF   P+     +S TGS RS++ S         TRN+T  
Sbjct: 1   MGLSRFFNLVSKPSKEPINSFLPFPNLRSTPTSQTGSHRSLTTSLQSIKFTNFTRNRT-- 60

Query: 61  CTSSRCNDHSAGVIPNRSYASHHFSDHGTEHSK-QDSDADEISIMASAEIAQDAEKICKL 120
                  ++ +G + NR+   +     G   ++  D  +D+I    + EI +DAE+IC++
Sbjct: 61  ----LHTENPSGFLSNRTGGFYRAIHDGLGKTQVPDEASDKIPGANNREITEDAERICRI 120

Query: 121 LTKSPS-SCIESLLDGASIEVSPALVVEVLKKMSNAGLLALSFFRWAEKQKGFKHTTESY 180
           L+ S S S I+S LD AS+EVS ALVVEVLKK+SNAG+LALSFFRWAEKQKGFKHT ESY
Sbjct: 121 LSTSNSKSPIDSFLDSASVEVSTALVVEVLKKLSNAGVLALSFFRWAEKQKGFKHTMESY 180

Query: 181 NSLIESLGKIKQFNVIWNLVNDMKRKGILSRETFALISRRYARARKVKEAIEAFEKMEKF 240
           N+LIE+LGKIKQF +IW LVNDMK KG+LS+ETFALISRRY+RA+KVKEAIE FEKMEKF
Sbjct: 181 NALIEALGKIKQFKMIWELVNDMKSKGLLSKETFALISRRYSRAKKVKEAIETFEKMEKF 240

Query: 241 GFQLGISDFNRLIDTLSKSRNVGHAQEVFDKMKHRRFKPDIKSYTILLEGWGQEQNLLRL 300
           G ++  SDFNRLIDTLSKSR V  AQEVFDKMKH RFKPDIKSYTILLEGWGQEQN LRL
Sbjct: 241 GMKVEGSDFNRLIDTLSKSRQVEKAQEVFDKMKHTRFKPDIKSYTILLEGWGQEQNFLRL 300

Query: 301 NEVYREMRDDGFEPDVVTFGIVINAHCKAKKYDEAIQLFHTMKAKNVKPSPHVFCTLING 360
           NEVYREM+D+GF+PDVVT GI+INAHCKAKKYDEAI LF  M+AKNVK +PH+FC LING
Sbjct: 301 NEVYREMKDEGFDPDVVTCGILINAHCKAKKYDEAIDLFREMEAKNVKATPHIFCILING 360

Query: 361 LGSEKRLNEALDFFKQSKSSGYAPEAPTYNAVVGAYCWSMKMADAYKTVNDMKKLGIGPN 420
           LGSE+RL+EAL+FF+ +K+SG+ PEAPTYNA+VGAYCWSM+M DA++ V +M+K GIGPN
Sbjct: 361 LGSERRLSEALEFFELNKASGFEPEAPTYNALVGAYCWSMRMHDAFRVVEEMRKCGIGPN 420

Query: 421 SRTYDIILHHLIKAGRSKEAYSVFERMSREPGCEPALSTYEIMVRMLCNKERVDMAIRIW 480
            RTYDIILHHL+KA R+++AYSVF+++SREPGCEP +STYEI+VRM CN+++VDMA+R+W
Sbjct: 421 PRTYDIILHHLVKARRTEQAYSVFQQISREPGCEPTVSTYEILVRMFCNEDQVDMALRVW 480

Query: 481 DQMKARGVLPGMHMFSTLINSLCHENKLECACKYFEEMLDLGIRPPATMFSNLKQALLDE 540
           DQMK +GVLPGMHMFSTLINSLCHENKL+ ACKYF+EMLD+GIRPPA MFSNLKQALL+E
Sbjct: 481 DQMKTKGVLPGMHMFSTLINSLCHENKLDDACKYFQEMLDVGIRPPAQMFSNLKQALLNE 540

Query: 541 GRQDTALLLVEKLDRLRKAPLHG 551
           GR+D  +    K+DRLRK PL G
Sbjct: 541 GRKDDVISFGLKIDRLRKTPLVG 557

BLAST of CmoCh04G021980 vs. NCBI nr
Match: gi|657993866|ref|XP_008389228.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g71060, mitochondrial-like [Malus domestica])

HSP 1 Score: 720.3 bits (1858), Expect = 2.5e-204
Identity = 349/496 (70.36%), Postives = 427/496 (86.09%), Query Frame = 1

Query: 57  DHSAGVIPNRSYASHHFSDHGTEHSKQDSDA-DEISIMASAEIAQDAEKICKLL-TKSPS 116
           ++ +G + NR+ + H     G E  +   DA D+  ++ + ++++DAEKICK+L T S +
Sbjct: 26  ENPSGFLCNRTASFHRAIHDGLEKPQVLHDAGDKTPLLKNPKVSEDAEKICKILSTSSSN 85

Query: 117 SCIESLLDGASIEVSPALVVEVLKKMSNAGLLALSFFRWAEKQKGFKHTTESYNSLIESL 176
           S IES LDGAS+E SP LVVEVLKK+SNAG+LALSFFRWAEKQKGFKHTTESYN+LIE+L
Sbjct: 86  SPIESFLDGASVEASPTLVVEVLKKLSNAGVLALSFFRWAEKQKGFKHTTESYNALIEAL 145

Query: 177 GKIKQFNVIWNLVNDMKRKGILSRETFALISRRYARARKVKEAIEAFEKMEKFGFQLGIS 236
           GKIKQF ++W LVN+MK KG+LS+ETFALISRRYAR +KVKEAI+AFEKM KFG ++  S
Sbjct: 146 GKIKQFKMMWELVNEMKIKGMLSKETFALISRRYARVKKVKEAIDAFEKMAKFGMKVEGS 205

Query: 237 DFNRLIDTLSKSRNVGHAQEVFDKMKHRRFKPDIKSYTILLEGWGQEQNLLRLNEVYREM 296
           DFNRLIDTLSKSR V  AQEVFDKMK RRF+PDIKSYTILLEGWGQEQN LRLNEVYREM
Sbjct: 206 DFNRLIDTLSKSRQVERAQEVFDKMKKRRFEPDIKSYTILLEGWGQEQNFLRLNEVYREM 265

Query: 297 RDDGFEPDVVTFGIVINAHCKAKKYDEAIQLFHTMKAKNVKPSPHVFCTLINGLGSEKRL 356
           +D+GF+ DVVT+ I+INAHCKAKKYDEAI LF  M+AKN+K +PH+FC LINGLGSEKRL
Sbjct: 266 KDEGFDADVVTYAILINAHCKAKKYDEAIDLFREMEAKNIKATPHIFCILINGLGSEKRL 325

Query: 357 NEALDFFKQSKSSGYAPEAPTYNAVVGAYCWSMKMADAYKTVNDMKKLGIGPNSRTYDII 416
           +EAL+FF+ +K+SG+ PEAPTYNA+VG+YCWSM+M DA++ V++M+K GIGPN+RTYDII
Sbjct: 326 SEALEFFELNKASGFKPEAPTYNALVGSYCWSMRMQDAFRVVDEMRKCGIGPNARTYDII 385

Query: 417 LHHLIKAGRSKEAYSVFERMSREPGCEPALSTYEIMVRMLCNKERVDMAIRIWDQMKARG 476
           LHHL+KA R+++AYS+F++MSR+PGCEP +STYEIMVRM CN+ER+DMA+++WDQMK RG
Sbjct: 386 LHHLVKARRTEQAYSIFQQMSRDPGCEPTVSTYEIMVRMFCNEERMDMALQVWDQMKTRG 445

Query: 477 VLPGMHMFSTLINSLCHENKLECACKYFEEMLDLGIRPPATMFSNLKQALLDEGRQDTAL 536
           VLPGMHMF+TLINSLCH NKL+ ACKYF+EMLD GIRPPA +FSNLKQ+LLD GR+D  +
Sbjct: 446 VLPGMHMFATLINSLCHGNKLDDACKYFQEMLDAGIRPPAQLFSNLKQSLLDGGRKDVVI 505

Query: 537 LLVEKLDRLRKAPLHG 551
            L  K+DRLRK PL G
Sbjct: 506 SLGLKIDRLRKTPLVG 521

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP112_ARATH5.3e-19468.94Pentatricopeptide repeat-containing protein At1g71060, mitochondrial OS=Arabidop... [more]
PP129_ARATH1.5e-10842.04Pentatricopeptide repeat-containing protein At1g77360, mitochondrial OS=Arabidop... [more]
PP293_ARATH4.5e-6829.20Pentatricopeptide repeat-containing protein At3g62470, mitochondrial OS=Arabidop... [more]
PP294_ARATH1.0e-6729.20Pentatricopeptide repeat-containing protein At3g62540, mitochondrial OS=Arabidop... [more]
PP382_ARATH1.7e-6729.18Pentatricopeptide repeat-containing protein At5g14820, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0KUY5_CUCSA1.6e-25078.73Uncharacterized protein OS=Cucumis sativus GN=Csa_5G571490 PE=4 SV=1[more]
F6HFJ5_VITVI4.5e-20068.69Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g04690 PE=4 SV=... [more]
A0A061E7F9_THECC6.1e-19772.65Pentatricopeptide repeat-containing protein, putative isoform 1 OS=Theobroma cac... [more]
B9SYJ4_RICCO2.8e-19467.84Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
B9HW85_POPTR6.3e-19469.15Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0010s12490g PE=4 SV=2[more]
Match NameE-valueIdentityDescription
AT1G71060.13.0e-19568.94 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G77360.18.6e-11042.04 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G62470.12.5e-6929.20 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G62540.15.6e-6929.20 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G14820.19.6e-6929.18 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659072880|ref|XP_008467144.1|1.9e-25280.00PREDICTED: pentatricopeptide repeat-containing protein At1g71060, mitochondrial ... [more]
gi|449465535|ref|XP_004150483.1|2.3e-25078.73PREDICTED: pentatricopeptide repeat-containing protein At1g71060, mitochondrial ... [more]
gi|694330949|ref|XP_009356158.1|2.7e-20671.17PREDICTED: pentatricopeptide repeat-containing protein At1g71060, mitochondrial-... [more]
gi|645276551|ref|XP_008243338.1|2.5e-20466.07PREDICTED: pentatricopeptide repeat-containing protein At1g71060, mitochondrial ... [more]
gi|657993866|ref|XP_008389228.1|2.5e-20470.36PREDICTED: pentatricopeptide repeat-containing protein At1g71060, mitochondrial-... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G021980.1CmoCh04G021980.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 208..229
score: 0.0095coord: 166..195
score: 0.011coord: 375..404
score: 8.
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 473..505
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 301..348
score: 1.2E-14coord: 406..456
score: 1.9E-11coord: 236..277
score: 3.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 236..268
score: 1.7E-4coord: 446..477
score: 8.7E-7coord: 270..303
score: 7.1E-5coord: 304..337
score: 6.5E-9coord: 340..371
score: 2.2E-5coord: 375..407
score: 0.002coord: 481..512
score: 2.9E-7coord: 166..195
score: 1.4E-4coord: 410..442
score: 3.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 337..371
score: 10.183coord: 478..512
score: 10.698coord: 443..477
score: 11.268coord: 407..442
score: 9.986coord: 267..301
score: 10.139coord: 197..231
score: 9.514coord: 372..406
score: 9.942coord: 302..336
score: 12.858coord: 513..547
score: 5.799coord: 232..266
score: 9.734coord: 163..193
score:
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 311..538
score: 1.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 75..542
score: 4.0E-293coord: 4..49
score: 4.0E
NoneNo IPR availablePANTHERPTHR24015:SF261SUBFAMILY NOT NAMEDcoord: 75..542
score: 4.0E-293coord: 4..49
score: 4.0E

The following gene(s) are paralogous to this gene:

None