CmoCh04G002870 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G002870
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCmo_Chr04 : 1444475 .. 1447597 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTAATGCTATGTGCTTGATCCGGCAAATGGCTGCGATCTCGCACCCTCGTAGAAATCTATGTAGTTTTCCTGTCCAAAATACCAATTTTCCCCTTATCGCGAATGATGTTTGTACCCAATTTATTTTCTTCTCTACCGCTCACCCATATGATCACAACGACGACACCGTTCGCGAAATCTCCACGATTCTGAAGCTTAGCGATTGGCAGGTCGTCTTGGACAATCAGAATAGTTTGAAGAAGCTAAACCCAGAAATCGTCCGCTCTGTTTTGCAGAAGAATGAAATCAACGACCCTGTACGGCTTCAAAGTTTCTTCTATTGGTCGAGTTCGAGAATGGGCACCCCACAAAACTTGCATTCTTATTCAATTCTTGCGATTCGTCTTTGTAACTCTGGGCTTTTTCCCCGTGCCGATAACATGTTTGAGAAAATGCTTGAGACCCGTAAGCCGCCATTGGAGATTTTGGATTCCTTGGTTAAGTGCTATAGAGAATGCGGTGGATCTAACTTGATTGTTTTTGATATTTTGGTTGATAACTTTAGGAAGTTTGGTTTTCTGAATGAGGCCTGTAGTGTTTTTCTAGCTTCCATTAGTGGTGGGTTCTTTCCCAGCTTGATATGCTGTAATAGTTTGATGAGGGATTTGTTGAAGGGTAAAATGATGGGATTGTTTTGGAAGGTGTATGGTGGTATGGTGGAGGCCAAGATAGTCCCTGATGTTTATACATACACCAATGTGATCAATGCACATTGTAAAGTTGGTGATGTTATGAAGGGTAGGATGGTTCTCTCTGAGATGGAGGAGAAGGGATGTAAACCTAATTTGGTCACCTACAATGTAGTTATTGGGGGTCTATGTCGGACCGGAGATGTCAATGAAGCTTTAGAGGTAAAGAAGTTGATGATGGAGAAGGGGTTGGTTCCGGATGGCTTTACTTATTCTATACTCATTGATGGGTTTTGCAAACAGAAGAGATCAGAAGAAGCAAAATTGATATTGGAAAGTATGCTTGGTTCAGGTTTAAATCCTAACCATATTACCTACACTGCTTTGATTGATGGGTTCATGAAACAAGGGAATATTGAAGAGGCATTAAGGATCAAAGACGAGATGGTCACTCGGGGACTTAAGTTGAACATTGTAACTTATAACACATTGATCAGGGGCATTGCTAAGGCTGGTGAGATGGAGAAAGCAATGGCTCTTGTTAATGAGATGTTTATAACTGGCATAGAATTGGATACTCAGACCTACGACTTGTTAATTGATGGATATTTGAAATCTCACAATAAGGACAAAGCTTATGAGCTACTAGCTGAGATGAAAGCAAGGAATTTGATGCCATCGTTGTACACTTATAGCGTGCTTATTAATGGTCTATGTCGTTCCCGTGAGCTGCCAAAGGCTAATGAAGTTTTGGAGCACATGATCAGCCACGGAGTGAAACCGAATGCTGTTATATATGCTACCCTGATCAATGCTAATGTCCAAGAAAGTAGATATGAAGGTGCAAAAGAAGTACTAAAAGGGATGGTAAAAAATGGGGTCGTACCGGATTTATTTTGCTATAATTCTCTTATAATTGGTCTTTGCAGGGCAAAAAGGGTGGAAGAAGCTAAAATGATGTTTGTTGAAATGGGTGAGAAAGGAATAAAGCCCAATGCATATACTTACGGAGCATTTATTCATTTATATTGTAAAACAGGTGAAATCCAAGTAGCAGAGAGGTATTTCCAAGACATGTTATCTTCACGTATAGTTCCTAACAATATAATCTATACTGCACTGATTGATGGGCATTGCAATGTCGGAAACACAGTAGAAGCTTTGTCAACTTTCAAGTGCATGCTCGAGAAAGGATTGATTCCTGATGTTCAAACATACGGTGCACTGATTCACGGTCTCTCCAAGAATGGGAAAACCGAAGAAGCAATGGTGGTTTTCTCTGAATACCTCGACAAGGGCTTGGTGCCGGACGTTTTTATATACAACTCTCTTATATCTGGTTTCTGCAAGAAAGGTGAAATTGAGAAGGCATCCCAACTTTATGAAGAGATGCTTCTCAAGGGACCTAATCCCAACATTGTCATATACAATACCCTGATTAACGGACTGTGCAAGCTTGGTGAGATAAAGGATGCAAGGGAACTTTTTGACAAAATTGAAGGAAAAGGTTTGGTCCCTAATGTTGTGACTTATTCAATAATCATAGATGGATATTGCAAATCTGGAAACTTAACTGAGGCGTTTAACCTGTTCGATGAGATGATATCAAAAGGAGTTCCTCTTGACCGTCACATCTACTGTATCCTCATTGATGGTTGCTGCAAGCAAGGAAATTTGGAGAAGGCACTTTCGTTATTTCACGAAGCACTGCAGAAAAGTGTTGCTTCCCCTTCTGCTTTCAACTCTTTGATCGATGGTTTCTGCAAACTGGGAAAGTTGATTGAAGCTAGGGAGTTGTTCGATGATACGGTTGATAAACATGTGACACCGAATAGTGTGACGTACACAATTCTGGTCGATGCATACAGCAAAGCAGAAATGATGGAGGAGGCAGAGCAGCTTTTTCTAGATATGGGAACTAAAAATATCATGCCAAATACTCTTACGTATACTTCTCTTTTACTCGGTTATAATCGGATAGGACACAGAATTAAGATGATTTCTTTGTTCAAGGATATGGAAGCTAGGGGAATTGCTTGTGATGCAATTACCTACGGTGTGATGGCTGATGTCTACTGCAAGGAAGGAAATTCTCTTGAAGCCTTAAAGCTGCTCGACAAAAGCTTGGTTGAGGGTATAAAGTTGGATGGTGATGTGTTTGATGCATTAATATTTCACTTATGCAATGAAGGAAAAAATTCTACTATGCTGAAGCTACTCGGTGAAATGGCCGAAAAGAAACTCGCTCTTACCTCTACTACATGTACTGCTCTGTTGATTGGTTTTTACAAGGCAGGTAATGAAGACAAAGCTTTAGAGGTTCTTGACATTATGCAAAGGTTGGGGTGGGTTCCAGATTCTTTAAACGTAGTTGATTTAGTAAATGCTAGGAAAAACGATATGAATTCTGAAAGCTTCCCAAGTGATGCAATGCAAGTAGGGTCGGTGTAG

mRNA sequence

ATGGCTAATGCTATGTGCTTGATCCGGCAAATGGCTGCGATCTCGCACCCTCGTAGAAATCTATGTAGTTTTCCTGTCCAAAATACCAATTTTCCCCTTATCGCGAATGATGTTTGTACCCAATTTATTTTCTTCTCTACCGCTCACCCATATGATCACAACGACGACACCGTTCGCGAAATCTCCACGATTCTGAAGCTTAGCGATTGGCAGGTCGTCTTGGACAATCAGAATAGTTTGAAGAAGCTAAACCCAGAAATCGTCCGCTCTGTTTTGCAGAAGAATGAAATCAACGACCCTGTACGGCTTCAAAGTTTCTTCTATTGGTCGAGTTCGAGAATGGGCACCCCACAAAACTTGCATTCTTATTCAATTCTTGCGATTCGTCTTTGTAACTCTGGGCTTTTTCCCCGTGCCGATAACATGTTTGAGAAAATGCTTGAGACCCGTAAGCCGCCATTGGAGATTTTGGATTCCTTGGTTAAGTGCTATAGAGAATGCGGTGGATCTAACTTGATTGTTTTTGATATTTTGGTTGATAACTTTAGGAAGTTTGGTTTTCTGAATGAGGCCTGTAGTGTTTTTCTAGCTTCCATTAGTGGTGGGTTCTTTCCCAGCTTGATATGCTGTAATAGTTTGATGAGGGATTTGTTGAAGGGTAAAATGATGGGATTGTTTTGGAAGGTGTATGGTGGTATGGTGGAGGCCAAGATAGTCCCTGATGTTTATACATACACCAATGTGATCAATGCACATTGTAAAGTTGGTGATGTTATGAAGGGTAGGATGGTTCTCTCTGAGATGGAGGAGAAGGGATGTAAACCTAATTTGGTCACCTACAATGTAGTTATTGGGGGTCTATGTCGGACCGGAGATGTCAATGAAGCTTTAGAGGTAAAGAAGTTGATGATGGAGAAGGGGTTGGTTCCGGATGGCTTTACTTATTCTATACTCATTGATGGGTTTTGCAAACAGAAGAGATCAGAAGAAGCAAAATTGATATTGGAAAGTATGCTTGGTTCAGGTTTAAATCCTAACCATATTACCTACACTGCTTTGATTGATGGGTTCATGAAACAAGGGAATATTGAAGAGGCATTAAGGATCAAAGACGAGATGGTCACTCGGGGACTTAAGTTGAACATTGTAACTTATAACACATTGATCAGGGGCATTGCTAAGGCTGGTGAGATGGAGAAAGCAATGGCTCTTGTTAATGAGATGTTTATAACTGGCATAGAATTGGATACTCAGACCTACGACTTGTTAATTGATGGATATTTGAAATCTCACAATAAGGACAAAGCTTATGAGCTACTAGCTGAGATGAAAGCAAGGAATTTGATGCCATCGTTGTACACTTATAGCGTGCTTATTAATGGTCTATGTCGTTCCCGTGAGCTGCCAAAGGCTAATGAAGTTTTGGAGCACATGATCAGCCACGGAGTGAAACCGAATGCTGTTATATATGCTACCCTGATCAATGCTAATGTCCAAGAAAGTAGATATGAAGGTGCAAAAGAAGTACTAAAAGGGATGGTAAAAAATGGGGTCGTACCGGATTTATTTTGCTATAATTCTCTTATAATTGGTCTTTGCAGGGCAAAAAGGGTGGAAGAAGCTAAAATGATGTTTGTTGAAATGGGTGAGAAAGGAATAAAGCCCAATGCATATACTTACGGAGCATTTATTCATTTATATTGTAAAACAGGTGAAATCCAAGTAGCAGAGAGGTATTTCCAAGACATGTTATCTTCACGTATAGTTCCTAACAATATAATCTATACTGCACTGATTGATGGGCATTGCAATGTCGGAAACACAGTAGAAGCTTTGTCAACTTTCAAGTGCATGCTCGAGAAAGGATTGATTCCTGATGTTCAAACATACGGTGCACTGATTCACGGTCTCTCCAAGAATGGGAAAACCGAAGAAGCAATGGTGGTTTTCTCTGAATACCTCGACAAGGGCTTGGTGCCGGACGTTTTTATATACAACTCTCTTATATCTGGTTTCTGCAAGAAAGGTGAAATTGAGAAGGCATCCCAACTTTATGAAGAGATGCTTCTCAAGGGACCTAATCCCAACATTGTCATATACAATACCCTGATTAACGGACTGTGCAAGCTTGGTGAGATAAAGGATGCAAGGGAACTTTTTGACAAAATTGAAGGAAAAGGTTTGGTCCCTAATGTTGTGACTTATTCAATAATCATAGATGGATATTGCAAATCTGGAAACTTAACTGAGGCGTTTAACCTGTTCGATGAGATGATATCAAAAGGAGTTCCTCTTGACCGTCACATCTACTGTATCCTCATTGATGGTTGCTGCAAGCAAGGAAATTTGGAGAAGGCACTTTCGTTATTTCACGAAGCACTGCAGAAAAGTGTTGCTTCCCCTTCTGCTTTCAACTCTTTGATCGATGGTTTCTGCAAACTGGGAAAGTTGATTGAAGCTAGGGAGTTGTTCGATGATACGGTTGATAAACATGTGACACCGAATAGTGTGACGTACACAATTCTGGTCGATGCATACAGCAAAGCAGAAATGATGGAGGAGGCAGAGCAGCTTTTTCTAGATATGGGAACTAAAAATATCATGCCAAATACTCTTACGTATACTTCTCTTTTACTCGGTTATAATCGGATAGGACACAGAATTAAGATGATTTCTTTGTTCAAGGATATGGAAGCTAGGGGAATTGCTTGTGATGCAATTACCTACGGTGTGATGGCTGATGTCTACTGCAAGGAAGGAAATTCTCTTGAAGCCTTAAAGCTGCTCGACAAAAGCTTGGTTGAGGGTATAAAGTTGGATGGTGATGTGTTTGATGCATTAATATTTCACTTATGCAATGAAGGAAAAAATTCTACTATGCTGAAGCTACTCGGTGAAATGGCCGAAAAGAAACTCGCTCTTACCTCTACTACATGTACTGCTCTGTTGATTGGTTTTTACAAGGCAGGTAATGAAGACAAAGCTTTAGAGGTTCTTGACATTATGCAAAGGTTGGGGTGGGTTCCAGATTCTTTAAACGTAGTTGATTTAGTAAATGCTAGGAAAAACGATATGAATTCTGAAAGCTTCCCAAGTGATGCAATGCAAGTAGGGTCGGTGTAG

Coding sequence (CDS)

ATGGCTAATGCTATGTGCTTGATCCGGCAAATGGCTGCGATCTCGCACCCTCGTAGAAATCTATGTAGTTTTCCTGTCCAAAATACCAATTTTCCCCTTATCGCGAATGATGTTTGTACCCAATTTATTTTCTTCTCTACCGCTCACCCATATGATCACAACGACGACACCGTTCGCGAAATCTCCACGATTCTGAAGCTTAGCGATTGGCAGGTCGTCTTGGACAATCAGAATAGTTTGAAGAAGCTAAACCCAGAAATCGTCCGCTCTGTTTTGCAGAAGAATGAAATCAACGACCCTGTACGGCTTCAAAGTTTCTTCTATTGGTCGAGTTCGAGAATGGGCACCCCACAAAACTTGCATTCTTATTCAATTCTTGCGATTCGTCTTTGTAACTCTGGGCTTTTTCCCCGTGCCGATAACATGTTTGAGAAAATGCTTGAGACCCGTAAGCCGCCATTGGAGATTTTGGATTCCTTGGTTAAGTGCTATAGAGAATGCGGTGGATCTAACTTGATTGTTTTTGATATTTTGGTTGATAACTTTAGGAAGTTTGGTTTTCTGAATGAGGCCTGTAGTGTTTTTCTAGCTTCCATTAGTGGTGGGTTCTTTCCCAGCTTGATATGCTGTAATAGTTTGATGAGGGATTTGTTGAAGGGTAAAATGATGGGATTGTTTTGGAAGGTGTATGGTGGTATGGTGGAGGCCAAGATAGTCCCTGATGTTTATACATACACCAATGTGATCAATGCACATTGTAAAGTTGGTGATGTTATGAAGGGTAGGATGGTTCTCTCTGAGATGGAGGAGAAGGGATGTAAACCTAATTTGGTCACCTACAATGTAGTTATTGGGGGTCTATGTCGGACCGGAGATGTCAATGAAGCTTTAGAGGTAAAGAAGTTGATGATGGAGAAGGGGTTGGTTCCGGATGGCTTTACTTATTCTATACTCATTGATGGGTTTTGCAAACAGAAGAGATCAGAAGAAGCAAAATTGATATTGGAAAGTATGCTTGGTTCAGGTTTAAATCCTAACCATATTACCTACACTGCTTTGATTGATGGGTTCATGAAACAAGGGAATATTGAAGAGGCATTAAGGATCAAAGACGAGATGGTCACTCGGGGACTTAAGTTGAACATTGTAACTTATAACACATTGATCAGGGGCATTGCTAAGGCTGGTGAGATGGAGAAAGCAATGGCTCTTGTTAATGAGATGTTTATAACTGGCATAGAATTGGATACTCAGACCTACGACTTGTTAATTGATGGATATTTGAAATCTCACAATAAGGACAAAGCTTATGAGCTACTAGCTGAGATGAAAGCAAGGAATTTGATGCCATCGTTGTACACTTATAGCGTGCTTATTAATGGTCTATGTCGTTCCCGTGAGCTGCCAAAGGCTAATGAAGTTTTGGAGCACATGATCAGCCACGGAGTGAAACCGAATGCTGTTATATATGCTACCCTGATCAATGCTAATGTCCAAGAAAGTAGATATGAAGGTGCAAAAGAAGTACTAAAAGGGATGGTAAAAAATGGGGTCGTACCGGATTTATTTTGCTATAATTCTCTTATAATTGGTCTTTGCAGGGCAAAAAGGGTGGAAGAAGCTAAAATGATGTTTGTTGAAATGGGTGAGAAAGGAATAAAGCCCAATGCATATACTTACGGAGCATTTATTCATTTATATTGTAAAACAGGTGAAATCCAAGTAGCAGAGAGGTATTTCCAAGACATGTTATCTTCACGTATAGTTCCTAACAATATAATCTATACTGCACTGATTGATGGGCATTGCAATGTCGGAAACACAGTAGAAGCTTTGTCAACTTTCAAGTGCATGCTCGAGAAAGGATTGATTCCTGATGTTCAAACATACGGTGCACTGATTCACGGTCTCTCCAAGAATGGGAAAACCGAAGAAGCAATGGTGGTTTTCTCTGAATACCTCGACAAGGGCTTGGTGCCGGACGTTTTTATATACAACTCTCTTATATCTGGTTTCTGCAAGAAAGGTGAAATTGAGAAGGCATCCCAACTTTATGAAGAGATGCTTCTCAAGGGACCTAATCCCAACATTGTCATATACAATACCCTGATTAACGGACTGTGCAAGCTTGGTGAGATAAAGGATGCAAGGGAACTTTTTGACAAAATTGAAGGAAAAGGTTTGGTCCCTAATGTTGTGACTTATTCAATAATCATAGATGGATATTGCAAATCTGGAAACTTAACTGAGGCGTTTAACCTGTTCGATGAGATGATATCAAAAGGAGTTCCTCTTGACCGTCACATCTACTGTATCCTCATTGATGGTTGCTGCAAGCAAGGAAATTTGGAGAAGGCACTTTCGTTATTTCACGAAGCACTGCAGAAAAGTGTTGCTTCCCCTTCTGCTTTCAACTCTTTGATCGATGGTTTCTGCAAACTGGGAAAGTTGATTGAAGCTAGGGAGTTGTTCGATGATACGGTTGATAAACATGTGACACCGAATAGTGTGACGTACACAATTCTGGTCGATGCATACAGCAAAGCAGAAATGATGGAGGAGGCAGAGCAGCTTTTTCTAGATATGGGAACTAAAAATATCATGCCAAATACTCTTACGTATACTTCTCTTTTACTCGGTTATAATCGGATAGGACACAGAATTAAGATGATTTCTTTGTTCAAGGATATGGAAGCTAGGGGAATTGCTTGTGATGCAATTACCTACGGTGTGATGGCTGATGTCTACTGCAAGGAAGGAAATTCTCTTGAAGCCTTAAAGCTGCTCGACAAAAGCTTGGTTGAGGGTATAAAGTTGGATGGTGATGTGTTTGATGCATTAATATTTCACTTATGCAATGAAGGAAAAAATTCTACTATGCTGAAGCTACTCGGTGAAATGGCCGAAAAGAAACTCGCTCTTACCTCTACTACATGTACTGCTCTGTTGATTGGTTTTTACAAGGCAGGTAATGAAGACAAAGCTTTAGAGGTTCTTGACATTATGCAAAGGTTGGGGTGGGTTCCAGATTCTTTAAACGTAGTTGATTTAGTAAATGCTAGGAAAAACGATATGAATTCTGAAAGCTTCCCAAGTGATGCAATGCAAGTAGGGTCGGTGTAG
BLAST of CmoCh04G002870 vs. Swiss-Prot
Match: PP442_ARATH (Pentatricopeptide repeat-containing protein At5g61990, mitochondrial OS=Arabidopsis thaliana GN=At5g61990 PE=2 SV=1)

HSP 1 Score: 738.4 bits (1905), Expect = 1.1e-211
Identity = 394/970 (40.62%), Postives = 580/970 (59.79%), Query Frame = 1

Query: 56   DTVREISTILKLSDWQVVLDNQNSLKKLNPEIVRSVLQKNEINDPVRLQSFFYWSSSRMG 115
            D   EI+ ILK  +W+  L + N   ++NPE+V SVL+   ++DP +L SFF W  S+  
Sbjct: 33   DASAEIAGILKQENWRDTLVSSNLSIEINPEVVLSVLRSKRVDDPSKLLSFFNWVDSQKV 92

Query: 116  TPQNLHSYSILAIRLCNSGLFPRADNMFEKMLETRKPPLEILDSLVKCYRECGGSNL--I 175
            T Q L S+S LA+ LCN G F +A ++ E+M+E   P  E+  S+V+C +E  G +   +
Sbjct: 93   TEQKLDSFSFLALDLCNFGSFEKALSVVERMIERNWPVAEVWSSIVRCSQEFVGKSDDGV 152

Query: 176  VFDILVDNFRKFGFLNEACSVFLASISGGFFPSLICCNSLMRDLLKGKMMGLFWKVYGGM 235
            +F IL D +   G++ EA  VF +S+     P L  C  L+  LL+   + LFW VY GM
Sbjct: 153  LFGILFDGYIAKGYIEEAVFVFSSSMGLELVPRLSRCKVLLDALLRWNRLDLFWDVYKGM 212

Query: 236  VEAKIVPDVYTYTNVINAHCKVGDVMKGRMVLSEMEEKGCKPNLVTYNVVIGGLCRTGDV 295
            VE  +V DV TY  +I AHC+ G+V  G+ VL + E++     L              +V
Sbjct: 213  VERNVVFDVKTYHMLIIAHCRAGNVQLGKDVLFKTEKEFRTATL--------------NV 272

Query: 296  NEALEVKKLMMEKGLVPDGFTYSILIDGFCKQKRSEEAKLILESMLGSGLNPNHITYTAL 355
            + AL++K+ M+ KGLVP  +TY +LIDG CK KR E+AK +L  M   G++ ++ TY+ L
Sbjct: 273  DGALKLKESMICKGLVPLKYTYDVLIDGLCKIKRLEDAKSLLVEMDSLGVSLDNHTYSLL 332

Query: 356  IDGFMKQGNIEEALRIKDEMVTRGLKLNIVTYNTLIRGIAKAGEMEKAMALVNEMFITGI 415
            IDG +K  N + A  +  EMV+ G+ +    Y+  I  ++K G MEKA AL + M  +G+
Sbjct: 333  IDGLLKGRNADAAKGLVHEMVSHGINIKPYMYDCCICVMSKEGVMEKAKALFDGMIASGL 392

Query: 416  ELDTQTYDLLIDGYLKSHNKDKAYELLAEMKARNLMPSLYTYSVLINGLCRSRELPKANE 475
                Q Y  LI+GY +  N  + YELL EMK RN++ S YTY  ++ G+C S +L  A  
Sbjct: 393  IPQAQAYASLIEGYCREKNVRQGYELLVEMKKRNIVISPYTYGTVVKGMCSSGDLDGAYN 452

Query: 476  VLEHMISHGVKPNAVIYATLINANVQESRYEGAKEVLKGMVKNGVVPDLFCYNSLIIGLC 535
            +++ MI+ G +PN VIY TLI   +Q SR+  A  VLK M + G+ PD+FCYNSLIIGL 
Sbjct: 453  IVKEMIASGCRPNVVIYTTLIKTFLQNSRFGDAMRVLKEMKEQGIAPDIFCYNSLIIGLS 512

Query: 536  RAKRVEEAKMMFVEMGEKGIKPNAYTYGAFIHLYCKTGEIQVAERYFQDMLSSRIVPNNI 595
            +AKR++EA+   VEM E G+KPNA+TYGAFI  Y +  E   A++Y ++M    ++PN +
Sbjct: 513  KAKRMDEARSFLVEMVENGLKPNAFTYGAFISGYIEASEFASADKYVKEMRECGVLPNKV 572

Query: 596  IYTALIDGHCNVGNTVEALSTFKCMLEKGLIPDVQTYGALIHGLSKNGKTEEAMVVFSEY 655
            + T LI+ +C  G  +EA S ++ M+++G++ D +TY  L++GL KN K ++A  +F E 
Sbjct: 573  LCTGLINEYCKKGKVIEACSAYRSMVDQGILGDAKTYTVLMNGLFKNDKVDDAEEIFREM 632

Query: 656  LDKGLVPDVFIYNSLISGFCKKGEIEKASQLYEEMLLKGPNPNIVIYNTLINGLCKLGEI 715
              KG+ PDVF Y  LI+GF K G ++KAS +++EM+ +G  PN++IYN L+ G C+ GEI
Sbjct: 633  RGKGIAPDVFSYGVLINGFSKLGNMQKASSIFDEMVEEGLTPNVIIYNMLLGGFCRSGEI 692

Query: 716  KDARELFDKIEGKGLVPNVVTYSIIIDGYCKSGNLTEAFNLFDEMISKGVPLDRHIYCIL 775
            + A+EL D++  KGL PN VTY  IIDGYCKSG+L EAF LFDEM  KG+  D  +Y  L
Sbjct: 693  EKAKELLDEMSVKGLHPNAVTYCTIIDGYCKSGDLAEAFRLFDEMKLKGLVPDSFVYTTL 752

Query: 776  IDGCCKQGNLEKALSLFHEALQKSVASPSAFNSLIDGFCKLGKLIEARE----LFDDTVD 835
            +DGCC+  ++E+A+++F    +   +S + FN+LI+   K GK     E    L D + D
Sbjct: 753  VDGCCRLNDVERAITIFGTNKKGCASSTAPFNALINWVFKFGKTELKTEVLNRLMDGSFD 812

Query: 836  KHVTPNSVTYTILVDAYSKAEMMEEAEQLFLDMGTKNIMPNTLTYTSLLLGYNRIGHRIK 895
            +   PN VTY I++D   K   +E A++LF  M   N+MP  +TYTSLL GY+++G R +
Sbjct: 813  RFGKPNDVTYNIMIDYLCKEGNLEAAKELFHQMQNANLMPTVITYTSLLNGYDKMGRRAE 872

Query: 896  MISLFKDMEARGIACDAITYGVMADVYCKEGNSLEALKLLDKSLVEGIKLDGDVFDALIF 955
            M  +F +  A GI  D I Y V+ + + KEG + +AL L+D+   +    DG        
Sbjct: 873  MFPVFDEAIAAGIEPDHIMYSVIINAFLKEGMTTKALVLVDQMFAKNAVDDG-------- 932

Query: 956  HLCNEGKNSTMLKLLGEMAEKKLALTSTTCTALLIGFYKAGNEDKALEVLDIMQRLGWVP 1015
                                    L+ +TC ALL GF K G  + A +V++ M RL ++P
Sbjct: 933  ----------------------CKLSISTCRALLSGFAKVGEMEVAEKVMENMVRLQYIP 958

Query: 1016 DSLNVVDLVN 1020
            DS  V++L+N
Sbjct: 993  DSATVIELIN 958

BLAST of CmoCh04G002870 vs. Swiss-Prot
Match: PP437_ARATH (Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis thaliana GN=At5g59900 PE=3 SV=1)

HSP 1 Score: 427.2 bits (1097), Expect = 5.2e-118
Identity = 255/836 (30.50%), Postives = 429/836 (51.32%), Query Frame = 1

Query: 70  WQVVLDNQNSLKKLNPEIVRSVLQKNEINDPVRLQSFFYWSSSRMGTPQNLHSYSILAIR 129
           W++ L ++   ++L    V  +L    I+DP     FF +     G   +  S+ IL   
Sbjct: 55  WEIALSSELVSRRLKTVHVEEILI-GTIDDPKLGLRFFNFLGLHRGFDHSTASFCILIHA 114

Query: 130 LCNSGLFPRADNMFEKMLETRKPPLEILDSLVKCYRECGGSNLIVFDILVDNFRKFGFLN 189
           L  + LF  A ++ + +L     P ++ + L  CY +C  S+   FD+L+ ++ +   + 
Sbjct: 115 LVKANLFWPASSLLQTLLLRALKPSDVFNVLFSCYEKCKLSSSSSFDLLIQHYVRSRRVL 174

Query: 190 EACSVFLASISG-GFFPSLICCNSLMRDLLKGKMMGLFWKVYGGMVEAKIVPDVYTYTNV 249
           +   VF   I+     P +   ++L+  L+K +  GL  +++  MV   I PDVY YT V
Sbjct: 175 DGVLVFKMMITKVSLLPEVRTLSALLHGLVKFRHFGLAMELFNDMVSVGIRPDVYIYTGV 234

Query: 250 INAHCKVGDVMKGRMVLSEMEEKGCKPNLVTYNVVIGGLCRTGDVNEALEVKKLMMEKGL 309
           I + C++ D+ + + +++ ME  GC  N+V YNV+I GLC+   V EA+ +KK +  K L
Sbjct: 235 IRSLCELKDLSRAKEMIAHMEATGCDVNIVPYNVLIDGLCKKQKVWEAVGIKKDLAGKDL 294

Query: 310 VPDGFTYSILIDGFCKQKRSEEAKLILESMLGSGLNPNHITYTALIDGFMKQGNIEEALR 369
            PD  TY  L+ G CK +  E    +++ ML    +P+    ++L++G  K+G IEEAL 
Sbjct: 295 KPDVVTYCTLVYGLCKVQEFEIGLEMMDEMLCLRFSPSEAAVSSLVEGLRKRGKIEEALN 354

Query: 370 IKDEMVTRGLKLNIVTYNTLIRGIAKAGEMEKAMALVNEMFITGIELDTQTYDLLIDGYL 429
           +   +V  G+  N+  YN LI  + K  +  +A  L + M   G+  +  TY +LID + 
Sbjct: 355 LVKRVVDFGVSPNLFVYNALIDSLCKGRKFHEAELLFDRMGKIGLRPNDVTYSILIDMFC 414

Query: 430 KSHNKDKAYELLAEMKARNLMPSLYTYSVLINGLCRSRELPKANEVLEHMISHGVKPNAV 489
           +    D A   L EM    L  S+Y Y+ LING C+  ++  A   +  MI+  ++P  V
Sbjct: 415 RRGKLDTALSFLGEMVDTGLKLSVYPYNSLINGHCKFGDISAAEGFMAEMINKKLEPTVV 474

Query: 490 IYATLINANVQESRYEGAKEVLKGMVKNGVVPDLFCYNSLIIGLCRAKRVEEAKMMFVEM 549
            Y +L+     + +   A  +   M   G+ P ++ + +L+ GL RA  + +A  +F EM
Sbjct: 475 TYTSLMGGYCSKGKINKALRLYHEMTGKGIAPSIYTFTTLLSGLFRAGLIRDAVKLFNEM 534

Query: 550 GEKGIKPNAYTYGAFIHLYCKTGEIQVAERYFQDMLSSRIVPNNIIYTALIDGHCNVGNT 609
            E  +KPN  TY   I  YC+ G++  A  + ++M    IVP+   Y  LI G C  G  
Sbjct: 535 AEWNVKPNRVTYNVMIEGYCEEGDMSKAFEFLKEMTEKGIVPDTYSYRPLIHGLCLTGQA 594

Query: 610 VEALSTFKCMLEKGLIP-DVQTYGALIHGLSKNGKTEEAMVVFSEYLDKGLVPDVFIYNS 669
            EA   F   L KG    +   Y  L+HG  + GK EEA+ V  E + +G+  D+  Y  
Sbjct: 595 SEA-KVFVDGLHKGNCELNEICYTGLLHGFCREGKLEEALSVCQEMVQRGVDLDLVCYGV 654

Query: 670 LISGFCKKGEIEKASQLYEEMLLKGPNPNIVIYNTLINGLCKLGEIKDARELFDKIEGKG 729
           LI G  K  + +    L +EM  +G  P+ VIY ++I+   K G+ K+A  ++D +  +G
Sbjct: 655 LIDGSLKHKDRKLFFGLLKEMHDRGLKPDDVIYTSMIDAKSKTGDFKEAFGIWDLMINEG 714

Query: 730 LVPNVVTYSIIIDGYCKSGNLTEAFNLFDEMISKGVPLDRHIYCILIDGCCK-QGNLEKA 789
            VPN VTY+ +I+G CK+G + EA  L  +M       ++  Y   +D   K + +++KA
Sbjct: 715 CVPNEVTYTAVINGLCKAGFVNEAEVLCSKMQPVSSVPNQVTYGCFLDILTKGEVDMQKA 774

Query: 790 LSLFHEALQKSVASPSAFNSLIDGFCKLGKLIEARELFDDTVDKHVTPNSVTYTILVDAY 849
           + L +  L+  +A+ + +N LI GFC+ G++ EA EL    +   V+P+ +TYT +++  
Sbjct: 775 VELHNAILKGLLANTATYNMLIRGFCRQGRIEEASELITRMIGDGVSPDCITYTTMINEL 834

Query: 850 SKAEMMEEAEQLFLDMGTKNIMPNTLTYTSLLLGYNRIGHRIKMISLFKDMEARGI 903
            +   +++A +L+  M  K I P+ + Y +L+ G    G   K   L  +M  +G+
Sbjct: 835 CRRNDVKKAIELWNSMTEKGIRPDRVAYNTLIHGCCVAGEMGKATELRNEMLRQGL 888

BLAST of CmoCh04G002870 vs. Swiss-Prot
Match: RF1_ORYSI (Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica GN=Rf1 PE=2 SV=1)

HSP 1 Score: 392.1 bits (1006), Expect = 1.9e-107
Identity = 201/631 (31.85%), Postives = 350/631 (55.47%), Query Frame = 1

Query: 237 KIVPDVYTYTNVINAHCKVGDVMKGRMVLSEMEEKGCKPNLVTYNVVIGGLCRTGDVNEA 296
           ++ PD+ TY  +I   C+ G +  G   L  + +KG + + + +  ++ GLC     ++A
Sbjct: 82  EVTPDLCTYGILIGCCCRAGRLDLGFAALGNVIKKGFRVDAIAFTPLLKGLCADKRTSDA 141

Query: 297 LE-VKKLMMEKGLVPDGFTYSILIDGFCKQKRSEEAKLILESML---GSGLNPNHITYTA 356
           ++ V + M E G +P+ F+Y+IL+ G C + RS+EA  +L  M    G G  P+ ++YT 
Sbjct: 142 MDIVLRRMTELGCIPNVFSYNILLKGLCDENRSQEALELLHMMADDRGGGSPPDVVSYTT 201

Query: 357 LIDGFMKQGNIEEALRIKDEMVTRGLKLNIVTYNTLIRGIAKAGEMEKAMALVNEMFITG 416
           +I+GF K+G+ ++A     EM+ RG+  ++VTYN++I  + KA  M+KAM ++N M   G
Sbjct: 202 VINGFFKEGDSDKAYSTYHEMLDRGILPDVVTYNSIIAALCKAQAMDKAMEVLNTMVKNG 261

Query: 417 IELDTQTYDLLIDGYLKSHNKDKAYELLAEMKARNLMPSLYTYSVLINGLCRSRELPKAN 476
           +  D  TY+ ++ GY  S    +A   L +M++  + P + TYS+L++ LC++    +A 
Sbjct: 262 VMPDCMTYNSILHGYCSSGQPKEAIGFLKKMRSDGVEPDVVTYSLLMDYLCKNGRCMEAR 321

Query: 477 EVLEHMISHGVKPNAVIYATLINANVQESRYEGAKEVLKGMVKNGVVPDLFCYNSLIIGL 536
           ++ + M   G+KP    Y TL+     +        +L  MV+NG+ PD + ++ LI   
Sbjct: 322 KIFDSMTKRGLKPEITTYGTLLQGYATKGALVEMHGLLDLMVRNGIHPDHYVFSILICAY 381

Query: 537 CRAKRVEEAKMMFVEMGEKGIKPNAYTYGAFIHLYCKTGEIQVAERYFQDMLSSRIVPNN 596
            +  +V++A ++F +M ++G+ PNA TYGA I + CK+G ++ A  YF+ M+   + P N
Sbjct: 382 AKQGKVDQAMLVFSKMRQQGLNPNAVTYGAVIGILCKSGRVEDAMLYFEQMIDEGLSPGN 441

Query: 597 IIYTALIDGHCNVGNTVEALSTFKCMLEKGLIPDVQTYGALIHGLSKNGKTEEAMVVFSE 656
           I+Y +LI G C       A      ML++G+  +   + ++I    K G+  E+  +F  
Sbjct: 442 IVYNSLIHGLCTCNKWERAEELILEMLDRGICLNTIFFNSIIDSHCKEGRVIESEKLFEL 501

Query: 657 YLDKGLVPDVFIYNSLISGFCKKGEIEKASQLYEEMLLKGPNPNIVIYNTLINGLCKLGE 716
            +  G+ P+V  YN+LI+G+C  G++++A +L   M+  G  PN V Y+TLING CK+  
Sbjct: 502 MVRIGVKPNVITYNTLINGYCLAGKMDEAMKLLSGMVSVGLKPNTVTYSTLINGYCKISR 561

Query: 717 IKDARELFDKIEGKGLVPNVVTYSIIIDGYCKSGNLTEAFNLFDEMISKGVPLDRHIYCI 776
           ++DA  LF ++E  G+ P+++TY+II+ G  ++     A  L+  +   G  ++   Y I
Sbjct: 562 MEDALVLFKEMESSGVSPDIITYNIILQGLFQTRRTAAAKELYVRITESGTQIELSTYNI 621

Query: 777 LIDGCCKQGNLEKALSLFHE-ALQKSVASPSAFNSLIDGFCKLGKLIEARELFDDTVDKH 836
           ++ G CK    + AL +F    L         FN +ID   K+G+  EA++LF       
Sbjct: 622 ILHGLCKNKLTDDALQMFQNLCLMDLKLEARTFNIMIDALLKVGRNDEAKDLFVAFSSNG 681

Query: 837 VTPNSVTYTILVDAYSKAEMMEEAEQLFLDM 863
           + PN  TY ++ +      ++EE +QLFL M
Sbjct: 682 LVPNYWTYRLMAENIIGQGLLEELDQLFLSM 712

BLAST of CmoCh04G002870 vs. Swiss-Prot
Match: PP432_ARATH (Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana GN=At5g55840 PE=3 SV=2)

HSP 1 Score: 387.9 bits (995), Expect = 3.5e-106
Identity = 245/866 (28.29%), Postives = 417/866 (48.15%), Query Frame = 1

Query: 156  ILDSLVKCYRECGGSNLIVFDILVDNFRKFGFLNEACSVFLASISGGFFPSLICCNSLMR 215
            +  +L+  YR C  SN  V+DIL+  + + G + ++  +F      GF PS+  CN+++ 
Sbjct: 108  VFGALMTTYRLCN-SNPSVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCNAILG 167

Query: 216  DLLKGKMMGLFWKVYGGMVEAKIVPDVYTYTNVINAHCKVGDVMKGRMVLSEMEEKGCKP 275
             ++K       W     M++ KI PDV T+  +IN  C  G   K   ++ +ME+ G  P
Sbjct: 168  SVVKSGEDVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYAP 227

Query: 276  NLVTYNVVIGGLCRTGDVNEALEVKKLMMEKGLVPDGFTYSILIDGFCKQKRSEEAKLIL 335
             +VTYN V+   C+ G    A+E+   M  KG+  D  TY++LI   C+  R  +  L+L
Sbjct: 228  TIVTYNTVLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYLLL 287

Query: 336  ESMLGSGLNPNHITYTALIDGFMKQGNIEEALRIKDEMVTRGLKLNIVTYNTLIRGIAKA 395
              M    ++PN +TY  LI+GF  +G +  A ++ +EM++ GL  N VT+N LI G    
Sbjct: 288  RDMRKRMIHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHISE 347

Query: 396  GEMEKAMALVNEMFITGIELDTQTYDLLIDGYLKSHNKDKAYELLAEMKARNLMPSLYTY 455
            G  ++A+ +   M   G+     +Y +L+DG  K+   D A      MK   +     TY
Sbjct: 348  GNFKEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRITY 407

Query: 456  SVLINGLCRSRELPKANEVLEHMISHGVKPNAVIYATLINANVQESRYEGAKEVLKGMVK 515
            + +I+GLC++  L +A  +L  M   G+ P+ V Y+ LIN   +  R++ AKE++  + +
Sbjct: 408  TGMIDGLCKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYR 467

Query: 516  NGVVPDLFCYNSLIIGLCRAKRVEEAKMMFVEMGEKGIKPNAYTYGAFIHLYCKTGEIQV 575
             G+ P+   Y++LI   CR   ++EA  ++  M  +G   + +T+   +   CK G++  
Sbjct: 468  VGLSPNGIIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAE 527

Query: 576  AERYFQDMLSSRIVPNNIIYTALIDGHCNVGNTVEALSTFKCMLEKGLIPDVQTYGALIH 635
            AE + + M S  I+PN + +  LI+G+ N G  ++A S F  M + G  P   TYG+L+ 
Sbjct: 528  AEEFMRCMTSDGILPNTVSFDCLINGYGNSGEGLKAFSVFDEMTKVGHHPTFFTYGSLLK 587

Query: 636  GLSKNGKTEEAMVVFSEYLDKGLVPDVFIYNSLISGFCKKGEIEKASQLYEEMLLKGPNP 695
            GL K G   EA              D  +YN+L++  CK G + KA  L+ EM+ +   P
Sbjct: 588  GLCKGGHLREAEKFLKSLHAVPAAVDTVMYNTLLTAMCKSGNLAKAVSLFGEMVQRSILP 647

Query: 696  NIVIYNTLINGLCKLGEIKDARELFDKIEGKG-LVPNVVTYSIIIDGYCKSGNLTEAFNL 755
            +   Y +LI+GLC+ G+   A     + E +G ++PN V Y+  +DG  K+G        
Sbjct: 648  DSYTYTSLISGLCRKGKTVIAILFAKEAEARGNVLPNKVMYTCFVDGMFKAGQWKAGIYF 707

Query: 756  FDEMISKGVPLDRHIYCILIDGCCKQGNLEKALSLFHE-ALQKSVASPSAFNSLIDGFCK 815
             ++M + G   D      +IDG  + G +EK   L  E   Q    + + +N L+ G+ K
Sbjct: 708  REQMDNLGHTPDIVTTNAMIDGYSRMGKIEKTNDLLPEMGNQNGGPNLTTYNILLHGYSK 767

Query: 816  LGKLIEARELFDDTVDKHVTPNSVTYTILVDAYSKAEMMEEAEQLFLDMGTKNIMPNTLT 875
               +  +  L+   +   + P+ +T   LV    ++ M+E   ++      + +  +  T
Sbjct: 768  RKDVSTSFLLYRSIILNGILPDKLTCHSLVLGICESNMLEIGLKILKAFICRGVEVDRYT 827

Query: 876  YTSLLLGYNRIGHRIKMISLFKDMEARGIACDAITYGVMADVYCKEGNSLEALKLLDKSL 935
            +  L+      G       L K M + GI+ D  T   M  V  +     E+  +L +  
Sbjct: 828  FNMLISKCCANGEINWAFDLVKVMTSLGISLDKDTCDAMVSVLNRNHRFQESRMVLHEMS 887

Query: 936  VEGIKLDGDVFDALIFHLCNEGKNSTMLKLLGEMAEKKLALTSTTCTALLIGFYKAGNED 995
             +GI  +   +  LI  LC  G   T   +  EM   K+   +   +A++    K G  D
Sbjct: 888  KQGISPESRKYIGLINGLCRVGDIKTAFVVKEEMIAHKICPPNVAESAMVRALAKCGKAD 947

Query: 996  KALEVLDIMQRLGWVPDSLNVVDLVN 1020
            +A  +L  M ++  VP   +   L++
Sbjct: 948  EATLLLRFMLKMKLVPTIASFTTLMH 972

BLAST of CmoCh04G002870 vs. Swiss-Prot
Match: PP325_ARATH (Pentatricopeptide repeat-containing protein At4g19440, chloroplastic OS=Arabidopsis thaliana GN=At4g19440 PE=2 SV=2)

HSP 1 Score: 364.8 bits (935), Expect = 3.2e-99
Identity = 220/702 (31.34%), Postives = 353/702 (50.28%), Query Frame = 1

Query: 99  DPVRLQSFFYWSSSRMGTPQNLHSYSILAIRLCNSGLFPRADNMFEKMLETRKPPL---- 158
           +P     FF  +S       +L SY +L   L ++ L   A  +  +++    P L    
Sbjct: 118 NPKTALDFFRLASDSFSFSFSLRSYCLLIGLLLDANLLSAARVVLIRLINGNVPVLPCGL 177

Query: 159 --------EILDSLVKCYRECGGSNL--IVFDILVDNFRKFGFLNEACSVFLASISGGFF 218
                   + + SL  C+ E     +  ++ ++    F++ G    A  VF    + G F
Sbjct: 178 RDSRVAIADAMASLSLCFDEEIRRKMSDLLIEVYCTQFKRDGCYL-ALDVFPVLANKGMF 237

Query: 219 PSLICCNSLMRDLLKGKMMGLFWKVYGGMVEAKIVPDVYTYTNVINAHCKVGDVMKGRMV 278
           PS   CN L+  L++        + +  + +  + PDVY +T  INA CK G V +   +
Sbjct: 238 PSKTTCNILLTSLVRANEFQKCCEAFDVVCKG-VSPDVYLFTTAINAFCKGGKVEEAVKL 297

Query: 279 LSEMEEKGCKPNLVTYNVVIGGLCRTGDVNEALEVKKLMMEKGLVPDGFTYSILIDGFCK 338
            S+MEE G  PN+VT+N VI GL   G  +EA   K+ M+E+G+ P   TYSIL+ G  +
Sbjct: 298 FSKMEEAGVAPNVVTFNTVIDGLGMCGRYDEAFMFKEKMVERGMEPTLITYSILVKGLTR 357

Query: 339 QKRSEEAKLILESMLGSGLNPNHITYTALIDGFMKQGNIEEALRIKDEMVTRGLKLNIVT 398
            KR  +A  +L+ M   G  PN I Y  LID F++ G++ +A+ IKD MV++GL L   T
Sbjct: 358 AKRIGDAYFVLKEMTKKGFPPNVIVYNNLIDSFIEAGSLNKAIEIKDLMVSKGLSLTSST 417

Query: 399 YNTLIRGIAKAGEMEKAMALVNEMFITGIELDTQTYDLLIDGYLKSHNKDKAYELLAEMK 458
           YNTLI+G  K G+ + A  L+ EM   G  ++  ++  +I         D A   + EM 
Sbjct: 418 YNTLIKGYCKNGQADNAERLLKEMLSIGFNVNQGSFTSVICLLCSHLMFDSALRFVGEML 477

Query: 459 ARNLMPSLYTYSVLINGLCRSRELPKANEVLEHMISHGVKPNAVIYATLINANVQESRYE 518
            RN+ P     + LI+GLC+  +  KA E+    ++ G   +      L++   +  + +
Sbjct: 478 LRNMSPGGGLLTTLISGLCKHGKHSKALELWFQFLNKGFVVDTRTSNALLHGLCEAGKLD 537

Query: 519 GAKEVLKGMVKNGVVPDLFCYNSLIIGLCRAKRVEEAKMMFVEMGEKGIKPNAYTYGAFI 578
            A  + K ++  G V D   YN+LI G C  K+++EA M   EM ++G+KP+ YTY   I
Sbjct: 538 EAFRIQKEILGRGCVMDRVSYNTLISGCCGKKKLDEAFMFLDEMVKRGLKPDNYTYSILI 597

Query: 579 HLYCKTGEIQVAERYFQDMLSSRIVPNNIIYTALIDGHCNVGNTVEALSTFKCMLEKGLI 638
                  +++ A +++ D   + ++P+   Y+ +IDG C    T E    F  M+ K + 
Sbjct: 598 CGLFNMNKVEEAIQFWDDCKRNGMLPDVYTYSVMIDGCCKAERTEEGQEFFDEMMSKNVQ 657

Query: 639 PDVQTYGALIHGLSKNGKTEEAMVVFSEYLDKGLVPDVFIYNSLISGFCKKGEIEKASQL 698
           P+   Y  LI    ++G+   A+ +  +   KG+ P+   Y SLI G      +E+A  L
Sbjct: 658 PNTVVYNHLIRAYCRSGRLSMALELREDMKHKGISPNSATYTSLIKGMSIISRVEEAKLL 717

Query: 699 YEEMLLKGPNPNIVIYNTLINGLCKLGEIKDARELFDKIEGKGLVPNVVTYSIIIDGYCK 758
           +EEM ++G  PN+  Y  LI+G  KLG++     L  ++  K + PN +TY+++I GY +
Sbjct: 718 FEEMRMEGLEPNVFHYTALIDGYGKLGQMVKVECLLREMHSKNVHPNKITYTVMIGGYAR 777

Query: 759 SGNLTEAFNLFDEMISKGVPLDRHIYCILIDGCCKQGNLEKA 787
            GN+TEA  L +EM  KG+  D   Y   I G  KQG + +A
Sbjct: 778 DGNVTEASRLLNEMREKGIVPDSITYKEFIYGYLKQGGVLEA 817

BLAST of CmoCh04G002870 vs. TrEMBL
Match: A0A0A0KPZ1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G175760 PE=4 SV=1)

HSP 1 Score: 1583.9 bits (4100), Expect = 0.0e+00
Identity = 781/1028 (75.97%), Postives = 892/1028 (86.77%), Query Frame = 1

Query: 1    MANAMCLIRQMAAISHPRRNLCSFPVQNTNFPLIANDVCTQFIFFSTAHPYDHNDDTVRE 60
            MANA+CLIRQ+AA S PRR L +FP Q T+FP I N+V   F+FFST +P+DH DDTVRE
Sbjct: 1    MANALCLIRQIAANSSPRRILSTFPFQTTSFPQIWNNVSIHFMFFSTNNPFDHYDDTVRE 60

Query: 61   ISTILKLSDWQVVLDNQNSLKKLNPEIVRSVLQKNEINDPVRLQSFFYWSSSRMGTPQNL 120
             S ILK  DWQ++L+N+++++KLNPEIV SVLQK+EI+D VRLQ+FFYWSSS+M TPQ L
Sbjct: 61   FSMILKRKDWQILLNNEDNVRKLNPEIVCSVLQKSEIDDSVRLQNFFYWSSSKMSTPQYL 120

Query: 121  HSYSILAIRLCNSGLFPRADNMFEKMLETRKPPLEILDSLVKCYRECGGSNLIVFDILVD 180
            HSYSILAIRLCNSGL  +ADNM EK+L+TRKPPLEILDSLV+CYRE GGSNL VFDI +D
Sbjct: 121  HSYSILAIRLCNSGLIHQADNMLEKLLQTRKPPLEILDSLVRCYREFGGSNLTVFDIFID 180

Query: 181  NFRKFGFLNEACSVFLASISGGFFPSLICCNSLMRDLLKGKMMGLFWKVYGGMVEAKIVP 240
             FR  GFLNEA SVF+ASIS GFFP+LICCN+LMRDLLK  MMGLFWKVYG MVEAKIVP
Sbjct: 181  KFRVLGFLNEASSVFIASISEGFFPTLICCNNLMRDLLKANMMGLFWKVYGSMVEAKIVP 240

Query: 241  DVYTYTNVINAHCKVGDVMKGRMVLSEMEEKGCKPNLVTYNVVIGGLCRTGDVNEALEVK 300
            DVYTYTNVI AHCKVGDV+KG+MVLSEME K CKPNL TYN  IGGLC+TG V+EALEVK
Sbjct: 241  DVYTYTNVIKAHCKVGDVIKGKMVLSEME-KECKPNLFTYNAFIGGLCQTGAVDEALEVK 300

Query: 301  KLMMEKGLVPDGFTYSILIDGFCKQKRSEEAKLILESMLGSGLNPNHITYTALIDGFMKQ 360
            KLMMEKGL PDG TY++L+DGFCKQKRS+EAKLI ESM  SGLNPN  TYTALIDGF+K+
Sbjct: 301  KLMMEKGLGPDGHTYTLLVDGFCKQKRSKEAKLIFESMPSSGLNPNRFTYTALIDGFIKE 360

Query: 361  GNIEEALRIKDEMVTRGLKLNIVTYNTLIRGIAKAGEMEKAMALVNEMFITGIELDTQTY 420
            GNIEEALRIKDEM+TRGLKLN+VTYN +I GIAKAGEM KAM+L NEM + G+E DT TY
Sbjct: 361  GNIEEALRIKDEMITRGLKLNVVTYNAMIGGIAKAGEMAKAMSLFNEMLMAGLEPDTWTY 420

Query: 421  DLLIDGYLKSHNKDKAYELLAEMKARNLMPSLYTYSVLINGLCRSRELPKANEVLEHMIS 480
            +LLIDGYLKSH+  KA ELLAEMKAR L PS +TYSVLI+GLC S +L KANEVL+ MI 
Sbjct: 421  NLLIDGYLKSHDMAKACELLAEMKARKLTPSPFTYSVLISGLCHSSDLQKANEVLDQMIR 480

Query: 481  HGVKPNAVIYATLINANVQESRYEGAKEVLKGMVKNGVVPDLFCYNSLIIGLCRAKRVEE 540
            +GVKPN  +Y TLI A VQESRYE A E+LK M+ NGV+PDLFCYN LIIGLCRAK+VEE
Sbjct: 481  NGVKPNVFMYGTLIKAYVQESRYEMAIELLKIMIANGVLPDLFCYNCLIIGLCRAKKVEE 540

Query: 541  AKMMFVEMGEKGIKPNAYTYGAFIHLYCKTGEIQVAERYFQDMLSSRIVPNNIIYTALID 600
            AKM+ V+MGEKGIKPNA+TYGAFI+LY K+GEIQVAERYF+DMLSS IVPNN+IYT LI 
Sbjct: 541  AKMLLVDMGEKGIKPNAHTYGAFINLYSKSGEIQVAERYFKDMLSSGIVPNNVIYTILIK 600

Query: 601  GHCNVGNTVEALSTFKCMLEKGLIPDVQTYGALIHGLSKNGKTEEAMVVFSEYLDKGLVP 660
            GHC+VGNTVEALSTFKCMLEKGLIPD++ Y A+IH LSKNGKT+EAM VF ++L  G+VP
Sbjct: 601  GHCDVGNTVEALSTFKCMLEKGLIPDIRAYSAIIHSLSKNGKTKEAMGVFLKFLKTGVVP 660

Query: 661  DVFIYNSLISGFCKKGEIEKASQLYEEMLLKGPNPNIVIYNTLINGLCKLGEIKDARELF 720
            DVF+YNSLISGFCK+G+IEKASQLY+EML  G NPNIV+YNTLINGLCKLGE+  ARELF
Sbjct: 661  DVFLYNSLISGFCKEGDIEKASQLYDEMLHNGINPNIVVYNTLINGLCKLGEVTKARELF 720

Query: 721  DKIEGKGLVPNVVTYSIIIDGYCKSGNLTEAFNLFDEMISKGVPLDRHIYCILIDGCCKQ 780
            D+IE K LVP+VVTYS IIDGYCKSGNLTEAF LFDEMISKG+  D +IYCILIDGC K+
Sbjct: 721  DEIEEKDLVPDVVTYSTIIDGYCKSGNLTEAFKLFDEMISKGISPDGYIYCILIDGCGKE 780

Query: 781  GNLEKALSLFHEALQKSVASPSAFNSLIDGFCKLGKLIEARELFDDTVDKHVTPNSVTYT 840
            GNLEKALSLFHEA QKSV S SAFNSLID FCK GK+IEARELFDD VDK +TPN VTYT
Sbjct: 781  GNLEKALSLFHEAQQKSVGSLSAFNSLIDSFCKHGKVIEARELFDDMVDKKLTPNIVTYT 840

Query: 841  ILVDAYSKAEMMEEAEQLFLDMGTKNIMPNTLTYTSLLLGYNRIGHRIKMISLFKDMEAR 900
            IL+DAY KAEMMEEAEQLFLDM T+NI+PNTLTYTSLLL YN+IG+R KMISLFKDMEAR
Sbjct: 841  ILIDAYGKAEMMEEAEQLFLDMETRNIIPNTLTYTSLLLSYNQIGNRFKMISLFKDMEAR 900

Query: 901  GIACDAITYGVMADVYCKEGNSLEALKLLDKSLVEGIKLDGDVFDALIFHLCNEGKNSTM 960
            GIACDAI YGVMA  YCKEG SLEALKLL+KSLVEGIKL+ DVFDALIFHLC E + ST+
Sbjct: 901  GIACDAIAYGVMASAYCKEGKSLEALKLLNKSLVEGIKLEDDVFDALIFHLCKEKQISTV 960

Query: 961  LKLLGEMAEKKLALTSTTCTALLIGFYKAGNEDKALEVLDIMQRLGWVPDSLNVVDLVNA 1020
            L+LL EM +++L+L+S TC  LL+GFYK+GNED+A +VL +MQRLGWVP SL++ D ++ 
Sbjct: 961  LELLSEMGKEELSLSSKTCNTLLLGFYKSGNEDEASKVLGVMQRLGWVPTSLSLTDSIST 1020

Query: 1021 RKNDMNSE 1029
             ++DM S+
Sbjct: 1021 GRDDMKSD 1027

BLAST of CmoCh04G002870 vs. TrEMBL
Match: V4TAC8_9ROSI (Uncharacterized protein (Fragment) OS=Citrus clementina GN=CICLE_v10033858mg PE=4 SV=1)

HSP 1 Score: 1110.9 bits (2872), Expect = 0.0e+00
Identity = 535/987 (54.20%), Postives = 713/987 (72.24%), Query Frame = 1

Query: 45   FSTAHPYDH-NDDTVREISTILKLSDWQVVLDNQNSLKKLNPEIVRSVLQKNEINDPVRL 104
            FST+    H N++  +EI+  L  + W+ ++++     KLNP++V+SVLQ + +NDP RL
Sbjct: 3    FSTSQTSLHSNEEAAKEITNFLNENHWESLIESSKLRNKLNPDVVQSVLQHSHVNDPKRL 62

Query: 105  QSFFYWSSSRMGTPQNLHSYSILAIRLCNSGLFPRADNMFEKMLETRKPPLEILDSLVKC 164
              FF W+S+++G P NLHS+S LA+ LCNS LF  A  + ++M+ TR+   +IL+S + C
Sbjct: 63   LGFFNWTSTQLGIPPNLHSFSYLAMMLCNSRLFGAASGVIDRMIATRRSSYQILESFLMC 122

Query: 165  YRECGGSNLIVFDILVDNFRKFGFLNEACSVFLASIS-GGFFPSLICCNSLMRDLLKGKM 224
            YRE   S  +VF++L+D +RK GFL++A  VF   +  GG  P L+CCNS++ DLL+   
Sbjct: 123  YRERNVSGGVVFEMLIDGYRKIGFLDDAAIVFFGVVKDGGSVPGLLCCNSILNDLLRANK 182

Query: 225  MGLFWKVYGGMVEAKIVPDVYTYTNVINAHCKVGDVMKGRMVLSEMEEKGCKPNLVTYNV 284
            + LFWKVY  M+EAK+ PDVYTYT++INAH + G+V   + VL EMEEKGC P+LVTYNV
Sbjct: 183  LKLFWKVYDVMLEAKVTPDVYTYTSLINAHFRAGNVKAAQRVLFEMEEKGCCPSLVTYNV 242

Query: 285  VIGGLCRTGDVNEALEVKKLMMEKGLVPDGFTYSILIDGFCKQKRSEEAKLILESMLGSG 344
            VIGGLCR G ++EA E+K+ M+ KGLVPD FTYS+++DGFCK KR E+AKL+L+ M    
Sbjct: 243  VIGGLCRVGAIDEAFELKESMIHKGLVPDCFTYSLMVDGFCKNKRLEDAKLLLKKMYDLK 302

Query: 345  LNPNHITYTALIDGFMKQGNIEEALRIKDEMVTRGLKLNIVTYNTLIRGIAKAGEMEKAM 404
            LNPN + YT LI+GFMKQGN++EA R+K+EMVT G+KLN+ TYN LI GI KAGE+EKA 
Sbjct: 303  LNPNEVVYTTLINGFMKQGNLQEAFRLKNEMVTFGIKLNLFTYNALIGGICKAGEIEKAK 362

Query: 405  ALVNEMFITGIELDTQTYDLLIDGYLKSHNKDKAYELLAEMKARNLMPSLYTYSVLINGL 464
             L+ EM   GI  DTQTY+ LI+G  + +N  KAYELL +MK RNL P+ YT +V+INGL
Sbjct: 363  GLMTEMLRLGINPDTQTYNSLIEGCYRENNMAKAYELLVDMKKRNLSPTAYTCNVIINGL 422

Query: 465  CRSRELPKANEVLEHMISHGVKPNAVIYATLINANVQESRYEGAKEVLKGMVKNGVVPDL 524
            CR  +L  A  V E MI+ G+KPN  +Y TL+ A+++++R+E A  +LKGM   GV+PD+
Sbjct: 423  CRCSDLEGACRVFEEMIACGLKPNNFVYTTLVQAHLRQNRFEEAINILKGMTGKGVLPDV 482

Query: 525  FCYNSLIIGLCRAKRVEEAKMMFVEMGEKGIKPNAYTYGAFIHLYCKTGEIQVAERYFQD 584
            FCYNSLI GLC+AK++E+A+   VEM   G+KPN YTYGAFI  Y KTG +Q A+RYFQ+
Sbjct: 483  FCYNSLISGLCKAKKMEDARNCLVEMTVNGLKPNLYTYGAFIREYTKTGNMQAADRYFQE 542

Query: 585  MLSSRIVPNNIIYTALIDGHCNVGNTVEALSTFKCMLEKGLIPDVQTYGALIHGLSKNGK 644
            ML+  I PN+IIYT LIDGHC  GN  EA STF+CML +G++PD++TY  LIHGLS+ GK
Sbjct: 543  MLNCGIAPNDIIYTTLIDGHCKEGNVKEAFSTFRCMLGRGILPDLKTYSVLIHGLSRCGK 602

Query: 645  TEEAMVVFSEYLDKGLVPDVFIYNSLISGFCKKGEIEKASQLYEEMLLKGPNPNIVIYNT 704
              EA+ VFSE  DKGLVPDV  Y+SLISGFCK+G I++A QL+E+M   G  PNIV YN 
Sbjct: 603  IHEALEVFSELQDKGLVPDVITYSSLISGFCKQGFIKEAFQLHEKMCESGITPNIVTYNA 662

Query: 705  LINGLCKLGEIKDARELFDKIEGKGLVPNVVTYSIIIDGYCKSGNLTEAFNLFDEMISKG 764
            LI+GLCK GE++ ARELFD I  KGL P VVTY+ IIDGYCKSGNLTEAF L +EM S+G
Sbjct: 663  LIDGLCKSGELERARELFDGIFAKGLTPTVVTYTTIIDGYCKSGNLTEAFQLVNEMPSRG 722

Query: 765  VPLDRHIYCILIDGCCKQGNLEKALSLFHEALQKSVASPSAFNSLIDGFCKLGKLIEARE 824
            V  D  +YC L+DGCC+ GN+EKALSLF E +QK +AS S+FN+L++G CK  K+ EA +
Sbjct: 723  VTPDNFVYCTLVDGCCRDGNMEKALSLFLEMVQKGLASTSSFNALLNGLCKSQKIFEANK 782

Query: 825  LFDDTVDKHVTPNSVTYTILVDAYSKAEMMEEAEQLFLDMGTKNIMPNTLTYTSLLLGYN 884
            L +D  DKH+TPN VTYTIL+D + KA  M++AE L ++M  + + PN  TYTSLL GY 
Sbjct: 783  LLEDMADKHITPNHVTYTILIDYHCKAGTMKDAEHLLVEMQKRVLKPNFRTYTSLLHGYA 842

Query: 885  RIGHRIKMISLFKDMEARGIACDAITYGVMADVYCKEGNSLEALKLLDKSLVEGIKLDGD 944
             IG R +M +LF +M  RG+  D + Y +M D Y KEGN ++ +KL+D+  + G+ L+ +
Sbjct: 843  GIGKRSEMFALFDEMVERGVEPDGVIYSMMVDAYLKEGNMMKTIKLVDEMFLRGLVLNQN 902

Query: 945  VFDALIFHLCNEGKNSTMLKLLGEMAEKKLALTSTTCTALLIGFYKAGNEDKALEVLDIM 1004
            V+ +L   LC E +   +LKLL EM +K++ L+  TC  L+   Y+AGN DKA   L+ M
Sbjct: 903  VYTSLANSLCKEEEFYKVLKLLDEMGDKEIKLSHATCCILISSVYEAGNIDKATRFLESM 962

Query: 1005 QRLGWVPDSLNVVDLVNARKNDMNSES 1030
             + GWV DS  ++DLV   +ND NSE+
Sbjct: 963  IKFGWVADSTVMMDLVKQDQNDANSEN 989

BLAST of CmoCh04G002870 vs. TrEMBL
Match: A0A061G0B2_THECC (Pentatricopeptide repeat superfamily protein, putative isoform 1 OS=Theobroma cacao GN=TCM_014744 PE=4 SV=1)

HSP 1 Score: 1088.2 bits (2813), Expect = 0.0e+00
Identity = 523/985 (53.10%), Postives = 707/985 (71.78%), Query Frame = 1

Query: 54   NDDTVREISTILKLSDWQVVLDNQNSLK-KLNPEIVRSVLQKNEINDPVRLQSFFYWSSS 113
            ND    EI+ IL+  DW+ +L+  + LK KLNPE V S+L ++ + DP RL +FF W+  
Sbjct: 31   NDAAAEEIAAILEKKDWKRLLETTSELKNKLNPETVHSILHQSSVRDPKRLFNFFNWAIH 90

Query: 114  RMGTPQNLHSYSILAIRLCNSGLFPRADNMFEKMLETRKPPLEILDSLVKCYRECGGSNL 173
            ++  PQNL S+S LAI LCNS LF  A+ + +KM++TR+P   +L S+++CY+E  G++ 
Sbjct: 91   QVPNPQNLDSFSFLAIILCNSKLFRDANMVLDKMVQTRRPVQAVLASIIRCYKEYKGNDA 150

Query: 174  IVFDILVDNFRKFGFLNEACSVFLASISGGFFPSLICCNSLMRDLLKGKMMGLFWKVYGG 233
             VF+IL+D ++K G  N A  VFL +  GGF P L+CCN+ + DL+K   + LFWKV+ G
Sbjct: 151  GVFEILIDCYKKVGSWNNAVYVFLGAKEGGFLPGLVCCNNFLGDLVKFNKLDLFWKVFDG 210

Query: 234  MVEAKIVPDVYTYTNVINAHCKVGDVMKGRMVLSEMEEKGCKPNLVTYNVVIGGLCRTGD 293
            MV+AK+VPDVYT+TNVINAHC+VGD+ K + V+ EMEEKGC P LVTYNV+IGGLCR G 
Sbjct: 211  MVDAKLVPDVYTFTNVINAHCRVGDIEKAKRVILEMEEKGCTPGLVTYNVMIGGLCRAGV 270

Query: 294  VNEALEVKKLMMEKGLVPDGFTYSILIDGFCKQKRSEEAKLILESMLGSGLNPNHITYTA 353
            V+EAL++KK M EKG  PD +TY+ LIDGFC++KR  EAKL++  M  +GLNPNH  YTA
Sbjct: 271  VDEALKLKKSMAEKGFAPDAYTYNTLIDGFCREKRFSEAKLMMTEMRRAGLNPNHFAYTA 330

Query: 354  LIDGFMKQGNIEEALRIKDEMVTRGLKLNIVTYNTLIRGIAKAGEMEKAMALVNEMFITG 413
            LIDG MKQGN+ E  R+KDEMV RG+KLN+ TYN LI G+ KAG++EKA AL NEM   G
Sbjct: 331  LIDGLMKQGNVVEGFRVKDEMVARGIKLNVFTYNALISGVCKAGDLEKAKALFNEMVWIG 390

Query: 414  IELDTQTYDLLIDGYLKSHNKDKAYELLAEMKARNLMPSLYTYSVLINGLCRSRELPKAN 473
             E D QT+ +LI+ Y ++   DKAYELL EMK  NL P+LYTYS +INGLC   +L +AN
Sbjct: 391  AEPDAQTFSILIESYSRAKKIDKAYELLNEMKRSNLTPTLYTYSGIINGLCHCGDLERAN 450

Query: 474  EVLEHMISHGVKPNAVIYATLINANVQESRYEGAKEVLKGMVKNGVVPDLFCYNSLIIGL 533
             VL+ M+  G+KPN VIY  LI  ++Q+SR+E A+ +L  M++ GV+PD+ C N+LI GL
Sbjct: 451  HVLDAMVEGGLKPNLVIYTNLIKGHIQKSRFEEARRILDRMMEKGVLPDVICCNTLISGL 510

Query: 534  CRAKRVEEAKMMFVEMGEKGIKPNAYTYGAFIHLYCKTGEIQVAERYFQDMLSSRIVPNN 593
            C+A++++EA+   VEM ++G+KPNA+TYGAFIH Y K GEI+  ER F++M +  I PNN
Sbjct: 511  CKAQKMDEARSCLVEMVDRGLKPNAHTYGAFIHGYAKAGEIEAVERCFKEMQNYGIAPNN 570

Query: 594  IIYTALIDGHCNVGNTVEALSTFKCMLEKGLIPDVQTYGALIHGLSKNGKTEEAMVVFSE 653
            +IY+ LI+ HC  GN  EALST +CM E+G++PDV+TY  LIHGL+ NG+  +A  VFS+
Sbjct: 571  VIYSELINSHCKAGNVTEALSTLRCMSEQGVVPDVKTYTVLIHGLATNGRINDARDVFSQ 630

Query: 654  YLDKGLVPDVFIYNSLISGFCKKGEIEKASQLYEEMLLKGPNPNIVIYNTLINGLCKLGE 713
               KG+VPDVF Y SLISGFCK G+++ A  LY+EM  K   PNIV YNTLI GLCK G 
Sbjct: 631  LHGKGIVPDVFTYTSLISGFCKLGDMKAALNLYKEMCQKSIAPNIVTYNTLIGGLCKAGN 690

Query: 714  IKDARELFDKIEGKGLVPNVVTYSIIIDGYCKSGNLTEAFNLFDEMISKGVPLDRHIYCI 773
            I+ AR++F++I  K L PN  +Y++IIDGYCKSGNLT+AF L DEM S+GVP D   YC 
Sbjct: 691  IEKARKVFNEISQKALAPNTKSYTMIIDGYCKSGNLTQAFQLLDEMPSRGVPPDSFAYCA 750

Query: 774  LIDGCCKQGNLEKALSLFHEALQKSVASPSAFNSLIDGFCKLGKLIEARELFDDTVDKHV 833
            L+DGCCK+G LEKALSLF+E ++K  AS +AFN+LIDG CK GK  +A  L +D VDK +
Sbjct: 751  LVDGCCKEGKLEKALSLFYEMVRKGFASTTAFNALIDGLCKSGKPNDANGLLEDMVDKCI 810

Query: 834  TPNSVTYTILVDAYSKAEMMEEAEQLFLDMGTKNIMPNTLTYTSLLLGYNRIGHRIKMIS 893
            TPN +TYTIL+D + KA  M+EAE LFL+M  +N++PNT+TYT LL GY+R+G R +M +
Sbjct: 811  TPNHITYTILIDHHCKAGEMKEAENLFLEMQRRNLVPNTVTYTLLLHGYDRLGRRAEMFA 870

Query: 894  LFKDMEARGIACDAITYGVMADVYCKEGNSLEALKLLDKSLVEGIKLDGDVFDALIFHLC 953
            LF+ M A  +  D I YG+M + + KE N +  LKLLD+ LV+ + LD      L+  +C
Sbjct: 871  LFERMAANAVEPDEIIYGLMTNAHLKENNLIGNLKLLDEILVKDVVLDQKWSSLLLDAVC 930

Query: 954  NEGKNSTMLKLLGEMAEKKLALTSTTCTALLIGFYKAGNEDKALEVLDIMQRLGWVPDSL 1013
               + S ++K L EMAE+ L L+  TC  L+  F+  G+ +KA ++L+ + + GWVP+S 
Sbjct: 931  KREEFSEVVKFLDEMAEQGLRLSPVTCHKLVRSFHDKGSLEKAEQILESLVQFGWVPNST 990

Query: 1014 NVVDLVNARKNDMNSESFPSDAMQV 1038
            +V  +++   +D NSES  + + QV
Sbjct: 991  SVHSIIHKDHDDANSESPGNFSKQV 1015

BLAST of CmoCh04G002870 vs. TrEMBL
Match: B9RA74_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_1503920 PE=4 SV=1)

HSP 1 Score: 1076.2 bits (2782), Expect = 0.0e+00
Identity = 531/964 (55.08%), Postives = 697/964 (72.30%), Query Frame = 1

Query: 45   FSTAHPYDHNDDTVREISTILKLSDWQVVLDNQNSLKKLNPEIVRSVLQKNEINDPVRLQ 104
            FST    + +D+TV+EI+++LK  +WQ ++++     KLNP++V  V+++N++ DP RL 
Sbjct: 32   FSTNADTNQSDNTVKEITSLLKQKNWQFLIESSPLPNKLNPDVVFLVIKQNQVIDPKRLH 91

Query: 105  SFFYWSSSRMGTPQNLHSYSILAIRLCNSGLFPRADNMFEKMLETRKPPLEILDSLVKCY 164
             FF W +SR    QNL ++SIL++ LCNSGLF  A N+ E+M++TR P ++ILDS++KCY
Sbjct: 92   GFFNWVNSRTVFSQNLSTFSILSLILCNSGLFGNAANVLERMIDTRNPHVKILDSIIKCY 151

Query: 165  RECGGSN----LIVFDILVDNFRKFGFLNEACSVFLASISGGFFPSLICCNSLMRDLLKG 224
            +E  GS+    ++VF+IL+D +RK GFLNEA SVFL + +  F   L CCNSL +DLLKG
Sbjct: 152  KEINGSSSSSSVVVFEILIDIYRKKGFLNEAVSVFLGAKTNEFIVGLACCNSLSKDLLKG 211

Query: 225  KMMGLFWKVYGGMVEAKIVPDVYTYTNVINAHCKVGDVMKGRMVLSEMEEKGCKPNLVTY 284
              + LFWKVY GM+ A IVPDVYTYTN+INA+C+VG V +G+ VL +MEEKGC PNLVTY
Sbjct: 212  NRVELFWKVYKGMLGA-IVPDVYTYTNLINAYCRVGKVEEGKHVLFDMEEKGCIPNLVTY 271

Query: 285  NVVIGGLCRTGDVNEALEVKKLMMEKGLVPDGFTYSILIDGFCKQKRSEEAKLILESMLG 344
            +VVI GLCR GDV+EALE+K+ M  KGL+PD + Y+ LIDGFC+QKRS E K +L+ M  
Sbjct: 272  SVVIAGLCRAGDVDEALELKRSMANKGLLPDNYIYATLIDGFCRQKRSTEGKSMLDEMYT 331

Query: 345  SGLNPNHITYTALIDGFMKQGNIEEALRIKDEMVTRGLKLNIVTYNTLIRGIAKAGEMEK 404
             GL P+H+ YTALI+GF+KQ +I  A ++K+EM  R +KLN  TY  LI G+ K G++EK
Sbjct: 332  MGLKPDHVAYTALINGFVKQSDIGGAFQVKEEMFARKIKLNTFTYYALIHGLCKIGDLEK 391

Query: 405  AMALVNEMFITGIELDTQTYDLLIDGYLKSHNKDKAYELLAEMKARNLMPSLYTYSVLIN 464
            A  L +EM + GI+ D QTY+ LI+GY K  N +KAYELL E+K  NL  + Y    ++N
Sbjct: 392  AEDLFSEMTMMGIKPDIQTYNCLIEGYYKVQNMEKAYELLIEIKKENLTANAYMCGAIVN 451

Query: 465  GLCRSRELPKANEVLEHMISHGVKPNAVIYATLINANVQESRYEGAKEVLKGMVKNGVVP 524
            GLC   +L +ANE+ + MIS G+KPN VIY T++   V+E R+E A ++L  M   G+ P
Sbjct: 452  GLCHCGDLTRANELFQEMISWGLKPNIVIYTTIVKGLVKEGRFEEAIKILGVMKDQGLSP 511

Query: 525  DLFCYNSLIIGLCRAKRVEEAKMMFVEMGEKGIKPNAYTYGAFIHLYCKTGEIQVAERYF 584
            D+FCYN++IIG C+A ++EE K   VEM  KG+KPN YTYGAFIH YC+ GE+Q AER F
Sbjct: 512  DVFCYNTVIIGFCKAGKMEEGKSYLVEMIAKGLKPNVYTYGAFIHGYCRAGEMQAAERSF 571

Query: 585  QDMLSSRIVPNNIIYTALIDGHCNVGNTVEALSTFKCMLEKGLIPDVQTYGALIHGLSKN 644
             +ML S I PN++I T LIDG+C  GNT +A + F+CML++G++PDVQT+  LIHGLSKN
Sbjct: 572  IEMLDSGIAPNDVICTDLIDGYCKDGNTTKAFAKFRCMLDQGVLPDVQTHSVLIHGLSKN 631

Query: 645  GKTEEAMVVFSEYLDKGLVPDVFIYNSLISGFCKKGEIEKASQLYEEMLLKGPNPNIVIY 704
            GK +EAM VFSE LDKGLVPDVF Y SLIS  CK+G+++ A +L+++M  KG NPNIV Y
Sbjct: 632  GKLQEAMGVFSELLDKGLVPDVFTYTSLISNLCKEGDLKAAFELHDDMCKKGINPNIVTY 691

Query: 705  NTLINGLCKLGEIKDARELFDKIEGKGLVPNVVTYSIIIDGYCKSGNLTEAFNLFDEMIS 764
            N LINGLCKLGEI  ARELFD I  KGL  N VTYS II GYCKS NLTEAF LF  M  
Sbjct: 692  NALINGLCKLGEIAKARELFDGIPEKGLARNSVTYSTIIAGYCKSANLTEAFQLFHGMKL 751

Query: 765  KGVPLDRHIYCILIDGCCKQGNLEKALSLFHEALQKSVASPSAFNSLIDGFCKLGKLIEA 824
             GVP D  +YC LIDGCCK GN EKALSLF   +++ +AS  AFN+LIDGF KLGKLIEA
Sbjct: 752  VGVPPDSFVYCALIDGCCKAGNTEKALSLFLGMVEEGIASTPAFNALIDGFFKLGKLIEA 811

Query: 825  RELFDDTVDKHVTPNSVTYTILVDAYSKAEMMEEAEQLFLDMGTKNIMPNTLTYTSLLLG 884
             +L +D VD H+TPN VTYTIL++ +     ++EAEQLF++M  +N+MPN LTYTSLL G
Sbjct: 812  YQLVEDMVDNHITPNHVTYTILIEYHCTVGNIKEAEQLFMEMQKRNVMPNVLTYTSLLHG 871

Query: 885  YNRIGHRIKMISLFKDMEARGIACDAITYGVMADVYCKEGNSLEALKLLDKSLVEGIKLD 944
            YNRIG R +M SLF +M ARGI  D + + VM D + KEGN ++ALKL+D  L EG+ + 
Sbjct: 872  YNRIGRRSEMFSLFDEMVARGIKPDDLAWSVMVDAHLKEGNWIKALKLVDDMLSEGVNVC 931

Query: 945  GDVFDALIFHLCNEGKNSTMLKLLGEMAEKKLALTSTTCTALLIGFYKAGNEDKALEVLD 1004
             +++  LI  LC     S +LK+L E+ ++   L+  TC  L+  F++AG  D+AL VL+
Sbjct: 932  KNLYTILIDALCKHNNLSEVLKVLDEVEKQGSKLSLATCGTLVCCFHRAGRTDEALRVLE 991

BLAST of CmoCh04G002870 vs. TrEMBL
Match: A0A067E7X7_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g001797mg PE=4 SV=1)

HSP 1 Score: 1066.6 bits (2757), Expect = 1.9e-308
Identity = 520/987 (52.68%), Postives = 696/987 (70.52%), Query Frame = 1

Query: 45   FSTAHPYDH-NDDTVREISTILKLSDWQVVLDNQNSLKKLNPEIVRSVLQKNEINDPVRL 104
            FST+    H N++  +EI+  L  + W+ ++++     KLNP++V+SVLQ + +NDP RL
Sbjct: 27   FSTSQTSLHSNEEAAKEITNFLNENHWESLIESSKLRNKLNPDVVQSVLQHSHVNDPKRL 86

Query: 105  QSFFYWSSSRMGTPQNLHSYSILAIRLCNSGLFPRADNMFEKMLETRKPPLEILDSLVKC 164
              FF W+S+++G P NLHS+S LA+ LCNS LF  A  + ++M+ TR+   +IL+S + C
Sbjct: 87   LGFFNWTSTQLGIPPNLHSFSYLAMMLCNSRLFGAASGVIDRMIATRRSSYQILESFLMC 146

Query: 165  YRECGGSNLIVFDILVDNFRKFGFLNEACSVFLASIS-GGFFPSLICCNSLMRDLLKGKM 224
            YRE   S  +VF++L+D +RK GFL++A  VF   +  GG  P L+CCNS++ DLL+   
Sbjct: 147  YRERNVSGGVVFEMLIDGYRKIGFLDDAAIVFFGVVKDGGSVPGLLCCNSILNDLLRANK 206

Query: 225  MGLFWKVYGGMVEAKIVPDVYTYTNVINAHCKVGDVMKGRMVLSEMEEKGCKPNLVTYNV 284
            + LFWKVY  M+EAK+ PDVYTYT++INAH + G+V   + VL EMEEK           
Sbjct: 207  LKLFWKVYDVMLEAKVTPDVYTYTSLINAHFRAGNVKAAQRVLFEMEEK----------- 266

Query: 285  VIGGLCRTGDVNEALEVKKLMMEKGLVPDGFTYSILIDGFCKQKRSEEAKLILESMLGSG 344
                    G ++EA E+K+ M+ KGLVPD FTYS+++DGFCK KR E+AKL+L+ M    
Sbjct: 267  -------VGAIDEAFELKESMIHKGLVPDCFTYSLMVDGFCKNKRLEDAKLLLKKMYDLK 326

Query: 345  LNPNHITYTALIDGFMKQGNIEEALRIKDEMVTRGLKLNIVTYNTLIRGIAKAGEMEKAM 404
            LNPN + YT LI+GFMKQGN++EA R+K+EMVT G+KLN+ TYN LI GI KAGE+EKA 
Sbjct: 327  LNPNEVVYTTLINGFMKQGNLQEAFRLKNEMVTFGIKLNLFTYNALIGGICKAGEIEKAK 386

Query: 405  ALVNEMFITGIELDTQTYDLLIDGYLKSHNKDKAYELLAEMKARNLMPSLYTYSVLINGL 464
             L+ EM   GI  DTQTY+ LI+G  + +N  KAYELL +MK RNL P+ YT +V+INGL
Sbjct: 387  GLMTEMLRLGINPDTQTYNSLIEGCYRENNMAKAYELLVDMKKRNLSPTAYTCNVIINGL 446

Query: 465  CRSRELPKANEVLEHMISHGVKPNAVIYATLINANVQESRYEGAKEVLKGMVKNGVVPDL 524
            CR  +L  A  V E MI+ G+KPN  +Y TLI A+++++R+E A  +LKGM   GV+PD+
Sbjct: 447  CRCSDLEGACRVFEEMIACGLKPNNFVYTTLIQAHLRQNRFEEAINILKGMTGKGVLPDV 506

Query: 525  FCYNSLIIGLCRAKRVEEAKMMFVEMGEKGIKPNAYTYGAFIHLYCKTGEIQVAERYFQD 584
            FCYNSLI GLC+AK++E+A+   VEM   G+KPN YTYGAFI  Y KTG +Q A+RYFQ+
Sbjct: 507  FCYNSLISGLCKAKKMEDARSCLVEMTANGLKPNLYTYGAFIREYTKTGNMQAADRYFQE 566

Query: 585  MLSSRIVPNNIIYTALIDGHCNVGNTVEALSTFKCMLEKGLIPDVQTYGALIHGLSKNGK 644
            ML+  I PN+IIYT LIDGHC  GN  EA STF+CML +G++PD++TY  LIHGLS+ GK
Sbjct: 567  MLNCGIAPNDIIYTTLIDGHCKEGNVKEAFSTFRCMLGRGILPDLKTYSVLIHGLSRCGK 626

Query: 645  TEEAMVVFSEYLDKGLVPDVFIYNSLISGFCKKGEIEKASQLYEEMLLKGPNPNIVIYNT 704
              EA+ VFSE  DKGLVPDV  Y+SLISGFCK+G I++A QL+E+M   G  PNIV YN 
Sbjct: 627  IHEALEVFSELQDKGLVPDVITYSSLISGFCKQGFIKEAFQLHEKMCESGITPNIVTYNA 686

Query: 705  LINGLCKLGEIKDARELFDKIEGKGLVPNVVTYSIIIDGYCKSGNLTEAFNLFDEMISKG 764
            LI+GLCK GE++ ARELFD I  KGL P VVTY+ IIDGYCKSGNLTEAF L +EM S+G
Sbjct: 687  LIDGLCKSGELERARELFDGIFAKGLTPTVVTYTTIIDGYCKSGNLTEAFQLVNEMPSRG 746

Query: 765  VPLDRHIYCILIDGCCKQGNLEKALSLFHEALQKSVASPSAFNSLIDGFCKLGKLIEARE 824
            V  D  +YC L+DGCC+ GN+EKALSLF E +QK +AS S+FN+L++G CK  K+ EA +
Sbjct: 747  VTPDNFVYCTLVDGCCRDGNMEKALSLFLEMVQKGLASTSSFNALLNGLCKSQKIFEANK 806

Query: 825  LFDDTVDKHVTPNSVTYTILVDAYSKAEMMEEAEQLFLDMGTKNIMPNTLTYTSLLLGYN 884
            L +D  DKH+TPN VTYTIL+D + KA  M++AE L ++M  + + PN  TYTSLL GY 
Sbjct: 807  LLEDMADKHITPNHVTYTILIDYHCKAGTMKDAEHLLVEMQKRVLKPNFRTYTSLLHGYA 866

Query: 885  RIGHRIKMISLFKDMEARGIACDAITYGVMADVYCKEGNSLEALKLLDKSLVEGIKLDGD 944
             IG R +M +LF +M  RG+  D + Y +M D Y KEGN ++ +KL+D+  + G+ L+ +
Sbjct: 867  GIGKRSEMFALFDEMVERGVEPDGVIYSMMVDAYLKEGNMMKTIKLVDEMFLRGLVLNQN 926

Query: 945  VFDALIFHLCNEGKNSTMLKLLGEMAEKKLALTSTTCTALLIGFYKAGNEDKALEVLDIM 1004
            V+ +L   LC E +   +LKLL EM +K++ L+  TC  L+   Y+AGN DKA   L+ M
Sbjct: 927  VYTSLANSLCKEEEFYKVLKLLDEMGDKEIKLSHATCCILISSVYEAGNIDKATRFLESM 986

Query: 1005 QRLGWVPDSLNVVDLVNARKNDMNSES 1030
             + GWV DS  ++DLV   +ND NSE+
Sbjct: 987  IKFGWVADSTVMMDLVKQDQNDANSEN 995

BLAST of CmoCh04G002870 vs. TAIR10
Match: AT5G61990.1 (AT5G61990.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 738.4 bits (1905), Expect = 6.0e-213
Identity = 394/970 (40.62%), Postives = 580/970 (59.79%), Query Frame = 1

Query: 56   DTVREISTILKLSDWQVVLDNQNSLKKLNPEIVRSVLQKNEINDPVRLQSFFYWSSSRMG 115
            D   EI+ ILK  +W+  L + N   ++NPE+V SVL+   ++DP +L SFF W  S+  
Sbjct: 33   DASAEIAGILKQENWRDTLVSSNLSIEINPEVVLSVLRSKRVDDPSKLLSFFNWVDSQKV 92

Query: 116  TPQNLHSYSILAIRLCNSGLFPRADNMFEKMLETRKPPLEILDSLVKCYRECGGSNL--I 175
            T Q L S+S LA+ LCN G F +A ++ E+M+E   P  E+  S+V+C +E  G +   +
Sbjct: 93   TEQKLDSFSFLALDLCNFGSFEKALSVVERMIERNWPVAEVWSSIVRCSQEFVGKSDDGV 152

Query: 176  VFDILVDNFRKFGFLNEACSVFLASISGGFFPSLICCNSLMRDLLKGKMMGLFWKVYGGM 235
            +F IL D +   G++ EA  VF +S+     P L  C  L+  LL+   + LFW VY GM
Sbjct: 153  LFGILFDGYIAKGYIEEAVFVFSSSMGLELVPRLSRCKVLLDALLRWNRLDLFWDVYKGM 212

Query: 236  VEAKIVPDVYTYTNVINAHCKVGDVMKGRMVLSEMEEKGCKPNLVTYNVVIGGLCRTGDV 295
            VE  +V DV TY  +I AHC+ G+V  G+ VL + E++     L              +V
Sbjct: 213  VERNVVFDVKTYHMLIIAHCRAGNVQLGKDVLFKTEKEFRTATL--------------NV 272

Query: 296  NEALEVKKLMMEKGLVPDGFTYSILIDGFCKQKRSEEAKLILESMLGSGLNPNHITYTAL 355
            + AL++K+ M+ KGLVP  +TY +LIDG CK KR E+AK +L  M   G++ ++ TY+ L
Sbjct: 273  DGALKLKESMICKGLVPLKYTYDVLIDGLCKIKRLEDAKSLLVEMDSLGVSLDNHTYSLL 332

Query: 356  IDGFMKQGNIEEALRIKDEMVTRGLKLNIVTYNTLIRGIAKAGEMEKAMALVNEMFITGI 415
            IDG +K  N + A  +  EMV+ G+ +    Y+  I  ++K G MEKA AL + M  +G+
Sbjct: 333  IDGLLKGRNADAAKGLVHEMVSHGINIKPYMYDCCICVMSKEGVMEKAKALFDGMIASGL 392

Query: 416  ELDTQTYDLLIDGYLKSHNKDKAYELLAEMKARNLMPSLYTYSVLINGLCRSRELPKANE 475
                Q Y  LI+GY +  N  + YELL EMK RN++ S YTY  ++ G+C S +L  A  
Sbjct: 393  IPQAQAYASLIEGYCREKNVRQGYELLVEMKKRNIVISPYTYGTVVKGMCSSGDLDGAYN 452

Query: 476  VLEHMISHGVKPNAVIYATLINANVQESRYEGAKEVLKGMVKNGVVPDLFCYNSLIIGLC 535
            +++ MI+ G +PN VIY TLI   +Q SR+  A  VLK M + G+ PD+FCYNSLIIGL 
Sbjct: 453  IVKEMIASGCRPNVVIYTTLIKTFLQNSRFGDAMRVLKEMKEQGIAPDIFCYNSLIIGLS 512

Query: 536  RAKRVEEAKMMFVEMGEKGIKPNAYTYGAFIHLYCKTGEIQVAERYFQDMLSSRIVPNNI 595
            +AKR++EA+   VEM E G+KPNA+TYGAFI  Y +  E   A++Y ++M    ++PN +
Sbjct: 513  KAKRMDEARSFLVEMVENGLKPNAFTYGAFISGYIEASEFASADKYVKEMRECGVLPNKV 572

Query: 596  IYTALIDGHCNVGNTVEALSTFKCMLEKGLIPDVQTYGALIHGLSKNGKTEEAMVVFSEY 655
            + T LI+ +C  G  +EA S ++ M+++G++ D +TY  L++GL KN K ++A  +F E 
Sbjct: 573  LCTGLINEYCKKGKVIEACSAYRSMVDQGILGDAKTYTVLMNGLFKNDKVDDAEEIFREM 632

Query: 656  LDKGLVPDVFIYNSLISGFCKKGEIEKASQLYEEMLLKGPNPNIVIYNTLINGLCKLGEI 715
              KG+ PDVF Y  LI+GF K G ++KAS +++EM+ +G  PN++IYN L+ G C+ GEI
Sbjct: 633  RGKGIAPDVFSYGVLINGFSKLGNMQKASSIFDEMVEEGLTPNVIIYNMLLGGFCRSGEI 692

Query: 716  KDARELFDKIEGKGLVPNVVTYSIIIDGYCKSGNLTEAFNLFDEMISKGVPLDRHIYCIL 775
            + A+EL D++  KGL PN VTY  IIDGYCKSG+L EAF LFDEM  KG+  D  +Y  L
Sbjct: 693  EKAKELLDEMSVKGLHPNAVTYCTIIDGYCKSGDLAEAFRLFDEMKLKGLVPDSFVYTTL 752

Query: 776  IDGCCKQGNLEKALSLFHEALQKSVASPSAFNSLIDGFCKLGKLIEARE----LFDDTVD 835
            +DGCC+  ++E+A+++F    +   +S + FN+LI+   K GK     E    L D + D
Sbjct: 753  VDGCCRLNDVERAITIFGTNKKGCASSTAPFNALINWVFKFGKTELKTEVLNRLMDGSFD 812

Query: 836  KHVTPNSVTYTILVDAYSKAEMMEEAEQLFLDMGTKNIMPNTLTYTSLLLGYNRIGHRIK 895
            +   PN VTY I++D   K   +E A++LF  M   N+MP  +TYTSLL GY+++G R +
Sbjct: 813  RFGKPNDVTYNIMIDYLCKEGNLEAAKELFHQMQNANLMPTVITYTSLLNGYDKMGRRAE 872

Query: 896  MISLFKDMEARGIACDAITYGVMADVYCKEGNSLEALKLLDKSLVEGIKLDGDVFDALIF 955
            M  +F +  A GI  D I Y V+ + + KEG + +AL L+D+   +    DG        
Sbjct: 873  MFPVFDEAIAAGIEPDHIMYSVIINAFLKEGMTTKALVLVDQMFAKNAVDDG-------- 932

Query: 956  HLCNEGKNSTMLKLLGEMAEKKLALTSTTCTALLIGFYKAGNEDKALEVLDIMQRLGWVP 1015
                                    L+ +TC ALL GF K G  + A +V++ M RL ++P
Sbjct: 933  ----------------------CKLSISTCRALLSGFAKVGEMEVAEKVMENMVRLQYIP 958

Query: 1016 DSLNVVDLVN 1020
            DS  V++L+N
Sbjct: 993  DSATVIELIN 958

BLAST of CmoCh04G002870 vs. TAIR10
Match: AT5G59900.1 (AT5G59900.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 427.2 bits (1097), Expect = 3.0e-119
Identity = 255/836 (30.50%), Postives = 429/836 (51.32%), Query Frame = 1

Query: 70  WQVVLDNQNSLKKLNPEIVRSVLQKNEINDPVRLQSFFYWSSSRMGTPQNLHSYSILAIR 129
           W++ L ++   ++L    V  +L    I+DP     FF +     G   +  S+ IL   
Sbjct: 55  WEIALSSELVSRRLKTVHVEEILI-GTIDDPKLGLRFFNFLGLHRGFDHSTASFCILIHA 114

Query: 130 LCNSGLFPRADNMFEKMLETRKPPLEILDSLVKCYRECGGSNLIVFDILVDNFRKFGFLN 189
           L  + LF  A ++ + +L     P ++ + L  CY +C  S+   FD+L+ ++ +   + 
Sbjct: 115 LVKANLFWPASSLLQTLLLRALKPSDVFNVLFSCYEKCKLSSSSSFDLLIQHYVRSRRVL 174

Query: 190 EACSVFLASISG-GFFPSLICCNSLMRDLLKGKMMGLFWKVYGGMVEAKIVPDVYTYTNV 249
           +   VF   I+     P +   ++L+  L+K +  GL  +++  MV   I PDVY YT V
Sbjct: 175 DGVLVFKMMITKVSLLPEVRTLSALLHGLVKFRHFGLAMELFNDMVSVGIRPDVYIYTGV 234

Query: 250 INAHCKVGDVMKGRMVLSEMEEKGCKPNLVTYNVVIGGLCRTGDVNEALEVKKLMMEKGL 309
           I + C++ D+ + + +++ ME  GC  N+V YNV+I GLC+   V EA+ +KK +  K L
Sbjct: 235 IRSLCELKDLSRAKEMIAHMEATGCDVNIVPYNVLIDGLCKKQKVWEAVGIKKDLAGKDL 294

Query: 310 VPDGFTYSILIDGFCKQKRSEEAKLILESMLGSGLNPNHITYTALIDGFMKQGNIEEALR 369
            PD  TY  L+ G CK +  E    +++ ML    +P+    ++L++G  K+G IEEAL 
Sbjct: 295 KPDVVTYCTLVYGLCKVQEFEIGLEMMDEMLCLRFSPSEAAVSSLVEGLRKRGKIEEALN 354

Query: 370 IKDEMVTRGLKLNIVTYNTLIRGIAKAGEMEKAMALVNEMFITGIELDTQTYDLLIDGYL 429
           +   +V  G+  N+  YN LI  + K  +  +A  L + M   G+  +  TY +LID + 
Sbjct: 355 LVKRVVDFGVSPNLFVYNALIDSLCKGRKFHEAELLFDRMGKIGLRPNDVTYSILIDMFC 414

Query: 430 KSHNKDKAYELLAEMKARNLMPSLYTYSVLINGLCRSRELPKANEVLEHMISHGVKPNAV 489
           +    D A   L EM    L  S+Y Y+ LING C+  ++  A   +  MI+  ++P  V
Sbjct: 415 RRGKLDTALSFLGEMVDTGLKLSVYPYNSLINGHCKFGDISAAEGFMAEMINKKLEPTVV 474

Query: 490 IYATLINANVQESRYEGAKEVLKGMVKNGVVPDLFCYNSLIIGLCRAKRVEEAKMMFVEM 549
            Y +L+     + +   A  +   M   G+ P ++ + +L+ GL RA  + +A  +F EM
Sbjct: 475 TYTSLMGGYCSKGKINKALRLYHEMTGKGIAPSIYTFTTLLSGLFRAGLIRDAVKLFNEM 534

Query: 550 GEKGIKPNAYTYGAFIHLYCKTGEIQVAERYFQDMLSSRIVPNNIIYTALIDGHCNVGNT 609
            E  +KPN  TY   I  YC+ G++  A  + ++M    IVP+   Y  LI G C  G  
Sbjct: 535 AEWNVKPNRVTYNVMIEGYCEEGDMSKAFEFLKEMTEKGIVPDTYSYRPLIHGLCLTGQA 594

Query: 610 VEALSTFKCMLEKGLIP-DVQTYGALIHGLSKNGKTEEAMVVFSEYLDKGLVPDVFIYNS 669
            EA   F   L KG    +   Y  L+HG  + GK EEA+ V  E + +G+  D+  Y  
Sbjct: 595 SEA-KVFVDGLHKGNCELNEICYTGLLHGFCREGKLEEALSVCQEMVQRGVDLDLVCYGV 654

Query: 670 LISGFCKKGEIEKASQLYEEMLLKGPNPNIVIYNTLINGLCKLGEIKDARELFDKIEGKG 729
           LI G  K  + +    L +EM  +G  P+ VIY ++I+   K G+ K+A  ++D +  +G
Sbjct: 655 LIDGSLKHKDRKLFFGLLKEMHDRGLKPDDVIYTSMIDAKSKTGDFKEAFGIWDLMINEG 714

Query: 730 LVPNVVTYSIIIDGYCKSGNLTEAFNLFDEMISKGVPLDRHIYCILIDGCCK-QGNLEKA 789
            VPN VTY+ +I+G CK+G + EA  L  +M       ++  Y   +D   K + +++KA
Sbjct: 715 CVPNEVTYTAVINGLCKAGFVNEAEVLCSKMQPVSSVPNQVTYGCFLDILTKGEVDMQKA 774

Query: 790 LSLFHEALQKSVASPSAFNSLIDGFCKLGKLIEARELFDDTVDKHVTPNSVTYTILVDAY 849
           + L +  L+  +A+ + +N LI GFC+ G++ EA EL    +   V+P+ +TYT +++  
Sbjct: 775 VELHNAILKGLLANTATYNMLIRGFCRQGRIEEASELITRMIGDGVSPDCITYTTMINEL 834

Query: 850 SKAEMMEEAEQLFLDMGTKNIMPNTLTYTSLLLGYNRIGHRIKMISLFKDMEARGI 903
            +   +++A +L+  M  K I P+ + Y +L+ G    G   K   L  +M  +G+
Sbjct: 835 CRRNDVKKAIELWNSMTEKGIRPDRVAYNTLIHGCCVAGEMGKATELRNEMLRQGL 888

BLAST of CmoCh04G002870 vs. TAIR10
Match: AT5G55840.1 (AT5G55840.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 387.9 bits (995), Expect = 2.0e-107
Identity = 245/866 (28.29%), Postives = 417/866 (48.15%), Query Frame = 1

Query: 156  ILDSLVKCYRECGGSNLIVFDILVDNFRKFGFLNEACSVFLASISGGFFPSLICCNSLMR 215
            +  +L+  YR C  SN  V+DIL+  + + G + ++  +F      GF PS+  CN+++ 
Sbjct: 148  VFGALMTTYRLCN-SNPSVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCNAILG 207

Query: 216  DLLKGKMMGLFWKVYGGMVEAKIVPDVYTYTNVINAHCKVGDVMKGRMVLSEMEEKGCKP 275
             ++K       W     M++ KI PDV T+  +IN  C  G   K   ++ +ME+ G  P
Sbjct: 208  SVVKSGEDVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYAP 267

Query: 276  NLVTYNVVIGGLCRTGDVNEALEVKKLMMEKGLVPDGFTYSILIDGFCKQKRSEEAKLIL 335
             +VTYN V+   C+ G    A+E+   M  KG+  D  TY++LI   C+  R  +  L+L
Sbjct: 268  TIVTYNTVLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYLLL 327

Query: 336  ESMLGSGLNPNHITYTALIDGFMKQGNIEEALRIKDEMVTRGLKLNIVTYNTLIRGIAKA 395
              M    ++PN +TY  LI+GF  +G +  A ++ +EM++ GL  N VT+N LI G    
Sbjct: 328  RDMRKRMIHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHISE 387

Query: 396  GEMEKAMALVNEMFITGIELDTQTYDLLIDGYLKSHNKDKAYELLAEMKARNLMPSLYTY 455
            G  ++A+ +   M   G+     +Y +L+DG  K+   D A      MK   +     TY
Sbjct: 388  GNFKEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRITY 447

Query: 456  SVLINGLCRSRELPKANEVLEHMISHGVKPNAVIYATLINANVQESRYEGAKEVLKGMVK 515
            + +I+GLC++  L +A  +L  M   G+ P+ V Y+ LIN   +  R++ AKE++  + +
Sbjct: 448  TGMIDGLCKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYR 507

Query: 516  NGVVPDLFCYNSLIIGLCRAKRVEEAKMMFVEMGEKGIKPNAYTYGAFIHLYCKTGEIQV 575
             G+ P+   Y++LI   CR   ++EA  ++  M  +G   + +T+   +   CK G++  
Sbjct: 508  VGLSPNGIIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAE 567

Query: 576  AERYFQDMLSSRIVPNNIIYTALIDGHCNVGNTVEALSTFKCMLEKGLIPDVQTYGALIH 635
            AE + + M S  I+PN + +  LI+G+ N G  ++A S F  M + G  P   TYG+L+ 
Sbjct: 568  AEEFMRCMTSDGILPNTVSFDCLINGYGNSGEGLKAFSVFDEMTKVGHHPTFFTYGSLLK 627

Query: 636  GLSKNGKTEEAMVVFSEYLDKGLVPDVFIYNSLISGFCKKGEIEKASQLYEEMLLKGPNP 695
            GL K G   EA              D  +YN+L++  CK G + KA  L+ EM+ +   P
Sbjct: 628  GLCKGGHLREAEKFLKSLHAVPAAVDTVMYNTLLTAMCKSGNLAKAVSLFGEMVQRSILP 687

Query: 696  NIVIYNTLINGLCKLGEIKDARELFDKIEGKG-LVPNVVTYSIIIDGYCKSGNLTEAFNL 755
            +   Y +LI+GLC+ G+   A     + E +G ++PN V Y+  +DG  K+G        
Sbjct: 688  DSYTYTSLISGLCRKGKTVIAILFAKEAEARGNVLPNKVMYTCFVDGMFKAGQWKAGIYF 747

Query: 756  FDEMISKGVPLDRHIYCILIDGCCKQGNLEKALSLFHE-ALQKSVASPSAFNSLIDGFCK 815
             ++M + G   D      +IDG  + G +EK   L  E   Q    + + +N L+ G+ K
Sbjct: 748  REQMDNLGHTPDIVTTNAMIDGYSRMGKIEKTNDLLPEMGNQNGGPNLTTYNILLHGYSK 807

Query: 816  LGKLIEARELFDDTVDKHVTPNSVTYTILVDAYSKAEMMEEAEQLFLDMGTKNIMPNTLT 875
               +  +  L+   +   + P+ +T   LV    ++ M+E   ++      + +  +  T
Sbjct: 808  RKDVSTSFLLYRSIILNGILPDKLTCHSLVLGICESNMLEIGLKILKAFICRGVEVDRYT 867

Query: 876  YTSLLLGYNRIGHRIKMISLFKDMEARGIACDAITYGVMADVYCKEGNSLEALKLLDKSL 935
            +  L+      G       L K M + GI+ D  T   M  V  +     E+  +L +  
Sbjct: 868  FNMLISKCCANGEINWAFDLVKVMTSLGISLDKDTCDAMVSVLNRNHRFQESRMVLHEMS 927

Query: 936  VEGIKLDGDVFDALIFHLCNEGKNSTMLKLLGEMAEKKLALTSTTCTALLIGFYKAGNED 995
             +GI  +   +  LI  LC  G   T   +  EM   K+   +   +A++    K G  D
Sbjct: 928  KQGISPESRKYIGLINGLCRVGDIKTAFVVKEEMIAHKICPPNVAESAMVRALAKCGKAD 987

Query: 996  KALEVLDIMQRLGWVPDSLNVVDLVN 1020
            +A  +L  M ++  VP   +   L++
Sbjct: 988  EATLLLRFMLKMKLVPTIASFTTLMH 1012

BLAST of CmoCh04G002870 vs. TAIR10
Match: AT4G19440.1 (AT4G19440.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 364.8 bits (935), Expect = 1.8e-100
Identity = 220/702 (31.34%), Postives = 353/702 (50.28%), Query Frame = 1

Query: 99  DPVRLQSFFYWSSSRMGTPQNLHSYSILAIRLCNSGLFPRADNMFEKMLETRKPPL---- 158
           +P     FF  +S       +L SY +L   L ++ L   A  +  +++    P L    
Sbjct: 105 NPKTALDFFRLASDSFSFSFSLRSYCLLIGLLLDANLLSAARVVLIRLINGNVPVLPCGL 164

Query: 159 --------EILDSLVKCYRECGGSNL--IVFDILVDNFRKFGFLNEACSVFLASISGGFF 218
                   + + SL  C+ E     +  ++ ++    F++ G    A  VF    + G F
Sbjct: 165 RDSRVAIADAMASLSLCFDEEIRRKMSDLLIEVYCTQFKRDGCYL-ALDVFPVLANKGMF 224

Query: 219 PSLICCNSLMRDLLKGKMMGLFWKVYGGMVEAKIVPDVYTYTNVINAHCKVGDVMKGRMV 278
           PS   CN L+  L++        + +  + +  + PDVY +T  INA CK G V +   +
Sbjct: 225 PSKTTCNILLTSLVRANEFQKCCEAFDVVCKG-VSPDVYLFTTAINAFCKGGKVEEAVKL 284

Query: 279 LSEMEEKGCKPNLVTYNVVIGGLCRTGDVNEALEVKKLMMEKGLVPDGFTYSILIDGFCK 338
            S+MEE G  PN+VT+N VI GL   G  +EA   K+ M+E+G+ P   TYSIL+ G  +
Sbjct: 285 FSKMEEAGVAPNVVTFNTVIDGLGMCGRYDEAFMFKEKMVERGMEPTLITYSILVKGLTR 344

Query: 339 QKRSEEAKLILESMLGSGLNPNHITYTALIDGFMKQGNIEEALRIKDEMVTRGLKLNIVT 398
            KR  +A  +L+ M   G  PN I Y  LID F++ G++ +A+ IKD MV++GL L   T
Sbjct: 345 AKRIGDAYFVLKEMTKKGFPPNVIVYNNLIDSFIEAGSLNKAIEIKDLMVSKGLSLTSST 404

Query: 399 YNTLIRGIAKAGEMEKAMALVNEMFITGIELDTQTYDLLIDGYLKSHNKDKAYELLAEMK 458
           YNTLI+G  K G+ + A  L+ EM   G  ++  ++  +I         D A   + EM 
Sbjct: 405 YNTLIKGYCKNGQADNAERLLKEMLSIGFNVNQGSFTSVICLLCSHLMFDSALRFVGEML 464

Query: 459 ARNLMPSLYTYSVLINGLCRSRELPKANEVLEHMISHGVKPNAVIYATLINANVQESRYE 518
            RN+ P     + LI+GLC+  +  KA E+    ++ G   +      L++   +  + +
Sbjct: 465 LRNMSPGGGLLTTLISGLCKHGKHSKALELWFQFLNKGFVVDTRTSNALLHGLCEAGKLD 524

Query: 519 GAKEVLKGMVKNGVVPDLFCYNSLIIGLCRAKRVEEAKMMFVEMGEKGIKPNAYTYGAFI 578
            A  + K ++  G V D   YN+LI G C  K+++EA M   EM ++G+KP+ YTY   I
Sbjct: 525 EAFRIQKEILGRGCVMDRVSYNTLISGCCGKKKLDEAFMFLDEMVKRGLKPDNYTYSILI 584

Query: 579 HLYCKTGEIQVAERYFQDMLSSRIVPNNIIYTALIDGHCNVGNTVEALSTFKCMLEKGLI 638
                  +++ A +++ D   + ++P+   Y+ +IDG C    T E    F  M+ K + 
Sbjct: 585 CGLFNMNKVEEAIQFWDDCKRNGMLPDVYTYSVMIDGCCKAERTEEGQEFFDEMMSKNVQ 644

Query: 639 PDVQTYGALIHGLSKNGKTEEAMVVFSEYLDKGLVPDVFIYNSLISGFCKKGEIEKASQL 698
           P+   Y  LI    ++G+   A+ +  +   KG+ P+   Y SLI G      +E+A  L
Sbjct: 645 PNTVVYNHLIRAYCRSGRLSMALELREDMKHKGISPNSATYTSLIKGMSIISRVEEAKLL 704

Query: 699 YEEMLLKGPNPNIVIYNTLINGLCKLGEIKDARELFDKIEGKGLVPNVVTYSIIIDGYCK 758
           +EEM ++G  PN+  Y  LI+G  KLG++     L  ++  K + PN +TY+++I GY +
Sbjct: 705 FEEMRMEGLEPNVFHYTALIDGYGKLGQMVKVECLLREMHSKNVHPNKITYTVMIGGYAR 764

Query: 759 SGNLTEAFNLFDEMISKGVPLDRHIYCILIDGCCKQGNLEKA 787
            GN+TEA  L +EM  KG+  D   Y   I G  KQG + +A
Sbjct: 765 DGNVTEASRLLNEMREKGIVPDSITYKEFIYGYLKQGGVLEA 804

BLAST of CmoCh04G002870 vs. TAIR10
Match: AT5G65560.1 (AT5G65560.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 362.8 bits (930), Expect = 6.9e-100
Identity = 252/872 (28.90%), Postives = 404/872 (46.33%), Query Frame = 1

Query: 63  TILKLSDWQVVLDNQNSLKKLNPEIVRSVLQKNEINDPVRLQSFFYWSSSRMGTPQNLHS 122
           +IL   +W      ++ +  ++P  V S+   +   DP    +F +W S       +++S
Sbjct: 68  SILSKPNWHKSPSLKSMVSAISPSHVSSLFSLDL--DPKTALNFSHWISQNPRYKHSVYS 127

Query: 123 YSILAIRLCNSGLFPRADNMFEKMLETRKPPLEILDSLVKCYRECGGSNLIVFDILVDNF 182
           Y+ L   L N+G           + + R   ++  DS+        G  L V D+     
Sbjct: 128 YASLLTLLINNGYVG-------VVFKIRLLMIKSCDSV--------GDALYVLDLC---- 187

Query: 183 RKFGFLNEACSVFLASISGGFFPSLICCNSLMRDLLKGKMMGLFWKVYGGMVEAKIVPDV 242
           RK    +E   +    I G       C N+L+  L +  ++    +VY  M+E K+ P++
Sbjct: 188 RKMN-KDERFELKYKLIIG-------CYNTLLNSLARFGLVDEMKQVYMEMLEDKVCPNI 247

Query: 243 YTYTNVINAHCKVGDVMKGRMVLSEMEEKGCKPNLVTYNVVIGGLCRTGDVNEALEVKKL 302
           YTY  ++N +CK+G+V +    +S++ E G  P+  TY  +I G C+  D++ A +V   
Sbjct: 248 YTYNKMVNGYCKLGNVEEANQYVSKIVEAGLDPDFFTYTSLIMGYCQRKDLDSAFKVFNE 307

Query: 303 MMEKGLVPDGFTYSILIDGFCKQKRSEEAKLILESMLGSGLNPNHITYTALIDGFMKQGN 362
           M  KG   +   Y+ LI G C  +R +EA  +   M      P   TYT LI        
Sbjct: 308 MPLKGCRRNEVAYTHLIHGLCVARRIDEAMDLFVKMKDDECFPTVRTYTVLIKSLCGSER 367

Query: 363 IEEALRIKDEMVTRGLKLNIVTYNTLIRGIAKAGEMEKAMALVNEMFITGIELDTQTYDL 422
             EAL +  EM   G+K NI TY  LI  +    + EKA  L+ +M   G+  +  TY+ 
Sbjct: 368 KSEALNLVKEMEETGIKPNIHTYTVLIDSLCSQCKFEKARELLGQMLEKGLMPNVITYNA 427

Query: 423 LIDGYLKSHNKDKAYELLAEMKARNLMPSLYTYSVLINGLCRSRELPKANEVLEHMISHG 482
           LI+GY K    + A +++  M++R L P+  TY+ LI G C+S  + KA  VL  M+   
Sbjct: 428 LINGYCKRGMIEDAVDVVELMESRKLSPNTRTYNELIKGYCKSN-VHKAMGVLNKMLERK 487

Query: 483 VKPNAVIYATLINANVQESRYEGAKEVLKGMVKNGVVPDLFCYNSLIIGLCRAKRVEEAK 542
           V P+ V Y +LI+   +   ++ A  +L  M   G+VPD + Y S+I  LC++KRVEEA 
Sbjct: 488 VLPDVVTYNSLIDGQCRSGNFDSAYRLLSLMNDRGLVPDQWTYTSMIDSLCKSKRVEEAC 547

Query: 543 MMFVEMGEKGIKPNAYTYGAFIHLYCKTGEIQVAERYFQDMLSSRIVPNNIIYTALIDGH 602
            +F  + +KG+ PN   Y A I  YCK G++  A    + MLS   +PN++ + ALI G 
Sbjct: 548 DLFDSLEQKGVNPNVVMYTALIDGYCKAGKVDEAHLMLEKMLSKNCLPNSLTFNALIHGL 607

Query: 603 CNVGNTVEALSTFKCMLEKGLIPDVQTYGALIHGLSKNGKTEEAMVVFSEYLDKGLVPDV 662
           C  G   EA    + M++ GL P V T   LIH L K+G  + A   F + L  G  PD 
Sbjct: 608 CADGKLKEATLLEEKMVKIGLQPTVSTDTILIHRLLKDGDFDHAYSRFQQMLSSGTKPDA 667

Query: 663 FIYNSLISGFCKKGEIEKASQLYEEMLLKGPNPNIVIYNTLINGLCKLGEIKDARELFDK 722
             Y + I  +C++G +  A  +  +M   G +P++  Y++LI G   LG+   A ++  +
Sbjct: 668 HTYTTFIQTYCREGRLLDAEDMMAKMRENGVSPDLFTYSSLIKGYGDLGQTNFAFDVLKR 727

Query: 723 IEGKGLVPNVVTYSIII---------------DGYCKSGNLTE---AFNLFDEMISKGVP 782
           +   G  P+  T+  +I                  C   N+ E      L ++M+   V 
Sbjct: 728 MRDTGCEPSQHTFLSLIKHLLEMKYGKQKGSEPELCAMSNMMEFDTVVELLEKMVEHSVT 787

Query: 783 LDRHIYCILIDGCCKQGNLEKALSLFHEALQKSVASPS--AFNSLIDGFCKLGKLIEARE 842
            +   Y  LI G C+ GNL  A  +F    +    SPS   FN+L+   CKL K  EA +
Sbjct: 788 PNAKSYEKLILGICEVGNLRVAEKVFDHMQRNEGISPSELVFNALLSCCCKLKKHNEAAK 847

Query: 843 LFDDTVDKHVTPNSVTYTILVDAYSKAEMMEEAEQLFLDMGTKNIMPNTLTYTSLLLGYN 902
           + DD +     P   +  +L+    K    E    +F ++       + L +  ++ G  
Sbjct: 848 VVDDMICVGHLPQLESCKVLICGLYKKGEKERGTSVFQNLLQCGYYEDELAWKIIIDGVG 907

Query: 903 RIGHRIKMISLFKDMEARGIACDAITYGVMAD 915
           + G       LF  ME  G    + TY ++ +
Sbjct: 908 KQGLVEAFYELFNVMEKNGCKFSSQTYSLLIE 909

BLAST of CmoCh04G002870 vs. NCBI nr
Match: gi|659090168|ref|XP_008445872.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g61990, mitochondrial [Cucumis melo])

HSP 1 Score: 1594.7 bits (4128), Expect = 0.0e+00
Identity = 787/1028 (76.56%), Postives = 896/1028 (87.16%), Query Frame = 1

Query: 1    MANAMCLIRQMAAISHPRRNLCSFPVQNTNFPLIANDVCTQFIFFSTAHPYDHNDDTVRE 60
            MANA+CLIRQMA  S PR  L +FP++ T+FP I N+   + +FFST +P DH +DTVRE
Sbjct: 1    MANALCLIRQMAVNSSPRGILSTFPLRTTSFPQIWNNFSIRLMFFSTNNPSDHYEDTVRE 60

Query: 61   ISTILKLSDWQVVLDNQNSLKKLNPEIVRSVLQKNEINDPVRLQSFFYWSSSRMGTPQNL 120
             S ILK  DW ++L+N++SL+KLNPE+V SVLQK+EI+D VRLQ+FFYWSSS+M TPQNL
Sbjct: 61   FSMILKRKDWVILLNNEDSLRKLNPEVVCSVLQKSEIDDSVRLQNFFYWSSSKMSTPQNL 120

Query: 121  HSYSILAIRLCNSGLFPRADNMFEKMLETRKPPLEILDSLVKCYRECGGSNLIVFDILVD 180
             SYSILAIRLCNSGL  +A NM EK+LETRKPPLEILDSLV+CYRE GGSNL VFDI +D
Sbjct: 121  LSYSILAIRLCNSGLIHQAQNMLEKLLETRKPPLEILDSLVRCYREFGGSNLTVFDIFID 180

Query: 181  NFRKFGFLNEACSVFLASISGGFFPSLICCNSLMRDLLKGKMMGLFWKVYGGMVEAKIVP 240
            NFR FGFLNEA SVF+ASIS GFFPSL+CCN+LMRDLLKG MMGLFWKVYG M+EAKIVP
Sbjct: 181  NFRMFGFLNEASSVFIASISEGFFPSLMCCNNLMRDLLKGNMMGLFWKVYGSMLEAKIVP 240

Query: 241  DVYTYTNVINAHCKVGDVMKGRMVLSEMEEKGCKPNLVTYNVVIGGLCRTGDVNEALEVK 300
            DVYTYTNVINAHCKVGDV+KG+MVLSEME+K CKPNL+TYNVVIGGLCRTG ++EALEVK
Sbjct: 241  DVYTYTNVINAHCKVGDVIKGKMVLSEMEKKECKPNLITYNVVIGGLCRTGALDEALEVK 300

Query: 301  KLMMEKGLVPDGFTYSILIDGFCKQKRSEEAKLILESMLGSGLNPNHITYTALIDGFMKQ 360
            KLMMEKGL PDG+TY++LIDGFCKQKRS+EAKLI ESML SG NPNH T +ALIDGFMK+
Sbjct: 301  KLMMEKGLGPDGYTYTLLIDGFCKQKRSKEAKLIFESMLSSGSNPNHFTCSALIDGFMKE 360

Query: 361  GNIEEALRIKDEMVTRGLKLNIVTYNTLIRGIAKAGEMEKAMALVNEMFITGIELDTQTY 420
            G IEEAL IKDEM+TRGLKLN+VTYN +I GIAKAGEM KAMAL NEM + GIE DT TY
Sbjct: 361  GTIEEALSIKDEMITRGLKLNVVTYNAMIGGIAKAGEMGKAMALFNEMLMAGIEPDTWTY 420

Query: 421  DLLIDGYLKSHNKDKAYELLAEMKARNLMPSLYTYSVLINGLCRSRELPKANEVLEHMIS 480
            + LIDGYLKSH+  KA ELLAEMKARNLM S +T SVLI+GLC   +L KANEVL+ MI 
Sbjct: 421  NTLIDGYLKSHDMAKACELLAEMKARNLMLSPFTCSVLISGLCHCGDLQKANEVLDQMIR 480

Query: 481  HGVKPNAVIYATLINANVQESRYEGAKEVLKGMVKNGVVPDLFCYNSLIIGLCRAKRVEE 540
             GVKP+  +Y TLI A VQESRYE A E+LK M+ NGV+PDLFCYN LIIGLCRAK+VEE
Sbjct: 481  SGVKPSVFMYGTLIKAYVQESRYETAIELLKVMIANGVLPDLFCYNCLIIGLCRAKKVEE 540

Query: 541  AKMMFVEMGEKGIKPNAYTYGAFIHLYCKTGEIQVAERYFQDMLSSRIVPNNIIYTALID 600
            AKM+ V+MGEKGIKPNA+TYGAFI+LY K+GEIQVAERYF+DMLSS IVPNN+IYT LI+
Sbjct: 541  AKMLLVDMGEKGIKPNAHTYGAFINLYSKSGEIQVAERYFKDMLSSGIVPNNVIYTILIN 600

Query: 601  GHCNVGNTVEALSTFKCMLEKGLIPDVQTYGALIHGLSKNGKTEEAMVVFSEYLDKGLVP 660
            G+C+VGNTVEALSTFKCM EKGLIPDV+ Y A+IH LSKNGKT+EAM VF E+L KGL P
Sbjct: 601  GYCDVGNTVEALSTFKCMFEKGLIPDVRAYSAIIHSLSKNGKTKEAMGVFLEFLKKGLAP 660

Query: 661  DVFIYNSLISGFCKKGEIEKASQLYEEMLLKGPNPNIVIYNTLINGLCKLGEIKDARELF 720
            DVF+YNSLISGFCK+G+IEKASQLYEEML  G NPNIV+YNTLINGLCKLGE+K ARELF
Sbjct: 661  DVFLYNSLISGFCKEGDIEKASQLYEEMLHNGINPNIVVYNTLINGLCKLGEVKKARELF 720

Query: 721  DKIEGKGLVPNVVTYSIIIDGYCKSGNLTEAFNLFDEMISKGVPLDRHIYCILIDGCCKQ 780
            DKIEGK LVPNVVTYS I+DGYCKSGNLTEAF LFDEMISKG+  D +IYCILIDGC K+
Sbjct: 721  DKIEGKDLVPNVVTYSTIVDGYCKSGNLTEAFKLFDEMISKGISPDGYIYCILIDGCGKE 780

Query: 781  GNLEKALSLFHEALQKSVASPSAFNSLIDGFCKLGKLIEARELFDDTVDKHVTPNSVTYT 840
            GNLEKALSLFHEALQKSVAS SAFNSLID FCK GK+IEARELFDD VDK VTPNSVTYT
Sbjct: 781  GNLEKALSLFHEALQKSVASLSAFNSLIDSFCKHGKVIEARELFDDMVDKKVTPNSVTYT 840

Query: 841  ILVDAYSKAEMMEEAEQLFLDMGTKNIMPNTLTYTSLLLGYNRIGHRIKMISLFKDMEAR 900
            IL+DAY +AEMMEEAEQLFLDM  +NI+PNTLTYTSLLLGYN+IG+R KMISLFKDMEAR
Sbjct: 841  ILIDAYGRAEMMEEAEQLFLDMEMRNIIPNTLTYTSLLLGYNQIGNRFKMISLFKDMEAR 900

Query: 901  GIACDAITYGVMADVYCKEGNSLEALKLLDKSLVEGIKLDGDVFDALIFHLCNEGKNSTM 960
            GIACDAI YGVMA  YCKEG SLEALKLL+KSLVEGIKL+ DVFDALIFHLC E + ST+
Sbjct: 901  GIACDAIAYGVMASAYCKEGKSLEALKLLNKSLVEGIKLEDDVFDALIFHLCKEKQISTV 960

Query: 961  LKLLGEMAEKKLALTSTTCTALLIGFYKAGNEDKALEVLDIMQRLGWVPDSLNVVDLVNA 1020
            L+LL EM +++L+L+S TC ALL+GF+ +GNED+A +VL +MQRLGWVP SL++ D ++ 
Sbjct: 961  LELLTEMGKEELSLSSKTCNALLLGFFNSGNEDEASKVLGVMQRLGWVPTSLSLTDSIST 1020

Query: 1021 RKNDMNSE 1029
             +NDM S+
Sbjct: 1021 GRNDMKSD 1028

BLAST of CmoCh04G002870 vs. NCBI nr
Match: gi|449451896|ref|XP_004143696.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g61990, mitochondrial [Cucumis sativus])

HSP 1 Score: 1583.9 bits (4100), Expect = 0.0e+00
Identity = 781/1028 (75.97%), Postives = 892/1028 (86.77%), Query Frame = 1

Query: 1    MANAMCLIRQMAAISHPRRNLCSFPVQNTNFPLIANDVCTQFIFFSTAHPYDHNDDTVRE 60
            MANA+CLIRQ+AA S PRR L +FP Q T+FP I N+V   F+FFST +P+DH DDTVRE
Sbjct: 1    MANALCLIRQIAANSSPRRILSTFPFQTTSFPQIWNNVSIHFMFFSTNNPFDHYDDTVRE 60

Query: 61   ISTILKLSDWQVVLDNQNSLKKLNPEIVRSVLQKNEINDPVRLQSFFYWSSSRMGTPQNL 120
             S ILK  DWQ++L+N+++++KLNPEIV SVLQK+EI+D VRLQ+FFYWSSS+M TPQ L
Sbjct: 61   FSMILKRKDWQILLNNEDNVRKLNPEIVCSVLQKSEIDDSVRLQNFFYWSSSKMSTPQYL 120

Query: 121  HSYSILAIRLCNSGLFPRADNMFEKMLETRKPPLEILDSLVKCYRECGGSNLIVFDILVD 180
            HSYSILAIRLCNSGL  +ADNM EK+L+TRKPPLEILDSLV+CYRE GGSNL VFDI +D
Sbjct: 121  HSYSILAIRLCNSGLIHQADNMLEKLLQTRKPPLEILDSLVRCYREFGGSNLTVFDIFID 180

Query: 181  NFRKFGFLNEACSVFLASISGGFFPSLICCNSLMRDLLKGKMMGLFWKVYGGMVEAKIVP 240
             FR  GFLNEA SVF+ASIS GFFP+LICCN+LMRDLLK  MMGLFWKVYG MVEAKIVP
Sbjct: 181  KFRVLGFLNEASSVFIASISEGFFPTLICCNNLMRDLLKANMMGLFWKVYGSMVEAKIVP 240

Query: 241  DVYTYTNVINAHCKVGDVMKGRMVLSEMEEKGCKPNLVTYNVVIGGLCRTGDVNEALEVK 300
            DVYTYTNVI AHCKVGDV+KG+MVLSEME K CKPNL TYN  IGGLC+TG V+EALEVK
Sbjct: 241  DVYTYTNVIKAHCKVGDVIKGKMVLSEME-KECKPNLFTYNAFIGGLCQTGAVDEALEVK 300

Query: 301  KLMMEKGLVPDGFTYSILIDGFCKQKRSEEAKLILESMLGSGLNPNHITYTALIDGFMKQ 360
            KLMMEKGL PDG TY++L+DGFCKQKRS+EAKLI ESM  SGLNPN  TYTALIDGF+K+
Sbjct: 301  KLMMEKGLGPDGHTYTLLVDGFCKQKRSKEAKLIFESMPSSGLNPNRFTYTALIDGFIKE 360

Query: 361  GNIEEALRIKDEMVTRGLKLNIVTYNTLIRGIAKAGEMEKAMALVNEMFITGIELDTQTY 420
            GNIEEALRIKDEM+TRGLKLN+VTYN +I GIAKAGEM KAM+L NEM + G+E DT TY
Sbjct: 361  GNIEEALRIKDEMITRGLKLNVVTYNAMIGGIAKAGEMAKAMSLFNEMLMAGLEPDTWTY 420

Query: 421  DLLIDGYLKSHNKDKAYELLAEMKARNLMPSLYTYSVLINGLCRSRELPKANEVLEHMIS 480
            +LLIDGYLKSH+  KA ELLAEMKAR L PS +TYSVLI+GLC S +L KANEVL+ MI 
Sbjct: 421  NLLIDGYLKSHDMAKACELLAEMKARKLTPSPFTYSVLISGLCHSSDLQKANEVLDQMIR 480

Query: 481  HGVKPNAVIYATLINANVQESRYEGAKEVLKGMVKNGVVPDLFCYNSLIIGLCRAKRVEE 540
            +GVKPN  +Y TLI A VQESRYE A E+LK M+ NGV+PDLFCYN LIIGLCRAK+VEE
Sbjct: 481  NGVKPNVFMYGTLIKAYVQESRYEMAIELLKIMIANGVLPDLFCYNCLIIGLCRAKKVEE 540

Query: 541  AKMMFVEMGEKGIKPNAYTYGAFIHLYCKTGEIQVAERYFQDMLSSRIVPNNIIYTALID 600
            AKM+ V+MGEKGIKPNA+TYGAFI+LY K+GEIQVAERYF+DMLSS IVPNN+IYT LI 
Sbjct: 541  AKMLLVDMGEKGIKPNAHTYGAFINLYSKSGEIQVAERYFKDMLSSGIVPNNVIYTILIK 600

Query: 601  GHCNVGNTVEALSTFKCMLEKGLIPDVQTYGALIHGLSKNGKTEEAMVVFSEYLDKGLVP 660
            GHC+VGNTVEALSTFKCMLEKGLIPD++ Y A+IH LSKNGKT+EAM VF ++L  G+VP
Sbjct: 601  GHCDVGNTVEALSTFKCMLEKGLIPDIRAYSAIIHSLSKNGKTKEAMGVFLKFLKTGVVP 660

Query: 661  DVFIYNSLISGFCKKGEIEKASQLYEEMLLKGPNPNIVIYNTLINGLCKLGEIKDARELF 720
            DVF+YNSLISGFCK+G+IEKASQLY+EML  G NPNIV+YNTLINGLCKLGE+  ARELF
Sbjct: 661  DVFLYNSLISGFCKEGDIEKASQLYDEMLHNGINPNIVVYNTLINGLCKLGEVTKARELF 720

Query: 721  DKIEGKGLVPNVVTYSIIIDGYCKSGNLTEAFNLFDEMISKGVPLDRHIYCILIDGCCKQ 780
            D+IE K LVP+VVTYS IIDGYCKSGNLTEAF LFDEMISKG+  D +IYCILIDGC K+
Sbjct: 721  DEIEEKDLVPDVVTYSTIIDGYCKSGNLTEAFKLFDEMISKGISPDGYIYCILIDGCGKE 780

Query: 781  GNLEKALSLFHEALQKSVASPSAFNSLIDGFCKLGKLIEARELFDDTVDKHVTPNSVTYT 840
            GNLEKALSLFHEA QKSV S SAFNSLID FCK GK+IEARELFDD VDK +TPN VTYT
Sbjct: 781  GNLEKALSLFHEAQQKSVGSLSAFNSLIDSFCKHGKVIEARELFDDMVDKKLTPNIVTYT 840

Query: 841  ILVDAYSKAEMMEEAEQLFLDMGTKNIMPNTLTYTSLLLGYNRIGHRIKMISLFKDMEAR 900
            IL+DAY KAEMMEEAEQLFLDM T+NI+PNTLTYTSLLL YN+IG+R KMISLFKDMEAR
Sbjct: 841  ILIDAYGKAEMMEEAEQLFLDMETRNIIPNTLTYTSLLLSYNQIGNRFKMISLFKDMEAR 900

Query: 901  GIACDAITYGVMADVYCKEGNSLEALKLLDKSLVEGIKLDGDVFDALIFHLCNEGKNSTM 960
            GIACDAI YGVMA  YCKEG SLEALKLL+KSLVEGIKL+ DVFDALIFHLC E + ST+
Sbjct: 901  GIACDAIAYGVMASAYCKEGKSLEALKLLNKSLVEGIKLEDDVFDALIFHLCKEKQISTV 960

Query: 961  LKLLGEMAEKKLALTSTTCTALLIGFYKAGNEDKALEVLDIMQRLGWVPDSLNVVDLVNA 1020
            L+LL EM +++L+L+S TC  LL+GFYK+GNED+A +VL +MQRLGWVP SL++ D ++ 
Sbjct: 961  LELLSEMGKEELSLSSKTCNTLLLGFYKSGNEDEASKVLGVMQRLGWVPTSLSLTDSIST 1020

Query: 1021 RKNDMNSE 1029
             ++DM S+
Sbjct: 1021 GRDDMKSD 1027

BLAST of CmoCh04G002870 vs. NCBI nr
Match: gi|645268503|ref|XP_008239557.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g61990, mitochondrial [Prunus mume])

HSP 1 Score: 1169.8 bits (3025), Expect = 0.0e+00
Identity = 559/975 (57.33%), Postives = 732/975 (75.08%), Query Frame = 1

Query: 54   NDDTVREISTILKLSDWQVVLDNQNSLKKLNPEIVRSVLQKN-EINDPVRLQSFFYWSSS 113
            ++DTVREISTILK +DW   L+  +  KKLNP +VR+VLQ+N ++ DP RL SFF W+ +
Sbjct: 42   DEDTVREISTILKHNDWHFALNTSDLPKKLNPHVVRAVLQQNHQVGDPKRLLSFFIWTGT 101

Query: 114  RMGTPQNLHSYSILAIRLCNSGLFPRADNMFEKMLETRKPPLEILDSLVKCYRECGGSNL 173
             MG PQNLHS+SILA+ LCNS LF +A  + E+M+++RKPPLE+++SLV C+RE  GS+ 
Sbjct: 102  HMGVPQNLHSFSILAVALCNSKLFEQAHAVLERMVKSRKPPLEVVNSLVMCFREFDGSDR 161

Query: 174  IVFDILVDNFRKFGFLNEACSVFLASISGGFFPSLICCNSLMRDLLKGKMMGLFWKVYGG 233
            +VF+IL++ F+  G LNEA   FLA    G FP L CCNSL++DLLK   + LFWKVY  
Sbjct: 162  VVFEILINAFKMAGHLNEAADAFLAVKKVGIFPRLDCCNSLLKDLLKCNRLELFWKVYDA 221

Query: 234  MVEAKIVPDVYTYTNVINAHCKVGDVMKGRMVLSEMEEKGCKPNLVTYNVVIGGLCRTGD 293
            M+EAK+ PD YTYTNVINAHCK G+  +G+  L EMEEKGC PNL TYNVVIG LCRT  
Sbjct: 222  MLEAKVNPDFYTYTNVINAHCKAGNAGQGKRCLHEMEEKGCNPNLSTYNVVIGALCRTWG 281

Query: 294  VNEALEVKKLMMEKGLVPDGFTYSILIDGFCKQKRSEEAKLILESMLGSGLNPNHITYTA 353
            V+EALEVKK M+EKGLVPD +TY +L+DG C+ KRSEEAKLIL+ M   GLNP +  Y A
Sbjct: 282  VDEALEVKKAMVEKGLVPDRYTYLVLLDGLCRHKRSEEAKLILKDMYDIGLNPENTCYIA 341

Query: 354  LIDGFMKQGNIEEALRIKDEMVTRGLKLNIVTYNTLIRGIAKAGEMEKAMALVNEMFITG 413
            LIDGF+K+GN+EEAL IK EM+ RG+KL   TYNT++ G+ + G MEKA A++NEM + G
Sbjct: 342  LIDGFIKEGNMEEALSIKGEMIARGVKLCDATYNTILAGVCRNGTMEKAEAVLNEMNVMG 401

Query: 414  IELDTQTYDLLIDGYLKSHNKDKAYELLAEMKARNLMPSLYTYSVLINGLCRSRELPKAN 473
            I+ + QT+  LIDGY +  +  KAYE+L EMK RNL P++YTY V+INGL R  +L +AN
Sbjct: 402  IKPNAQTFKFLIDGYCREQSMVKAYEILNEMKKRNLAPNVYTYGVIINGLSRCGDLQRAN 461

Query: 474  EVLEHMISHGVKPNAVIYATLINANVQESRYEGAKEVLKGMVKNGVVPDLFCYNSLIIGL 533
            +VL+ MI+ G+KP AVIY T+I  +VQE ++E A ++ KGM + GV+PD+FCYNSLIIGL
Sbjct: 462  KVLKEMITRGLKPGAVIYTTVIRGHVQEGKFEEAIKLFKGMNEKGVMPDVFCYNSLIIGL 521

Query: 534  CRAKRVEEAKMMFVEMGEKGIKPNAYTYGAFIHLYCKTGEIQVAERYFQDMLSSRIVPNN 593
            C+A+++EEA+  F+EM E+G+KPNAYTYGAF+H +CK GE+Q+A RYFQ+ML   I PN+
Sbjct: 522  CKARKMEEARTYFLEMVERGLKPNAYTYGAFVHGHCKEGEMQLANRYFQEMLGCGIAPND 581

Query: 594  IIYTALIDGHCNVGNTVEALSTFKCMLEKGLIPDVQTYGALIHGLSKNGKTEEAMVVFSE 653
            +IYTALI+GHC  GN  EA S F+CML +G++PD++TY  +IHGLSKNGK +EAM VFSE
Sbjct: 582  VIYTALIEGHCKEGNLTEAHSAFRCMLGRGVLPDIKTYSVIIHGLSKNGKLQEAMGVFSE 641

Query: 654  YLDKGLVPDVFIYNSLISGFCKKGEIEKASQLYEEMLLKGPNPNIVIYNTLINGLCKLGE 713
             LDK LVPDVF Y+SLISGFCK+G ++KA Q+ E M  +G +PNIV YN LINGLCK GE
Sbjct: 642  LLDKDLVPDVFTYSSLISGFCKQGNVDKAFQILELMCQRGIDPNIVTYNALINGLCKSGE 701

Query: 714  IKDARELFDKIEGKGLVPNVVTYSIIIDGYCKSGNLTEAFNLFDEMISKGVPLDRHIYCI 773
            +  A+ELFD I GKGL PN VTY+ ++ GY K+G LTEAF L DEM+  G P D  IYC 
Sbjct: 702  VDKAKELFDGISGKGLTPNAVTYATMMGGYSKAGKLTEAFRLLDEMLLHGFPTDSFIYCT 761

Query: 774  LIDGCCKQGNLEKALSLFHEALQKSVASPSAFNSLIDGFCKLGKLIEARELFDDTVDKHV 833
            LIDGCCK G+ EKALSLF + ++K  A+ ++FN+LI+GFCKLGK++EA  LF+D VDKHV
Sbjct: 762  LIDGCCKAGDTEKALSLFEDMVEKGFAATASFNALINGFCKLGKMMEAIRLFEDMVDKHV 821

Query: 834  TPNSVTYTILVDAYSKAEMMEEAEQLFLDMGTKNIMPNTLTYTSLLLGYNRIGHRIKMIS 893
            TPN V+YTIL+ +  K  +M E+EQLFL+M  +N+ P  +TYTSLL GYN  G R KM +
Sbjct: 822  TPNHVSYTILIVSLCKEGLMNESEQLFLEMQKRNLTPTIVTYTSLLHGYNLTGSRFKMFA 881

Query: 894  LFKDMEARGIACDAITYGVMADVYCKEGNSLEALKLLDKSLVEGIKLDGDVFDALIFHLC 953
            LF++M ARG+  D + YG+M D YCKEG+ ++ LKL+D+ LV G  ++  V DAL  +L 
Sbjct: 882  LFEEMMARGLKPDEVNYGMMVDAYCKEGHWVKCLKLVDEVLVNGTIMNSIVVDALTINLF 941

Query: 954  NEGKNSTMLKLLGEMAEKKLALTSTTCTALLIGFYKAGNEDKALEVLDIMQRLGWVPDSL 1013
             + + S ++K L EM E+  AL+  TC+ L+ GFY+ GN +KA  +L+ M   GWV  S 
Sbjct: 942  QKEEFSEVMKSLDEMGEQGFALSLATCSTLVCGFYRLGNVEKAARILESMLSFGWVSQST 1001

Query: 1014 NVVDLVNARKNDMNS 1028
            ++ DL+N  +N+ +S
Sbjct: 1002 SLSDLINEDQNEASS 1016

BLAST of CmoCh04G002870 vs. NCBI nr
Match: gi|802770320|ref|XP_012090594.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g61990, mitochondrial [Jatropha curcas])

HSP 1 Score: 1150.2 bits (2974), Expect = 0.0e+00
Identity = 557/1010 (55.15%), Postives = 743/1010 (73.56%), Query Frame = 1

Query: 17   PRRNLCSFPVQNTNFPLIANDVCTQFIFFSTAHPYDHNDDTVREISTILKLSDWQVVLDN 76
            P +NL  F  ++ N        C +FI +ST++    ND TV+EI+ +LK ++WQ ++++
Sbjct: 5    PYKNLQFFKNRHRNVISRLRIRCLKFISYSTSN---QNDSTVKEITGLLKENNWQHLIES 64

Query: 77   QNSLKKLNPEIVRSVLQKNEINDPVRLQSFFYWSSSRMGTPQNLHSYSILAIRLCNSGLF 136
                 +LNP++V SVL++N +NDP RL  FF W  SR+G PQNL+S+SI A+ LCNS  F
Sbjct: 65   STLSSRLNPDVVISVLKQNLVNDPKRLFGFFNWVHSRVGIPQNLYSFSITAVILCNSQQF 124

Query: 137  PRADNMFEKMLETRKPPLEILDSLVKCYRECGGSNLIVFDILVDNFRKFGFLNEACSVFL 196
              A+ + E+++E R P L+ILDS++ C+RE   +N +VF+IL++ ++K GFLNEA  VFL
Sbjct: 125  VPANIVLERIIEARMPHLKILDSIITCFREFNWNNSVVFEILINAYKKKGFLNEAAGVFL 184

Query: 197  ASISGGFFPSLICCNSLMRDLLKGKMMGLFWKVYGGMVEAKIVPDVYTYTNVINAHCKVG 256
             + + GF   L+CCNSL++DLLKG  + LFW VY GM+EAK+VPDVYTYTN+INA+C+ G
Sbjct: 185  GAKNHGFVVGLVCCNSLLKDLLKGNRLELFWDVYNGMLEAKVVPDVYTYTNLINAYCRAG 244

Query: 257  DVMKGRMVLSEMEEKGCKPNLVTYNVVIGGLCRTGDVNEALEVKKLMMEKGLVPDGFTYS 316
            +V  G+ +L +MEEKGC P+LVTYNV++GG CR GDV+EA ++K+ M++KGL PD +TY 
Sbjct: 245  NVKAGKSILFDMEEKGCNPSLVTYNVLLGGFCRAGDVDEAFKLKRTMVDKGLFPDNYTYG 304

Query: 317  ILIDGFCKQKRSEEAKLILESMLGSGLNPNHITYTALIDGFMKQGNIEEALRIKDEMVTR 376
             LIDGFCKQKRS EA+L+L+ M   GL P+ I YT+LIDGFMKQG+I EA ++K+EM+  
Sbjct: 305  ALIDGFCKQKRSREARLMLKEMYSVGLKPDPIAYTSLIDGFMKQGDIREAFQVKEEMLAH 364

Query: 377  GLKLNIVTYNTLIRGIAKAGEMEKAMALVNEMFITGIELDTQTYDLLIDGYLKSHNKDKA 436
            G+KLN+ TYN LI G+ K  EMEKA AL +EM   GI+ DTQTY+ LI+GY K  N+ KA
Sbjct: 365  GIKLNLFTYNALIHGMCKVVEMEKAEALFSEMIAMGIKPDTQTYNCLIEGYYKEQNEAKA 424

Query: 437  YELLAEMKARNLMPSLYTYSVLINGLCRSRELPKANEVLEHMISHGVKPNAVIYATLINA 496
             ELL EM   NL P++YT  V+IN LC S EL +A  V  +MIS G+KPN V+Y TLI  
Sbjct: 425  NELLNEMMKSNLAPTVYTCGVIINALCCSGELGRATNVFRYMISKGLKPNVVLYTTLIKK 484

Query: 497  NVQESRYEGAKEVLKGMVKNGVVPDLFCYNSLIIGLCRAKRVEEAKMMFVEMGEKGIKPN 556
             VQE  +EGA ++L+ M + GVVPD+FCYN++IIGLC+A ++E+A+   VEM +KG+KPN
Sbjct: 485  LVQEGAFEGAIKILEVMEEQGVVPDVFCYNTVIIGLCKAGKMEDARKYLVEMAKKGLKPN 544

Query: 557  AYTYGAFIHLYCKTGEIQVAERYFQDMLSSRIVPNNIIYTALIDGHCNVGNTVEALSTFK 616
             YTYGAFIH YCKTG +Q A+RYF +ML   I PN+++Y+ALIDGHC  GNT  + + F+
Sbjct: 545  VYTYGAFIHGYCKTGAMQEADRYFTEMLGCGIDPNHVVYSALIDGHCKEGNTAASFAKFR 604

Query: 617  CMLEKGLIPDVQTYGALIHGLSKNGKTEEAMVVFSEYLDKGLVPDVFIYNSLISGFCKKG 676
            CMLE+ ++PDVQ Y  LIHGL +NGK +EA  VFSE LDKGLVPDVF YN+LISGFCK+G
Sbjct: 605  CMLEQQVLPDVQIYCILIHGLLRNGKLQEATGVFSELLDKGLVPDVFTYNALISGFCKQG 664

Query: 677  EIEKASQLYEEMLLKGPNPNIVIYNTLINGLCKLGEIKDARELFDKIEGKGLVPNVVTYS 736
            ++++A +LYEEM  KG NPNIV YN LINGLCK G+I+ ARELFD I  KGLV N VTYS
Sbjct: 665  DLKRAFELYEEMFQKGINPNIVSYNALINGLCKFGDIERARELFDGIPSKGLVRNGVTYS 724

Query: 737  IIIDGYCKSGNLTEAFNLFDEMISKGVPLDRHIYCILIDGCCKQGNLEKALSLFHEALQK 796
             IIDGYCKSGNL EAF LFD M  +GVP D  +YC LIDGCCK+G+LEKA SLF + ++K
Sbjct: 725  TIIDGYCKSGNLNEAFQLFDGMAMEGVPPDSFVYCALIDGCCKEGSLEKAQSLFSQMVEK 784

Query: 797  SVASPSAFNSLIDGFCKLGKLIEARELFDDTVDKHVTPNSVTYTILVDAYSKAEMMEEAE 856
             +AS SAFN+LIDGFC+ GKLIEA +LF+D  DKH+TPN VTYTIL++ + +   M+EA+
Sbjct: 785  GLASISAFNALIDGFCRSGKLIEAYQLFEDQFDKHITPNHVTYTILIEYHCRVGRMKEAK 844

Query: 857  QLFLDMGTKNIMPNTLTYTSLLLGYNRIGHRIKMISLFKDMEARGIACDAITYGVMADVY 916
            +LFL+M  +N+MPN LTYT+LL GYNRIG R +M +LF +M AR I  D + + VM D Y
Sbjct: 845  KLFLEMQKRNLMPNILTYTTLLQGYNRIGSRSEMHTLFDEMIARDIEPDDMLWSVMIDAY 904

Query: 917  CKEGNSLEALKLLDKSLVEGIKLDGDVFDALIFHLCNEGKNSTMLKLLGEMAEKKLALTS 976
             +EGN ++ALKL+D  L++ + +  +V++ L   LC       +LKLL E+ E+   L  
Sbjct: 905  LQEGNWIKALKLVDDILLKDVNVGKNVYNVLTDILCTYNNVPKLLKLLNEIEEQGYNLNL 964

Query: 977  TTCTALLIGFYKAGNEDKALEVLDIMQRLGWVPDSLNVVDLVN--ARKND 1025
             TC  L+  F++AG  D+A++VLD M R GWVP S ++ D +N  ++K+D
Sbjct: 965  ATCRVLVCCFHRAGRTDEAVKVLDRMVRFGWVPASTDICDFINEDSKKSD 1011

BLAST of CmoCh04G002870 vs. NCBI nr
Match: gi|694450386|ref|XP_009350612.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g61990, mitochondrial-like [Pyrus x bretschneideri])

HSP 1 Score: 1135.2 bits (2935), Expect = 0.0e+00
Identity = 557/999 (55.76%), Postives = 722/999 (72.27%), Query Frame = 1

Query: 33   LIANDVCTQFIFFSTAHPYDHNDDTVREISTILKL-SDWQVVLDNQNSLKKLNPEIVRSV 92
            LI  + CT     S    +D +D+TVREIST+L+  SDW  VL++ +  +KLNP +VR+V
Sbjct: 26   LIHIEYCTTTSQESETFKHD-DDETVREISTVLRNHSDWHFVLNSSDLPRKLNPHVVRAV 85

Query: 93   LQKN-EINDPVRLQSFFYWSSSRMGTPQNLHSYSILAIRLCNSGLFPRADNMFEKMLETR 152
            LQ+N ++ DP RL SFF W+ + +G PQNLHS+SILA+ LCNS +F +A+ + ++M++TR
Sbjct: 86   LQQNHQVGDPKRLLSFFLWTDTHLGFPQNLHSFSILAVVLCNSKMFEQANAVLDRMVKTR 145

Query: 153  KPPLEILDSLVKCYR--ECGGSNLIVFDILVDNFRKFGFLNEACSVFLASISGGFFPSLI 212
            KP  E+LDS+V C+R  ECGGS+ IVF+ L+  F+    LNEA  VFL     G  P L 
Sbjct: 146  KPVFEVLDSVVSCFRGGECGGSDKIVFEFLIRAFKAAWRLNEAADVFLGLRKVGILPRLD 205

Query: 213  CCNSLMRDLLKGKMMGLFWKVYGGMVEAKIVPDVYTYTNVINAHCKVGDVMKGRMVLSEM 272
            CCNSL+ DLLK   M LFWKVY GM+EAK+ PD YTY NVI+AHC+ G+  +G+  L EM
Sbjct: 206  CCNSLLNDLLKCNRMELFWKVYDGMLEAKMKPDFYTYYNVIHAHCRAGNAGQGKRFLLEM 265

Query: 273  EEKGCKPNLVTYNVVIGGLCRTGDVNEALEVKKLMMEKGLVPDGFTYSILIDGFCKQKRS 332
            EEKG  P+L TYNVVIGGLCR GDV+EAL VKK M+EKGLVPD +TYS L+DG C+ KR 
Sbjct: 266  EEKGGNPDLSTYNVVIGGLCRAGDVDEALAVKKSMVEKGLVPDRYTYSALVDGLCRTKRP 325

Query: 333  EEAKLILESMLGSGLNPNHITYTALIDGFMKQGNIEEALRIKDEMVTRGLKLNIVTYNTL 392
            EE KLIL+ M   GL+P+   YTALIDG MK+G +EEALRIKDE + RG KL   T N +
Sbjct: 326  EETKLILKYMYDKGLSPDSTCYTALIDGLMKEGYLEEALRIKDETIARGFKLCDATCNAI 385

Query: 393  IRGIAKAGEMEKAMALVNEMFITGIELDTQTYDLLIDGYLKSHNKDKAYELLAEMKARNL 452
              G+ K G MEKA  L+NEM + G   + QTY  LIDGY +  N  KA ELL EMK RN 
Sbjct: 386  FAGMCKVGRMEKAEVLLNEMNVMGTRPNAQTYKFLIDGYCREQNMVKACELLNEMKKRNF 445

Query: 453  MPSLYTYSVLINGLCRSRELPKANEVLEHMISHGVKPNAVIYATLINANVQESRYEGAKE 512
             P ++TY  +INGL R  ++  AN++L+ MI+ G+KP AVIY T+I  +VQE ++E A +
Sbjct: 446  APGVFTYGAIINGLSRCGDMEGANQLLKEMITRGLKPGAVIYTTVIRGHVQEGKFEEAIK 505

Query: 513  VLKGMVKNGVVPDLFCYNSLIIGLCRAKRVEEAKMMFVEMGEKGIKPNAYTYGAFIHLYC 572
            VLKGM K GV+PD FCYNSLIIGLC+A++++EA++ FVEM ++G+KPNAYTYGAFIH YC
Sbjct: 506  VLKGMTKKGVMPDAFCYNSLIIGLCKARKMDEARIYFVEMVDRGLKPNAYTYGAFIHGYC 565

Query: 573  KTGEIQVAERYFQDMLSSRIVPNNIIYTALIDGHCNVGNTVEALSTFKCMLEKGLIPDVQ 632
            K G++Q+A  YFQ+ML   I PN++IYTALIDGHC  GN  EA STF+CML +G++PD++
Sbjct: 566  KEGQMQLANTYFQEMLGCGIAPNDVIYTALIDGHCKDGNLTEAYSTFRCMLGRGVLPDIK 625

Query: 633  TYGALIHGLSKNGKTEEAMVVFSEYLDKGLVPDVFIYNSLISGFCKKGEIEKASQLYEEM 692
            TY  +IHGLSKNGK +EAM +FSE L K LVPDVF Y+SLISGFCK+G ++KA QL E+M
Sbjct: 626  TYSVIIHGLSKNGKIQEAMGIFSELLGKDLVPDVFTYSSLISGFCKQGNVDKAFQLLEQM 685

Query: 693  LLKGPNPNIVIYNTLINGLCKLGEIKDARELFDKIEGKGLVPNVVTYSIIIDGYCKSGNL 752
              +G +PNIV YN LINGLCK G+   ARELFD I  KGL PN VTY+ ++DGY KSG L
Sbjct: 686  CRRGVDPNIVTYNALINGLCKSGDTDRARELFDGISRKGLSPNAVTYATMMDGYSKSGKL 745

Query: 753  TEAFNLFDEMISKGVPLDRHIYCILIDGCCKQGNLEKALSLFHEALQKSVASPSAFNSLI 812
            TEAF L DEM+ +G+P D  IYCILIDGCCK G++E+A+SLF + + K +A+ S FN+LI
Sbjct: 746  TEAFQLLDEMLLRGIPTDSFIYCILIDGCCKAGDMERAVSLFQDIVGKGIAATSPFNALI 805

Query: 813  DGFCKLGKLIEARELFDDTVDKHVTPNSVTYTILVDAYSKAEMMEEAEQLFLDMGTKNIM 872
            DGFCKLG+++EA  L +D VDKHVTPN VTYTIL+ +  K  +M E+EQLFL+M  +N+ 
Sbjct: 806  DGFCKLGRMVEANRLLEDMVDKHVTPNHVTYTILIVSLCKEGLMRESEQLFLEMQKRNLT 865

Query: 873  PNTLTYTSLLLGYNRIGHRIKMISLFKDMEARGIACDAITYGVMADVYCKEGNSLEALKL 932
            PN LTYTSLL GYN  G R KM SLF +M  RG+  D +TY +M D YCKEG+ ++ LKL
Sbjct: 866  PNILTYTSLLHGYNSTGSRYKMFSLFDEMVTRGLKPDEVTYRMMVDAYCKEGDLVKCLKL 925

Query: 933  LDKSLVEGIKLDGDVFDALIFHLCNEGKNSTMLKLLGEMAEKKLALTSTTCTALLIGFYK 992
            +D++LV G   +  V DAL   L    + S ++K L EM E    L+  TC+ L+ GF+K
Sbjct: 926  VDETLVNGAISNSAVVDALTSTLFRREEFSEIMKSLEEMVEHGFMLSLATCSTLVRGFHK 985

Query: 993  AGNEDKALEVLDIMQRLGWVPDSLNVVDLVNARKNDMNS 1028
             GN +KA  + + M R GWV  S N+ DL++  +++++S
Sbjct: 986  LGNAEKAARIFESMLRFGWVSHSTNLDDLIHEDQSEVSS 1023

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP442_ARATH1.1e-21140.62Pentatricopeptide repeat-containing protein At5g61990, mitochondrial OS=Arabidop... [more]
PP437_ARATH5.2e-11830.50Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis th... [more]
RF1_ORYSI1.9e-10731.85Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica GN=Rf1 PE=2 SV=1[more]
PP432_ARATH3.5e-10628.29Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana GN... [more]
PP325_ARATH3.2e-9931.34Pentatricopeptide repeat-containing protein At4g19440, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0KPZ1_CUCSA0.0e+0075.97Uncharacterized protein OS=Cucumis sativus GN=Csa_5G175760 PE=4 SV=1[more]
V4TAC8_9ROSI0.0e+0054.20Uncharacterized protein (Fragment) OS=Citrus clementina GN=CICLE_v10033858mg PE=... [more]
A0A061G0B2_THECC0.0e+0053.10Pentatricopeptide repeat superfamily protein, putative isoform 1 OS=Theobroma ca... [more]
B9RA74_RICCO0.0e+0055.08Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
A0A067E7X7_CITSI1.9e-30852.68Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g001797mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G61990.16.0e-21340.62 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G59900.13.0e-11930.50 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G55840.12.0e-10728.29 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G19440.11.8e-10031.34 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G65560.16.9e-10028.90 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659090168|ref|XP_008445872.1|0.0e+0076.56PREDICTED: pentatricopeptide repeat-containing protein At5g61990, mitochondrial ... [more]
gi|449451896|ref|XP_004143696.1|0.0e+0075.97PREDICTED: pentatricopeptide repeat-containing protein At5g61990, mitochondrial ... [more]
gi|645268503|ref|XP_008239557.1|0.0e+0057.33PREDICTED: pentatricopeptide repeat-containing protein At5g61990, mitochondrial ... [more]
gi|802770320|ref|XP_012090594.1|0.0e+0055.15PREDICTED: pentatricopeptide repeat-containing protein At5g61990, mitochondrial ... [more]
gi|694450386|ref|XP_009350612.1|0.0e+0055.76PREDICTED: pentatricopeptide repeat-containing protein At5g61990, mitochondrial-... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G002870.1CmoCh04G002870.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 907..936
score: 4.6E-4coord: 803..826
score: 4.2E-6coord: 488..518
score: 0.0052coord: 122..148
score: 0.83coord: 978..1006
score: 0.
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 761..792
score: 5.4E-7coord: 656..688
score: 2.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 345..394
score: 6.3E-16coord: 695..744
score: 1.7E-18coord: 205..254
score: 9.4E-8coord: 416..464
score: 4.9E-15coord: 834..881
score: 4.3E-13coord: 590..639
score: 1.5E-11coord: 275..324
score: 1.4E-17coord: 520..569
score: 6.3
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 278..311
score: 1.2E-10coord: 629..662
score: 1.0E-7coord: 383..416
score: 1.1E-7coord: 907..940
score: 5.9E-4coord: 348..381
score: 7.2E-9coord: 313..346
score: 5.1E-8coord: 873..906
score: 0.0018coord: 769..797
score: 9.3E-8coord: 419..451
score: 4.1E-4coord: 837..871
score: 7.2E-7coord: 523..557
score: 3.7E-10coord: 488..521
score: 3.6E-5coord: 803..835
score: 1.2E-5coord: 594..627
score: 2.7E-6coord: 733..766
score: 1.8E-10coord: 453..487
score: 6.3E-8coord: 664..696
score: 6.7E-9coord: 558..591
score: 1.4E-7coord: 243..276
score: 6.0E-9coord: 978..1011
score: 7.3E-6coord: 698..732
score: 8.0
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 486..520
score: 10.348coord: 731..765
score: 13.943coord: 835..869
score: 11.904coord: 241..275
score: 12.485coord: 119..153
score: 8.999coord: 870..904
score: 9.175coord: 346..380
score: 12.036coord: 311..345
score: 12.222coord: 661..695
score: 13.526coord: 626..660
score: 12.31coord: 451..485
score: 12.452coord: 171..205
score: 7.476coord: 591..625
score: 11.213coord: 556..590
score: 11.477coord: 905..939
score: 9.931coord: 276..310
score: 13.318coord: 696..730
score: 13.351coord: 975..1009
score: 10.304coord: 521..555
score: 13.165coord: 766..796
score: 10.227coord: 940..974
score: 7.607coord: 800..834
score: 11.531coord: 206..240
score: 7.18coord: 416..450
score: 11.444coord: 381..415
score: 11
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 117..170
score: 2.1E-4coord: 458..722
score: 3.2E-11coord: 741..933
score: 1.3E-11coord: 314..407
score: 2.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 211..399
score: 5.1E-260coord: 505..777
score: 5.1E-260coord: 812..911
score: 5.1E
NoneNo IPR availablePANTHERPTHR24015:SF847SUBFAMILY NOT NAMEDcoord: 211..399
score: 5.1E-260coord: 505..777
score: 5.1E-260coord: 812..911
score: 5.1E
NoneNo IPR availableunknownSSF81901HCP-likecoord: 604..826
score: 4.45