CmoCh16G011170 (gene) Cucurbita moschata (Rifu)

NameCmoCh16G011170
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionPentatricopeptide repeat-containing protein
LocationCmo_Chr16 : 7919269 .. 7921386 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGCTCCTCTTTCTTCCTCTCTCGATATCAAACTCAAACTCAAACCAACGCCGCCCTTGTTCTTCACTTCCCCTCTCCGGCGAAACAATTTCACCAAGCGATTTACAGTTCTCTGTACCTCCTCCTCCAAATCCCCTCGATCCACCGACAAAAAGAACCCATCTCTATCGGAGCAGCTCAAAGACCTCTCCACATCGACTCTTTCCAATGCATCCAACGACGAATCCCATCTCTTATCGAACCCTAAATCCATTTGGGTGAATCCCACCAAGCCCAAGCGCTCGGTTCTGTCACTCCAGAGACAAAAACGTTCTTCTTACTCTTATAACCCCAAGATGCGAGAGCTTAAAACCTTCGCCCATAAGCTCAATGCATCCGATTCCTCTGAAGCTGCTTTCATGGCGGTTCTTAAGGAAATCCCTCATCCACCCACTAAAGAAAATGCCCTTCTCATTCTCAATAGCTTGAAGCCATGGCAGAAAACCCATTTGTTCTTCAATTGGATTAAGACCCAGAATCTGTTTCCTATGGAGACTATCTTCTACAATGTGGCTATGAAGTCTTTGAGGTATGGTAGGCAGTTTCAGCTCATTGAAGAACTAGCAAATGAGATGATTAACACTGGGATTGAGCTTGATAACATTACTTATTCTACTATTATCACTTGTGCTAAGAAGTGCAGTAGGTTTGATAAGGCAATGGAATGGTTTGAGAGAATGTATAGAACTGGTTTGATGCCCGATGAGGTTACTTACTCTGCTATTTTAGATGTTTATGCAAATTTGGGGAAGGTTGAGGAGGCTCTTAGTTTGTATGAGAGAGGGAGGGCTAGTGGTTGGAAGCCTGATCCTTATACGTTTTCTGTGTTGGGGAAGATGTTTGGAGAGGCAGGGGATTATGATGGGATTATGTATGTTTTGCAAGAAATGAAGTCTATTGAGGTGCAGCCTAATCTTGTGGTGTATAACACCTTGTTGGATGCAATGGGGAAGGCTGGGAGGCCTGGTTTTGCAAGAAGCTTGTTCGAGGAAATGATTGAATCGGGGATAACGCCGAATGAGAAGACGTTAACTGCTCTGGTTAAGATTTATGGGAAGGCAAGGTGGGCTCGGGATGCTTTGGAGCTGTGGGAGCGGATGAGGTCGAAAGGGTGGCCGATGGACTTCATTTTGTATAATACATTGTTGAATATGTGTGCAGATCTTGGTTTGGAGGAGGAAGCTGAGAAGCTATTTGAAGAGATGAAGAAATCGGAGCATTCTAGACCGGATAGTTGGAGCTATACGGCGATGTTGAATATACATGGTAGCGGAGGTAATGTAAAAAGATCGATGGAGTTGTTTGAAGAAATGCTCGAGTTGGGTGTTGGGATTAATGTGATGTGTTGCACTTGTTTGATTCAATGCTTGGGGAAGGCTAGGAGGATCGATGATCTAGTTCGAGTTTTCGACGTTTCTGTACGAAAAGGAGTTGAGCCAGATGACAGGCTTTGTGGTTGCTTGTTGTCTGTTGTGTCCTTGTGTGACAACAATGAAGATATTAGCAAGGTATTCACTTGTCTACAACAAGCTAACCCAAAGTTAGTTGCCTTTGTAAATCTACTGCAACAAAATGACATTACCTTCGACGTTATCAAAGACGAATTCAGAACCATTCTCGGCGAGACCGCCACCGAAGCCCGACGACCTTTCTGCAATTGCTTGATTGATATATGTCGAAACCAAAATCTTTCCAAGAGAGCTCACGAGCTGCTCTACTTGGGAAGTTTATATGGACTGTACCCAGGCCTACACAACAAAACCGAAGGTGAATGGTGCCTAGATGTTCGATCTCTATCGGTAGGCGCAGCTCAGACTGCACTCGAAGAATGGATGATAACTCTATCAAAAATCGTACAACGAGAAGAAGCATTACCTGAATTGTTATCAGCTCAAACCGGTGCAGGAACTCACAGGTTTTCTCAAGGACTTGCCAATTCATTCGCTTCTCATGTCGAGAAGCTAGCTGCTCCGTTTCGATTGCGAGAAGATCGGGCTGGTTGGTTTGTAGCCACGAGGGAGGATGTCGTTGCTTGGGTACATTCAAGAGTACCATCTGTGGCTACCAGAGCTTAA

mRNA sequence

ATGGCGGCTCCTCTTTCTTCCTCTCTCGATATCAAACTCAAACTCAAACCAACGCCGCCCTTGTTCTTCACTTCCCCTCTCCGGCGAAACAATTTCACCAAGCGATTTACAGTTCTCTGTACCTCCTCCTCCAAATCCCCTCGATCCACCGACAAAAAGAACCCATCTCTATCGGAGCAGCTCAAAGACCTCTCCACATCGACTCTTTCCAATGCATCCAACGACGAATCCCATCTCTTATCGAACCCTAAATCCATTTGGGTGAATCCCACCAAGCCCAAGCGCTCGGTTCTGTCACTCCAGAGACAAAAACGTTCTTCTTACTCTTATAACCCCAAGATGCGAGAGCTTAAAACCTTCGCCCATAAGCTCAATGCATCCGATTCCTCTGAAGCTGCTTTCATGGCGGTTCTTAAGGAAATCCCTCATCCACCCACTAAAGAAAATGCCCTTCTCATTCTCAATAGCTTGAAGCCATGGCAGAAAACCCATTTGTTCTTCAATTGGATTAAGACCCAGAATCTGTTTCCTATGGAGACTATCTTCTACAATGTGGCTATGAAGTCTTTGAGGTATGGTAGGCAGTTTCAGCTCATTGAAGAACTAGCAAATGAGATGATTAACACTGGGATTGAGCTTGATAACATTACTTATTCTACTATTATCACTTGTGCTAAGAAGTGCAGTAGGTTTGATAAGGCAATGGAATGGTTTGAGAGAATGTATAGAACTGGTTTGATGCCCGATGAGGTTACTTACTCTGCTATTTTAGATGTTTATGCAAATTTGGGGAAGGTTGAGGAGGCTCTTAGTTTGTATGAGAGAGGGAGGGCTAGTGGTTGGAAGCCTGATCCTTATACGTTTTCTGTGTTGGGGAAGATGTTTGGAGAGGCAGGGGATTATGATGGGATTATGTATGTTTTGCAAGAAATGAAGTCTATTGAGGTGCAGCCTAATCTTGTGGTGTATAACACCTTGTTGGATGCAATGGGGAAGGCTGGGAGGCCTGGTTTTGCAAGAAGCTTGTTCGAGGAAATGATTGAATCGGGGATAACGCCGAATGAGAAGACGTTAACTGCTCTGGTTAAGATTTATGGGAAGGCAAGGTGGGCTCGGGATGCTTTGGAGCTGTGGGAGCGGATGAGGTCGAAAGGGTGGCCGATGGACTTCATTTTGTATAATACATTGTTGAATATGTGTGCAGATCTTGGTTTGGAGGAGGAAGCTGAGAAGCTATTTGAAGAGATGAAGAAATCGGAGCATTCTAGACCGGATAGTTGGAGCTATACGGCGATGTTGAATATACATGGTAGCGGAGGTAATGTAAAAAGATCGATGGAGTTGTTTGAAGAAATGCTCGAGTTGGGTGTTGGGATTAATGTGATGTGTTGCACTTGTTTGATTCAATGCTTGGGGAAGGCTAGGAGGATCGATGATCTAGTTCGAGTTTTCGACGTTTCTGTACGAAAAGGAGTTGAGCCAGATGACAGGCTTTGTGGTTGCTTGTTGTCTGTTGTGTCCTTGTGTGACAACAATGAAGATATTAGCAAGGTATTCACTTGTCTACAACAAGCTAACCCAAAGTTAGTTGCCTTTGTAAATCTACTGCAACAAAATGACATTACCTTCGACGTTATCAAAGACGAATTCAGAACCATTCTCGGCGAGACCGCCACCGAAGCCCGACGACCTTTCTGCAATTGCTTGATTGATATATGTCGAAACCAAAATCTTTCCAAGAGAGCTCACGAGCTGCTCTACTTGGGAAGTTTATATGGACTGTACCCAGGCCTACACAACAAAACCGAAGGTGAATGGTGCCTAGATGTTCGATCTCTATCGGTAGGCGCAGCTCAGACTGCACTCGAAGAATGGATGATAACTCTATCAAAAATCGTACAACGAGAAGAAGCATTACCTGAATTGTTATCAGCTCAAACCGGTGCAGGAACTCACAGGTTTTCTCAAGGACTTGCCAATTCATTCGCTTCTCATGTCGAGAAGCTAGCTGCTCCGTTTCGATTGCGAGAAGATCGGGCTGGTTGGTTTGTAGCCACGAGGGAGGATGTCGTTGCTTGGGTACATTCAAGAGTACCATCTGTGGCTACCAGAGCTTAA

Coding sequence (CDS)

ATGGCGGCTCCTCTTTCTTCCTCTCTCGATATCAAACTCAAACTCAAACCAACGCCGCCCTTGTTCTTCACTTCCCCTCTCCGGCGAAACAATTTCACCAAGCGATTTACAGTTCTCTGTACCTCCTCCTCCAAATCCCCTCGATCCACCGACAAAAAGAACCCATCTCTATCGGAGCAGCTCAAAGACCTCTCCACATCGACTCTTTCCAATGCATCCAACGACGAATCCCATCTCTTATCGAACCCTAAATCCATTTGGGTGAATCCCACCAAGCCCAAGCGCTCGGTTCTGTCACTCCAGAGACAAAAACGTTCTTCTTACTCTTATAACCCCAAGATGCGAGAGCTTAAAACCTTCGCCCATAAGCTCAATGCATCCGATTCCTCTGAAGCTGCTTTCATGGCGGTTCTTAAGGAAATCCCTCATCCACCCACTAAAGAAAATGCCCTTCTCATTCTCAATAGCTTGAAGCCATGGCAGAAAACCCATTTGTTCTTCAATTGGATTAAGACCCAGAATCTGTTTCCTATGGAGACTATCTTCTACAATGTGGCTATGAAGTCTTTGAGGTATGGTAGGCAGTTTCAGCTCATTGAAGAACTAGCAAATGAGATGATTAACACTGGGATTGAGCTTGATAACATTACTTATTCTACTATTATCACTTGTGCTAAGAAGTGCAGTAGGTTTGATAAGGCAATGGAATGGTTTGAGAGAATGTATAGAACTGGTTTGATGCCCGATGAGGTTACTTACTCTGCTATTTTAGATGTTTATGCAAATTTGGGGAAGGTTGAGGAGGCTCTTAGTTTGTATGAGAGAGGGAGGGCTAGTGGTTGGAAGCCTGATCCTTATACGTTTTCTGTGTTGGGGAAGATGTTTGGAGAGGCAGGGGATTATGATGGGATTATGTATGTTTTGCAAGAAATGAAGTCTATTGAGGTGCAGCCTAATCTTGTGGTGTATAACACCTTGTTGGATGCAATGGGGAAGGCTGGGAGGCCTGGTTTTGCAAGAAGCTTGTTCGAGGAAATGATTGAATCGGGGATAACGCCGAATGAGAAGACGTTAACTGCTCTGGTTAAGATTTATGGGAAGGCAAGGTGGGCTCGGGATGCTTTGGAGCTGTGGGAGCGGATGAGGTCGAAAGGGTGGCCGATGGACTTCATTTTGTATAATACATTGTTGAATATGTGTGCAGATCTTGGTTTGGAGGAGGAAGCTGAGAAGCTATTTGAAGAGATGAAGAAATCGGAGCATTCTAGACCGGATAGTTGGAGCTATACGGCGATGTTGAATATACATGGTAGCGGAGGTAATGTAAAAAGATCGATGGAGTTGTTTGAAGAAATGCTCGAGTTGGGTGTTGGGATTAATGTGATGTGTTGCACTTGTTTGATTCAATGCTTGGGGAAGGCTAGGAGGATCGATGATCTAGTTCGAGTTTTCGACGTTTCTGTACGAAAAGGAGTTGAGCCAGATGACAGGCTTTGTGGTTGCTTGTTGTCTGTTGTGTCCTTGTGTGACAACAATGAAGATATTAGCAAGGTATTCACTTGTCTACAACAAGCTAACCCAAAGTTAGTTGCCTTTGTAAATCTACTGCAACAAAATGACATTACCTTCGACGTTATCAAAGACGAATTCAGAACCATTCTCGGCGAGACCGCCACCGAAGCCCGACGACCTTTCTGCAATTGCTTGATTGATATATGTCGAAACCAAAATCTTTCCAAGAGAGCTCACGAGCTGCTCTACTTGGGAAGTTTATATGGACTGTACCCAGGCCTACACAACAAAACCGAAGGTGAATGGTGCCTAGATGTTCGATCTCTATCGGTAGGCGCAGCTCAGACTGCACTCGAAGAATGGATGATAACTCTATCAAAAATCGTACAACGAGAAGAAGCATTACCTGAATTGTTATCAGCTCAAACCGGTGCAGGAACTCACAGGTTTTCTCAAGGACTTGCCAATTCATTCGCTTCTCATGTCGAGAAGCTAGCTGCTCCGTTTCGATTGCGAGAAGATCGGGCTGGTTGGTTTGTAGCCACGAGGGAGGATGTCGTTGCTTGGGTACATTCAAGAGTACCATCTGTGGCTACCAGAGCTTAA
BLAST of CmoCh16G011170 vs. Swiss-Prot
Match: PP420_ARATH (Pentatricopeptide repeat-containing protein At5g46580, chloroplastic OS=Arabidopsis thaliana GN=At5g46580 PE=2 SV=1)

HSP 1 Score: 971.5 bits (2510), Expect = 5.1e-282
Identity = 471/713 (66.06%), Postives = 593/713 (83.17%), Query Frame = 1

Query: 1   MAAPLSSSLDIKLKLKPTPP----LFFTSPLRRNNFTKRFTVLCTSSSKSPRSTDK---- 60
           MA  L++++D+    + +      LF    L R + +++  + C SS K P++ ++    
Sbjct: 1   MATVLTTAIDVCFNPQNSDTKKHSLFLKPSLFRQSRSRKLNISC-SSLKQPKTLEEEPIT 60

Query: 61  -KNPSLSEQLKDLSTSTLSNASNDESHLLSNPKSIWVNPTKPKRSVLSLQRQKRSSYSYN 120
            K PSLSEQLK LS +TL     +++ +LS PKS+WVNPT+PKRSVLSLQRQKRS+YSYN
Sbjct: 61  TKTPSLSEQLKPLSATTLRQ---EQTQILSKPKSVWVNPTRPKRSVLSLQRQKRSAYSYN 120

Query: 121 PKMRELKTFAHKLNASDSSEAA-FMAVLKEIPHPPTKENALLILNSLKPWQKTHLFFNWI 180
           P++++L+ FA KLN+S  +E + F+++L EIPHPP ++NALL+LNSL+ WQKTH FFNW+
Sbjct: 121 PQIKDLRAFALKLNSSIFTEKSEFLSLLDEIPHPPNRDNALLVLNSLREWQKTHTFFNWV 180

Query: 181 KTQNLFPMETIFYNVAMKSLRYGRQFQLIEELANEMINTGIELDNITYSTIITCAKKCSR 240
           K+++LFPMETIFYNV MKSLR+GRQFQLIEE+A EM+  G+ELDNITYSTIITCAK+C+ 
Sbjct: 181 KSKSLFPMETIFYNVTMKSLRFGRQFQLIEEMALEMVKDGVELDNITYSTIITCAKRCNL 240

Query: 241 FDKAMEWFERMYRTGLMPDEVTYSAILDVYANLGKVEEALSLYERGRASGWKPDPYTFSV 300
           ++KA+EWFERMY+TGLMPDEVTYSAILDVY+  GKVEE LSLYER  A+GWKPD   FSV
Sbjct: 241 YNKAIEWFERMYKTGLMPDEVTYSAILDVYSKSGKVEEVLSLYERAVATGWKPDAIAFSV 300

Query: 301 LGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGRPGFARSLFEEMIESG 360
           LGKMFGEAGDYDGI YVLQEMKS++V+PN+VVYNTLL+AMG+AG+PG ARSLF EM+E+G
Sbjct: 301 LGKMFGEAGDYDGIRYVLQEMKSMDVKPNVVVYNTLLEAMGRAGKPGLARSLFNEMLEAG 360

Query: 361 ITPNEKTLTALVKIYGKARWARDALELWERMRSKGWPMDFILYNTLLNMCADLGLEEEAE 420
           +TPNEKTLTALVKIYGKARWARDAL+LWE M++K WPMDFILYNTLLNMCAD+GLEEEAE
Sbjct: 361 LTPNEKTLTALVKIYGKARWARDALQLWEEMKAKKWPMDFILYNTLLNMCADIGLEEEAE 420

Query: 421 KLFEEMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGVGINVMCCTCLIQC 480
           +LF +MK+S   RPD++SYTAMLNI+GSGG  +++MELFEEML+ GV +NVM CTCL+QC
Sbjct: 421 RLFNDMKESVQCRPDNFSYTAMLNIYGSGGKAEKAMELFEEMLKAGVQVNVMGCTCLVQC 480

Query: 481 LGKARRIDDLVRVFDVSVRKGVEPDDRLCGCLLSVVSLCDNNEDISKVFTCLQQANPKLV 540
           LGKA+RIDD+V VFD+S+++GV+PDDRLCGCLLSV++LC+++ED  KV  CL++AN KLV
Sbjct: 481 LGKAKRIDDVVYVFDLSIKRGVKPDDRLCGCLLSVMALCESSEDAEKVMACLERANKKLV 540

Query: 541 AFVNLLQQNDITFDVIKDEFRTILGETATEARRPFCNCLIDICRNQNLSKRAHELLYLGS 600
            FVNL+      ++ +K+EF+ ++  T  EARRPFCNCLIDICR  N  +RAHELLYLG+
Sbjct: 541 TFVNLIVDEKTEYETVKEEFKLVINATQVEARRPFCNCLIDICRGNNRHERAHELLYLGT 600

Query: 601 LYGLYPGLHNKTEGEWCLDVRSLSVGAAQTALEEWMITLSKIVQREEALPELLSAQTGAG 660
           L+GLYPGLHNKT  EW LDVRSLSVGAA+TALEEWM TL+ I++R+E LPEL  AQTG G
Sbjct: 601 LFGLYPGLHNKTIKEWSLDVRSLSVGAAETALEEWMRTLANIIKRQEELPELFLAQTGTG 660

Query: 661 THRFSQGLANSFASHVEKLAAPFRLREDRAGWFVATREDVVAWVHSRVPSVAT 704
           THRFSQGLANSFA H+++L+APFR + DR G FVAT+ED+V+W+ S+ P + T
Sbjct: 661 THRFSQGLANSFALHLQQLSAPFR-QSDRPGIFVATKEDLVSWLESKFPPLVT 708

BLAST of CmoCh16G011170 vs. Swiss-Prot
Match: PP314_ARATH (Pentatricopeptide repeat-containing protein At4g16390, chloroplastic OS=Arabidopsis thaliana GN=P67 PE=1 SV=3)

HSP 1 Score: 456.8 bits (1174), Expect = 4.2e-127
Identity = 241/615 (39.19%), Postives = 373/615 (60.65%), Query Frame = 1

Query: 86  IWVNPTKPKRSVLSLQRQKRSSYSYNPKMRELKTFAHKLNASDSSEAAFMAVLKEIPHPP 145
           +WVNP  P+ S L   R+K    SY+ +   L   A  L+A   +EA    V+       
Sbjct: 88  VWVNPKSPRASQL---RRK----SYDSRYSSLIKLAESLDACKPNEADVCDVITGFGGKL 147

Query: 146 TKENALLILNSLKPWQKTHLFFNWIKTQNLFPMETIFYNVAMKSLRYGRQFQLIEELANE 205
            +++A++ LN++   +   L  N +        E I YNV MK  R  +  +  E+L +E
Sbjct: 148 FEQDAVVTLNNMTNPETAPLVLNNLLETMKPSREVILYNVTMKVFRKSKDLEKSEKLFDE 207

Query: 206 MINTGIELDNITYSTIITCAKKCSRFDKAMEWFERMYRTGLMPDEVTYSAILDVYANLGK 265
           M+  GI+ DN T++TII+CA++     +A+EWFE+M   G  PD VT +A++D Y   G 
Sbjct: 208 MLERGIKPDNATFTTIISCARQNGVPKRAVEWFEKMSSFGCEPDNVTMAAMIDAYGRAGN 267

Query: 266 VEEALSLYERGRASGWKPDPYTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNT 325
           V+ ALSLY+R R   W+ D  TFS L +++G +G+YDG + + +EMK++ V+PNLV+YN 
Sbjct: 268 VDMALSLYDRARTEKWRIDAVTFSTLIRIYGVSGNYDGCLNIYEEMKALGVKPNLVIYNR 327

Query: 326 LLDAMGKAGRPGFARSLFEEMIESGITPNEKTLTALVKIYGKARWARDALELWERMRSKG 385
           L+D+MG+A RP  A+ +++++I +G TPN  T  ALV+ YG+AR+  DAL ++  M+ KG
Sbjct: 328 LIDSMGRAKRPWQAKIIYKDLITNGFTPNWSTYAALVRAYGRARYGDDALAIYREMKEKG 387

Query: 386 WPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSEHSRPDSWSYTAMLNIHGSGGNVKRS 445
             +  ILYNTLL+MCAD    +EA ++F++MK  E   PDSW++++++ ++   G V  +
Sbjct: 388 LSLTVILYNTLLSMCADNRYVDEAFEIFQDMKNCETCDPDSWTFSSLITVYACSGRVSEA 447

Query: 446 MELFEEMLELGVGINVMCCTCLIQCLGKARRIDDLVRVFDVSVRKGVEPDDRLCGCLLSV 505
                +M E G    +   T +IQC GKA+++DD+VR FD  +  G+ PDDR CGCLL+V
Sbjct: 448 EAALLQMREAGFEPTLFVLTSVIQCYGKAKQVDDVVRTFDQVLELGITPDDRFCGCLLNV 507

Query: 506 VSLCDNNEDISKVFTCLQQANPKLVAFVNLL-QQNDITFDVIKDEFRTILGETATEARRP 565
           ++    +E+I K+  C+++A PKL   V +L ++ +    V K E   ++    ++ ++ 
Sbjct: 508 MTQTP-SEEIGKLIGCVEKAKPKLGQVVKMLVEEQNCEEGVFKKEASELIDSIGSDVKKA 567

Query: 566 FCNCLIDICRNQNLSKRAHELLYLGSLYGLYPGLHNKTEGEWCLDVRSLSVGAAQTALEE 625
           + NCLID+C N N  +RA E+L LG  Y +Y GL +K+  +W L ++SLS+GAA TAL  
Sbjct: 568 YLNCLIDLCVNLNKLERACEILQLGLEYDIYTGLQSKSATQWSLHLKSLSLGAALTALHV 627

Query: 626 WMITLSK-IVQREEALPELLSAQTGAGTHRFS-QGLANSFASHVEKLAAPFRLREDRAGW 685
           WM  LS+  ++  E  P LL   TG G H++S +GLA  F SH+++L APF    D+ GW
Sbjct: 628 WMNDLSEAALESGEEFPPLLGINTGHGKHKYSDKGLAAVFESHLKELNAPFHEAPDKVGW 687

Query: 686 FVATREDVVAWVHSR 698
           F+ T     AW+ SR
Sbjct: 688 FLTTSVAAKAWLESR 694

BLAST of CmoCh16G011170 vs. Swiss-Prot
Match: PP123_ARATH (Pentatricopeptide repeat-containing protein At1g74750 OS=Arabidopsis thaliana GN=At1g74750 PE=2 SV=1)

HSP 1 Score: 173.7 bits (439), Expect = 7.1e-42
Identity = 117/493 (23.73%), Postives = 223/493 (45.23%), Query Frame = 1

Query: 210 GIELDNITYSTIITCAKKCSRFDKAMEWFERMYRTGLMPDEVTYSAILDVYANLGKVEEA 269
           G + D  TY+T++    +  +F +  +  + M R G  P+ VTY+ ++  Y     ++EA
Sbjct: 354 GFKHDGHTYTTMVGNLGRAKQFGEINKLLDEMVRDGCKPNTVTYNRLIHSYGRANYLKEA 413

Query: 270 LSLYERGRASGWKPDPYTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDA 329
           ++++ + + +G +PD  T+  L  +  +AG  D  M + Q M+   + P+   Y+ +++ 
Sbjct: 414 MNVFNQMQEAGCEPDRVTYCTLIDIHAKAGFLDIAMDMYQRMQEAGLSPDTFTYSVIINC 473

Query: 330 MGKAGRPGFARSLFEEMIESGITPNEKTLTALVKIYGKARWARDALELWERMRSKGWPMD 389
           +GKAG    A  LF EM+  G TPN  T   ++ ++ KAR    AL+L+  M++ G+  D
Sbjct: 474 LGKAGHLPAAHRLFCEMVGQGCTPNLVTFNIMIALHAKARNYETALKLYRDMQNAGFQPD 533

Query: 390 FILYNTLLNMCADLGLEEEAEKLFEEMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELF 449
            + Y+ ++ +    G  EEAE +F EM++     PD   Y  ++++ G  GNV ++ + +
Sbjct: 534 KVTYSIVMEVLGHCGFLEEAEGVFAEMQRKNWV-PDEPVYGLLVDLWGKAGNVDKAWQWY 593

Query: 450 EEMLELGVGINVMCCTCLIQCLGKARRIDDLVRVFDVSVRKGVEPDDRLCGCLLSVVSLC 509
           + ML+ G+  NV  C  L+    +  R+ +   +    +  G+ P  +    LLS  +  
Sbjct: 594 QAMLQAGLRPNVPTCNSLLSTFLRVHRMSEAYNLLQSMLALGLHPSLQTYTLLLSCCTDA 653

Query: 510 DNNEDISKVFTCLQQANPKLVAFVNLLQQNDITFDVIKD---EFRTILGETATEARRPFC 569
            +N D+      +  +      F+  +         ++D    F   +     E++R   
Sbjct: 654 RSNFDMGFCGQLMAVSGHPAHMFLLKMPPAGPDGQKVRDHVSNFLDFMHSEDRESKRGLM 713

Query: 570 NCLIDICRNQNLSKRAHELLYLGSLYGLYP-GLHNKTEGEWCLDVRSLSVGAAQTALEEW 629
           + ++D      L + A  +  + +   +YP  L  K+   W +++  +S G A  AL   
Sbjct: 714 DAVVDFLHKSGLKEEAGSVWEVAAGKNVYPDALREKSYSYWLINLHVMSEGTAVIALSRT 773

Query: 630 MITLSKIVQREEALPELLSAQTGAGTHRFSQGLANSFASHVEKLA----APFRLREDRAG 689
           +    K +      P  +   TG G      G  +     VE+L      PF      +G
Sbjct: 774 LAWFRKQMLVSGDCPSRIDIVTGWGRRSRVTG-TSMVRQAVEELLNIFNFPFFTENGNSG 833

Query: 690 WFVATREDVVAWV 695
            FV + E +  W+
Sbjct: 834 CFVGSGEPLKNWL 844

BLAST of CmoCh16G011170 vs. Swiss-Prot
Match: PPR49_ARATH (Pentatricopeptide repeat-containing protein At1g18900 OS=Arabidopsis thaliana GN=At1g18900 PE=2 SV=1)

HSP 1 Score: 167.9 bits (424), Expect = 3.9e-40
Identity = 127/547 (23.22%), Postives = 235/547 (42.96%), Query Frame = 1

Query: 153 ILNSLKPWQKTHLFFNWIKTQNLFPMETIFYNVAMKSLRYGRQFQLIEELANEMINTGIE 212
           +L  +  +     FF W+K Q  F  +   Y   + +L   +QF  I +L +EM+  G +
Sbjct: 337 VLKQMNDYGNALGFFYWLKRQPGFKHDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQ 396

Query: 213 LDNITYSTIITCAKKCSRFDKAMEWFERMYRTGLMPDEVTYSAILDVYANLGKVEEALSL 272
            + +TY+ +I    + +  ++AM  F +M   G  PD VTY  ++D++A  G ++ A+ +
Sbjct: 397 PNTVTYNRLIHSYGRANYLNEAMNVFNQMQEAGCKPDRVTYCTLIDIHAKAGFLDIAMDM 456

Query: 273 YERGRASGWKPDPYTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGK 332
           Y+R +A G  PD +T+SV+    G+AG       +  EM      PNLV YN ++D   K
Sbjct: 457 YQRMQAGGLSPDTFTYSVIINCLGKAGHLPAAHKLFCEMVDQGCTPNLVTYNIMMDLHAK 516

Query: 333 AGRPGFARSLFEEMIESGITPNEKTLTALVKIYGKARWARDALELWERMRSKGWPMDFIL 392
           A     A  L+ +M  +G  P++ T + ++++ G   +  +A  ++  M+ K W  D  +
Sbjct: 517 ARNYQNALKLYRDMQNAGFEPDKVTYSIVMEVLGHCGYLEEAEAVFTEMQQKNWIPDEPV 576

Query: 393 YNTLLNMCADLGLEEEAEKLFEEMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEM 452
           Y  L+++    G  E+A + ++ M  +   RP+  +  ++L+       +  + EL + M
Sbjct: 577 YGLLVDLWGKAGNVEKAWQWYQAMLHA-GLRPNVPTCNSLLSTFLRVNKIAEAYELLQNM 636

Query: 453 LELGVGINVMCCTCLIQCLGKARRIDDLVRVFDVSVRKGVEPDDRLCGCLLSVVSLCDNN 512
           L LG+  ++   T L+ C    R                 + D   CG L++        
Sbjct: 637 LALGLRPSLQTYTLLLSCCTDGRS----------------KLDMGFCGQLMA-----STG 696

Query: 513 EDISKVFTCLQQANPKLVAFVNLLQQNDITFDVIKDEFRTILGETATEARRPFCNCLIDI 572
                    +  A P      N+    +   D++  E R        E++R   + ++D 
Sbjct: 697 HPAHMFLLKMPAAGPD---GENVRNHANNFLDLMHSEDR--------ESKRGLVDAVVDF 756

Query: 573 CRNQNLSKRAHELLYLGSLYGLYP-GLHNKTEGEWCLDVRSLSVGAAQTALEEWMITLSK 632
                  + A  +  + +   ++P  L  K+   W +++  +S G A TAL   +    K
Sbjct: 757 LHKSGQKEEAGSVWEVAAQKNVFPDALREKSCSYWLINLHVMSEGTAVTALSRTLAWFRK 816

Query: 633 IVQREEALPELLSAQTGAGTHRFSQGLANSFASHVEKL----AAPFRLREDRAGWFVATR 692
            +      P  +   TG G      G  +     VE+L     +PF      +G FV + 
Sbjct: 817 QMLASGTCPSRIDIVTGWGRRSRVTG-TSMVRQAVEELLNIFGSPFFTESGNSGCFVGSG 849

Query: 693 EDVVAWV 695
           E +  W+
Sbjct: 877 EPLNRWL 849

BLAST of CmoCh16G011170 vs. Swiss-Prot
Match: PP178_ARATH (Pentatricopeptide repeat-containing protein At2g31400, chloroplastic OS=Arabidopsis thaliana GN=At2g31400 PE=2 SV=1)

HSP 1 Score: 159.8 bits (403), Expect = 1.1e-37
Identity = 137/544 (25.18%), Postives = 242/544 (44.49%), Query Frame = 1

Query: 169 WIKTQNLFP--------METIFYNVAMKSLRYGRQFQLIEELANEMINTGIELDNITYST 228
           W   +NLF          +   YN  + ++  G Q  L  E+  +M    I  + ++YST
Sbjct: 355 WEAARNLFDEMTNRRIEQDVFSYNTLLDAICKGGQMDLAFEILAQMPVKRIMPNVVSYST 414

Query: 229 IITCAKKCSRFDKAMEWFERMYRTGLMPDEVTYSAILDVYANLGKVEEALSLYERGRASG 288
           +I    K  RFD+A+  F  M   G+  D V+Y+ +L +Y  +G+ EEAL +     + G
Sbjct: 415 VIDGFAKAGRFDEALNLFGEMRYLGIALDRVSYNTLLSIYTKVGRSEEALDILREMASVG 474

Query: 289 WKPDPYTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGRPGFAR 348
            K D  T++ L   +G+ G YD +  V  EMK   V PNL+ Y+TL+D   K G    A 
Sbjct: 475 IKKDVVTYNALLGGYGKQGKYDEVKKVFTEMKREHVLPNLLTYSTLIDGYSKGGLYKEAM 534

Query: 349 SLFEEMIESGITPNEKTLTALVKIYGKARWARDALELWERMRSKGWPMDFILYNTLLNMC 408
            +F E   +G+  +    +AL+    K      A+ L + M  +G   + + YN++++  
Sbjct: 535 EIFREFKSAGLRADVVLYSALIDALCKNGLVGSAVSLIDEMTKEGISPNVVTYNSIIDAF 594

Query: 409 ADLGLEEEAEKLFEEMKKSEHSRPDS--WSYTAMLNIHGSGGNVKRSMELFEEMLELGVG 468
                 +         + +++S   S  +S +A+  +  + GN  R ++LF ++      
Sbjct: 595 GRSATMD---------RSADYSNGGSLPFSSSALSALTETEGN--RVIQLFGQLTTESNN 654

Query: 469 INVMCCTCLIQCLGKARRIDDLVRVFDVSVRKGVEPDDRLCGCLLSVVSLCDNNEDISKV 528
                C   +Q       +  ++ VF    +  ++P+      +L+  S C++ ED S +
Sbjct: 655 RTTKDCEEGMQ------ELSCILEVFRKMHQLEIKPNVVTFSAILNACSRCNSFEDASML 714

Query: 529 FTCLQQANPKLVAFVN--LLQQND---ITFDVIKDEFRTILGETATEARRPFCNCLIDIC 588
              L+  + K+   V+  L+ Q +   +    + D+   + G TA+     F N L D+ 
Sbjct: 715 LEELRLFDNKVYGVVHGLLMGQRENVWLQAQSLFDKVNEMDGSTAS----AFYNALTDML 774

Query: 589 RNQNLSKRAHELLYLGSLYGLYPGLHNKTEGEWCLDVRSLSVGAAQTALEEWMITLSKIV 648
            +    KR  EL+   +L G    +      + CLD+  +S GAA+  +  W++ +  IV
Sbjct: 775 WHFG-QKRGAELV---ALEGRSRQVWENVWSDSCLDLHLMSSGAARAMVHAWLLNIRSIV 834

Query: 649 QREEALPELLSAQTGAGTHRFSQG---LANSFASHVEKLAAPFRLREDRAGWFVATREDV 695
                LP++LS  TG G H    G   L  +    +  + APF L +   G F ++   V
Sbjct: 835 YEGHELPKVLSILTGWGKHSKVVGDGALRRAVEVLLRGMDAPFHLSKCNMGRFTSSGSVV 873

BLAST of CmoCh16G011170 vs. TrEMBL
Match: A0A0A0L6K8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G011820 PE=4 SV=1)

HSP 1 Score: 1268.1 bits (3280), Expect = 0.0e+00
Identity = 634/714 (88.80%), Postives = 674/714 (94.40%), Query Frame = 1

Query: 1   MAAPLSSSLDIKLKLKPTPPLFFTSPLRRNNFTKRFTVLCTSSSKSPR--------STDK 60
           MA PLSSSLD  LKLKPTP +FFTSPLRR N TKR T+LC SSSKSPR        S D 
Sbjct: 1   MAVPLSSSLD--LKLKPTP-IFFTSPLRRKNVTKRLTLLC-SSSKSPRKPSSVSSQSVDN 60

Query: 61  KNPSLSEQLKDLSTSTLSNASNDESHLLSNPKSIWVNPTKPKRSVLSLQRQKRSSYSYNP 120
           KNPSLSEQLK+LST+TLSNA NDE+ LLS PKS WVNPTKPKRSVLSLQRQKRSSYSYNP
Sbjct: 61  KNPSLSEQLKNLSTTTLSNAPNDETRLLSKPKSTWVNPTKPKRSVLSLQRQKRSSYSYNP 120

Query: 121 KMRELKTFAHKLNASDSSE-AAFMAVLKEIPHPPTKENALLILNSLKPWQKTHLFFNWIK 180
           KMR+LK+FAHKLNA DSS+ A+F+A L+EIPHPPTKENALLILNSL+PWQKTHLFFNWIK
Sbjct: 121 KMRDLKSFAHKLNACDSSDDASFIAALEEIPHPPTKENALLILNSLRPWQKTHLFFNWIK 180

Query: 181 TQNLFPMETIFYNVAMKSLRYGRQFQLIEELANEMINTGIELDNITYSTIITCAKKCSRF 240
           +QNLFPMETIFYNVAMKSLRYGRQFQLIE+LANEMI+ GIELDNITYSTIITCAKKCSRF
Sbjct: 181 SQNLFPMETIFYNVAMKSLRYGRQFQLIEDLANEMISAGIELDNITYSTIITCAKKCSRF 240

Query: 241 DKAMEWFERMYRTGLMPDEVTYSAILDVYANLGKVEEALSLYERGRASGWKPDPYTFSVL 300
           DKAMEWFERMY+TGLMPDEVTYSAILDVYANLGKVEE LSLYERGRASGW PDPYTFSVL
Sbjct: 241 DKAMEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWTPDPYTFSVL 300

Query: 301 GKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGRPGFARSLFEEMIESGI 360
           GKMFGEAGDYDGIMYVLQEMKSIE+QPNLVVYNTLLDAMGKAG+PGFARSLF+EM+ESGI
Sbjct: 301 GKMFGEAGDYDGIMYVLQEMKSIEMQPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGI 360

Query: 361 TPNEKTLTALVKIYGKARWARDALELWERMRSKGWPMDFILYNTLLNMCADLGLEEEAEK 420
           TPNEKTLTALVKIYGKARWARDAL+LWERMRS GWPMDFILYNTLLNMCADLGLEEEAE 
Sbjct: 361 TPNEKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAET 420

Query: 421 LFEEMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGVGINVMCCTCLIQCL 480
           LFEEMKKS+HSRPDSWSYTAMLNI+GSGGNVKRSMELFEEMLELGV INVMCCTCLIQCL
Sbjct: 421 LFEEMKKSKHSRPDSWSYTAMLNIYGSGGNVKRSMELFEEMLELGVEINVMCCTCLIQCL 480

Query: 481 GKARRIDDLVRVFDVSVRKGVEPDDRLCGCLLSVVSLCDNNEDISKVFTCLQQANPKLVA 540
           GK+ RIDDLVRVF+VSV+KG++PDDRLCGCLLSV+SLC N+EDI+KVFTCLQQANPKLV+
Sbjct: 481 GKSGRIDDLVRVFNVSVQKGIKPDDRLCGCLLSVLSLCYNSEDINKVFTCLQQANPKLVS 540

Query: 541 FVNLLQQNDITFDVIKDEFRTILGETATEARRPFCNCLIDICRNQNLSKRAHELLYLGSL 600
           F+NLLQQNDITF+V+K+EFR ILGETA EARRPFCNCLIDICRNQNL +RAHELLYLGSL
Sbjct: 541 FINLLQQNDITFEVVKNEFRNILGETAPEARRPFCNCLIDICRNQNLRERAHELLYLGSL 600

Query: 601 YGLYPGLHNKTEGEWCLDVRSLSVGAAQTALEEWMITLSKIVQREEALPELLSAQTGAGT 660
           YGLYPGLHNKTE EWCLDVRSLSVGAAQTALEEWMITLSKIVQREEALPELLSAQTGAGT
Sbjct: 601 YGLYPGLHNKTETEWCLDVRSLSVGAAQTALEEWMITLSKIVQREEALPELLSAQTGAGT 660

Query: 661 HRFSQGLANSFASHVEKLAAPFRLREDRAGWFVATREDVVAWVHSRVPSVATRA 706
           HRFSQGLANSFASHV+KLAAPF+LREDRAGWFVATRED+V WVHSRVPSVA  A
Sbjct: 661 HRFSQGLANSFASHVDKLAAPFQLREDRAGWFVATREDLVTWVHSRVPSVAATA 710

BLAST of CmoCh16G011170 vs. TrEMBL
Match: M5WQE9_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002049mg PE=4 SV=1)

HSP 1 Score: 1067.8 bits (2760), Expect = 5.8e-309
Identity = 530/707 (74.96%), Postives = 595/707 (84.16%), Query Frame = 1

Query: 20  PLFFTSPLRRNNFTKRFTVLCTSSSKSPRS-------------------TDKKNPSLS-- 79
           P+FFTSP R+   TKRF + C S+   P+S                    +KKNPSLS  
Sbjct: 19  PIFFTSPFRQIP-TKRFNLSCRSTKSPPKSPPDLAEPNSKNNNKKNDDNNNKKNPSLSLS 78

Query: 80  EQLKDLSTSTLSNASNDESHLLSNPKSIWVNPTKPKRSVLSLQRQKRSSYSYNPKMRELK 139
           EQL+ L+++TLSN   D+S LLS PKSIWVNP KPKRSVLSLQRQKRS YSYNP++R+L+
Sbjct: 79  EQLQPLTSTTLSNPPKDQSQLLSKPKSIWVNPAKPKRSVLSLQRQKRSLYSYNPQVRDLR 138

Query: 140 TFAHKLNASDSSEAAFMAVLKEIPHPPTKENALLILNSLKPWQKTHLFFNWIKTQNLFPM 199
            FAHKLN  D+S+ AF+A L+EIPHPPT+ENALLILNSLKPWQKTH+FFNW+K QN FPM
Sbjct: 139 QFAHKLNDCDASQNAFLAALEEIPHPPTRENALLILNSLKPWQKTHMFFNWVKAQNSFPM 198

Query: 200 ETIFYNVAMKSLRYGRQFQLIEELANEMINTGIELDNITYSTIITCAKKCSRFDKAMEWF 259
           +TIFYNV MKSLR+GRQFQLIEELA EM++  IELDNITYSTIITCAK+   FDKA+EWF
Sbjct: 199 DTIFYNVTMKSLRFGRQFQLIEELAEEMVSNEIELDNITYSTIITCAKRSKLFDKAVEWF 258

Query: 260 ERMYRTGLMPDEVTYSAILDVYANLGKVEEALSLYERGRASGWKPDPYTFSVLGKMFGEA 319
           ERMY+TGLMPDEVTYSAILDVYA LGKVEE LSLYERGRASGWKPDP  FSVLGKMFGEA
Sbjct: 259 ERMYKTGLMPDEVTYSAILDVYAKLGKVEEVLSLYERGRASGWKPDPIAFSVLGKMFGEA 318

Query: 320 GDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGRPGFARSLFEEMIESGITPNEKTL 379
           GDYDGI YVLQEM ++ VQPNLVVYNTLL+AMGKAG+PG ARSLFEEM+ SG+ PNEKTL
Sbjct: 319 GDYDGIRYVLQEMAALGVQPNLVVYNTLLEAMGKAGKPGLARSLFEEMVGSGLKPNEKTL 378

Query: 380 TALVKIYGKARWARDALELWERMRSKGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKK 439
           TALVKIYGKARWARDALELWERMRS  WPMDFILYNTLLNMCADLGLEEEA+KLFE+MK+
Sbjct: 379 TALVKIYGKARWARDALELWERMRSNEWPMDFILYNTLLNMCADLGLEEEAKKLFEDMKQ 438

Query: 440 SEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGVGINVMCCTCLIQCLGKARRID 499
           SEH RPDSWSYTAMLNI GSGGNV  +M LFEEM ELG+ +NVM CTCLIQCLGKARR  
Sbjct: 439 SEHCRPDSWSYTAMLNIFGSGGNVDGAMGLFEEMSELGIELNVMGCTCLIQCLGKARRFS 498

Query: 500 DLVRVFDVSVRKGVEPDDRLCGCLLSVVSLCDNNEDISKVFTCLQQANPKLVAFVNLLQQ 559
           D+VRVF V+V +GV+PDDRLCGCLLSVVSLC+  ED  KV +CLQQANPKLV  V +LQ 
Sbjct: 499 DMVRVFGVAVERGVKPDDRLCGCLLSVVSLCEKTEDEDKVLSCLQQANPKLVTLVKVLQD 558

Query: 560 NDITFDVIKDEFRTILGETATEARRPFCNCLIDICRNQNLSKRAHELLYLGSLYGLYPGL 619
             + F+ IKDEFR ++  T+ E+RRPFCNCLIDICRN+N  +RAHELLYLG+LYGLYPGL
Sbjct: 559 KKLGFETIKDEFRDVISGTSVESRRPFCNCLIDICRNKNNHERAHELLYLGTLYGLYPGL 618

Query: 620 HNKTEGEWCLDVRSLSVGAAQTALEEWMITLSKIVQREEALPELLSAQTGAGTHRFSQGL 679
           HNKT  EWCLDVRSLS+GAA TALEEWM TL KIVQREEALPEL SAQTG GTH+FSQGL
Sbjct: 619 HNKTSREWCLDVRSLSIGAAHTALEEWMGTLYKIVQREEALPELFSAQTGTGTHKFSQGL 678

Query: 680 ANSFASHVEKLAAPFRLREDRAGWFVATREDVVAWVHSRVPSVATRA 706
           A+SFASHVEKLAAPFR  E++AG FVATRED+V+WV S+ PS A  A
Sbjct: 679 AHSFASHVEKLAAPFRKSEEKAGRFVATREDLVSWVQSQAPSTAITA 724

BLAST of CmoCh16G011170 vs. TrEMBL
Match: F6HTA5_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_02s0012g01090 PE=4 SV=1)

HSP 1 Score: 1062.4 bits (2746), Expect = 2.4e-307
Identity = 527/699 (75.39%), Postives = 603/699 (86.27%), Query Frame = 1

Query: 19  PPLFFTSPLRRNNFTKRFTVLCTSSSKSP--------------RSTDKKNPSLSEQLKDL 78
           P LF  S    +N    FT+ C SSS+SP                T+ +NPSLSEQLK L
Sbjct: 26  PNLFSKSTKFSSN---TFTIRCNSSSRSPPKPKPKPKPTSSDSEQTNHQNPSLSEQLKPL 85

Query: 79  STSTLSNASNDESHLLSNPKSIWVNPTKPKRSVLSLQRQKRSSYSYNPKMRELKTFAHKL 138
           S + L+   + ++HL+S PKS W+NPTKPK SVLSLQR KR +YSYNP++R+LK FA K+
Sbjct: 86  SKTILTRDHSGQTHLVSKPKSTWINPTKPKPSVLSLQRHKRHNYSYNPQIRDLKLFAKKI 145

Query: 139 NASDSS-EAAFMAVLKEIPHPPTKENALLILNSLKPWQKTHLFFNWIKTQNLFPMETIFY 198
           N S+SS E+ F+AVL++IPHPPT++NALL+LNSLKPW KT+LFFNWIKTQNLFPMETIFY
Sbjct: 146 NESESSDESEFLAVLEQIPHPPTRDNALLLLNSLKPWPKTYLFFNWIKTQNLFPMETIFY 205

Query: 199 NVAMKSLRYGRQFQLIEELANEMINTGIELDNITYSTIITCAKKCSRFDKAMEWFERMYR 258
           NV MKSLR+GRQFQLIEELANEMI+TG+ELDNITYSTIITCAK+C+ FDKA++WFERMY+
Sbjct: 206 NVTMKSLRFGRQFQLIEELANEMISTGVELDNITYSTIITCAKRCNLFDKAVKWFERMYK 265

Query: 259 TGLMPDEVTYSAILDVYANLGKVEEALSLYERGRASGWKPDPYTFSVLGKMFGEAGDYDG 318
           TGLMPDEVTYSAILDVYA LGKVEE LSLYERGRASGWKPDP  F+VLGKMFGEAGDYDG
Sbjct: 266 TGLMPDEVTYSAILDVYAKLGKVEEVLSLYERGRASGWKPDPIAFAVLGKMFGEAGDYDG 325

Query: 319 IMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGRPGFARSLFEEMIESGITPNEKTLTALVK 378
           I YVLQEMKS+ VQPNLVVYNTLL+AMGKAG+PG ARSLFEEM+ SG+ P+ KTLTALVK
Sbjct: 326 IRYVLQEMKSLGVQPNLVVYNTLLEAMGKAGKPGLARSLFEEMVGSGVIPDAKTLTALVK 385

Query: 379 IYGKARWARDALELWERMRSKGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSEHSR 438
           IYGKARWARDALELWERMRS GWPMDFILYNTLL+MCADLGLEEEAEKLFE+MKKSEH R
Sbjct: 386 IYGKARWARDALELWERMRSNGWPMDFILYNTLLSMCADLGLEEEAEKLFEDMKKSEHCR 445

Query: 439 PDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGVGINVMCCTCLIQCLGKARRIDDLVRV 498
           PDSWSYTAMLNI+GSGGNV R+M+LF+EM ELGV INVM CTCL QCLG+ARRIDDLV+V
Sbjct: 446 PDSWSYTAMLNIYGSGGNVDRAMQLFDEMSELGVQINVMGCTCLSQCLGRARRIDDLVKV 505

Query: 499 FDVSVRKGVEPDDRLCGCLLSVVSLCDNNEDISKVFTCLQQANPKLVAFVNLLQQNDITF 558
           F+VS+ +GV+PDDRLCGCLLSVVS C+  ED +KV  CLQQANPKLVAFVNLL++  I+F
Sbjct: 506 FEVSLERGVKPDDRLCGCLLSVVSFCEGAEDANKVLACLQQANPKLVAFVNLLEEK-ISF 565

Query: 559 DVIKDEFRTILGETATEARRPFCNCLIDICRNQNLSKRAHELLYLGSLYGLYPGLHNKTE 618
           + +K+EFR IL +TA EARRPFCNCLIDICRN++L +RAHELLYLG+LYGLYPGLHN+T 
Sbjct: 566 EALKEEFRGILTDTAVEARRPFCNCLIDICRNRSLHERAHELLYLGTLYGLYPGLHNRTA 625

Query: 619 GEWCLDVRSLSVGAAQTALEEWMITLSKIVQREEALPELLSAQTGAGTHRFSQGLANSFA 678
            EWCLDVRSLSVGAA TALEEWM TLSKIVQREEALPE  SA TG GTH+FSQGLA++FA
Sbjct: 626 DEWCLDVRSLSVGAAHTALEEWMGTLSKIVQREEALPEAFSANTGTGTHKFSQGLASAFA 685

Query: 679 SHVEKLAAPFRLREDRAGWFVATREDVVAWVHSRVPSVA 703
           SHV+KLAAPF   E++AG FVATRED+V+WV SR+ S A
Sbjct: 686 SHVKKLAAPFTQSEEKAGCFVATREDLVSWVQSRILSPA 720

BLAST of CmoCh16G011170 vs. TrEMBL
Match: W9RN90_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_023300 PE=4 SV=1)

HSP 1 Score: 1056.2 bits (2730), Expect = 1.7e-305
Identity = 522/706 (73.94%), Postives = 595/706 (84.28%), Query Frame = 1

Query: 3   APLSSSLDIKLKLKP--TPPLFFTSPLRRN--NFTKRFTVLCTSSSKSPRSTDKKNPSLS 62
           A +S+ LD+ L         LFFTSPL R     T+  T L  S   SP+  +KK  SLS
Sbjct: 4   AAISTPLDVHLTKHSDQNKSLFFTSPLFRQIPTTTRTRTTLTISCCTSPKPRNKKTSSLS 63

Query: 63  EQLKDLSTSTLSNASNDESH-LLSNPKSIWVNPTKPKRSVLSLQRQKRSSYSYNPKMREL 122
           EQLK L+T+TLSN    +++ LLS PKS WVNPT+PKRSV+SLQRQKRS +SYNP++R+L
Sbjct: 64  EQLKPLTTTTLSNDQEQQNNTLLSKPKSTWVNPTRPKRSVISLQRQKRSPHSYNPQVRDL 123

Query: 123 KTFAHKLNASDSSEAAFMAVLKEIPHPPTKENALLILNSLKPWQKTHLFFNWIKTQNLFP 182
           + FA KLN S  SE AFMA LKEIPHPP++ENALLILNSLKPWQ T LFFNW+KTQN FP
Sbjct: 124 RRFAQKLNNSGDSEEAFMATLKEIPHPPSRENALLILNSLKPWQNTRLFFNWLKTQNSFP 183

Query: 183 METIFYNVAMKSLRYGRQFQLIEELANEMINTGIELDNITYSTIITCAKKCSRFDKAMEW 242
           METIFYNV MKSLR+GRQFQLIEELANEMI   IELDNITYSTIITCAK+C  FDKA+EW
Sbjct: 184 METIFYNVTMKSLRFGRQFQLIEELANEMIRNDIELDNITYSTIITCAKRCKDFDKAVEW 243

Query: 243 FERMYRTGLMPDEVTYSAILDVYANLGKVEEALSLYERGRASGWKPDPYTFSVLGKMFGE 302
           FERMY+TG+MPDEVTYSAILDVYA L KVEE LSLYERGRASGWKPD  TF+VLGKMFGE
Sbjct: 244 FERMYKTGMMPDEVTYSAILDVYAQLRKVEEVLSLYERGRASGWKPDAITFAVLGKMFGE 303

Query: 303 AGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGRPGFARSLFEEMIESGITPNEKT 362
           AGD+DGI YVLQEM S+ V+PNL+VYNTLL+AMGKAG+PG ARSLFEEMIESG+TPNEKT
Sbjct: 304 AGDFDGIRYVLQEMGSLGVEPNLIVYNTLLEAMGKAGKPGMARSLFEEMIESGLTPNEKT 363

Query: 363 LTALVKIYGKARWARDALELWERMRSKGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMK 422
           LTALVK+YGKARW RDALELWERMRS  WP+DFILYNTLLNMCADLGLEEEAE+LFE+MK
Sbjct: 364 LTALVKVYGKARWGRDALELWERMRSNSWPVDFILYNTLLNMCADLGLEEEAERLFEDMK 423

Query: 423 KSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGVGINVMCCTCLIQCLGKARRI 482
           +SE SRPDSWSYTAMLNI+GSGG V+++ME+F+EM ELGV +NVM CTCL+QCLGKA+R+
Sbjct: 424 RSESSRPDSWSYTAMLNIYGSGGKVEKAMEMFDEMSELGVELNVMGCTCLVQCLGKAKRV 483

Query: 483 DDLVRVFDVSVRKGVEPDDRLCGCLLSVVSLCDNNEDISKVFTCLQQANPKLVAFVNLLQ 542
           DD+VRVF   V KGV PDDRLCGCLLSVVS+CD+  D  KV  CLQQANPKLV FV LLQ
Sbjct: 484 DDMVRVFSFVVEKGVRPDDRLCGCLLSVVSMCDDVGDEEKVLACLQQANPKLVVFVRLLQ 543

Query: 543 QNDITFDVIKDEFRTILGETATEARRPFCNCLIDICRNQNLSKRAHELLYLGSLYGLYPG 602
             + +F  +KDEFR+++ +T+ EARRPFCNCLID+CRN+   +RAHELLYLG+LYGLYPG
Sbjct: 544 GEETSFKTVKDEFRSVISDTSIEARRPFCNCLIDMCRNRGHHERAHELLYLGTLYGLYPG 603

Query: 603 LHNKTEGEWCLDVRSLSVGAAQTALEEWMITLSKIVQREEALPELLSAQTGAGTHRFSQG 662
           LHNKT  EWCLDVRSLS+GAAQTALEEWM TL +IVQR+E LPEL SAQTG GTH+FSQG
Sbjct: 604 LHNKTAKEWCLDVRSLSIGAAQTALEEWMGTLYRIVQRKEELPELFSAQTGVGTHKFSQG 663

Query: 663 LANSFASHVEKLAAPFRLREDRAGWFVATREDVVAWVHSRVPSVAT 704
           LANSFASH EKLAAPFR  E++AG FVATRED+V+W  SR P+  T
Sbjct: 664 LANSFASHAEKLAAPFRQSEEKAGCFVATREDLVSWAQSRAPTAVT 709

BLAST of CmoCh16G011170 vs. TrEMBL
Match: A0A067DT79_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g005178mg PE=4 SV=1)

HSP 1 Score: 1048.9 bits (2711), Expect = 2.8e-303
Identity = 517/711 (72.71%), Postives = 603/711 (84.81%), Query Frame = 1

Query: 1   MAAPLSSSLDIKLKLKPTP---PLFFTSPLRRNNFTKRFTVLCTSSSKSPRSTDKKNP-- 60
           MAA LS++LD+ L +  +    P+F T P RR++  K   + C S S    + +  NP  
Sbjct: 1   MAASLSTALDVHLSIPKSDTKRPIFLTKPTRRSHPIK-INISCNSKSSENVAAESPNPET 60

Query: 61  ---SLSEQLKDLSTSTLSNASNDESHLLSNPKSIWVNPTKPKRSVLSLQRQKRSSYSYNP 120
              SLSEQLK LS++TLS   ND + LLS PKS WVNPTKP+RSVLSLQRQKRS+YSYNP
Sbjct: 61  KTLSLSEQLKPLSSTTLSPTKNDRTPLLSKPKSTWVNPTKPRRSVLSLQRQKRSTYSYNP 120

Query: 121 KMRELKTFAHKLNASDSSEAAFMAVLKEIPHPPTKENALLILNSLKPWQKTHLFFNWIKT 180
           ++R+LK FA KLN  D++E AF+  + EIPH PT+ENALLILNSLK WQK++ FFNWIK+
Sbjct: 121 RVRDLKLFARKLNDCDNTEEAFLRAITEIPHQPTRENALLILNSLKFWQKSYFFFNWIKS 180

Query: 181 QNLFPMETIFYNVAMKSLRYGRQFQLIEELANEMINTGIELDNITYSTIITCAKKCSRFD 240
           QNLFPMETIFYNV MKSLR+GRQFQLIE+LANEM++  IELDNITYSTIITCAK+C+ FD
Sbjct: 181 QNLFPMETIFYNVTMKSLRFGRQFQLIEQLANEMVSNEIELDNITYSTIITCAKRCNLFD 240

Query: 241 KAMEWFERMYRTGLMPDEVTYSAILDVYANLGKVEEALSLYERGRASGWKPDPYTFSVLG 300
           +A+EWFERMY+TGLMPDEVTYSAILDVYA  GKVEE LSLYERG ASGWKPDP  FSVLG
Sbjct: 241 EAIEWFERMYKTGLMPDEVTYSAILDVYAKSGKVEEVLSLYERGVASGWKPDPIAFSVLG 300

Query: 301 KMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGRPGFARSLFEEMIESGIT 360
           KMFGE+GDYDGI YVLQEMKS+ VQPNLVVYNTLL+AMGKAG+PG ARSLF+EM+ESG+T
Sbjct: 301 KMFGESGDYDGIRYVLQEMKSLGVQPNLVVYNTLLEAMGKAGKPGLARSLFDEMVESGLT 360

Query: 361 PNEKTLTALVKIYGKARWARDALELWERMRSKGWPMDFILYNTLLNMCADLGLEEEAEKL 420
           P+EKTLTAL+KIYGKARWA+DALELWERMR   WPMDFILYNTLLNMCAD+GL EEAE+L
Sbjct: 361 PDEKTLTALIKIYGKARWAKDALELWERMRENKWPMDFILYNTLLNMCADIGLVEEAERL 420

Query: 421 FEEMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGVGINVMCCTCLIQCLG 480
           FE+MK S++ +PD++SYTAMLNI+GSGGNV  ++ELFEEM ELGV INVM CTCLIQCLG
Sbjct: 421 FEDMKLSDYCKPDNYSYTAMLNIYGSGGNVDNAIELFEEMSELGVAINVMGCTCLIQCLG 480

Query: 481 KARRIDDLVRVFDVSVRKGVEPDDRLCGCLLSVVSLCDNNEDISKVFTCLQQANPKLVAF 540
           KARRIDDLVRVF VS+ +GV+PDDRLCGCLLSVVSLC+ +ED+ KV TCLQQANPKLVAF
Sbjct: 481 KARRIDDLVRVFGVSIDRGVKPDDRLCGCLLSVVSLCETSEDVGKVITCLQQANPKLVAF 540

Query: 541 VNLLQQNDITFDVIKDEFRTILGETATEARRPFCNCLIDICRNQNLSKRAHELLYLGSLY 600
           +NL++ N   F+ IK+EFR ++ +T  +ARRPFCNCLIDICRN+NL++RAHELLYLG+LY
Sbjct: 541 LNLIEDNSTGFENIKEEFRNVIKDTEVDARRPFCNCLIDICRNRNLNERAHELLYLGTLY 600

Query: 601 GLYPGLHNKTEGEWCLDVRSLSVGAAQTALEEWMITLSKIVQREEALPELLSAQTGAGTH 660
           GLYPGLHNKT  EW LDVRSLSVGAAQTALEEWM TL+KIV REE LP+L  A+TG GTH
Sbjct: 601 GLYPGLHNKTLDEWSLDVRSLSVGAAQTALEEWMWTLAKIVLREEVLPQLFLAETGTGTH 660

Query: 661 RFSQGLANSFASHVEKLAAPFRLREDRAGWFVATREDVVAWVHSRVPSVAT 704
           +FSQGLA +FASHV KLAAPFR  E +AG FVATRED+V+WV +R  S+ T
Sbjct: 661 KFSQGLATAFASHVNKLAAPFRQSEGKAGCFVATREDLVSWVQARPSSITT 710

BLAST of CmoCh16G011170 vs. TAIR10
Match: AT5G46580.1 (AT5G46580.1 pentatricopeptide (PPR) repeat-containing protein)

HSP 1 Score: 971.5 bits (2510), Expect = 2.9e-283
Identity = 471/713 (66.06%), Postives = 593/713 (83.17%), Query Frame = 1

Query: 1   MAAPLSSSLDIKLKLKPTPP----LFFTSPLRRNNFTKRFTVLCTSSSKSPRSTDK---- 60
           MA  L++++D+    + +      LF    L R + +++  + C SS K P++ ++    
Sbjct: 1   MATVLTTAIDVCFNPQNSDTKKHSLFLKPSLFRQSRSRKLNISC-SSLKQPKTLEEEPIT 60

Query: 61  -KNPSLSEQLKDLSTSTLSNASNDESHLLSNPKSIWVNPTKPKRSVLSLQRQKRSSYSYN 120
            K PSLSEQLK LS +TL     +++ +LS PKS+WVNPT+PKRSVLSLQRQKRS+YSYN
Sbjct: 61  TKTPSLSEQLKPLSATTLRQ---EQTQILSKPKSVWVNPTRPKRSVLSLQRQKRSAYSYN 120

Query: 121 PKMRELKTFAHKLNASDSSEAA-FMAVLKEIPHPPTKENALLILNSLKPWQKTHLFFNWI 180
           P++++L+ FA KLN+S  +E + F+++L EIPHPP ++NALL+LNSL+ WQKTH FFNW+
Sbjct: 121 PQIKDLRAFALKLNSSIFTEKSEFLSLLDEIPHPPNRDNALLVLNSLREWQKTHTFFNWV 180

Query: 181 KTQNLFPMETIFYNVAMKSLRYGRQFQLIEELANEMINTGIELDNITYSTIITCAKKCSR 240
           K+++LFPMETIFYNV MKSLR+GRQFQLIEE+A EM+  G+ELDNITYSTIITCAK+C+ 
Sbjct: 181 KSKSLFPMETIFYNVTMKSLRFGRQFQLIEEMALEMVKDGVELDNITYSTIITCAKRCNL 240

Query: 241 FDKAMEWFERMYRTGLMPDEVTYSAILDVYANLGKVEEALSLYERGRASGWKPDPYTFSV 300
           ++KA+EWFERMY+TGLMPDEVTYSAILDVY+  GKVEE LSLYER  A+GWKPD   FSV
Sbjct: 241 YNKAIEWFERMYKTGLMPDEVTYSAILDVYSKSGKVEEVLSLYERAVATGWKPDAIAFSV 300

Query: 301 LGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGRPGFARSLFEEMIESG 360
           LGKMFGEAGDYDGI YVLQEMKS++V+PN+VVYNTLL+AMG+AG+PG ARSLF EM+E+G
Sbjct: 301 LGKMFGEAGDYDGIRYVLQEMKSMDVKPNVVVYNTLLEAMGRAGKPGLARSLFNEMLEAG 360

Query: 361 ITPNEKTLTALVKIYGKARWARDALELWERMRSKGWPMDFILYNTLLNMCADLGLEEEAE 420
           +TPNEKTLTALVKIYGKARWARDAL+LWE M++K WPMDFILYNTLLNMCAD+GLEEEAE
Sbjct: 361 LTPNEKTLTALVKIYGKARWARDALQLWEEMKAKKWPMDFILYNTLLNMCADIGLEEEAE 420

Query: 421 KLFEEMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGVGINVMCCTCLIQC 480
           +LF +MK+S   RPD++SYTAMLNI+GSGG  +++MELFEEML+ GV +NVM CTCL+QC
Sbjct: 421 RLFNDMKESVQCRPDNFSYTAMLNIYGSGGKAEKAMELFEEMLKAGVQVNVMGCTCLVQC 480

Query: 481 LGKARRIDDLVRVFDVSVRKGVEPDDRLCGCLLSVVSLCDNNEDISKVFTCLQQANPKLV 540
           LGKA+RIDD+V VFD+S+++GV+PDDRLCGCLLSV++LC+++ED  KV  CL++AN KLV
Sbjct: 481 LGKAKRIDDVVYVFDLSIKRGVKPDDRLCGCLLSVMALCESSEDAEKVMACLERANKKLV 540

Query: 541 AFVNLLQQNDITFDVIKDEFRTILGETATEARRPFCNCLIDICRNQNLSKRAHELLYLGS 600
            FVNL+      ++ +K+EF+ ++  T  EARRPFCNCLIDICR  N  +RAHELLYLG+
Sbjct: 541 TFVNLIVDEKTEYETVKEEFKLVINATQVEARRPFCNCLIDICRGNNRHERAHELLYLGT 600

Query: 601 LYGLYPGLHNKTEGEWCLDVRSLSVGAAQTALEEWMITLSKIVQREEALPELLSAQTGAG 660
           L+GLYPGLHNKT  EW LDVRSLSVGAA+TALEEWM TL+ I++R+E LPEL  AQTG G
Sbjct: 601 LFGLYPGLHNKTIKEWSLDVRSLSVGAAETALEEWMRTLANIIKRQEELPELFLAQTGTG 660

Query: 661 THRFSQGLANSFASHVEKLAAPFRLREDRAGWFVATREDVVAWVHSRVPSVAT 704
           THRFSQGLANSFA H+++L+APFR + DR G FVAT+ED+V+W+ S+ P + T
Sbjct: 661 THRFSQGLANSFALHLQQLSAPFR-QSDRPGIFVATKEDLVSWLESKFPPLVT 708

BLAST of CmoCh16G011170 vs. TAIR10
Match: AT4G16390.1 (AT4G16390.1 pentatricopeptide (PPR) repeat-containing protein)

HSP 1 Score: 456.8 bits (1174), Expect = 2.4e-128
Identity = 241/615 (39.19%), Postives = 373/615 (60.65%), Query Frame = 1

Query: 86  IWVNPTKPKRSVLSLQRQKRSSYSYNPKMRELKTFAHKLNASDSSEAAFMAVLKEIPHPP 145
           +WVNP  P+ S L   R+K    SY+ +   L   A  L+A   +EA    V+       
Sbjct: 88  VWVNPKSPRASQL---RRK----SYDSRYSSLIKLAESLDACKPNEADVCDVITGFGGKL 147

Query: 146 TKENALLILNSLKPWQKTHLFFNWIKTQNLFPMETIFYNVAMKSLRYGRQFQLIEELANE 205
            +++A++ LN++   +   L  N +        E I YNV MK  R  +  +  E+L +E
Sbjct: 148 FEQDAVVTLNNMTNPETAPLVLNNLLETMKPSREVILYNVTMKVFRKSKDLEKSEKLFDE 207

Query: 206 MINTGIELDNITYSTIITCAKKCSRFDKAMEWFERMYRTGLMPDEVTYSAILDVYANLGK 265
           M+  GI+ DN T++TII+CA++     +A+EWFE+M   G  PD VT +A++D Y   G 
Sbjct: 208 MLERGIKPDNATFTTIISCARQNGVPKRAVEWFEKMSSFGCEPDNVTMAAMIDAYGRAGN 267

Query: 266 VEEALSLYERGRASGWKPDPYTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNT 325
           V+ ALSLY+R R   W+ D  TFS L +++G +G+YDG + + +EMK++ V+PNLV+YN 
Sbjct: 268 VDMALSLYDRARTEKWRIDAVTFSTLIRIYGVSGNYDGCLNIYEEMKALGVKPNLVIYNR 327

Query: 326 LLDAMGKAGRPGFARSLFEEMIESGITPNEKTLTALVKIYGKARWARDALELWERMRSKG 385
           L+D+MG+A RP  A+ +++++I +G TPN  T  ALV+ YG+AR+  DAL ++  M+ KG
Sbjct: 328 LIDSMGRAKRPWQAKIIYKDLITNGFTPNWSTYAALVRAYGRARYGDDALAIYREMKEKG 387

Query: 386 WPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSEHSRPDSWSYTAMLNIHGSGGNVKRS 445
             +  ILYNTLL+MCAD    +EA ++F++MK  E   PDSW++++++ ++   G V  +
Sbjct: 388 LSLTVILYNTLLSMCADNRYVDEAFEIFQDMKNCETCDPDSWTFSSLITVYACSGRVSEA 447

Query: 446 MELFEEMLELGVGINVMCCTCLIQCLGKARRIDDLVRVFDVSVRKGVEPDDRLCGCLLSV 505
                +M E G    +   T +IQC GKA+++DD+VR FD  +  G+ PDDR CGCLL+V
Sbjct: 448 EAALLQMREAGFEPTLFVLTSVIQCYGKAKQVDDVVRTFDQVLELGITPDDRFCGCLLNV 507

Query: 506 VSLCDNNEDISKVFTCLQQANPKLVAFVNLL-QQNDITFDVIKDEFRTILGETATEARRP 565
           ++    +E+I K+  C+++A PKL   V +L ++ +    V K E   ++    ++ ++ 
Sbjct: 508 MTQTP-SEEIGKLIGCVEKAKPKLGQVVKMLVEEQNCEEGVFKKEASELIDSIGSDVKKA 567

Query: 566 FCNCLIDICRNQNLSKRAHELLYLGSLYGLYPGLHNKTEGEWCLDVRSLSVGAAQTALEE 625
           + NCLID+C N N  +RA E+L LG  Y +Y GL +K+  +W L ++SLS+GAA TAL  
Sbjct: 568 YLNCLIDLCVNLNKLERACEILQLGLEYDIYTGLQSKSATQWSLHLKSLSLGAALTALHV 627

Query: 626 WMITLSK-IVQREEALPELLSAQTGAGTHRFS-QGLANSFASHVEKLAAPFRLREDRAGW 685
           WM  LS+  ++  E  P LL   TG G H++S +GLA  F SH+++L APF    D+ GW
Sbjct: 628 WMNDLSEAALESGEEFPPLLGINTGHGKHKYSDKGLAAVFESHLKELNAPFHEAPDKVGW 687

Query: 686 FVATREDVVAWVHSR 698
           F+ T     AW+ SR
Sbjct: 688 FLTTSVAAKAWLESR 694

BLAST of CmoCh16G011170 vs. TAIR10
Match: AT1G74750.1 (AT1G74750.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 173.7 bits (439), Expect = 4.0e-43
Identity = 117/493 (23.73%), Postives = 223/493 (45.23%), Query Frame = 1

Query: 210 GIELDNITYSTIITCAKKCSRFDKAMEWFERMYRTGLMPDEVTYSAILDVYANLGKVEEA 269
           G + D  TY+T++    +  +F +  +  + M R G  P+ VTY+ ++  Y     ++EA
Sbjct: 354 GFKHDGHTYTTMVGNLGRAKQFGEINKLLDEMVRDGCKPNTVTYNRLIHSYGRANYLKEA 413

Query: 270 LSLYERGRASGWKPDPYTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDA 329
           ++++ + + +G +PD  T+  L  +  +AG  D  M + Q M+   + P+   Y+ +++ 
Sbjct: 414 MNVFNQMQEAGCEPDRVTYCTLIDIHAKAGFLDIAMDMYQRMQEAGLSPDTFTYSVIINC 473

Query: 330 MGKAGRPGFARSLFEEMIESGITPNEKTLTALVKIYGKARWARDALELWERMRSKGWPMD 389
           +GKAG    A  LF EM+  G TPN  T   ++ ++ KAR    AL+L+  M++ G+  D
Sbjct: 474 LGKAGHLPAAHRLFCEMVGQGCTPNLVTFNIMIALHAKARNYETALKLYRDMQNAGFQPD 533

Query: 390 FILYNTLLNMCADLGLEEEAEKLFEEMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELF 449
            + Y+ ++ +    G  EEAE +F EM++     PD   Y  ++++ G  GNV ++ + +
Sbjct: 534 KVTYSIVMEVLGHCGFLEEAEGVFAEMQRKNWV-PDEPVYGLLVDLWGKAGNVDKAWQWY 593

Query: 450 EEMLELGVGINVMCCTCLIQCLGKARRIDDLVRVFDVSVRKGVEPDDRLCGCLLSVVSLC 509
           + ML+ G+  NV  C  L+    +  R+ +   +    +  G+ P  +    LLS  +  
Sbjct: 594 QAMLQAGLRPNVPTCNSLLSTFLRVHRMSEAYNLLQSMLALGLHPSLQTYTLLLSCCTDA 653

Query: 510 DNNEDISKVFTCLQQANPKLVAFVNLLQQNDITFDVIKD---EFRTILGETATEARRPFC 569
            +N D+      +  +      F+  +         ++D    F   +     E++R   
Sbjct: 654 RSNFDMGFCGQLMAVSGHPAHMFLLKMPPAGPDGQKVRDHVSNFLDFMHSEDRESKRGLM 713

Query: 570 NCLIDICRNQNLSKRAHELLYLGSLYGLYP-GLHNKTEGEWCLDVRSLSVGAAQTALEEW 629
           + ++D      L + A  +  + +   +YP  L  K+   W +++  +S G A  AL   
Sbjct: 714 DAVVDFLHKSGLKEEAGSVWEVAAGKNVYPDALREKSYSYWLINLHVMSEGTAVIALSRT 773

Query: 630 MITLSKIVQREEALPELLSAQTGAGTHRFSQGLANSFASHVEKLA----APFRLREDRAG 689
           +    K +      P  +   TG G      G  +     VE+L      PF      +G
Sbjct: 774 LAWFRKQMLVSGDCPSRIDIVTGWGRRSRVTG-TSMVRQAVEELLNIFNFPFFTENGNSG 833

Query: 690 WFVATREDVVAWV 695
            FV + E +  W+
Sbjct: 834 CFVGSGEPLKNWL 844

BLAST of CmoCh16G011170 vs. TAIR10
Match: AT1G18900.3 (AT1G18900.3 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 167.9 bits (424), Expect = 2.2e-41
Identity = 127/547 (23.22%), Postives = 235/547 (42.96%), Query Frame = 1

Query: 153 ILNSLKPWQKTHLFFNWIKTQNLFPMETIFYNVAMKSLRYGRQFQLIEELANEMINTGIE 212
           +L  +  +     FF W+K Q  F  +   Y   + +L   +QF  I +L +EM+  G +
Sbjct: 337 VLKQMNDYGNALGFFYWLKRQPGFKHDGHTYTTMVGNLGRAKQFGAINKLLDEMVRDGCQ 396

Query: 213 LDNITYSTIITCAKKCSRFDKAMEWFERMYRTGLMPDEVTYSAILDVYANLGKVEEALSL 272
            + +TY+ +I    + +  ++AM  F +M   G  PD VTY  ++D++A  G ++ A+ +
Sbjct: 397 PNTVTYNRLIHSYGRANYLNEAMNVFNQMQEAGCKPDRVTYCTLIDIHAKAGFLDIAMDM 456

Query: 273 YERGRASGWKPDPYTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGK 332
           Y+R +A G  PD +T+SV+    G+AG       +  EM      PNLV YN ++D   K
Sbjct: 457 YQRMQAGGLSPDTFTYSVIINCLGKAGHLPAAHKLFCEMVDQGCTPNLVTYNIMMDLHAK 516

Query: 333 AGRPGFARSLFEEMIESGITPNEKTLTALVKIYGKARWARDALELWERMRSKGWPMDFIL 392
           A     A  L+ +M  +G  P++ T + ++++ G   +  +A  ++  M+ K W  D  +
Sbjct: 517 ARNYQNALKLYRDMQNAGFEPDKVTYSIVMEVLGHCGYLEEAEAVFTEMQQKNWIPDEPV 576

Query: 393 YNTLLNMCADLGLEEEAEKLFEEMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEM 452
           Y  L+++    G  E+A + ++ M  +   RP+  +  ++L+       +  + EL + M
Sbjct: 577 YGLLVDLWGKAGNVEKAWQWYQAMLHA-GLRPNVPTCNSLLSTFLRVNKIAEAYELLQNM 636

Query: 453 LELGVGINVMCCTCLIQCLGKARRIDDLVRVFDVSVRKGVEPDDRLCGCLLSVVSLCDNN 512
           L LG+  ++   T L+ C    R                 + D   CG L++        
Sbjct: 637 LALGLRPSLQTYTLLLSCCTDGRS----------------KLDMGFCGQLMA-----STG 696

Query: 513 EDISKVFTCLQQANPKLVAFVNLLQQNDITFDVIKDEFRTILGETATEARRPFCNCLIDI 572
                    +  A P      N+    +   D++  E R        E++R   + ++D 
Sbjct: 697 HPAHMFLLKMPAAGPD---GENVRNHANNFLDLMHSEDR--------ESKRGLVDAVVDF 756

Query: 573 CRNQNLSKRAHELLYLGSLYGLYP-GLHNKTEGEWCLDVRSLSVGAAQTALEEWMITLSK 632
                  + A  +  + +   ++P  L  K+   W +++  +S G A TAL   +    K
Sbjct: 757 LHKSGQKEEAGSVWEVAAQKNVFPDALREKSCSYWLINLHVMSEGTAVTALSRTLAWFRK 816

Query: 633 IVQREEALPELLSAQTGAGTHRFSQGLANSFASHVEKL----AAPFRLREDRAGWFVATR 692
            +      P  +   TG G      G  +     VE+L     +PF      +G FV + 
Sbjct: 817 QMLASGTCPSRIDIVTGWGRRSRVTG-TSMVRQAVEELLNIFGSPFFTESGNSGCFVGSG 849

Query: 693 EDVVAWV 695
           E +  W+
Sbjct: 877 EPLNRWL 849

BLAST of CmoCh16G011170 vs. TAIR10
Match: AT2G31400.1 (AT2G31400.1 genomes uncoupled 1)

HSP 1 Score: 159.8 bits (403), Expect = 6.0e-39
Identity = 137/544 (25.18%), Postives = 242/544 (44.49%), Query Frame = 1

Query: 169 WIKTQNLFP--------METIFYNVAMKSLRYGRQFQLIEELANEMINTGIELDNITYST 228
           W   +NLF          +   YN  + ++  G Q  L  E+  +M    I  + ++YST
Sbjct: 355 WEAARNLFDEMTNRRIEQDVFSYNTLLDAICKGGQMDLAFEILAQMPVKRIMPNVVSYST 414

Query: 229 IITCAKKCSRFDKAMEWFERMYRTGLMPDEVTYSAILDVYANLGKVEEALSLYERGRASG 288
           +I    K  RFD+A+  F  M   G+  D V+Y+ +L +Y  +G+ EEAL +     + G
Sbjct: 415 VIDGFAKAGRFDEALNLFGEMRYLGIALDRVSYNTLLSIYTKVGRSEEALDILREMASVG 474

Query: 289 WKPDPYTFSVLGKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGRPGFAR 348
            K D  T++ L   +G+ G YD +  V  EMK   V PNL+ Y+TL+D   K G    A 
Sbjct: 475 IKKDVVTYNALLGGYGKQGKYDEVKKVFTEMKREHVLPNLLTYSTLIDGYSKGGLYKEAM 534

Query: 349 SLFEEMIESGITPNEKTLTALVKIYGKARWARDALELWERMRSKGWPMDFILYNTLLNMC 408
            +F E   +G+  +    +AL+    K      A+ L + M  +G   + + YN++++  
Sbjct: 535 EIFREFKSAGLRADVVLYSALIDALCKNGLVGSAVSLIDEMTKEGISPNVVTYNSIIDAF 594

Query: 409 ADLGLEEEAEKLFEEMKKSEHSRPDS--WSYTAMLNIHGSGGNVKRSMELFEEMLELGVG 468
                 +         + +++S   S  +S +A+  +  + GN  R ++LF ++      
Sbjct: 595 GRSATMD---------RSADYSNGGSLPFSSSALSALTETEGN--RVIQLFGQLTTESNN 654

Query: 469 INVMCCTCLIQCLGKARRIDDLVRVFDVSVRKGVEPDDRLCGCLLSVVSLCDNNEDISKV 528
                C   +Q       +  ++ VF    +  ++P+      +L+  S C++ ED S +
Sbjct: 655 RTTKDCEEGMQ------ELSCILEVFRKMHQLEIKPNVVTFSAILNACSRCNSFEDASML 714

Query: 529 FTCLQQANPKLVAFVN--LLQQND---ITFDVIKDEFRTILGETATEARRPFCNCLIDIC 588
              L+  + K+   V+  L+ Q +   +    + D+   + G TA+     F N L D+ 
Sbjct: 715 LEELRLFDNKVYGVVHGLLMGQRENVWLQAQSLFDKVNEMDGSTAS----AFYNALTDML 774

Query: 589 RNQNLSKRAHELLYLGSLYGLYPGLHNKTEGEWCLDVRSLSVGAAQTALEEWMITLSKIV 648
            +    KR  EL+   +L G    +      + CLD+  +S GAA+  +  W++ +  IV
Sbjct: 775 WHFG-QKRGAELV---ALEGRSRQVWENVWSDSCLDLHLMSSGAARAMVHAWLLNIRSIV 834

Query: 649 QREEALPELLSAQTGAGTHRFSQG---LANSFASHVEKLAAPFRLREDRAGWFVATREDV 695
                LP++LS  TG G H    G   L  +    +  + APF L +   G F ++   V
Sbjct: 835 YEGHELPKVLSILTGWGKHSKVVGDGALRRAVEVLLRGMDAPFHLSKCNMGRFTSSGSVV 873

BLAST of CmoCh16G011170 vs. NCBI nr
Match: gi|449462001|ref|XP_004148730.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g46580, chloroplastic [Cucumis sativus])

HSP 1 Score: 1268.1 bits (3280), Expect = 0.0e+00
Identity = 634/714 (88.80%), Postives = 674/714 (94.40%), Query Frame = 1

Query: 1   MAAPLSSSLDIKLKLKPTPPLFFTSPLRRNNFTKRFTVLCTSSSKSPR--------STDK 60
           MA PLSSSLD  LKLKPTP +FFTSPLRR N TKR T+LC SSSKSPR        S D 
Sbjct: 1   MAVPLSSSLD--LKLKPTP-IFFTSPLRRKNVTKRLTLLC-SSSKSPRKPSSVSSQSVDN 60

Query: 61  KNPSLSEQLKDLSTSTLSNASNDESHLLSNPKSIWVNPTKPKRSVLSLQRQKRSSYSYNP 120
           KNPSLSEQLK+LST+TLSNA NDE+ LLS PKS WVNPTKPKRSVLSLQRQKRSSYSYNP
Sbjct: 61  KNPSLSEQLKNLSTTTLSNAPNDETRLLSKPKSTWVNPTKPKRSVLSLQRQKRSSYSYNP 120

Query: 121 KMRELKTFAHKLNASDSSE-AAFMAVLKEIPHPPTKENALLILNSLKPWQKTHLFFNWIK 180
           KMR+LK+FAHKLNA DSS+ A+F+A L+EIPHPPTKENALLILNSL+PWQKTHLFFNWIK
Sbjct: 121 KMRDLKSFAHKLNACDSSDDASFIAALEEIPHPPTKENALLILNSLRPWQKTHLFFNWIK 180

Query: 181 TQNLFPMETIFYNVAMKSLRYGRQFQLIEELANEMINTGIELDNITYSTIITCAKKCSRF 240
           +QNLFPMETIFYNVAMKSLRYGRQFQLIE+LANEMI+ GIELDNITYSTIITCAKKCSRF
Sbjct: 181 SQNLFPMETIFYNVAMKSLRYGRQFQLIEDLANEMISAGIELDNITYSTIITCAKKCSRF 240

Query: 241 DKAMEWFERMYRTGLMPDEVTYSAILDVYANLGKVEEALSLYERGRASGWKPDPYTFSVL 300
           DKAMEWFERMY+TGLMPDEVTYSAILDVYANLGKVEE LSLYERGRASGW PDPYTFSVL
Sbjct: 241 DKAMEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWTPDPYTFSVL 300

Query: 301 GKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGRPGFARSLFEEMIESGI 360
           GKMFGEAGDYDGIMYVLQEMKSIE+QPNLVVYNTLLDAMGKAG+PGFARSLF+EM+ESGI
Sbjct: 301 GKMFGEAGDYDGIMYVLQEMKSIEMQPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGI 360

Query: 361 TPNEKTLTALVKIYGKARWARDALELWERMRSKGWPMDFILYNTLLNMCADLGLEEEAEK 420
           TPNEKTLTALVKIYGKARWARDAL+LWERMRS GWPMDFILYNTLLNMCADLGLEEEAE 
Sbjct: 361 TPNEKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAET 420

Query: 421 LFEEMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGVGINVMCCTCLIQCL 480
           LFEEMKKS+HSRPDSWSYTAMLNI+GSGGNVKRSMELFEEMLELGV INVMCCTCLIQCL
Sbjct: 421 LFEEMKKSKHSRPDSWSYTAMLNIYGSGGNVKRSMELFEEMLELGVEINVMCCTCLIQCL 480

Query: 481 GKARRIDDLVRVFDVSVRKGVEPDDRLCGCLLSVVSLCDNNEDISKVFTCLQQANPKLVA 540
           GK+ RIDDLVRVF+VSV+KG++PDDRLCGCLLSV+SLC N+EDI+KVFTCLQQANPKLV+
Sbjct: 481 GKSGRIDDLVRVFNVSVQKGIKPDDRLCGCLLSVLSLCYNSEDINKVFTCLQQANPKLVS 540

Query: 541 FVNLLQQNDITFDVIKDEFRTILGETATEARRPFCNCLIDICRNQNLSKRAHELLYLGSL 600
           F+NLLQQNDITF+V+K+EFR ILGETA EARRPFCNCLIDICRNQNL +RAHELLYLGSL
Sbjct: 541 FINLLQQNDITFEVVKNEFRNILGETAPEARRPFCNCLIDICRNQNLRERAHELLYLGSL 600

Query: 601 YGLYPGLHNKTEGEWCLDVRSLSVGAAQTALEEWMITLSKIVQREEALPELLSAQTGAGT 660
           YGLYPGLHNKTE EWCLDVRSLSVGAAQTALEEWMITLSKIVQREEALPELLSAQTGAGT
Sbjct: 601 YGLYPGLHNKTETEWCLDVRSLSVGAAQTALEEWMITLSKIVQREEALPELLSAQTGAGT 660

Query: 661 HRFSQGLANSFASHVEKLAAPFRLREDRAGWFVATREDVVAWVHSRVPSVATRA 706
           HRFSQGLANSFASHV+KLAAPF+LREDRAGWFVATRED+V WVHSRVPSVA  A
Sbjct: 661 HRFSQGLANSFASHVDKLAAPFQLREDRAGWFVATREDLVTWVHSRVPSVAATA 710

BLAST of CmoCh16G011170 vs. NCBI nr
Match: gi|659095679|ref|XP_008448710.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g46580, chloroplastic [Cucumis melo])

HSP 1 Score: 1261.9 bits (3264), Expect = 0.0e+00
Identity = 632/714 (88.52%), Postives = 673/714 (94.26%), Query Frame = 1

Query: 1   MAAPLSSSLDIKLKLKPTPPLFFTSPLRRNNFTKRFTVLCTSSSKSPR--------STDK 60
           MAAPLSSSLD KLK  PTP +FFTS LRR    KR T+LC SSSKSPR        S D 
Sbjct: 1   MAAPLSSSLDFKLK--PTP-IFFTSLLRRKYVNKRLTLLC-SSSKSPRKPSSISSESIDN 60

Query: 61  KNPSLSEQLKDLSTSTLSNASNDESHLLSNPKSIWVNPTKPKRSVLSLQRQKRSSYSYNP 120
           KNPSLS+QLK+LST+TLSNA NDE+ LLS PKS WVNPTKPKRSVLSLQRQKRSSYSYNP
Sbjct: 61  KNPSLSDQLKNLSTTTLSNAPNDETRLLSKPKSTWVNPTKPKRSVLSLQRQKRSSYSYNP 120

Query: 121 KMRELKTFAHKLNASDSS-EAAFMAVLKEIPHPPTKENALLILNSLKPWQKTHLFFNWIK 180
           KMR+LK+FAHKLNA DSS EA+F+A L+EIPHPPTKENALLILNSL+PWQKTHLFFNWIK
Sbjct: 121 KMRDLKSFAHKLNACDSSDEASFIAALEEIPHPPTKENALLILNSLRPWQKTHLFFNWIK 180

Query: 181 TQNLFPMETIFYNVAMKSLRYGRQFQLIEELANEMINTGIELDNITYSTIITCAKKCSRF 240
           TQNLFPMETIFYNVAMKSLRYGRQFQLIE+LAN+M++TGIELDNITYSTIITCAKKCSRF
Sbjct: 181 TQNLFPMETIFYNVAMKSLRYGRQFQLIEDLANDMVSTGIELDNITYSTIITCAKKCSRF 240

Query: 241 DKAMEWFERMYRTGLMPDEVTYSAILDVYANLGKVEEALSLYERGRASGWKPDPYTFSVL 300
           DKAMEWFERMY+TGLMPDEVTYSAILDVYANLGKVEE LSLYERGRASGWKPDPYTFSVL
Sbjct: 241 DKAMEWFERMYKTGLMPDEVTYSAILDVYANLGKVEEVLSLYERGRASGWKPDPYTFSVL 300

Query: 301 GKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGRPGFARSLFEEMIESGI 360
           GKMFGEAGDYDGIMYVLQEMKSIE+QPNLVVYNTLLDAMGKAG+PGFARSLF+EM+ESGI
Sbjct: 301 GKMFGEAGDYDGIMYVLQEMKSIEMQPNLVVYNTLLDAMGKAGKPGFARSLFDEMVESGI 360

Query: 361 TPNEKTLTALVKIYGKARWARDALELWERMRSKGWPMDFILYNTLLNMCADLGLEEEAEK 420
           TPNEKTLTALVKIYGKARWARDAL+LWERMRS GWPMDFILYNTLLNMCADLGLEEEAEK
Sbjct: 361 TPNEKTLTALVKIYGKARWARDALDLWERMRSNGWPMDFILYNTLLNMCADLGLEEEAEK 420

Query: 421 LFEEMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGVGINVMCCTCLIQCL 480
           LFEEMKKS+HSRPDSWSYTAMLNI+GSGGNVKRSMELFEEML+LGV INVMCCTCLIQCL
Sbjct: 421 LFEEMKKSKHSRPDSWSYTAMLNIYGSGGNVKRSMELFEEMLKLGVEINVMCCTCLIQCL 480

Query: 481 GKARRIDDLVRVFDVSVRKGVEPDDRLCGCLLSVVSLCDNNEDISKVFTCLQQANPKLVA 540
           GK+ RIDDLVRVF+VSV+KG++PDDRLCGCLLSVVSLCDN+EDI+KVFTCLQQANPKLV+
Sbjct: 481 GKSGRIDDLVRVFNVSVQKGIKPDDRLCGCLLSVVSLCDNSEDINKVFTCLQQANPKLVS 540

Query: 541 FVNLLQQNDITFDVIKDEFRTILGETATEARRPFCNCLIDICRNQNLSKRAHELLYLGSL 600
           FVNLLQQN ITF+VIK+EFR IL ETA+EARRPFCNCLIDICRNQNL +RAHELLYLGSL
Sbjct: 541 FVNLLQQNSITFEVIKNEFRNILSETASEARRPFCNCLIDICRNQNLRERAHELLYLGSL 600

Query: 601 YGLYPGLHNKTEGEWCLDVRSLSVGAAQTALEEWMITLSKIVQREEALPELLSAQTGAGT 660
           YGLYPGLHNKTE EWCLDVRSLSVGAAQTALEEWMITLSKIVQR+EALPELLSAQTGAGT
Sbjct: 601 YGLYPGLHNKTETEWCLDVRSLSVGAAQTALEEWMITLSKIVQRKEALPELLSAQTGAGT 660

Query: 661 HRFSQGLANSFASHVEKLAAPFRLREDRAGWFVATREDVVAWVHSRVPSVATRA 706
           HRFSQGLANSFASHV+KLAAPF+LREDRAGWFVATRED+V WVHSRVPSV   A
Sbjct: 661 HRFSQGLANSFASHVDKLAAPFQLREDRAGWFVATREDLVTWVHSRVPSVPATA 710

BLAST of CmoCh16G011170 vs. NCBI nr
Match: gi|595852519|ref|XP_007210329.1| (hypothetical protein PRUPE_ppa002049mg [Prunus persica])

HSP 1 Score: 1067.8 bits (2760), Expect = 8.3e-309
Identity = 530/707 (74.96%), Postives = 595/707 (84.16%), Query Frame = 1

Query: 20  PLFFTSPLRRNNFTKRFTVLCTSSSKSPRS-------------------TDKKNPSLS-- 79
           P+FFTSP R+   TKRF + C S+   P+S                    +KKNPSLS  
Sbjct: 19  PIFFTSPFRQIP-TKRFNLSCRSTKSPPKSPPDLAEPNSKNNNKKNDDNNNKKNPSLSLS 78

Query: 80  EQLKDLSTSTLSNASNDESHLLSNPKSIWVNPTKPKRSVLSLQRQKRSSYSYNPKMRELK 139
           EQL+ L+++TLSN   D+S LLS PKSIWVNP KPKRSVLSLQRQKRS YSYNP++R+L+
Sbjct: 79  EQLQPLTSTTLSNPPKDQSQLLSKPKSIWVNPAKPKRSVLSLQRQKRSLYSYNPQVRDLR 138

Query: 140 TFAHKLNASDSSEAAFMAVLKEIPHPPTKENALLILNSLKPWQKTHLFFNWIKTQNLFPM 199
            FAHKLN  D+S+ AF+A L+EIPHPPT+ENALLILNSLKPWQKTH+FFNW+K QN FPM
Sbjct: 139 QFAHKLNDCDASQNAFLAALEEIPHPPTRENALLILNSLKPWQKTHMFFNWVKAQNSFPM 198

Query: 200 ETIFYNVAMKSLRYGRQFQLIEELANEMINTGIELDNITYSTIITCAKKCSRFDKAMEWF 259
           +TIFYNV MKSLR+GRQFQLIEELA EM++  IELDNITYSTIITCAK+   FDKA+EWF
Sbjct: 199 DTIFYNVTMKSLRFGRQFQLIEELAEEMVSNEIELDNITYSTIITCAKRSKLFDKAVEWF 258

Query: 260 ERMYRTGLMPDEVTYSAILDVYANLGKVEEALSLYERGRASGWKPDPYTFSVLGKMFGEA 319
           ERMY+TGLMPDEVTYSAILDVYA LGKVEE LSLYERGRASGWKPDP  FSVLGKMFGEA
Sbjct: 259 ERMYKTGLMPDEVTYSAILDVYAKLGKVEEVLSLYERGRASGWKPDPIAFSVLGKMFGEA 318

Query: 320 GDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGRPGFARSLFEEMIESGITPNEKTL 379
           GDYDGI YVLQEM ++ VQPNLVVYNTLL+AMGKAG+PG ARSLFEEM+ SG+ PNEKTL
Sbjct: 319 GDYDGIRYVLQEMAALGVQPNLVVYNTLLEAMGKAGKPGLARSLFEEMVGSGLKPNEKTL 378

Query: 380 TALVKIYGKARWARDALELWERMRSKGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKK 439
           TALVKIYGKARWARDALELWERMRS  WPMDFILYNTLLNMCADLGLEEEA+KLFE+MK+
Sbjct: 379 TALVKIYGKARWARDALELWERMRSNEWPMDFILYNTLLNMCADLGLEEEAKKLFEDMKQ 438

Query: 440 SEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGVGINVMCCTCLIQCLGKARRID 499
           SEH RPDSWSYTAMLNI GSGGNV  +M LFEEM ELG+ +NVM CTCLIQCLGKARR  
Sbjct: 439 SEHCRPDSWSYTAMLNIFGSGGNVDGAMGLFEEMSELGIELNVMGCTCLIQCLGKARRFS 498

Query: 500 DLVRVFDVSVRKGVEPDDRLCGCLLSVVSLCDNNEDISKVFTCLQQANPKLVAFVNLLQQ 559
           D+VRVF V+V +GV+PDDRLCGCLLSVVSLC+  ED  KV +CLQQANPKLV  V +LQ 
Sbjct: 499 DMVRVFGVAVERGVKPDDRLCGCLLSVVSLCEKTEDEDKVLSCLQQANPKLVTLVKVLQD 558

Query: 560 NDITFDVIKDEFRTILGETATEARRPFCNCLIDICRNQNLSKRAHELLYLGSLYGLYPGL 619
             + F+ IKDEFR ++  T+ E+RRPFCNCLIDICRN+N  +RAHELLYLG+LYGLYPGL
Sbjct: 559 KKLGFETIKDEFRDVISGTSVESRRPFCNCLIDICRNKNNHERAHELLYLGTLYGLYPGL 618

Query: 620 HNKTEGEWCLDVRSLSVGAAQTALEEWMITLSKIVQREEALPELLSAQTGAGTHRFSQGL 679
           HNKT  EWCLDVRSLS+GAA TALEEWM TL KIVQREEALPEL SAQTG GTH+FSQGL
Sbjct: 619 HNKTSREWCLDVRSLSIGAAHTALEEWMGTLYKIVQREEALPELFSAQTGTGTHKFSQGL 678

Query: 680 ANSFASHVEKLAAPFRLREDRAGWFVATREDVVAWVHSRVPSVATRA 706
           A+SFASHVEKLAAPFR  E++AG FVATRED+V+WV S+ PS A  A
Sbjct: 679 AHSFASHVEKLAAPFRKSEEKAGRFVATREDLVSWVQSQAPSTAITA 724

BLAST of CmoCh16G011170 vs. NCBI nr
Match: gi|225427240|ref|XP_002278451.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g46580, chloroplastic [Vitis vinifera])

HSP 1 Score: 1062.4 bits (2746), Expect = 3.5e-307
Identity = 527/699 (75.39%), Postives = 603/699 (86.27%), Query Frame = 1

Query: 19  PPLFFTSPLRRNNFTKRFTVLCTSSSKSP--------------RSTDKKNPSLSEQLKDL 78
           P LF  S    +N    FT+ C SSS+SP                T+ +NPSLSEQLK L
Sbjct: 26  PNLFSKSTKFSSN---TFTIRCNSSSRSPPKPKPKPKPTSSDSEQTNHQNPSLSEQLKPL 85

Query: 79  STSTLSNASNDESHLLSNPKSIWVNPTKPKRSVLSLQRQKRSSYSYNPKMRELKTFAHKL 138
           S + L+   + ++HL+S PKS W+NPTKPK SVLSLQR KR +YSYNP++R+LK FA K+
Sbjct: 86  SKTILTRDHSGQTHLVSKPKSTWINPTKPKPSVLSLQRHKRHNYSYNPQIRDLKLFAKKI 145

Query: 139 NASDSS-EAAFMAVLKEIPHPPTKENALLILNSLKPWQKTHLFFNWIKTQNLFPMETIFY 198
           N S+SS E+ F+AVL++IPHPPT++NALL+LNSLKPW KT+LFFNWIKTQNLFPMETIFY
Sbjct: 146 NESESSDESEFLAVLEQIPHPPTRDNALLLLNSLKPWPKTYLFFNWIKTQNLFPMETIFY 205

Query: 199 NVAMKSLRYGRQFQLIEELANEMINTGIELDNITYSTIITCAKKCSRFDKAMEWFERMYR 258
           NV MKSLR+GRQFQLIEELANEMI+TG+ELDNITYSTIITCAK+C+ FDKA++WFERMY+
Sbjct: 206 NVTMKSLRFGRQFQLIEELANEMISTGVELDNITYSTIITCAKRCNLFDKAVKWFERMYK 265

Query: 259 TGLMPDEVTYSAILDVYANLGKVEEALSLYERGRASGWKPDPYTFSVLGKMFGEAGDYDG 318
           TGLMPDEVTYSAILDVYA LGKVEE LSLYERGRASGWKPDP  F+VLGKMFGEAGDYDG
Sbjct: 266 TGLMPDEVTYSAILDVYAKLGKVEEVLSLYERGRASGWKPDPIAFAVLGKMFGEAGDYDG 325

Query: 319 IMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGRPGFARSLFEEMIESGITPNEKTLTALVK 378
           I YVLQEMKS+ VQPNLVVYNTLL+AMGKAG+PG ARSLFEEM+ SG+ P+ KTLTALVK
Sbjct: 326 IRYVLQEMKSLGVQPNLVVYNTLLEAMGKAGKPGLARSLFEEMVGSGVIPDAKTLTALVK 385

Query: 379 IYGKARWARDALELWERMRSKGWPMDFILYNTLLNMCADLGLEEEAEKLFEEMKKSEHSR 438
           IYGKARWARDALELWERMRS GWPMDFILYNTLL+MCADLGLEEEAEKLFE+MKKSEH R
Sbjct: 386 IYGKARWARDALELWERMRSNGWPMDFILYNTLLSMCADLGLEEEAEKLFEDMKKSEHCR 445

Query: 439 PDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGVGINVMCCTCLIQCLGKARRIDDLVRV 498
           PDSWSYTAMLNI+GSGGNV R+M+LF+EM ELGV INVM CTCL QCLG+ARRIDDLV+V
Sbjct: 446 PDSWSYTAMLNIYGSGGNVDRAMQLFDEMSELGVQINVMGCTCLSQCLGRARRIDDLVKV 505

Query: 499 FDVSVRKGVEPDDRLCGCLLSVVSLCDNNEDISKVFTCLQQANPKLVAFVNLLQQNDITF 558
           F+VS+ +GV+PDDRLCGCLLSVVS C+  ED +KV  CLQQANPKLVAFVNLL++  I+F
Sbjct: 506 FEVSLERGVKPDDRLCGCLLSVVSFCEGAEDANKVLACLQQANPKLVAFVNLLEEK-ISF 565

Query: 559 DVIKDEFRTILGETATEARRPFCNCLIDICRNQNLSKRAHELLYLGSLYGLYPGLHNKTE 618
           + +K+EFR IL +TA EARRPFCNCLIDICRN++L +RAHELLYLG+LYGLYPGLHN+T 
Sbjct: 566 EALKEEFRGILTDTAVEARRPFCNCLIDICRNRSLHERAHELLYLGTLYGLYPGLHNRTA 625

Query: 619 GEWCLDVRSLSVGAAQTALEEWMITLSKIVQREEALPELLSAQTGAGTHRFSQGLANSFA 678
            EWCLDVRSLSVGAA TALEEWM TLSKIVQREEALPE  SA TG GTH+FSQGLA++FA
Sbjct: 626 DEWCLDVRSLSVGAAHTALEEWMGTLSKIVQREEALPEAFSANTGTGTHKFSQGLASAFA 685

Query: 679 SHVEKLAAPFRLREDRAGWFVATREDVVAWVHSRVPSVA 703
           SHV+KLAAPF   E++AG FVATRED+V+WV SR+ S A
Sbjct: 686 SHVKKLAAPFTQSEEKAGCFVATREDLVSWVQSRILSPA 720

BLAST of CmoCh16G011170 vs. NCBI nr
Match: gi|1009107574|ref|XP_015880412.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g46580, chloroplastic [Ziziphus jujuba])

HSP 1 Score: 1062.4 bits (2746), Expect = 3.5e-307
Identity = 521/710 (73.38%), Postives = 609/710 (85.77%), Query Frame = 1

Query: 5   LSSSLDIKLKLKPTP---PLFFTSPLRRNNFTKRFTVLCTSSSKSPRS--------TDKK 64
           LS++LD++     T    P+FF SPL++    KRFT+ C+SS   P+S          KK
Sbjct: 6   LSAALDVRFSGHSTETKRPIFFASPLKQFP-KKRFTIYCSSSKSPPKSPPDLAQPNNKKK 65

Query: 65  NPSLSEQLKDLSTSTLSNASNDESHLLSNPKSIWVNPTKPKRSVLSLQRQKRSS--YSYN 124
           NPSLS+QL+ LS +TLSN + ++++LLS PKS WVNPTKPKRSV+SLQR KRSS  Y YN
Sbjct: 66  NPSLSDQLRPLSKTTLSNTTKEQANLLSKPKSTWVNPTKPKRSVISLQRHKRSSSSYLYN 125

Query: 125 PKMRELKTFAHKLNASDSSEAAFMAVLKEIPHPPTKENALLILNSLKPWQKTHLFFNWIK 184
            ++R+L+ FA+KLN SD SE AF+A L+EIPH  T++NALLILN LKPWQKTH+FFNW+K
Sbjct: 126 SQVRDLRRFAYKLNNSDISETAFLAALEEIPHTLTRDNALLILNLLKPWQKTHMFFNWVK 185

Query: 185 TQNLFPMETIFYNVAMKSLRYGRQFQLIEELANEMINTGIELDNITYSTIITCAKKCSRF 244
           TQNLFPMETIFYNVAMKSLR+GRQFQLIEELA+EMI+  IELDNITYSTIITCAK+C  F
Sbjct: 186 TQNLFPMETIFYNVAMKSLRFGRQFQLIEELAHEMISNEIELDNITYSTIITCAKRCKDF 245

Query: 245 DKAMEWFERMYRTGLMPDEVTYSAILDVYANLGKVEEALSLYERGRASGWKPDPYTFSVL 304
           DKA++WFERMY+TG+MPDEVTYSAILDVYA LGKVEE L+LYERGRASGWKPDP TFSVL
Sbjct: 246 DKAVDWFERMYKTGMMPDEVTYSAILDVYAQLGKVEEVLNLYERGRASGWKPDPITFSVL 305

Query: 305 GKMFGEAGDYDGIMYVLQEMKSIEVQPNLVVYNTLLDAMGKAGRPGFARSLFEEMIESGI 364
           GKMFGE GDYDGI YVLQEM+SI VQPNLVVYNTLL+ MGKAG+PG ARSLFEEM+ESG+
Sbjct: 306 GKMFGETGDYDGIRYVLQEMRSIGVQPNLVVYNTLLEGMGKAGKPGLARSLFEEMLESGL 365

Query: 365 TPNEKTLTALVKIYGKARWARDALELWERMRSKGWPMDFILYNTLLNMCADLGLEEEAEK 424
           TPNEKTLTALVKIYGKARWARDA+ELWERM+S  WPMDFILYNTLLNMCADLGLEEEAEK
Sbjct: 366 TPNEKTLTALVKIYGKARWARDAMELWERMKSNSWPMDFILYNTLLNMCADLGLEEEAEK 425

Query: 425 LFEEMKKSEHSRPDSWSYTAMLNIHGSGGNVKRSMELFEEMLELGVGINVMCCTCLIQCL 484
           LFE+MK+SEH +PDSWSYTAMLNI+GSGGNV ++MELFEEM ++GV +NVM  TCLIQCL
Sbjct: 426 LFEDMKQSEHCQPDSWSYTAMLNIYGSGGNVDKAMELFEEMPKVGVELNVMGSTCLIQCL 485

Query: 485 GKARRIDDLVRVFDVSVRKGVEPDDRLCGCLLSVVSLCDNNEDISKVFTCLQQANPKLVA 544
           GKA+RI D+VRVF +S+ +GV+PDDRLCGCLLSVVSLC+N ED  KV  CL+Q+NP+LVA
Sbjct: 486 GKAKRISDMVRVFSISMERGVKPDDRLCGCLLSVVSLCENEEDEDKVVACLEQSNPRLVA 545

Query: 545 FVNLLQQNDITFDVIKDEFRTILGETATEARRPFCNCLIDICRNQNLSKRAHELLYLGSL 604
            + LLQ+ + +F+ IKDEFR+++G TA EARRPFCNCLIDICR +NL ++AHELLYLG L
Sbjct: 546 LIKLLQEENTSFETIKDEFRSVIGNTAVEARRPFCNCLIDICRIKNLDEKAHELLYLGIL 605

Query: 605 YGLYPGLHNKTEGEWCLDVRSLSVGAAQTALEEWMITLSKIVQREEALPELLSAQTGAGT 664
           YGLYPGLHNKT  EWCLDVRSLSVGAA TALEEWM TL KIVQR+EALPEL  AQTG GT
Sbjct: 606 YGLYPGLHNKTADEWCLDVRSLSVGAAHTALEEWMGTLCKIVQRKEALPELFLAQTGTGT 665

Query: 665 HRFSQGLANSFASHVEKLAAPFRLREDRAGWFVATREDVVAWVHSRVPSV 702
           H+FSQGL  SFA+HV KLAAPF+  E+RAG FVA+RED+V+W +SR+ ++
Sbjct: 666 HKFSQGLGTSFAAHVRKLAAPFKQSEERAGCFVASREDLVSWANSRLSTI 714

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP420_ARATH5.1e-28266.06Pentatricopeptide repeat-containing protein At5g46580, chloroplastic OS=Arabidop... [more]
PP314_ARATH4.2e-12739.19Pentatricopeptide repeat-containing protein At4g16390, chloroplastic OS=Arabidop... [more]
PP123_ARATH7.1e-4223.73Pentatricopeptide repeat-containing protein At1g74750 OS=Arabidopsis thaliana GN... [more]
PPR49_ARATH3.9e-4023.22Pentatricopeptide repeat-containing protein At1g18900 OS=Arabidopsis thaliana GN... [more]
PP178_ARATH1.1e-3725.18Pentatricopeptide repeat-containing protein At2g31400, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0L6K8_CUCSA0.0e+0088.80Uncharacterized protein OS=Cucumis sativus GN=Csa_3G011820 PE=4 SV=1[more]
M5WQE9_PRUPE5.8e-30974.96Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002049mg PE=4 SV=1[more]
F6HTA5_VITVI2.4e-30775.39Putative uncharacterized protein OS=Vitis vinifera GN=VIT_02s0012g01090 PE=4 SV=... [more]
W9RN90_9ROSA1.7e-30573.94Uncharacterized protein OS=Morus notabilis GN=L484_023300 PE=4 SV=1[more]
A0A067DT79_CITSI2.8e-30372.71Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g005178mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G46580.12.9e-28366.06 pentatricopeptide (PPR) repeat-containing protein[more]
AT4G16390.12.4e-12839.19 pentatricopeptide (PPR) repeat-containing protein[more]
AT1G74750.14.0e-4323.73 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G18900.32.2e-4123.22 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G31400.16.0e-3925.18 genomes uncoupled 1[more]
Match NameE-valueIdentityDescription
gi|449462001|ref|XP_004148730.1|0.0e+0088.80PREDICTED: pentatricopeptide repeat-containing protein At5g46580, chloroplastic ... [more]
gi|659095679|ref|XP_008448710.1|0.0e+0088.52PREDICTED: pentatricopeptide repeat-containing protein At5g46580, chloroplastic ... [more]
gi|595852519|ref|XP_007210329.1|8.3e-30974.96hypothetical protein PRUPE_ppa002049mg [Prunus persica][more]
gi|225427240|ref|XP_002278451.1|3.5e-30775.39PREDICTED: pentatricopeptide repeat-containing protein At5g46580, chloroplastic ... [more]
gi|1009107574|ref|XP_015880412.1|3.5e-30773.38PREDICTED: pentatricopeptide repeat-containing protein At5g46580, chloroplastic ... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002625Smr_dom
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009073 aromatic amino acid family biosynthetic process
biological_process GO:0009793 embryo development ending in seed dormancy
biological_process GO:0016226 iron-sulfur cluster assembly
biological_process GO:0045036 protein targeting to chloroplast
biological_process GO:0010103 stomatal complex morphogenesis
biological_process GO:0008150 biological_process
cellular_component GO:0009507 chloroplast
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh16G011170.1CmoCh16G011170.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002625Smr domainPROFILEPS50828SMRcoord: 608..691
score: 10
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 464..492
score: 1.1coord: 216..243
score: 0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 318..363
score: 5.2E-11coord: 391..434
score: 1.1E-8coord: 248..291
score: 2.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 251..284
score: 7.1E-6coord: 321..355
score: 1.3E-9coord: 463..495
score: 1.3E-5coord: 392..425
score: 2.0E-6coord: 428..460
score: 8.8E-5coord: 357..389
score: 3.9E-5coord: 216..250
score: 6.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 425..459
score: 10.128coord: 284..318
score: 9.164coord: 214..248
score: 11.192coord: 354..388
score: 10.117coord: 179..213
score: 6.654coord: 460..494
score: 9.602coord: 319..353
score: 12.891coord: 389..423
score: 10.797coord: 249..283
score: 11
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 219..455
score: 2.7
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 568..650
score: 8.9E-252coord: 47..500
score: 8.9E
NoneNo IPR availablePANTHERPTHR24015:SF357SUBFAMILY NOT NAMEDcoord: 47..500
score: 8.9E-252coord: 568..650
score: 8.9E
NoneNo IPR availableunknownSSF81901HCP-likecoord: 225..457
score: 2.4

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmoCh16G011170CmoCh06G008720Cucurbita moschata (Rifu)cmocmoB294
The following block(s) are covering this gene:
GeneOrganismBlock
CmoCh16G011170Cucurbita pepo (Zucchini)cmocpeB308