CmaCh17G012570 (gene) Cucurbita maxima (Rimu)

NameCmaCh17G012570
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCma_Chr17 : 8567327 .. 8570762 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGATGCGAGGCGAATGGCCTGAGAATCTCAAAGCTCAGCATCCATTATCCATGGCCGCCCACTCCACTCCCTCCATCCTTCTTCCACCGGCAGTCAGCCATTCAAATCCTAAGAACCGTGTATTTCTTTGTCAACAACGAGCGAGCCATCACAATTTCGAATCTCCCGCCCCATCTTCCTCTTCGTCTTCTACGGATGGTAAGCTCATGCAGAGAAGTCCTCGCGAAGGACGCATGGACATCGCAAAGCTGAAGGCGAAGGAAGCGGGCGAGAGAAAAGAGGAGGTCAACAGGAAGATTGCTTCTCAGAAAGCCATTTCTGTGATTTTGCGCAGGGAAGCCACGAAGGCCGTCATTGAGAGGAAGAGAGGCCCCACTAATTCTAAGAAACTGCTTCCGCGAACTGTTCTTGAGGCTCTCCATGAACGAATCACGGCCTTGCGATGGGAGTCTGCGCTCAAGGTCAGTTAATTTTGGTTTTAAGCTGCGTCTGATTTTTCCTTTTCTAGTATTCGTTTGGAAATTTATAATCGCTTTGGTTGGAGACTCTTGGAAGAACAACGTAAAGGTGTTTTGTTCTTCCCTCTTTTTGGTTCTTGTATGATGAGTGAAATGACTCTTGTTCGGAATTCCAGCAGTTCCCTTCATGTTTGAGTTCAATGTAATTCATTTTAAGAACAAATTTAATTGACTGCTTTGAGTGCAAATAAACCTCAATTGAAGCTCAGAAGTTGGATCTTTCTTCATTTAGCAACCAGGAAAGAAGAAACAAGTTTATCTTTGGATTGAGTCGGTAAACACGGGCAGCACCGAACGAGAATGAATATGTATAGTTCTGGTACAACAAAGACGAGGCATAGAAATATAAATCACCATGGAACCTAAAAGTTTGAGCTTGTGGGTAACAACAAATTTAATTTATTTCAACACTTCAACATTCACTAGGTGGATTTGCTCGTGTGTTAGCGATCAACTGGAAGAGGACATTCTGGTTTCCTGTCTTTTTGGGGCATATCAGACTTTGTTGGGACGATCTAAGGCTTTGACTTATCTTACAACCATGGAATAGATTCGGTGAGATCTCACATCGGTTGGGGAGGAGAACAAAACATTATTTATAAGAGTGTGGAAGCCTCTCCCTAGTAGACCCGTTTTAAAAACCCTGAGGGAAAGTCCGAAAGGAAAAGCTCAAAGAGAACAATATCTGTTTGCGGTGGAATAAGCTGCTATTGCTATTTGGAGTGGGAATTTGTATGCATGAGAAAAATTGAAACTGAACTGCGGATGTGCATTCTATACTCTAAATGCATAAGGCCAGCAACTTAAGAAAAACTGGCACCATTGACAACTTATGGATTGTGTCATTAGCATTAATGCTAAATTTTGTTCATTGTTAAGAACAAGAAAAGTTCAAAACTTGGCCAAGAACGACCGTGATTAAGCTAATTCGTCTTCTTAATTGATTATGGATTTCAGGTTTTTGAACTACTACGCGAACAATTGTGGTACAGACCTTACGCCGGGATGTACATCAAGCTAATTGTCATGCTTGGAAAATGTAAGCAACCTGAAAAGGCCTATGAGCTGTTCCAAGAAATGATCGAGGAAGGCTGTGAAGTTAGCCATGAGTCCTATACTGCTCTCCTGTCGGCCTATAGTAGGAGTGGTCTTCTTGACAGGGCATTTTCACTCCTCAACGAGATGAAAAACAGTCCTGATTGCCAGCCTGACGTTCACACTTACTCTATCCTCATAAAATCATGCTTGCAGGTGTTTGCATTCAACAAAGCACAAACTCTACTCTCTGACATGGTGACTCGAGGAATAAAACCGAACACTATTACATATAATACCTTCATTGATGCTTATGGCAAAGCAAAAATGTACTCTTACTTGCCTGCCTTTTGAGAACATAATAGATGATTCATCTGGTTGCTGACTGAACTATGTTTAAATTTGAGTACTATAGATGATAATCAACTTCATTTTTAGATGGAGCTTATATCTTTCTGCAGGCATAACGCCTTCCACGGCCAGTTCTCTTCAATCTTTCCTCCAGAGTCAACTGCTAGAAGTTGAGCTTGGTTATAGCTAGTAGTTGGAGCCAGCATGGGGTGTGTGAGGTCTCCTTTTGTAGGAAAGGGGTGGGTGCCTTGGGGCTGTTAGAAGAAAATGGTTTCGTCTTTTATGCTTATGGACTAGTTCTGCTCCTCTATGTCATGGTTGTTATCAATCTCTCTAGAAACAGTTCAAACCCACCACTAGCCGATATTGTCCTCTCTAGGCTCTCCCTTCTGGGCTTCCCTTCAAGGTTTTAGAATGTCTGCTAGAGAGAGGTTTTTATACCCTTATAAGGAATGTTTTGTTCCCATCTCCAACCGACGTGGGATCTCACAACTTCTGTCTCTTCTTTGCATTGTCCTGTCTTATAATTTGCGACCAGATGAATGTGTCTACTACTTCCATTGTAGAAAAGTAATTCTTTTTGTGTGTATATATTCTGCAGGTTTGCTGAAATGGAGTCCATCCTTGTGGAAATGCTGAGTGATGATGGTTGTAAACCCGATGTTTGGACAATGAACTCGACACTTCGAGGCTTTGGCAGCAGTGGACAATTAGAGACCATGGAGAAGTGTTATGAAAAGTTCCAGGGAGCTGGAATCCAACCAAACATTCAGACCTTCAACATCCTTTTGGATTCATACGGCAAAGCCAAAAGCTATGAGAAAATGAGTGCTGTTATGGAGTATATGCAGAAGTACCATTATTCATGGACAATCGTAACCTACAACATCGTTATCGACGCATTCGGGAGGGCTGGGAATTTGAAACAGATGGAGCATCTCTTTAGACTTATGAGATCAGAGAGGATCCAACCGAGTTGTGTAACACTTTGCTCGCTCGTAAAAGCCTACGGTCAAGCAGGAAAACGCGACAAAATCGAAAGTGTACTGCACATAGTCGAAAATTCCAATATAACGCTGGATACCGTCTTTTACAACTGTCTTGTGGACGCTTACGGGCGAATGGAATGTTTTGCAGAGATGAAGGCGGTACTCGGAATGATGGAGCAGAGAGGATGCAAGCCTGATAAGACTACCTACAGGATCATGGCTCGAGCTTATTCAGATGGAGGAATGGCCAACCATGCCAGGGAGATCCATGATCTTGTAAGAACAGCAGAAGCAAGTAAGAGAACTCGTCCTGACTTATGATATTAATCTCTTAGCATATGTAATGAAATAAATCTCAGTATTGATAATTAAAGGAGAAATTTAAAAATAAATTATCTCAATCAGGGAAAAAAATTGGATTTGGGCTTTGAGGGGTGGGCCAGGCCCATGCAGTAGTGCAAGGCAAGGGCGACGCTCAAAGGCCCACAATAGCAAGCTACTGTATTGGGCCTTTCCCCGAGAAAAATTAATTTAATATCATGT

mRNA sequence

GGATGCGAGGCGAATGGCCTGAGAATCTCAAAGCTCAGCATCCATTATCCATGGCCGCCCACTCCACTCCCTCCATCCTTCTTCCACCGGCAGTCAGCCATTCAAATCCTAAGAACCGTGTATTTCTTTGTCAACAACGAGCGAGCCATCACAATTTCGAATCTCCCGCCCCATCTTCCTCTTCGTCTTCTACGGATGGTAAGCTCATGCAGAGAAGTCCTCGCGAAGGACGCATGGACATCGCAAAGCTGAAGGCGAAGGAAGCGGGCGAGAGAAAAGAGGAGGTCAACAGGAAGATTGCTTCTCAGAAAGCCATTTCTGTGATTTTGCGCAGGGAAGCCACGAAGGCCGTCATTGAGAGGAAGAGAGGCCCCACTAATTCTAAGAAACTGCTTCCGCGAACTGTTCTTGAGGCTCTCCATGAACGAATCACGGCCTTGCGATGGGAGTCTGCGCTCAAGGTTTTTGAACTACTACGCGAACAATTGTGGTACAGACCTTACGCCGGGATGTACATCAAGCTAATTGTCATGCTTGGAAAATGTAAGCAACCTGAAAAGGCCTATGAGCTGTTCCAAGAAATGATCGAGGAAGGCTGTGAAGTTAGCCATGAGTCCTATACTGCTCTCCTGTCGGCCTATAGTAGGAGTGGTCTTCTTGACAGGGCATTTTCACTCCTCAACGAGATGAAAAACAGTCCTGATTGCCAGCCTGACGTTCACACTTACTCTATCCTCATAAAATCATGCTTGCAGGTGTTTGCATTCAACAAAGCACAAACTCTACTCTCTGACATGGTGACTCGAGGAATAAAACCGAACACTATTACATATAATACCTTCATTGATGCTTATGGCAAAGCAAAAATGTTTGCTGAAATGGAGTCCATCCTTGTGGAAATGCTGAGTGATGATGGTTGTAAACCCGATGTTTGGACAATGAACTCGACACTTCGAGGCTTTGGCAGCAGTGGACAATTAGAGACCATGGAGAAGTGTTATGAAAAGTTCCAGGGAGCTGGAATCCAACCAAACATTCAGACCTTCAACATCCTTTTGGATTCATACGGCAAAGCCAAAAGCTATGAGAAAATGAGTGCTGTTATGGAGTATATGCAGAAGTACCATTATTCATGGACAATCGTAACCTACAACATCGTTATCGACGCATTCGGGAGGGCTGGGAATTTGAAACAGATGGAGCATCTCTTTAGACTTATGAGATCAGAGAGGATCCAACCGAGTTGTGTAACACTTTGCTCGCTCGTAAAAGCCTACGGTCAAGCAGGAAAACGCGACAAAATCGAAAGTGTACTGCACATAGTCGAAAATTCCAATATAACGCTGGATACCGTCTTTTACAACTGTCTTGTGGACGCTTACGGGCGAATGGAATGTTTTGCAGAGATGAAGGCGGTACTCGGAATGATGGAGCAGAGAGGATGCAAGCCTGATAAGACTACCTACAGGATCATGGCTCGAGCTTATTCAGATGGAGGAATGGCCAACCATGCCAGGGAGATCCATGATCTTGTAAGAACAGCAGAAGCAAGTAAGAGAACTCGTCCTGACTTATGATATTAATCTCTTAGCATATGTAATGAAATAAATCTCAGTATTGATAATTAAAGGAGAAATTTAAAAATAAATTATCTCAATCAGGGAAAAAAATTGGATTTGGGCTTTGAGGGGTGGGCCAGGCCCATGCAGTAGTGCAAGGCAAGGGCGACGCTCAAAGGCCCACAATAGCAAGCTACTGTATTGGGCCTTTCCCCGAGAAAAATTAATTTAATATCATGT

Coding sequence (CDS)

ATGCGAGGCGAATGGCCTGAGAATCTCAAAGCTCAGCATCCATTATCCATGGCCGCCCACTCCACTCCCTCCATCCTTCTTCCACCGGCAGTCAGCCATTCAAATCCTAAGAACCGTGTATTTCTTTGTCAACAACGAGCGAGCCATCACAATTTCGAATCTCCCGCCCCATCTTCCTCTTCGTCTTCTACGGATGGTAAGCTCATGCAGAGAAGTCCTCGCGAAGGACGCATGGACATCGCAAAGCTGAAGGCGAAGGAAGCGGGCGAGAGAAAAGAGGAGGTCAACAGGAAGATTGCTTCTCAGAAAGCCATTTCTGTGATTTTGCGCAGGGAAGCCACGAAGGCCGTCATTGAGAGGAAGAGAGGCCCCACTAATTCTAAGAAACTGCTTCCGCGAACTGTTCTTGAGGCTCTCCATGAACGAATCACGGCCTTGCGATGGGAGTCTGCGCTCAAGGTTTTTGAACTACTACGCGAACAATTGTGGTACAGACCTTACGCCGGGATGTACATCAAGCTAATTGTCATGCTTGGAAAATGTAAGCAACCTGAAAAGGCCTATGAGCTGTTCCAAGAAATGATCGAGGAAGGCTGTGAAGTTAGCCATGAGTCCTATACTGCTCTCCTGTCGGCCTATAGTAGGAGTGGTCTTCTTGACAGGGCATTTTCACTCCTCAACGAGATGAAAAACAGTCCTGATTGCCAGCCTGACGTTCACACTTACTCTATCCTCATAAAATCATGCTTGCAGGTGTTTGCATTCAACAAAGCACAAACTCTACTCTCTGACATGGTGACTCGAGGAATAAAACCGAACACTATTACATATAATACCTTCATTGATGCTTATGGCAAAGCAAAAATGTTTGCTGAAATGGAGTCCATCCTTGTGGAAATGCTGAGTGATGATGGTTGTAAACCCGATGTTTGGACAATGAACTCGACACTTCGAGGCTTTGGCAGCAGTGGACAATTAGAGACCATGGAGAAGTGTTATGAAAAGTTCCAGGGAGCTGGAATCCAACCAAACATTCAGACCTTCAACATCCTTTTGGATTCATACGGCAAAGCCAAAAGCTATGAGAAAATGAGTGCTGTTATGGAGTATATGCAGAAGTACCATTATTCATGGACAATCGTAACCTACAACATCGTTATCGACGCATTCGGGAGGGCTGGGAATTTGAAACAGATGGAGCATCTCTTTAGACTTATGAGATCAGAGAGGATCCAACCGAGTTGTGTAACACTTTGCTCGCTCGTAAAAGCCTACGGTCAAGCAGGAAAACGCGACAAAATCGAAAGTGTACTGCACATAGTCGAAAATTCCAATATAACGCTGGATACCGTCTTTTACAACTGTCTTGTGGACGCTTACGGGCGAATGGAATGTTTTGCAGAGATGAAGGCGGTACTCGGAATGATGGAGCAGAGAGGATGCAAGCCTGATAAGACTACCTACAGGATCATGGCTCGAGCTTATTCAGATGGAGGAATGGCCAACCATGCCAGGGAGATCCATGATCTTGTAAGAACAGCAGAAGCAAGTAAGAGAACTCGTCCTGACTTATGA

Protein sequence

MRGEWPENLKAQHPLSMAAHSTPSILLPPAVSHSNPKNRVFLCQQRASHHNFESPAPSSSSSSTDGKLMQRSPREGRMDIAKLKAKEAGERKEEVNRKIASQKAISVILRREATKAVIERKRGPTNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIEEGCEVSHESYTALLSAYSRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLRGFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYHYSWTIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIQPSCVTLCSLVKAYGQAGKRDKIESVLHIVENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDKTTYRIMARAYSDGGMANHAREIHDLVRTAEASKRTRPDL
BLAST of CmaCh17G012570 vs. Swiss-Prot
Match: PP424_ARATH (Pentatricopeptide repeat-containing protein At5g48730, chloroplastic OS=Arabidopsis thaliana GN=At5g48730 PE=2 SV=2)

HSP 1 Score: 641.0 bits (1652), Expect = 1.2e-182
Identity = 328/498 (65.86%), Postives = 394/498 (79.12%), Query Frame = 1

Query: 17  MAAHSTPSILLPPAVSHSNPKNRVFLCQQRA-SHHNFESPAPSSSSSSTDGKLMQRSPRE 76
           M + ST +   PP  ++     R F  +  + S         S  S++T   L +    +
Sbjct: 1   MVSLSTSTSHAPPLPTNRRTAERTFTVRCISISPREPNYAITSDKSNNTSLSLRETRQSK 60

Query: 77  GRMDIAKLKAKEAGERKEEVNRKIASQKAISVILRREATKAVIERKRGPTNSKKLLPRTV 136
             ++   +  +++ E KE+ N KIAS+KAIS+ILRREATK++IE+K+G   SKKLLPRTV
Sbjct: 61  WLINAEDVNERDSKEIKEDKNTKIASRKAISIILRREATKSIIEKKKG---SKKLLPRTV 120

Query: 137 LEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMI 196
           LE+LHERITALRWESA++VFELLREQLWY+P  G+Y+KLIVMLGKCKQPEKA+ELFQEMI
Sbjct: 121 LESLHERITALRWESAIQVFELLREQLWYKPNVGIYVKLIVMLGKCKQPEKAHELFQEMI 180

Query: 197 EEGCEVSHESYTALLSAYSRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAF 256
            EGC V+HE YTAL+SAYSRSG  D AF+LL  MK+S +CQPDVHTYSILIKS LQVFAF
Sbjct: 181 NEGCVVNHEVYTALVSAYSRSGRFDAAFTLLERMKSSHNCQPDVHTYSILIKSFLQVFAF 240

Query: 257 NKAQTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNS 316
           +K Q LLSDM  +GI+PNTITYNT IDAYGKAKMF EMES L++ML +D CKPD WTMNS
Sbjct: 241 DKVQDLLSDMRRQGIRPNTITYNTLIDAYGKAKMFVEMESTLIQMLGEDDCKPDSWTMNS 300

Query: 317 TLRGFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYH 376
           TLR FG +GQ+E ME CYEKFQ +GI+PNI+TFNILLDSYGK+ +Y+KMSAVMEYMQKYH
Sbjct: 301 TLRAFGGNGQIEMMENCYEKFQSSGIEPNIRTFNILLDSYGKSGNYKKMSAVMEYMQKYH 360

Query: 377 YSWTIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIQPSCVTLCSLVKAYGQAGKRDKIE 436
           YSWTIVTYN+VIDAFGRAG+LKQME+LFRLM+SERI PSCVTLCSLV+AYG+A K DKI 
Sbjct: 361 YSWTIVTYNVVIDAFGRAGDLKQMEYLFRLMQSERIFPSCVTLCSLVRAYGRASKADKIG 420

Query: 437 SVLHIVENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDKTTYRIMARAY 496
            VL  +ENS+I LD VF+NCLVDAYGRME FAEMK VL +ME++G KPDK TYR M +AY
Sbjct: 421 GVLRFIENSDIRLDLVFFNCLVDAYGRMEKFAEMKGVLELMEKKGFKPDKITYRTMVKAY 480

Query: 497 SDGGMANHAREIHDLVRT 514
              GM  H +E+H +V +
Sbjct: 481 RISGMTTHVKELHGVVES 495

BLAST of CmaCh17G012570 vs. Swiss-Prot
Match: PP279_ARATH (Pentatricopeptide repeat-containing protein At3g53170 OS=Arabidopsis thaliana GN=At3g53170 PE=3 SV=1)

HSP 1 Score: 373.6 bits (958), Expect = 3.5e-102
Identity = 192/429 (44.76%), Postives = 281/429 (65.50%), Query Frame = 1

Query: 86  KEAGERKEEVNRKIAS-------QKAISVILRREATKAVIERKRGPTNSKKLLPRTVLEA 145
           K   ER E++N  + S       +K +S ILR +A    IERK        L P+ VLEA
Sbjct: 5   KVPNERTEKMNSGLISTRHQVDPKKELSRILRTDAAVKGIERKANSEKYLTLWPKAVLEA 64

Query: 146 LHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIEEG 205
           L E I   RW+SALK+F LLR+Q WY P    Y KL  +LG CKQP++A  LF+ M+ EG
Sbjct: 65  LDEAIKENRWQSALKIFNLLRKQHWYEPRCKTYTKLFKVLGNCKQPDQASLLFEVMLSEG 124

Query: 206 CEVSHESYTALLSAYSRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAFNKA 265
            + + + YT+L+S Y +S LLD+AFS L  MK+  DC+PDV T+++LI  C ++  F+  
Sbjct: 125 LKPTIDVYTSLISVYGKSELLDKAFSTLEYMKSVSDCKPDVFTFTVLISCCCKLGRFDLV 184

Query: 266 QTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLR 325
           ++++ +M   G+  +T+TYNT ID YGKA MF EMES+L +M+ D    PDV T+NS + 
Sbjct: 185 KSIVLEMSYLGVGCSTVTYNTIIDGYGKAGMFEEMESVLADMIEDGDSLPDVCTLNSIIG 244

Query: 326 GFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYHYSW 385
            +G+   +  ME  Y +FQ  G+QP+I TFNIL+ S+GKA  Y+KM +VM++M+K  +S 
Sbjct: 245 SYGNGRNMRKMESWYSRFQLMGVQPDITTFNILILSFGKAGMYKKMCSVMDFMEKRFFSL 304

Query: 386 TIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIQPSCVTLCSLVKAYGQAGKRDKIESVL 445
           T VTYNIVI+ FG+AG +++M+ +FR M+ + ++P+ +T CSLV AY +AG   KI+SVL
Sbjct: 305 TTVTYNIVIETFGKAGRIEKMDDVFRKMKYQGVKPNSITYCSLVNAYSKAGLVVKIDSVL 364

Query: 446 HIVENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDKTTYRIMARAYSDG 505
             + NS++ LDT F+NC+++AYG+    A MK +   ME+R CKPDK T+  M + Y+  
Sbjct: 365 RQIVNSDVVLDTPFFNCIINAYGQAGDLATMKELYIQMEERKCKPDKITFATMIKTYTAH 424

Query: 506 GMANHAREI 508
           G+ +  +E+
Sbjct: 425 GIFDAVQEL 433

BLAST of CmaCh17G012570 vs. Swiss-Prot
Match: PP216_ARATH (Pentatricopeptide repeat-containing protein At3g06430, chloroplastic OS=Arabidopsis thaliana GN=EMB2750 PE=2 SV=1)

HSP 1 Score: 342.0 bits (876), Expect = 1.1e-92
Identity = 177/396 (44.70%), Postives = 244/396 (61.62%), Query Frame = 1

Query: 114 TKAVIERKRGPTNSKKLLPR---------TVLEALHERITALRWESALKVFELLREQLWY 173
           T+ V +R+    N KK L R         TV E L + I   +W  AL+VF++LREQ +Y
Sbjct: 61  TEPVNQRRTPIKNVKKKLDRRSKANGWVNTVTETLSDLIAKKQWLQALEVFDMLREQTFY 120

Query: 174 RPYAGMYIKLIVMLGKCKQPEKAYELFQEMIEEGCEVSHESYTALLSAYSRSGLLDRAFS 233
           +P  G Y+KL+V+LGK  QP +A +LF EM+EEG E + E YTALL+AY+RS L+D AFS
Sbjct: 121 QPKEGTYMKLLVLLGKSGQPNRAQKLFDEMLEEGLEPTVELYTALLAAYTRSNLIDDAFS 180

Query: 234 LLNEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPNTITYNTFIDAY 293
           +L++MK+ P CQPDV TYS L+K+C+    F+   +L  +M  R I PNT+T N  +  Y
Sbjct: 181 ILDKMKSFPQCQPDVFTYSTLLKACVDASQFDLVDSLYKEMDERLITPNTVTQNIVLSGY 240

Query: 294 GKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLRGFGSSGQLETMEKCYEKFQGAGIQPN 353
           G+   F +ME +L +ML    CKPDVWTMN  L  FG+ G+++ ME  YEKF+  GI+P 
Sbjct: 241 GRVGRFDQMEKVLSDMLVSTACKPDVWTMNIILSVFGNMGKIDMMESWYEKFRNFGIEPE 300

Query: 354 IQTFNILLDSYGKAKSYEKMSAVMEYMQKYHYSWTIVTYNIVIDAFGRAGNLKQMEHLFR 413
            +TFNIL+ SYGK + Y+KMS+VMEYM+K  + WT  TYN +I+AF   G+ K ME  F 
Sbjct: 301 TRTFNILIGSYGKKRMYDKMSSVMEYMRKLEFPWTTSTYNNIIEAFADVGDAKNMELTFD 360

Query: 414 LMRSERIQPSCVTLCSLVKAYGQAGKRDKIESVLHIVENSNITLDTVFYNCLVDAYGRME 473
            MRSE ++    T C L+  Y  AG   K+ S + +     I  +T FYN ++ A  + +
Sbjct: 361 QMRSEGMKADTKTFCCLINGYANAGLFHKVISSVQLAAKFEIPENTAFYNAVISACAKAD 420

Query: 474 CFAEMKAVLGMMEQRGCKPDKTTYRIMARAYSDGGM 501
              EM+ V   M++R C  D  T+ IM  AY   GM
Sbjct: 421 DLIEMERVYIRMKERQCVCDSRTFEIMVEAYEKEGM 456

BLAST of CmaCh17G012570 vs. Swiss-Prot
Match: PP358_ARATH (Pentatricopeptide repeat-containing protein At4g39620, chloroplastic OS=Arabidopsis thaliana GN=EMB2453 PE=2 SV=1)

HSP 1 Score: 179.5 bits (454), Expect = 9.6e-44
Identity = 102/344 (29.65%), Postives = 182/344 (52.91%), Query Frame = 1

Query: 147 RWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIEEGCEVSHESY 206
           +W   L+VF  +++Q WY P  G+Y KLI ++GK  Q   A  LF EM   GC      Y
Sbjct: 112 KWLQCLEVFRWMQKQRWYIPDNGVYSKLISVMGKKGQTRMAMWLFSEMKNSGCRPDASVY 171

Query: 207 TALLSAY----SRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLL 266
            AL++A+     ++  L++    L++MK    CQP+V TY+IL+++  Q    ++   L 
Sbjct: 172 NALITAHLHTRDKAKALEKVRGYLDKMKGIERCQPNVVTYNILLRAFAQSGKVDQVNALF 231

Query: 267 SDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLRGFGS 326
            D+    + P+  T+N  +DAYGK  M  EME++L  M S++ CKPD+ T N  +  +G 
Sbjct: 232 KDLDMSPVSPDVYTFNGVMDAYGKNGMIKEMEAVLTRMRSNE-CKPDIITFNVLIDSYGK 291

Query: 327 SGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYHYSWTIVT 386
             + E ME+ ++    +  +P + TFN ++ +YGKA+  +K   V + M   +Y  + +T
Sbjct: 292 KQEFEKMEQTFKSLMRSKEKPTLPTFNSMIINYGKARMIDKAEWVFKKMNDMNYIPSFIT 351

Query: 387 YNIVIDAFGRAGNLKQMEHLF-RLMRSERIQPSCVTLCSLVKAYGQAGKRDKIESVLHIV 446
           Y  +I  +G  G++ +   +F  +  S+R+  +  TL ++++ Y + G   + + + H  
Sbjct: 352 YECMIMMYGYCGSVSRAREIFEEVGESDRVLKAS-TLNAMLEVYCRNGLYIEADKLFHNA 411

Query: 447 ENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDK 486
               +  D   Y  L  AY + +   +++ ++  ME+ G  P+K
Sbjct: 412 SAFRVHPDASTYKFLYKAYTKADMKEQVQILMKKMEKDGIVPNK 453

BLAST of CmaCh17G012570 vs. Swiss-Prot
Match: PP362_ARATH (Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana GN=At5g02860 PE=2 SV=1)

HSP 1 Score: 166.0 bits (419), Expect = 1.1e-39
Identity = 103/342 (30.12%), Postives = 163/342 (47.66%), Query Frame = 1

Query: 171 YIKLIVMLGKCKQPEKAYELFQEMIEEGCEVSHESYTALLSAYSRSGLLDRAFSLLNEMK 230
           Y  L+ + GK  +P++A ++  EM+  G   S  +Y +L+SAY+R G+LD A  L N+M 
Sbjct: 317 YNALLDVYGKSHRPKEAMKVLNEMVLNGFSPSIVTYNSLISAYARDGMLDEAMELKNQMA 376

Query: 231 NSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPNTITYNTFIDAYGKAKMF 290
                +PDV TY+ L+    +      A ++  +M   G KPN  T+N FI  YG    F
Sbjct: 377 EK-GTKPDVFTYTTLLSGFERAGKVESAMSIFEEMRNAGCKPNICTFNAFIKMYGNRGKF 436

Query: 291 AEMESILVEMLSDDGCKPDVWTMNSTLRGFGSSGQLETMEKCYEKFQGAGIQPNIQTFNI 350
            EM  I  E ++  G  PD+ T N+ L  FG +G    +   +++ + AG  P  +TFN 
Sbjct: 437 TEMMKIFDE-INVCGLSPDIVTWNTLLAVFGQNGMDSEVSGVFKEMKRAGFVPERETFNT 496

Query: 351 LLDSYGKAKSYEKMSAVMEYMQKYHYSWTIVTYNIVIDAFGRAGNLKQMEHLFRLMRSER 410
           L+ +Y +  S+E+   V   M     +  + TYN V+ A  R G  +Q E +   M   R
Sbjct: 497 LISAYSRCGSFEQAMTVYRRMLDAGVTPDLSTYNTVLAALARGGMWEQSEKVLAEMEDGR 556

Query: 411 IQPSCVTLCSLVKAYGQAGKRDKIESVLHIVENSNITLDTVFYNCLVDAYGRMECFAEMK 470
            +P+ +T CSL+ AY    +   + S+   V +  I    V    LV    + +   E +
Sbjct: 557 CKPNELTYCSLLHAYANGKEIGLMHSLAEEVYSGVIEPRAVLLKTLVLVCSKCDLLPEAE 616

Query: 471 AVLGMMEQRGCKPDKTTYRIMARAYSDGGMANHAREIHDLVR 513
                +++RG  PD TT   M   Y    M   A  + D ++
Sbjct: 617 RAFSELKERGFSPDITTLNSMVSIYGRRQMVAKANGVLDYMK 656

BLAST of CmaCh17G012570 vs. TrEMBL
Match: A0A0A0K8C6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G342750 PE=4 SV=1)

HSP 1 Score: 884.8 bits (2285), Expect = 5.2e-254
Identity = 447/506 (88.34%), Postives = 475/506 (93.87%), Query Frame = 1

Query: 27  LPPAVSHSNP-KNRVFLCQQRASHHNFESPAPSSSSSST-------DGKLMQRSPREGRM 86
           LPP ++H NP  NR++L +Q+ +H  F+SPAPSSSSSS+       DGKLM+ SP EGRM
Sbjct: 6   LPPPINHPNPTNNRLYLRRQQRTHPKFQSPAPSSSSSSSSSTTTTADGKLMKISPHEGRM 65

Query: 87  DIAKLKAKEAGERKEEVNRKIASQKAISVILRREATKAVIERKRGPTNSKKLLPRTVLEA 146
           D+AKLKAKEA ERKEEVNRKIASQKAISVILRREATKAVIERKRGP NSKKLLPRTVLEA
Sbjct: 66  DVAKLKAKEASERKEEVNRKIASQKAISVILRREATKAVIERKRGPNNSKKLLPRTVLEA 125

Query: 147 LHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIEEG 206
           LH+RIT LRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQ EKAYELFQEMIEEG
Sbjct: 126 LHDRITTLRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQQEKAYELFQEMIEEG 185

Query: 207 CEVSHESYTALLSAYSRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAFNKA 266
           CEVSHESYTALLSAYSRSGLLD AFS+LNEMKNSPDCQPDVHTYSILIKSCLQVFAFNKA
Sbjct: 186 CEVSHESYTALLSAYSRSGLLDEAFSILNEMKNSPDCQPDVHTYSILIKSCLQVFAFNKA 245

Query: 267 QTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLR 326
           QTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILV+ML+DDGCKPDVWTMNSTLR
Sbjct: 246 QTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVDMLNDDGCKPDVWTMNSTLR 305

Query: 327 GFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYHYSW 386
            FG SGQLETMEKCYEKFQ AGIQP+IQTFNILLDSYGKA+SYEKMSAVMEYMQKYHYSW
Sbjct: 306 AFGRSGQLETMEKCYEKFQEAGIQPSIQTFNILLDSYGKAESYEKMSAVMEYMQKYHYSW 365

Query: 387 TIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIQPSCVTLCSLVKAYGQAGKRDKIESVL 446
           TIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERI+PSCVTLCSLV+AYGQAGKR+KI+SVL
Sbjct: 366 TIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIKPSCVTLCSLVRAYGQAGKREKIDSVL 425

Query: 447 HIVENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDKTTYRIMARAYSDG 506
           ++VENS+I LDTVFYNCLVDAYGR+ECFAEMK VLGMMEQRGCKPDKTTYR MARAYSDG
Sbjct: 426 NLVENSDIMLDTVFYNCLVDAYGRLECFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDG 485

Query: 507 GMANHAREIHDLVRTAEASKRTRPDL 525
           GMANHA+EI +L+ TAE SKRTRPDL
Sbjct: 486 GMANHAKEIQELITTAEPSKRTRPDL 511

BLAST of CmaCh17G012570 vs. TrEMBL
Match: B9SV96_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_1303150 PE=4 SV=1)

HSP 1 Score: 709.1 bits (1829), Expect = 3.9e-201
Identity = 350/434 (80.65%), Postives = 390/434 (89.86%), Query Frame = 1

Query: 77  RMDIAKLKAKEAGERKEEVNRKIASQKAISVILRREATKAVIERKRGPTNSKKLLPRTVL 136
           RMD   +K KE  E KEE++RKIAS+KAISVILRREATKA+IE+KRGPTNSKKLLPRTVL
Sbjct: 64  RMDWEIVKKKEEKEGKEEMDRKIASRKAISVILRREATKAIIEKKRGPTNSKKLLPRTVL 123

Query: 137 EALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIE 196
           EALHERITALRWESALKVFELLREQ+WYRPY+GMYIKLIVMLGKCKQPEKA+ELFQ MI 
Sbjct: 124 EALHERITALRWESALKVFELLREQIWYRPYSGMYIKLIVMLGKCKQPEKAHELFQAMIH 183

Query: 197 EGCEVSHESYTALLSAYSRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAFN 256
           EGC+VSHESYTALLSAY RSGLLD+AFSLL EMK +PDCQPDVHTYSILIKSC+QVFAF+
Sbjct: 184 EGCDVSHESYTALLSAYGRSGLLDKAFSLLEEMKRNPDCQPDVHTYSILIKSCVQVFAFD 243

Query: 257 KAQTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNST 316
           KA+TLLS+M + GI PNTITYNT IDAYGKAKMF EME+ LV+MLS   C+PDVWTMNST
Sbjct: 244 KAKTLLSNMESLGISPNTITYNTLIDAYGKAKMFEEMEATLVKMLSQQNCEPDVWTMNST 303

Query: 317 LRGFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYHY 376
           LR FG SGQ+ETMEKCYEKFQGAGI+P+I TFN+LLDSYGKA  Y+KMSAVMEYMQKYHY
Sbjct: 304 LRAFGISGQIETMEKCYEKFQGAGIEPSIMTFNVLLDSYGKAGDYKKMSAVMEYMQKYHY 363

Query: 377 SWTIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIQPSCVTLCSLVKAYGQAGKRDKIES 436
           SWTI+TYNIVIDAFGRAG+LKQME+LFRLMRSERI+PSCVTLCSLV+AYGQA K +KIE 
Sbjct: 364 SWTIITYNIVIDAFGRAGDLKQMEYLFRLMRSERIKPSCVTLCSLVRAYGQAEKPEKIEG 423

Query: 437 VLHIVENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDKTTYRIMARAYS 496
           VL  +ENS+ITLDTVF+NCLVDAYGRM CFAEMK VL +MEQ+G +PDK TYR M +AYS
Sbjct: 424 VLRFIENSDITLDTVFFNCLVDAYGRMGCFAEMKGVLILMEQKGYRPDKITYRTMIKAYS 483

Query: 497 DGGMANHAREIHDL 511
             GM  H +E+ DL
Sbjct: 484 SKGMTKHVKELQDL 497

BLAST of CmaCh17G012570 vs. TrEMBL
Match: V4TYK6_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10019744mg PE=4 SV=1)

HSP 1 Score: 700.3 bits (1806), Expect = 1.8e-198
Identity = 357/497 (71.83%), Postives = 415/497 (83.50%), Query Frame = 1

Query: 29  PAVSHSNPKNRVFLCQQRASHHNFESPAPSSSSSSTDGKLMQRSPREGRMDIAKLKAKEA 88
           P +S S P ++  +  ++  HH   S    ++ S   GK  Q   R   +D+ KLK +E 
Sbjct: 27  PVLSTSVPLSKPEIEPRKRPHHIISS----NNHSGNAGKTPQS--RGTLLDLEKLKEREE 86

Query: 89  GERKEEVNRKIASQKAISVILRREATKAVIERKRGPTNSKKLLPRTVLEALHERITALRW 148
            ERKEEVNRKIAS+KAISVILRREATKAVIE+KRGP NSKKLLPRTVLEAL+ERITALRW
Sbjct: 87  KERKEEVNRKIASKKAISVILRREATKAVIEKKRGPVNSKKLLPRTVLEALNERITALRW 146

Query: 149 ESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIEEGCEVSHESYTA 208
           ESALKVFELLREQLWY+P A +Y+KLIVMLGKCKQPEKA+ELFQ M++EGC+ + +S+TA
Sbjct: 147 ESALKVFELLREQLWYKPNAAVYVKLIVMLGKCKQPEKAHELFQAMVDEGCDANTQSFTA 206

Query: 209 LLSAYSRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTR 268
           LLSAY RSGL D+AFSLL  MKN+PDCQPDV+TYSILIKSCL+ FAF+K Q LLSDM T+
Sbjct: 207 LLSAYGRSGLFDKAFSLLEHMKNTPDCQPDVNTYSILIKSCLKAFAFDKVQALLSDMSTQ 266

Query: 269 GIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLRGFGSSGQLET 328
           GI+PNT+TYNT IDAYG+AKMFAEME  LV+MLS+D C+PDVWTMN TLR FG+SGQ++T
Sbjct: 267 GIRPNTVTYNTLIDAYGRAKMFAEMELTLVKMLSED-CEPDVWTMNCTLRAFGNSGQIDT 326

Query: 329 MEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYHYSWTIVTYNIVID 388
           MEKCYEKFQ AGIQP+I TFNILLDSYGKA  +EKMSAVMEYMQKYHYSWTIVTYNIVID
Sbjct: 327 MEKCYEKFQSAGIQPSINTFNILLDSYGKAGHFEKMSAVMEYMQKYHYSWTIVTYNIVID 386

Query: 389 AFGRAGNLKQMEHLFRLMRSERIQPSCVTLCSLVKAYGQAGKRDKIESVLHIVENSNITL 448
           AFGRAG+LKQME+LFRLMRSERI+PSCVTLCSLV+AYG AGK +K+ SVL  ++NS+I L
Sbjct: 387 AFGRAGDLKQMEYLFRLMRSERIKPSCVTLCSLVRAYGHAGKPEKLGSVLRFIDNSDIML 446

Query: 449 DTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDKTTYRIMARAYSDGGMANHAREIH 508
           DTVF+NCLVDAYGR++CFAEMK VL +M+QRGCKPDK TYR M RAYS  GM NHA+E  
Sbjct: 447 DTVFFNCLVDAYGRLKCFAEMKGVLEVMQQRGCKPDKVTYRTMVRAYSTNGMKNHAKEFQ 506

Query: 509 DLVRTAEAS--KRTRPD 524
           DLV   + +     RPD
Sbjct: 507 DLVEKMDETCLAMKRPD 516

BLAST of CmaCh17G012570 vs. TrEMBL
Match: A0A067LME3_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_17186 PE=4 SV=1)

HSP 1 Score: 698.7 bits (1802), Expect = 5.2e-198
Identity = 347/448 (77.46%), Postives = 389/448 (86.83%), Query Frame = 1

Query: 78  MDIAKLKAKEAGERKEEVNRKIASQKAISVILRREATKAVIERKRGPTNSKKLLPRTVLE 137
           +D  K+  +EA E+KEE NRKIAS+KAISVILRREATKA+IE+K+GPTNSKKLLPRTVLE
Sbjct: 72  IDWKKVNEREAREKKEEANRKIASRKAISVILRREATKAIIEKKKGPTNSKKLLPRTVLE 131

Query: 138 ALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIEE 197
           ALH+RITALRWESALKVFELLREQLWYRP  GMYIKLIVMLGKCKQPEKA+ELF  MI E
Sbjct: 132 ALHDRITALRWESALKVFELLREQLWYRPSPGMYIKLIVMLGKCKQPEKAHELFDAMIAE 191

Query: 198 GCEVSHESYTALLSAYSRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAFNK 257
           GC VS ESYTALLSAY RSGLLD AFSLL EMKN+PDC+PDVHTYSILIKSCLQVFAF+K
Sbjct: 192 GCVVSRESYTALLSAYGRSGLLDEAFSLLEEMKNNPDCRPDVHTYSILIKSCLQVFAFDK 251

Query: 258 AQTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNSTL 317
           AQ LLSDM   G++ NTITYNT IDAYGKAKMFAEME+ LV+MLS+  C+PDVWTMNSTL
Sbjct: 252 AQELLSDMAPLGVRANTITYNTLIDAYGKAKMFAEMEATLVKMLSEQNCEPDVWTMNSTL 311

Query: 318 RGFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYHYS 377
           R FG SGQ+E ME CYEKFQ AGI+PNI+TFNILLDSYGKA  Y KMSAVMEYMQKYHYS
Sbjct: 312 RAFGGSGQIEMMETCYEKFQSAGIEPNIKTFNILLDSYGKAGDYRKMSAVMEYMQKYHYS 371

Query: 378 WTIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIQPSCVTLCSLVKAYGQAGKRDKIESV 437
           WTIVTYN+VIDAFGRAG+LKQME+LFRLMRSERI+PSCVTLCSLV+AYG+AG+ +KI  V
Sbjct: 372 WTIVTYNVVIDAFGRAGDLKQMEYLFRLMRSERIKPSCVTLCSLVRAYGEAGEPEKIGGV 431

Query: 438 LHIVENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDKTTYRIMARAYSD 497
           L  +ENS+ITLD VF+NCLVDAYG++ CF EMK VL +MEQ+GCK DK TYR M  AYS 
Sbjct: 432 LRFIENSDITLDIVFFNCLVDAYGKLGCFVEMKGVLELMEQKGCKADKITYRTMINAYSS 491

Query: 498 GGMANHAREIHDLVRTAEASK--RTRPD 524
            GM  HA+E+ DLV +AE  +  R +PD
Sbjct: 492 KGMTKHAKELQDLVVSAERPRLHRNKPD 519

BLAST of CmaCh17G012570 vs. TrEMBL
Match: A0A0B0NSH6_GOSAR (Uncharacterized protein OS=Gossypium arboreum GN=F383_20827 PE=4 SV=1)

HSP 1 Score: 694.9 bits (1792), Expect = 7.6e-197
Identity = 350/477 (73.38%), Postives = 401/477 (84.07%), Query Frame = 1

Query: 47  ASHHNFESPAPSSSSSSTDGKLMQRSPREGRMDIAK-LKAKEAGERKEEVNRKIASQKAI 106
           AS     +P       +    + +R   +GR+ IA+ LK +E   R+EEVNRKIAS+KAI
Sbjct: 33  ASKSETSAPNEDEEEETRSLTVSERGREKGRLVIAEELKQRETRGRREEVNRKIASRKAI 92

Query: 107 SVILRREATKAVIERKRGPTNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYR 166
           SVI+RREATKA IE+KRGP NSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYR
Sbjct: 93  SVIMRREATKAFIEKKRGPNNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWYR 152

Query: 167 PYAGMYIKLIVMLGKCKQPEKAYELFQEMIEEGCEVSHESYTALLSAYSRSGLLDRAFSL 226
           P A +YIKLIV+LGKCKQP+KAYELFQ M +EGC ++HE+YTALLSAYSRSGL D+AFSL
Sbjct: 153 PNAAIYIKLIVLLGKCKQPDKAYELFQAMSDEGCVMNHEAYTALLSAYSRSGLFDKAFSL 212

Query: 227 LNEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPNTITYNTFIDAYG 286
           L EMK++P C PDV TYSILIKSCLQVFAF+K + LLSDM +RGI+PNT+TYNT IDAYG
Sbjct: 213 LEEMKDTPICHPDVQTYSILIKSCLQVFAFDKVRALLSDMASRGIRPNTVTYNTLIDAYG 272

Query: 287 KAKMFAEMESILVEMLSDDGCKPDVWTMNSTLRGFGSSGQLETMEKCYEKFQGAGIQPNI 346
           KAKMF EME  LVEML    C+PDVWTMNST+R FGSSGQ+ETMEKCYEKFQ AGIQPNI
Sbjct: 273 KAKMFQEMEMTLVEMLRGKDCEPDVWTMNSTIRAFGSSGQIETMEKCYEKFQSAGIQPNI 332

Query: 347 QTFNILLDSYGKAKSYEKMSAVMEYMQKYHYSWTIVTYNIVIDAFGRAGNLKQMEHLFRL 406
           +TFNILLDSYGK  +YEKMSAVMEYMQKYHYSWTIVTYN+VIDAFGRAG+LKQ+E+LFRL
Sbjct: 333 KTFNILLDSYGKTGNYEKMSAVMEYMQKYHYSWTIVTYNVVIDAFGRAGDLKQVEYLFRL 392

Query: 407 MRSERIQPSCVTLCSLVKAYGQAGKRDKIESVLHIVENSNITLDTVFYNCLVDAYGRMEC 466
           MRSERI+PSCVTLCSLV+AYGQAGK +KI  VL I+ENS++TLD VF+NCLVDAYGRM C
Sbjct: 393 MRSERIKPSCVTLCSLVRAYGQAGKAEKIAGVLRIIENSDVTLDIVFFNCLVDAYGRMGC 452

Query: 467 FAEMKAVLGMMEQRGCKPDKTTYRIMARAYSDGGMANHAREIHDLVRTAEASKRTRP 523
           FAEMK VL MM+Q+G KPDK TYR M +AYS  GM +HA+E+ +LV +A  S    P
Sbjct: 453 FAEMKGVLEMMKQKGYKPDKITYRTMIKAYSISGMTSHAKELRNLVESAAGSSLGMP 509

BLAST of CmaCh17G012570 vs. TAIR10
Match: AT5G48730.1 (AT5G48730.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 641.0 bits (1652), Expect = 6.6e-184
Identity = 328/498 (65.86%), Postives = 394/498 (79.12%), Query Frame = 1

Query: 17  MAAHSTPSILLPPAVSHSNPKNRVFLCQQRA-SHHNFESPAPSSSSSSTDGKLMQRSPRE 76
           M + ST +   PP  ++     R F  +  + S         S  S++T   L +    +
Sbjct: 1   MVSLSTSTSHAPPLPTNRRTAERTFTVRCISISPREPNYAITSDKSNNTSLSLRETRQSK 60

Query: 77  GRMDIAKLKAKEAGERKEEVNRKIASQKAISVILRREATKAVIERKRGPTNSKKLLPRTV 136
             ++   +  +++ E KE+ N KIAS+KAIS+ILRREATK++IE+K+G   SKKLLPRTV
Sbjct: 61  WLINAEDVNERDSKEIKEDKNTKIASRKAISIILRREATKSIIEKKKG---SKKLLPRTV 120

Query: 137 LEALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMI 196
           LE+LHERITALRWESA++VFELLREQLWY+P  G+Y+KLIVMLGKCKQPEKA+ELFQEMI
Sbjct: 121 LESLHERITALRWESAIQVFELLREQLWYKPNVGIYVKLIVMLGKCKQPEKAHELFQEMI 180

Query: 197 EEGCEVSHESYTALLSAYSRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAF 256
            EGC V+HE YTAL+SAYSRSG  D AF+LL  MK+S +CQPDVHTYSILIKS LQVFAF
Sbjct: 181 NEGCVVNHEVYTALVSAYSRSGRFDAAFTLLERMKSSHNCQPDVHTYSILIKSFLQVFAF 240

Query: 257 NKAQTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNS 316
           +K Q LLSDM  +GI+PNTITYNT IDAYGKAKMF EMES L++ML +D CKPD WTMNS
Sbjct: 241 DKVQDLLSDMRRQGIRPNTITYNTLIDAYGKAKMFVEMESTLIQMLGEDDCKPDSWTMNS 300

Query: 317 TLRGFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYH 376
           TLR FG +GQ+E ME CYEKFQ +GI+PNI+TFNILLDSYGK+ +Y+KMSAVMEYMQKYH
Sbjct: 301 TLRAFGGNGQIEMMENCYEKFQSSGIEPNIRTFNILLDSYGKSGNYKKMSAVMEYMQKYH 360

Query: 377 YSWTIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIQPSCVTLCSLVKAYGQAGKRDKIE 436
           YSWTIVTYN+VIDAFGRAG+LKQME+LFRLM+SERI PSCVTLCSLV+AYG+A K DKI 
Sbjct: 361 YSWTIVTYNVVIDAFGRAGDLKQMEYLFRLMQSERIFPSCVTLCSLVRAYGRASKADKIG 420

Query: 437 SVLHIVENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDKTTYRIMARAY 496
            VL  +ENS+I LD VF+NCLVDAYGRME FAEMK VL +ME++G KPDK TYR M +AY
Sbjct: 421 GVLRFIENSDIRLDLVFFNCLVDAYGRMEKFAEMKGVLELMEKKGFKPDKITYRTMVKAY 480

Query: 497 SDGGMANHAREIHDLVRT 514
              GM  H +E+H +V +
Sbjct: 481 RISGMTTHVKELHGVVES 495

BLAST of CmaCh17G012570 vs. TAIR10
Match: AT3G53170.1 (AT3G53170.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 375.2 bits (962), Expect = 6.7e-104
Identity = 193/441 (43.76%), Postives = 287/441 (65.08%), Query Frame = 1

Query: 86  KEAGERKEEVNRKIAS-------QKAISVILRREATKAVIERKRGPTNSKKLLPRTVLEA 145
           K   ER E++N  + S       +K +S ILR +A    IERK        L P+ VLEA
Sbjct: 55  KVPNERTEKMNSGLISTRHQVDPKKELSRILRTDAAVKGIERKANSEKYLTLWPKAVLEA 114

Query: 146 LHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIEEG 205
           L E I   RW+SALK+F LLR+Q WY P    Y KL  +LG CKQP++A  LF+ M+ EG
Sbjct: 115 LDEAIKENRWQSALKIFNLLRKQHWYEPRCKTYTKLFKVLGNCKQPDQASLLFEVMLSEG 174

Query: 206 CEVSHESYTALLSAYSRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAFNKA 265
            + + + YT+L+S Y +S LLD+AFS L  MK+  DC+PDV T+++LI  C ++  F+  
Sbjct: 175 LKPTIDVYTSLISVYGKSELLDKAFSTLEYMKSVSDCKPDVFTFTVLISCCCKLGRFDLV 234

Query: 266 QTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLR 325
           ++++ +M   G+  +T+TYNT ID YGKA MF EMES+L +M+ D    PDV T+NS + 
Sbjct: 235 KSIVLEMSYLGVGCSTVTYNTIIDGYGKAGMFEEMESVLADMIEDGDSLPDVCTLNSIIG 294

Query: 326 GFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYHYSW 385
            +G+   +  ME  Y +FQ  G+QP+I TFNIL+ S+GKA  Y+KM +VM++M+K  +S 
Sbjct: 295 SYGNGRNMRKMESWYSRFQLMGVQPDITTFNILILSFGKAGMYKKMCSVMDFMEKRFFSL 354

Query: 386 TIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIQPSCVTLCSLVKAYGQAGKRDKIESVL 445
           T VTYNIVI+ FG+AG +++M+ +FR M+ + ++P+ +T CSLV AY +AG   KI+SVL
Sbjct: 355 TTVTYNIVIETFGKAGRIEKMDDVFRKMKYQGVKPNSITYCSLVNAYSKAGLVVKIDSVL 414

Query: 446 HIVENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDKTTYRIMARAYSDG 505
             + NS++ LDT F+NC+++AYG+    A MK +   ME+R CKPDK T+  M + Y+  
Sbjct: 415 RQIVNSDVVLDTPFFNCIINAYGQAGDLATMKELYIQMEERKCKPDKITFATMIKTYTAH 474

Query: 506 GMANHAREIHDLVRTAEASKR 520
           G+ +  +E+   + +++  K+
Sbjct: 475 GIFDAVQELEKQMISSDIGKK 495

BLAST of CmaCh17G012570 vs. TAIR10
Match: AT3G06430.1 (AT3G06430.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 342.0 bits (876), Expect = 6.3e-94
Identity = 177/396 (44.70%), Postives = 244/396 (61.62%), Query Frame = 1

Query: 114 TKAVIERKRGPTNSKKLLPR---------TVLEALHERITALRWESALKVFELLREQLWY 173
           T+ V +R+    N KK L R         TV E L + I   +W  AL+VF++LREQ +Y
Sbjct: 61  TEPVNQRRTPIKNVKKKLDRRSKANGWVNTVTETLSDLIAKKQWLQALEVFDMLREQTFY 120

Query: 174 RPYAGMYIKLIVMLGKCKQPEKAYELFQEMIEEGCEVSHESYTALLSAYSRSGLLDRAFS 233
           +P  G Y+KL+V+LGK  QP +A +LF EM+EEG E + E YTALL+AY+RS L+D AFS
Sbjct: 121 QPKEGTYMKLLVLLGKSGQPNRAQKLFDEMLEEGLEPTVELYTALLAAYTRSNLIDDAFS 180

Query: 234 LLNEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPNTITYNTFIDAY 293
           +L++MK+ P CQPDV TYS L+K+C+    F+   +L  +M  R I PNT+T N  +  Y
Sbjct: 181 ILDKMKSFPQCQPDVFTYSTLLKACVDASQFDLVDSLYKEMDERLITPNTVTQNIVLSGY 240

Query: 294 GKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLRGFGSSGQLETMEKCYEKFQGAGIQPN 353
           G+   F +ME +L +ML    CKPDVWTMN  L  FG+ G+++ ME  YEKF+  GI+P 
Sbjct: 241 GRVGRFDQMEKVLSDMLVSTACKPDVWTMNIILSVFGNMGKIDMMESWYEKFRNFGIEPE 300

Query: 354 IQTFNILLDSYGKAKSYEKMSAVMEYMQKYHYSWTIVTYNIVIDAFGRAGNLKQMEHLFR 413
            +TFNIL+ SYGK + Y+KMS+VMEYM+K  + WT  TYN +I+AF   G+ K ME  F 
Sbjct: 301 TRTFNILIGSYGKKRMYDKMSSVMEYMRKLEFPWTTSTYNNIIEAFADVGDAKNMELTFD 360

Query: 414 LMRSERIQPSCVTLCSLVKAYGQAGKRDKIESVLHIVENSNITLDTVFYNCLVDAYGRME 473
            MRSE ++    T C L+  Y  AG   K+ S + +     I  +T FYN ++ A  + +
Sbjct: 361 QMRSEGMKADTKTFCCLINGYANAGLFHKVISSVQLAAKFEIPENTAFYNAVISACAKAD 420

Query: 474 CFAEMKAVLGMMEQRGCKPDKTTYRIMARAYSDGGM 501
              EM+ V   M++R C  D  T+ IM  AY   GM
Sbjct: 421 DLIEMERVYIRMKERQCVCDSRTFEIMVEAYEKEGM 456

BLAST of CmaCh17G012570 vs. TAIR10
Match: AT4G39620.1 (AT4G39620.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 179.5 bits (454), Expect = 5.4e-45
Identity = 102/344 (29.65%), Postives = 182/344 (52.91%), Query Frame = 1

Query: 147 RWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIEEGCEVSHESY 206
           +W   L+VF  +++Q WY P  G+Y KLI ++GK  Q   A  LF EM   GC      Y
Sbjct: 112 KWLQCLEVFRWMQKQRWYIPDNGVYSKLISVMGKKGQTRMAMWLFSEMKNSGCRPDASVY 171

Query: 207 TALLSAY----SRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLL 266
            AL++A+     ++  L++    L++MK    CQP+V TY+IL+++  Q    ++   L 
Sbjct: 172 NALITAHLHTRDKAKALEKVRGYLDKMKGIERCQPNVVTYNILLRAFAQSGKVDQVNALF 231

Query: 267 SDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLRGFGS 326
            D+    + P+  T+N  +DAYGK  M  EME++L  M S++ CKPD+ T N  +  +G 
Sbjct: 232 KDLDMSPVSPDVYTFNGVMDAYGKNGMIKEMEAVLTRMRSNE-CKPDIITFNVLIDSYGK 291

Query: 327 SGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYHYSWTIVT 386
             + E ME+ ++    +  +P + TFN ++ +YGKA+  +K   V + M   +Y  + +T
Sbjct: 292 KQEFEKMEQTFKSLMRSKEKPTLPTFNSMIINYGKARMIDKAEWVFKKMNDMNYIPSFIT 351

Query: 387 YNIVIDAFGRAGNLKQMEHLF-RLMRSERIQPSCVTLCSLVKAYGQAGKRDKIESVLHIV 446
           Y  +I  +G  G++ +   +F  +  S+R+  +  TL ++++ Y + G   + + + H  
Sbjct: 352 YECMIMMYGYCGSVSRAREIFEEVGESDRVLKAS-TLNAMLEVYCRNGLYIEADKLFHNA 411

Query: 447 ENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDK 486
               +  D   Y  L  AY + +   +++ ++  ME+ G  P+K
Sbjct: 412 SAFRVHPDASTYKFLYKAYTKADMKEQVQILMKKMEKDGIVPNK 453

BLAST of CmaCh17G012570 vs. TAIR10
Match: AT5G02860.1 (AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 166.0 bits (419), Expect = 6.2e-41
Identity = 103/342 (30.12%), Postives = 163/342 (47.66%), Query Frame = 1

Query: 171 YIKLIVMLGKCKQPEKAYELFQEMIEEGCEVSHESYTALLSAYSRSGLLDRAFSLLNEMK 230
           Y  L+ + GK  +P++A ++  EM+  G   S  +Y +L+SAY+R G+LD A  L N+M 
Sbjct: 317 YNALLDVYGKSHRPKEAMKVLNEMVLNGFSPSIVTYNSLISAYARDGMLDEAMELKNQMA 376

Query: 231 NSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPNTITYNTFIDAYGKAKMF 290
                +PDV TY+ L+    +      A ++  +M   G KPN  T+N FI  YG    F
Sbjct: 377 EK-GTKPDVFTYTTLLSGFERAGKVESAMSIFEEMRNAGCKPNICTFNAFIKMYGNRGKF 436

Query: 291 AEMESILVEMLSDDGCKPDVWTMNSTLRGFGSSGQLETMEKCYEKFQGAGIQPNIQTFNI 350
            EM  I  E ++  G  PD+ T N+ L  FG +G    +   +++ + AG  P  +TFN 
Sbjct: 437 TEMMKIFDE-INVCGLSPDIVTWNTLLAVFGQNGMDSEVSGVFKEMKRAGFVPERETFNT 496

Query: 351 LLDSYGKAKSYEKMSAVMEYMQKYHYSWTIVTYNIVIDAFGRAGNLKQMEHLFRLMRSER 410
           L+ +Y +  S+E+   V   M     +  + TYN V+ A  R G  +Q E +   M   R
Sbjct: 497 LISAYSRCGSFEQAMTVYRRMLDAGVTPDLSTYNTVLAALARGGMWEQSEKVLAEMEDGR 556

Query: 411 IQPSCVTLCSLVKAYGQAGKRDKIESVLHIVENSNITLDTVFYNCLVDAYGRMECFAEMK 470
            +P+ +T CSL+ AY    +   + S+   V +  I    V    LV    + +   E +
Sbjct: 557 CKPNELTYCSLLHAYANGKEIGLMHSLAEEVYSGVIEPRAVLLKTLVLVCSKCDLLPEAE 616

Query: 471 AVLGMMEQRGCKPDKTTYRIMARAYSDGGMANHAREIHDLVR 513
                +++RG  PD TT   M   Y    M   A  + D ++
Sbjct: 617 RAFSELKERGFSPDITTLNSMVSIYGRRQMVAKANGVLDYMK 656

BLAST of CmaCh17G012570 vs. NCBI nr
Match: gi|659116302|ref|XP_008458009.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g48730, chloroplastic [Cucumis melo])

HSP 1 Score: 891.7 bits (2303), Expect = 6.1e-256
Identity = 449/501 (89.62%), Postives = 476/501 (95.01%), Query Frame = 1

Query: 27  LPPAVSHSNPKNRVFLCQQRASHHNFESPAPSSSSSS---TDGKLMQRSPREGRMDIAKL 86
           LPP ++H +P  R+FL +Q+ ++  F+SPAPSSSSSS   TDGKL++ SP EGRMD+AKL
Sbjct: 6   LPPPINHRDPNKRLFLRRQQPTYPKFQSPAPSSSSSSTTTTDGKLVKISPHEGRMDVAKL 65

Query: 87  KAKEAGERKEEVNRKIASQKAISVILRREATKAVIERKRGPTNSKKLLPRTVLEALHERI 146
           KAKEA ERKEEVNRKIASQKAISVILRREATKAVIERKRGP NSKKLLPRTVLEALH+RI
Sbjct: 66  KAKEAAERKEEVNRKIASQKAISVILRREATKAVIERKRGPNNSKKLLPRTVLEALHDRI 125

Query: 147 TALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIEEGCEVSH 206
           TALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAY+LFQEMIEEGCEVSH
Sbjct: 126 TALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYKLFQEMIEEGCEVSH 185

Query: 207 ESYTALLSAYSRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLS 266
           ESYTALLSAYSRSGLLD+AFS+LNEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLS
Sbjct: 186 ESYTALLSAYSRSGLLDKAFSILNEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLS 245

Query: 267 DMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLRGFGSS 326
           DMVT+GIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLR FG S
Sbjct: 246 DMVTQGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLRAFGHS 305

Query: 327 GQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYHYSWTIVTY 386
           GQ+ETMEKCYEKFQ AGIQPNIQTFNILLDSYGKA+SYEKMSAVMEYMQKYHYSWTIVTY
Sbjct: 306 GQIETMEKCYEKFQEAGIQPNIQTFNILLDSYGKAESYEKMSAVMEYMQKYHYSWTIVTY 365

Query: 387 NIVIDAFGRAGNLKQMEHLFRLMRSERIQPSCVTLCSLVKAYGQAGKRDKIESVLHIVEN 446
           NIVIDAFGRAGNLKQMEHLFRLMRSERI+PSCVTLCSLVKAYGQAGK +KI SVL++VEN
Sbjct: 366 NIVIDAFGRAGNLKQMEHLFRLMRSERIKPSCVTLCSLVKAYGQAGKCEKINSVLNLVEN 425

Query: 447 SNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDKTTYRIMARAYSDGGMANH 506
           S+I LDTVFYNCLVDAYGRMECFAEMK VLGMMEQRGCKPDKTTYR MARAYSDGGMANH
Sbjct: 426 SDIMLDTVFYNCLVDAYGRMECFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDGGMANH 485

Query: 507 AREIHDLVRTAEASKRTRPDL 525
           A+EI +L+ TAEASKRTRPDL
Sbjct: 486 AKEIQELITTAEASKRTRPDL 506

BLAST of CmaCh17G012570 vs. NCBI nr
Match: gi|778726970|ref|XP_004139628.2| (PREDICTED: pentatricopeptide repeat-containing protein At5g48730, chloroplastic [Cucumis sativus])

HSP 1 Score: 884.8 bits (2285), Expect = 7.4e-254
Identity = 447/506 (88.34%), Postives = 475/506 (93.87%), Query Frame = 1

Query: 27  LPPAVSHSNP-KNRVFLCQQRASHHNFESPAPSSSSSST-------DGKLMQRSPREGRM 86
           LPP ++H NP  NR++L +Q+ +H  F+SPAPSSSSSS+       DGKLM+ SP EGRM
Sbjct: 6   LPPPINHPNPTNNRLYLRRQQRTHPKFQSPAPSSSSSSSSSTTTTADGKLMKISPHEGRM 65

Query: 87  DIAKLKAKEAGERKEEVNRKIASQKAISVILRREATKAVIERKRGPTNSKKLLPRTVLEA 146
           D+AKLKAKEA ERKEEVNRKIASQKAISVILRREATKAVIERKRGP NSKKLLPRTVLEA
Sbjct: 66  DVAKLKAKEASERKEEVNRKIASQKAISVILRREATKAVIERKRGPNNSKKLLPRTVLEA 125

Query: 147 LHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIEEG 206
           LH+RIT LRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQ EKAYELFQEMIEEG
Sbjct: 126 LHDRITTLRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQQEKAYELFQEMIEEG 185

Query: 207 CEVSHESYTALLSAYSRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAFNKA 266
           CEVSHESYTALLSAYSRSGLLD AFS+LNEMKNSPDCQPDVHTYSILIKSCLQVFAFNKA
Sbjct: 186 CEVSHESYTALLSAYSRSGLLDEAFSILNEMKNSPDCQPDVHTYSILIKSCLQVFAFNKA 245

Query: 267 QTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLR 326
           QTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILV+ML+DDGCKPDVWTMNSTLR
Sbjct: 246 QTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVDMLNDDGCKPDVWTMNSTLR 305

Query: 327 GFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYHYSW 386
            FG SGQLETMEKCYEKFQ AGIQP+IQTFNILLDSYGKA+SYEKMSAVMEYMQKYHYSW
Sbjct: 306 AFGRSGQLETMEKCYEKFQEAGIQPSIQTFNILLDSYGKAESYEKMSAVMEYMQKYHYSW 365

Query: 387 TIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIQPSCVTLCSLVKAYGQAGKRDKIESVL 446
           TIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERI+PSCVTLCSLV+AYGQAGKR+KI+SVL
Sbjct: 366 TIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIKPSCVTLCSLVRAYGQAGKREKIDSVL 425

Query: 447 HIVENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDKTTYRIMARAYSDG 506
           ++VENS+I LDTVFYNCLVDAYGR+ECFAEMK VLGMMEQRGCKPDKTTYR MARAYSDG
Sbjct: 426 NLVENSDIMLDTVFYNCLVDAYGRLECFAEMKKVLGMMEQRGCKPDKTTYRTMARAYSDG 485

Query: 507 GMANHAREIHDLVRTAEASKRTRPDL 525
           GMANHA+EI +L+ TAE SKRTRPDL
Sbjct: 486 GMANHAKEIQELITTAEPSKRTRPDL 511

BLAST of CmaCh17G012570 vs. NCBI nr
Match: gi|1000945412|ref|XP_015581300.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g48730, chloroplastic [Ricinus communis])

HSP 1 Score: 714.5 bits (1843), Expect = 1.3e-202
Identity = 355/449 (79.06%), Postives = 398/449 (88.64%), Query Frame = 1

Query: 77  RMDIAKLKAKEAGERKEEVNRKIASQKAISVILRREATKAVIERKRGPTNSKKLLPRTVL 136
           RMD   +K KE  E KEE++RKIAS+KAISVILRREATKA+IE+KRGPTNSKKLLPRTVL
Sbjct: 64  RMDWEIVKKKEEKEGKEEMDRKIASRKAISVILRREATKAIIEKKRGPTNSKKLLPRTVL 123

Query: 137 EALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIE 196
           EALHERITALRWESALKVFELLREQ+WYRPY+GMYIKLIVMLGKCKQPEKA+ELFQ MI 
Sbjct: 124 EALHERITALRWESALKVFELLREQIWYRPYSGMYIKLIVMLGKCKQPEKAHELFQAMIH 183

Query: 197 EGCEVSHESYTALLSAYSRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAFN 256
           EGC+VSHESYTALLSAY RSGLLD+AFSLL EMK +PDCQPDVHTYSILIKSC+QVFAF+
Sbjct: 184 EGCDVSHESYTALLSAYGRSGLLDKAFSLLEEMKRNPDCQPDVHTYSILIKSCVQVFAFD 243

Query: 257 KAQTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNST 316
           KA+TLLS+M + GI PNTITYNT IDAYGKAKMF EME+ LV+MLS   C+PDVWTMNST
Sbjct: 244 KAKTLLSNMESLGISPNTITYNTLIDAYGKAKMFEEMEATLVKMLSQQNCEPDVWTMNST 303

Query: 317 LRGFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYHY 376
           LR FG SGQ+ETMEKCYEKFQGAGI+P+I TFN+LLDSYGKA  Y+KMSAVMEYMQKYHY
Sbjct: 304 LRAFGISGQIETMEKCYEKFQGAGIEPSIMTFNVLLDSYGKAGDYKKMSAVMEYMQKYHY 363

Query: 377 SWTIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIQPSCVTLCSLVKAYGQAGKRDKIES 436
           SWTI+TYNIVIDAFGRAG+LKQME+LFRLMRSERI+PSCVTLCSLV+AYGQA K +KIE 
Sbjct: 364 SWTIITYNIVIDAFGRAGDLKQMEYLFRLMRSERIKPSCVTLCSLVRAYGQAEKPEKIEG 423

Query: 437 VLHIVENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDKTTYRIMARAYS 496
           VL  +ENS+ITLDTVF+NCLVDAYGRM CFAEMK VL +MEQ+G +PDK TYR M +AYS
Sbjct: 424 VLRFIENSDITLDTVFFNCLVDAYGRMGCFAEMKGVLILMEQKGYRPDKITYRTMIKAYS 483

Query: 497 DGGMANHAREIHDLVRTAEASK--RTRPD 524
             GM  H +E+ DLV + E  +  R +PD
Sbjct: 484 SKGMTKHVKELQDLVASVEGPQLHRKKPD 512

BLAST of CmaCh17G012570 vs. NCBI nr
Match: gi|743833914|ref|XP_011024658.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g48730, chloroplastic [Populus euphratica])

HSP 1 Score: 711.4 bits (1835), Expect = 1.1e-201
Identity = 354/481 (73.60%), Postives = 416/481 (86.49%), Query Frame = 1

Query: 47  ASHHNFESPAPSSSSSSTD--GKLMQRSPREGRMDIAKLKAKEAGERKEEVNRKIASQKA 106
           AS    +S  PSSS+++    G+  +       ++  KLK KE  ERKEEVNRKIASQKA
Sbjct: 30  ASRPETKSNFPSSSTNNNVAIGRSERGEMTRREIEWEKLKKKEEKERKEEVNRKIASQKA 89

Query: 107 ISVILRREATKAVIERKRGPTNSKKLLPRTVLEALHERITALRWESALKVFELLREQLWY 166
           ISVILRREATKAVIE+KRGPTNSKKLLP+TVLEALHERITALRW SAL+VFELLREQLWY
Sbjct: 90  ISVILRREATKAVIEKKRGPTNSKKLLPQTVLEALHERITALRWASALEVFELLREQLWY 149

Query: 167 RPYAGMYIKLIVMLGKCKQPEKAYELFQEMIEEGCEVSHESYTALLSAYSRSGLLDRAFS 226
           RPYAGMY+KLIVMLGKCKQP+KA++LFQ MI+EGC V+HESYTALLSAY RSGL D+AFS
Sbjct: 150 RPYAGMYVKLIVMLGKCKQPDKAHQLFQAMIDEGCAVTHESYTALLSAYGRSGLFDKAFS 209

Query: 227 LLNEMKNSPDCQPDVHTYSILIKSCLQVFAFNKAQTLLSDMVTRGIKPNTITYNTFIDAY 286
           ++ EMKN+PDC+PDVHTYSILIKSCLQVFAF+K Q LLSDM + GI+PNT+TYNT IDAY
Sbjct: 210 IMEEMKNTPDCRPDVHTYSILIKSCLQVFAFDKVQVLLSDMESLGIRPNTVTYNTLIDAY 269

Query: 287 GKAKMFAEMESILVEMLSDDGCKPDVWTMNSTLRGFGSSGQLETMEKCYEKFQGAGIQPN 346
           GKAKMFAEME+ L+EMLS   C+PDVWTMNST+R FG SGQ+E ME CYEKFQ AGI+PN
Sbjct: 270 GKAKMFAEMEATLMEMLSQQDCEPDVWTMNSTIRAFGGSGQMEMMENCYEKFQSAGIEPN 329

Query: 347 IQTFNILLDSYGKAKSYEKMSAVMEYMQKYHYSWTIVTYNIVIDAFGRAGNLKQMEHLFR 406
           I+TFNILLDSYGKA +Y+KMSAVMEYMQ+YHYSWTIVTYN+VIDAFGRAG+LKQME+LFR
Sbjct: 330 IKTFNILLDSYGKAGNYQKMSAVMEYMQRYHYSWTIVTYNVVIDAFGRAGDLKQMEYLFR 389

Query: 407 LMRSERIQPSCVTLCSLVKAYGQAGKRDKIESVLHIVENSNITLDTVFYNCLVDAYGRME 466
           LMRSERI+PSCVTLCSLV+AY +AGK +KI SVL  ++NS++TLDTVF+NCLVDAYGR+E
Sbjct: 390 LMRSERIKPSCVTLCSLVRAYREAGKPEKIRSVLRFIDNSDVTLDTVFFNCLVDAYGRLE 449

Query: 467 CFAEMKAVLGMMEQRGCKPDKTTYRIMARAYSDGGMANHAREIHDLVRTAEA--SKRTRP 524
           CFAEMK VL +ME++GCKPDK TYR M +AYS  GM +HA+E+ +L+ + E   S++ +P
Sbjct: 450 CFAEMKEVLELMEEKGCKPDKVTYRTMIKAYSIKGMTSHAKELRNLLGSVEVTRSQKKKP 509

BLAST of CmaCh17G012570 vs. NCBI nr
Match: gi|223530592|gb|EEF32469.1| (pentatricopeptide repeat-containing protein, putative [Ricinus communis])

HSP 1 Score: 709.1 bits (1829), Expect = 5.6e-201
Identity = 350/434 (80.65%), Postives = 390/434 (89.86%), Query Frame = 1

Query: 77  RMDIAKLKAKEAGERKEEVNRKIASQKAISVILRREATKAVIERKRGPTNSKKLLPRTVL 136
           RMD   +K KE  E KEE++RKIAS+KAISVILRREATKA+IE+KRGPTNSKKLLPRTVL
Sbjct: 64  RMDWEIVKKKEEKEGKEEMDRKIASRKAISVILRREATKAIIEKKRGPTNSKKLLPRTVL 123

Query: 137 EALHERITALRWESALKVFELLREQLWYRPYAGMYIKLIVMLGKCKQPEKAYELFQEMIE 196
           EALHERITALRWESALKVFELLREQ+WYRPY+GMYIKLIVMLGKCKQPEKA+ELFQ MI 
Sbjct: 124 EALHERITALRWESALKVFELLREQIWYRPYSGMYIKLIVMLGKCKQPEKAHELFQAMIH 183

Query: 197 EGCEVSHESYTALLSAYSRSGLLDRAFSLLNEMKNSPDCQPDVHTYSILIKSCLQVFAFN 256
           EGC+VSHESYTALLSAY RSGLLD+AFSLL EMK +PDCQPDVHTYSILIKSC+QVFAF+
Sbjct: 184 EGCDVSHESYTALLSAYGRSGLLDKAFSLLEEMKRNPDCQPDVHTYSILIKSCVQVFAFD 243

Query: 257 KAQTLLSDMVTRGIKPNTITYNTFIDAYGKAKMFAEMESILVEMLSDDGCKPDVWTMNST 316
           KA+TLLS+M + GI PNTITYNT IDAYGKAKMF EME+ LV+MLS   C+PDVWTMNST
Sbjct: 244 KAKTLLSNMESLGISPNTITYNTLIDAYGKAKMFEEMEATLVKMLSQQNCEPDVWTMNST 303

Query: 317 LRGFGSSGQLETMEKCYEKFQGAGIQPNIQTFNILLDSYGKAKSYEKMSAVMEYMQKYHY 376
           LR FG SGQ+ETMEKCYEKFQGAGI+P+I TFN+LLDSYGKA  Y+KMSAVMEYMQKYHY
Sbjct: 304 LRAFGISGQIETMEKCYEKFQGAGIEPSIMTFNVLLDSYGKAGDYKKMSAVMEYMQKYHY 363

Query: 377 SWTIVTYNIVIDAFGRAGNLKQMEHLFRLMRSERIQPSCVTLCSLVKAYGQAGKRDKIES 436
           SWTI+TYNIVIDAFGRAG+LKQME+LFRLMRSERI+PSCVTLCSLV+AYGQA K +KIE 
Sbjct: 364 SWTIITYNIVIDAFGRAGDLKQMEYLFRLMRSERIKPSCVTLCSLVRAYGQAEKPEKIEG 423

Query: 437 VLHIVENSNITLDTVFYNCLVDAYGRMECFAEMKAVLGMMEQRGCKPDKTTYRIMARAYS 496
           VL  +ENS+ITLDTVF+NCLVDAYGRM CFAEMK VL +MEQ+G +PDK TYR M +AYS
Sbjct: 424 VLRFIENSDITLDTVFFNCLVDAYGRMGCFAEMKGVLILMEQKGYRPDKITYRTMIKAYS 483

Query: 497 DGGMANHAREIHDL 511
             GM  H +E+ DL
Sbjct: 484 SKGMTKHVKELQDL 497

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP424_ARATH1.2e-18265.86Pentatricopeptide repeat-containing protein At5g48730, chloroplastic OS=Arabidop... [more]
PP279_ARATH3.5e-10244.76Pentatricopeptide repeat-containing protein At3g53170 OS=Arabidopsis thaliana GN... [more]
PP216_ARATH1.1e-9244.70Pentatricopeptide repeat-containing protein At3g06430, chloroplastic OS=Arabidop... [more]
PP358_ARATH9.6e-4429.65Pentatricopeptide repeat-containing protein At4g39620, chloroplastic OS=Arabidop... [more]
PP362_ARATH1.1e-3930.12Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0K8C6_CUCSA5.2e-25488.34Uncharacterized protein OS=Cucumis sativus GN=Csa_7G342750 PE=4 SV=1[more]
B9SV96_RICCO3.9e-20180.65Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
V4TYK6_9ROSI1.8e-19871.83Uncharacterized protein OS=Citrus clementina GN=CICLE_v10019744mg PE=4 SV=1[more]
A0A067LME3_JATCU5.2e-19877.46Uncharacterized protein OS=Jatropha curcas GN=JCGZ_17186 PE=4 SV=1[more]
A0A0B0NSH6_GOSAR7.6e-19773.38Uncharacterized protein OS=Gossypium arboreum GN=F383_20827 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G48730.16.6e-18465.86 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G53170.16.7e-10443.76 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G06430.16.3e-9444.70 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G39620.15.4e-4529.65 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G02860.16.2e-4130.12 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659116302|ref|XP_008458009.1|6.1e-25689.62PREDICTED: pentatricopeptide repeat-containing protein At5g48730, chloroplastic ... [more]
gi|778726970|ref|XP_004139628.2|7.4e-25488.34PREDICTED: pentatricopeptide repeat-containing protein At5g48730, chloroplastic ... [more]
gi|1000945412|ref|XP_015581300.1|1.3e-20279.06PREDICTED: pentatricopeptide repeat-containing protein At5g48730, chloroplastic ... [more]
gi|743833914|ref|XP_011024658.1|1.1e-20173.60PREDICTED: pentatricopeptide repeat-containing protein At5g48730, chloroplastic ... [more]
gi|223530592|gb|EEF32469.1|5.6e-20180.65pentatricopeptide repeat-containing protein, putative [Ricinus communis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh17G012570.1CmaCh17G012570.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 174..199
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 272..320
score: 3.2E-9coord: 205..249
score: 2.1E-13coord: 449..495
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 334..391
score: 4.9
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 452..484
score: 4.5E-7coord: 347..377
score: 3.5E-5coord: 275..310
score: 2.4E-9coord: 381..415
score: 2.7E-9coord: 171..202
score: 3.7E-6coord: 240..274
score: 0.0031coord: 205..239
score: 2.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 379..413
score: 11.772coord: 273..308
score: 10.797coord: 238..272
score: 10.249coord: 449..483
score: 10.83coord: 344..378
score: 8.364coord: 167..201
score: 9.317coord: 484..518
score: 7.147coord: 309..343
score: 9.372coord: 202..232
score: 9.986coord: 414..448
score: 7
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 147..235
score: 8.
NoneNo IPR availableunknownCoilCoilcoord: 78..98
scor
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 47..515
score: 1.4E
NoneNo IPR availablePANTHERPTHR24015:SF664SUBFAMILY NOT NAMEDcoord: 47..515
score: 1.4E

The following gene(s) are paralogous to this gene:

None