CmoCh07G001980 (gene) Cucurbita moschata (Rifu)

NameCmoCh07G001980
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
Description(Pentatricopeptide repeat-containing protein) (3.4.24.-) (3.6.4.3)
LocationCmo_Chr07 : 1011356 .. 1013260 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTACTCTGTAAGCCTAAATTCATCTTCTGGACTTCGAAACAGAGATTGAATGTCCATGTCTTCTCCACCGTTTCCCATTTACCTCCTCCTCTTTCTTCTCTTCCCCCAATATCTGGGATTACCCAAATTAAGCAAGCCCATGCTCGTAGTGTCGTCTTCGGCCTTGCTAATGATGGCCGCATCATGGGTCACCTCCTCGCTTTTCTTGCCGTTTCTTCCTCTTCATTGCCGTATGAGTACGCCTTCTCAATTTATCAGTCTATTTCTCATCCAAGTGTTTTTGCCACCAATAACATGATACGGTGCTGCGCAAAAGAGGAGTTATCTTGCGAGTCGATATCTCTTTACTCACACATGCGCCGAAGTCTTGTGGCGCCTAATAAACATACTTTGACCTTTGTATTGCAAGCTTGCAGTAACGCTTTGGCTATCTGCGAAGGGATTCAAGTTCAAACCCATGTCATAAAATTTGGTTTTGCTAAAGACGTTTTCATTCGAAATGCGTTGATTCACTTGTATTGTACTCATTGCAGAGTTGAATGTGCGAAGCAGGTATTTGATGAAGTTCCTAGTAGTCGAGATATAGTTTCCTGGAATTCAATGATTGCTGGTTTTGTTAGAGCTGGGCAGATCAATGTTGCAGATAAACTGTTTGTTGAAATGCCTGAGAAAGATGTGATCTCATGGAGCGCGATGATATCTGGGTGTGTTCAAAATGGGCTATTGGAGAAGGCGTTAGACTGCTTTAATGAGATGAGGGAGCAAAAAATGAGGCCGAATGAGGCAATATTGGTGTCCATGCTCGCAGCAGCATCCCAATTGGGTATGCTTGAGTATGGAAAAATGATCCATTCCATTGCGGACTCCTTGAAATTCCCAATGACTGCTTCTCTTGGCACAGCACTAGTTGACATGTATGCTAAGTGCGGTTGTATTGATGAGTCCAAATTCTTGTTCGACCGAATGCCCCAGAAAGATAAATGGACTTGGAATGTTATGATTTGTGGTTTAGCATCGCATGGCCTTGGGCAAGAAGCGCTTGCATTATTTGAAAAGTTTCTAACACAGGGTTTCTACCCAGTCAACGTGACATTCATTGGAGTCTTGAATGCGTGTAGCAGAGCTGGTTTAGTCAGCGAGGGAAGACGTTTTTTTAAGCTAATGACGGACACATATAAGATTATACCAGAGATGGAACACTATGGTTGCATGGTTGATCTCTTCAGCCGTGCTGGGTTCGTTTATGATGCTGTTGAAATGATTAACAGGATGCCTGCTCCTCCGGACCCTGTGTTGTGGGCAACGGTGCTTGGTTCATGCAAGGTTCACGGATTTATAGAACTGGGTGAAGAGATTGGGAACAAGTTGATTCAAATGGATCCCACTCACAATGGGCATTATGTCCAGTTAGCGGGTATCTATGCCAGACTAAGAAAATGGGAAGATGTAAGCAAGATTAGGAGACTAATGGCTGACAGAAACTCCAACAAAATTGCAGGGTGGAGCTTGATTGAAGCAGGAGGAAGAGTTCACCGATTTGTGGCAGGAGATAAGGAGCATGAACAATGTACAGAGATCTACAAGATGTTGGAGACAATTGGAGTACGAATAGCAGCAGCGGGATACTCAGCAAACGTTTCATCAGTACTGCATGACATAGAGGAAGAAGAAAAAGAAACTGCCATTAAAGAGCATAGTGAAAGGTTGGCAATTGCTTTTGGGTTGCTGGTGACTCAAGTTGGTGACTGTATTCGTATTATCAAGAATTTAAGAGTTTGTGGCGATTGCCATGAGGTAAGTAAGATCATTTCTCGAGTATTTGAAAGAGAAATAATTGTTAGAGATGGCAGTAGATTTCACCATTTTAAGAATGGTAGTTGTTCTTGTCTAGATTATTGGTGA

mRNA sequence

ATGCTACTCTGTAAGCCTAAATTCATCTTCTGGACTTCGAAACAGAGATTGAATGTCCATGTCTTCTCCACCGTTTCCCATTTACCTCCTCCTCTTTCTTCTCTTCCCCCAATATCTGGGATTACCCAAATTAAGCAAGCCCATGCTCGTAGTGTCGTCTTCGGCCTTGCTAATGATGGCCGCATCATGGGTCACCTCCTCGCTTTTCTTGCCGTTTCTTCCTCTTCATTGCCGTATGAGTACGCCTTCTCAATTTATCAGTCTATTTCTCATCCAAGTGTTTTTGCCACCAATAACATGATACGGTGCTGCGCAAAAGAGGAGTTATCTTGCGAGTCGATATCTCTTTACTCACACATGCGCCGAAGTCTTGTGGCGCCTAATAAACATACTTTGACCTTTGTATTGCAAGCTTGCAGTAACGCTTTGGCTATCTGCGAAGGGATTCAAGTTCAAACCCATGTCATAAAATTTGGTTTTGCTAAAGACGTTTTCATTCGAAATGCGTTGATTCACTTGTATTGTACTCATTGCAGAGTTGAATGTGCGAAGCAGGTATTTGATGAAGTTCCTAGTAGTCGAGATATAGTTTCCTGGAATTCAATGATTGCTGGTTTTGTTAGAGCTGGGCAGATCAATGTTGCAGATAAACTGTTTGTTGAAATGCCTGAGAAAGATGTGATCTCATGGAGCGCGATGATATCTGGGTGTGTTCAAAATGGGCTATTGGAGAAGGCGTTAGACTGCTTTAATGAGATGAGGGAGCAAAAAATGAGGCCGAATGAGGCAATATTGGTGTCCATGCTCGCAGCAGCATCCCAATTGGGTATGCTTGAGTATGGAAAAATGATCCATTCCATTGCGGACTCCTTGAAATTCCCAATGACTGCTTCTCTTGGCACAGCACTAGTTGACATGTATGCTAAGTGCGGTTGTATTGATGAGTCCAAATTCTTGTTCGACCGAATGCCCCAGAAAGATAAATGGACTTGGAATGTTATGATTTGTGGTTTAGCATCGCATGGCCTTGGGCAAGAAGCGCTTGCATTATTTGAAAAGTTTCTAACACAGGGTTTCTACCCAGTCAACGTGACATTCATTGGAGTCTTGAATGCGTGTAGCAGAGCTGGTTTAGTCAGCGAGGGAAGACGTTTTTTTAAGCTAATGACGGACACATATAAGATTATACCAGAGATGGAACACTATGGTTGCATGGTTGATCTCTTCAGCCGTGCTGGGTTCGTTTATGATGCTGTTGAAATGATTAACAGGATGCCTGCTCCTCCGGACCCTGTGTTGTGGGCAACGGTGCTTGGTTCATGCAAGGTTCACGGATTTATAGAACTGGGTGAAGAGATTGGGAACAAGTTGATTCAAATGGATCCCACTCACAATGGGCATTATGTCCAGTTAGCGGGTATCTATGCCAGACTAAGAAAATGGGAAGATGTAAGCAAGATTAGGAGACTAATGGCTGACAGAAACTCCAACAAAATTGCAGGGTGGAGCTTGATTGAAGCAGGAGGAAGAGTTCACCGATTTGTGGCAGGAGATAAGGAGCATGAACAATGTACAGAGATCTACAAGATGTTGGAGACAATTGGAGTACGAATAGCAGCAGCGGGATACTCAGCAAACGTTTCATCAGTACTGCATGACATAGAGGAAGAAGAAAAAGAAACTGCCATTAAAGAGCATAGTGAAAGGTTGGCAATTGCTTTTGGGTTGCTGGTGACTCAAGTTGGTGACTGTATTCGTATTATCAAGAATTTAAGAGTTTGTGGCGATTGCCATGAGGTAAGTAAGATCATTTCTCGAGTATTTGAAAGAGAAATAATTGTTAGAGATGGCAGTAGATTTCACCATTTTAAGAATGGTAGTTGTTCTTGTCTAGATTATTGGTGA

Coding sequence (CDS)

ATGCTACTCTGTAAGCCTAAATTCATCTTCTGGACTTCGAAACAGAGATTGAATGTCCATGTCTTCTCCACCGTTTCCCATTTACCTCCTCCTCTTTCTTCTCTTCCCCCAATATCTGGGATTACCCAAATTAAGCAAGCCCATGCTCGTAGTGTCGTCTTCGGCCTTGCTAATGATGGCCGCATCATGGGTCACCTCCTCGCTTTTCTTGCCGTTTCTTCCTCTTCATTGCCGTATGAGTACGCCTTCTCAATTTATCAGTCTATTTCTCATCCAAGTGTTTTTGCCACCAATAACATGATACGGTGCTGCGCAAAAGAGGAGTTATCTTGCGAGTCGATATCTCTTTACTCACACATGCGCCGAAGTCTTGTGGCGCCTAATAAACATACTTTGACCTTTGTATTGCAAGCTTGCAGTAACGCTTTGGCTATCTGCGAAGGGATTCAAGTTCAAACCCATGTCATAAAATTTGGTTTTGCTAAAGACGTTTTCATTCGAAATGCGTTGATTCACTTGTATTGTACTCATTGCAGAGTTGAATGTGCGAAGCAGGTATTTGATGAAGTTCCTAGTAGTCGAGATATAGTTTCCTGGAATTCAATGATTGCTGGTTTTGTTAGAGCTGGGCAGATCAATGTTGCAGATAAACTGTTTGTTGAAATGCCTGAGAAAGATGTGATCTCATGGAGCGCGATGATATCTGGGTGTGTTCAAAATGGGCTATTGGAGAAGGCGTTAGACTGCTTTAATGAGATGAGGGAGCAAAAAATGAGGCCGAATGAGGCAATATTGGTGTCCATGCTCGCAGCAGCATCCCAATTGGGTATGCTTGAGTATGGAAAAATGATCCATTCCATTGCGGACTCCTTGAAATTCCCAATGACTGCTTCTCTTGGCACAGCACTAGTTGACATGTATGCTAAGTGCGGTTGTATTGATGAGTCCAAATTCTTGTTCGACCGAATGCCCCAGAAAGATAAATGGACTTGGAATGTTATGATTTGTGGTTTAGCATCGCATGGCCTTGGGCAAGAAGCGCTTGCATTATTTGAAAAGTTTCTAACACAGGGTTTCTACCCAGTCAACGTGACATTCATTGGAGTCTTGAATGCGTGTAGCAGAGCTGGTTTAGTCAGCGAGGGAAGACGTTTTTTTAAGCTAATGACGGACACATATAAGATTATACCAGAGATGGAACACTATGGTTGCATGGTTGATCTCTTCAGCCGTGCTGGGTTCGTTTATGATGCTGTTGAAATGATTAACAGGATGCCTGCTCCTCCGGACCCTGTGTTGTGGGCAACGGTGCTTGGTTCATGCAAGGTTCACGGATTTATAGAACTGGGTGAAGAGATTGGGAACAAGTTGATTCAAATGGATCCCACTCACAATGGGCATTATGTCCAGTTAGCGGGTATCTATGCCAGACTAAGAAAATGGGAAGATGTAAGCAAGATTAGGAGACTAATGGCTGACAGAAACTCCAACAAAATTGCAGGGTGGAGCTTGATTGAAGCAGGAGGAAGAGTTCACCGATTTGTGGCAGGAGATAAGGAGCATGAACAATGTACAGAGATCTACAAGATGTTGGAGACAATTGGAGTACGAATAGCAGCAGCGGGATACTCAGCAAACGTTTCATCAGTACTGCATGACATAGAGGAAGAAGAAAAAGAAACTGCCATTAAAGAGCATAGTGAAAGGTTGGCAATTGCTTTTGGGTTGCTGGTGACTCAAGTTGGTGACTGTATTCGTATTATCAAGAATTTAAGAGTTTGTGGCGATTGCCATGAGGTAAGTAAGATCATTTCTCGAGTATTTGAAAGAGAAATAATTGTTAGAGATGGCAGTAGATTTCACCATTTTAAGAATGGTAGTTGTTCTTGTCTAGATTATTGGTGA
BLAST of CmoCh07G001980 vs. Swiss-Prot
Match: PP449_ARATH (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 485.3 bits (1248), Expect = 9.9e-136
Identity = 247/614 (40.23%), Postives = 363/614 (59.12%), Query Frame = 1

Query: 22  FSTVSHLPPPLSSLPPISGITQIKQAHARSVVFGLANDGRIMGHLLAFLAVSSSSLPYEY 81
           FS   +L   +S L   S   ++KQ HAR +  GL  D   +   L+F   S+SS    Y
Sbjct: 8   FSLEHNLYETMSCLQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPY 67

Query: 82  AFSIYQSISHPSVFATNNMIRCCAKEELSCESISLYSHMRRSLVAPNKHTLTFVLQACSN 141
           A  ++     P  F  N MIR  +  +    S+ LY  M  S    N +T   +L+ACSN
Sbjct: 68  AQIVFDGFDRPDTFLWNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSN 127

Query: 142 ALAICEGIQVQTHVIKFGFAKDVFIRNALIHLYCTHCRVECAKQVFDEVPSSRDIVSWNS 201
             A  E  Q+   + K G+  DV+  N+LI+ Y      + A  +FD +P   D VSWNS
Sbjct: 128 LSAFEETTQIHAQITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDD-VSWNS 187

Query: 202 MIAGFVRAGQINVADKLFVEMPEKDVISWSAMISGCVQNGLLEKALDCFNEMREQKMRPN 261
           +I G+V+AG++++A  LF +M EK+ ISW+ MISG VQ  + ++AL  F+EM+   + P+
Sbjct: 188 VIKGYVKAGKMDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPD 247

Query: 262 EAILVSMLAAASQLGMLEYGKMIHSIADSLKFPMTASLGTALVDMYAKCGCIDESKFLFD 321
              L + L+A +QLG LE GK IHS  +  +  M + LG  L+DMYAKCG ++E+  +F 
Sbjct: 248 NVSLANALSACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFK 307

Query: 322 RMPQKDKWTWNVMICGLASHGLGQEALALFEKFLTQGFYPVNVTFIGVLNACSRAGLVSE 381
            + +K    W  +I G A HG G+EA++ F +    G  P  +TF  VL ACS  GLV E
Sbjct: 308 NIKKKSVQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEE 367

Query: 382 GRRFFKLMTDTYKIIPEMEHYGCMVDLFSRAGFVYDAVEMINRMPAPPDPVLWATVLGSC 441
           G+  F  M   Y + P +EHYGC+VDL  RAG + +A   I  MP  P+ V+W  +L +C
Sbjct: 368 GKLIFYSMERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKAC 427

Query: 442 KVHGFIELGEEIGNKLIQMDPTHNGHYVQLAGIYARLRKWEDVSKIRRLMADRNSNKIAG 501
           ++H  IELGEEIG  LI +DP H G YV  A I+A  +KW+  ++ RRLM ++   K+ G
Sbjct: 428 RIHKNIELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPG 487

Query: 502 WSLIEAGGRVHRFVAGDKEHEQCTEIYKMLETIGVRIAAAGYSANVSSVLHD-IEEEEKE 561
            S I   G  H F+AGD+ H +  +I      +  ++   GY   +  +L D ++++E+E
Sbjct: 488 CSTISLEGTTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDERE 547

Query: 562 TAIKEHSERLAIAFGLLVTQVGDCIRIIKNLRVCGDCHEVSKIISRVFEREIIVRDGSRF 621
             + +HSE+LAI +GL+ T+ G  IRI+KNLRVC DCH+V+K+IS++++R+I++RD +RF
Sbjct: 548 AIVHQHSEKLAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRF 607

Query: 622 HHFKNGSCSCLDYW 635
           HHF++G CSC DYW
Sbjct: 608 HHFRDGKCSCGDYW 620

BLAST of CmoCh07G001980 vs. Swiss-Prot
Match: PP367_ARATH (Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana GN=PCMP-H88 PE=3 SV=1)

HSP 1 Score: 477.6 bits (1228), Expect = 2.1e-133
Identity = 232/620 (37.42%), Postives = 375/620 (60.48%), Query Frame = 1

Query: 21  VFSTVSHLPPPLSSLPPISGITQIKQAHARSVVFGLANDGRIMGHLLAFLAVSSSSLPYE 80
           V +T+    P L+ L   S  + +K  H   +   L +D  +   LLA L V  S+    
Sbjct: 5   VLNTLRFKHPKLALLQSCSSFSDLKIIHGFLLRTHLISDVFVASRLLA-LCVDDSTFNKP 64

Query: 81  -----YAFSIYQSISHPSVFATNNMIRCCAKEELSCESISLYSHMRRSLVAPNKHTLTFV 140
                YA+ I+  I +P++F  N +IRC +      ++   Y+ M +S + P+  T  F+
Sbjct: 65  TNLLGYAYGIFSQIQNPNLFVFNLLIRCFSTGAEPSKAFGFYTQMLKSRIWPDNITFPFL 124

Query: 141 LQACSNALAICEGIQVQTHVIKFGFAKDVFIRNALIHLYCTHCRVECAKQVFDEVPSSRD 200
           ++A S    +  G Q  + +++FGF  DV++ N+L+H+Y     +  A ++F ++   RD
Sbjct: 125 IKASSEMECVLVGEQTHSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIFGQM-GFRD 184

Query: 201 IVSWNSMIAGFVRAGQINVADKLFVEMPEKDVISWSAMISGCVQNGLLEKALDCFNEMRE 260
           +VSW SM+AG+ + G +  A ++F EMP +++ +WS MI+G  +N   EKA+D F  M+ 
Sbjct: 185 VVSWTSMVAGYCKCGMVENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMKR 244

Query: 261 QKMRPNEAILVSMLAAASQLGMLEYGKMIHSIADSLKFPMTASLGTALVDMYAKCGCIDE 320
           + +  NE ++VS++++ + LG LE+G+  +         +   LGTALVDM+ +CG I++
Sbjct: 245 EGVVANETVMVSVISSCAHLGALEFGERAYEYVVKSHMTVNLILGTALVDMFWRCGDIEK 304

Query: 321 SKFLFDRMPQKDKWTWNVMICGLASHGLGQEALALFEKFLTQGFYPVNVTFIGVLNACSR 380
           +  +F+ +P+ D  +W+ +I GLA HG   +A+  F + ++ GF P +VTF  VL+ACS 
Sbjct: 305 AIHVFEGLPETDSLSWSSIIKGLAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSACSH 364

Query: 381 AGLVSEGRRFFKLMTDTYKIIPEMEHYGCMVDLFSRAGFVYDAVEMINRMPAPPDPVLWA 440
            GLV +G   ++ M   + I P +EHYGC+VD+  RAG + +A   I +M   P+  +  
Sbjct: 365 GGLVEKGLEIYENMKKDHGIEPRLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPILG 424

Query: 441 TVLGSCKVHGFIELGEEIGNKLIQMDPTHNGHYVQLAGIYARLRKWEDVSKIRRLMADRN 500
            +LG+CK++   E+ E +GN LI++ P H+G+YV L+ IYA   +W+ +  +R +M ++ 
Sbjct: 425 ALLGACKIYKNTEVAERVGNMLIKVKPEHSGYYVLLSNIYACAGQWDKIESLRDMMKEKL 484

Query: 501 SNKIAGWSLIEAGGRVHRFVAG-DKEHEQCTEIYKMLETIGVRIAAAGYSANVSSVLHDI 560
             K  GWSLIE  G++++F  G D++H +  +I +  E I  +I   GY  N      D+
Sbjct: 485 VKKPPGWSLIEIDGKINKFTMGDDQKHPEMGKIRRKWEEILGKIRLIGYKGNTGDAFFDV 544

Query: 561 EEEEKETAIKEHSERLAIAFGLLVTQVGDCIRIIKNLRVCGDCHEVSKIISRVFEREIIV 620
           +EEEKE++I  HSE+LAIA+G++ T+ G  IRI+KNLRVC DCH V+K+IS V+ RE+IV
Sbjct: 545 DEEEKESSIHMHSEKLAIAYGMMKTKPGTTIRIVKNLRVCEDCHTVTKLISEVYGRELIV 604

Query: 621 RDGSRFHHFKNGSCSCLDYW 635
           RD +RFHHF+NG CSC DYW
Sbjct: 605 RDRNRFHHFRNGVCSCRDYW 622

BLAST of CmoCh07G001980 vs. Swiss-Prot
Match: PP330_ARATH (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 474.9 bits (1221), Expect = 1.3e-132
Identity = 241/601 (40.10%), Postives = 359/601 (59.73%), Query Frame = 1

Query: 38  ISGITQIKQAHARSVVFGLA-NDGRIMGHLLAFLAVSSSSLPYEYAFSIYQSISHP-SVF 97
           +S IT+++Q HA S+  G++ +D  +  HL+ +L    S  P  YA  ++  I  P +VF
Sbjct: 27  VSSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSKIEKPINVF 86

Query: 98  ATNNMIRCCAKEELSCESISLYSHMRRS-LVAPNKHTLTFVLQACSNALAICEGIQVQTH 157
             N +IR  A+   S  + SLY  MR S LV P+ HT  F+++A +    +  G  + + 
Sbjct: 87  IWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVRLGETIHSV 146

Query: 158 VIKFGFAKDVFIRNALIHLYCTHCRVECAKQVFDEVPSSRDIVSWNSMIAGFVRAGQINV 217
           VI+ GF   ++++N+L+HLY                                   G +  
Sbjct: 147 VIRSGFGSLIYVQNSLLHLYAN--------------------------------CGDVAS 206

Query: 218 ADKLFVEMPEKDVISWSAMISGCVQNGLLEKALDCFNEMREQKMRPNEAILVSMLAAASQ 277
           A K+F +MPEKD+++W+++I+G  +NG  E+AL  + EM  + ++P+   +VS+L+A ++
Sbjct: 207 AYKVFDKMPEKDLVAWNSVINGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAK 266

Query: 278 LGMLEYGKMIHSIADSLKFPMTASLGTALVDMYAKCGCIDESKFLFDRMPQKDKWTWNVM 337
           +G L  GK +H     +           L+D+YA+CG ++E+K LFD M  K+  +W  +
Sbjct: 267 IGALTLGKRVHVYMIKVGLTRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSL 326

Query: 338 ICGLASHGLGQEALALFEKF-LTQGFYPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTY 397
           I GLA +G G+EA+ LF+    T+G  P  +TF+G+L ACS  G+V EG  +F+ M + Y
Sbjct: 327 IVGLAVNGFGKEAIELFKYMESTEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEY 386

Query: 398 KIIPEMEHYGCMVDLFSRAGFVYDAVEMINRMPAPPDPVLWATVLGSCKVHGFIELGEEI 457
           KI P +EH+GCMVDL +RAG V  A E I  MP  P+ V+W T+LG+C VHG  +L E  
Sbjct: 387 KIEPRIEHFGCMVDLLARAGQVKKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFA 446

Query: 458 GNKLIQMDPTHNGHYVQLAGIYARLRKWEDVSKIRRLMADRNSNKIAGWSLIEAGGRVHR 517
             +++Q++P H+G YV L+ +YA  ++W DV KIR+ M      K+ G SL+E G RVH 
Sbjct: 447 RIQILQLEPNHSGDYVLLSNMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHE 506

Query: 518 FVAGDKEHEQCTEIYKMLETIGVRIAAAGYSANVSSVLHDIEEEEKETAIKEHSERLAIA 577
           F+ GDK H Q   IY  L+ +  R+ + GY   +S+V  D+EEEEKE A+  HSE++AIA
Sbjct: 507 FLMGDKSHPQSDAIYAKLKEMTGRLRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIA 566

Query: 578 FGLLVTQVGDCIRIIKNLRVCGDCHEVSKIISRVFEREIIVRDGSRFHHFKNGSCSCLDY 635
           F L+ T     I ++KNLRVC DCH   K++S+V+ REI+VRD SRFHHFKNGSCSC DY
Sbjct: 567 FMLISTPERSPITVVKNLRVCADCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDY 595

BLAST of CmoCh07G001980 vs. Swiss-Prot
Match: PP122_ARATH (Pentatricopeptide repeat-containing protein At1g74630 OS=Arabidopsis thaliana GN=PCMP-H71 PE=2 SV=1)

HSP 1 Score: 464.2 bits (1193), Expect = 2.4e-129
Identity = 240/637 (37.68%), Postives = 364/637 (57.14%), Query Frame = 1

Query: 32  LSSLPPISGITQIKQAHARSVVFGLANDGRIMGHLLAFLAVS-SSSLPYEYAFSIYQSIS 91
           LS L     +  + Q H   + +G+  D    G L+   A+S S +LPY  A  +     
Sbjct: 9   LSLLNSCKNLRALTQIHGLFIKYGVDTDSYFTGKLILHCAISISDALPY--ARRLLLCFP 68

Query: 92  HPSVFATNNMIRCCAKEELSCESISLYSHM-RRSLVAPNKHTLTFVLQACSNALAICEGI 151
            P  F  N ++R  ++ +    S++++  M R+  V P+  +  FV++A  N  ++  G 
Sbjct: 69  EPDAFMFNTLVRGYSESDEPHNSVAVFVEMMRKGFVFPDSFSFAFVIKAVENFRSLRTGF 128

Query: 152 QVQTHVIKFGFAKDVFIRNALIHLYCTHCRVECAKQVFDE-------------------- 211
           Q+    +K G    +F+   LI +Y     VE A++VFDE                    
Sbjct: 129 QMHCQALKHGLESHLFVGTTLIGMYGGCGCVEFARKVFDEMHQPNLVAWNAVITACFRGN 188

Query: 212 -VPSSRDI---------VSWNSMIAGFVRAGQINVADKLFVEMPEKDVISWSAMISGCVQ 271
            V  +R+I          SWN M+AG+++AG++  A ++F EMP +D +SWS MI G   
Sbjct: 189 DVAGAREIFDKMLVRNHTSWNVMLAGYIKAGELESAKRIFSEMPHRDDVSWSTMIVGIAH 248

Query: 272 NGLLEKALDCFNEMREQKMRPNEAILVSMLAAASQLGMLEYGKMIHSIADSLKFPMTASL 331
           NG   ++   F E++   M PNE  L  +L+A SQ G  E+GK++H   +   +    S+
Sbjct: 249 NGSFNESFLYFRELQRAGMSPNEVSLTGVLSACSQSGSFEFGKILHGFVEKAGYSWIVSV 308

Query: 332 GTALVDMYAKCGCIDESKFLFDRMPQKDKW-TWNVMICGLASHGLGQEALALFEKFLTQG 391
             AL+DMY++CG +  ++ +F+ M +K    +W  MI GLA HG G+EA+ LF +    G
Sbjct: 309 NNALIDMYSRCGNVPMARLVFEGMQEKRCIVSWTSMIAGLAMHGQGEEAVRLFNEMTAYG 368

Query: 392 FYPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTYKIIPEMEHYGCMVDLFSRAGFVYDA 451
             P  ++FI +L+ACS AGL+ EG  +F  M   Y I PE+EHYGCMVDL+ R+G +  A
Sbjct: 369 VTPDGISFISLLHACSHAGLIEEGEDYFSEMKRVYHIEPEIEHYGCMVDLYGRSGKLQKA 428

Query: 452 VEMINRMPAPPDPVLWATVLGSCKVHGFIELGEEIGNKLIQMDPTHNGHYVQLAGIYARL 511
            + I +MP PP  ++W T+LG+C  HG IEL E++  +L ++DP ++G  V L+  YA  
Sbjct: 429 YDFICQMPIPPTAIVWRTLLGACSSHGNIELAEQVKQRLNELDPNNSGDLVLLSNAYATA 488

Query: 512 RKWEDVSKIRRLMADRNSNKIAGWSLIEAGGRVHRFVAGDKEHEQCTEIYKMLETIGVRI 571
            KW+DV+ IR+ M  +   K   WSL+E G  +++F AG+K+     E ++ L+ I +R+
Sbjct: 489 GKWKDVASIRKSMIVQRIKKTTAWSLVEVGKTMYKFTAGEKKKGIDIEAHEKLKEIILRL 548

Query: 572 A-AAGYSANVSSVLHDIEEEEKETAIKEHSERLAIAFGLLVTQVGDCIRIIKNLRVCGDC 631
              AGY+  V+S L+D+EEEEKE  + +HSE+LA+AF L     G  IRI+KNLR+C DC
Sbjct: 549 KDEAGYTPEVASALYDVEEEEKEDQVSKHSEKLALAFALARLSKGANIRIVKNLRICRDC 608

Query: 632 HEVSKIISRVFEREIIVRDGSRFHHFKNGSCSCLDYW 635
           H V K+ S+V+  EI+VRD +RFH FK+GSCSC DYW
Sbjct: 609 HAVMKLTSKVYGVEILVRDRNRFHSFKDGSCSCRDYW 643

BLAST of CmoCh07G001980 vs. Swiss-Prot
Match: PP295_ARATH (Pentatricopeptide repeat-containing protein At3g62890 OS=Arabidopsis thaliana GN=PCMP-H82 PE=2 SV=1)

HSP 1 Score: 463.4 bits (1191), Expect = 4.0e-129
Identity = 226/528 (42.80%), Postives = 340/528 (64.39%), Query Frame = 1

Query: 114 ISLYSHMRRSLVAPNKHTLTFVLQACSNALAICEGIQVQTHVIKFGFAKDVFIRNALIHL 173
           IS+Y  MR   V+P+ HT  F+L +  N L +  G +    ++ FG  KD F+R +L+++
Sbjct: 47  ISVYLRMRNHRVSPDFHTFPFLLPSFHNPLHLPLGQRTHAQILLFGLDKDPFVRTSLLNM 106

Query: 174 YCTHCRVECAKQVFDEVPSSRDIVSWNSMIAGFVRAGQINVADKLFVEMPEKDVISWSAM 233
           Y +   +  A++VFD+   S+D+ +WNS++  + +AG I+ A KLF EMPE++VISWS +
Sbjct: 107 YSSCGDLRSAQRVFDD-SGSKDLPAWNSVVNAYAKAGLIDDARKLFDEMPERNVISWSCL 166

Query: 234 ISGCVQNGLLEKALDCFNEMREQK-----MRPNEAILVSMLAAASQLGMLEYGKMIHSIA 293
           I+G V  G  ++ALD F EM+  K     +RPNE  + ++L+A  +LG LE GK +H+  
Sbjct: 167 INGYVMCGKYKEALDLFREMQLPKPNEAFVRPNEFTMSTVLSACGRLGALEQGKWVHAYI 226

Query: 294 DSLKFPMTASLGTALVDMYAKCGCIDESKFLFDRM-PQKDKWTWNVMICGLASHGLGQEA 353
           D     +   LGTAL+DMYAKCG ++ +K +F+ +  +KD   ++ MIC LA +GL  E 
Sbjct: 227 DKYHVEIDIVLGTALIDMYAKCGSLERAKRVFNALGSKKDVKAYSAMICCLAMYGLTDEC 286

Query: 354 LALFEKFLTQ-GFYPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTYKIIPEMEHYGCMV 413
             LF +  T     P +VTF+G+L AC   GL++EG+ +FK+M + + I P ++HYGCMV
Sbjct: 287 FQLFSEMTTSDNINPNSVTFVGILGACVHRGLINEGKSYFKMMIEEFGITPSIQHYGCMV 346

Query: 414 DLFSRAGFVYDAVEMINRMPAPPDPVLWATVLGSCKVHGFIELGEEIGNKLIQMDPTHNG 473
           DL+ R+G + +A   I  MP  PD ++W ++L   ++ G I+  E    +LI++DP ++G
Sbjct: 347 DLYGRSGLIKEAESFIASMPMEPDVLIWGSLLSGSRMLGDIKTCEGALKRLIELDPMNSG 406

Query: 474 HYVQLAGIYARLRKWEDVSKIRRLMADRNSNKIAGWSLIEAGGRVHRFVAGDKEHEQCTE 533
            YV L+ +YA+  +W +V  IR  M  +  NK+ G S +E  G VH FV GD+  ++   
Sbjct: 407 AYVLLSNVYAKTGRWMEVKCIRHEMEVKGINKVPGCSYVEVEGVVHEFVVGDESQQESER 466

Query: 534 IYKMLETIGVRIAAAGYSANVSSVLHDIEEEEKETAIKEHSERLAIAFGLLVTQVGDCIR 593
           IY ML+ I  R+  AGY  +   VL D+ E++KE A+  HSE+LAIAF L+ T+ G  +R
Sbjct: 467 IYAMLDEIMQRLREAGYVTDTKEVLLDLNEKDKEIALSYHSEKLAIAFCLMKTRPGTPVR 526

Query: 594 IIKNLRVCGDCHEVSKIISRVFEREIIVRDGSRFHHFKNGSCSCLDYW 635
           IIKNLR+CGDCH V K+IS++F REI+VRD +RFHHF++GSCSC D+W
Sbjct: 527 IIKNLRICGDCHLVMKMISKLFSREIVVRDCNRFHHFRDGSCSCRDFW 573

BLAST of CmoCh07G001980 vs. TrEMBL
Match: A0A0A0KNI8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G519440 PE=4 SV=1)

HSP 1 Score: 1105.9 bits (2859), Expect = 0.0e+00
Identity = 545/638 (85.42%), Postives = 578/638 (90.60%), Query Frame = 1

Query: 1   MLLCKPKFIF-WTSKQRLNVHVFSTVSH--LPPPLSSLPPISGITQIKQAHARSVVFGLA 60
           MLLC+P F+F W SKQRLN H FST+ +  LPPP SSLPP  GITQIKQAHAR +V GLA
Sbjct: 1   MLLCRPNFLFFWISKQRLNFHFFSTLPNPRLPPPFSSLPPTPGITQIKQAHARILVLGLA 60

Query: 61  NDGRIMGHLLAFLAVSSSSLPYEYAFSIYQSISHPSVFATNNMIRCCAKEELSCESISLY 120
           NDGRI  HLLAFLA+SSSSLP +YA SIY SISHP+VFATNNMIRC  K +L   SISLY
Sbjct: 61  NDGRITSHLLAFLAISSSSLPSDYALSIYNSISHPTVFATNNMIRCFVKGDLPRHSISLY 120

Query: 121 SHMRRSLVA-PNKHTLTFVLQACSNALAICEGIQVQTHVIKFGFAKDVFIRNALIHLYCT 180
           SHM RS VA PNKHTLTFVLQACSNA AI EG QVQTHVIK GF KDVF+RNALIHLYCT
Sbjct: 121 SHMCRSFVAAPNKHTLTFVLQACSNAFAIREGAQVQTHVIKLGFVKDVFVRNALIHLYCT 180

Query: 181 HCRVECAKQVFDEVPSSRDIVSWNSMIAGFVRAGQINVADKLFVEMPEKDVISWSAMISG 240
            CRVE AKQVFDEVPSSRD+VSWNSMI GFVR GQI+VA KLFVEMPEKDVISW  +ISG
Sbjct: 181 CCRVESAKQVFDEVPSSRDVVSWNSMIVGFVRLGQISVAQKLFVEMPEKDVISWGTIISG 240

Query: 241 CVQNGLLEKALDCFNEMREQKMRPNEAILVSMLAAASQLGMLEYGKMIHSIADSLKFPMT 300
           CVQNG LEKALD F E+ EQK+RPNEAILVS+LAAA+QLG LEYGK IHSIA+SL+FPMT
Sbjct: 241 CVQNGELEKALDYFKELGEQKLRPNEAILVSLLAAAAQLGTLEYGKRIHSIANSLRFPMT 300

Query: 301 ASLGTALVDMYAKCGCIDESKFLFDRMPQKDKWTWNVMICGLASHGLGQEALALFEKFLT 360
           ASLGTALVDMYAKCGCIDES+FLFDRMP+KDKW+WNVMICGLA+HGLGQEALALFEKFLT
Sbjct: 301 ASLGTALVDMYAKCGCIDESRFLFDRMPEKDKWSWNVMICGLATHGLGQEALALFEKFLT 360

Query: 361 QGFYPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTYKIIPEMEHYGCMVDLFSRAGFVY 420
           QGF+PVNVTFIGVL ACSRAGLVSEG+ FFKLMTDTY I PEMEHYGCMVDL SRAGFVY
Sbjct: 361 QGFHPVNVTFIGVLTACSRAGLVSEGKHFFKLMTDTYGIEPEMEHYGCMVDLLSRAGFVY 420

Query: 421 DAVEMINRMPAPPDPVLWATVLGSCKVHGFIELGEEIGNKLIQMDPTHNGHYVQLAGIYA 480
           DAVEMINRMPAPPDPVLWA+VLGSC+VHGFIELGEEIGNKLIQMDPTHNGHYVQLA I+A
Sbjct: 421 DAVEMINRMPAPPDPVLWASVLGSCQVHGFIELGEEIGNKLIQMDPTHNGHYVQLARIFA 480

Query: 481 RLRKWEDVSKIRRLMADRNSNKIAGWSLIEAGGRVHRFVAGDKEHEQCTEIYKMLETIGV 540
           RLRKWEDVSK+RRLMA+RNSNKIAGWSLIEA GRVHRFVAGDKEHE+ TEIYKMLE +GV
Sbjct: 481 RLRKWEDVSKVRRLMAERNSNKIAGWSLIEAEGRVHRFVAGDKEHERTTEIYKMLEIMGV 540

Query: 541 RIAAAGYSANVSSVLHDIEEEEKETAIKEHSERLAIAFGLLVTQVGDCIRIIKNLRVCGD 600
           RIAAAGYSANVSSVLHDIEEEEKE AIKEHSERLAIAFGLLVT+ GDCIRIIKNLRVCGD
Sbjct: 541 RIAAAGYSANVSSVLHDIEEEEKENAIKEHSERLAIAFGLLVTKDGDCIRIIKNLRVCGD 600

Query: 601 CHEVSKIISRVFEREIIVRDGSRFHHFKNGSCSCLDYW 635
           CHEVSKIIS VFEREIIVRDGSRFHHFK G CSC DYW
Sbjct: 601 CHEVSKIISLVFEREIIVRDGSRFHHFKKGICSCQDYW 638

BLAST of CmoCh07G001980 vs. TrEMBL
Match: B9HDP6_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0006s24730g PE=4 SV=2)

HSP 1 Score: 832.0 bits (2148), Expect = 4.8e-238
Identity = 388/594 (65.32%), Postives = 483/594 (81.31%), Query Frame = 1

Query: 41  ITQIKQAHARSVVFGLANDGRIMGHLLAFLAVSSSSLPYEYAFSIYQSISHPSVFATNNM 100
           ++Q KQAHAR +V GLA    +MGH+L+FLA   SS P++Y+ SIY++I +P+VFA+NNM
Sbjct: 10  LSQTKQAHARIIVSGLAGKASLMGHILSFLATFPSS-PFDYSLSIYRTIKNPNVFASNNM 69

Query: 101 IRCCAKEELSCESISLYSHMRRSLVAPNKHTLTFVLQACSNALAICEGIQVQTHVIKFGF 160
           IRC AK +L  +S+ LYS + R+ V PN ++ TF+LQACS  L + EG+QV  HV+K GF
Sbjct: 70  IRCFAKSDLPLQSLVLYSSVLRNCVRPNNYSFTFLLQACSKGLGLVEGVQVHGHVLKLGF 129

Query: 161 AKDVFIRNALIHLYCTHCRVECAKQVFDEVPSSRDIVSWNSMIAGFVRAGQINVADKLFV 220
            +DV++RNALIHLY + CR E +KQVFDE P   D+V+WN+M+AGF R GQ++V  KLF 
Sbjct: 130 GEDVYVRNALIHLYSSCCRTESSKQVFDESPHHCDVVTWNAMLAGFARDGQVSVVQKLFD 189

Query: 221 EMPEKDVISWSAMISGCVQNGLLEKALDCFNEMREQKMRPNEAILVSMLAAASQLGMLEY 280
           EMPE+DVISW+ M+   V NG L +AL+CF  MRE  + P+EA LV+ML+A++QL +LE+
Sbjct: 190 EMPERDVISWNTMLMAYVHNGKLGEALECFKRMRESGLVPDEATLVTMLSASAQLCLLEH 249

Query: 281 GKMIHSIADSLKFPMTASLGTALVDMYAKCGCIDESKFLFDRMPQKDKWTWNVMICGLAS 340
           G+ IHSI DSL  PMT S+GTAL+DMYAKCGCI++S+ LF+ MP++D  TWNVMICGLAS
Sbjct: 250 GQSIHSIIDSLSLPMTISIGTALLDMYAKCGCIEQSRLLFENMPRRDVSTWNVMICGLAS 309

Query: 341 HGLGQEALALFEKFLTQGFYPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTYKIIPEME 400
           HGLG++AL LFE+FL +G +P+NVTF+GVLNACSRAGLV EGR +F++MTD+Y I PEME
Sbjct: 310 HGLGKDALTLFERFLNEGLHPMNVTFVGVLNACSRAGLVKEGRHYFQMMTDSYGIEPEME 369

Query: 401 HYGCMVDLFSRAGFVYDAVEMINRMPAPPDPVLWATVLGSCKVHGFIELGEEIGNKLIQM 460
           HYGCMVDL  RAG V++A+++I  M   PDPVLWA VL +C++HG  ELGE+IGN+LI++
Sbjct: 370 HYGCMVDLLGRAGLVFEAIKVIESMAISPDPVLWAMVLCACRIHGLAELGEKIGNRLIEL 429

Query: 461 DPTHNGHYVQLAGIYARLRKWEDVSKIRRLMADRNSNKIAGWSLIEAGGRVHRFVAGDKE 520
           DPT++GHYVQLA IYA  RKWEDV ++RRLMA+RN++K+AGWSLIEA G+VHRFVAG +E
Sbjct: 430 DPTYDGHYVQLASIYANSRKWEDVVRVRRLMAERNTSKVAGWSLIEARGKVHRFVAGHRE 489

Query: 521 HEQCTEIYKMLETIGVRIAAAGYSANVSSVLHDIEEEEKETAIKEHSERLAIAFGLLVTQ 580
           HEQ  EI KMLE I  R+AAAGY  NVS VLHDI EEEKE AIK HSERLAIAFGLLVT 
Sbjct: 490 HEQSLEIQKMLEIIETRLAAAGYVPNVSPVLHDIGEEEKENAIKVHSERLAIAFGLLVTG 549

Query: 581 VGDCIRIIKNLRVCGDCHEVSKIISRVFEREIIVRDGSRFHHFKNGSCSCLDYW 635
            G CIRI+KNLRVC DCHEV+K+ISRVFEREIIVRDGSRFHHFK G CSCLDYW
Sbjct: 550 PGSCIRIVKNLRVCWDCHEVTKMISRVFEREIIVRDGSRFHHFKEGKCSCLDYW 602

BLAST of CmoCh07G001980 vs. TrEMBL
Match: M5XDQ3_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004024mg PE=4 SV=1)

HSP 1 Score: 817.0 bits (2109), Expect = 1.6e-233
Identity = 377/535 (70.47%), Postives = 455/535 (85.05%), Query Frame = 1

Query: 100 MIRCCAKEELSCESISLYSHMRRSLVAPNKHTLTFVLQACSNALAICEGIQVQTHVIKFG 159
           MIRC AK +   +S+ L+S M R+ V PN HT TF+LQACS ALA+ EG QV T  +K G
Sbjct: 1   MIRCFAKSDSPPQSLLLFSSMLRTCVKPNNHTFTFLLQACSRALALNEGAQVHTVAVKLG 60

Query: 160 FAKDVFIRNALIHLYCTHCRVECAKQVFDEVPSSRDIVSWNSMIAGFVRAGQINVADKLF 219
           F   VF+RNALIHLYC+  R+EC+K++F+E  SSRD+V+WNSM+  FVR  QI  A+KLF
Sbjct: 61  FGGYVFVRNALIHLYCSCSRIECSKRLFEENASSRDVVTWNSMLTAFVRDEQIGAAEKLF 120

Query: 220 VEMPEKDVISWSAMISGCVQNGLLEKALDCFNEMREQKMRPNEAILVSMLAAASQLGMLE 279
            EMPE+DVISWS MISG VQNG L + L+CF +MRE+ MR NEA LVS+L+A++QLG+LE
Sbjct: 121 EEMPERDVISWSTMISGYVQNGRLGEGLECFKQMREKGMRLNEATLVSVLSASAQLGLLE 180

Query: 280 YGKMIHSIADSLKFPMTASLGTALVDMYAKCGCIDESKFLFDRMPQKDKWTWNVMICGLA 339
           +G+++HS+ +SL FP+T SLGTAL+DMYAKCGCI++SK LF  MP+KD WTWNVMICGLA
Sbjct: 181 HGRLVHSLVESLNFPLTVSLGTALIDMYAKCGCIEQSKLLFKNMPKKDIWTWNVMICGLA 240

Query: 340 SHGLGQEALALFEKFLTQGFYPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTYKIIPEM 399
           SHG+G+EALALF++F+ +GF+PVNVTFIGVL ACSRAGLVSEGRR FKLMT+ Y I+PEM
Sbjct: 241 SHGIGKEALALFQRFIDEGFHPVNVTFIGVLGACSRAGLVSEGRRHFKLMTEKYSILPEM 300

Query: 400 EHYGCMVDLFSRAGFVYDAVEMINRMPAPPDPVLWATVLGSCKVHGFIELGEEIGNKLIQ 459
           EHYGCMVD+  RAGF+ +AV++I +M  PPDPVLWAT+LG+CK+HG IELGE+IG KL++
Sbjct: 301 EHYGCMVDMLGRAGFLDEAVQLIEKMTVPPDPVLWATLLGACKIHGSIELGEKIGKKLLK 360

Query: 460 MDPTHNGHYVQLAGIYARLRKWEDVSKIRRLMADRNSNKIAGWSLIEAGGRVHRFVAGDK 519
           +DPTH+GHYVQLA IYA+ RKWEDV ++RRL+ ++N+NK AGWSLIEA G VH+FVAGD+
Sbjct: 361 LDPTHDGHYVQLASIYAKARKWEDVIRVRRLLVEQNTNKAAGWSLIEAQGTVHKFVAGDR 420

Query: 520 EHEQCTEIYKMLETIGVRIAAAGYSANVSSVLHDIEEEEKETAIKEHSERLAIAFGLLVT 579
           EHE+  EIYKMLE IG+RIA +GYS NVSSVLHDI EEEKE AIKEHSERLA+AFGLLVT
Sbjct: 421 EHERSLEIYKMLEKIGIRIAESGYSPNVSSVLHDIGEEEKENAIKEHSERLAMAFGLLVT 480

Query: 580 QVGDCIRIIKNLRVCGDCHEVSKIISRVFEREIIVRDGSRFHHFKNGSCSCLDYW 635
             GDCIRI+KNLRVC DCHEVSKIISRVFEREIIVRDGSRFHHFK+G CSCLDYW
Sbjct: 481 GAGDCIRIVKNLRVCEDCHEVSKIISRVFEREIIVRDGSRFHHFKDGKCSCLDYW 535

BLAST of CmoCh07G001980 vs. TrEMBL
Match: A0A067L218_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_00279 PE=4 SV=1)

HSP 1 Score: 807.7 bits (2085), Expect = 9.7e-231
Identity = 372/572 (65.03%), Postives = 460/572 (80.42%), Query Frame = 1

Query: 63  MGHLLAFLAVSSSSLPYEYAFSIYQSISHPSVFATNNMIRCCAKEELSCESISLYSHMRR 122
           MGHLL FLA++ S+ P++Y+ SIYQ++S P+VFA+NNMIRC  K E    ++  YS M R
Sbjct: 1   MGHLLTFLALTPST-PFDYSLSIYQTLSSPTVFASNNMIRCFTKTESPLNAVVFYSSMLR 60

Query: 123 SLVAPNKHTLTFVLQACSNALAICEGIQVQTHVIKFGFAKDVFIRNALIHLYCTHCRVEC 182
           S V PN ++ TF+LQAC+    + EG QV  HV+KFGF +DV+IRNALIHLY   C +E 
Sbjct: 61  SCVIPNNYSFTFLLQACAKGFGLNEGAQVHDHVVKFGFCEDVYIRNALIHLYSACCHIES 120

Query: 183 AKQVFDEVPSSRDIVSWNSMIAGFVRAGQINVADKLFVEMPEKDVISWSAMISGCVQNGL 242
           +K+VFDE P+  D+V+WN ++AGF R GQ+ V +KLF EMPE+DV+SW+ MI   + NG 
Sbjct: 121 SKRVFDESPNKCDVVTWNVILAGFARDGQVGVVEKLFDEMPERDVVSWNTMIMAYLHNGE 180

Query: 243 LEKALDCFNEMREQKMRPNEAILVSMLAAASQLGMLEYGKMIHSIADSLKFPMTASLGTA 302
           LE+ LDCF  MRE    PNEA LV ML+A++QLG++E+G+++HSI D L  PMT +LGTA
Sbjct: 181 LEECLDCFRRMRESGFIPNEATLVMMLSASAQLGLVEHGRLVHSIIDYLDIPMTVALGTA 240

Query: 303 LVDMYAKCGCIDESKFLFDRMPQKDKWTWNVMICGLASHGLGQEALALFEKFLTQGFYPV 362
           L+DMYAKCGCI++ +  FD+MPQ+D  TWNVMICG+ASHGLG+EALALFE+FL +G  PV
Sbjct: 241 LLDMYAKCGCIEKCRLFFDKMPQRDISTWNVMICGMASHGLGKEALALFERFLNEGLRPV 300

Query: 363 NVTFIGVLNACSRAGLVSEGRRFFKLMTDTYKIIPEMEHYGCMVDLFSRAGFVYDAVEMI 422
           N+TFIGVLNACSRAGLV EG+ +FK+MT+ Y I PEMEHYGCMVDL  RAG V +A+EMI
Sbjct: 301 NITFIGVLNACSRAGLVREGKHYFKMMTENYGIEPEMEHYGCMVDLLGRAGLVSEAIEMI 360

Query: 423 NRMPAPPDPVLWATVLGSCKVHGFIELGEEIGNKLIQMDPTHNGHYVQLAGIYARLRKWE 482
            +    PDPVLWAT+L +C++HG +ELGE IGNKLI++DPT++GHYVQL+ IYA+ RKW+
Sbjct: 361 EKNVVSPDPVLWATLLCACRIHGLVELGENIGNKLIELDPTYDGHYVQLSSIYAKSRKWD 420

Query: 483 DVSKIRRLMADRNSNKIAGWSLIEAGGRVHRFVAGDKEHEQCTEIYKMLETIGVRIAAAG 542
            V ++RRLMA+RN++K+ GWSLIEA GRVHRF+AGDKEHE+  EIYKML  I  R+A AG
Sbjct: 421 QVVRVRRLMAERNTSKVPGWSLIEAQGRVHRFIAGDKEHERSIEIYKMLNIIETRVAEAG 480

Query: 543 YSANVSSVLHDIEEEEKETAIKEHSERLAIAFGLLVTQVGDCIRIIKNLRVCGDCHEVSK 602
           Y  N+SSVLHDI EEEKE AIK HSERLAIAFG LVT  GDCIRI+KNLRVC DCHEV+K
Sbjct: 481 YVRNLSSVLHDIGEEEKENAIKVHSERLAIAFGFLVTGAGDCIRIVKNLRVCRDCHEVTK 540

Query: 603 IISRVFEREIIVRDGSRFHHFKNGSCSCLDYW 635
           +ISRVFEREIIVRDGSRFHHFK G CSCLDYW
Sbjct: 541 MISRVFEREIIVRDGSRFHHFKEGKCSCLDYW 571

BLAST of CmoCh07G001980 vs. TrEMBL
Match: A0A103XBF5_CYNCS (Pentatricopeptide repeat-containing protein OS=Cynara cardunculus var. scolymus GN=Ccrd_025086 PE=4 SV=1)

HSP 1 Score: 755.4 bits (1949), Expect = 5.7e-215
Identity = 355/599 (59.27%), Postives = 451/599 (75.29%), Query Frame = 1

Query: 37  PISGITQIKQAHARSVVFGLANDGRIMGHLLAFLAVSSSSLPYEYAFSIYQSISHPSVFA 96
           P   + QIKQAHAR +  G   D  + G LLA LA  S S+P++Y+ SI  S  +PS+FA
Sbjct: 21  PNPSLHQIKQAHARLIAAGNGGDSHLTGQLLAALA-QSRSIPFQYSLSILHSTQNPSLFA 80

Query: 97  TNNMIRCCAKEELSCESISLYSHM-RRSLVAPNKHTLTFVLQACSNALAICEGIQVQTHV 156
            NN+IRC AK +   E++SLYS M + +   PN +T  F+LQAC N   I EG QVQ HV
Sbjct: 81  INNLIRCFAKSKSPHEAMSLYSFMFKNTYFRPNNYTFPFLLQACGNFKGIVEGTQVQAHV 140

Query: 157 IKFGFAKDVFIRNALIHLYCTHCRVECAKQVFDEVPSSRDIVSWNSMIAGFVRAGQINVA 216
           +K GF  +V+ RNALIHLY   C  +CAK+VF+E P  RD+V+WN M+AG+ R GQI+  
Sbjct: 141 VKLGFYNNVYSRNALIHLYFASCESKCAKEVFNESPGCRDLVTWNVMMAGYARMGQIDDL 200

Query: 217 DKLFVEMPEKDVISWSAMISGCVQNGLLEKALDCFNEMREQKMRPNEAILVSMLAAASQL 276
           +K+F EMPEKD+ISWS++I+G VQNG LE+  DCF  MR+  + PNEAILV +L+A +QL
Sbjct: 201 EKMFDEMPEKDIISWSSLITGYVQNGYLEQGFDCFKRMRDLGLLPNEAILVMVLSACAQL 260

Query: 277 GMLEYGKMIHSIADSLKFPMTASLGTALVDMYAKCGCIDESKFLFDRMPQKDKWTWNVMI 336
           G++E G +IHSI DS   P T  +  ALVDMYAKCG ID+++ LFD+MPQKD  +WNVMI
Sbjct: 261 GLIEKGILIHSIIDSFDCPKTVHIWNALVDMYAKCGNIDKARQLFDKMPQKDISSWNVMI 320

Query: 337 CGLASHGLGQEALALFEKFLTQGFYPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTYKI 396
           CG A++GL  EA+  FEKFL +G  P NVTFIGVLNACSRAGLV +GR +FKLM+  Y I
Sbjct: 321 CGFATYGLAMEAIEHFEKFLIEGRTPENVTFIGVLNACSRAGLVDQGRHYFKLMSQKYNI 380

Query: 397 IPEMEHYGCMVDLFSRAGFVYDAVEMINRMPAPPDPVLWATVLGSCKVHGFIELGEEIGN 456
            PEMEHYGCMVDL  RAG + DA+E++ +MP PPDPVLW T++ +C+ HG +E GE  G 
Sbjct: 381 DPEMEHYGCMVDLLGRAGLIADAIELVEKMPIPPDPVLWVTIVAACRTHGLVEFGEGTGK 440

Query: 457 KLIQMDPTHNGHYVQLAGIYARLRKWEDVSKIRRLMADRNSNKIAGWSLIEAGGRVHRFV 516
           KLIQ+DP H+G+YVQL+ I+A+  KWE+V   R L    NS K+ GWSLIEA G++H+FV
Sbjct: 441 KLIQLDPNHHGNYVQLSSIFAKSCKWEEVLTTRGL----NSRKVPGWSLIEAQGKIHQFV 500

Query: 517 AGDKEHEQCTEIYKMLETIGVRIAAAGYSANVSSVLHDIEEEEKETAIKEHSERLAIAFG 576
           AGD+EHE+ +EIYKM++ +  +I  AGY  N+SSVLHD+EEEEK  +IKEHSERLAIAFG
Sbjct: 501 AGDREHERTSEIYKMVDRMNTKIVEAGYLPNISSVLHDLEEEEKINSIKEHSERLAIAFG 560

Query: 577 LLVTQVGDCIRIIKNLRVCGDCHEVSKIISRVFEREIIVRDGSRFHHFKNGSCSCLDYW 635
           LL+T+ G CIRI+KNLRVCGDCHE++KI S+VF+REIIVRDGSRFHHF+ G+CSC DYW
Sbjct: 561 LLITEAGTCIRIVKNLRVCGDCHEMTKITSKVFQREIIVRDGSRFHHFQGGNCSCQDYW 614

BLAST of CmoCh07G001980 vs. TAIR10
Match: AT5G66520.1 (AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 485.3 bits (1248), Expect = 5.6e-137
Identity = 247/614 (40.23%), Postives = 363/614 (59.12%), Query Frame = 1

Query: 22  FSTVSHLPPPLSSLPPISGITQIKQAHARSVVFGLANDGRIMGHLLAFLAVSSSSLPYEY 81
           FS   +L   +S L   S   ++KQ HAR +  GL  D   +   L+F   S+SS    Y
Sbjct: 8   FSLEHNLYETMSCLQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPY 67

Query: 82  AFSIYQSISHPSVFATNNMIRCCAKEELSCESISLYSHMRRSLVAPNKHTLTFVLQACSN 141
           A  ++     P  F  N MIR  +  +    S+ LY  M  S    N +T   +L+ACSN
Sbjct: 68  AQIVFDGFDRPDTFLWNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSN 127

Query: 142 ALAICEGIQVQTHVIKFGFAKDVFIRNALIHLYCTHCRVECAKQVFDEVPSSRDIVSWNS 201
             A  E  Q+   + K G+  DV+  N+LI+ Y      + A  +FD +P   D VSWNS
Sbjct: 128 LSAFEETTQIHAQITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDD-VSWNS 187

Query: 202 MIAGFVRAGQINVADKLFVEMPEKDVISWSAMISGCVQNGLLEKALDCFNEMREQKMRPN 261
           +I G+V+AG++++A  LF +M EK+ ISW+ MISG VQ  + ++AL  F+EM+   + P+
Sbjct: 188 VIKGYVKAGKMDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPD 247

Query: 262 EAILVSMLAAASQLGMLEYGKMIHSIADSLKFPMTASLGTALVDMYAKCGCIDESKFLFD 321
              L + L+A +QLG LE GK IHS  +  +  M + LG  L+DMYAKCG ++E+  +F 
Sbjct: 248 NVSLANALSACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFK 307

Query: 322 RMPQKDKWTWNVMICGLASHGLGQEALALFEKFLTQGFYPVNVTFIGVLNACSRAGLVSE 381
            + +K    W  +I G A HG G+EA++ F +    G  P  +TF  VL ACS  GLV E
Sbjct: 308 NIKKKSVQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEE 367

Query: 382 GRRFFKLMTDTYKIIPEMEHYGCMVDLFSRAGFVYDAVEMINRMPAPPDPVLWATVLGSC 441
           G+  F  M   Y + P +EHYGC+VDL  RAG + +A   I  MP  P+ V+W  +L +C
Sbjct: 368 GKLIFYSMERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKAC 427

Query: 442 KVHGFIELGEEIGNKLIQMDPTHNGHYVQLAGIYARLRKWEDVSKIRRLMADRNSNKIAG 501
           ++H  IELGEEIG  LI +DP H G YV  A I+A  +KW+  ++ RRLM ++   K+ G
Sbjct: 428 RIHKNIELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPG 487

Query: 502 WSLIEAGGRVHRFVAGDKEHEQCTEIYKMLETIGVRIAAAGYSANVSSVLHD-IEEEEKE 561
            S I   G  H F+AGD+ H +  +I      +  ++   GY   +  +L D ++++E+E
Sbjct: 488 CSTISLEGTTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDERE 547

Query: 562 TAIKEHSERLAIAFGLLVTQVGDCIRIIKNLRVCGDCHEVSKIISRVFEREIIVRDGSRF 621
             + +HSE+LAI +GL+ T+ G  IRI+KNLRVC DCH+V+K+IS++++R+I++RD +RF
Sbjct: 548 AIVHQHSEKLAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRF 607

Query: 622 HHFKNGSCSCLDYW 635
           HHF++G CSC DYW
Sbjct: 608 HHFRDGKCSCGDYW 620

BLAST of CmoCh07G001980 vs. TAIR10
Match: AT5G06540.1 (AT5G06540.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 477.6 bits (1228), Expect = 1.2e-134
Identity = 232/620 (37.42%), Postives = 375/620 (60.48%), Query Frame = 1

Query: 21  VFSTVSHLPPPLSSLPPISGITQIKQAHARSVVFGLANDGRIMGHLLAFLAVSSSSLPYE 80
           V +T+    P L+ L   S  + +K  H   +   L +D  +   LLA L V  S+    
Sbjct: 5   VLNTLRFKHPKLALLQSCSSFSDLKIIHGFLLRTHLISDVFVASRLLA-LCVDDSTFNKP 64

Query: 81  -----YAFSIYQSISHPSVFATNNMIRCCAKEELSCESISLYSHMRRSLVAPNKHTLTFV 140
                YA+ I+  I +P++F  N +IRC +      ++   Y+ M +S + P+  T  F+
Sbjct: 65  TNLLGYAYGIFSQIQNPNLFVFNLLIRCFSTGAEPSKAFGFYTQMLKSRIWPDNITFPFL 124

Query: 141 LQACSNALAICEGIQVQTHVIKFGFAKDVFIRNALIHLYCTHCRVECAKQVFDEVPSSRD 200
           ++A S    +  G Q  + +++FGF  DV++ N+L+H+Y     +  A ++F ++   RD
Sbjct: 125 IKASSEMECVLVGEQTHSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIFGQM-GFRD 184

Query: 201 IVSWNSMIAGFVRAGQINVADKLFVEMPEKDVISWSAMISGCVQNGLLEKALDCFNEMRE 260
           +VSW SM+AG+ + G +  A ++F EMP +++ +WS MI+G  +N   EKA+D F  M+ 
Sbjct: 185 VVSWTSMVAGYCKCGMVENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMKR 244

Query: 261 QKMRPNEAILVSMLAAASQLGMLEYGKMIHSIADSLKFPMTASLGTALVDMYAKCGCIDE 320
           + +  NE ++VS++++ + LG LE+G+  +         +   LGTALVDM+ +CG I++
Sbjct: 245 EGVVANETVMVSVISSCAHLGALEFGERAYEYVVKSHMTVNLILGTALVDMFWRCGDIEK 304

Query: 321 SKFLFDRMPQKDKWTWNVMICGLASHGLGQEALALFEKFLTQGFYPVNVTFIGVLNACSR 380
           +  +F+ +P+ D  +W+ +I GLA HG   +A+  F + ++ GF P +VTF  VL+ACS 
Sbjct: 305 AIHVFEGLPETDSLSWSSIIKGLAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSACSH 364

Query: 381 AGLVSEGRRFFKLMTDTYKIIPEMEHYGCMVDLFSRAGFVYDAVEMINRMPAPPDPVLWA 440
            GLV +G   ++ M   + I P +EHYGC+VD+  RAG + +A   I +M   P+  +  
Sbjct: 365 GGLVEKGLEIYENMKKDHGIEPRLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPILG 424

Query: 441 TVLGSCKVHGFIELGEEIGNKLIQMDPTHNGHYVQLAGIYARLRKWEDVSKIRRLMADRN 500
            +LG+CK++   E+ E +GN LI++ P H+G+YV L+ IYA   +W+ +  +R +M ++ 
Sbjct: 425 ALLGACKIYKNTEVAERVGNMLIKVKPEHSGYYVLLSNIYACAGQWDKIESLRDMMKEKL 484

Query: 501 SNKIAGWSLIEAGGRVHRFVAG-DKEHEQCTEIYKMLETIGVRIAAAGYSANVSSVLHDI 560
             K  GWSLIE  G++++F  G D++H +  +I +  E I  +I   GY  N      D+
Sbjct: 485 VKKPPGWSLIEIDGKINKFTMGDDQKHPEMGKIRRKWEEILGKIRLIGYKGNTGDAFFDV 544

Query: 561 EEEEKETAIKEHSERLAIAFGLLVTQVGDCIRIIKNLRVCGDCHEVSKIISRVFEREIIV 620
           +EEEKE++I  HSE+LAIA+G++ T+ G  IRI+KNLRVC DCH V+K+IS V+ RE+IV
Sbjct: 545 DEEEKESSIHMHSEKLAIAYGMMKTKPGTTIRIVKNLRVCEDCHTVTKLISEVYGRELIV 604

Query: 621 RDGSRFHHFKNGSCSCLDYW 635
           RD +RFHHF+NG CSC DYW
Sbjct: 605 RDRNRFHHFRNGVCSCRDYW 622

BLAST of CmoCh07G001980 vs. TAIR10
Match: AT4G21065.1 (AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 474.9 bits (1221), Expect = 7.5e-134
Identity = 241/601 (40.10%), Postives = 359/601 (59.73%), Query Frame = 1

Query: 38  ISGITQIKQAHARSVVFGLA-NDGRIMGHLLAFLAVSSSSLPYEYAFSIYQSISHP-SVF 97
           +S IT+++Q HA S+  G++ +D  +  HL+ +L    S  P  YA  ++  I  P +VF
Sbjct: 27  VSSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSKIEKPINVF 86

Query: 98  ATNNMIRCCAKEELSCESISLYSHMRRS-LVAPNKHTLTFVLQACSNALAICEGIQVQTH 157
             N +IR  A+   S  + SLY  MR S LV P+ HT  F+++A +    +  G  + + 
Sbjct: 87  IWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVRLGETIHSV 146

Query: 158 VIKFGFAKDVFIRNALIHLYCTHCRVECAKQVFDEVPSSRDIVSWNSMIAGFVRAGQINV 217
           VI+ GF   ++++N+L+HLY                                   G +  
Sbjct: 147 VIRSGFGSLIYVQNSLLHLYAN--------------------------------CGDVAS 206

Query: 218 ADKLFVEMPEKDVISWSAMISGCVQNGLLEKALDCFNEMREQKMRPNEAILVSMLAAASQ 277
           A K+F +MPEKD+++W+++I+G  +NG  E+AL  + EM  + ++P+   +VS+L+A ++
Sbjct: 207 AYKVFDKMPEKDLVAWNSVINGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAK 266

Query: 278 LGMLEYGKMIHSIADSLKFPMTASLGTALVDMYAKCGCIDESKFLFDRMPQKDKWTWNVM 337
           +G L  GK +H     +           L+D+YA+CG ++E+K LFD M  K+  +W  +
Sbjct: 267 IGALTLGKRVHVYMIKVGLTRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSL 326

Query: 338 ICGLASHGLGQEALALFEKF-LTQGFYPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTY 397
           I GLA +G G+EA+ LF+    T+G  P  +TF+G+L ACS  G+V EG  +F+ M + Y
Sbjct: 327 IVGLAVNGFGKEAIELFKYMESTEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEY 386

Query: 398 KIIPEMEHYGCMVDLFSRAGFVYDAVEMINRMPAPPDPVLWATVLGSCKVHGFIELGEEI 457
           KI P +EH+GCMVDL +RAG V  A E I  MP  P+ V+W T+LG+C VHG  +L E  
Sbjct: 387 KIEPRIEHFGCMVDLLARAGQVKKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFA 446

Query: 458 GNKLIQMDPTHNGHYVQLAGIYARLRKWEDVSKIRRLMADRNSNKIAGWSLIEAGGRVHR 517
             +++Q++P H+G YV L+ +YA  ++W DV KIR+ M      K+ G SL+E G RVH 
Sbjct: 447 RIQILQLEPNHSGDYVLLSNMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHE 506

Query: 518 FVAGDKEHEQCTEIYKMLETIGVRIAAAGYSANVSSVLHDIEEEEKETAIKEHSERLAIA 577
           F+ GDK H Q   IY  L+ +  R+ + GY   +S+V  D+EEEEKE A+  HSE++AIA
Sbjct: 507 FLMGDKSHPQSDAIYAKLKEMTGRLRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIA 566

Query: 578 FGLLVTQVGDCIRIIKNLRVCGDCHEVSKIISRVFEREIIVRDGSRFHHFKNGSCSCLDY 635
           F L+ T     I ++KNLRVC DCH   K++S+V+ REI+VRD SRFHHFKNGSCSC DY
Sbjct: 567 FMLISTPERSPITVVKNLRVCADCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDY 595

BLAST of CmoCh07G001980 vs. TAIR10
Match: AT1G74630.1 (AT1G74630.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 464.2 bits (1193), Expect = 1.3e-130
Identity = 240/637 (37.68%), Postives = 364/637 (57.14%), Query Frame = 1

Query: 32  LSSLPPISGITQIKQAHARSVVFGLANDGRIMGHLLAFLAVS-SSSLPYEYAFSIYQSIS 91
           LS L     +  + Q H   + +G+  D    G L+   A+S S +LPY  A  +     
Sbjct: 9   LSLLNSCKNLRALTQIHGLFIKYGVDTDSYFTGKLILHCAISISDALPY--ARRLLLCFP 68

Query: 92  HPSVFATNNMIRCCAKEELSCESISLYSHM-RRSLVAPNKHTLTFVLQACSNALAICEGI 151
            P  F  N ++R  ++ +    S++++  M R+  V P+  +  FV++A  N  ++  G 
Sbjct: 69  EPDAFMFNTLVRGYSESDEPHNSVAVFVEMMRKGFVFPDSFSFAFVIKAVENFRSLRTGF 128

Query: 152 QVQTHVIKFGFAKDVFIRNALIHLYCTHCRVECAKQVFDE-------------------- 211
           Q+    +K G    +F+   LI +Y     VE A++VFDE                    
Sbjct: 129 QMHCQALKHGLESHLFVGTTLIGMYGGCGCVEFARKVFDEMHQPNLVAWNAVITACFRGN 188

Query: 212 -VPSSRDI---------VSWNSMIAGFVRAGQINVADKLFVEMPEKDVISWSAMISGCVQ 271
            V  +R+I          SWN M+AG+++AG++  A ++F EMP +D +SWS MI G   
Sbjct: 189 DVAGAREIFDKMLVRNHTSWNVMLAGYIKAGELESAKRIFSEMPHRDDVSWSTMIVGIAH 248

Query: 272 NGLLEKALDCFNEMREQKMRPNEAILVSMLAAASQLGMLEYGKMIHSIADSLKFPMTASL 331
           NG   ++   F E++   M PNE  L  +L+A SQ G  E+GK++H   +   +    S+
Sbjct: 249 NGSFNESFLYFRELQRAGMSPNEVSLTGVLSACSQSGSFEFGKILHGFVEKAGYSWIVSV 308

Query: 332 GTALVDMYAKCGCIDESKFLFDRMPQKDKW-TWNVMICGLASHGLGQEALALFEKFLTQG 391
             AL+DMY++CG +  ++ +F+ M +K    +W  MI GLA HG G+EA+ LF +    G
Sbjct: 309 NNALIDMYSRCGNVPMARLVFEGMQEKRCIVSWTSMIAGLAMHGQGEEAVRLFNEMTAYG 368

Query: 392 FYPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTYKIIPEMEHYGCMVDLFSRAGFVYDA 451
             P  ++FI +L+ACS AGL+ EG  +F  M   Y I PE+EHYGCMVDL+ R+G +  A
Sbjct: 369 VTPDGISFISLLHACSHAGLIEEGEDYFSEMKRVYHIEPEIEHYGCMVDLYGRSGKLQKA 428

Query: 452 VEMINRMPAPPDPVLWATVLGSCKVHGFIELGEEIGNKLIQMDPTHNGHYVQLAGIYARL 511
            + I +MP PP  ++W T+LG+C  HG IEL E++  +L ++DP ++G  V L+  YA  
Sbjct: 429 YDFICQMPIPPTAIVWRTLLGACSSHGNIELAEQVKQRLNELDPNNSGDLVLLSNAYATA 488

Query: 512 RKWEDVSKIRRLMADRNSNKIAGWSLIEAGGRVHRFVAGDKEHEQCTEIYKMLETIGVRI 571
            KW+DV+ IR+ M  +   K   WSL+E G  +++F AG+K+     E ++ L+ I +R+
Sbjct: 489 GKWKDVASIRKSMIVQRIKKTTAWSLVEVGKTMYKFTAGEKKKGIDIEAHEKLKEIILRL 548

Query: 572 A-AAGYSANVSSVLHDIEEEEKETAIKEHSERLAIAFGLLVTQVGDCIRIIKNLRVCGDC 631
              AGY+  V+S L+D+EEEEKE  + +HSE+LA+AF L     G  IRI+KNLR+C DC
Sbjct: 549 KDEAGYTPEVASALYDVEEEEKEDQVSKHSEKLALAFALARLSKGANIRIVKNLRICRDC 608

Query: 632 HEVSKIISRVFEREIIVRDGSRFHHFKNGSCSCLDYW 635
           H V K+ S+V+  EI+VRD +RFH FK+GSCSC DYW
Sbjct: 609 HAVMKLTSKVYGVEILVRDRNRFHSFKDGSCSCRDYW 643

BLAST of CmoCh07G001980 vs. TAIR10
Match: AT3G62890.1 (AT3G62890.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 463.4 bits (1191), Expect = 2.3e-130
Identity = 226/528 (42.80%), Postives = 340/528 (64.39%), Query Frame = 1

Query: 114 ISLYSHMRRSLVAPNKHTLTFVLQACSNALAICEGIQVQTHVIKFGFAKDVFIRNALIHL 173
           IS+Y  MR   V+P+ HT  F+L +  N L +  G +    ++ FG  KD F+R +L+++
Sbjct: 47  ISVYLRMRNHRVSPDFHTFPFLLPSFHNPLHLPLGQRTHAQILLFGLDKDPFVRTSLLNM 106

Query: 174 YCTHCRVECAKQVFDEVPSSRDIVSWNSMIAGFVRAGQINVADKLFVEMPEKDVISWSAM 233
           Y +   +  A++VFD+   S+D+ +WNS++  + +AG I+ A KLF EMPE++VISWS +
Sbjct: 107 YSSCGDLRSAQRVFDD-SGSKDLPAWNSVVNAYAKAGLIDDARKLFDEMPERNVISWSCL 166

Query: 234 ISGCVQNGLLEKALDCFNEMREQK-----MRPNEAILVSMLAAASQLGMLEYGKMIHSIA 293
           I+G V  G  ++ALD F EM+  K     +RPNE  + ++L+A  +LG LE GK +H+  
Sbjct: 167 INGYVMCGKYKEALDLFREMQLPKPNEAFVRPNEFTMSTVLSACGRLGALEQGKWVHAYI 226

Query: 294 DSLKFPMTASLGTALVDMYAKCGCIDESKFLFDRM-PQKDKWTWNVMICGLASHGLGQEA 353
           D     +   LGTAL+DMYAKCG ++ +K +F+ +  +KD   ++ MIC LA +GL  E 
Sbjct: 227 DKYHVEIDIVLGTALIDMYAKCGSLERAKRVFNALGSKKDVKAYSAMICCLAMYGLTDEC 286

Query: 354 LALFEKFLTQ-GFYPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTYKIIPEMEHYGCMV 413
             LF +  T     P +VTF+G+L AC   GL++EG+ +FK+M + + I P ++HYGCMV
Sbjct: 287 FQLFSEMTTSDNINPNSVTFVGILGACVHRGLINEGKSYFKMMIEEFGITPSIQHYGCMV 346

Query: 414 DLFSRAGFVYDAVEMINRMPAPPDPVLWATVLGSCKVHGFIELGEEIGNKLIQMDPTHNG 473
           DL+ R+G + +A   I  MP  PD ++W ++L   ++ G I+  E    +LI++DP ++G
Sbjct: 347 DLYGRSGLIKEAESFIASMPMEPDVLIWGSLLSGSRMLGDIKTCEGALKRLIELDPMNSG 406

Query: 474 HYVQLAGIYARLRKWEDVSKIRRLMADRNSNKIAGWSLIEAGGRVHRFVAGDKEHEQCTE 533
            YV L+ +YA+  +W +V  IR  M  +  NK+ G S +E  G VH FV GD+  ++   
Sbjct: 407 AYVLLSNVYAKTGRWMEVKCIRHEMEVKGINKVPGCSYVEVEGVVHEFVVGDESQQESER 466

Query: 534 IYKMLETIGVRIAAAGYSANVSSVLHDIEEEEKETAIKEHSERLAIAFGLLVTQVGDCIR 593
           IY ML+ I  R+  AGY  +   VL D+ E++KE A+  HSE+LAIAF L+ T+ G  +R
Sbjct: 467 IYAMLDEIMQRLREAGYVTDTKEVLLDLNEKDKEIALSYHSEKLAIAFCLMKTRPGTPVR 526

Query: 594 IIKNLRVCGDCHEVSKIISRVFEREIIVRDGSRFHHFKNGSCSCLDYW 635
           IIKNLR+CGDCH V K+IS++F REI+VRD +RFHHF++GSCSC D+W
Sbjct: 527 IIKNLRICGDCHLVMKMISKLFSREIVVRDCNRFHHFRDGSCSCRDFW 573

BLAST of CmoCh07G001980 vs. NCBI nr
Match: gi|659078510|ref|XP_008439760.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Cucumis melo])

HSP 1 Score: 1124.8 bits (2908), Expect = 0.0e+00
Identity = 551/637 (86.50%), Postives = 585/637 (91.84%), Query Frame = 1

Query: 1   MLLCKPKFIFWTSKQRLNVHVFSTVSH--LPPPLSSLPPISGITQIKQAHARSVVFGLAN 60
           MLLC+P F FWTSKQRLN H+FSTV +  LP PLSSLPP  GITQIKQAHAR++VFGLAN
Sbjct: 83  MLLCRPNFFFWTSKQRLNFHLFSTVPNPRLPSPLSSLPPTPGITQIKQAHARTIVFGLAN 142

Query: 61  DGRIMGHLLAFLAVSSSSLPYEYAFSIYQSISHPSVFATNNMIRCCAKEELSCESISLYS 120
           DGRI  HLLAFLA+SSSSLP +YA SIY SI HPSVFATNNMIRC  K +L   SISLYS
Sbjct: 143 DGRITPHLLAFLAISSSSLPSDYALSIYNSIPHPSVFATNNMIRCFVKGDLPRHSISLYS 202

Query: 121 HMRRSL-VAPNKHTLTFVLQACSNALAICEGIQVQTHVIKFGFAKDVFIRNALIHLYCTH 180
           HM RS  VAPNKHTLTFVLQACSNALAI EG QVQTHVIK GF KDVF+RNALIHLYCT 
Sbjct: 203 HMCRSFEVAPNKHTLTFVLQACSNALAIREGAQVQTHVIKLGFVKDVFVRNALIHLYCTC 262

Query: 181 CRVECAKQVFDEVPSSRDIVSWNSMIAGFVRAGQINVADKLFVEMPEKDVISWSAMISGC 240
           CRVE AKQVFDEVPSSRD+VSWNSMIAGFVR GQI+ A KLFVEMPEKDVISW  +ISGC
Sbjct: 263 CRVESAKQVFDEVPSSRDVVSWNSMIAGFVRHGQISDAQKLFVEMPEKDVISWGTIISGC 322

Query: 241 VQNGLLEKALDCFNEMREQKMRPNEAILVSMLAAASQLGMLEYGKMIHSIADSLKFPMTA 300
           VQNG LEKALD F E+ EQK+RPNEAILVS+LAAA+QLG LEYGKMIHSIADSL+FPMTA
Sbjct: 323 VQNGELEKALDYFKELGEQKLRPNEAILVSLLAAAAQLGTLEYGKMIHSIADSLRFPMTA 382

Query: 301 SLGTALVDMYAKCGCIDESKFLFDRMPQKDKWTWNVMICGLASHGLGQEALALFEKFLTQ 360
           SLGTALVDMYAKCGCIDES+FLFDRMP+KDKW+WNVMICGLA+HGLGQEALALFEKFLTQ
Sbjct: 383 SLGTALVDMYAKCGCIDESRFLFDRMPEKDKWSWNVMICGLATHGLGQEALALFEKFLTQ 442

Query: 361 GFYPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTYKIIPEMEHYGCMVDLFSRAGFVYD 420
           GF+P+NVTFIGVL ACSRAGLVSEGR FFKLMTDTY I PEMEHYGCMVDL SRAGFVYD
Sbjct: 443 GFHPINVTFIGVLTACSRAGLVSEGRHFFKLMTDTYGIEPEMEHYGCMVDLLSRAGFVYD 502

Query: 421 AVEMINRMPAPPDPVLWATVLGSCKVHGFIELGEEIGNKLIQMDPTHNGHYVQLAGIYAR 480
           AVEMINRMPAPPDPVLWA+VLGSC+VHGF ELGEEIGNKLIQMDPTHNGHYVQLA I+AR
Sbjct: 503 AVEMINRMPAPPDPVLWASVLGSCQVHGFAELGEEIGNKLIQMDPTHNGHYVQLARIFAR 562

Query: 481 LRKWEDVSKIRRLMADRNSNKIAGWSLIEAGGRVHRFVAGDKEHEQCTEIYKMLETIGVR 540
           LRKWEDVSK+RRLMA+RNSNK+AGWSLIEA GRVH+FVAGDKEHE+ TEIYKMLE IGVR
Sbjct: 563 LRKWEDVSKVRRLMAERNSNKVAGWSLIEAEGRVHQFVAGDKEHERTTEIYKMLEIIGVR 622

Query: 541 IAAAGYSANVSSVLHDIEEEEKETAIKEHSERLAIAFGLLVTQVGDCIRIIKNLRVCGDC 600
           IAAAGYSANV+SVLHDIEEEEKE AIKEHSERLAIAFGLLVT+VGDCIRIIKNLRVCGDC
Sbjct: 623 IAAAGYSANVTSVLHDIEEEEKENAIKEHSERLAIAFGLLVTKVGDCIRIIKNLRVCGDC 682

Query: 601 HEVSKIISRVFEREIIVRDGSRFHHFKNGSCSCLDYW 635
           HEVSKIIS+VFEREIIVRDGSRFHHFKNGSCSC DYW
Sbjct: 683 HEVSKIISQVFEREIIVRDGSRFHHFKNGSCSCQDYW 719

BLAST of CmoCh07G001980 vs. NCBI nr
Match: gi|449434296|ref|XP_004134932.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Cucumis sativus])

HSP 1 Score: 1105.9 bits (2859), Expect = 0.0e+00
Identity = 545/638 (85.42%), Postives = 578/638 (90.60%), Query Frame = 1

Query: 1   MLLCKPKFIF-WTSKQRLNVHVFSTVSH--LPPPLSSLPPISGITQIKQAHARSVVFGLA 60
           MLLC+P F+F W SKQRLN H FST+ +  LPPP SSLPP  GITQIKQAHAR +V GLA
Sbjct: 1   MLLCRPNFLFFWISKQRLNFHFFSTLPNPRLPPPFSSLPPTPGITQIKQAHARILVLGLA 60

Query: 61  NDGRIMGHLLAFLAVSSSSLPYEYAFSIYQSISHPSVFATNNMIRCCAKEELSCESISLY 120
           NDGRI  HLLAFLA+SSSSLP +YA SIY SISHP+VFATNNMIRC  K +L   SISLY
Sbjct: 61  NDGRITSHLLAFLAISSSSLPSDYALSIYNSISHPTVFATNNMIRCFVKGDLPRHSISLY 120

Query: 121 SHMRRSLVA-PNKHTLTFVLQACSNALAICEGIQVQTHVIKFGFAKDVFIRNALIHLYCT 180
           SHM RS VA PNKHTLTFVLQACSNA AI EG QVQTHVIK GF KDVF+RNALIHLYCT
Sbjct: 121 SHMCRSFVAAPNKHTLTFVLQACSNAFAIREGAQVQTHVIKLGFVKDVFVRNALIHLYCT 180

Query: 181 HCRVECAKQVFDEVPSSRDIVSWNSMIAGFVRAGQINVADKLFVEMPEKDVISWSAMISG 240
            CRVE AKQVFDEVPSSRD+VSWNSMI GFVR GQI+VA KLFVEMPEKDVISW  +ISG
Sbjct: 181 CCRVESAKQVFDEVPSSRDVVSWNSMIVGFVRLGQISVAQKLFVEMPEKDVISWGTIISG 240

Query: 241 CVQNGLLEKALDCFNEMREQKMRPNEAILVSMLAAASQLGMLEYGKMIHSIADSLKFPMT 300
           CVQNG LEKALD F E+ EQK+RPNEAILVS+LAAA+QLG LEYGK IHSIA+SL+FPMT
Sbjct: 241 CVQNGELEKALDYFKELGEQKLRPNEAILVSLLAAAAQLGTLEYGKRIHSIANSLRFPMT 300

Query: 301 ASLGTALVDMYAKCGCIDESKFLFDRMPQKDKWTWNVMICGLASHGLGQEALALFEKFLT 360
           ASLGTALVDMYAKCGCIDES+FLFDRMP+KDKW+WNVMICGLA+HGLGQEALALFEKFLT
Sbjct: 301 ASLGTALVDMYAKCGCIDESRFLFDRMPEKDKWSWNVMICGLATHGLGQEALALFEKFLT 360

Query: 361 QGFYPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTYKIIPEMEHYGCMVDLFSRAGFVY 420
           QGF+PVNVTFIGVL ACSRAGLVSEG+ FFKLMTDTY I PEMEHYGCMVDL SRAGFVY
Sbjct: 361 QGFHPVNVTFIGVLTACSRAGLVSEGKHFFKLMTDTYGIEPEMEHYGCMVDLLSRAGFVY 420

Query: 421 DAVEMINRMPAPPDPVLWATVLGSCKVHGFIELGEEIGNKLIQMDPTHNGHYVQLAGIYA 480
           DAVEMINRMPAPPDPVLWA+VLGSC+VHGFIELGEEIGNKLIQMDPTHNGHYVQLA I+A
Sbjct: 421 DAVEMINRMPAPPDPVLWASVLGSCQVHGFIELGEEIGNKLIQMDPTHNGHYVQLARIFA 480

Query: 481 RLRKWEDVSKIRRLMADRNSNKIAGWSLIEAGGRVHRFVAGDKEHEQCTEIYKMLETIGV 540
           RLRKWEDVSK+RRLMA+RNSNKIAGWSLIEA GRVHRFVAGDKEHE+ TEIYKMLE +GV
Sbjct: 481 RLRKWEDVSKVRRLMAERNSNKIAGWSLIEAEGRVHRFVAGDKEHERTTEIYKMLEIMGV 540

Query: 541 RIAAAGYSANVSSVLHDIEEEEKETAIKEHSERLAIAFGLLVTQVGDCIRIIKNLRVCGD 600
           RIAAAGYSANVSSVLHDIEEEEKE AIKEHSERLAIAFGLLVT+ GDCIRIIKNLRVCGD
Sbjct: 541 RIAAAGYSANVSSVLHDIEEEEKENAIKEHSERLAIAFGLLVTKDGDCIRIIKNLRVCGD 600

Query: 601 CHEVSKIISRVFEREIIVRDGSRFHHFKNGSCSCLDYW 635
           CHEVSKIIS VFEREIIVRDGSRFHHFK G CSC DYW
Sbjct: 601 CHEVSKIISLVFEREIIVRDGSRFHHFKKGICSCQDYW 638

BLAST of CmoCh07G001980 vs. NCBI nr
Match: gi|645223550|ref|XP_008218684.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Prunus mume])

HSP 1 Score: 895.2 bits (2312), Expect = 6.6e-257
Identity = 419/609 (68.80%), Postives = 510/609 (83.74%), Query Frame = 1

Query: 26  SHLPPPLSSLPPISGITQIKQAHARSVVFGLANDGRIMGHLLAFLAVSSSSLPYEYAFSI 85
           S L PPLSSLPP   I Q KQAHA+ +V GLA D  ++ HLL FLA+S S+ P+ Y+ S+
Sbjct: 20  SQLSPPLSSLPPRPSIPQTKQAHAQIIVSGLAADSPLISHLLCFLALSPST-PFHYSLSL 79

Query: 86  YQSISHPSVFATNNMIRCCAKEELSCESISLYSHMRRSLVAPNKHTLTFVLQACSNALAI 145
           YQSI +PSVFATNNMIRC AK +   +S+ L+S M R+ + PN HT TF+LQACS ALA+
Sbjct: 80  YQSIKYPSVFATNNMIRCFAKSDSPPQSLLLFSSMLRTCMKPNNHTFTFLLQACSRALAL 139

Query: 146 CEGIQVQTHVIKFGFAKDVFIRNALIHLYCTHCRVECAKQVFDEVPSSRDIVSWNSMIAG 205
            EG QV T  +K GF   VF+RNALIHLYC+  R+EC+K++F+E  SSRD+V+WNSM+  
Sbjct: 140 NEGAQVHTVAVKLGFGGYVFVRNALIHLYCSCSRIECSKRLFEENASSRDVVTWNSMLMA 199

Query: 206 FVRAGQINVADKLFVEMPEKDVISWSAMISGCVQNGLLEKALDCFNEMREQKMRPNEAIL 265
           F++  QI  A+KLF EMPE+DVISWS MISG VQNG LE+ L+CF +MRE+ MR NEA L
Sbjct: 200 FLKDEQIGAAEKLFEEMPERDVISWSTMISGNVQNGRLEEGLECFKQMREKGMRLNEATL 259

Query: 266 VSMLAAASQLGMLEYGKMIHSIADSLKFPMTASLGTALVDMYAKCGCIDESKFLFDRMPQ 325
           VS+L+A++QLG+LE+G+++HS+ +SL FP+T SLGTA++DMYAKCGCI++SK LF  MP+
Sbjct: 260 VSVLSASAQLGLLEHGRLVHSLVESLNFPLTVSLGTAIIDMYAKCGCIEQSKLLFKNMPK 319

Query: 326 KDKWTWNVMICGLASHGLGQEALALFEKFLTQGFYPVNVTFIGVLNACSRAGLVSEGRRF 385
           KD WTWNVMICGLASHGLG+EALALF++F+ +GF+PVNVTFIGVL ACSRAGLVSEGRR 
Sbjct: 320 KDIWTWNVMICGLASHGLGKEALALFQRFIDEGFHPVNVTFIGVLGACSRAGLVSEGRRH 379

Query: 386 FKLMTDTYKIIPEMEHYGCMVDLFSRAGFVYDAVEMINRMPAPPDPVLWATVLGSCKVHG 445
           FKLMT+ Y I+PEMEHYGCMVD+  RAGF+ +AV++I +M  PPDPVLWAT+LG+CK+HG
Sbjct: 380 FKLMTEKYSILPEMEHYGCMVDMLGRAGFLDEAVQLIEKMTVPPDPVLWATLLGACKIHG 439

Query: 446 FIELGEEIGNKLIQMDPTHNGHYVQLAGIYARLRKWEDVSKIRRLMADRNSNKIAGWSLI 505
            IELGE+IG KL+++DPTH+GHYVQLA IYA+ RKWEDV ++RRL+ ++N+NK AGWSLI
Sbjct: 440 SIELGEKIGKKLLKLDPTHDGHYVQLASIYAKARKWEDVIRVRRLLVEQNTNKAAGWSLI 499

Query: 506 EAGGRVHRFVAGDKEHEQCTEIYKMLETIGVRIAAAGYSANVSSVLHDIEEEEKETAIKE 565
           EA G VH+FVAGD+EHE+  EIYKMLE IG RIA +GYS NVSSVLHDI EEEKE AIKE
Sbjct: 500 EAQGTVHKFVAGDREHERSLEIYKMLEKIGTRIAESGYSPNVSSVLHDIGEEEKENAIKE 559

Query: 566 HSERLAIAFGLLVTQVGDCIRIIKNLRVCGDCHEVSKIISRVFEREIIVRDGSRFHHFKN 625
           HSERLA+AFGLLVT  GDCIRI+KNLRVC DCHEVSKIIS+VFEREIIVRDGSRFHHFK+
Sbjct: 560 HSERLAMAFGLLVTGAGDCIRIVKNLRVCEDCHEVSKIISKVFEREIIVRDGSRFHHFKD 619

Query: 626 GSCSCLDYW 635
           G CSCLDYW
Sbjct: 620 GKCSCLDYW 627

BLAST of CmoCh07G001980 vs. NCBI nr
Match: gi|694427521|ref|XP_009341388.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g62890-like [Pyrus x bretschneideri])

HSP 1 Score: 872.8 bits (2254), Expect = 3.5e-250
Identity = 417/614 (67.92%), Postives = 505/614 (82.25%), Query Frame = 1

Query: 23  STVSHLPPPL-SSLPPISGITQIKQAHARSVVFGLANDGRIMGHLLAFLAVSSSSLPYEY 82
           S +SH    L S+LPP   + Q KQAHA  +V G A    ++ HLL  L++S ++  + Y
Sbjct: 260 SLLSHSSSSLVSALPPRPSLRQTKQAHAHIIVSGHAAYSPLLSHLLCLLSLSPTTT-FHY 319

Query: 83  AFSIYQSISHPSVFATNNMIRCCAKEELSCESISLYSH-MRRSLVAPNKHTLTFVLQACS 142
           + S+Y+SI +PSVFATNNMIRC AK +    S+ L+S  M RS V PN HT TF+LQACS
Sbjct: 320 SLSLYRSIKYPSVFATNNMIRCFAKSDSPLLSLLLFSSSMLRSYVKPNNHTFTFLLQACS 379

Query: 143 NALAICEGIQVQTHVIKFGFAKDVFIRNALIHLYCTHCRVECAKQVFDEVPSSRDIVSWN 202
            A+A+ EG QV   V+K GF   VF+RNALIHLYC+  R+EC+K+VF+E  SSRD+V+WN
Sbjct: 380 KAMALTEGAQVHAIVVKLGFGGYVFVRNALIHLYCSCSRIECSKRVFEENVSSRDVVTWN 439

Query: 203 SMIAGFVRAGQINVADKLFVEMPEKDVISWSAMISGCVQNGLLEKALDCFNEMREQKMRP 262
           SM+  FVR  QI VA+KLF EMPE+DVISWS MISG VQNG LE+ LDCF  M+E+ +R 
Sbjct: 440 SMLTAFVRDEQIGVAEKLFEEMPERDVISWSTMISGYVQNGRLEQGLDCFKRMKEEGIRM 499

Query: 263 NEAILVSMLAAASQLGMLEYGKMIHSIADSLKFPMTASLGTALVDMYAKCGCIDESKFLF 322
           NEA LVS+L+A++QLG+LE+G+++HS+A+SL FP+TA LGTAL+DMYAKCGCI++SK LF
Sbjct: 500 NEATLVSVLSASAQLGLLEHGRLVHSLAESLNFPLTACLGTALIDMYAKCGCIEQSKLLF 559

Query: 323 DRMPQKDKWTWNVMICGLASHGLGQEALALFEKFLTQGFYPVNVTFIGVLNACSRAGLVS 382
             MPQ+D WTWNVMICGLASHGLG+EALALF++F+ +GF P NVTFIGVL ACSRAGLVS
Sbjct: 560 KNMPQRDIWTWNVMICGLASHGLGKEALALFDRFVDEGFRPANVTFIGVLGACSRAGLVS 619

Query: 383 EGRRFFKLMTDTYKIIPEMEHYGCMVDLFSRAGFVYDAVEMINRMPAPPDPVLWATVLGS 442
           EGRR FKLMT+ Y I+PEMEHYGCMVD+  RAGFV +AVE+I +M   PDPVLWAT+LG+
Sbjct: 620 EGRRHFKLMTEKYGILPEMEHYGCMVDILGRAGFVDEAVELIEKMTVSPDPVLWATLLGA 679

Query: 443 CKVHGFIELGEEIGNKLIQMDPTHNGHYVQLAGIYARLRKWEDVSKIRRLMADRNSNKIA 502
           CK+HG IELGE+IGNKL+++DPTH+GHYVQLA IYA+ RKWEDV ++RRLM ++N+NK A
Sbjct: 680 CKIHGSIELGEKIGNKLLELDPTHDGHYVQLANIYAKARKWEDVVRVRRLMVEQNTNKAA 739

Query: 503 GWSLIEAGGRVHRFVAGDKEHEQCTEIYKMLETIGVRIAAAGYSANVSSVLHDIEEEEKE 562
           GWSLIEA G+VH+FVAGD+EHE+  EI+KMLETIG RIA AGYS NV+SVLHDI EEEKE
Sbjct: 740 GWSLIEAHGKVHKFVAGDREHERSLEIHKMLETIGTRIAEAGYSPNVTSVLHDIGEEEKE 799

Query: 563 TAIKEHSERLAIAFGLLVTQVGDCIRIIKNLRVCGDCHEVSKIISRVFEREIIVRDGSRF 622
             IKEHSERLA+AFGLLVTQ GDCIRI+KNLRVC DCHEVSKIIS+VFEREIIVRDGSRF
Sbjct: 800 NVIKEHSERLAMAFGLLVTQAGDCIRIMKNLRVCEDCHEVSKIISKVFEREIIVRDGSRF 859

Query: 623 HHFKNGSCSCLDYW 635
           HHFK+G CSCLDYW
Sbjct: 860 HHFKDGKCSCLDYW 872

BLAST of CmoCh07G001980 vs. NCBI nr
Match: gi|658008452|ref|XP_008339421.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g62890-like [Malus domestica])

HSP 1 Score: 857.8 bits (2215), Expect = 1.2e-245
Identity = 412/638 (64.58%), Postives = 508/638 (79.62%), Query Frame = 1

Query: 1   MLLCKPKFIFWTSKQRLNVHVFS---TVSHLPPPLSSLPPISGITQIKQAHARSVVFGLA 60
           M   KPK +F +  + ++  + S   + S     +S+LPP   + Q KQAHA  +V G A
Sbjct: 1   MFAYKPKSLFPSLPKHISNSLLSHSSSSSSSSSLVSALPPRPSLRQTKQAHAHIIVSGHA 60

Query: 61  NDGRIMGHLLAFLAVSSSSLPYEYAFSIYQSISHPSVFATNNMIRCCAKEELSCESISLY 120
               ++ HLL  L++S ++ P+ Y+ S+Y SI +PSVFATNNMIRC AK +    S+ L+
Sbjct: 61  AYSPLLSHLLCLLSLSPTT-PFHYSLSLYHSIKYPSVFATNNMIRCFAKSDSPLLSLLLF 120

Query: 121 S-HMRRSLVAPNKHTLTFVLQACSNALAICEGIQVQTHVIKFGFAKDVFIRNALIHLYCT 180
           S  + RS V PN HT TF+LQACS A+A+ EG QV   V+K GF   VF+RNALIHLYC+
Sbjct: 121 SCSILRSCVKPNNHTFTFLLQACSRAMALTEGDQVHAVVVKLGFGGYVFVRNALIHLYCS 180

Query: 181 HCRVECAKQVFDEVPSSRDIVSWNSMIAGFVRAGQINVADKLFVEMPEKDVISWSAMISG 240
             R+EC+K+VF+E   SRD+V+WNSM+  FVR  QI VA+KLF EMP +DVISWS MISG
Sbjct: 181 CSRIECSKRVFEENVXSRDVVTWNSMLTAFVRDEQIGVAEKLFEEMPARDVISWSTMISG 240

Query: 241 CVQNGLLEKALDCFNEMREQKMRPNEAILVSMLAAASQLGMLEYGKMIHSIADSLKFPMT 300
            VQNG LE+ L+CF  MRE+ +R NEA  VS+L+A++QLG+LE+G+++HS+A+SL FP+T
Sbjct: 241 YVQNGXLEQGLECFKRMREEGIRMNEATXVSVLSASAQLGLLEHGRLVHSLAESLNFPLT 300

Query: 301 ASLGTALVDMYAKCGCIDESKFLFDRMPQKDKWTWNVMICGLASHGLGQEALALFEKFLT 360
           A LGTAL+DMYAKCGCI++SK LF  M Q+D WTWNVMICGLASHGLG+EALALF++F+ 
Sbjct: 301 ACLGTALIDMYAKCGCIEQSKLLFKNMTQRDIWTWNVMICGLASHGLGKEALALFDRFVD 360

Query: 361 QGFYPVNVTFIGVLNACSRAGLVSEGRRFFKLMTDTYKIIPEMEHYGCMVDLFSRAGFVY 420
           +GF P NVTFIGVL ACSRAGLV EGR  FKLMT+ Y I+PEMEHYGCMVD+  RAGFV 
Sbjct: 361 EGFRPANVTFIGVLGACSRAGLVREGRHHFKLMTEKYGILPEMEHYGCMVDILGRAGFVD 420

Query: 421 DAVEMINRMPAPPDPVLWATVLGSCKVHGFIELGEEIGNKLIQMDPTHNGHYVQLAGIYA 480
           +AVE+I +M   PDPVLWAT+LG+CK+HG IELGE+IGNKL+++DPTH+GHYVQLA IYA
Sbjct: 421 EAVELIEKMTVSPDPVLWATLLGACKIHGSIELGEKIGNKLLELDPTHDGHYVQLANIYA 480

Query: 481 RLRKWEDVSKIRRLMADRNSNKIAGWSLIEAGGRVHRFVAGDKEHEQCTEIYKMLETIGV 540
           + RKWEDV ++RRLM ++N+NK AGWSLIEA GRVH+FVAGD+EHE+  EI+KMLETIG 
Sbjct: 481 KARKWEDVVRVRRLMVEQNTNKAAGWSLIEAQGRVHKFVAGDREHERSLEIHKMLETIGT 540

Query: 541 RIAAAGYSANVSSVLHDIEEEEKETAIKEHSERLAIAFGLLVTQVGDCIRIIKNLRVCGD 600
           RIA AGYS NV+SVLHDI EEEKE  I EHSERLA+AFGLLVT+ G+CIRI+KNLRVC D
Sbjct: 541 RIAEAGYSPNVTSVLHDIGEEEKENVIMEHSERLAMAFGLLVTEAGECIRIMKNLRVCED 600

Query: 601 CHEVSKIISRVFEREIIVRDGSRFHHFKNGSCSCLDYW 635
           CHEVSKIIS+VFEREIIVRDGSRFHHFK+G CSCLDYW
Sbjct: 601 CHEVSKIISKVFEREIIVRDGSRFHHFKDGKCSCLDYW 637

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP449_ARATH9.9e-13640.23Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana GN... [more]
PP367_ARATH2.1e-13337.42Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana GN... [more]
PP330_ARATH1.3e-13240.10Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana GN... [more]
PP122_ARATH2.4e-12937.68Pentatricopeptide repeat-containing protein At1g74630 OS=Arabidopsis thaliana GN... [more]
PP295_ARATH4.0e-12942.80Pentatricopeptide repeat-containing protein At3g62890 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0KNI8_CUCSA0.0e+0085.42Uncharacterized protein OS=Cucumis sativus GN=Csa_6G519440 PE=4 SV=1[more]
B9HDP6_POPTR4.8e-23865.32Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0006s24730g PE=4 SV=2[more]
M5XDQ3_PRUPE1.6e-23370.47Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004024mg PE=4 SV=1[more]
A0A067L218_JATCU9.7e-23165.03Uncharacterized protein OS=Jatropha curcas GN=JCGZ_00279 PE=4 SV=1[more]
A0A103XBF5_CYNCS5.7e-21559.27Pentatricopeptide repeat-containing protein OS=Cynara cardunculus var. scolymus ... [more]
Match NameE-valueIdentityDescription
AT5G66520.15.6e-13740.23 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G06540.11.2e-13437.42 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G21065.17.5e-13440.10 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G74630.11.3e-13037.68 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G62890.12.3e-13042.80 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659078510|ref|XP_008439760.1|0.0e+0086.50PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Cucumis m... [more]
gi|449434296|ref|XP_004134932.1|0.0e+0085.42PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Cucumis s... [more]
gi|645223550|ref|XP_008218684.1|6.6e-25768.80PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Prunus mu... [more]
gi|694427521|ref|XP_009341388.1|3.5e-25067.92PREDICTED: pentatricopeptide repeat-containing protein At3g62890-like [Pyrus x b... [more]
gi|658008452|ref|XP_008339421.1|1.2e-24564.58PREDICTED: pentatricopeptide repeat-containing protein At3g62890-like [Malus dom... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0009536 plastid
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh07G001980.1CmoCh07G001980.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 364..390
score: 0.11coord: 168..191
score: 0.26coord: 401..425
score: 0.043coord: 301..326
score: 0.0068coord: 330..358
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 92..140
score: 3.9E-7coord: 225..262
score: 2.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 228..262
score: 1.6E-10coord: 330..361
score: 1.6E-5coord: 197..227
score: 2.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 398..428
score: 6.39coord: 430..460
score: 5.185coord: 163..193
score: 7.541coord: 93..127
score: 7.881coord: 195..225
score: 10.358coord: 296..326
score: 7.224coord: 362..392
score: 6.785coord: 327..361
score: 10.852coord: 226..260
score: 12.748coord: 464..498
score: 5
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 428..483
score: 8.1E-5coord: 224..358
score: 8.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 27..505
score: 1.3E

The following gene(s) are paralogous to this gene:

None