CmoCh04G009740 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G009740
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCmo_Chr04 : 4949678 .. 4953724 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATGGTAGCCATGATTTATGGGAATGGTGGGTTTGTTTGGGTTTTGCATGCGTTGTTTTTCATGGTGGTCGGTGGGTCCTTCGTCCATGGGGACGGTGGTGCTTGGCTTGATGCTCATGCAACTTTCTATGGAGCTGATCAAAACCCTACTAGCCTCGGTGAGTCTTTGAATGAAATAATTATTAAATGAGATTAAAAAAAAAAAATTTGAAAACTAGTTATTTTTCTTAAAATGTGTCCTTTTTTTAATTCTTAAATATAAATTAACTTTTAAAAATATTTCTGAAAGGTACTTTAAAATTAAGGAAATTGTTAGTACACATGTTTCCAATAATTTAAGAAAATGCACAATTTATTAATCATACATATCTGGCAATCAACTTTACTATTTGAACATTATGCCATTTAATCATTTTCTTGATATATATATATATATATATAAAAGAAAATAATATATTAATATTGTAATAAAGTTATGTGATTATTTATTAATCAGGAGGAGCATGTGGTTACGACAATACATTTCATGCCGGATTTGGAATAAACACGGCGGCGGTGAGCGGCGCACTTTTCAGAGGAGGAGAGGCTTGCGGCGCTTGCTTCCTAGTAATTTGCAACTACAACGTGGACCCCAAGTGGTGCCTCCGCCGCCGCGCCGTCGCAATCACCGCCACGAACTTCTGCCCCTCCAATAACAACGGGGGCTGGTGCGACCCCCCTCGCGCGCACTTCGACATGTCGTCACCTGCCTTTCTTACCATTGCTCGGCAAGGCAACGAAGGGATCGTCCCTGTCCTTTACAAGAGGTACAACGTGATACAATATTTAGTAAGTTGGTAGAAATTTAAATAAAACGTTCAATTTTTTTTTAATATATGCTTTTAAATGTTGTATTTTTAAAGGGTAAGTTGTAGAAGGAAGGGAGGAGTTCGATTCACATTGAGAGGACAATCAAACTTCAATATGGTAATGATATCGAACGTCGGTGGCAGCGGCGACATAAAGGCTGCATGGGTTAAGGGGTCGAGGACGAGGACGTGGATGCTCATGCATCGTAATTGGGGCGCAAACTGGCAAGCCAACGTCGACCTTCGAAACCAAATAATGTCGTTTAAGGTTACTCTAATAGATGGGAGGACATTGGATTTTGTCAATGTGGTTCCTTCCTCTTGGAGGTTTGGACAAACGTTTTCTTCCATGGTTCAGTTTTCTTAGGGCTCGAACCTTCATGTTTGATCATTCATTCTTTTTTACTTGTTTTTATTAAAATTGATTTCAAGTTTATTTTCTTCTTGAAATTATAGTCGGTGAATTTTATATCATTATAATATAATGATATAAAACAATAATTATATTGAATTTATAATTCTTCCCCCTTAGTTTCCTATCAATTTATGTATTATTTGGACCTATCATTTAATATTTCTTTGATGTAATGTTCAAATTAGACATGTTATGACACATCTTATAAATAATTTTGAAATTTAGTCGCAAGCATCAATTAACCAAAGAATATAATATATTAAGGAAACAATATGTTAAATTATTCAAATCTTTTATACTTCAAATAATTTTAATACCGAAAGTGATATATATTAAATAAATAATTATTATTTATCTTAAATTTTGCTCATCTAATTCAATTTTTAATTATGTTTTTTTAAATTACACAATATTGATGTAAATAGATTTGAAAGAATTTTGGTGAAAAGCAAAATCTACTGGTTATAAATTTAAATAATATTGTTGGATGACGAAAGTCTCACGTCCACTAATTTAGGAGAAAATGATTATGAATTTATAATCCATAGGATTTTTTTTAGGAGGAAAGCACTAGAACTTAAGAAAAATATATATTTTTTAAATTAGTTGGAGATGAATTATTTAATAATAATTTTAAAAAATTTAACTGGAAATGCTTGAGACATGAAGTGAGAAATAAATGAAAGATGTTTATAGAGACGATACGATTACAACCCTAGAAACTATTTATTCGATTTCAAACGCTGGAGGTGGCGCTCAAAACTGTCGAAACTCGATCTTTTTTTCTTTTTTCCTGAAATTGGAGGTTTCCGGAGCTCCGGCCTTGGCGGCGGCATTGGCAATGTTCAAAATCTTGAGGAGCTTTTCTTCAGGTTTCACGAGAACGGCAAGAACGGAGACAGATGCATTCTGTTTTGTAGCGTTGAGATTATACAGCGCGAGACGAACCTGCAACCGAAGAAACCTCTTCGCCAGGATCAGTCCTCTCGGTTCTCCTGAGCTTAGTGTAGTTCCGATTCTTGATCAGTGGATTCAGGAAGGCAGGATGATCAAGGACTTTGAGATGCGGAGAATCGTTCGCGACCTTCGTAATTGCCGGCGGTATGGCCAAGCCCTTCAGGTGAGCGCAATTGAATAACACACTCTAATTTTTACTTGATCCTTTGCCATTTCCCGTTAAAGGAATGTTTGTATTGTACATGATGTCTGTTGGATGGAGTAAAATTCGTGTGTTTTTATTGAATGAATGATGGTGAGAGAGGATTTTCAAATCGATTTATTGATGAAAATATTGTGGACGAAGGCAACAAACTGGAAGTTTGCGTTTTGAATTCTGAAGTAATCACATGTAGTCAAGTTAATTGTGGTTTTTCAACATGATTCTAGCAGTTGCAATGCATTAGTTGCAGTTTCACTAGAGGATGAAAAGTTTCATGTTATTTAGCACAAGGTTGATTTCCTAAATTGTTCTCTTCCCATTTGGTTTTTCATCAATCCTAGATCTTATTTTTGAAATATCTATAAGATTTCTGGATATTCGATCATTTGTTCTTCTTTTGGATTATTCAAATTGAAGGTGTCTGAATGGATGCGTAGCAAGGGACTTTTCTCCTTTACAACTAGAGACTTCGCTGTACAGCTTGATCTGATCGGCCGAGTTCGGGGGATCGATTCTGCAGAGAAGTATTTCAGCAGTGTTTCTAACCAAGAGGAAATTGGTAAACTCTATGGTGCTCTTCTAAATTGTTATGTCAGGGAAGGGCTTGTAGATAAGGCCCTTTCCCATATGCAGAAGATGAAAGAGATGGGTTTTGCTTCCTCTCCCCTCTGCTACAATGATATAATGTGTCTATATTTGAACACTGGCCAGGTCGATAAAGTTCCGAATGTACTTTCTGAAATGAAGGATAATGGTGTTCTTCCTGACAATTATAGCTATAGAATTTGCATCAGCTCTTATGGAGCTAGGTCTGATCTAATCGGTATGCTGAAGGTTTTGAAAGAAATGGAGAGTCAAACTCACATATCTATGGACTGGACTACTTATTCAATGGTTGCCAATTTTTTCATAAAGGCTGGTATGCACGAGCAAGCAATGAGTTACCTTCGGAAATGCGAGGACAAGGTCAACCAAGATGCTCTCGGCTTCAATCACCTCATTTCACTTTACACCAGTCTGGGACGTAAAGACGAAGTAATGAGACTGTGGGCTCTCCAAAAGAAGTGCAAGAAGCAAGTCAATAGGGATTATATAACCATGTTGGGTTGTTTGGTTAAGCTTGAGTTTCTTGAGGAAGCTGAGAAATTGGTTGAGGAATGGGAGTCATCTTGCGAGTGTTATGATTTTCGAGTTCCGAATGTTCTTCTCATTGGATACTCGCAAAGGGGGCTAATTGAAAGAGCAGAAAAGATGCTTCAAAACATCATCAGTGATGGGAGGATTCCACCACCCAATAGTTGGGGCATTATTGCAGCAGGGTACTTGGAGAAGCAGAACCCGGAGAGAGCTTTCAAGTGCATGAAGGAAGCTGTAGCTGTACAAGAGCAAAACAAAGGGTGGAGGCCCAAACCTAGCGTCTTATCAAGCATACTGCGATGGCTATCTGAAAATGGAAGATATGAGGAGCTGAAAGAGTTTCTGAGCTCATTGAAGGCTGTTCCTTCCATGGACGGAAAACTAAGTAATGCCTTCGATGAGCTTCTGGAAACCTTGAAAAACAATGATGAAACAACGGCCGATGCTCTTAAGAAATCACAACCTTGTTTAGCTCAGGTAGATTAA

mRNA sequence

ATGATGGTAGCCATGATTTATGGGAATGGTGGGTTTGTTTGGGTTTTGCATGCGTTGTTTTTCATGGTGGTCGGTGGGTCCTTCGTCCATGGGGACGGTGGTGCTTGGCTTGATGCTCATGCAACTTTCTATGGAGCTGATCAAAACCCTACTAGCCTCGGAGGAGCATGTGGTTACGACAATACATTTCATGCCGGATTTGGAATAAACACGGCGGCGGTGAGCGGCGCACTTTTCAGAGGAGGAGAGGCTTGCGGCGCTTGCTTCCTAGTAATTTGCAACTACAACGTGGACCCCAAGTGGTGCCTCCGCCGCCGCGCCGTCGCAATCACCGCCACGAACTTCTGCCCCTCCAATAACAACGGGGGCTGGTGCGACCCCCCTCGCGCGCACTTCGACATGTCGTCACCTGCCTTTCTTACCATTGCTCGGCAAGGCAACGAAGGGATCGTCCCTGTCCTTTACAAGAGGGTAAGTTGTAGAAGGAAGGGAGGAGTTCGATTCACATTGAGAGGACAATCAAACTTCAATATGGTAATGATATCGAACGTCGGTGGCAGCGGCGACATAAAGGCTGCATGGGTTAAGGGGTCGAGGACGAGGACGTGGATGCTCATGCATCGTAATTGGGGCGCAAACTGGCAAGCCAACGTCGACCTTCGAAACCAAATAATGTCGTTTAAGGTTTCCGGAGCTCCGGCCTTGGCGGCGGCATTGGCAATGTTCAAAATCTTGAGGAGCTTTTCTTCAGGTTTCACGAGAACGGCAAGAACGGAGACAGATGCATTCTGTTTTGTAGCGTTGAGATTATACAGCGCGAGACGAACCTGCAACCGAAGAAACCTCTTCGCCAGGATCAGTCCTCTCGGTTCTCCTGAGCTTAGTGTAGTTCCGATTCTTGATCAGTGGATTCAGGAAGGCAGGATGATCAAGGACTTTGAGATGCGGAGAATCGTTCGCGACCTTCGTAATTGCCGGCGGTATGGCCAAGCCCTTCAGGTGTCTGAATGGATGCGTAGCAAGGGACTTTTCTCCTTTACAACTAGAGACTTCGCTGTACAGCTTGATCTGATCGGCCGAGTTCGGGGGATCGATTCTGCAGAGAAGTATTTCAGCAGTGTTTCTAACCAAGAGGAAATTGGTAAACTCTATGGTGCTCTTCTAAATTGTTATGTCAGGGAAGGGCTTGTAGATAAGGCCCTTTCCCATATGCAGAAGATGAAAGAGATGGGTTTTGCTTCCTCTCCCCTCTGCTACAATGATATAATGTGTCTATATTTGAACACTGGCCAGGTCGATAAAGTTCCGAATGTACTTTCTGAAATGAAGGATAATGGTGTTCTTCCTGACAATTATAGCTATAGAATTTGCATCAGCTCTTATGGAGCTAGGTCTGATCTAATCGGTATGCTGAAGGTTTTGAAAGAAATGGAGAGTCAAACTCACATATCTATGGACTGGACTACTTATTCAATGGTTGCCAATTTTTTCATAAAGGCTGGTATGCACGAGCAAGCAATGAGTTACCTTCGGAAATGCGAGGACAAGGTCAACCAAGATGCTCTCGGCTTCAATCACCTCATTTCACTTTACACCAGTCTGGGACGTAAAGACGAAGTAATGAGACTGTGGGCTCTCCAAAAGAAGTGCAAGAAGCAAGTCAATAGGGATTATATAACCATGTTGGGTTGTTTGGTTAAGCTTGAGTTTCTTGAGGAAGCTGAGAAATTGGTTGAGGAATGGGAGTCATCTTGCGAGTGTTATGATTTTCGAGTTCCGAATGTTCTTCTCATTGGATACTCGCAAAGGGGGCTAATTGAAAGAGCAGAAAAGATGCTTCAAAACATCATCAGTGATGGGAGGATTCCACCACCCAATAGTTGGGGCATTATTGCAGCAGGGTACTTGGAGAAGCAGAACCCGGAGAGAGCTTTCAAGTGCATGAAGGAAGCTGTAGCTGTACAAGAGCAAAACAAAGGGTGGAGGCCCAAACCTAGCGTCTTATCAAGCATACTGCGATGGCTATCTGAAAATGGAAGATATGAGGAGCTGAAAGAGTTTCTGAGCTCATTGAAGGCTGTTCCTTCCATGGACGGAAAACTAAGTAATGCCTTCGATGAGCTTCTGGAAACCTTGAAAAACAATGATGAAACAACGGCCGATGCTCTTAAGAAATCACAACCTTGTTTAGCTCAGGTAGATTAA

Coding sequence (CDS)

ATGATGGTAGCCATGATTTATGGGAATGGTGGGTTTGTTTGGGTTTTGCATGCGTTGTTTTTCATGGTGGTCGGTGGGTCCTTCGTCCATGGGGACGGTGGTGCTTGGCTTGATGCTCATGCAACTTTCTATGGAGCTGATCAAAACCCTACTAGCCTCGGAGGAGCATGTGGTTACGACAATACATTTCATGCCGGATTTGGAATAAACACGGCGGCGGTGAGCGGCGCACTTTTCAGAGGAGGAGAGGCTTGCGGCGCTTGCTTCCTAGTAATTTGCAACTACAACGTGGACCCCAAGTGGTGCCTCCGCCGCCGCGCCGTCGCAATCACCGCCACGAACTTCTGCCCCTCCAATAACAACGGGGGCTGGTGCGACCCCCCTCGCGCGCACTTCGACATGTCGTCACCTGCCTTTCTTACCATTGCTCGGCAAGGCAACGAAGGGATCGTCCCTGTCCTTTACAAGAGGGTAAGTTGTAGAAGGAAGGGAGGAGTTCGATTCACATTGAGAGGACAATCAAACTTCAATATGGTAATGATATCGAACGTCGGTGGCAGCGGCGACATAAAGGCTGCATGGGTTAAGGGGTCGAGGACGAGGACGTGGATGCTCATGCATCGTAATTGGGGCGCAAACTGGCAAGCCAACGTCGACCTTCGAAACCAAATAATGTCGTTTAAGGTTTCCGGAGCTCCGGCCTTGGCGGCGGCATTGGCAATGTTCAAAATCTTGAGGAGCTTTTCTTCAGGTTTCACGAGAACGGCAAGAACGGAGACAGATGCATTCTGTTTTGTAGCGTTGAGATTATACAGCGCGAGACGAACCTGCAACCGAAGAAACCTCTTCGCCAGGATCAGTCCTCTCGGTTCTCCTGAGCTTAGTGTAGTTCCGATTCTTGATCAGTGGATTCAGGAAGGCAGGATGATCAAGGACTTTGAGATGCGGAGAATCGTTCGCGACCTTCGTAATTGCCGGCGGTATGGCCAAGCCCTTCAGGTGTCTGAATGGATGCGTAGCAAGGGACTTTTCTCCTTTACAACTAGAGACTTCGCTGTACAGCTTGATCTGATCGGCCGAGTTCGGGGGATCGATTCTGCAGAGAAGTATTTCAGCAGTGTTTCTAACCAAGAGGAAATTGGTAAACTCTATGGTGCTCTTCTAAATTGTTATGTCAGGGAAGGGCTTGTAGATAAGGCCCTTTCCCATATGCAGAAGATGAAAGAGATGGGTTTTGCTTCCTCTCCCCTCTGCTACAATGATATAATGTGTCTATATTTGAACACTGGCCAGGTCGATAAAGTTCCGAATGTACTTTCTGAAATGAAGGATAATGGTGTTCTTCCTGACAATTATAGCTATAGAATTTGCATCAGCTCTTATGGAGCTAGGTCTGATCTAATCGGTATGCTGAAGGTTTTGAAAGAAATGGAGAGTCAAACTCACATATCTATGGACTGGACTACTTATTCAATGGTTGCCAATTTTTTCATAAAGGCTGGTATGCACGAGCAAGCAATGAGTTACCTTCGGAAATGCGAGGACAAGGTCAACCAAGATGCTCTCGGCTTCAATCACCTCATTTCACTTTACACCAGTCTGGGACGTAAAGACGAAGTAATGAGACTGTGGGCTCTCCAAAAGAAGTGCAAGAAGCAAGTCAATAGGGATTATATAACCATGTTGGGTTGTTTGGTTAAGCTTGAGTTTCTTGAGGAAGCTGAGAAATTGGTTGAGGAATGGGAGTCATCTTGCGAGTGTTATGATTTTCGAGTTCCGAATGTTCTTCTCATTGGATACTCGCAAAGGGGGCTAATTGAAAGAGCAGAAAAGATGCTTCAAAACATCATCAGTGATGGGAGGATTCCACCACCCAATAGTTGGGGCATTATTGCAGCAGGGTACTTGGAGAAGCAGAACCCGGAGAGAGCTTTCAAGTGCATGAAGGAAGCTGTAGCTGTACAAGAGCAAAACAAAGGGTGGAGGCCCAAACCTAGCGTCTTATCAAGCATACTGCGATGGCTATCTGAAAATGGAAGATATGAGGAGCTGAAAGAGTTTCTGAGCTCATTGAAGGCTGTTCCTTCCATGGACGGAAAACTAAGTAATGCCTTCGATGAGCTTCTGGAAACCTTGAAAAACAATGATGAAACAACGGCCGATGCTCTTAAGAAATCACAACCTTGTTTAGCTCAGGTAGATTAA
BLAST of CmoCh04G009740 vs. Swiss-Prot
Match: PP334_ARATH (Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidopsis thaliana GN=At4g21705 PE=2 SV=1)

HSP 1 Score: 422.2 bits (1084), Expect = 1.2e-116
Identity = 212/482 (43.98%), Postives = 315/482 (65.35%), Query Frame = 1

Query: 266 VALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNC 325
           +A R Y   R   +  L+++ISPLG P+ SV P L  W+Q G+ +   E+ RIV DLR  
Sbjct: 12  IASRYYYTNRV-KKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVSVAELIRIVHDLRRR 71

Query: 326 RRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYG 385
           +R+  AL+VS+WM   G+  F+  + AV LDLIGRV G  +AE+YF ++  Q +  K YG
Sbjct: 72  KRFLHALEVSKWMNETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYFENLKEQYKNDKTYG 131

Query: 386 ALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKDN 445
           ALLNCYVR+  V+K+L H +KMKEMGF +S L YN+IMCLY N GQ +KVP VL EMK+ 
Sbjct: 132 ALLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQHEKVPKVLEEMKEE 191

Query: 446 GVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQ 505
            V PDNYSYRICI+++GA  DL  +   L++ME +  I+MDW TY++ A F+I  G  ++
Sbjct: 192 NVAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYAVAAKFYIDGGDCDR 251

Query: 506 AMSYLRKCEDKV-NQDALGFNHLISLYTSLGRKDEVMRLWALQKK-CKKQVNRDYITMLG 565
           A+  L+  E+++  +D  G+NHLI+LY  LG+K EV+RLW L+K  CK+++N+DY+T+L 
Sbjct: 252 AVELLKMSENRLEKKDGEGYNHLITLYARLGKKIEVLRLWDLEKDVCKRRINQDYLTVLQ 311

Query: 566 CLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIP 625
            LVK++ L EAE+++ EW+SS  CYDFRVPN ++ GY  + + E+AE ML+++   G+  
Sbjct: 312 SLVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEEKAEAMLEDLARRGKAT 371

Query: 626 PPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEE 685
            P SW ++A  Y EK   E AFKCMK A+ V+  ++ WRP  ++++S+L W+ + G  +E
Sbjct: 372 TPESWELVATAYAEKGTLENAFKCMKTALGVEVGSRKWRPGLTLVTSVLSWVGDEGSLKE 431

Query: 686 LKEFLSSLKAVPSMDGKLSNA------------FDELLETLKNN----DETTADALKKSQ 730
           ++ F++SL+    ++ ++ +A             D LL+ +K++    DE T   L    
Sbjct: 432 VESFVASLRNCIGVNKQMYHALVKADIREGGRNIDTLLQRMKDDKIEIDEETTVILSTRS 491

BLAST of CmoCh04G009740 vs. Swiss-Prot
Match: EXP12_ARATH (Expansin-A12 OS=Arabidopsis thaliana GN=EXPA12 PE=2 SV=1)

HSP 1 Score: 305.1 bits (780), Expect = 2.1e-81
Identity = 131/195 (67.18%), Postives = 161/195 (82.56%), Query Frame = 1

Query: 36  WLDAHATFYGADQNPTSLGGACGYDNTFHAGFGINTAAVSGALFRGGEACGACFLVICNY 95
           W+ AHAT+YG + +P SLGGACGYDN +HAGFG +TAA+SG LFR GE+CG C+ V C++
Sbjct: 27  WIRAHATYYGVNDSPASLGGACGYDNPYHAGFGAHTAALSGELFRSGESCGGCYQVRCDF 86

Query: 96  NVDPKWCLRRRAVAITATNFCPSNNNGGWCDPPRAHFDMSSPAFLTIARQGNEGIVPVLY 155
             DPKWCLR  AV +TATNFCP+NNN GWC+ PR HFDMSSPAF  IAR+GNEGIVPV Y
Sbjct: 87  PADPKWCLRGAAVTVTATNFCPTNNNNGWCNLPRHHFDMSSPAFFRIARRGNEGIVPVFY 146

Query: 156 KRVSCRRKGGVRFTLRGQSNFNMVMISNVGGSGDIKAAWVKGSRTRTWMLMHRNWGANWQ 215
           +RV C+R+GGVRFT+RGQ NFNMVMISNVGG G +++  V+GS+ +TW+ M RNWGANWQ
Sbjct: 147 RRVGCKRRGGVRFTMRGQGNFNMVMISNVGGGGSVRSVAVRGSKGKTWLQMTRNWGANWQ 206

Query: 216 ANVDLRNQIMSFKVS 231
           ++ DLR Q +SFKV+
Sbjct: 207 SSGDLRGQRLSFKVT 221

BLAST of CmoCh04G009740 vs. Swiss-Prot
Match: PP166_ARATH (Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidopsis thaliana GN=At2g20710 PE=2 SV=1)

HSP 1 Score: 295.8 bits (756), Expect = 1.3e-78
Identity = 153/427 (35.83%), Postives = 249/427 (58.31%), Query Frame = 1

Query: 285 RISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRSKGLF 344
           R++  G P  S++ +LD W+ +G ++K  E+  I++ LR   R+  ALQ+S+WM    + 
Sbjct: 43  RVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQISDWMSEHRVH 102

Query: 345 SFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHM 404
             +  D A++LDLI +V G+  AEK+F ++  +     LYGALLNCY  + ++ KA    
Sbjct: 103 EISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYASKKVLHKAEQVF 162

Query: 405 QKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGAR 464
           Q+MKE+GF    L YN ++ LY+ TG+   V  +L EM+D  V PD ++    + +Y   
Sbjct: 163 QEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTRLHAYSVV 222

Query: 465 SDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDAL-- 524
           SD+ GM K L   E+   + +DW TY+  AN +IKAG+ E+A+  LRK E  VN      
Sbjct: 223 SDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMVNAQKRKH 282

Query: 525 GFNHLISLYTSLGRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWE 584
            +  L+S Y + G+K+EV RLW+L K+     N  YI+++  L+K++ +EE EK++EEWE
Sbjct: 283 AYEVLMSFYGAAGKKEEVYRLWSLYKELDGFYNTGYISVISALLKMDDIEEVEKIMEEWE 342

Query: 585 SSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPE 644
           +    +D R+P++L+ GY ++G++E+AE+++  ++   R+   ++W  +A GY      E
Sbjct: 343 AGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWERLALGYKMAGKME 402

Query: 645 RAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLS 704
           +A +  K A+ V +   GWRP   VL S + +L      E L++ L  L    S  G +S
Sbjct: 403 KAVEKWKRAIEVSK--PGWRPHQVVLMSCVDYLEGQRDMEGLRKILRLL----SERGHIS 461

Query: 705 NAFDELL 710
             +D+LL
Sbjct: 463 --YDQLL 461

BLAST of CmoCh04G009740 vs. Swiss-Prot
Match: PPR3_ARATH (Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana GN=At1g02150 PE=2 SV=2)

HSP 1 Score: 271.9 bits (694), Expect = 2.0e-71
Identity = 154/454 (33.92%), Postives = 251/454 (55.29%), Query Frame = 1

Query: 271 YSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQ 330
           Y  R       ++ +IS +  PEL    +L+QW + GR +  +E+ R+V++LR  +R  Q
Sbjct: 58  YERRPIVQWNAIYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQ 117

Query: 331 ALQVSEWMRSKG-LFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLN 390
           AL+V +WM ++G  F  +  D A+QLDLIG+VRGI  AE++F  +    +  ++YG+LLN
Sbjct: 118 ALEVYDWMNNRGERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLN 177

Query: 391 CYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLP 450
            YVR    +KA + +  M++ G+A  PL +N +M LY+N  + DKV  ++ EMK   +  
Sbjct: 178 AYVRAKSREKAEALLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRL 237

Query: 451 DNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSY 510
           D YSY I +SS G+   +  M  V ++M+S   I  +WTT+S +A  +IK G  E+A   
Sbjct: 238 DIYSYNIWLSSCGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDA 297

Query: 511 LRKCEDKV-NQDALGFNHLISLYTSLGRKDEVMRLWALQKKCKKQV-NRDYITMLGCLVK 570
           LRK E ++  ++ + +++L+SLY SLG K E+ R+W + K     + N  Y  ++  LV+
Sbjct: 298 LRKVEARITGRNRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVR 357

Query: 571 LEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNS 630
           +  +E AEK+ EEW      YD R+PN+L+  Y +   +E AE +  +++  G  P  ++
Sbjct: 358 MGDIEGAEKVYEEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSST 417

Query: 631 WGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEF 690
           W I+A G+  K+    A  C++ A +  E +  WRPK  +LS   +   E       +  
Sbjct: 418 WEILAVGHTRKRCISEALTCLRNAFSA-EGSSNWRPKVLMLSGFFKLCEEESDVTSKEAV 477

Query: 691 LSSLKAVPSMDGKLSNAFDELLET-LKNNDETTA 721
           L  L+    ++ K   A  ++ E    NN E  A
Sbjct: 478 LELLRQSGDLEDKSYLALIDVDENRTVNNSEIDA 510

BLAST of CmoCh04G009740 vs. Swiss-Prot
Match: PPR86_ARATH (Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana GN=At1g60770 PE=2 SV=1)

HSP 1 Score: 260.0 bits (663), Expect = 7.8e-68
Identity = 147/457 (32.17%), Postives = 250/457 (54.70%), Query Frame = 1

Query: 266 VALRLYSARRTCNRRN--------LFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRR 325
           +A+R  S  R   +R+        L+ R+   G  E+ V   L+Q+++  + +  +E+  
Sbjct: 1   MAMRHLSRSRDVTKRSTKKYIEEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGD 60

Query: 326 IVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQ 385
            ++ LRN   Y  AL++SE M  +G+ + T  D A+ LDL+ + R I + E YF  +   
Sbjct: 61  TIKKLRNRGLYYPALKLSEVMEERGM-NKTVSDQAIHLDLVAKAREITAGENYFVDLPET 120

Query: 386 EEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPN 445
            +    YG+LLNCY +E L +KA   + KMKE+    S + YN +M LY  TG+ +KVP 
Sbjct: 121 SKTELTYGSLLNCYCKELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPA 180

Query: 446 VLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFF 505
           ++ E+K   V+PD+Y+Y + + +  A +D+ G+ +V++EM     ++ DWTTYS +A+ +
Sbjct: 181 MIQELKAENVMPDSYTYNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIY 240

Query: 506 IKAGMHEQAMSYLRKCEDKVNQ-DALGFNHLISLYTSLGRKDEVMRLW-ALQKKCKKQVN 565
           + AG+ ++A   L++ E K  Q D   +  LI+LY  LG+  EV R+W +L+    K  N
Sbjct: 241 VDAGLSQKAEKALQELEMKNTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSN 300

Query: 566 RDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQN 625
             Y+ M+  LVKL  L  AE L +EW+++C  YD R+ NVL+  Y+Q GLI++A ++ + 
Sbjct: 301 VAYLNMIQVLVKLNDLPGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEK 360

Query: 626 IISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKG-WRPKPSVLSSILRW 685
               G      +W I    Y++  +  RA +CM +AV++ + + G W P P  + +++ +
Sbjct: 361 APRRGGKLNAKTWEIFMDYYVKSGDMARALECMSKAVSIGKGDGGKWLPSPETVRALMSY 420

Query: 686 LSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLET 712
             +       +  L  LK     D   +  F+ L+ T
Sbjct: 421 FEQKKDVNGAENLLEILK--NGTDNIGAEIFEPLIRT 454

BLAST of CmoCh04G009740 vs. TrEMBL
Match: A0A0A0L7Y2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G104890 PE=4 SV=1)

HSP 1 Score: 813.9 bits (2101), Expect = 1.6e-232
Identity = 396/490 (80.82%), Postives = 443/490 (90.41%), Query Frame = 1

Query: 235 LAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPEL 294
           +AAA AMFKIL   SSG TRT R ETDAFCFVALRLYS RR+C+RRNL+ARISPLG PE 
Sbjct: 1   MAAASAMFKILSRSSSGCTRTLRPETDAFCFVALRLYSTRRSCDRRNLYARISPLGDPEC 60

Query: 295 SVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQ 354
           +VVP+L+QWI+EGR IKDFE+RRIVRDLR CRRY QAL+VSEWM SKGLFS TTRDFA+Q
Sbjct: 61  TVVPVLNQWIEEGRNIKDFELRRIVRDLRTCRRYRQALEVSEWMCSKGLFSLTTRDFAIQ 120

Query: 355 LDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFAS 414
           LDLIG+VRG+DSAEKYF SVSNQ+EIGKLYGALLNCYVREGL+DK+L+HMQKMKEMG AS
Sbjct: 121 LDLIGQVRGLDSAEKYFGSVSNQKEIGKLYGALLNCYVREGLIDKSLAHMQKMKEMGLAS 180

Query: 415 SPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVL 474
           SPLCYNDIMCLYLNTGQ DKVPNVLSEMK+NGVLPDN+SYRICISSYGARSD+I M  VL
Sbjct: 181 SPLCYNDIMCLYLNTGQADKVPNVLSEMKENGVLPDNFSYRICISSYGARSDVISMENVL 240

Query: 475 KEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGFNHLISLYTSL 534
           KEME QTHISMDWTTYSMVA FFIKAGMH++AM+YLRKCEDKV++DALGFNHLIS YT+L
Sbjct: 241 KEMEGQTHISMDWTTYSMVAGFFIKAGMHDKAMNYLRKCEDKVDEDALGFNHLISHYTNL 300

Query: 535 GRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPN 594
           G K+EVMRLWAL KK KKQ+NRDYITMLG LVKLE LEEAE LV EWESSC+CYDFRVPN
Sbjct: 301 GHKNEVMRLWALLKKGKKQLNRDYITMLGSLVKLELLEEAENLVMEWESSCQCYDFRVPN 360

Query: 595 VLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAV 654
           V+LIGYSQ+GLIE+AEKML+NII +G IP PNSWGIIA+GYLEKQN E+AF+CMKEA+AV
Sbjct: 361 VVLIGYSQKGLIEKAEKMLRNIIVNGMIPSPNSWGIIASGYLEKQNLEKAFECMKEALAV 420

Query: 655 QEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLETLKN 714
           + QNK WRPKP+VLSSILRWLSEN RYEE+KEF+SSLK VPSMD KL+NA DELLE + N
Sbjct: 421 KGQNKVWRPKPNVLSSILRWLSENRRYEEMKEFMSSLKTVPSMDEKLNNALDELLEIMAN 480

Query: 715 NDETTADALK 725
           +D  + D L+
Sbjct: 481 DDGISKDELE 490

BLAST of CmoCh04G009740 vs. TrEMBL
Match: B9SNN7_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_1010390 PE=4 SV=1)

HSP 1 Score: 594.3 bits (1531), Expect = 2.0e-166
Identity = 288/452 (63.72%), Postives = 361/452 (79.87%), Query Frame = 1

Query: 252 FTRTARTET-DAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMI 311
           FT   RT++  A   +  R Y+  RT +   LFARISPLG P++S+VP+LD W+QEG+ I
Sbjct: 7   FTILKRTQSLTANAILTRRYYNKARTASN-TLFARISPLGEPDISLVPVLDNWVQEGKKI 66

Query: 312 KDFEMRRIVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKY 371
           + FE+++I+RDLR  RRY QALQVSEWM  KG   F+  D AVQLDLIGRVRG++SAE Y
Sbjct: 67  RGFELQKIIRDLRCHRRYTQALQVSEWMNGKGQSGFSPADHAVQLDLIGRVRGLESAESY 126

Query: 372 FSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTG 431
           F ++ NQ+   K YGALLNCYVREGLVDK+L HMQKMKE+GFASSPL YND+MCLY  TG
Sbjct: 127 FQNLVNQDRNDKTYGALLNCYVREGLVDKSLYHMQKMKELGFASSPLNYNDLMCLYTRTG 186

Query: 432 QVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTY 491
           Q++KV +VLSEMK+NG+ PD +SYRIC+SS  ARSDL G+ ++L+EME+Q+HIS+DW TY
Sbjct: 187 QLEKVTDVLSEMKENGITPDLFSYRICMSSCAARSDLKGVEEILEEMENQSHISIDWVTY 246

Query: 492 SMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGFNHLISLYTSLGRKDEVMRLWALQK-K 551
           S VA+ ++KA + E+A+ YL+KCE KVN+DALG+NHLISL  SLG KDEVMRLW L K K
Sbjct: 247 STVASIYVKASLKEKALIYLKKCEQKVNRDALGYNHLISLNASLGIKDEVMRLWGLVKTK 306

Query: 552 CKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERA 611
           CKKQVNRDYITMLG LVKLE LEEA+KL++EWESSC+CYDFRVPNVLLIGY Q+GLIE+A
Sbjct: 307 CKKQVNRDYITMLGALVKLEELEEADKLLQEWESSCQCYDFRVPNVLLIGYCQQGLIEKA 366

Query: 612 EKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLS 671
           E ML++I+   + P PNSW IIAAGY+ KQN E+AF CMKEA+ VQ +NKGWRPK +++S
Sbjct: 367 EAMLKDIVKKQKNPTPNSWAIIAAGYVNKQNMEKAFNCMKEALTVQAENKGWRPKANLIS 426

Query: 672 SILRWLSENGRYEELKEFLSSLKAVPSMDGKL 702
           SIL WL ENG  E+++ F++ L+     D ++
Sbjct: 427 SILSWLGENGDVEDVEAFVNLLETKVPKDREI 457

BLAST of CmoCh04G009740 vs. TrEMBL
Match: F6H257_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_19s0014g02920 PE=4 SV=1)

HSP 1 Score: 583.2 bits (1502), Expect = 4.5e-163
Identity = 269/425 (63.29%), Postives = 357/425 (84.00%), Query Frame = 1

Query: 281 NLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRS 340
           NL++RISPLG+P LS+VP+LDQW++EG+ ++D E+ RI+RDLR+ +RY QAL+VSEWM S
Sbjct: 38  NLYSRISPLGTPNLSLVPVLDQWVEEGKKVRDVELHRIIRDLRSRKRYAQALEVSEWMSS 97

Query: 341 KGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKA 400
           K L  F+    AVQLDLIG+VRG++SAE YF+++S +E+I K+YGALLNCYVRE ++DK+
Sbjct: 98  KELCPFSPSARAVQLDLIGQVRGLESAENYFNNMSAEEKIDKMYGALLNCYVRERVIDKS 157

Query: 401 LSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISS 460
           LSH+QKMKE+GFAS+PL YN +MCLY+NT Q++K+P+VLSEM++NG+ PDN+SYR+CI+S
Sbjct: 158 LSHLQKMKELGFASTPLPYNGLMCLYINTDQLEKIPDVLSEMQENGISPDNFSYRLCINS 217

Query: 461 YGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQD 520
           YGARSDL  M K+L+EMES++HI +DW TYSMVANF+IKAG++E+A+ +L+K E K+++D
Sbjct: 218 YGARSDLNSMEKILEEMESKSHIHIDWMTYSMVANFYIKAGLNEKALFFLKKAETKLHKD 277

Query: 521 ALGFNHLISLYTSLGRKDEVMRLWALQKKC-KKQVNRDYITMLGCLVKLEFLEEAEKLVE 580
            LG+NHLISLY SLG K E+MRLW  +K   KK +NRDYITMLG LVKL  LE+ E L++
Sbjct: 278 PLGYNHLISLYASLGSKAEMMRLWERRKTASKKLINRDYITMLGSLVKLGELEDTEALLK 337

Query: 581 EWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQ 640
           EWESS  CYDFRVPN LLIG+ Q+GLIE+AE ML++I+ +G+ P PNSW I+AAGY+EKQ
Sbjct: 338 EWESSGNCYDFRVPNTLLIGFCQKGLIEKAESMLRDIVEEGKTPTPNSWSIVAAGYIEKQ 397

Query: 641 NPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDG 700
           N E+AF+CMKEA+AV  +NKGWRPKP V+SSIL WL +N   EE++ F+S+LKAV  MD 
Sbjct: 398 NMEKAFECMKEAIAVLAENKGWRPKPKVISSILSWLGDNRDVEEVETFVSALKAVIPMDR 457

Query: 701 KLSNA 705
           ++ +A
Sbjct: 458 EMYHA 462

BLAST of CmoCh04G009740 vs. TrEMBL
Match: M5WGV8_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004720mg PE=4 SV=1)

HSP 1 Score: 575.9 bits (1483), Expect = 7.2e-161
Identity = 281/468 (60.04%), Postives = 366/468 (78.21%), Query Frame = 1

Query: 238 ALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVV 297
           A A+FK+L+   +     +  + D           AR T N RNLF+RISPLG P LSVV
Sbjct: 2   AFAVFKLLKRHQNLAADVSPIKFDC---------RARHTANTRNLFSRISPLGDPSLSVV 61

Query: 298 PILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDL 357
           P+LDQW+QEG  +  FE++RIVRDLR  +RY  AL VSEWM SKGL  F   D AVQLDL
Sbjct: 62  PVLDQWVQEGGKVNYFELQRIVRDLRARKRYRHALDVSEWMSSKGLCQFLPGDHAVQLDL 121

Query: 358 IGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPL 417
           IGRVRG+D+AE  FSS+S+ E+  K YGALLNCYVREGL+DK+LS+MQKMKE+GFA+S L
Sbjct: 122 IGRVRGLDAAESCFSSLSD-EDTSKSYGALLNCYVREGLIDKSLSYMQKMKELGFATS-L 181

Query: 418 CYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEM 477
            YNDIM LY++TGQ +K+P+VLSEMK+ GV PDN+SYRIC+SSYG RSD+  M KVL+EM
Sbjct: 182 NYNDIMRLYIHTGQPEKIPDVLSEMKEEGVSPDNFSYRICMSSYGMRSDISSMEKVLEEM 241

Query: 478 ESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGFNHLISLYTSLGRK 537
           E + HISMDW TY++VAN +IKAG+H++A+ YL K E+KVN+DALG+NHLISLY SLG K
Sbjct: 242 EREPHISMDWLTYALVANLYIKAGLHDKALIYLEKSEEKVNKDALGYNHLISLYASLGCK 301

Query: 538 DEVMRLWALQK-KCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVL 597
           D++MRLW+L+K KCKKQ+NRDYITMLG LVKL  LEE +KL++EWE SC  YDFRVPN+L
Sbjct: 302 DDMMRLWSLEKTKCKKQINRDYITMLGSLVKLGELEETKKLLDEWELSCLSYDFRVPNIL 361

Query: 598 LIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQE 657
           LIGY Q+GL+E+AE  L++I+  G+ P PNSW I+AAGY++KQ  ++AF+CM EA+ ++ 
Sbjct: 362 LIGYCQKGLVEQAEDTLRDIVKKGKTPTPNSWAILAAGYVDKQKMQKAFECMTEALNLRA 421

Query: 658 QNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNA 705
           +N GWRPKP V+SS+L W+ +NG  E+++ F+S +K V +++ ++ +A
Sbjct: 422 RNTGWRPKPGVVSSVLSWIGDNGDIEQVEAFVSLMKTVITVNREMYHA 458

BLAST of CmoCh04G009740 vs. TrEMBL
Match: A0A067G664_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g011236mg PE=4 SV=1)

HSP 1 Score: 571.6 bits (1472), Expect = 1.4e-159
Identity = 277/461 (60.09%), Postives = 362/461 (78.52%), Query Frame = 1

Query: 269 RLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRY 328
           R Y A +   R NL++RISPLG P++S+ P+LDQW+ EG+ I + E++R++R LR+ +R+
Sbjct: 26  RAYRAVKPVARNNLYSRISPLGDPDVSLTPVLDQWVLEGQKISELELQRVIRQLRSRKRF 85

Query: 329 GQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALL 388
             ALQVSEWM  +GL +F+  D AVQLDLIG+VRG++SAE YF+S+++++++ KLYGALL
Sbjct: 86  KHALQVSEWMSGQGL-AFSVHDHAVQLDLIGKVRGLESAETYFNSLNDEDKVDKLYGALL 145

Query: 389 NCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVL 448
           NCYVREGLVD++LS MQKMKEMG   S L YN IMCLY NTGQ +K+P+VL +MK+NGV 
Sbjct: 146 NCYVREGLVDESLSLMQKMKEMGSFGSALNYNGIMCLYTNTGQHEKIPDVLLDMKENGVP 205

Query: 449 PDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMS 508
           PDN+SYRICI+SYGARS+L  M  VL+EMESQ+HISMDW TYS VAN++I AG+ E+A+ 
Sbjct: 206 PDNFSYRICINSYGARSELSSMENVLQEMESQSHISMDWGTYSTVANYYIIAGLKEKAII 265

Query: 509 YLRKCEDKV--NQDALGFNHLISLYTSLGRKDEVMRLWALQK-KCKKQVNRDYITMLGCL 568
           YL+KCED V  ++DALG+NHLIS Y SLG KDE+M+ W LQK KCKKQ+NRDYITMLG L
Sbjct: 266 YLKKCEDIVSKSKDALGYNHLISHYASLGNKDEMMKFWGLQKIKCKKQLNRDYITMLGSL 325

Query: 569 VKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPP 628
           VK+  LEEAEK++EEWE SC CYDFRVPN++L+GYSQ+G+IE+A+ +L+ I+  G+ P P
Sbjct: 326 VKIGELEEAEKMLEEWELSCYCYDFRVPNIILLGYSQKGMIEKADAVLKEIVKKGKTPTP 385

Query: 629 NSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELK 688
           NSW IIAAGY +K N E+AF+CMKEA+AV E+NK WRPKPS++SSIL WL +N   EE++
Sbjct: 386 NSWSIIAAGYADKNNMEKAFECMKEALAVHEENKFWRPKPSLVSSILDWLGDNRDVEEVE 445

Query: 689 EFLSSLK----------AVPSMDGKLSNAFDELLETLKNND 717
            F+SSLK          A+     +     D LLE++K +D
Sbjct: 446 AFVSSLKIKVQKRNMYHALTEAHIRSGQEVDGLLESMKADD 485

BLAST of CmoCh04G009740 vs. TAIR10
Match: AT4G21705.1 (AT4G21705.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 422.2 bits (1084), Expect = 6.7e-118
Identity = 212/482 (43.98%), Postives = 315/482 (65.35%), Query Frame = 1

Query: 266 VALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNC 325
           +A R Y   R   +  L+++ISPLG P+ SV P L  W+Q G+ +   E+ RIV DLR  
Sbjct: 12  IASRYYYTNRV-KKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVSVAELIRIVHDLRRR 71

Query: 326 RRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYG 385
           +R+  AL+VS+WM   G+  F+  + AV LDLIGRV G  +AE+YF ++  Q +  K YG
Sbjct: 72  KRFLHALEVSKWMNETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYFENLKEQYKNDKTYG 131

Query: 386 ALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKDN 445
           ALLNCYVR+  V+K+L H +KMKEMGF +S L YN+IMCLY N GQ +KVP VL EMK+ 
Sbjct: 132 ALLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQHEKVPKVLEEMKEE 191

Query: 446 GVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQ 505
            V PDNYSYRICI+++GA  DL  +   L++ME +  I+MDW TY++ A F+I  G  ++
Sbjct: 192 NVAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYAVAAKFYIDGGDCDR 251

Query: 506 AMSYLRKCEDKV-NQDALGFNHLISLYTSLGRKDEVMRLWALQKK-CKKQVNRDYITMLG 565
           A+  L+  E+++  +D  G+NHLI+LY  LG+K EV+RLW L+K  CK+++N+DY+T+L 
Sbjct: 252 AVELLKMSENRLEKKDGEGYNHLITLYARLGKKIEVLRLWDLEKDVCKRRINQDYLTVLQ 311

Query: 566 CLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIP 625
            LVK++ L EAE+++ EW+SS  CYDFRVPN ++ GY  + + E+AE ML+++   G+  
Sbjct: 312 SLVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEEKAEAMLEDLARRGKAT 371

Query: 626 PPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEE 685
            P SW ++A  Y EK   E AFKCMK A+ V+  ++ WRP  ++++S+L W+ + G  +E
Sbjct: 372 TPESWELVATAYAEKGTLENAFKCMKTALGVEVGSRKWRPGLTLVTSVLSWVGDEGSLKE 431

Query: 686 LKEFLSSLKAVPSMDGKLSNA------------FDELLETLKNN----DETTADALKKSQ 730
           ++ F++SL+    ++ ++ +A             D LL+ +K++    DE T   L    
Sbjct: 432 VESFVASLRNCIGVNKQMYHALVKADIREGGRNIDTLLQRMKDDKIEIDEETTVILSTRS 491

BLAST of CmoCh04G009740 vs. TAIR10
Match: AT3G15370.1 (AT3G15370.1 expansin 12)

HSP 1 Score: 305.1 bits (780), Expect = 1.2e-82
Identity = 131/195 (67.18%), Postives = 161/195 (82.56%), Query Frame = 1

Query: 36  WLDAHATFYGADQNPTSLGGACGYDNTFHAGFGINTAAVSGALFRGGEACGACFLVICNY 95
           W+ AHAT+YG + +P SLGGACGYDN +HAGFG +TAA+SG LFR GE+CG C+ V C++
Sbjct: 27  WIRAHATYYGVNDSPASLGGACGYDNPYHAGFGAHTAALSGELFRSGESCGGCYQVRCDF 86

Query: 96  NVDPKWCLRRRAVAITATNFCPSNNNGGWCDPPRAHFDMSSPAFLTIARQGNEGIVPVLY 155
             DPKWCLR  AV +TATNFCP+NNN GWC+ PR HFDMSSPAF  IAR+GNEGIVPV Y
Sbjct: 87  PADPKWCLRGAAVTVTATNFCPTNNNNGWCNLPRHHFDMSSPAFFRIARRGNEGIVPVFY 146

Query: 156 KRVSCRRKGGVRFTLRGQSNFNMVMISNVGGSGDIKAAWVKGSRTRTWMLMHRNWGANWQ 215
           +RV C+R+GGVRFT+RGQ NFNMVMISNVGG G +++  V+GS+ +TW+ M RNWGANWQ
Sbjct: 147 RRVGCKRRGGVRFTMRGQGNFNMVMISNVGGGGSVRSVAVRGSKGKTWLQMTRNWGANWQ 206

Query: 216 ANVDLRNQIMSFKVS 231
           ++ DLR Q +SFKV+
Sbjct: 207 SSGDLRGQRLSFKVT 221

BLAST of CmoCh04G009740 vs. TAIR10
Match: AT2G20710.1 (AT2G20710.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 295.8 bits (756), Expect = 7.3e-80
Identity = 153/427 (35.83%), Postives = 249/427 (58.31%), Query Frame = 1

Query: 285 RISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRSKGLF 344
           R++  G P  S++ +LD W+ +G ++K  E+  I++ LR   R+  ALQ+S+WM    + 
Sbjct: 43  RVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQISDWMSEHRVH 102

Query: 345 SFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHM 404
             +  D A++LDLI +V G+  AEK+F ++  +     LYGALLNCY  + ++ KA    
Sbjct: 103 EISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYASKKVLHKAEQVF 162

Query: 405 QKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGAR 464
           Q+MKE+GF    L YN ++ LY+ TG+   V  +L EM+D  V PD ++    + +Y   
Sbjct: 163 QEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTRLHAYSVV 222

Query: 465 SDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDAL-- 524
           SD+ GM K L   E+   + +DW TY+  AN +IKAG+ E+A+  LRK E  VN      
Sbjct: 223 SDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMVNAQKRKH 282

Query: 525 GFNHLISLYTSLGRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWE 584
            +  L+S Y + G+K+EV RLW+L K+     N  YI+++  L+K++ +EE EK++EEWE
Sbjct: 283 AYEVLMSFYGAAGKKEEVYRLWSLYKELDGFYNTGYISVISALLKMDDIEEVEKIMEEWE 342

Query: 585 SSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPE 644
           +    +D R+P++L+ GY ++G++E+AE+++  ++   R+   ++W  +A GY      E
Sbjct: 343 AGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWERLALGYKMAGKME 402

Query: 645 RAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLS 704
           +A +  K A+ V +   GWRP   VL S + +L      E L++ L  L    S  G +S
Sbjct: 403 KAVEKWKRAIEVSK--PGWRPHQVVLMSCVDYLEGQRDMEGLRKILRLL----SERGHIS 461

Query: 705 NAFDELL 710
             +D+LL
Sbjct: 463 --YDQLL 461

BLAST of CmoCh04G009740 vs. TAIR10
Match: AT1G02150.1 (AT1G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 271.9 bits (694), Expect = 1.1e-72
Identity = 154/454 (33.92%), Postives = 251/454 (55.29%), Query Frame = 1

Query: 271 YSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQ 330
           Y  R       ++ +IS +  PEL    +L+QW + GR +  +E+ R+V++LR  +R  Q
Sbjct: 58  YERRPIVQWNAIYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQ 117

Query: 331 ALQVSEWMRSKG-LFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLN 390
           AL+V +WM ++G  F  +  D A+QLDLIG+VRGI  AE++F  +    +  ++YG+LLN
Sbjct: 118 ALEVYDWMNNRGERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLN 177

Query: 391 CYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLP 450
            YVR    +KA + +  M++ G+A  PL +N +M LY+N  + DKV  ++ EMK   +  
Sbjct: 178 AYVRAKSREKAEALLNTMRDKGYALHPLPFNVMMTLYMNLREYDKVDAMVFEMKQKDIRL 237

Query: 451 DNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSY 510
           D YSY I +SS G+   +  M  V ++M+S   I  +WTT+S +A  +IK G  E+A   
Sbjct: 238 DIYSYNIWLSSCGSLGSVEKMELVYQQMKSDVSIYPNWTTFSTMATMYIKMGETEKAEDA 297

Query: 511 LRKCEDKV-NQDALGFNHLISLYTSLGRKDEVMRLWALQKKCKKQV-NRDYITMLGCLVK 570
           LRK E ++  ++ + +++L+SLY SLG K E+ R+W + K     + N  Y  ++  LV+
Sbjct: 298 LRKVEARITGRNRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVR 357

Query: 571 LEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNS 630
           +  +E AEK+ EEW      YD R+PN+L+  Y +   +E AE +  +++  G  P  ++
Sbjct: 358 MGDIEGAEKVYEEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSST 417

Query: 631 WGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEF 690
           W I+A G+  K+    A  C++ A +  E +  WRPK  +LS   +   E       +  
Sbjct: 418 WEILAVGHTRKRCISEALTCLRNAFSA-EGSSNWRPKVLMLSGFFKLCEEESDVTSKEAV 477

Query: 691 LSSLKAVPSMDGKLSNAFDELLET-LKNNDETTA 721
           L  L+    ++ K   A  ++ E    NN E  A
Sbjct: 478 LELLRQSGDLEDKSYLALIDVDENRTVNNSEIDA 510

BLAST of CmoCh04G009740 vs. TAIR10
Match: AT1G60770.1 (AT1G60770.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 260.0 bits (663), Expect = 4.4e-69
Identity = 147/457 (32.17%), Postives = 250/457 (54.70%), Query Frame = 1

Query: 266 VALRLYSARRTCNRRN--------LFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRR 325
           +A+R  S  R   +R+        L+ R+   G  E+ V   L+Q+++  + +  +E+  
Sbjct: 1   MAMRHLSRSRDVTKRSTKKYIEEPLYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGD 60

Query: 326 IVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQ 385
            ++ LRN   Y  AL++SE M  +G+ + T  D A+ LDL+ + R I + E YF  +   
Sbjct: 61  TIKKLRNRGLYYPALKLSEVMEERGM-NKTVSDQAIHLDLVAKAREITAGENYFVDLPET 120

Query: 386 EEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPN 445
            +    YG+LLNCY +E L +KA   + KMKE+    S + YN +M LY  TG+ +KVP 
Sbjct: 121 SKTELTYGSLLNCYCKELLTEKAEGLLNKMKELNITPSSMSYNSLMTLYTKTGETEKVPA 180

Query: 446 VLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFF 505
           ++ E+K   V+PD+Y+Y + + +  A +D+ G+ +V++EM     ++ DWTTYS +A+ +
Sbjct: 181 MIQELKAENVMPDSYTYNVWMRALAATNDISGVERVIEEMNRDGRVAPDWTTYSNMASIY 240

Query: 506 IKAGMHEQAMSYLRKCEDKVNQ-DALGFNHLISLYTSLGRKDEVMRLW-ALQKKCKKQVN 565
           + AG+ ++A   L++ E K  Q D   +  LI+LY  LG+  EV R+W +L+    K  N
Sbjct: 241 VDAGLSQKAEKALQELEMKNTQRDFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSN 300

Query: 566 RDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQN 625
             Y+ M+  LVKL  L  AE L +EW+++C  YD R+ NVL+  Y+Q GLI++A ++ + 
Sbjct: 301 VAYLNMIQVLVKLNDLPGAETLFKEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEK 360

Query: 626 IISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKG-WRPKPSVLSSILRW 685
               G      +W I    Y++  +  RA +CM +AV++ + + G W P P  + +++ +
Sbjct: 361 APRRGGKLNAKTWEIFMDYYVKSGDMARALECMSKAVSIGKGDGGKWLPSPETVRALMSY 420

Query: 686 LSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLET 712
             +       +  L  LK     D   +  F+ L+ T
Sbjct: 421 FEQKKDVNGAENLLEILK--NGTDNIGAEIFEPLIRT 454

BLAST of CmoCh04G009740 vs. NCBI nr
Match: gi|449431834|ref|XP_004133705.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like [Cucumis sativus])

HSP 1 Score: 813.9 bits (2101), Expect = 2.2e-232
Identity = 396/490 (80.82%), Postives = 443/490 (90.41%), Query Frame = 1

Query: 235 LAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPEL 294
           +AAA AMFKIL   SSG TRT R ETDAFCFVALRLYS RR+C+RRNL+ARISPLG PE 
Sbjct: 1   MAAASAMFKILSRSSSGCTRTLRPETDAFCFVALRLYSTRRSCDRRNLYARISPLGDPEC 60

Query: 295 SVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQ 354
           +VVP+L+QWI+EGR IKDFE+RRIVRDLR CRRY QAL+VSEWM SKGLFS TTRDFA+Q
Sbjct: 61  TVVPVLNQWIEEGRNIKDFELRRIVRDLRTCRRYRQALEVSEWMCSKGLFSLTTRDFAIQ 120

Query: 355 LDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFAS 414
           LDLIG+VRG+DSAEKYF SVSNQ+EIGKLYGALLNCYVREGL+DK+L+HMQKMKEMG AS
Sbjct: 121 LDLIGQVRGLDSAEKYFGSVSNQKEIGKLYGALLNCYVREGLIDKSLAHMQKMKEMGLAS 180

Query: 415 SPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVL 474
           SPLCYNDIMCLYLNTGQ DKVPNVLSEMK+NGVLPDN+SYRICISSYGARSD+I M  VL
Sbjct: 181 SPLCYNDIMCLYLNTGQADKVPNVLSEMKENGVLPDNFSYRICISSYGARSDVISMENVL 240

Query: 475 KEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGFNHLISLYTSL 534
           KEME QTHISMDWTTYSMVA FFIKAGMH++AM+YLRKCEDKV++DALGFNHLIS YT+L
Sbjct: 241 KEMEGQTHISMDWTTYSMVAGFFIKAGMHDKAMNYLRKCEDKVDEDALGFNHLISHYTNL 300

Query: 535 GRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPN 594
           G K+EVMRLWAL KK KKQ+NRDYITMLG LVKLE LEEAE LV EWESSC+CYDFRVPN
Sbjct: 301 GHKNEVMRLWALLKKGKKQLNRDYITMLGSLVKLELLEEAENLVMEWESSCQCYDFRVPN 360

Query: 595 VLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAV 654
           V+LIGYSQ+GLIE+AEKML+NII +G IP PNSWGIIA+GYLEKQN E+AF+CMKEA+AV
Sbjct: 361 VVLIGYSQKGLIEKAEKMLRNIIVNGMIPSPNSWGIIASGYLEKQNLEKAFECMKEALAV 420

Query: 655 QEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLETLKN 714
           + QNK WRPKP+VLSSILRWLSEN RYEE+KEF+SSLK VPSMD KL+NA DELLE + N
Sbjct: 421 KGQNKVWRPKPNVLSSILRWLSENRRYEEMKEFMSSLKTVPSMDEKLNNALDELLEIMAN 480

Query: 715 NDETTADALK 725
           +D  + D L+
Sbjct: 481 DDGISKDELE 490

BLAST of CmoCh04G009740 vs. NCBI nr
Match: gi|659102689|ref|XP_008452263.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g21705, mitochondrial-like [Cucumis melo])

HSP 1 Score: 794.3 bits (2050), Expect = 1.8e-226
Identity = 386/482 (80.08%), Postives = 433/482 (89.83%), Query Frame = 1

Query: 235 LAAALAMFKILRSFSSGFTRTARTETDAFCFVALRLYSARRTCNRRNLFARISPLGSPEL 294
           +AAA AMFKIL   SSG TRT R ETDAFCFVALRLYS RR+CNRR L+A ISPLG P+ 
Sbjct: 1   MAAASAMFKILSRSSSGCTRTPRPETDAFCFVALRLYSTRRSCNRRKLYAMISPLGDPDS 60

Query: 295 SVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQ 354
           SVVP+L+QWI+EGR IKDFE+RRIVRDLR CRRY QAL+VSEWM SKG FS TTRDFA+Q
Sbjct: 61  SVVPVLNQWIKEGRKIKDFELRRIVRDLRTCRRYRQALEVSEWMCSKGRFSLTTRDFAIQ 120

Query: 355 LDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFAS 414
           LDLIG+VRG+DSAEKYF SVS Q+EIGKLYG+LLNCYVREGL+DK+L+HMQKMKEMGFAS
Sbjct: 121 LDLIGQVRGLDSAEKYFGSVSKQKEIGKLYGSLLNCYVREGLIDKSLAHMQKMKEMGFAS 180

Query: 415 SPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVL 474
           SPLCYNDIMCLYLNTGQ DKVPNVLSEMK+NGVLPDN+SYRICISSYGARSD+I M  VL
Sbjct: 181 SPLCYNDIMCLYLNTGQADKVPNVLSEMKENGVLPDNFSYRICISSYGARSDVISMENVL 240

Query: 475 KEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGFNHLISLYTSL 534
           KEMESQTHISMDW TYSMVA FFIK  MH++A +YLRKCED+V+QDALGFNHLIS YT+L
Sbjct: 241 KEMESQTHISMDWITYSMVAGFFIKVVMHDKARNYLRKCEDRVDQDALGFNHLISHYTNL 300

Query: 535 GRKDEVMRLWALQKKCKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPN 594
           G K+EVMRLWALQKK KKQ+NRDYITMLG LVKL+ LEEAE LV EWESSC+C DFRVPN
Sbjct: 301 GHKNEVMRLWALQKKAKKQLNRDYITMLGSLVKLDLLEEAENLVMEWESSCQCNDFRVPN 360

Query: 595 VLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAV 654
           V+LIGYSQ GLIE+AEKML+NII +G IP PNSWGIIA+GYLEKQN E+AF+CMKEA+AV
Sbjct: 361 VVLIGYSQNGLIEKAEKMLRNIIVNGMIPSPNSWGIIASGYLEKQNLEKAFECMKEALAV 420

Query: 655 QEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDGKLSNAFDELLETLKN 714
           + QNK WRPKP+VLSSILRWLSEN RYEE+KEF+SSLK VPSMD KL++A DELLE ++N
Sbjct: 421 KGQNKVWRPKPNVLSSILRWLSENRRYEEMKEFMSSLKTVPSMDEKLNSALDELLEIMEN 480

Query: 715 ND 717
           +D
Sbjct: 481 DD 482

BLAST of CmoCh04G009740 vs. NCBI nr
Match: gi|255573349|ref|XP_002527601.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g21705, mitochondrial isoform X1 [Ricinus communis])

HSP 1 Score: 594.3 bits (1531), Expect = 2.8e-166
Identity = 288/452 (63.72%), Postives = 361/452 (79.87%), Query Frame = 1

Query: 252 FTRTARTET-DAFCFVALRLYSARRTCNRRNLFARISPLGSPELSVVPILDQWIQEGRMI 311
           FT   RT++  A   +  R Y+  RT +   LFARISPLG P++S+VP+LD W+QEG+ I
Sbjct: 7   FTILKRTQSLTANAILTRRYYNKARTASN-TLFARISPLGEPDISLVPVLDNWVQEGKKI 66

Query: 312 KDFEMRRIVRDLRNCRRYGQALQVSEWMRSKGLFSFTTRDFAVQLDLIGRVRGIDSAEKY 371
           + FE+++I+RDLR  RRY QALQVSEWM  KG   F+  D AVQLDLIGRVRG++SAE Y
Sbjct: 67  RGFELQKIIRDLRCHRRYTQALQVSEWMNGKGQSGFSPADHAVQLDLIGRVRGLESAESY 126

Query: 372 FSSVSNQEEIGKLYGALLNCYVREGLVDKALSHMQKMKEMGFASSPLCYNDIMCLYLNTG 431
           F ++ NQ+   K YGALLNCYVREGLVDK+L HMQKMKE+GFASSPL YND+MCLY  TG
Sbjct: 127 FQNLVNQDRNDKTYGALLNCYVREGLVDKSLYHMQKMKELGFASSPLNYNDLMCLYTRTG 186

Query: 432 QVDKVPNVLSEMKDNGVLPDNYSYRICISSYGARSDLIGMLKVLKEMESQTHISMDWTTY 491
           Q++KV +VLSEMK+NG+ PD +SYRIC+SS  ARSDL G+ ++L+EME+Q+HIS+DW TY
Sbjct: 187 QLEKVTDVLSEMKENGITPDLFSYRICMSSCAARSDLKGVEEILEEMENQSHISIDWVTY 246

Query: 492 SMVANFFIKAGMHEQAMSYLRKCEDKVNQDALGFNHLISLYTSLGRKDEVMRLWALQK-K 551
           S VA+ ++KA + E+A+ YL+KCE KVN+DALG+NHLISL  SLG KDEVMRLW L K K
Sbjct: 247 STVASIYVKASLKEKALIYLKKCEQKVNRDALGYNHLISLNASLGIKDEVMRLWGLVKTK 306

Query: 552 CKKQVNRDYITMLGCLVKLEFLEEAEKLVEEWESSCECYDFRVPNVLLIGYSQRGLIERA 611
           CKKQVNRDYITMLG LVKLE LEEA+KL++EWESSC+CYDFRVPNVLLIGY Q+GLIE+A
Sbjct: 307 CKKQVNRDYITMLGALVKLEELEEADKLLQEWESSCQCYDFRVPNVLLIGYCQQGLIEKA 366

Query: 612 EKMLQNIISDGRIPPPNSWGIIAAGYLEKQNPERAFKCMKEAVAVQEQNKGWRPKPSVLS 671
           E ML++I+   + P PNSW IIAAGY+ KQN E+AF CMKEA+ VQ +NKGWRPK +++S
Sbjct: 367 EAMLKDIVKKQKNPTPNSWAIIAAGYVNKQNMEKAFNCMKEALTVQAENKGWRPKANLIS 426

Query: 672 SILRWLSENGRYEELKEFLSSLKAVPSMDGKL 702
           SIL WL ENG  E+++ F++ L+     D ++
Sbjct: 427 SILSWLGENGDVEDVEAFVNLLETKVPKDREI 457

BLAST of CmoCh04G009740 vs. NCBI nr
Match: gi|225461407|ref|XP_002282230.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g21705, mitochondrial [Vitis vinifera])

HSP 1 Score: 583.2 bits (1502), Expect = 6.5e-163
Identity = 269/425 (63.29%), Postives = 357/425 (84.00%), Query Frame = 1

Query: 281 NLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRS 340
           NL++RISPLG+P LS+VP+LDQW++EG+ ++D E+ RI+RDLR+ +RY QAL+VSEWM S
Sbjct: 38  NLYSRISPLGTPNLSLVPVLDQWVEEGKKVRDVELHRIIRDLRSRKRYAQALEVSEWMSS 97

Query: 341 KGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKA 400
           K L  F+    AVQLDLIG+VRG++SAE YF+++S +E+I K+YGALLNCYVRE ++DK+
Sbjct: 98  KELCPFSPSARAVQLDLIGQVRGLESAENYFNNMSAEEKIDKMYGALLNCYVRERVIDKS 157

Query: 401 LSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISS 460
           LSH+QKMKE+GFAS+PL YN +MCLY+NT Q++K+P+VLSEM++NG+ PDN+SYR+CI+S
Sbjct: 158 LSHLQKMKELGFASTPLPYNGLMCLYINTDQLEKIPDVLSEMQENGISPDNFSYRLCINS 217

Query: 461 YGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQD 520
           YGARSDL  M K+L+EMES++HI +DW TYSMVANF+IKAG++E+A+ +L+K E K+++D
Sbjct: 218 YGARSDLNSMEKILEEMESKSHIHIDWMTYSMVANFYIKAGLNEKALFFLKKAETKLHKD 277

Query: 521 ALGFNHLISLYTSLGRKDEVMRLWALQKKC-KKQVNRDYITMLGCLVKLEFLEEAEKLVE 580
            LG+NHLISLY SLG K E+MRLW  +K   KK +NRDYITMLG LVKL  LE+ E L++
Sbjct: 278 PLGYNHLISLYASLGSKAEMMRLWERRKTASKKLINRDYITMLGSLVKLGELEDTEALLK 337

Query: 581 EWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQ 640
           EWESS  CYDFRVPN LLIG+ Q+GLIE+AE ML++I+ +G+ P PNSW I+AAGY+EKQ
Sbjct: 338 EWESSGNCYDFRVPNTLLIGFCQKGLIEKAESMLRDIVEEGKTPTPNSWSIVAAGYIEKQ 397

Query: 641 NPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDG 700
           N E+AF+CMKEA+AV  +NKGWRPKP V+SSIL WL +N   EE++ F+S+LKAV  MD 
Sbjct: 398 NMEKAFECMKEAIAVLAENKGWRPKPKVISSILSWLGDNRDVEEVETFVSALKAVIPMDR 457

Query: 701 KLSNA 705
           ++ +A
Sbjct: 458 EMYHA 462

BLAST of CmoCh04G009740 vs. NCBI nr
Match: gi|302143027|emb|CBI20322.3| (unnamed protein product [Vitis vinifera])

HSP 1 Score: 583.2 bits (1502), Expect = 6.5e-163
Identity = 269/425 (63.29%), Postives = 357/425 (84.00%), Query Frame = 1

Query: 281 NLFARISPLGSPELSVVPILDQWIQEGRMIKDFEMRRIVRDLRNCRRYGQALQVSEWMRS 340
           NL++RISPLG+P LS+VP+LDQW++EG+ ++D E+ RI+RDLR+ +RY QAL+VSEWM S
Sbjct: 38  NLYSRISPLGTPNLSLVPVLDQWVEEGKKVRDVELHRIIRDLRSRKRYAQALEVSEWMSS 97

Query: 341 KGLFSFTTRDFAVQLDLIGRVRGIDSAEKYFSSVSNQEEIGKLYGALLNCYVREGLVDKA 400
           K L  F+    AVQLDLIG+VRG++SAE YF+++S +E+I K+YGALLNCYVRE ++DK+
Sbjct: 98  KELCPFSPSARAVQLDLIGQVRGLESAENYFNNMSAEEKIDKMYGALLNCYVRERVIDKS 157

Query: 401 LSHMQKMKEMGFASSPLCYNDIMCLYLNTGQVDKVPNVLSEMKDNGVLPDNYSYRICISS 460
           LSH+QKMKE+GFAS+PL YN +MCLY+NT Q++K+P+VLSEM++NG+ PDN+SYR+CI+S
Sbjct: 158 LSHLQKMKELGFASTPLPYNGLMCLYINTDQLEKIPDVLSEMQENGISPDNFSYRLCINS 217

Query: 461 YGARSDLIGMLKVLKEMESQTHISMDWTTYSMVANFFIKAGMHEQAMSYLRKCEDKVNQD 520
           YGARSDL  M K+L+EMES++HI +DW TYSMVANF+IKAG++E+A+ +L+K E K+++D
Sbjct: 218 YGARSDLNSMEKILEEMESKSHIHIDWMTYSMVANFYIKAGLNEKALFFLKKAETKLHKD 277

Query: 521 ALGFNHLISLYTSLGRKDEVMRLWALQKKC-KKQVNRDYITMLGCLVKLEFLEEAEKLVE 580
            LG+NHLISLY SLG K E+MRLW  +K   KK +NRDYITMLG LVKL  LE+ E L++
Sbjct: 278 PLGYNHLISLYASLGSKAEMMRLWERRKTASKKLINRDYITMLGSLVKLGELEDTEALLK 337

Query: 581 EWESSCECYDFRVPNVLLIGYSQRGLIERAEKMLQNIISDGRIPPPNSWGIIAAGYLEKQ 640
           EWESS  CYDFRVPN LLIG+ Q+GLIE+AE ML++I+ +G+ P PNSW I+AAGY+EKQ
Sbjct: 338 EWESSGNCYDFRVPNTLLIGFCQKGLIEKAESMLRDIVEEGKTPTPNSWSIVAAGYIEKQ 397

Query: 641 NPERAFKCMKEAVAVQEQNKGWRPKPSVLSSILRWLSENGRYEELKEFLSSLKAVPSMDG 700
           N E+AF+CMKEA+AV  +NKGWRPKP V+SSIL WL +N   EE++ F+S+LKAV  MD 
Sbjct: 398 NMEKAFECMKEAIAVLAENKGWRPKPKVISSILSWLGDNRDVEEVETFVSALKAVIPMDR 457

Query: 701 KLSNA 705
           ++ +A
Sbjct: 458 EMYHA 462

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP334_ARATH1.2e-11643.98Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidop... [more]
EXP12_ARATH2.1e-8167.18Expansin-A12 OS=Arabidopsis thaliana GN=EXPA12 PE=2 SV=1[more]
PP166_ARATH1.3e-7835.83Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidop... [more]
PPR3_ARATH2.0e-7133.92Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana GN... [more]
PPR86_ARATH7.8e-6832.17Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0L7Y2_CUCSA1.6e-23280.82Uncharacterized protein OS=Cucumis sativus GN=Csa_3G104890 PE=4 SV=1[more]
B9SNN7_RICCO2.0e-16663.72Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
F6H257_VITVI4.5e-16363.29Putative uncharacterized protein OS=Vitis vinifera GN=VIT_19s0014g02920 PE=4 SV=... [more]
M5WGV8_PRUPE7.2e-16160.04Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004720mg PE=4 SV=1[more]
A0A067G664_CITSI1.4e-15960.09Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g011236mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G21705.16.7e-11843.98 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G15370.11.2e-8267.18 expansin 12[more]
AT2G20710.17.3e-8035.83 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G02150.11.1e-7233.92 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G60770.14.4e-6932.17 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449431834|ref|XP_004133705.1|2.2e-23280.82PREDICTED: pentatricopeptide repeat-containing protein At4g21705, mitochondrial-... [more]
gi|659102689|ref|XP_008452263.1|1.8e-22680.08PREDICTED: pentatricopeptide repeat-containing protein At4g21705, mitochondrial-... [more]
gi|255573349|ref|XP_002527601.1|2.8e-16663.72PREDICTED: pentatricopeptide repeat-containing protein At4g21705, mitochondrial ... [more]
gi|225461407|ref|XP_002282230.1|6.5e-16363.29PREDICTED: pentatricopeptide repeat-containing protein At4g21705, mitochondrial ... [more]
gi|302143027|emb|CBI20322.3|6.5e-16363.29unnamed protein product [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR002963Expansin
IPR007112Expansin/allergen_DPBB_dom
IPR007117Expansin_CBD
IPR007118Expan_Lol_pI
IPR009009RlpA-like_DPBB
IPR011990TPR-like_helical_dom_sf
Vocabulary: Biological Process
TermDefinition
GO:0009664plant-type cell wall organization
Vocabulary: Cellular Component
TermDefinition
GO:0005576extracellular region
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009664 plant-type cell wall organization
biological_process GO:0008150 biological_process
biological_process GO:0009451 RNA modification
cellular_component GO:0005576 extracellular region
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G009740.1CmoCh04G009740.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 384..412
score: 2.2E-5coord: 558..583
score: 0.3coord: 594..620
score: 0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 418..461
score: 2.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 383..412
score: 8.7E-5coord: 418..450
score: 7.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 589..623
score: 8.331coord: 664..694
score: 5.514coord: 520..554
score: 6.665coord: 380..414
score: 9.098coord: 486..516
score: 6.675coord: 311..345
score: 5.338coord: 415..449
score: 9.701coord: 450..484
score: 6
IPR002963ExpansinPRINTSPR01226EXPANSINcoord: 122..139
score: 1.4E-41coord: 95..106
score: 1.4E-41coord: 108..118
score: 1.4E-41coord: 66..80
score: 1.4E-41coord: 164..176
score: 1.4E-41coord: 176..197
score: 1.4E-41coord: 212..233
score: 1.4
IPR007112Expansin/pollen allergen, DPBB domainSMARTSM00837dpbb_1coord: 71..155
score: 3.6
IPR007112Expansin/pollen allergen, DPBB domainPROFILEPS50842EXPANSIN_EG45coord: 54..165
score: 2
IPR007117Expansin, cellulose-binding-like domainGENE3DG3DSA:2.60.40.760coord: 160..229
score: 1.3
IPR007117Expansin, cellulose-binding-like domainPFAMPF01357Pollen_allerg_1coord: 166..229
score: 5.2
IPR007117Expansin, cellulose-binding-like domainPROFILEPS50843EXPANSIN_CBDcoord: 175..230
score: 12
IPR007117Expansin, cellulose-binding-like domainunknownSSF49590PHL pollen allergencoord: 162..229
score: 9.68
IPR007118Expansin/Lol pIPRINTSPR01225EXPANSNFAMLYcoord: 203..217
score: 1.8E-28coord: 75..93
score: 1.8E-28coord: 148..164
score: 1.8E-28coord: 35..50
score: 1.8E-28coord: 53..71
score: 1.8
IPR009009RlpA-like protein, double-psi beta-barrel domainGENE3DG3DSA:2.40.40.10coord: 21..159
score: 9.8
IPR009009RlpA-like protein, double-psi beta-barrel domainPFAMPF03330DPBB_1coord: 71..155
score: 9.8
IPR009009RlpA-like protein, double-psi beta-barrel domainunknownSSF50685Barwin-like endoglucanasescoord: 16..183
score: 1.4
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 606..684
score: 3.6E-14coord: 489..540
score: 3.6E-14coord: 368..432
score: 3.6
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 501..537
score: 5.69E-6coord: 390..439
score: 5.69E-6coord: 631..659
score: 5.6
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 251..698
score: 9.8E-211coord: 214..233
score: 9.8E
NoneNo IPR availablePANTHERPTHR24015:SF671SUBFAMILY NOT NAMEDcoord: 251..698
score: 9.8E-211coord: 214..233
score: 9.8E