CmoCh06G000200.1 (mRNA) Cucurbita moschata (Rifu)

NameCmoCh06G000200.1
TypemRNA
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionPENTATRICOPEPTIDE REPEAT 596
LocationCmo_Chr06 : 136432 .. 139947 (+)
Sequence length2432
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAAGAAGAAGAAGAAGTGTTTTTCTTTTTATTCTTTTCTTTCCTTATTTATTTTTTCTACCAATTATTTTGACATTTTCATGAAAATCAGTATAATGGCTTACTAATTTATTTTTAAAATTATAACAACATATTTATTTATTAATTTATAGTTTTCTAAAAGTATGTATGCAATATTTTTATAAAATTATTTAAGCAAGCAGCTATGCTTTAAACCTTGGTCCGAGTCCACCGCCTGATATAAAGAACGGGAGTCCAACACACGAAAGGGTTTTTAGGGTTTAACACCAACAACGTCTCTGCCGCGCCTATTTCGATCCCCCCTCCCTCCCTCTCCTTTCCACTCTTACACCTCTGCTACTAACCTCTTCACTCCTATTATCATCTCATCATGTGGGCTCTTCGTAGAGCTTCGACTCCTCTCAGGTATGCTTCTCGTCTTCTAACTGCACCTCTCCGCTTATTTACCTTACCTTCTTTATTCCTTAATGCCAATACATTTCTTTTCAATCTTCTTGTCACTACACACTTTCATTCACTATTTCTGCAACTAAGATTCTTGCTTGCTTCTCATCCACTTTCCGGTGCTCATCACAGAAGGAAATAATGTAAAAACTTGTGTGTCATTTTTTTCTGTTTTTATTCGCATTCAAGCTCAAATCTGAAGCTTCGTTATTTCTACAATTTAAATGCTTCTGGGTTTCACAATCTATCTTCACAGTTGTCGGCCTATTATCAAGGGGGGATTTCCTTTTCTCTCTGTAAGGAAGAACAAGTTTGGGGGATGTCAGTTTCCCGAAATCATCCATTATTAATCATTCAAACGACTGTTTTTGTATGTATGTATTTCTGTATGTGTAATATATGTACCTGTATATTGCGGTAACCGGTGCGGAGGGTACGATGCCTCCAATTTTTCATATCCCTCTCTCAATTTTTTGTGTTCTCTTTCTAGGAATCAAGGATATAGAATAAGAACTTCGTATGTCTTTGGCAAACTAGAGGCGCCATATTCTTGCGATGGAAATATAGTTGGTTCGGCTATCATCCCTGCTATATCTGATAGATGCATTTCTTTTGAGAGAAATAACCTTGCAACATGGCGGTCCTCTGGGCTTTCTATTAGCAGCCATGGTCTATCTTCACAAGCTGGTGCTGAGAACAGTGGAGAGGAAGATGACTTGGAAGATGGGTTTTCTGAACTTGAAACACTTCCAAGCACTAATGCACTTGAAGATAATAAAGCAGCTGATGAAAATGAAGGGGAATTAACCTCTGAATCAGAACTTGATGATGATGGGACTCAAAATGAACTGGATTTACCTGAGGTTGAAACTGAGCTTGGTGAAAAGATATCTGCGAAAAGGGCTCCTTCAGAGCTGTTCAAGGCTATTTGGAGTGCTCCGGGTTTATCTGTTCCCAGTGCACTGGACAAGTGGGTCAGTGAAGGAAAAGAATTAAGCCGTGCTGACATCTCTCTGGCTATGCTCAATCTTCGAAGACGTCGAATGTTTGGGAAGGCTTTGCAGGTAAATTGACTTGATTTTACTTCTTCAAGCAATGGAATTCATTCCTATGGTTTTAGGTTTATAGTTAATCTCGTCACTCTTGGGATTTGCAGAACTCTATAAAGTATATGTGACTAGCGGACACAATGGTTTCAGGATGGTAATATTGATAAATCTCTAAGAAATACCATAATCAGTAGTAATGTAATGCGGTCTAACAAACACAAAATGATTTCAAGAAGTACCACTATCAAATGCTTGTATAGAAATTTGTTAAGCTGGCTTATGGCATTCAATAAATGGTGCCAGAACCGCGTATAGAATTTTCAGATGGCTTGATATTAGGTTGAGCTGCATGTAACTGCTTGATTTATTAGTATATATGAATTTTGCATTTATTATTGTTAGTCTTATAAAAGCCATTATTGGTAAGAAAATGGTTATGCGAGTTTTAGCCAGTTTATCGACCAAGAATTTTGTAGCTACCGTATTTTAAAAATATTTTGCCTATTTGTTCAACATTCTTATTCATCATTTCTTCAACTCATTACTTGCATTGCTATATGGGTTTCACAGTTTTCAGAGTGGTTGGAAGCAAGTGGACAACTTGAGTTCGTTGACAGAGACTATGCTTCTCGCCTTGACTTGATTGCAAAGGTACATGGTCTCCATAGGGCAGAAGGTTACATCGCCAAAATCCCGAAATCATTCCAGGGGGAGGTGATATACCGAACTCTTTTGGCTAACTGTGTGGTCGCCAACAATGTAAAAAAAGCAGAGGAAGTATTTAACAAAATGAAGGACCTTGAATTTCCAATCACAGCCTTTGCTTGCAACCAGTTGCTTCTTCTTTACAAGAGACTTGACAAGAGGAAAATAGCCGACGTTCTGTTGTTGATGGAGAAAGAAAATGTCAAGCCTTCTCTGTTTACTTACAAGATCCTAATAGATGCTAAAGGCCTTTCAAATGACATGGTGGGTATGGAACAAGTTGTTGATACAATGAAGGCTGAAGGAATCGAACTTGATGTAAATACACTTTCCATATTAGCGAAGCACTATGCTTCAGGTGGGCTTAAAGACAAAGCCAAGGCCATCTTGAAGGAGATGGAAGATGTGAGCTCCAAAGAATCTCGATGGCCTTGCAGACTTTTACTTCCCCTTTATGGAGAACTACAAATGGAAGATGAAGTGAGGAGGGTCTGGAAGCTCTGCGAGGCTAATCCTCGCATTGAAGAATGCATGGCTGCCATTGTTGCATGGGGAAAGCTGAAGAACGTCCAAGAAGCAGAGGAAATTTTTGATAGAGTTTTAAAAACATGGAAGAAGCTTTCCTCGAAACAATACTCTACCATGTTGAAGGTTTATGCAGACAATAAGATGCTGACGAAGGGCAAGGATCTAGTCAAGCAGATGGCAGACAGCGGTTGCCGAATCGGTCCATTAACATGGGATGCAGTTGTGAAACTCTATGTGGAAGCTGGGGAGGTAGAAAAGGCAGATTCTTTCTTGCAGAAGGCGGTCCAGAAAAACCAGATGAAGCCATTGTTTACATCTTACATGGTTATCCTGGATCAGTATGCAAGGAGGGGTGACGTCCACAATGCGGAGAAAATGTTTCATAGGATGAGACTATCAGGTTACGTGGCTAGATTCAGCCCGTTTCAAGCTCTAATACAGGCATACATTAATGCCAAGGCTCCAGCCTACGGCATGAAAGAGAGAATGAAGGCAGATAATGTGTTTCCAAACAAAGCTTTGGCAGGAAAATTAGCCCAAATCGATGCATTCAGGAAGACAGCAGTGTCGGATTTGCTTGACTGAGGAATTTAATTTGATACTTTTGGATGTATAAACTCCTCGGATTTCAACTCTATTTTACAAATCAAGTATTTGTTAGTTAATCTGGTGGCTGCGCTTTGAAATTTGACAGCAGAACTACACAATGGATTGTGCTGCAGGTGTTATAATATATTTTGTTTGTTCGAATACCCATTATTACCAAGAA

mRNA sequence

AGAAGAAGAAGAAGAAGTGTTTTTCTTTTTATTCTTTTCTTTCCTTATTTATTTTTTCTACCAATTATTTTGACATTTTCATGAAAATCAGTATAATGGCTTACTAATTTATTTTTAAAATTATAACAACATATTTATTTATTAATTTATAGTTTTCTAAAAGTATGTATGCAATATTTTTATAAAATTATTTAAGCAAGCAGCTATGCTTTAAACCTTGGTCCGAGTCCACCGCCTGATATAAAGAACGGGAGTCCAACACACGAAAGGGTTTTTAGGGTTTAACACCAACAACGTCTCTGCCGCGCCTATTTCGATCCCCCCTCCCTCCCTCTCCTTTCCACTCTTACACCTCTGCTACTAACCTCTTCACTCCTATTATCATCTCATCATGTGGGCTCTTCGTAGAGCTTCGACTCCTCTCAGGAATCAAGGATATAGAATAAGAACTTCGTATGTCTTTGGCAAACTAGAGGCGCCATATTCTTGCGATGGAAATATAGTTGGTTCGGCTATCATCCCTGCTATATCTGATAGATGCATTTCTTTTGAGAGAAATAACCTTGCAACATGGCGGTCCTCTGGGCTTTCTATTAGCAGCCATGGTCTATCTTCACAAGCTGGTGCTGAGAACAGTGGAGAGGAAGATGACTTGGAAGATGGGTTTTCTGAACTTGAAACACTTCCAAGCACTAATGCACTTGAAGATAATAAAGCAGCTGATGAAAATGAAGGGGAATTAACCTCTGAATCAGAACTTGATGATGATGGGACTCAAAATGAACTGGATTTACCTGAGGTTGAAACTGAGCTTGGTGAAAAGATATCTGCGAAAAGGGCTCCTTCAGAGCTGTTCAAGGCTATTTGGAGTGCTCCGGGTTTATCTGTTCCCAGTGCACTGGACAAGTGGGTCAGTGAAGGAAAAGAATTAAGCCGTGCTGACATCTCTCTGGCTATGCTCAATCTTCGAAGACGTCGAATGTTTGGGAAGGCTTTGCAGTTTTCAGAGTGGTTGGAAGCAAGTGGACAACTTGAGTTCGTTGACAGAGACTATGCTTCTCGCCTTGACTTGATTGCAAAGGTACATGGTCTCCATAGGGCAGAAGGTTACATCGCCAAAATCCCGAAATCATTCCAGGGGGAGGTGATATACCGAACTCTTTTGGCTAACTGTGTGGTCGCCAACAATGTAAAAAAAGCAGAGGAAGTATTTAACAAAATGAAGGACCTTGAATTTCCAATCACAGCCTTTGCTTGCAACCAGTTGCTTCTTCTTTACAAGAGACTTGACAAGAGGAAAATAGCCGACGTTCTGTTGTTGATGGAGAAAGAAAATGTCAAGCCTTCTCTGTTTACTTACAAGATCCTAATAGATGCTAAAGGCCTTTCAAATGACATGGTGGGTATGGAACAAGTTGTTGATACAATGAAGGCTGAAGGAATCGAACTTGATGTAAATACACTTTCCATATTAGCGAAGCACTATGCTTCAGGTGGGCTTAAAGACAAAGCCAAGGCCATCTTGAAGGAGATGGAAGATGTGAGCTCCAAAGAATCTCGATGGCCTTGCAGACTTTTACTTCCCCTTTATGGAGAACTACAAATGGAAGATGAAGTGAGGAGGGTCTGGAAGCTCTGCGAGGCTAATCCTCGCATTGAAGAATGCATGGCTGCCATTGTTGCATGGGGAAAGCTGAAGAACGTCCAAGAAGCAGAGGAAATTTTTGATAGAGTTTTAAAAACATGGAAGAAGCTTTCCTCGAAACAATACTCTACCATGTTGAAGGTTTATGCAGACAATAAGATGCTGACGAAGGGCAAGGATCTAGTCAAGCAGATGGCAGACAGCGGTTGCCGAATCGGTCCATTAACATGGGATGCAGTTGTGAAACTCTATGTGGAAGCTGGGGAGGTAGAAAAGGCAGATTCTTTCTTGCAGAAGGCGGTCCAGAAAAACCAGATGAAGCCATTGTTTACATCTTACATGGTTATCCTGGATCAGTATGCAAGGAGGGGTGACGTCCACAATGCGGAGAAAATGTTTCATAGGATGAGACTATCAGGTTACGTGGCTAGATTCAGCCCGTTTCAAGCTCTAATACAGGCATACATTAATGCCAAGGCTCCAGCCTACGGCATGAAAGAGAGAATGAAGGCAGATAATGTGTTTCCAAACAAAGCTTTGGCAGGAAAATTAGCCCAAATCGATGCATTCAGGAAGACAGCAGTGTCGGATTTGCTTGACTGAGGAATTTAATTTGATACTTTTGGATGTATAAACTCCTCGGATTTCAACTCTATTTTACAAATCAAGTATTTGTTAGTTAATCTGGTGGCTGCGCTTTGAAATTTGACAGCAGAACTACACAATGGATTGTGCTGCAGGTGTTATAATATATTTTGTTTGTTCGAATACCCATTATTACCAAGAA

Coding sequence (CDS)

ATGTGGGCTCTTCGTAGAGCTTCGACTCCTCTCAGGAATCAAGGATATAGAATAAGAACTTCGTATGTCTTTGGCAAACTAGAGGCGCCATATTCTTGCGATGGAAATATAGTTGGTTCGGCTATCATCCCTGCTATATCTGATAGATGCATTTCTTTTGAGAGAAATAACCTTGCAACATGGCGGTCCTCTGGGCTTTCTATTAGCAGCCATGGTCTATCTTCACAAGCTGGTGCTGAGAACAGTGGAGAGGAAGATGACTTGGAAGATGGGTTTTCTGAACTTGAAACACTTCCAAGCACTAATGCACTTGAAGATAATAAAGCAGCTGATGAAAATGAAGGGGAATTAACCTCTGAATCAGAACTTGATGATGATGGGACTCAAAATGAACTGGATTTACCTGAGGTTGAAACTGAGCTTGGTGAAAAGATATCTGCGAAAAGGGCTCCTTCAGAGCTGTTCAAGGCTATTTGGAGTGCTCCGGGTTTATCTGTTCCCAGTGCACTGGACAAGTGGGTCAGTGAAGGAAAAGAATTAAGCCGTGCTGACATCTCTCTGGCTATGCTCAATCTTCGAAGACGTCGAATGTTTGGGAAGGCTTTGCAGTTTTCAGAGTGGTTGGAAGCAAGTGGACAACTTGAGTTCGTTGACAGAGACTATGCTTCTCGCCTTGACTTGATTGCAAAGGTACATGGTCTCCATAGGGCAGAAGGTTACATCGCCAAAATCCCGAAATCATTCCAGGGGGAGGTGATATACCGAACTCTTTTGGCTAACTGTGTGGTCGCCAACAATGTAAAAAAAGCAGAGGAAGTATTTAACAAAATGAAGGACCTTGAATTTCCAATCACAGCCTTTGCTTGCAACCAGTTGCTTCTTCTTTACAAGAGACTTGACAAGAGGAAAATAGCCGACGTTCTGTTGTTGATGGAGAAAGAAAATGTCAAGCCTTCTCTGTTTACTTACAAGATCCTAATAGATGCTAAAGGCCTTTCAAATGACATGGTGGGTATGGAACAAGTTGTTGATACAATGAAGGCTGAAGGAATCGAACTTGATGTAAATACACTTTCCATATTAGCGAAGCACTATGCTTCAGGTGGGCTTAAAGACAAAGCCAAGGCCATCTTGAAGGAGATGGAAGATGTGAGCTCCAAAGAATCTCGATGGCCTTGCAGACTTTTACTTCCCCTTTATGGAGAACTACAAATGGAAGATGAAGTGAGGAGGGTCTGGAAGCTCTGCGAGGCTAATCCTCGCATTGAAGAATGCATGGCTGCCATTGTTGCATGGGGAAAGCTGAAGAACGTCCAAGAAGCAGAGGAAATTTTTGATAGAGTTTTAAAAACATGGAAGAAGCTTTCCTCGAAACAATACTCTACCATGTTGAAGGTTTATGCAGACAATAAGATGCTGACGAAGGGCAAGGATCTAGTCAAGCAGATGGCAGACAGCGGTTGCCGAATCGGTCCATTAACATGGGATGCAGTTGTGAAACTCTATGTGGAAGCTGGGGAGGTAGAAAAGGCAGATTCTTTCTTGCAGAAGGCGGTCCAGAAAAACCAGATGAAGCCATTGTTTACATCTTACATGGTTATCCTGGATCAGTATGCAAGGAGGGGTGACGTCCACAATGCGGAGAAAATGTTTCATAGGATGAGACTATCAGGTTACGTGGCTAGATTCAGCCCGTTTCAAGCTCTAATACAGGCATACATTAATGCCAAGGCTCCAGCCTACGGCATGAAAGAGAGAATGAAGGCAGATAATGTGTTTCCAAACAAAGCTTTGGCAGGAAAATTAGCCCAAATCGATGCATTCAGGAAGACAGCAGTGTCGGATTTGCTTGACTGA
BLAST of CmoCh06G000200.1 vs. Swiss-Prot
Match: PP135_ARATH (Pentatricopeptide repeat-containing protein At1g80270, mitochondrial OS=Arabidopsis thaliana GN=At1g80270 PE=2 SV=1)

HSP 1 Score: 644.4 bits (1661), Expect = 1.2e-183
Identity = 330/574 (57.49%), Postives = 444/574 (77.35%), Query Frame = 1

Query: 52  SFERNNLATWRSSGL-------SISSHGLSSQAGAENSGEEDDLEDGFSELETLPSTNAL 111
           SF+ N++A+ +   +       S+S+  LSS AG ++  EEDDLEDGFSELE    + + 
Sbjct: 34  SFDSNSIASTKREAVPRFYEISSLSNRALSSSAGTKSDQEEDDLEDGFSELE---GSKSG 93

Query: 112 EDNKAADENEGELTSESELDDDGTQNELDLPEVETELGEKISAKRAPSELFKAIWSAPGL 171
           + + ++DE+EG+L+++ E ++     ELDL  +ET++  K   K+  SELFK I SAPGL
Sbjct: 94  QGSTSSDEDEGKLSADEEEEE-----ELDL--IETDVSRKTVEKKQ-SELFKTIVSAPGL 153

Query: 172 SVPSALDKWVSEGKELSRADISLAMLNLRRRRMFGKALQFSEWLEASGQLEFVDRDYASR 231
           S+ SALDKWV EG E++R +I+ AML LRRRRM+G+ALQ SEWLEA+ ++E  +RDYASR
Sbjct: 154 SIGSALDKWVEEGNEITRVEIAKAMLQLRRRRMYGRALQMSEWLEANKKIEMTERDYASR 213

Query: 232 LDLIAKVHGLHRAEGYIAKIPKSFQGEVIYRTLLANCVVANNVKKAEEVFNKMKDLEFPI 291
           LDL  K+ GL + E  + KIPKSF+GEV+YRTLLANCV A NVKK+E VFNKMKDL FP+
Sbjct: 214 LDLTVKIRGLEKGEACMQKIPKSFKGEVLYRTLLANCVAAGNVKKSELVFNKMKDLGFPL 273

Query: 292 TAFACNQLLLLYKRLDKRKIADVLLLMEKENVKPSLFTYKILIDAKGLSNDMVGMEQVVD 351
           + F C+Q+LLL+KR+D++KIADVLLLMEKEN+KPSL TYKILID KG +ND+ GMEQ+++
Sbjct: 274 SGFTCDQMLLLHKRIDRKKIADVLLLMEKENIKPSLLTYKILIDVKGATNDISGMEQILE 333

Query: 352 TMKAEGIELDVNTLSILAKHYASGGLKDKAKAILKEMEDVSSKESRWPCRLLLPLYGELQ 411
           TMK EG+ELD  T ++ A+HY+  GLKDKA+ +LKEME  S + +R   + LL +Y  L 
Sbjct: 334 TMKDEGVELDFQTQALTARHYSGAGLKDKAEKVLKEMEGESLEANRRAFKDLLSIYASLG 393

Query: 412 MEDEVRRVWKLCEANPRIEECMAAIVAWGKLKNVQEAEEIFDRVLKTWKKLSSKQYSTML 471
            EDEV+R+WK+CE+ P  EE +AAI A+GKL  VQEAE IF++++K  ++ SS  YS +L
Sbjct: 394 REDEVKRIWKICESKPYFEESLAAIQAFGKLNKVQEAEAIFEKIVKMDRRASSSTYSVLL 453

Query: 472 KVYADNKMLTKGKDLVKQMADSGCRIGPLTWDAVVKLYVEAGEVEKADSFLQKAVQKNQM 531
           +VY D+KML+KGKDLVK+MA+SGCRI   TWDA++KLYVEAGEVEKADS L KA +++  
Sbjct: 454 RVYVDHKMLSKGKDLVKRMAESGCRIEATTWDALIKLYVEAGEVEKADSLLDKASKQSHT 513

Query: 532 KPLFTSYMVILDQYARRGDVHNAEKMFHRMRLSGYVARFSPFQALIQAYINAKAPAYGMK 591
           K +  S+M I+D+Y++RGDVHN EK+F +MR +GY +R   FQAL+QAYINAK+PAYGM+
Sbjct: 514 KLMMNSFMYIMDEYSKRGDVHNTEKIFLKMREAGYTSRLRQFQALMQAYINAKSPAYGMR 573

Query: 592 ERMKADNVFPNKALAGKLAQIDAFRKTAVSDLLD 619
           +R+KADN+FPNK++A +LAQ D F+KTA+SD+LD
Sbjct: 574 DRLKADNIFPNKSMAAQLAQGDPFKKTAISDILD 596

BLAST of CmoCh06G000200.1 vs. Swiss-Prot
Match: PPR44_ARATH (Pentatricopeptide repeat-containing protein At1g15480, mitochondrial OS=Arabidopsis thaliana GN=At1g15480 PE=2 SV=2)

HSP 1 Score: 640.2 bits (1650), Expect = 2.3e-182
Identity = 337/608 (55.43%), Postives = 444/608 (73.03%), Query Frame = 1

Query: 12  RNQGYRIRT-SYVFGKLEAPYSCDGNIVGSAIIPAISDRCISFERNNLATWRSSGLSISS 71
           R+Q  R+   + V+ KL+ P       + S  +  I D+  +  R    +W SS      
Sbjct: 10  RSQSLRLGACNAVYSKLDIPLGERNIAIESNAL--IHDKHEALPRFYELSWSSS---TGR 69

Query: 72  HGLSSQAGAENSGEEDDLEDGFSELETLPSTNALEDNKAADENEGELTSESELDDDGTQN 131
             LSS AGA+ +G++DDLED   +L T        D  ++D  +GE  S  E D +G + 
Sbjct: 70  RSLSSDAGAKTTGDDDDLEDKNVDLAT-------PDETSSDSEDGEEFSGDEGDIEGAEL 129

Query: 132 ELDLPEVETELGEKISAKRAPSELFKAIWSAPGLSVPSALDKWVSEGKELSRADISLAML 191
           EL +PE            + PSE+FKAI S  GLSV SALDKWV +GK+ +R +   AML
Sbjct: 130 ELHVPE-----------SKRPSEMFKAIVSVSGLSVGSALDKWVEQGKDTNRKEFESAML 189

Query: 192 NLRRRRMFGKALQFSEWLEASGQLEFVDRDYASRLDLIAKVHGLHRAEGYIAKIPKSFQG 251
            LR+RRMFG+ALQ +EWL+ + Q E  +RDYA RLDLI+KV G ++ E YI  IP+SF+G
Sbjct: 190 QLRKRRMFGRALQMTEWLDENKQFEMEERDYACRLDLISKVRGWYKGEAYIKTIPESFRG 249

Query: 252 EVIYRTLLANCVVANNVKKAEEVFNKMKDLEFPITAFACNQLLLLYKRLDKRKIADVLLL 311
           E++YRTLLAN V  +NV+ AE VFNKMKDL FP++ F CNQ+L+LYKR+DK+KIADVLLL
Sbjct: 250 ELVYRTLLANHVATSNVRTAEAVFNKMKDLGFPLSTFTCNQMLILYKRVDKKKIADVLLL 309

Query: 312 MEKENVKPSLFTYKILIDAKGLSNDMVGMEQVVDTMKAEGIELDVNTLSILAKHYASGGL 371
           +EKEN+KP+L TYKILID KG SND+ GMEQ+V+TMK+EG+ELD+   +++A+HYAS GL
Sbjct: 310 LEKENLKPNLNTYKILIDTKGSSNDITGMEQIVETMKSEGVELDLRARALIARHYASAGL 369

Query: 372 KDKAKAILKEMEDVSSKESRWPCRLLLPLYGELQMEDEVRRVWKLCEANPRIEECMAAIV 431
           K+KA+ +LKEME  S +E+R  C+ LL +YG LQ EDEVRRVWK+CE NPR  E +AAI+
Sbjct: 370 KEKAEKVLKEMEGESLEENRHMCKDLLSVYGYLQREDEVRRVWKICEENPRYNEVLAAIL 429

Query: 432 AWGKLKNVQEAEEIFDRVLKTWKKLSSKQYSTMLKVYADNKMLTKGKDLVKQMADSGCRI 491
           A+GK+  V++AE +F++VLK   ++SS  YS +L+VY D+KM+++GKDLVKQM+DSGC I
Sbjct: 430 AFGKIDKVKDAEAVFEKVLKMSHRVSSNVYSVLLRVYVDHKMVSEGKDLVKQMSDSGCNI 489

Query: 492 GPLTWDAVVKLYVEAGEVEKADSFLQKAVQKNQMKPLFTSYMVILDQYARRGDVHNAEKM 551
           G LTWDAV+KLYVEAGEVEKA+S L KA+Q  Q+KPL +S+M ++ +Y RRGDVHN EK+
Sbjct: 490 GALTWDAVIKLYVEAGEVEKAESSLSKAIQSKQIKPLMSSFMYLMHEYVRRGDVHNTEKI 549

Query: 552 FHRMRLSGYVARFSPFQALIQAYINAKAPAYGMKERMKADNVFPNKALAGKLAQIDAFRK 611
           F RM+ +GY +RF  +Q LIQAY+NAKAPAYGMKERMKADN+FPNK LA +LA+ D F+K
Sbjct: 550 FQRMKQAGYQSRFWAYQTLIQAYVNAKAPAYGMKERMKADNIFPNKRLAAQLAKADPFKK 594

Query: 612 TAVSDLLD 619
           T +SDLLD
Sbjct: 610 TPLSDLLD 594

BLAST of CmoCh06G000200.1 vs. Swiss-Prot
Match: PP234_ARATH (Pentatricopeptide repeat-containing protein At3g15590, mitochondrial OS=Arabidopsis thaliana GN=At3g15590 PE=2 SV=1)

HSP 1 Score: 535.0 bits (1377), Expect = 1.1e-150
Identity = 275/549 (50.09%), Postives = 398/549 (72.50%), Query Frame = 1

Query: 71  HGLSSQAGAENSGEEDDLEDGFSELE-TLPSTNALEDNKAADENEGELTSESELDDDGTQ 130
           H LSS A A++ G+E   E+  SE E  +P +  + +    D++      E EL  D   
Sbjct: 69  HKLSSIADAKDKGDEVVREEELSESEEAVPVSGDVPEGVVDDDS----LFEPELGSDN-- 128

Query: 131 NELDLPEVETELGEKISAKRAPSELFKAIWSAPGLSVPSALDKWVSEGKELSRADISLAM 190
           ++L++ E  ++ G K + KR  SEL+++I +    SV   L+KWV EGK+LS+A+++LA+
Sbjct: 129 DDLEIEEKHSKDGGKPTKKRGQSELYESIVAYK--SVKHVLEKWVKEGKDLSQAEVTLAI 188

Query: 191 LNLRRRRMFGKALQFSEWLEASGQLEFVDRDYASRLDLIAKVHGLHRAEGYIAKIPKSFQ 250
            NLR+R+ +   LQ  EWL A+ Q EF + +YAS+LDL+AKVH L +AE ++  IP+S +
Sbjct: 189 HNLRKRKSYAMCLQLWEWLGANTQFEFTEANYASQLDLVAKVHSLQKAEIFLKDIPESSR 248

Query: 251 GEVIYRTLLANCVVANNVKKAEEVFNKMKDLEFPITAFACNQLLLLYKRLDKRKIADVLL 310
           GEV+YRTLLANCV+ ++V KAE++FNKMK+L+FP + FACNQLLLLY   D++KI+DVLL
Sbjct: 249 GEVVYRTLLANCVLKHHVNKAEDIFNKMKELKFPTSVFACNQLLLLYSMHDRKKISDVLL 308

Query: 311 LMEKENVKPSLFTYKILIDAKGLSNDMVGMEQVVDTMKAEGIELDVNTLSILAKHYASGG 370
           LME+EN+KPS  TY  LI++KGL+ D+ GME++V+T+K EGIELD    SILAK+Y   G
Sbjct: 309 LMERENIKPSRATYHFLINSKGLAGDITGMEKIVETIKEEGIELDPELQSILAKYYIRAG 368

Query: 371 LKDKAKAILKEMEDVSSKESRWPCRLLLPLYGELQMEDEVRRVWKLCEANPRIEECMAAI 430
           LK++A+ ++KE+E    +++ W CR LLPLY ++   D VRR+ +  + NPR + C++AI
Sbjct: 369 LKERAQDLMKEIEGKGLQQTPWVCRSLLPLYADIGDSDNVRRLSRFVDQNPRYDNCISAI 428

Query: 431 VAWGKLKNVQEAEEIFDRVLKTWKKLSSKQYSTMLKVYADNKMLTKGKDLVKQMADSGCR 490
            AWGKLK V+EAE +F+R+++ +K      Y  ++++Y +NKML KG+DLVK+M ++G  
Sbjct: 429 KAWGKLKEVEEAEAVFERLVEKYKIFPMMPYFALMEIYTENKMLAKGRDLVKRMGNAGIA 488

Query: 491 IGPLTWDAVVKLYVEAGEVEKADSFLQKAVQKNQMKPLFTSYMVILDQYARRGDVHNAEK 550
           IGP TW A+VKLY++AGEV KA+  L +A + N+M+P+FT+YM IL++YA+RGDVHN EK
Sbjct: 489 IGPSTWHALVKLYIKAGEVGKAELILNRATKDNKMRPMFTTYMAILEEYAKRGDVHNTEK 548

Query: 551 MFHRMRLSGYVARFSPFQALIQAYINAKAPAYGMKERMKADNVFPNKALAGKLAQIDAFR 610
           +F +M+ + Y A+   ++ ++ AYINAK PAYGM ERMKADNVFPNK+LA KLAQ++ F+
Sbjct: 549 VFMKMKRASYAAQLMQYETVLLAYINAKTPAYGMIERMKADNVFPNKSLAAKLAQVNPFK 608

Query: 611 KTAVSDLLD 619
           K  VS LLD
Sbjct: 609 KCPVSVLLD 609

BLAST of CmoCh06G000200.1 vs. Swiss-Prot
Match: PPR19_ARATH (Pentatricopeptide repeat-containing protein At1g07590, mitochondrial OS=Arabidopsis thaliana GN=At1g07590 PE=2 SV=1)

HSP 1 Score: 164.9 bits (416), Expect = 2.9e-39
Identity = 113/456 (24.78%), Postives = 223/456 (48.90%), Query Frame = 1

Query: 163 GLSVPSALDKWVSEGKELSRADISLAMLNLRRRRMFGKALQFSEWLEASGQLEFVDRDYA 222
           G++V SAL  W+ +G  +   D+  A+  LR+     +AL+  EW+         + +Y+
Sbjct: 78  GVTVGSALQSWMGDGFPVHGGDVYHAINRLRKLGRNKRALELMEWIIRERPYRLGELEYS 137

Query: 223 SRLDLIAKVHGLHRAEGYIAKIPKSFQGEVIYRTLLANCVVANNVKKAEEVFNKMKDLEF 282
             L+   K+HG+ + E    ++P+ FQ E++Y  L+  C+    ++ A E   KM++L +
Sbjct: 138 YLLEFTVKLHGVSQGEKLFTRVPQEFQNELLYNNLVIACLDQGVIRLALEYMKKMRELGY 197

Query: 283 PITAFACNQLLLLYKRLDKRK-IADVLLLMEKENVKPSLFTYKILIDAKGLSNDMVGMEQ 342
             +    N+L++      +RK IA  L LM+ +   P + TY IL+  +   +++ G+ +
Sbjct: 198 RTSHLVYNRLIIRNSAPGRRKLIAKDLALMKADKATPHVSTYHILMKLEANEHNIDGVLK 257

Query: 343 VVDTMKAEGIELDVNTLSILAKHYASGGLKDKAKAILKEMEDVSSKESRWPCRLLLPLYG 402
             D MK  G+E +  +  ILA  +A   L   A+A  +E+E   + ++     +L+ LYG
Sbjct: 258 AFDGMKKAGVEPNEVSYCILAMAHAVARLYTVAEAYTEEIEKSITGDNWSTLDILMILYG 317

Query: 403 ELQMEDEVRRVWKLCEA--NPRIEECMAAIVAWGKLKNVQEAEEIFDRVLKTWKKLSSKQ 462
            L  E E+ R W +     + R +  + A  A+ ++ N+  AEE++  +        ++Q
Sbjct: 318 RLGKEKELARTWNVIRGFHHVRSKSYLLATEAFARVGNLDRAEELWLEMKNVKGLKETEQ 377

Query: 463 YSTMLKVYADNKMLTKGKDLVKQMADSGCRIGPLTWD------AVVKLYVEAGEVEKADS 522
           ++++L VY  + ++ K   + ++M  +G +   +T+       A  KL  EA +  +   
Sbjct: 378 FNSLLSVYCKDGLIEKAIGVFREMTGNGFKPNSITYRHLALGCAKAKLMKEALKNIEMGL 437

Query: 523 FLQKAVQKNQMKPLFTSYMVILDQYARRGDVHNAEKMFHRMRLSGYVARFSPFQALIQAY 582
            L+ +       P   + + I++ +A +GDV N+EK+F  ++ + Y      + AL +AY
Sbjct: 438 NLKTSKSIGSSTPWLETTLSIIECFAEKGDVENSEKLFEEVKNAKYNRYAFVYNALFKAY 497

Query: 583 INAKAPAYGMKERMKADNVFPNKALAGKLAQIDAFR 610
           + AK     + +RM      P+      L  ++ ++
Sbjct: 498 VKAKVYDPNLFKRMVLGGARPDAESYSLLKLVEQYK 533

BLAST of CmoCh06G000200.1 vs. Swiss-Prot
Match: PPR86_ARATH (Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana GN=At1g60770 PE=2 SV=1)

HSP 1 Score: 158.7 bits (400), Expect = 2.1e-37
Identity = 118/453 (26.05%), Postives = 221/453 (48.79%), Query Frame = 1

Query: 164 LSVPSALDKWVSEGKELSRADISLAMLNLRRRRMFGKALQFSEWLEASGQLEFVDRDYAS 223
           + V   L++++   K + + ++   +  LR R ++  AL+ SE +E  G  + V  D A 
Sbjct: 37  VKVRQQLNQFLKGTKHVFKWEVGDTIKKLRNRGLYYPALKLSEVMEERGMNKTVS-DQAI 96

Query: 224 RLDLIAKVHGLHRAEGYIAKIPKSFQGEVIYRTLLANCVVANNV-KKAEEVFNKMKDLEF 283
            LDL+AK   +   E Y   +P++ + E+ Y +LL NC     + +KAE + NKMK+L  
Sbjct: 97  HLDLVAKAREITAGENYFVDLPETSKTELTYGSLL-NCYCKELLTEKAEGLLNKMKELNI 156

Query: 284 PITAFACNQLLLLYKRL-DKRKIADVLLLMEKENVKPSLFTYKILIDAKGLSNDMVGMEQ 343
             ++ + N L+ LY +  +  K+  ++  ++ ENV P  +TY + + A   +ND+ G+E+
Sbjct: 157 TPSSMSYNSLMTLYTKTGETEKVPAMIQELKAENVMPDSYTYNVWMRALAATNDISGVER 216

Query: 344 VVDTMKAEG-IELDVNTLSILAKHYASGGLKDKAKAILKEMEDVSSKESRWPCRLLLPLY 403
           V++ M  +G +  D  T S +A  Y   GL  KA+  L+E+E  +++      + L+ LY
Sbjct: 217 VIEEMNRDGRVAPDWTTYSNMASIYVDAGLSQKAEKALQELEMKNTQRDFTAYQFLITLY 276

Query: 404 GELQMEDEVRRVWK-LCEANPRIEEC--MAAIVAWGKLKNVQEAEEIFDRVLKTWKKLSS 463
           G L    EV R+W+ L  A P+      +  I    KL ++  AE +F            
Sbjct: 277 GRLGKLTEVYRIWRSLRLAIPKTSNVAYLNMIQVLVKLNDLPGAETLFKEWQANCSTYDI 336

Query: 464 KQYSTMLKVYADNKMLTKGKDLVKQMADSGCRIGPLTWDAVVKLYVEAGEVEKADSFLQK 523
           +  + ++  YA   ++ K  +L ++    G ++   TW+  +  YV++G++ +A   + K
Sbjct: 337 RIVNVLIGAYAQEGLIQKANELKEKAPRRGGKLNAKTWEIFMDYYVKSGDMARALECMSK 396

Query: 524 AV-----QKNQMKPLFTSYMVILDQYARRGDVHNAEKMFHRMRLSGYVARFSPFQALIQA 583
           AV        +  P   +   ++  + ++ DV+ AE +   ++          F+ LI+ 
Sbjct: 397 AVSIGKGDGGKWLPSPETVRALMSYFEQKKDVNGAENLLEILKNGTDNIGAEIFEPLIRT 456

Query: 584 YINAKAPAYGMKERMKADNVFPNKALAGKLAQI 606
           Y  A      M+ R+K +NV  N+A    L ++
Sbjct: 457 YAAAGKSHPAMRRRLKMENVEVNEATKKLLDEV 487

BLAST of CmoCh06G000200.1 vs. TrEMBL
Match: A0A0A0LEU5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G819910 PE=4 SV=1)

HSP 1 Score: 916.0 bits (2366), Expect = 2.5e-263
Identity = 472/624 (75.64%), Postives = 539/624 (86.38%), Query Frame = 1

Query: 1   MWALRRASTPLRNQGYRIRTSYVFGKLEAPYSCDGNIVGSAIIPAISDRCISFERNNLAT 60
           MWALRRASTPLRNQGYR+RTSYVFGKLE PY  +GN+ G     A+SDR I F+RNNL T
Sbjct: 1   MWALRRASTPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTT 60

Query: 61  WRSSGLSISSHGLSSQAGAENSGEEDDLEDGFSEL-ETLPSTNALEDNKAADENEGELTS 120
           W SS + ISSHGLS+QAGAENSGEE ++EDG SEL ETLPST+ LED+K AD+NE ELTS
Sbjct: 61  WPSSEVYISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTS 120

Query: 121 ESELDDDGTQ----NELDLPEVETELGEKISAKRAPSELFKAIWSAPGLSVPSALDKWVS 180
            SE+DDD        ELDLPE ET L EKIS KRAPSEL   IW APGL+V SALDKWVS
Sbjct: 121 GSEIDDDNDVVDDGTELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKWVS 180

Query: 181 EGKELSRADISLAMLNLRRRRMFGKALQFSEWLEASGQLEFVDRDYASRLDLIAKVHGLH 240
           EGKELSR DIS AMLNLR+ RM+GKALQFSEWLEA+G+L+FV++DYASRLDLI K+ GL 
Sbjct: 181 EGKELSRDDISSAMLNLRKCRMYGKALQFSEWLEANGKLDFVEKDYASRLDLIGKLRGLR 240

Query: 241 RAEGYIAKIPKSFQGEVIYRTLLANCVVANNVKKAEEVFNKMKDLEFPITAFACNQLLLL 300
            AE YIAKIPKSFQGEV+YRTLLANCV+A NV+KAEEVFNKMKDLEFPITAFACNQLLLL
Sbjct: 241 MAENYIAKIPKSFQGEVVYRTLLANCVIACNVQKAEEVFNKMKDLEFPITAFACNQLLLL 300

Query: 301 YKRLDKRKIADVLLLMEKENVKPSLFTYKILIDAKGLSNDMVGMEQVVDTMKAEGIELDV 360
           YKR DKRK+AD+LLLMEKENVKPS FTY+ILID KGLSND+ GMEQVVDTMKAEGIELDV
Sbjct: 301 YKRTDKRKVADILLLMEKENVKPSRFTYRILIDTKGLSNDITGMEQVVDTMKAEGIELDV 360

Query: 361 NTLSILAKHYASGGLKDKAKAILKEMEDVSSKESRWPCRLLLPLYGELQMEDEVRRVWKL 420
           +TLS+LAKHY SGGLKDKAKAILKEME+++S+ SRWPCR+LLPLYGELQMEDEVRR+W++
Sbjct: 361 STLSVLAKHYISGGLKDKAKAILKEMEEINSEGSRWPCRILLPLYGELQMEDEVRRLWEI 420

Query: 421 CEANPRIEECMAAIVAWGKLKNVQEAEEIFDRVLKTWKKLSSKQYSTMLKVY-ADNKMLT 480
           C +NP IEECMAAIVAWGKLKN+QEAE+IFDRV+KT +KLS++ YSTML VY  D+KMLT
Sbjct: 421 CGSNPHIEECMAAIVAWGKLKNIQEAEKIFDRVVKTGEKLSARHYSTMLNVYREDSKMLT 480

Query: 481 KGKDLVKQMADSGCRIGPLTWDAVVKLYVEAGEVEKADSFLQKAVQKNQMKPLFTSYMVI 540
           KGK++VKQMA+SG R+ P+T DAVVKLYVEAGE EKADSFL K V + + KP+FT+Y+ +
Sbjct: 481 KGKEVVKQMAESGSRMDPVTLDAVVKLYVEAGEGEKADSFLVKTVLQYKKKPMFTTYITL 540

Query: 541 LDQYARRGDVHNAEKMFHRMRLSGYVARFSPFQALIQAYINAKAPAYGMKERMKADNVFP 600
           +D+YA RGDV NAEK+F  MR  GYV R S FQ LIQAY+NAKAPAYGM+ERMKAD+VFP
Sbjct: 541 MDRYASRGDVPNAEKIFGMMRKYGYVGRLSHFQTLIQAYVNAKAPAYGMRERMKADSVFP 600

Query: 601 NKALAGKLAQIDAFRKTAVSDLLD 619
           NKALAGKLAQ+D+ +   VSDLLD
Sbjct: 601 NKALAGKLAQVDSLKMREVSDLLD 624

BLAST of CmoCh06G000200.1 vs. TrEMBL
Match: A0A0A0LHD8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G819930 PE=4 SV=1)

HSP 1 Score: 901.7 bits (2329), Expect = 4.8e-259
Identity = 467/626 (74.60%), Postives = 532/626 (84.98%), Query Frame = 1

Query: 1   MWALRRASTPLRNQGYRIRTSYVFGKLEAPYSCDGNIVGSAIIPAISDRCISFERNNLAT 60
           MWALRRASTPLRNQGYR+RTSYVFGKLE PY  +GN+ G     A+SDR I F+RNNL T
Sbjct: 1   MWALRRASTPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTT 60

Query: 61  WRSSGLSISSHGLSSQAGAENSGEEDDLEDGFSEL-ETLPSTNALEDNKAADENEGELTS 120
           W SS + ISSHGLS+QAGAENSGEE ++EDG SEL ETLPST+ LED+K AD+NE ELTS
Sbjct: 61  WPSSEVYISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTS 120

Query: 121 ESELDDD------GTQNELDLPEVETELGEKISAKRAPSELFKAIWSAPGLSVPSALDKW 180
            SE+DDD      GTQNELDLPE ET L EKIS KRAPSEL   IW APGL+V SALDKW
Sbjct: 121 GSEIDDDDDVVDDGTQNELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKW 180

Query: 181 VSEGKELSRADISLAMLNLRRRRMFGKALQFSEWLEASGQLEFVDRDYASRLDLIAKVHG 240
           VSEGKELSR DIS AMLNLR+ RM+GKALQFSEWLE SG+L+F++ DYASRL LI K+ G
Sbjct: 181 VSEGKELSRDDISSAMLNLRKCRMYGKALQFSEWLETSGKLDFIENDYASRLYLIGKLRG 240

Query: 241 LHRAEGYIAKIPKSFQGEVIYRTLLANCVVANNVKKAEEVFNKMKDLEFPITAFACNQLL 300
           L  AE YIAKIPKSFQGEV+Y+TLL NCV+A+NV KAE+VFNKMK+LEFPITAFACNQLL
Sbjct: 241 LRMAENYIAKIPKSFQGEVVYQTLLVNCVIASNVHKAEKVFNKMKNLEFPITAFACNQLL 300

Query: 301 LLYKRLDKRKIADVLLLMEKENVKPSLFTYKILIDAKGLSNDMVGMEQVVDTMKAEGIEL 360
           LLYKR DKRKIADVLLLM+KENVK S  TY+ILID  GLSND+ GME+VVD+MKAEGI+L
Sbjct: 301 LLYKRTDKRKIADVLLLMKKENVKYSTSTYRILIDVNGLSNDITGMEEVVDSMKAEGIKL 360

Query: 361 DVNTLSILAKHYASGGLKDKAKAILKEMEDVSSKESRWPCRLLLPLYGELQMEDEVRRVW 420
           DV TLS L KHY SGGLKDKAKA+LKEME+++S+ SR PCR+LLPLYGELQMEDEVRR+W
Sbjct: 361 DVETLSRLVKHYVSGGLKDKAKAVLKEMEEINSEGSRRPCRILLPLYGELQMEDEVRRLW 420

Query: 421 KLCEANPRIEECMAAIVAWGKLKNVQEAEEIFDRVLKTWKKLSSKQYSTMLKVY-ADNKM 480
           ++CE+NP IEECMAAIVAWGKLKNVQEAE+IFDRV+KT +KLS++ YSTML VY  D+KM
Sbjct: 421 EICESNPHIEECMAAIVAWGKLKNVQEAEKIFDRVVKTGEKLSARHYSTMLNVYREDSKM 480

Query: 481 LTKGKDLVKQMADSGCRIGPLTWDAVVKLYVEAGEVEKADSFLQKAVQKNQMKPLFTSYM 540
           LTKGK++VKQMA+SGCR+ P T DAVVKLYVEAGEVEKADSFL KAV +N+ KP+FT+Y+
Sbjct: 481 LTKGKEVVKQMAESGCRMDPFTLDAVVKLYVEAGEVEKADSFLVKAVLQNKKKPMFTTYI 540

Query: 541 VILDQYARRGDVHNAEKMFHRMRLSGYVARFSPFQALIQAYINAKAPAYGMKERMKADNV 600
            ++D+YA RGDV N EK F  MR  GYV R S FQ LIQAY+NAKAPAYGM+ERMKADNV
Sbjct: 541 TLMDRYASRGDVPNVEKNFAMMRRLGYVGRLSQFQTLIQAYVNAKAPAYGMRERMKADNV 600

Query: 601 FPNKALAGKLAQIDAFRKTAVSDLLD 619
           FPNK LAGKLAQ+D  +   VSDLLD
Sbjct: 601 FPNKDLAGKLAQVDCLKMRKVSDLLD 626

BLAST of CmoCh06G000200.1 vs. TrEMBL
Match: E5GBN7_CUCME (DNA-binding protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 895.6 bits (2313), Expect = 3.4e-257
Identity = 457/619 (73.83%), Postives = 523/619 (84.49%), Query Frame = 1

Query: 1   MWALRRASTPLRNQGYRIRTSYVFGKLEAPYSCDGNIVGSAIIPAISDRCISFERNNLAT 60
           MWALRRASTPLRN GYR+RTSYVFGKLE PY  +GNI G      +S R ISFERNNLAT
Sbjct: 1   MWALRRASTPLRNHGYRVRTSYVFGKLEVPYFGEGNIAGFGTTATLSGRFISFERNNLAT 60

Query: 61  WRSSGLSISSHGLSSQAGAENSGEEDDLEDGFSEL-ETLPSTNALEDNKAADENEGELTS 120
           W SSG+ ISSHGLS+QAGAENSGEED++EDGFSEL ET P+T +    + +D++E  +  
Sbjct: 61  WPSSGIYISSHGLSTQAGAENSGEEDNVEDGFSELDETHPTTRS----EISDDDENVV-- 120

Query: 121 ESELDDDGTQNELDLPEVETELGEKISAKRAPSELFKAIWSAPGLSVPSALDKWVSEGKE 180
                DDGTQNELDLPE ETEL EKIS K  PSEL +AIW+AP LSV SALDKWVSEG E
Sbjct: 121 -----DDGTQNELDLPEGETELAEKISRKWVPSELNEAIWNAPALSVTSALDKWVSEGHE 180

Query: 181 LSRADISLAMLNLRRRRMFGKALQFSEWLEASGQLEFVDRDYASRLDLIAKVHGLHRAEG 240
           LSR DIS  M  LR+RRMFGKALQFSEWLEASGQLEF + DYAS LDLIAKV GLH+AE 
Sbjct: 181 LSRDDISSTMFGLRKRRMFGKALQFSEWLEASGQLEFNEADYASHLDLIAKVQGLHKAET 240

Query: 241 YIAKIPKSFQGEVIYRTLLANCVVANNVKKAEEVFNKMKDLEFPITAFACNQLLLLYKRL 300
           YIAKIP SF+GE +YRTLLAN V+AN+VKKAEEVFN+MKDLEFP+T FA +Q+L+LYKR+
Sbjct: 241 YIAKIPNSFRGEAVYRTLLANYVLANDVKKAEEVFNRMKDLEFPMTTFAYDQMLILYKRI 300

Query: 301 DKRKIADVLLLMEKENVKPSLFTYKILIDAKGLSNDMVGMEQVVDTMKAEGIELDVNTLS 360
           D+R+IAD+L LMEKENVKP  FTYKILIDAKGLSND+ GMEQVVDTMKAEGI+LDV+TL 
Sbjct: 301 DRRRIADILSLMEKENVKPRPFTYKILIDAKGLSNDISGMEQVVDTMKAEGIKLDVDTLL 360

Query: 361 ILAKHYASGGLKDKAKAILKEMEDVSSKESRWPCRLLLPLYGELQMEDEVRRVWKLCEAN 420
           +LAKHY  GGLKDKA  ILK  E+V+SK SRWPCR LLPLYGELQMEDEVRR+W++CE N
Sbjct: 361 LLAKHYVLGGLKDKAMPILKATEEVNSKGSRWPCRYLLPLYGELQMEDEVRRLWEICEPN 420

Query: 421 PRIEECMAAIVAWGKLKNVQEAEEIFDRVLKTWKKLSSKQYSTMLKVYADNKMLTKGKDL 480
           P +EECMAAIVAWGKLKN+QEAE+IFDRV+KTWK+LS+K YSTM+KVY D+KMLTKGK+L
Sbjct: 421 PNVEECMAAIVAWGKLKNIQEAEKIFDRVVKTWKRLSTKHYSTMIKVYGDSKMLTKGKEL 480

Query: 481 VKQMADSGCRIGPLTWDAVVKLYVEAGEVEKADSFLQKAVQKNQMKPLFTSYMVILDQYA 540
           V QMA SGCRI P+ WDAVVKLYVEAGEVEKADSFL KAV++  MKPLF SY  ++  YA
Sbjct: 481 VNQMAKSGCRIDPMIWDAVVKLYVEAGEVEKADSFLFKAVKQYGMKPLFDSYRTLMVHYA 540

Query: 541 RRGDVHNAEKMFHRMRLSGYVARFSPFQALIQAYINAKAPAYGMKERMKADNVFPNKALA 600
           R+GDVHN+EK+FH++R SGY   F  F  L+QAY+NAK PAYGM+ERM ADNVFPNKALA
Sbjct: 541 RKGDVHNSEKIFHKIRQSGYPTHFGQFVTLVQAYLNAKTPAYGMRERMMADNVFPNKALA 600

Query: 601 GKLAQIDAFRKTAVSDLLD 619
           GKLAQ+D+FR+T VSDLLD
Sbjct: 601 GKLAQLDSFRQTTVSDLLD 608

BLAST of CmoCh06G000200.1 vs. TrEMBL
Match: A0A0A0LFG6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G819890 PE=4 SV=1)

HSP 1 Score: 885.6 bits (2287), Expect = 3.6e-254
Identity = 462/621 (74.40%), Postives = 522/621 (84.06%), Query Frame = 1

Query: 1   MWALRRASTPLRNQGYRIRTSYVFGKLEAPYSCDGNIVGSAIIPAISDRCISFERNNLAT 60
           MWALRRASTPLRNQGYR+RTSYVFGKLE PY  +GN+ G   I  +SDR IS ERNNLAT
Sbjct: 1   MWALRRASTPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTIATLSDRYISSERNNLAT 60

Query: 61  WRSSGLSISSHGLSSQAGAENSGEEDDLEDGFSEL-ETLPSTNALEDNKAADENEGELTS 120
           W SSG+ ISSHGLSSQAGAENSGEED    GFSEL ETLP+T +    +  D+++  +  
Sbjct: 61  WPSSGIYISSHGLSSQAGAENSGEED----GFSELDETLPTTRS----EIVDDDDNVV-- 120

Query: 121 ESELDDDGTQNELDLPEVETELGEKISAKRAPSELFKAIWSAPGLSVPSALDKWVSEGKE 180
                DDGTQNELDL E ETEL EK   K  PSEL KAIW+A GLSV SALDKWVSEG E
Sbjct: 121 -----DDGTQNELDLLEGETELAEKKFTKWVPSELTKAIWNASGLSVSSALDKWVSEGNE 180

Query: 181 LSRADISLAMLNLRRRRMFGKALQFSEWLEASGQLEFVDRDYASRLDLIAKVHGLHRAEG 240
           LS  DIS  M++LRRRRMFGKALQFSEWLEASGQLEF + DYASRLDLIAKV GLH+AE 
Sbjct: 181 LSWDDISSTMMSLRRRRMFGKALQFSEWLEASGQLEFNENDYASRLDLIAKVQGLHKAES 240

Query: 241 YIAKIPKSFQGEVIYRTLLANCVVANNVKKAEEVFNKMKDLEFPITAFACNQLLLLYKRL 300
           YIAKIPKSFQGEV+YRTLLAN V ANNV KAEEVFNKMKDLEFP+T FA NQ+L+LYKR 
Sbjct: 241 YIAKIPKSFQGEVMYRTLLANYVAANNVNKAEEVFNKMKDLEFPMTTFAYNQVLVLYKRN 300

Query: 301 DKRKIADVLLLMEKENVKPSLFTYKILIDAKGLSNDMVGMEQVVDTMKAEGIELDVNTLS 360
           D+RKIADVLLLMEKENVKPS FTYKILIDAKGLS D+ GMEQVVDTMKAEGIELDV  L 
Sbjct: 301 DRRKIADVLLLMEKENVKPSPFTYKILIDAKGLSKDISGMEQVVDTMKAEGIELDVFALC 360

Query: 361 ILAKHYASGGLKDKAKAILKEMEDVSSKESRWPCRLLLPLYGELQMEDEVRRVWKLCEAN 420
           +LAKHY S GLKDKAKA LKEME+++SK SRWPCRLLLPLYGEL+MEDEVRR+W++CEAN
Sbjct: 361 LLAKHYVSCGLKDKAKATLKEMEEINSKGSRWPCRLLLPLYGELEMEDEVRRLWEICEAN 420

Query: 421 PRIEECMAAIVAWGKLKNVQEAEEIFDRVLKTW--KKLSSKQYSTMLKVYADNKMLTKGK 480
           P IEECMAAIVAWGKLKN+ EAE+IFD+V+KTW  KK+S+K Y TM+KVY D KMLTKGK
Sbjct: 421 PHIEECMAAIVAWGKLKNIHEAEKIFDKVVKTWPKKKISTKHYCTMIKVYGDCKMLTKGK 480

Query: 481 DLVKQMADSGCRIGPLTWDAVVKLYVEAGEVEKADSFLQKAVQKNQMKPLFTSYMVILDQ 540
           +LV QMA+SG  I PL WDAVVKLYVEAGEVEKAD+FL KAV+K +M+PL+ SY  +++ 
Sbjct: 481 ELVNQMAESGYSIDPLAWDAVVKLYVEAGEVEKADTFLVKAVKKYEMRPLYCSYRTLMNH 540

Query: 541 YARRGDVHNAEKMFHRMRLSGYVARFSPFQALIQAYINAKAPAYGMKERMKADNVFPNKA 600
           YARRGDVHNAEK+F++MR SGY   F+ F+ LIQAY+N+K PAYGM+ERM AD +FPNKA
Sbjct: 541 YARRGDVHNAEKIFYKMRQSGYGPWFNQFETLIQAYVNSKTPAYGMRERMMADKLFPNKA 600

Query: 601 LAGKLAQIDAFRKTAVSDLLD 619
           LAGKLAQ+D+FRKTA+ DLLD
Sbjct: 601 LAGKLAQVDSFRKTALPDLLD 606

BLAST of CmoCh06G000200.1 vs. TrEMBL
Match: A0A0A0LFH0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G819940 PE=4 SV=1)

HSP 1 Score: 884.4 bits (2284), Expect = 7.9e-254
Identity = 457/619 (73.83%), Postives = 523/619 (84.49%), Query Frame = 1

Query: 1   MWALRRASTPLRNQGYRIRTSYVFGKLEAPYSCDGNIVGSAIIPAISDRCISFERNNLAT 60
           MWALRRASTPLRNQGYR+RTS+VFGKLE PYS + N+ G      +SDR ISFERNNLAT
Sbjct: 1   MWALRRASTPLRNQGYRVRTSHVFGKLEVPYSWEANVAGFVTTATLSDRFISFERNNLAT 60

Query: 61  WRSSGLSISSHGLSSQAGAENSGEEDDLEDGFSEL-ETLPSTNALEDNKAADENEGELTS 120
           W SSG+ ISSHGLS+QAGAENSGEED++EDGFSEL E LP+T +    + AD+++     
Sbjct: 61  WPSSGIYISSHGLSTQAGAENSGEEDNVEDGFSELDEALPTTRS----EIADDDD----- 120

Query: 121 ESELDDDGTQNELDLPEVETELGEKISAKRAPSELFKAIWSAPGLSVPSALDKWVSEGKE 180
             ++ DDG+QNELDL E ET L EK S+K  PSEL K IW+APGLSV SALDKWVSEG E
Sbjct: 121 --DIVDDGSQNELDLLEGETVLAEKKSSKWVPSELTKVIWNAPGLSVASALDKWVSEGNE 180

Query: 181 LSRADISLAMLNLRRRRMFGKALQFSEWLEASGQLEFVDRDYASRLDLIAKVHGLHRAEG 240
           LSR+DISL M++LRRRRMFGKALQFSEWLEASGQLEF ++DYASRLDLIAKV GLH+AE 
Sbjct: 181 LSRSDISLTMMSLRRRRMFGKALQFSEWLEASGQLEFNEKDYASRLDLIAKVQGLHKAES 240

Query: 241 YIAKIPKSFQGEVIYRTLLANCVVANNVKKAEEVFNKMKDLEFPITAFACNQLLLLYKRL 300
           YIAKIPKSF GEV++RTLLAN VVANNVKKAEEVFNKMKDL+FP+T FA +Q+L+LYKR+
Sbjct: 241 YIAKIPKSFHGEVVHRTLLANYVVANNVKKAEEVFNKMKDLKFPMTTFAYDQMLILYKRI 300

Query: 301 DKRKIADVLLLMEKENVKPSLFTYKILIDAKGLSNDMVGMEQVVDTMKAEGIELDVNTLS 360
           DK+KIADVLLLMEKENVKPS FTY +LID KGLSND+ GMEQVVDTMKA G+E D  TLS
Sbjct: 301 DKKKIADVLLLMEKENVKPSRFTYIVLIDVKGLSNDIRGMEQVVDTMKAGGMEPDSYTLS 360

Query: 361 ILAKHYASGGLKDKAKAILKEMEDVSSKESRWPCRLLLPLYGELQMEDEVRRVWKLCEAN 420
           ILAKHY SGG KDKAKAILKE+E+ +S+  +W  R+LLPLY  LQMEDEVRR+WK CE N
Sbjct: 361 ILAKHYVSGGYKDKAKAILKEIEESNSRIPQWSRRILLPLYVSLQMEDEVRRLWKSCEEN 420

Query: 421 PRIEECMAAIVAWGKLKNVQEAEEIFDRVLKTWKKLSSKQYSTMLKVYADNKMLTKGKDL 480
           PRIEECMAAIVAWG+LKNV EAE IF+ V+KT+ KLS++ Y TMLKVY D+KMLTKGK+L
Sbjct: 421 PRIEECMAAIVAWGRLKNVPEAERIFNIVVKTFTKLSTRHYYTMLKVYGDSKMLTKGKEL 480

Query: 481 VKQMADSGCRIGPLTWDAVVKLYVEAGEVEKADSFLQKAVQKNQMKPLFTSYMVILDQYA 540
           V QMA SGCRI P TWDAVVKLYVEAGEVEKADSFL KA Q+  MKPLFTSYM ++D YA
Sbjct: 481 VNQMAKSGCRIDPFTWDAVVKLYVEAGEVEKADSFLVKAAQQYGMKPLFTSYMTLMDHYA 540

Query: 541 RRGDVHNAEKMFHRMRLSGYVARFSPFQALIQAYINAKAPAYGMKERMKADNVFPNKALA 600
           R+GDVHNAEK+FH+MR SGY+ R   F  LI+AY+NAK PAYGM+ERM  D +FPNKALA
Sbjct: 541 RKGDVHNAEKIFHKMRQSGYMPRLGQFGTLIRAYVNAKTPAYGMRERMMGDKLFPNKALA 600

Query: 601 GKLAQIDAFRKTAVSDLLD 619
           G+LAQ+D FRKTAVSDLLD
Sbjct: 601 GQLAQVDPFRKTAVSDLLD 608

BLAST of CmoCh06G000200.1 vs. TAIR10
Match: AT1G80270.1 (AT1G80270.1 PENTATRICOPEPTIDE REPEAT 596)

HSP 1 Score: 644.4 bits (1661), Expect = 7.0e-185
Identity = 330/574 (57.49%), Postives = 444/574 (77.35%), Query Frame = 1

Query: 52  SFERNNLATWRSSGL-------SISSHGLSSQAGAENSGEEDDLEDGFSELETLPSTNAL 111
           SF+ N++A+ +   +       S+S+  LSS AG ++  EEDDLEDGFSELE    + + 
Sbjct: 34  SFDSNSIASTKREAVPRFYEISSLSNRALSSSAGTKSDQEEDDLEDGFSELE---GSKSG 93

Query: 112 EDNKAADENEGELTSESELDDDGTQNELDLPEVETELGEKISAKRAPSELFKAIWSAPGL 171
           + + ++DE+EG+L+++ E ++     ELDL  +ET++  K   K+  SELFK I SAPGL
Sbjct: 94  QGSTSSDEDEGKLSADEEEEE-----ELDL--IETDVSRKTVEKKQ-SELFKTIVSAPGL 153

Query: 172 SVPSALDKWVSEGKELSRADISLAMLNLRRRRMFGKALQFSEWLEASGQLEFVDRDYASR 231
           S+ SALDKWV EG E++R +I+ AML LRRRRM+G+ALQ SEWLEA+ ++E  +RDYASR
Sbjct: 154 SIGSALDKWVEEGNEITRVEIAKAMLQLRRRRMYGRALQMSEWLEANKKIEMTERDYASR 213

Query: 232 LDLIAKVHGLHRAEGYIAKIPKSFQGEVIYRTLLANCVVANNVKKAEEVFNKMKDLEFPI 291
           LDL  K+ GL + E  + KIPKSF+GEV+YRTLLANCV A NVKK+E VFNKMKDL FP+
Sbjct: 214 LDLTVKIRGLEKGEACMQKIPKSFKGEVLYRTLLANCVAAGNVKKSELVFNKMKDLGFPL 273

Query: 292 TAFACNQLLLLYKRLDKRKIADVLLLMEKENVKPSLFTYKILIDAKGLSNDMVGMEQVVD 351
           + F C+Q+LLL+KR+D++KIADVLLLMEKEN+KPSL TYKILID KG +ND+ GMEQ+++
Sbjct: 274 SGFTCDQMLLLHKRIDRKKIADVLLLMEKENIKPSLLTYKILIDVKGATNDISGMEQILE 333

Query: 352 TMKAEGIELDVNTLSILAKHYASGGLKDKAKAILKEMEDVSSKESRWPCRLLLPLYGELQ 411
           TMK EG+ELD  T ++ A+HY+  GLKDKA+ +LKEME  S + +R   + LL +Y  L 
Sbjct: 334 TMKDEGVELDFQTQALTARHYSGAGLKDKAEKVLKEMEGESLEANRRAFKDLLSIYASLG 393

Query: 412 MEDEVRRVWKLCEANPRIEECMAAIVAWGKLKNVQEAEEIFDRVLKTWKKLSSKQYSTML 471
            EDEV+R+WK+CE+ P  EE +AAI A+GKL  VQEAE IF++++K  ++ SS  YS +L
Sbjct: 394 REDEVKRIWKICESKPYFEESLAAIQAFGKLNKVQEAEAIFEKIVKMDRRASSSTYSVLL 453

Query: 472 KVYADNKMLTKGKDLVKQMADSGCRIGPLTWDAVVKLYVEAGEVEKADSFLQKAVQKNQM 531
           +VY D+KML+KGKDLVK+MA+SGCRI   TWDA++KLYVEAGEVEKADS L KA +++  
Sbjct: 454 RVYVDHKMLSKGKDLVKRMAESGCRIEATTWDALIKLYVEAGEVEKADSLLDKASKQSHT 513

Query: 532 KPLFTSYMVILDQYARRGDVHNAEKMFHRMRLSGYVARFSPFQALIQAYINAKAPAYGMK 591
           K +  S+M I+D+Y++RGDVHN EK+F +MR +GY +R   FQAL+QAYINAK+PAYGM+
Sbjct: 514 KLMMNSFMYIMDEYSKRGDVHNTEKIFLKMREAGYTSRLRQFQALMQAYINAKSPAYGMR 573

Query: 592 ERMKADNVFPNKALAGKLAQIDAFRKTAVSDLLD 619
           +R+KADN+FPNK++A +LAQ D F+KTA+SD+LD
Sbjct: 574 DRLKADNIFPNKSMAAQLAQGDPFKKTAISDILD 596

BLAST of CmoCh06G000200.1 vs. TAIR10
Match: AT1G15480.1 (AT1G15480.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 640.2 bits (1650), Expect = 1.3e-183
Identity = 337/608 (55.43%), Postives = 444/608 (73.03%), Query Frame = 1

Query: 12  RNQGYRIRT-SYVFGKLEAPYSCDGNIVGSAIIPAISDRCISFERNNLATWRSSGLSISS 71
           R+Q  R+   + V+ KL+ P       + S  +  I D+  +  R    +W SS      
Sbjct: 10  RSQSLRLGACNAVYSKLDIPLGERNIAIESNAL--IHDKHEALPRFYELSWSSS---TGR 69

Query: 72  HGLSSQAGAENSGEEDDLEDGFSELETLPSTNALEDNKAADENEGELTSESELDDDGTQN 131
             LSS AGA+ +G++DDLED   +L T        D  ++D  +GE  S  E D +G + 
Sbjct: 70  RSLSSDAGAKTTGDDDDLEDKNVDLAT-------PDETSSDSEDGEEFSGDEGDIEGAEL 129

Query: 132 ELDLPEVETELGEKISAKRAPSELFKAIWSAPGLSVPSALDKWVSEGKELSRADISLAML 191
           EL +PE            + PSE+FKAI S  GLSV SALDKWV +GK+ +R +   AML
Sbjct: 130 ELHVPE-----------SKRPSEMFKAIVSVSGLSVGSALDKWVEQGKDTNRKEFESAML 189

Query: 192 NLRRRRMFGKALQFSEWLEASGQLEFVDRDYASRLDLIAKVHGLHRAEGYIAKIPKSFQG 251
            LR+RRMFG+ALQ +EWL+ + Q E  +RDYA RLDLI+KV G ++ E YI  IP+SF+G
Sbjct: 190 QLRKRRMFGRALQMTEWLDENKQFEMEERDYACRLDLISKVRGWYKGEAYIKTIPESFRG 249

Query: 252 EVIYRTLLANCVVANNVKKAEEVFNKMKDLEFPITAFACNQLLLLYKRLDKRKIADVLLL 311
           E++YRTLLAN V  +NV+ AE VFNKMKDL FP++ F CNQ+L+LYKR+DK+KIADVLLL
Sbjct: 250 ELVYRTLLANHVATSNVRTAEAVFNKMKDLGFPLSTFTCNQMLILYKRVDKKKIADVLLL 309

Query: 312 MEKENVKPSLFTYKILIDAKGLSNDMVGMEQVVDTMKAEGIELDVNTLSILAKHYASGGL 371
           +EKEN+KP+L TYKILID KG SND+ GMEQ+V+TMK+EG+ELD+   +++A+HYAS GL
Sbjct: 310 LEKENLKPNLNTYKILIDTKGSSNDITGMEQIVETMKSEGVELDLRARALIARHYASAGL 369

Query: 372 KDKAKAILKEMEDVSSKESRWPCRLLLPLYGELQMEDEVRRVWKLCEANPRIEECMAAIV 431
           K+KA+ +LKEME  S +E+R  C+ LL +YG LQ EDEVRRVWK+CE NPR  E +AAI+
Sbjct: 370 KEKAEKVLKEMEGESLEENRHMCKDLLSVYGYLQREDEVRRVWKICEENPRYNEVLAAIL 429

Query: 432 AWGKLKNVQEAEEIFDRVLKTWKKLSSKQYSTMLKVYADNKMLTKGKDLVKQMADSGCRI 491
           A+GK+  V++AE +F++VLK   ++SS  YS +L+VY D+KM+++GKDLVKQM+DSGC I
Sbjct: 430 AFGKIDKVKDAEAVFEKVLKMSHRVSSNVYSVLLRVYVDHKMVSEGKDLVKQMSDSGCNI 489

Query: 492 GPLTWDAVVKLYVEAGEVEKADSFLQKAVQKNQMKPLFTSYMVILDQYARRGDVHNAEKM 551
           G LTWDAV+KLYVEAGEVEKA+S L KA+Q  Q+KPL +S+M ++ +Y RRGDVHN EK+
Sbjct: 490 GALTWDAVIKLYVEAGEVEKAESSLSKAIQSKQIKPLMSSFMYLMHEYVRRGDVHNTEKI 549

Query: 552 FHRMRLSGYVARFSPFQALIQAYINAKAPAYGMKERMKADNVFPNKALAGKLAQIDAFRK 611
           F RM+ +GY +RF  +Q LIQAY+NAKAPAYGMKERMKADN+FPNK LA +LA+ D F+K
Sbjct: 550 FQRMKQAGYQSRFWAYQTLIQAYVNAKAPAYGMKERMKADNIFPNKRLAAQLAKADPFKK 594

Query: 612 TAVSDLLD 619
           T +SDLLD
Sbjct: 610 TPLSDLLD 594

BLAST of CmoCh06G000200.1 vs. TAIR10
Match: AT3G15590.1 (AT3G15590.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 535.0 bits (1377), Expect = 6.0e-152
Identity = 275/549 (50.09%), Postives = 398/549 (72.50%), Query Frame = 1

Query: 71  HGLSSQAGAENSGEEDDLEDGFSELE-TLPSTNALEDNKAADENEGELTSESELDDDGTQ 130
           H LSS A A++ G+E   E+  SE E  +P +  + +    D++      E EL  D   
Sbjct: 69  HKLSSIADAKDKGDEVVREEELSESEEAVPVSGDVPEGVVDDDS----LFEPELGSDN-- 128

Query: 131 NELDLPEVETELGEKISAKRAPSELFKAIWSAPGLSVPSALDKWVSEGKELSRADISLAM 190
           ++L++ E  ++ G K + KR  SEL+++I +    SV   L+KWV EGK+LS+A+++LA+
Sbjct: 129 DDLEIEEKHSKDGGKPTKKRGQSELYESIVAYK--SVKHVLEKWVKEGKDLSQAEVTLAI 188

Query: 191 LNLRRRRMFGKALQFSEWLEASGQLEFVDRDYASRLDLIAKVHGLHRAEGYIAKIPKSFQ 250
            NLR+R+ +   LQ  EWL A+ Q EF + +YAS+LDL+AKVH L +AE ++  IP+S +
Sbjct: 189 HNLRKRKSYAMCLQLWEWLGANTQFEFTEANYASQLDLVAKVHSLQKAEIFLKDIPESSR 248

Query: 251 GEVIYRTLLANCVVANNVKKAEEVFNKMKDLEFPITAFACNQLLLLYKRLDKRKIADVLL 310
           GEV+YRTLLANCV+ ++V KAE++FNKMK+L+FP + FACNQLLLLY   D++KI+DVLL
Sbjct: 249 GEVVYRTLLANCVLKHHVNKAEDIFNKMKELKFPTSVFACNQLLLLYSMHDRKKISDVLL 308

Query: 311 LMEKENVKPSLFTYKILIDAKGLSNDMVGMEQVVDTMKAEGIELDVNTLSILAKHYASGG 370
           LME+EN+KPS  TY  LI++KGL+ D+ GME++V+T+K EGIELD    SILAK+Y   G
Sbjct: 309 LMERENIKPSRATYHFLINSKGLAGDITGMEKIVETIKEEGIELDPELQSILAKYYIRAG 368

Query: 371 LKDKAKAILKEMEDVSSKESRWPCRLLLPLYGELQMEDEVRRVWKLCEANPRIEECMAAI 430
           LK++A+ ++KE+E    +++ W CR LLPLY ++   D VRR+ +  + NPR + C++AI
Sbjct: 369 LKERAQDLMKEIEGKGLQQTPWVCRSLLPLYADIGDSDNVRRLSRFVDQNPRYDNCISAI 428

Query: 431 VAWGKLKNVQEAEEIFDRVLKTWKKLSSKQYSTMLKVYADNKMLTKGKDLVKQMADSGCR 490
            AWGKLK V+EAE +F+R+++ +K      Y  ++++Y +NKML KG+DLVK+M ++G  
Sbjct: 429 KAWGKLKEVEEAEAVFERLVEKYKIFPMMPYFALMEIYTENKMLAKGRDLVKRMGNAGIA 488

Query: 491 IGPLTWDAVVKLYVEAGEVEKADSFLQKAVQKNQMKPLFTSYMVILDQYARRGDVHNAEK 550
           IGP TW A+VKLY++AGEV KA+  L +A + N+M+P+FT+YM IL++YA+RGDVHN EK
Sbjct: 489 IGPSTWHALVKLYIKAGEVGKAELILNRATKDNKMRPMFTTYMAILEEYAKRGDVHNTEK 548

Query: 551 MFHRMRLSGYVARFSPFQALIQAYINAKAPAYGMKERMKADNVFPNKALAGKLAQIDAFR 610
           +F +M+ + Y A+   ++ ++ AYINAK PAYGM ERMKADNVFPNK+LA KLAQ++ F+
Sbjct: 549 VFMKMKRASYAAQLMQYETVLLAYINAKTPAYGMIERMKADNVFPNKSLAAKLAQVNPFK 608

Query: 611 KTAVSDLLD 619
           K  VS LLD
Sbjct: 609 KCPVSVLLD 609

BLAST of CmoCh06G000200.1 vs. TAIR10
Match: AT1G07590.1 (AT1G07590.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 164.9 bits (416), Expect = 1.6e-40
Identity = 113/456 (24.78%), Postives = 223/456 (48.90%), Query Frame = 1

Query: 163 GLSVPSALDKWVSEGKELSRADISLAMLNLRRRRMFGKALQFSEWLEASGQLEFVDRDYA 222
           G++V SAL  W+ +G  +   D+  A+  LR+     +AL+  EW+         + +Y+
Sbjct: 78  GVTVGSALQSWMGDGFPVHGGDVYHAINRLRKLGRNKRALELMEWIIRERPYRLGELEYS 137

Query: 223 SRLDLIAKVHGLHRAEGYIAKIPKSFQGEVIYRTLLANCVVANNVKKAEEVFNKMKDLEF 282
             L+   K+HG+ + E    ++P+ FQ E++Y  L+  C+    ++ A E   KM++L +
Sbjct: 138 YLLEFTVKLHGVSQGEKLFTRVPQEFQNELLYNNLVIACLDQGVIRLALEYMKKMRELGY 197

Query: 283 PITAFACNQLLLLYKRLDKRK-IADVLLLMEKENVKPSLFTYKILIDAKGLSNDMVGMEQ 342
             +    N+L++      +RK IA  L LM+ +   P + TY IL+  +   +++ G+ +
Sbjct: 198 RTSHLVYNRLIIRNSAPGRRKLIAKDLALMKADKATPHVSTYHILMKLEANEHNIDGVLK 257

Query: 343 VVDTMKAEGIELDVNTLSILAKHYASGGLKDKAKAILKEMEDVSSKESRWPCRLLLPLYG 402
             D MK  G+E +  +  ILA  +A   L   A+A  +E+E   + ++     +L+ LYG
Sbjct: 258 AFDGMKKAGVEPNEVSYCILAMAHAVARLYTVAEAYTEEIEKSITGDNWSTLDILMILYG 317

Query: 403 ELQMEDEVRRVWKLCEA--NPRIEECMAAIVAWGKLKNVQEAEEIFDRVLKTWKKLSSKQ 462
            L  E E+ R W +     + R +  + A  A+ ++ N+  AEE++  +        ++Q
Sbjct: 318 RLGKEKELARTWNVIRGFHHVRSKSYLLATEAFARVGNLDRAEELWLEMKNVKGLKETEQ 377

Query: 463 YSTMLKVYADNKMLTKGKDLVKQMADSGCRIGPLTWD------AVVKLYVEAGEVEKADS 522
           ++++L VY  + ++ K   + ++M  +G +   +T+       A  KL  EA +  +   
Sbjct: 378 FNSLLSVYCKDGLIEKAIGVFREMTGNGFKPNSITYRHLALGCAKAKLMKEALKNIEMGL 437

Query: 523 FLQKAVQKNQMKPLFTSYMVILDQYARRGDVHNAEKMFHRMRLSGYVARFSPFQALIQAY 582
            L+ +       P   + + I++ +A +GDV N+EK+F  ++ + Y      + AL +AY
Sbjct: 438 NLKTSKSIGSSTPWLETTLSIIECFAEKGDVENSEKLFEEVKNAKYNRYAFVYNALFKAY 497

Query: 583 INAKAPAYGMKERMKADNVFPNKALAGKLAQIDAFR 610
           + AK     + +RM      P+      L  ++ ++
Sbjct: 498 VKAKVYDPNLFKRMVLGGARPDAESYSLLKLVEQYK 533

BLAST of CmoCh06G000200.1 vs. TAIR10
Match: AT1G60770.1 (AT1G60770.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 158.7 bits (400), Expect = 1.2e-38
Identity = 118/453 (26.05%), Postives = 221/453 (48.79%), Query Frame = 1

Query: 164 LSVPSALDKWVSEGKELSRADISLAMLNLRRRRMFGKALQFSEWLEASGQLEFVDRDYAS 223
           + V   L++++   K + + ++   +  LR R ++  AL+ SE +E  G  + V  D A 
Sbjct: 37  VKVRQQLNQFLKGTKHVFKWEVGDTIKKLRNRGLYYPALKLSEVMEERGMNKTVS-DQAI 96

Query: 224 RLDLIAKVHGLHRAEGYIAKIPKSFQGEVIYRTLLANCVVANNV-KKAEEVFNKMKDLEF 283
            LDL+AK   +   E Y   +P++ + E+ Y +LL NC     + +KAE + NKMK+L  
Sbjct: 97  HLDLVAKAREITAGENYFVDLPETSKTELTYGSLL-NCYCKELLTEKAEGLLNKMKELNI 156

Query: 284 PITAFACNQLLLLYKRL-DKRKIADVLLLMEKENVKPSLFTYKILIDAKGLSNDMVGMEQ 343
             ++ + N L+ LY +  +  K+  ++  ++ ENV P  +TY + + A   +ND+ G+E+
Sbjct: 157 TPSSMSYNSLMTLYTKTGETEKVPAMIQELKAENVMPDSYTYNVWMRALAATNDISGVER 216

Query: 344 VVDTMKAEG-IELDVNTLSILAKHYASGGLKDKAKAILKEMEDVSSKESRWPCRLLLPLY 403
           V++ M  +G +  D  T S +A  Y   GL  KA+  L+E+E  +++      + L+ LY
Sbjct: 217 VIEEMNRDGRVAPDWTTYSNMASIYVDAGLSQKAEKALQELEMKNTQRDFTAYQFLITLY 276

Query: 404 GELQMEDEVRRVWK-LCEANPRIEEC--MAAIVAWGKLKNVQEAEEIFDRVLKTWKKLSS 463
           G L    EV R+W+ L  A P+      +  I    KL ++  AE +F            
Sbjct: 277 GRLGKLTEVYRIWRSLRLAIPKTSNVAYLNMIQVLVKLNDLPGAETLFKEWQANCSTYDI 336

Query: 464 KQYSTMLKVYADNKMLTKGKDLVKQMADSGCRIGPLTWDAVVKLYVEAGEVEKADSFLQK 523
           +  + ++  YA   ++ K  +L ++    G ++   TW+  +  YV++G++ +A   + K
Sbjct: 337 RIVNVLIGAYAQEGLIQKANELKEKAPRRGGKLNAKTWEIFMDYYVKSGDMARALECMSK 396

Query: 524 AV-----QKNQMKPLFTSYMVILDQYARRGDVHNAEKMFHRMRLSGYVARFSPFQALIQA 583
           AV        +  P   +   ++  + ++ DV+ AE +   ++          F+ LI+ 
Sbjct: 397 AVSIGKGDGGKWLPSPETVRALMSYFEQKKDVNGAENLLEILKNGTDNIGAEIFEPLIRT 456

Query: 584 YINAKAPAYGMKERMKADNVFPNKALAGKLAQI 606
           Y  A      M+ R+K +NV  N+A    L ++
Sbjct: 457 YAAAGKSHPAMRRRLKMENVEVNEATKKLLDEV 487

BLAST of CmoCh06G000200.1 vs. NCBI nr
Match: gi|659085107|ref|XP_008443247.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like [Cucumis melo])

HSP 1 Score: 941.8 bits (2433), Expect = 6.0e-271
Identity = 476/626 (76.04%), Postives = 545/626 (87.06%), Query Frame = 1

Query: 1   MWALRRASTPLRNQGYRIRTSYVFGKLEAPYSCDGNIVGSAIIPAISDRCISFERNNLAT 60
           MWALRRASTPLRNQGY++RTSYVFGKLE P+  +GN+ G     A+SDR ISFERNNLAT
Sbjct: 1   MWALRRASTPLRNQGYKVRTSYVFGKLEVPFFWEGNVAGFGTTTALSDRFISFERNNLAT 60

Query: 61  WRSSGLSISSHGLSSQAGAENSGEEDDLEDGFSEL-ETLPSTNALEDNKAADENEGELTS 120
           W S+G+ ISSHGLS+QAGAENSGEED+++DGFSEL ETL ST+ LED+KAAD+NE ELTS
Sbjct: 61  WPSAGVYISSHGLSTQAGAENSGEEDNVKDGFSELDETLASTSPLEDSKAADDNEEELTS 120

Query: 121 ESELD-------DDGTQNELDLPEVETELGEKISAKRAPSELFKAIWSAPGLSVPSALDK 180
            SE+D       DDGTQNELDL E ET L EK S KR PSELF  IW APGLSV +ALDK
Sbjct: 121 GSEIDDDDDNAVDDGTQNELDLLEGETGLAEKKSTKRGPSELFNVIWKAPGLSVANALDK 180

Query: 181 WVSEGKELSRADISLAMLNLRRRRMFGKALQFSEWLEASGQLEFVDRDYASRLDLIAKVH 240
           WVSEGKELSRADISLAML LR+R+MFGKALQFSEWLEASG+L F D+DYASRLDLI K+ 
Sbjct: 181 WVSEGKELSRADISLAMLYLRKRQMFGKALQFSEWLEASGKLNFTDKDYASRLDLIGKLR 240

Query: 241 GLHRAEGYIAKIPKSFQGEVIYRTLLANCVVANNVKKAEEVFNKMKDLEFPITAFACNQL 300
           GL  AE Y+AKIPKSFQGEV+YRTLLANCV+A+NV+KAEEVFNKMKDLEFPITAFAC+QL
Sbjct: 241 GLRMAENYLAKIPKSFQGEVVYRTLLANCVIASNVQKAEEVFNKMKDLEFPITAFACSQL 300

Query: 301 LLLYKRLDKRKIADVLLLMEKENVKPSLFTYKILIDAKGLSNDMVGMEQVVDTMKAEGIE 360
           LLLY+R DKRKIAD+LLLMEKENVKPS FTYKILIDAKGLSND+ GMEQVVDTMKA+GIE
Sbjct: 301 LLLYRRTDKRKIADILLLMEKENVKPSRFTYKILIDAKGLSNDISGMEQVVDTMKADGIE 360

Query: 361 LDVNTLSILAKHYASGGLKDKAKAILKEMEDVSSKESRWPCRLLLPLYGELQMEDEVRRV 420
           LD +TL++LAKHY SGGLKDKAKA LK+ME+++S+ SRWPCR+LLP YGEL+MEDEVRR+
Sbjct: 361 LDFDTLALLAKHYVSGGLKDKAKATLKQMEEINSQGSRWPCRILLPRYGELEMEDEVRRL 420

Query: 421 WKLCEANPRIEECMAAIVAWGKLKNVQEAEEIFDRVLKTWKKLSSKQYSTMLKVYADNKM 480
           W++CE++P IEECMAAIVAWGKLKNVQEAE+IFDRV+K+ KKLS++ YSTM+ VY   KM
Sbjct: 421 WEICESDPHIEECMAAIVAWGKLKNVQEAEKIFDRVVKSGKKLSARHYSTMMNVYRQTKM 480

Query: 481 LTKGKDLVKQMADSGCRIGPLTWDAVVKLYVEAGEVEKADSFLQKAVQKNQMKPLFTSYM 540
           LTKGK+LV QMA+SGCR+ PLTWDAVVK YVEAGEVEKADSFL KAVQ+N+ KPLF +YM
Sbjct: 481 LTKGKELVNQMAESGCRMDPLTWDAVVKFYVEAGEVEKADSFLVKAVQQNKKKPLFATYM 540

Query: 541 VILDQYARRGDVHNAEKMFHRMRLSGYVARFSPFQALIQAYINAKAPAYGMKERMKADNV 600
            ++  YA RGDV NAE +F RMR  GY+ RF+ FQ L+QAY+NAKAPAYGM+ERM  DN+
Sbjct: 541 TLMHHYASRGDVPNAENIFDRMRRLGYMGRFTQFQTLVQAYVNAKAPAYGMRERMMVDNI 600

Query: 601 FPNKALAGKLAQIDAFRKTAVSDLLD 619
           FPNKALAGKLAQ+D FR T VSDLLD
Sbjct: 601 FPNKALAGKLAQVDPFRMTEVSDLLD 626

BLAST of CmoCh06G000200.1 vs. NCBI nr
Match: gi|778685180|ref|XP_011652180.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like isoform X1 [Cucumis sativus])

HSP 1 Score: 916.0 bits (2366), Expect = 3.5e-263
Identity = 472/624 (75.64%), Postives = 539/624 (86.38%), Query Frame = 1

Query: 1   MWALRRASTPLRNQGYRIRTSYVFGKLEAPYSCDGNIVGSAIIPAISDRCISFERNNLAT 60
           MWALRRASTPLRNQGYR+RTSYVFGKLE PY  +GN+ G     A+SDR I F+RNNL T
Sbjct: 1   MWALRRASTPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTT 60

Query: 61  WRSSGLSISSHGLSSQAGAENSGEEDDLEDGFSEL-ETLPSTNALEDNKAADENEGELTS 120
           W SS + ISSHGLS+QAGAENSGEE ++EDG SEL ETLPST+ LED+K AD+NE ELTS
Sbjct: 61  WPSSEVYISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTS 120

Query: 121 ESELDDDGTQ----NELDLPEVETELGEKISAKRAPSELFKAIWSAPGLSVPSALDKWVS 180
            SE+DDD        ELDLPE ET L EKIS KRAPSEL   IW APGL+V SALDKWVS
Sbjct: 121 GSEIDDDNDVVDDGTELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKWVS 180

Query: 181 EGKELSRADISLAMLNLRRRRMFGKALQFSEWLEASGQLEFVDRDYASRLDLIAKVHGLH 240
           EGKELSR DIS AMLNLR+ RM+GKALQFSEWLEA+G+L+FV++DYASRLDLI K+ GL 
Sbjct: 181 EGKELSRDDISSAMLNLRKCRMYGKALQFSEWLEANGKLDFVEKDYASRLDLIGKLRGLR 240

Query: 241 RAEGYIAKIPKSFQGEVIYRTLLANCVVANNVKKAEEVFNKMKDLEFPITAFACNQLLLL 300
            AE YIAKIPKSFQGEV+YRTLLANCV+A NV+KAEEVFNKMKDLEFPITAFACNQLLLL
Sbjct: 241 MAENYIAKIPKSFQGEVVYRTLLANCVIACNVQKAEEVFNKMKDLEFPITAFACNQLLLL 300

Query: 301 YKRLDKRKIADVLLLMEKENVKPSLFTYKILIDAKGLSNDMVGMEQVVDTMKAEGIELDV 360
           YKR DKRK+AD+LLLMEKENVKPS FTY+ILID KGLSND+ GMEQVVDTMKAEGIELDV
Sbjct: 301 YKRTDKRKVADILLLMEKENVKPSRFTYRILIDTKGLSNDITGMEQVVDTMKAEGIELDV 360

Query: 361 NTLSILAKHYASGGLKDKAKAILKEMEDVSSKESRWPCRLLLPLYGELQMEDEVRRVWKL 420
           +TLS+LAKHY SGGLKDKAKAILKEME+++S+ SRWPCR+LLPLYGELQMEDEVRR+W++
Sbjct: 361 STLSVLAKHYISGGLKDKAKAILKEMEEINSEGSRWPCRILLPLYGELQMEDEVRRLWEI 420

Query: 421 CEANPRIEECMAAIVAWGKLKNVQEAEEIFDRVLKTWKKLSSKQYSTMLKVY-ADNKMLT 480
           C +NP IEECMAAIVAWGKLKN+QEAE+IFDRV+KT +KLS++ YSTML VY  D+KMLT
Sbjct: 421 CGSNPHIEECMAAIVAWGKLKNIQEAEKIFDRVVKTGEKLSARHYSTMLNVYREDSKMLT 480

Query: 481 KGKDLVKQMADSGCRIGPLTWDAVVKLYVEAGEVEKADSFLQKAVQKNQMKPLFTSYMVI 540
           KGK++VKQMA+SG R+ P+T DAVVKLYVEAGE EKADSFL K V + + KP+FT+Y+ +
Sbjct: 481 KGKEVVKQMAESGSRMDPVTLDAVVKLYVEAGEGEKADSFLVKTVLQYKKKPMFTTYITL 540

Query: 541 LDQYARRGDVHNAEKMFHRMRLSGYVARFSPFQALIQAYINAKAPAYGMKERMKADNVFP 600
           +D+YA RGDV NAEK+F  MR  GYV R S FQ LIQAY+NAKAPAYGM+ERMKAD+VFP
Sbjct: 541 MDRYASRGDVPNAEKIFGMMRKYGYVGRLSHFQTLIQAYVNAKAPAYGMRERMKADSVFP 600

Query: 601 NKALAGKLAQIDAFRKTAVSDLLD 619
           NKALAGKLAQ+D+ +   VSDLLD
Sbjct: 601 NKALAGKLAQVDSLKMREVSDLLD 624

BLAST of CmoCh06G000200.1 vs. NCBI nr
Match: gi|778685174|ref|XP_011652178.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like [Cucumis sativus])

HSP 1 Score: 901.7 bits (2329), Expect = 6.9e-259
Identity = 467/626 (74.60%), Postives = 532/626 (84.98%), Query Frame = 1

Query: 1   MWALRRASTPLRNQGYRIRTSYVFGKLEAPYSCDGNIVGSAIIPAISDRCISFERNNLAT 60
           MWALRRASTPLRNQGYR+RTSYVFGKLE PY  +GN+ G     A+SDR I F+RNNL T
Sbjct: 1   MWALRRASTPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTAAALSDRFIYFDRNNLTT 60

Query: 61  WRSSGLSISSHGLSSQAGAENSGEEDDLEDGFSEL-ETLPSTNALEDNKAADENEGELTS 120
           W SS + ISSHGLS+QAGAENSGEE ++EDG SEL ETLPST+ LED+K AD+NE ELTS
Sbjct: 61  WPSSEVYISSHGLSTQAGAENSGEEGNVEDGCSELDETLPSTSPLEDSKTADDNEEELTS 120

Query: 121 ESELDDD------GTQNELDLPEVETELGEKISAKRAPSELFKAIWSAPGLSVPSALDKW 180
            SE+DDD      GTQNELDLPE ET L EKIS KRAPSEL   IW APGL+V SALDKW
Sbjct: 121 GSEIDDDDDVVDDGTQNELDLPEGETGLVEKISIKRAPSELLNVIWKAPGLTVSSALDKW 180

Query: 181 VSEGKELSRADISLAMLNLRRRRMFGKALQFSEWLEASGQLEFVDRDYASRLDLIAKVHG 240
           VSEGKELSR DIS AMLNLR+ RM+GKALQFSEWLE SG+L+F++ DYASRL LI K+ G
Sbjct: 181 VSEGKELSRDDISSAMLNLRKCRMYGKALQFSEWLETSGKLDFIENDYASRLYLIGKLRG 240

Query: 241 LHRAEGYIAKIPKSFQGEVIYRTLLANCVVANNVKKAEEVFNKMKDLEFPITAFACNQLL 300
           L  AE YIAKIPKSFQGEV+Y+TLL NCV+A+NV KAE+VFNKMK+LEFPITAFACNQLL
Sbjct: 241 LRMAENYIAKIPKSFQGEVVYQTLLVNCVIASNVHKAEKVFNKMKNLEFPITAFACNQLL 300

Query: 301 LLYKRLDKRKIADVLLLMEKENVKPSLFTYKILIDAKGLSNDMVGMEQVVDTMKAEGIEL 360
           LLYKR DKRKIADVLLLM+KENVK S  TY+ILID  GLSND+ GME+VVD+MKAEGI+L
Sbjct: 301 LLYKRTDKRKIADVLLLMKKENVKYSTSTYRILIDVNGLSNDITGMEEVVDSMKAEGIKL 360

Query: 361 DVNTLSILAKHYASGGLKDKAKAILKEMEDVSSKESRWPCRLLLPLYGELQMEDEVRRVW 420
           DV TLS L KHY SGGLKDKAKA+LKEME+++S+ SR PCR+LLPLYGELQMEDEVRR+W
Sbjct: 361 DVETLSRLVKHYVSGGLKDKAKAVLKEMEEINSEGSRRPCRILLPLYGELQMEDEVRRLW 420

Query: 421 KLCEANPRIEECMAAIVAWGKLKNVQEAEEIFDRVLKTWKKLSSKQYSTMLKVY-ADNKM 480
           ++CE+NP IEECMAAIVAWGKLKNVQEAE+IFDRV+KT +KLS++ YSTML VY  D+KM
Sbjct: 421 EICESNPHIEECMAAIVAWGKLKNVQEAEKIFDRVVKTGEKLSARHYSTMLNVYREDSKM 480

Query: 481 LTKGKDLVKQMADSGCRIGPLTWDAVVKLYVEAGEVEKADSFLQKAVQKNQMKPLFTSYM 540
           LTKGK++VKQMA+SGCR+ P T DAVVKLYVEAGEVEKADSFL KAV +N+ KP+FT+Y+
Sbjct: 481 LTKGKEVVKQMAESGCRMDPFTLDAVVKLYVEAGEVEKADSFLVKAVLQNKKKPMFTTYI 540

Query: 541 VILDQYARRGDVHNAEKMFHRMRLSGYVARFSPFQALIQAYINAKAPAYGMKERMKADNV 600
            ++D+YA RGDV N EK F  MR  GYV R S FQ LIQAY+NAKAPAYGM+ERMKADNV
Sbjct: 541 TLMDRYASRGDVPNVEKNFAMMRRLGYVGRLSQFQTLIQAYVNAKAPAYGMRERMKADNV 600

Query: 601 FPNKALAGKLAQIDAFRKTAVSDLLD 619
           FPNK LAGKLAQ+D  +   VSDLLD
Sbjct: 601 FPNKDLAGKLAQVDCLKMRKVSDLLD 626

BLAST of CmoCh06G000200.1 vs. NCBI nr
Match: gi|659085109|ref|XP_008443248.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like isoform X1 [Cucumis melo])

HSP 1 Score: 897.9 bits (2319), Expect = 1.0e-257
Identity = 461/619 (74.47%), Postives = 522/619 (84.33%), Query Frame = 1

Query: 1   MWALRRASTPLRNQGYRIRTSYVFGKLEAPYSCDGNIVGSAIIPAISDRCISFERNNLAT 60
           MWALRRAS PLRNQGYR+RTSYVFGKLE PY  +GN+ G     A+SDR ISFERNNL T
Sbjct: 1   MWALRRASAPLRNQGYRVRTSYVFGKLEVPYFWEGNVAGFGTTAALSDRFISFERNNLET 60

Query: 61  WRSSGLSISSHGLSSQAGAENSGEEDDLEDGFSELETLPSTNALEDNKAADENEGELTSE 120
           W SSG+ ISSHGLS+QAGAENSGEE ++ED                      NE ELTS 
Sbjct: 61  WPSSGVYISSHGLSTQAGAENSGEEGNVED----------------------NEEELTSG 120

Query: 121 SELDDDG-TQNELDLPEVETELGEKISAKRAPSELFKAIWSAPGLSVPSALDKWVSEGKE 180
           SE+DDD  TQNELDLPE ET L EKIS K APSELF  IW APGLSVPSALDKWVSEGKE
Sbjct: 121 SEIDDDDETQNELDLPEGETGLAEKISTKGAPSELFNIIWKAPGLSVPSALDKWVSEGKE 180

Query: 181 LSRADISLAMLNLRRRRMFGKALQFSEWLEASGQLEFVDRDYASRLDLIAKVHGLHRAEG 240
           LSRADISL ML LRRRRMFGKAL+FSEWLEA+G+L   DRDYAS+LDLI K+ GL  AE 
Sbjct: 181 LSRADISLTMLYLRRRRMFGKALKFSEWLEANGKL-VTDRDYASQLDLIGKLRGLRMAEN 240

Query: 241 YIAKIPKSFQGEVIYRTLLANCVVANNVKKAEEVFNKMKDLEFPITAFACNQLLLLYKRL 300
           YI+KIPKSFQGEV+YRTLLANCV++ NV+KAEEVFNKMKDLEFPITAFACNQLLLLYKR 
Sbjct: 241 YISKIPKSFQGEVVYRTLLANCVMSTNVRKAEEVFNKMKDLEFPITAFACNQLLLLYKRT 300

Query: 301 DKRKIADVLLLMEKENVKPSLFTYKILIDAKGLSNDMVGMEQVVDTMKAEGIELDVNTLS 360
           DK+KIADVLLLMEKENVKPS FTYKILIDAKGLSND+ GMEQVVDTMKAEGI+L V TL 
Sbjct: 301 DKKKIADVLLLMEKENVKPSPFTYKILIDAKGLSNDISGMEQVVDTMKAEGIKLGVGTLL 360

Query: 361 ILAKHYASGGLKDKAKAILKEMEDVSSKESRWPCRLLLPLYGELQMEDEVRRVWKLCEAN 420
           +LAKHY S GLKDKAKA LKE E+++SK SR PCR LLPLYGELQMEDEVRR+W++CE+N
Sbjct: 361 LLAKHYVSAGLKDKAKATLKETEEINSKGSRRPCRFLLPLYGELQMEDEVRRLWEICESN 420

Query: 421 PRIEECMAAIVAWGKLKNVQEAEEIFDRVLKTWKKLSSKQYSTMLKVYADNKMLTKGKDL 480
           P +EECMAAIVAWGKLKNVQEAE+IFDRV+KT KKLS++ YSTM+ VY D+KMLTKGK+L
Sbjct: 421 PHVEECMAAIVAWGKLKNVQEAEKIFDRVVKTGKKLSTRHYSTMMNVYRDSKMLTKGKEL 480

Query: 481 VKQMADSGCRIGPLTWDAVVKLYVEAGEVEKADSFLQKAVQKNQMKPLFTSYMVILDQYA 540
           V QMA+SGC + P TWDAVVKLYVEAGEVEKADSFL KAVQ+++ KPLF +Y+ ++D YA
Sbjct: 481 VNQMAESGCSMDPFTWDAVVKLYVEAGEVEKADSFLVKAVQQSKKKPLFATYIALMDHYA 540

Query: 541 RRGDVHNAEKMFHRMRLSGYVARFSPFQALIQAYINAKAPAYGMKERMKADNVFPNKALA 600
            RGDV NAE++F ++R+ GYV RF+ +Q LIQAY+NAK PAYGM+ERMKADN+FPNKALA
Sbjct: 541 SRGDVPNAERIFDKLRILGYVGRFTQYQTLIQAYVNAKTPAYGMRERMKADNIFPNKALA 596

Query: 601 GKLAQIDAFRKTAVSDLLD 619
           G+LAQ+D+F+ T VSDLLD
Sbjct: 601 GQLAQVDSFKMTDVSDLLD 596

BLAST of CmoCh06G000200.1 vs. NCBI nr
Match: gi|659085115|ref|XP_008443253.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g80270, mitochondrial-like isoform X2 [Cucumis melo])

HSP 1 Score: 895.6 bits (2313), Expect = 4.9e-257
Identity = 457/619 (73.83%), Postives = 523/619 (84.49%), Query Frame = 1

Query: 1   MWALRRASTPLRNQGYRIRTSYVFGKLEAPYSCDGNIVGSAIIPAISDRCISFERNNLAT 60
           MWALRRASTPLRN GYR+RTSYVFGKLE PY  +GNI G      +S R ISFERNNLAT
Sbjct: 1   MWALRRASTPLRNHGYRVRTSYVFGKLEVPYFGEGNIAGFGTTATLSGRFISFERNNLAT 60

Query: 61  WRSSGLSISSHGLSSQAGAENSGEEDDLEDGFSEL-ETLPSTNALEDNKAADENEGELTS 120
           W SSG+ ISSHGLS+QAGAENSGEED++EDGFSEL ET P+T +    + +D++E  +  
Sbjct: 61  WPSSGIYISSHGLSTQAGAENSGEEDNVEDGFSELDETHPTTRS----EISDDDENVV-- 120

Query: 121 ESELDDDGTQNELDLPEVETELGEKISAKRAPSELFKAIWSAPGLSVPSALDKWVSEGKE 180
                DDGTQNELDLPE ETEL EKIS K  PSEL +AIW+AP LSV SALDKWVSEG E
Sbjct: 121 -----DDGTQNELDLPEGETELAEKISRKWVPSELNEAIWNAPALSVTSALDKWVSEGHE 180

Query: 181 LSRADISLAMLNLRRRRMFGKALQFSEWLEASGQLEFVDRDYASRLDLIAKVHGLHRAEG 240
           LSR DIS  M  LR+RRMFGKALQFSEWLEASGQLEF + DYAS LDLIAKV GLH+AE 
Sbjct: 181 LSRDDISSTMFGLRKRRMFGKALQFSEWLEASGQLEFNEADYASHLDLIAKVQGLHKAET 240

Query: 241 YIAKIPKSFQGEVIYRTLLANCVVANNVKKAEEVFNKMKDLEFPITAFACNQLLLLYKRL 300
           YIAKIP SF+GE +YRTLLAN V+AN+VKKAEEVFN+MKDLEFP+T FA +Q+L+LYKR+
Sbjct: 241 YIAKIPNSFRGEAVYRTLLANYVLANDVKKAEEVFNRMKDLEFPMTTFAYDQMLILYKRI 300

Query: 301 DKRKIADVLLLMEKENVKPSLFTYKILIDAKGLSNDMVGMEQVVDTMKAEGIELDVNTLS 360
           D+R+IAD+L LMEKENVKP  FTYKILIDAKGLSND+ GMEQVVDTMKAEGI+LDV+TL 
Sbjct: 301 DRRRIADILSLMEKENVKPRPFTYKILIDAKGLSNDISGMEQVVDTMKAEGIKLDVDTLL 360

Query: 361 ILAKHYASGGLKDKAKAILKEMEDVSSKESRWPCRLLLPLYGELQMEDEVRRVWKLCEAN 420
           +LAKHY  GGLKDKA  ILK  E+V+SK SRWPCR LLPLYGELQMEDEVRR+W++CE N
Sbjct: 361 LLAKHYVLGGLKDKAMPILKATEEVNSKGSRWPCRYLLPLYGELQMEDEVRRLWEICEPN 420

Query: 421 PRIEECMAAIVAWGKLKNVQEAEEIFDRVLKTWKKLSSKQYSTMLKVYADNKMLTKGKDL 480
           P +EECMAAIVAWGKLKN+QEAE+IFDRV+KTWK+LS+K YSTM+KVY D+KMLTKGK+L
Sbjct: 421 PNVEECMAAIVAWGKLKNIQEAEKIFDRVVKTWKRLSTKHYSTMIKVYGDSKMLTKGKEL 480

Query: 481 VKQMADSGCRIGPLTWDAVVKLYVEAGEVEKADSFLQKAVQKNQMKPLFTSYMVILDQYA 540
           V QMA SGCRI P+ WDAVVKLYVEAGEVEKADSFL KAV++  MKPLF SY  ++  YA
Sbjct: 481 VNQMAKSGCRIDPMIWDAVVKLYVEAGEVEKADSFLFKAVKQYGMKPLFDSYRTLMVHYA 540

Query: 541 RRGDVHNAEKMFHRMRLSGYVARFSPFQALIQAYINAKAPAYGMKERMKADNVFPNKALA 600
           R+GDVHN+EK+FH++R SGY   F  F  L+QAY+NAK PAYGM+ERM ADNVFPNKALA
Sbjct: 541 RKGDVHNSEKIFHKIRQSGYPTHFGQFVTLVQAYLNAKTPAYGMRERMMADNVFPNKALA 600

Query: 601 GKLAQIDAFRKTAVSDLLD 619
           GKLAQ+D+FR+T VSDLLD
Sbjct: 601 GKLAQLDSFRQTTVSDLLD 608

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP135_ARATH1.2e-18357.49Pentatricopeptide repeat-containing protein At1g80270, mitochondrial OS=Arabidop... [more]
PPR44_ARATH2.3e-18255.43Pentatricopeptide repeat-containing protein At1g15480, mitochondrial OS=Arabidop... [more]
PP234_ARATH1.1e-15050.09Pentatricopeptide repeat-containing protein At3g15590, mitochondrial OS=Arabidop... [more]
PPR19_ARATH2.9e-3924.78Pentatricopeptide repeat-containing protein At1g07590, mitochondrial OS=Arabidop... [more]
PPR86_ARATH2.1e-3726.05Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0LEU5_CUCSA2.5e-26375.64Uncharacterized protein OS=Cucumis sativus GN=Csa_3G819910 PE=4 SV=1[more]
A0A0A0LHD8_CUCSA4.8e-25974.60Uncharacterized protein OS=Cucumis sativus GN=Csa_3G819930 PE=4 SV=1[more]
E5GBN7_CUCME3.4e-25773.83DNA-binding protein OS=Cucumis melo subsp. melo PE=4 SV=1[more]
A0A0A0LFG6_CUCSA3.6e-25474.40Uncharacterized protein OS=Cucumis sativus GN=Csa_3G819890 PE=4 SV=1[more]
A0A0A0LFH0_CUCSA7.9e-25473.83Uncharacterized protein OS=Cucumis sativus GN=Csa_3G819940 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G80270.17.0e-18557.49 PENTATRICOPEPTIDE REPEAT 596[more]
AT1G15480.11.3e-18355.43 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G15590.16.0e-15250.09 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G07590.11.6e-4024.78 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G60770.11.2e-3826.05 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659085107|ref|XP_008443247.1|6.0e-27176.04PREDICTED: pentatricopeptide repeat-containing protein At1g80270, mitochondrial-... [more]
gi|778685180|ref|XP_011652180.1|3.5e-26375.64PREDICTED: pentatricopeptide repeat-containing protein At1g80270, mitochondrial-... [more]
gi|778685174|ref|XP_011652178.1|6.9e-25974.60PREDICTED: pentatricopeptide repeat-containing protein At1g80270, mitochondrial-... [more]
gi|659085109|ref|XP_008443248.1|1.0e-25774.47PREDICTED: pentatricopeptide repeat-containing protein At1g80270, mitochondrial-... [more]
gi|659085115|ref|XP_008443253.1|4.9e-25773.83PREDICTED: pentatricopeptide repeat-containing protein At1g80270, mitochondrial-... [more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003677 DNA binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmoCh06G000200CmoCh06G000200gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmoCh06G000200.1CmoCh06G000200.1-proteinpolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh06G000200.1.exon.3CmoCh06G000200.1.exon.3exon
CmoCh06G000200.1.exon.2CmoCh06G000200.1.exon.2exon
CmoCh06G000200.1.exon.1CmoCh06G000200.1.exon.1exon


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh06G000200.1.five_prime_UTR.1CmoCh06G000200.1.five_prime_UTR.1five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh06G000200.1.CDS.1CmoCh06G000200.1.CDS.1CDS
CmoCh06G000200.1.CDS.2CmoCh06G000200.1.CDS.2CDS
CmoCh06G000200.1.CDS.3CmoCh06G000200.1.CDS.3CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh06G000200.1.three_prime_UTR.1CmoCh06G000200.1.three_prime_UTR.1three_prime_UTR


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 530..558
score: 8.2E-4coord: 460..488
score: 0.021coord: 252..280
score: 0.0041coord: 494..520
score:
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 286..329
score: 3.5E-4coord: 342..384
score: 0.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 530..560
score: 2.4E-4coord: 460..489
score: 5.0E-4coord: 252..283
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 421..451
score: 5.218coord: 354..388
score: 7.311coord: 456..490
score: 8.857coord: 250..284
score: 8.276coord: 527..561
score: 8.835coord: 319..353
score: 7.443coord: 491..526
score: 8
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 256..282
score: 1.3E-6coord: 349..468
score: 1.3E-6coord: 487..552
score: 6.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 43..601
score: 1.6E
NoneNo IPR availablePANTHERPTHR24015:SF26SUBFAMILY NOT NAMEDcoord: 43..601
score: 1.6E