CmoCh08G001590.1 (mRNA) Cucurbita moschata (Rifu)

NameCmoCh08G001590.1
TypemRNA
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCmo_Chr08 : 920999 .. 922513 (+)
Sequence length1515
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACTGTTAGGTGGCCAAGGCTTTTAACGCCCACACACCTGTCTCAGATTATTAGGAAGCAGAACAATCCTTTCACAGCTTACCAACTGTTCAATGAAGCCAAATGTAGGTATCCAAATTATCAGCACAATGGTCCGGTGTACGCCGCAATGATCAATATACTTGGAAACTCGGGTAGAATTTCCGAGATGAGGGAAGTGATAGATCAGATGAAAGTTGACTCTTGTCAGTGCAAGGATTCTATATTTTCATTTGCAATTAAAACGTATGCTAGTCATGGATTATTGGAAGAAGGTATATCTCTGTTTAAAAGCCTTGGGGGATTTAACTGTACCGATAGAACACAAACTTTCAATACCCTTTTGGAAATACTCTTGAATGAATCTCAGCTCGATGCTGCTTGTCAGCTTTTTCAGCAGAGTTCATTTGGTTGGGAAGTGAAATCCAGGACTCAGTCATTGAATTTGCTAATGCAATCTCTTTGTCAAAGAGGCCAATCTGAACTTGCTTTACATGTCTTTAAAGAAATGGATTACCAAAGTTGCTATCCTAATAGGCTGAGTTATTTGATTCTAATGAAAGGACTGTGTCAAGATGGTAAGCTTCATGAGGCCATCCATTTATTGTATTCCATGTTCTGGAGGATTTCTCGAAGGGGTAGCGGAGGGGACATAGTAATTTACAGAACCCTTCTGTTTGCTTTGTGTGATAATGGAGAGATTGAGCAAGCTGTGGAAATACTAGGCAAGATCTTGAAGAAAGGACTGAAAGCCCCTAAGCGAGCTCATTACTTGATTGACCTCAACTACTGCAGGATTAGCAAGCTCACCGTCACGGAAATCAAGTGTTTAATCAATGAAGCTTTAATCAAAGGTGGAATTCCCAGTTCAGATAGTTATTGTGCCATGGCTATCGATCTATATAACGAAAACGAGACTGATCAGGGAGATAAAGTTGTTAGCCACATGCTAGCTAAAGGCTTTTGGCCACCATCCTCAGTCTATGAGGCGAAAGCAGCTGCATTATGCAAAGAAGGGAAAGTTGATGATGCAGTGAAAGTAATTGAAGAGGAAACGGTGAAGGGAAGTTGCGTTCCAACCGTTGCGTTGTATAACATCGTTCTGAATGGTCTTTGTAGGGTGGGCAAGTCAACAGTGGCTATGGAGTTCTTGAAGAAAATGGTAAAGCAGGTTGGTCTTGTTGCAGACAAGGAGACTTATAGCACTTTAGTACATGGTCTTTGTCGTGAGAATAGATACACTGAAGCATGTAAGTTGTTGGAGGAGATGGTTATCAAATCACATTGGCCTTGTTCTAACACATTCAATACACTTACCAGAGGTCTTTGCTCGGTGGGAAAACCATATAAAGCAGTGATGTGCTTGGAAGAAATGATTAGCCAAGGCCAATTGCCGGAACTTTCTGTTTGGAATGCTTTGGTTTCATCTTTGTGTTTCAATGTGGCTGACACTGATATGTGGTCTAAGGTCTTACGACAGATACGAAGTTGTTGA

mRNA sequence

ATGACTGTTAGGTGGCCAAGGCTTTTAACGCCCACACACCTGTCTCAGATTATTAGGAAGCAGAACAATCCTTTCACAGCTTACCAACTGTTCAATGAAGCCAAATGTAGGTATCCAAATTATCAGCACAATGGTCCGGTGTACGCCGCAATGATCAATATACTTGGAAACTCGGGTAGAATTTCCGAGATGAGGGAAGTGATAGATCAGATGAAAGTTGACTCTTGTCAGTGCAAGGATTCTATATTTTCATTTGCAATTAAAACGTATGCTAGTCATGGATTATTGGAAGAAGGTATATCTCTGTTTAAAAGCCTTGGGGGATTTAACTGTACCGATAGAACACAAACTTTCAATACCCTTTTGGAAATACTCTTGAATGAATCTCAGCTCGATGCTGCTTGTCAGCTTTTTCAGCAGAGTTCATTTGGTTGGGAAGTGAAATCCAGGACTCAGTCATTGAATTTGCTAATGCAATCTCTTTGTCAAAGAGGCCAATCTGAACTTGCTTTACATGTCTTTAAAGAAATGGATTACCAAAGTTGCTATCCTAATAGGCTGAGTTATTTGATTCTAATGAAAGGACTGTGTCAAGATGGTAAGCTTCATGAGGCCATCCATTTATTGTATTCCATGTTCTGGAGGATTTCTCGAAGGGGTAGCGGAGGGGACATAGTAATTTACAGAACCCTTCTGTTTGCTTTGTGTGATAATGGAGAGATTGAGCAAGCTGTGGAAATACTAGGCAAGATCTTGAAGAAAGGACTGAAAGCCCCTAAGCGAGCTCATTACTTGATTGACCTCAACTACTGCAGGATTAGCAAGCTCACCGTCACGGAAATCAAGTGTTTAATCAATGAAGCTTTAATCAAAGGTGGAATTCCCAGTTCAGATAGTTATTGTGCCATGGCTATCGATCTATATAACGAAAACGAGACTGATCAGGGAGATAAAGTTGTTAGCCACATGCTAGCTAAAGGCTTTTGGCCACCATCCTCAGTCTATGAGGCGAAAGCAGCTGCATTATGCAAAGAAGGGAAAGTTGATGATGCAGTGAAAGTAATTGAAGAGGAAACGGTGAAGGGAAGTTGCGTTCCAACCGTTGCGTTGTATAACATCGTTCTGAATGGTCTTTGTAGGGTGGGCAAGTCAACAGTGGCTATGGAGTTCTTGAAGAAAATGGTAAAGCAGGTTGGTCTTGTTGCAGACAAGGAGACTTATAGCACTTTAGTACATGGTCTTTGTCGTGAGAATAGATACACTGAAGCATGTAAGTTGTTGGAGGAGATGGTTATCAAATCACATTGGCCTTGTTCTAACACATTCAATACACTTACCAGAGGTCTTTGCTCGGTGGGAAAACCATATAAAGCAGTGATGTGCTTGGAAGAAATGATTAGCCAAGGCCAATTGCCGGAACTTTCTGTTTGGAATGCTTTGGTTTCATCTTTGTGTTTCAATGTGGCTGACACTGATATGTGGTCTAAGGTCTTACGACAGATACGAAGTTGTTGA

Coding sequence (CDS)

ATGACTGTTAGGTGGCCAAGGCTTTTAACGCCCACACACCTGTCTCAGATTATTAGGAAGCAGAACAATCCTTTCACAGCTTACCAACTGTTCAATGAAGCCAAATGTAGGTATCCAAATTATCAGCACAATGGTCCGGTGTACGCCGCAATGATCAATATACTTGGAAACTCGGGTAGAATTTCCGAGATGAGGGAAGTGATAGATCAGATGAAAGTTGACTCTTGTCAGTGCAAGGATTCTATATTTTCATTTGCAATTAAAACGTATGCTAGTCATGGATTATTGGAAGAAGGTATATCTCTGTTTAAAAGCCTTGGGGGATTTAACTGTACCGATAGAACACAAACTTTCAATACCCTTTTGGAAATACTCTTGAATGAATCTCAGCTCGATGCTGCTTGTCAGCTTTTTCAGCAGAGTTCATTTGGTTGGGAAGTGAAATCCAGGACTCAGTCATTGAATTTGCTAATGCAATCTCTTTGTCAAAGAGGCCAATCTGAACTTGCTTTACATGTCTTTAAAGAAATGGATTACCAAAGTTGCTATCCTAATAGGCTGAGTTATTTGATTCTAATGAAAGGACTGTGTCAAGATGGTAAGCTTCATGAGGCCATCCATTTATTGTATTCCATGTTCTGGAGGATTTCTCGAAGGGGTAGCGGAGGGGACATAGTAATTTACAGAACCCTTCTGTTTGCTTTGTGTGATAATGGAGAGATTGAGCAAGCTGTGGAAATACTAGGCAAGATCTTGAAGAAAGGACTGAAAGCCCCTAAGCGAGCTCATTACTTGATTGACCTCAACTACTGCAGGATTAGCAAGCTCACCGTCACGGAAATCAAGTGTTTAATCAATGAAGCTTTAATCAAAGGTGGAATTCCCAGTTCAGATAGTTATTGTGCCATGGCTATCGATCTATATAACGAAAACGAGACTGATCAGGGAGATAAAGTTGTTAGCCACATGCTAGCTAAAGGCTTTTGGCCACCATCCTCAGTCTATGAGGCGAAAGCAGCTGCATTATGCAAAGAAGGGAAAGTTGATGATGCAGTGAAAGTAATTGAAGAGGAAACGGTGAAGGGAAGTTGCGTTCCAACCGTTGCGTTGTATAACATCGTTCTGAATGGTCTTTGTAGGGTGGGCAAGTCAACAGTGGCTATGGAGTTCTTGAAGAAAATGGTAAAGCAGGTTGGTCTTGTTGCAGACAAGGAGACTTATAGCACTTTAGTACATGGTCTTTGTCGTGAGAATAGATACACTGAAGCATGTAAGTTGTTGGAGGAGATGGTTATCAAATCACATTGGCCTTGTTCTAACACATTCAATACACTTACCAGAGGTCTTTGCTCGGTGGGAAAACCATATAAAGCAGTGATGTGCTTGGAAGAAATGATTAGCCAAGGCCAATTGCCGGAACTTTCTGTTTGGAATGCTTTGGTTTCATCTTTGTGTTTCAATGTGGCTGACACTGATATGTGGTCTAAGGTCTTACGACAGATACGAAGTTGTTGA
BLAST of CmoCh08G001590.1 vs. Swiss-Prot
Match: PPR11_ARATH (Pentatricopeptide repeat-containing protein At1g05600 OS=Arabidopsis thaliana GN=At1g05600 PE=2 SV=1)

HSP 1 Score: 558.9 bits (1439), Expect = 5.6e-158
Identity = 269/488 (55.12%), Postives = 365/488 (74.80%), Query Frame = 1

Query: 3   VRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRIS 62
           VRWPR+LTP+ LSQI++KQ NP TA +LF EAK R+P+Y HNG VYA MI+ILG S R+ 
Sbjct: 4   VRWPRVLTPSLLSQILKKQKNPVTALKLFEEAKERFPSYGHNGSVYATMIDILGKSNRVL 63

Query: 63  EMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISLFKSLGGFNCTDRTQTFNTLL 122
           EM+ VI++MK DSC+CKDS+F+  I+T++  G LE+ ISLFKSL  FNC + + +F+TLL
Sbjct: 64  EMKYVIERMKEDSCECKDSVFASVIRTFSRAGRLEDAISLFKSLHEFNCVNWSLSFDTLL 123

Query: 123 EILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSC 182
           + ++ ES+L+AAC +F++  +GWEV SR  +LNLLM+ LCQ  +S+LA  VF+EM+YQ C
Sbjct: 124 QEMVKESELEAACHIFRKYCYGWEVNSRITALNLLMKVLCQVNRSDLASQVFQEMNYQGC 183

Query: 183 YPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIE 242
           YP+R SY ILMKG C +GKL EA HLLYSMFWRIS++GSG DIV+YR LL ALCD GE++
Sbjct: 184 YPDRDSYRILMKGFCLEGKLEEATHLLYSMFWRISQKGSGEDIVVYRILLDALCDAGEVD 243

Query: 243 QAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSYCA 302
            A+EILGKIL+KGLKAPKR ++ I+  +   S   +  +K L+ E LI+G IP  DSY A
Sbjct: 244 DAIEILGKILRKGLKAPKRCYHHIEAGHWESSSEGIERVKRLLTETLIRGAIPCLDSYSA 303

Query: 303 MAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKG 362
           MA DL+ E +  +G++V+  M +KGF P   +Y AK  ALC+ GK+ +AV VI +E ++G
Sbjct: 304 MATDLFEEGKLVEGEEVLLAMRSKGFEPTPFIYGAKVKALCRAGKLKEAVSVINKEMMQG 363

Query: 363 SCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRYTE 422
            C+PTV +YN+++ GLC  GKS  A+ +LKKM KQV  VA++ETY TLV GLCR+ ++ E
Sbjct: 364 HCLPTVGVYNVLIKGLCDDGKSMEAVGYLKKMSKQVSCVANEETYQTLVDGLCRDGQFLE 423

Query: 423 ACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVS 482
           A +++EEM+IKSH+P   T++ + +GLC + + Y+AVM LEEM+SQ  +PE SVW AL  
Sbjct: 424 ASQVMEEMLIKSHFPGVETYHMMIKGLCDMDRRYEAVMWLEEMVSQDMVPESSVWKALAE 483

Query: 483 SLCFNVAD 491
           S+CF   D
Sbjct: 484 SVCFCAID 491

BLAST of CmoCh08G001590.1 vs. Swiss-Prot
Match: PP120_ARATH (Putative pentatricopeptide repeat-containing protein At1g74580 OS=Arabidopsis thaliana GN=At1g74580 PE=3 SV=1)

HSP 1 Score: 175.6 bits (444), Expect = 1.3e-42
Identity = 115/476 (24.16%), Postives = 224/476 (47.06%), Query Frame = 1

Query: 9   LTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVI 68
           L P H++ +I+ Q +P  A ++FN  + +   ++H    Y ++I  LG  G+   M EV+
Sbjct: 5   LLPKHVTAVIKCQKDPMKALEMFNSMR-KEVGFKHTLSTYRSVIEKLGYYGKFEAMEEVL 64

Query: 69  DQMKVD-SCQCKDSIFSFAIKTYASHGLLEEGISLFKSLGGFNCTDRTQTFNTLLEILLN 128
             M+ +      + ++  A+K Y   G ++E +++F+ +  ++C     ++N ++ +L++
Sbjct: 65  VDMRENVGNHMLEGVYVGAMKNYGRKGKVQEAVNVFERMDFYDCEPTVFSYNAIMSVLVD 124

Query: 129 ESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRL 188
               D A +++ +      +     S  + M+S C+  +   AL +   M  Q C  N +
Sbjct: 125 SGYFDQAHKVYMRMR-DRGITPDVYSFTIRMKSFCKTSRPHAALRLLNNMSSQGCEMNVV 184

Query: 189 SYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEI 248
           +Y  ++ G  ++    E     Y +F ++   G    +  +  LL  LC  G++++  ++
Sbjct: 185 AYCTVVGGFYEENFKAEG----YELFGKMLASGVSLCLSTFNKLLRVLCKKGDVKECEKL 244

Query: 249 LGKILKKGLKAPKRAHYLIDLNYCRISKL--TVTEIKCLINEALIKGGIPSSDSYCAMAI 308
           L K++K+G+      + L     C+  +L   V  + CLI +    G  P   +Y  +  
Sbjct: 245 LDKVIKRGVLPNLFTYNLFIQGLCQRGELDGAVRMVGCLIEQ----GPKPDVITYNNLIY 304

Query: 309 DLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCV 368
            L   ++  + +  +  M+ +G  P S  Y    A  CK G V  A +++ +    G  V
Sbjct: 305 GLCKNSKFQEAEVYLGKMVNEGLEPDSYTYNTLIAGYCKGGMVQLAERIVGDAVFNG-FV 364

Query: 369 PTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRYTEACK 428
           P    Y  +++GLC  G++  A+    + + + G+  +   Y+TL+ GL  +    EA +
Sbjct: 365 PDQFTYRSLIDGLCHEGETNRALALFNEALGK-GIKPNVILYNTLIKGLSNQGMILEAAQ 424

Query: 429 LLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALV 482
           L  EM  K   P   TFN L  GLC +G    A   ++ MIS+G  P++  +N L+
Sbjct: 425 LANEMSEKGLIPEVQTFNILVNGLCKMGCVSDADGLVKVMISKGYFPDIFTFNILI 468

BLAST of CmoCh08G001590.1 vs. Swiss-Prot
Match: PP281_ARATH (Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidopsis thaliana GN=MEE40 PE=2 SV=1)

HSP 1 Score: 174.5 bits (441), Expect = 3.0e-42
Identity = 119/459 (25.93%), Postives = 226/459 (49.24%), Query Frame = 1

Query: 48  YAAMINIL--GNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISLFKS 107
           Y  M+N+L  GNS ++ E+     +M V   +   S F+  IK       L   I + + 
Sbjct: 157 YNRMLNLLVDGNSLKLVEISHA--KMSVWGIKPDVSTFNVLIKALCRAHQLRPAILMLED 216

Query: 108 LGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQS-SFGWEVKSRTQSLNLLMQSLCQR 167
           +  +      +TF T+++  + E  LD A ++ +Q   FG    +   S+N+++   C+ 
Sbjct: 217 MPSYGLVPDEKTFTTVMQGYIEEGDLDGALRIREQMVEFGCSWSN--VSVNVIVHGFCKE 276

Query: 168 GQSELALHVFKEMDYQS-CYPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGG 227
           G+ E AL+  +EM  Q   +P++ ++  L+ GLC+ G +  AI ++  M     + G   
Sbjct: 277 GRVEDALNFIQEMSNQDGFFPDQYTFNTLVNGLCKAGHVKHAIEIMDVML----QEGYDP 336

Query: 228 DIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKC 287
           D+  Y +++  LC  GE+++AVE+L +++ +        +  +    C+ ++  V E   
Sbjct: 337 DVYTYNSVISGLCKLGEVKEAVEVLDQMITRDCSPNTVTYNTLISTLCKENQ--VEEATE 396

Query: 288 LINEALIKGGIPSSDSYCAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALC 347
           L      KG +P   ++ ++   L          ++   M +KG  P    Y     +LC
Sbjct: 397 LARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMELFEEMRSKGCEPDEFTYNMLIDSLC 456

Query: 348 KEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVAD 407
            +GK+D+A+ ++++  + G C  +V  YN +++G C+  K+  A E   +M    G+  +
Sbjct: 457 SKGKLDEALNMLKQMELSG-CARSVITYNTLIDGFCKANKTREAEEIFDEMEVH-GVSRN 516

Query: 408 KETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLE 467
             TY+TL+ GLC+  R  +A +L+++M+++   P   T+N+L    C  G   KA   ++
Sbjct: 517 SVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYTYNSLLTHFCRGGDIKKAADIVQ 576

Query: 468 EMISQGQLPELSVWNALVSSLCFNVADTDMWSKVLRQIR 503
            M S G  P++  +  L+S LC      ++ SK+LR I+
Sbjct: 577 AMTSNGCEPDIVTYGTLISGLC-KAGRVEVASKLLRSIQ 602

BLAST of CmoCh08G001590.1 vs. Swiss-Prot
Match: PP327_ARATH (Pentatricopeptide repeat-containing protein At4g20090 OS=Arabidopsis thaliana GN=EMB1025 PE=3 SV=1)

HSP 1 Score: 172.2 bits (435), Expect = 1.5e-41
Identity = 108/444 (24.32%), Postives = 215/444 (48.42%), Query Frame = 1

Query: 49  AAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISLF-KSLG 108
           ++MI    NSG    + +++ ++++++    +  F    + Y    L ++ + LF + + 
Sbjct: 81  SSMIESYANSGDFDSVEKLLSRIRLENRVIIERSFIVVFRAYGKAHLPDKAVDLFHRMVD 140

Query: 109 GFNCTDRTQTFNTLLEILLNESQLDAACQLFQ---QSSFGWEVKSRTQSLNLLMQSLCQR 168
            F C    ++FN++L +++NE       + +     S+    +     S NL++++LC+ 
Sbjct: 141 EFRCKRSVKSFNSVLNVIINEGLYHRGLEFYDYVVNSNMNMNISPNGLSFNLVIKALCKL 200

Query: 169 GQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGD 228
              + A+ VF+ M  + C P+  +Y  LM GLC++ ++ EA+ LL  M       G    
Sbjct: 201 RFVDRAIEVFRGMPERKCLPDGYTYCTLMDGLCKEERIDEAVLLLDEM----QSEGCSPS 260

Query: 229 IVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCL 288
            VIY  L+  LC  G++ +  +++  +  KG    +  +  +    C   KL   +   L
Sbjct: 261 PVIYNVLIDGLCKKGDLTRVTKLVDNMFLKGCVPNEVTYNTLIHGLCLKGKL--DKAVSL 320

Query: 289 INEALIKGGIPSSDSYCAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCK 348
           +   +    IP+  +Y  +   L  +       +++S M  +G+     +Y    + L K
Sbjct: 321 LERMVSSKCIPNDVTYGTLINGLVKQRRATDAVRLLSSMEERGYHLNQHIYSVLISGLFK 380

Query: 349 EGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADK 408
           EGK ++A+ +  +   KG C P + +Y+++++GLCR GK   A E L +M+   G + + 
Sbjct: 381 EGKAEEAMSLWRKMAEKG-CKPNIVVYSVLVDGLCREGKPNEAKEILNRMIAS-GCLPNA 440

Query: 409 ETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNT---FNTLTRGLCSVGKPYKAVMC 468
            TYS+L+ G  +     EA ++ +EM       CS     ++ L  GLC VG+  +A+M 
Sbjct: 441 YTYSSLMKGFFKTGLCEEAVQVWKEM---DKTGCSRNKFCYSVLIDGLCGVGRVKEAMMV 500

Query: 469 LEEMISQGQLPELSVWNALVSSLC 486
             +M++ G  P+   +++++  LC
Sbjct: 501 WSKMLTIGIKPDTVAYSSIIKGLC 513

BLAST of CmoCh08G001590.1 vs. Swiss-Prot
Match: PP388_ARATH (Pentatricopeptide repeat-containing protein At5g16420, mitochondrial OS=Arabidopsis thaliana GN=At5g16420 PE=2 SV=1)

HSP 1 Score: 159.8 bits (403), Expect = 7.6e-38
Identity = 113/473 (23.89%), Postives = 213/473 (45.03%), Query Frame = 1

Query: 5   WPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEM 64
           WP+ L P  L  +I +Q N   A Q+F  A   +P + HN   Y +++  L  +     +
Sbjct: 43  WPQRLFPKRLVSMITQQQNIDLALQIFLYAGKSHPGFTHNYDTYHSILFKLSRARAFDPV 102

Query: 65  REVIDQMK--VDSCQCKDSIFSFAIKTYASHGLLEEGISLFKSLGGFNCTDRTQTFNTLL 124
             ++  ++      +C +++F   ++ Y   G  E  + +F  +  F      ++ NTLL
Sbjct: 103 ESLMADLRNSYPPIKCGENLFIDLLRNYGLAGRYESSMRIFLRIPDFGVKRSVRSLNTLL 162

Query: 125 EILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSC 184
            +L+   + D    +F+ S   + +     + NLL+++LC++   E A  V  E+     
Sbjct: 163 NVLIQNQRFDLVHAMFKNSKESFGITPNIFTCNLLVKALCKKNDIESAYKVLDEIPSMGL 222

Query: 185 YPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIE 244
            PN ++Y  ++ G    G +  A  +L  M      RG   D   Y  L+   C  G   
Sbjct: 223 VPNLVTYTTILGGYVARGDMESAKRVLEEML----DRGWYPDATTYTVLMDGYCKLGRFS 282

Query: 245 QAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSYCA 304
           +A  ++  + K  ++  +  + ++    C+  K    E + + +E L +  +P S   C 
Sbjct: 283 EAATVMDDMEKNEIEPNEVTYGVMIRALCKEKKSG--EARNMFDEMLERSFMPDSSLCCK 342

Query: 305 MAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKG 364
           +   L  +++ D+   +   ML     P +++       LCKEG+V +A K+  +E  KG
Sbjct: 343 VIDALCEDHKVDEACGLWRKMLKNNCMPDNALLSTLIHWLCKEGRVTEARKLF-DEFEKG 402

Query: 365 SCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRYTE 424
           S +P++  YN ++ G+C  G+ T A      M ++     +  TY+ L+ GL +     E
Sbjct: 403 S-IPSLLTYNTLIAGMCEKGELTEAGRLWDDMYER-KCKPNAFTYNVLIEGLSKNGNVKE 462

Query: 425 ACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELS 476
             ++LEEM+    +P   TF  L  GL  +GK   A+  +   +  G++ + S
Sbjct: 463 GVRVLEEMLEIGCFPNKTTFLILFEGLQKLGKEEDAMKIVSMAVMNGKVDKES 506

BLAST of CmoCh08G001590.1 vs. TrEMBL
Match: A0A0A0K9Q4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G095860 PE=4 SV=1)

HSP 1 Score: 854.7 bits (2207), Expect = 5.5e-245
Identity = 420/498 (84.34%), Postives = 458/498 (91.97%), Query Frame = 1

Query: 1   MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGR 60
           M++RWPR+LTPT LSQIIRKQNNP TAYQLF EAKCRYP+Y+HNGPVYA MINILGNSGR
Sbjct: 1   MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGR 60

Query: 61  ISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISLFKSLGGFNCTDRTQTFNT 120
           +SEMREV+DQM+ DSC+CKDS+FSFAIKTYASHGLLE+GISLFKS G FNCT+RTQTFNT
Sbjct: 61  VSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNT 120

Query: 121 LLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQ 180
           LLEILL ESQL AACQLFQ+ S+GW VKSRTQSLNLLMQSLCQRGQSELALHVF+EMDYQ
Sbjct: 121 LLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQ 180

Query: 181 SCYPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGE 240
           SCYPNRLSYLI+MKGLCQDG+L+EAIHLLYSMFWRISR+G GGDIVIYRTLLFALCDNGE
Sbjct: 181 SCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGE 240

Query: 241 IEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY 300
           IEQAVEILGKIL+KGLKAPKRAHY IDL+ CR S LT+ EIK LINEALIKGGIPSSDSY
Sbjct: 241 IEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY 300

Query: 301 CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETV 360
           CAMA+DLYNEN+TDQGDKVVSHM+AKGF PPS +YEAKAA+LCKEGKVDDAVKVIEE+ V
Sbjct: 301 CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIV 360

Query: 361 KGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRY 420
            G CVPT+ALYNIVL GLC  GKSTVAME+LKKM KQVGLVA+KETYSTLVHGLC ENRY
Sbjct: 361 -GGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRY 420

Query: 421 TEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNAL 480
            EACK+LEEMVIKS  PCSNTFNTL +GLCSVGK Y+AVM LEEMISQGQLP + VWN+L
Sbjct: 421 IEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSL 480

Query: 481 VSSLCFNVADTDMWSKVL 499
           VSSLC +VA  DM S+VL
Sbjct: 481 VSSLCCDVAGIDMCSRVL 497

BLAST of CmoCh08G001590.1 vs. TrEMBL
Match: V4VX73_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10019806mg PE=4 SV=1)

HSP 1 Score: 652.1 bits (1681), Expect = 5.4e-184
Identity = 322/504 (63.89%), Postives = 399/504 (79.17%), Query Frame = 1

Query: 1   MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGR 60
           M+VRWPRLLTPT+LSQII+KQ +P TA ++F EAK +YPNY+HNGPVYA+MI IL  S R
Sbjct: 1   MSVRWPRLLTPTYLSQIIKKQKSPLTALKIFKEAKEKYPNYRHNGPVYASMIGILSESNR 60

Query: 61  ISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISLFKSLGGFNCTDRTQTFNT 120
           I+EM+EVIDQMK DSC+CKDS+F+ AI+TYA  G L E +SLFK+L  FNC + TQ+FNT
Sbjct: 61  ITEMKEVIDQMKGDSCECKDSVFATAIRTYARAGQLNEAVSLFKNLSQFNCVNWTQSFNT 120

Query: 121 LLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQ 180
           LL+ ++ ES+L+AA  LF +S +GWEVKSR QSLNLLM  LCQR +S+LALHVF+EMD+Q
Sbjct: 121 LLKEMVKESKLEAAHILFLRSCYGWEVKSRIQSLNLLMDVLCQRRRSDLALHVFQEMDFQ 180

Query: 181 SCYPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGE 240
            CYP+R SY ILMKGLC D +L+EA HLLYSMFWRIS++GSG DIVIYRTLLFALCD G+
Sbjct: 181 GCYPDRESYHILMKGLCNDRRLNEATHLLYSMFWRISQKGSGEDIVIYRTLLFALCDQGK 240

Query: 241 IEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY 300
           I+ A++IL KIL+KGLKAPK   + IDL  C   +  +   K LINEALI+GGIPS  SY
Sbjct: 241 IQDAMQILEKILRKGLKAPKSRRHRIDLCPCNDGE-DIEGAKSLINEALIRGGIPSLASY 300

Query: 301 CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETV 360
            AMAIDLYNE    +GDKV+  M  KGFWP   +YEAK AAL K+G VD+A++VIEEE V
Sbjct: 301 SAMAIDLYNEGRIVEGDKVLDEMRTKGFWPSLVMYEAKLAALFKDGMVDEALEVIEEEMV 360

Query: 361 KGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRY 420
           KG+ VPTV +YNI+L GLC  G S VA+ +LKKM KQVG VA+ ETY  LV GLCR+ R+
Sbjct: 361 KGTFVPTVRVYNILLKGLCDAGNSAVAVMYLKKMSKQVGCVANGETYGILVDGLCRDGRF 420

Query: 421 TEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNAL 480
            EA ++LEEM+I+S+WPC  T+N L RGLCS+GK Y+AVM LEEMISQ +LP++SVW++L
Sbjct: 421 LEASRVLEEMLIRSYWPCVETYNVLIRGLCSIGKQYEAVMWLEEMISQAKLPDISVWSSL 480

Query: 481 VSSLCFNVADTDMWSKVLRQIRSC 505
           V+S+C N AD ++  K L Q+ SC
Sbjct: 481 VASVCCNTADLNVCRKTLEQLSSC 503

BLAST of CmoCh08G001590.1 vs. TrEMBL
Match: A0A067HFD7_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g043204mg PE=4 SV=1)

HSP 1 Score: 649.0 bits (1673), Expect = 4.6e-183
Identity = 321/504 (63.69%), Postives = 398/504 (78.97%), Query Frame = 1

Query: 1   MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGR 60
           M+VRWPRLLTPT+LSQII+KQ +P TA ++F EAK +YPNY+HNGPVYA+MI IL  S R
Sbjct: 1   MSVRWPRLLTPTYLSQIIKKQKSPLTALKIFKEAKEKYPNYRHNGPVYASMIGILSESNR 60

Query: 61  ISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISLFKSLGGFNCTDRTQTFNT 120
           I+EM+EVIDQMK DSC+CKDS+F+ AI+TYA  G L E +SLFK+L  FNC + TQ+FNT
Sbjct: 61  ITEMKEVIDQMKGDSCECKDSVFATAIRTYARAGQLNEAVSLFKNLSQFNCVNWTQSFNT 120

Query: 121 LLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQ 180
           LL+ ++ ES+L+AA  LF +S +GWEVKSR QSLNLLM  LCQ  +S+LALHVF+EMD+Q
Sbjct: 121 LLKEMVKESKLEAAHILFLRSCYGWEVKSRIQSLNLLMDVLCQCRRSDLALHVFQEMDFQ 180

Query: 181 SCYPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGE 240
            CYP+R SY ILMKGLC D +L+EA HLLYSMFWRIS++GSG DIVIYRTLLFALCD G+
Sbjct: 181 GCYPDRESYHILMKGLCNDRRLNEATHLLYSMFWRISQKGSGEDIVIYRTLLFALCDQGK 240

Query: 241 IEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY 300
           I+ A++IL KIL+KGLKAPK   + IDL  C   +  +   K LINEALI+GGIPS  SY
Sbjct: 241 IQDAMQILEKILRKGLKAPKSRRHRIDLCPCNDGE-DIEGAKSLINEALIRGGIPSLASY 300

Query: 301 CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETV 360
            AMAIDLYNE    +GDKV+  M  KGFWP   +YEAK AAL K+G VD+A++VIEEE V
Sbjct: 301 SAMAIDLYNEGRIVEGDKVLDEMRTKGFWPSLVMYEAKLAALFKDGMVDEALEVIEEEMV 360

Query: 361 KGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRY 420
           KG+ VPTV +YNI+L GLC  G S VA+ +LKKM KQVG VA+ ETY  LV GLCR+ R+
Sbjct: 361 KGTFVPTVRVYNILLKGLCDAGNSAVAVMYLKKMSKQVGCVANGETYGILVDGLCRDGRF 420

Query: 421 TEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNAL 480
            EA ++LEEM+I+S+WPC  T+N L RGLCS+GK Y+AVM LEEMISQ +LP++SVW++L
Sbjct: 421 LEASRVLEEMLIRSYWPCVETYNVLIRGLCSIGKQYEAVMWLEEMISQAKLPDISVWSSL 480

Query: 481 VSSLCFNVADTDMWSKVLRQIRSC 505
           V+S+C N AD ++  K L Q+ SC
Sbjct: 481 VASVCCNTADLNVCRKTLEQLSSC 503

BLAST of CmoCh08G001590.1 vs. TrEMBL
Match: M5XFE6_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004557mg PE=4 SV=1)

HSP 1 Score: 648.7 bits (1672), Expect = 6.0e-183
Identity = 313/503 (62.23%), Postives = 409/503 (81.31%), Query Frame = 1

Query: 1   MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGR 60
           M++RWPRLLTPTHLSQIIRKQ NP TA Q+F+EAKC+YPNY+HNGPVYA MI+ILGNSGR
Sbjct: 1   MSIRWPRLLTPTHLSQIIRKQKNPLTALQIFSEAKCKYPNYRHNGPVYANMISILGNSGR 60

Query: 61  ISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISLFKSLGGFNCTDRTQTFNT 120
           I+EM+EVI++MK DSC+CKDS+F   IKTYA  GLL+E +SLFK++  FNC + TQ+FNT
Sbjct: 61  INEMKEVINEMKNDSCECKDSVFVSVIKTYARAGLLDEAVSLFKNISQFNCVNWTQSFNT 120

Query: 121 LLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQ 180
           LLEI++ ES+L+AA ++F +   GWEV SR  SLNLLM +LCQ+G+S++AL VF+EMDYQ
Sbjct: 121 LLEIMVKESKLEAAHRIFMEHCCGWEVSSRVPSLNLLMLALCQKGRSDIALQVFQEMDYQ 180

Query: 181 SCYPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGE 240
           SC P+R SY ILM+GLC+D +L+EA HLLYSMFWRIS++G G D+VIYRTLL ALCDNG+
Sbjct: 181 SCNPDRESYRILMRGLCEDKRLNEATHLLYSMFWRISQKGCGEDVVIYRTLLDALCDNGQ 240

Query: 241 IEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY 300
           +E AVEILGKIL+KGLKAPKR  + +DL++    + T   IK LINEAL++GGIPS  SY
Sbjct: 241 VEDAVEILGKILRKGLKAPKRFRHNLDLSHYGNGEDT-EGIKRLINEALVRGGIPSLASY 300

Query: 301 CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETV 360
            AMAIDLY+EN+  + D+V+  M  +GF P + V+EAKAAALC+E KV +AV+VIE+E V
Sbjct: 301 SAMAIDLYDENKVGEADRVLKEMQDRGFRPTALVFEAKAAALCRERKVVEAVEVIEKEMV 360

Query: 361 KGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRY 420
           + +CVPTV +Y++V+ GLC  G+S +A+ +LKKM KQVG VADK+TY  LV GLC E+R+
Sbjct: 361 EANCVPTVRVYSVVVRGLCSEGQSVLAILYLKKMEKQVGCVADKKTYGILVDGLCGESRF 420

Query: 421 TEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNAL 480
            EA ++L+EM+IKSHWPC+ T+N +  GLCSVGK Y+AVM LEEM S+  LPE SVW++L
Sbjct: 421 LEASRVLQEMLIKSHWPCAETYNRVITGLCSVGKQYEAVMWLEEMTSRAMLPEYSVWSSL 480

Query: 481 VSSLCFNVADTDMWSKVLRQIRS 504
           V+S+C N+A+ ++ S   ++++S
Sbjct: 481 VASVCCNMANIEVCSDAYKRLKS 502

BLAST of CmoCh08G001590.1 vs. TrEMBL
Match: B9RNK1_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_1339630 PE=4 SV=1)

HSP 1 Score: 635.6 bits (1638), Expect = 5.2e-179
Identity = 310/504 (61.51%), Postives = 389/504 (77.18%), Query Frame = 1

Query: 1   MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGR 60
           MTVRWPRLLTPTHLSQIIR Q NP  A ++F EAK +YPNY+HNGPVYA MI ILG+SGR
Sbjct: 1   MTVRWPRLLTPTHLSQIIRNQKNPLIALRIFKEAKDKYPNYRHNGPVYATMIGILGSSGR 60

Query: 61  ISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISLFKSLGGFNCTDRTQTFNT 120
           I+EM+EV+DQM+ DSC+CKDSIF+ AIKTYA  GLL E ISLFK++  FNC + T++FNT
Sbjct: 61  ITEMKEVLDQMREDSCECKDSIFANAIKTYARVGLLNEAISLFKNIPQFNCVNWTESFNT 120

Query: 121 LLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQ 180
           LL+I++ ES+L+AA +LF +SS+GWEVKSR +SLNLLM  LCQ  +S++AL VF+EM+YQ
Sbjct: 121 LLQIMVKESKLEAAHRLFLESSYGWEVKSRVRSLNLLMDVLCQHNRSDVALQVFQEMNYQ 180

Query: 181 SCYPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGE 240
            CYP+R SY I+M GLC+DG+L+EA HLLYSMFWRIS++GSG DIVIYR  L ALCD G 
Sbjct: 181 GCYPDRDSYRIVMMGLCKDGRLNEATHLLYSMFWRISQKGSGEDIVIYRIFLDALCDIGM 240

Query: 241 IEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY 300
           +EQA+E+LGKIL+KGLKAPKR H  +DL+ C  S   +   K LINEALI+G IPS  SY
Sbjct: 241 VEQALEVLGKILRKGLKAPKRCHPRLDLSNCN-SDGNIETTKHLINEALIRGAIPSLSSY 300

Query: 301 CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETV 360
            AMA+D Y E +  Q DKV+     +GF P    YEAK AALCKEGKV +A+ V+E E V
Sbjct: 301 TAMAVDFYAEGKLSQADKVLDETQDRGFRPSLLTYEAKVAALCKEGKVHEAINVLEVEMV 360

Query: 361 KGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRY 420
           +G+CVP V LYNI+L GLC    S  A+++LK+M KQ G VA+KETY  LVHGLC++  +
Sbjct: 361 EGNCVPNVRLYNILLKGLCDARNSATAVKYLKRMAKQTGCVANKETYCILVHGLCQDGGF 420

Query: 421 TEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNAL 480
            EA ++LEEM+IKS+WP  +TFN L RGLCS+G+ Y+A M LEEMIS  + PELSVWN+L
Sbjct: 421 IEASRILEEMLIKSYWPPVDTFNMLIRGLCSIGRQYEATMWLEEMISLDEAPELSVWNSL 480

Query: 481 VSSLCFNVADTDMWSKVLRQIRSC 505
           V+ LC N AD D   +  +++ +C
Sbjct: 481 VTCLCSNTADIDACCETFKRLSNC 503

BLAST of CmoCh08G001590.1 vs. TAIR10
Match: AT1G05600.1 (AT1G05600.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 558.9 bits (1439), Expect = 3.2e-159
Identity = 269/488 (55.12%), Postives = 365/488 (74.80%), Query Frame = 1

Query: 3   VRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRIS 62
           VRWPR+LTP+ LSQI++KQ NP TA +LF EAK R+P+Y HNG VYA MI+ILG S R+ 
Sbjct: 4   VRWPRVLTPSLLSQILKKQKNPVTALKLFEEAKERFPSYGHNGSVYATMIDILGKSNRVL 63

Query: 63  EMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISLFKSLGGFNCTDRTQTFNTLL 122
           EM+ VI++MK DSC+CKDS+F+  I+T++  G LE+ ISLFKSL  FNC + + +F+TLL
Sbjct: 64  EMKYVIERMKEDSCECKDSVFASVIRTFSRAGRLEDAISLFKSLHEFNCVNWSLSFDTLL 123

Query: 123 EILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSC 182
           + ++ ES+L+AAC +F++  +GWEV SR  +LNLLM+ LCQ  +S+LA  VF+EM+YQ C
Sbjct: 124 QEMVKESELEAACHIFRKYCYGWEVNSRITALNLLMKVLCQVNRSDLASQVFQEMNYQGC 183

Query: 183 YPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIE 242
           YP+R SY ILMKG C +GKL EA HLLYSMFWRIS++GSG DIV+YR LL ALCD GE++
Sbjct: 184 YPDRDSYRILMKGFCLEGKLEEATHLLYSMFWRISQKGSGEDIVVYRILLDALCDAGEVD 243

Query: 243 QAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSYCA 302
            A+EILGKIL+KGLKAPKR ++ I+  +   S   +  +K L+ E LI+G IP  DSY A
Sbjct: 244 DAIEILGKILRKGLKAPKRCYHHIEAGHWESSSEGIERVKRLLTETLIRGAIPCLDSYSA 303

Query: 303 MAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKG 362
           MA DL+ E +  +G++V+  M +KGF P   +Y AK  ALC+ GK+ +AV VI +E ++G
Sbjct: 304 MATDLFEEGKLVEGEEVLLAMRSKGFEPTPFIYGAKVKALCRAGKLKEAVSVINKEMMQG 363

Query: 363 SCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRYTE 422
            C+PTV +YN+++ GLC  GKS  A+ +LKKM KQV  VA++ETY TLV GLCR+ ++ E
Sbjct: 364 HCLPTVGVYNVLIKGLCDDGKSMEAVGYLKKMSKQVSCVANEETYQTLVDGLCRDGQFLE 423

Query: 423 ACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALVS 482
           A +++EEM+IKSH+P   T++ + +GLC + + Y+AVM LEEM+SQ  +PE SVW AL  
Sbjct: 424 ASQVMEEMLIKSHFPGVETYHMMIKGLCDMDRRYEAVMWLEEMVSQDMVPESSVWKALAE 483

Query: 483 SLCFNVAD 491
           S+CF   D
Sbjct: 484 SVCFCAID 491

BLAST of CmoCh08G001590.1 vs. TAIR10
Match: AT1G74580.1 (AT1G74580.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 175.6 bits (444), Expect = 7.5e-44
Identity = 115/476 (24.16%), Postives = 224/476 (47.06%), Query Frame = 1

Query: 9   LTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEMREVI 68
           L P H++ +I+ Q +P  A ++FN  + +   ++H    Y ++I  LG  G+   M EV+
Sbjct: 5   LLPKHVTAVIKCQKDPMKALEMFNSMR-KEVGFKHTLSTYRSVIEKLGYYGKFEAMEEVL 64

Query: 69  DQMKVD-SCQCKDSIFSFAIKTYASHGLLEEGISLFKSLGGFNCTDRTQTFNTLLEILLN 128
             M+ +      + ++  A+K Y   G ++E +++F+ +  ++C     ++N ++ +L++
Sbjct: 65  VDMRENVGNHMLEGVYVGAMKNYGRKGKVQEAVNVFERMDFYDCEPTVFSYNAIMSVLVD 124

Query: 129 ESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSCYPNRL 188
               D A +++ +      +     S  + M+S C+  +   AL +   M  Q C  N +
Sbjct: 125 SGYFDQAHKVYMRMR-DRGITPDVYSFTIRMKSFCKTSRPHAALRLLNNMSSQGCEMNVV 184

Query: 189 SYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIEQAVEI 248
           +Y  ++ G  ++    E     Y +F ++   G    +  +  LL  LC  G++++  ++
Sbjct: 185 AYCTVVGGFYEENFKAEG----YELFGKMLASGVSLCLSTFNKLLRVLCKKGDVKECEKL 244

Query: 249 LGKILKKGLKAPKRAHYLIDLNYCRISKL--TVTEIKCLINEALIKGGIPSSDSYCAMAI 308
           L K++K+G+      + L     C+  +L   V  + CLI +    G  P   +Y  +  
Sbjct: 245 LDKVIKRGVLPNLFTYNLFIQGLCQRGELDGAVRMVGCLIEQ----GPKPDVITYNNLIY 304

Query: 309 DLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKGSCV 368
            L   ++  + +  +  M+ +G  P S  Y    A  CK G V  A +++ +    G  V
Sbjct: 305 GLCKNSKFQEAEVYLGKMVNEGLEPDSYTYNTLIAGYCKGGMVQLAERIVGDAVFNG-FV 364

Query: 369 PTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRYTEACK 428
           P    Y  +++GLC  G++  A+    + + + G+  +   Y+TL+ GL  +    EA +
Sbjct: 365 PDQFTYRSLIDGLCHEGETNRALALFNEALGK-GIKPNVILYNTLIKGLSNQGMILEAAQ 424

Query: 429 LLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNALV 482
           L  EM  K   P   TFN L  GLC +G    A   ++ MIS+G  P++  +N L+
Sbjct: 425 LANEMSEKGLIPEVQTFNILVNGLCKMGCVSDADGLVKVMISKGYFPDIFTFNILI 468

BLAST of CmoCh08G001590.1 vs. TAIR10
Match: AT3G53700.1 (AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 174.5 bits (441), Expect = 1.7e-43
Identity = 119/459 (25.93%), Postives = 226/459 (49.24%), Query Frame = 1

Query: 48  YAAMINIL--GNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISLFKS 107
           Y  M+N+L  GNS ++ E+     +M V   +   S F+  IK       L   I + + 
Sbjct: 157 YNRMLNLLVDGNSLKLVEISHA--KMSVWGIKPDVSTFNVLIKALCRAHQLRPAILMLED 216

Query: 108 LGGFNCTDRTQTFNTLLEILLNESQLDAACQLFQQS-SFGWEVKSRTQSLNLLMQSLCQR 167
           +  +      +TF T+++  + E  LD A ++ +Q   FG    +   S+N+++   C+ 
Sbjct: 217 MPSYGLVPDEKTFTTVMQGYIEEGDLDGALRIREQMVEFGCSWSN--VSVNVIVHGFCKE 276

Query: 168 GQSELALHVFKEMDYQS-CYPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGG 227
           G+ E AL+  +EM  Q   +P++ ++  L+ GLC+ G +  AI ++  M     + G   
Sbjct: 277 GRVEDALNFIQEMSNQDGFFPDQYTFNTLVNGLCKAGHVKHAIEIMDVML----QEGYDP 336

Query: 228 DIVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKC 287
           D+  Y +++  LC  GE+++AVE+L +++ +        +  +    C+ ++  V E   
Sbjct: 337 DVYTYNSVISGLCKLGEVKEAVEVLDQMITRDCSPNTVTYNTLISTLCKENQ--VEEATE 396

Query: 288 LINEALIKGGIPSSDSYCAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALC 347
           L      KG +P   ++ ++   L          ++   M +KG  P    Y     +LC
Sbjct: 397 LARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMELFEEMRSKGCEPDEFTYNMLIDSLC 456

Query: 348 KEGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVAD 407
            +GK+D+A+ ++++  + G C  +V  YN +++G C+  K+  A E   +M    G+  +
Sbjct: 457 SKGKLDEALNMLKQMELSG-CARSVITYNTLIDGFCKANKTREAEEIFDEMEVH-GVSRN 516

Query: 408 KETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLE 467
             TY+TL+ GLC+  R  +A +L+++M+++   P   T+N+L    C  G   KA   ++
Sbjct: 517 SVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYTYNSLLTHFCRGGDIKKAADIVQ 576

Query: 468 EMISQGQLPELSVWNALVSSLCFNVADTDMWSKVLRQIR 503
            M S G  P++  +  L+S LC      ++ SK+LR I+
Sbjct: 577 AMTSNGCEPDIVTYGTLISGLC-KAGRVEVASKLLRSIQ 602

BLAST of CmoCh08G001590.1 vs. TAIR10
Match: AT4G20090.1 (AT4G20090.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 172.2 bits (435), Expect = 8.3e-43
Identity = 108/444 (24.32%), Postives = 215/444 (48.42%), Query Frame = 1

Query: 49  AAMINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISLF-KSLG 108
           ++MI    NSG    + +++ ++++++    +  F    + Y    L ++ + LF + + 
Sbjct: 81  SSMIESYANSGDFDSVEKLLSRIRLENRVIIERSFIVVFRAYGKAHLPDKAVDLFHRMVD 140

Query: 109 GFNCTDRTQTFNTLLEILLNESQLDAACQLFQ---QSSFGWEVKSRTQSLNLLMQSLCQR 168
            F C    ++FN++L +++NE       + +     S+    +     S NL++++LC+ 
Sbjct: 141 EFRCKRSVKSFNSVLNVIINEGLYHRGLEFYDYVVNSNMNMNISPNGLSFNLVIKALCKL 200

Query: 169 GQSELALHVFKEMDYQSCYPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGD 228
              + A+ VF+ M  + C P+  +Y  LM GLC++ ++ EA+ LL  M       G    
Sbjct: 201 RFVDRAIEVFRGMPERKCLPDGYTYCTLMDGLCKEERIDEAVLLLDEM----QSEGCSPS 260

Query: 229 IVIYRTLLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCL 288
            VIY  L+  LC  G++ +  +++  +  KG    +  +  +    C   KL   +   L
Sbjct: 261 PVIYNVLIDGLCKKGDLTRVTKLVDNMFLKGCVPNEVTYNTLIHGLCLKGKL--DKAVSL 320

Query: 289 INEALIKGGIPSSDSYCAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCK 348
           +   +    IP+  +Y  +   L  +       +++S M  +G+     +Y    + L K
Sbjct: 321 LERMVSSKCIPNDVTYGTLINGLVKQRRATDAVRLLSSMEERGYHLNQHIYSVLISGLFK 380

Query: 349 EGKVDDAVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADK 408
           EGK ++A+ +  +   KG C P + +Y+++++GLCR GK   A E L +M+   G + + 
Sbjct: 381 EGKAEEAMSLWRKMAEKG-CKPNIVVYSVLVDGLCREGKPNEAKEILNRMIAS-GCLPNA 440

Query: 409 ETYSTLVHGLCRENRYTEACKLLEEMVIKSHWPCSNT---FNTLTRGLCSVGKPYKAVMC 468
            TYS+L+ G  +     EA ++ +EM       CS     ++ L  GLC VG+  +A+M 
Sbjct: 441 YTYSSLMKGFFKTGLCEEAVQVWKEM---DKTGCSRNKFCYSVLIDGLCGVGRVKEAMMV 500

Query: 469 LEEMISQGQLPELSVWNALVSSLC 486
             +M++ G  P+   +++++  LC
Sbjct: 501 WSKMLTIGIKPDTVAYSSIIKGLC 513

BLAST of CmoCh08G001590.1 vs. TAIR10
Match: AT5G16420.1 (AT5G16420.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 159.8 bits (403), Expect = 4.3e-39
Identity = 113/473 (23.89%), Postives = 213/473 (45.03%), Query Frame = 1

Query: 5   WPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGRISEM 64
           WP+ L P  L  +I +Q N   A Q+F  A   +P + HN   Y +++  L  +     +
Sbjct: 43  WPQRLFPKRLVSMITQQQNIDLALQIFLYAGKSHPGFTHNYDTYHSILFKLSRARAFDPV 102

Query: 65  REVIDQMK--VDSCQCKDSIFSFAIKTYASHGLLEEGISLFKSLGGFNCTDRTQTFNTLL 124
             ++  ++      +C +++F   ++ Y   G  E  + +F  +  F      ++ NTLL
Sbjct: 103 ESLMADLRNSYPPIKCGENLFIDLLRNYGLAGRYESSMRIFLRIPDFGVKRSVRSLNTLL 162

Query: 125 EILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQSC 184
            +L+   + D    +F+ S   + +     + NLL+++LC++   E A  V  E+     
Sbjct: 163 NVLIQNQRFDLVHAMFKNSKESFGITPNIFTCNLLVKALCKKNDIESAYKVLDEIPSMGL 222

Query: 185 YPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGEIE 244
            PN ++Y  ++ G    G +  A  +L  M      RG   D   Y  L+   C  G   
Sbjct: 223 VPNLVTYTTILGGYVARGDMESAKRVLEEML----DRGWYPDATTYTVLMDGYCKLGRFS 282

Query: 245 QAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSYCA 304
           +A  ++  + K  ++  +  + ++    C+  K    E + + +E L +  +P S   C 
Sbjct: 283 EAATVMDDMEKNEIEPNEVTYGVMIRALCKEKKSG--EARNMFDEMLERSFMPDSSLCCK 342

Query: 305 MAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETVKG 364
           +   L  +++ D+   +   ML     P +++       LCKEG+V +A K+  +E  KG
Sbjct: 343 VIDALCEDHKVDEACGLWRKMLKNNCMPDNALLSTLIHWLCKEGRVTEARKLF-DEFEKG 402

Query: 365 SCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRYTE 424
           S +P++  YN ++ G+C  G+ T A      M ++     +  TY+ L+ GL +     E
Sbjct: 403 S-IPSLLTYNTLIAGMCEKGELTEAGRLWDDMYER-KCKPNAFTYNVLIEGLSKNGNVKE 462

Query: 425 ACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELS 476
             ++LEEM+    +P   TF  L  GL  +GK   A+  +   +  G++ + S
Sbjct: 463 GVRVLEEMLEIGCFPNKTTFLILFEGLQKLGKEEDAMKIVSMAVMNGKVDKES 506

BLAST of CmoCh08G001590.1 vs. NCBI nr
Match: gi|659119763|ref|XP_008459831.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Cucumis melo])

HSP 1 Score: 882.5 bits (2279), Expect = 3.5e-253
Identity = 431/498 (86.55%), Postives = 469/498 (94.18%), Query Frame = 1

Query: 1   MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGR 60
           MTVRWPR+LTPT+LSQIIRKQNNP TAYQLF EAKCRYP+Y+HNGPVYAAMINILGNSGR
Sbjct: 1   MTVRWPRILTPTYLSQIIRKQNNPLTAYQLFKEAKCRYPDYRHNGPVYAAMINILGNSGR 60

Query: 61  ISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISLFKSLGGFNCTDRTQTFNT 120
           +SEMREV+DQM+ DSC+CKDS+FSFAIKTYASHGLLE+GISLFKSLG FNCT+RTQTFNT
Sbjct: 61  VSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSLGRFNCTNRTQTFNT 120

Query: 121 LLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQ 180
           LLEILLNESQL AACQLFQ+ S+GWEVKSRTQSLNLLMQSLCQRGQSELALHVF+EMDYQ
Sbjct: 121 LLEILLNESQLHAACQLFQECSYGWEVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQ 180

Query: 181 SCYPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGE 240
           SCYPNRLSYLI+MKGLCQDG+L+EAIHLLYSMFWRISR+GSGGDIVIYRTLLFALCDNGE
Sbjct: 181 SCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGSGGDIVIYRTLLFALCDNGE 240

Query: 241 IEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY 300
           IEQAVEILGKIL+KGLKAPKRAHY IDL+ CR +KLT+ EIK LINEALIKGGIPSSDSY
Sbjct: 241 IEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNNKLTIEEIKSLINEALIKGGIPSSDSY 300

Query: 301 CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETV 360
           CAMA+DLYNEN+TDQGDKVVSHM+AKGF PPSS+YEAK A+LCKEGKVDDAVKVIEE+ V
Sbjct: 301 CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSSIYEAKVASLCKEGKVDDAVKVIEEQIV 360

Query: 361 KGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRY 420
            GSCVPT+ALYNIVL GLC  GKSTVAME+LKKM K+VGLVA+KETYSTLVHGLCRENRY
Sbjct: 361 -GSCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKKVGLVANKETYSTLVHGLCRENRY 420

Query: 421 TEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNAL 480
           TEACK+LEEMVIKS WPCSNTFNTL +GLCSVGK Y+AVM LEEMISQGQLP + VWN+L
Sbjct: 421 TEACKVLEEMVIKSFWPCSNTFNTLIKGLCSVGKQYEAVMWLEEMISQGQLPHVCVWNSL 480

Query: 481 VSSLCFNVADTDMWSKVL 499
           VSSLC +VA  DM SKVL
Sbjct: 481 VSSLCCDVAGIDMCSKVL 497

BLAST of CmoCh08G001590.1 vs. NCBI nr
Match: gi|449445756|ref|XP_004140638.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Cucumis sativus])

HSP 1 Score: 854.7 bits (2207), Expect = 7.9e-245
Identity = 420/498 (84.34%), Postives = 458/498 (91.97%), Query Frame = 1

Query: 1   MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGR 60
           M++RWPR+LTPT LSQIIRKQNNP TAYQLF EAKCRYP+Y+HNGPVYA MINILGNSGR
Sbjct: 1   MSIRWPRILTPTCLSQIIRKQNNPQTAYQLFKEAKCRYPDYRHNGPVYATMINILGNSGR 60

Query: 61  ISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISLFKSLGGFNCTDRTQTFNT 120
           +SEMREV+DQM+ DSC+CKDS+FSFAIKTYASHGLLE+GISLFKS G FNCT+RTQTFNT
Sbjct: 61  VSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFNCTNRTQTFNT 120

Query: 121 LLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQ 180
           LLEILL ESQL AACQLFQ+ S+GW VKSRTQSLNLLMQSLCQRGQSELALHVF+EMDYQ
Sbjct: 121 LLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELALHVFQEMDYQ 180

Query: 181 SCYPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGE 240
           SCYPNRLSYLI+MKGLCQDG+L+EAIHLLYSMFWRISR+G GGDIVIYRTLLFALCDNGE
Sbjct: 181 SCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRTLLFALCDNGE 240

Query: 241 IEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY 300
           IEQAVEILGKIL+KGLKAPKRAHY IDL+ CR S LT+ EIK LINEALIKGGIPSSDSY
Sbjct: 241 IEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALIKGGIPSSDSY 300

Query: 301 CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETV 360
           CAMA+DLYNEN+TDQGDKVVSHM+AKGF PPS +YEAKAA+LCKEGKVDDAVKVIEE+ V
Sbjct: 301 CAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDDAVKVIEEQIV 360

Query: 361 KGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRY 420
            G CVPT+ALYNIVL GLC  GKSTVAME+LKKM KQVGLVA+KETYSTLVHGLC ENRY
Sbjct: 361 -GGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTLVHGLCLENRY 420

Query: 421 TEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNAL 480
            EACK+LEEMVIKS  PCSNTFNTL +GLCSVGK Y+AVM LEEMISQGQLP + VWN+L
Sbjct: 421 IEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQLPHVCVWNSL 480

Query: 481 VSSLCFNVADTDMWSKVL 499
           VSSLC +VA  DM S+VL
Sbjct: 481 VSSLCCDVAGIDMCSRVL 497

BLAST of CmoCh08G001590.1 vs. NCBI nr
Match: gi|659119766|ref|XP_008459832.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X2 [Cucumis melo])

HSP 1 Score: 786.2 bits (2029), Expect = 3.4e-224
Identity = 387/448 (86.38%), Postives = 421/448 (93.97%), Query Frame = 1

Query: 51  MINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISLFKSLGGFN 110
           MINILGNSGR+SEMREV+DQM+ DSC+CKDS+FSFAIKTYASHGLLE+GISLFKSLG FN
Sbjct: 1   MINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSLGRFN 60

Query: 111 CTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELA 170
           CT+RTQTFNTLLEILLNESQL AACQLFQ+ S+GWEVKSRTQSLNLLMQSLCQRGQSELA
Sbjct: 61  CTNRTQTFNTLLEILLNESQLHAACQLFQECSYGWEVKSRTQSLNLLMQSLCQRGQSELA 120

Query: 171 LHVFKEMDYQSCYPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRT 230
           LHVF+EMDYQSCYPNRLSYLI+MKGLCQDG+L+EAIHLLYSMFWRISR+GSGGDIVIYRT
Sbjct: 121 LHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGSGGDIVIYRT 180

Query: 231 LLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALI 290
           LLFALCDNGEIEQAVEILGKIL+KGLKAPKRAHY IDL+ CR +KLT+ EIK LINEALI
Sbjct: 181 LLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNNKLTIEEIKSLINEALI 240

Query: 291 KGGIPSSDSYCAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDD 350
           KGGIPSSDSYCAMA+DLYNEN+TDQGDKVVSHM+AKGF PPSS+YEAK A+LCKEGKVDD
Sbjct: 241 KGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSSIYEAKVASLCKEGKVDD 300

Query: 351 AVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTL 410
           AVKVIEE+ V GSCVPT+ALYNIVL GLC  GKSTVAME+LKKM K+VGLVA+KETYSTL
Sbjct: 301 AVKVIEEQIV-GSCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKKVGLVANKETYSTL 360

Query: 411 VHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQ 470
           VHGLCRENRYTEACK+LEEMVIKS WPCSNTFNTL +GLCSVGK Y+AVM LEEMISQGQ
Sbjct: 361 VHGLCRENRYTEACKVLEEMVIKSFWPCSNTFNTLIKGLCSVGKQYEAVMWLEEMISQGQ 420

Query: 471 LPELSVWNALVSSLCFNVADTDMWSKVL 499
           LP + VWN+LVSSLC +VA  DM SKVL
Sbjct: 421 LPHVCVWNSLVSSLCCDVAGIDMCSKVL 447

BLAST of CmoCh08G001590.1 vs. NCBI nr
Match: gi|778711932|ref|XP_011656819.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X2 [Cucumis sativus])

HSP 1 Score: 765.0 bits (1974), Expect = 8.2e-218
Identity = 379/448 (84.60%), Postives = 412/448 (91.96%), Query Frame = 1

Query: 51  MINILGNSGRISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISLFKSLGGFN 110
           MINILGNSGR+SEMREV+DQM+ DSC+CKDS+FSFAIKTYASHGLLE+GISLFKS G FN
Sbjct: 1   MINILGNSGRVSEMREVMDQMRDDSCECKDSVFSFAIKTYASHGLLEDGISLFKSFGRFN 60

Query: 111 CTDRTQTFNTLLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELA 170
           CT+RTQTFNTLLEILL ESQL AACQLFQ+ S+GW VKSRTQSLNLLMQSLCQRGQSELA
Sbjct: 61  CTNRTQTFNTLLEILLKESQLHAACQLFQECSYGWGVKSRTQSLNLLMQSLCQRGQSELA 120

Query: 171 LHVFKEMDYQSCYPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRT 230
           LHVF+EMDYQSCYPNRLSYLI+MKGLCQDG+L+EAIHLLYSMFWRISR+G GGDIVIYRT
Sbjct: 121 LHVFQEMDYQSCYPNRLSYLIVMKGLCQDGRLNEAIHLLYSMFWRISRKGGGGDIVIYRT 180

Query: 231 LLFALCDNGEIEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALI 290
           LLFALCDNGEIEQAVEILGKIL+KGLKAPKRAHY IDL+ CR S LT+ EIK LINEALI
Sbjct: 181 LLFALCDNGEIEQAVEILGKILRKGLKAPKRAHYRIDLDQCRNSNLTIEEIKSLINEALI 240

Query: 291 KGGIPSSDSYCAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDD 350
           KGGIPSSDSYCAMA+DLYNEN+TDQGDKVVSHM+AKGF PPS +YEAKAA+LCKEGKVDD
Sbjct: 241 KGGIPSSDSYCAMAVDLYNENKTDQGDKVVSHMIAKGFRPPSLIYEAKAASLCKEGKVDD 300

Query: 351 AVKVIEEETVKGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTL 410
           AVKVIEE+ V G CVPT+ALYNIVL GLC  GKSTVAME+LKKM KQVGLVA+KETYSTL
Sbjct: 301 AVKVIEEQIV-GGCVPTIALYNIVLKGLCDDGKSTVAMEYLKKMAKQVGLVANKETYSTL 360

Query: 411 VHGLCRENRYTEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQ 470
           VHGLC ENRY EACK+LEEMVIKS  PCSNTFNTL +GLCSVGK Y+AVM LEEMISQGQ
Sbjct: 361 VHGLCLENRYIEACKVLEEMVIKSFCPCSNTFNTLIKGLCSVGKHYEAVMWLEEMISQGQ 420

Query: 471 LPELSVWNALVSSLCFNVADTDMWSKVL 499
           LP + VWN+LVSSLC +VA  DM S+VL
Sbjct: 421 LPHVCVWNSLVSSLCCDVAGIDMCSRVL 447

BLAST of CmoCh08G001590.1 vs. NCBI nr
Match: gi|1009142747|ref|XP_015888890.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Ziziphus jujuba])

HSP 1 Score: 666.8 bits (1719), Expect = 3.0e-188
Identity = 320/488 (65.57%), Postives = 401/488 (82.17%), Query Frame = 1

Query: 1   MTVRWPRLLTPTHLSQIIRKQNNPFTAYQLFNEAKCRYPNYQHNGPVYAAMINILGNSGR 60
           M +RWPRLLTPTHLSQIIR Q NP TA ++FNEAK +YPNY+HNGPVYAAMI ILGNSGR
Sbjct: 1   MNIRWPRLLTPTHLSQIIRTQKNPLTALKIFNEAKSKYPNYRHNGPVYAAMIGILGNSGR 60

Query: 61  ISEMREVIDQMKVDSCQCKDSIFSFAIKTYASHGLLEEGISLFKSLGGFNCTDRTQTFNT 120
           I+EM+EVIDQMK+DSC+CKDS+F  AIKTY   GLL+E ++LFK++  FNC + T++FNT
Sbjct: 61  ITEMKEVIDQMKIDSCECKDSVFVHAIKTYERAGLLDEVLTLFKNIPKFNCVNWTESFNT 120

Query: 121 LLEILLNESQLDAACQLFQQSSFGWEVKSRTQSLNLLMQSLCQRGQSELALHVFKEMDYQ 180
           +L+I++NES+ + A +LF ++S GWEV+SR +SLNLLM++LC+RG+S++AL VF+EMDYQ
Sbjct: 121 VLQIMVNESRFETAHRLFLENSCGWEVRSRIRSLNLLMRALCERGRSDIALQVFQEMDYQ 180

Query: 181 SCYPNRLSYLILMKGLCQDGKLHEAIHLLYSMFWRISRRGSGGDIVIYRTLLFALCDNGE 240
            CYP R +Y  LM+GLC+DG+L+EA HLLYSMFWRIS++GSG DIVIYRTLL A CD+G+
Sbjct: 181 GCYPERETYRTLMRGLCEDGRLNEAKHLLYSMFWRISQKGSGADIVIYRTLLDAFCDHGQ 240

Query: 241 IEQAVEILGKILKKGLKAPKRAHYLIDLNYCRISKLTVTEIKCLINEALIKGGIPSSDSY 300
           +E+A+EILGKIL+KGLKAPKR+ + +DL+YCR  +  V  IK LINEALIKGGIPS  SY
Sbjct: 241 VEEAMEILGKILRKGLKAPKRSSHRMDLSYCRDGQ-DVERIKQLINEALIKGGIPSLASY 300

Query: 301 CAMAIDLYNENETDQGDKVVSHMLAKGFWPPSSVYEAKAAALCKEGKVDDAVKVIEEETV 360
            AMAIDLY EN+  + DKV+  M   GF P   +YEAK  ALC+E KVD+AVKVIE E +
Sbjct: 301 TAMAIDLYKENKIVEADKVLEVMQDGGFRPTPLIYEAKMEALCREAKVDEAVKVIETEMM 360

Query: 361 KGSCVPTVALYNIVLNGLCRVGKSTVAMEFLKKMVKQVGLVADKETYSTLVHGLCRENRY 420
           +G+CVPTV LYN+VL GLC  GKS  A+ +LKKM KQVG VADKETY+ LV GLCRE R+
Sbjct: 361 EGTCVPTVRLYNVVLRGLCNGGKSAFAVGYLKKMAKQVGCVADKETYTILVDGLCREGRF 420

Query: 421 TEACKLLEEMVIKSHWPCSNTFNTLTRGLCSVGKPYKAVMCLEEMISQGQLPELSVWNAL 480
            EA ++LEEM+IKSHWPC  T+N + +GLCSVG+ Y+AV+ LEEMISQG LP+ S WN+L
Sbjct: 421 AEASRVLEEMLIKSHWPCVETYNVVIKGLCSVGRQYEAVLWLEEMISQGMLPQNSSWNSL 480

Query: 481 VSSLCFNV 489
           VSS+C N+
Sbjct: 481 VSSVCCNL 487

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR11_ARATH5.6e-15855.12Pentatricopeptide repeat-containing protein At1g05600 OS=Arabidopsis thaliana GN... [more]
PP120_ARATH1.3e-4224.16Putative pentatricopeptide repeat-containing protein At1g74580 OS=Arabidopsis th... [more]
PP281_ARATH3.0e-4225.93Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidop... [more]
PP327_ARATH1.5e-4124.32Pentatricopeptide repeat-containing protein At4g20090 OS=Arabidopsis thaliana GN... [more]
PP388_ARATH7.6e-3823.89Pentatricopeptide repeat-containing protein At5g16420, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0K9Q4_CUCSA5.5e-24584.34Uncharacterized protein OS=Cucumis sativus GN=Csa_6G095860 PE=4 SV=1[more]
V4VX73_9ROSI5.4e-18463.89Uncharacterized protein OS=Citrus clementina GN=CICLE_v10019806mg PE=4 SV=1[more]
A0A067HFD7_CITSI4.6e-18363.69Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g043204mg PE=4 SV=1[more]
M5XFE6_PRUPE6.0e-18362.23Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004557mg PE=4 SV=1[more]
B9RNK1_RICCO5.2e-17961.51Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
Match NameE-valueIdentityDescription
AT1G05600.13.2e-15955.12 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G74580.17.5e-4424.16 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G53700.11.7e-4325.93 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G20090.18.3e-4324.32 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G16420.14.3e-3923.89 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659119763|ref|XP_008459831.1|3.5e-25386.55PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Cuc... [more]
gi|449445756|ref|XP_004140638.1|7.9e-24584.34PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Cuc... [more]
gi|659119766|ref|XP_008459832.1|3.4e-22486.38PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X2 [Cuc... [more]
gi|778711932|ref|XP_011656819.1|8.2e-21884.60PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X2 [Cuc... [more]
gi|1009142747|ref|XP_015888890.1|3.0e-18865.57PREDICTED: pentatricopeptide repeat-containing protein At1g05600 isoform X1 [Ziz... [more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmoCh08G001590CmoCh08G001590gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmoCh08G001590.1CmoCh08G001590.1-proteinpolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh08G001590.1.exon.1CmoCh08G001590.1.exon.1exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh08G001590.1.CDS.1CmoCh08G001590.1.CDS.1CDS


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 226..256
score: 0.0032coord: 441..469
score: 1.8E-4coord: 48..76
score: 0.014coord: 86..107
score: 0.64coord: 341..357
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 366..416
score: 4.0E-11coord: 153..198
score: 1.5
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 406..431
score: 5.2E-7coord: 154..185
score: 4.8E-5coord: 441..470
score: 6.0E-5coord: 370..403
score: 1.3E-5coord: 226..257
score: 0.0024coord: 47..77
score: 6.5E-5coord: 188..212
score: 0.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 367..397
score: 9.076coord: 114..149
score: 5.788coord: 438..472
score: 9.679coord: 296..330
score: 6.818coord: 403..437
score: 11.455coord: 331..366
score: 8.035coord: 150..184
score: 9.58coord: 79..113
score: 6.193coord: 224..258
score: 10.424coord: 473..504
score: 5.919coord: 44..78
score: 8.572coord: 185..215
score: 7
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 329..356
score: 2.8E-6coord: 117..269
score: 2.8E-6coord: 20..61
score: 2.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 289..485
score: 1.9E-203coord: 1..251
score: 1.9E
NoneNo IPR availablePANTHERPTHR24015:SF542SUBFAMILY NOT NAMEDcoord: 289..485
score: 1.9E-203coord: 1..251
score: 1.9E