CmoCh20G000880.1 (mRNA) Cucurbita moschata (Rifu)

NameCmoCh20G000880.1
TypemRNA
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionPentatricopeptide repeat-containing protein
LocationCmo_Chr20 : 451557 .. 454094 (+)
Sequence length2538
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGATTTCAGCCACCCTAAGCATTCACGGACGCTCTCCGACTCCTAAACAAGCCATCAATGTCTCAAAGGACTGGAACTTGATTATAAAGCACCAAACCAAGCTTAAGAATGACCATGCCATTCTTTCTACATATACCCAGATGGAGTCTCTTGGTATTGCACCCGATTCTGCTACAATGCCTCTTGTTCTAAAGGCTTGCGGGAGGCTCAACGCCATTGAAAAAGGGGTACGAATTCATTCTTGTATTAGGGATTCGGATTTGATCAGAGATGTTCGGGTTGGGACTGCCTTGGTCGATTTCTATAGTAAATGTGGGCTTGTTGGAGAGGCCAGTAAAGTGTTCGATGAAATGCCTGAAAGAGATTTGGTGTCGTGGAATGCATTGATTTCGGGATATGTGGGCTGTTCCTGCTATAAAGAAGCAGTGTTGTTGTTTATAGAGATGCAAAAGGCAGGCCTCACACCCAATTCTCGTACTGTAGTGCCTCTGCTTTTGGCTTGTGCTGAGATGTTGGAACTGCGATTAGGACATGAGATTCATGGTTATTGTTTGAGAAATGGGTTGTTTGATATGGATGCACATGTTGGTACTGCTTTGATAGGATTTTATATGAGATTTGATGCAACAGTTTCTCACCGAGTTTTTAGCTCGATGGAGGTGAGAAATGTAGTGAGTTGGAATGCAATGATAACCGGATATCTCAATATTGGAGATTACACAAAAGCTTTGAAGCTTTTTAGTAGTATGCTGACTGAGGGTATAAAGTTTGATGCTGTTACAATGCTGCTGGTTATTCAAGCCTGTGCAGAATCTGAGTCTCTCCAATTAGGCATGCAACTGCATCAGTTGGCTATCAAGTTCAATTTCATTGGTGACTTGTTCGTATTAAATGCACTGTTGAATATGTATAGTGATAATGGACGTCTGGAGTCATCATGTGCGTTGTTTAATGCCGTTCCCACCTCTGATGCCGCCTTATGGAATTCTATGATATCTGCATACATTGCCTTCGGATTTCATGCTGAAGCTATAGCTTTGTATATTAAAATGCGTTTGGAAGGCTTAAAAGAAGACAAAAGAACCGTTGAGATTATGCTGTCTTTATGCGAAGATCTAAATGATGGTTCTATTTGGGGTAGAGGCTTACATGCTCATGCCATGAAAAGTGGAATGGAACTAGATGTATTTCTGGGCAATGCATTGTTAAGCATGTATGTTGAGCACAATCAAATTGATGCTGCACAGAAACTTTTTGATAAGATGAGAGGTTTGGACGTCATCTCCTGTAACACAATGATATTAGCACTTGCTCGGAGTAAGTTTCGAGCCAAAGCATTTGAACTCTTTATGACGATGTGTGAATCAGAAATCAAGTTCAATTCATACACAATGATATCTCTCCTTGCATTATGTAAAGATGGAAGTGATTTGGTGTTTGGGCGATCGATCCATGGTTTTGCAATAAAAAATGGTCTTGAAATAAATACTTCTTTGAACACTTCACTGACTGAAATGTACATAAACTGTAGAGATGAAGGATCGGCTACAAATCTGTTTATTAGATGTCCTCAAAGAGATTTAGTTTCATGGAATTCCCTAATTTCGAGCTATATAAAGAATGACAATGCAGGAAAAGCTCTATTACTTTTTAACCATATGATTTCTGAGCTGGAGCCTAACTCCGTGACAATCATAAGTATTCTCACATCTTGTACCCAGCTTGCCCATCTACCACTAGGACAGTGCTTGCATGCTTACACAACTAGAAGGGGAGAATCTTTTGAATTGGATGCTTCTCTAGCAAATGCTTTTATAACTATGTATGCACGATGTGGTAAAATGCAATATGCAGAAAAGATTTTTAGCACCCTGAAGGCAAGAAATATTGTCTCATGGAATGCCATGATAACAGGGTATGGCATGCACGGTCGTGGACACGATGCTACTCTAGCCTTTGCACAGATGTTGGATGATGGTTTCAAGCCAAACAATATATCTTTTGTATCTGTTTTATCTGCCTGCAGCCATTCTGGTCTGACCAAGACCGGTTTGCAGCTTTTTAGTTCCATGGTGCGGGACTTTGGTATTGCTCCTCAACTTGCTCACTATGGTTGTATAGTTGATCTGCTTGGTCGTGGGGGCCATTTTGCTGAAGCTATAGCTCTCATCAGCTCAATGCCCGTTGAGCCTGATGCATCAATTTGGAGAGCTTTGCTCAGTTCATGTCAGGTTAAAAGCAATAAAAAACTAGTCGAAACCATCTTTAGAAAGCTTGTTGAATTAGAACCAAGCAATCCAGGGAATTTTGTTTTGCTTTCAAATATCTACGCAGCAGCAGGTCTTTGGTCAGAGGTTTCACAGATAAGAAAGTGGCTTAGAGATAAAGGTCTAGTGAAGCCTCCAGGAACTAGCTGGATTGTAATCGGAAGTCAGGTCCACTATTTCACTGCAACTGACGTATCACACCCTCAATCAGAAGAAATTTACGAAAATTTGAATTCTTTGACATCATTGATCCAAGATATGGGCTGA

mRNA sequence

ATGGAGATTTCAGCCACCCTAAGCATTCACGGACGCTCTCCGACTCCTAAACAAGCCATCAATGTCTCAAAGGACTGGAACTTGATTATAAAGCACCAAACCAAGCTTAAGAATGACCATGCCATTCTTTCTACATATACCCAGATGGAGTCTCTTGGTATTGCACCCGATTCTGCTACAATGCCTCTTGTTCTAAAGGCTTGCGGGAGGCTCAACGCCATTGAAAAAGGGGTACGAATTCATTCTTGTATTAGGGATTCGGATTTGATCAGAGATGTTCGGGTTGGGACTGCCTTGGTCGATTTCTATAGTAAATGTGGGCTTGTTGGAGAGGCCAGTAAAGTGTTCGATGAAATGCCTGAAAGAGATTTGGTGTCGTGGAATGCATTGATTTCGGGATATGTGGGCTGTTCCTGCTATAAAGAAGCAGTGTTGTTGTTTATAGAGATGCAAAAGGCAGGCCTCACACCCAATTCTCGTACTGTAGTGCCTCTGCTTTTGGCTTGTGCTGAGATGTTGGAACTGCGATTAGGACATGAGATTCATGGTTATTGTTTGAGAAATGGGTTGTTTGATATGGATGCACATGTTGGTACTGCTTTGATAGGATTTTATATGAGATTTGATGCAACAGTTTCTCACCGAGTTTTTAGCTCGATGGAGGTGAGAAATGTAGTGAGTTGGAATGCAATGATAACCGGATATCTCAATATTGGAGATTACACAAAAGCTTTGAAGCTTTTTAGTAGTATGCTGACTGAGGGTATAAAGTTTGATGCTGTTACAATGCTGCTGGTTATTCAAGCCTGTGCAGAATCTGAGTCTCTCCAATTAGGCATGCAACTGCATCAGTTGGCTATCAAGTTCAATTTCATTGGTGACTTGTTCGTATTAAATGCACTGTTGAATATGTATAGTGATAATGGACGTCTGGAGTCATCATGTGCGTTGTTTAATGCCGTTCCCACCTCTGATGCCGCCTTATGGAATTCTATGATATCTGCATACATTGCCTTCGGATTTCATGCTGAAGCTATAGCTTTGTATATTAAAATGCGTTTGGAAGGCTTAAAAGAAGACAAAAGAACCGTTGAGATTATGCTGTCTTTATGCGAAGATCTAAATGATGGTTCTATTTGGGGTAGAGGCTTACATGCTCATGCCATGAAAAGTGGAATGGAACTAGATGTATTTCTGGGCAATGCATTGTTAAGCATGTATGTTGAGCACAATCAAATTGATGCTGCACAGAAACTTTTTGATAAGATGAGAGGTTTGGACGTCATCTCCTGTAACACAATGATATTAGCACTTGCTCGGAGTAAGTTTCGAGCCAAAGCATTTGAACTCTTTATGACGATGTGTGAATCAGAAATCAAGTTCAATTCATACACAATGATATCTCTCCTTGCATTATGTAAAGATGGAAGTGATTTGGTGTTTGGGCGATCGATCCATGGTTTTGCAATAAAAAATGGTCTTGAAATAAATACTTCTTTGAACACTTCACTGACTGAAATGTACATAAACTGTAGAGATGAAGGATCGGCTACAAATCTGTTTATTAGATGTCCTCAAAGAGATTTAGTTTCATGGAATTCCCTAATTTCGAGCTATATAAAGAATGACAATGCAGGAAAAGCTCTATTACTTTTTAACCATATGATTTCTGAGCTGGAGCCTAACTCCGTGACAATCATAAGTATTCTCACATCTTGTACCCAGCTTGCCCATCTACCACTAGGACAGTGCTTGCATGCTTACACAACTAGAAGGGGAGAATCTTTTGAATTGGATGCTTCTCTAGCAAATGCTTTTATAACTATGTATGCACGATGTGGTAAAATGCAATATGCAGAAAAGATTTTTAGCACCCTGAAGGCAAGAAATATTGTCTCATGGAATGCCATGATAACAGGGTATGGCATGCACGGTCGTGGACACGATGCTACTCTAGCCTTTGCACAGATGTTGGATGATGGTTTCAAGCCAAACAATATATCTTTTGTATCTGTTTTATCTGCCTGCAGCCATTCTGGTCTGACCAAGACCGGTTTGCAGCTTTTTAGTTCCATGGTGCGGGACTTTGGTATTGCTCCTCAACTTGCTCACTATGGTTGTATAGTTGATCTGCTTGGTCGTGGGGGCCATTTTGCTGAAGCTATAGCTCTCATCAGCTCAATGCCCGTTGAGCCTGATGCATCAATTTGGAGAGCTTTGCTCAGTTCATGTCAGGTTAAAAGCAATAAAAAACTAGTCGAAACCATCTTTAGAAAGCTTGTTGAATTAGAACCAAGCAATCCAGGGAATTTTGTTTTGCTTTCAAATATCTACGCAGCAGCAGGTCTTTGGTCAGAGGTTTCACAGATAAGAAAGTGGCTTAGAGATAAAGGTCTAGTGAAGCCTCCAGGAACTAGCTGGATTGTAATCGGAAGTCAGGTCCACTATTTCACTGCAACTGACGTATCACACCCTCAATCAGAAGAAATTTACGAAAATTTGAATTCTTTGACATCATTGATCCAAGATATGGGCTGA

Coding sequence (CDS)

ATGGAGATTTCAGCCACCCTAAGCATTCACGGACGCTCTCCGACTCCTAAACAAGCCATCAATGTCTCAAAGGACTGGAACTTGATTATAAAGCACCAAACCAAGCTTAAGAATGACCATGCCATTCTTTCTACATATACCCAGATGGAGTCTCTTGGTATTGCACCCGATTCTGCTACAATGCCTCTTGTTCTAAAGGCTTGCGGGAGGCTCAACGCCATTGAAAAAGGGGTACGAATTCATTCTTGTATTAGGGATTCGGATTTGATCAGAGATGTTCGGGTTGGGACTGCCTTGGTCGATTTCTATAGTAAATGTGGGCTTGTTGGAGAGGCCAGTAAAGTGTTCGATGAAATGCCTGAAAGAGATTTGGTGTCGTGGAATGCATTGATTTCGGGATATGTGGGCTGTTCCTGCTATAAAGAAGCAGTGTTGTTGTTTATAGAGATGCAAAAGGCAGGCCTCACACCCAATTCTCGTACTGTAGTGCCTCTGCTTTTGGCTTGTGCTGAGATGTTGGAACTGCGATTAGGACATGAGATTCATGGTTATTGTTTGAGAAATGGGTTGTTTGATATGGATGCACATGTTGGTACTGCTTTGATAGGATTTTATATGAGATTTGATGCAACAGTTTCTCACCGAGTTTTTAGCTCGATGGAGGTGAGAAATGTAGTGAGTTGGAATGCAATGATAACCGGATATCTCAATATTGGAGATTACACAAAAGCTTTGAAGCTTTTTAGTAGTATGCTGACTGAGGGTATAAAGTTTGATGCTGTTACAATGCTGCTGGTTATTCAAGCCTGTGCAGAATCTGAGTCTCTCCAATTAGGCATGCAACTGCATCAGTTGGCTATCAAGTTCAATTTCATTGGTGACTTGTTCGTATTAAATGCACTGTTGAATATGTATAGTGATAATGGACGTCTGGAGTCATCATGTGCGTTGTTTAATGCCGTTCCCACCTCTGATGCCGCCTTATGGAATTCTATGATATCTGCATACATTGCCTTCGGATTTCATGCTGAAGCTATAGCTTTGTATATTAAAATGCGTTTGGAAGGCTTAAAAGAAGACAAAAGAACCGTTGAGATTATGCTGTCTTTATGCGAAGATCTAAATGATGGTTCTATTTGGGGTAGAGGCTTACATGCTCATGCCATGAAAAGTGGAATGGAACTAGATGTATTTCTGGGCAATGCATTGTTAAGCATGTATGTTGAGCACAATCAAATTGATGCTGCACAGAAACTTTTTGATAAGATGAGAGGTTTGGACGTCATCTCCTGTAACACAATGATATTAGCACTTGCTCGGAGTAAGTTTCGAGCCAAAGCATTTGAACTCTTTATGACGATGTGTGAATCAGAAATCAAGTTCAATTCATACACAATGATATCTCTCCTTGCATTATGTAAAGATGGAAGTGATTTGGTGTTTGGGCGATCGATCCATGGTTTTGCAATAAAAAATGGTCTTGAAATAAATACTTCTTTGAACACTTCACTGACTGAAATGTACATAAACTGTAGAGATGAAGGATCGGCTACAAATCTGTTTATTAGATGTCCTCAAAGAGATTTAGTTTCATGGAATTCCCTAATTTCGAGCTATATAAAGAATGACAATGCAGGAAAAGCTCTATTACTTTTTAACCATATGATTTCTGAGCTGGAGCCTAACTCCGTGACAATCATAAGTATTCTCACATCTTGTACCCAGCTTGCCCATCTACCACTAGGACAGTGCTTGCATGCTTACACAACTAGAAGGGGAGAATCTTTTGAATTGGATGCTTCTCTAGCAAATGCTTTTATAACTATGTATGCACGATGTGGTAAAATGCAATATGCAGAAAAGATTTTTAGCACCCTGAAGGCAAGAAATATTGTCTCATGGAATGCCATGATAACAGGGTATGGCATGCACGGTCGTGGACACGATGCTACTCTAGCCTTTGCACAGATGTTGGATGATGGTTTCAAGCCAAACAATATATCTTTTGTATCTGTTTTATCTGCCTGCAGCCATTCTGGTCTGACCAAGACCGGTTTGCAGCTTTTTAGTTCCATGGTGCGGGACTTTGGTATTGCTCCTCAACTTGCTCACTATGGTTGTATAGTTGATCTGCTTGGTCGTGGGGGCCATTTTGCTGAAGCTATAGCTCTCATCAGCTCAATGCCCGTTGAGCCTGATGCATCAATTTGGAGAGCTTTGCTCAGTTCATGTCAGGTTAAAAGCAATAAAAAACTAGTCGAAACCATCTTTAGAAAGCTTGTTGAATTAGAACCAAGCAATCCAGGGAATTTTGTTTTGCTTTCAAATATCTACGCAGCAGCAGGTCTTTGGTCAGAGGTTTCACAGATAAGAAAGTGGCTTAGAGATAAAGGTCTAGTGAAGCCTCCAGGAACTAGCTGGATTGTAATCGGAAGTCAGGTCCACTATTTCACTGCAACTGACGTATCACACCCTCAATCAGAAGAAATTTACGAAAATTTGAATTCTTTGACATCATTGATCCAAGATATGGGCTGA
BLAST of CmoCh20G000880.1 vs. Swiss-Prot
Match: PP307_ARATH (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 456.1 bits (1172), Expect = 8.6e-127
Identity = 261/823 (31.71%), Postives = 430/823 (52.25%), Query Frame = 1

Query: 26  WNLIIKHQTKLKNDHAILSTYTQMESLGIAPDSATMPLVLKAC-GRLNAIEKGVRIHSCI 85
           WN +IK          +   + +M S  + P+  T   VL+AC G   A +   +IH+ I
Sbjct: 154 WNKMIKELASRNLIGEVFGLFVRMVSENVTPNEGTFSGVLEACRGGSVAFDVVEQIHARI 213

Query: 86  RDSDLIRDVRVGTALVDFYSKCGLVGEASKVFDEMPERDLVSWNALISGYVGCSCYKEAV 145
               L     V   L+D YS+ G V  A +VFD +  +D  SW A+ISG     C  EA+
Sbjct: 214 LYQGLRDSTVVCNPLIDLYSRNGFVDLARRVFDGLRLKDHSSWVAMISGLSKNECEAEAI 273

Query: 146 LLFIEMQKAGLTPNSRTVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGF 205
            LF +M   G+ P       +L AC ++  L +G ++HG  L+ G F  D +V  AL+  
Sbjct: 274 RLFCDMYVLGIMPTPYAFSSVLSACKKIESLEIGEQLHGLVLKLG-FSSDTYVCNALVSL 333

Query: 206 YMRFDATVS-HRVFSSMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGIKFDAVTM 265
           Y      +S   +FS+M  R+ V++N +I G    G   KA++LF  M  +G++ D+ T+
Sbjct: 334 YFHLGNLISAEHIFSNMSQRDAVTYNTLINGLSQCGYGEKAMELFKRMHLDGLEPDSNTL 393

Query: 266 LLVIQACAESESLQLGMQLHQLAIKFNFIGDLFVLNALLNMYSDNGRLESSCALFNAVPT 325
             ++ AC+   +L  G QLH    K  F  +  +  ALLN+Y+    +E++   F     
Sbjct: 394 ASLVVACSADGTLFRGQQLHAYTTKLGFASNNKIEGALLNLYAKCADIETALDYFLETEV 453

Query: 326 SDAALWNSMISAYIAFGFHAEAIALYIKMRLEGLKEDKRTVEIMLSLCEDLNDGSIWGRG 385
            +  LWN M+ AY        +  ++ +M++E +  ++ T   +L  C  L D  + G  
Sbjct: 454 ENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEEIVPNQYTYPSILKTCIRLGDLEL-GEQ 513

Query: 386 LHAHAMKSGMELDVFLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISCNTMILALARSKF 445
           +H+  +K+  +L+ ++ + L+ MY +  ++D A  +  +  G DV+S  TMI    +  F
Sbjct: 514 IHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIRFAGKDVVSWTTMIAGYTQYNF 573

Query: 446 RAKAFELFMTMCESEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTS 505
             KA   F  M +  I+ +   + + ++ C     L  G+ IH  A  +G   +     +
Sbjct: 574 DDKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALKEGQQIHAQACVSGFSSDLPFQNA 633

Query: 506 LTEMYINCRDEGSATNLFIRCPQRDLVSWNSLISSYIKNDNAGKALLLFNHMISE-LEPN 565
           L  +Y  C     +   F +    D ++WN+L+S + ++ N  +AL +F  M  E ++ N
Sbjct: 634 LVTLYSRCGKIEESYLAFEQTEAGDNIAWNALVSGFQQSGNNEEALRVFVRMNREGIDNN 693

Query: 566 SVTIISILTSCTQLAHLPLGQCLHAYTTRRGESFELDASLANAFITMYARCGKMQYAEKI 625
           + T  S + + ++ A++  G+ +HA  T+ G  ++ +  + NA I+MYA+CG +  AEK 
Sbjct: 694 NFTFGSAVKAASETANMKQGKQVHAVITKTG--YDSETEVCNALISMYAKCGSISDAEKQ 753

Query: 626 FSTLKARNIVSWNAMITGYGMHGRGHDATLAFAQMLDDGFKPNNISFVSVLSACSHSGLT 685
           F  +  +N VSWNA+I  Y  HG G +A  +F QM+    +PN+++ V VLSACSH GL 
Sbjct: 754 FLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQMIHSNVRPNHVTLVGVLSACSHIGLV 813

Query: 686 KTGLQLFSSMVRDFGIAPQLAHYGCIVDLLGRGGHFAEAIALISSMPVEPDASIWRALLS 745
             G+  F SM  ++G++P+  HY C+VD+L R G  + A   I  MP++PDA +WR LLS
Sbjct: 814 DKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLSRAKEFIQEMPIKPDALVWRTLLS 873

Query: 746 SCQVKSNKKLVETIFRKLVELEPSNPGNFVLLSNIYAAAGLWSEVSQIRKWLRDKGLVKP 805
           +C V  N ++ E     L+ELEP +   +VLLSN+YA +  W      R+ +++KG+ K 
Sbjct: 874 ACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYAVSKKWDARDLTRQKMKEKGVKKE 933

Query: 806 PGTSWIVIGSQVHYFTATDVSHPQSEEIYENLNSLTSLIQDMG 846
           PG SWI + + +H F   D +HP ++EI+E    LT    ++G
Sbjct: 934 PGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQDLTKRASEIG 972

BLAST of CmoCh20G000880.1 vs. Swiss-Prot
Match: PP296_ARATH (Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H83 PE=2 SV=2)

HSP 1 Score: 448.0 bits (1151), Expect = 2.3e-124
Identity = 255/785 (32.48%), Postives = 432/785 (55.03%), Query Frame = 1

Query: 64  VLKACGRLNAIEKGVRIHSCIRDS--DLIRDVRVGTALVDFYSKCGLVGEASKVFDEMPE 123
           VL+ CG+  A+ +G ++HS I  +      D   G  LV  Y KCG + +A KVFDEMP+
Sbjct: 86  VLELCGKRRAVSQGRQLHSRIFKTFPSFELDFLAGK-LVFMYGKCGSLDDAEKVFDEMPD 145

Query: 124 RDLVSWNALISGYVGCSCYKEAVLLFIEMQKAGLTPNSRTVVPLLLACAEMLELRLGHEI 183
           R   +WN +I  YV       A+ L+  M+  G+     +   LL ACA++ ++R G E+
Sbjct: 146 RTAFAWNTMIGAYVSNGEPASALALYWNMRVEGVPLGLSSFPALLKACAKLRDIRSGSEL 205

Query: 184 HGYCLRNGLFDMDAHVGTALIGFYMRFD-ATVSHRVFSSMEVR-NVVSWNAMITGYLNIG 243
           H   ++ G +     +  AL+  Y + D  + + R+F   + + + V WN++++ Y   G
Sbjct: 206 HSLLVKLG-YHSTGFIVNALVSMYAKNDDLSAARRLFDGFQEKGDAVLWNSILSSYSTSG 265

Query: 244 DYTKALKLFSSMLTEGIKFDAVTMLLVIQACAESESLQLGMQLHQLAIKFN-FIGDLFVL 303
              + L+LF  M   G   ++ T++  + AC      +LG ++H   +K +    +L+V 
Sbjct: 266 KSLETLELFREMHMTGPAPNSYTIVSALTACDGFSYAKLGKEIHASVLKSSTHSSELYVC 325

Query: 304 NALLNMYSDNGRLESSCALFNAVPTSDAALWNSMISAYIAFGFHAEAIALYIKMRLEGLK 363
           NAL+ MY+  G++  +  +   +  +D   WNS+I  Y+    + EA+  +  M   G K
Sbjct: 326 NALIAMYTRCGKMPQAERILRQMNNADVVTWNSLIKGYVQNLMYKEALEFFSDMIAAGHK 385

Query: 364 EDKRTVEIMLSLCEDLNDGSIWGRGLHAHAMKSGMELDVFLGNALLSMYVEHNQIDAAQK 423
            D+ ++  +++    L++  + G  LHA+ +K G + ++ +GN L+ MY + N      +
Sbjct: 386 SDEVSMTSIIAASGRLSN-LLAGMELHAYVIKHGWDSNLQVGNTLIDMYSKCNLTCYMGR 445

Query: 424 LFDKMRGLDVISCNTMILALARSKFRAKAFELFMTMCESEIKFNSYTMISLLALCKDGSD 483
            F +M   D+IS  T+I   A++    +A ELF  + +  ++ +   + S+L        
Sbjct: 446 AFLRMHDKDLISWTTVIAGYAQNDCHVEALELFRDVAKKRMEIDEMILGSILRASSVLKS 505

Query: 484 LVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCRDEGSATNLFIRCPQRDLVSWNSLISS 543
           ++  + IH   ++ GL ++T +   L ++Y  CR+ G AT +F     +D+VSW S+ISS
Sbjct: 506 MLIVKEIHCHILRKGL-LDTVIQNELVDVYGKCRNMGYATRVFESIKGKDVVSWTSMISS 565

Query: 544 YIKNDNAGKALLLFNHMISE-LEPNSVTIISILTSCTQLAHLPLGQCLHAYTTRRGESFE 603
              N N  +A+ LF  M+   L  +SV ++ IL++   L+ L  G+ +H Y  R+G  F 
Sbjct: 566 SALNGNESEAVELFRRMVETGLSADSVALLCILSAAASLSALNKGREIHCYLLRKG--FC 625

Query: 604 LDASLANAFITMYARCGKMQYAEKIFSTLKARNIVSWNAMITGYGMHGRGHDATLAFAQM 663
           L+ S+A A + MYA CG +Q A+ +F  ++ + ++ + +MI  YGMHG G  A   F +M
Sbjct: 626 LEGSIAVAVVDMYACCGDLQSAKAVFDRIERKGLLQYTSMINAYGMHGCGKAAVELFDKM 685

Query: 664 LDDGFKPNNISFVSVLSACSHSGLTKTGLQLFSSMVRDFGIAPQLAHYGCIVDLLGRGGH 723
             +   P++ISF+++L ACSH+GL   G      M  ++ + P   HY C+VD+LGR   
Sbjct: 686 RHENVSPDHISFLALLYACSHAGLLDEGRGFLKIMEHEYELEPWPEHYVCLVDMLGRANC 745

Query: 724 FAEAIALISSMPVEPDASIWRALLSSCQVKSNKKLVETIFRKLVELEPSNPGNFVLLSNI 783
             EA   +  M  EP A +W ALL++C+  S K++ E   ++L+ELEP NPGN VL+SN+
Sbjct: 746 VVEAFEFVKMMKTEPTAEVWCALLAACRSHSEKEIGEIAAQRLLELEPKNPGNLVLVSNV 805

Query: 784 YAAAGLWSEVSQIRKWLRDKGLVKPPGTSWIVIGSQVHYFTATDVSHPQSEEIYENLNSL 843
           +A  G W++V ++R  ++  G+ K PG SWI +  +VH FTA D SHP+S+EIYE L+ +
Sbjct: 806 FAEQGRWNDVEKVRAKMKASGMEKHPGCSWIEMDGKVHKFTARDKSHPESKEIYEKLSEV 864

BLAST of CmoCh20G000880.1 vs. Swiss-Prot
Match: PP333_ARATH (Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana GN=PCMP-E36 PE=3 SV=1)

HSP 1 Score: 446.0 bits (1146), Expect = 8.9e-124
Identity = 247/785 (31.46%), Postives = 416/785 (52.99%), Query Frame = 1

Query: 61  MPLVLKACGRLNAIEKGVRIHSCIRDSDLIRDVRVGTALVDFYSKCGLVGEASKVFDEMP 120
           + L+L+AC   N + +G ++H+ +  + +  D      ++  Y+ CG   +  K+F  + 
Sbjct: 38  LSLLLQACSNPNLLRQGKQVHAFLIVNSISGDSYTDERILGMYAMCGSFSDCGKMFYRLD 97

Query: 121 ER--DLVSWNALISGYVGCSCYKEAVLLFIEMQKAGLTPNSRTVVPLLLACAEMLELRLG 180
            R   +  WN++IS +V      +A+  + +M   G++P+  T   L+ AC  +   + G
Sbjct: 98  LRRSSIRPWNSIISSFVRNGLLNQALAFYFKMLCFGVSPDVSTFPCLVKACVALKNFK-G 157

Query: 181 HEIHGYCLRNGLFDMDAHVGTALIGFYMRFDAT-VSHRVFSSMEVRNVVSWNAMITGYLN 240
            +     + +   D +  V ++LI  Y+ +    V  ++F  +  ++ V WN M+ GY  
Sbjct: 158 IDFLSDTVSSLGMDCNEFVASSLIKAYLEYGKIDVPSKLFDRVLQKDCVIWNVMLNGYAK 217

Query: 241 IGDYTKALKLFSSMLTEGIKFDAVTMLLVIQACAESESLQLGMQLHQLAIKFNFIGDLFV 300
            G     +K FS M  + I  +AVT   V+  CA    + LG+QLH L +      +  +
Sbjct: 218 CGALDSVIKGFSVMRMDQISPNAVTFDCVLSVCASKLLIDLGVQLHGLVVVSGVDFEGSI 277

Query: 301 LNALLNMYSDNGRLESSCALFNAVPTSDAALWNSMISAYIAFGFHAEAIALYIKMRLEGL 360
            N+LL+MYS  GR + +  LF  +  +D   WN MIS Y+  G   E++  + +M   G+
Sbjct: 278 KNSLLSMYSKCGRFDDASKLFRMMSRADTVTWNCMISGYVQSGLMEESLTFFYEMISSGV 337

Query: 361 KEDKRTVEIML---SLCEDLNDGSIWGRGLHAHAMKSGMELDVFLGNALLSMYVEHNQID 420
             D  T   +L   S  E+L     + + +H + M+  + LD+FL +AL+  Y +   + 
Sbjct: 338 LPDAITFSSLLPSVSKFENLE----YCKQIHCYIMRHSISLDIFLTSALIDAYFKCRGVS 397

Query: 421 AAQKLFDKMRGLDVISCNTMILALARSKFRAKAFELFMTMCESEIKFNSYTMISLLALCK 480
            AQ +F +   +DV+    MI     +     + E+F  + + +I  N  T++S+L +  
Sbjct: 398 MAQNIFSQCNSVDVVVFTAMISGYLHNGLYIDSLEMFRWLVKVKISPNEITLVSILPVIG 457

Query: 481 DGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCRDEGSATNLFIRCPQRDLVSWNS 540
               L  GR +HGF IK G +   ++  ++ +MY  C     A  +F R  +RD+VSWNS
Sbjct: 458 ILLALKLGRELHGFIIKKGFDNRCNIGCAVIDMYAKCGRMNLAYEIFERLSKRDIVSWNS 517

Query: 541 LISSYIKNDNAGKALLLFNHM-ISELEPNSVTIISILTSCTQLAHLPLGQCLHAYTTRRG 600
           +I+   ++DN   A+ +F  M +S +  + V+I + L++C  L     G+ +H +  +  
Sbjct: 518 MITRCAQSDNPSAAIDIFRQMGVSGICYDCVSISAALSACANLPSESFGKAIHGFMIKH- 577

Query: 601 ESFELDASLANAFITMYARCGKMQYAEKIFSTLKARNIVSWNAMITGYGMHGRGHDATLA 660
            S   D    +  I MYA+CG ++ A  +F T+K +NIVSWN++I   G HG+  D+   
Sbjct: 578 -SLASDVYSESTLIDMYAKCGNLKAAMNVFKTMKEKNIVSWNSIIAACGNHGKLKDSLCL 637

Query: 661 FAQMLD-DGFKPNNISFVSVLSACSHSGLTKTGLQLFSSMVRDFGIAPQLAHYGCIVDLL 720
           F +M++  G +P+ I+F+ ++S+C H G    G++ F SM  D+GI PQ  HY C+VDL 
Sbjct: 638 FHEMVEKSGIRPDQITFLEIISSCCHVGDVDEGVRFFRSMTEDYGIQPQQEHYACVVDLF 697

Query: 721 GRGGHFAEAIALISSMPVEPDASIWRALLSSCQVKSNKKLVETIFRKLVELEPSNPGNFV 780
           GR G   EA   + SMP  PDA +W  LL +C++  N +L E    KL++L+PSN G +V
Sbjct: 698 GRAGRLTEAYETVKSMPFPPDAGVWGTLLGACRLHKNVELAEVASSKLMDLDPSNSGYYV 757

Query: 781 LLSNIYAAAGLWSEVSQIRKWLRDKGLVKPPGTSWIVIGSQVHYFTATDVSHPQSEEIYE 838
           L+SN +A A  W  V+++R  ++++ + K PG SWI I  + H F + DV+HP+S  IY 
Sbjct: 758 LISNAHANAREWESVTKVRSLMKEREVQKIPGYSWIEINKRTHLFVSGDVNHPESSHIYS 815

BLAST of CmoCh20G000880.1 vs. Swiss-Prot
Match: PP357_ARATH (Pentatricopeptide repeat-containing protein At4g39530 OS=Arabidopsis thaliana GN=PCMP-E52 PE=3 SV=1)

HSP 1 Score: 441.4 bits (1134), Expect = 2.2e-122
Identity = 256/776 (32.99%), Postives = 429/776 (55.28%), Query Frame = 1

Query: 80  IHSCIRDSDLIRDVRVGTALVDFYSKCGLVGEASKVFDEMPERDLVSWNALISGYVGCSC 139
           +H  I    L  D  +   L++ YS+ G +  A KVF++MPER+LVSW+ ++S       
Sbjct: 66  VHGQIIVWGLELDTYLSNILINLYSRAGGMVYARKVFEKMPERNLVSWSTMVSACNHHGI 125

Query: 140 YKEAVLLFIEMQKAGL-TPNSRTVVPLLLACAEMLELR---LGHEIHGYCLRNGLFDMDA 199
           Y+E++++F+E  +    +PN   +   + AC+  L+ R   +  ++  + +++G FD D 
Sbjct: 126 YEESLVVFLEFWRTRKDSPNEYILSSFIQACSG-LDGRGRWMVFQLQSFLVKSG-FDRDV 185

Query: 200 HVGTALIGFYMRFDATVSHR--VFSSMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLT 259
           +VGT LI FY++ D  + +   VF ++  ++ V+W  MI+G + +G    +L+LF  ++ 
Sbjct: 186 YVGTLLIDFYLK-DGNIDYARLVFDALPEKSTVTWTTMISGCVKMGRSYVSLQLFYQLME 245

Query: 260 EGIKFDAVTMLLVIQACAESESLQLGMQLHQLAIKFNFIGDLFVLNALLNMYSDNGRLES 319
           + +  D   +  V+ AC+    L+ G Q+H   +++    D  ++N L++ Y   GR+ +
Sbjct: 246 DNVVPDGYILSTVLSACSILPFLEGGKQIHAHILRYGLEMDASLMNVLIDSYVKCGRVIA 305

Query: 320 SCALFNAVPTSDAALWNSMISAYIAFGFHAEAIALYIKMRLEGLKEDKRTVEIMLSLCED 379
           +  LFN +P  +   W +++S Y     H EA+ L+  M   GLK D      +L+ C  
Sbjct: 306 AHKLFNGMPNKNIISWTTLLSGYKQNALHKEAMELFTSMSKFGLKPDMYACSSILTSCAS 365

Query: 380 LNDGSIWGRGLHAHAMKSGMELDVFLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISCNT 439
           L+    +G  +HA+ +K+ +  D ++ N+L+ MY + + +  A+K+FD     DV+  N 
Sbjct: 366 LHALG-FGTQVHAYTIKANLGNDSYVTNSLIDMYAKCDCLTDARKVFDIFAAADVVLFNA 425

Query: 440 MILALARSKFR---AKAFELFMTMCESEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAI 499
           MI   +R   +    +A  +F  M    I+ +  T +SLL      + L   + IHG   
Sbjct: 426 MIEGYSRLGTQWELHEALNIFRDMRFRLIRPSLLTFVSLLRASASLTSLGLSKQIHGLMF 485

Query: 500 KNGLEINTSLNTSLTEMYINCRDEGSATNLFIRCPQRDLVSWNSLISSYIKNDNAGKALL 559
           K GL ++    ++L ++Y NC     +  +F     +DLV WNS+ + Y++     +AL 
Sbjct: 486 KYGLNLDIFAGSALIDVYSNCYCLKDSRLVFDEMKVKDLVIWNSMFAGYVQQSENEEALN 545

Query: 560 LFNHM-ISELEPNSVTIISILTSCTQLAHLPLGQCLHAYTTRRGESFELDASLANAFITM 619
           LF  + +S   P+  T  +++T+   LA + LGQ  H    +RG   E +  + NA + M
Sbjct: 546 LFLELQLSRERPDEFTFANMVTAAGNLASVQLGQEFHCQLLKRG--LECNPYITNALLDM 605

Query: 620 YARCGKMQYAEKIFSTLKARNIVSWNAMITGYGMHGRGHDATLAFAQMLDDGFKPNNISF 679
           YA+CG  + A K F +  +R++V WN++I+ Y  HG G  A     +M+ +G +PN I+F
Sbjct: 606 YAKCGSPEDAHKAFDSAASRDVVCWNSVISSYANHGEGKKALQMLEKMMSEGIEPNYITF 665

Query: 680 VSVLSACSHSGLTKTGLQLFSSMVRDFGIAPQLAHYGCIVDLLGRGGHFAEAIALISSMP 739
           V VLSACSH+GL + GL+ F  M+R FGI P+  HY C+V LLGR G   +A  LI  MP
Sbjct: 666 VGVLSACSHAGLVEDGLKQFELMLR-FGIEPETEHYVCMVSLLGRAGRLNKARELIEKMP 725

Query: 740 VEPDASIWRALLSSCQVKSNKKLVETIFRKLVELEPSNPGNFVLLSNIYAAAGLWSEVSQ 799
            +P A +WR+LLS C    N +L E      +  +P + G+F +LSNIYA+ G+W+E  +
Sbjct: 726 TKPAAIVWRSLLSGCAKAGNVELAEHAAEMAILSDPKDSGSFTMLSNIYASKGMWTEAKK 785

Query: 800 IRKWLRDKGLVKPPGTSWIVIGSQVHYFTATDVSHPQSEEIYENLNSLTSLIQDMG 846
           +R+ ++ +G+VK PG SWI I  +VH F + D SH ++ +IYE L+ L  L+Q  G
Sbjct: 786 VRERMKVEGVVKEPGRSWIGINKEVHIFLSKDKSHCKANQIYEVLDDL--LVQIRG 832

BLAST of CmoCh20G000880.1 vs. Swiss-Prot
Match: PP207_ARATH (Pentatricopeptide repeat-containing protein At3g02330 OS=Arabidopsis thaliana GN=PCMP-E90 PE=2 SV=2)

HSP 1 Score: 433.0 bits (1112), Expect = 7.8e-120
Identity = 258/781 (33.03%), Postives = 417/781 (53.39%), Query Frame = 1

Query: 82  SCIRDSDLIRDVRVGTALVDFYSKCGLVGEASKVFDEMPERDLVSWNALISGYVGCSCYK 141
           S + D   +RDV     +++ YSK   + +A+  F+ MP RD+VSWN+++SGY+      
Sbjct: 103 SMVFDKMPLRDVVSWNKMINGYSKSNDMFKANSFFNMMPVRDVVSWNSMLSGYLQNGESL 162

Query: 142 EAVLLFIEMQKAGLTPNSRTVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGTAL 201
           +++ +F++M + G+  + RT   +L  C+ + +  LG +IHG  +R G  D D    +AL
Sbjct: 163 KSIEVFVDMGREGIEFDGRTFAIILKVCSFLEDTSLGMQIHGIVVRVGC-DTDVVAASAL 222

Query: 202 IGFYMRFDATV-SHRVFSSMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGIKFDA 261
           +  Y +    V S RVF  +  +N VSW+A+I G +     + ALK F  M         
Sbjct: 223 LDMYAKGKRFVESLRVFQGIPEKNSVSWSAIIAGCVQNNLLSLALKFFKEMQKVNAGVSQ 282

Query: 262 VTMLLVIQACAESESLQLGMQLHQLAIKFNFIGDLFVLNALLNMYSDNGRLESSCALFNA 321
                V+++CA    L+LG QLH  A+K +F  D  V  A L+MY+    ++ +  LF+ 
Sbjct: 283 SIYASVLRSCAALSELRLGGQLHAHALKSDFAADGIVRTATLDMYAKCDNMQDAQILFDN 342

Query: 322 VPTSDAALWNSMISAYIAFGFHAEAIALYIKMRLEGLKEDKRTVEIMLSLCEDLNDGSIW 381
               +   +N+MI+ Y       +A+ L+ ++   GL  D+ ++  +   C  L  G   
Sbjct: 343 SENLNRQSYNAMITGYSQEEHGFKALLLFHRLMSSGLGFDEISLSGVFRACA-LVKGLSE 402

Query: 382 GRGLHAHAMKSGMELDVFLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISCNTMILALAR 441
           G  ++  A+KS + LDV + NA + MY +   +  A ++FD+MR  D +S N +I A  +
Sbjct: 403 GLQIYGLAIKSSLSLDVCVANAAIDMYGKCQALAEAFRVFDEMRRRDAVSWNAIIAAHEQ 462

Query: 442 SKFRAKAFELFMTMCESEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTSL 501
           +    +   LF++M  S I+ + +T  S+L  C  GS L +G  IH   +K+G+  N+S+
Sbjct: 463 NGKGYETLFLFVSMLRSRIEPDEFTFGSILKACTGGS-LGYGMEIHSSIVKSGMASNSSV 522

Query: 502 NTSLTEMYINCRDEGSATNLFIRCPQRD--------------------LVSWNSLISSYI 561
             SL +MY  C     A  +  R  QR                      VSWNS+IS Y+
Sbjct: 523 GCSLIDMYSKCGMIEEAEKIHSRFFQRANVSGTMEELEKMHNKRLQEMCVSWNSIISGYV 582

Query: 562 KNDNAGKALLLFNHMISE-LEPNSVTIISILTSCTQLAHLPLGQCLHAYTTRRGESFELD 621
             + +  A +LF  M+   + P+  T  ++L +C  LA   LG+ +HA   ++    + D
Sbjct: 583 MKEQSEDAQMLFTRMMEMGITPDKFTYATVLDTCANLASAGLGKQIHAQVIKK--ELQSD 642

Query: 622 ASLANAFITMYARCGKMQYAEKIFSTLKARNIVSWNAMITGYGMHGRGHDATLAFAQMLD 681
             + +  + MY++CG +  +  +F     R+ V+WNAMI GY  HG+G +A   F +M+ 
Sbjct: 643 VYICSTLVDMYSKCGDLHDSRLMFEKSLRRDFVTWNAMICGYAHHGKGEEAIQLFERMIL 702

Query: 682 DGFKPNNISFVSVLSACSHSGLTKTGLQLFSSMVRDFGIAPQLAHYGCIVDLLGRGGHFA 741
           +  KPN+++F+S+L AC+H GL   GL+ F  M RD+G+ PQL HY  +VD+LG+ G   
Sbjct: 703 ENIKPNHVTFISILRACAHMGLIDKGLEYFYMMKRDYGLDPQLPHYSNMVDILGKSGKVK 762

Query: 742 EAIALISSMPVEPDASIWRALLSSCQV-KSNKKLVETIFRKLVELEPSNPGNFVLLSNIY 801
            A+ LI  MP E D  IWR LL  C + ++N ++ E     L+ L+P +   + LLSN+Y
Sbjct: 763 RALELIREMPFEADDVIWRTLLGVCTIHRNNVEVAEEATAALLRLDPQDSSAYTLLSNVY 822

Query: 802 AAAGLWSEVSQIRKWLRDKGLVKPPGTSWIVIGSQVHYFTATDVSHPQSEEIYENLNSLT 840
           A AG+W +VS +R+ +R   L K PG SW+ +  ++H F   D +HP+ EEIYE L  + 
Sbjct: 823 ADAGMWEKVSDLRRNMRGFKLKKEPGCSWVELKDELHVFLVGDKAHPRWEEIYEELGLIY 878

BLAST of CmoCh20G000880.1 vs. TrEMBL
Match: A0A061GS93_THECC (Pentatricopeptide repeat-containing protein OS=Theobroma cacao GN=TCM_040700 PE=4 SV=1)

HSP 1 Score: 1055.8 bits (2729), Expect = 2.7e-305
Identity = 516/845 (61.07%), Postives = 647/845 (76.57%), Query Frame = 1

Query: 1   MEISATLSIHGRSPTPKQAINVSKDWNLIIKHQTKLKNDHAILSTYTQMESLGIAPDSAT 60
           M I  +L +H          +  KDWN +IK+QT LKNDHAILSTY++MESLG+ P+ A 
Sbjct: 1   MLIPFSLGLHFAPKNTHIKDDPQKDWNSLIKNQTNLKNDHAILSTYSRMESLGLTPNRAA 60

Query: 61  MPLVLKACGRLNAIEKGVRIHSCIRDSDLIRDVRVGTALVDFYSKCGLVGEASKVFDEMP 120
           +PLVLKAC +LNA+E G RIH  IR+++LI DVRVGTA++DFY KCG + EA KVFDEM 
Sbjct: 61  LPLVLKACVKLNAVETGKRIHLSIRNTNLIEDVRVGTAIIDFYCKCGFIEEARKVFDEMV 120

Query: 121 ERDLVSWNALISGYVGCSCYKEAVLLFIEMQKAGLTPNSRTVVPLLLACAEMLELRLGHE 180
           ERDLVSWNA+ISGY GC  ++E V L + MQ+ G  PNSRT+V +LLAC E+ E+RLG E
Sbjct: 121 ERDLVSWNAMISGYAGCGEFEEVVFLVMRMQREGFRPNSRTLVAMLLACQEVAEVRLGKE 180

Query: 181 IHGYCLRNGLFDMDAHVGTALIGFYMRFDATVSHRVFSSMEVRNVVSWNAMITGYLNIGD 240
           IHGYCLRNGLFD+D HVGTALIGFY+ F+   SH VF  M VRN V WNAMI GY +IG+
Sbjct: 181 IHGYCLRNGLFDLDPHVGTALIGFYLSFNVRASHTVFDLMAVRNTVCWNAMIKGYFDIGE 240

Query: 241 YTKALKLFSSMLTEGIKFDAVTMLLVIQACAESESLQLGMQLHQLAIKFNFIGDLFVLNA 300
             KALKLF  ML +G++FD+VTML +IQACAE  SL+LG Q+HQ+AIK ++  DLF++NA
Sbjct: 241 SLKALKLFEKMLMDGVEFDSVTMLALIQACAEFGSLELGSQIHQMAIKCSYSNDLFIVNA 300

Query: 301 LLNMYSDNGRLESSCALFNAVPTSDAALWNSMISAYIAFGFHAEAIALYIKMRLEGLKED 360
           LLNMY+D G L+S+C LF+  P  D ALWNSMISAY  +  + EA +L++ MR EG KED
Sbjct: 301 LLNMYADIGSLKSACKLFDVTPRRDVALWNSMISAYFEYSCNEEATSLFVHMRTEGNKED 360

Query: 361 KRTVEIMLSLCEDLNDGSIWGRGLHAHAMKSGMELDVFLGNALLSMYVEHNQIDAAQKLF 420
            RT+ IM SLC +  DG   G+ LHA+A KSGM +DV LGNA+L+MY + N ID+ QK+F
Sbjct: 361 DRTIVIMFSLCAESADGLRKGKSLHAYASKSGMRMDVNLGNAMLNMYAQQNCIDSVQKVF 420

Query: 421 DKMRGLDVISCNTMILALARSKFRAKAFELFMTMCESEIKFNSYTMISLLALCKDGSDLV 480
            +M  +DVIS NT+ILALAR+K  ++A+E+F  M E +++ NSYT+IS+LA CKD + L 
Sbjct: 421 SEMSNVDVISFNTVILALARNKLGSEAWEVFGLMWELDVEPNSYTIISILAACKDETCLN 480

Query: 481 FGRSIHGFAIKNGLEINTSLNTSLTEMYINCRDEGSATNLFIRCPQRDLVSWNSLISSYI 540
            GRS+HGF IK G+E+N SLNT+LT+MYINC DE +A NLF  CP RDL+SWN+LI++Y+
Sbjct: 481 IGRSLHGFVIKQGIEVNVSLNTALTDMYINCGDEATARNLFESCPGRDLISWNALIATYV 540

Query: 541 KNDNAGKALLLFNHMISELEPNSVTIISILTSCTQLAHLPLGQCLHAYTTRRGESFELDA 600
           KN+ A +A L+F+ MISE+EPNSVTII+IL+SCT LAHLP GQC HAY  R+  S   + 
Sbjct: 541 KNNLAHEAFLVFSRMISEVEPNSVTIINILSSCTHLAHLPQGQCFHAYMLRQESSLGHNL 600

Query: 601 SLANAFITMYARCGKMQYAEKIFSTLKARNIVSWNAMITGYGMHGRGHDATLAFAQMLDD 660
           SL NAFITMYARCG MQ AE+IF TL  RNI+SWNA+ITGYGMHGRG DA LAF+QML+D
Sbjct: 601 SLGNAFITMYARCGSMQSAERIFKTLPRRNIISWNAIITGYGMHGRGSDAILAFSQMLED 660

Query: 661 GFKPNNISFVSVLSACSHSGLTKTGLQLFSSMVRDFGIAPQLAHYGCIVDLLGRGGHFAE 720
           G+ PN ++F+SVLSACSHSG+ + GL+LF SMV DF I PQLAHYGC+VDLLGR G   E
Sbjct: 661 GYYPNEVTFISVLSACSHSGMIEEGLRLFDSMVHDFHITPQLAHYGCVVDLLGRAGCLDE 720

Query: 721 AIALISSMPVEPDASIWRALLSSCQVKSNKKLVETIFRKLVELEPSNPGNFVLLSNIYAA 780
           A   I SMP++PDAS+WRALLS+ +     K  + IF K+VEL+P NPGN+VL+ N YAA
Sbjct: 721 ARGFIESMPIKPDASVWRALLSAYRDHCYTKEAKAIFEKIVELDPMNPGNYVLVCNAYAA 780

Query: 781 AGLWSEVSQIRKWLRDKGLVKPPGTSWIVIGSQVHYFTATDVSHPQSEEIYENLNSLTSL 840
           AGLWS+V QIR  L+ KGL KPPG SWIV+ SQ+H F A D SHP +++IY NLNSL   
Sbjct: 781 AGLWSDVRQIRTCLKAKGLRKPPGMSWIVVRSQIHSFAAGDRSHPMADKIYANLNSLLHS 840

Query: 841 IQDMG 846
           ++++G
Sbjct: 841 MKEIG 845

BLAST of CmoCh20G000880.1 vs. TrEMBL
Match: A0A0D2N9C6_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_001G138400 PE=4 SV=1)

HSP 1 Score: 1047.7 bits (2708), Expect = 7.4e-303
Identity = 510/832 (61.30%), Postives = 646/832 (77.64%), Query Frame = 1

Query: 15  TPKQAINVSKDWNLIIKHQTKLKNDHAILSTYTQMESLGIAPDSATMPLVLKACGRLNAI 74
           TPK   N  K WN IIKHQTKLKNDHAILST+T M+SLG+ PD A++PLVLKAC +LNAI
Sbjct: 11  TPKNETN--KAWNSIIKHQTKLKNDHAILSTFTHMQSLGLTPDKASLPLVLKACRKLNAI 70

Query: 75  EKGVRIHSCIRDSDLIRDVRVGTALVDFYSKCGLVGEASKVFDEMPERDLVSWNALISGY 134
           E G RIHS IRD++LI DVRVGTAL+DFYSKCG + +A KVFDEM ERDLVSWNA+ISGY
Sbjct: 71  ETGKRIHSSIRDTNLIEDVRVGTALIDFYSKCGFLEDARKVFDEMSERDLVSWNAMISGY 130

Query: 135 VGCSCYKEAVLLFIEMQKAGLTPNSRTVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMD 194
            GC  ++E V L + MQ+ G  PNSRT+V ++L C ++ E+RLG  IHGYCLRNGLFD+D
Sbjct: 131 AGCEEFEEVVFLVMTMQREGFRPNSRTLVAMILVCDKVAEVRLGKAIHGYCLRNGLFDLD 190

Query: 195 AHVGTALIGFYMRF-DATVSHRVFSSMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLT 254
           AHVGTALI FY+ F D   SH VF  M +RN V WNAMI GY ++G+ +KAL+LF  ML 
Sbjct: 191 AHVGTALISFYLSFFDVRASHLVFDLMAIRNTVCWNAMIMGYFDVGESSKALRLFEQMLM 250

Query: 255 EGIKFDAVTMLLVIQACAESESLQLGMQLHQLAIKFNFIGDLFVLNALLNMYSDNGRLES 314
           +G++FD+VT+L +IQA AE  SL+LG Q+HQ+AIK ++  DLF++NAL+NMY++ G L+S
Sbjct: 251 DGVEFDSVTVLALIQASAEFGSLELGDQIHQMAIKCSYSNDLFIVNALINMYAEIGCLKS 310

Query: 315 SCALFNAVPTSDAALWNSMISAYIAFGFHAEAIALYIKMRLEGLKEDKRTVEIMLSLCED 374
           +C LF+ +PT D ALWNSMISAYI + +H EAI+L+IKMR EG KED+RT  +MLSLC +
Sbjct: 311 ACKLFDGIPTRDVALWNSMISAYIDYSYHGEAISLFIKMRTEGNKEDERTTVLMLSLCAE 370

Query: 375 LNDGSIWGRGLHAHAMKSGMELDVFLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISCNT 434
             D    GR LHAHA K+GM +D+ +GNA+L+MY E N +D+ +K+F +M  +DVIS NT
Sbjct: 371 SADALRKGRSLHAHACKTGMGMDINIGNAILNMYAEQNCMDSVRKVFGQMSNVDVISYNT 430

Query: 435 MILALARSKFRAKAFELFMTMCESEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNG 494
           +IL LAR+    +A+E F  M ES++K NSYT+IS+LA CKD + L  GRS+HGF IK G
Sbjct: 431 LILVLARNNLGIEAWETFGIMRESDVKPNSYTIISILAACKDETCLNIGRSLHGFVIKQG 490

Query: 495 LEINTSLNTSLTEMYINCRDEGSATNLFIRCPQRDLVSWNSLISSYIKNDNAGKALLLFN 554
           +E+N  L T+LT+MYINC DE +A  LF     RDL+SWN+LIS+Y+KN+ A +A L+F+
Sbjct: 491 IEVNAPLKTALTDMYINCGDETTAMKLFESSHGRDLISWNALISTYVKNNQAHEAFLVFS 550

Query: 555 HMISELEPNSVTIISILTSCTQLAHLPLGQCLHAYTTRRGESFELDASLANAFITMYARC 614
            M+SE+EPNSVTII+IL+SCT LAHLP G+CLH+Y  +R  S   + SL NAFITMYARC
Sbjct: 551 RMVSEVEPNSVTIINILSSCTHLAHLPQGRCLHSYMIQRESSLGRNLSLQNAFITMYARC 610

Query: 615 GKMQYAEKIFSTLKARNIVSWNAMITGYGMHGRGHDATLAFAQMLDDGFKPNNISFVSVL 674
           G M+ AEKIF TL  RNI+SWNA+ITGYGMHGRG+DA LA++QML+DGF+PN ++F+S+L
Sbjct: 611 GSMRNAEKIFETLTRRNIISWNAIITGYGMHGRGYDAILAYSQMLEDGFQPNEVTFISIL 670

Query: 675 SACSHSGLTKTGLQLFSSMVRDFGIAPQLAHYGCIVDLLGRGGHFAEAIALISSMPVEPD 734
           SACSHSG+ + GLQLF SMV DF I PQLAHYGC+VDLLGR G   +A   I SMP++PD
Sbjct: 671 SACSHSGMIEEGLQLFDSMVHDFNITPQLAHYGCVVDLLGRAGRLDKAREFIESMPIKPD 730

Query: 735 ASIWRALLSSCQVKSNKKLVETIFRKLVELEPSNPGNFVLLSNIYAAAGLWSEVSQIRKW 794
           ASIWR+LLS+ +     K  + IF K+VEL+P NPGN VLL N+YAAAGLW EVS++R+ 
Sbjct: 731 ASIWRSLLSAYRDHCYTKDAKAIFEKVVELDPMNPGNHVLLCNVYAAAGLWPEVSEMRRH 790

Query: 795 LRDKGLVKPPGTSWIVIGSQVHYFTATDVSHPQSEEIYENLNSLTSLIQDMG 846
           LR KGL KPPG SWIV+ SQ+H F A D SHP +++IY NLNSL   I+++G
Sbjct: 791 LRAKGLRKPPGISWIVVRSQIHSFAAGDRSHPMADKIYANLNSLLQSIKEIG 840

BLAST of CmoCh20G000880.1 vs. TrEMBL
Match: A0A067JE40_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_25923 PE=4 SV=1)

HSP 1 Score: 1030.0 bits (2662), Expect = 1.6e-297
Identity = 507/823 (61.60%), Postives = 630/823 (76.55%), Query Frame = 1

Query: 23  SKDWNLIIKHQTKLKNDHAILSTYTQMESLGIAPDSATMPLVLKACGRLNAIEKGVRIHS 82
           SKDWN IIK+  KL+NDHAILSTYTQMESLG+ PD+ T+PL+ KAC RLNA E+G +IHS
Sbjct: 26  SKDWNAIIKYHAKLRNDHAILSTYTQMESLGLQPDNMTLPLIFKACTRLNAFERGKKIHS 85

Query: 83  CIRDSDLIRDVRVGTALVDFYSKCGLVGEASKVFDEMPERDLVSWNALISGYVGCSCYKE 142
            I  +DLI++VRVGT++VDFY KCG + EA KVFD+M ERDLV WNA+ISGYVGC+ Y E
Sbjct: 86  SIESTDLIKNVRVGTSVVDFYCKCGHILEAHKVFDKMSERDLVLWNAIISGYVGCAYYVE 145

Query: 143 AVLLFIEMQKAGLTPNSRTVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALI 202
           A+  F  MQ+ GL PNSRT+V LLLAC  +LELRLG E+HGYCLR+G FD+  H+GTALI
Sbjct: 146 AIGQFRRMQREGLEPNSRTLVALLLACEGILELRLGQELHGYCLRSGYFDLYPHLGTALI 205

Query: 203 GFYMRFDATVSHRVFSSMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGIKFDAVT 262
           GFY+ FD  +S  VF  M V++ VSWNAMITGY   GD+ KAL+LF  ML +G+KFD VT
Sbjct: 206 GFYLNFDVKISSLVFDLMIVKSAVSWNAMITGYFGSGDFVKALELFVQMLKDGVKFDMVT 265

Query: 263 MLLVIQACAESESLQLGMQLHQLAIKFNFIGDLFVLNALLNMYSDNGRLESSCALFNAVP 322
           +L+ IQA AE  S +LGMQ+HQLAIK ++  +LF++NALLNMY++ G LE +C LF+ V 
Sbjct: 266 ILVSIQASAEIGSSELGMQIHQLAIKLSYGNELFIVNALLNMYAEIGNLELACRLFDTVT 325

Query: 323 TSDAALWNSMISAYIAFGFHAEAIALYIKMRLEGLKEDKRTVEIMLSLCEDLNDGSIWGR 382
             D  LWNSMI+AYI  G + EA +L+  MR E  +ED+RT+ ++LSL  +L DG   GR
Sbjct: 326 VHDVPLWNSMIAAYIDHGCYEEATSLFTTMRTE-TREDERTIAVILSLSAELTDGLKIGR 385

Query: 383 GLHAHAMKSGMELDVFLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISCNTMILALARSK 442
            LHA A K  M+++V LGNALLSMY + N ++ A K+F++M  +DV+  NT+ILA + S 
Sbjct: 386 SLHALAYKREMKMNVSLGNALLSMYADLNCVEDALKVFNEMSNIDVVPYNTLILAFSVSN 445

Query: 443 FRAKAFELFMTMCESEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNT 502
              KA+ELF  M ESE+  NS+TMISLLA C D   L  GRS+HGF IKN +EIN SLNT
Sbjct: 446 LSGKAWELFGMMRESEVSPNSHTMISLLASCGDEKCLNIGRSVHGFIIKNSIEINLSLNT 505

Query: 503 SLTEMYINCRDEGSATNLFIRCPQRDLVSWNSLISSYIKNDNAGKALLLFNHMISELEPN 562
           SLTEMYINC D  +A  LF  CP RDL+SWN++I++ +KND  G+A+L FN MISE+EPN
Sbjct: 506 SLTEMYINCGDGAAARYLFDTCPSRDLISWNAIIAALLKNDKTGEAILFFNRMISEVEPN 565

Query: 563 SVTIISILTSCTQLAHLPLGQCLHAYTTRRGESFELDASLANAFITMYARCGKMQYAEKI 622
           SVTII+IL++CT LA+LP GQCLHAY TRR  +F L+ SL NAFITMYARCG ++ AEKI
Sbjct: 566 SVTIINILSTCTNLANLPQGQCLHAYATRRFSAFGLNLSLGNAFITMYARCGSIRNAEKI 625

Query: 623 FSTLKARNIVSWNAMITGYGMHGRGHDATLAFAQMLDDGFKPNNISFVSVLSACSHSGLT 682
           F TL  RN++SWN MITGYG HG  +DA L F  ML DGF+PN ++F+S LSAC H+G+ 
Sbjct: 626 FETLAKRNVISWNGMITGYGTHGCAYDAILTFKNMLKDGFQPNGVTFLSALSACRHAGMI 685

Query: 683 KTGLQLFSSMVRDFGIAPQLAHYGCIVDLLGRGGHFAEAIALISSMPVEPDASIWRALLS 742
           K GLQLF+SMV+DF + P L+HYGC+VDLLGRGG   EA   I+SMP+EPDAS+WRALLS
Sbjct: 686 KEGLQLFNSMVQDFKMTPTLSHYGCVVDLLGRGGSLNEAREFINSMPIEPDASVWRALLS 745

Query: 743 SCQVKSNKKLVETIFRKLVELEPSNPGNFVLLSNIYAAAGLWSEVSQIRKWLRDKGLVKP 802
           +C+V SN ++   IF  LVELEP+N GN++LLSNIYAAAG WSEV QIRKWL+DKGL KP
Sbjct: 746 ACRVHSNTEIAAEIFENLVELEPTNAGNYILLSNIYAAAGFWSEVRQIRKWLKDKGLKKP 805

Query: 803 PGTSWIVIGSQVHYFTATDVSHPQSEEIYENLNSLTSLIQDMG 846
           PGTSW+++  QVH FTA D SH QS+ IY NLNSL  L ++ G
Sbjct: 806 PGTSWLIVRGQVHSFTAGDTSHLQSDRIYGNLNSLLLLTREYG 847

BLAST of CmoCh20G000880.1 vs. TrEMBL
Match: A5BC97_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_043633 PE=4 SV=1)

HSP 1 Score: 1025.4 bits (2650), Expect = 3.9e-296
Identity = 513/833 (61.58%), Postives = 629/833 (75.51%), Query Frame = 1

Query: 13  SPTPKQAINVSKDWNLIIKHQTKLKNDHAILSTYTQMESLGIAPDSATMPLVLKACGRLN 72
           SPT  Q I   K WN +IKHQ  LKND AILS YTQMESLG+ P++ T+PLVLKAC   N
Sbjct: 16  SPTKIQ-IKDPKHWNSVIKHQANLKNDQAILSAYTQMESLGVLPNNTTLPLVLKACAAQN 75

Query: 73  AIEKGVRIHSCIRDSDLIRDVRVGTALVDFYSKCGLVGEASKVFDEMPERDLVSWNALIS 132
           A+E+G  IH  I+ +DL+ DVRVGTA+VDFY KCG V +A  VFD M +RD+V WNA++ 
Sbjct: 76  AVERGKSIHRSIQGTDLMDDVRVGTAVVDFYCKCGFVEDARCVFDAMSDRDVVLWNAMVY 135

Query: 133 GYVGCSCYKEAVLLFIEMQKAGLTPNSRTVVPLLLACAEMLELRLGHEIHGYCLRNGLFD 192
           GYVG  CY+EA+LL  EM +  L PNSRT+V LLLAC    ELRLG  +HGYCLRNG+FD
Sbjct: 136 GYVGWGCYEEAMLLVREMGRENLRPNSRTMVALLLACEGASELRLGRGVHGYCLRNGMFD 195

Query: 193 MDAHVGTALIGFYMRFDATVSHRVFSSMEVRNVVSWNAMITGYLNIGDYTKALKLFSSML 252
            + HV TALIGFY+RFD  V   +F  M VRN+VSWNAMI+GY ++GDY KAL+LF  ML
Sbjct: 196 SNPHVATALIGFYLRFDMRVLPLLFDLMVVRNIVSWNAMISGYYDVGDYFKALELFVQML 255

Query: 253 TEGIKFDAVTMLLVIQACAESESLQLGMQLHQLAIKFNFIGDLFVLNALLNMYSDNGRLE 312
            + +KFD VTML+ +QACAE  SL+LG Q+HQLAIKF F+ DL++LNALLNMYS+NG LE
Sbjct: 256 VDEVKFDCVTMLVAVQACAELGSLKLGKQIHQLAIKFEFVEDLYILNALLNMYSNNGSLE 315

Query: 313 SSCALFNAVPTSDAALWNSMISAYIAFGFHAEAIALYIKMRLEGLKEDKRTVEIMLSLCE 372
           SS  LF +VP  DA LWNSMISAY AFG H EA+ L+I+M+ EG+K+D+RTV IMLS+CE
Sbjct: 316 SSHQLFESVPNRDAPLWNSMISAYAAFGCHEEAMDLFIRMQSEGVKKDERTVVIMLSMCE 375

Query: 373 DLNDGSIWGRGLHAHAMKSGMELDVFLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISCN 432
           +L  G + G+ LHAH +KSGM +D  LGNALLSMY E N +++ QK+FD+M+G+D+IS N
Sbjct: 376 ELASGLLKGKSLHAHVIKSGMRIDASLGNALLSMYTELNCVESVQKIFDRMKGVDIISWN 435

Query: 433 TMILALARSKFRAKAFELFMTMCESEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKN 492
           TMILALAR+  RA+A ELF  M ESEIK NSYT+IS+LA C+D + L FGRSIHG+ +K+
Sbjct: 436 TMILALARNTLRAQACELFERMRESEIKPNSYTIISILAACEDVTCLDFGRSIHGYVMKH 495

Query: 493 GLEINTSLNTSLTEMYINCRDEGSATNLFIRCPQRDLVSWNSLISSYIKNDNAGKALLLF 552
            +EIN  L T+L +MY+NC DE +A +LF  CP RDL+SWN+                  
Sbjct: 496 SIEINQPLRTALADMYMNCGDEATARDLFEGCPDRDLISWNA------------------ 555

Query: 553 NHMISELEPNSVTIISILTSCTQLAHLPLGQCLHAYTTRRGESFELDASLANAFITMYAR 612
             MI + EPNSVTII++L+S T LA LP GQ LHAY TRRG S  LD SLANAFITMYAR
Sbjct: 556 --MIXKAEPNSVTIINVLSSFTHLATLPQGQSLHAYVTRRGFSLGLDLSLANAFITMYAR 615

Query: 613 CGKMQYAEKIFSTLKARNIVSWNAMITGYGMHGRGHDATLAFAQMLDDGFKPNNISFVSV 672
           CG +Q AE IF TL  RNI+SWNAMI GYGM+GRG DA LAF+QML+DGF+PN ++FVSV
Sbjct: 616 CGSLQSAENIFKTLPKRNIISWNAMIAGYGMNGRGSDAMLAFSQMLEDGFRPNGVTFVSV 675

Query: 673 LSACSHSGLTKTGLQLFSSMVRDFGIAPQLAHYGCIVDLLGRGGHFAEAIALISSMPVEP 732
           LSACSHSG  + GLQLF SMV+DF + P+L HY CIVDLL RGG   EA   I SMP+EP
Sbjct: 676 LSACSHSGFIEMGLQLFHSMVQDFNVTPELVHYSCIVDLLARGGCIDEAREFIDSMPIEP 735

Query: 733 DASIWRALLSSCQVKSNKKLVETIFRKLVELEPSNPGNFVLLSNIYAAAGLWSEVSQIRK 792
           DAS+WRALLSSC+  S+ K  +TIF KL +LEP N GN+VLLSN+YA AGLW EV +IR 
Sbjct: 736 DASVWRALLSSCRAYSDAKQAKTIFEKLDKLEPMNAGNYVLLSNVYATAGLWLEVRRIRT 795

Query: 793 WLRDKGLVKPPGTSWIVIGSQVHYFTATDVSHPQSEEIYENLNSLTSLIQDMG 846
           WL++KGL KPPG SWI++ +QVH F+A D SHPQS++IY  L+ L S +++ G
Sbjct: 796 WLKEKGLRKPPGISWIIVKNQVHCFSAGDRSHPQSDKIYAKLSILLSSMRETG 827

BLAST of CmoCh20G000880.1 vs. TrEMBL
Match: W9RTH5_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_015292 PE=4 SV=1)

HSP 1 Score: 1015.4 bits (2624), Expect = 4.1e-293
Identity = 503/824 (61.04%), Postives = 628/824 (76.21%), Query Frame = 1

Query: 24  KDWNLIIKHQTKLKNDHAILSTYTQMESLGIAPDSATMPLVLKACGRLNAIEKGVRIHSC 83
           KDWN +IKH TK  NDHAILSTYT MESLGIAPD+AT+PLVLKAC RLN +E+G +IH  
Sbjct: 11  KDWNSVIKHHTKFNNDHAILSTYTHMESLGIAPDNATLPLVLKACTRLNDVERGKKIHWS 70

Query: 84  IRDSDLIRDVRVGTALVDFYSKCGLVGEASKVFDEMPERDLVSWNALISGYVGCSCYKEA 143
           IR + LI DVRVGT++VDFY KCGLV +A +VFD+M ERD+V WNA+I GYVGC  +++A
Sbjct: 71  IRGTGLIEDVRVGTSVVDFYGKCGLVDDAREVFDKMRERDVVLWNAMIYGYVGCCYFEKA 130

Query: 144 VLLFIEMQKAGLTPNSRTVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIG 203
           V LF+ MQ  GL PNSRTVV LL  C E+ ELRLG EIHGYCLRNGLFD+D HVGTALIG
Sbjct: 131 VSLFMRMQSEGLKPNSRTVVGLLSTCRELDELRLGQEIHGYCLRNGLFDLDLHVGTALIG 190

Query: 204 FYMRFDATVSHRVFSSMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGI-KFDAVT 263
           FY RFDA +S  VF  M+V+N VSWNA+ITGY+++G+  +A  LF  +L +G+ KFD++T
Sbjct: 191 FYSRFDARISRLVFDLMDVKNTVSWNAIITGYVDMGENLEACNLFVHLLVDGVNKFDSIT 250

Query: 264 MLLVIQACAESESLQLGMQLHQLAIKFNFIGDLFVLNALLNMYSDNGRLESSCALFNAVP 323
           +L+V QACAE     LGMQ+HQLAIK+ +  +LF++NALLNMY D   L+ +C LF  VP
Sbjct: 251 VLVVAQACAELGFRNLGMQIHQLAIKYGYRNNLFIVNALLNMYCDCRSLDLACRLFENVP 310

Query: 324 TSDAALWNSMISAYIAFGFHAEAIALYIKMRLEGLKEDKRTVEIMLSLCEDLNDGSIWGR 383
             D ALWNSMI AYI +G   EA++L++ MR EG++ED+RT+ IM S C +L DG   G+
Sbjct: 311 NRDVALWNSMIYAYIEYGICDEALSLFVSMRTEGVREDERTIAIMASSCPNLADGVRNGK 370

Query: 384 GLHAHAMKSGMELD-VFLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISCNTMILALARS 443
            LHAHA+KSGME+D V LGNA L MY E N  +AAQK+FD M G DVIS NT+I+ALA +
Sbjct: 371 SLHAHAIKSGMEIDDVSLGNAFLGMYAELNCTEAAQKVFDDMTGPDVISWNTLIMALACN 430

Query: 444 KFRAKAFELFMTMCESEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTSLN 503
           K R +A+ LF  M  +++  NS+T+IS+LA C D + L  GR++HGF IK G+EI+ S N
Sbjct: 431 KLRNEAWNLFEAMRATKMTPNSHTVISILAACDDETCLNIGRAVHGFVIKLGIEIDLSFN 490

Query: 504 TSLTEMYINCRDEGSATNLFIRCPQRDLVSWNSLISSYIKNDNAGKALLLFNHMISELEP 563
           T+LT+MY+NC DE +A NLF   P RD++SWN+LI+SY++N+   KA LLF+ MISE+EP
Sbjct: 491 TALTDMYMNCGDEATARNLFENFPDRDVISWNALIASYVRNNQGEKAQLLFSRMISEVEP 550

Query: 564 NSVTIISILTSCTQLAHLPLGQCLHAYTTRRGESFELDASLANAFITMYARCGKMQYAEK 623
           N VTII++L+SCT LA  P GQCLHA+ TRR  SF  + SLANAF+TMYARCG +Q AEK
Sbjct: 551 NGVTIINMLSSCTHLAARPQGQCLHAFVTRRQASFANNLSLANAFVTMYARCGSVQNAEK 610

Query: 624 IFSTLKARNIVSWNAMITGYGMHGRGHDATLAFAQMLDDGFKPNNISFVSVLSACSHSGL 683
           +F  L  RNI+SWNA+ITGY MHG G D+ LAF QML+DG +PN  +F+++LSAC H G 
Sbjct: 611 VFKLLPRRNIISWNALITGYSMHGCGVDSILAFLQMLEDGMQPNAATFIAILSACRHCGF 670

Query: 684 TKTGLQLFSSMVRDFGIAPQLAHYGCIVDLLGRGGHFAEAIALISSMPVEPDASIWRALL 743
            + GLQ F  MV +F I P+L HYGC+VDLL RGG   EA   I SMP+E DAS+WRALL
Sbjct: 671 IEKGLQFFQMMVHEFKIKPELVHYGCVVDLLCRGGRLNEAREFIESMPIELDASLWRALL 730

Query: 744 SSCQVKSNKKLVETIFRKLVELEPSNPGNFVLLSNIYAAAGLWSEVSQIRKWLRDKGLVK 803
           S+C+V S+ KL  TIF KLVELEP N GN++LLSNIYA+AGLW EV +IR WL++KGL K
Sbjct: 731 SACRVNSDTKLAATIFEKLVELEPMNAGNYILLSNIYASAGLWLEVRKIRTWLQEKGLRK 790

Query: 804 PPGTSWIVIGSQVHYFTATDVSHPQSEEIYENLNSLTSLIQDMG 846
            PG SWIV+ S+VH F A D SHPQS  IYENL SLT+LI++ G
Sbjct: 791 SPGISWIVVRSEVHCFAAGDASHPQSNIIYENLCSLTALIKESG 834

BLAST of CmoCh20G000880.1 vs. TAIR10
Match: AT4G13650.1 (AT4G13650.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 456.1 bits (1172), Expect = 4.8e-128
Identity = 261/823 (31.71%), Postives = 430/823 (52.25%), Query Frame = 1

Query: 26  WNLIIKHQTKLKNDHAILSTYTQMESLGIAPDSATMPLVLKAC-GRLNAIEKGVRIHSCI 85
           WN +IK          +   + +M S  + P+  T   VL+AC G   A +   +IH+ I
Sbjct: 154 WNKMIKELASRNLIGEVFGLFVRMVSENVTPNEGTFSGVLEACRGGSVAFDVVEQIHARI 213

Query: 86  RDSDLIRDVRVGTALVDFYSKCGLVGEASKVFDEMPERDLVSWNALISGYVGCSCYKEAV 145
               L     V   L+D YS+ G V  A +VFD +  +D  SW A+ISG     C  EA+
Sbjct: 214 LYQGLRDSTVVCNPLIDLYSRNGFVDLARRVFDGLRLKDHSSWVAMISGLSKNECEAEAI 273

Query: 146 LLFIEMQKAGLTPNSRTVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGF 205
            LF +M   G+ P       +L AC ++  L +G ++HG  L+ G F  D +V  AL+  
Sbjct: 274 RLFCDMYVLGIMPTPYAFSSVLSACKKIESLEIGEQLHGLVLKLG-FSSDTYVCNALVSL 333

Query: 206 YMRFDATVS-HRVFSSMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGIKFDAVTM 265
           Y      +S   +FS+M  R+ V++N +I G    G   KA++LF  M  +G++ D+ T+
Sbjct: 334 YFHLGNLISAEHIFSNMSQRDAVTYNTLINGLSQCGYGEKAMELFKRMHLDGLEPDSNTL 393

Query: 266 LLVIQACAESESLQLGMQLHQLAIKFNFIGDLFVLNALLNMYSDNGRLESSCALFNAVPT 325
             ++ AC+   +L  G QLH    K  F  +  +  ALLN+Y+    +E++   F     
Sbjct: 394 ASLVVACSADGTLFRGQQLHAYTTKLGFASNNKIEGALLNLYAKCADIETALDYFLETEV 453

Query: 326 SDAALWNSMISAYIAFGFHAEAIALYIKMRLEGLKEDKRTVEIMLSLCEDLNDGSIWGRG 385
            +  LWN M+ AY        +  ++ +M++E +  ++ T   +L  C  L D  + G  
Sbjct: 454 ENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEEIVPNQYTYPSILKTCIRLGDLEL-GEQ 513

Query: 386 LHAHAMKSGMELDVFLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISCNTMILALARSKF 445
           +H+  +K+  +L+ ++ + L+ MY +  ++D A  +  +  G DV+S  TMI    +  F
Sbjct: 514 IHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIRFAGKDVVSWTTMIAGYTQYNF 573

Query: 446 RAKAFELFMTMCESEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTS 505
             KA   F  M +  I+ +   + + ++ C     L  G+ IH  A  +G   +     +
Sbjct: 574 DDKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALKEGQQIHAQACVSGFSSDLPFQNA 633

Query: 506 LTEMYINCRDEGSATNLFIRCPQRDLVSWNSLISSYIKNDNAGKALLLFNHMISE-LEPN 565
           L  +Y  C     +   F +    D ++WN+L+S + ++ N  +AL +F  M  E ++ N
Sbjct: 634 LVTLYSRCGKIEESYLAFEQTEAGDNIAWNALVSGFQQSGNNEEALRVFVRMNREGIDNN 693

Query: 566 SVTIISILTSCTQLAHLPLGQCLHAYTTRRGESFELDASLANAFITMYARCGKMQYAEKI 625
           + T  S + + ++ A++  G+ +HA  T+ G  ++ +  + NA I+MYA+CG +  AEK 
Sbjct: 694 NFTFGSAVKAASETANMKQGKQVHAVITKTG--YDSETEVCNALISMYAKCGSISDAEKQ 753

Query: 626 FSTLKARNIVSWNAMITGYGMHGRGHDATLAFAQMLDDGFKPNNISFVSVLSACSHSGLT 685
           F  +  +N VSWNA+I  Y  HG G +A  +F QM+    +PN+++ V VLSACSH GL 
Sbjct: 754 FLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQMIHSNVRPNHVTLVGVLSACSHIGLV 813

Query: 686 KTGLQLFSSMVRDFGIAPQLAHYGCIVDLLGRGGHFAEAIALISSMPVEPDASIWRALLS 745
             G+  F SM  ++G++P+  HY C+VD+L R G  + A   I  MP++PDA +WR LLS
Sbjct: 814 DKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLSRAKEFIQEMPIKPDALVWRTLLS 873

Query: 746 SCQVKSNKKLVETIFRKLVELEPSNPGNFVLLSNIYAAAGLWSEVSQIRKWLRDKGLVKP 805
           +C V  N ++ E     L+ELEP +   +VLLSN+YA +  W      R+ +++KG+ K 
Sbjct: 874 ACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYAVSKKWDARDLTRQKMKEKGVKKE 933

Query: 806 PGTSWIVIGSQVHYFTATDVSHPQSEEIYENLNSLTSLIQDMG 846
           PG SWI + + +H F   D +HP ++EI+E    LT    ++G
Sbjct: 934 PGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQDLTKRASEIG 972

BLAST of CmoCh20G000880.1 vs. TAIR10
Match: AT3G63370.1 (AT3G63370.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 448.0 bits (1151), Expect = 1.3e-125
Identity = 255/785 (32.48%), Postives = 432/785 (55.03%), Query Frame = 1

Query: 64  VLKACGRLNAIEKGVRIHSCIRDS--DLIRDVRVGTALVDFYSKCGLVGEASKVFDEMPE 123
           VL+ CG+  A+ +G ++HS I  +      D   G  LV  Y KCG + +A KVFDEMP+
Sbjct: 86  VLELCGKRRAVSQGRQLHSRIFKTFPSFELDFLAGK-LVFMYGKCGSLDDAEKVFDEMPD 145

Query: 124 RDLVSWNALISGYVGCSCYKEAVLLFIEMQKAGLTPNSRTVVPLLLACAEMLELRLGHEI 183
           R   +WN +I  YV       A+ L+  M+  G+     +   LL ACA++ ++R G E+
Sbjct: 146 RTAFAWNTMIGAYVSNGEPASALALYWNMRVEGVPLGLSSFPALLKACAKLRDIRSGSEL 205

Query: 184 HGYCLRNGLFDMDAHVGTALIGFYMRFD-ATVSHRVFSSMEVR-NVVSWNAMITGYLNIG 243
           H   ++ G +     +  AL+  Y + D  + + R+F   + + + V WN++++ Y   G
Sbjct: 206 HSLLVKLG-YHSTGFIVNALVSMYAKNDDLSAARRLFDGFQEKGDAVLWNSILSSYSTSG 265

Query: 244 DYTKALKLFSSMLTEGIKFDAVTMLLVIQACAESESLQLGMQLHQLAIKFN-FIGDLFVL 303
              + L+LF  M   G   ++ T++  + AC      +LG ++H   +K +    +L+V 
Sbjct: 266 KSLETLELFREMHMTGPAPNSYTIVSALTACDGFSYAKLGKEIHASVLKSSTHSSELYVC 325

Query: 304 NALLNMYSDNGRLESSCALFNAVPTSDAALWNSMISAYIAFGFHAEAIALYIKMRLEGLK 363
           NAL+ MY+  G++  +  +   +  +D   WNS+I  Y+    + EA+  +  M   G K
Sbjct: 326 NALIAMYTRCGKMPQAERILRQMNNADVVTWNSLIKGYVQNLMYKEALEFFSDMIAAGHK 385

Query: 364 EDKRTVEIMLSLCEDLNDGSIWGRGLHAHAMKSGMELDVFLGNALLSMYVEHNQIDAAQK 423
            D+ ++  +++    L++  + G  LHA+ +K G + ++ +GN L+ MY + N      +
Sbjct: 386 SDEVSMTSIIAASGRLSN-LLAGMELHAYVIKHGWDSNLQVGNTLIDMYSKCNLTCYMGR 445

Query: 424 LFDKMRGLDVISCNTMILALARSKFRAKAFELFMTMCESEIKFNSYTMISLLALCKDGSD 483
            F +M   D+IS  T+I   A++    +A ELF  + +  ++ +   + S+L        
Sbjct: 446 AFLRMHDKDLISWTTVIAGYAQNDCHVEALELFRDVAKKRMEIDEMILGSILRASSVLKS 505

Query: 484 LVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCRDEGSATNLFIRCPQRDLVSWNSLISS 543
           ++  + IH   ++ GL ++T +   L ++Y  CR+ G AT +F     +D+VSW S+ISS
Sbjct: 506 MLIVKEIHCHILRKGL-LDTVIQNELVDVYGKCRNMGYATRVFESIKGKDVVSWTSMISS 565

Query: 544 YIKNDNAGKALLLFNHMISE-LEPNSVTIISILTSCTQLAHLPLGQCLHAYTTRRGESFE 603
              N N  +A+ LF  M+   L  +SV ++ IL++   L+ L  G+ +H Y  R+G  F 
Sbjct: 566 SALNGNESEAVELFRRMVETGLSADSVALLCILSAAASLSALNKGREIHCYLLRKG--FC 625

Query: 604 LDASLANAFITMYARCGKMQYAEKIFSTLKARNIVSWNAMITGYGMHGRGHDATLAFAQM 663
           L+ S+A A + MYA CG +Q A+ +F  ++ + ++ + +MI  YGMHG G  A   F +M
Sbjct: 626 LEGSIAVAVVDMYACCGDLQSAKAVFDRIERKGLLQYTSMINAYGMHGCGKAAVELFDKM 685

Query: 664 LDDGFKPNNISFVSVLSACSHSGLTKTGLQLFSSMVRDFGIAPQLAHYGCIVDLLGRGGH 723
             +   P++ISF+++L ACSH+GL   G      M  ++ + P   HY C+VD+LGR   
Sbjct: 686 RHENVSPDHISFLALLYACSHAGLLDEGRGFLKIMEHEYELEPWPEHYVCLVDMLGRANC 745

Query: 724 FAEAIALISSMPVEPDASIWRALLSSCQVKSNKKLVETIFRKLVELEPSNPGNFVLLSNI 783
             EA   +  M  EP A +W ALL++C+  S K++ E   ++L+ELEP NPGN VL+SN+
Sbjct: 746 VVEAFEFVKMMKTEPTAEVWCALLAACRSHSEKEIGEIAAQRLLELEPKNPGNLVLVSNV 805

Query: 784 YAAAGLWSEVSQIRKWLRDKGLVKPPGTSWIVIGSQVHYFTATDVSHPQSEEIYENLNSL 843
           +A  G W++V ++R  ++  G+ K PG SWI +  +VH FTA D SHP+S+EIYE L+ +
Sbjct: 806 FAEQGRWNDVEKVRAKMKASGMEKHPGCSWIEMDGKVHKFTARDKSHPESKEIYEKLSEV 864

BLAST of CmoCh20G000880.1 vs. TAIR10
Match: AT4G21300.1 (AT4G21300.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 446.0 bits (1146), Expect = 5.0e-125
Identity = 247/785 (31.46%), Postives = 416/785 (52.99%), Query Frame = 1

Query: 61  MPLVLKACGRLNAIEKGVRIHSCIRDSDLIRDVRVGTALVDFYSKCGLVGEASKVFDEMP 120
           + L+L+AC   N + +G ++H+ +  + +  D      ++  Y+ CG   +  K+F  + 
Sbjct: 38  LSLLLQACSNPNLLRQGKQVHAFLIVNSISGDSYTDERILGMYAMCGSFSDCGKMFYRLD 97

Query: 121 ER--DLVSWNALISGYVGCSCYKEAVLLFIEMQKAGLTPNSRTVVPLLLACAEMLELRLG 180
            R   +  WN++IS +V      +A+  + +M   G++P+  T   L+ AC  +   + G
Sbjct: 98  LRRSSIRPWNSIISSFVRNGLLNQALAFYFKMLCFGVSPDVSTFPCLVKACVALKNFK-G 157

Query: 181 HEIHGYCLRNGLFDMDAHVGTALIGFYMRFDAT-VSHRVFSSMEVRNVVSWNAMITGYLN 240
            +     + +   D +  V ++LI  Y+ +    V  ++F  +  ++ V WN M+ GY  
Sbjct: 158 IDFLSDTVSSLGMDCNEFVASSLIKAYLEYGKIDVPSKLFDRVLQKDCVIWNVMLNGYAK 217

Query: 241 IGDYTKALKLFSSMLTEGIKFDAVTMLLVIQACAESESLQLGMQLHQLAIKFNFIGDLFV 300
            G     +K FS M  + I  +AVT   V+  CA    + LG+QLH L +      +  +
Sbjct: 218 CGALDSVIKGFSVMRMDQISPNAVTFDCVLSVCASKLLIDLGVQLHGLVVVSGVDFEGSI 277

Query: 301 LNALLNMYSDNGRLESSCALFNAVPTSDAALWNSMISAYIAFGFHAEAIALYIKMRLEGL 360
            N+LL+MYS  GR + +  LF  +  +D   WN MIS Y+  G   E++  + +M   G+
Sbjct: 278 KNSLLSMYSKCGRFDDASKLFRMMSRADTVTWNCMISGYVQSGLMEESLTFFYEMISSGV 337

Query: 361 KEDKRTVEIML---SLCEDLNDGSIWGRGLHAHAMKSGMELDVFLGNALLSMYVEHNQID 420
             D  T   +L   S  E+L     + + +H + M+  + LD+FL +AL+  Y +   + 
Sbjct: 338 LPDAITFSSLLPSVSKFENLE----YCKQIHCYIMRHSISLDIFLTSALIDAYFKCRGVS 397

Query: 421 AAQKLFDKMRGLDVISCNTMILALARSKFRAKAFELFMTMCESEIKFNSYTMISLLALCK 480
            AQ +F +   +DV+    MI     +     + E+F  + + +I  N  T++S+L +  
Sbjct: 398 MAQNIFSQCNSVDVVVFTAMISGYLHNGLYIDSLEMFRWLVKVKISPNEITLVSILPVIG 457

Query: 481 DGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCRDEGSATNLFIRCPQRDLVSWNS 540
               L  GR +HGF IK G +   ++  ++ +MY  C     A  +F R  +RD+VSWNS
Sbjct: 458 ILLALKLGRELHGFIIKKGFDNRCNIGCAVIDMYAKCGRMNLAYEIFERLSKRDIVSWNS 517

Query: 541 LISSYIKNDNAGKALLLFNHM-ISELEPNSVTIISILTSCTQLAHLPLGQCLHAYTTRRG 600
           +I+   ++DN   A+ +F  M +S +  + V+I + L++C  L     G+ +H +  +  
Sbjct: 518 MITRCAQSDNPSAAIDIFRQMGVSGICYDCVSISAALSACANLPSESFGKAIHGFMIKH- 577

Query: 601 ESFELDASLANAFITMYARCGKMQYAEKIFSTLKARNIVSWNAMITGYGMHGRGHDATLA 660
            S   D    +  I MYA+CG ++ A  +F T+K +NIVSWN++I   G HG+  D+   
Sbjct: 578 -SLASDVYSESTLIDMYAKCGNLKAAMNVFKTMKEKNIVSWNSIIAACGNHGKLKDSLCL 637

Query: 661 FAQMLD-DGFKPNNISFVSVLSACSHSGLTKTGLQLFSSMVRDFGIAPQLAHYGCIVDLL 720
           F +M++  G +P+ I+F+ ++S+C H G    G++ F SM  D+GI PQ  HY C+VDL 
Sbjct: 638 FHEMVEKSGIRPDQITFLEIISSCCHVGDVDEGVRFFRSMTEDYGIQPQQEHYACVVDLF 697

Query: 721 GRGGHFAEAIALISSMPVEPDASIWRALLSSCQVKSNKKLVETIFRKLVELEPSNPGNFV 780
           GR G   EA   + SMP  PDA +W  LL +C++  N +L E    KL++L+PSN G +V
Sbjct: 698 GRAGRLTEAYETVKSMPFPPDAGVWGTLLGACRLHKNVELAEVASSKLMDLDPSNSGYYV 757

Query: 781 LLSNIYAAAGLWSEVSQIRKWLRDKGLVKPPGTSWIVIGSQVHYFTATDVSHPQSEEIYE 838
           L+SN +A A  W  V+++R  ++++ + K PG SWI I  + H F + DV+HP+S  IY 
Sbjct: 758 LISNAHANAREWESVTKVRSLMKEREVQKIPGYSWIEINKRTHLFVSGDVNHPESSHIYS 815

BLAST of CmoCh20G000880.1 vs. TAIR10
Match: AT4G39530.1 (AT4G39530.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 441.4 bits (1134), Expect = 1.2e-123
Identity = 256/776 (32.99%), Postives = 429/776 (55.28%), Query Frame = 1

Query: 80  IHSCIRDSDLIRDVRVGTALVDFYSKCGLVGEASKVFDEMPERDLVSWNALISGYVGCSC 139
           +H  I    L  D  +   L++ YS+ G +  A KVF++MPER+LVSW+ ++S       
Sbjct: 66  VHGQIIVWGLELDTYLSNILINLYSRAGGMVYARKVFEKMPERNLVSWSTMVSACNHHGI 125

Query: 140 YKEAVLLFIEMQKAGL-TPNSRTVVPLLLACAEMLELR---LGHEIHGYCLRNGLFDMDA 199
           Y+E++++F+E  +    +PN   +   + AC+  L+ R   +  ++  + +++G FD D 
Sbjct: 126 YEESLVVFLEFWRTRKDSPNEYILSSFIQACSG-LDGRGRWMVFQLQSFLVKSG-FDRDV 185

Query: 200 HVGTALIGFYMRFDATVSHR--VFSSMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLT 259
           +VGT LI FY++ D  + +   VF ++  ++ V+W  MI+G + +G    +L+LF  ++ 
Sbjct: 186 YVGTLLIDFYLK-DGNIDYARLVFDALPEKSTVTWTTMISGCVKMGRSYVSLQLFYQLME 245

Query: 260 EGIKFDAVTMLLVIQACAESESLQLGMQLHQLAIKFNFIGDLFVLNALLNMYSDNGRLES 319
           + +  D   +  V+ AC+    L+ G Q+H   +++    D  ++N L++ Y   GR+ +
Sbjct: 246 DNVVPDGYILSTVLSACSILPFLEGGKQIHAHILRYGLEMDASLMNVLIDSYVKCGRVIA 305

Query: 320 SCALFNAVPTSDAALWNSMISAYIAFGFHAEAIALYIKMRLEGLKEDKRTVEIMLSLCED 379
           +  LFN +P  +   W +++S Y     H EA+ L+  M   GLK D      +L+ C  
Sbjct: 306 AHKLFNGMPNKNIISWTTLLSGYKQNALHKEAMELFTSMSKFGLKPDMYACSSILTSCAS 365

Query: 380 LNDGSIWGRGLHAHAMKSGMELDVFLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISCNT 439
           L+    +G  +HA+ +K+ +  D ++ N+L+ MY + + +  A+K+FD     DV+  N 
Sbjct: 366 LHALG-FGTQVHAYTIKANLGNDSYVTNSLIDMYAKCDCLTDARKVFDIFAAADVVLFNA 425

Query: 440 MILALARSKFR---AKAFELFMTMCESEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAI 499
           MI   +R   +    +A  +F  M    I+ +  T +SLL      + L   + IHG   
Sbjct: 426 MIEGYSRLGTQWELHEALNIFRDMRFRLIRPSLLTFVSLLRASASLTSLGLSKQIHGLMF 485

Query: 500 KNGLEINTSLNTSLTEMYINCRDEGSATNLFIRCPQRDLVSWNSLISSYIKNDNAGKALL 559
           K GL ++    ++L ++Y NC     +  +F     +DLV WNS+ + Y++     +AL 
Sbjct: 486 KYGLNLDIFAGSALIDVYSNCYCLKDSRLVFDEMKVKDLVIWNSMFAGYVQQSENEEALN 545

Query: 560 LFNHM-ISELEPNSVTIISILTSCTQLAHLPLGQCLHAYTTRRGESFELDASLANAFITM 619
           LF  + +S   P+  T  +++T+   LA + LGQ  H    +RG   E +  + NA + M
Sbjct: 546 LFLELQLSRERPDEFTFANMVTAAGNLASVQLGQEFHCQLLKRG--LECNPYITNALLDM 605

Query: 620 YARCGKMQYAEKIFSTLKARNIVSWNAMITGYGMHGRGHDATLAFAQMLDDGFKPNNISF 679
           YA+CG  + A K F +  +R++V WN++I+ Y  HG G  A     +M+ +G +PN I+F
Sbjct: 606 YAKCGSPEDAHKAFDSAASRDVVCWNSVISSYANHGEGKKALQMLEKMMSEGIEPNYITF 665

Query: 680 VSVLSACSHSGLTKTGLQLFSSMVRDFGIAPQLAHYGCIVDLLGRGGHFAEAIALISSMP 739
           V VLSACSH+GL + GL+ F  M+R FGI P+  HY C+V LLGR G   +A  LI  MP
Sbjct: 666 VGVLSACSHAGLVEDGLKQFELMLR-FGIEPETEHYVCMVSLLGRAGRLNKARELIEKMP 725

Query: 740 VEPDASIWRALLSSCQVKSNKKLVETIFRKLVELEPSNPGNFVLLSNIYAAAGLWSEVSQ 799
            +P A +WR+LLS C    N +L E      +  +P + G+F +LSNIYA+ G+W+E  +
Sbjct: 726 TKPAAIVWRSLLSGCAKAGNVELAEHAAEMAILSDPKDSGSFTMLSNIYASKGMWTEAKK 785

Query: 800 IRKWLRDKGLVKPPGTSWIVIGSQVHYFTATDVSHPQSEEIYENLNSLTSLIQDMG 846
           +R+ ++ +G+VK PG SWI I  +VH F + D SH ++ +IYE L+ L  L+Q  G
Sbjct: 786 VRERMKVEGVVKEPGRSWIGINKEVHIFLSKDKSHCKANQIYEVLDDL--LVQIRG 832

BLAST of CmoCh20G000880.1 vs. TAIR10
Match: AT3G02330.1 (AT3G02330.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 433.0 bits (1112), Expect = 4.4e-121
Identity = 258/781 (33.03%), Postives = 417/781 (53.39%), Query Frame = 1

Query: 82  SCIRDSDLIRDVRVGTALVDFYSKCGLVGEASKVFDEMPERDLVSWNALISGYVGCSCYK 141
           S + D   +RDV     +++ YSK   + +A+  F+ MP RD+VSWN+++SGY+      
Sbjct: 103 SMVFDKMPLRDVVSWNKMINGYSKSNDMFKANSFFNMMPVRDVVSWNSMLSGYLQNGESL 162

Query: 142 EAVLLFIEMQKAGLTPNSRTVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGTAL 201
           +++ +F++M + G+  + RT   +L  C+ + +  LG +IHG  +R G  D D    +AL
Sbjct: 163 KSIEVFVDMGREGIEFDGRTFAIILKVCSFLEDTSLGMQIHGIVVRVGC-DTDVVAASAL 222

Query: 202 IGFYMRFDATV-SHRVFSSMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGIKFDA 261
           +  Y +    V S RVF  +  +N VSW+A+I G +     + ALK F  M         
Sbjct: 223 LDMYAKGKRFVESLRVFQGIPEKNSVSWSAIIAGCVQNNLLSLALKFFKEMQKVNAGVSQ 282

Query: 262 VTMLLVIQACAESESLQLGMQLHQLAIKFNFIGDLFVLNALLNMYSDNGRLESSCALFNA 321
                V+++CA    L+LG QLH  A+K +F  D  V  A L+MY+    ++ +  LF+ 
Sbjct: 283 SIYASVLRSCAALSELRLGGQLHAHALKSDFAADGIVRTATLDMYAKCDNMQDAQILFDN 342

Query: 322 VPTSDAALWNSMISAYIAFGFHAEAIALYIKMRLEGLKEDKRTVEIMLSLCEDLNDGSIW 381
               +   +N+MI+ Y       +A+ L+ ++   GL  D+ ++  +   C  L  G   
Sbjct: 343 SENLNRQSYNAMITGYSQEEHGFKALLLFHRLMSSGLGFDEISLSGVFRACA-LVKGLSE 402

Query: 382 GRGLHAHAMKSGMELDVFLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISCNTMILALAR 441
           G  ++  A+KS + LDV + NA + MY +   +  A ++FD+MR  D +S N +I A  +
Sbjct: 403 GLQIYGLAIKSSLSLDVCVANAAIDMYGKCQALAEAFRVFDEMRRRDAVSWNAIIAAHEQ 462

Query: 442 SKFRAKAFELFMTMCESEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTSL 501
           +    +   LF++M  S I+ + +T  S+L  C  GS L +G  IH   +K+G+  N+S+
Sbjct: 463 NGKGYETLFLFVSMLRSRIEPDEFTFGSILKACTGGS-LGYGMEIHSSIVKSGMASNSSV 522

Query: 502 NTSLTEMYINCRDEGSATNLFIRCPQRD--------------------LVSWNSLISSYI 561
             SL +MY  C     A  +  R  QR                      VSWNS+IS Y+
Sbjct: 523 GCSLIDMYSKCGMIEEAEKIHSRFFQRANVSGTMEELEKMHNKRLQEMCVSWNSIISGYV 582

Query: 562 KNDNAGKALLLFNHMISE-LEPNSVTIISILTSCTQLAHLPLGQCLHAYTTRRGESFELD 621
             + +  A +LF  M+   + P+  T  ++L +C  LA   LG+ +HA   ++    + D
Sbjct: 583 MKEQSEDAQMLFTRMMEMGITPDKFTYATVLDTCANLASAGLGKQIHAQVIKK--ELQSD 642

Query: 622 ASLANAFITMYARCGKMQYAEKIFSTLKARNIVSWNAMITGYGMHGRGHDATLAFAQMLD 681
             + +  + MY++CG +  +  +F     R+ V+WNAMI GY  HG+G +A   F +M+ 
Sbjct: 643 VYICSTLVDMYSKCGDLHDSRLMFEKSLRRDFVTWNAMICGYAHHGKGEEAIQLFERMIL 702

Query: 682 DGFKPNNISFVSVLSACSHSGLTKTGLQLFSSMVRDFGIAPQLAHYGCIVDLLGRGGHFA 741
           +  KPN+++F+S+L AC+H GL   GL+ F  M RD+G+ PQL HY  +VD+LG+ G   
Sbjct: 703 ENIKPNHVTFISILRACAHMGLIDKGLEYFYMMKRDYGLDPQLPHYSNMVDILGKSGKVK 762

Query: 742 EAIALISSMPVEPDASIWRALLSSCQV-KSNKKLVETIFRKLVELEPSNPGNFVLLSNIY 801
            A+ LI  MP E D  IWR LL  C + ++N ++ E     L+ L+P +   + LLSN+Y
Sbjct: 763 RALELIREMPFEADDVIWRTLLGVCTIHRNNVEVAEEATAALLRLDPQDSSAYTLLSNVY 822

Query: 802 AAAGLWSEVSQIRKWLRDKGLVKPPGTSWIVIGSQVHYFTATDVSHPQSEEIYENLNSLT 840
           A AG+W +VS +R+ +R   L K PG SW+ +  ++H F   D +HP+ EEIYE L  + 
Sbjct: 823 ADAGMWEKVSDLRRNMRGFKLKKEPGCSWVELKDELHVFLVGDKAHPRWEEIYEELGLIY 878

BLAST of CmoCh20G000880.1 vs. NCBI nr
Match: gi|659097605|ref|XP_008449715.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g13600-like [Cucumis melo])

HSP 1 Score: 1474.1 bits (3815), Expect = 0.0e+00
Identity = 731/847 (86.30%), Postives = 781/847 (92.21%), Query Frame = 1

Query: 1   MEISATLSIHGRSPTPKQA--INVSKDWNLIIKHQTKLKNDHAILSTYTQMESLGIAPDS 60
           MEI+  LS HG S TPKQ   INVSKDWN IIKH TKLKNDHAILSTYTQMESLGI PDS
Sbjct: 1   MEIAVNLSYHGLSSTPKQTHLINVSKDWNSIIKHHTKLKNDHAILSTYTQMESLGITPDS 60

Query: 61  ATMPLVLKACGRLNAIEKGVRIHSCIRDSDLIRDVRVGTALVDFYSKCGLVGEASKVFDE 120
           ATMPLVLKACGRLNAI+KGVRIHSCIR SDLI DVRVGTALVDFY KCGLV EASKVF E
Sbjct: 61  ATMPLVLKACGRLNAIDKGVRIHSCIRGSDLINDVRVGTALVDFYCKCGLVAEASKVFVE 120

Query: 121 MPERDLVSWNALISGYVGCSCYKEAVLLFIEMQKAGLTPNSRTVVPLLLACAEMLELRLG 180
           MPERDLVSWNALISGYVGC CYKEAVLLF+EM+KAGLTPNSRTVV LLLAC EMLELRLG
Sbjct: 121 MPERDLVSWNALISGYVGCLCYKEAVLLFVEMKKAGLTPNSRTVVALLLACGEMLELRLG 180

Query: 181 HEIHGYCLRNGLFDMDAHVGTALIGFYMRFDATVSHRVFSSMEVRNVVSWNAMITGYLNI 240
            EIHGYCLRNGLFDMDA+VGTAL+GFY+RFDA +SHRVFS M VRN+VSWNA+ITG+LN+
Sbjct: 181 QEIHGYCLRNGLFDMDAYVGTALVGFYLRFDAVLSHRVFSLMVVRNIVSWNAIITGFLNV 240

Query: 241 GDYTKALKLFSSMLTEGIKFDAVTMLLVIQACAESESLQLGMQLHQLAIKFNFIGDLFVL 300
           GDYTKALKLFSSML EGIKFDAVTML+VIQACAE   L+LGMQLHQLAIKFN I D+FVL
Sbjct: 241 GDYTKALKLFSSMLIEGIKFDAVTMLVVIQACAEYGCLRLGMQLHQLAIKFNLINDVFVL 300

Query: 301 NALLNMYSDNGRLESSCALFNAVPTSDAALWNSMISAYIAFGFHAEAIALYIKMRLEGLK 360
           NALLNMYSDNG LESSC LFNAVPTSDAALWNSMIS YI FGFHAEAIAL+IKMRLE +K
Sbjct: 301 NALLNMYSDNGSLESSCVLFNAVPTSDAALWNSMISCYIGFGFHAEAIALFIKMRLERIK 360

Query: 361 EDKRTVEIMLSLCEDLNDGSIWGRGLHAHAMKSGMELDVFLGNALLSMYVEHNQIDAAQK 420
           ED RT+ IMLSLC DLNDGS+WGRGLHAHAMKSG+ELD FLGNALLSMYV+HNQI+AAQ 
Sbjct: 361 EDVRTIVIMLSLCNDLNDGSLWGRGLHAHAMKSGIELDAFLGNALLSMYVKHNQINAAQN 420

Query: 421 LFDKMRGLDVISCNTMILALARSKFRAKAFELFMTMCESEIKFNSYTMISLLALCKDGSD 480
           +F+K RGLDVIS NTMI ALA+S FRAKAFELF  MCESEIKFNSYT+ISLLALCKDG+D
Sbjct: 421 VFEKTRGLDVISWNTMISALAQSMFRAKAFELFFMMCESEIKFNSYTIISLLALCKDGND 480

Query: 481 LVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCRDEGSATNLFIRCPQRDLVSWNSLISS 540
           LVFGRSIHGFAIKNGLEINTSLNTSLTEMYINC DE +A ++F RCPQRDL+SWNSLI S
Sbjct: 481 LVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCGDERAAIDMFTRCPQRDLISWNSLILS 540

Query: 541 YIKNDNAGKALLLFNHMISELEPNSVTIISILTSCTQLAHLPLGQCLHAYTTRRGESFEL 600
           YIKNDNAGKALLLFNHMISELEPNSVTII+ILTSCTQLAHLPLGQCLHAY TRR ES E+
Sbjct: 541 YIKNDNAGKALLLFNHMISELEPNSVTIINILTSCTQLAHLPLGQCLHAYATRREESLEM 600

Query: 601 DASLANAFITMYARCGKMQYAEKIFSTLKARNIVSWNAMITGYGMHGRGHDATLAFAQML 660
           DASLANAFITMYARCGKMQYAE+IF TL+ RNIVSWNAMITGYGMHGRG DATLAFAQML
Sbjct: 601 DASLANAFITMYARCGKMQYAEQIFRTLQTRNIVSWNAMITGYGMHGRGRDATLAFAQML 660

Query: 661 DDGFKPNNISFVSVLSACSHSGLTKTGLQLFSSMVRDFGIAPQLAHYGCIVDLLGRGGHF 720
           DDGFKPNN+SF SVLSACSHSGLT+TGL LF SMVRDFG+APQL HYGC+VDLLGRGGHF
Sbjct: 661 DDGFKPNNVSFASVLSACSHSGLTETGLLLFHSMVRDFGLAPQLTHYGCMVDLLGRGGHF 720

Query: 721 AEAIALISSMPVEPDASIWRALLSSCQVKSNKKLVETIFRKLVELEPSNPGNFVLLSNIY 780
           +EAIA I++MP+EPDASIWRALLSS Q+KSNKKL+ETIF KLVELEPSNPGNF+LLSNIY
Sbjct: 721 SEAIAFINTMPIEPDASIWRALLSSWQIKSNKKLLETIFGKLVELEPSNPGNFILLSNIY 780

Query: 781 AAAGLWSEVSQIRKWLRDKGLVKPPGTSWIVIGSQVHYFTATDVSHPQSEEIYENLNSLT 840
           AAAGLWSEV QIRKWLR++GL KPPGTSWIVIG+QVHYFTATDV HPQSE+IYENLNSLT
Sbjct: 781 AAAGLWSEVVQIRKWLRERGLGKPPGTSWIVIGNQVHYFTATDVLHPQSEKIYENLNSLT 840

Query: 841 SLIQDMG 846
           SLIQDMG
Sbjct: 841 SLIQDMG 847

BLAST of CmoCh20G000880.1 vs. NCBI nr
Match: gi|449448940|ref|XP_004142223.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g01580 [Cucumis sativus])

HSP 1 Score: 1459.9 bits (3778), Expect = 0.0e+00
Identity = 726/847 (85.71%), Postives = 776/847 (91.62%), Query Frame = 1

Query: 1   MEISATLSIHGRSPTPKQA--INVSKDWNLIIKHQTKLKNDHAILSTYTQMESLGIAPDS 60
           MEI+  LS HG S TP+Q   +NVSKDWN IIKH TKLKNDHAILSTYTQMESLGI PDS
Sbjct: 1   MEIAVNLSFHGLSSTPEQTHLVNVSKDWNSIIKHHTKLKNDHAILSTYTQMESLGITPDS 60

Query: 61  ATMPLVLKACGRLNAIEKGVRIHSCIRDSDLIRDVRVGTALVDFYSKCGLVGEASKVFDE 120
           ATMPLVLKACGRLNAI  GVRIHS IR  DLI DVRVGTALVDFY KCGLV EASKVF E
Sbjct: 61  ATMPLVLKACGRLNAIGNGVRIHSFIRGLDLINDVRVGTALVDFYCKCGLVAEASKVFVE 120

Query: 121 MPERDLVSWNALISGYVGCSCYKEAVLLFIEMQKAGLTPNSRTVVPLLLACAEMLELRLG 180
           MPERDLVSWNALISGYVGC CYKEAVLLF+EM+KAGLTPNSRTVV LLLAC EMLELRLG
Sbjct: 121 MPERDLVSWNALISGYVGCLCYKEAVLLFVEMKKAGLTPNSRTVVALLLACGEMLELRLG 180

Query: 181 HEIHGYCLRNGLFDMDAHVGTALIGFYMRFDATVSHRVFSSMEVRNVVSWNAMITGYLNI 240
            EIHGYCLRNGLFDMDA+VGTAL+GFYMRFDA +SHRVFS M VRN+VSWNA+ITG+LN+
Sbjct: 181 QEIHGYCLRNGLFDMDAYVGTALVGFYMRFDAVLSHRVFSLMLVRNIVSWNAIITGFLNV 240

Query: 241 GDYTKALKLFSSMLTEGIKFDAVTMLLVIQACAESESLQLGMQLHQLAIKFNFIGDLFVL 300
           GD  KALKL+SSML EGIKFDAVTML+VIQACAE   L+LGMQLHQLAIKFN I DLF+L
Sbjct: 241 GDCAKALKLYSSMLIEGIKFDAVTMLVVIQACAEYGCLRLGMQLHQLAIKFNLINDLFIL 300

Query: 301 NALLNMYSDNGRLESSCALFNAVPTSDAALWNSMISAYIAFGFHAEAIALYIKMRLEGLK 360
           NALLNMYSDNG LESS ALFNAVPTSDAALWNSMIS+YI FGFHAEAIAL+IKMRLE +K
Sbjct: 301 NALLNMYSDNGSLESSWALFNAVPTSDAALWNSMISSYIGFGFHAEAIALFIKMRLERIK 360

Query: 361 EDKRTVEIMLSLCEDLNDGSIWGRGLHAHAMKSGMELDVFLGNALLSMYVEHNQIDAAQK 420
           ED RT+ IMLSLC DLNDGSIWGRGLHAHAMKSG+ELD +LGNALLSMYV+HNQI AAQ 
Sbjct: 361 EDVRTIAIMLSLCNDLNDGSIWGRGLHAHAMKSGIELDAYLGNALLSMYVKHNQITAAQY 420

Query: 421 LFDKMRGLDVISCNTMILALARSKFRAKAFELFMTMCESEIKFNSYTMISLLALCKDGSD 480
           +F+KMRGLDVIS NTMI A A+S FRAKAFELF+ MCESEIKFNSYT++SLLA CKDGSD
Sbjct: 421 VFEKMRGLDVISWNTMISAFAQSMFRAKAFELFLMMCESEIKFNSYTIVSLLAFCKDGSD 480

Query: 481 LVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCRDEGSATNLFIRCPQRDLVSWNSLISS 540
           LVFGRSIHGFAIKNGLEINTSLNTSLTEMYINC DE +ATN+F RCPQRDLVSWNSLISS
Sbjct: 481 LVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCGDERAATNMFTRCPQRDLVSWNSLISS 540

Query: 541 YIKNDNAGKALLLFNHMISELEPNSVTIISILTSCTQLAHLPLGQCLHAYTTRRGESFEL 600
           YIKNDNAGKALLLFNHMISELEPNSVTII+ILTSCTQLAHLPLGQCLHAYTTRR  S E+
Sbjct: 541 YIKNDNAGKALLLFNHMISELEPNSVTIINILTSCTQLAHLPLGQCLHAYTTRREVSLEM 600

Query: 601 DASLANAFITMYARCGKMQYAEKIFSTLKARNIVSWNAMITGYGMHGRGHDATLAFAQML 660
           DASLANAFITMYARCGK+QYAEKIF TL+ R+IVSWNAMITGYGMHGRG DATLAFAQML
Sbjct: 601 DASLANAFITMYARCGKLQYAEKIFCTLQTRSIVSWNAMITGYGMHGRGRDATLAFAQML 660

Query: 661 DDGFKPNNISFVSVLSACSHSGLTKTGLQLFSSMVRDFGIAPQLAHYGCIVDLLGRGGHF 720
           DDGFKPNN+SF SVLSACSHSGLT TGLQLF SMVRDFGIAPQL HYGC+VDLLGRGGHF
Sbjct: 661 DDGFKPNNVSFASVLSACSHSGLTVTGLQLFHSMVRDFGIAPQLTHYGCMVDLLGRGGHF 720

Query: 721 AEAIALISSMPVEPDASIWRALLSSCQVKSNKKLVETIFRKLVELEPSNPGNFVLLSNIY 780
           +EAIA I+SMP+EPDASIWRALLSSCQ+KSN KL+ETIF KLVELEPSNPGNF+LLSNIY
Sbjct: 721 SEAIAFINSMPIEPDASIWRALLSSCQIKSNNKLLETIFGKLVELEPSNPGNFILLSNIY 780

Query: 781 AAAGLWSEVSQIRKWLRDKGLVKPPGTSWIVIGSQVHYFTATDVSHPQSEEIYENLNSLT 840
           AAAGLWSEV QIRKWLR++GL KPPGTSWIVIG+QVH+FTATDV HPQSE IYENLNSLT
Sbjct: 781 AAAGLWSEVVQIRKWLRERGLGKPPGTSWIVIGNQVHHFTATDVLHPQSERIYENLNSLT 840

Query: 841 SLIQDMG 846
           SLI+D+G
Sbjct: 841 SLIRDLG 847

BLAST of CmoCh20G000880.1 vs. NCBI nr
Match: gi|1009118471|ref|XP_015875878.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g27110-like [Ziziphus jujuba])

HSP 1 Score: 1068.9 bits (2763), Expect = 4.5e-309
Identity = 528/846 (62.41%), Postives = 659/846 (77.90%), Query Frame = 1

Query: 1   MEISATLSIHGRSPTPKQ-AINVSKDWNLIIKHQTKLKNDHAILSTYTQMESLGIAPDSA 60
           M+I +TLS+      P Q  I   KDWNLIIKH TKL NDHAIL+TYT MESLGI  D++
Sbjct: 1   MDIPSTLSLKNLPLLPNQFRITDPKDWNLIIKHHTKLNNDHAILTTYTHMESLGIPADTS 60

Query: 61  TMPLVLKACGRLNAIEKGVRIHSCIRDSDLIRDVRVGTALVDFYSKCGLVGEASKVFDEM 120
           T+PLVLKAC RLN +++G RIHS I ++ L  DVRVGTALVDFY +CGL+ +A KVF ++
Sbjct: 61  TLPLVLKACARLNDVDRGRRIHSSIWNTGLSCDVRVGTALVDFYCRCGLIDDARKVFAQI 120

Query: 121 PERDLVSWNALISGYVGCSCYKEAVLLFIEMQKAGLTPNSRTVVPLLLACAEMLELRLGH 180
             RD+V WNALI GYVGC  ++EA+ L IEM++ GL PNSRTVV LLLAC E+LELRLG 
Sbjct: 121 GVRDVVLWNALIYGYVGCCYFEEAIWLLIEMEREGLKPNSRTVVALLLACREILELRLGQ 180

Query: 181 EIHGYCLRNGLFDMDAHVGTALIGFYMRFDATVSHRVFSSMEVRNVVSWNAMITGYLNIG 240
           EIHGYC+RNGLFD+D HVGTALIGFY+RFD  +SH VF  M  RN VSWNA+ITGY+  G
Sbjct: 181 EIHGYCVRNGLFDLDPHVGTALIGFYLRFDVRISHIVFDLMVARNTVSWNAIITGYVENG 240

Query: 241 DYTKALKLFSSMLTEGIKFDAVTMLLVIQACAESESLQLGMQLHQLAIKFNFIGDLFVLN 300
           ++  A KLF  ML + +KFD+VT++ +IQACAE   L+LGMQ+HQ+AIK  +  +LFV N
Sbjct: 241 EHLTAWKLFMRMLVDRVKFDSVTVIAIIQACAELGFLELGMQMHQMAIKSGYSNNLFVAN 300

Query: 301 ALLNMYSDNGRLESSCALFNAVPTSDAALWNSMISAYIAFGFHAEAIALYIKMRLEGLKE 360
           ALLNMYS++G  E SC LF+ +P  D ALWNSMI AYI +GF+ EA+ L++ M++ G+++
Sbjct: 301 ALLNMYSESGSFELSCQLFDTIPKYDVALWNSMIYAYIGYGFYEEAMFLFLNMQVFGIRD 360

Query: 361 DKRTVEIMLSLCEDLNDGSIWGRGLHAHAMKSGMELDVFLGNALLSMYVEHNQIDAAQKL 420
           D+RT+ IMLSLC +L DG   G+ LHAHA+K GMELDV LGNA L MY E N I+AA+K+
Sbjct: 361 DERTIAIMLSLCANLADGMGMGKSLHAHAIKRGMELDVSLGNAFLGMYAEQNCIEAARKV 420

Query: 421 FDKMRGLDVISCNTMILALARSKFRAKAFELFMTMCESEIKFNSYTMISLLALCKDGSDL 480
           F +++G DVIS NT+I+ALA +K R +A+  F  +  S+IK NS+T+ISLLA C D + L
Sbjct: 421 FTEIKGPDVISWNTLIMALACNKLRNEAWNHFEEIQASKIKPNSHTIISLLAACDDETCL 480

Query: 481 VFGRSIHGFAIKNGLEINTSLNTSLTEMYINCRDEGSATNLFIRCPQRDLVSWNSLISSY 540
             GR+IHGFA+K+ ++I+ SLNT+LT+MY+NC DE +A +LF  CP RD++SWN+LISSY
Sbjct: 481 NSGRAIHGFAVKHDIQIDLSLNTALTDMYMNCGDEATARSLFEACPNRDVISWNALISSY 540

Query: 541 IKNDNAGKALLLFNHMISELEPNSVTIISILTSCTQLAHLPLGQCLHAYTTRRGESFELD 600
           IK +   KA  LFN MISE+EPNSVTII+IL+S T LA LP GQCLHAY TRR  SF +D
Sbjct: 541 IKKNEGKKAQELFNRMISEVEPNSVTIINILSSYTNLAALPQGQCLHAYITRRHSSFGVD 600

Query: 601 ASLANAFITMYARCGKMQYAEKIFSTLKARNIVSWNAMITGYGMHGRGHDATLAFAQMLD 660
            SLANAF+TMYARCG MQYAEK+F  L  +NI+SWNA+ITGYGMHGR +DA  AF QML+
Sbjct: 601 VSLANAFVTMYARCGSMQYAEKMFKNLPRKNIISWNALITGYGMHGRAYDAIFAFLQMLE 660

Query: 661 DGFKPNNISFVSVLSACSHSGLTKTGLQLFSSMVRDFGIAPQLAHYGCIVDLLGRGGHFA 720
           DG KPN  +FV+VLSAC H GL + GL LF +M+++F I P+L HYGC+VDLL RGG   
Sbjct: 661 DGLKPNGATFVAVLSACRHFGLIEEGLLLFHTMIQEFKITPELVHYGCVVDLLCRGGRIN 720

Query: 721 EAIALISSMPVEPDASIWRALLSSCQVKSNKKLVETIFRKLVELEPSNPGNFVLLSNIYA 780
           EA   I SMP++PDA++WRALLS+C+V S+ +L  TIF KLVE+EP N GN+VLLSNIYA
Sbjct: 721 EAKEFIESMPIKPDATLWRALLSACRVNSDIELAGTIFEKLVEIEPMNAGNYVLLSNIYA 780

Query: 781 AAGLWSEVSQIRKWLRDKGLVKPPGTSWIVIGSQVHYFTATDVSHPQSEEIYENLNSLTS 840
           AAGLWSEV ++RKWL++KGL KPPG SWIV+ SQVHYFTA DVSHPQS  IYENL SL +
Sbjct: 781 AAGLWSEVRKVRKWLQEKGLRKPPGMSWIVVRSQVHYFTAGDVSHPQSHIIYENLYSLLA 840

Query: 841 LIQDMG 846
           LI++ G
Sbjct: 841 LIKENG 846

BLAST of CmoCh20G000880.1 vs. NCBI nr
Match: gi|731392814|ref|XP_010651228.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g33680-like [Vitis vinifera])

HSP 1 Score: 1064.7 bits (2752), Expect = 8.4e-308
Identity = 527/833 (63.27%), Postives = 647/833 (77.67%), Query Frame = 1

Query: 13  SPTPKQAINVSKDWNLIIKHQTKLKNDHAILSTYTQMESLGIAPDSATMPLVLKACGRLN 72
           SPT  Q I   K WN +IKHQ  LKND AILS YTQMESLG+ P++ T+PLVLKAC   N
Sbjct: 16  SPTKIQ-IKDPKHWNSVIKHQANLKNDQAILSAYTQMESLGVLPNNTTLPLVLKACAAQN 75

Query: 73  AIEKGVRIHSCIRDSDLIRDVRVGTALVDFYSKCGLVGEASKVFDEMPERDLVSWNALIS 132
           A+E+G  IH  I+ +DL+ DVRVGTA+VDFY KCG V +A  VFD M +RD+V WNA++ 
Sbjct: 76  AVERGKSIHRSIQGTDLMDDVRVGTAVVDFYCKCGFVEDARCVFDAMSDRDVVLWNAMVY 135

Query: 133 GYVGCSCYKEAVLLFIEMQKAGLTPNSRTVVPLLLACAEMLELRLGHEIHGYCLRNGLFD 192
           GYVG  CY+EA+LL  EM +  L PNSRT+V LLLAC    ELRLG  +HGYCLRNG+FD
Sbjct: 136 GYVGWGCYEEAMLLVREMGRENLRPNSRTMVALLLACEGASELRLGRGVHGYCLRNGMFD 195

Query: 193 MDAHVGTALIGFYMRFDATVSHRVFSSMEVRNVVSWNAMITGYLNIGDYTKALKLFSSML 252
            + HV TALIGFY+RFD  V   +F  M VRN+VSWNAMI+GY ++GDY KAL+LF  ML
Sbjct: 196 SNPHVATALIGFYLRFDMRVLPLLFDLMVVRNIVSWNAMISGYYDVGDYFKALELFVQML 255

Query: 253 TEGIKFDAVTMLLVIQACAESESLQLGMQLHQLAIKFNFIGDLFVLNALLNMYSDNGRLE 312
            + +KFD VTML+ +QACAE  SL+LG Q+HQLAIKF F+ DL++LNALLNMYS+NG LE
Sbjct: 256 VDEVKFDCVTMLVAVQACAELGSLKLGKQIHQLAIKFEFVEDLYILNALLNMYSNNGSLE 315

Query: 313 SSCALFNAVPTSDAALWNSMISAYIAFGFHAEAIALYIKMRLEGLKEDKRTVEIMLSLCE 372
           SS  LF +VP  DA LWNSMISAY AFG H EA+ L+I+M+ EG+K+D+RTV IMLS+CE
Sbjct: 316 SSHQLFESVPNRDAPLWNSMISAYAAFGCHEEAMDLFIRMQSEGVKKDERTVVIMLSMCE 375

Query: 373 DLNDGSIWGRGLHAHAMKSGMELDVFLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISCN 432
           +L  G + G+ LHAH +KSGM +D  LGNALLSMY E N +++ QK+FD+M+G+D+IS N
Sbjct: 376 ELASGLLKGKSLHAHVIKSGMRIDASLGNALLSMYTELNCVESVQKIFDRMKGVDIISWN 435

Query: 433 TMILALARSKFRAKAFELFMTMCESEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKN 492
           TMILALAR+  RA+A ELF  M ESEIK NSYT+IS+LA C+D + L FGRSIHG+ +K+
Sbjct: 436 TMILALARNTLRAQACELFERMRESEIKPNSYTIISILAACEDVTCLDFGRSIHGYVMKH 495

Query: 493 GLEINTSLNTSLTEMYINCRDEGSATNLFIRCPQRDLVSWNSLISSYIKNDNAGKALLLF 552
            +EIN  L T+L +MY+NC DE +A +LF  CP RDL+SWN++I+SY+KN+ A KALLLF
Sbjct: 496 SIEINQPLRTALADMYMNCGDEATARDLFEGCPDRDLISWNAMIASYVKNNQAHKALLLF 555

Query: 553 NHMISELEPNSVTIISILTSCTQLAHLPLGQCLHAYTTRRGESFELDASLANAFITMYAR 612
           + MISE EPNSVTII++L+S T LA LP GQ LHAY TRRG S  LD SLANAFITMYAR
Sbjct: 556 HRMISEAEPNSVTIINVLSSFTHLATLPQGQSLHAYVTRRGFSLGLDLSLANAFITMYAR 615

Query: 613 CGKMQYAEKIFSTLKARNIVSWNAMITGYGMHGRGHDATLAFAQMLDDGFKPNNISFVSV 672
           CG +Q AE IF TL  RNI+SWNAMI GYGM+GRG DA LAF+QML+DGF+PN ++FVSV
Sbjct: 616 CGSLQSAENIFKTLPKRNIISWNAMIAGYGMNGRGSDAMLAFSQMLEDGFRPNGVTFVSV 675

Query: 673 LSACSHSGLTKTGLQLFSSMVRDFGIAPQLAHYGCIVDLLGRGGHFAEAIALISSMPVEP 732
           LSACSHSG  + GLQLF SMV+DF + P+L HY CIVDLL RGG   EA   I SMP+EP
Sbjct: 676 LSACSHSGFIEMGLQLFHSMVQDFNVTPELVHYSCIVDLLARGGCIDEAREFIDSMPIEP 735

Query: 733 DASIWRALLSSCQVKSNKKLVETIFRKLVELEPSNPGNFVLLSNIYAAAGLWSEVSQIRK 792
           DAS+WRALLSSC+  S+ K  +TIF KL +LEP N GN+VLLSN+YA AGLW EV +IR 
Sbjct: 736 DASVWRALLSSCRAYSDAKQAKTIFEKLDKLEPMNAGNYVLLSNVYATAGLWLEVRRIRT 795

Query: 793 WLRDKGLVKPPGTSWIVIGSQVHYFTATDVSHPQSEEIYENLNSLTSLIQDMG 846
           WL++KGL KPPG SWI++ +QVH F+A D SHPQS++IY  L+ L S +++ G
Sbjct: 796 WLKEKGLRKPPGISWIIVKNQVHCFSAGDRSHPQSDKIYAKLSILLSSMRETG 847

BLAST of CmoCh20G000880.1 vs. NCBI nr
Match: gi|645272538|ref|XP_008241445.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g27110-like [Prunus mume])

HSP 1 Score: 1058.5 bits (2736), Expect = 6.0e-306
Identity = 531/848 (62.62%), Postives = 656/848 (77.36%), Query Frame = 1

Query: 1   MEISATLSIHG---RSPTPKQAINVSKDWNLIIKHQTKLKNDHAILSTYTQMESLGIAPD 60
           M+I  +LS+     + P PK       DWNLIIKH  +LKNDHAILSTYTQMESLG+APD
Sbjct: 1   MDIPVSLSLPNLPIKEPKPK-------DWNLIIKHHAELKNDHAILSTYTQMESLGVAPD 60

Query: 61  SATMPLVLKACGRLNAIEKGVRIHSCIRDSDLIRDVRVGTALVDFYSKCGLVGEASKVFD 120
           + ++PLVLKAC RL+A+E+G  IHS IR++ L++DVR+GTALVDFYSK GL+ +A +VFD
Sbjct: 61  NISLPLVLKACARLSAVERGKGIHSSIRNTGLMKDVRIGTALVDFYSKGGLIDDAVEVFD 120

Query: 121 EMPERDLVSWNALISGYVGCSCYKEAVLLFIEMQKAGLTPNSRTVVPLLLACAEMLELRL 180
           EM ERDLV WNALI GYV C CYKEA+ LF++MQ  GL PNSRTVV LL AC E+ ELR 
Sbjct: 121 EMRERDLVLWNALIHGYVRCCCYKEAISLFMQMQNEGLKPNSRTVVALLSACREVSELRS 180

Query: 181 GHEIHGYCLRNGLFDMDAHVGTALIGFYMRFDATVSHRVFSSMEVRNVVSWNAMITGYLN 240
           G EIHGY LRNGLFD+DAHVGTALIGFY+RFD   +  +F SM VRN+VSWNA+ITGY+ 
Sbjct: 181 GQEIHGYALRNGLFDLDAHVGTALIGFYLRFDIKTTRLMFDSMVVRNIVSWNAIITGYVE 240

Query: 241 IGDYTKALKLFSSMLTEGIKFDAVTMLLVIQACAESESLQLGMQLHQLAIKFNFIGDLFV 300
           IG+Y  ALKLF  ML +G+K D V+ML+VIQACA   S++LG Q+HQ+AIK ++  DLF+
Sbjct: 241 IGEYLMALKLFVQMLVDGVKSDYVSMLVVIQACAGIGSIELGRQIHQMAIKNSYSDDLFI 300

Query: 301 LNALLNMYSDNGRLESSCALFNAVPTSDAALWNSMISAYIAFGFHAEAIALYIKMRLEGL 360
           +NALLNMYS+ G  E S  LF  V + D ALWNSMISA I +GF+ EA++L+ KMR+EG+
Sbjct: 301 VNALLNMYSECGCFELSRKLFEFVSSRDVALWNSMISACIEYGFYEEALSLFSKMRMEGI 360

Query: 361 KEDKRTVEIMLSLCEDLNDGSIWGRGLHAHAMKSGMELDVFLGNALLSMYVEHNQIDAAQ 420
           +ED+RT+ IMLS+CEDL DG   G+ LHA A KSGM++D  LGN LLSMY E N +++ Q
Sbjct: 361 REDERTIVIMLSVCEDLADGLRNGKSLHALARKSGMKMDASLGNTLLSMYAEFNCVESVQ 420

Query: 421 KLFDKMRGLDVISCNTMILALARSKFRAKAFELFMTMCESEIKFNSYTMISLLALCKDGS 480
           K+F +M+  DVIS NT+I ALA +  + +A+++F  M ES+ K NS+T+IS+LA C+D +
Sbjct: 421 KVFAEMKCSDVISWNTLIRALACNGLQDEAWKIFGVMRESDTKPNSHTIISILATCEDET 480

Query: 481 DLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCRDEGSATNLFIRCPQRDLVSWNSLIS 540
            +   R+IHGF IK+G+E + SLNT+LT+MY+NC DE +A  LF  CP RD++SWN+LI+
Sbjct: 481 CINIVRAIHGFVIKHGIEADLSLNTALTDMYMNCGDEAAARTLFEGCPSRDVISWNALIA 540

Query: 541 SYIKNDNAGKALLLFNHMISELEPNSVTIISILTSCTQLAHLPLGQCLHAYTTRRGESFE 600
           SYIKN+  GKA LLFN M+SE+ PNSVTII+IL+SCTQLA LPLGQCLHAY  RR  SF 
Sbjct: 541 SYIKNNEIGKAQLLFNRMVSEVNPNSVTIINILSSCTQLASLPLGQCLHAYANRRQFSFG 600

Query: 601 LDASLANAFITMYARCGKMQYAEKIFSTLKARNIVSWNAMITGYGMHGRGHDATLAFAQM 660
            D SLANAFI+MYAR G MQ AEKIF  L  RN++SWNA+ITGY MHG GHDA  AF QM
Sbjct: 601 FDLSLANAFISMYARSGSMQNAEKIFKILPKRNVISWNALITGYSMHGHGHDAIHAFLQM 660

Query: 661 LDDGFKPNNISFVSVLSACSHSGLTKTGLQLFSSMVRDFGIAPQLAHYGCIVDLLGRGGH 720
           L+DGF+PN  +FV+VLSAC HSGL + GLQLF +MVRDF I+P+L HYGC+VDLLGR G 
Sbjct: 661 LEDGFRPNGATFVAVLSACRHSGLIEMGLQLFHTMVRDFKISPELVHYGCVVDLLGRAGR 720

Query: 721 FAEAIALISSMPVEPDASIWRALLSSCQVKSNKKLVETIFRKLVELEPSNPGNFVLLSNI 780
             E    I SMPVE DAS+WRALL++C++ S  KL   IF KLVELEP N GN+VL+SNI
Sbjct: 721 LDEGREFIESMPVEADASVWRALLNACRLHSATKLAGAIFEKLVELEPMNAGNYVLISNI 780

Query: 781 YAAAGLWSEVSQIRKWLRDKGLVKPPGTSWIVIGSQVHYFTATDVSHPQSEEIYENLNSL 840
           YAAAGLW EV QIR  LR+KGL KPPG SWIV+ SQVH F A D SH QS+ IY +LNSL
Sbjct: 781 YAAAGLWMEVRQIRTRLREKGLEKPPGVSWIVVQSQVHCFVAGDTSHLQSDMIYASLNSL 840

Query: 841 TSLIQDMG 846
           ++LI++ G
Sbjct: 841 STLIKESG 841

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP307_ARATH8.6e-12731.71Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana GN... [more]
PP296_ARATH2.3e-12432.48Pentatricopeptide repeat-containing protein At3g63370, chloroplastic OS=Arabidop... [more]
PP333_ARATH8.9e-12431.46Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana GN... [more]
PP357_ARATH2.2e-12232.99Pentatricopeptide repeat-containing protein At4g39530 OS=Arabidopsis thaliana GN... [more]
PP207_ARATH7.8e-12033.03Pentatricopeptide repeat-containing protein At3g02330 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A061GS93_THECC2.7e-30561.07Pentatricopeptide repeat-containing protein OS=Theobroma cacao GN=TCM_040700 PE=... [more]
A0A0D2N9C6_GOSRA7.4e-30361.30Uncharacterized protein OS=Gossypium raimondii GN=B456_001G138400 PE=4 SV=1[more]
A0A067JE40_JATCU1.6e-29761.60Uncharacterized protein OS=Jatropha curcas GN=JCGZ_25923 PE=4 SV=1[more]
A5BC97_VITVI3.9e-29661.58Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_043633 PE=4 SV=1[more]
W9RTH5_9ROSA4.1e-29361.04Uncharacterized protein OS=Morus notabilis GN=L484_015292 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G13650.14.8e-12831.71 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G63370.11.3e-12532.48 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G21300.15.0e-12531.46 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G39530.11.2e-12332.99 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G02330.14.4e-12133.03 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659097605|ref|XP_008449715.1|0.0e+0086.30PREDICTED: pentatricopeptide repeat-containing protein At2g13600-like [Cucumis m... [more]
gi|449448940|ref|XP_004142223.1|0.0e+0085.71PREDICTED: putative pentatricopeptide repeat-containing protein At3g01580 [Cucum... [more]
gi|1009118471|ref|XP_015875878.1|4.5e-30962.41PREDICTED: pentatricopeptide repeat-containing protein At5g27110-like [Ziziphus ... [more]
gi|731392814|ref|XP_010651228.1|8.4e-30863.27PREDICTED: pentatricopeptide repeat-containing protein At2g33680-like [Vitis vin... [more]
gi|645272538|ref|XP_008241445.1|6.0e-30662.62PREDICTED: pentatricopeptide repeat-containing protein At5g27110-like [Prunus mu... [more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmoCh20G000880CmoCh20G000880gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmoCh20G000880.1CmoCh20G000880.1-proteinpolypeptide


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh20G000880.1.exon.1CmoCh20G000880.1.exon.1exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh20G000880.1.CDS.1CmoCh20G000880.1.CDS.1CDS


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 329..357
score: 1.5E-4coord: 530..557
score: 1.5E-5coord: 401..426
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 630..677
score: 1.1E-8coord: 224..271
score: 1.5E-11coord: 427..474
score: 1.6E-7coord: 123..169
score: 3.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 97..125
score: 0.002coord: 530..557
score: 7.3E-5coord: 125..158
score: 2.4E-6coord: 226..259
score: 1.7E-7coord: 632..665
score: 8.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 599..629
score: 7.541coord: 431..461
score: 5.24coord: 630..664
score: 11.301coord: 701..731
score: 5.985coord: 224..258
score: 11.575coord: 294..324
score: 7.125coord: 259..293
score: 5.667coord: 733..763
score: 5.229coord: 123..157
score: 11.597coord: 665..700
score: 8.342coord: 325..359
score: 9.767coord: 57..91
score: 6.138coord: 92..122
score: 8.374coord: 767..801
score: 6.533coord: 22..56
score: 6.182coord: 396..430
score: 8.747coord: 528..558
score: 9
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 230..356
score: 3.5E-8coord: 731..768
score: 3.5E-8coord: 526..571
score: 3.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 26..259
score: 9.0E-275coord: 397..416
score: 9.0E-275coord: 527..808
score: 9.0E