CmoCh20G003610 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh20G003610
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionPentatricopeptide repeat-containing protein, chloroplastic
LocationCmo_Chr20: 1772111 .. 1778127 (-)
RNA-Seq ExpressionCmoCh20G003610
SyntenyCmoCh20G003610
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAGATGGGTTTTTTGAGGCAATTGAGGAATTGGAACGAATGACGAGGGATCCATCGGATGTTCTTGAAGAAATGAACGACCGCCTATCGGCCAGGGAATTTCAGCTAGTGCTGGTGTACTTCTCTCAAGAAGGGAGGGATTCGTGGTGTGCTCTTGAGGTTTTTGAGTGGCTCCAAAAGGAAAATCGGGTCGACAAGGAGACCATGGAGCTGATGGTGTCTATAATGTGTAGTTGGATCAAGAAGTTGGTCGAGGGACAACATAACGTCAGAGATGTGGTTGACCTTCTCGTGGATATGGATTGTGTAGGTTTGAAGCCCCATTTTAGCATGATAGAAAAGGTCATCTCTTTGTATTGGGATATGGGTGAGAAGGAAAAAGCAATTTCGTTCGTGAAGGAGGTCTTGGGACGCAAACTTGATTTTATGAAGGACAATTGGGAAGGGCATAAAGGAGGACCGAGTGGATATCTCGCATGGAAGATGATGGTAAGCCTTTAGCTAATCAGGTTTTCATATCTTTAGCCTTTAACCTTGCTCAAGAGTCAAGTCAACCATAGTATTCTTTTGCTGTGTCTCTTTATTACATTGAAAATTCATATCTGAAATGTAACTATCTTCAATAGCTGTTTACTAACTTTCGATGTATAATTTTGAATCCTGAAGTTCCTAATTGTAGAGATGCCTCACTCTGCCTCCTTTATCTTTTTGGGTTTTGATTATGTGCTATAGAGCGTCAGATTTTCGTTGAGGTGACTCTGGTTAAGGTAGTAAAGGATATTTTCTGGCGTCGTGTTTTTCTAATCACAATTGAAGGCGTTCAGGAAGAACTGTGCAGGGTTGGCTGTACTCTTGGGATATATGGTTACGCTGCATTTTAATGTGGTTAAGCATGGTTGAATCTATTGTCATGATAGTGTTGGACATACTGTACCACATTGACTTCCTTTCAATGTTATTATTTGTGGCTTATTAGACTTCCATATCGTAGCGTAACTTCTGGTATGTTCTTGCTTATGTTAGTAGCACACCGACATATTTCATTTATATTAGAACCACATTGAATGCTGTATGCGACCAAGTTAATCCAGTCTCGTAACTATAGCACTGTGCTGCTACTTTTGTTCCCTTCACCAGTAATGGTAGTGGATTAGTTGCCCGAGGAGTATATATAAGTATTTTCGAATTGTTATGTCTGTTCCGCACGACCCCGAGTGAAAATTCAATTGCTGATAATCTTGAAGATCAATTCTTGTAGGTTGATGGTGACTATAGGGGTGCAGTGAAAATGGTGCTGAATCTCAGAGAATCTGGATTAAAGCCAGAGGTTTACTGCTATCTTATTGCCATGACTGCCGTGGTTAAAGAGCTGAATGAATTTGCAAAAGCTCTTCGCAAACTGAAAAGTTACGCAAGAGACGGGATGGTGGCTGAACTCGATAAAGACAATGTCGAACTTGTCAAGAGGTATCAGTCAGAGCTTCTAGCTGATGGAGTACGGTTATCCAACTGGGTGCTTGACGAGGGAGGCTCTTCGAGTCACGGGGTGGTTCATGAGAGACTCCTTGCAATGTACATTTGTGCTGGGCAAGGACTAGAGGCAGAGCGGCAGCTTTGGGAAATGAAGCTTGTAGGTAAGGAGGCTGATGCCGATCTCTACGATATCGTGCTAGCCATATGTGCTTCACAGAAGGAGACAAGAGCAATGAACCGGTTGCTTACCAGGATTGAGATTACGAGTCCCCGGCTTAAGAAGAAGAGTTTAACATGGCTACTAAGGGGTTACATAAAAGGAGGTCATTTCCGTGATGCTGCAGAAACATTAGTAAAAATGGTCAATTTGGGTTTTCTCCCTGAGTACTTGGACAGAGTAGCTGTGCTGCAAGGGCTTAGAAAACGGATTCGGGAACCTGAAAACGTCGAGACTTACCTCGATCTCTGCAAGTGTCTCTCTGATGCTAATCTTATTGGACCCAGTCTTGTATATTTGCACTTACAGAAATACAAGCTTTGGGTCATTAAAATGCTTTGAAGAAGCCTCGATACCTCTCTGCACAGGCAGCTAATAAAGTAGAGCAGAAATCATTTATACAGCACCAGCACTTTTTTGGGTGCTTTTATATGTTGATTTTGTATAGTTTCAGGCAGGTGACTCTAGAAGCTCTTTAAGCCGACCCTGAAGACGAATACTTGTGTATATCTGTATATATATAATCAGCTACTGTGCACAGAGGACCAATGTTACAGTGTATAGATTGTATAAATTACATACTCTGATATTCTCACTACAGGCTTCACACCGTTGGTAGATATTATCTGCTTTGTTCTGTTACGTATTGCCGTCAGACTCACGGTTTTAAAATGTGTTTGGTAGGGAGAGATTTCCACACTGTTGGATGTTTTGTTCTCCTCTCTTACCGACGTGGGATCTCACAATCTACCCCTCTTGGACGCTCAGCGTCCTTGTTGGCACACCGCTCGGTGTTTGGCTCTGATACCATTTGTAACAGATCAAGCACATATTGTCTGCTTTGTTCCGCTATATATCGTCGTCAGATGCATGGTTTTAAAACGTGTCTGGTAAGAAGAGATTTCCACACTCTTATAAGGAATGCTTCATTCCCCTTTTCAACTGATGTGAGATCTAAGAGTTTGTGAAGTGTTGTTCGTGAACTCCGACTTTGGATGTGGCCATGCTGCTTCTTCTCGCCCAGAAAACATTTGGGCTGCCCCTTGATTGGATATCCAGGTTTTACTTACACCATTGCATTATTACATTATTCAATTATTTCTTGTGTGATTGATGCAAGACGTCCTTTCTTATGATATCATTAAAAGGAAAAATTCTATTTGGAACCAACCCATGTTCTCATATTAGGAAGGTGTAGTTGTGTCGATATATATATAGATGGGGCCAAAACTAACACCAAACCAATACAATGATATCAACCTTTGATCAAAATCGAACCTAATCGACTGAATCCGACTCTGAAAAGCCATTTCAATTGAATTGAGTTGATTGATTTTGGTTATAAACTTATCATGAATCTCGAATTTAATAAACAGTATCCAAATCGACACAAATTCAGGTTGGGTGAGGCAGCCATTTATTGCTCAATTCTTGTTTGTTGTTCAATTCTTGTTTGTTGTGTATAATATTATATGAAAGTTTCCAGGAAAGACGCAGGGAAGTTGCTGTAATAGGATAGGATGTCGTGTGTGTTTATCTGTTCTGTTTCTGTCGCCGATAGGATGTCGTGTGTGTTTATCTGTCCTGTTTCTGTCGCCACTTAGCCTCTATTATTCACATCAATCTTCTTCTTGAAACCAAGCTCTCACATGTGCAATCTTCTTCTTGAAACCAAGCTCTCACATGTGCAATGTCGAAGAGCTCTATGATTGTTCCTTGGGTCAAACATTTGAAGCCTCCAACCTTATGGTTTATATTATGTGTAGATATTCTCGAGTTACCGATCATCTCGATACTCCTAAAAGAGGTGGAGCTAGAAATTTTCAGGGGAAGGGACGAAATTTTATAGTTTTTTTTTTTTTTTTTTAATGATCTTCGTGTTCATCCTAATAAAATGGAATTTGACATGTTTTTTTTTCAAACAAAGCTAACCTTTTTTTTGGATTATTTTTTTTGCTTTTCTCGCTCTCGGATGTGATTTTTTCCTCACTCATGCTCACTTCCTTGCTACTTTGTATGTCTCCCTCTCGAGTTTTCCCTCACACTCTCACTTACTTTCTTGGTCCCTCTGCATTGGGTTCATTCGGTGTCTATAAGAATGTTGCATATTTCAGCCGGTCGTCATGCATGGCTAGAGGATAAGAACAACTCTATTCATTTAAAGATATTGATACATGTAATAACTTTAACCTTGACCCATTATGTATCACCGTCAGCCTTGCGTTTTAAAATGTGTCTACTAGGGACAAGTTTCCACACCCTTATAAGGAATGATTTTGTTCCCCTCTTCAACCAACGTGGGATCTCACAATCCACTCCTAATATCTCACACTACTCAGTGTCTAACTCTGATACCAATAACCCAAACCACCACTAGTAGATATTATCCGCTTTGGTCCGTTACATATCACCTCAGCCTCATGGTTTTGAAACATGTCTGCTAAGAAGTTTCCACACCCATATTAGAAATGCTTTGTTCCCGATGGAAAACACACACAAAAGCATAATCATTATCTTTGAAGAAAGAGGCATATACATATTCATGTTGTGGAGCTCTTGCAACATAGCCTTAGCAGAGCTTGTTTTATGTACAAATATCATTTCTTACCACTTTGCATTGCATAAGTGGAGAACAGCACTCTCCCTGCTACTCAACCCCTGCTGCTATTTTACTGCCTTATCAACAAAAGAATTTTGAAACTCCCCAAGAAACAATACCATAACTTAACTTTCTGTAGATATGATTTCAAGTTCTGCCCACCACAGAGGAGATACCTTTGGTGTTGCTCCAGCAGGCTCCAATGCTGTGATTTCTTTCAACCTTGCAGCAATCTCCGACATCCTCCGTCCTTGCTTCGGCTCGGGCTGGATGCACCGTTTTATCACTCGAGACAAACTCTCGAGCTGTTCTTCTTTGAAGGAGCTTAGAATTGGATCAACCATCTCTCTTAATGACTGTTCCCCCTTCACAAAATCCACTGCCCAATCTGCAAGAGAGCCATCATCAACCGAGAATGGAATTCTACCTGTAATCATTTCCAACAAAATTACTCCAAAGCTATAAACATTGCTTTCTGAATCTGCAGGTGACGTTTCCAGGAGCTCGACTGTCGCGGATCCCAACTTTGCTGCTGTTTCTTCACTCCAGTAGCTGAAATCAGACAACTTGGCAGCATAGTCTTCGGTCAGGTACACAGACGACGAACAAAGATGTCTGTGTATGACGGGCGGGTCGAGCTGGTGCATATGTTCTAAGCAGTATGCTACACCCATTGCTATTCGTAGTCTCGTCTCCCAGTCCAGGTGCTCAGCTTCTTTTACTGTAACAAGGATGATTCCATGAACTTTGGAAGAACAATGCCAATTCTTCAACATGTTTATATGAAAATAAACACTTACTGTGAAGATGCTCAAACAGAGTTCCATTTGGAGCATATTCAAAAACCATCATCCTTGTAAACGGCTGTGCTTCTTCGCAAAACCCGACGAGACTAACGAAGTTCTTATGGTTGACCCTTGACAAAGTTTCGACCTACAAACATGGAAACGAACGGAAACTTTGAGCCTAAACTGATGAACTTCGAAGAAACTTTGTTTGTTTTCTTATACCTTTTTCCTGAACTGTTCTTCTTTAGTTTTTGACCAATCTGCATTTGAGGTCACTGCAGTTGATGTCACAGCTATTTCAACTCCGCTGGAAAGAGTTCCCTTATAGACAGTGATGTCCGAAAAGGATCCGATTATGTTGCTGAAGTCTTCACAAGCTGCTTCGAGCTCGGATCGCCTGAGCTTCGGGACACCTGACATGACAACGGTAAAGTTTGAGTACTGAGCTTAGACTTGAGGTATCTTCCAAATTCTATCACTTTGAGGAAAGAAACCAAACCTGTTACAAATGCCTTCTGCAGCTGTCCACTCAAACCAGTGGCCCAAGGCTTCACAGTAACAACTTTACTGCTTCTGAATATCAAAATTCCAACACTGATAACAATAAAGAACAATGACCCTGATATAATTCCTGCCAATATTGGAATTCTGTGATCTTTTTTCTTATTACTCCCTCCAATCAGACTCGGAGACGGAGCTAAGGAACGAGGCGGAGCCCTGTGGGAGTGAGGTGGTGTGAGAACATGAACTGGTGAGTGTAGACTCGGGGCAGATGCCGGTGCAGGACTCAAATGAAGGCTAGGAGCTGGTGCTGACCACCATGGAGCTAGAGGCTTTGTGGGAGATACTTCAGGAGTTAATGGTGGATCTGAAGGCAATAATGGAGACGAAGAAGCAAACGAAGATGGCGGTAATGAAACGAACGAATGTCGCATAG

mRNA sequence

ATGGGAGATGGGTTTTTTGAGGCAATTGAGGAATTGGAACGAATGACGAGGGATCCATCGGATGTTCTTGAAGAAATGAACGACCGCCTATCGGCCAGGGAATTTCAGCTAGTGCTGGTGTACTTCTCTCAAGAAGGGAGGGATTCGTGGTGTGCTCTTGAGGTTTTTGAGTGGCTCCAAAAGGAAAATCGGGTCGACAAGGAGACCATGGAGCTGATGGTGTCTATAATGTGTAGTTGGATCAAGAAGTTGGTCGAGGGACAACATAACGTCAGAGATGTGGTTGACCTTCTCGTGGATATGGATTGTGTAGAGCGTCAGATTTTCGTTGAGGTGACTCTGGTTAAGGTTGATGGTGACTATAGGGGTGCAGTGAAAATGGTGCTGAATCTCAGAGAATCTGGATTAAAGCCAGAGGTTTACTGCTATCTTATTGCCATGACTGCCGTGGTTAAAGAGCTGAATGAATTTGCAAAAGCTCTTCGCAAACTGAAAAGTTACGCAAGAGACGGGATGGTGGCTGAACTCGATAAAGACAATGTCGAACTTGTCAAGAGGTATCAGTCAGAGCTTCTAGCTGATGGAGTACGGTTATCCAACTGGGTGCTTGACGAGGGAGGCTCTTCGAGTCACGGGGTGGTTCATGAGAGACTCCTTGCAATGTACATTTGTGCTGGGCAAGGACTAGAGGCAGAGCGGCAGCTTTGGGAAATGAAGCTTGTAGGTAAGGAGGCTGATGCCGATCTCTACGATATCGTGCTAGCCATATGTGCTTCACAGAAGGAGACAAGAGCAATGAACCGGTTGCTTACCAGGATTGAGATTACGAGTCCCCGGCTTAAGAAGAAGAGTTTAACATGGCTACTAAGGGGTTACATAAAAGGAGGTCATTTCCGTGATGCTGCAGAAACATTAGTAAAAATGGTCAATTTGGGTTTTCTCCCTGAGTACTTGGACAGAGTAGCTGTGCTGCAAGGGCTTAGAAAACGGATTCGGGAACCTGAAAACGTCGAGACTTACCTCGATCTCTGCAAGTGTCTCTCTGATGCTAATCTTATTGGACCCAGTCTTGAGCTCGACTGTCGCGGATCCCAACTTTGCTGCTGTTTCTTCACTCCAGTAGCTGAAATCAGACAACTTGGCAGCATAGTCTTCGGTCAGACAGTGATGTCCGAAAAGGATCCGATTATGTTGCTGAAGTCTTCACAAGCTGCTTCGAGCTCGGATCGCCTGAGCTTCGGGACACCTGACATGACAACGGAGTTAATGGTGGATCTGAAGGCAATAATGGAGACGAAGAAGCAAACGAAGATGGCGGTAATGAAACGAACGAATGTCGCATAG

Coding sequence (CDS)

ATGGGAGATGGGTTTTTTGAGGCAATTGAGGAATTGGAACGAATGACGAGGGATCCATCGGATGTTCTTGAAGAAATGAACGACCGCCTATCGGCCAGGGAATTTCAGCTAGTGCTGGTGTACTTCTCTCAAGAAGGGAGGGATTCGTGGTGTGCTCTTGAGGTTTTTGAGTGGCTCCAAAAGGAAAATCGGGTCGACAAGGAGACCATGGAGCTGATGGTGTCTATAATGTGTAGTTGGATCAAGAAGTTGGTCGAGGGACAACATAACGTCAGAGATGTGGTTGACCTTCTCGTGGATATGGATTGTGTAGAGCGTCAGATTTTCGTTGAGGTGACTCTGGTTAAGGTTGATGGTGACTATAGGGGTGCAGTGAAAATGGTGCTGAATCTCAGAGAATCTGGATTAAAGCCAGAGGTTTACTGCTATCTTATTGCCATGACTGCCGTGGTTAAAGAGCTGAATGAATTTGCAAAAGCTCTTCGCAAACTGAAAAGTTACGCAAGAGACGGGATGGTGGCTGAACTCGATAAAGACAATGTCGAACTTGTCAAGAGGTATCAGTCAGAGCTTCTAGCTGATGGAGTACGGTTATCCAACTGGGTGCTTGACGAGGGAGGCTCTTCGAGTCACGGGGTGGTTCATGAGAGACTCCTTGCAATGTACATTTGTGCTGGGCAAGGACTAGAGGCAGAGCGGCAGCTTTGGGAAATGAAGCTTGTAGGTAAGGAGGCTGATGCCGATCTCTACGATATCGTGCTAGCCATATGTGCTTCACAGAAGGAGACAAGAGCAATGAACCGGTTGCTTACCAGGATTGAGATTACGAGTCCCCGGCTTAAGAAGAAGAGTTTAACATGGCTACTAAGGGGTTACATAAAAGGAGGTCATTTCCGTGATGCTGCAGAAACATTAGTAAAAATGGTCAATTTGGGTTTTCTCCCTGAGTACTTGGACAGAGTAGCTGTGCTGCAAGGGCTTAGAAAACGGATTCGGGAACCTGAAAACGTCGAGACTTACCTCGATCTCTGCAAGTGTCTCTCTGATGCTAATCTTATTGGACCCAGTCTTGAGCTCGACTGTCGCGGATCCCAACTTTGCTGCTGTTTCTTCACTCCAGTAGCTGAAATCAGACAACTTGGCAGCATAGTCTTCGGTCAGACAGTGATGTCCGAAAAGGATCCGATTATGTTGCTGAAGTCTTCACAAGCTGCTTCGAGCTCGGATCGCCTGAGCTTCGGGACACCTGACATGACAACGGAGTTAATGGTGGATCTGAAGGCAATAATGGAGACGAAGAAGCAAACGAAGATGGCGGTAATGAAACGAACGAATGTCGCATAG

Protein sequence

MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVDMDCVERQIFVEVTLVKVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANLIGPSLELDCRGSQLCCCFFTPVAEIRQLGSIVFGQTVMSEKDPIMLLKSSQAASSSDRLSFGTPDMTTELMVDLKAIMETKKQTKMAVMKRTNVA
Homology
BLAST of CmoCh20G003610 vs. ExPASy Swiss-Prot
Match: Q0WNN7 (Pentatricopeptide repeat-containing protein At2g30100, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At2g30100 PE=2 SV=2)

HSP 1 Score: 453.8 bits (1166), Expect = 2.3e-126
Identity = 233/412 (56.55%), Postives = 291/412 (70.63%), Query Frame = 0

Query: 1   MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQ 60
           +G+GFFEAIEELERMTR+PSD+LEEMN RLS+RE QL+LVYF+QEGRDSWC LEVFEWL+
Sbjct: 76  IGEGFFEAIEELERMTREPSDILEEMNHRLSSRELQLMLVYFAQEGRDSWCTLEVFEWLK 135

Query: 61  KENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVDMDCV---------------- 120
           KENRVD+E MELMVSIMC W+KKL+E + N   V DLL++MDCV                
Sbjct: 136 KENRVDEEIMELMVSIMCGWVKKLIEDECNAHQVFDLLIEMDCVGLKPGFSMMDKVIALY 195

Query: 121 -------ERQIFVEVTLVK------------------------------VDGDYRGAVKM 180
                     +FV+  L +                              VDGDYR AV M
Sbjct: 196 CEMGKKESAVLFVKEVLRRRDGFGYSVVGGGGSEGRKGGPVGYLAWKFMVDGDYRKAVDM 255

Query: 181 VLNLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRY 240
           V+ LR SGLKPE Y YLIAMTA+VKELN   K LR+LK +AR G VAE+D  +  L+++Y
Sbjct: 256 VMELRLSGLKPEAYSYLIAMTAIVKELNSLGKTLRELKRFARAGFVAEIDDHDRVLIEKY 315

Query: 241 QSELLADGVRLSNWVLDEG--GSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEA 300
           QSE L+ G++L+ W ++EG    S  GVVHERLLAMYICAG+G EAE+QLW+MKL G+E 
Sbjct: 316 QSETLSRGLQLATWAVEEGQENDSIIGVVHERLLAMYICAGRGPEAEKQLWKMKLAGREP 375

Query: 301 DADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETL 358
           +ADL+DIV+AICASQKE  A++RLLTR+E    + KKK+L+WLLRGY+KGGHF +AAETL
Sbjct: 376 EADLHDIVMAICASQKEVNAVSRLLTRVEFMGSQRKKKTLSWLLRGYVKGGHFEEAAETL 435

BLAST of CmoCh20G003610 vs. ExPASy Swiss-Prot
Match: Q0WVV0 (Pentatricopeptide repeat-containing protein At1g10910, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At1g10910 PE=2 SV=1)

HSP 1 Score: 66.6 bits (161), Expect = 8.0e-10
Identity = 77/329 (23.40%), Postives = 143/329 (43.47%), Query Frame = 0

Query: 8   AIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCAL-EVFEWLQKENRVD 67
           AI E++R + D    L+ +   L  ++  ++L  F   GR  W  L ++FEW+Q+  ++ 
Sbjct: 75  AISEVQR-SSDFLSSLQRLATVLKVQDLNVILRDFGISGR--WQDLIQLFEWMQQHGKIS 134

Query: 68  KETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVDMDCVERQIFVEV------TLVKVDGD 127
                  VS   S IK +  G  NV   +++   +     +I V +       LVK +G 
Sbjct: 135 -------VSTYSSCIKFV--GAKNVSKALEIYQSIPDESTKINVYICNSILSCLVK-NGK 194

Query: 128 YRGAVKMVLNLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDN 187
               +K+   ++  GLKP+V  Y   +   +K  N + KA                    
Sbjct: 195 LDSCIKLFDQMKRDGLKPDVVTYNTLLAGCIKVKNGYPKA-------------------- 254

Query: 188 VELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKL 247
           +EL+     EL  +G+++ +            V++  +LA+    G+  EAE  + +MK+
Sbjct: 255 IELI----GELPHNGIQMDS------------VMYGTVLAICASNGRSEEAENFIQQMKV 314

Query: 248 VGKEADADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRD 307
            G   +   Y  +L   + + + +  + L+T ++       K  +T LL+ YIKGG F  
Sbjct: 315 EGHSPNIYHYSSLLNSYSWKGDYKKADELMTEMKSIGLVPNKVMMTTLLKVYIKGGLFDR 354

Query: 308 AAETLVKMVNLGFLPEYLDRVAVLQGLRK 330
           + E L ++ + G+    +    ++ GL K
Sbjct: 375 SRELLSELESAGYAENEMPYCMLMDGLSK 354

BLAST of CmoCh20G003610 vs. ExPASy Swiss-Prot
Match: P0C8A0 (Pentatricopeptide repeat-containing protein At3g49730 OS=Arabidopsis thaliana OX=3702 GN=At3g49730 PE=2 SV=1)

HSP 1 Score: 46.6 bits (109), Expect = 8.6e-04
Identity = 44/233 (18.88%), Postives = 95/233 (40.77%), Query Frame = 0

Query: 119 GDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDK 178
           G    A  ++ ++R+ G +P V CY + + A+ +      +A+R      R G  A++  
Sbjct: 285 GKMADAYDLMNDMRKRGFEPNVNCYTVLIQALCRTEKRMDEAMRVFVEMERYGCEADIVT 344

Query: 179 DNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEM 238
               +    +  ++  G  + + +  +G   S  V + +++  +    Q  E    + +M
Sbjct: 345 YTALISGFCKWGMIDKGYSVLDDMRKKGVMPSQ-VTYMQIMVAHEKKEQFEECLELIEKM 404

Query: 239 KLVGKEADADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHF 298
           K  G   D  +Y++V+ +     E +   RL   +E         +   ++ G+   G  
Sbjct: 405 KRRGCHPDLLIYNVVIRLACKLGEVKEAVRLWNEMEANGLSPGVDTFVIMINGFTSQGFL 464

Query: 299 RDAAETLVKMVNLGFL--PEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSD 350
            +A     +MV+ G    P+Y      L+ L   +   + +E   D+  C+S+
Sbjct: 465 IEACNHFKEMVSRGIFSAPQY----GTLKSLLNNLVRDDKLEMAKDVWSCISN 512

BLAST of CmoCh20G003610 vs. ExPASy TrEMBL
Match: A0A6J1FYE9 (pentatricopeptide repeat-containing protein At2g30100, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111448564 PE=4 SV=1)

HSP 1 Score: 664.1 bits (1712), Expect = 4.2e-187
Identity = 350/405 (86.42%), Postives = 353/405 (87.16%), Query Frame = 0

Query: 1   MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQ 60
           MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQ
Sbjct: 90  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQ 149

Query: 61  KENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVDMDCV---------------- 120
           KENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVDMDCV                
Sbjct: 150 KENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVDMDCVGLKPHFSMIEKVISLY 209

Query: 121 ------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLR 180
                 E+ I FV+  L                         + VDGDYRGAVKMVLNLR
Sbjct: 210 WDMGEKEKAISFVKEVLGRKLDFMKDNWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLNLR 269

Query: 181 ESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELL 240
           ESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELL
Sbjct: 270 ESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELL 329

Query: 241 ADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI 300
           ADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI
Sbjct: 330 ADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI 389

Query: 301 VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLG 358
           VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLG
Sbjct: 390 VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLG 449

BLAST of CmoCh20G003610 vs. ExPASy TrEMBL
Match: A0A6J1JH85 (pentatricopeptide repeat-containing protein At2g30100, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111484464 PE=4 SV=1)

HSP 1 Score: 656.0 bits (1691), Expect = 1.1e-184
Identity = 346/405 (85.43%), Postives = 351/405 (86.67%), Query Frame = 0

Query: 1   MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQ 60
           MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQ
Sbjct: 90  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQ 149

Query: 61  KENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVDMDCV---------------- 120
           KENRVDKETMELMVSIMCSWIKKLVEGQHNV DVVDLLVDMDCV                
Sbjct: 150 KENRVDKETMELMVSIMCSWIKKLVEGQHNVGDVVDLLVDMDCVGLKPHFSMIEKVISLY 209

Query: 121 ------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLR 180
                 E+ I FV+  L                         + VDGDYRGAVKMVLNLR
Sbjct: 210 WDMGEKEKAISFVKEVLGRKLDFMKDNWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLNLR 269

Query: 181 ESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELL 240
           ESGLKPEVYC+LIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELL
Sbjct: 270 ESGLKPEVYCFLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELL 329

Query: 241 ADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI 300
           ADGVRLSNWVLDEG SSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI
Sbjct: 330 ADGVRLSNWVLDEGSSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI 389

Query: 301 VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLG 358
           VLAICASQKETRAMNRLL+RIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLG
Sbjct: 390 VLAICASQKETRAMNRLLSRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLG 449

BLAST of CmoCh20G003610 vs. ExPASy TrEMBL
Match: A0A0A0KC35 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G182120 PE=4 SV=1)

HSP 1 Score: 611.3 bits (1575), Expect = 3.2e-171
Identity = 320/405 (79.01%), Postives = 337/405 (83.21%), Query Frame = 0

Query: 1   MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQ 60
           MGDGFFEAIEELERMTR+PSDVLEEMNDRLSARE QLVLVYFSQEGRDSWCALEVFEWLQ
Sbjct: 81  MGDGFFEAIEELERMTREPSDVLEEMNDRLSAREIQLVLVYFSQEGRDSWCALEVFEWLQ 140

Query: 61  KENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVDMDCV---------------- 120
           KENRVDKETMELMVSIMCSWIKKLVEG+HNV DVVDLLVDMDCV                
Sbjct: 141 KENRVDKETMELMVSIMCSWIKKLVEGRHNVGDVVDLLVDMDCVGLKPHFSMIEKVISLY 200

Query: 121 -------ERQIFVEVTL-------------------------VKVDGDYRGAVKMVLNLR 180
                  +   FV+  L                         + VDGDYRGAVKMVL+LR
Sbjct: 201 WEMGEKEKAVFFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLR 260

Query: 181 ESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELL 240
           ESGL+PEVY YLIAMTAVVKELNEFAKALRKLK YARDG VAELDK+NVELV +YQ+ELL
Sbjct: 261 ESGLRPEVYSYLIAMTAVVKELNEFAKALRKLKGYARDGFVAELDKNNVELVAKYQTELL 320

Query: 241 ADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI 300
           ADGV+LSNWVL+EG SS  GVVHERLLAMYICAGQG+EAERQLWEMKLVGKEADADLYDI
Sbjct: 321 ADGVQLSNWVLEEGSSSIRGVVHERLLAMYICAGQGVEAERQLWEMKLVGKEADADLYDI 380

Query: 301 VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLG 358
           VLAICASQKET+AM RLLTRIEITSP +KKKSLTWLLRGYIKGGHFRDAA TLVKM+NLG
Sbjct: 381 VLAICASQKETKAMKRLLTRIEITSPMIKKKSLTWLLRGYIKGGHFRDAAGTLVKMINLG 440

BLAST of CmoCh20G003610 vs. ExPASy TrEMBL
Match: A0A5A7VNN8 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold615G00160 PE=4 SV=1)

HSP 1 Score: 608.6 bits (1568), Expect = 2.1e-170
Identity = 321/405 (79.26%), Postives = 338/405 (83.46%), Query Frame = 0

Query: 1   MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQ 60
           MGDGFFEAIEELERMTR+PSDVLEEMNDRLSARE QLVLVYFSQEGRDSWCALEVFEWLQ
Sbjct: 45  MGDGFFEAIEELERMTREPSDVLEEMNDRLSAREIQLVLVYFSQEGRDSWCALEVFEWLQ 104

Query: 61  KENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVDMDCV---------------- 120
           KENRVDKETMELMVSIMCSWI KLVEG+HNV DVVDLLVDMDCV                
Sbjct: 105 KENRVDKETMELMVSIMCSWINKLVEGRHNVGDVVDLLVDMDCVGLKPHFSMIEKVISLY 164

Query: 121 ------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLR 180
                 E+ I FV+  L                         + VDGDYRGAVKMVL+LR
Sbjct: 165 WEMGEKEKAIFFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLR 224

Query: 181 ESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELL 240
           ESGL+PEVY YLIAMTAVVKELNEFAKALRKLKSYARDG VAELDK+NVELV +YQ+ELL
Sbjct: 225 ESGLRPEVYSYLIAMTAVVKELNEFAKALRKLKSYARDGYVAELDKNNVELVAKYQTELL 284

Query: 241 ADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI 300
           ADGVRLSNWVL+EG SS HGVVHERLLAMYICAGQG+EAERQLWEMKL+GKEADADLYDI
Sbjct: 285 ADGVRLSNWVLEEGSSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLLGKEADADLYDI 344

Query: 301 VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLG 358
           VLAICASQKE +AM RLLTRIEITSP +KKKSLTWLLRGYIKGGHFRDAA T+VKM+NLG
Sbjct: 345 VLAICASQKEIKAMKRLLTRIEITSPMIKKKSLTWLLRGYIKGGHFRDAAGTVVKMINLG 404

BLAST of CmoCh20G003610 vs. ExPASy TrEMBL
Match: A0A1S3CNE0 (pentatricopeptide repeat-containing protein At2g30100, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103502924 PE=4 SV=1)

HSP 1 Score: 608.6 bits (1568), Expect = 2.1e-170
Identity = 321/405 (79.26%), Postives = 338/405 (83.46%), Query Frame = 0

Query: 1   MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQ 60
           MGDGFFEAIEELERMTR+PSDVLEEMNDRLSARE QLVLVYFSQEGRDSWCALEVFEWLQ
Sbjct: 81  MGDGFFEAIEELERMTREPSDVLEEMNDRLSAREIQLVLVYFSQEGRDSWCALEVFEWLQ 140

Query: 61  KENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVDMDCV---------------- 120
           KENRVDKETMELMVSIMCSWI KLVEG+HNV DVVDLLVDMDCV                
Sbjct: 141 KENRVDKETMELMVSIMCSWINKLVEGRHNVGDVVDLLVDMDCVGLKPHFSMIEKVISLY 200

Query: 121 ------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLR 180
                 E+ I FV+  L                         + VDGDYRGAVKMVL+LR
Sbjct: 201 WEMGEKEKAIFFVKEVLGRNLAFMKDDWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLHLR 260

Query: 181 ESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELL 240
           ESGL+PEVY YLIAMTAVVKELNEFAKALRKLKSYARDG VAELDK+NVELV +YQ+ELL
Sbjct: 261 ESGLRPEVYSYLIAMTAVVKELNEFAKALRKLKSYARDGYVAELDKNNVELVAKYQTELL 320

Query: 241 ADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI 300
           ADGVRLSNWVL+EG SS HGVVHERLLAMYICAGQG+EAERQLWEMKL+GKEADADLYDI
Sbjct: 321 ADGVRLSNWVLEEGSSSIHGVVHERLLAMYICAGQGVEAERQLWEMKLLGKEADADLYDI 380

Query: 301 VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLG 358
           VLAICASQKE +AM RLLTRIEITSP +KKKSLTWLLRGYIKGGHFRDAA T+VKM+NLG
Sbjct: 381 VLAICASQKEIKAMKRLLTRIEITSPMIKKKSLTWLLRGYIKGGHFRDAAGTVVKMINLG 440

BLAST of CmoCh20G003610 vs. NCBI nr
Match: XP_022944005.1 (pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita moschata])

HSP 1 Score: 664.1 bits (1712), Expect = 8.6e-187
Identity = 350/405 (86.42%), Postives = 353/405 (87.16%), Query Frame = 0

Query: 1   MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQ 60
           MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQ
Sbjct: 90  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQ 149

Query: 61  KENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVDMDCV---------------- 120
           KENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVDMDCV                
Sbjct: 150 KENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVDMDCVGLKPHFSMIEKVISLY 209

Query: 121 ------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLR 180
                 E+ I FV+  L                         + VDGDYRGAVKMVLNLR
Sbjct: 210 WDMGEKEKAISFVKEVLGRKLDFMKDNWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLNLR 269

Query: 181 ESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELL 240
           ESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELL
Sbjct: 270 ESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELL 329

Query: 241 ADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI 300
           ADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI
Sbjct: 330 ADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI 389

Query: 301 VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLG 358
           VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLG
Sbjct: 390 VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLG 449

BLAST of CmoCh20G003610 vs. NCBI nr
Match: KAG7010495.1 (Pentatricopeptide repeat-containing protein, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 661.4 bits (1705), Expect = 5.6e-186
Identity = 349/405 (86.17%), Postives = 352/405 (86.91%), Query Frame = 0

Query: 1   MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQ 60
           MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQ
Sbjct: 90  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQ 149

Query: 61  KENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVDMDCV---------------- 120
           KENRVDKETMELMVSIMCSWIKKLVEGQHNV DVVDLLVDMDCV                
Sbjct: 150 KENRVDKETMELMVSIMCSWIKKLVEGQHNVGDVVDLLVDMDCVGLKPHFSMIEKVISLY 209

Query: 121 ------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLR 180
                 E+ I FV+  L                         + VDGDYRGAVKMVLNLR
Sbjct: 210 WDMGEKEKAISFVKEVLGRKLDFMKDNWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLNLR 269

Query: 181 ESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELL 240
           ESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELL
Sbjct: 270 ESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELL 329

Query: 241 ADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI 300
           ADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI
Sbjct: 330 ADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI 389

Query: 301 VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLG 358
           VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLG
Sbjct: 390 VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLG 449

BLAST of CmoCh20G003610 vs. NCBI nr
Match: KAG6570645.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 659.4 bits (1700), Expect = 2.1e-185
Identity = 348/405 (85.93%), Postives = 351/405 (86.67%), Query Frame = 0

Query: 1   MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQ 60
           MGDGFFEAIEELERM RDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQ
Sbjct: 90  MGDGFFEAIEELERMARDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQ 149

Query: 61  KENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVDMDCV---------------- 120
           KENRVDKETMELMVSIMCSWIKKLVEGQHNV DVVDLLVDMDCV                
Sbjct: 150 KENRVDKETMELMVSIMCSWIKKLVEGQHNVGDVVDLLVDMDCVGLKPHFSMIEKVISLY 209

Query: 121 ------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLR 180
                 E+ I FV+  L                         + VDGDYRGAVKMVLNLR
Sbjct: 210 WDMGEKEKAISFVKEVLGRKLDFMKDNWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLNLR 269

Query: 181 ESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELL 240
           ESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELL
Sbjct: 270 ESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELL 329

Query: 241 ADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI 300
           ADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI
Sbjct: 330 ADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI 389

Query: 301 VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLG 358
           VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLG
Sbjct: 390 VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLG 449

BLAST of CmoCh20G003610 vs. NCBI nr
Match: XP_023512972.1 (pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 658.3 bits (1697), Expect = 4.7e-185
Identity = 348/405 (85.93%), Postives = 351/405 (86.67%), Query Frame = 0

Query: 1   MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQ 60
           MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQ
Sbjct: 90  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQ 149

Query: 61  KENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVDMDCV---------------- 120
           KENRVDKETMELMVSIMCSWIKKLVEGQHNV DVVDLLVDMDCV                
Sbjct: 150 KENRVDKETMELMVSIMCSWIKKLVEGQHNVGDVVDLLVDMDCVGLKPHFSMIEKVISLY 209

Query: 121 ------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLR 180
                 E+ I FV+  L                         + VDGDYRGAVKMVLNLR
Sbjct: 210 WDMGEKEKAISFVKEVLGRKLDFMKDNWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLNLR 269

Query: 181 ESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELL 240
           ESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELL
Sbjct: 270 ESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELL 329

Query: 241 ADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI 300
           ADGVRLSNWVLDEGGSSSH VVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI
Sbjct: 330 ADGVRLSNWVLDEGGSSSHRVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI 389

Query: 301 VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLG 358
           VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLG
Sbjct: 390 VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLG 449

BLAST of CmoCh20G003610 vs. NCBI nr
Match: XP_022986849.1 (pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita maxima])

HSP 1 Score: 656.0 bits (1691), Expect = 2.4e-184
Identity = 346/405 (85.43%), Postives = 351/405 (86.67%), Query Frame = 0

Query: 1   MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQ 60
           MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQ
Sbjct: 90  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQ 149

Query: 61  KENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVDMDCV---------------- 120
           KENRVDKETMELMVSIMCSWIKKLVEGQHNV DVVDLLVDMDCV                
Sbjct: 150 KENRVDKETMELMVSIMCSWIKKLVEGQHNVGDVVDLLVDMDCVGLKPHFSMIEKVISLY 209

Query: 121 ------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLR 180
                 E+ I FV+  L                         + VDGDYRGAVKMVLNLR
Sbjct: 210 WDMGEKEKAISFVKEVLGRKLDFMKDNWEGHKGGPSGYLAWKMMVDGDYRGAVKMVLNLR 269

Query: 181 ESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELL 240
           ESGLKPEVYC+LIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELL
Sbjct: 270 ESGLKPEVYCFLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELL 329

Query: 241 ADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI 300
           ADGVRLSNWVLDEG SSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI
Sbjct: 330 ADGVRLSNWVLDEGSSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI 389

Query: 301 VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLG 358
           VLAICASQKETRAMNRLL+RIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLG
Sbjct: 390 VLAICASQKETRAMNRLLSRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLG 449

BLAST of CmoCh20G003610 vs. TAIR 10
Match: AT2G30100.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 453.8 bits (1166), Expect = 1.7e-127
Identity = 233/412 (56.55%), Postives = 291/412 (70.63%), Query Frame = 0

Query: 1   MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQ 60
           +G+GFFEAIEELERMTR+PSD+LEEMN RLS+RE QL+LVYF+QEGRDSWC LEVFEWL+
Sbjct: 76  IGEGFFEAIEELERMTREPSDILEEMNHRLSSRELQLMLVYFAQEGRDSWCTLEVFEWLK 135

Query: 61  KENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVDMDCV---------------- 120
           KENRVD+E MELMVSIMC W+KKL+E + N   V DLL++MDCV                
Sbjct: 136 KENRVDEEIMELMVSIMCGWVKKLIEDECNAHQVFDLLIEMDCVGLKPGFSMMDKVIALY 195

Query: 121 -------ERQIFVEVTLVK------------------------------VDGDYRGAVKM 180
                     +FV+  L +                              VDGDYR AV M
Sbjct: 196 CEMGKKESAVLFVKEVLRRRDGFGYSVVGGGGSEGRKGGPVGYLAWKFMVDGDYRKAVDM 255

Query: 181 VLNLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRY 240
           V+ LR SGLKPE Y YLIAMTA+VKELN   K LR+LK +AR G VAE+D  +  L+++Y
Sbjct: 256 VMELRLSGLKPEAYSYLIAMTAIVKELNSLGKTLRELKRFARAGFVAEIDDHDRVLIEKY 315

Query: 241 QSELLADGVRLSNWVLDEG--GSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEA 300
           QSE L+ G++L+ W ++EG    S  GVVHERLLAMYICAG+G EAE+QLW+MKL G+E 
Sbjct: 316 QSETLSRGLQLATWAVEEGQENDSIIGVVHERLLAMYICAGRGPEAEKQLWKMKLAGREP 375

Query: 301 DADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETL 358
           +ADL+DIV+AICASQKE  A++RLLTR+E    + KKK+L+WLLRGY+KGGHF +AAETL
Sbjct: 376 EADLHDIVMAICASQKEVNAVSRLLTRVEFMGSQRKKKTLSWLLRGYVKGGHFEEAAETL 435

BLAST of CmoCh20G003610 vs. TAIR 10
Match: AT1G10910.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 66.6 bits (161), Expect = 5.7e-11
Identity = 77/329 (23.40%), Postives = 143/329 (43.47%), Query Frame = 0

Query: 8   AIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCAL-EVFEWLQKENRVD 67
           AI E++R + D    L+ +   L  ++  ++L  F   GR  W  L ++FEW+Q+  ++ 
Sbjct: 75  AISEVQR-SSDFLSSLQRLATVLKVQDLNVILRDFGISGR--WQDLIQLFEWMQQHGKIS 134

Query: 68  KETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVDMDCVERQIFVEV------TLVKVDGD 127
                  VS   S IK +  G  NV   +++   +     +I V +       LVK +G 
Sbjct: 135 -------VSTYSSCIKFV--GAKNVSKALEIYQSIPDESTKINVYICNSILSCLVK-NGK 194

Query: 128 YRGAVKMVLNLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDN 187
               +K+   ++  GLKP+V  Y   +   +K  N + KA                    
Sbjct: 195 LDSCIKLFDQMKRDGLKPDVVTYNTLLAGCIKVKNGYPKA-------------------- 254

Query: 188 VELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKL 247
           +EL+     EL  +G+++ +            V++  +LA+    G+  EAE  + +MK+
Sbjct: 255 IELI----GELPHNGIQMDS------------VMYGTVLAICASNGRSEEAENFIQQMKV 314

Query: 248 VGKEADADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRD 307
            G   +   Y  +L   + + + +  + L+T ++       K  +T LL+ YIKGG F  
Sbjct: 315 EGHSPNIYHYSSLLNSYSWKGDYKKADELMTEMKSIGLVPNKVMMTTLLKVYIKGGLFDR 354

Query: 308 AAETLVKMVNLGFLPEYLDRVAVLQGLRK 330
           + E L ++ + G+    +    ++ GL K
Sbjct: 375 SRELLSELESAGYAENEMPYCMLMDGLSK 354

BLAST of CmoCh20G003610 vs. TAIR 10
Match: AT3G49730.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 46.6 bits (109), Expect = 6.1e-05
Identity = 44/233 (18.88%), Postives = 95/233 (40.77%), Query Frame = 0

Query: 119 GDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDK 178
           G    A  ++ ++R+ G +P V CY + + A+ +      +A+R      R G  A++  
Sbjct: 285 GKMADAYDLMNDMRKRGFEPNVNCYTVLIQALCRTEKRMDEAMRVFVEMERYGCEADIVT 344

Query: 179 DNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEM 238
               +    +  ++  G  + + +  +G   S  V + +++  +    Q  E    + +M
Sbjct: 345 YTALISGFCKWGMIDKGYSVLDDMRKKGVMPSQ-VTYMQIMVAHEKKEQFEECLELIEKM 404

Query: 239 KLVGKEADADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHF 298
           K  G   D  +Y++V+ +     E +   RL   +E         +   ++ G+   G  
Sbjct: 405 KRRGCHPDLLIYNVVIRLACKLGEVKEAVRLWNEMEANGLSPGVDTFVIMINGFTSQGFL 464

Query: 299 RDAAETLVKMVNLGFL--PEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSD 350
            +A     +MV+ G    P+Y      L+ L   +   + +E   D+  C+S+
Sbjct: 465 IEACNHFKEMVSRGIFSAPQY----GTLKSLLNNLVRDDKLEMAKDVWSCISN 512

BLAST of CmoCh20G003610 vs. TAIR 10
Match: AT1G74580.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 43.1 bits (100), Expect = 6.7e-04
Identity = 54/231 (23.38%), Postives = 97/231 (41.99%), Query Frame = 0

Query: 91  VRDVVDLLVDMDC--VERQIFVEVTLVKV---DGDYRGAVKMVLNLRESGLKPEVYCYLI 150
           V++ V++   MD    E  +F    ++ V    G +  A K+ + +R+ G+ P+VY + I
Sbjct: 92  VQEAVNVFERMDFYDCEPTVFSYNAIMSVLVDSGYFDQAHKVYMRMRDRGITPDVYSFTI 151

Query: 151 AMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDE 210
            M +  K     A ALR L + +  G    +      +   Y+    A+G  L   +L  
Sbjct: 152 RMKSFCKTSRPHA-ALRLLNNMSSQGCEMNVVAYCTVVGGFYEENFKAEGYELFGKMLAS 211

Query: 211 GGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETRA 270
           G S      + +LL +    G   E E+ L ++   G   +   Y++ +     + E   
Sbjct: 212 GVSLCLSTFN-KLLRVLCKKGDVKECEKLLDKVIKRGVLPNLFTYNLFIQGLCQRGELDG 271

Query: 271 MNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPE 317
             R++  +    P+    +   L+ G  K   F++A   L KMVN G  P+
Sbjct: 272 AVRMVGCLIEQGPKPDVITYNNLIYGLCKNSKFQEAEVYLGKMVNEGLEPD 320

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q0WNN72.3e-12656.55Pentatricopeptide repeat-containing protein At2g30100, chloroplastic OS=Arabidop... [more]
Q0WVV08.0e-1023.40Pentatricopeptide repeat-containing protein At1g10910, chloroplastic OS=Arabidop... [more]
P0C8A08.6e-0418.88Pentatricopeptide repeat-containing protein At3g49730 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1FYE94.2e-18786.42pentatricopeptide repeat-containing protein At2g30100, chloroplastic OS=Cucurbit... [more]
A0A6J1JH851.1e-18485.43pentatricopeptide repeat-containing protein At2g30100, chloroplastic OS=Cucurbit... [more]
A0A0A0KC353.2e-17179.01Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G182120 PE=4 SV=1[more]
A0A5A7VNN82.1e-17079.26Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3CNE02.1e-17079.26pentatricopeptide repeat-containing protein At2g30100, chloroplastic OS=Cucumis ... [more]
Match NameE-valueIdentityDescription
XP_022944005.18.6e-18786.42pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita ... [more]
KAG7010495.15.6e-18686.17Pentatricopeptide repeat-containing protein, chloroplastic [Cucurbita argyrosper... [more]
KAG6570645.12.1e-18585.93Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
XP_023512972.14.7e-18585.93pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita ... [more]
XP_022986849.12.4e-18485.43pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita ... [more]
Match NameE-valueIdentityDescription
AT2G30100.11.7e-12756.55pentatricopeptide (PPR) repeat-containing protein [more]
AT1G10910.15.7e-1123.40Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G49730.16.1e-0518.88Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G74580.16.7e-0423.38Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 5..174
e-value: 2.1E-5
score: 26.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 182..346
e-value: 1.7E-7
score: 33.0
NoneNo IPR availablePANTHERPTHR47880OS05G0353300 PROTEINcoord: 1..104
coord: 117..357
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 281..315
score: 8.692369

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh20G003610.1CmoCh20G003610.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding