ClCG06G010030 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG06G010030
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionPentatricopeptide repeat-containing protein
LocationCG_Chr06: 20426069 .. 20428559 (+)
RNA-Seq ExpressionClCG06G010030
SyntenyClCG06G010030
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTATTACCCTTAAACTATTTTGAATCGAAACGGAGGGCTAAGAGAAAACCTATCGTCCCGCCCTCTCGCGTTTCAGTCTTTACGCCGGCTCCTTCCTCCTCCGCCTCCGGGCTTATCCCGCTTAGTGCAGGCCTTGCCGTTGGGCGCAACCTAACTTGCCGTCTGCCGGCCGTACGCCTCAACCGCCCTCTTTTGCTTCCTGTTTCATACTCTGCTTTCGTCGGCAACTATCAATGAAATCGAACTAAATCGAACCGGATTAAAATCCTTGGTAATTATCGATTTTTTGCACTCTGTTTTTGTTAATTATTTATTTCTTTTTCTATTCTTAAGAACAATTACTATGTTCTCAATATTTTCTCTAAATCGTTTGCAGCAGCTTCGACTCGTAAAAGAAAGAAACCATTTTGTTGCTCTGAAATTACCCTATCAATGGCTCTCTTTCGAATCTCTTACCTCCGATCATCATCATTTCGTCTCAAGATCCCCGCGTTATCTACCTTGCAGCTAAGTACAGTCTCTTCTGCCGATTTATTCTATGGCCATCTGCAGAAAAACAACGGTAATGTGGAGAAAACCCTTGCTACTGTAAAGACCAAGTTGGATTCTGGATGTGTCAACCAAGTATTACATAAATGTTCCTTCGAACTATCTCAAATGGGTCTTAGATTTTTTATATGGGCTGGTCGACAGCCTAATTATAGGCATAGTTCTTTTATGTACAGTAGAGCTTGTGAAATGATTGGAATTCATAGAAGGCCAGGTTTGCTTTTTAATGTTATTGAAGATTATAGAAGGGAGGGTTGCCTTGTTGATATTAGGATTTTTAAGATTATGTTAAACTTGTGTAAAGAAGCTAAGCTTGCCAAAGAGGCTTTGTCCATTTTAGGGAAAATGCCCGAATTTCATTTGCGTGCTGACACTACAATGTATAATCTGGTTATAAGGTTATTTACTGAGAAGGGTGAGATGGATATGGCGATGGAGTTGATGAAAGAGATGGATTCAGTTGATATACATCCTAACATGATTACTTATATTTCTTTGCTTAAGGGATTCTGTGATGTGGGTCGTTGGGCGGATGCTTATGGGGTATTTGAGGCTATGAAGGAAAATGGATGTGAACCCAATACAGTGGTTTACTCCGTGCTACTTAATGCTGCCAGTCGGCATGGGACTATGGAAAAGCTAATGGAAGTGTTGGAGGAGATGGAAAAACAAGGGGGAACATGTGGTCCAAATACTGTCACATACACTTCCATAATCCAGCGTCTATGTGAACTAGGCCAGCCTCTGGAAGCATTGAAGATATTGGACAGAATGGAAGAGTATGGTTGTGCTCCAAATCGTGTTACAGTTAGCTCTTTAATCAAGGAATTTTGTAAAAATGGTCACGTGGAGGAGGCATATAAGTTGATTGATAGAGCTGTTGCAAGAGGTGGTGCTTCGTATGGTGATTGCTATAGCTCACTTGTGGTATCTTTGGTTAAGATGAAAAAGATTGCAGAAGCAGAGGAGCTATTTAGAAACATGTTAGCCAATGGGGTGAAGCCAGATGGTGTGGCTTGTACTCTCGTGATCAAGGAATTGTGCTTAGAGGAGCGAGTGCTAGATGGTTTTAACTTATGCAACGAAGTCGATAGGAATGGATATTTAACTTCCATTGACTCTGATGTTTATTCTATTCTTTTAGTTGGACTTTGTGAGCATGACCACTCTGTGGATGCTGCAAAACTTGCAAGGTTGATGCTTAAAAAAGGGATTCGTTTAAAGCCTTACTTTGCTGAAAGTATCATCAAACATCTAAAGAAATTTGGAGATCAAGAGTTAGTTATGAATTTGGGTGGAATAAGAAATGACAAGCAAAACTAACCAAGAACTTGATATGGGGTTGGAGGTGTTAATTTTCAGGATTATCTTAAACAAAGGCTTTTCGCGAGCTTTTTAGTTCATTAAAAACAACAAACGTGTAGATTCATTTGTGGTATATGGATGTCAAATCCGTAGTAATTGCAGGTTCTAAGTTTGGTGGCATACAACTTTAGAGATTGAATCTGTGACCTTTCGTATGAGAGAAGAAGATTTTCATGACTTTGTGACCCTGAAATGTTTCACTTTTTGGATGAGTGCAAAGGTGTGTTAAAAGTCCTTATGAACAACTATTCACATCGTACGAGTTGGAGGTTGAGTTCTGTATCGATAGCCAACCTGAAGTACATCTTACTAGTTTCTCAAATTTCAATATTGTTCAATCTAGTGAAGATGCTTTTAAAATTTTAAATGTAGTCAACCTGAAGTACCTCTTATTAGCACCAAGCAACTCTGACCCCCTATCTAGGAGCTGGCTGCTATCCAGGGTCCTACTGCAGATTGCAGGAGTTCTGGCTGTGTTAATCTGTAAGCTTGTATACGTTTTAATTGTTAATATTATGATTTTAAAATATGACCTTACTTGAACAGGATGTGTTTTATAGGTTGTACAATCAAT

mRNA sequence

ATGCTATTACCCTTAAACTATTTTGAATCGAAACGGAGGGCTAAGAGAAAACCTATCGTCCCGCCCTCTCGCGTTTCAGTCTTTACGCCGGCTCCTTCCTCCTCCGCCTCCGGGCTTATCCCGCTTAGTGCAGGCCTTGCCGTTGGGCGCAACCTAACTTGCCCAGCTTCGACTCGTAAAAGAAAGAAACCATTTTGTTGCTCTGAAATTACCCTATCAATGGCTCTCTTTCGAATCTCTTACCTCCGATCATCATCATTTCGTCTCAAGATCCCCGCGTTATCTACCTTGCAGCTAAGTACAGTCTCTTCTGCCGATTTATTCTATGGCCATCTGCAGAAAAACAACGGTAATGTGGAGAAAACCCTTGCTACTGTAAAGACCAAGTTGGATTCTGGATGTGTCAACCAAGTATTACATAAATGTTCCTTCGAACTATCTCAAATGGGTCTTAGATTTTTTATATGGGCTGGTCGACAGCCTAATTATAGGCATAGTTCTTTTATGTACAGTAGAGCTTGTGAAATGATTGGAATTCATAGAAGGCCAGGTTTGCTTTTTAATGTTATTGAAGATTATAGAAGGGAGGGTTGCCTTGTTGATATTAGGATTTTTAAGATTATGTTAAACTTGTGTAAAGAAGCTAAGCTTGCCAAAGAGGCTTTGTCCATTTTAGGGAAAATGCCCGAATTTCATTTGCGTGCTGACACTACAATGTATAATCTGGTTATAAGGTTATTTACTGAGAAGGGTGAGATGGATATGGCGATGGAGTTGATGAAAGAGATGGATTCAGTTGATATACATCCTAACATGATTACTTATATTTCTTTGCTTAAGGGATTCTGTGATGTGGGTCGTTGGGCGGATGCTTATGGGGTATTTGAGGCTATGAAGGAAAATGGATGTGAACCCAATACAGTGGTTTACTCCGTGCTACTTAATGCTGCCAGTCGGCATGGGACTATGGAAAAGCTAATGGAAGTGTTGGAGGAGATGGAAAAACAAGGGGGAACATGTGGTCCAAATACTGTCACATACACTTCCATAATCCAGCGTCTATGTGAACTAGGCCAGCCTCTGGAAGCATTGAAGATATTGGACAGAATGGAAGAGTATGGTTGTGCTCCAAATCGTGTTACAGTTAGCTCTTTAATCAAGGAATTTTGTAAAAATGGTCACGTGGAGGAGGCATATAAGTTGATTGATAGAGCTGTTGCAAGAGGTGGTGCTTCGTATGGTGATTGCTATAGCTCACTTGTGGTATCTTTGGTTAAGATGAAAAAGATTGCAGAAGCAGAGGAGCTATTTAGAAACATGTTAGCCAATGGGGTGAAGCCAGATGGTGTGGCTTGTACTCTCGTGATCAAGGAATTGTGCTTAGAGGAGCGAGTGCTAGATGGTTTTAACTTATGCAACGAAGTCGATAGGAATGGATATTTAACTTCCATTGACTCTGATGTTTATTCTATTCTTTTAGTTGGACTTTGTGAGCATGACCACTCTGTGGATGCTGCAAAACTTGCAAGGTTGATGCTTAAAAAAGGGATTCGTTTAAAGCCTTACTTTGCTGAAAGTATCATCAAACATCTAAAGAAATTTGGAGATCAAGAGTTAGTTATGAATTTGGGTGGAATAAGAAATGACAAGCAAAACTAACCAAGAACTTGATATGGGGTTGGAGGTGTTAATTTTCAGGATTATCTTAAACAAAGGCTTTTCGCGAGCTTTTTAGTTCATTAAAAACAACAAACGTGTAGATTCATTTGTGGTATATGGATGTCAAATCCGTAGTAATTGCAGGTTCTAAGTTTGGTGGCATACAACTTTAGAGATTGAATCTGTGACCTTTCGTATGAGAGAAGAAGATTTTCATGACTTTGTGACCCTGAAATGTTTCACTTTTTGGATGAGTGCAAAGGTGTGTTAAAAGTCCTTATGAACAACTATTCACATCGTACGAGTTGGAGGTTGAGTTCTGTATCGATAGCCAACCTGAAGTACATCTTACTAGTTTCTCAAATTTCAATATTGTTCAATCTAGTGAAGATGCTTTTAAAATTTTAAATGTAGTCAACCTGAAGTACCTCTTATTAGCACCAAGCAACTCTGACCCCCTATCTAGGAGCTGGCTGCTATCCAGGGTCCTACTGCAGATTGCAGGAGTTCTGGCTGTGTTAATCTGTAAGCTTGTATACGTTTTAATTGTTAATATTATGATTTTAAAATATGACCTTACTTGAACAGGATGTGTTTTATAGGTTGTACAATCAAT

Coding sequence (CDS)

ATGCTATTACCCTTAAACTATTTTGAATCGAAACGGAGGGCTAAGAGAAAACCTATCGTCCCGCCCTCTCGCGTTTCAGTCTTTACGCCGGCTCCTTCCTCCTCCGCCTCCGGGCTTATCCCGCTTAGTGCAGGCCTTGCCGTTGGGCGCAACCTAACTTGCCCAGCTTCGACTCGTAAAAGAAAGAAACCATTTTGTTGCTCTGAAATTACCCTATCAATGGCTCTCTTTCGAATCTCTTACCTCCGATCATCATCATTTCGTCTCAAGATCCCCGCGTTATCTACCTTGCAGCTAAGTACAGTCTCTTCTGCCGATTTATTCTATGGCCATCTGCAGAAAAACAACGGTAATGTGGAGAAAACCCTTGCTACTGTAAAGACCAAGTTGGATTCTGGATGTGTCAACCAAGTATTACATAAATGTTCCTTCGAACTATCTCAAATGGGTCTTAGATTTTTTATATGGGCTGGTCGACAGCCTAATTATAGGCATAGTTCTTTTATGTACAGTAGAGCTTGTGAAATGATTGGAATTCATAGAAGGCCAGGTTTGCTTTTTAATGTTATTGAAGATTATAGAAGGGAGGGTTGCCTTGTTGATATTAGGATTTTTAAGATTATGTTAAACTTGTGTAAAGAAGCTAAGCTTGCCAAAGAGGCTTTGTCCATTTTAGGGAAAATGCCCGAATTTCATTTGCGTGCTGACACTACAATGTATAATCTGGTTATAAGGTTATTTACTGAGAAGGGTGAGATGGATATGGCGATGGAGTTGATGAAAGAGATGGATTCAGTTGATATACATCCTAACATGATTACTTATATTTCTTTGCTTAAGGGATTCTGTGATGTGGGTCGTTGGGCGGATGCTTATGGGGTATTTGAGGCTATGAAGGAAAATGGATGTGAACCCAATACAGTGGTTTACTCCGTGCTACTTAATGCTGCCAGTCGGCATGGGACTATGGAAAAGCTAATGGAAGTGTTGGAGGAGATGGAAAAACAAGGGGGAACATGTGGTCCAAATACTGTCACATACACTTCCATAATCCAGCGTCTATGTGAACTAGGCCAGCCTCTGGAAGCATTGAAGATATTGGACAGAATGGAAGAGTATGGTTGTGCTCCAAATCGTGTTACAGTTAGCTCTTTAATCAAGGAATTTTGTAAAAATGGTCACGTGGAGGAGGCATATAAGTTGATTGATAGAGCTGTTGCAAGAGGTGGTGCTTCGTATGGTGATTGCTATAGCTCACTTGTGGTATCTTTGGTTAAGATGAAAAAGATTGCAGAAGCAGAGGAGCTATTTAGAAACATGTTAGCCAATGGGGTGAAGCCAGATGGTGTGGCTTGTACTCTCGTGATCAAGGAATTGTGCTTAGAGGAGCGAGTGCTAGATGGTTTTAACTTATGCAACGAAGTCGATAGGAATGGATATTTAACTTCCATTGACTCTGATGTTTATTCTATTCTTTTAGTTGGACTTTGTGAGCATGACCACTCTGTGGATGCTGCAAAACTTGCAAGGTTGATGCTTAAAAAAGGGATTCGTTTAAAGCCTTACTTTGCTGAAAGTATCATCAAACATCTAAAGAAATTTGGAGATCAAGAGTTAGTTATGAATTTGGGTGGAATAAGAAATGACAAGCAAAACTAA

Protein sequence

MLLPLNYFESKRRAKRKPIVPPSRVSVFTPAPSSSASGLIPLSAGLAVGRNLTCPASTRKRKKPFCCSEITLSMALFRISYLRSSSFRLKIPALSTLQLSTVSSADLFYGHLQKNNGNVEKTLATVKTKLDSGCVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDYRREGCLVDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLRADTTMYNLVIRLFTEKGEMDMAMELMKEMDSVDIHPNMITYISLLKGFCDVGRWADAYGVFEAMKENGCEPNTVVYSVLLNAASRHGTMEKLMEVLEEMEKQGGTCGPNTVTYTSIIQRLCELGQPLEALKILDRMEEYGCAPNRVTVSSLIKEFCKNGHVEEAYKLIDRAVARGGASYGDCYSSLVVSLVKMKKIAEAEELFRNMLANGVKPDGVACTLVIKELCLEERVLDGFNLCNEVDRNGYLTSIDSDVYSILLVGLCEHDHSVDAAKLARLMLKKGIRLKPYFAESIIKHLKKFGDQELVMNLGGIRNDKQN
Homology
BLAST of ClCG06G010030 vs. NCBI nr
Match: XP_038874415.1 (pentatricopeptide repeat-containing protein At5g47360 [Benincasa hispida] >XP_038874417.1 pentatricopeptide repeat-containing protein At5g47360 [Benincasa hispida] >XP_038874418.1 pentatricopeptide repeat-containing protein At5g47360 [Benincasa hispida])

HSP 1 Score: 850.1 bits (2195), Expect = 1.1e-242
Identity = 421/474 (88.82%), Postives = 447/474 (94.30%), Query Frame = 0

Query: 74  MALFRISYLRSSSFRLKIPALSTLQLSTVSSADLFYGHLQKNNGNVEKTLATVKTKLDSG 133
           M LFRISYLRSSSFRLKI  LSTLQLST+SSADLFY HLQKNNGNVEKTLA VKTKLDS 
Sbjct: 1   MVLFRISYLRSSSFRLKISTLSTLQLSTISSADLFYDHLQKNNGNVEKTLANVKTKLDSR 60

Query: 134 CVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDY 193
           CVNQVLHKCSFE+SQMGLRFFIWAGRQPNYRHSSFMYSRACE+IGI   P LLFNVIEDY
Sbjct: 61  CVNQVLHKCSFEISQMGLRFFIWAGRQPNYRHSSFMYSRACELIGIKGSPCLLFNVIEDY 120

Query: 194 RREGCLVDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLRADTTMYNLVIRLFTEKGEM 253
           +REG LVD+R+FKI+LNLCKEAKLAKEALSILGKMPEFHLRADT+MYNLVI LFTEKGEM
Sbjct: 121 KREGYLVDVRMFKIILNLCKEAKLAKEALSILGKMPEFHLRADTSMYNLVISLFTEKGEM 180

Query: 254 DMAMELMKEMDSVDIHPNMITYISLLKGFCDVGRWADAYGVFEAMKENGCEPNTVVYSVL 313
           DMAMELMKEMDSVDIHPNMITYISLLKGFCDV RW DAYG+F+AMKE+GC PNTVVYSVL
Sbjct: 181 DMAMELMKEMDSVDIHPNMITYISLLKGFCDVCRWDDAYGLFKAMKESGCPPNTVVYSVL 240

Query: 314 LNAASRHGTMEKLMEVLEEMEKQGGTCGPNTVTYTSIIQRLCELGQPLEALKILDRMEEY 373
           LNAASR+G ME+LM VLEEMEKQGGTC PNTVTYTSIIQ  CELG+PLEALKILDRMEE+
Sbjct: 241 LNAASRNGIMERLMAVLEEMEKQGGTCTPNTVTYTSIIQSQCELGRPLEALKILDRMEEF 300

Query: 374 GCAPNRVTVSSLIKEFCKNGHVEEAYKLIDRAVARGGASYGDCYSSLVVSLVKMKKIAEA 433
           GCAPNRVT+S LIKEFCK+GHVEEAYKLIDR VARGGASYGDCYSSLVVSLVKMKKIAEA
Sbjct: 301 GCAPNRVTISCLIKEFCKDGHVEEAYKLIDRLVARGGASYGDCYSSLVVSLVKMKKIAEA 360

Query: 434 EELFRNMLANGVKPDGVACTLVIKELCLEERVLDGFNLCNEVDRNGYLTSIDSDVYSILL 493
           EELFRNMLANGVKPDG+AC+L+I+ELCLEERVLDGFNLC E+DRNGYL+SIDSD+YS+LL
Sbjct: 361 EELFRNMLANGVKPDGLACSLMIRELCLEERVLDGFNLCYEIDRNGYLSSIDSDIYSLLL 420

Query: 494 VGLCEHDHSVDAAKLARLMLKKGIRLKPYFAESIIKHLKKFGDQELVMNLGGIR 548
            GLCEHDHSVDAAKLARLMLKKGIRLKPY AESIIKHLKK  DQELVM+LGGIR
Sbjct: 421 AGLCEHDHSVDAAKLARLMLKKGIRLKPYCAESIIKHLKKSEDQELVMHLGGIR 474

BLAST of ClCG06G010030 vs. NCBI nr
Match: KAG7017159.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 847.8 bits (2189), Expect = 5.2e-242
Identity = 412/474 (86.92%), Postives = 447/474 (94.30%), Query Frame = 0

Query: 74  MALFRISYLRSSSFRLKIPALSTLQLSTVSSADLFYGHLQKNNGNVEKTLATVKTKLDSG 133
           MALFRI Y R SSFR KI  LSTLQLSTVSSADLFY HLQKNNGNVEKTLATVKTKLDS 
Sbjct: 1   MALFRIFYPRPSSFRFKISTLSTLQLSTVSSADLFYDHLQKNNGNVEKTLATVKTKLDSR 60

Query: 134 CVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDY 193
           CVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMY+RACE+IG++R P LLFNVIEDY
Sbjct: 61  CVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYTRACELIGLNRSPCLLFNVIEDY 120

Query: 194 RREGCLVDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLRADTTMYNLVIRLFTEKGEM 253
           RREGCLVDI +FK++LNLCKE KLAKEALSILGKM EFHLRADTTMYNLVIRLFTEKG+M
Sbjct: 121 RREGCLVDIGMFKVILNLCKEGKLAKEALSILGKMAEFHLRADTTMYNLVIRLFTEKGDM 180

Query: 254 DMAMELMKEMDSVDIHPNMITYISLLKGFCDVGRWADAYGVFEAMKENGCEPNTVVYSVL 313
           D AMEL+KEMDSVDI PNMITYI++LKGFCDVGR  DAYG+F+ MK+NGC PNTV YSVL
Sbjct: 181 DKAMELLKEMDSVDIDPNMITYIAMLKGFCDVGRLEDAYGLFKVMKDNGCAPNTVAYSVL 240

Query: 314 LNAASRHGTMEKLMEVLEEMEKQGGTCGPNTVTYTSIIQRLCELGQPLEALKILDRMEEY 373
           LN ASRHG +EKLME+LEEMEKQGGTCGPNTVTYTSIIQ LCE+GQPLEALKILDRME++
Sbjct: 241 LNGASRHGDLEKLMELLEEMEKQGGTCGPNTVTYTSIIQSLCEVGQPLEALKILDRMEDF 300

Query: 374 GCAPNRVTVSSLIKEFCKNGHVEEAYKLIDRAVARGGASYGDCYSSLVVSLVKMKKIAEA 433
           GCAPNRVTVS L+KEFCK+GH+EEAYKLIDR  ARGGASYGDCYSSLV+SL+KMK+IAEA
Sbjct: 301 GCAPNRVTVSVLVKEFCKDGHMEEAYKLIDRVAARGGASYGDCYSSLVISLIKMKRIAEA 360

Query: 434 EELFRNMLANGVKPDGVACTLVIKELCLEERVLDGFNLCNEVDRNGYLTSIDSDVYSILL 493
           EELFRNMLANGVKPDGVACTL+IKELCLEERV+DGFNLCNEVDRNGYL+SIDSD+YS+LL
Sbjct: 361 EELFRNMLANGVKPDGVACTLMIKELCLEERVVDGFNLCNEVDRNGYLSSIDSDIYSLLL 420

Query: 494 VGLCEHDHSVDAAKLARLMLKKGIRLKPYFAESIIKHLKKFGDQELVMNLGGIR 548
           VGLCEHDHSVDAAKLARLML+KGIRLKP++AESIIKH+KKFGDQ LVM+LGGIR
Sbjct: 421 VGLCEHDHSVDAAKLARLMLQKGIRLKPHYAESIIKHVKKFGDQNLVMHLGGIR 474

BLAST of ClCG06G010030 vs. NCBI nr
Match: KAG6579716.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 846.7 bits (2186), Expect = 1.2e-241
Identity = 411/474 (86.71%), Postives = 447/474 (94.30%), Query Frame = 0

Query: 74  MALFRISYLRSSSFRLKIPALSTLQLSTVSSADLFYGHLQKNNGNVEKTLATVKTKLDSG 133
           MALFRI Y R SSFR KI  LSTLQLSTVSSADLFY HLQKNNGNVEKTLATVKTKLDS 
Sbjct: 1   MALFRIFYPRPSSFRFKISTLSTLQLSTVSSADLFYDHLQKNNGNVEKTLATVKTKLDSR 60

Query: 134 CVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDY 193
           CVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMY+RACE+IG++R P LLFNVIEDY
Sbjct: 61  CVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYTRACELIGLNRSPCLLFNVIEDY 120

Query: 194 RREGCLVDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLRADTTMYNLVIRLFTEKGEM 253
           RREGCLVDI +FK++LNLCKE KLAKEALSILGKM EFHLRADTTMYNLVIRLFTEKG+M
Sbjct: 121 RREGCLVDIGMFKVILNLCKEGKLAKEALSILGKMAEFHLRADTTMYNLVIRLFTEKGDM 180

Query: 254 DMAMELMKEMDSVDIHPNMITYISLLKGFCDVGRWADAYGVFEAMKENGCEPNTVVYSVL 313
           D AMEL+KEMDSVDI PNMITYI++LKGFCDVGR  DAYG+F+ MK+NGC PNTV YSVL
Sbjct: 181 DKAMELLKEMDSVDIDPNMITYIAMLKGFCDVGRLEDAYGLFKVMKDNGCAPNTVAYSVL 240

Query: 314 LNAASRHGTMEKLMEVLEEMEKQGGTCGPNTVTYTSIIQRLCELGQPLEALKILDRMEEY 373
           LN ASRHG +EKLME+LEEMEKQGGTCGPNTVTYTSIIQ LCE+GQPLEALKILDRME++
Sbjct: 241 LNGASRHGDLEKLMELLEEMEKQGGTCGPNTVTYTSIIQSLCEVGQPLEALKILDRMEDF 300

Query: 374 GCAPNRVTVSSLIKEFCKNGHVEEAYKLIDRAVARGGASYGDCYSSLVVSLVKMKKIAEA 433
           GCAPNRVTVS L+KEFCK+GH+EEAYKLIDR  ARGGASYGDCYSSLV+SL+KMK+IAEA
Sbjct: 301 GCAPNRVTVSVLVKEFCKDGHMEEAYKLIDRVAARGGASYGDCYSSLVISLIKMKRIAEA 360

Query: 434 EELFRNMLANGVKPDGVACTLVIKELCLEERVLDGFNLCNEVDRNGYLTSIDSDVYSILL 493
           EELFRNMLANGVKPDGVACTL+IKELCLE+RV+DGFNLCNEVDRNGYL+SIDSD+YS+LL
Sbjct: 361 EELFRNMLANGVKPDGVACTLMIKELCLEDRVVDGFNLCNEVDRNGYLSSIDSDIYSLLL 420

Query: 494 VGLCEHDHSVDAAKLARLMLKKGIRLKPYFAESIIKHLKKFGDQELVMNLGGIR 548
           VGLCEHDHSVDAAKLARLML+KGIRLKP++AESIIKH+KKFGDQ LVM+LGGIR
Sbjct: 421 VGLCEHDHSVDAAKLARLMLQKGIRLKPHYAESIIKHVKKFGDQNLVMHLGGIR 474

BLAST of ClCG06G010030 vs. NCBI nr
Match: XP_022928928.1 (pentatricopeptide repeat-containing protein At5g47360 [Cucurbita moschata])

HSP 1 Score: 844.3 bits (2180), Expect = 5.8e-241
Identity = 411/474 (86.71%), Postives = 445/474 (93.88%), Query Frame = 0

Query: 74  MALFRISYLRSSSFRLKIPALSTLQLSTVSSADLFYGHLQKNNGNVEKTLATVKTKLDSG 133
           MALFRI Y R SSFR KI  LSTLQLSTVSSADLFY HLQK NGNVEKTLATVKTKLDS 
Sbjct: 1   MALFRIFYPRPSSFRFKISTLSTLQLSTVSSADLFYDHLQKKNGNVEKTLATVKTKLDSR 60

Query: 134 CVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDY 193
           CVNQVLHKCS ELSQMGLRFFIWAGRQPNYRHSSFMY+RACE+IG++R P LLFNVIEDY
Sbjct: 61  CVNQVLHKCSLELSQMGLRFFIWAGRQPNYRHSSFMYTRACELIGLNRSPCLLFNVIEDY 120

Query: 194 RREGCLVDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLRADTTMYNLVIRLFTEKGEM 253
           RREGCLVDI +FK++LNLCKE KLAKEALSILGKM EFHLRADTTMYNLVIRLFTEKGEM
Sbjct: 121 RREGCLVDIGMFKVILNLCKEGKLAKEALSILGKMAEFHLRADTTMYNLVIRLFTEKGEM 180

Query: 254 DMAMELMKEMDSVDIHPNMITYISLLKGFCDVGRWADAYGVFEAMKENGCEPNTVVYSVL 313
           D AMEL+KEMDSVDI PNMITYI++LKGFCDVGR  DAYG+F+ MK+NGC PNTV YSVL
Sbjct: 181 DKAMELLKEMDSVDIDPNMITYIAMLKGFCDVGRLEDAYGLFKVMKDNGCAPNTVAYSVL 240

Query: 314 LNAASRHGTMEKLMEVLEEMEKQGGTCGPNTVTYTSIIQRLCELGQPLEALKILDRMEEY 373
           LN ASRHG +EKLME+LEEMEKQGGTCGPNTVTYTSIIQ LCE+GQPLEALKILDRME++
Sbjct: 241 LNGASRHGDLEKLMELLEEMEKQGGTCGPNTVTYTSIIQSLCEVGQPLEALKILDRMEDF 300

Query: 374 GCAPNRVTVSSLIKEFCKNGHVEEAYKLIDRAVARGGASYGDCYSSLVVSLVKMKKIAEA 433
           GCAPNRVTVS L+KEFCK+GH+EEAYKLIDR  ARGGASYGDCYSSLV+SL+KMK+IAEA
Sbjct: 301 GCAPNRVTVSVLVKEFCKDGHMEEAYKLIDRVAARGGASYGDCYSSLVISLIKMKRIAEA 360

Query: 434 EELFRNMLANGVKPDGVACTLVIKELCLEERVLDGFNLCNEVDRNGYLTSIDSDVYSILL 493
           EELFRNMLANGVKPDGVACTL+IKELCLEERV+DGFNLCNEVDRNGYL+SIDSD+YS+LL
Sbjct: 361 EELFRNMLANGVKPDGVACTLMIKELCLEERVVDGFNLCNEVDRNGYLSSIDSDIYSLLL 420

Query: 494 VGLCEHDHSVDAAKLARLMLKKGIRLKPYFAESIIKHLKKFGDQELVMNLGGIR 548
           VGLCEHDHSVDAAKLARLML+KGIRLKP++AESIIKH+KKFGDQ LVM+LGGIR
Sbjct: 421 VGLCEHDHSVDAAKLARLMLQKGIRLKPHYAESIIKHVKKFGDQNLVMHLGGIR 474

BLAST of ClCG06G010030 vs. NCBI nr
Match: XP_023551479.1 (pentatricopeptide repeat-containing protein At5g47360 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 843.6 bits (2178), Expect = 9.8e-241
Identity = 409/474 (86.29%), Postives = 446/474 (94.09%), Query Frame = 0

Query: 74  MALFRISYLRSSSFRLKIPALSTLQLSTVSSADLFYGHLQKNNGNVEKTLATVKTKLDSG 133
           MALFRI Y R SSFR KI  LSTLQLSTVSSADLFY HLQKNNGNVEKTL TVKTKLDS 
Sbjct: 1   MALFRIFYPRPSSFRFKISTLSTLQLSTVSSADLFYDHLQKNNGNVEKTLTTVKTKLDSR 60

Query: 134 CVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDY 193
           CVN+VLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMY+RACE+IG++R P L+FNVIEDY
Sbjct: 61  CVNEVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYTRACELIGLNRSPCLVFNVIEDY 120

Query: 194 RREGCLVDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLRADTTMYNLVIRLFTEKGEM 253
           RREGCLVDI +FK++LNLCKE KLAKEALSILG+M EFHLRADTTMYNLVIRLFTEKGEM
Sbjct: 121 RREGCLVDIGMFKVILNLCKEGKLAKEALSILGEMAEFHLRADTTMYNLVIRLFTEKGEM 180

Query: 254 DMAMELMKEMDSVDIHPNMITYISLLKGFCDVGRWADAYGVFEAMKENGCEPNTVVYSVL 313
           D AMEL+KEMDSVDI PNMITYI++LKGFCDVGR  DAYG+F+ MK+NGC PNTV YSVL
Sbjct: 181 DKAMELLKEMDSVDIDPNMITYIAMLKGFCDVGRLEDAYGLFKVMKDNGCAPNTVAYSVL 240

Query: 314 LNAASRHGTMEKLMEVLEEMEKQGGTCGPNTVTYTSIIQRLCELGQPLEALKILDRMEEY 373
           LN ASRHG +EKLME+LEEMEKQGGTCGPNTVTYTSIIQ LCE+GQPLEALKILDRME++
Sbjct: 241 LNGASRHGDLEKLMELLEEMEKQGGTCGPNTVTYTSIIQSLCEVGQPLEALKILDRMEDF 300

Query: 374 GCAPNRVTVSSLIKEFCKNGHVEEAYKLIDRAVARGGASYGDCYSSLVVSLVKMKKIAEA 433
           GCAPNRVTVS L+KEFCK+GH+EEAYKLIDR  ARGGASYGDCYSSLV+SL+KMK+IAEA
Sbjct: 301 GCAPNRVTVSVLVKEFCKDGHMEEAYKLIDRVAARGGASYGDCYSSLVISLIKMKRIAEA 360

Query: 434 EELFRNMLANGVKPDGVACTLVIKELCLEERVLDGFNLCNEVDRNGYLTSIDSDVYSILL 493
           EELFRNMLANGVKPDGVACTL+IKELCLEERV+DGFNLCNEVDRNGYL+SIDSD+YS+LL
Sbjct: 361 EELFRNMLANGVKPDGVACTLMIKELCLEERVVDGFNLCNEVDRNGYLSSIDSDIYSLLL 420

Query: 494 VGLCEHDHSVDAAKLARLMLKKGIRLKPYFAESIIKHLKKFGDQELVMNLGGIR 548
           VGLCEHDHSVDAAKLARLML+KGIRLKP++AESIIKH+KKFGDQ LVM+LGGIR
Sbjct: 421 VGLCEHDHSVDAAKLARLMLQKGIRLKPHYAESIIKHVKKFGDQNLVMHLGGIR 474

BLAST of ClCG06G010030 vs. ExPASy Swiss-Prot
Match: Q9LVS3 (Pentatricopeptide repeat-containing protein At5g47360 OS=Arabidopsis thaliana OX=3702 GN=At5g47360 PE=2 SV=1)

HSP 1 Score: 430.3 bits (1105), Expect = 3.4e-119
Identity = 221/466 (47.42%), Postives = 318/466 (68.24%), Query Frame = 0

Query: 79  ISYLRSSSFRLKIPALSTLQ-LSTVSSADLFYGHLQKNNGNVEKTLATVKTKLDSGCVNQ 138
           IS L S S R +   +S L+ L+TVS+A+  YG LQ    N+EK LA+   +LDS C+N+
Sbjct: 6   ISRLVSPSLRSQPSKISALRFLTTVSAAERLYGQLQGCTSNLEKELASANVQLDSSCINE 65

Query: 139 VLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDYRREG 198
           VL +C     Q GLRFFIWAG   ++RHS++MY++AC+++ I  +P L+  VIE YR+E 
Sbjct: 66  VLRRCDPNQFQSGLRFFIWAGTLSSHRHSAYMYTKACDILKIRAKPDLIKYVIESYRKEE 125

Query: 199 CLVDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLRADTTMYNLVIRLFTEKGEMDMAM 258
           C V+++  +I+L LC +A LA EAL +L K PEF++ ADT  YNLVIRLF +KG++++A 
Sbjct: 126 CFVNVKTMRIVLTLCNQANLADEALWVLRKFPEFNVCADTVAYNLVIRLFADKGDLNIAD 185

Query: 259 ELMKEMDSVDIHPNMITYISLLKGFCDVGRWADAYGVFEAMKENGCEPNTVVYSVLLNAA 318
            L+KEMD V ++P++ITY S++ G+C+ G+  DA+ + + M ++ C  N+V YS +L   
Sbjct: 186 MLIKEMDCVGLYPDVITYTSMINGYCNAGKIDDAWRLAKEMSKHDCVLNSVTYSRILEGV 245

Query: 319 SRHGTMEKLMEVLEEMEKQ--GGTCGPNTVTYTSIIQRLCELGQPLEALKILDRMEEYGC 378
            + G ME+ +E+L EMEK+  GG   PN VTYT +IQ  CE  +  EAL +LDRM   GC
Sbjct: 246 CKSGDMERALELLAEMEKEDGGGLISPNAVTYTLVIQAFCEKRRVEEALLVLDRMGNRGC 305

Query: 379 APNRVTVSSLIKEFCKNGH-VEEAYKLIDRAVARGGASYGDCYSSLVVSLVKMKKIAEAE 438
            PNRVT   LI+   +N   V+   KLID+ V  GG S  +C+SS  VSL++MK+  EAE
Sbjct: 306 MPNRVTACVLIQGVLENDEDVKALSKLIDKLVKLGGVSLSECFSSATVSLIRMKRWEEAE 365

Query: 439 ELFRNMLANGVKPDGVACTLVIKELCLEERVLDGFNLCNEVDRNGYLTSIDSDVYSILLV 498
           ++FR ML  GV+PDG+AC+ V +ELCL ER LD F L  E+++    ++IDSD++++LL+
Sbjct: 366 KIFRLMLVRGVRPDGLACSHVFRELCLLERYLDCFLLYQEIEKKDVKSTIDSDIHAVLLL 425

Query: 499 GLCEHDHSVDAAKLARLMLKKGIRLKPYFAESIIKHLKKFGDQELV 541
           GLC+  +S +AAKLA+ ML K +RLK    E II+ LKK GD++L+
Sbjct: 426 GLCQQGNSWEAAKLAKSMLDKKMRLKVSHVEKIIEALKKTGDEDLM 471

BLAST of ClCG06G010030 vs. ExPASy Swiss-Prot
Match: P0C8A0 (Pentatricopeptide repeat-containing protein At3g49730 OS=Arabidopsis thaliana OX=3702 GN=At3g49730 PE=2 SV=1)

HSP 1 Score: 188.7 bits (478), Expect = 1.7e-46
Identity = 122/449 (27.17%), Postives = 229/449 (51.00%), Query Frame = 0

Query: 109 YGHLQKNNGNVEK-TLATVKTKLD--SGCVNQVLHKCSFELSQMGLRFFIWAGRQPNYRH 168
           Y  L+ ++  V K  LA  ++ +D   G + +VL +C  +   +G RFF+WA +QP Y H
Sbjct: 71  YRILRNHHSRVPKLELALNESGIDLRPGLIIRVLSRCG-DAGNLGYRFFLWATKQPGYFH 130

Query: 169 SSFMYSRACEMIGIHRRPGLLFNVIEDYRREGC-LVDIRIFKIMLNLCKEAKLAKEALSI 228
           S  +      ++   R+ G ++ +IE+ R+    L++  +F +++     A + K+A+ +
Sbjct: 131 SYEVCKSMVMILSKMRQFGAVWGLIEEMRKTNPELIEPELFVVLMRRFASANMVKKAVEV 190

Query: 229 LGKMPEFHLRADTTMYNLVIRLFTEKGEMDMAMELMKEMDSVDIHPNMITYISLLKGFCD 288
           L +MP++ L  D  ++  ++    + G +  A ++ ++M      PN+  + SLL G+C 
Sbjct: 191 LDEMPKYGLEPDEYVFGCLLDALCKNGSVKEASKVFEDMRE-KFPPNLRYFTSLLYGWCR 250

Query: 289 VGRWADAYGVFEAMKENGCEPNTVVYSVLLNAASRHGTMEKLMEVLEEMEKQGGTCGPNT 348
            G+  +A  V   MKE G EP+ VV++ LL+  +  G M    +++ +M K+G    PN 
Sbjct: 251 EGKLMEAKEVLVQMKEAGLEPDIVVFTNLLSGYAHAGKMADAYDLMNDMRKRG--FEPNV 310

Query: 349 VTYTSIIQRLCELGQPL-EALKILDRMEEYGCAPNRVTVSSLIKEFCKNGHVEEAYKLID 408
             YT +IQ LC   + + EA+++   ME YGC  + VT ++LI  FCK G +++ Y ++D
Sbjct: 311 NCYTVLIQALCRTEKRMDEAMRVFVEMERYGCEADIVTYTALISGFCKWGMIDKGYSVLD 370

Query: 409 RAVARGGASYGDCYSSLVVSLVKMKKIAEAEELFRNMLANGVKPDGVACTLVIKELCLEE 468
               +G       Y  ++V+  K ++  E  EL   M   G  PD +   +VI+  C   
Sbjct: 371 DMRKKGVMPSQVTYMQIMVAHEKKEQFEECLELIEKMKRRGCHPDLLIYNVVIRLACKLG 430

Query: 469 RVLDGFNLCNEVDRNGYLTSIDSDVYSILLVGLCEHDHSVDAAKLARLMLKKGIRLKPYF 528
            V +   L NE++ NG    +D+  + I++ G       ++A    + M+ +GI   P +
Sbjct: 431 EVKEAVRLWNEMEANGLSPGVDT--FVIMINGFTSQGFLIEACNHFKEMVSRGIFSAPQY 490

Query: 529 A--ESIIKHLKKFGDQELVMNLGGIRNDK 551
              +S++ +L +    E+  ++    ++K
Sbjct: 491 GTLKSLLNNLVRDDKLEMAKDVWSCISNK 513

BLAST of ClCG06G010030 vs. ExPASy Swiss-Prot
Match: Q9LPX2 (Pentatricopeptide repeat-containing protein At1g12775, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g12775 PE=2 SV=1)

HSP 1 Score: 159.1 bits (401), Expect = 1.5e-37
Identity = 88/309 (28.48%), Postives = 166/309 (53.72%), Query Frame = 0

Query: 211 LCKEAKLAKEALSILGKMPEFHLRADTTMYNLVIRLFTEKGEMDMAMELMKEMDSVDIHP 270
           +CK  + A  A+ +L KM E +++ D   Y+++I    + G +D A  L  EM+      
Sbjct: 238 MCKSGQTAL-AMELLRKMEERNIKLDAVKYSIIIDGLCKDGSLDNAFNLFNEMEIKGFKA 297

Query: 271 NMITYISLLKGFCDVGRWADAYGVFEAMKENGCEPNTVVYSVLLNAASRHGTMEKLMEVL 330
           ++ITY +L+ GFC+ GRW D   +   M +    PN V +SVL+++  + G + +  ++L
Sbjct: 298 DIITYNTLIGGFCNAGRWDDGAKLLRDMIKRKISPNVVTFSVLIDSFVKEGKLREADQLL 357

Query: 331 EEMEKQGGTCGPNTVTYTSIIQRLCELGQPLEALKILDRMEEYGCAPNRVTVSSLIKEFC 390
           +EM ++G    PNT+TY S+I   C+  +  EA++++D M   GC P+ +T + LI  +C
Sbjct: 358 KEMMQRG--IAPNTITYNSLIDGFCKENRLEEAIQMVDLMISKGCDPDIMTFNILINGYC 417

Query: 391 KNGHVEEAYKLIDRAVARGGASYGDCYSSLVVSLVKMKKIAEAEELFRNMLANGVKPDGV 450
           K   +++  +L      RG  +    Y++LV    +  K+  A++LF+ M++  V+PD V
Sbjct: 418 KANRIDDGLELFREMSLRGVIANTVTYNTLVQGFCQSGKLEVAKKLFQEMVSRRVRPDIV 477

Query: 451 ACTLVIKELCLEERVLDGFNLCNEVDRNGYLTSIDSDVYSILLVGLCEHDHSVDAAKLAR 510
           +  +++  LC    +     +  +++++     +D  +Y I++ G+C      DA  L  
Sbjct: 478 SYKILLDGLCDNGELEKALEIFGKIEKS--KMELDIGIYMIIIHGMCNASKVDDAWDLFC 537

Query: 511 LMLKKGIRL 520
            +  KG++L
Sbjct: 538 SLPLKGVKL 541

BLAST of ClCG06G010030 vs. ExPASy Swiss-Prot
Match: O49436 (Pentatricopeptide repeat-containing protein At4g20090 OS=Arabidopsis thaliana OX=3702 GN=EMB1025 PE=3 SV=1)

HSP 1 Score: 158.7 bits (400), Expect = 1.9e-37
Identity = 109/339 (32.15%), Postives = 173/339 (51.03%), Query Frame = 0

Query: 189 VIEDYRREGCLVDIRIFKIMLN-LCKEAKLAK-----EALSILGKMPEFHLRADTTMYNL 248
           ++++ + EGC     I+ ++++ LCK+  L +     + + + G +P      +   YN 
Sbjct: 244 LLDEMQSEGCSPSPVIYNVLIDGLCKKGDLTRVTKLVDNMFLKGCVP------NEVTYNT 303

Query: 249 VIRLFTEKGEMDMAMELMKEMDSVDIHPNMITYISLLKGFCDVGRWADAYGVFEAMKENG 308
           +I     KG++D A+ L++ M S    PN +TY +L+ G     R  DA  +  +M+E G
Sbjct: 304 LIHGLCLKGKLDKAVSLLERMVSSKCIPNDVTYGTLINGLVKQRRATDAVRLLSSMEERG 363

Query: 309 CEPNTVVYSVLLNAASRHGTMEKLMEVLEEMEKQGGTCGPNTVTYTSIIQRLCELGQPLE 368
              N  +YSVL++   + G  E+ M +  +M ++G  C PN V Y+ ++  LC  G+P E
Sbjct: 364 YHLNQHIYSVLISGLFKEGKAEEAMSLWRKMAEKG--CKPNIVVYSVLVDGLCREGKPNE 423

Query: 369 ALKILDRMEEYGCAPNRVTVSSLIKEFCKNGHVEEAYKLIDRAVARGGASYGDCYSSLVV 428
           A +IL+RM   GC PN  T SSL+K F K G  EEA ++       G +    CYS L+ 
Sbjct: 424 AKEILNRMIASGCLPNAYTYSSLMKGFFKTGLCEEAVQVWKEMDKTGCSRNKFCYSVLID 483

Query: 429 SLVKMKKIAEAEELFRNMLANGVKPDGVACTLVIKELC---LEERVLDGFN--LCNEVDR 488
            L  + ++ EA  ++  ML  G+KPD VA + +IK LC     +  L  ++  LC E  +
Sbjct: 484 GLCGVGRVKEAMMVWSKMLTIGIKPDTVAYSSIIKGLCGIGSMDAALKLYHEMLCQEEPK 543

Query: 489 NGYLTSIDSDVYSILLVGLCEHDHSVDAAKLARLMLKKG 517
               +  D   Y+ILL GLC       A  L   ML +G
Sbjct: 544 ----SQPDVVTYNILLDGLCMQKDISRAVDLLNSMLDRG 570

BLAST of ClCG06G010030 vs. ExPASy Swiss-Prot
Match: Q9ZVX5 (Pentatricopeptide repeat-containing protein At2g16880 OS=Arabidopsis thaliana OX=3702 GN=At2g16880 PE=2 SV=1)

HSP 1 Score: 157.5 bits (397), Expect = 4.3e-37
Identity = 103/374 (27.54%), Postives = 186/374 (49.73%), Query Frame = 0

Query: 176 MIGIHRRPGLLF-----NVIEDYRREGCLVDIRIFKIMLN-LCKEAKLAKEALSILGKM- 235
           +IG+ R P          V +D  + G  ++++ F +++N  C E KL ++AL +L +M 
Sbjct: 173 LIGLVRYPSSFSISSAREVFDDMVKIGVSLNVQTFNVLVNGYCLEGKL-EDALGMLERMV 232

Query: 236 PEFHLRADTTMYNLVIRLFTEKGEMDMAMELMKEMDSVDIHPNMITYISLLKGFCDVGRW 295
            EF +  D   YN +++  ++KG +    EL+ +M    + PN +TY +L+ G+C +G  
Sbjct: 233 SEFKVNPDNVTYNTILKAMSKKGRLSDLKELLLDMKKNGLVPNRVTYNNLVYGYCKLGSL 292

Query: 296 ADAYGVFEAMKENGCEPNTVVYSVLLNAASRHGTMEKLMEVLEEMEKQGGTCGPNTVTYT 355
            +A+ + E MK+    P+   Y++L+N     G+M + +E+++ M+       P+ VTY 
Sbjct: 293 KEAFQIVELMKQTNVLPDLCTYNILINGLCNAGSMREGLELMDAMKSL--KLQPDVVTYN 352

Query: 356 SIIQRLCELGQPLEALKILDRMEEYGCAPNRVTVSSLIKEFCKNGHVEEAYKLIDRAVAR 415
           ++I    ELG  LEA K++++ME  G   N+VT +  +K  CK    E   + +   V  
Sbjct: 353 TLIDGCFELGLSLEARKLMEQMENDGVKANQVTHNISLKWLCKEEKREAVTRKVKELVDM 412

Query: 416 GGASYG-DCYSSLVVSLVKMKKIAEAEELFRNMLANGVKPDGVACTLVIKELCLEERVLD 475
            G S     Y +L+ + +K+  ++ A E+ R M   G+K + +    ++  LC E ++ +
Sbjct: 413 HGFSPDIVTYHTLIKAYLKVGDLSGALEMMREMGQKGIKMNTITLNTILDALCKERKLDE 472

Query: 476 GFNLCNEVDRNGYLTSIDSDVYSILLVGLCEHDHSVDAAKLARLMLKKGIRLKPYFAESI 535
             NL N   + G++  +D   Y  L++G    +    A ++   M K  I        S+
Sbjct: 473 AHNLLNSAHKRGFI--VDEVTYGTLIMGFFREEKVEKALEMWDEMKKVKITPTVSTFNSL 532

Query: 536 IKHLKKFGDQELVM 542
           I  L   G  EL M
Sbjct: 533 IGGLCHHGKTELAM 541

BLAST of ClCG06G010030 vs. ExPASy TrEMBL
Match: A0A6J1EQI2 (pentatricopeptide repeat-containing protein At5g47360 OS=Cucurbita moschata OX=3662 GN=LOC111435690 PE=3 SV=1)

HSP 1 Score: 844.3 bits (2180), Expect = 2.8e-241
Identity = 411/474 (86.71%), Postives = 445/474 (93.88%), Query Frame = 0

Query: 74  MALFRISYLRSSSFRLKIPALSTLQLSTVSSADLFYGHLQKNNGNVEKTLATVKTKLDSG 133
           MALFRI Y R SSFR KI  LSTLQLSTVSSADLFY HLQK NGNVEKTLATVKTKLDS 
Sbjct: 1   MALFRIFYPRPSSFRFKISTLSTLQLSTVSSADLFYDHLQKKNGNVEKTLATVKTKLDSR 60

Query: 134 CVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDY 193
           CVNQVLHKCS ELSQMGLRFFIWAGRQPNYRHSSFMY+RACE+IG++R P LLFNVIEDY
Sbjct: 61  CVNQVLHKCSLELSQMGLRFFIWAGRQPNYRHSSFMYTRACELIGLNRSPCLLFNVIEDY 120

Query: 194 RREGCLVDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLRADTTMYNLVIRLFTEKGEM 253
           RREGCLVDI +FK++LNLCKE KLAKEALSILGKM EFHLRADTTMYNLVIRLFTEKGEM
Sbjct: 121 RREGCLVDIGMFKVILNLCKEGKLAKEALSILGKMAEFHLRADTTMYNLVIRLFTEKGEM 180

Query: 254 DMAMELMKEMDSVDIHPNMITYISLLKGFCDVGRWADAYGVFEAMKENGCEPNTVVYSVL 313
           D AMEL+KEMDSVDI PNMITYI++LKGFCDVGR  DAYG+F+ MK+NGC PNTV YSVL
Sbjct: 181 DKAMELLKEMDSVDIDPNMITYIAMLKGFCDVGRLEDAYGLFKVMKDNGCAPNTVAYSVL 240

Query: 314 LNAASRHGTMEKLMEVLEEMEKQGGTCGPNTVTYTSIIQRLCELGQPLEALKILDRMEEY 373
           LN ASRHG +EKLME+LEEMEKQGGTCGPNTVTYTSIIQ LCE+GQPLEALKILDRME++
Sbjct: 241 LNGASRHGDLEKLMELLEEMEKQGGTCGPNTVTYTSIIQSLCEVGQPLEALKILDRMEDF 300

Query: 374 GCAPNRVTVSSLIKEFCKNGHVEEAYKLIDRAVARGGASYGDCYSSLVVSLVKMKKIAEA 433
           GCAPNRVTVS L+KEFCK+GH+EEAYKLIDR  ARGGASYGDCYSSLV+SL+KMK+IAEA
Sbjct: 301 GCAPNRVTVSVLVKEFCKDGHMEEAYKLIDRVAARGGASYGDCYSSLVISLIKMKRIAEA 360

Query: 434 EELFRNMLANGVKPDGVACTLVIKELCLEERVLDGFNLCNEVDRNGYLTSIDSDVYSILL 493
           EELFRNMLANGVKPDGVACTL+IKELCLEERV+DGFNLCNEVDRNGYL+SIDSD+YS+LL
Sbjct: 361 EELFRNMLANGVKPDGVACTLMIKELCLEERVVDGFNLCNEVDRNGYLSSIDSDIYSLLL 420

Query: 494 VGLCEHDHSVDAAKLARLMLKKGIRLKPYFAESIIKHLKKFGDQELVMNLGGIR 548
           VGLCEHDHSVDAAKLARLML+KGIRLKP++AESIIKH+KKFGDQ LVM+LGGIR
Sbjct: 421 VGLCEHDHSVDAAKLARLMLQKGIRLKPHYAESIIKHVKKFGDQNLVMHLGGIR 474

BLAST of ClCG06G010030 vs. ExPASy TrEMBL
Match: A0A6J1I125 (pentatricopeptide repeat-containing protein At5g47360 OS=Cucurbita maxima OX=3661 GN=LOC111468919 PE=3 SV=1)

HSP 1 Score: 837.8 bits (2163), Expect = 2.6e-239
Identity = 407/474 (85.86%), Postives = 445/474 (93.88%), Query Frame = 0

Query: 74  MALFRISYLRSSSFRLKIPALSTLQLSTVSSADLFYGHLQKNNGNVEKTLATVKTKLDSG 133
           MALFRI Y R SSFR KI  LSTLQLSTVSSADLFY HLQKNNGNVEKTLATV+TKLDS 
Sbjct: 1   MALFRIFYPRPSSFRFKISTLSTLQLSTVSSADLFYDHLQKNNGNVEKTLATVRTKLDSR 60

Query: 134 CVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDY 193
           CVNQVLHKCSFELS MGLRFFIWAGRQPNYRHSSFMY+RACE+IG++R P LLFNVIEDY
Sbjct: 61  CVNQVLHKCSFELSPMGLRFFIWAGRQPNYRHSSFMYARACELIGLNRSPCLLFNVIEDY 120

Query: 194 RREGCLVDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLRADTTMYNLVIRLFTEKGEM 253
           RREGCL+DI +FK++LNLCKE KLAKEALSILGKM EFHLRADTTMYNLVIRLFTEKGEM
Sbjct: 121 RREGCLLDIGMFKVILNLCKEGKLAKEALSILGKMAEFHLRADTTMYNLVIRLFTEKGEM 180

Query: 254 DMAMELMKEMDSVDIHPNMITYISLLKGFCDVGRWADAYGVFEAMKENGCEPNTVVYSVL 313
           D AMEL+KEMDSVDI PNMITYI++LKGFCDVGR  DAYG+F+ MKENGC PNTV YSVL
Sbjct: 181 DKAMELLKEMDSVDIDPNMITYIAMLKGFCDVGRLEDAYGLFKVMKENGCAPNTVAYSVL 240

Query: 314 LNAASRHGTMEKLMEVLEEMEKQGGTCGPNTVTYTSIIQRLCELGQPLEALKILDRMEEY 373
           LN ASRHG +EKLME+LEEMEKQGG CGPNTVTYTSIIQ LCE+GQPLEALKILDRME+ 
Sbjct: 241 LNGASRHGDLEKLMELLEEMEKQGGNCGPNTVTYTSIIQSLCEVGQPLEALKILDRMEDS 300

Query: 374 GCAPNRVTVSSLIKEFCKNGHVEEAYKLIDRAVARGGASYGDCYSSLVVSLVKMKKIAEA 433
           GC+PNRVTVS+L+KEFCK+GH+EEAYKLIDR  ARGGASYGDCYSSLV+SL+KMK+IAEA
Sbjct: 301 GCSPNRVTVSALVKEFCKDGHMEEAYKLIDRVAARGGASYGDCYSSLVISLIKMKRIAEA 360

Query: 434 EELFRNMLANGVKPDGVACTLVIKELCLEERVLDGFNLCNEVDRNGYLTSIDSDVYSILL 493
           EELFRNMLANGVKPDGVACTL+IKELCLEERV+DGFNLCNEV+RNGYL+SIDSD+YS+LL
Sbjct: 361 EELFRNMLANGVKPDGVACTLMIKELCLEERVVDGFNLCNEVNRNGYLSSIDSDIYSLLL 420

Query: 494 VGLCEHDHSVDAAKLARLMLKKGIRLKPYFAESIIKHLKKFGDQELVMNLGGIR 548
           VGLCEHDHSVDAAKLARLML+KGIRLKP++AESIIKH+KKFG Q+LVM+LGGIR
Sbjct: 421 VGLCEHDHSVDAAKLARLMLQKGIRLKPHYAESIIKHVKKFGGQDLVMHLGGIR 474

BLAST of ClCG06G010030 vs. ExPASy TrEMBL
Match: A0A0A0LI44 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G123590 PE=4 SV=1)

HSP 1 Score: 813.5 bits (2100), Expect = 5.3e-232
Identity = 401/474 (84.60%), Postives = 438/474 (92.41%), Query Frame = 0

Query: 74  MALFRISYLRSSSFRLKIPALSTLQLSTVSSADLFYGHLQKNNGNVEKTLATVKTKLDSG 133
           MALFRIS  RSSSF L I  LST  L+T+SS+DLFY HL+K+NGN++KTLAT+KTKLDS 
Sbjct: 1   MALFRISCPRSSSFLLNISTLSTFHLNTLSSSDLFYDHLEKSNGNLDKTLATLKTKLDSR 60

Query: 134 CVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDY 193
           CVN+VL+KCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACE+IGI+  P LLFNVIEDY
Sbjct: 61  CVNEVLYKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINVSPCLLFNVIEDY 120

Query: 194 RREGCLVDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLRADTTMYNLVIRLFTEKGEM 253
           RREGCLVDIR+FKI+LNLCKEAKLAKEALSIL KM EFHLRADTTMYNLVIRLFTEKGEM
Sbjct: 121 RREGCLVDIRMFKIILNLCKEAKLAKEALSILRKMSEFHLRADTTMYNLVIRLFTEKGEM 180

Query: 254 DMAMELMKEMDSVDIHPNMITYISLLKGFCDVGRWADAYGVFEAMKENGCEPNTVVYSVL 313
           D AMELMKEMDSVDIHPNMITYIS+LKGFCDVGRW DAYG+F+ MKENGC PNTVVYSVL
Sbjct: 181 DKAMELMKEMDSVDIHPNMITYISMLKGFCDVGRWEDAYGLFKDMKENGCAPNTVVYSVL 240

Query: 314 LNAASRHGTMEKLMEVLEEMEKQGGTCGPNTVTYTSIIQRLCELGQPLEALKILDRMEEY 373
           +N A R   M++LME+L+EMEKQGGTC PNTVTYTSIIQ LCE G PLEALK+LDRMEEY
Sbjct: 241 VNGAIRLRIMDRLMEMLKEMEKQGGTCSPNTVTYTSIIQSLCEEGHPLEALKVLDRMEEY 300

Query: 374 GCAPNRVTVSSLIKEFCKNGHVEEAYKLIDRAVARGGASYGDCYSSLVVSLVKMKKIAEA 433
           G APNRV VS L+KEFCK+GHVEEAYKLIDR VARGG SYGDCYSSLVV+LVKMKKIAEA
Sbjct: 301 GYAPNRVAVSFLVKEFCKDGHVEEAYKLIDRVVARGGVSYGDCYSSLVVTLVKMKKIAEA 360

Query: 434 EELFRNMLANGVKPDGVACTLVIKELCLEERVLDGFNLCNEVDRNGYLTSIDSDVYSILL 493
           E+LFRNMLANGVKPDGVAC+L+I+ELCLEERVLDGFNLC EVDRNGYL SID+D+YS+LL
Sbjct: 361 EKLFRNMLANGVKPDGVACSLMIRELCLEERVLDGFNLCYEVDRNGYLCSIDADIYSLLL 420

Query: 494 VGLCEHDHSVDAAKLARLMLKKGIRLKPYFAESIIKHLKKFGDQELVMNLGGIR 548
           VGLCEHDHSVDAAKLARLMLKKGIRLKP++AESIIKHLKKF D+ELVM+LGGIR
Sbjct: 421 VGLCEHDHSVDAAKLARLMLKKGIRLKPHYAESIIKHLKKFEDRELVMHLGGIR 474

BLAST of ClCG06G010030 vs. ExPASy TrEMBL
Match: A0A6J1DNK5 (pentatricopeptide repeat-containing protein At5g47360 OS=Momordica charantia OX=3673 GN=LOC111022875 PE=3 SV=1)

HSP 1 Score: 800.8 bits (2067), Expect = 3.5e-228
Identity = 391/474 (82.49%), Postives = 426/474 (89.87%), Query Frame = 0

Query: 74  MALFRISYLRSSSFRLKIPALSTLQLSTVSSADLFYGHLQKNNGNVEKTLATVKTKLDSG 133
           MALF I   RS SF LKI  LS L LSTVSSADLFY HLQKNNGNVEK LATVKT LDS 
Sbjct: 1   MALFGIFSFRSFSFGLKISKLSALHLSTVSSADLFYDHLQKNNGNVEKILATVKTTLDSR 60

Query: 134 CVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDY 193
           CVNQVLHKCSFELS MGLRFFIWAGRQPNYRHSSFMYSRACE+IGI R P LL NVIEDY
Sbjct: 61  CVNQVLHKCSFELSLMGLRFFIWAGRQPNYRHSSFMYSRACELIGIDRSPCLLLNVIEDY 120

Query: 194 RREGCLVDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLRADTTMYNLVIRLFTEKGEM 253
           RREGC+VDIR+FK+MLNLCKEAKLA EAL ILGKMPEFHLRADTT+YNLV+RLF EKGEM
Sbjct: 121 RREGCVVDIRMFKVMLNLCKEAKLANEALLILGKMPEFHLRADTTIYNLVVRLFIEKGEM 180

Query: 254 DMAMELMKEMDSVDIHPNMITYISLLKGFCDVGRWADAYGVFEAMKENGCEPNTVVYSVL 313
           D AM+LM+EMDS+DIHPNMITYI++LKGFCDVGR  DAYG+F+AMKENGC PNT+ YS+L
Sbjct: 181 DKAMKLMEEMDSIDIHPNMITYIAMLKGFCDVGRLEDAYGLFKAMKENGCSPNTLAYSIL 240

Query: 314 LNAASRHGTMEKLMEVLEEMEKQGGTCGPNTVTYTSIIQRLCELGQPLEALKILDRMEEY 373
           LN ASR G  EK+ME+LEEMEK+GG C PNTVTYTSIIQ LCELGQPLEALKILDRME  
Sbjct: 241 LNGASRQGITEKIMELLEEMEKEGGNCSPNTVTYTSIIQSLCELGQPLEALKILDRMENS 300

Query: 374 GCAPNRVTVSSLIKEFCKNGHVEEAYKLIDRAVARGGASYGDCYSSLVVSLVKMKKIAEA 433
           GCAPNRVTV +LIKEFCK+GH+EE Y+LI R VARGG SYGDCYSSLVVSL KMKKIA A
Sbjct: 301 GCAPNRVTVRTLIKEFCKDGHMEEVYELIHRVVARGGTSYGDCYSSLVVSLAKMKKIAAA 360

Query: 434 EELFRNMLANGVKPDGVACTLVIKELCLEERVLDGFNLCNEVDRNGYLTSIDSDVYSILL 493
           EELFRNMLA+GVKPDGVAC+++IKELCLEERVLDG+NLCNEVDRNGYL+SIDSD+YS+LL
Sbjct: 361 EELFRNMLASGVKPDGVACSVMIKELCLEERVLDGYNLCNEVDRNGYLSSIDSDIYSLLL 420

Query: 494 VGLCEHDHSVDAAKLARLMLKKGIRLKPYFAESIIKHLKKFGDQELVMNLGGIR 548
           VGLCEHDH +DA KLARLMLKKGIRLKP++A+ +IKHL KFGDQELVM LGGIR
Sbjct: 421 VGLCEHDHPMDAEKLARLMLKKGIRLKPHYADHVIKHLNKFGDQELVMQLGGIR 474

BLAST of ClCG06G010030 vs. ExPASy TrEMBL
Match: A0A1S3B4L9 (pentatricopeptide repeat-containing protein At5g47360 OS=Cucumis melo OX=3656 GN=LOC103485755 PE=4 SV=1)

HSP 1 Score: 788.9 bits (2036), Expect = 1.4e-224
Identity = 395/474 (83.33%), Postives = 429/474 (90.51%), Query Frame = 0

Query: 74  MALFRISYLRSSSFRLKIPALSTLQLSTVSSADLFYGHLQKNNGNVEKTLATVKTKLDSG 133
           MALFRISY RSSS  L I  LST  LST+SS+DLFY HL+KNNGNVEKTLATVKTKLDS 
Sbjct: 1   MALFRISYPRSSSILLNISTLSTFHLSTLSSSDLFYDHLEKNNGNVEKTLATVKTKLDSR 60

Query: 134 CVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDY 193
           CVN+VL+KCS ELSQMGLRFFIWAGRQPNYRH+SFMYSRACE+IGI+  P LLFNVIEDY
Sbjct: 61  CVNEVLYKCSSELSQMGLRFFIWAGRQPNYRHTSFMYSRACELIGINVSPCLLFNVIEDY 120

Query: 194 RREGCLVDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLRADTTMYNLVIRLFTEKGEM 253
           RREGCLVDIRIF+I+LNLCKEAKL KEALSIL KM EFHLRADTT+YNLVIRL TEKGEM
Sbjct: 121 RREGCLVDIRIFQIILNLCKEAKLTKEALSILRKMSEFHLRADTTIYNLVIRLCTEKGEM 180

Query: 254 DMAMELMKEMDSVDIHPNMITYISLLKGFCDVGRWADAYGVFEAMKENGCEPNTVVYSVL 313
           D AMELMKEMDSVDIHPNMITYIS++KGFCDVGRW DAYG+F+AMKENG  PNTVVYSVL
Sbjct: 181 DKAMELMKEMDSVDIHPNMITYISMIKGFCDVGRWEDAYGLFKAMKENGYAPNTVVYSVL 240

Query: 314 LNAASRHGTMEKLMEVLEEMEKQGGTCGPNTVTYTSIIQRLCELGQPLEALKILDRMEEY 373
           +N A R   M+KLME+LEEMEKQGGTC PNTVTYTSIIQ LCE G  LEALK+LDRMEEY
Sbjct: 241 VNGAVRLRIMDKLMEMLEEMEKQGGTCRPNTVTYTSIIQSLCEQGFLLEALKVLDRMEEY 300

Query: 374 GCAPNRVTVSSLIKEFCKNGHVEEAYKLIDRAVARGGASYGDCYSSLVVSLVKMKKIAEA 433
           G APNRV V  L+KEFCK+GHVEEAYKLIDR VARGGASYGDC SSLV+SLVKMKKI EA
Sbjct: 301 GHAPNRVAVGYLVKEFCKDGHVEEAYKLIDRVVARGGASYGDCCSSLVISLVKMKKIPEA 360

Query: 434 EELFRNMLANGVKPDGVACTLVIKELCLEERVLDGFNLCNEVDRNGYLTSIDSDVYSILL 493
           E+LFRNMLANGVKPDGVAC+L+I+ELCLEERVLDGF+LC EVDRNGYL  ID+DVYS+LL
Sbjct: 361 EKLFRNMLANGVKPDGVACSLMIRELCLEERVLDGFSLCYEVDRNGYLCYIDADVYSLLL 420

Query: 494 VGLCEHDHSVDAAKLARLMLKKGIRLKPYFAESIIKHLKKFGDQELVMNLGGIR 548
           VGL +HDHSVDAA LARLMLKKGIRLKP++AESIIKHLKKF DQEL+M+LGGIR
Sbjct: 421 VGLYQHDHSVDAAILARLMLKKGIRLKPHYAESIIKHLKKFEDQELIMHLGGIR 474

BLAST of ClCG06G010030 vs. TAIR 10
Match: AT5G47360.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 430.3 bits (1105), Expect = 2.4e-120
Identity = 221/466 (47.42%), Postives = 318/466 (68.24%), Query Frame = 0

Query: 79  ISYLRSSSFRLKIPALSTLQ-LSTVSSADLFYGHLQKNNGNVEKTLATVKTKLDSGCVNQ 138
           IS L S S R +   +S L+ L+TVS+A+  YG LQ    N+EK LA+   +LDS C+N+
Sbjct: 6   ISRLVSPSLRSQPSKISALRFLTTVSAAERLYGQLQGCTSNLEKELASANVQLDSSCINE 65

Query: 139 VLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDYRREG 198
           VL +C     Q GLRFFIWAG   ++RHS++MY++AC+++ I  +P L+  VIE YR+E 
Sbjct: 66  VLRRCDPNQFQSGLRFFIWAGTLSSHRHSAYMYTKACDILKIRAKPDLIKYVIESYRKEE 125

Query: 199 CLVDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLRADTTMYNLVIRLFTEKGEMDMAM 258
           C V+++  +I+L LC +A LA EAL +L K PEF++ ADT  YNLVIRLF +KG++++A 
Sbjct: 126 CFVNVKTMRIVLTLCNQANLADEALWVLRKFPEFNVCADTVAYNLVIRLFADKGDLNIAD 185

Query: 259 ELMKEMDSVDIHPNMITYISLLKGFCDVGRWADAYGVFEAMKENGCEPNTVVYSVLLNAA 318
            L+KEMD V ++P++ITY S++ G+C+ G+  DA+ + + M ++ C  N+V YS +L   
Sbjct: 186 MLIKEMDCVGLYPDVITYTSMINGYCNAGKIDDAWRLAKEMSKHDCVLNSVTYSRILEGV 245

Query: 319 SRHGTMEKLMEVLEEMEKQ--GGTCGPNTVTYTSIIQRLCELGQPLEALKILDRMEEYGC 378
            + G ME+ +E+L EMEK+  GG   PN VTYT +IQ  CE  +  EAL +LDRM   GC
Sbjct: 246 CKSGDMERALELLAEMEKEDGGGLISPNAVTYTLVIQAFCEKRRVEEALLVLDRMGNRGC 305

Query: 379 APNRVTVSSLIKEFCKNGH-VEEAYKLIDRAVARGGASYGDCYSSLVVSLVKMKKIAEAE 438
            PNRVT   LI+   +N   V+   KLID+ V  GG S  +C+SS  VSL++MK+  EAE
Sbjct: 306 MPNRVTACVLIQGVLENDEDVKALSKLIDKLVKLGGVSLSECFSSATVSLIRMKRWEEAE 365

Query: 439 ELFRNMLANGVKPDGVACTLVIKELCLEERVLDGFNLCNEVDRNGYLTSIDSDVYSILLV 498
           ++FR ML  GV+PDG+AC+ V +ELCL ER LD F L  E+++    ++IDSD++++LL+
Sbjct: 366 KIFRLMLVRGVRPDGLACSHVFRELCLLERYLDCFLLYQEIEKKDVKSTIDSDIHAVLLL 425

Query: 499 GLCEHDHSVDAAKLARLMLKKGIRLKPYFAESIIKHLKKFGDQELV 541
           GLC+  +S +AAKLA+ ML K +RLK    E II+ LKK GD++L+
Sbjct: 426 GLCQQGNSWEAAKLAKSMLDKKMRLKVSHVEKIIEALKKTGDEDLM 471

BLAST of ClCG06G010030 vs. TAIR 10
Match: AT3G49730.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 188.7 bits (478), Expect = 1.2e-47
Identity = 122/449 (27.17%), Postives = 229/449 (51.00%), Query Frame = 0

Query: 109 YGHLQKNNGNVEK-TLATVKTKLD--SGCVNQVLHKCSFELSQMGLRFFIWAGRQPNYRH 168
           Y  L+ ++  V K  LA  ++ +D   G + +VL +C  +   +G RFF+WA +QP Y H
Sbjct: 71  YRILRNHHSRVPKLELALNESGIDLRPGLIIRVLSRCG-DAGNLGYRFFLWATKQPGYFH 130

Query: 169 SSFMYSRACEMIGIHRRPGLLFNVIEDYRREGC-LVDIRIFKIMLNLCKEAKLAKEALSI 228
           S  +      ++   R+ G ++ +IE+ R+    L++  +F +++     A + K+A+ +
Sbjct: 131 SYEVCKSMVMILSKMRQFGAVWGLIEEMRKTNPELIEPELFVVLMRRFASANMVKKAVEV 190

Query: 229 LGKMPEFHLRADTTMYNLVIRLFTEKGEMDMAMELMKEMDSVDIHPNMITYISLLKGFCD 288
           L +MP++ L  D  ++  ++    + G +  A ++ ++M      PN+  + SLL G+C 
Sbjct: 191 LDEMPKYGLEPDEYVFGCLLDALCKNGSVKEASKVFEDMRE-KFPPNLRYFTSLLYGWCR 250

Query: 289 VGRWADAYGVFEAMKENGCEPNTVVYSVLLNAASRHGTMEKLMEVLEEMEKQGGTCGPNT 348
            G+  +A  V   MKE G EP+ VV++ LL+  +  G M    +++ +M K+G    PN 
Sbjct: 251 EGKLMEAKEVLVQMKEAGLEPDIVVFTNLLSGYAHAGKMADAYDLMNDMRKRG--FEPNV 310

Query: 349 VTYTSIIQRLCELGQPL-EALKILDRMEEYGCAPNRVTVSSLIKEFCKNGHVEEAYKLID 408
             YT +IQ LC   + + EA+++   ME YGC  + VT ++LI  FCK G +++ Y ++D
Sbjct: 311 NCYTVLIQALCRTEKRMDEAMRVFVEMERYGCEADIVTYTALISGFCKWGMIDKGYSVLD 370

Query: 409 RAVARGGASYGDCYSSLVVSLVKMKKIAEAEELFRNMLANGVKPDGVACTLVIKELCLEE 468
               +G       Y  ++V+  K ++  E  EL   M   G  PD +   +VI+  C   
Sbjct: 371 DMRKKGVMPSQVTYMQIMVAHEKKEQFEECLELIEKMKRRGCHPDLLIYNVVIRLACKLG 430

Query: 469 RVLDGFNLCNEVDRNGYLTSIDSDVYSILLVGLCEHDHSVDAAKLARLMLKKGIRLKPYF 528
            V +   L NE++ NG    +D+  + I++ G       ++A    + M+ +GI   P +
Sbjct: 431 EVKEAVRLWNEMEANGLSPGVDT--FVIMINGFTSQGFLIEACNHFKEMVSRGIFSAPQY 490

Query: 529 A--ESIIKHLKKFGDQELVMNLGGIRNDK 551
              +S++ +L +    E+  ++    ++K
Sbjct: 491 GTLKSLLNNLVRDDKLEMAKDVWSCISNK 513

BLAST of ClCG06G010030 vs. TAIR 10
Match: AT1G12775.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 159.1 bits (401), Expect = 1.0e-38
Identity = 88/309 (28.48%), Postives = 166/309 (53.72%), Query Frame = 0

Query: 211 LCKEAKLAKEALSILGKMPEFHLRADTTMYNLVIRLFTEKGEMDMAMELMKEMDSVDIHP 270
           +CK  + A  A+ +L KM E +++ D   Y+++I    + G +D A  L  EM+      
Sbjct: 238 MCKSGQTAL-AMELLRKMEERNIKLDAVKYSIIIDGLCKDGSLDNAFNLFNEMEIKGFKA 297

Query: 271 NMITYISLLKGFCDVGRWADAYGVFEAMKENGCEPNTVVYSVLLNAASRHGTMEKLMEVL 330
           ++ITY +L+ GFC+ GRW D   +   M +    PN V +SVL+++  + G + +  ++L
Sbjct: 298 DIITYNTLIGGFCNAGRWDDGAKLLRDMIKRKISPNVVTFSVLIDSFVKEGKLREADQLL 357

Query: 331 EEMEKQGGTCGPNTVTYTSIIQRLCELGQPLEALKILDRMEEYGCAPNRVTVSSLIKEFC 390
           +EM ++G    PNT+TY S+I   C+  +  EA++++D M   GC P+ +T + LI  +C
Sbjct: 358 KEMMQRG--IAPNTITYNSLIDGFCKENRLEEAIQMVDLMISKGCDPDIMTFNILINGYC 417

Query: 391 KNGHVEEAYKLIDRAVARGGASYGDCYSSLVVSLVKMKKIAEAEELFRNMLANGVKPDGV 450
           K   +++  +L      RG  +    Y++LV    +  K+  A++LF+ M++  V+PD V
Sbjct: 418 KANRIDDGLELFREMSLRGVIANTVTYNTLVQGFCQSGKLEVAKKLFQEMVSRRVRPDIV 477

Query: 451 ACTLVIKELCLEERVLDGFNLCNEVDRNGYLTSIDSDVYSILLVGLCEHDHSVDAAKLAR 510
           +  +++  LC    +     +  +++++     +D  +Y I++ G+C      DA  L  
Sbjct: 478 SYKILLDGLCDNGELEKALEIFGKIEKS--KMELDIGIYMIIIHGMCNASKVDDAWDLFC 537

Query: 511 LMLKKGIRL 520
            +  KG++L
Sbjct: 538 SLPLKGVKL 541

BLAST of ClCG06G010030 vs. TAIR 10
Match: AT4G20090.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 158.7 bits (400), Expect = 1.4e-38
Identity = 109/339 (32.15%), Postives = 173/339 (51.03%), Query Frame = 0

Query: 189 VIEDYRREGCLVDIRIFKIMLN-LCKEAKLAK-----EALSILGKMPEFHLRADTTMYNL 248
           ++++ + EGC     I+ ++++ LCK+  L +     + + + G +P      +   YN 
Sbjct: 244 LLDEMQSEGCSPSPVIYNVLIDGLCKKGDLTRVTKLVDNMFLKGCVP------NEVTYNT 303

Query: 249 VIRLFTEKGEMDMAMELMKEMDSVDIHPNMITYISLLKGFCDVGRWADAYGVFEAMKENG 308
           +I     KG++D A+ L++ M S    PN +TY +L+ G     R  DA  +  +M+E G
Sbjct: 304 LIHGLCLKGKLDKAVSLLERMVSSKCIPNDVTYGTLINGLVKQRRATDAVRLLSSMEERG 363

Query: 309 CEPNTVVYSVLLNAASRHGTMEKLMEVLEEMEKQGGTCGPNTVTYTSIIQRLCELGQPLE 368
              N  +YSVL++   + G  E+ M +  +M ++G  C PN V Y+ ++  LC  G+P E
Sbjct: 364 YHLNQHIYSVLISGLFKEGKAEEAMSLWRKMAEKG--CKPNIVVYSVLVDGLCREGKPNE 423

Query: 369 ALKILDRMEEYGCAPNRVTVSSLIKEFCKNGHVEEAYKLIDRAVARGGASYGDCYSSLVV 428
           A +IL+RM   GC PN  T SSL+K F K G  EEA ++       G +    CYS L+ 
Sbjct: 424 AKEILNRMIASGCLPNAYTYSSLMKGFFKTGLCEEAVQVWKEMDKTGCSRNKFCYSVLID 483

Query: 429 SLVKMKKIAEAEELFRNMLANGVKPDGVACTLVIKELC---LEERVLDGFN--LCNEVDR 488
            L  + ++ EA  ++  ML  G+KPD VA + +IK LC     +  L  ++  LC E  +
Sbjct: 484 GLCGVGRVKEAMMVWSKMLTIGIKPDTVAYSSIIKGLCGIGSMDAALKLYHEMLCQEEPK 543

Query: 489 NGYLTSIDSDVYSILLVGLCEHDHSVDAAKLARLMLKKG 517
               +  D   Y+ILL GLC       A  L   ML +G
Sbjct: 544 ----SQPDVVTYNILLDGLCMQKDISRAVDLLNSMLDRG 570

BLAST of ClCG06G010030 vs. TAIR 10
Match: AT1G09900.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 157.5 bits (397), Expect = 3.0e-38
Identity = 83/331 (25.08%), Postives = 177/331 (53.47%), Query Frame = 0

Query: 189 VIEDYRREGCLVDIRIFKIMLN-LCKEAKLAKEALSILGKMPEFHLRADTTMYNLVIRLF 248
           V++   +  C  D+  + I++   C+++ +   A+ +L +M +     D   YN+++   
Sbjct: 226 VLDRMLQRDCYPDVITYTILIEATCRDSGVG-HAMKLLDEMRDRGCTPDVVTYNVLVNGI 285

Query: 249 TEKGEMDMAMELMKEMDSVDIHPNMITYISLLKGFCDVGRWADAYGVFEAMKENGCEPNT 308
            ++G +D A++ + +M S    PN+IT+  +L+  C  GRW DA  +   M   G  P+ 
Sbjct: 286 CKEGRLDEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPSV 345

Query: 309 VVYSVLLNAASRHGTMEKLMEVLEEMEKQGGTCGPNTVTYTSIIQRLCELGQPLEALKIL 368
           V +++L+N   R G + + +++LE+M + G  C PN+++Y  ++   C+  +   A++ L
Sbjct: 346 VTFNILINFLCRKGLLGRAIDILEKMPQHG--CQPNSLSYNPLLHGFCKEKKMDRAIEYL 405

Query: 369 DRMEEYGCAPNRVTVSSLIKEFCKNGHVEEAYKLIDRAVARGGASYGDCYSSLVVSLVKM 428
           +RM   GC P+ VT ++++   CK+G VE+A +++++  ++G +     Y++++  L K 
Sbjct: 406 ERMVSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAKA 465

Query: 429 KKIAEAEELFRNMLANGVKPDGVACTLVIKELCLEERVLDGFNLCNEVDRNGYLTSIDSD 488
            K  +A +L   M A  +KPD +  + ++  L  E +V +     +E +R G     ++ 
Sbjct: 466 GKTGKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGIRP--NAV 525

Query: 489 VYSILLVGLCEHDHSVDAAKLARLMLKKGIR 519
            ++ +++GLC+   +  A      M+ +G +
Sbjct: 526 TFNSIMLGLCKSRQTDRAIDFLVFMINRGCK 551

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038874415.11.1e-24288.82pentatricopeptide repeat-containing protein At5g47360 [Benincasa hispida] >XP_03... [more]
KAG7017159.15.2e-24286.92Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
KAG6579716.11.2e-24186.71Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022928928.15.8e-24186.71pentatricopeptide repeat-containing protein At5g47360 [Cucurbita moschata][more]
XP_023551479.19.8e-24186.29pentatricopeptide repeat-containing protein At5g47360 [Cucurbita pepo subsp. pep... [more]
Match NameE-valueIdentityDescription
Q9LVS33.4e-11947.42Pentatricopeptide repeat-containing protein At5g47360 OS=Arabidopsis thaliana OX... [more]
P0C8A01.7e-4627.17Pentatricopeptide repeat-containing protein At3g49730 OS=Arabidopsis thaliana OX... [more]
Q9LPX21.5e-3728.48Pentatricopeptide repeat-containing protein At1g12775, mitochondrial OS=Arabidop... [more]
O494361.9e-3732.15Pentatricopeptide repeat-containing protein At4g20090 OS=Arabidopsis thaliana OX... [more]
Q9ZVX54.3e-3727.54Pentatricopeptide repeat-containing protein At2g16880 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1EQI22.8e-24186.71pentatricopeptide repeat-containing protein At5g47360 OS=Cucurbita moschata OX=3... [more]
A0A6J1I1252.6e-23985.86pentatricopeptide repeat-containing protein At5g47360 OS=Cucurbita maxima OX=366... [more]
A0A0A0LI445.3e-23284.60Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G123590 PE=4 SV=1[more]
A0A6J1DNK53.5e-22882.49pentatricopeptide repeat-containing protein At5g47360 OS=Momordica charantia OX=... [more]
A0A1S3B4L91.4e-22483.33pentatricopeptide repeat-containing protein At5g47360 OS=Cucumis melo OX=3656 GN... [more]
Match NameE-valueIdentityDescription
AT5G47360.12.4e-12047.42Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G49730.11.2e-4727.17Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G12775.11.0e-3828.48Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G20090.11.4e-3832.15Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G09900.13.0e-3825.08Pentatricopeptide repeat (PPR-like) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 416..445
e-value: 0.0035
score: 17.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 380..409
e-value: 0.0013
score: 16.7
coord: 308..337
e-value: 0.0031
score: 15.6
coord: 273..307
e-value: 1.0E-8
score: 32.8
coord: 239..271
e-value: 6.4E-5
score: 20.9
coord: 416..448
e-value: 1.8E-6
score: 25.8
coord: 345..378
e-value: 2.3E-8
score: 31.7
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 342..391
e-value: 3.7E-14
score: 52.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 413..447
score: 9.898111
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 485..519
score: 8.933517
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 378..412
score: 9.656963
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 236..270
score: 10.665402
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 306..340
score: 9.700809
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 271..305
score: 13.054966
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 343..377
score: 12.616514
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 415..550
e-value: 2.4E-16
score: 62.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 266..335
e-value: 8.1E-21
score: 76.4
coord: 110..265
e-value: 3.9E-17
score: 64.4
coord: 336..406
e-value: 1.8E-19
score: 72.0
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 344..442
IPR033443Pentacotripeptide-repeat region of PRORPPFAMPF17177PPR_longcoord: 191..334
e-value: 9.7E-17
score: 61.0
NoneNo IPR availablePANTHERPTHR45613PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 92..543
NoneNo IPR availablePANTHERPTHR45613:SF82PPR CONTAINING PLANT-LIKE PROTEINcoord: 92..543

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG06G010030.1ClCG06G010030.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding