ClCG01G019660 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG01G019660
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionPentatricopeptide repeat-containing protein
LocationCG_Chr01: 33879611 .. 33881991 (-)
RNA-Seq ExpressionClCG01G019660
SyntenyClCG01G019660
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTCTCTCTAAAAGCAGAGTCTCCATTACAATCAACATGGGCCCAGACCATGTCTCTGCTTGTAAACTGCTCAAACATGAAGCAATTGAAACAAATTCACGCTCAAATGATCAAAACAGAGATCGTCACAGAACCCAAATTAGCTACAAAGTTTCTAACCCTCTGCACTTCACCCCATTTCGGCGATTTGCTTTACGCGCAAAAGGTCTTCAATGGAATCACCAGCCCCAACACTTTCATGTGGAACGCCATTATAAGAGCTTACTGTAACAGTAACGAACCAGAATTAGCATTTCTCTTGTATCAGCAAATGCTTTCTTCTTCGGTACCGCACAACTCCTACACCTTCCCTTTCGTGCTCAAAGCTTGTCGTAATTTGTCGGCCATGGGTGAGGCCCTCCAAGTTCATGGACTGGTTTTCAAACTGGGATTTGGGTCGGATGTTTTTGCATTGAATGCTCTGCTTCATGTCTACACTTTGTGTGGTGACATTAATTATGCACGCCAACTGTTTGATAATATTCCTGAAAGAGATGTTGTTTCTTGGAACATAATGATTGATGGGTATATCAAATCTGGGGATGTAAAAACGGCTTATGGGATTTTCTTGGACATGCCATTGAAAAATGTGGTCTCGTGGACGTCGCTGATTTCGGGGCTTGTTGAGGCAGGACTGAGCGTAGAAGCTTTGAATCTTTGTTATGAGATGCAGAGTGCAGGATTTGAACTTGATGGTGTTGCTATTGCGAGTTTGCTTACTGCTTGTGCAAATCTTGGAGCGTTGGATCAAGGAAGATGGCTCCATTTCTATTTGCTCAACAATGGAGTCCACATCGATCGAGTAACTGGCTGTGCTCTGGTGAATATGTACTTAAAATGTGGGGAAATGGAAGAAGCCTTTAGATTGTTTGGGAAACTGAAGAGCGATCAGAAAGATGTGTATGTTTGGACGGCCATGATCGATGGCTTTGCCATTCATGGGCGTGGAGTGGAAGCTCTGGAATGGTTTAACCGAATGCAGAGAGAAGGAATAAGACCAAATTCCATCACTTTCACTGCAGTTTTAAGGGCCTGTAGCTATGCAGGACTGGTTGAAGAAGGAAAAGTGTTATTCGAGAGCATGAGATGTCTTTACAACTTGAGCCCATCTATTGAGCATTTTGGGTGTATGGTTGATCTTTTGGGTCGAGCTGGGCTGCTGGATAAAGCGAAGGAGTTGATCAAGAAGATGCCCATGAAACCTAATGCTGTAATATGGGGAGCTTTGCTAAAGGTAGGGGGCAAATTTTATATGCCTACTTTTTTAAATCATGGGTTAAGTATATGTTTCGGGATGATATTGAGTGAGTTATTAGGAATGTGCACATAGGAGAGCAAGAATCTTCACCTAGAAGAGAAAAGGTTGGATTTCTGTCAACTTCATGATAGATTATACTTGGAAATTTAGAAGAGAAATTAATAACTTCCATCTATTCAGTGGCCTTATTCTCTTAAATATGAGTAAACTCTCAGAGTAATAAGTATCATGCTAGCTACATAAGCTTACCACAAGCCAAATTTAACAACCACAAAATGAAGTGAGTATACTTAAAATATTAAAATGTAAGTAACTAAATTAAAACTTCTTAATTATGATTTTAATTGATTAGAAAAGTTGAGAATCTACATAAGCTTACTACTCCTTCCTCCTCGTTCAGGCCTGTTGGATTCATAGAGATTTTCTGGTGGGTAGCCAAATCGGAGCCCACCTGGTGGAAGTCGATTCAGATCATAGCGGGCGGTACATTCAGTTGGCTACCATTTTAGCTGCAGAAGGTAAATGGAAAGAAGCAGCTGAAGTGAGGTTGAAGATGAAGAATCTGAGAGTCCCAATTCCCCCCGGAAAGAGTTCAATAACTTTGAATGGCGTTGTTCATGAATTTCTTGCTGGGCATCAAGATCATCCACAGATGGAGCAGATTCATTTGAAACTGAAACAGGTTGCCGAGAGGCTACGACAAGATGAAAGGTACTTCTAAATCACCATTAAATCCAAAAACTTAAGCTGATAACTTATCGTATGTATATCTAACATTTTGTTAATACTCATCTTTTTACAGTTATGAACCTGTAACTAAAGATTTATTACTTGATCTTGAGAATGAGGAGAAAGAGACTACGATGGCTCAACATAGCGAGAAGTTGGCTATTGCTTTTGGATTGATCAATACGAAACCAGGAGCGACGATTCGAGTTATTAAGAATCTTAGAGTCTGTAGAGATTGTCACACTGTTGCAAAGCTCATATCTCAAATCTATTGTAGAGAGATTATAATGCGAGATAGAGTTCGATTCCACCATTTTAGAGATGGGAATTGTTCTTGCAAAGATTATTGGTAG

mRNA sequence

ATGTTCTCTCTAAAAGCAGAGTCTCCATTACAATCAACATGGGCCCAGACCATGTCTCTGCTTGTAAACTGCTCAAACATGAAGCAATTGAAACAAATTCACGCTCAAATGATCAAAACAGAGATCGTCACAGAACCCAAATTAGCTACAAAGTTTCTAACCCTCTGCACTTCACCCCATTTCGGCGATTTGCTTTACGCGCAAAAGGTCTTCAATGGAATCACCAGCCCCAACACTTTCATGTGGAACGCCATTATAAGAGCTTACTGTAACAGTAACGAACCAGAATTAGCATTTCTCTTGTATCAGCAAATGCTTTCTTCTTCGGTACCGCACAACTCCTACACCTTCCCTTTCGTGCTCAAAGCTTGTCGTAATTTGTCGGCCATGGGTGAGGCCCTCCAAGTTCATGGACTGGTTTTCAAACTGGGATTTGGGTCGGATGTTTTTGCATTGAATGCTCTGCTTCATGTCTACACTTTGTGTGGTGACATTAATTATGCACGCCAACTGTTTGATAATATTCCTGAAAGAGATGTTGTTTCTTGGAACATAATGATTGATGGGTATATCAAATCTGGGGATGTAAAAACGGCTTATGGGATTTTCTTGGACATGCCATTGAAAAATGTGGTCTCGTGGACGTCGCTGATTTCGGGGCTTGTTGAGGCAGGACTGAGCGTAGAAGCTTTGAATCTTTGTTATGAGATGCAGAGTGCAGGATTTGAACTTGATGGTGTTGCTATTGCGAGTTTGCTTACTGCTTGTGCAAATCTTGGAGCGTTGGATCAAGGAAGATGGCTCCATTTCTATTTGCTCAACAATGGAGTCCACATCGATCGAGTAACTGGCTGTGCTCTGGTGAATATGTACTTAAAATGTGGGGAAATGGAAGAAGCCTTTAGATTGTTTGGGAAACTGAAGAGCGATCAGAAAGATGTGTATGTTTGGACGGCCATGATCGATGGCTTTGCCATTCATGGGCGTGGAGTGGAAGCTCTGGAATGGTTTAACCGAATGCAGAGAGAAGGAATAAGACCAAATTCCATCACTTTCACTGCAGTTTTAAGGGCCTGTAGCTATGCAGGACTGGTTGAAGAAGGAAAAGTGTTATTCGAGAGCATGAGATGTCTTTACAACTTGAGCCCATCTATTGAGCATTTTGGGTGTATGGTTGATCTTTTGGGTCGAGCTGGGCTGCTGGATAAAGCGAAGGAGTTGATCAAGAAGATGCCCATGAAACCTAATGCTGTAATATGGGGAGCTTTGCTAAAGGCCTGTTGGATTCATAGAGATTTTCTGGTGGGTAGCCAAATCGGAGCCCACCTGGTGGAAGTCGATTCAGATCATAGCGGGCGGTACATTCAGTTGGCTACCATTTTAGCTGCAGAAGGTAAATGGAAAGAAGCAGCTGAAGTGAGGTTGAAGATGAAGAATCTGAGAGTCCCAATTCCCCCCGGAAAGAGTTCAATAACTTTGAATGGCGTTGTTCATGAATTTCTTGCTGGGCATCAAGATCATCCACAGATGGAGCAGATTCATTTGAAACTGAAACAGGTTGCCGAGAGGCTACGACAAGATGAAAGTTATGAACCTGTAACTAAAGATTTATTACTTGATCTTGAGAATGAGGAGAAAGAGACTACGATGGCTCAACATAGCGAGAAGTTGGCTATTGCTTTTGGATTGATCAATACGAAACCAGGAGCGACGATTCGAGTTATTAAGAATCTTAGAGTCTGTAGAGATTGTCACACTGTTGCAAAGCTCATATCTCAAATCTATTGTAGAGAGATTATAATGCGAGATAGAGTTCGATTCCACCATTTTAGAGATGGGAATTGTTCTTGCAAAGATTATTGGTAG

Coding sequence (CDS)

ATGTTCTCTCTAAAAGCAGAGTCTCCATTACAATCAACATGGGCCCAGACCATGTCTCTGCTTGTAAACTGCTCAAACATGAAGCAATTGAAACAAATTCACGCTCAAATGATCAAAACAGAGATCGTCACAGAACCCAAATTAGCTACAAAGTTTCTAACCCTCTGCACTTCACCCCATTTCGGCGATTTGCTTTACGCGCAAAAGGTCTTCAATGGAATCACCAGCCCCAACACTTTCATGTGGAACGCCATTATAAGAGCTTACTGTAACAGTAACGAACCAGAATTAGCATTTCTCTTGTATCAGCAAATGCTTTCTTCTTCGGTACCGCACAACTCCTACACCTTCCCTTTCGTGCTCAAAGCTTGTCGTAATTTGTCGGCCATGGGTGAGGCCCTCCAAGTTCATGGACTGGTTTTCAAACTGGGATTTGGGTCGGATGTTTTTGCATTGAATGCTCTGCTTCATGTCTACACTTTGTGTGGTGACATTAATTATGCACGCCAACTGTTTGATAATATTCCTGAAAGAGATGTTGTTTCTTGGAACATAATGATTGATGGGTATATCAAATCTGGGGATGTAAAAACGGCTTATGGGATTTTCTTGGACATGCCATTGAAAAATGTGGTCTCGTGGACGTCGCTGATTTCGGGGCTTGTTGAGGCAGGACTGAGCGTAGAAGCTTTGAATCTTTGTTATGAGATGCAGAGTGCAGGATTTGAACTTGATGGTGTTGCTATTGCGAGTTTGCTTACTGCTTGTGCAAATCTTGGAGCGTTGGATCAAGGAAGATGGCTCCATTTCTATTTGCTCAACAATGGAGTCCACATCGATCGAGTAACTGGCTGTGCTCTGGTGAATATGTACTTAAAATGTGGGGAAATGGAAGAAGCCTTTAGATTGTTTGGGAAACTGAAGAGCGATCAGAAAGATGTGTATGTTTGGACGGCCATGATCGATGGCTTTGCCATTCATGGGCGTGGAGTGGAAGCTCTGGAATGGTTTAACCGAATGCAGAGAGAAGGAATAAGACCAAATTCCATCACTTTCACTGCAGTTTTAAGGGCCTGTAGCTATGCAGGACTGGTTGAAGAAGGAAAAGTGTTATTCGAGAGCATGAGATGTCTTTACAACTTGAGCCCATCTATTGAGCATTTTGGGTGTATGGTTGATCTTTTGGGTCGAGCTGGGCTGCTGGATAAAGCGAAGGAGTTGATCAAGAAGATGCCCATGAAACCTAATGCTGTAATATGGGGAGCTTTGCTAAAGGCCTGTTGGATTCATAGAGATTTTCTGGTGGGTAGCCAAATCGGAGCCCACCTGGTGGAAGTCGATTCAGATCATAGCGGGCGGTACATTCAGTTGGCTACCATTTTAGCTGCAGAAGGTAAATGGAAAGAAGCAGCTGAAGTGAGGTTGAAGATGAAGAATCTGAGAGTCCCAATTCCCCCCGGAAAGAGTTCAATAACTTTGAATGGCGTTGTTCATGAATTTCTTGCTGGGCATCAAGATCATCCACAGATGGAGCAGATTCATTTGAAACTGAAACAGGTTGCCGAGAGGCTACGACAAGATGAAAGTTATGAACCTGTAACTAAAGATTTATTACTTGATCTTGAGAATGAGGAGAAAGAGACTACGATGGCTCAACATAGCGAGAAGTTGGCTATTGCTTTTGGATTGATCAATACGAAACCAGGAGCGACGATTCGAGTTATTAAGAATCTTAGAGTCTGTAGAGATTGTCACACTGTTGCAAAGCTCATATCTCAAATCTATTGTAGAGAGATTATAATGCGAGATAGAGTTCGATTCCACCATTTTAGAGATGGGAATTGTTCTTGCAAAGATTATTGGTAG

Protein sequence

MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPHNSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSGDVKTAYGIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEAFRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREIIMRDRVRFHHFRDGNCSCKDYW
Homology
BLAST of ClCG01G019660 vs. NCBI nr
Match: XP_038882528.1 (pentatricopeptide repeat-containing protein At5g66520 [Benincasa hispida])

HSP 1 Score: 1161.7 bits (3004), Expect = 0.0e+00
Identity = 567/621 (91.30%), Postives = 588/621 (94.69%), Query Frame = 0

Query: 1   MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPH 60
           MF+LKA+SPLQSTWAQTMSLL NCSNMKQLK+IHAQMIKTE  TEPKLATK LTLCTSPH
Sbjct: 1   MFTLKADSPLQSTWAQTMSLLENCSNMKQLKEIHAQMIKTETATEPKLATKLLTLCTSPH 60

Query: 61  FGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPHNSYTFPFV 120
           FGDL YAQ+VFNGIT PNTFMWNAIIRAY NS EPELAFLLYQQMLSSSVPHNSYTFPF+
Sbjct: 61  FGDLPYAQRVFNGITRPNTFMWNAIIRAYSNSKEPELAFLLYQQMLSSSVPHNSYTFPFL 120

Query: 121 LKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDV 180
           LKACRNLSAMGEALQ+HGLV KLGFGSDVFALNALLHVY LCGDI YARQLFDNIP RDV
Sbjct: 121 LKACRNLSAMGEALQIHGLVIKLGFGSDVFALNALLHVYALCGDIYYARQLFDNIPVRDV 180

Query: 181 VSWNIMIDGYIKSGDVKTAYGIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSA 240
           VSWNIMIDGYIKSGDVKTAYG+FLDMPLKNVVSWTSLISGLVEAG SVEAL+LCYEMQ+A
Sbjct: 181 VSWNIMIDGYIKSGDVKTAYGVFLDMPLKNVVSWTSLISGLVEAGQSVEALSLCYEMQNA 240

Query: 241 GFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA 300
           GFELDG+AIASLLTACANLGALDQGRWLHFY+LNNGV +DRV GCALVNMYLKCG+MEEA
Sbjct: 241 GFELDGIAIASLLTACANLGALDQGRWLHFYVLNNGVDVDRVIGCALVNMYLKCGDMEEA 300

Query: 301 FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACS 360
            R+FGKLKSDQKDVYVWTAMIDGFAIHG GVEALEWFNRMQREGIRPNSITFTAVLRACS
Sbjct: 301 LRVFGKLKSDQKDVYVWTAMIDGFAIHGLGVEALEWFNRMQREGIRPNSITFTAVLRACS 360

Query: 361 YAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMPMKPNAVIW 420
           YAGLV EGK LFESM  LYNL PSIEH+GCMVDLLGRAGLLD+AKELIKKMPMKPNAVIW
Sbjct: 361 YAGLVGEGKELFESMTSLYNLIPSIEHYGCMVDLLGRAGLLDEAKELIKKMPMKPNAVIW 420

Query: 421 GALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNL 480
           GALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATI AAEGKWKEAAEVRLKMKNL
Sbjct: 421 GALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATIFAAEGKWKEAAEVRLKMKNL 480

Query: 481 RVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLD 540
            V IPPGKSSIT+NGVVHEFLAG QDHPQME+IHLKLKQ+AERLR+DE YEP TKDLLLD
Sbjct: 481 GVSIPPGKSSITVNGVVHEFLAGQQDHPQMEKIHLKLKQIAERLRRDEGYEPSTKDLLLD 540

Query: 541 LENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII 600
           LENEEKET MAQHSEKLAIAFGLINTKPG TIRVIKNLRVC DCH VAKLISQIYCR II
Sbjct: 541 LENEEKETAMAQHSEKLAIAFGLINTKPGMTIRVIKNLRVCEDCHVVAKLISQIYCRGII 600

Query: 601 MRDRVRFHHFRDGNCSCKDYW 622
           MRDRVRFHHFR+GNCSCKDYW
Sbjct: 601 MRDRVRFHHFRNGNCSCKDYW 621

BLAST of ClCG01G019660 vs. NCBI nr
Match: XP_008440725.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g66520 [Cucumis melo] >KAA0036221.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK12617.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1140.9 bits (2950), Expect = 0.0e+00
Identity = 552/621 (88.89%), Postives = 586/621 (94.36%), Query Frame = 0

Query: 1   MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPH 60
           MF+LKAESPLQSTW    +LL NCSNMKQLKQI AQMIKT I++EPKLATKFLTLCTSPH
Sbjct: 1   MFTLKAESPLQSTW----TLLENCSNMKQLKQIQAQMIKTAILSEPKLATKFLTLCTSPH 60

Query: 61  FGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPHNSYTFPFV 120
            GDLLYAQ+VFNGITSPNT MWNAIIRAY NS EPELAFLLYQQMLSSSVPHNSYTFPF+
Sbjct: 61  VGDLLYAQRVFNGITSPNTVMWNAIIRAYSNSKEPELAFLLYQQMLSSSVPHNSYTFPFL 120

Query: 121 LKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDV 180
           LKACRNLSA+GEALQVHGLV KLGFGSDVFALNALLHVY LCG+I YARQ+FDNIPERD 
Sbjct: 121 LKACRNLSALGEALQVHGLVIKLGFGSDVFALNALLHVYALCGEIRYARQMFDNIPERDA 180

Query: 181 VSWNIMIDGYIKSGDVKTAYGIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSA 240
           VSWNIMIDGYIKSGDVKTAYGIFLDMP KNVVSWTSLISGLV AGLSV+AL+LCYEMQ+A
Sbjct: 181 VSWNIMIDGYIKSGDVKTAYGIFLDMPSKNVVSWTSLISGLVGAGLSVKALSLCYEMQNA 240

Query: 241 GFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA 300
           GFELDGVAIA LLTACANLGALDQGRWLHFY+LNNGV +DRV GCALVNMY+KCG+MEEA
Sbjct: 241 GFELDGVAIACLLTACANLGALDQGRWLHFYVLNNGVDVDRVIGCALVNMYVKCGDMEEA 300

Query: 301 FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACS 360
            R+FGKLK DQKDV +WTAMIDGFAIHGRGVEALEWF+ M+REGIRPNSITFTAVLRACS
Sbjct: 301 LRVFGKLKGDQKDVCIWTAMIDGFAIHGRGVEALEWFDLMRREGIRPNSITFTAVLRACS 360

Query: 361 YAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMPMKPNAVIW 420
           Y GLVEEGK LF+SM+CLYNLSPSIEH+GCMVDLLGR+G L++AKELIK MPMKPNAVIW
Sbjct: 361 YGGLVEEGKELFKSMKCLYNLSPSIEHYGCMVDLLGRSGRLNEAKELIKNMPMKPNAVIW 420

Query: 421 GALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNL 480
           GA LKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAA+GKWKEAAEVRLKMKNL
Sbjct: 421 GAFLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAQGKWKEAAEVRLKMKNL 480

Query: 481 RVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLD 540
            VPI PGKSSITLNG+VHEFLAGHQDHPQMEQIHLKLKQ+AERLRQDE YEP TKDLLLD
Sbjct: 481 GVPISPGKSSITLNGIVHEFLAGHQDHPQMEQIHLKLKQIAERLRQDEGYEPATKDLLLD 540

Query: 541 LENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII 600
           LENEEKET +AQHSEKLAIAFGLINTKPG TIRV+KNLR+CRDCHTVAKL+SQIYCREII
Sbjct: 541 LENEEKETAIAQHSEKLAIAFGLINTKPGTTIRVVKNLRICRDCHTVAKLVSQIYCREII 600

Query: 601 MRDRVRFHHFRDGNCSCKDYW 622
           MRDRVRFHHFRDG+CSCKDYW
Sbjct: 601 MRDRVRFHHFRDGSCSCKDYW 617

BLAST of ClCG01G019660 vs. NCBI nr
Match: XP_004143583.2 (pentatricopeptide repeat-containing protein At5g66520 [Cucumis sativus] >KGN48837.1 hypothetical protein Csa_002803 [Cucumis sativus])

HSP 1 Score: 1140.2 bits (2948), Expect = 0.0e+00
Identity = 551/621 (88.73%), Postives = 587/621 (94.52%), Query Frame = 0

Query: 1   MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPH 60
           MF+L AESPLQSTWA    LL NCSNMKQLKQI AQMIKT I+TEPKLATKFLTLCTSPH
Sbjct: 1   MFTLNAESPLQSTWA----LLENCSNMKQLKQIQAQMIKTAIITEPKLATKFLTLCTSPH 60

Query: 61  FGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPHNSYTFPFV 120
            GDLLYAQ+VFNGITSPNTFMWNAIIRAY NS+EPELAFL YQQMLSSSVPHNSYTFPF+
Sbjct: 61  VGDLLYAQRVFNGITSPNTFMWNAIIRAYSNSDEPELAFLSYQQMLSSSVPHNSYTFPFL 120

Query: 121 LKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDV 180
           L+ACRNL AMGEALQVHGLV KLGFGSDVFALNALLHVY LCG+I+ ARQLFDNIPERD 
Sbjct: 121 LRACRNLLAMGEALQVHGLVIKLGFGSDVFALNALLHVYALCGEIHCARQLFDNIPERDA 180

Query: 181 VSWNIMIDGYIKSGDVKTAYGIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSA 240
           VSWNIMIDGYIKSGDVKTAYG+FLDMPLKNVVSWTSLISGLVEAG SVEAL+LCYEMQ+A
Sbjct: 181 VSWNIMIDGYIKSGDVKTAYGVFLDMPLKNVVSWTSLISGLVEAGQSVEALSLCYEMQNA 240

Query: 241 GFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA 300
           GFELDGVAIASLLTACANLGALDQGRWLHFY+LNNGV +DRV GCALVNMY+KCG+MEEA
Sbjct: 241 GFELDGVAIASLLTACANLGALDQGRWLHFYVLNNGVDVDRVIGCALVNMYVKCGDMEEA 300

Query: 301 FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACS 360
             +FGKLK +QKDVY+WTAMIDGFAIHGRGVEALEWFNRM+REGIRPNSITFTAVLRACS
Sbjct: 301 LSVFGKLKGNQKDVYIWTAMIDGFAIHGRGVEALEWFNRMRREGIRPNSITFTAVLRACS 360

Query: 361 YAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMPMKPNAVIW 420
           Y GLVEEGK LF+SM+C YN++PSIEH+GCMVDLLGR+G LD+AKELIKKMPMKP+AVIW
Sbjct: 361 YGGLVEEGKELFKSMKCFYNVNPSIEHYGCMVDLLGRSGRLDEAKELIKKMPMKPSAVIW 420

Query: 421 GALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNL 480
           GALLKACWIHRDFL+GSQ+GAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMK+L
Sbjct: 421 GALLKACWIHRDFLLGSQVGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKSL 480

Query: 481 RVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLD 540
            VPI PGKSS+TLNG+VHEFLAGHQDHPQMEQI LKLKQ+AERLRQDE YEP TKDLLLD
Sbjct: 481 GVPISPGKSSVTLNGIVHEFLAGHQDHPQMEQIQLKLKQIAERLRQDEGYEPATKDLLLD 540

Query: 541 LENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII 600
           LENEEKET MAQHSEKLAIAFGLINTKPG TIRVIKNLR+CRDCHTVAKL+SQIY REII
Sbjct: 541 LENEEKETAMAQHSEKLAIAFGLINTKPGTTIRVIKNLRICRDCHTVAKLVSQIYSREII 600

Query: 601 MRDRVRFHHFRDGNCSCKDYW 622
           MRDRVRFHHFRDG+CSCKDYW
Sbjct: 601 MRDRVRFHHFRDGSCSCKDYW 617

BLAST of ClCG01G019660 vs. NCBI nr
Match: XP_022978438.1 (pentatricopeptide repeat-containing protein At5g66520 [Cucurbita maxima])

HSP 1 Score: 1136.3 bits (2938), Expect = 0.0e+00
Identity = 554/621 (89.21%), Postives = 583/621 (93.88%), Query Frame = 0

Query: 1   MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPH 60
           MF+LKAESP+QSTWAQTMSLL NCSNMKQLK+IHAQMI+T   TEPKLATK LTLCTSPH
Sbjct: 1   MFALKAESPMQSTWAQTMSLLENCSNMKQLKEIHAQMIRTGTATEPKLATKLLTLCTSPH 60

Query: 61  FGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPHNSYTFPFV 120
           FGDL YAQ+VFNGI+SP TFMWNA+IRAY NSNEPELAFLLY+QMLSSSVPHNSYTFPF+
Sbjct: 61  FGDLHYAQRVFNGISSPTTFMWNAMIRAYSNSNEPELAFLLYRQMLSSSVPHNSYTFPFL 120

Query: 121 LKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDV 180
           LKACRN SAM EALQVHGLV KLGFGSDVFALNALLHVY LCGDI YARQLFDNIPERD+
Sbjct: 121 LKACRNFSAMSEALQVHGLVIKLGFGSDVFALNALLHVYALCGDIQYARQLFDNIPERDI 180

Query: 181 VSWNIMIDGYIKSGDVKTAYGIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSA 240
           VSWNIMIDGYIKSGDVKTAYG+FLDMPLKNVVSWTSLISGLVEAGL+VEAL+LC+EMQ+A
Sbjct: 181 VSWNIMIDGYIKSGDVKTAYGVFLDMPLKNVVSWTSLISGLVEAGLNVEALSLCHEMQNA 240

Query: 241 GFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA 300
           GFELDGVAIASLLTACANLGALDQGRWLHFY+LNNGVH+DRV GCALVNMYLKCG+MEEA
Sbjct: 241 GFELDGVAIASLLTACANLGALDQGRWLHFYVLNNGVHVDRVIGCALVNMYLKCGDMEEA 300

Query: 301 FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACS 360
            + FGKLK DQKDVYVWTAMIDGFAIHGRGVEALEWF RM REGIRPNSITFTAVLRACS
Sbjct: 301 LQEFGKLKGDQKDVYVWTAMIDGFAIHGRGVEALEWFKRMLREGIRPNSITFTAVLRACS 360

Query: 361 YAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMPMKPNAVIW 420
           YAGLVEEGKVLFESM  +Y LSPSIEH+GCMVDLLGRAGLL++AKELIK MPMKPNA+IW
Sbjct: 361 YAGLVEEGKVLFESMMSVYILSPSIEHYGCMVDLLGRAGLLEEAKELIKTMPMKPNAIIW 420

Query: 421 GALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNL 480
           GALLKAC IHRDFLVG QIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNL
Sbjct: 421 GALLKACRIHRDFLVGGQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNL 480

Query: 481 RVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLD 540
           RVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQI  KL QV ERLRQ E YEP TKDLLLD
Sbjct: 481 RVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQILHKLNQVVERLRQHEGYEPATKDLLLD 540

Query: 541 LENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII 600
           LENE KET +AQHSEKLAIAFGLINTKPG+TIRV+KNLRVC DCH VAKLIS+IY REII
Sbjct: 541 LENEAKETAVAQHSEKLAIAFGLINTKPGSTIRVVKNLRVCEDCHVVAKLISRIYRREII 600

Query: 601 MRDRVRFHHFRDGNCSCKDYW 622
           MRDRVRFHHFR G+CSCKDYW
Sbjct: 601 MRDRVRFHHFRGGSCSCKDYW 621

BLAST of ClCG01G019660 vs. NCBI nr
Match: XP_022949774.1 (pentatricopeptide repeat-containing protein At5g66520 [Cucurbita moschata])

HSP 1 Score: 1135.6 bits (2936), Expect = 0.0e+00
Identity = 553/621 (89.05%), Postives = 582/621 (93.72%), Query Frame = 0

Query: 1   MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPH 60
           MF+LKAESP+QSTWAQTMSLL NCSNMKQLK+IHAQMI+T   TEPKLATK LTLC SPH
Sbjct: 1   MFALKAESPVQSTWAQTMSLLENCSNMKQLKEIHAQMIRTGTATEPKLATKLLTLCISPH 60

Query: 61  FGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPHNSYTFPFV 120
           FGDL YAQ+VFNGI+SP TFMWNA+IRAY NSNEPELAFLLY+QMLSSSVPHNSYTFPF+
Sbjct: 61  FGDLHYAQRVFNGISSPTTFMWNAMIRAYSNSNEPELAFLLYRQMLSSSVPHNSYTFPFL 120

Query: 121 LKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDV 180
           LKACRN SAM EALQVHGLV KLGFGSDVFALNALLHVY LCGDI YARQLFDNIPERD+
Sbjct: 121 LKACRNFSAMSEALQVHGLVIKLGFGSDVFALNALLHVYALCGDIQYARQLFDNIPERDI 180

Query: 181 VSWNIMIDGYIKSGDVKTAYGIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSA 240
           VSWNIMIDGYIKSGDVKTAYG+FLDMPLKNVVSWTSLISGLVEAGL+VEAL+LC+EMQ+A
Sbjct: 181 VSWNIMIDGYIKSGDVKTAYGVFLDMPLKNVVSWTSLISGLVEAGLNVEALSLCHEMQNA 240

Query: 241 GFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA 300
           GFELDGVAIASLLTACANLGALDQGRWLHFY+LNNGVH+DRV GCALVNMYLKCG+MEEA
Sbjct: 241 GFELDGVAIASLLTACANLGALDQGRWLHFYVLNNGVHVDRVIGCALVNMYLKCGDMEEA 300

Query: 301 FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACS 360
            R FGKLK DQKDVYVWTAMIDGFAIHGRGVEALEWF RM REGIRPNSITFTAVLRACS
Sbjct: 301 LREFGKLKGDQKDVYVWTAMIDGFAIHGRGVEALEWFKRMLREGIRPNSITFTAVLRACS 360

Query: 361 YAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMPMKPNAVIW 420
           YAGLVEEGKVLFESM  +YNLSPSIEH+GCMVDLLGRAGLL++AKELIK MPM+PNA+IW
Sbjct: 361 YAGLVEEGKVLFESMMSVYNLSPSIEHYGCMVDLLGRAGLLEEAKELIKTMPMEPNAIIW 420

Query: 421 GALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNL 480
           GALLKAC IHRDFLVG QIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNL
Sbjct: 421 GALLKACRIHRDFLVGGQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNL 480

Query: 481 RVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLD 540
           R+PIPPGKSSITLNGVVHEFLAGHQDHPQMEQI  KL QV ERLRQ E YEP TKDLLLD
Sbjct: 481 RLPIPPGKSSITLNGVVHEFLAGHQDHPQMEQILHKLNQVVERLRQHEGYEPATKDLLLD 540

Query: 541 LENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII 600
           LE+E KET +AQHSEKLAIAFGLINTKPG+TIRV+KNLRVC DCH VAKLISQIY REII
Sbjct: 541 LESEAKETAVAQHSEKLAIAFGLINTKPGSTIRVVKNLRVCEDCHVVAKLISQIYRREII 600

Query: 601 MRDRVRFHHFRDGNCSCKDYW 622
           MRDRVRFHHFR GNCSC DYW
Sbjct: 601 MRDRVRFHHFRGGNCSCNDYW 621

BLAST of ClCG01G019660 vs. ExPASy Swiss-Prot
Match: Q9FJY7 (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 678.3 bits (1749), Expect = 8.1e-194
Identity = 331/614 (53.91%), Postives = 438/614 (71.34%), Query Frame = 0

Query: 10  LQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLL-YAQ 69
           L+    +TMS L  CS  ++LKQIHA+M+KT ++ +    TKFL+ C S    D L YAQ
Sbjct: 10  LEHNLYETMSCLQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQ 69

Query: 70  KVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPHNSYTFPFVLKACRNLS 129
            VF+G   P+TF+WN +IR +  S+EPE + LLYQ+ML SS PHN+YTFP +LKAC NLS
Sbjct: 70  IVFDGFDRPDTFLWNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLS 129

Query: 130 AMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMID 189
           A  E  Q+H  + KLG+ +DV+A+N+L++ Y + G+   A  LFD IPE D VSWN +I 
Sbjct: 130 AFEETTQIHAQITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIK 189

Query: 190 GYIKSGDVKTAYGIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVA 249
           GY+K+G +  A  +F  M  KN +SWT++ISG V+A ++ EAL L +EMQ++  E D V+
Sbjct: 190 GYVKAGKMDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVS 249

Query: 250 IASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEAFRLFGKLK 309
           +A+ L+ACA LGAL+QG+W+H YL    + +D V GC L++MY KCGEMEEA  +F  +K
Sbjct: 250 LANALSACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIK 309

Query: 310 SDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEG 369
             +K V  WTA+I G+A HG G EA+  F  MQ+ GI+PN ITFTAVL ACSY GLVEEG
Sbjct: 310 --KKSVQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEG 369

Query: 370 KVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMPMKPNAVIWGALLKACW 429
           K++F SM   YNL P+IEH+GC+VDLLGRAGLLD+AK  I++MP+KPNAVIWGALLKAC 
Sbjct: 370 KLIFYSMERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACR 429

Query: 430 IHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGK 489
           IH++  +G +IG  L+ +D  H GRY+  A I A + KW +AAE R  MK   V   PG 
Sbjct: 430 IHKNIELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGC 489

Query: 490 SSITLNGVVHEFLAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDL-ENEEKE 549
           S+I+L G  HEFLAG + HP++E+I  K + +  R  ++  Y P  +++LLDL +++E+E
Sbjct: 490 STISLEGTTHEFLAGDRSHPEIEKIQSKWR-IMRRKLEENGYVPELEEMLLDLVDDDERE 549

Query: 550 TTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREIIMRDRVRF 609
             + QHSEKLAI +GLI TKPG  IR++KNLRVC+DCH V KLIS+IY R+I+MRDR RF
Sbjct: 550 AIVHQHSEKLAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRF 609

Query: 610 HHFRDGNCSCKDYW 622
           HHFRDG CSC DYW
Sbjct: 610 HHFRDGKCSCGDYW 620

BLAST of ClCG01G019660 vs. ExPASy Swiss-Prot
Match: Q9FI80 (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 531.2 bits (1367), Expect = 1.6e-149
Identity = 274/643 (42.61%), Postives = 404/643 (62.83%), Query Frame = 0

Query: 1   MFSLKAESPLQSTWAQTMSL---LVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCT 60
           +FS    SP  S  +   SL   + NC  ++ L QIHA  IK+  + +   A + L  C 
Sbjct: 7   LFSPGGNSPASSPASHPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCA 66

Query: 61  SP--HFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPE--LAFLLYQQMLSSS-VPH 120
           +   H  DL YA K+FN +   N F WN IIR +  S+E +  +A  L+ +M+S   V  
Sbjct: 67  TSDLHHRDLDYAHKIFNQMPQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEP 126

Query: 121 NSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLF 180
           N +TFP VLKAC     + E  Q+HGL  K GFG D F ++ L+ +Y +CG +  AR LF
Sbjct: 127 NRFTFPSVLKACAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLF 186

Query: 181 -DNIPERD-------------VVSWNIMIDGYIKSGDVKTAYGIFLDMPLKNVVSWTSLI 240
             NI E+D             +V WN+MIDGY++ GD K A  +F  M  ++VVSW ++I
Sbjct: 187 YKNIIEKDMVVMTDRRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMI 246

Query: 241 SGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVH 300
           SG    G   +A+ +  EM+      + V + S+L A + LG+L+ G WLH Y  ++G+ 
Sbjct: 247 SGYSLNGFFKDAVEVFREMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIR 306

Query: 301 IDRVTGCALVNMYLKCGEMEEAFRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFN 360
           ID V G AL++MY KCG +E+A  +F +L   +++V  W+AMI+GFAIHG+  +A++ F 
Sbjct: 307 IDDVLGSALIDMYSKCGIIEKAIHVFERL--PRENVITWSAMINGFAIHGQAGDAIDCFC 366

Query: 361 RMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRA 420
           +M++ G+RP+ + +  +L ACS+ GLVEEG+  F  M  +  L P IEH+GCMVDLLGR+
Sbjct: 367 KMRQAGVRPSDVAYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRS 426

Query: 421 GLLDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLA 480
           GLLD+A+E I  MP+KP+ VIW ALL AC +  +  +G ++   L+++    SG Y+ L+
Sbjct: 427 GLLDEAEEFILNMPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALS 486

Query: 481 TILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQIHLKLK 540
            + A++G W E +E+RL+MK   +   PG S I ++GV+HEF+     HP+ ++I+  L 
Sbjct: 487 NMYASQGNWSEVSEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLV 546

Query: 541 QVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNL 600
           +++++LR    Y P+T  +LL+LE E+KE  +  HSEK+A AFGLI+T PG  IR++KNL
Sbjct: 547 EISDKLRL-AGYRPITTQVLLNLEEEDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNL 606

Query: 601 RVCRDCHTVAKLISQIYCREIIMRDRVRFHHFRDGNCSCKDYW 622
           R+C DCH+  KLIS++Y R+I +RDR RFHHF+DG+CSC DYW
Sbjct: 607 RICEDCHSSIKLISKVYKRKITVRDRKRFHHFQDGSCSCMDYW 646

BLAST of ClCG01G019660 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 499.6 bits (1285), Expect = 5.1e-140
Identity = 264/708 (37.29%), Postives = 398/708 (56.21%), Query Frame = 0

Query: 18  MSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSP 77
           +SL+  C +++QLKQ H  MI+T   ++P  A+K   +     F  L YA+KVF+ I  P
Sbjct: 34  ISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPKP 93

Query: 78  NTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPH-NSYTFPFVLKACRNLSAMGEALQV 137
           N+F WN +IRAY +  +P L+   +  M+S S  + N YTFPF++KA   +S++     +
Sbjct: 94  NSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSL 153

Query: 138 HGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSG-- 197
           HG+  K   GSDVF  N+L+H Y  CGD++ A ++F  I E+DVVSWN MI+G+++ G  
Sbjct: 154 HGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSP 213

Query: 198 ------------------------------------------------------------ 257
                                                                       
Sbjct: 214 DKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLANAM 273

Query: 258 ---------------------------------------DVKTAYGIFLDMPLKNVVSWT 317
                                                  D + A  +   MP K++V+W 
Sbjct: 274 LDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIVAWN 333

Query: 318 SLISGLVEAGLSVEALNLCYEMQ-SAGFELDGVAIASLLTACANLGALDQGRWLHFYLLN 377
           +LIS   + G   EAL + +E+Q     +L+ + + S L+ACA +GAL+ GRW+H Y+  
Sbjct: 334 ALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSYIKK 393

Query: 378 NGVHIDRVTGCALVNMYLKCGEMEEAFRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEAL 437
           +G+ ++     AL++MY KCG++E++  +F  +  +++DV+VW+AMI G A+HG G EA+
Sbjct: 394 HGIRMNFHVTSALIHMYSKCGDLEKSREVFNSV--EKRDVFVWSAMIGGLAMHGCGNEAV 453

Query: 438 EWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDL 497
           + F +MQ   ++PN +TFT V  ACS+ GLV+E + LF  M   Y + P  +H+ C+VD+
Sbjct: 454 DMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVDV 513

Query: 498 LGRAGLLDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRY 557
           LGR+G L+KA + I+ MP+ P+  +WGALL AC IH +  +       L+E++  + G +
Sbjct: 514 LGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRNDGAH 573

Query: 558 IQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQIH 617
           + L+ I A  GKW+  +E+R  M+   +   PG SSI ++G++HEFL+G   HP  E+++
Sbjct: 574 VLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEKVY 633

Query: 618 LKLKQVAERLRQDESYEPVTKDLLLDLENEE-KETTMAQHSEKLAIAFGLINTKPGATIR 622
            KL +V E+L+ +  YEP    +L  +E EE KE ++  HSEKLAI +GLI+T+    IR
Sbjct: 634 GKLHEVMEKLKSN-GYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPKVIR 693

BLAST of ClCG01G019660 vs. ExPASy Swiss-Prot
Match: Q9FG16 (Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H88 PE=3 SV=1)

HSP 1 Score: 490.3 bits (1261), Expect = 3.1e-137
Identity = 246/610 (40.33%), Postives = 391/610 (64.10%), Query Frame = 0

Query: 18  MSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGD-----LLYAQKVFN 77
           ++LL +CS+   LK IH  +++T ++++  +A++ L LC      +     L YA  +F+
Sbjct: 16  LALLQSCSSFSDLKIIHGFLLRTHLISDVFVASRLLALCVDDSTFNKPTNLLGYAYGIFS 75

Query: 78  GITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPHNSYTFPFVLKACRNLSAMGE 137
            I +PN F++N +IR +    EP  AF  Y QML S +  ++ TFPF++KA   +  +  
Sbjct: 76  QIQNPNLFVFNLLIRCFSTGAEPSKAFGFYTQMLKSRIWPDNITFPFLIKASSEMECVLV 135

Query: 138 ALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIK 197
             Q H  + + GF +DV+  N+L+H+Y  CG I  A ++F  +  RDVVSW  M+ GY K
Sbjct: 136 GEQTHSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIFGQMGFRDVVSWTSMVAGYCK 195

Query: 198 SGDVKTAYGIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASL 257
            G V+ A  +F +MP +N+ +W+ +I+G  +     +A++L   M+  G   +   + S+
Sbjct: 196 CGMVENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMKREGVVANETVMVSV 255

Query: 258 LTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEAFRLFGKLKSDQK 317
           +++CA+LGAL+ G   + Y++ + + ++ + G ALV+M+ +CG++E+A  +F  L   + 
Sbjct: 256 ISSCAHLGALEFGERAYEYVVKSHMTVNLILGTALVDMFWRCGDIEKAIHVFEGL--PET 315

Query: 318 DVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLF 377
           D   W+++I G A+HG   +A+ +F++M   G  P  +TFTAVL ACS+ GLVE+G  ++
Sbjct: 316 DSLSWSSIIKGLAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSACSHGGLVEKGLEIY 375

Query: 378 ESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMPMKPNAVIWGALLKACWIHRD 437
           E+M+  + + P +EH+GC+VD+LGRAG L +A+  I KM +KPNA I GALL AC I+++
Sbjct: 376 ENMKKDHGIEPRLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPILGALLGACKIYKN 435

Query: 438 FLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSIT 497
             V  ++G  L++V  +HSG Y+ L+ I A  G+W +   +R  MK   V  PPG S I 
Sbjct: 436 TEVAERVGNMLIKVKPEHSGYYVLLSNIYACAGQWDKIESLRDMMKEKLVKKPPGWSLIE 495

Query: 498 LNGVVHEFLAG-HQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMA 557
           ++G +++F  G  Q HP+M +I  K +++  ++R    Y+  T D   D++ EEKE+++ 
Sbjct: 496 IDGKINKFTMGDDQKHPEMGKIRRKWEEILGKIRL-IGYKGNTGDAFFDVDEEEKESSIH 555

Query: 558 QHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREIIMRDRVRFHHFR 617
            HSEKLAIA+G++ TKPG TIR++KNLRVC DCHTV KLIS++Y RE+I+RDR RFHHFR
Sbjct: 556 MHSEKLAIAYGMMKTKPGTTIRIVKNLRVCEDCHTVTKLISEVYGRELIVRDRNRFHHFR 615

Query: 618 DGNCSCKDYW 622
           +G CSC+DYW
Sbjct: 616 NGVCSCRDYW 622

BLAST of ClCG01G019660 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 489.2 bits (1258), Expect = 6.9e-137
Identity = 272/709 (38.36%), Postives = 389/709 (54.87%), Query Frame = 0

Query: 17  TMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLC-TSPHFGDLLYAQKVFNGIT 76
           ++SLL NC  ++ L+ IHAQMIK  +       +K +  C  SPHF  L YA  VF  I 
Sbjct: 36  SLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQ 95

Query: 77  SPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPHNSYTFPFVLKACRNLSAMGEALQ 136
            PN  +WN + R +  S++P  A  LY  M+S  +  NSYTFPFVLK+C    A  E  Q
Sbjct: 96  EPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQ 155

Query: 137 VHGLVFKLGFG-------------------------------SDVFALNALLHVYTLCGD 196
           +HG V KLG                                  DV +  AL+  Y   G 
Sbjct: 156 IHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGY 215

Query: 197 INYARQLFDNIPERDVVSWNIMIDGYI--------------------------------- 256
           I  A++LFD IP +DVVSWN MI GY                                  
Sbjct: 216 IENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSA 275

Query: 257 -------------------------------------KSGDVKTAYGIFLDMPLKNVVSW 316
                                                K G+++TA G+F  +P K+V+SW
Sbjct: 276 CAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISW 335

Query: 317 TSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLN 376
            +LI G     L  EAL L  EM  +G   + V + S+L ACA+LGA+D GRW+H Y+  
Sbjct: 336 NTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDK 395

Query: 377 --NGVHIDRVTGCALVNMYLKCGEMEEAFRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVE 436
              GV        +L++MY KCG++E A ++F  +    K +  W AMI GFA+HGR   
Sbjct: 396 RLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSIL--HKSLSSWNAMIFGFAMHGRADA 455

Query: 437 ALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMV 496
           + + F+RM++ GI+P+ ITF  +L ACS++G+++ G+ +F +M   Y ++P +EH+GCM+
Sbjct: 456 SFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMI 515

Query: 497 DLLGRAGLLDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSG 556
           DLLG +GL  +A+E+I  M M+P+ VIW +LLKAC +H +  +G     +L++++ ++ G
Sbjct: 516 DLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPG 575

Query: 557 RYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQ 616
            Y+ L+ I A+ G+W E A+ R  + +  +   PG SSI ++ VVHEF+ G + HP+  +
Sbjct: 576 SYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNRE 635

Query: 617 IHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATI 622
           I+  L+++ E L +   + P T ++L ++E E KE  +  HSEKLAIAFGLI+TKPG  +
Sbjct: 636 IYGMLEEM-EVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKL 695

BLAST of ClCG01G019660 vs. ExPASy TrEMBL
Match: A0A5D3CKZ8 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G002030 PE=3 SV=1)

HSP 1 Score: 1140.9 bits (2950), Expect = 0.0e+00
Identity = 552/621 (88.89%), Postives = 586/621 (94.36%), Query Frame = 0

Query: 1   MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPH 60
           MF+LKAESPLQSTW    +LL NCSNMKQLKQI AQMIKT I++EPKLATKFLTLCTSPH
Sbjct: 1   MFTLKAESPLQSTW----TLLENCSNMKQLKQIQAQMIKTAILSEPKLATKFLTLCTSPH 60

Query: 61  FGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPHNSYTFPFV 120
            GDLLYAQ+VFNGITSPNT MWNAIIRAY NS EPELAFLLYQQMLSSSVPHNSYTFPF+
Sbjct: 61  VGDLLYAQRVFNGITSPNTVMWNAIIRAYSNSKEPELAFLLYQQMLSSSVPHNSYTFPFL 120

Query: 121 LKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDV 180
           LKACRNLSA+GEALQVHGLV KLGFGSDVFALNALLHVY LCG+I YARQ+FDNIPERD 
Sbjct: 121 LKACRNLSALGEALQVHGLVIKLGFGSDVFALNALLHVYALCGEIRYARQMFDNIPERDA 180

Query: 181 VSWNIMIDGYIKSGDVKTAYGIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSA 240
           VSWNIMIDGYIKSGDVKTAYGIFLDMP KNVVSWTSLISGLV AGLSV+AL+LCYEMQ+A
Sbjct: 181 VSWNIMIDGYIKSGDVKTAYGIFLDMPSKNVVSWTSLISGLVGAGLSVKALSLCYEMQNA 240

Query: 241 GFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA 300
           GFELDGVAIA LLTACANLGALDQGRWLHFY+LNNGV +DRV GCALVNMY+KCG+MEEA
Sbjct: 241 GFELDGVAIACLLTACANLGALDQGRWLHFYVLNNGVDVDRVIGCALVNMYVKCGDMEEA 300

Query: 301 FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACS 360
            R+FGKLK DQKDV +WTAMIDGFAIHGRGVEALEWF+ M+REGIRPNSITFTAVLRACS
Sbjct: 301 LRVFGKLKGDQKDVCIWTAMIDGFAIHGRGVEALEWFDLMRREGIRPNSITFTAVLRACS 360

Query: 361 YAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMPMKPNAVIW 420
           Y GLVEEGK LF+SM+CLYNLSPSIEH+GCMVDLLGR+G L++AKELIK MPMKPNAVIW
Sbjct: 361 YGGLVEEGKELFKSMKCLYNLSPSIEHYGCMVDLLGRSGRLNEAKELIKNMPMKPNAVIW 420

Query: 421 GALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNL 480
           GA LKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAA+GKWKEAAEVRLKMKNL
Sbjct: 421 GAFLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAQGKWKEAAEVRLKMKNL 480

Query: 481 RVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLD 540
            VPI PGKSSITLNG+VHEFLAGHQDHPQMEQIHLKLKQ+AERLRQDE YEP TKDLLLD
Sbjct: 481 GVPISPGKSSITLNGIVHEFLAGHQDHPQMEQIHLKLKQIAERLRQDEGYEPATKDLLLD 540

Query: 541 LENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII 600
           LENEEKET +AQHSEKLAIAFGLINTKPG TIRV+KNLR+CRDCHTVAKL+SQIYCREII
Sbjct: 541 LENEEKETAIAQHSEKLAIAFGLINTKPGTTIRVVKNLRICRDCHTVAKLVSQIYCREII 600

Query: 601 MRDRVRFHHFRDGNCSCKDYW 622
           MRDRVRFHHFRDG+CSCKDYW
Sbjct: 601 MRDRVRFHHFRDGSCSCKDYW 617

BLAST of ClCG01G019660 vs. ExPASy TrEMBL
Match: A0A1S3B1S8 (pentatricopeptide repeat-containing protein At5g66520 OS=Cucumis melo OX=3656 GN=LOC103485057 PE=3 SV=1)

HSP 1 Score: 1140.9 bits (2950), Expect = 0.0e+00
Identity = 552/621 (88.89%), Postives = 586/621 (94.36%), Query Frame = 0

Query: 1   MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPH 60
           MF+LKAESPLQSTW    +LL NCSNMKQLKQI AQMIKT I++EPKLATKFLTLCTSPH
Sbjct: 1   MFTLKAESPLQSTW----TLLENCSNMKQLKQIQAQMIKTAILSEPKLATKFLTLCTSPH 60

Query: 61  FGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPHNSYTFPFV 120
            GDLLYAQ+VFNGITSPNT MWNAIIRAY NS EPELAFLLYQQMLSSSVPHNSYTFPF+
Sbjct: 61  VGDLLYAQRVFNGITSPNTVMWNAIIRAYSNSKEPELAFLLYQQMLSSSVPHNSYTFPFL 120

Query: 121 LKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDV 180
           LKACRNLSA+GEALQVHGLV KLGFGSDVFALNALLHVY LCG+I YARQ+FDNIPERD 
Sbjct: 121 LKACRNLSALGEALQVHGLVIKLGFGSDVFALNALLHVYALCGEIRYARQMFDNIPERDA 180

Query: 181 VSWNIMIDGYIKSGDVKTAYGIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSA 240
           VSWNIMIDGYIKSGDVKTAYGIFLDMP KNVVSWTSLISGLV AGLSV+AL+LCYEMQ+A
Sbjct: 181 VSWNIMIDGYIKSGDVKTAYGIFLDMPSKNVVSWTSLISGLVGAGLSVKALSLCYEMQNA 240

Query: 241 GFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA 300
           GFELDGVAIA LLTACANLGALDQGRWLHFY+LNNGV +DRV GCALVNMY+KCG+MEEA
Sbjct: 241 GFELDGVAIACLLTACANLGALDQGRWLHFYVLNNGVDVDRVIGCALVNMYVKCGDMEEA 300

Query: 301 FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACS 360
            R+FGKLK DQKDV +WTAMIDGFAIHGRGVEALEWF+ M+REGIRPNSITFTAVLRACS
Sbjct: 301 LRVFGKLKGDQKDVCIWTAMIDGFAIHGRGVEALEWFDLMRREGIRPNSITFTAVLRACS 360

Query: 361 YAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMPMKPNAVIW 420
           Y GLVEEGK LF+SM+CLYNLSPSIEH+GCMVDLLGR+G L++AKELIK MPMKPNAVIW
Sbjct: 361 YGGLVEEGKELFKSMKCLYNLSPSIEHYGCMVDLLGRSGRLNEAKELIKNMPMKPNAVIW 420

Query: 421 GALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNL 480
           GA LKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAA+GKWKEAAEVRLKMKNL
Sbjct: 421 GAFLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAQGKWKEAAEVRLKMKNL 480

Query: 481 RVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLD 540
            VPI PGKSSITLNG+VHEFLAGHQDHPQMEQIHLKLKQ+AERLRQDE YEP TKDLLLD
Sbjct: 481 GVPISPGKSSITLNGIVHEFLAGHQDHPQMEQIHLKLKQIAERLRQDEGYEPATKDLLLD 540

Query: 541 LENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII 600
           LENEEKET +AQHSEKLAIAFGLINTKPG TIRV+KNLR+CRDCHTVAKL+SQIYCREII
Sbjct: 541 LENEEKETAIAQHSEKLAIAFGLINTKPGTTIRVVKNLRICRDCHTVAKLVSQIYCREII 600

Query: 601 MRDRVRFHHFRDGNCSCKDYW 622
           MRDRVRFHHFRDG+CSCKDYW
Sbjct: 601 MRDRVRFHHFRDGSCSCKDYW 617

BLAST of ClCG01G019660 vs. ExPASy TrEMBL
Match: A0A0A0KKE0 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G502750 PE=3 SV=1)

HSP 1 Score: 1140.2 bits (2948), Expect = 0.0e+00
Identity = 551/621 (88.73%), Postives = 587/621 (94.52%), Query Frame = 0

Query: 1   MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPH 60
           MF+L AESPLQSTWA    LL NCSNMKQLKQI AQMIKT I+TEPKLATKFLTLCTSPH
Sbjct: 1   MFTLNAESPLQSTWA----LLENCSNMKQLKQIQAQMIKTAIITEPKLATKFLTLCTSPH 60

Query: 61  FGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPHNSYTFPFV 120
            GDLLYAQ+VFNGITSPNTFMWNAIIRAY NS+EPELAFL YQQMLSSSVPHNSYTFPF+
Sbjct: 61  VGDLLYAQRVFNGITSPNTFMWNAIIRAYSNSDEPELAFLSYQQMLSSSVPHNSYTFPFL 120

Query: 121 LKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDV 180
           L+ACRNL AMGEALQVHGLV KLGFGSDVFALNALLHVY LCG+I+ ARQLFDNIPERD 
Sbjct: 121 LRACRNLLAMGEALQVHGLVIKLGFGSDVFALNALLHVYALCGEIHCARQLFDNIPERDA 180

Query: 181 VSWNIMIDGYIKSGDVKTAYGIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSA 240
           VSWNIMIDGYIKSGDVKTAYG+FLDMPLKNVVSWTSLISGLVEAG SVEAL+LCYEMQ+A
Sbjct: 181 VSWNIMIDGYIKSGDVKTAYGVFLDMPLKNVVSWTSLISGLVEAGQSVEALSLCYEMQNA 240

Query: 241 GFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA 300
           GFELDGVAIASLLTACANLGALDQGRWLHFY+LNNGV +DRV GCALVNMY+KCG+MEEA
Sbjct: 241 GFELDGVAIASLLTACANLGALDQGRWLHFYVLNNGVDVDRVIGCALVNMYVKCGDMEEA 300

Query: 301 FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACS 360
             +FGKLK +QKDVY+WTAMIDGFAIHGRGVEALEWFNRM+REGIRPNSITFTAVLRACS
Sbjct: 301 LSVFGKLKGNQKDVYIWTAMIDGFAIHGRGVEALEWFNRMRREGIRPNSITFTAVLRACS 360

Query: 361 YAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMPMKPNAVIW 420
           Y GLVEEGK LF+SM+C YN++PSIEH+GCMVDLLGR+G LD+AKELIKKMPMKP+AVIW
Sbjct: 361 YGGLVEEGKELFKSMKCFYNVNPSIEHYGCMVDLLGRSGRLDEAKELIKKMPMKPSAVIW 420

Query: 421 GALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNL 480
           GALLKACWIHRDFL+GSQ+GAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMK+L
Sbjct: 421 GALLKACWIHRDFLLGSQVGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKSL 480

Query: 481 RVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLD 540
            VPI PGKSS+TLNG+VHEFLAGHQDHPQMEQI LKLKQ+AERLRQDE YEP TKDLLLD
Sbjct: 481 GVPISPGKSSVTLNGIVHEFLAGHQDHPQMEQIQLKLKQIAERLRQDEGYEPATKDLLLD 540

Query: 541 LENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII 600
           LENEEKET MAQHSEKLAIAFGLINTKPG TIRVIKNLR+CRDCHTVAKL+SQIY REII
Sbjct: 541 LENEEKETAMAQHSEKLAIAFGLINTKPGTTIRVIKNLRICRDCHTVAKLVSQIYSREII 600

Query: 601 MRDRVRFHHFRDGNCSCKDYW 622
           MRDRVRFHHFRDG+CSCKDYW
Sbjct: 601 MRDRVRFHHFRDGSCSCKDYW 617

BLAST of ClCG01G019660 vs. ExPASy TrEMBL
Match: A0A6J1IT43 (pentatricopeptide repeat-containing protein At5g66520 OS=Cucurbita maxima OX=3661 GN=LOC111478422 PE=3 SV=1)

HSP 1 Score: 1136.3 bits (2938), Expect = 0.0e+00
Identity = 554/621 (89.21%), Postives = 583/621 (93.88%), Query Frame = 0

Query: 1   MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPH 60
           MF+LKAESP+QSTWAQTMSLL NCSNMKQLK+IHAQMI+T   TEPKLATK LTLCTSPH
Sbjct: 1   MFALKAESPMQSTWAQTMSLLENCSNMKQLKEIHAQMIRTGTATEPKLATKLLTLCTSPH 60

Query: 61  FGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPHNSYTFPFV 120
           FGDL YAQ+VFNGI+SP TFMWNA+IRAY NSNEPELAFLLY+QMLSSSVPHNSYTFPF+
Sbjct: 61  FGDLHYAQRVFNGISSPTTFMWNAMIRAYSNSNEPELAFLLYRQMLSSSVPHNSYTFPFL 120

Query: 121 LKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDV 180
           LKACRN SAM EALQVHGLV KLGFGSDVFALNALLHVY LCGDI YARQLFDNIPERD+
Sbjct: 121 LKACRNFSAMSEALQVHGLVIKLGFGSDVFALNALLHVYALCGDIQYARQLFDNIPERDI 180

Query: 181 VSWNIMIDGYIKSGDVKTAYGIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSA 240
           VSWNIMIDGYIKSGDVKTAYG+FLDMPLKNVVSWTSLISGLVEAGL+VEAL+LC+EMQ+A
Sbjct: 181 VSWNIMIDGYIKSGDVKTAYGVFLDMPLKNVVSWTSLISGLVEAGLNVEALSLCHEMQNA 240

Query: 241 GFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA 300
           GFELDGVAIASLLTACANLGALDQGRWLHFY+LNNGVH+DRV GCALVNMYLKCG+MEEA
Sbjct: 241 GFELDGVAIASLLTACANLGALDQGRWLHFYVLNNGVHVDRVIGCALVNMYLKCGDMEEA 300

Query: 301 FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACS 360
            + FGKLK DQKDVYVWTAMIDGFAIHGRGVEALEWF RM REGIRPNSITFTAVLRACS
Sbjct: 301 LQEFGKLKGDQKDVYVWTAMIDGFAIHGRGVEALEWFKRMLREGIRPNSITFTAVLRACS 360

Query: 361 YAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMPMKPNAVIW 420
           YAGLVEEGKVLFESM  +Y LSPSIEH+GCMVDLLGRAGLL++AKELIK MPMKPNA+IW
Sbjct: 361 YAGLVEEGKVLFESMMSVYILSPSIEHYGCMVDLLGRAGLLEEAKELIKTMPMKPNAIIW 420

Query: 421 GALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNL 480
           GALLKAC IHRDFLVG QIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNL
Sbjct: 421 GALLKACRIHRDFLVGGQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNL 480

Query: 481 RVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLD 540
           RVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQI  KL QV ERLRQ E YEP TKDLLLD
Sbjct: 481 RVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQILHKLNQVVERLRQHEGYEPATKDLLLD 540

Query: 541 LENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII 600
           LENE KET +AQHSEKLAIAFGLINTKPG+TIRV+KNLRVC DCH VAKLIS+IY REII
Sbjct: 541 LENEAKETAVAQHSEKLAIAFGLINTKPGSTIRVVKNLRVCEDCHVVAKLISRIYRREII 600

Query: 601 MRDRVRFHHFRDGNCSCKDYW 622
           MRDRVRFHHFR G+CSCKDYW
Sbjct: 601 MRDRVRFHHFRGGSCSCKDYW 621

BLAST of ClCG01G019660 vs. ExPASy TrEMBL
Match: A0A6J1GDX2 (pentatricopeptide repeat-containing protein At5g66520 OS=Cucurbita moschata OX=3662 GN=LOC111453066 PE=3 SV=1)

HSP 1 Score: 1135.6 bits (2936), Expect = 0.0e+00
Identity = 553/621 (89.05%), Postives = 582/621 (93.72%), Query Frame = 0

Query: 1   MFSLKAESPLQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPH 60
           MF+LKAESP+QSTWAQTMSLL NCSNMKQLK+IHAQMI+T   TEPKLATK LTLC SPH
Sbjct: 1   MFALKAESPVQSTWAQTMSLLENCSNMKQLKEIHAQMIRTGTATEPKLATKLLTLCISPH 60

Query: 61  FGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPHNSYTFPFV 120
           FGDL YAQ+VFNGI+SP TFMWNA+IRAY NSNEPELAFLLY+QMLSSSVPHNSYTFPF+
Sbjct: 61  FGDLHYAQRVFNGISSPTTFMWNAMIRAYSNSNEPELAFLLYRQMLSSSVPHNSYTFPFL 120

Query: 121 LKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDV 180
           LKACRN SAM EALQVHGLV KLGFGSDVFALNALLHVY LCGDI YARQLFDNIPERD+
Sbjct: 121 LKACRNFSAMSEALQVHGLVIKLGFGSDVFALNALLHVYALCGDIQYARQLFDNIPERDI 180

Query: 181 VSWNIMIDGYIKSGDVKTAYGIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSA 240
           VSWNIMIDGYIKSGDVKTAYG+FLDMPLKNVVSWTSLISGLVEAGL+VEAL+LC+EMQ+A
Sbjct: 181 VSWNIMIDGYIKSGDVKTAYGVFLDMPLKNVVSWTSLISGLVEAGLNVEALSLCHEMQNA 240

Query: 241 GFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEA 300
           GFELDGVAIASLLTACANLGALDQGRWLHFY+LNNGVH+DRV GCALVNMYLKCG+MEEA
Sbjct: 241 GFELDGVAIASLLTACANLGALDQGRWLHFYVLNNGVHVDRVIGCALVNMYLKCGDMEEA 300

Query: 301 FRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACS 360
            R FGKLK DQKDVYVWTAMIDGFAIHGRGVEALEWF RM REGIRPNSITFTAVLRACS
Sbjct: 301 LREFGKLKGDQKDVYVWTAMIDGFAIHGRGVEALEWFKRMLREGIRPNSITFTAVLRACS 360

Query: 361 YAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMPMKPNAVIW 420
           YAGLVEEGKVLFESM  +YNLSPSIEH+GCMVDLLGRAGLL++AKELIK MPM+PNA+IW
Sbjct: 361 YAGLVEEGKVLFESMMSVYNLSPSIEHYGCMVDLLGRAGLLEEAKELIKTMPMEPNAIIW 420

Query: 421 GALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNL 480
           GALLKAC IHRDFLVG QIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNL
Sbjct: 421 GALLKACRIHRDFLVGGQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNL 480

Query: 481 RVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLD 540
           R+PIPPGKSSITLNGVVHEFLAGHQDHPQMEQI  KL QV ERLRQ E YEP TKDLLLD
Sbjct: 481 RLPIPPGKSSITLNGVVHEFLAGHQDHPQMEQILHKLNQVVERLRQHEGYEPATKDLLLD 540

Query: 541 LENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREII 600
           LE+E KET +AQHSEKLAIAFGLINTKPG+TIRV+KNLRVC DCH VAKLISQIY REII
Sbjct: 541 LESEAKETAVAQHSEKLAIAFGLINTKPGSTIRVVKNLRVCEDCHVVAKLISQIYRREII 600

Query: 601 MRDRVRFHHFRDGNCSCKDYW 622
           MRDRVRFHHFR GNCSC DYW
Sbjct: 601 MRDRVRFHHFRGGNCSCNDYW 621

BLAST of ClCG01G019660 vs. TAIR 10
Match: AT5G66520.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 678.3 bits (1749), Expect = 5.7e-195
Identity = 331/614 (53.91%), Postives = 438/614 (71.34%), Query Frame = 0

Query: 10  LQSTWAQTMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLL-YAQ 69
           L+    +TMS L  CS  ++LKQIHA+M+KT ++ +    TKFL+ C S    D L YAQ
Sbjct: 10  LEHNLYETMSCLQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQ 69

Query: 70  KVFNGITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPHNSYTFPFVLKACRNLS 129
            VF+G   P+TF+WN +IR +  S+EPE + LLYQ+ML SS PHN+YTFP +LKAC NLS
Sbjct: 70  IVFDGFDRPDTFLWNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLS 129

Query: 130 AMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMID 189
           A  E  Q+H  + KLG+ +DV+A+N+L++ Y + G+   A  LFD IPE D VSWN +I 
Sbjct: 130 AFEETTQIHAQITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIK 189

Query: 190 GYIKSGDVKTAYGIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVA 249
           GY+K+G +  A  +F  M  KN +SWT++ISG V+A ++ EAL L +EMQ++  E D V+
Sbjct: 190 GYVKAGKMDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVS 249

Query: 250 IASLLTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEAFRLFGKLK 309
           +A+ L+ACA LGAL+QG+W+H YL    + +D V GC L++MY KCGEMEEA  +F  +K
Sbjct: 250 LANALSACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIK 309

Query: 310 SDQKDVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEG 369
             +K V  WTA+I G+A HG G EA+  F  MQ+ GI+PN ITFTAVL ACSY GLVEEG
Sbjct: 310 --KKSVQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEG 369

Query: 370 KVLFESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMPMKPNAVIWGALLKACW 429
           K++F SM   YNL P+IEH+GC+VDLLGRAGLLD+AK  I++MP+KPNAVIWGALLKAC 
Sbjct: 370 KLIFYSMERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACR 429

Query: 430 IHRDFLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGK 489
           IH++  +G +IG  L+ +D  H GRY+  A I A + KW +AAE R  MK   V   PG 
Sbjct: 430 IHKNIELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGC 489

Query: 490 SSITLNGVVHEFLAGHQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDL-ENEEKE 549
           S+I+L G  HEFLAG + HP++E+I  K + +  R  ++  Y P  +++LLDL +++E+E
Sbjct: 490 STISLEGTTHEFLAGDRSHPEIEKIQSKWR-IMRRKLEENGYVPELEEMLLDLVDDDERE 549

Query: 550 TTMAQHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREIIMRDRVRF 609
             + QHSEKLAI +GLI TKPG  IR++KNLRVC+DCH V KLIS+IY R+I+MRDR RF
Sbjct: 550 AIVHQHSEKLAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRF 609

Query: 610 HHFRDGNCSCKDYW 622
           HHFRDG CSC DYW
Sbjct: 610 HHFRDGKCSCGDYW 620

BLAST of ClCG01G019660 vs. TAIR 10
Match: AT5G48910.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 531.2 bits (1367), Expect = 1.1e-150
Identity = 274/643 (42.61%), Postives = 404/643 (62.83%), Query Frame = 0

Query: 1   MFSLKAESPLQSTWAQTMSL---LVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCT 60
           +FS    SP  S  +   SL   + NC  ++ L QIHA  IK+  + +   A + L  C 
Sbjct: 7   LFSPGGNSPASSPASHPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCA 66

Query: 61  SP--HFGDLLYAQKVFNGITSPNTFMWNAIIRAYCNSNEPE--LAFLLYQQMLSSS-VPH 120
           +   H  DL YA K+FN +   N F WN IIR +  S+E +  +A  L+ +M+S   V  
Sbjct: 67  TSDLHHRDLDYAHKIFNQMPQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEP 126

Query: 121 NSYTFPFVLKACRNLSAMGEALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLF 180
           N +TFP VLKAC     + E  Q+HGL  K GFG D F ++ L+ +Y +CG +  AR LF
Sbjct: 127 NRFTFPSVLKACAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLF 186

Query: 181 -DNIPERD-------------VVSWNIMIDGYIKSGDVKTAYGIFLDMPLKNVVSWTSLI 240
             NI E+D             +V WN+MIDGY++ GD K A  +F  M  ++VVSW ++I
Sbjct: 187 YKNIIEKDMVVMTDRRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMI 246

Query: 241 SGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLNNGVH 300
           SG    G   +A+ +  EM+      + V + S+L A + LG+L+ G WLH Y  ++G+ 
Sbjct: 247 SGYSLNGFFKDAVEVFREMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIR 306

Query: 301 IDRVTGCALVNMYLKCGEMEEAFRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEALEWFN 360
           ID V G AL++MY KCG +E+A  +F +L   +++V  W+AMI+GFAIHG+  +A++ F 
Sbjct: 307 IDDVLGSALIDMYSKCGIIEKAIHVFERL--PRENVITWSAMINGFAIHGQAGDAIDCFC 366

Query: 361 RMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDLLGRA 420
           +M++ G+RP+ + +  +L ACS+ GLVEEG+  F  M  +  L P IEH+GCMVDLLGR+
Sbjct: 367 KMRQAGVRPSDVAYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRS 426

Query: 421 GLLDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRYIQLA 480
           GLLD+A+E I  MP+KP+ VIW ALL AC +  +  +G ++   L+++    SG Y+ L+
Sbjct: 427 GLLDEAEEFILNMPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALS 486

Query: 481 TILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQIHLKLK 540
            + A++G W E +E+RL+MK   +   PG S I ++GV+HEF+     HP+ ++I+  L 
Sbjct: 487 NMYASQGNWSEVSEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLV 546

Query: 541 QVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATIRVIKNL 600
           +++++LR    Y P+T  +LL+LE E+KE  +  HSEK+A AFGLI+T PG  IR++KNL
Sbjct: 547 EISDKLRL-AGYRPITTQVLLNLEEEDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNL 606

Query: 601 RVCRDCHTVAKLISQIYCREIIMRDRVRFHHFRDGNCSCKDYW 622
           R+C DCH+  KLIS++Y R+I +RDR RFHHF+DG+CSC DYW
Sbjct: 607 RICEDCHSSIKLISKVYKRKITVRDRKRFHHFQDGSCSCMDYW 646

BLAST of ClCG01G019660 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 499.6 bits (1285), Expect = 3.7e-141
Identity = 264/708 (37.29%), Postives = 398/708 (56.21%), Query Frame = 0

Query: 18  MSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGDLLYAQKVFNGITSP 77
           +SL+  C +++QLKQ H  MI+T   ++P  A+K   +     F  L YA+KVF+ I  P
Sbjct: 34  ISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPKP 93

Query: 78  NTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPH-NSYTFPFVLKACRNLSAMGEALQV 137
           N+F WN +IRAY +  +P L+   +  M+S S  + N YTFPF++KA   +S++     +
Sbjct: 94  NSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSL 153

Query: 138 HGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIKSG-- 197
           HG+  K   GSDVF  N+L+H Y  CGD++ A ++F  I E+DVVSWN MI+G+++ G  
Sbjct: 154 HGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSP 213

Query: 198 ------------------------------------------------------------ 257
                                                                       
Sbjct: 214 DKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLANAM 273

Query: 258 ---------------------------------------DVKTAYGIFLDMPLKNVVSWT 317
                                                  D + A  +   MP K++V+W 
Sbjct: 274 LDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIVAWN 333

Query: 318 SLISGLVEAGLSVEALNLCYEMQ-SAGFELDGVAIASLLTACANLGALDQGRWLHFYLLN 377
           +LIS   + G   EAL + +E+Q     +L+ + + S L+ACA +GAL+ GRW+H Y+  
Sbjct: 334 ALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSYIKK 393

Query: 378 NGVHIDRVTGCALVNMYLKCGEMEEAFRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVEAL 437
           +G+ ++     AL++MY KCG++E++  +F  +  +++DV+VW+AMI G A+HG G EA+
Sbjct: 394 HGIRMNFHVTSALIHMYSKCGDLEKSREVFNSV--EKRDVFVWSAMIGGLAMHGCGNEAV 453

Query: 438 EWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMVDL 497
           + F +MQ   ++PN +TFT V  ACS+ GLV+E + LF  M   Y + P  +H+ C+VD+
Sbjct: 454 DMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVDV 513

Query: 498 LGRAGLLDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSGRY 557
           LGR+G L+KA + I+ MP+ P+  +WGALL AC IH +  +       L+E++  + G +
Sbjct: 514 LGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRNDGAH 573

Query: 558 IQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQIH 617
           + L+ I A  GKW+  +E+R  M+   +   PG SSI ++G++HEFL+G   HP  E+++
Sbjct: 574 VLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEKVY 633

Query: 618 LKLKQVAERLRQDESYEPVTKDLLLDLENEE-KETTMAQHSEKLAIAFGLINTKPGATIR 622
            KL +V E+L+ +  YEP    +L  +E EE KE ++  HSEKLAI +GLI+T+    IR
Sbjct: 634 GKLHEVMEKLKSN-GYEPEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPKVIR 693

BLAST of ClCG01G019660 vs. TAIR 10
Match: AT5G06540.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 490.3 bits (1261), Expect = 2.2e-138
Identity = 246/610 (40.33%), Postives = 391/610 (64.10%), Query Frame = 0

Query: 18  MSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLCTSPHFGD-----LLYAQKVFN 77
           ++LL +CS+   LK IH  +++T ++++  +A++ L LC      +     L YA  +F+
Sbjct: 16  LALLQSCSSFSDLKIIHGFLLRTHLISDVFVASRLLALCVDDSTFNKPTNLLGYAYGIFS 75

Query: 78  GITSPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPHNSYTFPFVLKACRNLSAMGE 137
            I +PN F++N +IR +    EP  AF  Y QML S +  ++ TFPF++KA   +  +  
Sbjct: 76  QIQNPNLFVFNLLIRCFSTGAEPSKAFGFYTQMLKSRIWPDNITFPFLIKASSEMECVLV 135

Query: 138 ALQVHGLVFKLGFGSDVFALNALLHVYTLCGDINYARQLFDNIPERDVVSWNIMIDGYIK 197
             Q H  + + GF +DV+  N+L+H+Y  CG I  A ++F  +  RDVVSW  M+ GY K
Sbjct: 136 GEQTHSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIFGQMGFRDVVSWTSMVAGYCK 195

Query: 198 SGDVKTAYGIFLDMPLKNVVSWTSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASL 257
            G V+ A  +F +MP +N+ +W+ +I+G  +     +A++L   M+  G   +   + S+
Sbjct: 196 CGMVENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMKREGVVANETVMVSV 255

Query: 258 LTACANLGALDQGRWLHFYLLNNGVHIDRVTGCALVNMYLKCGEMEEAFRLFGKLKSDQK 317
           +++CA+LGAL+ G   + Y++ + + ++ + G ALV+M+ +CG++E+A  +F  L   + 
Sbjct: 256 ISSCAHLGALEFGERAYEYVVKSHMTVNLILGTALVDMFWRCGDIEKAIHVFEGL--PET 315

Query: 318 DVYVWTAMIDGFAIHGRGVEALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLF 377
           D   W+++I G A+HG   +A+ +F++M   G  P  +TFTAVL ACS+ GLVE+G  ++
Sbjct: 316 DSLSWSSIIKGLAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSACSHGGLVEKGLEIY 375

Query: 378 ESMRCLYNLSPSIEHFGCMVDLLGRAGLLDKAKELIKKMPMKPNAVIWGALLKACWIHRD 437
           E+M+  + + P +EH+GC+VD+LGRAG L +A+  I KM +KPNA I GALL AC I+++
Sbjct: 376 ENMKKDHGIEPRLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPILGALLGACKIYKN 435

Query: 438 FLVGSQIGAHLVEVDSDHSGRYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSIT 497
             V  ++G  L++V  +HSG Y+ L+ I A  G+W +   +R  MK   V  PPG S I 
Sbjct: 436 TEVAERVGNMLIKVKPEHSGYYVLLSNIYACAGQWDKIESLRDMMKEKLVKKPPGWSLIE 495

Query: 498 LNGVVHEFLAG-HQDHPQMEQIHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMA 557
           ++G +++F  G  Q HP+M +I  K +++  ++R    Y+  T D   D++ EEKE+++ 
Sbjct: 496 IDGKINKFTMGDDQKHPEMGKIRRKWEEILGKIRL-IGYKGNTGDAFFDVDEEEKESSIH 555

Query: 558 QHSEKLAIAFGLINTKPGATIRVIKNLRVCRDCHTVAKLISQIYCREIIMRDRVRFHHFR 617
            HSEKLAIA+G++ TKPG TIR++KNLRVC DCHTV KLIS++Y RE+I+RDR RFHHFR
Sbjct: 556 MHSEKLAIAYGMMKTKPGTTIRIVKNLRVCEDCHTVTKLISEVYGRELIVRDRNRFHHFR 615

Query: 618 DGNCSCKDYW 622
           +G CSC+DYW
Sbjct: 616 NGVCSCRDYW 622

BLAST of ClCG01G019660 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 489.2 bits (1258), Expect = 4.9e-138
Identity = 272/709 (38.36%), Postives = 389/709 (54.87%), Query Frame = 0

Query: 17  TMSLLVNCSNMKQLKQIHAQMIKTEIVTEPKLATKFLTLC-TSPHFGDLLYAQKVFNGIT 76
           ++SLL NC  ++ L+ IHAQMIK  +       +K +  C  SPHF  L YA  VF  I 
Sbjct: 36  SLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFEGLPYAISVFKTIQ 95

Query: 77  SPNTFMWNAIIRAYCNSNEPELAFLLYQQMLSSSVPHNSYTFPFVLKACRNLSAMGEALQ 136
            PN  +WN + R +  S++P  A  LY  M+S  +  NSYTFPFVLK+C    A  E  Q
Sbjct: 96  EPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPFVLKSCAKSKAFKEGQQ 155

Query: 137 VHGLVFKLGFG-------------------------------SDVFALNALLHVYTLCGD 196
           +HG V KLG                                  DV +  AL+  Y   G 
Sbjct: 156 IHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGY 215

Query: 197 INYARQLFDNIPERDVVSWNIMIDGYI--------------------------------- 256
           I  A++LFD IP +DVVSWN MI GY                                  
Sbjct: 216 IENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSA 275

Query: 257 -------------------------------------KSGDVKTAYGIFLDMPLKNVVSW 316
                                                K G+++TA G+F  +P K+V+SW
Sbjct: 276 CAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISW 335

Query: 317 TSLISGLVEAGLSVEALNLCYEMQSAGFELDGVAIASLLTACANLGALDQGRWLHFYLLN 376
            +LI G     L  EAL L  EM  +G   + V + S+L ACA+LGA+D GRW+H Y+  
Sbjct: 336 NTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDK 395

Query: 377 --NGVHIDRVTGCALVNMYLKCGEMEEAFRLFGKLKSDQKDVYVWTAMIDGFAIHGRGVE 436
              GV        +L++MY KCG++E A ++F  +    K +  W AMI GFA+HGR   
Sbjct: 396 RLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSIL--HKSLSSWNAMIFGFAMHGRADA 455

Query: 437 ALEWFNRMQREGIRPNSITFTAVLRACSYAGLVEEGKVLFESMRCLYNLSPSIEHFGCMV 496
           + + F+RM++ GI+P+ ITF  +L ACS++G+++ G+ +F +M   Y ++P +EH+GCM+
Sbjct: 456 SFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMI 515

Query: 497 DLLGRAGLLDKAKELIKKMPMKPNAVIWGALLKACWIHRDFLVGSQIGAHLVEVDSDHSG 556
           DLLG +GL  +A+E+I  M M+P+ VIW +LLKAC +H +  +G     +L++++ ++ G
Sbjct: 516 DLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPG 575

Query: 557 RYIQLATILAAEGKWKEAAEVRLKMKNLRVPIPPGKSSITLNGVVHEFLAGHQDHPQMEQ 616
            Y+ L+ I A+ G+W E A+ R  + +  +   PG SSI ++ VVHEF+ G + HP+  +
Sbjct: 576 SYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNRE 635

Query: 617 IHLKLKQVAERLRQDESYEPVTKDLLLDLENEEKETTMAQHSEKLAIAFGLINTKPGATI 622
           I+  L+++ E L +   + P T ++L ++E E KE  +  HSEKLAIAFGLI+TKPG  +
Sbjct: 636 IYGMLEEM-EVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKL 695

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038882528.10.0e+0091.30pentatricopeptide repeat-containing protein At5g66520 [Benincasa hispida][more]
XP_008440725.10.0e+0088.89PREDICTED: pentatricopeptide repeat-containing protein At5g66520 [Cucumis melo] ... [more]
XP_004143583.20.0e+0088.73pentatricopeptide repeat-containing protein At5g66520 [Cucumis sativus] >KGN4883... [more]
XP_022978438.10.0e+0089.21pentatricopeptide repeat-containing protein At5g66520 [Cucurbita maxima][more]
XP_022949774.10.0e+0089.05pentatricopeptide repeat-containing protein At5g66520 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q9FJY78.1e-19453.91Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX... [more]
Q9FI801.6e-14942.61Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX... [more]
O823805.1e-14037.29Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Q9FG163.1e-13740.33Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana OX... [more]
Q9LN016.9e-13738.36Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A5D3CKZ80.0e+0088.89Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3B1S80.0e+0088.89pentatricopeptide repeat-containing protein At5g66520 OS=Cucumis melo OX=3656 GN... [more]
A0A0A0KKE00.0e+0088.73DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G5027... [more]
A0A6J1IT430.0e+0089.21pentatricopeptide repeat-containing protein At5g66520 OS=Cucurbita maxima OX=366... [more]
A0A6J1GDX20.0e+0089.05pentatricopeptide repeat-containing protein At5g66520 OS=Cucurbita moschata OX=3... [more]
Match NameE-valueIdentityDescription
AT5G66520.15.7e-19553.91Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G48910.11.1e-15042.61Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G29760.13.7e-14137.29Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G06540.12.2e-13840.33Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G08070.14.9e-13838.36Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 312..359
e-value: 5.3E-13
score: 48.9
coord: 77..124
e-value: 2.7E-11
score: 43.5
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 286..310
e-value: 0.004
score: 17.3
coord: 181..210
e-value: 2.0E-6
score: 27.7
coord: 388..412
e-value: 0.0032
score: 17.6
coord: 212..242
e-value: 8.7E-6
score: 25.6
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 181..210
e-value: 1.7E-5
score: 22.7
coord: 212..245
e-value: 6.1E-6
score: 24.1
coord: 315..348
e-value: 1.7E-8
score: 32.1
coord: 81..111
e-value: 1.7E-6
score: 25.8
coord: 350..383
e-value: 0.0015
score: 16.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 179..213
score: 10.753093
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 78..112
score: 10.544828
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 313..347
score: 12.309597
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 365..522
e-value: 7.0E-14
score: 53.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 8..125
e-value: 9.1E-13
score: 50.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 267..364
e-value: 3.3E-25
score: 90.5
coord: 147..266
e-value: 1.1E-28
score: 101.9
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 486..611
e-value: 7.8E-35
score: 119.5
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 15..595
NoneNo IPR availablePANTHERPTHR47928:SF136PPR CONTAINING PLANT-LIKE PROTEINcoord: 15..595

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG01G019660.2ClCG01G019660.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:1900865 chloroplast RNA modification
biological_process GO:0016554 cytidine to uridine editing
cellular_component GO:0009507 chloroplast
molecular_function GO:0003729 mRNA binding
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding