ClCG07G005030 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG07G005030
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionPentatricopeptide repeat-containing protein
LocationCG_Chr07: 6688042 .. 6690099 (-)
RNA-Seq ExpressionClCG07G005030
SyntenyClCG07G005030
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGAGTGCATTTTCCAAAACAAGTTTCTGCTCCAAGCTCAACTTTGGAAGAAAAAGAGGATGTAGATATTTTGCAACAGCTAATTCTGCGTTGTCTTCTTTGAACTATGTCGATGACTGCTTTACTTTTGAATGTCCCGGTGCTACAAACTATAATGATGACTCTGTTGAACAAAGTAATTTTGGCTATGAAGTTCAAGTTTCTAAAGGTCAGAAAGCTGATGAGGATGAAATGAAAACGATAAAATTGATACTTGGGAACCATGGGTTTAATCTTGGTTCGCATCCGAAACAATTCGAGATTGTAAGGATTTTGGACATTCTATTTGAGGATAGTTCAGATGCCGGACTTTGTCTTTGCTACTTCAAATGGTCAGGATGTTTATCCGGATCTAATCAGTCGCTGGAGTCAATTTGTAAGATGATGCATATTTTGGTTACTGGGAATATGAATCATAGGGCTGTTGATTTAATGTCACAACTTGTTAGAACCTATGGTAGTAAAGAGGGATCTTCAATCATATTGCTGAAACTTTTGTATGAAACACATAATGAAAGGAAGACTTTGGAAACCACATGCAGCATGCTGGTTGACTGTTATATCAAGGAAAGAATGGTAACGGCTGCTCTTTTATTGATGGGTCAAATGAAGCACCTTAACATATTTCCTTCTATATGGGTATACAAGTCGGTGATACAGGCTTTATTACAAACCAATCAATCGGATTTAGCTTGGGATCTTCTAGAAGAAATGTACCGGCAAGGTATAAGTTTAAATTATTCAATTAATTTATTTATTCATCACTATTGTGCAAAAGGTGATCTGGGCAGGGGGTGGAAAGTGCTTTTGGAGTTGAGGAATTTCCTATCTAAGCCTGATGCAGTTGATTTCACAATTGTGATCAACTCACTTTGCAAAATTTCTCTTTTAAAAGAAGCCTCAGCCCTGTTGTTTAAAATGACTGCTTTTGGTGTTTCCCCTGATTCAGTGACGATGAGTTCTGTTATTGATGGTTATTGTAAAGTAGGAAAGTTGGATATAGCTTGTAAAATATTGAAGTATTTTAGGCTTCCCCTAAATATTTTCACATACAATAGCTTTATAACAAGGTTATGTACAGAAGGAGACATGGAAAAAGCTTCTGAAGTTTTTCTTGAAATGTCTGAGGTGGGCTTAGTTCCAGACTGTGTTAGTTACACAACCATGATAGGAGGCTATCTTAAAGTGGAAAACATAAACAGAGCATTTTCTTACCTATGCAAGATGTTAAAGAGTGGAACCCAACCATCTATTATCACGTATACTTTGTTCATTGATAACTTTTGCAAGTATGGAGATGTGGAAATGGCTGAAGTTATGTTCCAAAAGATGATTATTGAGGGTTTAAAGCCTGATGTTGTCATGTATAATATTTTGATGGATGGATATGGAAAGAAGGGGTACTTGCACAAGGCTTTTGAACTCCTTGATATGATGAGATCTACCAATGTTACCCCTGACGTTGTGACATATAACACTCTCATTAATGGTCTTGTTATGCGAGGGTTTCTTAAAGAGGCAAAGGATATACTAGATGAGCTCATCAGGAGGGGTTTCAGTATAGATGTTGTCACATACACTAATATCATATATGGATATTCCAAAAGGGGAAACTTTGAGGAAGCTTTTCTTCTTTGGTATCATATGACTGACAATTGTGTAAAGCCTGATGTTGTTACTTGCAGTGCCCTTCTTAGTGGGTATTGCCGAGAACGGCGTATGGACGAAGCAAATGCTCTATTTTGTAAAATGCTGGACATTGGGTTAAATCCAGACTTGATATTGTACAATACTCTAATCCATGGATTTTGTAGTGTTGGTAATGTGGACGAAGGTTGCAATTTTGTAAAGAAGATGATTGAAAGCAGTATCATTCCAAACAATGTTACTCACTGTGCTCTTGTCCTCGGATTTCAGAGAAAGAGAGTTACCAATCCAATCAAGAGTGCCACTTCTAAGCTTCAAGAAATCTTGCTTGCATATAATCTTCAGATTGATGCCAATGGATATATCTAA

mRNA sequence

ATGAAGAGTGCATTTTCCAAAACAAGTTTCTGCTCCAAGCTCAACTTTGGAAGAAAAAGAGGATGTAGATATTTTGCAACAGCTAATTCTGCGTTGTCTTCTTTGAACTATGTCGATGACTGCTTTACTTTTGAATGTCCCGGTGCTACAAACTATAATGATGACTCTGTTGAACAAAGTAATTTTGGCTATGAAGTTCAAGTTTCTAAAGGTCAGAAAGCTGATGAGGATGAAATGAAAACGATAAAATTGATACTTGGGAACCATGGGTTTAATCTTGGTTCGCATCCGAAACAATTCGAGATTGTAAGGATTTTGGACATTCTATTTGAGGATAGTTCAGATGCCGGACTTTGTCTTTGCTACTTCAAATGGTCAGGATGTTTATCCGGATCTAATCAGTCGCTGGAGTCAATTTGTAAGATGATGCATATTTTGGTTACTGGGAATATGAATCATAGGGCTGTTGATTTAATGTCACAACTTGTTAGAACCTATGGTAGTAAAGAGGGATCTTCAATCATATTGCTGAAACTTTTGTATGAAACACATAATGAAAGGAAGACTTTGGAAACCACATGCAGCATGCTGGTTGACTGTTATATCAAGGAAAGAATGGTAACGGCTGCTCTTTTATTGATGGGTCAAATGAAGCACCTTAACATATTTCCTTCTATATGGGTATACAAGTCGGTGATACAGGCTTTATTACAAACCAATCAATCGGATTTAGCTTGGGATCTTCTAGAAGAAATGTACCGGCAAGGTATAAGTTTAAATTATTCAATTAATTTATTTATTCATCACTATTGTGCAAAAGGTGATCTGGGCAGGGGGTGGAAAGTGCTTTTGGAGTTGAGGAATTTCCTATCTAAGCCTGATGCAGTTGATTTCACAATTGTGATCAACTCACTTTGCAAAATTTCTCTTTTAAAAGAAGCCTCAGCCCTGTTGTTTAAAATGACTGCTTTTGGTGTTTCCCCTGATTCAGTGACGATGAGTTCTGTTATTGATGGTTATTGTAAAGTAGGAAAGTTGGATATAGCTTGTAAAATATTGAAGTATTTTAGGCTTCCCCTAAATATTTTCACATACAATAGCTTTATAACAAGGTTATGTACAGAAGGAGACATGGAAAAAGCTTCTGAAGTTTTTCTTGAAATGTCTGAGGTGGGCTTAGTTCCAGACTGTGTTAGTTACACAACCATGATAGGAGGCTATCTTAAAGTGGAAAACATAAACAGAGCATTTTCTTACCTATGCAAGATGTTAAAGAGTGGAACCCAACCATCTATTATCACGTATACTTTGTTCATTGATAACTTTTGCAAGTATGGAGATGTGGAAATGGCTGAAGTTATGTTCCAAAAGATGATTATTGAGGGTTTAAAGCCTGATGTTGTCATGTATAATATTTTGATGGATGGATATGGAAAGAAGGGGTACTTGCACAAGGCTTTTGAACTCCTTGATATGATGAGATCTACCAATGTTACCCCTGACGTTGTGACATATAACACTCTCATTAATGGTCTTGTTATGCGAGGGTTTCTTAAAGAGGCAAAGGATATACTAGATGAGCTCATCAGGAGGGGTTTCAGTATAGATGTTGTCACATACACTAATATCATATATGGATATTCCAAAAGGGGAAACTTTGAGGAAGCTTTTCTTCTTTGGTATCATATGACTGACAATTGTGTAAAGCCTGATGTTGTTACTTGCAGTGCCCTTCTTAGTGGGTATTGCCGAGAACGGCGTATGGACGAAGCAAATGCTCTATTTTGTAAAATGCTGGACATTGGGTTAAATCCAGACTTGATATTGTACAATACTCTAATCCATGGATTTTGTAGTGTTGGTAATGTGGACGAAGGTTGCAATTTTGTAAAGAAGATGATTGAAAGCAGTATCATTCCAAACAATGTTACTCACTGTGCTCTTGTCCTCGGATTTCAGAGAAAGAGAGTTACCAATCCAATCAAGAGTGCCACTTCTAAGCTTCAAGAAATCTTGCTTGCATATAATCTTCAGATTGATGCCAATGGATATATCTAA

Coding sequence (CDS)

ATGAAGAGTGCATTTTCCAAAACAAGTTTCTGCTCCAAGCTCAACTTTGGAAGAAAAAGAGGATGTAGATATTTTGCAACAGCTAATTCTGCGTTGTCTTCTTTGAACTATGTCGATGACTGCTTTACTTTTGAATGTCCCGGTGCTACAAACTATAATGATGACTCTGTTGAACAAAGTAATTTTGGCTATGAAGTTCAAGTTTCTAAAGGTCAGAAAGCTGATGAGGATGAAATGAAAACGATAAAATTGATACTTGGGAACCATGGGTTTAATCTTGGTTCGCATCCGAAACAATTCGAGATTGTAAGGATTTTGGACATTCTATTTGAGGATAGTTCAGATGCCGGACTTTGTCTTTGCTACTTCAAATGGTCAGGATGTTTATCCGGATCTAATCAGTCGCTGGAGTCAATTTGTAAGATGATGCATATTTTGGTTACTGGGAATATGAATCATAGGGCTGTTGATTTAATGTCACAACTTGTTAGAACCTATGGTAGTAAAGAGGGATCTTCAATCATATTGCTGAAACTTTTGTATGAAACACATAATGAAAGGAAGACTTTGGAAACCACATGCAGCATGCTGGTTGACTGTTATATCAAGGAAAGAATGGTAACGGCTGCTCTTTTATTGATGGGTCAAATGAAGCACCTTAACATATTTCCTTCTATATGGGTATACAAGTCGGTGATACAGGCTTTATTACAAACCAATCAATCGGATTTAGCTTGGGATCTTCTAGAAGAAATGTACCGGCAAGGTATAAGTTTAAATTATTCAATTAATTTATTTATTCATCACTATTGTGCAAAAGGTGATCTGGGCAGGGGGTGGAAAGTGCTTTTGGAGTTGAGGAATTTCCTATCTAAGCCTGATGCAGTTGATTTCACAATTGTGATCAACTCACTTTGCAAAATTTCTCTTTTAAAAGAAGCCTCAGCCCTGTTGTTTAAAATGACTGCTTTTGGTGTTTCCCCTGATTCAGTGACGATGAGTTCTGTTATTGATGGTTATTGTAAAGTAGGAAAGTTGGATATAGCTTGTAAAATATTGAAGTATTTTAGGCTTCCCCTAAATATTTTCACATACAATAGCTTTATAACAAGGTTATGTACAGAAGGAGACATGGAAAAAGCTTCTGAAGTTTTTCTTGAAATGTCTGAGGTGGGCTTAGTTCCAGACTGTGTTAGTTACACAACCATGATAGGAGGCTATCTTAAAGTGGAAAACATAAACAGAGCATTTTCTTACCTATGCAAGATGTTAAAGAGTGGAACCCAACCATCTATTATCACGTATACTTTGTTCATTGATAACTTTTGCAAGTATGGAGATGTGGAAATGGCTGAAGTTATGTTCCAAAAGATGATTATTGAGGGTTTAAAGCCTGATGTTGTCATGTATAATATTTTGATGGATGGATATGGAAAGAAGGGGTACTTGCACAAGGCTTTTGAACTCCTTGATATGATGAGATCTACCAATGTTACCCCTGACGTTGTGACATATAACACTCTCATTAATGGTCTTGTTATGCGAGGGTTTCTTAAAGAGGCAAAGGATATACTAGATGAGCTCATCAGGAGGGGTTTCAGTATAGATGTTGTCACATACACTAATATCATATATGGATATTCCAAAAGGGGAAACTTTGAGGAAGCTTTTCTTCTTTGGTATCATATGACTGACAATTGTGTAAAGCCTGATGTTGTTACTTGCAGTGCCCTTCTTAGTGGGTATTGCCGAGAACGGCGTATGGACGAAGCAAATGCTCTATTTTGTAAAATGCTGGACATTGGGTTAAATCCAGACTTGATATTGTACAATACTCTAATCCATGGATTTTGTAGTGTTGGTAATGTGGACGAAGGTTGCAATTTTGTAAAGAAGATGATTGAAAGCAGTATCATTCCAAACAATGTTACTCACTGTGCTCTTGTCCTCGGATTTCAGAGAAAGAGAGTTACCAATCCAATCAAGAGTGCCACTTCTAAGCTTCAAGAAATCTTGCTTGCATATAATCTTCAGATTGATGCCAATGGATATATCTAA

Protein sequence

MKSAFSKTSFCSKLNFGRKRGCRYFATANSALSSLNYVDDCFTFECPGATNYNDDSVEQSNFGYEVQVSKGQKADEDEMKTIKLILGNHGFNLGSHPKQFEIVRILDILFEDSSDAGLCLCYFKWSGCLSGSNQSLESICKMMHILVTGNMNHRAVDLMSQLVRTYGSKEGSSIILLKLLYETHNERKTLETTCSMLVDCYIKERMVTAALLLMGQMKHLNIFPSIWVYKSVIQALLQTNQSDLAWDLLEEMYRQGISLNYSINLFIHHYCAKGDLGRGWKVLLELRNFLSKPDAVDFTIVINSLCKISLLKEASALLFKMTAFGVSPDSVTMSSVIDGYCKVGKLDIACKILKYFRLPLNIFTYNSFITRLCTEGDMEKASEVFLEMSEVGLVPDCVSYTTMIGGYLKVENINRAFSYLCKMLKSGTQPSIITYTLFIDNFCKYGDVEMAEVMFQKMIIEGLKPDVVMYNILMDGYGKKGYLHKAFELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVTYTNIIYGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCRERRMDEANALFCKMLDIGLNPDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHCALVLGFQRKRVTNPIKSATSKLQEILLAYNLQIDANGYI
Homology
BLAST of ClCG07G005030 vs. NCBI nr
Match: XP_038892184.1 (pentatricopeptide repeat-containing protein At2g19280 [Benincasa hispida])

HSP 1 Score: 1275.4 bits (3299), Expect = 0.0e+00
Identity = 633/686 (92.27%), Postives = 655/686 (95.48%), Query Frame = 0

Query: 1   MKSAFSKTSFCSKLNFGRKRGCRYFATANSALSSLNYVDD-CFTFECPGATNYNDDSVEQ 60
           MKSAFSK SFCSKLNFGRKRGCRY ATANSALSSLNYVDD CFTFECP ATNY++DSVEQ
Sbjct: 1   MKSAFSKISFCSKLNFGRKRGCRYSATANSALSSLNYVDDGCFTFECPMATNYDNDSVEQ 60

Query: 61  SNFGYEVQVSKGQKADEDEMKTIKLILGNHGFNLGSHPKQFEIVRILDILFEDSSDAGLC 120
           S+FG EVQVSKGQKAD+DEMKTIKLILGNHG NLGSHPKQFE VRILDILFEDSSDAGLC
Sbjct: 61  SHFGNEVQVSKGQKADQDEMKTIKLILGNHGINLGSHPKQFETVRILDILFEDSSDAGLC 120

Query: 121 LCYFKWSGCLSGSNQSLESICKMMHILVTGNMNHRAVDLMSQLVRTYGSKEGSSIILLKL 180
           L YFKWSGCLSGSNQSLESICKM+HILVTGNMNHRAVDLMS L +TYGSKEGSS ILLKL
Sbjct: 121 LYYFKWSGCLSGSNQSLESICKMLHILVTGNMNHRAVDLMSHLAKTYGSKEGSSTILLKL 180

Query: 181 LYETHNERKTLETTCSMLVDCYIKERMVTAALLLMGQMKHLNIFPSIWVYKSVIQALLQT 240
           LYETH ERKTLETTCSMLV CYIKERMVTAAL+LMGQMKHLNIFPSIWVYKSVIQALLQT
Sbjct: 181 LYETHTERKTLETTCSMLVHCYIKERMVTAALILMGQMKHLNIFPSIWVYKSVIQALLQT 240

Query: 241 NQSDLAWDLLEEMYRQGISLNYSINLFIHHYCAKGDLGRGWKVLLELRNFLSKPDAVDFT 300
           NQ +LAWDLLEEMYR+GISLNYSINLFIHHYCAKG+LGRGWKVLLELRNF SKPDAVD+T
Sbjct: 241 NQLELAWDLLEEMYRRGISLNYSINLFIHHYCAKGNLGRGWKVLLELRNFGSKPDAVDYT 300

Query: 301 IVINSLCKISLLKEASALLFKMTAFGVSPDSVTMSSVIDGYCKVGKLDIACKILKYFRLP 360
           I+INSLCKISLLKEA+ALLFKM AFGVSPDSV MSSVIDGYCKVGK DIACKILKYFRLP
Sbjct: 301 IMINSLCKISLLKEATALLFKMIAFGVSPDSVMMSSVIDGYCKVGKSDIACKILKYFRLP 360

Query: 361 LNIFTYNSFITRLCTEGDMEKASEVFLEMSEVGLVPDCVSYTTMIGGYLKVENINRAFSY 420
           LNIFTYNSFITRLC EG+M KAS+VFLEM EVGLVPD VSYTTMIGGY KVENIN+AFSY
Sbjct: 361 LNIFTYNSFITRLCMEGNMAKASKVFLEMFEVGLVPDRVSYTTMIGGYCKVENINKAFSY 420

Query: 421 LCKMLKSGTQPSIITYTLFIDNFCKYGDVEMAEVMFQKMIIEGLKPDVVMYNILMDGYGK 480
           LCKM+KSG QPSIITYTLFIDNFCK GDVEMAEV+FQK+IIEGLKPDVV YNILMDGYGK
Sbjct: 421 LCKMIKSGIQPSIITYTLFIDNFCKCGDVEMAEVLFQKIIIEGLKPDVVTYNILMDGYGK 480

Query: 481 KGYLHKAFELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVT 540
           KGYLHK FELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRR FSIDVVT
Sbjct: 481 KGYLHKTFELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRRDFSIDVVT 540

Query: 541 YTNIIYGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCRERRMDEANALFCKML 600
           YTNIIYGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCRE+RMDEANALFCKML
Sbjct: 541 YTNIIYGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCREQRMDEANALFCKML 600

Query: 601 DIGLNPDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHCALVLGFQRKRVTN 660
           DIGL+PDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHCALVLGFQ+KR  N
Sbjct: 601 DIGLSPDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHCALVLGFQKKRAIN 660

Query: 661 PIKSATSKLQEILLAYNLQIDANGYI 686
           PI SATSKLQEILLAYNLQIDANGYI
Sbjct: 661 PIGSATSKLQEILLAYNLQIDANGYI 686

BLAST of ClCG07G005030 vs. NCBI nr
Match: XP_022957015.1 (pentatricopeptide repeat-containing protein At2g19280 isoform X1 [Cucurbita moschata] >XP_022957016.1 pentatricopeptide repeat-containing protein At2g19280 isoform X1 [Cucurbita moschata] >XP_022957017.1 pentatricopeptide repeat-containing protein At2g19280 isoform X1 [Cucurbita moschata] >XP_022957018.1 pentatricopeptide repeat-containing protein At2g19280 isoform X1 [Cucurbita moschata] >XP_022957019.1 pentatricopeptide repeat-containing protein At2g19280 isoform X1 [Cucurbita moschata])

HSP 1 Score: 1192.2 bits (3083), Expect = 0.0e+00
Identity = 594/686 (86.59%), Postives = 634/686 (92.42%), Query Frame = 0

Query: 1   MKSAFSKTSFCSKLNFGRKRGCRYFATANSALSSLNYVD-DCFTFECPGATNYNDDSVEQ 60
           MKSAFS  +FCSKLNFGRKR CRYFATANSALSS NY D DCFT E P ATN + DS EQ
Sbjct: 1   MKSAFSIINFCSKLNFGRKRPCRYFATANSALSSFNYADEDCFTSELPAATNSDVDSEEQ 60

Query: 61  SNFGYEVQVSKGQKADEDEMKTIKLILGNHGFNLGSHPKQFEIVRILDILFEDSSDAGLC 120
           + FG +VQVSKG KAD+DEMK IKLILGNHGFNLGSHPKQ EIVRILDILFE+SSDA LC
Sbjct: 61  NYFGNDVQVSKGLKADDDEMKLIKLILGNHGFNLGSHPKQLEIVRILDILFEESSDARLC 120

Query: 121 LCYFKWSGCLSGSNQSLESICKMMHILVTGNMNHRAVDLMSQLVRTYGSKEGSSIILLKL 180
           L YFKWSGCLSGSN+SLESIC+M+HILV GNMNHRAVDLMS LV+ YGSKEG S ILLKL
Sbjct: 121 LYYFKWSGCLSGSNRSLESICRMIHILVAGNMNHRAVDLMSHLVKNYGSKEGFSTILLKL 180

Query: 181 LYETHNERKTLETTCSMLVDCYIKERMVTAALLLMGQMKHLNIFPSIWVYKSVIQALLQT 240
            YETH+ERKTLETTCSMLVDCYIKERMVTAAL+LMGQMK  +IFPSIWVYKSVIQALLQT
Sbjct: 181 FYETHHERKTLETTCSMLVDCYIKERMVTAALILMGQMKSFDIFPSIWVYKSVIQALLQT 240

Query: 241 NQSDLAWDLLEEMYRQGISLNYSINLFIHHYCAKGDLGRGWKVLLELRNFLSKPDAVDFT 300
           NQS+ AWDLLEEM+RQGISLNYSINLFI+HYCAKG+L RGWKVLLELR F SKPDAVD+T
Sbjct: 241 NQSESAWDLLEEMHRQGISLNYSINLFIYHYCAKGNLSRGWKVLLELRKFGSKPDAVDYT 300

Query: 301 IVINSLCKISLLKEASALLFKMTAFGVSPDSVTMSSVIDGYCKVGKLDIACKILKYFRLP 360
           IVINSLCKISLLKEA+ALLFKMTAFGVSPDSVTMSSVIDGYCK+GKLDIACKILKYFR P
Sbjct: 301 IVINSLCKISLLKEATALLFKMTAFGVSPDSVTMSSVIDGYCKLGKLDIACKILKYFRRP 360

Query: 361 LNIFTYNSFITRLCTEGDMEKASEVFLEMSEVGLVPDCVSYTTMIGGYLKVENINRAFSY 420
           LNIF YNSFIT+LC EG+  KASEVFLEMSEVGLVPDCVSYTTMIGGY KV NINRAFSY
Sbjct: 361 LNIFIYNSFITKLCMEGNTVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGNINRAFSY 420

Query: 421 LCKMLKSGTQPSIITYTLFIDNFCKYGDVEMAEVMFQKMIIEGLKPDVVMYNILMDGYGK 480
           L KMLKSG +PS+ITYTLFID FCK  DVEMAEVM QKMIIEGL PDVV YNILMDGYGK
Sbjct: 421 LGKMLKSGIRPSVITYTLFIDYFCKRRDVEMAEVMLQKMIIEGLNPDVVTYNILMDGYGK 480

Query: 481 KGYLHKAFELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVT 540
           KGYLHKAFELLD MRSTN+TPDVVTYNTLINGLV RGFL+EAKD+LDEL RRGF+IDVVT
Sbjct: 481 KGYLHKAFELLDTMRSTNLTPDVVTYNTLINGLVTRGFLQEAKDMLDELNRRGFNIDVVT 540

Query: 541 YTNIIYGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCRERRMDEANALFCKML 600
           YTNII+GYSKRGNFEEAFL+W+HMTDNCVKPDVVTCSALLSGYCRERR+DEANALFCKML
Sbjct: 541 YTNIIHGYSKRGNFEEAFLVWFHMTDNCVKPDVVTCSALLSGYCRERRIDEANALFCKML 600

Query: 601 DIGLNPDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHCALVLGFQRKRVTN 660
           DIGLNPDLILYNTLIHGFCSVGNVDEGCN VKKMIE+SI+PNNVTH ALVLGFQ+++V +
Sbjct: 601 DIGLNPDLILYNTLIHGFCSVGNVDEGCNLVKKMIENSILPNNVTHRALVLGFQKRKVID 660

Query: 661 PIKSATSKLQEILLAYNLQIDANGYI 686
           PI+SATSKLQEILLAY+LQIDANGYI
Sbjct: 661 PIESATSKLQEILLAYDLQIDANGYI 686

BLAST of ClCG07G005030 vs. NCBI nr
Match: XP_011655513.1 (pentatricopeptide repeat-containing protein At2g19280 [Cucumis sativus] >XP_011655514.1 pentatricopeptide repeat-containing protein At2g19280 [Cucumis sativus])

HSP 1 Score: 1142.9 bits (2955), Expect = 0.0e+00
Identity = 566/685 (82.63%), Postives = 622/685 (90.80%), Query Frame = 0

Query: 1   MKSAFSKTSFCSKLNFGRKRGCRYFATANSALSSLNYVDDCFTFECPGATNYNDDSVEQS 60
           M+SAFS  SFCSKLNF RK  CRY ATANS LSS N++D+    +C   TNY+ +S E+S
Sbjct: 1   MRSAFSIISFCSKLNFRRKTPCRYSATANSELSSFNHMDE----DC---TNYDVNSDERS 60

Query: 61  NFGYEVQVSKGQKADEDEMKTIKLILGNHGFNLGSHPKQFEIVRILDILFEDSSDAGLCL 120
             G EV+VSKGQK DEDEM+TIKLILGN GFNLGS PKQ EI+RILD+LFEDSSDAGLCL
Sbjct: 61  YVGNEVEVSKGQKTDEDEMETIKLILGNRGFNLGSCPKQLEIIRILDVLFEDSSDAGLCL 120

Query: 121 CYFKWSGCLSGSNQSLESICKMMHILVTGNMNHRAVDLMSQLVRTYGSKEGSSIILLKLL 180
            YFKWSGCLSGSNQSLESIC+M HILV GNMNHRAVDL+S LV+ YG  EGSS ILLK+ 
Sbjct: 121 YYFKWSGCLSGSNQSLESICRMAHILVAGNMNHRAVDLISHLVKNYGCTEGSSSILLKVF 180

Query: 181 YETHNERKTLETTCSMLVDCYIKERMVTAALLLMGQMKHLNIFPSIWVYKSVIQALLQTN 240
            ETHN RKTLETTCSM+V+CYIKERMVT+AL+L+ QMKHLNIFPSIWVYKSVI+ALLQTN
Sbjct: 181 CETHNGRKTLETTCSMMVNCYIKERMVTSALILIDQMKHLNIFPSIWVYKSVIKALLQTN 240

Query: 241 QSDLAWDLLEEMYRQGISLNYSINLFIHHYCAKGDLGRGWKVLLELRNFLSKPDAVDFTI 300
           QS +AWDLLEEM+RQG+SLNYSINLFIHHYC++G+LG+GWKVLLELRNF SKPD VD+T 
Sbjct: 241 QSGMAWDLLEEMHRQGVSLNYSINLFIHHYCSEGNLGKGWKVLLELRNFGSKPDVVDYTT 300

Query: 301 VINSLCKISLLKEASALLFKMTAFGVSPDSVTMSSVIDGYCKVGKLDIACKILKYFRLPL 360
           VINSLCK+SLLKEA+ALLFKM  FGVSPD VTMSS+IDG+CKVGK DIACKILKYFRLPL
Sbjct: 301 VINSLCKVSLLKEATALLFKMITFGVSPDLVTMSSIIDGHCKVGKSDIACKILKYFRLPL 360

Query: 361 NIFTYNSFITRLCTEGDMEKASEVFLEMSEVGLVPDCVSYTTMIGGYLKVENINRAFSYL 420
           NIF YNSFIT+L TEGDM KAS+VFLEM+EVGLVPDC+SYTTMIGGY KV NIN AFSYL
Sbjct: 361 NIFIYNSFITKLSTEGDMVKASKVFLEMTEVGLVPDCISYTTMIGGYCKVGNINIAFSYL 420

Query: 421 CKMLKSGTQPSIITYTLFIDNFCKYGDVEMAEVMFQKMIIEGLKPDVVMYNILMDGYGKK 480
            KMLKSG QPS+ITYTLF+D FC+  DVEMAEVMF+KMI+EGLKPDVV+YNILMD YGKK
Sbjct: 421 SKMLKSGIQPSVITYTLFLDYFCECRDVEMAEVMFEKMIVEGLKPDVVVYNILMDAYGKK 480

Query: 481 GYLHKAFELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVTY 540
           GY+HKAF+LLDMMRSTNVTPDVVTYNTLINGLVMRGFL+EAKDILDELIRRGFS+DVVTY
Sbjct: 481 GYMHKAFKLLDMMRSTNVTPDVVTYNTLINGLVMRGFLQEAKDILDELIRRGFSVDVVTY 540

Query: 541 TNIIYGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCRERRMDEANALFCKMLD 600
           TNII+GYS RGNFEEAFLLWYHM +NCV PDVVTCSALLSGYCRE+RMDEANALFCKMLD
Sbjct: 541 TNIIHGYSTRGNFEEAFLLWYHMAENCVTPDVVTCSALLSGYCREKRMDEANALFCKMLD 600

Query: 601 IGLNPDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHCALVLGFQRKRVTNP 660
           IGL PDLILYNTLIHGFCSVGNVDEGCN VKKMIESSIIPNNVTH ALVLGFQ+KRVT+P
Sbjct: 601 IGLKPDLILYNTLIHGFCSVGNVDEGCNLVKKMIESSIIPNNVTHRALVLGFQKKRVTDP 660

Query: 661 IKSATSKLQEILLAYNLQIDANGYI 686
           I+SATSKLQEIL+AY+LQIDA GYI
Sbjct: 661 IQSATSKLQEILIAYDLQIDAIGYI 678

BLAST of ClCG07G005030 vs. NCBI nr
Match: XP_022139130.1 (pentatricopeptide repeat-containing protein At2g19280-like isoform X1 [Momordica charantia] >XP_022139131.1 pentatricopeptide repeat-containing protein At2g19280-like isoform X1 [Momordica charantia] >XP_022139132.1 pentatricopeptide repeat-containing protein At2g19280-like isoform X1 [Momordica charantia])

HSP 1 Score: 1123.2 bits (2904), Expect = 0.0e+00
Identity = 558/686 (81.34%), Postives = 606/686 (88.34%), Query Frame = 0

Query: 1   MKSAFSKTSFCSKLNFGRKRGCRYFATANSALSSLNYV-DDCFTFECPGATNYNDDSVEQ 60
           MKS FS   FCSKLNFGRK  CRYFAT N+ALS  N V DDCFT+E P A NY+ D  E+
Sbjct: 28  MKSPFSIICFCSKLNFGRKIACRYFATTNAALSLFNCVDDDCFTYEYPVAANYDVDFDEK 87

Query: 61  SNFGYEVQVSKGQKADEDEMKTIKLILGNHGFNLGSHPKQFEIVRILDILFEDSSDAGLC 120
             F  E    KGQK D+D MK IKLIL NHG NLGSHPKQ EIVRILD LFEDSSDAGL 
Sbjct: 88  IYFRNE--DPKGQKVDDDRMKMIKLILRNHGLNLGSHPKQLEIVRILDTLFEDSSDAGLS 147

Query: 121 LCYFKWSGCLSGSNQSLESICKMMHILVTGNMNHRAVDLMSQLVRTYGSKEGSSIILLKL 180
           L YFKWSGCLSGSNQSL+SIC+M+ IL+TGNMNHRAVDLMS +V  YGSKEGSS +LLKL
Sbjct: 148 LYYFKWSGCLSGSNQSLQSICRMIRILITGNMNHRAVDLMSHIVENYGSKEGSSSMLLKL 207

Query: 181 LYETHNERKTLETTCSMLVDCYIKERMVTAALLLMGQMKHLNIFPSIWVYKSVIQALLQT 240
            +E  NERKTLET CSMLV CYIKERMVTAAL+LMGQMKHL IFPSIWVY+SVIQ LL+T
Sbjct: 208 FFEMVNERKTLETACSMLVYCYIKERMVTAALILMGQMKHLKIFPSIWVYRSVIQTLLET 267

Query: 241 NQSDLAWDLLEEMYRQGISLNYSINLFIHHYCAKGDLGRGWKVLLELRNFLSKPDAVDFT 300
           NQ +LAWDLLEEMY QG+SLNYSINLFIHHYCA+G+LG GWKVLLELRNF SKPDAVD+T
Sbjct: 268 NQLELAWDLLEEMYIQGVSLNYSINLFIHHYCAEGNLGMGWKVLLELRNFGSKPDAVDYT 327

Query: 301 IVINSLCKISLLKEASALLFKMTAFGVSPDSVTMSSVIDGYCKVGKLDIACKILKYFRLP 360
           IVI+SLCK SLLKEA++LLFKM+AFGVSPDSVTMSSVIDGYCK+G LD+ACKILKYFRLP
Sbjct: 328 IVIDSLCKNSLLKEATSLLFKMSAFGVSPDSVTMSSVIDGYCKIGNLDVACKILKYFRLP 387

Query: 361 LNIFTYNSFITRLCTEGDMEKASEVFLEMSEVGLVPDCVSYTTMIGGYLKVENINRAFSY 420
           LNIFTYNSFIT+LCTEG+M  ASEVFLEMSEVGL+PDCVSYTTM+GGY KV +IN+AF Y
Sbjct: 388 LNIFTYNSFITKLCTEGNMVSASEVFLEMSEVGLLPDCVSYTTMMGGYCKVGDINKAFLY 447

Query: 421 LCKMLKSGTQPSIITYTLFIDNFCKYGDVEMAEVMFQKMIIEGLKPDVVMYNILMDGYGK 480
           L KMLKSG QPS+ITYTL IDN CK G+VEMAE+ FQKM+ EG+KPDVV +NILMDGYGK
Sbjct: 448 LGKMLKSGIQPSVITYTLLIDNLCKCGNVEMAEIFFQKMVTEGIKPDVVAFNILMDGYGK 507

Query: 481 KGYLHKAFELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVT 540
           KGYLHKAFELLDMMRSTNVTPDVVTYNTLINGL  RGFL+EAKDILDELIRRGFSIDVVT
Sbjct: 508 KGYLHKAFELLDMMRSTNVTPDVVTYNTLINGLFTRGFLREAKDILDELIRRGFSIDVVT 567

Query: 541 YTNIIYGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCRERRMDEANALFCKML 600
           YTN IYGYSKRGNFEEAFL+WYHMTDNCVKPDVVTCSALLSGYCRE RMDEANALF KML
Sbjct: 568 YTNFIYGYSKRGNFEEAFLVWYHMTDNCVKPDVVTCSALLSGYCREHRMDEANALFYKML 627

Query: 601 DIGLNPDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHCALVLGFQRKRVTN 660
           DIGLNPDLILYNTLIHGFCSVGNVDE CN V KMIESSI+PNNVTH ALV GFQ+K+V +
Sbjct: 628 DIGLNPDLILYNTLIHGFCSVGNVDEACNLVMKMIESSILPNNVTHRALVFGFQKKQVIS 687

Query: 661 PIKSATSKLQEILLAYNLQIDANGYI 686
           PI+SAT KLQEIL AY +++DA GYI
Sbjct: 688 PIESATCKLQEILRAYGIEVDAKGYI 711

BLAST of ClCG07G005030 vs. NCBI nr
Match: XP_008445921.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g19280 [Cucumis melo])

HSP 1 Score: 1108.6 bits (2866), Expect = 0.0e+00
Identity = 549/685 (80.15%), Postives = 610/685 (89.05%), Query Frame = 0

Query: 1   MKSAFSKTSFCSKLNFGRKRGCRYFATANSALSSLNYVDDCFTFECPGATNYNDDSVEQS 60
           MKSAFS  SFCSKLNF RK  CRYFATAN  LSS N++D+    +C   TNY+ DS E+S
Sbjct: 1   MKSAFSIISFCSKLNFRRKTPCRYFATANYELSSFNHMDE----DC---TNYDVDSDERS 60

Query: 61  NFGYEVQVSKGQKADEDEMKTIKLILGNHGFNLGSHPKQFEIVRILDILFEDSSDAGLCL 120
            FG EV+VSKG+K DED+M+ IKLILGN GF LGS PKQ E VRILDILFEDSSD  LCL
Sbjct: 61  YFGNEVEVSKGKKTDEDKMEKIKLILGNRGFKLGSRPKQLETVRILDILFEDSSDPELCL 120

Query: 121 CYFKWSGCLSGSNQSLESICKMMHILVTGNMNHRAVDLMSQLVRTYGSKEGSSIILLKLL 180
            YFKWSGCLSGSNQSLESIC+M HILV GN NH AVDL+S LV+ YG KEGSS ILL++ 
Sbjct: 121 YYFKWSGCLSGSNQSLESICRMAHILVAGNKNHGAVDLISHLVKNYGCKEGSSSILLEVF 180

Query: 181 YETHNERKTLETTCSMLVDCYIKERMVTAALLLMGQMKHLNIFPSIWVYKSVIQALLQTN 240
           Y+THN+RKTLETTC M+++CYIKE MVT+A++L+ QM+ LN+FPSIWVYKSVI+ALLQTN
Sbjct: 181 YDTHNKRKTLETTCGMMINCYIKEGMVTSAVILIDQMRRLNVFPSIWVYKSVIKALLQTN 240

Query: 241 QSDLAWDLLEEMYRQGISLNYSINLFIHHYCAKGDLGRGWKVLLELRNFLSKPDAVDFTI 300
           + D+AWDLLEEM RQGISL+YSINLFIHHYC++G+LG+GWKVLLELRNF SKPD VD+T 
Sbjct: 241 RFDMAWDLLEEMQRQGISLHYSINLFIHHYCSEGNLGKGWKVLLELRNFGSKPDVVDYTT 300

Query: 301 VINSLCKISLLKEASALLFKMTAFGVSPDSVTMSSVIDGYCKVGKLDIACKILKYFRLPL 360
           VINSLCKISLLKEA+ALLFKM  FGVSPD VTMSS+IDG+CKVGK DIACKILKYF++PL
Sbjct: 301 VINSLCKISLLKEATALLFKMITFGVSPDLVTMSSIIDGHCKVGKSDIACKILKYFKIPL 360

Query: 361 NIFTYNSFITRLCTEGDMEKASEVFLEMSEVGLVPDCVSYTTMIGGYLKVENINRAFSYL 420
           NIF YNSFIT L  EGD  KAS+VFLEMSEVGLVPDCVSYTTMIGGY KV NIN AFSYL
Sbjct: 361 NIFIYNSFITELFMEGDTVKASKVFLEMSEVGLVPDCVSYTTMIGGYCKVGNINIAFSYL 420

Query: 421 CKMLKSGTQPSIITYTLFIDNFCKYGDVEMAEVMFQKMIIEGLKPDVVMYNILMDGYGKK 480
            KMLKSG QPS+ITYTLF+D FC+ GDVEMAEVMF+KMI+E LKPDVVMYNILMD YGKK
Sbjct: 421 SKMLKSGIQPSVITYTLFVDYFCECGDVEMAEVMFEKMIVEDLKPDVVMYNILMDAYGKK 480

Query: 481 GYLHKAFELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVTY 540
           GY+HKAF+LLDMMRSTNVTPDVVTYN+LI+GLVMRGFL+EAKDILDELIRRGFSIDVVTY
Sbjct: 481 GYMHKAFQLLDMMRSTNVTPDVVTYNSLIHGLVMRGFLQEAKDILDELIRRGFSIDVVTY 540

Query: 541 TNIIYGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCRERRMDEANALFCKMLD 600
           TNI++GYSKRGNFEEAFLLWYHM DNCV PDVVTCSALLSGYCR + MDEANALFC+MLD
Sbjct: 541 TNIMHGYSKRGNFEEAFLLWYHMADNCVTPDVVTCSALLSGYCRAKHMDEANALFCRMLD 600

Query: 601 IGLNPDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHCALVLGFQRKRVTNP 660
           IGL PDLILYNTLIHGFCSVGNVDEGCN VKKMIESSIIPNNVTH ALVLGFQ+KRV +P
Sbjct: 601 IGLKPDLILYNTLIHGFCSVGNVDEGCNLVKKMIESSIIPNNVTHRALVLGFQKKRVMDP 660

Query: 661 IKSATSKLQEILLAYNLQIDANGYI 686
           I+SATSKLQEIL+AY+LQIDA G+I
Sbjct: 661 IQSATSKLQEILIAYDLQIDAIGHI 678

BLAST of ClCG07G005030 vs. ExPASy Swiss-Prot
Match: Q6NKW7 (Pentatricopeptide repeat-containing protein At2g19280 OS=Arabidopsis thaliana OX=3702 GN=At2g19280 PE=2 SV=2)

HSP 1 Score: 580.5 bits (1495), Expect = 2.5e-164
Identity = 297/673 (44.13%), Postives = 444/673 (65.97%), Query Frame = 0

Query: 10  FCSKLNFGRKRGCRYFATANSALSSLNYVDDCFTFECPGATNYNDDSVEQSNFGYE-VQV 69
           FC++    R   CR F+ A+ + ++  +  D       G+  Y+  S    +FG + V +
Sbjct: 16  FCTRTKAFRYFWCRTFSLASLSENNSRFQTDSSRLPYSGSRYYHSSS---KHFGEDFVSI 75

Query: 70  SKGQKADEDEMKTIKLILGNHGF------NLGSHPKQFEIVRILDILFEDSSDAGLCLCY 129
            K      D ++TI+ +L  H +         +   Q+ ++RILD LFE++ DA + L +
Sbjct: 76  LKNIDVPRDCVETIRNVLVKHNWIQKYESGFSTELDQYTVIRILDDLFEETLDASIVLYF 135

Query: 130 FKWSGCLSGSNQSLESICKMMHILVTGNMNHRAVDLMSQLVRTYGSKEGSSIILLKLLYE 189
           F+WS    G   S  SI +M+HILV+GNMN+RAVD++  LV+    +E S  +++K L+E
Sbjct: 136 FRWSELWIGVEHSSRSISRMIHILVSGNMNYRAVDMLLCLVKKCSGEERSLCLVMKDLFE 195

Query: 190 THNERKTLETTCSMLVDCYIKERMVTAALLLMGQMKHLNIFPSIWVYKSVIQALLQTNQS 249
           T  +R+ LET  S+L+DC I+ER V  AL L  ++    IFPS  V  S+++ +L+ +  
Sbjct: 196 TRIDRRVLETVFSILIDCCIRERKVNMALKLTYKVDQFGIFPSRGVCISLLKEILRVHGL 255

Query: 250 DLAWDLLEEMYRQGISLNYSI-NLFIHHYCAKGDLGRGWKVLLELRNFLSKPDAVDFTIV 309
           +LA + +E M  +G  LN ++ +LFI  YC+ G   +GW++L+ ++++  +PD V FT+ 
Sbjct: 256 ELAREFVEHMLSRGRHLNAAVLSLFIRKYCSDGYFDKGWELLMGMKHYGIRPDIVAFTVF 315

Query: 310 INSLCKISLLKEASALLFKMTAFGVSPDSVTMSSVIDGYCKVGKLDIACKILKYFRLPLN 369
           I+ LCK   LKEA+++LFK+  FG+S DSV++SSVIDG+CKVGK + A K++  FRL  N
Sbjct: 316 IDKLCKAGFLKEATSVLFKLKLFGISQDSVSVSSVIDGFCKVGKPEEAIKLIHSFRLRPN 375

Query: 370 IFTYNSFITRLCTEGDMEKASEVFLEMSEVGLVPDCVSYTTMIGGYLKVENINRAFSYLC 429
           IF Y+SF++ +C+ GDM +AS +F E+ E+GL+PDCV YTTMI GY  +   ++AF Y  
Sbjct: 376 IFVYSSFLSNICSTGDMLRASTIFQEIFELGLLPDCVCYTTMIDGYCNLGRTDKAFQYFG 435

Query: 430 KMLKSGTQPSIITYTLFIDNFCKYGDVEMAEVMFQKMIIEGLKPDVVMYNILMDGYGKKG 489
            +LKSG  PS+ T T+ I    ++G +  AE +F+ M  EGLK DVV YN LM GYGK  
Sbjct: 436 ALLKSGNPPSLTTSTILIGACSRFGSISDAESVFRNMKTEGLKLDVVTYNNLMHGYGKTH 495

Query: 490 YLHKAFELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVTYT 549
            L+K FEL+D MRS  ++PDV TYN LI+ +V+RG++ EA +I+ ELIRRGF    + +T
Sbjct: 496 QLNKVFELIDEMRSAGISPDVATYNILIHSMVVRGYIDEANEIISELIRRGFVPSTLAFT 555

Query: 550 NIIYGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCRERRMDEANALFCKMLDI 609
           ++I G+SKRG+F+EAF+LW++M D  +KPDVVTCSALL GYC+ +RM++A  LF K+LD 
Sbjct: 556 DVIGGFSKRGDFQEAFILWFYMADLRMKPDVVTCSALLHGYCKAQRMEKAIVLFNKLLDA 615

Query: 610 GLNPDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHCALVLGFQRKRVTNPI 669
           GL PD++LYNTLIHG+CSVG++++ C  +  M++  ++PN  TH ALVLG + KR  N  
Sbjct: 616 GLKPDVVLYNTLIHGYCSVGDIEKACELIGLMVQRGMLPNESTHHALVLGLEGKRFVNSE 675

Query: 670 KSATSKLQEILLA 675
             A+  L+EI++A
Sbjct: 676 THASMLLEEIIVA 685

BLAST of ClCG07G005030 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 271.9 bits (694), Expect = 1.9e-71
Identity = 165/586 (28.16%), Postives = 294/586 (50.17%), Query Frame = 0

Query: 86  LGNHGFNLGSHPKQFEIVRILDILFEDSSDAGLCLCYFKWSGCLSGSNQSLESICKMMHI 145
           L  H + L      F      ++L +  +D  L L +  W+        +L   C  +HI
Sbjct: 32  LKRHPYQLHHLSANFTPEAASNLLLKSQNDQALILKFLNWAN--PHQFFTLRCKCITLHI 91

Query: 146 LVTGNMNHRAVDLMSQLVRTYGSKEGSSIILLKLLYETHNERKTLETTCSMLVDCYIKER 205
           L    + ++   ++++ V      +  + ++ K L ET++   +  +   ++V  Y +  
Sbjct: 92  LTKFKL-YKTAQILAEDVAAKTLDDEYASLVFKSLQETYDLCYSTSSVFDLVVKSYSRLS 151

Query: 206 MVTAALLLMGQMKHLNIFPSIWVYKSVIQALLQTNQSDLAWDLLEEMYRQGISLNYSINL 265
           ++  AL ++   +     P +  Y +V+ A +++ +                +++++ N+
Sbjct: 152 LIDKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKR----------------NISFAENV 211

Query: 266 FIHHYCAKGDLGRGWKVLLELRNFLSKPDAVDFTIVINSLCKISLLKEASALLFKMTAFG 325
           F              K +LE +     P+   + I+I   C    +  A  L  KM   G
Sbjct: 212 F--------------KEMLESQ---VSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKG 271

Query: 326 VSPDSVTMSSVIDGYCKVGKLDIACKILKYFR---LPLNIFTYNSFITRLCTEGDMEKAS 385
             P+ VT +++IDGYCK+ K+D   K+L+      L  N+ +YN  I  LC EG M++ S
Sbjct: 272 CLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVS 331

Query: 386 EVFLEMSEVGLVPDCVSYTTMIGGYLKVENINRAFSYLCKMLKSGTQPSIITYTLFIDNF 445
            V  EM+  G   D V+Y T+I GY K  N ++A     +ML+ G  PS+ITYT  I + 
Sbjct: 332 FVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSM 391

Query: 446 CKYGDVEMAEVMFQKMIIEGLKPDVVMYNILMDGYGKKGYLHKAFELLDMMRSTNVTPDV 505
           CK G++  A     +M + GL P+   Y  L+DG+ +KGY+++A+ +L  M     +P V
Sbjct: 392 CKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSV 451

Query: 506 VTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVTYTNIIYGYSKRGNFEEAFLLWYH 565
           VTYN LING  + G +++A  +L+++  +G S DVV+Y+ ++ G+ +  + +EA  +   
Sbjct: 452 VTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKRE 511

Query: 566 MTDNCVKPDVVTCSALLSGYCRERRMDEANALFCKMLDIGLNPDLILYNTLIHGFCSVGN 625
           M +  +KPD +T S+L+ G+C +RR  EA  L+ +ML +GL PD   Y  LI+ +C  G+
Sbjct: 512 MVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGD 571

Query: 626 VDEGCNFVKKMIESSIIPNNVTHCALVLGFQRKRVTNPIKSATSKL 669
           +++      +M+E  ++P+ VT+  L+ G  ++  T   K    KL
Sbjct: 572 LEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRLLLKL 581

BLAST of ClCG07G005030 vs. ExPASy Swiss-Prot
Match: Q0WVK7 (Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g05670 PE=2 SV=1)

HSP 1 Score: 248.1 bits (632), Expect = 3.0e-64
Identity = 171/575 (29.74%), Postives = 277/575 (48.17%), Query Frame = 0

Query: 99  QFEIVRILDILFEDSSDAGLCLCYFKWSGCLSGSNQSLESICKMMHILVTGNMNHRAVDL 158
           +F+   ++ +L +   D  L L +F W+     SN  LES+C ++H+ V       A  L
Sbjct: 84  KFKTDHLIWVLMKIKCDYRLVLDFFDWARSRRDSN--LESLCIVIHLAVASKDLKVAQSL 143

Query: 159 MSQLVRTYGSKEGSSIILLKLLYETHNERKTLETTCSMLVDCYIKERMVTAALLLMGQMK 218
           +S                         ER  L  T          +  V    LL+   K
Sbjct: 144 ISSFW----------------------ERPKLNVT----------DSFVQFFDLLVYTYK 203

Query: 219 HLNIFPSIWVYKSVIQALLQTNQSDLAWDLLEEMYRQGISLNY-SINLFIHHYCAKGDLG 278
                P   V+    Q L+       A  + E+M   G+ L+  S N+++       D  
Sbjct: 204 DWGSDPR--VFDVFFQVLVDFGLLREARRVFEKMLNYGLVLSVDSCNVYLTR--LSKDCY 263

Query: 279 RGWKVLLELRNFLSKP---DAVDFTIVINSLCKISLLKEASALLFKMTAFGVSPDSVTMS 338
           +    ++  R F       +   + IVI+ +C++  +KEA  LL  M   G +PD ++ S
Sbjct: 264 KTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLLLMELKGYTPDVISYS 323

Query: 339 SVIDGYCKVGKLDIACKILKYFR---LPLNIFTYNSFITRLCTEGDMEKASEVFLEMSEV 398
           +V++GYC+ G+LD   K+++  +   L  N + Y S I  LC    + +A E F EM   
Sbjct: 324 TVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGLLCRICKLAEAEEAFSEMIRQ 383

Query: 399 GLVPDCVSYTTMIGGYLKVENINRAFSYLCKMLKSGTQPSIITYTLFIDNFCKYGDVEMA 458
           G++PD V YTT+I G+ K  +I  A  +  +M      P ++TYT  I  FC+ GD+  A
Sbjct: 384 GILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAIISGFCQIGDMVEA 443

Query: 459 EVMFQKMIIEGLKPDVVMYNILMDGYGKKGYLHKAFELLDMMRSTNVTPDVVTYNTLING 518
             +F +M  +GL+PD V +  L++GY K G++  AF + + M     +P+VVTY TLI+G
Sbjct: 444 GKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHMIQAGCSPNVVTYTTLIDG 503

Query: 519 LVMRGFLKEAKDILDELIRRGFSIDVVTYTNIIYGYSKRGNFEEAFLLWYHMTDNCVKPD 578
           L   G L  A ++L E+ + G   ++ TY +I+ G  K GN EEA  L        +  D
Sbjct: 504 LCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKLVGEFEAAGLNAD 563

Query: 579 VVTCSALLSGYCRERRMDEANALFCKMLDIGLNPDLILYNTLIHGFCSVGNVDEGCNFVK 638
            VT + L+  YC+   MD+A  +  +ML  GL P ++ +N L++GFC  G +++G   + 
Sbjct: 564 TVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFCLHGMLEDGEKLLN 617

Query: 639 KMIESSIIPNNVTHCALVLGFQRKRVTNPIKSATS 667
            M+   I PN  T  +LV   ++  + N +K+AT+
Sbjct: 624 WMLAKGIAPNATTFNSLV---KQYCIRNNLKAATA 617

BLAST of ClCG07G005030 vs. ExPASy Swiss-Prot
Match: Q9CAN6 (Pentatricopeptide repeat-containing protein At1g63070, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g63070 PE=2 SV=1)

HSP 1 Score: 232.6 bits (592), Expect = 1.3e-59
Identity = 134/451 (29.71%), Postives = 233/451 (51.66%), Query Frame = 0

Query: 210 ALLLMGQMKHLNIFPSIWVYKSVIQALLQTNQSDLAWDLLEEMYRQGISLN-YSINLFIH 269
           A+ L G M     FPSI  +  ++ A+ + N+ DL   L E+M   GIS N Y+ ++FI+
Sbjct: 59  AIGLFGDMVKSRPFPSIVEFSKLLSAIAKMNKFDLVISLGEQMQNLGISHNLYTYSIFIN 118

Query: 270 HYCAKGDLGRGWKVLLELRNFLSKPDAVDFTIVINSLCKISLLKEASALLFKMTAFGVSP 329
           ++C +  L     +L ++      P  V    ++N  C  + + EA AL+ +M   G  P
Sbjct: 119 YFCRRSQLSLALAILGKMMKLGYGPSIVTLNSLLNGFCHGNRISEAVALVDQMVEMGYQP 178

Query: 330 DSVTMSSVIDGYCKVGKLDIACKILKYFRL---PLNIFTYNSFITRLCTEGDMEKASEVF 389
           D+VT ++++ G  +  K   A  +++   +     ++ TY + I  LC  G+ + A  + 
Sbjct: 179 DTVTFTTLVHGLFQHNKASEAVALVERMVVKGCQPDLVTYGAVINGLCKRGEPDLALNLL 238

Query: 390 LEMSEVGLVPDCVSYTTMIGGYLKVENINRAFSYLCKMLKSGTQPSIITYTLFIDNFCKY 449
            +M +  +  D V Y T+I G  K ++++ AF    KM   G +P + TY   I   C Y
Sbjct: 239 NKMEKGKIEADVVIYNTIIDGLCKYKHMDDAFDLFNKMETKGIKPDVFTYNPLISCLCNY 298

Query: 450 GDVEMAEVMFQKMIIEGLKPDVVMYNILMDGYGKKGYLHKAFELLD-MMRSTNVTPDVVT 509
           G    A  +   M+ + + PD+V +N L+D + K+G L +A +L D M++S +  PDVV 
Sbjct: 299 GRWSDASRLLSDMLEKNINPDLVFFNALIDAFVKEGKLVEAEKLYDEMVKSKHCFPDVVA 358

Query: 510 YNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVTYTNIIYGYSKRGNFEEAFLLWYHMT 569
           YNTLI G      ++E  ++  E+ +RG   + VTYT +I+G+ +  + + A +++  M 
Sbjct: 359 YNTLIKGFCKYKRVEEGMEVFREMSQRGLVGNTVTYTTLIHGFFQARDCDNAQMVFKQMV 418

Query: 570 DNCVKPDVVTCSALLSGYCRERRMDEANALFCKMLDIGLNPDLILYNTLIHGFCSVGNVD 629
            + V PD++T + LL G C    ++ A  +F  M    +  D++ Y T+I   C  G V+
Sbjct: 419 SDGVHPDIMTYNILLDGLCNNGNVETALVVFEYMQKRDMKLDIVTYTTMIEALCKAGKVE 478

Query: 630 EGCNFVKKMIESSIIPNNVTHCALVLGFQRK 656
           +G +    +    + PN VT+  ++ GF RK
Sbjct: 479 DGWDLFCSLSLKGVKPNVVTYTTMMSGFCRK 509

BLAST of ClCG07G005030 vs. ExPASy Swiss-Prot
Match: Q9LFC5 (Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana OX=3702 GN=At5g01110 PE=2 SV=1)

HSP 1 Score: 231.5 bits (589), Expect = 2.9e-59
Identity = 130/456 (28.51%), Postives = 237/456 (51.97%), Query Frame = 0

Query: 193 TCSMLVDCYIKERMVTAALLLMGQMKHLNIFPSIWVYKSVIQALLQTNQSDLAWDLLEEM 252
           T + L+  Y  + ++  A  LM  M      P ++ Y +VI  L +  + + A ++  EM
Sbjct: 272 TYNTLISAYSSKGLMEEAFELMNAMPGKGFSPGVYTYNTVINGLCKHGKYERAKEVFAEM 331

Query: 253 YRQGISL-NYSINLFIHHYCAKGDLGRGWKVLLELRNFLSKPDAVDFTIVINSLCKISLL 312
            R G+S  + +    +   C KGD+    KV  ++R+    PD V F+ +++   +   L
Sbjct: 332 LRSGLSPDSTTYRSLLMEACKKGDVVETEKVFSDMRSRDVVPDLVCFSSMMSLFTRSGNL 391

Query: 313 KEASALLFKMTAFGVSPDSVTMSSVIDGYCKVGKLDIACKI---LKYFRLPLNIFTYNSF 372
            +A      +   G+ PD+V  + +I GYC+ G + +A  +   +      +++ TYN+ 
Sbjct: 392 DKALMYFNSVKEAGLIPDNVIYTILIQGYCRKGMISVAMNLRNEMLQQGCAMDVVTYNTI 451

Query: 373 ITRLCTEGDMEKASEVFLEMSEVGLVPDCVSYTTMIGGYLKVENINRAFSYLCKMLKSGT 432
           +  LC    + +A ++F EM+E  L PD  + T +I G+ K+ N+  A     KM +   
Sbjct: 452 LHGLCKRKMLGEADKLFNEMTERALFPDSYTLTILIDGHCKLGNLQNAMELFQKMKEKRI 511

Query: 433 QPSIITYTLFIDNFCKYGDVEMAEVMFQKMIIEGLKPDVVMYNILMDGYGKKGYLHKAFE 492
           +  ++TY   +D F K GD++ A+ ++  M+ + + P  + Y+IL++    KG+L +AF 
Sbjct: 512 RLDVVTYNTLLDGFGKVGDIDTAKEIWADMVSKEILPTPISYSILVNALCSKGHLAEAFR 571

Query: 493 LLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVTYTNIIYGYS 552
           + D M S N+ P V+  N++I G    G   + +  L+++I  GF  D ++Y  +IYG+ 
Sbjct: 572 VWDEMISKNIKPTVMICNSMIKGYCRSGNASDGESFLEKMISEGFVPDCISYNTLIYGFV 631

Query: 553 KRGNFEEAFLLWYHMTD--NCVKPDVVTCSALLSGYCRERRMDEANALFCKMLDIGLNPD 612
           +  N  +AF L   M +    + PDV T +++L G+CR+ +M EA  +  KM++ G+NPD
Sbjct: 632 REENMSKAFGLVKKMEEEQGGLVPDVFTYNSILHGFCRQNQMKEAEVVLRKMIERGVNPD 691

Query: 613 LILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNN 643
              Y  +I+GF S  N+ E      +M++    P++
Sbjct: 692 RSTYTCMINGFVSQDNLTEAFRIHDEMLQRGFSPDD 727

BLAST of ClCG07G005030 vs. ExPASy TrEMBL
Match: A0A6J1GY27 (pentatricopeptide repeat-containing protein At2g19280 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111458516 PE=4 SV=1)

HSP 1 Score: 1192.2 bits (3083), Expect = 0.0e+00
Identity = 594/686 (86.59%), Postives = 634/686 (92.42%), Query Frame = 0

Query: 1   MKSAFSKTSFCSKLNFGRKRGCRYFATANSALSSLNYVD-DCFTFECPGATNYNDDSVEQ 60
           MKSAFS  +FCSKLNFGRKR CRYFATANSALSS NY D DCFT E P ATN + DS EQ
Sbjct: 1   MKSAFSIINFCSKLNFGRKRPCRYFATANSALSSFNYADEDCFTSELPAATNSDVDSEEQ 60

Query: 61  SNFGYEVQVSKGQKADEDEMKTIKLILGNHGFNLGSHPKQFEIVRILDILFEDSSDAGLC 120
           + FG +VQVSKG KAD+DEMK IKLILGNHGFNLGSHPKQ EIVRILDILFE+SSDA LC
Sbjct: 61  NYFGNDVQVSKGLKADDDEMKLIKLILGNHGFNLGSHPKQLEIVRILDILFEESSDARLC 120

Query: 121 LCYFKWSGCLSGSNQSLESICKMMHILVTGNMNHRAVDLMSQLVRTYGSKEGSSIILLKL 180
           L YFKWSGCLSGSN+SLESIC+M+HILV GNMNHRAVDLMS LV+ YGSKEG S ILLKL
Sbjct: 121 LYYFKWSGCLSGSNRSLESICRMIHILVAGNMNHRAVDLMSHLVKNYGSKEGFSTILLKL 180

Query: 181 LYETHNERKTLETTCSMLVDCYIKERMVTAALLLMGQMKHLNIFPSIWVYKSVIQALLQT 240
            YETH+ERKTLETTCSMLVDCYIKERMVTAAL+LMGQMK  +IFPSIWVYKSVIQALLQT
Sbjct: 181 FYETHHERKTLETTCSMLVDCYIKERMVTAALILMGQMKSFDIFPSIWVYKSVIQALLQT 240

Query: 241 NQSDLAWDLLEEMYRQGISLNYSINLFIHHYCAKGDLGRGWKVLLELRNFLSKPDAVDFT 300
           NQS+ AWDLLEEM+RQGISLNYSINLFI+HYCAKG+L RGWKVLLELR F SKPDAVD+T
Sbjct: 241 NQSESAWDLLEEMHRQGISLNYSINLFIYHYCAKGNLSRGWKVLLELRKFGSKPDAVDYT 300

Query: 301 IVINSLCKISLLKEASALLFKMTAFGVSPDSVTMSSVIDGYCKVGKLDIACKILKYFRLP 360
           IVINSLCKISLLKEA+ALLFKMTAFGVSPDSVTMSSVIDGYCK+GKLDIACKILKYFR P
Sbjct: 301 IVINSLCKISLLKEATALLFKMTAFGVSPDSVTMSSVIDGYCKLGKLDIACKILKYFRRP 360

Query: 361 LNIFTYNSFITRLCTEGDMEKASEVFLEMSEVGLVPDCVSYTTMIGGYLKVENINRAFSY 420
           LNIF YNSFIT+LC EG+  KASEVFLEMSEVGLVPDCVSYTTMIGGY KV NINRAFSY
Sbjct: 361 LNIFIYNSFITKLCMEGNTVKASEVFLEMSEVGLVPDCVSYTTMIGGYCKVGNINRAFSY 420

Query: 421 LCKMLKSGTQPSIITYTLFIDNFCKYGDVEMAEVMFQKMIIEGLKPDVVMYNILMDGYGK 480
           L KMLKSG +PS+ITYTLFID FCK  DVEMAEVM QKMIIEGL PDVV YNILMDGYGK
Sbjct: 421 LGKMLKSGIRPSVITYTLFIDYFCKRRDVEMAEVMLQKMIIEGLNPDVVTYNILMDGYGK 480

Query: 481 KGYLHKAFELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVT 540
           KGYLHKAFELLD MRSTN+TPDVVTYNTLINGLV RGFL+EAKD+LDEL RRGF+IDVVT
Sbjct: 481 KGYLHKAFELLDTMRSTNLTPDVVTYNTLINGLVTRGFLQEAKDMLDELNRRGFNIDVVT 540

Query: 541 YTNIIYGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCRERRMDEANALFCKML 600
           YTNII+GYSKRGNFEEAFL+W+HMTDNCVKPDVVTCSALLSGYCRERR+DEANALFCKML
Sbjct: 541 YTNIIHGYSKRGNFEEAFLVWFHMTDNCVKPDVVTCSALLSGYCRERRIDEANALFCKML 600

Query: 601 DIGLNPDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHCALVLGFQRKRVTN 660
           DIGLNPDLILYNTLIHGFCSVGNVDEGCN VKKMIE+SI+PNNVTH ALVLGFQ+++V +
Sbjct: 601 DIGLNPDLILYNTLIHGFCSVGNVDEGCNLVKKMIENSILPNNVTHRALVLGFQKRKVID 660

Query: 661 PIKSATSKLQEILLAYNLQIDANGYI 686
           PI+SATSKLQEILLAY+LQIDANGYI
Sbjct: 661 PIESATSKLQEILLAYDLQIDANGYI 686

BLAST of ClCG07G005030 vs. ExPASy TrEMBL
Match: A0A0A0KV38 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G581700 PE=4 SV=1)

HSP 1 Score: 1142.9 bits (2955), Expect = 0.0e+00
Identity = 566/685 (82.63%), Postives = 622/685 (90.80%), Query Frame = 0

Query: 1   MKSAFSKTSFCSKLNFGRKRGCRYFATANSALSSLNYVDDCFTFECPGATNYNDDSVEQS 60
           M+SAFS  SFCSKLNF RK  CRY ATANS LSS N++D+    +C   TNY+ +S E+S
Sbjct: 1   MRSAFSIISFCSKLNFRRKTPCRYSATANSELSSFNHMDE----DC---TNYDVNSDERS 60

Query: 61  NFGYEVQVSKGQKADEDEMKTIKLILGNHGFNLGSHPKQFEIVRILDILFEDSSDAGLCL 120
             G EV+VSKGQK DEDEM+TIKLILGN GFNLGS PKQ EI+RILD+LFEDSSDAGLCL
Sbjct: 61  YVGNEVEVSKGQKTDEDEMETIKLILGNRGFNLGSCPKQLEIIRILDVLFEDSSDAGLCL 120

Query: 121 CYFKWSGCLSGSNQSLESICKMMHILVTGNMNHRAVDLMSQLVRTYGSKEGSSIILLKLL 180
            YFKWSGCLSGSNQSLESIC+M HILV GNMNHRAVDL+S LV+ YG  EGSS ILLK+ 
Sbjct: 121 YYFKWSGCLSGSNQSLESICRMAHILVAGNMNHRAVDLISHLVKNYGCTEGSSSILLKVF 180

Query: 181 YETHNERKTLETTCSMLVDCYIKERMVTAALLLMGQMKHLNIFPSIWVYKSVIQALLQTN 240
            ETHN RKTLETTCSM+V+CYIKERMVT+AL+L+ QMKHLNIFPSIWVYKSVI+ALLQTN
Sbjct: 181 CETHNGRKTLETTCSMMVNCYIKERMVTSALILIDQMKHLNIFPSIWVYKSVIKALLQTN 240

Query: 241 QSDLAWDLLEEMYRQGISLNYSINLFIHHYCAKGDLGRGWKVLLELRNFLSKPDAVDFTI 300
           QS +AWDLLEEM+RQG+SLNYSINLFIHHYC++G+LG+GWKVLLELRNF SKPD VD+T 
Sbjct: 241 QSGMAWDLLEEMHRQGVSLNYSINLFIHHYCSEGNLGKGWKVLLELRNFGSKPDVVDYTT 300

Query: 301 VINSLCKISLLKEASALLFKMTAFGVSPDSVTMSSVIDGYCKVGKLDIACKILKYFRLPL 360
           VINSLCK+SLLKEA+ALLFKM  FGVSPD VTMSS+IDG+CKVGK DIACKILKYFRLPL
Sbjct: 301 VINSLCKVSLLKEATALLFKMITFGVSPDLVTMSSIIDGHCKVGKSDIACKILKYFRLPL 360

Query: 361 NIFTYNSFITRLCTEGDMEKASEVFLEMSEVGLVPDCVSYTTMIGGYLKVENINRAFSYL 420
           NIF YNSFIT+L TEGDM KAS+VFLEM+EVGLVPDC+SYTTMIGGY KV NIN AFSYL
Sbjct: 361 NIFIYNSFITKLSTEGDMVKASKVFLEMTEVGLVPDCISYTTMIGGYCKVGNINIAFSYL 420

Query: 421 CKMLKSGTQPSIITYTLFIDNFCKYGDVEMAEVMFQKMIIEGLKPDVVMYNILMDGYGKK 480
            KMLKSG QPS+ITYTLF+D FC+  DVEMAEVMF+KMI+EGLKPDVV+YNILMD YGKK
Sbjct: 421 SKMLKSGIQPSVITYTLFLDYFCECRDVEMAEVMFEKMIVEGLKPDVVVYNILMDAYGKK 480

Query: 481 GYLHKAFELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVTY 540
           GY+HKAF+LLDMMRSTNVTPDVVTYNTLINGLVMRGFL+EAKDILDELIRRGFS+DVVTY
Sbjct: 481 GYMHKAFKLLDMMRSTNVTPDVVTYNTLINGLVMRGFLQEAKDILDELIRRGFSVDVVTY 540

Query: 541 TNIIYGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCRERRMDEANALFCKMLD 600
           TNII+GYS RGNFEEAFLLWYHM +NCV PDVVTCSALLSGYCRE+RMDEANALFCKMLD
Sbjct: 541 TNIIHGYSTRGNFEEAFLLWYHMAENCVTPDVVTCSALLSGYCREKRMDEANALFCKMLD 600

Query: 601 IGLNPDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHCALVLGFQRKRVTNP 660
           IGL PDLILYNTLIHGFCSVGNVDEGCN VKKMIESSIIPNNVTH ALVLGFQ+KRVT+P
Sbjct: 601 IGLKPDLILYNTLIHGFCSVGNVDEGCNLVKKMIESSIIPNNVTHRALVLGFQKKRVTDP 660

Query: 661 IKSATSKLQEILLAYNLQIDANGYI 686
           I+SATSKLQEIL+AY+LQIDA GYI
Sbjct: 661 IQSATSKLQEILIAYDLQIDAIGYI 678

BLAST of ClCG07G005030 vs. ExPASy TrEMBL
Match: A0A6J1CBR5 (pentatricopeptide repeat-containing protein At2g19280-like isoform X1 OS=Momordica charantia OX=3673 GN=LOC111010113 PE=4 SV=1)

HSP 1 Score: 1123.2 bits (2904), Expect = 0.0e+00
Identity = 558/686 (81.34%), Postives = 606/686 (88.34%), Query Frame = 0

Query: 1   MKSAFSKTSFCSKLNFGRKRGCRYFATANSALSSLNYV-DDCFTFECPGATNYNDDSVEQ 60
           MKS FS   FCSKLNFGRK  CRYFAT N+ALS  N V DDCFT+E P A NY+ D  E+
Sbjct: 28  MKSPFSIICFCSKLNFGRKIACRYFATTNAALSLFNCVDDDCFTYEYPVAANYDVDFDEK 87

Query: 61  SNFGYEVQVSKGQKADEDEMKTIKLILGNHGFNLGSHPKQFEIVRILDILFEDSSDAGLC 120
             F  E    KGQK D+D MK IKLIL NHG NLGSHPKQ EIVRILD LFEDSSDAGL 
Sbjct: 88  IYFRNE--DPKGQKVDDDRMKMIKLILRNHGLNLGSHPKQLEIVRILDTLFEDSSDAGLS 147

Query: 121 LCYFKWSGCLSGSNQSLESICKMMHILVTGNMNHRAVDLMSQLVRTYGSKEGSSIILLKL 180
           L YFKWSGCLSGSNQSL+SIC+M+ IL+TGNMNHRAVDLMS +V  YGSKEGSS +LLKL
Sbjct: 148 LYYFKWSGCLSGSNQSLQSICRMIRILITGNMNHRAVDLMSHIVENYGSKEGSSSMLLKL 207

Query: 181 LYETHNERKTLETTCSMLVDCYIKERMVTAALLLMGQMKHLNIFPSIWVYKSVIQALLQT 240
            +E  NERKTLET CSMLV CYIKERMVTAAL+LMGQMKHL IFPSIWVY+SVIQ LL+T
Sbjct: 208 FFEMVNERKTLETACSMLVYCYIKERMVTAALILMGQMKHLKIFPSIWVYRSVIQTLLET 267

Query: 241 NQSDLAWDLLEEMYRQGISLNYSINLFIHHYCAKGDLGRGWKVLLELRNFLSKPDAVDFT 300
           NQ +LAWDLLEEMY QG+SLNYSINLFIHHYCA+G+LG GWKVLLELRNF SKPDAVD+T
Sbjct: 268 NQLELAWDLLEEMYIQGVSLNYSINLFIHHYCAEGNLGMGWKVLLELRNFGSKPDAVDYT 327

Query: 301 IVINSLCKISLLKEASALLFKMTAFGVSPDSVTMSSVIDGYCKVGKLDIACKILKYFRLP 360
           IVI+SLCK SLLKEA++LLFKM+AFGVSPDSVTMSSVIDGYCK+G LD+ACKILKYFRLP
Sbjct: 328 IVIDSLCKNSLLKEATSLLFKMSAFGVSPDSVTMSSVIDGYCKIGNLDVACKILKYFRLP 387

Query: 361 LNIFTYNSFITRLCTEGDMEKASEVFLEMSEVGLVPDCVSYTTMIGGYLKVENINRAFSY 420
           LNIFTYNSFIT+LCTEG+M  ASEVFLEMSEVGL+PDCVSYTTM+GGY KV +IN+AF Y
Sbjct: 388 LNIFTYNSFITKLCTEGNMVSASEVFLEMSEVGLLPDCVSYTTMMGGYCKVGDINKAFLY 447

Query: 421 LCKMLKSGTQPSIITYTLFIDNFCKYGDVEMAEVMFQKMIIEGLKPDVVMYNILMDGYGK 480
           L KMLKSG QPS+ITYTL IDN CK G+VEMAE+ FQKM+ EG+KPDVV +NILMDGYGK
Sbjct: 448 LGKMLKSGIQPSVITYTLLIDNLCKCGNVEMAEIFFQKMVTEGIKPDVVAFNILMDGYGK 507

Query: 481 KGYLHKAFELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVT 540
           KGYLHKAFELLDMMRSTNVTPDVVTYNTLINGL  RGFL+EAKDILDELIRRGFSIDVVT
Sbjct: 508 KGYLHKAFELLDMMRSTNVTPDVVTYNTLINGLFTRGFLREAKDILDELIRRGFSIDVVT 567

Query: 541 YTNIIYGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCRERRMDEANALFCKML 600
           YTN IYGYSKRGNFEEAFL+WYHMTDNCVKPDVVTCSALLSGYCRE RMDEANALF KML
Sbjct: 568 YTNFIYGYSKRGNFEEAFLVWYHMTDNCVKPDVVTCSALLSGYCREHRMDEANALFYKML 627

Query: 601 DIGLNPDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHCALVLGFQRKRVTN 660
           DIGLNPDLILYNTLIHGFCSVGNVDE CN V KMIESSI+PNNVTH ALV GFQ+K+V +
Sbjct: 628 DIGLNPDLILYNTLIHGFCSVGNVDEACNLVMKMIESSILPNNVTHRALVFGFQKKQVIS 687

Query: 661 PIKSATSKLQEILLAYNLQIDANGYI 686
           PI+SAT KLQEIL AY +++DA GYI
Sbjct: 688 PIESATCKLQEILRAYGIEVDAKGYI 711

BLAST of ClCG07G005030 vs. ExPASy TrEMBL
Match: A0A1S3BDC2 (pentatricopeptide repeat-containing protein At2g19280 OS=Cucumis melo OX=3656 GN=LOC103488803 PE=4 SV=1)

HSP 1 Score: 1108.6 bits (2866), Expect = 0.0e+00
Identity = 549/685 (80.15%), Postives = 610/685 (89.05%), Query Frame = 0

Query: 1   MKSAFSKTSFCSKLNFGRKRGCRYFATANSALSSLNYVDDCFTFECPGATNYNDDSVEQS 60
           MKSAFS  SFCSKLNF RK  CRYFATAN  LSS N++D+    +C   TNY+ DS E+S
Sbjct: 1   MKSAFSIISFCSKLNFRRKTPCRYFATANYELSSFNHMDE----DC---TNYDVDSDERS 60

Query: 61  NFGYEVQVSKGQKADEDEMKTIKLILGNHGFNLGSHPKQFEIVRILDILFEDSSDAGLCL 120
            FG EV+VSKG+K DED+M+ IKLILGN GF LGS PKQ E VRILDILFEDSSD  LCL
Sbjct: 61  YFGNEVEVSKGKKTDEDKMEKIKLILGNRGFKLGSRPKQLETVRILDILFEDSSDPELCL 120

Query: 121 CYFKWSGCLSGSNQSLESICKMMHILVTGNMNHRAVDLMSQLVRTYGSKEGSSIILLKLL 180
            YFKWSGCLSGSNQSLESIC+M HILV GN NH AVDL+S LV+ YG KEGSS ILL++ 
Sbjct: 121 YYFKWSGCLSGSNQSLESICRMAHILVAGNKNHGAVDLISHLVKNYGCKEGSSSILLEVF 180

Query: 181 YETHNERKTLETTCSMLVDCYIKERMVTAALLLMGQMKHLNIFPSIWVYKSVIQALLQTN 240
           Y+THN+RKTLETTC M+++CYIKE MVT+A++L+ QM+ LN+FPSIWVYKSVI+ALLQTN
Sbjct: 181 YDTHNKRKTLETTCGMMINCYIKEGMVTSAVILIDQMRRLNVFPSIWVYKSVIKALLQTN 240

Query: 241 QSDLAWDLLEEMYRQGISLNYSINLFIHHYCAKGDLGRGWKVLLELRNFLSKPDAVDFTI 300
           + D+AWDLLEEM RQGISL+YSINLFIHHYC++G+LG+GWKVLLELRNF SKPD VD+T 
Sbjct: 241 RFDMAWDLLEEMQRQGISLHYSINLFIHHYCSEGNLGKGWKVLLELRNFGSKPDVVDYTT 300

Query: 301 VINSLCKISLLKEASALLFKMTAFGVSPDSVTMSSVIDGYCKVGKLDIACKILKYFRLPL 360
           VINSLCKISLLKEA+ALLFKM  FGVSPD VTMSS+IDG+CKVGK DIACKILKYF++PL
Sbjct: 301 VINSLCKISLLKEATALLFKMITFGVSPDLVTMSSIIDGHCKVGKSDIACKILKYFKIPL 360

Query: 361 NIFTYNSFITRLCTEGDMEKASEVFLEMSEVGLVPDCVSYTTMIGGYLKVENINRAFSYL 420
           NIF YNSFIT L  EGD  KAS+VFLEMSEVGLVPDCVSYTTMIGGY KV NIN AFSYL
Sbjct: 361 NIFIYNSFITELFMEGDTVKASKVFLEMSEVGLVPDCVSYTTMIGGYCKVGNINIAFSYL 420

Query: 421 CKMLKSGTQPSIITYTLFIDNFCKYGDVEMAEVMFQKMIIEGLKPDVVMYNILMDGYGKK 480
            KMLKSG QPS+ITYTLF+D FC+ GDVEMAEVMF+KMI+E LKPDVVMYNILMD YGKK
Sbjct: 421 SKMLKSGIQPSVITYTLFVDYFCECGDVEMAEVMFEKMIVEDLKPDVVMYNILMDAYGKK 480

Query: 481 GYLHKAFELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVTY 540
           GY+HKAF+LLDMMRSTNVTPDVVTYN+LI+GLVMRGFL+EAKDILDELIRRGFSIDVVTY
Sbjct: 481 GYMHKAFQLLDMMRSTNVTPDVVTYNSLIHGLVMRGFLQEAKDILDELIRRGFSIDVVTY 540

Query: 541 TNIIYGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCRERRMDEANALFCKMLD 600
           TNI++GYSKRGNFEEAFLLWYHM DNCV PDVVTCSALLSGYCR + MDEANALFC+MLD
Sbjct: 541 TNIMHGYSKRGNFEEAFLLWYHMADNCVTPDVVTCSALLSGYCRAKHMDEANALFCRMLD 600

Query: 601 IGLNPDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHCALVLGFQRKRVTNP 660
           IGL PDLILYNTLIHGFCSVGNVDEGCN VKKMIESSIIPNNVTH ALVLGFQ+KRV +P
Sbjct: 601 IGLKPDLILYNTLIHGFCSVGNVDEGCNLVKKMIESSIIPNNVTHRALVLGFQKKRVMDP 660

Query: 661 IKSATSKLQEILLAYNLQIDANGYI 686
           I+SATSKLQEIL+AY+LQIDA G+I
Sbjct: 661 IQSATSKLQEILIAYDLQIDAIGHI 678

BLAST of ClCG07G005030 vs. ExPASy TrEMBL
Match: A0A5D3CZD6 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold35G003210 PE=4 SV=1)

HSP 1 Score: 1061.2 bits (2743), Expect = 1.8e-306
Identity = 519/636 (81.60%), Postives = 576/636 (90.57%), Query Frame = 0

Query: 50  TNYNDDSVEQSNFGYEVQVSKGQKADEDEMKTIKLILGNHGFNLGSHPKQFEIVRILDIL 109
           TNY+ DS E+S FG EV+VSKG+K DED+M+ IKLILGN GF LGS PKQ E VRILDIL
Sbjct: 6   TNYDVDSDERSYFGNEVEVSKGKKTDEDKMEKIKLILGNRGFKLGSRPKQLETVRILDIL 65

Query: 110 FEDSSDAGLCLCYFKWSGCLSGSNQSLESICKMMHILVTGNMNHRAVDLMSQLVRTYGSK 169
           FEDSSD  LCL YFKWSGCLSGSNQSLESIC+M HILV GN NH AVDL+S LV+ YG K
Sbjct: 66  FEDSSDPELCLYYFKWSGCLSGSNQSLESICRMAHILVAGNKNHGAVDLISHLVKNYGCK 125

Query: 170 EGSSIILLKLLYETHNERKTLETTCSMLVDCYIKERMVTAALLLMGQMKHLNIFPSIWVY 229
           EGSS ILL++ Y+THN+RKTLETTC M+++CYIKE MVT+A++L+ QM+ LN+FPSIWVY
Sbjct: 126 EGSSSILLEVFYDTHNKRKTLETTCGMMINCYIKEGMVTSAVILIDQMRRLNVFPSIWVY 185

Query: 230 KSVIQALLQTNQSDLAWDLLEEMYRQGISLNYSINLFIHHYCAKGDLGRGWKVLLELRNF 289
           KSVI+ALLQTN+ D+AWDLLEEM RQGISL+YSINLFIHHYC++G+LG+GWKVLLELRNF
Sbjct: 186 KSVIKALLQTNRFDMAWDLLEEMQRQGISLHYSINLFIHHYCSEGNLGKGWKVLLELRNF 245

Query: 290 LSKPDAVDFTIVINSLCKISLLKEASALLFKMTAFGVSPDSVTMSSVIDGYCKVGKLDIA 349
            SKPD VD+T VINSLCKISLLKEA+ALLFKM  FGVSPD VTMSS+IDG+CKVGK DIA
Sbjct: 246 GSKPDVVDYTTVINSLCKISLLKEATALLFKMITFGVSPDLVTMSSIIDGHCKVGKSDIA 305

Query: 350 CKILKYFRLPLNIFTYNSFITRLCTEGDMEKASEVFLEMSEVGLVPDCVSYTTMIGGYLK 409
           CKILKYF++PLNIF YNSFIT L  EGD  KAS+VFLEMSEVGLVPDCVSYTTMIGGY K
Sbjct: 306 CKILKYFKIPLNIFIYNSFITELFMEGDTVKASKVFLEMSEVGLVPDCVSYTTMIGGYCK 365

Query: 410 VENINRAFSYLCKMLKSGTQPSIITYTLFIDNFCKYGDVEMAEVMFQKMIIEGLKPDVVM 469
           V NIN AFSYL KMLKSG QPS+ITYTLF+D FC+ GDVEMAEVMF+KMI+E LKPDVVM
Sbjct: 366 VGNINIAFSYLSKMLKSGIQPSVITYTLFVDYFCECGDVEMAEVMFEKMIVEDLKPDVVM 425

Query: 470 YNILMDGYGKKGYLHKAFELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELI 529
           YNILMD YGKKGY+HKAF+LLDMMRSTNVTPDVVTYN+LI+GLVMRGFL+EAKDILDELI
Sbjct: 426 YNILMDAYGKKGYMHKAFQLLDMMRSTNVTPDVVTYNSLIHGLVMRGFLQEAKDILDELI 485

Query: 530 RRGFSIDVVTYTNIIYGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCRERRMD 589
           RRGFSIDVVTYTNI++GYSKRGNFEEAFLLWYHM DNCV PDVVTCSALLSGYCR + MD
Sbjct: 486 RRGFSIDVVTYTNIMHGYSKRGNFEEAFLLWYHMADNCVTPDVVTCSALLSGYCRAKHMD 545

Query: 590 EANALFCKMLDIGLNPDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHCALV 649
           EANALFC+MLDIGL PDLILYNTLIHGFCSVGNVDEGCN VKKMIESSIIPNNVTH ALV
Sbjct: 546 EANALFCRMLDIGLKPDLILYNTLIHGFCSVGNVDEGCNLVKKMIESSIIPNNVTHRALV 605

Query: 650 LGFQRKRVTNPIKSATSKLQEILLAYNLQIDANGYI 686
           LGFQ+KRV +PI+SATSKLQEIL+AY+LQIDA G+I
Sbjct: 606 LGFQKKRVMDPIQSATSKLQEILIAYDLQIDAIGHI 641

BLAST of ClCG07G005030 vs. TAIR 10
Match: AT2G19280.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 580.5 bits (1495), Expect = 1.8e-165
Identity = 297/673 (44.13%), Postives = 444/673 (65.97%), Query Frame = 0

Query: 10  FCSKLNFGRKRGCRYFATANSALSSLNYVDDCFTFECPGATNYNDDSVEQSNFGYE-VQV 69
           FC++    R   CR F+ A+ + ++  +  D       G+  Y+  S    +FG + V +
Sbjct: 16  FCTRTKAFRYFWCRTFSLASLSENNSRFQTDSSRLPYSGSRYYHSSS---KHFGEDFVSI 75

Query: 70  SKGQKADEDEMKTIKLILGNHGF------NLGSHPKQFEIVRILDILFEDSSDAGLCLCY 129
            K      D ++TI+ +L  H +         +   Q+ ++RILD LFE++ DA + L +
Sbjct: 76  LKNIDVPRDCVETIRNVLVKHNWIQKYESGFSTELDQYTVIRILDDLFEETLDASIVLYF 135

Query: 130 FKWSGCLSGSNQSLESICKMMHILVTGNMNHRAVDLMSQLVRTYGSKEGSSIILLKLLYE 189
           F+WS    G   S  SI +M+HILV+GNMN+RAVD++  LV+    +E S  +++K L+E
Sbjct: 136 FRWSELWIGVEHSSRSISRMIHILVSGNMNYRAVDMLLCLVKKCSGEERSLCLVMKDLFE 195

Query: 190 THNERKTLETTCSMLVDCYIKERMVTAALLLMGQMKHLNIFPSIWVYKSVIQALLQTNQS 249
           T  +R+ LET  S+L+DC I+ER V  AL L  ++    IFPS  V  S+++ +L+ +  
Sbjct: 196 TRIDRRVLETVFSILIDCCIRERKVNMALKLTYKVDQFGIFPSRGVCISLLKEILRVHGL 255

Query: 250 DLAWDLLEEMYRQGISLNYSI-NLFIHHYCAKGDLGRGWKVLLELRNFLSKPDAVDFTIV 309
           +LA + +E M  +G  LN ++ +LFI  YC+ G   +GW++L+ ++++  +PD V FT+ 
Sbjct: 256 ELAREFVEHMLSRGRHLNAAVLSLFIRKYCSDGYFDKGWELLMGMKHYGIRPDIVAFTVF 315

Query: 310 INSLCKISLLKEASALLFKMTAFGVSPDSVTMSSVIDGYCKVGKLDIACKILKYFRLPLN 369
           I+ LCK   LKEA+++LFK+  FG+S DSV++SSVIDG+CKVGK + A K++  FRL  N
Sbjct: 316 IDKLCKAGFLKEATSVLFKLKLFGISQDSVSVSSVIDGFCKVGKPEEAIKLIHSFRLRPN 375

Query: 370 IFTYNSFITRLCTEGDMEKASEVFLEMSEVGLVPDCVSYTTMIGGYLKVENINRAFSYLC 429
           IF Y+SF++ +C+ GDM +AS +F E+ E+GL+PDCV YTTMI GY  +   ++AF Y  
Sbjct: 376 IFVYSSFLSNICSTGDMLRASTIFQEIFELGLLPDCVCYTTMIDGYCNLGRTDKAFQYFG 435

Query: 430 KMLKSGTQPSIITYTLFIDNFCKYGDVEMAEVMFQKMIIEGLKPDVVMYNILMDGYGKKG 489
            +LKSG  PS+ T T+ I    ++G +  AE +F+ M  EGLK DVV YN LM GYGK  
Sbjct: 436 ALLKSGNPPSLTTSTILIGACSRFGSISDAESVFRNMKTEGLKLDVVTYNNLMHGYGKTH 495

Query: 490 YLHKAFELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVTYT 549
            L+K FEL+D MRS  ++PDV TYN LI+ +V+RG++ EA +I+ ELIRRGF    + +T
Sbjct: 496 QLNKVFELIDEMRSAGISPDVATYNILIHSMVVRGYIDEANEIISELIRRGFVPSTLAFT 555

Query: 550 NIIYGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCRERRMDEANALFCKMLDI 609
           ++I G+SKRG+F+EAF+LW++M D  +KPDVVTCSALL GYC+ +RM++A  LF K+LD 
Sbjct: 556 DVIGGFSKRGDFQEAFILWFYMADLRMKPDVVTCSALLHGYCKAQRMEKAIVLFNKLLDA 615

Query: 610 GLNPDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHCALVLGFQRKRVTNPI 669
           GL PD++LYNTLIHG+CSVG++++ C  +  M++  ++PN  TH ALVLG + KR  N  
Sbjct: 616 GLKPDVVLYNTLIHGYCSVGDIEKACELIGLMVQRGMLPNESTHHALVLGLEGKRFVNSE 675

Query: 670 KSATSKLQEILLA 675
             A+  L+EI++A
Sbjct: 676 THASMLLEEIIVA 685

BLAST of ClCG07G005030 vs. TAIR 10
Match: AT2G19280.2 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 580.5 bits (1495), Expect = 1.8e-165
Identity = 297/673 (44.13%), Postives = 444/673 (65.97%), Query Frame = 0

Query: 10  FCSKLNFGRKRGCRYFATANSALSSLNYVDDCFTFECPGATNYNDDSVEQSNFGYE-VQV 69
           FC++    R   CR F+ A+ + ++  +  D       G+  Y+  S    +FG + V +
Sbjct: 16  FCTRTKAFRYFWCRTFSLASLSENNSRFQTDSSRLPYSGSRYYHSSS---KHFGEDFVSI 75

Query: 70  SKGQKADEDEMKTIKLILGNHGF------NLGSHPKQFEIVRILDILFEDSSDAGLCLCY 129
            K      D ++TI+ +L  H +         +   Q+ ++RILD LFE++ DA + L +
Sbjct: 76  LKNIDVPRDCVETIRNVLVKHNWIQKYESGFSTELDQYTVIRILDDLFEETLDASIVLYF 135

Query: 130 FKWSGCLSGSNQSLESICKMMHILVTGNMNHRAVDLMSQLVRTYGSKEGSSIILLKLLYE 189
           F+WS    G   S  SI +M+HILV+GNMN+RAVD++  LV+    +E S  +++K L+E
Sbjct: 136 FRWSELWIGVEHSSRSISRMIHILVSGNMNYRAVDMLLCLVKKCSGEERSLCLVMKDLFE 195

Query: 190 THNERKTLETTCSMLVDCYIKERMVTAALLLMGQMKHLNIFPSIWVYKSVIQALLQTNQS 249
           T  +R+ LET  S+L+DC I+ER V  AL L  ++    IFPS  V  S+++ +L+ +  
Sbjct: 196 TRIDRRVLETVFSILIDCCIRERKVNMALKLTYKVDQFGIFPSRGVCISLLKEILRVHGL 255

Query: 250 DLAWDLLEEMYRQGISLNYSI-NLFIHHYCAKGDLGRGWKVLLELRNFLSKPDAVDFTIV 309
           +LA + +E M  +G  LN ++ +LFI  YC+ G   +GW++L+ ++++  +PD V FT+ 
Sbjct: 256 ELAREFVEHMLSRGRHLNAAVLSLFIRKYCSDGYFDKGWELLMGMKHYGIRPDIVAFTVF 315

Query: 310 INSLCKISLLKEASALLFKMTAFGVSPDSVTMSSVIDGYCKVGKLDIACKILKYFRLPLN 369
           I+ LCK   LKEA+++LFK+  FG+S DSV++SSVIDG+CKVGK + A K++  FRL  N
Sbjct: 316 IDKLCKAGFLKEATSVLFKLKLFGISQDSVSVSSVIDGFCKVGKPEEAIKLIHSFRLRPN 375

Query: 370 IFTYNSFITRLCTEGDMEKASEVFLEMSEVGLVPDCVSYTTMIGGYLKVENINRAFSYLC 429
           IF Y+SF++ +C+ GDM +AS +F E+ E+GL+PDCV YTTMI GY  +   ++AF Y  
Sbjct: 376 IFVYSSFLSNICSTGDMLRASTIFQEIFELGLLPDCVCYTTMIDGYCNLGRTDKAFQYFG 435

Query: 430 KMLKSGTQPSIITYTLFIDNFCKYGDVEMAEVMFQKMIIEGLKPDVVMYNILMDGYGKKG 489
            +LKSG  PS+ T T+ I    ++G +  AE +F+ M  EGLK DVV YN LM GYGK  
Sbjct: 436 ALLKSGNPPSLTTSTILIGACSRFGSISDAESVFRNMKTEGLKLDVVTYNNLMHGYGKTH 495

Query: 490 YLHKAFELLDMMRSTNVTPDVVTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVTYT 549
            L+K FEL+D MRS  ++PDV TYN LI+ +V+RG++ EA +I+ ELIRRGF    + +T
Sbjct: 496 QLNKVFELIDEMRSAGISPDVATYNILIHSMVVRGYIDEANEIISELIRRGFVPSTLAFT 555

Query: 550 NIIYGYSKRGNFEEAFLLWYHMTDNCVKPDVVTCSALLSGYCRERRMDEANALFCKMLDI 609
           ++I G+SKRG+F+EAF+LW++M D  +KPDVVTCSALL GYC+ +RM++A  LF K+LD 
Sbjct: 556 DVIGGFSKRGDFQEAFILWFYMADLRMKPDVVTCSALLHGYCKAQRMEKAIVLFNKLLDA 615

Query: 610 GLNPDLILYNTLIHGFCSVGNVDEGCNFVKKMIESSIIPNNVTHCALVLGFQRKRVTNPI 669
           GL PD++LYNTLIHG+CSVG++++ C  +  M++  ++PN  TH ALVLG + KR  N  
Sbjct: 616 GLKPDVVLYNTLIHGYCSVGDIEKACELIGLMVQRGMLPNESTHHALVLGLEGKRFVNSE 675

Query: 670 KSATSKLQEILLA 675
             A+  L+EI++A
Sbjct: 676 THASMLLEEIIVA 685

BLAST of ClCG07G005030 vs. TAIR 10
Match: AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 271.9 bits (694), Expect = 1.4e-72
Identity = 165/586 (28.16%), Postives = 294/586 (50.17%), Query Frame = 0

Query: 86  LGNHGFNLGSHPKQFEIVRILDILFEDSSDAGLCLCYFKWSGCLSGSNQSLESICKMMHI 145
           L  H + L      F      ++L +  +D  L L +  W+        +L   C  +HI
Sbjct: 32  LKRHPYQLHHLSANFTPEAASNLLLKSQNDQALILKFLNWAN--PHQFFTLRCKCITLHI 91

Query: 146 LVTGNMNHRAVDLMSQLVRTYGSKEGSSIILLKLLYETHNERKTLETTCSMLVDCYIKER 205
           L    + ++   ++++ V      +  + ++ K L ET++   +  +   ++V  Y +  
Sbjct: 92  LTKFKL-YKTAQILAEDVAAKTLDDEYASLVFKSLQETYDLCYSTSSVFDLVVKSYSRLS 151

Query: 206 MVTAALLLMGQMKHLNIFPSIWVYKSVIQALLQTNQSDLAWDLLEEMYRQGISLNYSINL 265
           ++  AL ++   +     P +  Y +V+ A +++ +                +++++ N+
Sbjct: 152 LIDKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKR----------------NISFAENV 211

Query: 266 FIHHYCAKGDLGRGWKVLLELRNFLSKPDAVDFTIVINSLCKISLLKEASALLFKMTAFG 325
           F              K +LE +     P+   + I+I   C    +  A  L  KM   G
Sbjct: 212 F--------------KEMLESQ---VSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKG 271

Query: 326 VSPDSVTMSSVIDGYCKVGKLDIACKILKYFR---LPLNIFTYNSFITRLCTEGDMEKAS 385
             P+ VT +++IDGYCK+ K+D   K+L+      L  N+ +YN  I  LC EG M++ S
Sbjct: 272 CLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVS 331

Query: 386 EVFLEMSEVGLVPDCVSYTTMIGGYLKVENINRAFSYLCKMLKSGTQPSIITYTLFIDNF 445
            V  EM+  G   D V+Y T+I GY K  N ++A     +ML+ G  PS+ITYT  I + 
Sbjct: 332 FVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSM 391

Query: 446 CKYGDVEMAEVMFQKMIIEGLKPDVVMYNILMDGYGKKGYLHKAFELLDMMRSTNVTPDV 505
           CK G++  A     +M + GL P+   Y  L+DG+ +KGY+++A+ +L  M     +P V
Sbjct: 392 CKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSV 451

Query: 506 VTYNTLINGLVMRGFLKEAKDILDELIRRGFSIDVVTYTNIIYGYSKRGNFEEAFLLWYH 565
           VTYN LING  + G +++A  +L+++  +G S DVV+Y+ ++ G+ +  + +EA  +   
Sbjct: 452 VTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKRE 511

Query: 566 MTDNCVKPDVVTCSALLSGYCRERRMDEANALFCKMLDIGLNPDLILYNTLIHGFCSVGN 625
           M +  +KPD +T S+L+ G+C +RR  EA  L+ +ML +GL PD   Y  LI+ +C  G+
Sbjct: 512 MVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGD 571

Query: 626 VDEGCNFVKKMIESSIIPNNVTHCALVLGFQRKRVTNPIKSATSKL 669
           +++      +M+E  ++P+ VT+  L+ G  ++  T   K    KL
Sbjct: 572 LEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRLLLKL 581

BLAST of ClCG07G005030 vs. TAIR 10
Match: AT1G05670.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 248.1 bits (632), Expect = 2.1e-65
Identity = 171/575 (29.74%), Postives = 277/575 (48.17%), Query Frame = 0

Query: 99  QFEIVRILDILFEDSSDAGLCLCYFKWSGCLSGSNQSLESICKMMHILVTGNMNHRAVDL 158
           +F+   ++ +L +   D  L L +F W+     SN  LES+C ++H+ V       A  L
Sbjct: 84  KFKTDHLIWVLMKIKCDYRLVLDFFDWARSRRDSN--LESLCIVIHLAVASKDLKVAQSL 143

Query: 159 MSQLVRTYGSKEGSSIILLKLLYETHNERKTLETTCSMLVDCYIKERMVTAALLLMGQMK 218
           +S                         ER  L  T          +  V    LL+   K
Sbjct: 144 ISSFW----------------------ERPKLNVT----------DSFVQFFDLLVYTYK 203

Query: 219 HLNIFPSIWVYKSVIQALLQTNQSDLAWDLLEEMYRQGISLNY-SINLFIHHYCAKGDLG 278
                P   V+    Q L+       A  + E+M   G+ L+  S N+++       D  
Sbjct: 204 DWGSDPR--VFDVFFQVLVDFGLLREARRVFEKMLNYGLVLSVDSCNVYLTR--LSKDCY 263

Query: 279 RGWKVLLELRNFLSKP---DAVDFTIVINSLCKISLLKEASALLFKMTAFGVSPDSVTMS 338
           +    ++  R F       +   + IVI+ +C++  +KEA  LL  M   G +PD ++ S
Sbjct: 264 KTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLLLMELKGYTPDVISYS 323

Query: 339 SVIDGYCKVGKLDIACKILKYFR---LPLNIFTYNSFITRLCTEGDMEKASEVFLEMSEV 398
           +V++GYC+ G+LD   K+++  +   L  N + Y S I  LC    + +A E F EM   
Sbjct: 324 TVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGLLCRICKLAEAEEAFSEMIRQ 383

Query: 399 GLVPDCVSYTTMIGGYLKVENINRAFSYLCKMLKSGTQPSIITYTLFIDNFCKYGDVEMA 458
           G++PD V YTT+I G+ K  +I  A  +  +M      P ++TYT  I  FC+ GD+  A
Sbjct: 384 GILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAIISGFCQIGDMVEA 443

Query: 459 EVMFQKMIIEGLKPDVVMYNILMDGYGKKGYLHKAFELLDMMRSTNVTPDVVTYNTLING 518
             +F +M  +GL+PD V +  L++GY K G++  AF + + M     +P+VVTY TLI+G
Sbjct: 444 GKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHMIQAGCSPNVVTYTTLIDG 503

Query: 519 LVMRGFLKEAKDILDELIRRGFSIDVVTYTNIIYGYSKRGNFEEAFLLWYHMTDNCVKPD 578
           L   G L  A ++L E+ + G   ++ TY +I+ G  K GN EEA  L        +  D
Sbjct: 504 LCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKLVGEFEAAGLNAD 563

Query: 579 VVTCSALLSGYCRERRMDEANALFCKMLDIGLNPDLILYNTLIHGFCSVGNVDEGCNFVK 638
            VT + L+  YC+   MD+A  +  +ML  GL P ++ +N L++GFC  G +++G   + 
Sbjct: 564 TVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFCLHGMLEDGEKLLN 617

Query: 639 KMIESSIIPNNVTHCALVLGFQRKRVTNPIKSATS 667
            M+   I PN  T  +LV   ++  + N +K+AT+
Sbjct: 624 WMLAKGIAPNATTFNSLV---KQYCIRNNLKAATA 617

BLAST of ClCG07G005030 vs. TAIR 10
Match: AT1G05670.2 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 248.1 bits (632), Expect = 2.1e-65
Identity = 171/575 (29.74%), Postives = 277/575 (48.17%), Query Frame = 0

Query: 99  QFEIVRILDILFEDSSDAGLCLCYFKWSGCLSGSNQSLESICKMMHILVTGNMNHRAVDL 158
           +F+   ++ +L +   D  L L +F W+     SN  LES+C ++H+ V       A  L
Sbjct: 84  KFKTDHLIWVLMKIKCDYRLVLDFFDWARSRRDSN--LESLCIVIHLAVASKDLKVAQSL 143

Query: 159 MSQLVRTYGSKEGSSIILLKLLYETHNERKTLETTCSMLVDCYIKERMVTAALLLMGQMK 218
           +S                         ER  L  T          +  V    LL+   K
Sbjct: 144 ISSFW----------------------ERPKLNVT----------DSFVQFFDLLVYTYK 203

Query: 219 HLNIFPSIWVYKSVIQALLQTNQSDLAWDLLEEMYRQGISLNY-SINLFIHHYCAKGDLG 278
                P   V+    Q L+       A  + E+M   G+ L+  S N+++       D  
Sbjct: 204 DWGSDPR--VFDVFFQVLVDFGLLREARRVFEKMLNYGLVLSVDSCNVYLTR--LSKDCY 263

Query: 279 RGWKVLLELRNFLSKP---DAVDFTIVINSLCKISLLKEASALLFKMTAFGVSPDSVTMS 338
           +    ++  R F       +   + IVI+ +C++  +KEA  LL  M   G +PD ++ S
Sbjct: 264 KTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGRIKEAHHLLLLMELKGYTPDVISYS 323

Query: 339 SVIDGYCKVGKLDIACKILKYFR---LPLNIFTYNSFITRLCTEGDMEKASEVFLEMSEV 398
           +V++GYC+ G+LD   K+++  +   L  N + Y S I  LC    + +A E F EM   
Sbjct: 324 TVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYGSIIGLLCRICKLAEAEEAFSEMIRQ 383

Query: 399 GLVPDCVSYTTMIGGYLKVENINRAFSYLCKMLKSGTQPSIITYTLFIDNFCKYGDVEMA 458
           G++PD V YTT+I G+ K  +I  A  +  +M      P ++TYT  I  FC+ GD+  A
Sbjct: 384 GILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSRDITPDVLTYTAIISGFCQIGDMVEA 443

Query: 459 EVMFQKMIIEGLKPDVVMYNILMDGYGKKGYLHKAFELLDMMRSTNVTPDVVTYNTLING 518
             +F +M  +GL+PD V +  L++GY K G++  AF + + M     +P+VVTY TLI+G
Sbjct: 444 GKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDAFRVHNHMIQAGCSPNVVTYTTLIDG 503

Query: 519 LVMRGFLKEAKDILDELIRRGFSIDVVTYTNIIYGYSKRGNFEEAFLLWYHMTDNCVKPD 578
           L   G L  A ++L E+ + G   ++ TY +I+ G  K GN EEA  L        +  D
Sbjct: 504 LCKEGDLDSANELLHEMWKIGLQPNIFTYNSIVNGLCKSGNIEEAVKLVGEFEAAGLNAD 563

Query: 579 VVTCSALLSGYCRERRMDEANALFCKMLDIGLNPDLILYNTLIHGFCSVGNVDEGCNFVK 638
            VT + L+  YC+   MD+A  +  +ML  GL P ++ +N L++GFC  G +++G   + 
Sbjct: 564 TVTYTTLMDAYCKSGEMDKAQEILKEMLGKGLQPTIVTFNVLMNGFCLHGMLEDGEKLLN 617

Query: 639 KMIESSIIPNNVTHCALVLGFQRKRVTNPIKSATS 667
            M+   I PN  T  +LV   ++  + N +K+AT+
Sbjct: 624 WMLAKGIAPNATTFNSLV---KQYCIRNNLKAATA 617

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038892184.10.0e+0092.27pentatricopeptide repeat-containing protein At2g19280 [Benincasa hispida][more]
XP_022957015.10.0e+0086.59pentatricopeptide repeat-containing protein At2g19280 isoform X1 [Cucurbita mosc... [more]
XP_011655513.10.0e+0082.63pentatricopeptide repeat-containing protein At2g19280 [Cucumis sativus] >XP_0116... [more]
XP_022139130.10.0e+0081.34pentatricopeptide repeat-containing protein At2g19280-like isoform X1 [Momordica... [more]
XP_008445921.10.0e+0080.15PREDICTED: pentatricopeptide repeat-containing protein At2g19280 [Cucumis melo][more]
Match NameE-valueIdentityDescription
Q6NKW72.5e-16444.13Pentatricopeptide repeat-containing protein At2g19280 OS=Arabidopsis thaliana OX... [more]
Q9FIX31.9e-7128.16Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Q0WVK73.0e-6429.74Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidop... [more]
Q9CAN61.3e-5929.71Pentatricopeptide repeat-containing protein At1g63070, mitochondrial OS=Arabidop... [more]
Q9LFC52.9e-5928.51Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1GY270.0e+0086.59pentatricopeptide repeat-containing protein At2g19280 isoform X1 OS=Cucurbita mo... [more]
A0A0A0KV380.0e+0082.63Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G581700 PE=4 SV=1[more]
A0A6J1CBR50.0e+0081.34pentatricopeptide repeat-containing protein At2g19280-like isoform X1 OS=Momordi... [more]
A0A1S3BDC20.0e+0080.15pentatricopeptide repeat-containing protein At2g19280 OS=Cucumis melo OX=3656 GN... [more]
A0A5D3CZD61.8e-30681.60Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT2G19280.11.8e-16544.13Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G19280.21.8e-16544.13Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G39710.11.4e-7228.16Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G05670.12.1e-6529.74Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT1G05670.22.1e-6529.74Pentatricopeptide repeat (PPR-like) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 228..257
e-value: 0.037
score: 14.3
coord: 193..221
e-value: 0.23
score: 11.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 361..406
e-value: 1.0E-12
score: 48.0
coord: 293..342
e-value: 8.9E-9
score: 35.4
coord: 536..581
e-value: 3.3E-10
score: 40.0
coord: 465..512
e-value: 2.8E-16
score: 59.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 363..397
e-value: 2.0E-8
score: 31.9
coord: 228..260
e-value: 2.9E-4
score: 18.8
coord: 298..330
e-value: 4.3E-4
score: 18.3
coord: 468..502
e-value: 4.1E-9
score: 34.1
coord: 609..641
e-value: 6.4E-8
score: 30.3
coord: 503..537
e-value: 5.2E-8
score: 30.6
coord: 573..606
e-value: 4.7E-7
score: 27.6
coord: 398..431
e-value: 3.9E-6
score: 24.7
coord: 538..572
e-value: 1.6E-7
score: 29.0
coord: 433..467
e-value: 4.6E-8
score: 30.7
coord: 193..225
e-value: 1.5E-4
score: 19.7
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 427..458
e-value: 1.5E-5
score: 24.6
coord: 602..633
e-value: 1.7E-10
score: 40.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 571..605
score: 12.824779
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 294..328
score: 9.63504
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 431..465
score: 11.301158
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 466..500
score: 12.495939
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 225..259
score: 9.251395
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 501..535
score: 11.73961
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 536..570
score: 11.355965
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 396..430
score: 10.446177
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 361..395
score: 12.33152
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 606..640
score: 11.849223
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 461..531
e-value: 4.6E-21
score: 77.2
coord: 95..257
e-value: 1.8E-9
score: 39.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 357..460
e-value: 1.6E-26
score: 95.5
coord: 260..356
e-value: 2.4E-19
score: 71.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 547..673
e-value: 5.5E-31
score: 109.3
NoneNo IPR availablePANTHERPTHR47941:SF3REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 2..681
NoneNo IPR availablePANTHERPTHR47941PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN 3, MITOCHONDRIALcoord: 2..681
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 374..611

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG07G005030.1ClCG07G005030.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding