CcUC06G125840 (gene) Watermelon (PI 537277) v1

Overview
NameCcUC06G125840
Typegene
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationCicolChr06: 28196469 .. 28198316 (-)
RNA-Seq ExpressionCcUC06G125840
SyntenyCcUC06G125840
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAATGTATTTCCGGCTTCTGTCTTTCTCATATAGAATAATTCAAAGGTCTCGGCTCCAACAAATTTGTACAATCTCGAACTCGGTTTTTCTAGAATCAGAAATGTCGAAATTTGTACATACCCAAGCGATGGATCTTCCTTCTCAGAGAACTAACGAGAGAAAGATTCCTGATTACGAGGACGCCCTCCATGAGGAAGCCGATGGCGTGCGGAGGCGCGGGTATTTCCTCACGAAACTCATAGACGACTCTGTTTCGCATAATGGGTTCGAATCTATTGCTCGTATTTTCCCCAAGTTTCGTGGTTCTATTGATTCTCAGCTGTGTAACTCGATGATTAGGCGTTATTTGGATTTGAATAAGCATTTACATTCACTCTTCATTTTTGCCCACATGCATCAATTCAGTATTCTGCCCGATTCCTCCACTTTTCCTTGTGTTCTTAAAGCAACTGCAAAGCTATGTGCTACTGAACTTGGAAAAATGATACATGGTACTGTTATTCAGATGGGTTTTATCCATGATGTCTACGTAAGTACCGCTCTTATTCATATGTACTGTTCTTGTTTGTCTACATCCGATGCTTCTCAGTTGTTCGACGAAATGCCTGAAAGAAATGCAGTTACTTGGAATGCTCTGATTACTGGTTATACCCATAATAGAAAGTTTATGGAAGCTACCAACGCTTTCAGAGGAATGCTGGCAGCTGGGGCTGAACCGAGTGAGAGAACCGTGGTGGTAGTTCTATCAGCTTGTGCTCATTTGGGAGCGTTGAATCAGGGAAAGTGGATCCATGAGTTTATATATCATAATAGGTTGAGACTGAACGTGTTTGTGGGCACAGCACTTATTGATATGTATGCTAAATGTGGGGCTGTTGATGAGGCAGAGAAGGTCTTTGAAGAAATTTGGGAGAAGAATGTCCATACGTGGAATGTCTTGATTTCTGGATATGCCATGAATGGGCAAGGCGATGCAGCTTTGCAGGCTTTTTCTAGGATGTTGATGGAAGATTTCGAGCCAGATGAGGTTACCTTTCTAGGTCTCTTGTGTGCATGCTGTCACCAAGGTCTGGTCACAGAAGGGCGCAGGCAATTCATGAGCATGAAACAACAATTTGGACTGCAACCAAAGATCGAGCATTATGGGTGTATGGTTGACCTACTTGGTCGAGCGGGATTGTTGGAGGAAGCTCTACAGTTAATCCAATCCATGAGCATGGAGCCAGACCCTATCATTTGGAGGGCTCTGCTTTGTGCTTGCAGAGTCCATGGGAATACAAAATTGGGTGAATATACTATCAGAAGACTTATAGAATTAGAACCAAACAATGGCGAGAATTATGTCTTGCTGTCAAATCTATATACAAGGGAACAACGGTGGGCTGAAGTAGGGGAGTTGAGAGGAATGATGAGTCTCAGGGGGATTGGGAAAGTCCCTGGTTGCAGTTCAATTGAAATAAACAATGTAGTTTATGAGTTCGCAGCATCAGATGACAGAAAACCAGAATTTGAAGCAATATACAAGCAGTTGGATAATTTGATTGAAAAATTGAAAGAAAATGGTTACGTTATACGCACTGACATGGCTTTATATGATATTGAGAAAGAAGAGAAAGAACGTTCGGTGGTGTACCATAGCGAGAAACTTGCTTTAGCATTTGGTCTCTTAAACTCTCCTTTAGATTGCACCCTAAGGATAGTGAAAAATCTGAGAATTTGCTTGGACTGCCATGAGTTTTTCAAAGTTGTATCACTTGTCTATAAAAGATATATTGTTGTGAGAGACAGAAACCGTTTTCACCATTTTTCCGAAGGTTTCTGTTCGTGTCGCGACTATTGGTGA

mRNA sequence

ATGGAAATGTATTTCCGGCTTCTGTCTTTCTCATATAGAATAATTCAAAGGTCTCGGCTCCAACAAATTTGTACAATCTCGAACTCGGTTTTTCTAGAATCAGAAATGTCGAAATTTGTACATACCCAAGCGATGGATCTTCCTTCTCAGAGAACTAACGAGAGAAAGATTCCTGATTACGAGGACGCCCTCCATGAGGAAGCCGATGGCGTGCGGAGGCGCGGGTATTTCCTCACGAAACTCATAGACGACTCTGTTTCGCATAATGGGTTCGAATCTATTGCTCGTATTTTCCCCAAGTTTCGTGGTTCTATTGATTCTCAGCTGTGTAACTCGATGATTAGGCGTTATTTGGATTTGAATAAGCATTTACATTCACTCTTCATTTTTGCCCACATGCATCAATTCAGTATTCTGCCCGATTCCTCCACTTTTCCTTGTGTTCTTAAAGCAACTGCAAAGCTATGTGCTACTGAACTTGGAAAAATGATACATGGTACTGTTATTCAGATGGGTTTTATCCATGATGTCTACGTAAGTACCGCTCTTATTCATATGTACTGTTCTTGTTTGTCTACATCCGATGCTTCTCAGTTGTTCGACGAAATGCCTGAAAGAAATGCAGTTACTTGGAATGCTCTGATTACTGGTTATACCCATAATAGAAAGTTTATGGAAGCTACCAACGCTTTCAGAGGAATGCTGGCAGCTGGGGCTGAACCGAGTGAGAGAACCGTGGTGGTAGTTCTATCAGCTTGTGCTCATTTGGGAGCGTTGAATCAGGGAAAGTGGATCCATGAGTTTATATATCATAATAGGTTGAGACTGAACGTGTTTGTGGGCACAGCACTTATTGATATGTATGCTAAATGTGGGGCTGTTGATGAGGCAGAGAAGGTCTTTGAAGAAATTTGGGAGAAGAATGTCCATACGTGGAATGTCTTGATTTCTGGATATGCCATGAATGGGCAAGGCGATGCAGCTTTGCAGGCTTTTTCTAGGATGTTGATGGAAGATTTCGAGCCAGATGAGGTTACCTTTCTAGGTCTCTTGTGTGCATGCTGTCACCAAGGTCTGGTCACAGAAGGGCGCAGGCAATTCATGAGCATGAAACAACAATTTGGACTGCAACCAAAGATCGAGCATTATGGGTGTATGGTTGACCTACTTGGTCGAGCGGGATTGTTGGAGGAAGCTCTACAGTTAATCCAATCCATGAGCATGGAGCCAGACCCTATCATTTGGAGGGCTCTGCTTTGTGCTTGCAGAGTCCATGGGAATACAAAATTGGGTGAATATACTATCAGAAGACTTATAGAATTAGAACCAAACAATGGCGAGAATTATGTCTTGCTGTCAAATCTATATACAAGGGAACAACGGTGGGCTGAAGTAGGGGAGTTGAGAGGAATGATGAGTCTCAGGGGGATTGGGAAAGTCCCTGGTTGCAGTTCAATTGAAATAAACAATGTAGTTTATGAGTTCGCAGCATCAGATGACAGAAAACCAGAATTTGAAGCAATATACAAGCAGTTGGATAATTTGATTGAAAAATTGAAAGAAAATGGTTACGTTATACGCACTGACATGGCTTTATATGATATTGAGAAAGAAGAGAAAGAACGTTCGGTGGTGTACCATAGCGAGAAACTTGCTTTAGCATTTGGTCTCTTAAACTCTCCTTTAGATTGCACCCTAAGGATAGTGAAAAATCTGAGAATTTGCTTGGACTGCCATGAGTTTTTCAAAGTTGTATCACTTGTCTATAAAAGATATATTGTTGTGAGAGACAGAAACCGTTTTCACCATTTTTCCGAAGGTTTCTGTTCGTGTCGCGACTATTGGTGA

Coding sequence (CDS)

ATGGAAATGTATTTCCGGCTTCTGTCTTTCTCATATAGAATAATTCAAAGGTCTCGGCTCCAACAAATTTGTACAATCTCGAACTCGGTTTTTCTAGAATCAGAAATGTCGAAATTTGTACATACCCAAGCGATGGATCTTCCTTCTCAGAGAACTAACGAGAGAAAGATTCCTGATTACGAGGACGCCCTCCATGAGGAAGCCGATGGCGTGCGGAGGCGCGGGTATTTCCTCACGAAACTCATAGACGACTCTGTTTCGCATAATGGGTTCGAATCTATTGCTCGTATTTTCCCCAAGTTTCGTGGTTCTATTGATTCTCAGCTGTGTAACTCGATGATTAGGCGTTATTTGGATTTGAATAAGCATTTACATTCACTCTTCATTTTTGCCCACATGCATCAATTCAGTATTCTGCCCGATTCCTCCACTTTTCCTTGTGTTCTTAAAGCAACTGCAAAGCTATGTGCTACTGAACTTGGAAAAATGATACATGGTACTGTTATTCAGATGGGTTTTATCCATGATGTCTACGTAAGTACCGCTCTTATTCATATGTACTGTTCTTGTTTGTCTACATCCGATGCTTCTCAGTTGTTCGACGAAATGCCTGAAAGAAATGCAGTTACTTGGAATGCTCTGATTACTGGTTATACCCATAATAGAAAGTTTATGGAAGCTACCAACGCTTTCAGAGGAATGCTGGCAGCTGGGGCTGAACCGAGTGAGAGAACCGTGGTGGTAGTTCTATCAGCTTGTGCTCATTTGGGAGCGTTGAATCAGGGAAAGTGGATCCATGAGTTTATATATCATAATAGGTTGAGACTGAACGTGTTTGTGGGCACAGCACTTATTGATATGTATGCTAAATGTGGGGCTGTTGATGAGGCAGAGAAGGTCTTTGAAGAAATTTGGGAGAAGAATGTCCATACGTGGAATGTCTTGATTTCTGGATATGCCATGAATGGGCAAGGCGATGCAGCTTTGCAGGCTTTTTCTAGGATGTTGATGGAAGATTTCGAGCCAGATGAGGTTACCTTTCTAGGTCTCTTGTGTGCATGCTGTCACCAAGGTCTGGTCACAGAAGGGCGCAGGCAATTCATGAGCATGAAACAACAATTTGGACTGCAACCAAAGATCGAGCATTATGGGTGTATGGTTGACCTACTTGGTCGAGCGGGATTGTTGGAGGAAGCTCTACAGTTAATCCAATCCATGAGCATGGAGCCAGACCCTATCATTTGGAGGGCTCTGCTTTGTGCTTGCAGAGTCCATGGGAATACAAAATTGGGTGAATATACTATCAGAAGACTTATAGAATTAGAACCAAACAATGGCGAGAATTATGTCTTGCTGTCAAATCTATATACAAGGGAACAACGGTGGGCTGAAGTAGGGGAGTTGAGAGGAATGATGAGTCTCAGGGGGATTGGGAAAGTCCCTGGTTGCAGTTCAATTGAAATAAACAATGTAGTTTATGAGTTCGCAGCATCAGATGACAGAAAACCAGAATTTGAAGCAATATACAAGCAGTTGGATAATTTGATTGAAAAATTGAAAGAAAATGGTTACGTTATACGCACTGACATGGCTTTATATGATATTGAGAAAGAAGAGAAAGAACGTTCGGTGGTGTACCATAGCGAGAAACTTGCTTTAGCATTTGGTCTCTTAAACTCTCCTTTAGATTGCACCCTAAGGATAGTGAAAAATCTGAGAATTTGCTTGGACTGCCATGAGTTTTTCAAAGTTGTATCACTTGTCTATAAAAGATATATTGTTGTGAGAGACAGAAACCGTTTTCACCATTTTTCCGAAGGTTTCTGTTCGTGTCGCGACTATTGGTGA

Protein sequence

MEMYFRLLSFSYRIIQRSRLQQICTISNSVFLESEMSKFVHTQAMDLPSQRTNERKIPDYEDALHEEADGVRRRGYFLTKLIDDSVSHNGFESIARIFPKFRGSIDSQLCNSMIRRYLDLNKHLHSLFIFAHMHQFSILPDSSTFPCVLKATAKLCATELGKMIHGTVIQMGFIHDVYVSTALIHMYCSCLSTSDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGMLAAGAEPSERTVVVVLSACAHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEKVFEEIWEKNVHTWNVLISGYAMNGQGDAALQAFSRMLMEDFEPDEVTFLGLLCACCHQGLVTEGRRQFMSMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYTREQRWAEVGELRGMMSLRGIGKVPGCSSIEINNVVYEFAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALYDIEKEEKERSVVYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDRNRFHHFSEGFCSCRDYW
Homology
BLAST of CcUC06G125840 vs. NCBI nr
Match: XP_038878567.1 (pentatricopeptide repeat-containing protein At4g21065-like [Benincasa hispida])

HSP 1 Score: 1138.6 bits (2944), Expect = 0.0e+00
Identity = 555/615 (90.24%), Postives = 585/615 (95.12%), Query Frame = 0

Query: 1   MEMYFRLLSFSYRIIQRSRLQQICTISNSVFLESEMSKFVHTQAMDLPSQRTNERKIPDY 60
           M+MYFRLL  S  IIQRSRLQ+ICTI NSV LESEMSKFVHTQAMDLP  RTNERKIPDY
Sbjct: 1   MKMYFRLLPLSCGIIQRSRLQEICTILNSVILESEMSKFVHTQAMDLPPPRTNERKIPDY 60

Query: 61  EDALHEEADGVRRRGYFLTKLIDDSVSHNGFESIARIFPKFRGSIDSQLCNSMIRRYLDL 120
           +DALH+E + VRR GYFL KLIDDSVSHNGFESIA IF KFR SI+SQLCNSMIR YLDL
Sbjct: 61  KDALHKEGNDVRRDGYFLMKLIDDSVSHNGFESIALIFSKFRSSINSQLCNSMIRGYLDL 120

Query: 121 NKHLHSLFIFAHMHQFSILPDSSTFPCVLKATAKLCATELGKMIHGTVIQMGFIHDVYVS 180
           NKHL+SL+IFAHMH+FSILPDSSTFP VLKATA+LC TE+GKMIHGTVIQMGFIHDVY S
Sbjct: 121 NKHLNSLYIFAHMHKFSILPDSSTFPAVLKATAQLCDTEVGKMIHGTVIQMGFIHDVYTS 180

Query: 181 TALIHMYCSCLSTSDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGMLAAGAE 240
           TAL+HMYC+CLS SDAS++FDEMPERNAVTWNALITGYTHNRKFMEA NAFRGMLAAGAE
Sbjct: 181 TALVHMYCACLSISDASRVFDEMPERNAVTWNALITGYTHNRKFMEAINAFRGMLAAGAE 240

Query: 241 PSERTVVVVLSACAHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEKV 300
           PSERT+VVVLSAC+HLGALNQGKW+HEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEKV
Sbjct: 241 PSERTMVVVLSACSHLGALNQGKWVHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEKV 300

Query: 301 FEEIWEKNVHTWNVLISGYAMNGQGDAALQAFSRMLMEDFEPDEVTFLGLLCACCHQGLV 360
           FEEI EKNV+TWNVLISGYAMNGQGDAAL AFSRMLME+F+PDEVTFLG+LCACCHQGLV
Sbjct: 301 FEEIREKNVYTWNVLISGYAMNGQGDAALAAFSRMLMENFKPDEVTFLGILCACCHQGLV 360

Query: 361 TEGRRQFMSMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRALLC 420
           TEGRRQFMSMKQ FGLQPKIEHYGCMVDLLGRAG L+EAL+LIQSMSMEPDPIIWRALLC
Sbjct: 361 TEGRRQFMSMKQHFGLQPKIEHYGCMVDLLGRAGFLDEALELIQSMSMEPDPIIWRALLC 420

Query: 421 ACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYTREQRWAEVGELRGMMSLRGIGKV 480
           ACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLY+REQRWAEVG+LRGMMSLRGIGKV
Sbjct: 421 ACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGKLRGMMSLRGIGKV 480

Query: 481 PGCSSIEINNVVYEFAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALYDIEKEEK 540
           PGCSSIEINNVVYEFAAS+DRKPEFEAIYKQLDNL EKLKENGYV  TDMALYDIEKEEK
Sbjct: 481 PGCSSIEINNVVYEFAASNDRKPEFEAIYKQLDNLSEKLKENGYVTGTDMALYDIEKEEK 540

Query: 541 ERSVVYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDRNR 600
           E SV+YHSEKLALAFGLLNSPL CTLRIVKNLRICLDCHEFFKVVS+VY+RYIVVRDRNR
Sbjct: 541 EHSVMYHSEKLALAFGLLNSPLGCTLRIVKNLRICLDCHEFFKVVSIVYQRYIVVRDRNR 600

Query: 601 FHHFSEGFCSCRDYW 616
           FHHFSEGFCSCRDYW
Sbjct: 601 FHHFSEGFCSCRDYW 615

BLAST of CcUC06G125840 vs. NCBI nr
Match: XP_004138309.2 (pentatricopeptide repeat-containing protein At4g21065 [Cucumis sativus] >KGN63701.1 hypothetical protein Csa_014271 [Cucumis sativus])

HSP 1 Score: 1075.1 bits (2779), Expect = 0.0e+00
Identity = 532/616 (86.36%), Postives = 568/616 (92.21%), Query Frame = 0

Query: 1   MEMYFRLLSFSYRIIQRSRLQQ-ICTISNSVFLESEMSKFVHTQAMDLPSQRTNERKIPD 60
           M+MY RLL FSYRII+RSR+QQ ICTISN  FLESEM KFVHTQAMDLP Q TN  KIPD
Sbjct: 1   MKMYNRLLPFSYRIIRRSRVQQEICTISNLDFLESEMLKFVHTQAMDLPFQATNGSKIPD 60

Query: 61  YEDALHEEADGVRRRGYFLTKLIDDSVSHNGFESIARIFPKFRGSIDSQLCNSMIRRYLD 120
           Y D          RRG+FL KLIDDSVS NGFESIARIF K+RGSI+SQ CNSMIR YLD
Sbjct: 61  YNDV---------RRGHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLD 120

Query: 121 LNKHLHSLFIFAHMHQFSILPDSSTFPCVLKATAKLCATELGKMIHGTVIQMGFIHDVYV 180
           LNKHL+SL+IFA MH+FSILPDSSTFP VLKATA+LC T +GKMIHG VIQMGFI DVY 
Sbjct: 121 LNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYT 180

Query: 181 STALIHMYCSCLSTSDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGMLAAGA 240
           STAL+H+YC+CLS SDASQLFDEMPERNAVTWNALITGYTHNRKF++A +AFRGMLA GA
Sbjct: 181 STALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGA 240

Query: 241 EPSERTVVVVLSACAHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEK 300
           +PSERTVVVVLSAC+HLGA NQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAV E EK
Sbjct: 241 QPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEK 300

Query: 301 VFEEIWEKNVHTWNVLISGYAMNGQGDAALQAFSRMLMEDFEPDEVTFLGLLCACCHQGL 360
           VFEEI EKNV+TWNVLISGYAMNGQGDAALQAFSRMLME+F+PDEVTFLG+LCACCHQGL
Sbjct: 301 VFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGL 360

Query: 361 VTEGRRQFMSMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRALL 420
           VTEGR QFMSMKQQFGLQP+IEHYGCMVDLLGRAGLLEEAL+LIQSMS+EPDPIIWRALL
Sbjct: 361 VTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALL 420

Query: 421 CACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYTREQRWAEVGELRGMMSLRGIGK 480
           CACRVHGNTKLGEY I+RLIELEPNNGENYVLLSN+Y+RE+RWAEVG+LRGMM+LRGI K
Sbjct: 421 CACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRK 480

Query: 481 VPGCSSIEINNVVYEFAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALYDIEKEE 540
           VPGCSSIEINNVVYEF AS+DRKPEFEAIYKQLDNLI+KLKENGYV  TDMALYDIEKEE
Sbjct: 481 VPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEE 540

Query: 541 KERSVVYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDRN 600
           KE SV+YHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKV+SLVYKRYIVVRDRN
Sbjct: 541 KEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRN 600

Query: 601 RFHHFSEGFCSCRDYW 616
           RFHHF EGFCSCRDYW
Sbjct: 601 RFHHFYEGFCSCRDYW 607

BLAST of CcUC06G125840 vs. NCBI nr
Match: XP_023529316.1 (pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1055.4 bits (2728), Expect = 1.8e-304
Identity = 517/617 (83.79%), Postives = 565/617 (91.57%), Query Frame = 0

Query: 1   MEMYFRLLSFSYRIIQRSRLQQICTISNSVFL--ESEMSKFVHTQAMDLPSQRTNERKIP 60
           M+M  R L FS+R+I+R+RLQ  CTISN  FL  +S++S+FVHT+ M+LPSQ   ERKIP
Sbjct: 1   MKMDLRFLPFSFRLIRRARLQDTCTISNLDFLANQSQISRFVHTRVMNLPSQGGIERKIP 60

Query: 61  DYEDALHEEADGVRRRGYFLTKLIDDSVSHNGFESIARIFPKFRGSIDSQLCNSMIRRYL 120
           D  DA  +E + +R  GYFL KLI+DSVS+NGFESIA IF KFRGSI+SQ+CNSMIR YL
Sbjct: 61  DCLDARRKEGNDMRSDGYFLMKLIEDSVSNNGFESIALIFSKFRGSINSQICNSMIRGYL 120

Query: 121 DLNKHLHSLFIFAHMHQFSILPDSSTFPCVLKATAKLCATELGKMIHGTVIQMGFIHDVY 180
           DLN+HL+SL IFAHMH+FSILPDSSTFP VLKATA+LC  +LGKMIHG V+QMGFI DVY
Sbjct: 121 DLNEHLNSLIIFAHMHKFSILPDSSTFPAVLKATAQLCDIKLGKMIHGAVVQMGFIRDVY 180

Query: 181 VSTALIHMYCSCLSTSDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGMLAAG 240
            STAL+HMYCSCLS SDASQLFDEMPERN+VTWNALITGYTHNRKF EA NAFRGMLAAG
Sbjct: 181 TSTALVHMYCSCLSISDASQLFDEMPERNSVTWNALITGYTHNRKFKEAINAFRGMLAAG 240

Query: 241 AEPSERTVVVVLSACAHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAE 300
           AEPSERTVVVVLSAC+HLGALNQGKWIH+FIY N+LRLNVFVGTALIDMYAKCG V+EAE
Sbjct: 241 AEPSERTVVVVLSACSHLGALNQGKWIHDFIYQNKLRLNVFVGTALIDMYAKCGVVEEAE 300

Query: 301 KVFEEIWEKNVHTWNVLISGYAMNGQGDAALQAFSRMLMEDFEPDEVTFLGLLCACCHQG 360
           KVFEEI +KNV+TWNVLISGY MNGQGDAALQAFSRMLME+F+PD VTFLGLLCACCHQG
Sbjct: 301 KVFEEIRDKNVYTWNVLISGYGMNGQGDAALQAFSRMLMENFKPDAVTFLGLLCACCHQG 360

Query: 361 LVTEGRRQFMSMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRAL 420
           LVTEGRRQF+SMKQQFGLQPKIEHYGCMVDLLGRAGLLEEAL+LI+SMSMEPDPIIWRAL
Sbjct: 361 LVTEGRRQFISMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALELIESMSMEPDPIIWRAL 420

Query: 421 LCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYTREQRWAEVGELRGMMSLRGIG 480
           LCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLY+RE+RW EVG+LRGMMSLRGI 
Sbjct: 421 LCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSRERRWIEVGKLRGMMSLRGIE 480

Query: 481 KVPGCSSIEINNVVYEFAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALYDIEKE 540
           KVPGCSSIEINN V+EF AS+DRK EF AIYKQLDN+++KLKENGYV  TDM+L+DIEKE
Sbjct: 481 KVPGCSSIEINNSVHEFTASNDRKLEFNAIYKQLDNVMKKLKENGYVTGTDMSLFDIEKE 540

Query: 541 EKERSVVYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDR 600
           EKE SV+YHSEKLALAFGLLNSPLDCTLRIVKNLRIC DCHEFFKVVSLVYKRYIVVRDR
Sbjct: 541 EKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICSDCHEFFKVVSLVYKRYIVVRDR 600

Query: 601 NRFHHFSEGFCSCRDYW 616
           NRFHHFSEG CSCRDYW
Sbjct: 601 NRFHHFSEGVCSCRDYW 617

BLAST of CcUC06G125840 vs. NCBI nr
Match: XP_022134759.1 (pentatricopeptide repeat-containing protein At4g21065-like [Momordica charantia])

HSP 1 Score: 1052.4 bits (2720), Expect = 1.6e-303
Identity = 512/611 (83.80%), Postives = 558/611 (91.33%), Query Frame = 0

Query: 8   LSFSYRIIQRSRLQQICTISNSVFL--ESEMSKFVHTQ-AMDLPSQRTNERKIPDYEDAL 67
           +  S+R+I+R+RLQ ICTISNS FL  +S++SKF+HTQ  M+LP Q TNERKIPDY D +
Sbjct: 43  IEMSFRLIRRARLQDICTISNSAFLANQSQISKFMHTQLTMNLPPQSTNERKIPDYMDVV 102

Query: 68  HEEADGVRRRGYFLTKLIDDSVSHNGFESIARIFPKFRGSIDSQLCNSMIRRYLDLNKHL 127
            +E + +R  GYFL KLIDDSVSH+GFESIA IF KFRG I+ QLCN MIR YLD NKHL
Sbjct: 103 RKEGNDMRSDGYFLMKLIDDSVSHDGFESIAPIFSKFRGVINCQLCNWMIRGYLDSNKHL 162

Query: 128 HSLFIFAHMHQFSILPDSSTFPCVLKATAKLCATELGKMIHGTVIQMGFIHDVYVSTALI 187
           +SL IFAHMH+FSILPDSSTFP V+KATA+ C  ELGKMIHGTVIQMGFI DVY STAL+
Sbjct: 163 NSLLIFAHMHKFSILPDSSTFPAVIKATARSCNVELGKMIHGTVIQMGFIRDVYTSTALV 222

Query: 188 HMYCSCLSTSDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGMLAAGAEPSER 247
           HMYC+CLS SDA QLFDEMPERN+VTWNALITGYTHNRKFMEA NAFRGMLAAGAEPSER
Sbjct: 223 HMYCTCLSISDAYQLFDEMPERNSVTWNALITGYTHNRKFMEAINAFRGMLAAGAEPSER 282

Query: 248 TVVVVLSACAHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEKVFEEI 307
           TVVVVLSAC+HLGALNQG WIHEFIY N+LRLNVFVGTALIDMYAKCGAV+EAEKVFEEI
Sbjct: 283 TVVVVLSACSHLGALNQGTWIHEFIYQNKLRLNVFVGTALIDMYAKCGAVEEAEKVFEEI 342

Query: 308 WEKNVHTWNVLISGYAMNGQGDAALQAFSRMLMEDFEPDEVTFLGLLCACCHQGLVTEGR 367
            EKNV+TWNVLISGYAMNGQGD ALQAFS ML E+F+PDEVTFLG+LCACCHQGLVTEGR
Sbjct: 343 REKNVYTWNVLISGYAMNGQGDEALQAFSMMLRENFKPDEVTFLGVLCACCHQGLVTEGR 402

Query: 368 RQFMSMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRALLCACRV 427
           RQF+SMKQ FGL+P+IEHYGCMVDLLGRAGLLEEAL+LIQSMSMEPDPIIWRALLCACRV
Sbjct: 403 RQFVSMKQHFGLRPRIEHYGCMVDLLGRAGLLEEALELIQSMSMEPDPIIWRALLCACRV 462

Query: 428 HGNTKLGEYTIRRLIELEPNNGENYVLLSNLYTREQRWAEVGELRGMMSLRGIGKVPGCS 487
           HGNTKLGEY IRRLI+LEPNNGENYVLLSNLY+RE+RW EVG+LRGMMSLRGIGKVPGCS
Sbjct: 463 HGNTKLGEYAIRRLIDLEPNNGENYVLLSNLYSRERRWIEVGKLRGMMSLRGIGKVPGCS 522

Query: 488 SIEINNVVYEFAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALYDIEKEEKERSV 547
           SIEI NVVYEFAAS+DRKPEF+AIYKQLDN+IEKLK NGY+  T MAL+DIE+EEKE  V
Sbjct: 523 SIEIKNVVYEFAASNDRKPEFDAIYKQLDNVIEKLKANGYITGTGMALFDIEEEEKEHCV 582

Query: 548 VYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDRNRFHHF 607
           +YHSEKLALAFGLLNSPLDC LRIVKNLRICLDCHEFFKV SLVYKR+IVVRDRNRFHHF
Sbjct: 583 MYHSEKLALAFGLLNSPLDCALRIVKNLRICLDCHEFFKVASLVYKRFIVVRDRNRFHHF 642

Query: 608 SEGFCSCRDYW 616
           SEGFCSCRDYW
Sbjct: 643 SEGFCSCRDYW 653

BLAST of CcUC06G125840 vs. NCBI nr
Match: XP_022925029.1 (pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita moschata])

HSP 1 Score: 1051.2 bits (2717), Expect = 3.5e-303
Identity = 514/617 (83.31%), Postives = 564/617 (91.41%), Query Frame = 0

Query: 1   MEMYFRLLSFSYRIIQRSRLQQICTISNSVFL--ESEMSKFVHTQAMDLPSQRTNERKIP 60
           M+M  RLL FS+R+I+R+RLQ  CTISN  FL  +S++S+FVHT+ M+LPSQ   ERKIP
Sbjct: 1   MKMDLRLLPFSFRLIRRARLQDTCTISNLDFLANQSQISRFVHTRVMNLPSQGGIERKIP 60

Query: 61  DYEDALHEEADGVRRRGYFLTKLIDDSVSHNGFESIARIFPKFRGSIDSQLCNSMIRRYL 120
           D  DA  +E + +R  GYFL KLI+DSVS+NGFESIA IF KFRGSI+SQ+CNSMIR YL
Sbjct: 61  DCLDARRKEGNDMRSDGYFLMKLIEDSVSNNGFESIALIFSKFRGSINSQICNSMIRGYL 120

Query: 121 DLNKHLHSLFIFAHMHQFSILPDSSTFPCVLKATAKLCATELGKMIHGTVIQMGFIHDVY 180
           D N+HL+SL IFAHMH+FSILPDSSTFP VLKATA+LC  +LGKMIHG V+QMGFI DVY
Sbjct: 121 DSNEHLNSLIIFAHMHKFSILPDSSTFPAVLKATAQLCDIKLGKMIHGAVVQMGFIRDVY 180

Query: 181 VSTALIHMYCSCLSTSDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGMLAAG 240
            STAL+HMYCSCLS SDASQLFDEMPERN+VTWNALITGYTHNRKF EA NAFRGMLAAG
Sbjct: 181 TSTALVHMYCSCLSISDASQLFDEMPERNSVTWNALITGYTHNRKFREAINAFRGMLAAG 240

Query: 241 AEPSERTVVVVLSACAHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAE 300
           AEPSERTVVVVLSAC+HLGALNQGKWIH+FIY N+LRLNVFVGTALIDMYAKCG V+EAE
Sbjct: 241 AEPSERTVVVVLSACSHLGALNQGKWIHDFIYQNKLRLNVFVGTALIDMYAKCGVVEEAE 300

Query: 301 KVFEEIWEKNVHTWNVLISGYAMNGQGDAALQAFSRMLMEDFEPDEVTFLGLLCACCHQG 360
           KVFEEI ++NV+TWNVLISGY MNGQG+AALQ FSRMLME+F+PD VTFLGLLCACCHQG
Sbjct: 301 KVFEEIRDRNVYTWNVLISGYGMNGQGNAALQVFSRMLMENFKPDAVTFLGLLCACCHQG 360

Query: 361 LVTEGRRQFMSMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRAL 420
           LVTEGRRQF+SMKQQFGLQPKIEHYGCMVDLLGRAGLLEEAL+LI+SMSMEPDPIIWRAL
Sbjct: 361 LVTEGRRQFISMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALELIESMSMEPDPIIWRAL 420

Query: 421 LCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYTREQRWAEVGELRGMMSLRGIG 480
           LCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLY+RE+RW EVG+LRGMMSLRGI 
Sbjct: 421 LCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSRERRWIEVGKLRGMMSLRGIE 480

Query: 481 KVPGCSSIEINNVVYEFAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALYDIEKE 540
           KVPGCSSIEINN V+EF AS+DRK EF AIYKQLDN+++KLKENGYV  TDM+L+DIEKE
Sbjct: 481 KVPGCSSIEINNAVHEFTASNDRKREFSAIYKQLDNVMKKLKENGYVTGTDMSLFDIEKE 540

Query: 541 EKERSVVYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDR 600
           EKE SV+YHSEKLALAFGLLNSPLDCTLRIVKNLRIC DCHEFFKVVSLVYKRYIVVRDR
Sbjct: 541 EKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICSDCHEFFKVVSLVYKRYIVVRDR 600

Query: 601 NRFHHFSEGFCSCRDYW 616
           NRFHHFSEG CSCRDYW
Sbjct: 601 NRFHHFSEGVCSCRDYW 617

BLAST of CcUC06G125840 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 467.2 bits (1201), Expect = 2.8e-130
Identity = 245/587 (41.74%), Postives = 359/587 (61.16%), Query Frame = 0

Query: 32  LESEMSKFVHTQAMDLPSQRTNERKIPDYEDALHEEADGVRRRGYFLTKLIDDSVSHNGF 91
           L  ++  +VHT  + +  Q  N R     EDA         R     T LI    S    
Sbjct: 163 LGCDLDLYVHTSLISMYVQ--NGR----LEDAHKVFDKSPHRDVVSYTALIKGYASRGYI 222

Query: 92  ESIARIFPKFRGSIDSQLCNSMIRRYLDLNKHLHSLFIFAHMHQFSILPDSSTFPCVLKA 151
           E+  ++F +     D    N+MI  Y +   +  +L +F  M + ++ PD ST   V+ A
Sbjct: 223 ENAQKLFDEIPVK-DVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSA 282

Query: 152 TAKLCATELGKMIHGTVIQMGFIHDVYVSTALIHMYCSCLSTSDASQLFDEMPERNAVTW 211
            A+  + ELG+ +H  +   GF  ++ +  ALI +Y  C     A  LF+ +P ++ ++W
Sbjct: 283 CAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISW 342

Query: 212 NALITGYTHNRKFMEATNAFRGMLAAGAEPSERTVVVVLSACAHLGALNQGKWIHEFIYH 271
           N LI GYTH   + EA   F+ ML +G  P++ T++ +L ACAHLGA++ G+WIH +I  
Sbjct: 343 NTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYI-D 402

Query: 272 NRLR--LNV-FVGTALIDMYAKCGAVDEAEKVFEEIWEKNVHTWNVLISGYAMNGQGDAA 331
            RL+   N   + T+LIDMYAKCG ++ A +VF  I  K++ +WN +I G+AM+G+ DA+
Sbjct: 403 KRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADAS 462

Query: 332 LQAFSRMLMEDFEPDEVTFLGLLCACCHQGLVTEGRRQFMSMKQQFGLQPKIEHYGCMVD 391
              FSRM     +PD++TF+GLL AC H G++  GR  F +M Q + + PK+EHYGCM+D
Sbjct: 463 FDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMID 522

Query: 392 LLGRAGLLEEALQLIQSMSMEPDPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGEN 451
           LLG +GL +EA ++I  M MEPD +IW +LL AC++HGN +LGE     LI++EP N  +
Sbjct: 523 LLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGS 582

Query: 452 YVLLSNLYTREQRWAEVGELRGMMSLRGIGKVPGCSSIEINNVVYEFAASDDRKPEFEAI 511
           YVLLSN+Y    RW EV + R +++ +G+ KVPGCSSIEI++VV+EF   D   P    I
Sbjct: 583 YVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREI 642

Query: 512 YKQLDNLIEKLKENGYVIRTDMALYDIEKEEKERSVVYHSEKLALAFGLLNSPLDCTLRI 571
           Y  L+ +   L++ G+V  T   L ++E+E KE ++ +HSEKLA+AFGL+++     L I
Sbjct: 643 YGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLTI 702

Query: 572 VKNLRICLDCHEFFKVVSLVYKRYIVVRDRNRFHHFSEGFCSCRDYW 616
           VKNLR+C +CHE  K++S +YKR I+ RDR RFHHF +G CSC DYW
Sbjct: 703 VKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of CcUC06G125840 vs. ExPASy Swiss-Prot
Match: A8MQA3 (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 466.8 bits (1200), Expect = 3.7e-130
Identity = 223/522 (42.72%), Postives = 340/522 (65.13%), Query Frame = 0

Query: 96  RIFPKFRGSIDSQLCNSMIRRYLDLNKHLHSLFIFAHMHQFSIL-PDSSTFPCVLKATAK 155
           ++F K    I+  + N++IR Y ++   + +  ++  M    ++ PD+ T+P ++KA   
Sbjct: 74  KVFSKIEKPINVFIWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTT 133

Query: 156 LCATELGKMIHGTVIQMGFIHDVYVSTALIHMYCSCLSTSDASQLFDEMPERNAVTWNAL 215
           +    LG+ IH  VI+ GF   +YV  +L+H+Y +C   + A ++FD+MPE++ V WN++
Sbjct: 134 MADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSV 193

Query: 216 ITGYTHNRKFMEATNAFRGMLAAGAEPSERTVVVVLSACAHLGALNQGKWIHEFIYHNRL 275
           I G+  N K  EA   +  M + G +P   T+V +LSACA +GAL  GK +H ++    L
Sbjct: 194 INGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGL 253

Query: 276 RLNVFVGTALIDMYAKCGAVDEAEKVFEEIWEKNVHTWNVLISGYAMNGQGDAALQAFSR 335
             N+     L+D+YA+CG V+EA+ +F+E+ +KN  +W  LI G A+NG G  A++ F  
Sbjct: 254 TRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKY 313

Query: 336 M-LMEDFEPDEVTFLGLLCACCHQGLVTEGRRQFMSMKQQFGLQPKIEHYGCMVDLLGRA 395
           M   E   P E+TF+G+L AC H G+V EG   F  M++++ ++P+IEH+GCMVDLL RA
Sbjct: 314 MESTEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARA 373

Query: 396 GLLEEALQLIQSMSMEPDPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLS 455
           G +++A + I+SM M+P+ +IWR LL AC VHG++ L E+   ++++LEPN+  +YVLLS
Sbjct: 374 GQVKKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLS 433

Query: 456 NLYTREQRWAEVGELRGMMSLRGIGKVPGCSSIEINNVVYEFAASDDRKPEFEAIYKQLD 515
           N+Y  EQRW++V ++R  M   G+ KVPG S +E+ N V+EF   D   P+ +AIY +L 
Sbjct: 434 NMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLK 493

Query: 516 NLIEKLKENGYVIRTDMALYDIEKEEKERSVVYHSEKLALAFGLLNSPLDCTLRIVKNLR 575
            +  +L+  GYV +      D+E+EEKE +VVYHSEK+A+AF L+++P    + +VKNLR
Sbjct: 494 EMTGRLRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLR 553

Query: 576 ICLDCHEFFKVVSLVYKRYIVVRDRNRFHHFSEGFCSCRDYW 616
           +C DCH   K+VS VY R IVVRDR+RFHHF  G CSC+DYW
Sbjct: 554 VCADCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of CcUC06G125840 vs. ExPASy Swiss-Prot
Match: Q8LK93 (Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H26 PE=2 SV=2)

HSP 1 Score: 461.1 bits (1185), Expect = 2.0e-128
Identity = 237/543 (43.65%), Postives = 340/543 (62.62%), Query Frame = 0

Query: 77  FLTKLID---DSVSHNGFESIARIFPKFRGSIDSQLCNSMIRRYLDLNKHLHSLFIFAHM 136
           F+ KLI+   +S + +   S AR   +     D  + NSM R Y      L    +F  +
Sbjct: 62  FVAKLINFCTESPTESSM-SYARHLFEAMSEPDIVIFNSMARGYSRFTNPLEVFSLFVEI 121

Query: 137 HQFSILPDSSTFPCVLKATAKLCATELGKMIHGTVIQMGFIHDVYVSTALIHMYCSCLST 196
            +  ILPD+ TFP +LKA A   A E G+ +H   +++G   +VYV   LI+MY  C   
Sbjct: 122 LEDGILPDNYTFPSLLKACAVAKALEEGRQLHCLSMKLGLDDNVYVCPTLINMYTECEDV 181

Query: 197 SDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGMLAAGAEPSERTVVVVLSAC 256
             A  +FD + E   V +NA+ITGY    +  EA + FR M     +P+E T++ VLS+C
Sbjct: 182 DSARCVFDRIVEPCVVCYNAMITGYARRNRPNEALSLFREMQGKYLKPNEITLLSVLSSC 241

Query: 257 AHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEKVFEEIWEKNVHTWN 316
           A LG+L+ GKWIH++   +     V V TALIDM+AKCG++D+A  +FE++  K+   W+
Sbjct: 242 ALLGSLDLGKWIHKYAKKHSFCKYVKVNTALIDMFAKCGSLDDAVSIFEKMRYKDTQAWS 301

Query: 317 VLISGYAMNGQGDAALQAFSRMLMEDFEPDEVTFLGLLCACCHQGLVTEGRRQFMSMKQQ 376
            +I  YA +G+ + ++  F RM  E+ +PDE+TFLGLL AC H G V EGR+ F  M  +
Sbjct: 302 AMIVAYANHGKAEKSMLMFERMRSENVQPDEITFLGLLNACSHTGRVEEGRKYFSQMVSK 361

Query: 377 FGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRALLCACRVHGNTKLGEY 436
           FG+ P I+HYG MVDLL RAG LE+A + I  + + P P++WR LL AC  H N  L E 
Sbjct: 362 FGIVPSIKHYGSMVDLLSRAGNLEDAYEFIDKLPISPTPMLWRILLAACSSHNNLDLAEK 421

Query: 437 TIRRLIELEPNNGENYVLLSNLYTREQRWAEVGELRGMMSLRGIGKVPGCSSIEINNVVY 496
              R+ EL+ ++G +YV+LSNLY R ++W  V  LR +M  R   KVPGCSSIE+NNVV+
Sbjct: 422 VSERIFELDDSHGGDYVILSNLYARNKKWEYVDSLRKVMKDRKAVKVPGCSSIEVNNVVH 481

Query: 497 EFAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALY-DIEKEEKERSVVYHSEKLA 556
           EF + D  K     +++ LD ++++LK +GYV  T M ++ ++  +EKE ++ YHSEKLA
Sbjct: 482 EFFSGDGVKSATTKLHRALDEMVKELKLSGYVPDTSMVVHANMNDQEKEITLRYHSEKLA 541

Query: 557 LAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDRNRFHHFSEGFCSCR 616
           + FGLLN+P   T+R+VKNLR+C DCH   K++SL++ R +V+RD  RFHHF +G CSC 
Sbjct: 542 ITFGLLNTPPGTTIRVVKNLRVCRDCHNAAKLISLIFGRKVVLRDVQRFHHFEDGKCSCG 601

BLAST of CcUC06G125840 vs. ExPASy Swiss-Prot
Match: Q683I9 (Pentatricopeptide repeat-containing protein At3g62890 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H82 PE=2 SV=1)

HSP 1 Score: 441.0 bits (1133), Expect = 2.1e-122
Identity = 234/552 (42.39%), Postives = 330/552 (59.78%), Query Frame = 0

Query: 105 IDSQLCNSMIRRYLD--LNKHLHS-LFIFAHMHQFSILPDSSTFPCVLKATAKLCATELG 164
           ++S L N +IR  +    +   HS + ++  M    + PD  TFP +L +        LG
Sbjct: 22  LESFLWNIIIRAIVHNVSSPQRHSPISVYLRMRNHRVSPDFHTFPFLLPSFHNPLHLPLG 81

Query: 165 KMIHGTVIQMGFIHDVYVSTALIHMYCSCLS----------------------------- 224
           +  H  ++  G   D +V T+L++MY SC                               
Sbjct: 82  QRTHAQILLFGLDKDPFVRTSLLNMYSSCGDLRSAQRVFDDSGSKDLPAWNSVVNAYAKA 141

Query: 225 --TSDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGML-----AAGAEPSERT 284
               DA +LFDEMPERN ++W+ LI GY    K+ EA + FR M       A   P+E T
Sbjct: 142 GLIDDARKLFDEMPERNVISWSCLINGYVMCGKYKEALDLFREMQLPKPNEAFVRPNEFT 201

Query: 285 VVVVLSACAHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEKVFEEIW 344
           +  VLSAC  LGAL QGKW+H +I    + +++ +GTALIDMYAKCG+++ A++VF  + 
Sbjct: 202 MSTVLSACGRLGALEQGKWVHAYIDKYHVEIDIVLGTALIDMYAKCGSLERAKRVFNALG 261

Query: 345 -EKNVHTWNVLISGYAMNGQGDAALQAFSRMLMED-FEPDEVTFLGLLCACCHQGLVTEG 404
            +K+V  ++ +I   AM G  D   Q FS M   D   P+ VTF+G+L AC H+GL+ EG
Sbjct: 262 SKKDVKAYSAMICCLAMYGLTDECFQLFSEMTTSDNINPNSVTFVGILGACVHRGLINEG 321

Query: 405 RRQFMSMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRALLCACR 464
           +  F  M ++FG+ P I+HYGCMVDL GR+GL++EA   I SM MEPD +IW +LL   R
Sbjct: 322 KSYFKMMIEEFGITPSIQHYGCMVDLYGRSGLIKEAESFIASMPMEPDVLIWGSLLSGSR 381

Query: 465 VHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYTREQRWAEVGELRGMMSLRGIGKVPGC 524
           + G+ K  E  ++RLIEL+P N   YVLLSN+Y +  RW EV  +R  M ++GI KVPGC
Sbjct: 382 MLGDIKTCEGALKRLIELDPMNSGAYVLLSNVYAKTGRWMEVKCIRHEMEVKGINKVPGC 441

Query: 525 SSIEINNVVYEFAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALYDIEKEEKERS 584
           S +E+  VV+EF   D+ + E E IY  LD ++++L+E GYV  T   L D+ +++KE +
Sbjct: 442 SYVEVEGVVHEFVVGDESQQESERIYAMLDEIMQRLREAGYVTDTKEVLLDLNEKDKEIA 501

Query: 585 VVYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDRNRFHH 616
           + YHSEKLA+AF L+ +     +RI+KNLRIC DCH   K++S ++ R IVVRD NRFHH
Sbjct: 502 LSYHSEKLAIAFCLMKTRPGTPVRIIKNLRICGDCHLVMKMISKLFSREIVVRDCNRFHH 561

BLAST of CcUC06G125840 vs. ExPASy Swiss-Prot
Match: Q9FJY7 (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 431.4 bits (1108), Expect = 1.7e-119
Identity = 229/580 (39.48%), Postives = 333/580 (57.41%), Query Frame = 0

Query: 70  GVRRRGYFLTKLIDDSVSHNGFESIARIFPKFRG--SIDSQLCNSMIRRYLDLNKHLHSL 129
           G+ +  Y +TK +   +S    + +      F G    D+ L N MIR +   ++   SL
Sbjct: 41  GLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDRPDTFLWNLMIRGFSCSDEPERSL 100

Query: 130 FIFAHMHQFSILPDSSTFPCVLKATAKLCATELGKMIHGTVIQMGFIHDVYVSTALIHMY 189
            ++  M   S   ++ TFP +LKA + L A E    IH  + ++G+ +DVY   +LI+ Y
Sbjct: 101 LLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETTQIHAQITKLGYENDVYAVNSLINSY 160

Query: 190 CSCLSTSDASQLFDEMP-------------------------------ERNAVTWNALIT 249
               +   A  LFD +P                               E+NA++W  +I+
Sbjct: 161 AVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKAGKMDIALTLFRKMAEKNAISWTTMIS 220

Query: 250 GYTHNRKFMEATNAFRGMLAAGAEPSERTVVVVLSACAHLGALNQGKWIHEFIYHNRLRL 309
           GY       EA   F  M  +  EP   ++   LSACA LGAL QGKWIH ++   R+R+
Sbjct: 221 GYVQADMNKEALQLFHEMQNSDVEPDNVSLANALSACAQLGALEQGKWIHSYLNKTRIRM 280

Query: 310 NVFVGTALIDMYAKCGAVDEAEKVFEEIWEKNVHTWNVLISGYAMNGQGDAALQAFSRML 369
           +  +G  LIDMYAKCG ++EA +VF+ I +K+V  W  LISGYA +G G  A+  F  M 
Sbjct: 281 DSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQAWTALISGYAYHGHGREAISKFMEMQ 340

Query: 370 MEDFEPDEVTFLGLLCACCHQGLVTEGRRQFMSMKQQFGLQPKIEHYGCMVDLLGRAGLL 429
               +P+ +TF  +L AC + GLV EG+  F SM++ + L+P IEHYGC+VDLLGRAGLL
Sbjct: 341 KMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSMERDYNLKPTIEHYGCIVDLLGRAGLL 400

Query: 430 EEALQLIQSMSMEPDPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLY 489
           +EA + IQ M ++P+ +IW ALL ACR+H N +LGE     LI ++P +G  YV  +N++
Sbjct: 401 DEAKRFIQEMPLKPNAVIWGALLKACRIHKNIELGEEIGEILIAIDPYHGGRYVHKANIH 460

Query: 490 TREQRWAEVGELRGMMSLRGIGKVPGCSSIEINNVVYEFAASDDRKPEFEAIYKQLDNLI 549
             +++W +  E R +M  +G+ KVPGCS+I +    +EF A D   PE E I  +   + 
Sbjct: 461 AMDKKWDKAAETRRLMKEQGVAKVPGCSTISLEGTTHEFLAGDRSHPEIEKIQSKWRIMR 520

Query: 550 EKLKENGYVIRTDMALYD-IEKEEKERSVVYHSEKLALAFGLLNSPLDCTLRIVKNLRIC 609
            KL+ENGYV   +  L D ++ +E+E  V  HSEKLA+ +GL+ +     +RI+KNLR+C
Sbjct: 521 RKLEENGYVPELEEMLLDLVDDDEREAIVHQHSEKLAITYGLIKTKPGTIIRIMKNLRVC 580

Query: 610 LDCHEFFKVVSLVYKRYIVVRDRNRFHHFSEGFCSCRDYW 616
            DCH+  K++S +YKR IV+RDR RFHHF +G CSC DYW
Sbjct: 581 KDCHKVTKLISKIYKRDIVMRDRTRFHHFRDGKCSCGDYW 620

BLAST of CcUC06G125840 vs. ExPASy TrEMBL
Match: A0A0A0LUK8 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G011530 PE=3 SV=1)

HSP 1 Score: 1075.1 bits (2779), Expect = 0.0e+00
Identity = 532/616 (86.36%), Postives = 568/616 (92.21%), Query Frame = 0

Query: 1   MEMYFRLLSFSYRIIQRSRLQQ-ICTISNSVFLESEMSKFVHTQAMDLPSQRTNERKIPD 60
           M+MY RLL FSYRII+RSR+QQ ICTISN  FLESEM KFVHTQAMDLP Q TN  KIPD
Sbjct: 1   MKMYNRLLPFSYRIIRRSRVQQEICTISNLDFLESEMLKFVHTQAMDLPFQATNGSKIPD 60

Query: 61  YEDALHEEADGVRRRGYFLTKLIDDSVSHNGFESIARIFPKFRGSIDSQLCNSMIRRYLD 120
           Y D          RRG+FL KLIDDSVS NGFESIARIF K+RGSI+SQ CNSMIR YLD
Sbjct: 61  YNDV---------RRGHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLD 120

Query: 121 LNKHLHSLFIFAHMHQFSILPDSSTFPCVLKATAKLCATELGKMIHGTVIQMGFIHDVYV 180
           LNKHL+SL+IFA MH+FSILPDSSTFP VLKATA+LC T +GKMIHG VIQMGFI DVY 
Sbjct: 121 LNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYT 180

Query: 181 STALIHMYCSCLSTSDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGMLAAGA 240
           STAL+H+YC+CLS SDASQLFDEMPERNAVTWNALITGYTHNRKF++A +AFRGMLA GA
Sbjct: 181 STALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGA 240

Query: 241 EPSERTVVVVLSACAHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEK 300
           +PSERTVVVVLSAC+HLGA NQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAV E EK
Sbjct: 241 QPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEK 300

Query: 301 VFEEIWEKNVHTWNVLISGYAMNGQGDAALQAFSRMLMEDFEPDEVTFLGLLCACCHQGL 360
           VFEEI EKNV+TWNVLISGYAMNGQGDAALQAFSRMLME+F+PDEVTFLG+LCACCHQGL
Sbjct: 301 VFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGL 360

Query: 361 VTEGRRQFMSMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRALL 420
           VTEGR QFMSMKQQFGLQP+IEHYGCMVDLLGRAGLLEEAL+LIQSMS+EPDPIIWRALL
Sbjct: 361 VTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALL 420

Query: 421 CACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYTREQRWAEVGELRGMMSLRGIGK 480
           CACRVHGNTKLGEY I+RLIELEPNNGENYVLLSN+Y+RE+RWAEVG+LRGMM+LRGI K
Sbjct: 421 CACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRK 480

Query: 481 VPGCSSIEINNVVYEFAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALYDIEKEE 540
           VPGCSSIEINNVVYEF AS+DRKPEFEAIYKQLDNLI+KLKENGYV  TDMALYDIEKEE
Sbjct: 481 VPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEE 540

Query: 541 KERSVVYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDRN 600
           KE SV+YHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKV+SLVYKRYIVVRDRN
Sbjct: 541 KEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRN 600

Query: 601 RFHHFSEGFCSCRDYW 616
           RFHHF EGFCSCRDYW
Sbjct: 601 RFHHFYEGFCSCRDYW 607

BLAST of CcUC06G125840 vs. ExPASy TrEMBL
Match: A0A6J1BZP4 (pentatricopeptide repeat-containing protein At4g21065-like OS=Momordica charantia OX=3673 GN=LOC111006955 PE=3 SV=1)

HSP 1 Score: 1052.4 bits (2720), Expect = 7.5e-304
Identity = 512/611 (83.80%), Postives = 558/611 (91.33%), Query Frame = 0

Query: 8   LSFSYRIIQRSRLQQICTISNSVFL--ESEMSKFVHTQ-AMDLPSQRTNERKIPDYEDAL 67
           +  S+R+I+R+RLQ ICTISNS FL  +S++SKF+HTQ  M+LP Q TNERKIPDY D +
Sbjct: 43  IEMSFRLIRRARLQDICTISNSAFLANQSQISKFMHTQLTMNLPPQSTNERKIPDYMDVV 102

Query: 68  HEEADGVRRRGYFLTKLIDDSVSHNGFESIARIFPKFRGSIDSQLCNSMIRRYLDLNKHL 127
            +E + +R  GYFL KLIDDSVSH+GFESIA IF KFRG I+ QLCN MIR YLD NKHL
Sbjct: 103 RKEGNDMRSDGYFLMKLIDDSVSHDGFESIAPIFSKFRGVINCQLCNWMIRGYLDSNKHL 162

Query: 128 HSLFIFAHMHQFSILPDSSTFPCVLKATAKLCATELGKMIHGTVIQMGFIHDVYVSTALI 187
           +SL IFAHMH+FSILPDSSTFP V+KATA+ C  ELGKMIHGTVIQMGFI DVY STAL+
Sbjct: 163 NSLLIFAHMHKFSILPDSSTFPAVIKATARSCNVELGKMIHGTVIQMGFIRDVYTSTALV 222

Query: 188 HMYCSCLSTSDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGMLAAGAEPSER 247
           HMYC+CLS SDA QLFDEMPERN+VTWNALITGYTHNRKFMEA NAFRGMLAAGAEPSER
Sbjct: 223 HMYCTCLSISDAYQLFDEMPERNSVTWNALITGYTHNRKFMEAINAFRGMLAAGAEPSER 282

Query: 248 TVVVVLSACAHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEKVFEEI 307
           TVVVVLSAC+HLGALNQG WIHEFIY N+LRLNVFVGTALIDMYAKCGAV+EAEKVFEEI
Sbjct: 283 TVVVVLSACSHLGALNQGTWIHEFIYQNKLRLNVFVGTALIDMYAKCGAVEEAEKVFEEI 342

Query: 308 WEKNVHTWNVLISGYAMNGQGDAALQAFSRMLMEDFEPDEVTFLGLLCACCHQGLVTEGR 367
            EKNV+TWNVLISGYAMNGQGD ALQAFS ML E+F+PDEVTFLG+LCACCHQGLVTEGR
Sbjct: 343 REKNVYTWNVLISGYAMNGQGDEALQAFSMMLRENFKPDEVTFLGVLCACCHQGLVTEGR 402

Query: 368 RQFMSMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRALLCACRV 427
           RQF+SMKQ FGL+P+IEHYGCMVDLLGRAGLLEEAL+LIQSMSMEPDPIIWRALLCACRV
Sbjct: 403 RQFVSMKQHFGLRPRIEHYGCMVDLLGRAGLLEEALELIQSMSMEPDPIIWRALLCACRV 462

Query: 428 HGNTKLGEYTIRRLIELEPNNGENYVLLSNLYTREQRWAEVGELRGMMSLRGIGKVPGCS 487
           HGNTKLGEY IRRLI+LEPNNGENYVLLSNLY+RE+RW EVG+LRGMMSLRGIGKVPGCS
Sbjct: 463 HGNTKLGEYAIRRLIDLEPNNGENYVLLSNLYSRERRWIEVGKLRGMMSLRGIGKVPGCS 522

Query: 488 SIEINNVVYEFAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALYDIEKEEKERSV 547
           SIEI NVVYEFAAS+DRKPEF+AIYKQLDN+IEKLK NGY+  T MAL+DIE+EEKE  V
Sbjct: 523 SIEIKNVVYEFAASNDRKPEFDAIYKQLDNVIEKLKANGYITGTGMALFDIEEEEKEHCV 582

Query: 548 VYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDRNRFHHF 607
           +YHSEKLALAFGLLNSPLDC LRIVKNLRICLDCHEFFKV SLVYKR+IVVRDRNRFHHF
Sbjct: 583 MYHSEKLALAFGLLNSPLDCALRIVKNLRICLDCHEFFKVASLVYKRFIVVRDRNRFHHF 642

Query: 608 SEGFCSCRDYW 616
           SEGFCSCRDYW
Sbjct: 643 SEGFCSCRDYW 653

BLAST of CcUC06G125840 vs. ExPASy TrEMBL
Match: A0A6J1EAY2 (pentatricopeptide repeat-containing protein At4g21065-like OS=Cucurbita moschata OX=3662 GN=LOC111432395 PE=3 SV=1)

HSP 1 Score: 1051.2 bits (2717), Expect = 1.7e-303
Identity = 514/617 (83.31%), Postives = 564/617 (91.41%), Query Frame = 0

Query: 1   MEMYFRLLSFSYRIIQRSRLQQICTISNSVFL--ESEMSKFVHTQAMDLPSQRTNERKIP 60
           M+M  RLL FS+R+I+R+RLQ  CTISN  FL  +S++S+FVHT+ M+LPSQ   ERKIP
Sbjct: 1   MKMDLRLLPFSFRLIRRARLQDTCTISNLDFLANQSQISRFVHTRVMNLPSQGGIERKIP 60

Query: 61  DYEDALHEEADGVRRRGYFLTKLIDDSVSHNGFESIARIFPKFRGSIDSQLCNSMIRRYL 120
           D  DA  +E + +R  GYFL KLI+DSVS+NGFESIA IF KFRGSI+SQ+CNSMIR YL
Sbjct: 61  DCLDARRKEGNDMRSDGYFLMKLIEDSVSNNGFESIALIFSKFRGSINSQICNSMIRGYL 120

Query: 121 DLNKHLHSLFIFAHMHQFSILPDSSTFPCVLKATAKLCATELGKMIHGTVIQMGFIHDVY 180
           D N+HL+SL IFAHMH+FSILPDSSTFP VLKATA+LC  +LGKMIHG V+QMGFI DVY
Sbjct: 121 DSNEHLNSLIIFAHMHKFSILPDSSTFPAVLKATAQLCDIKLGKMIHGAVVQMGFIRDVY 180

Query: 181 VSTALIHMYCSCLSTSDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGMLAAG 240
            STAL+HMYCSCLS SDASQLFDEMPERN+VTWNALITGYTHNRKF EA NAFRGMLAAG
Sbjct: 181 TSTALVHMYCSCLSISDASQLFDEMPERNSVTWNALITGYTHNRKFREAINAFRGMLAAG 240

Query: 241 AEPSERTVVVVLSACAHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAE 300
           AEPSERTVVVVLSAC+HLGALNQGKWIH+FIY N+LRLNVFVGTALIDMYAKCG V+EAE
Sbjct: 241 AEPSERTVVVVLSACSHLGALNQGKWIHDFIYQNKLRLNVFVGTALIDMYAKCGVVEEAE 300

Query: 301 KVFEEIWEKNVHTWNVLISGYAMNGQGDAALQAFSRMLMEDFEPDEVTFLGLLCACCHQG 360
           KVFEEI ++NV+TWNVLISGY MNGQG+AALQ FSRMLME+F+PD VTFLGLLCACCHQG
Sbjct: 301 KVFEEIRDRNVYTWNVLISGYGMNGQGNAALQVFSRMLMENFKPDAVTFLGLLCACCHQG 360

Query: 361 LVTEGRRQFMSMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRAL 420
           LVTEGRRQF+SMKQQFGLQPKIEHYGCMVDLLGRAGLLEEAL+LI+SMSMEPDPIIWRAL
Sbjct: 361 LVTEGRRQFISMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALELIESMSMEPDPIIWRAL 420

Query: 421 LCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYTREQRWAEVGELRGMMSLRGIG 480
           LCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLY+RE+RW EVG+LRGMMSLRGI 
Sbjct: 421 LCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSRERRWIEVGKLRGMMSLRGIE 480

Query: 481 KVPGCSSIEINNVVYEFAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALYDIEKE 540
           KVPGCSSIEINN V+EF AS+DRK EF AIYKQLDN+++KLKENGYV  TDM+L+DIEKE
Sbjct: 481 KVPGCSSIEINNAVHEFTASNDRKREFSAIYKQLDNVMKKLKENGYVTGTDMSLFDIEKE 540

Query: 541 EKERSVVYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDR 600
           EKE SV+YHSEKLALAFGLLNSPLDCTLRIVKNLRIC DCHEFFKVVSLVYKRYIVVRDR
Sbjct: 541 EKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICSDCHEFFKVVSLVYKRYIVVRDR 600

Query: 601 NRFHHFSEGFCSCRDYW 616
           NRFHHFSEG CSCRDYW
Sbjct: 601 NRFHHFSEGVCSCRDYW 617

BLAST of CcUC06G125840 vs. ExPASy TrEMBL
Match: A0A5A7US40 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold274G002740 PE=3 SV=1)

HSP 1 Score: 1041.6 bits (2692), Expect = 1.3e-300
Identity = 504/580 (86.90%), Postives = 541/580 (93.28%), Query Frame = 0

Query: 36  MSKFVHTQAMDLPSQRTNERKIPDYEDALHEEADGVRRRGYFLTKLIDDSVSHNGFESIA 95
           M KFVHTQAMDLP Q TN+RK PDY D          RRG+F+ KLIDDSVSHNGFESIA
Sbjct: 1   MLKFVHTQAMDLPFQETNDRKTPDYNDV---------RRGHFVMKLIDDSVSHNGFESIA 60

Query: 96  RIFPKFRGSIDSQLCNSMIRRYLDLNKHLHSLFIFAHMHQFSILPDSSTFPCVLKATAKL 155
           RIF K+RGSI+SQ CNSMIRRYLDLNKHL+SL+IFA MH+FSILPD STFP VLKATA+L
Sbjct: 61  RIFSKYRGSINSQQCNSMIRRYLDLNKHLNSLYIFAQMHKFSILPDLSTFPAVLKATAQL 120

Query: 156 CATELGKMIHGTVIQMGFIHDVYVSTALIHMYCSCLSTSDASQLFDEMPERNAVTWNALI 215
           C TE+GKMIHG VIQMGFI DVY STAL+HMY +CLS SDASQ+FDEM ERNAVTWNALI
Sbjct: 121 CDTEVGKMIHGIVIQMGFICDVYTSTALVHMYSTCLSISDASQVFDEMAERNAVTWNALI 180

Query: 216 TGYTHNRKFMEATNAFRGMLAAGAEPSERTVVVVLSACAHLGALNQGKWIHEFIYHNRLR 275
           TGYTHNRKFMEA +AFRGMLAAGA+PSERTVV+VLSAC+HLGALNQGKWIH+FIYHNRLR
Sbjct: 181 TGYTHNRKFMEAIDAFRGMLAAGAQPSERTVVLVLSACSHLGALNQGKWIHDFIYHNRLR 240

Query: 276 LNVFVGTALIDMYAKCGAVDEAEKVFEEIWEKNVHTWNVLISGYAMNGQGDAALQAFSRM 335
           LNVFVGTALIDMYAKCGAVDE EKVFEEI EKNV+TWNVLISGYAMNGQGDAALQAFSRM
Sbjct: 241 LNVFVGTALIDMYAKCGAVDEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRM 300

Query: 336 LMEDFEPDEVTFLGLLCACCHQGLVTEGRRQFMSMKQQFGLQPKIEHYGCMVDLLGRAGL 395
           LME+F+PDEVTFLG+LCACCHQGLVTEGRRQFMSMKQQFGLQP+IEHYGCMVDLLGRAGL
Sbjct: 301 LMENFKPDEVTFLGVLCACCHQGLVTEGRRQFMSMKQQFGLQPRIEHYGCMVDLLGRAGL 360

Query: 396 LEEALQLIQSMSMEPDPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNL 455
           LEEAL+LIQSMSMEPDPIIWRALLCACRVHGNTKLGEY ++RL+ELEPNNGENYVLLSN+
Sbjct: 361 LEEALELIQSMSMEPDPIIWRALLCACRVHGNTKLGEYIMKRLVELEPNNGENYVLLSNI 420

Query: 456 YTREQRWAEVGELRGMMSLRGIGKVPGCSSIEINNVVYEFAASDDRKPEFEAIYKQLDNL 515
           Y RE+RWAEVG+LRGMM+LRGI KVPGCSSIEINNVVYEF AS+DRKPE+EAIYKQLDNL
Sbjct: 421 YARERRWAEVGKLRGMMNLRGIRKVPGCSSIEINNVVYEFVASNDRKPEYEAIYKQLDNL 480

Query: 516 IEKLKENGYVIRTDMALYDIEKEEKERSVVYHSEKLALAFGLLNSPLDCTLRIVKNLRIC 575
           I+KLKENGYV  TDMALYD+EKEEKE S++YHSEKLALAFGLLNSPLDCTLRIVKNLRIC
Sbjct: 481 IKKLKENGYVTGTDMALYDVEKEEKEHSLMYHSEKLALAFGLLNSPLDCTLRIVKNLRIC 540

Query: 576 LDCHEFFKVVSLVYKRYIVVRDRNRFHHFSEGFCSCRDYW 616
           LDCHEFFKVVSLVYKRYIVVRDRNRFHHF EGFCSCRDYW
Sbjct: 541 LDCHEFFKVVSLVYKRYIVVRDRNRFHHFFEGFCSCRDYW 571

BLAST of CcUC06G125840 vs. ExPASy TrEMBL
Match: A0A1S4DZH3 (pentatricopeptide repeat-containing protein At4g21065-like OS=Cucumis melo OX=3656 GN=LOC103494017 PE=3 SV=1)

HSP 1 Score: 1026.2 bits (2652), Expect = 5.8e-296
Identity = 496/571 (86.87%), Postives = 533/571 (93.35%), Query Frame = 0

Query: 45  MDLPSQRTNERKIPDYEDALHEEADGVRRRGYFLTKLIDDSVSHNGFESIARIFPKFRGS 104
           MDLP Q TN+RK PDY D          RRG+F+ KLIDDSVSHNGFESIARIF K+RGS
Sbjct: 1   MDLPFQETNDRKTPDYNDV---------RRGHFVMKLIDDSVSHNGFESIARIFSKYRGS 60

Query: 105 IDSQLCNSMIRRYLDLNKHLHSLFIFAHMHQFSILPDSSTFPCVLKATAKLCATELGKMI 164
           I+SQ CNSMIRRYLDLNKHL+SL+IFA MH+FSILPD STFP VLKATA+LC TE+GKMI
Sbjct: 61  INSQQCNSMIRRYLDLNKHLNSLYIFAQMHKFSILPDLSTFPAVLKATAQLCDTEVGKMI 120

Query: 165 HGTVIQMGFIHDVYVSTALIHMYCSCLSTSDASQLFDEMPERNAVTWNALITGYTHNRKF 224
           HG VIQMGFI DVY STAL+HMY +CLS SDASQ+FDEM ERNAVTWNALITGYTHNRKF
Sbjct: 121 HGIVIQMGFICDVYTSTALVHMYSTCLSISDASQVFDEMAERNAVTWNALITGYTHNRKF 180

Query: 225 MEATNAFRGMLAAGAEPSERTVVVVLSACAHLGALNQGKWIHEFIYHNRLRLNVFVGTAL 284
           MEA +AFRGMLAAGA+PSERTVV+VLSAC+HLGALNQGKWIH+FIYHNRLRLNVFVGTAL
Sbjct: 181 MEAIDAFRGMLAAGAQPSERTVVLVLSACSHLGALNQGKWIHDFIYHNRLRLNVFVGTAL 240

Query: 285 IDMYAKCGAVDEAEKVFEEIWEKNVHTWNVLISGYAMNGQGDAALQAFSRMLMEDFEPDE 344
           IDMYAKCGAVDE EKVFEEI EKNV+TWNVLISGYAMNGQGDAALQAFSRMLME+F+PDE
Sbjct: 241 IDMYAKCGAVDEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDE 300

Query: 345 VTFLGLLCACCHQGLVTEGRRQFMSMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALQLIQ 404
           VTFLG+LCACCHQGLVTEGRRQFMSMKQQFGLQP+IEHYGCMVDLLGRAGLLEEAL+LIQ
Sbjct: 301 VTFLGVLCACCHQGLVTEGRRQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQ 360

Query: 405 SMSMEPDPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYTREQRWAE 464
           SMSMEPDPIIWRALLCACRVHGNTKLGEY ++RL+ELEPNNGENYVLLSN+Y RE+RWAE
Sbjct: 361 SMSMEPDPIIWRALLCACRVHGNTKLGEYIMKRLVELEPNNGENYVLLSNIYARERRWAE 420

Query: 465 VGELRGMMSLRGIGKVPGCSSIEINNVVYEFAASDDRKPEFEAIYKQLDNLIEKLKENGY 524
           VG+LRGMM+LRGI KVPGCSSIEINNVVYEF AS+DRKPE+EAIYKQLDNLI+KLKENGY
Sbjct: 421 VGKLRGMMNLRGIRKVPGCSSIEINNVVYEFVASNDRKPEYEAIYKQLDNLIKKLKENGY 480

Query: 525 VIRTDMALYDIEKEEKERSVVYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKV 584
           V  TDMALYD+EKEEKE S++YHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKV
Sbjct: 481 VTGTDMALYDVEKEEKEHSLMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKV 540

Query: 585 VSLVYKRYIVVRDRNRFHHFSEGFCSCRDYW 616
           VSLVYKRYIVVRDRNRFHHF EGFCSCRDYW
Sbjct: 541 VSLVYKRYIVVRDRNRFHHFFEGFCSCRDYW 562

BLAST of CcUC06G125840 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 467.2 bits (1201), Expect = 2.0e-131
Identity = 245/587 (41.74%), Postives = 359/587 (61.16%), Query Frame = 0

Query: 32  LESEMSKFVHTQAMDLPSQRTNERKIPDYEDALHEEADGVRRRGYFLTKLIDDSVSHNGF 91
           L  ++  +VHT  + +  Q  N R     EDA         R     T LI    S    
Sbjct: 163 LGCDLDLYVHTSLISMYVQ--NGR----LEDAHKVFDKSPHRDVVSYTALIKGYASRGYI 222

Query: 92  ESIARIFPKFRGSIDSQLCNSMIRRYLDLNKHLHSLFIFAHMHQFSILPDSSTFPCVLKA 151
           E+  ++F +     D    N+MI  Y +   +  +L +F  M + ++ PD ST   V+ A
Sbjct: 223 ENAQKLFDEIPVK-DVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSA 282

Query: 152 TAKLCATELGKMIHGTVIQMGFIHDVYVSTALIHMYCSCLSTSDASQLFDEMPERNAVTW 211
            A+  + ELG+ +H  +   GF  ++ +  ALI +Y  C     A  LF+ +P ++ ++W
Sbjct: 283 CAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISW 342

Query: 212 NALITGYTHNRKFMEATNAFRGMLAAGAEPSERTVVVVLSACAHLGALNQGKWIHEFIYH 271
           N LI GYTH   + EA   F+ ML +G  P++ T++ +L ACAHLGA++ G+WIH +I  
Sbjct: 343 NTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYI-D 402

Query: 272 NRLR--LNV-FVGTALIDMYAKCGAVDEAEKVFEEIWEKNVHTWNVLISGYAMNGQGDAA 331
            RL+   N   + T+LIDMYAKCG ++ A +VF  I  K++ +WN +I G+AM+G+ DA+
Sbjct: 403 KRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADAS 462

Query: 332 LQAFSRMLMEDFEPDEVTFLGLLCACCHQGLVTEGRRQFMSMKQQFGLQPKIEHYGCMVD 391
              FSRM     +PD++TF+GLL AC H G++  GR  F +M Q + + PK+EHYGCM+D
Sbjct: 463 FDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMID 522

Query: 392 LLGRAGLLEEALQLIQSMSMEPDPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGEN 451
           LLG +GL +EA ++I  M MEPD +IW +LL AC++HGN +LGE     LI++EP N  +
Sbjct: 523 LLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGS 582

Query: 452 YVLLSNLYTREQRWAEVGELRGMMSLRGIGKVPGCSSIEINNVVYEFAASDDRKPEFEAI 511
           YVLLSN+Y    RW EV + R +++ +G+ KVPGCSSIEI++VV+EF   D   P    I
Sbjct: 583 YVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREI 642

Query: 512 YKQLDNLIEKLKENGYVIRTDMALYDIEKEEKERSVVYHSEKLALAFGLLNSPLDCTLRI 571
           Y  L+ +   L++ G+V  T   L ++E+E KE ++ +HSEKLA+AFGL+++     L I
Sbjct: 643 YGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLTI 702

Query: 572 VKNLRICLDCHEFFKVVSLVYKRYIVVRDRNRFHHFSEGFCSCRDYW 616
           VKNLR+C +CHE  K++S +YKR I+ RDR RFHHF +G CSC DYW
Sbjct: 703 VKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of CcUC06G125840 vs. TAIR 10
Match: AT4G21065.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 466.8 bits (1200), Expect = 2.6e-131
Identity = 223/522 (42.72%), Postives = 340/522 (65.13%), Query Frame = 0

Query: 96  RIFPKFRGSIDSQLCNSMIRRYLDLNKHLHSLFIFAHMHQFSIL-PDSSTFPCVLKATAK 155
           ++F K    I+  + N++IR Y ++   + +  ++  M    ++ PD+ T+P ++KA   
Sbjct: 74  KVFSKIEKPINVFIWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTT 133

Query: 156 LCATELGKMIHGTVIQMGFIHDVYVSTALIHMYCSCLSTSDASQLFDEMPERNAVTWNAL 215
           +    LG+ IH  VI+ GF   +YV  +L+H+Y +C   + A ++FD+MPE++ V WN++
Sbjct: 134 MADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSV 193

Query: 216 ITGYTHNRKFMEATNAFRGMLAAGAEPSERTVVVVLSACAHLGALNQGKWIHEFIYHNRL 275
           I G+  N K  EA   +  M + G +P   T+V +LSACA +GAL  GK +H ++    L
Sbjct: 194 INGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGL 253

Query: 276 RLNVFVGTALIDMYAKCGAVDEAEKVFEEIWEKNVHTWNVLISGYAMNGQGDAALQAFSR 335
             N+     L+D+YA+CG V+EA+ +F+E+ +KN  +W  LI G A+NG G  A++ F  
Sbjct: 254 TRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKY 313

Query: 336 M-LMEDFEPDEVTFLGLLCACCHQGLVTEGRRQFMSMKQQFGLQPKIEHYGCMVDLLGRA 395
           M   E   P E+TF+G+L AC H G+V EG   F  M++++ ++P+IEH+GCMVDLL RA
Sbjct: 314 MESTEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARA 373

Query: 396 GLLEEALQLIQSMSMEPDPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLS 455
           G +++A + I+SM M+P+ +IWR LL AC VHG++ L E+   ++++LEPN+  +YVLLS
Sbjct: 374 GQVKKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLS 433

Query: 456 NLYTREQRWAEVGELRGMMSLRGIGKVPGCSSIEINNVVYEFAASDDRKPEFEAIYKQLD 515
           N+Y  EQRW++V ++R  M   G+ KVPG S +E+ N V+EF   D   P+ +AIY +L 
Sbjct: 434 NMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLK 493

Query: 516 NLIEKLKENGYVIRTDMALYDIEKEEKERSVVYHSEKLALAFGLLNSPLDCTLRIVKNLR 575
            +  +L+  GYV +      D+E+EEKE +VVYHSEK+A+AF L+++P    + +VKNLR
Sbjct: 494 EMTGRLRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLR 553

Query: 576 ICLDCHEFFKVVSLVYKRYIVVRDRNRFHHFSEGFCSCRDYW 616
           +C DCH   K+VS VY R IVVRDR+RFHHF  G CSC+DYW
Sbjct: 554 VCADCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of CcUC06G125840 vs. TAIR 10
Match: AT2G02980.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 461.1 bits (1185), Expect = 1.4e-129
Identity = 237/543 (43.65%), Postives = 340/543 (62.62%), Query Frame = 0

Query: 77  FLTKLID---DSVSHNGFESIARIFPKFRGSIDSQLCNSMIRRYLDLNKHLHSLFIFAHM 136
           F+ KLI+   +S + +   S AR   +     D  + NSM R Y      L    +F  +
Sbjct: 62  FVAKLINFCTESPTESSM-SYARHLFEAMSEPDIVIFNSMARGYSRFTNPLEVFSLFVEI 121

Query: 137 HQFSILPDSSTFPCVLKATAKLCATELGKMIHGTVIQMGFIHDVYVSTALIHMYCSCLST 196
            +  ILPD+ TFP +LKA A   A E G+ +H   +++G   +VYV   LI+MY  C   
Sbjct: 122 LEDGILPDNYTFPSLLKACAVAKALEEGRQLHCLSMKLGLDDNVYVCPTLINMYTECEDV 181

Query: 197 SDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGMLAAGAEPSERTVVVVLSAC 256
             A  +FD + E   V +NA+ITGY    +  EA + FR M     +P+E T++ VLS+C
Sbjct: 182 DSARCVFDRIVEPCVVCYNAMITGYARRNRPNEALSLFREMQGKYLKPNEITLLSVLSSC 241

Query: 257 AHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEKVFEEIWEKNVHTWN 316
           A LG+L+ GKWIH++   +     V V TALIDM+AKCG++D+A  +FE++  K+   W+
Sbjct: 242 ALLGSLDLGKWIHKYAKKHSFCKYVKVNTALIDMFAKCGSLDDAVSIFEKMRYKDTQAWS 301

Query: 317 VLISGYAMNGQGDAALQAFSRMLMEDFEPDEVTFLGLLCACCHQGLVTEGRRQFMSMKQQ 376
            +I  YA +G+ + ++  F RM  E+ +PDE+TFLGLL AC H G V EGR+ F  M  +
Sbjct: 302 AMIVAYANHGKAEKSMLMFERMRSENVQPDEITFLGLLNACSHTGRVEEGRKYFSQMVSK 361

Query: 377 FGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRALLCACRVHGNTKLGEY 436
           FG+ P I+HYG MVDLL RAG LE+A + I  + + P P++WR LL AC  H N  L E 
Sbjct: 362 FGIVPSIKHYGSMVDLLSRAGNLEDAYEFIDKLPISPTPMLWRILLAACSSHNNLDLAEK 421

Query: 437 TIRRLIELEPNNGENYVLLSNLYTREQRWAEVGELRGMMSLRGIGKVPGCSSIEINNVVY 496
              R+ EL+ ++G +YV+LSNLY R ++W  V  LR +M  R   KVPGCSSIE+NNVV+
Sbjct: 422 VSERIFELDDSHGGDYVILSNLYARNKKWEYVDSLRKVMKDRKAVKVPGCSSIEVNNVVH 481

Query: 497 EFAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALY-DIEKEEKERSVVYHSEKLA 556
           EF + D  K     +++ LD ++++LK +GYV  T M ++ ++  +EKE ++ YHSEKLA
Sbjct: 482 EFFSGDGVKSATTKLHRALDEMVKELKLSGYVPDTSMVVHANMNDQEKEITLRYHSEKLA 541

Query: 557 LAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDRNRFHHFSEGFCSCR 616
           + FGLLN+P   T+R+VKNLR+C DCH   K++SL++ R +V+RD  RFHHF +G CSC 
Sbjct: 542 ITFGLLNTPPGTTIRVVKNLRVCRDCHNAAKLISLIFGRKVVLRDVQRFHHFEDGKCSCG 601

BLAST of CcUC06G125840 vs. TAIR 10
Match: AT3G62890.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 441.0 bits (1133), Expect = 1.5e-123
Identity = 234/552 (42.39%), Postives = 330/552 (59.78%), Query Frame = 0

Query: 105 IDSQLCNSMIRRYLD--LNKHLHS-LFIFAHMHQFSILPDSSTFPCVLKATAKLCATELG 164
           ++S L N +IR  +    +   HS + ++  M    + PD  TFP +L +        LG
Sbjct: 22  LESFLWNIIIRAIVHNVSSPQRHSPISVYLRMRNHRVSPDFHTFPFLLPSFHNPLHLPLG 81

Query: 165 KMIHGTVIQMGFIHDVYVSTALIHMYCSCLS----------------------------- 224
           +  H  ++  G   D +V T+L++MY SC                               
Sbjct: 82  QRTHAQILLFGLDKDPFVRTSLLNMYSSCGDLRSAQRVFDDSGSKDLPAWNSVVNAYAKA 141

Query: 225 --TSDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGML-----AAGAEPSERT 284
               DA +LFDEMPERN ++W+ LI GY    K+ EA + FR M       A   P+E T
Sbjct: 142 GLIDDARKLFDEMPERNVISWSCLINGYVMCGKYKEALDLFREMQLPKPNEAFVRPNEFT 201

Query: 285 VVVVLSACAHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEKVFEEIW 344
           +  VLSAC  LGAL QGKW+H +I    + +++ +GTALIDMYAKCG+++ A++VF  + 
Sbjct: 202 MSTVLSACGRLGALEQGKWVHAYIDKYHVEIDIVLGTALIDMYAKCGSLERAKRVFNALG 261

Query: 345 -EKNVHTWNVLISGYAMNGQGDAALQAFSRMLMED-FEPDEVTFLGLLCACCHQGLVTEG 404
            +K+V  ++ +I   AM G  D   Q FS M   D   P+ VTF+G+L AC H+GL+ EG
Sbjct: 262 SKKDVKAYSAMICCLAMYGLTDECFQLFSEMTTSDNINPNSVTFVGILGACVHRGLINEG 321

Query: 405 RRQFMSMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRALLCACR 464
           +  F  M ++FG+ P I+HYGCMVDL GR+GL++EA   I SM MEPD +IW +LL   R
Sbjct: 322 KSYFKMMIEEFGITPSIQHYGCMVDLYGRSGLIKEAESFIASMPMEPDVLIWGSLLSGSR 381

Query: 465 VHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYTREQRWAEVGELRGMMSLRGIGKVPGC 524
           + G+ K  E  ++RLIEL+P N   YVLLSN+Y +  RW EV  +R  M ++GI KVPGC
Sbjct: 382 MLGDIKTCEGALKRLIELDPMNSGAYVLLSNVYAKTGRWMEVKCIRHEMEVKGINKVPGC 441

Query: 525 SSIEINNVVYEFAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALYDIEKEEKERS 584
           S +E+  VV+EF   D+ + E E IY  LD ++++L+E GYV  T   L D+ +++KE +
Sbjct: 442 SYVEVEGVVHEFVVGDESQQESERIYAMLDEIMQRLREAGYVTDTKEVLLDLNEKDKEIA 501

Query: 585 VVYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDRNRFHH 616
           + YHSEKLA+AF L+ +     +RI+KNLRIC DCH   K++S ++ R IVVRD NRFHH
Sbjct: 502 LSYHSEKLAIAFCLMKTRPGTPVRIIKNLRICGDCHLVMKMISKLFSREIVVRDCNRFHH 561

BLAST of CcUC06G125840 vs. TAIR 10
Match: AT4G21065.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 440.7 bits (1132), Expect = 2.0e-123
Identity = 209/457 (45.73%), Postives = 307/457 (67.18%), Query Frame = 0

Query: 160 LGKMIHGTVIQMGFIHDVYVSTALIHMYCSCLSTSDASQLFDEMPERNAVTWNALITGYT 219
           LG+ IH  VI+ GF   +YV  +L+H+Y +C   + A ++FD+MPE++ V WN++I G+ 
Sbjct: 6   LGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFA 65

Query: 220 HNRKFMEATNAFRGMLAAGAEPSERTVVVVLSACAHLGALNQGKWIHEFIYHNRLRLNVF 279
            N K  EA   +  M + G +P   T+V +LSACA +GAL  GK +H ++    L  N+ 
Sbjct: 66  ENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLH 125

Query: 280 VGTALIDMYAKCGAVDEAEKVFEEIWEKNVHTWNVLISGYAMNGQGDAALQAFSRM-LME 339
               L+D+YA+CG V+EA+ +F+E+ +KN  +W  LI G A+NG G  A++ F  M   E
Sbjct: 126 SSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTE 185

Query: 340 DFEPDEVTFLGLLCACCHQGLVTEGRRQFMSMKQQFGLQPKIEHYGCMVDLLGRAGLLEE 399
              P E+TF+G+L AC H G+V EG   F  M++++ ++P+IEH+GCMVDLL RAG +++
Sbjct: 186 GLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKK 245

Query: 400 ALQLIQSMSMEPDPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYTR 459
           A + I+SM M+P+ +IWR LL AC VHG++ L E+   ++++LEPN+  +YVLLSN+Y  
Sbjct: 246 AYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYAS 305

Query: 460 EQRWAEVGELRGMMSLRGIGKVPGCSSIEINNVVYEFAASDDRKPEFEAIYKQLDNLIEK 519
           EQRW++V ++R  M   G+ KVPG S +E+ N V+EF   D   P+ +AIY +L  +  +
Sbjct: 306 EQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGR 365

Query: 520 LKENGYVIRTDMALYDIEKEEKERSVVYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDC 579
           L+  GYV +      D+E+EEKE +VVYHSEK+A+AF L+++P    + +VKNLR+C DC
Sbjct: 366 LRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCADC 425

Query: 580 HEFFKVVSLVYKRYIVVRDRNRFHHFSEGFCSCRDYW 616
           H   K+VS VY R IVVRDR+RFHHF  G CSC+DYW
Sbjct: 426 HLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 462

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038878567.10.0e+0090.24pentatricopeptide repeat-containing protein At4g21065-like [Benincasa hispida][more]
XP_004138309.20.0e+0086.36pentatricopeptide repeat-containing protein At4g21065 [Cucumis sativus] >KGN6370... [more]
XP_023529316.11.8e-30483.79pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita pepo subsp... [more]
XP_022134759.11.6e-30383.80pentatricopeptide repeat-containing protein At4g21065-like [Momordica charantia][more]
XP_022925029.13.5e-30383.31pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q9LN012.8e-13041.74Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
A8MQA33.7e-13042.72Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX... [more]
Q8LK932.0e-12843.65Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidop... [more]
Q683I92.1e-12242.39Pentatricopeptide repeat-containing protein At3g62890 OS=Arabidopsis thaliana OX... [more]
Q9FJY71.7e-11939.48Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A0A0LUK80.0e+0086.36DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G0115... [more]
A0A6J1BZP47.5e-30483.80pentatricopeptide repeat-containing protein At4g21065-like OS=Momordica charanti... [more]
A0A6J1EAY21.7e-30383.31pentatricopeptide repeat-containing protein At4g21065-like OS=Cucurbita moschata... [more]
A0A5A7US401.3e-30086.90Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S4DZH35.8e-29686.87pentatricopeptide repeat-containing protein At4g21065-like OS=Cucumis melo OX=36... [more]
Match NameE-valueIdentityDescription
AT1G08070.12.0e-13141.74Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G21065.12.6e-13142.72Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G02980.11.4e-12943.65Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G62890.11.5e-12342.39Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G21065.22.0e-12345.73Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (PI 537277) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 502..522
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 81..589
NoneNo IPR availablePANTHERPTHR47928:SF133SUBFAMILY NOT NAMEDcoord: 81..589
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 382..407
e-value: 0.0037
score: 17.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 383..407
e-value: 0.0022
score: 16.1
coord: 209..242
e-value: 3.8E-5
score: 21.6
coord: 310..344
e-value: 4.4E-8
score: 30.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 307..356
e-value: 1.5E-11
score: 44.3
coord: 207..254
e-value: 3.2E-9
score: 36.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 207..241
score: 10.98328
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 308..342
score: 11.783455
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 277..307
score: 8.889672
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 268..330
e-value: 3.1E-10
score: 41.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 331..500
e-value: 4.4E-29
score: 103.8
coord: 106..267
e-value: 1.2E-27
score: 99.1
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 288..464
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 481..604
e-value: 2.3E-36
score: 124.5

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CcUC06G125840.1CcUC06G125840.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding