CaUC06G122530 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC06G122530
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationCiama_Chr06: 27588070 .. 27589917 (-)
RNA-Seq ExpressionCaUC06G122530
SyntenyCaUC06G122530
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAATGTATTTCCGGCTTCTGTCTTTCTCATATAGAATAATTCAAAGGTCTCGGCTCCAACGAATTTGTACAAACTCGAACTCGGTTTTACTAGAATCAGAAATGTCGAAATTTGTACATACCCAAGCGATGGATCTTCCTTCTCAGAGAACTAACGAGAGAAGGATTCCTGATTACGAGGACGCCCTCCATGAGGAAGCCGATGGCGTGCGGAGGCGCGGGTATTTCCTCACGAAACTCATAGACGACTCTGTTTCGCATAATGGGTTCGAATCTATTGCTCGTATTTTCCCCAAGTTTCGTGGTTCTATTGATTCTCAGCTGTGTAACTCGATGATTAGGCGTTATTTGGATTTGAATAAGCATTTACATTCACTCTTCATTTTTGCCCACATGCATAAATTCAGTATTCTGCCCGATTCCTCCACTTTTCCTTGTGTTCTTAAAGCAACTGCAAAGCTATGTGCTAGTGAACTTGGAAAAATGATACATGGTACTGTTATTCAGATGGGTTTTATTCGTGATGTCTACATAAGTACCGCTCTTGTTCATATGTACTGTTCTTGTTTGTCTATATCCGATGCTTCTCAGTTGTTCGACGAAATGCCTGAGAGAAATGCAGTTACTTGGAATGCTCTGATTACTGGTTATACTCATAATAGAAAGTTTATGGAAGCTACCAATGCTTTCAGAGGAATGTTGGCAGCTGGGGCTGAACCGAGTGAGAGAACCGTGGTGGTAGTTCTATCAGCTTGTGCTCATTTGGGAGCGTTGAATCAGGGAAAGTGGATCCATGAGTTTATATATCATAATAGGTTGAGACTGAACGTGTTTGTGGGCACAGCACTTATTGATATGTATGCTAAATGTGGGGCTGTTGATGAGGCAGAGAAGGTCTTTGAAGAAATTTGGGAGAAGAATGTCCATACGTGGAATGTCTTGATTTCTGGATATGCCATGAATGGGCAAGGCGATGCAGCTTTGCAGGCTTTTTCTAGGATGTTGATGGAAGATTTCGAGCCAGATGAGGTTACCTTTCTAGGTCTCTTGTGTGCATGCTGTCACCAAGGTCTGGTCACAGAAGGGCGCAGGCAATACATGAGCATGAAACAACAGTTTGGACTGCAACCAAAGATCGAGCATTATGGGTGTATGGTCGACCTACTTGGTCGAGCGGGATTGTTGGAGGAAGCTCTACAGTTAATCCAATCCATGAGCATGGAGCCAGACCCTATCATTTGGAGGGCTCTGCTTTGTGCTTGCAGAGTCCATGGGAATACAAAATTGGGTGAATATACTATCAGAAGACTTATAGAATTAGAACCAAACAATGGCGAGAATTATGTCTTGCTGTCAAATCTGTACTCAAGGGAACAACGGTGGGCTGAAGTAGGGGAGTTGAGAGGAATGATGAGTCTCAGGGGGATTGGGAAAGTCCCTGGTTGCAGTTCAATTGAAATAAACAATGTAGTTTATGAGTTAGCAGCATCAGATGACAGAAAACCAGAATTTGAAGCAATATACAAGCAGTTGGATAATTTGATTGAAAAATTGAAAGAAAATGGTTACGTTATACGCACTGACATGGCTTTATATGATATTGAGAAAGAAGAGAAAGAACGTTCTGTGGTGTACCATAGCGAGAAACTTGCTTTAGCATTTGGTCTCTTAAACTCTCCTTTAGATTGCACCCTAAGGATAGTGAAAAATCTGAGAATTTGCTTGGACTGCCATGAGTTTTTCAAAGTTGTATCACTTGTCTATAAAAGATATATTGTTGTGAGAGACAGAAACCGTTTTCACCATTTTTCCGAAGGTTTCTGTTCGTGTCGCGACTATTGGTGA

mRNA sequence

ATGGAAATGTATTTCCGGCTTCTGTCTTTCTCATATAGAATAATTCAAAGGTCTCGGCTCCAACGAATTTGTACAAACTCGAACTCGGTTTTACTAGAATCAGAAATGTCGAAATTTGTACATACCCAAGCGATGGATCTTCCTTCTCAGAGAACTAACGAGAGAAGGATTCCTGATTACGAGGACGCCCTCCATGAGGAAGCCGATGGCGTGCGGAGGCGCGGGTATTTCCTCACGAAACTCATAGACGACTCTGTTTCGCATAATGGGTTCGAATCTATTGCTCGTATTTTCCCCAAGTTTCGTGGTTCTATTGATTCTCAGCTGTGTAACTCGATGATTAGGCGTTATTTGGATTTGAATAAGCATTTACATTCACTCTTCATTTTTGCCCACATGCATAAATTCAGTATTCTGCCCGATTCCTCCACTTTTCCTTGTGTTCTTAAAGCAACTGCAAAGCTATGTGCTAGTGAACTTGGAAAAATGATACATGGTACTGTTATTCAGATGGGTTTTATTCGTGATGTCTACATAAGTACCGCTCTTGTTCATATGTACTGTTCTTGTTTGTCTATATCCGATGCTTCTCAGTTGTTCGACGAAATGCCTGAGAGAAATGCAGTTACTTGGAATGCTCTGATTACTGGTTATACTCATAATAGAAAGTTTATGGAAGCTACCAATGCTTTCAGAGGAATGTTGGCAGCTGGGGCTGAACCGAGTGAGAGAACCGTGGTGGTAGTTCTATCAGCTTGTGCTCATTTGGGAGCGTTGAATCAGGGAAAGTGGATCCATGAGTTTATATATCATAATAGGTTGAGACTGAACGTGTTTGTGGGCACAGCACTTATTGATATGTATGCTAAATGTGGGGCTGTTGATGAGGCAGAGAAGGTCTTTGAAGAAATTTGGGAGAAGAATGTCCATACGTGGAATGTCTTGATTTCTGGATATGCCATGAATGGGCAAGGCGATGCAGCTTTGCAGGCTTTTTCTAGGATGTTGATGGAAGATTTCGAGCCAGATGAGGTTACCTTTCTAGGTCTCTTGTGTGCATGCTGTCACCAAGGTCTGGTCACAGAAGGGCGCAGGCAATACATGAGCATGAAACAACAGTTTGGACTGCAACCAAAGATCGAGCATTATGGGTGTATGGTCGACCTACTTGGTCGAGCGGGATTGTTGGAGGAAGCTCTACAGTTAATCCAATCCATGAGCATGGAGCCAGACCCTATCATTTGGAGGGCTCTGCTTTGTGCTTGCAGAGTCCATGGGAATACAAAATTGGGTGAATATACTATCAGAAGACTTATAGAATTAGAACCAAACAATGGCGAGAATTATGTCTTGCTGTCAAATCTGTACTCAAGGGAACAACGGTGGGCTGAAGTAGGGGAGTTGAGAGGAATGATGAGTCTCAGGGGGATTGGGAAAGTCCCTGGTTGCAGTTCAATTGAAATAAACAATGTAGTTTATGAGTTAGCAGCATCAGATGACAGAAAACCAGAATTTGAAGCAATATACAAGCAGTTGGATAATTTGATTGAAAAATTGAAAGAAAATGGTTACGTTATACGCACTGACATGGCTTTATATGATATTGAGAAAGAAGAGAAAGAACGTTCTGTGGTGTACCATAGCGAGAAACTTGCTTTAGCATTTGGTCTCTTAAACTCTCCTTTAGATTGCACCCTAAGGATAGTGAAAAATCTGAGAATTTGCTTGGACTGCCATGAGTTTTTCAAAGTTGTATCACTTGTCTATAAAAGATATATTGTTGTGAGAGACAGAAACCGTTTTCACCATTTTTCCGAAGGTTTCTGTTCGTGTCGCGACTATTGGTGA

Coding sequence (CDS)

ATGGAAATGTATTTCCGGCTTCTGTCTTTCTCATATAGAATAATTCAAAGGTCTCGGCTCCAACGAATTTGTACAAACTCGAACTCGGTTTTACTAGAATCAGAAATGTCGAAATTTGTACATACCCAAGCGATGGATCTTCCTTCTCAGAGAACTAACGAGAGAAGGATTCCTGATTACGAGGACGCCCTCCATGAGGAAGCCGATGGCGTGCGGAGGCGCGGGTATTTCCTCACGAAACTCATAGACGACTCTGTTTCGCATAATGGGTTCGAATCTATTGCTCGTATTTTCCCCAAGTTTCGTGGTTCTATTGATTCTCAGCTGTGTAACTCGATGATTAGGCGTTATTTGGATTTGAATAAGCATTTACATTCACTCTTCATTTTTGCCCACATGCATAAATTCAGTATTCTGCCCGATTCCTCCACTTTTCCTTGTGTTCTTAAAGCAACTGCAAAGCTATGTGCTAGTGAACTTGGAAAAATGATACATGGTACTGTTATTCAGATGGGTTTTATTCGTGATGTCTACATAAGTACCGCTCTTGTTCATATGTACTGTTCTTGTTTGTCTATATCCGATGCTTCTCAGTTGTTCGACGAAATGCCTGAGAGAAATGCAGTTACTTGGAATGCTCTGATTACTGGTTATACTCATAATAGAAAGTTTATGGAAGCTACCAATGCTTTCAGAGGAATGTTGGCAGCTGGGGCTGAACCGAGTGAGAGAACCGTGGTGGTAGTTCTATCAGCTTGTGCTCATTTGGGAGCGTTGAATCAGGGAAAGTGGATCCATGAGTTTATATATCATAATAGGTTGAGACTGAACGTGTTTGTGGGCACAGCACTTATTGATATGTATGCTAAATGTGGGGCTGTTGATGAGGCAGAGAAGGTCTTTGAAGAAATTTGGGAGAAGAATGTCCATACGTGGAATGTCTTGATTTCTGGATATGCCATGAATGGGCAAGGCGATGCAGCTTTGCAGGCTTTTTCTAGGATGTTGATGGAAGATTTCGAGCCAGATGAGGTTACCTTTCTAGGTCTCTTGTGTGCATGCTGTCACCAAGGTCTGGTCACAGAAGGGCGCAGGCAATACATGAGCATGAAACAACAGTTTGGACTGCAACCAAAGATCGAGCATTATGGGTGTATGGTCGACCTACTTGGTCGAGCGGGATTGTTGGAGGAAGCTCTACAGTTAATCCAATCCATGAGCATGGAGCCAGACCCTATCATTTGGAGGGCTCTGCTTTGTGCTTGCAGAGTCCATGGGAATACAAAATTGGGTGAATATACTATCAGAAGACTTATAGAATTAGAACCAAACAATGGCGAGAATTATGTCTTGCTGTCAAATCTGTACTCAAGGGAACAACGGTGGGCTGAAGTAGGGGAGTTGAGAGGAATGATGAGTCTCAGGGGGATTGGGAAAGTCCCTGGTTGCAGTTCAATTGAAATAAACAATGTAGTTTATGAGTTAGCAGCATCAGATGACAGAAAACCAGAATTTGAAGCAATATACAAGCAGTTGGATAATTTGATTGAAAAATTGAAAGAAAATGGTTACGTTATACGCACTGACATGGCTTTATATGATATTGAGAAAGAAGAGAAAGAACGTTCTGTGGTGTACCATAGCGAGAAACTTGCTTTAGCATTTGGTCTCTTAAACTCTCCTTTAGATTGCACCCTAAGGATAGTGAAAAATCTGAGAATTTGCTTGGACTGCCATGAGTTTTTCAAAGTTGTATCACTTGTCTATAAAAGATATATTGTTGTGAGAGACAGAAACCGTTTTCACCATTTTTCCGAAGGTTTCTGTTCGTGTCGCGACTATTGGTGA

Protein sequence

MEMYFRLLSFSYRIIQRSRLQRICTNSNSVLLESEMSKFVHTQAMDLPSQRTNERRIPDYEDALHEEADGVRRRGYFLTKLIDDSVSHNGFESIARIFPKFRGSIDSQLCNSMIRRYLDLNKHLHSLFIFAHMHKFSILPDSSTFPCVLKATAKLCASELGKMIHGTVIQMGFIRDVYISTALVHMYCSCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGMLAAGAEPSERTVVVVLSACAHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEKVFEEIWEKNVHTWNVLISGYAMNGQGDAALQAFSRMLMEDFEPDEVTFLGLLCACCHQGLVTEGRRQYMSMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGELRGMMSLRGIGKVPGCSSIEINNVVYELAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALYDIEKEEKERSVVYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDRNRFHHFSEGFCSCRDYW
Homology
BLAST of CaUC06G122530 vs. NCBI nr
Match: XP_038878567.1 (pentatricopeptide repeat-containing protein At4g21065-like [Benincasa hispida])

HSP 1 Score: 1130.9 bits (2924), Expect = 0.0e+00
Identity = 553/615 (89.92%), Postives = 583/615 (94.80%), Query Frame = 0

Query: 1   MEMYFRLLSFSYRIIQRSRLQRICTNSNSVLLESEMSKFVHTQAMDLPSQRTNERRIPDY 60
           M+MYFRLL  S  IIQRSRLQ ICT  NSV+LESEMSKFVHTQAMDLP  RTNER+IPDY
Sbjct: 1   MKMYFRLLPLSCGIIQRSRLQEICTILNSVILESEMSKFVHTQAMDLPPPRTNERKIPDY 60

Query: 61  EDALHEEADGVRRRGYFLTKLIDDSVSHNGFESIARIFPKFRGSIDSQLCNSMIRRYLDL 120
           +DALH+E + VRR GYFL KLIDDSVSHNGFESIA IF KFR SI+SQLCNSMIR YLDL
Sbjct: 61  KDALHKEGNDVRRDGYFLMKLIDDSVSHNGFESIALIFSKFRSSINSQLCNSMIRGYLDL 120

Query: 121 NKHLHSLFIFAHMHKFSILPDSSTFPCVLKATAKLCASELGKMIHGTVIQMGFIRDVYIS 180
           NKHL+SL+IFAHMHKFSILPDSSTFP VLKATA+LC +E+GKMIHGTVIQMGFI DVY S
Sbjct: 121 NKHLNSLYIFAHMHKFSILPDSSTFPAVLKATAQLCDTEVGKMIHGTVIQMGFIHDVYTS 180

Query: 181 TALVHMYCSCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGMLAAGAE 240
           TALVHMYC+CLSISDAS++FDEMPERNAVTWNALITGYTHNRKFMEA NAFRGMLAAGAE
Sbjct: 181 TALVHMYCACLSISDASRVFDEMPERNAVTWNALITGYTHNRKFMEAINAFRGMLAAGAE 240

Query: 241 PSERTVVVVLSACAHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEKV 300
           PSERT+VVVLSAC+HLGALNQGKW+HEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEKV
Sbjct: 241 PSERTMVVVLSACSHLGALNQGKWVHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEKV 300

Query: 301 FEEIWEKNVHTWNVLISGYAMNGQGDAALQAFSRMLMEDFEPDEVTFLGLLCACCHQGLV 360
           FEEI EKNV+TWNVLISGYAMNGQGDAAL AFSRMLME+F+PDEVTFLG+LCACCHQGLV
Sbjct: 301 FEEIREKNVYTWNVLISGYAMNGQGDAALAAFSRMLMENFKPDEVTFLGILCACCHQGLV 360

Query: 361 TEGRRQYMSMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRALLC 420
           TEGRRQ+MSMKQ FGLQPKIEHYGCMVDLLGRAG L+EAL+LIQSMSMEPDPIIWRALLC
Sbjct: 361 TEGRRQFMSMKQHFGLQPKIEHYGCMVDLLGRAGFLDEALELIQSMSMEPDPIIWRALLC 420

Query: 421 ACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGELRGMMSLRGIGKV 480
           ACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVG+LRGMMSLRGIGKV
Sbjct: 421 ACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGKLRGMMSLRGIGKV 480

Query: 481 PGCSSIEINNVVYELAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALYDIEKEEK 540
           PGCSSIEINNVVYE AAS+DRKPEFEAIYKQLDNL EKLKENGYV  TDMALYDIEKEEK
Sbjct: 481 PGCSSIEINNVVYEFAASNDRKPEFEAIYKQLDNLSEKLKENGYVTGTDMALYDIEKEEK 540

Query: 541 ERSVVYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDRNR 600
           E SV+YHSEKLALAFGLLNSPL CTLRIVKNLRICLDCHEFFKVVS+VY+RYIVVRDRNR
Sbjct: 541 EHSVMYHSEKLALAFGLLNSPLGCTLRIVKNLRICLDCHEFFKVVSIVYQRYIVVRDRNR 600

Query: 601 FHHFSEGFCSCRDYW 616
           FHHFSEGFCSCRDYW
Sbjct: 601 FHHFSEGFCSCRDYW 615

BLAST of CaUC06G122530 vs. NCBI nr
Match: XP_004138309.2 (pentatricopeptide repeat-containing protein At4g21065 [Cucumis sativus] >KGN63701.1 hypothetical protein Csa_014271 [Cucumis sativus])

HSP 1 Score: 1066.6 bits (2757), Expect = 7.9e-308
Identity = 529/616 (85.88%), Postives = 566/616 (91.88%), Query Frame = 0

Query: 1   MEMYFRLLSFSYRIIQRSRLQR-ICTNSNSVLLESEMSKFVHTQAMDLPSQRTNERRIPD 60
           M+MY RLL FSYRII+RSR+Q+ ICT SN   LESEM KFVHTQAMDLP Q TN  +IPD
Sbjct: 1   MKMYNRLLPFSYRIIRRSRVQQEICTISNLDFLESEMLKFVHTQAMDLPFQATNGSKIPD 60

Query: 61  YEDALHEEADGVRRRGYFLTKLIDDSVSHNGFESIARIFPKFRGSIDSQLCNSMIRRYLD 120
           Y D          RRG+FL KLIDDSVS NGFESIARIF K+RGSI+SQ CNSMIR YLD
Sbjct: 61  YNDV---------RRGHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLD 120

Query: 121 LNKHLHSLFIFAHMHKFSILPDSSTFPCVLKATAKLCASELGKMIHGTVIQMGFIRDVYI 180
           LNKHL+SL+IFA MHKFSILPDSSTFP VLKATA+LC + +GKMIHG VIQMGFI DVY 
Sbjct: 121 LNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYT 180

Query: 181 STALVHMYCSCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGMLAAGA 240
           STALVH+YC+CLSISDASQLFDEMPERNAVTWNALITGYTHNRKF++A +AFRGMLA GA
Sbjct: 181 STALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGA 240

Query: 241 EPSERTVVVVLSACAHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEK 300
           +PSERTVVVVLSAC+HLGA NQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAV E EK
Sbjct: 241 QPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEK 300

Query: 301 VFEEIWEKNVHTWNVLISGYAMNGQGDAALQAFSRMLMEDFEPDEVTFLGLLCACCHQGL 360
           VFEEI EKNV+TWNVLISGYAMNGQGDAALQAFSRMLME+F+PDEVTFLG+LCACCHQGL
Sbjct: 301 VFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGL 360

Query: 361 VTEGRRQYMSMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRALL 420
           VTEGR Q+MSMKQQFGLQP+IEHYGCMVDLLGRAGLLEEAL+LIQSMS+EPDPIIWRALL
Sbjct: 361 VTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALL 420

Query: 421 CACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGELRGMMSLRGIGK 480
           CACRVHGNTKLGEY I+RLIELEPNNGENYVLLSN+YSRE+RWAEVG+LRGMM+LRGI K
Sbjct: 421 CACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRK 480

Query: 481 VPGCSSIEINNVVYELAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALYDIEKEE 540
           VPGCSSIEINNVVYE  AS+DRKPEFEAIYKQLDNLI+KLKENGYV  TDMALYDIEKEE
Sbjct: 481 VPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEE 540

Query: 541 KERSVVYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDRN 600
           KE SV+YHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKV+SLVYKRYIVVRDRN
Sbjct: 541 KEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRN 600

Query: 601 RFHHFSEGFCSCRDYW 616
           RFHHF EGFCSCRDYW
Sbjct: 601 RFHHFYEGFCSCRDYW 607

BLAST of CaUC06G122530 vs. NCBI nr
Match: XP_023529316.1 (pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1050.8 bits (2716), Expect = 4.5e-303
Identity = 517/617 (83.79%), Postives = 564/617 (91.41%), Query Frame = 0

Query: 1   MEMYFRLLSFSYRIIQRSRLQRICTNSNSVLL--ESEMSKFVHTQAMDLPSQRTNERRIP 60
           M+M  R L FS+R+I+R+RLQ  CT SN   L  +S++S+FVHT+ M+LPSQ   ER+IP
Sbjct: 1   MKMDLRFLPFSFRLIRRARLQDTCTISNLDFLANQSQISRFVHTRVMNLPSQGGIERKIP 60

Query: 61  DYEDALHEEADGVRRRGYFLTKLIDDSVSHNGFESIARIFPKFRGSIDSQLCNSMIRRYL 120
           D  DA  +E + +R  GYFL KLI+DSVS+NGFESIA IF KFRGSI+SQ+CNSMIR YL
Sbjct: 61  DCLDARRKEGNDMRSDGYFLMKLIEDSVSNNGFESIALIFSKFRGSINSQICNSMIRGYL 120

Query: 121 DLNKHLHSLFIFAHMHKFSILPDSSTFPCVLKATAKLCASELGKMIHGTVIQMGFIRDVY 180
           DLN+HL+SL IFAHMHKFSILPDSSTFP VLKATA+LC  +LGKMIHG V+QMGFIRDVY
Sbjct: 121 DLNEHLNSLIIFAHMHKFSILPDSSTFPAVLKATAQLCDIKLGKMIHGAVVQMGFIRDVY 180

Query: 181 ISTALVHMYCSCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGMLAAG 240
            STALVHMYCSCLSISDASQLFDEMPERN+VTWNALITGYTHNRKF EA NAFRGMLAAG
Sbjct: 181 TSTALVHMYCSCLSISDASQLFDEMPERNSVTWNALITGYTHNRKFKEAINAFRGMLAAG 240

Query: 241 AEPSERTVVVVLSACAHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAE 300
           AEPSERTVVVVLSAC+HLGALNQGKWIH+FIY N+LRLNVFVGTALIDMYAKCG V+EAE
Sbjct: 241 AEPSERTVVVVLSACSHLGALNQGKWIHDFIYQNKLRLNVFVGTALIDMYAKCGVVEEAE 300

Query: 301 KVFEEIWEKNVHTWNVLISGYAMNGQGDAALQAFSRMLMEDFEPDEVTFLGLLCACCHQG 360
           KVFEEI +KNV+TWNVLISGY MNGQGDAALQAFSRMLME+F+PD VTFLGLLCACCHQG
Sbjct: 301 KVFEEIRDKNVYTWNVLISGYGMNGQGDAALQAFSRMLMENFKPDAVTFLGLLCACCHQG 360

Query: 361 LVTEGRRQYMSMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRAL 420
           LVTEGRRQ++SMKQQFGLQPKIEHYGCMVDLLGRAGLLEEAL+LI+SMSMEPDPIIWRAL
Sbjct: 361 LVTEGRRQFISMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALELIESMSMEPDPIIWRAL 420

Query: 421 LCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGELRGMMSLRGIG 480
           LCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSRE+RW EVG+LRGMMSLRGI 
Sbjct: 421 LCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSRERRWIEVGKLRGMMSLRGIE 480

Query: 481 KVPGCSSIEINNVVYELAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALYDIEKE 540
           KVPGCSSIEINN V+E  AS+DRK EF AIYKQLDN+++KLKENGYV  TDM+L+DIEKE
Sbjct: 481 KVPGCSSIEINNSVHEFTASNDRKLEFNAIYKQLDNVMKKLKENGYVTGTDMSLFDIEKE 540

Query: 541 EKERSVVYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDR 600
           EKE SV+YHSEKLALAFGLLNSPLDCTLRIVKNLRIC DCHEFFKVVSLVYKRYIVVRDR
Sbjct: 541 EKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICSDCHEFFKVVSLVYKRYIVVRDR 600

Query: 601 NRFHHFSEGFCSCRDYW 616
           NRFHHFSEG CSCRDYW
Sbjct: 601 NRFHHFSEGVCSCRDYW 617

BLAST of CaUC06G122530 vs. NCBI nr
Match: XP_022134759.1 (pentatricopeptide repeat-containing protein At4g21065-like [Momordica charantia])

HSP 1 Score: 1047.3 bits (2707), Expect = 5.0e-302
Identity = 512/611 (83.80%), Postives = 557/611 (91.16%), Query Frame = 0

Query: 8   LSFSYRIIQRSRLQRICTNSNSVLL--ESEMSKFVHTQ-AMDLPSQRTNERRIPDYEDAL 67
           +  S+R+I+R+RLQ ICT SNS  L  +S++SKF+HTQ  M+LP Q TNER+IPDY D +
Sbjct: 43  IEMSFRLIRRARLQDICTISNSAFLANQSQISKFMHTQLTMNLPPQSTNERKIPDYMDVV 102

Query: 68  HEEADGVRRRGYFLTKLIDDSVSHNGFESIARIFPKFRGSIDSQLCNSMIRRYLDLNKHL 127
            +E + +R  GYFL KLIDDSVSH+GFESIA IF KFRG I+ QLCN MIR YLD NKHL
Sbjct: 103 RKEGNDMRSDGYFLMKLIDDSVSHDGFESIAPIFSKFRGVINCQLCNWMIRGYLDSNKHL 162

Query: 128 HSLFIFAHMHKFSILPDSSTFPCVLKATAKLCASELGKMIHGTVIQMGFIRDVYISTALV 187
           +SL IFAHMHKFSILPDSSTFP V+KATA+ C  ELGKMIHGTVIQMGFIRDVY STALV
Sbjct: 163 NSLLIFAHMHKFSILPDSSTFPAVIKATARSCNVELGKMIHGTVIQMGFIRDVYTSTALV 222

Query: 188 HMYCSCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGMLAAGAEPSER 247
           HMYC+CLSISDA QLFDEMPERN+VTWNALITGYTHNRKFMEA NAFRGMLAAGAEPSER
Sbjct: 223 HMYCTCLSISDAYQLFDEMPERNSVTWNALITGYTHNRKFMEAINAFRGMLAAGAEPSER 282

Query: 248 TVVVVLSACAHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEKVFEEI 307
           TVVVVLSAC+HLGALNQG WIHEFIY N+LRLNVFVGTALIDMYAKCGAV+EAEKVFEEI
Sbjct: 283 TVVVVLSACSHLGALNQGTWIHEFIYQNKLRLNVFVGTALIDMYAKCGAVEEAEKVFEEI 342

Query: 308 WEKNVHTWNVLISGYAMNGQGDAALQAFSRMLMEDFEPDEVTFLGLLCACCHQGLVTEGR 367
            EKNV+TWNVLISGYAMNGQGD ALQAFS ML E+F+PDEVTFLG+LCACCHQGLVTEGR
Sbjct: 343 REKNVYTWNVLISGYAMNGQGDEALQAFSMMLRENFKPDEVTFLGVLCACCHQGLVTEGR 402

Query: 368 RQYMSMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRALLCACRV 427
           RQ++SMKQ FGL+P+IEHYGCMVDLLGRAGLLEEAL+LIQSMSMEPDPIIWRALLCACRV
Sbjct: 403 RQFVSMKQHFGLRPRIEHYGCMVDLLGRAGLLEEALELIQSMSMEPDPIIWRALLCACRV 462

Query: 428 HGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGELRGMMSLRGIGKVPGCS 487
           HGNTKLGEY IRRLI+LEPNNGENYVLLSNLYSRE+RW EVG+LRGMMSLRGIGKVPGCS
Sbjct: 463 HGNTKLGEYAIRRLIDLEPNNGENYVLLSNLYSRERRWIEVGKLRGMMSLRGIGKVPGCS 522

Query: 488 SIEINNVVYELAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALYDIEKEEKERSV 547
           SIEI NVVYE AAS+DRKPEF+AIYKQLDN+IEKLK NGY+  T MAL+DIE+EEKE  V
Sbjct: 523 SIEIKNVVYEFAASNDRKPEFDAIYKQLDNVIEKLKANGYITGTGMALFDIEEEEKEHCV 582

Query: 548 VYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDRNRFHHF 607
           +YHSEKLALAFGLLNSPLDC LRIVKNLRICLDCHEFFKV SLVYKR+IVVRDRNRFHHF
Sbjct: 583 MYHSEKLALAFGLLNSPLDCALRIVKNLRICLDCHEFFKVASLVYKRFIVVRDRNRFHHF 642

Query: 608 SEGFCSCRDYW 616
           SEGFCSCRDYW
Sbjct: 643 SEGFCSCRDYW 653

BLAST of CaUC06G122530 vs. NCBI nr
Match: XP_022925029.1 (pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita moschata])

HSP 1 Score: 1046.6 bits (2705), Expect = 8.5e-302
Identity = 514/617 (83.31%), Postives = 563/617 (91.25%), Query Frame = 0

Query: 1   MEMYFRLLSFSYRIIQRSRLQRICTNSNSVLL--ESEMSKFVHTQAMDLPSQRTNERRIP 60
           M+M  RLL FS+R+I+R+RLQ  CT SN   L  +S++S+FVHT+ M+LPSQ   ER+IP
Sbjct: 1   MKMDLRLLPFSFRLIRRARLQDTCTISNLDFLANQSQISRFVHTRVMNLPSQGGIERKIP 60

Query: 61  DYEDALHEEADGVRRRGYFLTKLIDDSVSHNGFESIARIFPKFRGSIDSQLCNSMIRRYL 120
           D  DA  +E + +R  GYFL KLI+DSVS+NGFESIA IF KFRGSI+SQ+CNSMIR YL
Sbjct: 61  DCLDARRKEGNDMRSDGYFLMKLIEDSVSNNGFESIALIFSKFRGSINSQICNSMIRGYL 120

Query: 121 DLNKHLHSLFIFAHMHKFSILPDSSTFPCVLKATAKLCASELGKMIHGTVIQMGFIRDVY 180
           D N+HL+SL IFAHMHKFSILPDSSTFP VLKATA+LC  +LGKMIHG V+QMGFIRDVY
Sbjct: 121 DSNEHLNSLIIFAHMHKFSILPDSSTFPAVLKATAQLCDIKLGKMIHGAVVQMGFIRDVY 180

Query: 181 ISTALVHMYCSCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGMLAAG 240
            STALVHMYCSCLSISDASQLFDEMPERN+VTWNALITGYTHNRKF EA NAFRGMLAAG
Sbjct: 181 TSTALVHMYCSCLSISDASQLFDEMPERNSVTWNALITGYTHNRKFREAINAFRGMLAAG 240

Query: 241 AEPSERTVVVVLSACAHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAE 300
           AEPSERTVVVVLSAC+HLGALNQGKWIH+FIY N+LRLNVFVGTALIDMYAKCG V+EAE
Sbjct: 241 AEPSERTVVVVLSACSHLGALNQGKWIHDFIYQNKLRLNVFVGTALIDMYAKCGVVEEAE 300

Query: 301 KVFEEIWEKNVHTWNVLISGYAMNGQGDAALQAFSRMLMEDFEPDEVTFLGLLCACCHQG 360
           KVFEEI ++NV+TWNVLISGY MNGQG+AALQ FSRMLME+F+PD VTFLGLLCACCHQG
Sbjct: 301 KVFEEIRDRNVYTWNVLISGYGMNGQGNAALQVFSRMLMENFKPDAVTFLGLLCACCHQG 360

Query: 361 LVTEGRRQYMSMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRAL 420
           LVTEGRRQ++SMKQQFGLQPKIEHYGCMVDLLGRAGLLEEAL+LI+SMSMEPDPIIWRAL
Sbjct: 361 LVTEGRRQFISMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALELIESMSMEPDPIIWRAL 420

Query: 421 LCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGELRGMMSLRGIG 480
           LCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSRE+RW EVG+LRGMMSLRGI 
Sbjct: 421 LCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSRERRWIEVGKLRGMMSLRGIE 480

Query: 481 KVPGCSSIEINNVVYELAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALYDIEKE 540
           KVPGCSSIEINN V+E  AS+DRK EF AIYKQLDN+++KLKENGYV  TDM+L+DIEKE
Sbjct: 481 KVPGCSSIEINNAVHEFTASNDRKREFSAIYKQLDNVMKKLKENGYVTGTDMSLFDIEKE 540

Query: 541 EKERSVVYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDR 600
           EKE SV+YHSEKLALAFGLLNSPLDCTLRIVKNLRIC DCHEFFKVVSLVYKRYIVVRDR
Sbjct: 541 EKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICSDCHEFFKVVSLVYKRYIVVRDR 600

Query: 601 NRFHHFSEGFCSCRDYW 616
           NRFHHFSEG CSCRDYW
Sbjct: 601 NRFHHFSEGVCSCRDYW 617

BLAST of CaUC06G122530 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 466.5 bits (1199), Expect = 4.8e-130
Identity = 244/587 (41.57%), Postives = 360/587 (61.33%), Query Frame = 0

Query: 32  LESEMSKFVHTQAMDLPSQRTNERRIPDYEDALHEEADGVRRRGYFLTKLIDDSVSHNGF 91
           L  ++  +VHT  + +  Q  N R     EDA         R     T LI    S    
Sbjct: 163 LGCDLDLYVHTSLISMYVQ--NGR----LEDAHKVFDKSPHRDVVSYTALIKGYASRGYI 222

Query: 92  ESIARIFPKFRGSIDSQLCNSMIRRYLDLNKHLHSLFIFAHMHKFSILPDSSTFPCVLKA 151
           E+  ++F +     D    N+MI  Y +   +  +L +F  M K ++ PD ST   V+ A
Sbjct: 223 ENAQKLFDEIPVK-DVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSA 282

Query: 152 TAKLCASELGKMIHGTVIQMGFIRDVYISTALVHMYCSCLSISDASQLFDEMPERNAVTW 211
            A+  + ELG+ +H  +   GF  ++ I  AL+ +Y  C  +  A  LF+ +P ++ ++W
Sbjct: 283 CAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISW 342

Query: 212 NALITGYTHNRKFMEATNAFRGMLAAGAEPSERTVVVVLSACAHLGALNQGKWIHEFIYH 271
           N LI GYTH   + EA   F+ ML +G  P++ T++ +L ACAHLGA++ G+WIH +I  
Sbjct: 343 NTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYI-D 402

Query: 272 NRLR--LNV-FVGTALIDMYAKCGAVDEAEKVFEEIWEKNVHTWNVLISGYAMNGQGDAA 331
            RL+   N   + T+LIDMYAKCG ++ A +VF  I  K++ +WN +I G+AM+G+ DA+
Sbjct: 403 KRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADAS 462

Query: 332 LQAFSRMLMEDFEPDEVTFLGLLCACCHQGLVTEGRRQYMSMKQQFGLQPKIEHYGCMVD 391
              FSRM     +PD++TF+GLL AC H G++  GR  + +M Q + + PK+EHYGCM+D
Sbjct: 463 FDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMID 522

Query: 392 LLGRAGLLEEALQLIQSMSMEPDPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGEN 451
           LLG +GL +EA ++I  M MEPD +IW +LL AC++HGN +LGE     LI++EP N  +
Sbjct: 523 LLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGS 582

Query: 452 YVLLSNLYSREQRWAEVGELRGMMSLRGIGKVPGCSSIEINNVVYELAASDDRKPEFEAI 511
           YVLLSN+Y+   RW EV + R +++ +G+ KVPGCSSIEI++VV+E    D   P    I
Sbjct: 583 YVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREI 642

Query: 512 YKQLDNLIEKLKENGYVIRTDMALYDIEKEEKERSVVYHSEKLALAFGLLNSPLDCTLRI 571
           Y  L+ +   L++ G+V  T   L ++E+E KE ++ +HSEKLA+AFGL+++     L I
Sbjct: 643 YGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLTI 702

Query: 572 VKNLRICLDCHEFFKVVSLVYKRYIVVRDRNRFHHFSEGFCSCRDYW 616
           VKNLR+C +CHE  K++S +YKR I+ RDR RFHHF +G CSC DYW
Sbjct: 703 VKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of CaUC06G122530 vs. ExPASy Swiss-Prot
Match: A8MQA3 (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 463.4 bits (1191), Expect = 4.0e-129
Identity = 220/522 (42.15%), Postives = 341/522 (65.33%), Query Frame = 0

Query: 96  RIFPKFRGSIDSQLCNSMIRRYLDLNKHLHSLFIFAHMHKFSIL-PDSSTFPCVLKATAK 155
           ++F K    I+  + N++IR Y ++   + +  ++  M    ++ PD+ T+P ++KA   
Sbjct: 74  KVFSKIEKPINVFIWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTT 133

Query: 156 LCASELGKMIHGTVIQMGFIRDVYISTALVHMYCSCLSISDASQLFDEMPERNAVTWNAL 215
           +    LG+ IH  VI+ GF   +Y+  +L+H+Y +C  ++ A ++FD+MPE++ V WN++
Sbjct: 134 MADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSV 193

Query: 216 ITGYTHNRKFMEATNAFRGMLAAGAEPSERTVVVVLSACAHLGALNQGKWIHEFIYHNRL 275
           I G+  N K  EA   +  M + G +P   T+V +LSACA +GAL  GK +H ++    L
Sbjct: 194 INGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGL 253

Query: 276 RLNVFVGTALIDMYAKCGAVDEAEKVFEEIWEKNVHTWNVLISGYAMNGQGDAALQAFSR 335
             N+     L+D+YA+CG V+EA+ +F+E+ +KN  +W  LI G A+NG G  A++ F  
Sbjct: 254 TRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKY 313

Query: 336 M-LMEDFEPDEVTFLGLLCACCHQGLVTEGRRQYMSMKQQFGLQPKIEHYGCMVDLLGRA 395
           M   E   P E+TF+G+L AC H G+V EG   +  M++++ ++P+IEH+GCMVDLL RA
Sbjct: 314 MESTEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARA 373

Query: 396 GLLEEALQLIQSMSMEPDPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLS 455
           G +++A + I+SM M+P+ +IWR LL AC VHG++ L E+   ++++LEPN+  +YVLLS
Sbjct: 374 GQVKKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLS 433

Query: 456 NLYSREQRWAEVGELRGMMSLRGIGKVPGCSSIEINNVVYELAASDDRKPEFEAIYKQLD 515
           N+Y+ EQRW++V ++R  M   G+ KVPG S +E+ N V+E    D   P+ +AIY +L 
Sbjct: 434 NMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLK 493

Query: 516 NLIEKLKENGYVIRTDMALYDIEKEEKERSVVYHSEKLALAFGLLNSPLDCTLRIVKNLR 575
            +  +L+  GYV +      D+E+EEKE +VVYHSEK+A+AF L+++P    + +VKNLR
Sbjct: 494 EMTGRLRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLR 553

Query: 576 ICLDCHEFFKVVSLVYKRYIVVRDRNRFHHFSEGFCSCRDYW 616
           +C DCH   K+VS VY R IVVRDR+RFHHF  G CSC+DYW
Sbjct: 554 VCADCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of CaUC06G122530 vs. ExPASy Swiss-Prot
Match: Q8LK93 (Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H26 PE=2 SV=2)

HSP 1 Score: 457.2 bits (1175), Expect = 2.9e-127
Identity = 233/543 (42.91%), Postives = 341/543 (62.80%), Query Frame = 0

Query: 77  FLTKLID---DSVSHNGFESIARIFPKFRGSIDSQLCNSMIRRYLDLNKHLHSLFIFAHM 136
           F+ KLI+   +S + +   S AR   +     D  + NSM R Y      L    +F  +
Sbjct: 62  FVAKLINFCTESPTESSM-SYARHLFEAMSEPDIVIFNSMARGYSRFTNPLEVFSLFVEI 121

Query: 137 HKFSILPDSSTFPCVLKATAKLCASELGKMIHGTVIQMGFIRDVYISTALVHMYCSCLSI 196
            +  ILPD+ TFP +LKA A   A E G+ +H   +++G   +VY+   L++MY  C  +
Sbjct: 122 LEDGILPDNYTFPSLLKACAVAKALEEGRQLHCLSMKLGLDDNVYVCPTLINMYTECEDV 181

Query: 197 SDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGMLAAGAEPSERTVVVVLSAC 256
             A  +FD + E   V +NA+ITGY    +  EA + FR M     +P+E T++ VLS+C
Sbjct: 182 DSARCVFDRIVEPCVVCYNAMITGYARRNRPNEALSLFREMQGKYLKPNEITLLSVLSSC 241

Query: 257 AHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEKVFEEIWEKNVHTWN 316
           A LG+L+ GKWIH++   +     V V TALIDM+AKCG++D+A  +FE++  K+   W+
Sbjct: 242 ALLGSLDLGKWIHKYAKKHSFCKYVKVNTALIDMFAKCGSLDDAVSIFEKMRYKDTQAWS 301

Query: 317 VLISGYAMNGQGDAALQAFSRMLMEDFEPDEVTFLGLLCACCHQGLVTEGRRQYMSMKQQ 376
            +I  YA +G+ + ++  F RM  E+ +PDE+TFLGLL AC H G V EGR+ +  M  +
Sbjct: 302 AMIVAYANHGKAEKSMLMFERMRSENVQPDEITFLGLLNACSHTGRVEEGRKYFSQMVSK 361

Query: 377 FGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRALLCACRVHGNTKLGEY 436
           FG+ P I+HYG MVDLL RAG LE+A + I  + + P P++WR LL AC  H N  L E 
Sbjct: 362 FGIVPSIKHYGSMVDLLSRAGNLEDAYEFIDKLPISPTPMLWRILLAACSSHNNLDLAEK 421

Query: 437 TIRRLIELEPNNGENYVLLSNLYSREQRWAEVGELRGMMSLRGIGKVPGCSSIEINNVVY 496
              R+ EL+ ++G +YV+LSNLY+R ++W  V  LR +M  R   KVPGCSSIE+NNVV+
Sbjct: 422 VSERIFELDDSHGGDYVILSNLYARNKKWEYVDSLRKVMKDRKAVKVPGCSSIEVNNVVH 481

Query: 497 ELAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALY-DIEKEEKERSVVYHSEKLA 556
           E  + D  K     +++ LD ++++LK +GYV  T M ++ ++  +EKE ++ YHSEKLA
Sbjct: 482 EFFSGDGVKSATTKLHRALDEMVKELKLSGYVPDTSMVVHANMNDQEKEITLRYHSEKLA 541

Query: 557 LAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDRNRFHHFSEGFCSCR 616
           + FGLLN+P   T+R+VKNLR+C DCH   K++SL++ R +V+RD  RFHHF +G CSC 
Sbjct: 542 ITFGLLNTPPGTTIRVVKNLRVCRDCHNAAKLISLIFGRKVVLRDVQRFHHFEDGKCSCG 601

BLAST of CaUC06G122530 vs. ExPASy Swiss-Prot
Match: Q683I9 (Pentatricopeptide repeat-containing protein At3g62890 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H82 PE=2 SV=1)

HSP 1 Score: 439.9 bits (1130), Expect = 4.8e-122
Identity = 232/552 (42.03%), Postives = 332/552 (60.14%), Query Frame = 0

Query: 105 IDSQLCNSMIRRYLD--LNKHLHS-LFIFAHMHKFSILPDSSTFPCVLKATAKLCASELG 164
           ++S L N +IR  +    +   HS + ++  M    + PD  TFP +L +        LG
Sbjct: 22  LESFLWNIIIRAIVHNVSSPQRHSPISVYLRMRNHRVSPDFHTFPFLLPSFHNPLHLPLG 81

Query: 165 KMIHGTVIQMGFIRDVYISTALVHMYCSCLS----------------------------- 224
           +  H  ++  G  +D ++ T+L++MY SC                               
Sbjct: 82  QRTHAQILLFGLDKDPFVRTSLLNMYSSCGDLRSAQRVFDDSGSKDLPAWNSVVNAYAKA 141

Query: 225 --ISDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGML-----AAGAEPSERT 284
             I DA +LFDEMPERN ++W+ LI GY    K+ EA + FR M       A   P+E T
Sbjct: 142 GLIDDARKLFDEMPERNVISWSCLINGYVMCGKYKEALDLFREMQLPKPNEAFVRPNEFT 201

Query: 285 VVVVLSACAHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEKVFEEIW 344
           +  VLSAC  LGAL QGKW+H +I    + +++ +GTALIDMYAKCG+++ A++VF  + 
Sbjct: 202 MSTVLSACGRLGALEQGKWVHAYIDKYHVEIDIVLGTALIDMYAKCGSLERAKRVFNALG 261

Query: 345 -EKNVHTWNVLISGYAMNGQGDAALQAFSRMLMED-FEPDEVTFLGLLCACCHQGLVTEG 404
            +K+V  ++ +I   AM G  D   Q FS M   D   P+ VTF+G+L AC H+GL+ EG
Sbjct: 262 SKKDVKAYSAMICCLAMYGLTDECFQLFSEMTTSDNINPNSVTFVGILGACVHRGLINEG 321

Query: 405 RRQYMSMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRALLCACR 464
           +  +  M ++FG+ P I+HYGCMVDL GR+GL++EA   I SM MEPD +IW +LL   R
Sbjct: 322 KSYFKMMIEEFGITPSIQHYGCMVDLYGRSGLIKEAESFIASMPMEPDVLIWGSLLSGSR 381

Query: 465 VHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGELRGMMSLRGIGKVPGC 524
           + G+ K  E  ++RLIEL+P N   YVLLSN+Y++  RW EV  +R  M ++GI KVPGC
Sbjct: 382 MLGDIKTCEGALKRLIELDPMNSGAYVLLSNVYAKTGRWMEVKCIRHEMEVKGINKVPGC 441

Query: 525 SSIEINNVVYELAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALYDIEKEEKERS 584
           S +E+  VV+E    D+ + E E IY  LD ++++L+E GYV  T   L D+ +++KE +
Sbjct: 442 SYVEVEGVVHEFVVGDESQQESERIYAMLDEIMQRLREAGYVTDTKEVLLDLNEKDKEIA 501

Query: 585 VVYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDRNRFHH 616
           + YHSEKLA+AF L+ +     +RI+KNLRIC DCH   K++S ++ R IVVRD NRFHH
Sbjct: 502 LSYHSEKLAIAFCLMKTRPGTPVRIIKNLRICGDCHLVMKMISKLFSREIVVRDCNRFHH 561

BLAST of CaUC06G122530 vs. ExPASy Swiss-Prot
Match: Q9SUH6 (Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana OX=3702 GN=DYW9 PE=2 SV=1)

HSP 1 Score: 429.5 bits (1103), Expect = 6.5e-119
Identity = 222/546 (40.66%), Postives = 322/546 (58.97%), Query Frame = 0

Query: 70  GVRRRGYFLTKLIDDSVSHNGFESIARIFPKFRGSIDSQLCNSMIRRYLDLNKHLHSLFI 129
           G     Y LT  I         +  + +F +FR   D    N+MI  Y    +   SL +
Sbjct: 251 GCYSHDYVLTGFISLYSKCGKIKMGSALFREFR-KPDIVAYNAMIHGYTSNGETELSLSL 310

Query: 130 FAHMHKFSILPDSSTFPCVLKATAKLCASELGKMIHGTVIQMGFIRDVYISTALVHMYCS 189
           F  +        SST   ++  +  L    L   IHG  ++  F+    +STAL  +Y  
Sbjct: 311 FKELMLSGARLRSSTLVSLVPVSGHLM---LIYAIHGYCLKSNFLSHASVSTALTTVYSK 370

Query: 190 CLSISDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGMLAAGAEPSERTVVVV 249
              I  A +LFDE PE++  +WNA+I+GYT N    +A + FR M  +   P+  T+  +
Sbjct: 371 LNEIESARKLFDESPEKSLPSWNAMISGYTQNGLTEDAISLFREMQKSEFSPNPVTITCI 430

Query: 250 LSACAHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEKVFEEIWEKNV 309
           LSACA LGAL+ GKW+H+ +       +++V TALI MYAKCG++ EA ++F+ + +KN 
Sbjct: 431 LSACAQLGALSLGKWVHDLVRSTDFESSIYVSTALIGMYAKCGSIAEARRLFDLMTKKNE 490

Query: 310 HTWNVLISGYAMNGQGDAALQAFSRMLMEDFEPDEVTFLGLLCACCHQGLVTEGRRQYMS 369
            TWN +ISGY ++GQG  AL  F  ML     P  VTFL +L AC H GLV EG   + S
Sbjct: 491 VTWNTMISGYGLHGQGQEALNIFYEMLNSGITPTPVTFLCVLYACSHAGLVKEGDEIFNS 550

Query: 370 MKQQFGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRALLCACRVHGNTK 429
           M  ++G +P ++HY CMVD+LGRAG L+ ALQ I++MS+EP   +W  LL ACR+H +T 
Sbjct: 551 MIHRYGFEPSVKHYACMVDILGRAGHLQRALQFIEAMSIEPGSSVWETLLGACRIHKDTN 610

Query: 430 LGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGELRGMMSLRGIGKVPGCSSIEIN 489
           L      +L EL+P+N   +VLLSN++S ++ + +   +R     R + K PG + IEI 
Sbjct: 611 LARTVSEKLFELDPDNVGYHVLLSNIHSADRNYPQAATVRQTAKKRKLAKAPGYTLIEIG 670

Query: 490 NVVYELAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALYDIEKEEKERSVVYHSE 549
              +   + D   P+ + IY++L+ L  K++E GY   T++AL+D+E+EE+E  V  HSE
Sbjct: 671 ETPHVFTSGDQSHPQVKEIYEKLEKLEGKMREAGYQPETELALHDVEEEERELMVKVHSE 730

Query: 550 KLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDRNRFHHFSEGFC 609
           +LA+AFGL+ +     +RI+KNLR+CLDCH   K++S + +R IVVRD NRFHHF +G C
Sbjct: 731 RLAIAFGLIATEPGTEIRIIKNLRVCLDCHTVTKLISKITERVIVVRDANRFHHFKDGVC 790

Query: 610 SCRDYW 616
           SC DYW
Sbjct: 791 SCGDYW 792

BLAST of CaUC06G122530 vs. ExPASy TrEMBL
Match: A0A0A0LUK8 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G011530 PE=3 SV=1)

HSP 1 Score: 1066.6 bits (2757), Expect = 3.8e-308
Identity = 529/616 (85.88%), Postives = 566/616 (91.88%), Query Frame = 0

Query: 1   MEMYFRLLSFSYRIIQRSRLQR-ICTNSNSVLLESEMSKFVHTQAMDLPSQRTNERRIPD 60
           M+MY RLL FSYRII+RSR+Q+ ICT SN   LESEM KFVHTQAMDLP Q TN  +IPD
Sbjct: 1   MKMYNRLLPFSYRIIRRSRVQQEICTISNLDFLESEMLKFVHTQAMDLPFQATNGSKIPD 60

Query: 61  YEDALHEEADGVRRRGYFLTKLIDDSVSHNGFESIARIFPKFRGSIDSQLCNSMIRRYLD 120
           Y D          RRG+FL KLIDDSVS NGFESIARIF K+RGSI+SQ CNSMIR YLD
Sbjct: 61  YNDV---------RRGHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLD 120

Query: 121 LNKHLHSLFIFAHMHKFSILPDSSTFPCVLKATAKLCASELGKMIHGTVIQMGFIRDVYI 180
           LNKHL+SL+IFA MHKFSILPDSSTFP VLKATA+LC + +GKMIHG VIQMGFI DVY 
Sbjct: 121 LNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYT 180

Query: 181 STALVHMYCSCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGMLAAGA 240
           STALVH+YC+CLSISDASQLFDEMPERNAVTWNALITGYTHNRKF++A +AFRGMLA GA
Sbjct: 181 STALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGA 240

Query: 241 EPSERTVVVVLSACAHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEK 300
           +PSERTVVVVLSAC+HLGA NQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAV E EK
Sbjct: 241 QPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEK 300

Query: 301 VFEEIWEKNVHTWNVLISGYAMNGQGDAALQAFSRMLMEDFEPDEVTFLGLLCACCHQGL 360
           VFEEI EKNV+TWNVLISGYAMNGQGDAALQAFSRMLME+F+PDEVTFLG+LCACCHQGL
Sbjct: 301 VFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGL 360

Query: 361 VTEGRRQYMSMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRALL 420
           VTEGR Q+MSMKQQFGLQP+IEHYGCMVDLLGRAGLLEEAL+LIQSMS+EPDPIIWRALL
Sbjct: 361 VTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALL 420

Query: 421 CACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGELRGMMSLRGIGK 480
           CACRVHGNTKLGEY I+RLIELEPNNGENYVLLSN+YSRE+RWAEVG+LRGMM+LRGI K
Sbjct: 421 CACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRK 480

Query: 481 VPGCSSIEINNVVYELAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALYDIEKEE 540
           VPGCSSIEINNVVYE  AS+DRKPEFEAIYKQLDNLI+KLKENGYV  TDMALYDIEKEE
Sbjct: 481 VPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEE 540

Query: 541 KERSVVYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDRN 600
           KE SV+YHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKV+SLVYKRYIVVRDRN
Sbjct: 541 KEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRN 600

Query: 601 RFHHFSEGFCSCRDYW 616
           RFHHF EGFCSCRDYW
Sbjct: 601 RFHHFYEGFCSCRDYW 607

BLAST of CaUC06G122530 vs. ExPASy TrEMBL
Match: A0A6J1BZP4 (pentatricopeptide repeat-containing protein At4g21065-like OS=Momordica charantia OX=3673 GN=LOC111006955 PE=3 SV=1)

HSP 1 Score: 1047.3 bits (2707), Expect = 2.4e-302
Identity = 512/611 (83.80%), Postives = 557/611 (91.16%), Query Frame = 0

Query: 8   LSFSYRIIQRSRLQRICTNSNSVLL--ESEMSKFVHTQ-AMDLPSQRTNERRIPDYEDAL 67
           +  S+R+I+R+RLQ ICT SNS  L  +S++SKF+HTQ  M+LP Q TNER+IPDY D +
Sbjct: 43  IEMSFRLIRRARLQDICTISNSAFLANQSQISKFMHTQLTMNLPPQSTNERKIPDYMDVV 102

Query: 68  HEEADGVRRRGYFLTKLIDDSVSHNGFESIARIFPKFRGSIDSQLCNSMIRRYLDLNKHL 127
            +E + +R  GYFL KLIDDSVSH+GFESIA IF KFRG I+ QLCN MIR YLD NKHL
Sbjct: 103 RKEGNDMRSDGYFLMKLIDDSVSHDGFESIAPIFSKFRGVINCQLCNWMIRGYLDSNKHL 162

Query: 128 HSLFIFAHMHKFSILPDSSTFPCVLKATAKLCASELGKMIHGTVIQMGFIRDVYISTALV 187
           +SL IFAHMHKFSILPDSSTFP V+KATA+ C  ELGKMIHGTVIQMGFIRDVY STALV
Sbjct: 163 NSLLIFAHMHKFSILPDSSTFPAVIKATARSCNVELGKMIHGTVIQMGFIRDVYTSTALV 222

Query: 188 HMYCSCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGMLAAGAEPSER 247
           HMYC+CLSISDA QLFDEMPERN+VTWNALITGYTHNRKFMEA NAFRGMLAAGAEPSER
Sbjct: 223 HMYCTCLSISDAYQLFDEMPERNSVTWNALITGYTHNRKFMEAINAFRGMLAAGAEPSER 282

Query: 248 TVVVVLSACAHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEKVFEEI 307
           TVVVVLSAC+HLGALNQG WIHEFIY N+LRLNVFVGTALIDMYAKCGAV+EAEKVFEEI
Sbjct: 283 TVVVVLSACSHLGALNQGTWIHEFIYQNKLRLNVFVGTALIDMYAKCGAVEEAEKVFEEI 342

Query: 308 WEKNVHTWNVLISGYAMNGQGDAALQAFSRMLMEDFEPDEVTFLGLLCACCHQGLVTEGR 367
            EKNV+TWNVLISGYAMNGQGD ALQAFS ML E+F+PDEVTFLG+LCACCHQGLVTEGR
Sbjct: 343 REKNVYTWNVLISGYAMNGQGDEALQAFSMMLRENFKPDEVTFLGVLCACCHQGLVTEGR 402

Query: 368 RQYMSMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRALLCACRV 427
           RQ++SMKQ FGL+P+IEHYGCMVDLLGRAGLLEEAL+LIQSMSMEPDPIIWRALLCACRV
Sbjct: 403 RQFVSMKQHFGLRPRIEHYGCMVDLLGRAGLLEEALELIQSMSMEPDPIIWRALLCACRV 462

Query: 428 HGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGELRGMMSLRGIGKVPGCS 487
           HGNTKLGEY IRRLI+LEPNNGENYVLLSNLYSRE+RW EVG+LRGMMSLRGIGKVPGCS
Sbjct: 463 HGNTKLGEYAIRRLIDLEPNNGENYVLLSNLYSRERRWIEVGKLRGMMSLRGIGKVPGCS 522

Query: 488 SIEINNVVYELAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALYDIEKEEKERSV 547
           SIEI NVVYE AAS+DRKPEF+AIYKQLDN+IEKLK NGY+  T MAL+DIE+EEKE  V
Sbjct: 523 SIEIKNVVYEFAASNDRKPEFDAIYKQLDNVIEKLKANGYITGTGMALFDIEEEEKEHCV 582

Query: 548 VYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDRNRFHHF 607
           +YHSEKLALAFGLLNSPLDC LRIVKNLRICLDCHEFFKV SLVYKR+IVVRDRNRFHHF
Sbjct: 583 MYHSEKLALAFGLLNSPLDCALRIVKNLRICLDCHEFFKVASLVYKRFIVVRDRNRFHHF 642

Query: 608 SEGFCSCRDYW 616
           SEGFCSCRDYW
Sbjct: 643 SEGFCSCRDYW 653

BLAST of CaUC06G122530 vs. ExPASy TrEMBL
Match: A0A6J1EAY2 (pentatricopeptide repeat-containing protein At4g21065-like OS=Cucurbita moschata OX=3662 GN=LOC111432395 PE=3 SV=1)

HSP 1 Score: 1046.6 bits (2705), Expect = 4.1e-302
Identity = 514/617 (83.31%), Postives = 563/617 (91.25%), Query Frame = 0

Query: 1   MEMYFRLLSFSYRIIQRSRLQRICTNSNSVLL--ESEMSKFVHTQAMDLPSQRTNERRIP 60
           M+M  RLL FS+R+I+R+RLQ  CT SN   L  +S++S+FVHT+ M+LPSQ   ER+IP
Sbjct: 1   MKMDLRLLPFSFRLIRRARLQDTCTISNLDFLANQSQISRFVHTRVMNLPSQGGIERKIP 60

Query: 61  DYEDALHEEADGVRRRGYFLTKLIDDSVSHNGFESIARIFPKFRGSIDSQLCNSMIRRYL 120
           D  DA  +E + +R  GYFL KLI+DSVS+NGFESIA IF KFRGSI+SQ+CNSMIR YL
Sbjct: 61  DCLDARRKEGNDMRSDGYFLMKLIEDSVSNNGFESIALIFSKFRGSINSQICNSMIRGYL 120

Query: 121 DLNKHLHSLFIFAHMHKFSILPDSSTFPCVLKATAKLCASELGKMIHGTVIQMGFIRDVY 180
           D N+HL+SL IFAHMHKFSILPDSSTFP VLKATA+LC  +LGKMIHG V+QMGFIRDVY
Sbjct: 121 DSNEHLNSLIIFAHMHKFSILPDSSTFPAVLKATAQLCDIKLGKMIHGAVVQMGFIRDVY 180

Query: 181 ISTALVHMYCSCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGMLAAG 240
            STALVHMYCSCLSISDASQLFDEMPERN+VTWNALITGYTHNRKF EA NAFRGMLAAG
Sbjct: 181 TSTALVHMYCSCLSISDASQLFDEMPERNSVTWNALITGYTHNRKFREAINAFRGMLAAG 240

Query: 241 AEPSERTVVVVLSACAHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAE 300
           AEPSERTVVVVLSAC+HLGALNQGKWIH+FIY N+LRLNVFVGTALIDMYAKCG V+EAE
Sbjct: 241 AEPSERTVVVVLSACSHLGALNQGKWIHDFIYQNKLRLNVFVGTALIDMYAKCGVVEEAE 300

Query: 301 KVFEEIWEKNVHTWNVLISGYAMNGQGDAALQAFSRMLMEDFEPDEVTFLGLLCACCHQG 360
           KVFEEI ++NV+TWNVLISGY MNGQG+AALQ FSRMLME+F+PD VTFLGLLCACCHQG
Sbjct: 301 KVFEEIRDRNVYTWNVLISGYGMNGQGNAALQVFSRMLMENFKPDAVTFLGLLCACCHQG 360

Query: 361 LVTEGRRQYMSMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRAL 420
           LVTEGRRQ++SMKQQFGLQPKIEHYGCMVDLLGRAGLLEEAL+LI+SMSMEPDPIIWRAL
Sbjct: 361 LVTEGRRQFISMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALELIESMSMEPDPIIWRAL 420

Query: 421 LCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGELRGMMSLRGIG 480
           LCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSRE+RW EVG+LRGMMSLRGI 
Sbjct: 421 LCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSRERRWIEVGKLRGMMSLRGIE 480

Query: 481 KVPGCSSIEINNVVYELAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALYDIEKE 540
           KVPGCSSIEINN V+E  AS+DRK EF AIYKQLDN+++KLKENGYV  TDM+L+DIEKE
Sbjct: 481 KVPGCSSIEINNAVHEFTASNDRKREFSAIYKQLDNVMKKLKENGYVTGTDMSLFDIEKE 540

Query: 541 EKERSVVYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDR 600
           EKE SV+YHSEKLALAFGLLNSPLDCTLRIVKNLRIC DCHEFFKVVSLVYKRYIVVRDR
Sbjct: 541 EKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICSDCHEFFKVVSLVYKRYIVVRDR 600

Query: 601 NRFHHFSEGFCSCRDYW 616
           NRFHHFSEG CSCRDYW
Sbjct: 601 NRFHHFSEGVCSCRDYW 617

BLAST of CaUC06G122530 vs. ExPASy TrEMBL
Match: A0A5A7US40 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold274G002740 PE=3 SV=1)

HSP 1 Score: 1038.9 bits (2685), Expect = 8.6e-300
Identity = 503/580 (86.72%), Postives = 542/580 (93.45%), Query Frame = 0

Query: 36  MSKFVHTQAMDLPSQRTNERRIPDYEDALHEEADGVRRRGYFLTKLIDDSVSHNGFESIA 95
           M KFVHTQAMDLP Q TN+R+ PDY D          RRG+F+ KLIDDSVSHNGFESIA
Sbjct: 1   MLKFVHTQAMDLPFQETNDRKTPDYNDV---------RRGHFVMKLIDDSVSHNGFESIA 60

Query: 96  RIFPKFRGSIDSQLCNSMIRRYLDLNKHLHSLFIFAHMHKFSILPDSSTFPCVLKATAKL 155
           RIF K+RGSI+SQ CNSMIRRYLDLNKHL+SL+IFA MHKFSILPD STFP VLKATA+L
Sbjct: 61  RIFSKYRGSINSQQCNSMIRRYLDLNKHLNSLYIFAQMHKFSILPDLSTFPAVLKATAQL 120

Query: 156 CASELGKMIHGTVIQMGFIRDVYISTALVHMYCSCLSISDASQLFDEMPERNAVTWNALI 215
           C +E+GKMIHG VIQMGFI DVY STALVHMY +CLSISDASQ+FDEM ERNAVTWNALI
Sbjct: 121 CDTEVGKMIHGIVIQMGFICDVYTSTALVHMYSTCLSISDASQVFDEMAERNAVTWNALI 180

Query: 216 TGYTHNRKFMEATNAFRGMLAAGAEPSERTVVVVLSACAHLGALNQGKWIHEFIYHNRLR 275
           TGYTHNRKFMEA +AFRGMLAAGA+PSERTVV+VLSAC+HLGALNQGKWIH+FIYHNRLR
Sbjct: 181 TGYTHNRKFMEAIDAFRGMLAAGAQPSERTVVLVLSACSHLGALNQGKWIHDFIYHNRLR 240

Query: 276 LNVFVGTALIDMYAKCGAVDEAEKVFEEIWEKNVHTWNVLISGYAMNGQGDAALQAFSRM 335
           LNVFVGTALIDMYAKCGAVDE EKVFEEI EKNV+TWNVLISGYAMNGQGDAALQAFSRM
Sbjct: 241 LNVFVGTALIDMYAKCGAVDEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRM 300

Query: 336 LMEDFEPDEVTFLGLLCACCHQGLVTEGRRQYMSMKQQFGLQPKIEHYGCMVDLLGRAGL 395
           LME+F+PDEVTFLG+LCACCHQGLVTEGRRQ+MSMKQQFGLQP+IEHYGCMVDLLGRAGL
Sbjct: 301 LMENFKPDEVTFLGVLCACCHQGLVTEGRRQFMSMKQQFGLQPRIEHYGCMVDLLGRAGL 360

Query: 396 LEEALQLIQSMSMEPDPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNL 455
           LEEAL+LIQSMSMEPDPIIWRALLCACRVHGNTKLGEY ++RL+ELEPNNGENYVLLSN+
Sbjct: 361 LEEALELIQSMSMEPDPIIWRALLCACRVHGNTKLGEYIMKRLVELEPNNGENYVLLSNI 420

Query: 456 YSREQRWAEVGELRGMMSLRGIGKVPGCSSIEINNVVYELAASDDRKPEFEAIYKQLDNL 515
           Y+RE+RWAEVG+LRGMM+LRGI KVPGCSSIEINNVVYE  AS+DRKPE+EAIYKQLDNL
Sbjct: 421 YARERRWAEVGKLRGMMNLRGIRKVPGCSSIEINNVVYEFVASNDRKPEYEAIYKQLDNL 480

Query: 516 IEKLKENGYVIRTDMALYDIEKEEKERSVVYHSEKLALAFGLLNSPLDCTLRIVKNLRIC 575
           I+KLKENGYV  TDMALYD+EKEEKE S++YHSEKLALAFGLLNSPLDCTLRIVKNLRIC
Sbjct: 481 IKKLKENGYVTGTDMALYDVEKEEKEHSLMYHSEKLALAFGLLNSPLDCTLRIVKNLRIC 540

Query: 576 LDCHEFFKVVSLVYKRYIVVRDRNRFHHFSEGFCSCRDYW 616
           LDCHEFFKVVSLVYKRYIVVRDRNRFHHF EGFCSCRDYW
Sbjct: 541 LDCHEFFKVVSLVYKRYIVVRDRNRFHHFFEGFCSCRDYW 571

BLAST of CaUC06G122530 vs. ExPASy TrEMBL
Match: A0A1S4DZH3 (pentatricopeptide repeat-containing protein At4g21065-like OS=Cucumis melo OX=3656 GN=LOC103494017 PE=3 SV=1)

HSP 1 Score: 1023.8 bits (2646), Expect = 2.9e-295
Identity = 495/571 (86.69%), Postives = 534/571 (93.52%), Query Frame = 0

Query: 45  MDLPSQRTNERRIPDYEDALHEEADGVRRRGYFLTKLIDDSVSHNGFESIARIFPKFRGS 104
           MDLP Q TN+R+ PDY D          RRG+F+ KLIDDSVSHNGFESIARIF K+RGS
Sbjct: 1   MDLPFQETNDRKTPDYNDV---------RRGHFVMKLIDDSVSHNGFESIARIFSKYRGS 60

Query: 105 IDSQLCNSMIRRYLDLNKHLHSLFIFAHMHKFSILPDSSTFPCVLKATAKLCASELGKMI 164
           I+SQ CNSMIRRYLDLNKHL+SL+IFA MHKFSILPD STFP VLKATA+LC +E+GKMI
Sbjct: 61  INSQQCNSMIRRYLDLNKHLNSLYIFAQMHKFSILPDLSTFPAVLKATAQLCDTEVGKMI 120

Query: 165 HGTVIQMGFIRDVYISTALVHMYCSCLSISDASQLFDEMPERNAVTWNALITGYTHNRKF 224
           HG VIQMGFI DVY STALVHMY +CLSISDASQ+FDEM ERNAVTWNALITGYTHNRKF
Sbjct: 121 HGIVIQMGFICDVYTSTALVHMYSTCLSISDASQVFDEMAERNAVTWNALITGYTHNRKF 180

Query: 225 MEATNAFRGMLAAGAEPSERTVVVVLSACAHLGALNQGKWIHEFIYHNRLRLNVFVGTAL 284
           MEA +AFRGMLAAGA+PSERTVV+VLSAC+HLGALNQGKWIH+FIYHNRLRLNVFVGTAL
Sbjct: 181 MEAIDAFRGMLAAGAQPSERTVVLVLSACSHLGALNQGKWIHDFIYHNRLRLNVFVGTAL 240

Query: 285 IDMYAKCGAVDEAEKVFEEIWEKNVHTWNVLISGYAMNGQGDAALQAFSRMLMEDFEPDE 344
           IDMYAKCGAVDE EKVFEEI EKNV+TWNVLISGYAMNGQGDAALQAFSRMLME+F+PDE
Sbjct: 241 IDMYAKCGAVDEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDE 300

Query: 345 VTFLGLLCACCHQGLVTEGRRQYMSMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALQLIQ 404
           VTFLG+LCACCHQGLVTEGRRQ+MSMKQQFGLQP+IEHYGCMVDLLGRAGLLEEAL+LIQ
Sbjct: 301 VTFLGVLCACCHQGLVTEGRRQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQ 360

Query: 405 SMSMEPDPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAE 464
           SMSMEPDPIIWRALLCACRVHGNTKLGEY ++RL+ELEPNNGENYVLLSN+Y+RE+RWAE
Sbjct: 361 SMSMEPDPIIWRALLCACRVHGNTKLGEYIMKRLVELEPNNGENYVLLSNIYARERRWAE 420

Query: 465 VGELRGMMSLRGIGKVPGCSSIEINNVVYELAASDDRKPEFEAIYKQLDNLIEKLKENGY 524
           VG+LRGMM+LRGI KVPGCSSIEINNVVYE  AS+DRKPE+EAIYKQLDNLI+KLKENGY
Sbjct: 421 VGKLRGMMNLRGIRKVPGCSSIEINNVVYEFVASNDRKPEYEAIYKQLDNLIKKLKENGY 480

Query: 525 VIRTDMALYDIEKEEKERSVVYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKV 584
           V  TDMALYD+EKEEKE S++YHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKV
Sbjct: 481 VTGTDMALYDVEKEEKEHSLMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKV 540

Query: 585 VSLVYKRYIVVRDRNRFHHFSEGFCSCRDYW 616
           VSLVYKRYIVVRDRNRFHHF EGFCSCRDYW
Sbjct: 541 VSLVYKRYIVVRDRNRFHHFFEGFCSCRDYW 562

BLAST of CaUC06G122530 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 466.5 bits (1199), Expect = 3.4e-131
Identity = 244/587 (41.57%), Postives = 360/587 (61.33%), Query Frame = 0

Query: 32  LESEMSKFVHTQAMDLPSQRTNERRIPDYEDALHEEADGVRRRGYFLTKLIDDSVSHNGF 91
           L  ++  +VHT  + +  Q  N R     EDA         R     T LI    S    
Sbjct: 163 LGCDLDLYVHTSLISMYVQ--NGR----LEDAHKVFDKSPHRDVVSYTALIKGYASRGYI 222

Query: 92  ESIARIFPKFRGSIDSQLCNSMIRRYLDLNKHLHSLFIFAHMHKFSILPDSSTFPCVLKA 151
           E+  ++F +     D    N+MI  Y +   +  +L +F  M K ++ PD ST   V+ A
Sbjct: 223 ENAQKLFDEIPVK-DVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSA 282

Query: 152 TAKLCASELGKMIHGTVIQMGFIRDVYISTALVHMYCSCLSISDASQLFDEMPERNAVTW 211
            A+  + ELG+ +H  +   GF  ++ I  AL+ +Y  C  +  A  LF+ +P ++ ++W
Sbjct: 283 CAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISW 342

Query: 212 NALITGYTHNRKFMEATNAFRGMLAAGAEPSERTVVVVLSACAHLGALNQGKWIHEFIYH 271
           N LI GYTH   + EA   F+ ML +G  P++ T++ +L ACAHLGA++ G+WIH +I  
Sbjct: 343 NTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYI-D 402

Query: 272 NRLR--LNV-FVGTALIDMYAKCGAVDEAEKVFEEIWEKNVHTWNVLISGYAMNGQGDAA 331
            RL+   N   + T+LIDMYAKCG ++ A +VF  I  K++ +WN +I G+AM+G+ DA+
Sbjct: 403 KRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADAS 462

Query: 332 LQAFSRMLMEDFEPDEVTFLGLLCACCHQGLVTEGRRQYMSMKQQFGLQPKIEHYGCMVD 391
              FSRM     +PD++TF+GLL AC H G++  GR  + +M Q + + PK+EHYGCM+D
Sbjct: 463 FDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMID 522

Query: 392 LLGRAGLLEEALQLIQSMSMEPDPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGEN 451
           LLG +GL +EA ++I  M MEPD +IW +LL AC++HGN +LGE     LI++EP N  +
Sbjct: 523 LLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGS 582

Query: 452 YVLLSNLYSREQRWAEVGELRGMMSLRGIGKVPGCSSIEINNVVYELAASDDRKPEFEAI 511
           YVLLSN+Y+   RW EV + R +++ +G+ KVPGCSSIEI++VV+E    D   P    I
Sbjct: 583 YVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREI 642

Query: 512 YKQLDNLIEKLKENGYVIRTDMALYDIEKEEKERSVVYHSEKLALAFGLLNSPLDCTLRI 571
           Y  L+ +   L++ G+V  T   L ++E+E KE ++ +HSEKLA+AFGL+++     L I
Sbjct: 643 YGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLTI 702

Query: 572 VKNLRICLDCHEFFKVVSLVYKRYIVVRDRNRFHHFSEGFCSCRDYW 616
           VKNLR+C +CHE  K++S +YKR I+ RDR RFHHF +G CSC DYW
Sbjct: 703 VKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of CaUC06G122530 vs. TAIR 10
Match: AT4G21065.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 463.4 bits (1191), Expect = 2.9e-130
Identity = 220/522 (42.15%), Postives = 341/522 (65.33%), Query Frame = 0

Query: 96  RIFPKFRGSIDSQLCNSMIRRYLDLNKHLHSLFIFAHMHKFSIL-PDSSTFPCVLKATAK 155
           ++F K    I+  + N++IR Y ++   + +  ++  M    ++ PD+ T+P ++KA   
Sbjct: 74  KVFSKIEKPINVFIWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTT 133

Query: 156 LCASELGKMIHGTVIQMGFIRDVYISTALVHMYCSCLSISDASQLFDEMPERNAVTWNAL 215
           +    LG+ IH  VI+ GF   +Y+  +L+H+Y +C  ++ A ++FD+MPE++ V WN++
Sbjct: 134 MADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSV 193

Query: 216 ITGYTHNRKFMEATNAFRGMLAAGAEPSERTVVVVLSACAHLGALNQGKWIHEFIYHNRL 275
           I G+  N K  EA   +  M + G +P   T+V +LSACA +GAL  GK +H ++    L
Sbjct: 194 INGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGL 253

Query: 276 RLNVFVGTALIDMYAKCGAVDEAEKVFEEIWEKNVHTWNVLISGYAMNGQGDAALQAFSR 335
             N+     L+D+YA+CG V+EA+ +F+E+ +KN  +W  LI G A+NG G  A++ F  
Sbjct: 254 TRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKY 313

Query: 336 M-LMEDFEPDEVTFLGLLCACCHQGLVTEGRRQYMSMKQQFGLQPKIEHYGCMVDLLGRA 395
           M   E   P E+TF+G+L AC H G+V EG   +  M++++ ++P+IEH+GCMVDLL RA
Sbjct: 314 MESTEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARA 373

Query: 396 GLLEEALQLIQSMSMEPDPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLS 455
           G +++A + I+SM M+P+ +IWR LL AC VHG++ L E+   ++++LEPN+  +YVLLS
Sbjct: 374 GQVKKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLS 433

Query: 456 NLYSREQRWAEVGELRGMMSLRGIGKVPGCSSIEINNVVYELAASDDRKPEFEAIYKQLD 515
           N+Y+ EQRW++V ++R  M   G+ KVPG S +E+ N V+E    D   P+ +AIY +L 
Sbjct: 434 NMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLK 493

Query: 516 NLIEKLKENGYVIRTDMALYDIEKEEKERSVVYHSEKLALAFGLLNSPLDCTLRIVKNLR 575
            +  +L+  GYV +      D+E+EEKE +VVYHSEK+A+AF L+++P    + +VKNLR
Sbjct: 494 EMTGRLRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLR 553

Query: 576 ICLDCHEFFKVVSLVYKRYIVVRDRNRFHHFSEGFCSCRDYW 616
           +C DCH   K+VS VY R IVVRDR+RFHHF  G CSC+DYW
Sbjct: 554 VCADCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of CaUC06G122530 vs. TAIR 10
Match: AT2G02980.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 457.2 bits (1175), Expect = 2.1e-128
Identity = 233/543 (42.91%), Postives = 341/543 (62.80%), Query Frame = 0

Query: 77  FLTKLID---DSVSHNGFESIARIFPKFRGSIDSQLCNSMIRRYLDLNKHLHSLFIFAHM 136
           F+ KLI+   +S + +   S AR   +     D  + NSM R Y      L    +F  +
Sbjct: 62  FVAKLINFCTESPTESSM-SYARHLFEAMSEPDIVIFNSMARGYSRFTNPLEVFSLFVEI 121

Query: 137 HKFSILPDSSTFPCVLKATAKLCASELGKMIHGTVIQMGFIRDVYISTALVHMYCSCLSI 196
            +  ILPD+ TFP +LKA A   A E G+ +H   +++G   +VY+   L++MY  C  +
Sbjct: 122 LEDGILPDNYTFPSLLKACAVAKALEEGRQLHCLSMKLGLDDNVYVCPTLINMYTECEDV 181

Query: 197 SDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGMLAAGAEPSERTVVVVLSAC 256
             A  +FD + E   V +NA+ITGY    +  EA + FR M     +P+E T++ VLS+C
Sbjct: 182 DSARCVFDRIVEPCVVCYNAMITGYARRNRPNEALSLFREMQGKYLKPNEITLLSVLSSC 241

Query: 257 AHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEKVFEEIWEKNVHTWN 316
           A LG+L+ GKWIH++   +     V V TALIDM+AKCG++D+A  +FE++  K+   W+
Sbjct: 242 ALLGSLDLGKWIHKYAKKHSFCKYVKVNTALIDMFAKCGSLDDAVSIFEKMRYKDTQAWS 301

Query: 317 VLISGYAMNGQGDAALQAFSRMLMEDFEPDEVTFLGLLCACCHQGLVTEGRRQYMSMKQQ 376
            +I  YA +G+ + ++  F RM  E+ +PDE+TFLGLL AC H G V EGR+ +  M  +
Sbjct: 302 AMIVAYANHGKAEKSMLMFERMRSENVQPDEITFLGLLNACSHTGRVEEGRKYFSQMVSK 361

Query: 377 FGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRALLCACRVHGNTKLGEY 436
           FG+ P I+HYG MVDLL RAG LE+A + I  + + P P++WR LL AC  H N  L E 
Sbjct: 362 FGIVPSIKHYGSMVDLLSRAGNLEDAYEFIDKLPISPTPMLWRILLAACSSHNNLDLAEK 421

Query: 437 TIRRLIELEPNNGENYVLLSNLYSREQRWAEVGELRGMMSLRGIGKVPGCSSIEINNVVY 496
              R+ EL+ ++G +YV+LSNLY+R ++W  V  LR +M  R   KVPGCSSIE+NNVV+
Sbjct: 422 VSERIFELDDSHGGDYVILSNLYARNKKWEYVDSLRKVMKDRKAVKVPGCSSIEVNNVVH 481

Query: 497 ELAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALY-DIEKEEKERSVVYHSEKLA 556
           E  + D  K     +++ LD ++++LK +GYV  T M ++ ++  +EKE ++ YHSEKLA
Sbjct: 482 EFFSGDGVKSATTKLHRALDEMVKELKLSGYVPDTSMVVHANMNDQEKEITLRYHSEKLA 541

Query: 557 LAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDRNRFHHFSEGFCSCR 616
           + FGLLN+P   T+R+VKNLR+C DCH   K++SL++ R +V+RD  RFHHF +G CSC 
Sbjct: 542 ITFGLLNTPPGTTIRVVKNLRVCRDCHNAAKLISLIFGRKVVLRDVQRFHHFEDGKCSCG 601

BLAST of CaUC06G122530 vs. TAIR 10
Match: AT3G62890.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 439.9 bits (1130), Expect = 3.4e-123
Identity = 232/552 (42.03%), Postives = 332/552 (60.14%), Query Frame = 0

Query: 105 IDSQLCNSMIRRYLD--LNKHLHS-LFIFAHMHKFSILPDSSTFPCVLKATAKLCASELG 164
           ++S L N +IR  +    +   HS + ++  M    + PD  TFP +L +        LG
Sbjct: 22  LESFLWNIIIRAIVHNVSSPQRHSPISVYLRMRNHRVSPDFHTFPFLLPSFHNPLHLPLG 81

Query: 165 KMIHGTVIQMGFIRDVYISTALVHMYCSCLS----------------------------- 224
           +  H  ++  G  +D ++ T+L++MY SC                               
Sbjct: 82  QRTHAQILLFGLDKDPFVRTSLLNMYSSCGDLRSAQRVFDDSGSKDLPAWNSVVNAYAKA 141

Query: 225 --ISDASQLFDEMPERNAVTWNALITGYTHNRKFMEATNAFRGML-----AAGAEPSERT 284
             I DA +LFDEMPERN ++W+ LI GY    K+ EA + FR M       A   P+E T
Sbjct: 142 GLIDDARKLFDEMPERNVISWSCLINGYVMCGKYKEALDLFREMQLPKPNEAFVRPNEFT 201

Query: 285 VVVVLSACAHLGALNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEKVFEEIW 344
           +  VLSAC  LGAL QGKW+H +I    + +++ +GTALIDMYAKCG+++ A++VF  + 
Sbjct: 202 MSTVLSACGRLGALEQGKWVHAYIDKYHVEIDIVLGTALIDMYAKCGSLERAKRVFNALG 261

Query: 345 -EKNVHTWNVLISGYAMNGQGDAALQAFSRMLMED-FEPDEVTFLGLLCACCHQGLVTEG 404
            +K+V  ++ +I   AM G  D   Q FS M   D   P+ VTF+G+L AC H+GL+ EG
Sbjct: 262 SKKDVKAYSAMICCLAMYGLTDECFQLFSEMTTSDNINPNSVTFVGILGACVHRGLINEG 321

Query: 405 RRQYMSMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALQLIQSMSMEPDPIIWRALLCACR 464
           +  +  M ++FG+ P I+HYGCMVDL GR+GL++EA   I SM MEPD +IW +LL   R
Sbjct: 322 KSYFKMMIEEFGITPSIQHYGCMVDLYGRSGLIKEAESFIASMPMEPDVLIWGSLLSGSR 381

Query: 465 VHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGELRGMMSLRGIGKVPGC 524
           + G+ K  E  ++RLIEL+P N   YVLLSN+Y++  RW EV  +R  M ++GI KVPGC
Sbjct: 382 MLGDIKTCEGALKRLIELDPMNSGAYVLLSNVYAKTGRWMEVKCIRHEMEVKGINKVPGC 441

Query: 525 SSIEINNVVYELAASDDRKPEFEAIYKQLDNLIEKLKENGYVIRTDMALYDIEKEEKERS 584
           S +E+  VV+E    D+ + E E IY  LD ++++L+E GYV  T   L D+ +++KE +
Sbjct: 442 SYVEVEGVVHEFVVGDESQQESERIYAMLDEIMQRLREAGYVTDTKEVLLDLNEKDKEIA 501

Query: 585 VVYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYIVVRDRNRFHH 616
           + YHSEKLA+AF L+ +     +RI+KNLRIC DCH   K++S ++ R IVVRD NRFHH
Sbjct: 502 LSYHSEKLAIAFCLMKTRPGTPVRIIKNLRICGDCHLVMKMISKLFSREIVVRDCNRFHH 561

BLAST of CaUC06G122530 vs. TAIR 10
Match: AT4G21065.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 438.0 bits (1125), Expect = 1.3e-122
Identity = 206/457 (45.08%), Postives = 308/457 (67.40%), Query Frame = 0

Query: 160 LGKMIHGTVIQMGFIRDVYISTALVHMYCSCLSISDASQLFDEMPERNAVTWNALITGYT 219
           LG+ IH  VI+ GF   +Y+  +L+H+Y +C  ++ A ++FD+MPE++ V WN++I G+ 
Sbjct: 6   LGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFA 65

Query: 220 HNRKFMEATNAFRGMLAAGAEPSERTVVVVLSACAHLGALNQGKWIHEFIYHNRLRLNVF 279
            N K  EA   +  M + G +P   T+V +LSACA +GAL  GK +H ++    L  N+ 
Sbjct: 66  ENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLH 125

Query: 280 VGTALIDMYAKCGAVDEAEKVFEEIWEKNVHTWNVLISGYAMNGQGDAALQAFSRM-LME 339
               L+D+YA+CG V+EA+ +F+E+ +KN  +W  LI G A+NG G  A++ F  M   E
Sbjct: 126 SSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTE 185

Query: 340 DFEPDEVTFLGLLCACCHQGLVTEGRRQYMSMKQQFGLQPKIEHYGCMVDLLGRAGLLEE 399
              P E+TF+G+L AC H G+V EG   +  M++++ ++P+IEH+GCMVDLL RAG +++
Sbjct: 186 GLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKK 245

Query: 400 ALQLIQSMSMEPDPIIWRALLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSR 459
           A + I+SM M+P+ +IWR LL AC VHG++ L E+   ++++LEPN+  +YVLLSN+Y+ 
Sbjct: 246 AYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYAS 305

Query: 460 EQRWAEVGELRGMMSLRGIGKVPGCSSIEINNVVYELAASDDRKPEFEAIYKQLDNLIEK 519
           EQRW++V ++R  M   G+ KVPG S +E+ N V+E    D   P+ +AIY +L  +  +
Sbjct: 306 EQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGR 365

Query: 520 LKENGYVIRTDMALYDIEKEEKERSVVYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDC 579
           L+  GYV +      D+E+EEKE +VVYHSEK+A+AF L+++P    + +VKNLR+C DC
Sbjct: 366 LRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCADC 425

Query: 580 HEFFKVVSLVYKRYIVVRDRNRFHHFSEGFCSCRDYW 616
           H   K+VS VY R IVVRDR+RFHHF  G CSC+DYW
Sbjct: 426 HLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 462

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038878567.10.0e+0089.92pentatricopeptide repeat-containing protein At4g21065-like [Benincasa hispida][more]
XP_004138309.27.9e-30885.88pentatricopeptide repeat-containing protein At4g21065 [Cucumis sativus] >KGN6370... [more]
XP_023529316.14.5e-30383.79pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita pepo subsp... [more]
XP_022134759.15.0e-30283.80pentatricopeptide repeat-containing protein At4g21065-like [Momordica charantia][more]
XP_022925029.18.5e-30283.31pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q9LN014.8e-13041.57Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
A8MQA34.0e-12942.15Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX... [more]
Q8LK932.9e-12742.91Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidop... [more]
Q683I94.8e-12242.03Pentatricopeptide repeat-containing protein At3g62890 OS=Arabidopsis thaliana OX... [more]
Q9SUH66.5e-11940.66Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A0A0LUK83.8e-30885.88DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G0115... [more]
A0A6J1BZP42.4e-30283.80pentatricopeptide repeat-containing protein At4g21065-like OS=Momordica charanti... [more]
A0A6J1EAY24.1e-30283.31pentatricopeptide repeat-containing protein At4g21065-like OS=Cucurbita moschata... [more]
A0A5A7US408.6e-30086.72Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S4DZH32.9e-29586.69pentatricopeptide repeat-containing protein At4g21065-like OS=Cucumis melo OX=36... [more]
Match NameE-valueIdentityDescription
AT1G08070.13.4e-13141.57Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G21065.12.9e-13042.15Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G02980.12.1e-12842.91Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G62890.13.4e-12342.03Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G21065.21.3e-12245.08Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 502..522
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 81..589
NoneNo IPR availablePANTHERPTHR47928:SF133SUBFAMILY NOT NAMEDcoord: 81..589
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 307..356
e-value: 1.5E-11
score: 44.3
coord: 207..254
e-value: 3.2E-9
score: 36.9
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 382..407
e-value: 0.0037
score: 17.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 383..407
e-value: 0.0022
score: 16.1
coord: 209..242
e-value: 3.8E-5
score: 21.6
coord: 310..344
e-value: 4.4E-8
score: 30.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 308..342
score: 11.783455
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 277..307
score: 8.889672
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 207..241
score: 10.98328
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 481..604
e-value: 8.6E-36
score: 122.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 105..271
e-value: 4.6E-27
score: 97.2
coord: 326..500
e-value: 1.1E-28
score: 102.5
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 288..464

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC06G122530.1CaUC06G122530.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding