CaUC01G020330 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC01G020330
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationCiama_Chr01: 33252129 .. 33253652 (-)
RNA-Seq ExpressionCaUC01G020330
SyntenyCaUC01G020330
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGGGGCTTCGTCTTGTTGGGCCGAAGCCCGGCCCATTTATTTTCCCACGCAAATCTCCAAGTTCCGCAAAAACCCTAACATTGAAAAAACCCTTTCCAATTGATTCGAGTAGCGCAGGGGGGAATTTGAAAATGGGCAAATTATCGACGTCATTTCGTTCAGCTCTCTCCACCGCCGTCGTTAGCAAACCGCCCCATTCTCCGGCGGCGCCTCCTCTCTCATCCAAGAAACCAACCCCTAAACTTTCCCGGAAAACTCCATCGGGTCAGAGCTCCGGCCACCCGGAAAAGCCTAAACTCCCAACGGTGTTCAAATCGGCTAGTCTCGCAGATGCCAAGAAGCTCTACACCTCCTTCATCTCCACTACAAAAGCCCCTCTCGACCTTCGTTTCCATAATTCCCTCCTTCAGTCTTACGCTTCAATCGCCACACTCAATGACTCCATCTCTTTCCTCCGCCACATGTCCAAAGTTCAACCTTCCTTCTCGCCCGATCAATCGACCTTCCATGTCTTACTCTCCACCTCCGGGAATGGTCCTGATTGCTCTCTCGCCTCGATTCGGCAAATTCTCAATTTCATGGTCACCAATGGCTTCGATCCTGACAAGGTAACCACTGATCTTGCTGTGCGTTCGCTTTGTTCGGTAGGTCTGGTTGATGAAGCTGTAGAATTAGTTAAGGAATTATCGCAAAAACACTCGCCTCCTGATTCTTATACATATAATCATCTCGTTAAGCAACTCTGCAAGTCCAGAGCTCTGTCTACGGTTTATGATTTTATTGTTGAAATGCGTACTAGCTGTGGTGCGAAGCCCGATCTTGTTACTTTCACAATCTTGATAGATAACGTGTGTAATAGCAAGAATCTACGGGAGGCTATGCGGTTGGTAAGTTTGCTGTATAAGGAGGGTTTTAAGCCGGATTGCTTTGTTTATAACACAATTATGAAGGGTTATTGTATGATTGGCAGGGGCAGCGAGGCAATTGAAGTCTATAAGAAAATGAAGGAGGTGGGATTGGAGCCTGATGTTGTAACTTTTAATACGTTGATTTTTGGGTTATCGAAGTCGGGGCGAGTTAAGGAAGCCAGAAAGTTTTTGGACATTATGGCAGAGATGGGTCATTTCCCTGATGCAGTTACTTACACTTCATTGATGAATGGAATGTGTTGTGAGGGTGATGCTTTGAGAGCACTGTCATTGCTTGAGGAGATGGAGGCAAAGGGGTGCAGCCCCAATTCGTGCACATATAATACGTTGCTCCATGGGTTGTCAAAGTCTAGGCTTTTGGATAGAGCGATTGAATTGTATGGTTTGATGAAGTCTAGTGATATGAAGCTCGAAACAGCTTCGTATGCTACTTTTGTGAGGGTGCTTTGCAGGAGTGGTAGAATTGCTGAAGCCTATGAAGTGTTTGATTATGCAGTTGAGAGTAAAAGTCTGACTGATGTTGCTGCATATTCAACTTTAGAGACTACATTGAAGTCTCTGAAGAAAGCAAGGGAGCAAGCCTCTATATAA

mRNA sequence

ATGGTGGGGCTTCGTCTTGTTGGGCCGAAGCCCGGCCCATTTATTTTCCCACGCAAATCTCCAAGTTCCGCAAAAACCCTAACATTGAAAAAACCCTTTCCAATTGATTCGAGTAGCGCAGGGGGGAATTTGAAAATGGGCAAATTATCGACGTCATTTCGTTCAGCTCTCTCCACCGCCGTCGTTAGCAAACCGCCCCATTCTCCGGCGGCGCCTCCTCTCTCATCCAAGAAACCAACCCCTAAACTTTCCCGGAAAACTCCATCGGGTCAGAGCTCCGGCCACCCGGAAAAGCCTAAACTCCCAACGGTGTTCAAATCGGCTAGTCTCGCAGATGCCAAGAAGCTCTACACCTCCTTCATCTCCACTACAAAAGCCCCTCTCGACCTTCGTTTCCATAATTCCCTCCTTCAGTCTTACGCTTCAATCGCCACACTCAATGACTCCATCTCTTTCCTCCGCCACATGTCCAAAGTTCAACCTTCCTTCTCGCCCGATCAATCGACCTTCCATGTCTTACTCTCCACCTCCGGGAATGGTCCTGATTGCTCTCTCGCCTCGATTCGGCAAATTCTCAATTTCATGGTCACCAATGGCTTCGATCCTGACAAGGTAACCACTGATCTTGCTGTGCGTTCGCTTTGTTCGGTAGGTCTGGTTGATGAAGCTGTAGAATTAGTTAAGGAATTATCGCAAAAACACTCGCCTCCTGATTCTTATACATATAATCATCTCGTTAAGCAACTCTGCAAGTCCAGAGCTCTGTCTACGGTTTATGATTTTATTGTTGAAATGCGTACTAGCTGTGGTGCGAAGCCCGATCTTGTTACTTTCACAATCTTGATAGATAACGTGTGTAATAGCAAGAATCTACGGGAGGCTATGCGGTTGGTAAGTTTGCTGTATAAGGAGGGTTTTAAGCCGGATTGCTTTGTTTATAACACAATTATGAAGGGTTATTGTATGATTGGCAGGGGCAGCGAGGCAATTGAAGTCTATAAGAAAATGAAGGAGGTGGGATTGGAGCCTGATGTTGTAACTTTTAATACGTTGATTTTTGGGTTATCGAAGTCGGGGCGAGTTAAGGAAGCCAGAAAGTTTTTGGACATTATGGCAGAGATGGGTCATTTCCCTGATGCAGTTACTTACACTTCATTGATGAATGGAATGTGTTGTGAGGGTGATGCTTTGAGAGCACTGTCATTGCTTGAGGAGATGGAGGCAAAGGGGTGCAGCCCCAATTCGTGCACATATAATACGTTGCTCCATGGGTTGTCAAAGTCTAGGCTTTTGGATAGAGCGATTGAATTGTATGGTTTGATGAAGTCTAGTGATATGAAGCTCGAAACAGCTTCGTATGCTACTTTTGTGAGGGTGCTTTGCAGGAGTGGTAGAATTGCTGAAGCCTATGAAGTGTTTGATTATGCAGTTGAGAGTAAAAGTCTGACTGATGTTGCTGCATATTCAACTTTAGAGACTACATTGAAGTCTCTGAAGAAAGCAAGGGAGCAAGCCTCTATATAA

Coding sequence (CDS)

ATGGTGGGGCTTCGTCTTGTTGGGCCGAAGCCCGGCCCATTTATTTTCCCACGCAAATCTCCAAGTTCCGCAAAAACCCTAACATTGAAAAAACCCTTTCCAATTGATTCGAGTAGCGCAGGGGGGAATTTGAAAATGGGCAAATTATCGACGTCATTTCGTTCAGCTCTCTCCACCGCCGTCGTTAGCAAACCGCCCCATTCTCCGGCGGCGCCTCCTCTCTCATCCAAGAAACCAACCCCTAAACTTTCCCGGAAAACTCCATCGGGTCAGAGCTCCGGCCACCCGGAAAAGCCTAAACTCCCAACGGTGTTCAAATCGGCTAGTCTCGCAGATGCCAAGAAGCTCTACACCTCCTTCATCTCCACTACAAAAGCCCCTCTCGACCTTCGTTTCCATAATTCCCTCCTTCAGTCTTACGCTTCAATCGCCACACTCAATGACTCCATCTCTTTCCTCCGCCACATGTCCAAAGTTCAACCTTCCTTCTCGCCCGATCAATCGACCTTCCATGTCTTACTCTCCACCTCCGGGAATGGTCCTGATTGCTCTCTCGCCTCGATTCGGCAAATTCTCAATTTCATGGTCACCAATGGCTTCGATCCTGACAAGGTAACCACTGATCTTGCTGTGCGTTCGCTTTGTTCGGTAGGTCTGGTTGATGAAGCTGTAGAATTAGTTAAGGAATTATCGCAAAAACACTCGCCTCCTGATTCTTATACATATAATCATCTCGTTAAGCAACTCTGCAAGTCCAGAGCTCTGTCTACGGTTTATGATTTTATTGTTGAAATGCGTACTAGCTGTGGTGCGAAGCCCGATCTTGTTACTTTCACAATCTTGATAGATAACGTGTGTAATAGCAAGAATCTACGGGAGGCTATGCGGTTGGTAAGTTTGCTGTATAAGGAGGGTTTTAAGCCGGATTGCTTTGTTTATAACACAATTATGAAGGGTTATTGTATGATTGGCAGGGGCAGCGAGGCAATTGAAGTCTATAAGAAAATGAAGGAGGTGGGATTGGAGCCTGATGTTGTAACTTTTAATACGTTGATTTTTGGGTTATCGAAGTCGGGGCGAGTTAAGGAAGCCAGAAAGTTTTTGGACATTATGGCAGAGATGGGTCATTTCCCTGATGCAGTTACTTACACTTCATTGATGAATGGAATGTGTTGTGAGGGTGATGCTTTGAGAGCACTGTCATTGCTTGAGGAGATGGAGGCAAAGGGGTGCAGCCCCAATTCGTGCACATATAATACGTTGCTCCATGGGTTGTCAAAGTCTAGGCTTTTGGATAGAGCGATTGAATTGTATGGTTTGATGAAGTCTAGTGATATGAAGCTCGAAACAGCTTCGTATGCTACTTTTGTGAGGGTGCTTTGCAGGAGTGGTAGAATTGCTGAAGCCTATGAAGTGTTTGATTATGCAGTTGAGAGTAAAAGTCTGACTGATGTTGCTGCATATTCAACTTTAGAGACTACATTGAAGTCTCTGAAGAAAGCAAGGGAGCAAGCCTCTATATAA

Protein sequence

MVGLRLVGPKPGPFIFPRKSPSSAKTLTLKKPFPIDSSSAGGNLKMGKLSTSFRSALSTAVVSKPPHSPAAPPLSSKKPTPKLSRKTPSGQSSGHPEKPKLPTVFKSASLADAKKLYTSFISTTKAPLDLRFHNSLLQSYASIATLNDSISFLRHMSKVQPSFSPDQSTFHVLLSTSGNGPDCSLASIRQILNFMVTNGFDPDKVTTDLAVRSLCSVGLVDEAVELVKELSQKHSPPDSYTYNHLVKQLCKSRALSTVYDFIVEMRTSCGAKPDLVTFTILIDNVCNSKNLREAMRLVSLLYKEGFKPDCFVYNTIMKGYCMIGRGSEAIEVYKKMKEVGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCCEGDALRALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRAIELYGLMKSSDMKLETASYATFVRVLCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLETTLKSLKKAREQASI
Homology
BLAST of CaUC01G020330 vs. NCBI nr
Match: XP_038883848.1 (pentatricopeptide repeat-containing protein At2g17670 [Benincasa hispida])

HSP 1 Score: 847.0 bits (2187), Expect = 8.2e-242
Identity = 434/460 (94.35%), Postives = 447/460 (97.17%), Query Frame = 0

Query: 46  MGKLSTSFRSALSTAVVSKPPHSPAAPPLSSKKPTPKLSRKTPSGQSSGHPEKPKLPTVF 105
           MGKLS SFRSALST VV+KPPHSPAAPPLSSKKPT  +SRK+PSGQSSGHPEKPKLPTVF
Sbjct: 1   MGKLSPSFRSALSTVVVNKPPHSPAAPPLSSKKPTQSVSRKSPSGQSSGHPEKPKLPTVF 60

Query: 106 KSASLADAKKLYTSFISTTKAPLDLRFHNSLLQSYASIATLNDSISFLRHMSKVQPSFSP 165
           KSA LADAKKLY SFISTTKAPLDLRFHNSLLQSYASIATLNDSISFLRHMSKVQPSFSP
Sbjct: 61  KSAGLADAKKLYASFISTTKAPLDLRFHNSLLQSYASIATLNDSISFLRHMSKVQPSFSP 120

Query: 166 DQSTFHVLLSTSGNGPDCSLASIRQILNFMVTNGFDPDKVTTDLAVRSLCSVGLVDEAVE 225
           DQSTFHVLL+TSGN PD SLAS+RQILNFMVTNGFDPDKVTTDLAVRSLCSVGLVDEAVE
Sbjct: 121 DQSTFHVLLATSGNDPDFSLASVRQILNFMVTNGFDPDKVTTDLAVRSLCSVGLVDEAVE 180

Query: 226 LVKELSQKHSPPDSYTYNHLVKQLCKSRALSTVYDFIVEMRTSCGAKPDLVTFTILIDNV 285
           LVKELSQKHSPPDSYTYNHLVKQLCKSR LSTVYDFIV+MR+SCGAKPDLVT+TILIDNV
Sbjct: 181 LVKELSQKHSPPDSYTYNHLVKQLCKSRDLSTVYDFIVQMRSSCGAKPDLVTYTILIDNV 240

Query: 286 CNSKNLREAMRLVSLLYKEGFKPDCFVYNTIMKGYCMIGRGSEAIEVYKKMKEVGLEPDV 345
           CNSKNLREAMRLVSLLYKEGFKPDCFVYNTIMKGYCMIGRGSEAI VYKKMKEVGLEPDV
Sbjct: 241 CNSKNLREAMRLVSLLYKEGFKPDCFVYNTIMKGYCMIGRGSEAIGVYKKMKEVGLEPDV 300

Query: 346 VTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCCEGDALRALSLLEE 405
           VTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCCEG+AL ALSLL+E
Sbjct: 301 VTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCCEGNALGALSLLQE 360

Query: 406 MEAKGCSPNSCTYNTLLHGLSKSRLLDRAIELYGLMKSSDMKLETASYATFVRVLCRSGR 465
           MEAKGCSPNSCTYNTLLHGLSKSRLLDR IELYGLMK+S+MKLETASYATFVR LCRSGR
Sbjct: 361 MEAKGCSPNSCTYNTLLHGLSKSRLLDRGIELYGLMKASNMKLETASYATFVRALCRSGR 420

Query: 466 IAEAYEVFDYAVESKSLTDVAAYSTLETTLKSLKKAREQA 506
           IAEAYEVFDYAVESKSLTDVAAYSTLE+TLKSLKKAREQA
Sbjct: 421 IAEAYEVFDYAVESKSLTDVAAYSTLESTLKSLKKAREQA 460

BLAST of CaUC01G020330 vs. NCBI nr
Match: XP_004135005.1 (pentatricopeptide repeat-containing protein At2g17670 [Cucumis sativus] >KGN48978.1 hypothetical protein Csa_003999 [Cucumis sativus])

HSP 1 Score: 807.7 bits (2085), Expect = 5.5e-230
Identity = 414/461 (89.80%), Postives = 438/461 (95.01%), Query Frame = 0

Query: 46  MGKLSTSFRSALST-AVVSKPPHSPAAPPLSSKKPTPKLSRKTPSGQSSGHPEKPKLPTV 105
           MGKLS SFRS LS+  V++KPPHSPAAPPLSSKK TPK SRKTPSGQSSGHPEKPKLPTV
Sbjct: 1   MGKLSPSFRSILSSNTVINKPPHSPAAPPLSSKKATPKPSRKTPSGQSSGHPEKPKLPTV 60

Query: 106 FKSASLADAKKLYTSFISTTKAPLDLRFHNSLLQSYASIATLNDSISFLRHMSKVQPSFS 165
           FKSASLADAKKLY+SF+S TKAP +LR HNSLLQSYASIATLNDSISFLRHMSKVQPSFS
Sbjct: 61  FKSASLADAKKLYSSFVSATKAPFNLRVHNSLLQSYASIATLNDSISFLRHMSKVQPSFS 120

Query: 166 PDQSTFHVLLSTSGNGPDCSLASIRQILNFMVTNGFDPDKVTTDLAVRSLCSVGLVDEAV 225
           PDQSTFH+LLSTSGN PD +LAS++QILNFMVTNGF+PDKVT DLAVRSLCSVGLVDEAV
Sbjct: 121 PDQSTFHILLSTSGNRPDSTLASVQQILNFMVTNGFNPDKVTADLAVRSLCSVGLVDEAV 180

Query: 226 ELVKELSQKHSPPDSYTYNHLVKQLCKSRALSTVYDFIVEMRTSCGAKPDLVTFTILIDN 285
           ELVKELSQKH+PPD YTYNHLVKQLCKSRALSTVY+FIVEMR+SCGAKPDLVT+TILIDN
Sbjct: 181 ELVKELSQKHTPPDIYTYNHLVKQLCKSRALSTVYNFIVEMRSSCGAKPDLVTYTILIDN 240

Query: 286 VCNSKNLREAMRLVSLLYKEGFKPDCFVYNTIMKGYCMIGRGSEAIEVYKKMKEVGLEPD 345
           VCNS NLREAMRLVSLLYKEGFKPDCFVYNTIMKGYCM+GRG+EAI VYKKMKEVGLEPD
Sbjct: 241 VCNSNNLREAMRLVSLLYKEGFKPDCFVYNTIMKGYCMVGRGAEAIGVYKKMKEVGLEPD 300

Query: 346 VVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCCEGDALRALSLLE 405
           VVTFNTLIFGLSKSGRVKEAR FLDIMAEMGHFPDAVTYTSLMNGMC EG+AL ALSLL+
Sbjct: 301 VVTFNTLIFGLSKSGRVKEARNFLDIMAEMGHFPDAVTYTSLMNGMCREGNALGALSLLK 360

Query: 406 EMEAKGCSPNSCTYNTLLHGLSKSRLLDRAIELYGLMKSSDMKLETASYATFVRVLCRSG 465
           EMEAKGC+PNSCTYNTLLHGLSKSRLLDR IELYGLMKS DMKLETASY+TFVR LCRSG
Sbjct: 361 EMEAKGCNPNSCTYNTLLHGLSKSRLLDRGIELYGLMKSCDMKLETASYSTFVRALCRSG 420

Query: 466 RIAEAYEVFDYAVESKSLTDVAAYSTLETTLKSLKKAREQA 506
           RIAEAYEVFDYAVESKSLTDV+AY +LE+TLKSLK AREQA
Sbjct: 421 RIAEAYEVFDYAVESKSLTDVSAYLSLESTLKSLKNAREQA 461

BLAST of CaUC01G020330 vs. NCBI nr
Match: XP_016899201.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g17670 [Cucumis melo])

HSP 1 Score: 791.6 bits (2043), Expect = 4.1e-225
Identity = 409/460 (88.91%), Postives = 433/460 (94.13%), Query Frame = 0

Query: 46  MGKLSTSFRSALSTAVVSKPPHSPAAPPLSSKKPTPKLSRKTPSGQSSGHPEKPKLPTVF 105
           MGKLS SFRS LST+V+ KP  SPAAPPLSSKKPTPK SRKTPSGQSSGHP KPKLPTVF
Sbjct: 1   MGKLSPSFRSILSTSVIHKPTLSPAAPPLSSKKPTPKPSRKTPSGQSSGHPVKPKLPTVF 60

Query: 106 KSASLADAKKLYTSFISTTKAPLDLRFHNSLLQSYASIATLNDSISFLRHMSKVQPSFSP 165
           KSASLADAKKLY+SFIST+KAP +LR HNSLLQSYASIATLNDSISFLRHMSKVQPSFSP
Sbjct: 61  KSASLADAKKLYSSFISTSKAPFNLRVHNSLLQSYASIATLNDSISFLRHMSKVQPSFSP 120

Query: 166 DQSTFHVLLSTSGNGPDCSLASIRQILNFMVTNGFDPDKVTTDLAVRSLCSVGLVDEAVE 225
           DQSTFH+LLSTS N PD SLAS+R+ILNFMVTNGF+PDKVT DLAVRSLCSVGLVDEAVE
Sbjct: 121 DQSTFHILLSTSENRPDSSLASVRKILNFMVTNGFNPDKVTADLAVRSLCSVGLVDEAVE 180

Query: 226 LVKELSQKHSPPDSYTYNHLVKQLCKSRALSTVYDFIVEMRTSCGAKPDLVTFTILIDNV 285
           LVKELSQKH+P D YTYNHLVKQLCKSRALSTVY+FIVEMR+SCGAKPDLVT+TILIDNV
Sbjct: 181 LVKELSQKHTPLDIYTYNHLVKQLCKSRALSTVYNFIVEMRSSCGAKPDLVTYTILIDNV 240

Query: 286 CNSKNLREAMRLVSLLYKEGFKPDCFVYNTIMKGYCMIGRGSEAIEVYKKMKEVGLEPDV 345
           CNS NLREAMRLVSLLYKEGFKPDCFVYNTIMKGYCM+GRG+EAI VYKKMKEVGLEPD+
Sbjct: 241 CNSNNLREAMRLVSLLYKEGFKPDCFVYNTIMKGYCMVGRGAEAIGVYKKMKEVGLEPDL 300

Query: 346 VTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCCEGDALRALSLLEE 405
           VTFNTLIFGLSKSGRVKEA  FLDIMAEMGHFPD VTYTSLMNGMC EG+AL ALSLL+E
Sbjct: 301 VTFNTLIFGLSKSGRVKEAIDFLDIMAEMGHFPDTVTYTSLMNGMCREGNALGALSLLKE 360

Query: 406 MEAKGCSPNSCTYNTLLHGLSKSRLLDRAIELYGLMKSSDMKLETASYATFVRVLCRSGR 465
           MEAKGC+PNS TYNTLLHGLSKSRLLDR IELYGLMKSSDMKLE+ASY+TFVR LCRSGR
Sbjct: 361 MEAKGCNPNSFTYNTLLHGLSKSRLLDRGIELYGLMKSSDMKLESASYSTFVRALCRSGR 420

Query: 466 IAEAYEVFDYAVESKSLTDVAAYSTLETTLKSLKKAREQA 506
           IAEAYEVFDYAVESKSLTDV+AY +LE+TLKSLK AREQA
Sbjct: 421 IAEAYEVFDYAVESKSLTDVSAYLSLESTLKSLKNAREQA 460

BLAST of CaUC01G020330 vs. NCBI nr
Match: XP_023518399.1 (pentatricopeptide repeat-containing protein At2g17670-like [Cucurbita pepo subsp. pepo] >XP_023518400.1 pentatricopeptide repeat-containing protein At2g17670-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 770.8 bits (1989), Expect = 7.4e-219
Identity = 397/469 (84.65%), Postives = 425/469 (90.62%), Query Frame = 0

Query: 46  MGKLSTSFRSALSTAVVSKPPHSPAAPPL-------SSKKPTPKLSRKTPSGQSSGHPEK 105
           MGKLS SFRSA+STA+V+KPPH PAAP L        SKK  PK SR+  S QSSGH EK
Sbjct: 1   MGKLSPSFRSAISTAIVNKPPHPPAAPSLLSGEIRSLSKKRPPKHSRENQSAQSSGHGEK 60

Query: 106 PKLPTVFKSASLADAKKLYTSFISTTKAPLDLRFHNSLLQSYASIATLNDSISFLRHMSK 165
           PKLPT+FKSASLADAKKLY+SFI+TTKAPLD+RF+NSLL+SYASIA+LNDSISFLRHMSK
Sbjct: 61  PKLPTLFKSASLADAKKLYSSFINTTKAPLDVRFYNSLLRSYASIASLNDSISFLRHMSK 120

Query: 166 VQPSFSPDQSTFHVLLSTSGNGPDCSLASIRQILNFMVTNGFDPDKVTTDLAVRSLCSVG 225
           VQPSFSP++STFHVLLSTSG G D SLAS+RQILNFMVT GF+PDK TTD+AVRSLCS G
Sbjct: 121 VQPSFSPERSTFHVLLSTSGTGTDSSLASVRQILNFMVTQGFNPDKATTDIAVRSLCSAG 180

Query: 226 LVDEAVELVKELSQKHSPPDSYTYNHLVKQLCKSRALSTVYDFIVEMRTSCGAKPDLVTF 285
           L+DEAVELV+E SQKHSPPDSYTYNHLVKQLCKSR+LSTVYDFI EMR SCGA PDLVT+
Sbjct: 181 LIDEAVELVREFSQKHSPPDSYTYNHLVKQLCKSRSLSTVYDFIEEMRISCGATPDLVTY 240

Query: 286 TILIDNVCNSKNLREAMRLVSLLYKEGFKPDCFVYNTIMKGYCMIGRGSEAIEVYKKMKE 345
           TILIDNVCN KNLRE  RLVS+L KEGFKPDCF+YN IMKGYCM+GRG EAI VYKKMKE
Sbjct: 241 TILIDNVCNGKNLREVTRLVSVLAKEGFKPDCFLYNIIMKGYCMLGRGVEAIGVYKKMKE 300

Query: 346 VGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCCEGDALR 405
            GLEPDVVTFNTLIFGLSKSGRVK+ARKFLDIMAEMGHFPDAVTYTSLMNGMCCEGDAL 
Sbjct: 301 EGLEPDVVTFNTLIFGLSKSGRVKDARKFLDIMAEMGHFPDAVTYTSLMNGMCCEGDALG 360

Query: 406 ALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRAIELYGLMKSSDMKLETASYATFVR 465
           ALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDR IELYGLMKSSDMKLE ASYATFVR
Sbjct: 361 ALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIELYGLMKSSDMKLEVASYATFVR 420

Query: 466 VLCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLETTLKSLKKAREQASI 508
            LCRSGRIAEAYEVFDYAVESKSLTDV AYSTLE TLK+LKKA E+ ++
Sbjct: 421 ALCRSGRIAEAYEVFDYAVESKSLTDVTAYSTLEITLKALKKAGEKGNV 469

BLAST of CaUC01G020330 vs. NCBI nr
Match: XP_022926982.1 (pentatricopeptide repeat-containing protein At2g17670 [Cucurbita moschata] >XP_022926983.1 pentatricopeptide repeat-containing protein At2g17670 [Cucurbita moschata] >XP_022926984.1 pentatricopeptide repeat-containing protein At2g17670 [Cucurbita moschata])

HSP 1 Score: 769.2 bits (1985), Expect = 2.2e-218
Identity = 398/469 (84.86%), Postives = 425/469 (90.62%), Query Frame = 0

Query: 46  MGKLSTSFRSALSTAVVSKPPHSPAAPPL-------SSKKPTPKLSRKTPSGQSSGHPEK 105
           MGKLS SFRSA+ST +V+KPPH PAAP L        SKK  PK SR+  S QSS H EK
Sbjct: 1   MGKLSPSFRSAISTPIVNKPPHPPAAPSLLSGEIRSLSKKRPPKHSRENQSAQSSAHGEK 60

Query: 106 PKLPTVFKSASLADAKKLYTSFISTTKAPLDLRFHNSLLQSYASIATLNDSISFLRHMSK 165
           PKLPT+FKSASLADAKKLY+SFI+TTKAPLD+RF+NSLLQSYASIA+LNDSISFLRHMSK
Sbjct: 61  PKLPTLFKSASLADAKKLYSSFINTTKAPLDVRFYNSLLQSYASIASLNDSISFLRHMSK 120

Query: 166 VQPSFSPDQSTFHVLLSTSGNGPDCSLASIRQILNFMVTNGFDPDKVTTDLAVRSLCSVG 225
           VQPSFSP++STFHVLLSTSGNG D SLAS+RQILNFMVT GF+PDK TTD+AVRSLCS G
Sbjct: 121 VQPSFSPERSTFHVLLSTSGNGTDSSLASVRQILNFMVTQGFNPDKATTDIAVRSLCSAG 180

Query: 226 LVDEAVELVKELSQKHSPPDSYTYNHLVKQLCKSRALSTVYDFIVEMRTSCGAKPDLVTF 285
           L+DEAVELV+E SQKHSPPDSYTYNHLVKQLCKSR+LSTVYDFI EMR+SCGA PDLVT+
Sbjct: 181 LIDEAVELVREFSQKHSPPDSYTYNHLVKQLCKSRSLSTVYDFIEEMRSSCGATPDLVTY 240

Query: 286 TILIDNVCNSKNLREAMRLVSLLYKEGFKPDCFVYNTIMKGYCMIGRGSEAIEVYKKMKE 345
           TILIDNVCN KNLREA RLVS+L KEGFKPDCFVYN IMKGYCM+GRG EAI VYKKMKE
Sbjct: 241 TILIDNVCNGKNLREATRLVSVLAKEGFKPDCFVYNIIMKGYCMLGRGVEAIGVYKKMKE 300

Query: 346 VGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCCEGDALR 405
            GLEPDVVTFNTLIFGLSKSGRVK+ARKFLDIMAEMGHFPDAVTYTSLMNGMC EGDAL 
Sbjct: 301 EGLEPDVVTFNTLIFGLSKSGRVKDARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALG 360

Query: 406 ALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRAIELYGLMKSSDMKLETASYATFVR 465
           ALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDR IELYGLMKSSDMKLE ASYATFVR
Sbjct: 361 ALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIELYGLMKSSDMKLEVASYATFVR 420

Query: 466 VLCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLETTLKSLKKAREQASI 508
            LCRSGRIAEAYEVFDYAVESKSLTDV AYSTLE TLK+LKKA E+ ++
Sbjct: 421 ALCRSGRIAEAYEVFDYAVESKSLTDVTAYSTLEITLKALKKAGEKGNV 469

BLAST of CaUC01G020330 vs. ExPASy Swiss-Prot
Match: Q84J71 (Pentatricopeptide repeat-containing protein At2g17670 OS=Arabidopsis thaliana OX=3702 GN=At2g17670 PE=1 SV=1)

HSP 1 Score: 553.9 bits (1426), Expect = 1.9e-156
Identity = 278/462 (60.17%), Postives = 355/462 (76.84%), Query Frame = 0

Query: 46  MGKLSTSFRSALSTAVVSKPPHSPAAPPLSSKKPTPKLSRKTPSGQSSGHPEKPKLPTVF 105
           MGK+ +SFRS  +  +V K   SP APP   +  T          Q++  P +P L   F
Sbjct: 1   MGKVPSSFRSMPANLLVRKTTPSPPAPPRDFRNRTAVGGDSAKLPQNTQAPREPSLRNPF 60

Query: 106 KSASLADAKKLYTSFISTTKAPLDLRFHNSLLQSYASIATLNDSISFLRHMSKVQPSFSP 165
           KS +L+DAK L+ S  +T++ PLDL+FHNS+LQSY SIA +ND++   +H+ K QP+F P
Sbjct: 61  KSPNLSDAKSLFNSIAATSRIPLDLKFHNSVLQSYGSIAVVNDTVKLFQHILKSQPNFRP 120

Query: 166 DQSTFHVLLSTSGNGPDCSLASIRQILNFMVTNGFDPDKVTTDLAVRSLCSVGLVDEAVE 225
            +STF +LLS +   PD S++++ ++LN MV NG +PD+VTTD+AVRSLC  G VDEA +
Sbjct: 121 GRSTFLILLSHACRAPDSSISNVHRVLNLMVNNGLEPDQVTTDIAVRSLCETGRVDEAKD 180

Query: 226 LVKELSQKHSPPDSYTYNHLVKQLCKSRALSTVYDFIVEMRTSCGAKPDLVTFTILIDNV 285
           L+KEL++KHSPPD+YTYN L+K LCK + L  VY+F+ EMR     KPDLV+FTILIDNV
Sbjct: 181 LMKELTEKHSPPDTYTYNFLLKHLCKCKDLHVVYEFVDEMRDDFDVKPDLVSFTILIDNV 240

Query: 286 CNSKNLREAMRLVSLLYKEGFKPDCFVYNTIMKGYCMIGRGSEAIEVYKKMKEVGLEPDV 345
           CNSKNLREAM LVS L   GFKPDCF+YNTIMKG+C + +GSEA+ VYKKMKE G+EPD 
Sbjct: 241 CNSKNLREAMYLVSKLGNAGFKPDCFLYNTIMKGFCTLSKGSEAVGVYKKMKEEGVEPDQ 300

Query: 346 VTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCCEGDALRALSLLEE 405
           +T+NTLIFGLSK+GRV+EAR +L  M + G+ PD  TYTSLMNGMC +G++L ALSLLEE
Sbjct: 301 ITYNTLIFGLSKAGRVEEARMYLKTMVDAGYEPDTATYTSLMNGMCRKGESLGALSLLEE 360

Query: 406 MEAKGCSPNSCTYNTLLHGLSKSRLLDRAIELYGLMKSSDMKLETASYATFVRVLCRSGR 465
           MEA+GC+PN CTYNTLLHGL K+RL+D+ +ELY +MKSS +KLE+  YAT VR L +SG+
Sbjct: 361 MEARGCAPNDCTYNTLLHGLCKARLMDKGMELYEMMKSSGVKLESNGYATLVRSLVKSGK 420

Query: 466 IAEAYEVFDYAVESKSLTDVAAYSTLETTLKSLKKAREQASI 508
           +AEAYEVFDYAV+SKSL+D +AYSTLETTLK LKKA+EQ  +
Sbjct: 421 VAEAYEVFDYAVDSKSLSDASAYSTLETTLKWLKKAKEQGLV 462

BLAST of CaUC01G020330 vs. ExPASy Swiss-Prot
Match: Q3EDF8 (Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana OX=3702 GN=At1g09900 PE=2 SV=1)

HSP 1 Score: 189.5 bits (480), Expect = 9.3e-47
Identity = 102/346 (29.48%), Postives = 193/346 (55.78%), Query Frame = 0

Query: 129 DLRFHNSLLQSYASIATLNDSISFLRHMSKVQPSFSPDQSTFHVLL-STSGNGPDCSLAS 188
           D+  +N ++  Y     +N+++S L  M     S SPD  T++ +L S   +G    L  
Sbjct: 171 DVITYNVMISGYCKAGEINNALSVLDRM-----SVSPDVVTYNTILRSLCDSG---KLKQ 230

Query: 189 IRQILNFMVTNGFDPDKVTTDLAVRSLCSVGLVDEAVELVKELSQKHSPPDSYTYNHLVK 248
             ++L+ M+     PD +T  + + + C    V  A++L+ E+  +   PD  TYN LV 
Sbjct: 231 AMEVLDRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVLVN 290

Query: 249 QLCKSRALSTVYDFIVEMRTSCGAKPDLVTFTILIDNVCNSKNLREAMRLVSLLYKEGFK 308
            +CK   L     F+ +M +S G +P+++T  I++ ++C++    +A +L++ + ++GF 
Sbjct: 291 GICKEGRLDEAIKFLNDMPSS-GCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFS 350

Query: 309 PDCFVYNTIMKGYCMIGRGSEAIEVYKKMKEVGLEPDVVTFNTLIFGLSKSGRVKEARKF 368
           P    +N ++   C  G    AI++ +KM + G +P+ +++N L+ G  K  ++  A ++
Sbjct: 351 PSVVTFNILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKMDRAIEY 410

Query: 369 LDIMAEMGHFPDAVTYTSLMNGMCCEGDALRALSLLEEMEAKGCSPNSCTYNTLLHGLSK 428
           L+ M   G +PD VTY +++  +C +G    A+ +L ++ +KGCSP   TYNT++ GL+K
Sbjct: 411 LERMVSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAK 470

Query: 429 SRLLDRAIELYGLMKSSDMKLETASYATFVRVLCRSGRIAEAYEVF 474
           +    +AI+L   M++ D+K +T +Y++ V  L R G++ EA + F
Sbjct: 471 AGKTGKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFF 507

BLAST of CaUC01G020330 vs. ExPASy Swiss-Prot
Match: Q9LFF1 (Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=MEE40 PE=2 SV=1)

HSP 1 Score: 188.0 bits (476), Expect = 2.7e-46
Identity = 106/339 (31.27%), Postives = 184/339 (54.28%), Query Frame = 0

Query: 134 NSLLQSYASIATLNDSISFLRHMSKVQPSFSPDQSTFHVLLSTSGNGPDCSLASIR---Q 193
           N ++  +     + D+++F++ MS  Q  F PDQ TF+ L+    NG  C    ++   +
Sbjct: 263 NVIVHGFCKEGRVEDALNFIQEMSN-QDGFFPDQYTFNTLV----NGL-CKAGHVKHAIE 322

Query: 194 ILNFMVTNGFDPDKVTTDLAVRSLCSVGLVDEAVELVKELSQKHSPPDSYTYNHLVKQLC 253
           I++ M+  G+DPD  T +  +  LC +G V EAVE++ ++  +   P++ TYN L+  LC
Sbjct: 323 IMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITRDCSPNTVTYNTLISTLC 382

Query: 254 KSRALSTVYDFIVEMRTSCGAKPDLVTFTILIDNVCNSKNLREAMRLVSLLYKEGFKPDC 313
           K   +    + +  + TS G  PD+ TF  LI  +C ++N R AM L   +  +G +PD 
Sbjct: 383 KENQVEEATE-LARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMELFEEMRSKGCEPDE 442

Query: 314 FVYNTIMKGYCMIGRGSEAIEVYKKMKEVGLEPDVVTFNTLIFGLSKSGRVKEARKFLDI 373
           F YN ++   C  G+  EA+ + K+M+  G    V+T+NTLI G  K+ + +EA +  D 
Sbjct: 443 FTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFCKANKTREAEEIFDE 502

Query: 374 MAEMGHFPDAVTYTSLMNGMCCEGDALRALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRL 433
           M   G   ++VTY +L++G+C       A  L+++M  +G  P+  TYN+LL    +   
Sbjct: 503 MEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYTYNSLLTHFCRGGD 562

Query: 434 LDRAIELYGLMKSSDMKLETASYATFVRVLCRSGRIAEA 470
           + +A ++   M S+  + +  +Y T +  LC++GR+  A
Sbjct: 563 IKKAADIVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVA 594

BLAST of CaUC01G020330 vs. ExPASy Swiss-Prot
Match: Q9ASZ8 (Pentatricopeptide repeat-containing protein At1g12620 OS=Arabidopsis thaliana OX=3702 GN=At1g12620 PE=2 SV=1)

HSP 1 Score: 184.5 bits (467), Expect = 3.0e-45
Identity = 104/343 (30.32%), Postives = 181/343 (52.77%), Query Frame = 0

Query: 149 SISFLRHMSKVQPSFSPDQSTFHVLLSTSGNGPDCSLASIRQILNFMVTNGFDPDKVTTD 208
           S++F      ++  + PD  TF  L+  +G   +  ++   ++++ MV  G  P  +T +
Sbjct: 124 SLAFSAMGKIIKLGYEPDTVTFSTLI--NGLCLEGRVSEALELVDRMVEMGHKPTLITLN 183

Query: 209 LAVRSLCSVGLVDEAVELVKELSQKHSPPDSYTYNHLVKQLCKSRALSTVYDFIVEMRTS 268
             V  LC  G V +AV L+  + +    P+  TY  ++K +CKS   +   + + +M   
Sbjct: 184 ALVNGLCLNGKVSDAVLLIDRMVETGFQPNEVTYGPVLKVMCKSGQTALAMELLRKMEER 243

Query: 269 CGAKPDLVTFTILIDNVCNSKNLREAMRLVSLLYKEGFKPDCFVYNTIMKGYCMIGRGSE 328
              K D V ++I+ID +C   +L  A  L + +  +GFK D  +Y T+++G+C  GR  +
Sbjct: 244 -KIKLDAVKYSIIIDGLCKDGSLDNAFNLFNEMEIKGFKADIIIYTTLIRGFCYAGRWDD 303

Query: 329 AIEVYKKMKEVGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMN 388
             ++ + M +  + PDVV F+ LI    K G+++EA +    M + G  PD VTYTSL++
Sbjct: 304 GAKLLRDMIKRKITPDVVAFSALIDCFVKEGKLREAEELHKEMIQRGISPDTVTYTSLID 363

Query: 389 GMCCEGDALRALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRAIELYGLMKSSDMKL 448
           G C E    +A  +L+ M +KGC PN  T+N L++G  K+ L+D  +EL+  M    +  
Sbjct: 364 GFCKENQLDKANHMLDLMVSKGCGPNIRTFNILINGYCKANLIDDGLELFRKMSLRGVVA 423

Query: 449 ETASYATFVRVLCRSGRIAEAYEVFDYAVESKSLTDVAAYSTL 492
           +T +Y T ++  C  G++  A E+F   V  +   D+ +Y  L
Sbjct: 424 DTVTYNTLIQGFCELGKLEVAKELFQEMVSRRVRPDIVSYKIL 463

BLAST of CaUC01G020330 vs. ExPASy Swiss-Prot
Match: Q0WVK7 (Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g05670 PE=2 SV=1)

HSP 1 Score: 183.3 bits (464), Expect = 6.7e-45
Identity = 99/293 (33.79%), Postives = 160/293 (54.61%), Query Frame = 0

Query: 183 CSLASIRQ---ILNFMVTNGFDPDKVTTDLAVRSLCSVGLVDEAVELVKELSQKHSPPDS 242
           C L  I++   +L  M   G+ PD ++    V   C  G +D+  +L++ + +K   P+S
Sbjct: 257 CQLGRIKEAHHLLLLMELKGYTPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNS 316

Query: 243 YTYNHLVKQLCKSRALSTVYDFIVEMRTSCGAKPDLVTFTILIDNVCNSKNLREAMRLVS 302
           Y Y  ++  LC+   L+   +   EM    G  PD V +T LID  C   ++R A +   
Sbjct: 317 YIYGSIIGLLCRICKLAEAEEAFSEMIRQ-GILPDTVVYTTLIDGFCKRGDIRAASKFFY 376

Query: 303 LLYKEGFKPDCFVYNTIMKGYCMIGRGSEAIEVYKKMKEVGLEPDVVTFNTLIFGLSKSG 362
            ++     PD   Y  I+ G+C IG   EA +++ +M   GLEPD VTF  LI G  K+G
Sbjct: 377 EMHSRDITPDVLTYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAG 436

Query: 363 RVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCCEGDALRALSLLEEMEAKGCSPNSCTYN 422
            +K+A +  + M + G  P+ VTYT+L++G+C EGD   A  LL EM   G  PN  TYN
Sbjct: 437 HMKDAFRVHNHMIQAGCSPNVVTYTTLIDGLCKEGDLDSANELLHEMWKIGLQPNIFTYN 496

Query: 423 TLLHGLSKSRLLDRAIELYGLMKSSDMKLETASYATFVRVLCRSGRIAEAYEV 473
           ++++GL KS  ++ A++L G  +++ +  +T +Y T +   C+SG + +A E+
Sbjct: 497 SIVNGLCKSGNIEEAVKLVGEFEAAGLNADTVTYTTLMDAYCKSGEMDKAQEI 548

BLAST of CaUC01G020330 vs. ExPASy TrEMBL
Match: A0A0A0KHF8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G507510 PE=4 SV=1)

HSP 1 Score: 807.7 bits (2085), Expect = 2.7e-230
Identity = 414/461 (89.80%), Postives = 438/461 (95.01%), Query Frame = 0

Query: 46  MGKLSTSFRSALST-AVVSKPPHSPAAPPLSSKKPTPKLSRKTPSGQSSGHPEKPKLPTV 105
           MGKLS SFRS LS+  V++KPPHSPAAPPLSSKK TPK SRKTPSGQSSGHPEKPKLPTV
Sbjct: 1   MGKLSPSFRSILSSNTVINKPPHSPAAPPLSSKKATPKPSRKTPSGQSSGHPEKPKLPTV 60

Query: 106 FKSASLADAKKLYTSFISTTKAPLDLRFHNSLLQSYASIATLNDSISFLRHMSKVQPSFS 165
           FKSASLADAKKLY+SF+S TKAP +LR HNSLLQSYASIATLNDSISFLRHMSKVQPSFS
Sbjct: 61  FKSASLADAKKLYSSFVSATKAPFNLRVHNSLLQSYASIATLNDSISFLRHMSKVQPSFS 120

Query: 166 PDQSTFHVLLSTSGNGPDCSLASIRQILNFMVTNGFDPDKVTTDLAVRSLCSVGLVDEAV 225
           PDQSTFH+LLSTSGN PD +LAS++QILNFMVTNGF+PDKVT DLAVRSLCSVGLVDEAV
Sbjct: 121 PDQSTFHILLSTSGNRPDSTLASVQQILNFMVTNGFNPDKVTADLAVRSLCSVGLVDEAV 180

Query: 226 ELVKELSQKHSPPDSYTYNHLVKQLCKSRALSTVYDFIVEMRTSCGAKPDLVTFTILIDN 285
           ELVKELSQKH+PPD YTYNHLVKQLCKSRALSTVY+FIVEMR+SCGAKPDLVT+TILIDN
Sbjct: 181 ELVKELSQKHTPPDIYTYNHLVKQLCKSRALSTVYNFIVEMRSSCGAKPDLVTYTILIDN 240

Query: 286 VCNSKNLREAMRLVSLLYKEGFKPDCFVYNTIMKGYCMIGRGSEAIEVYKKMKEVGLEPD 345
           VCNS NLREAMRLVSLLYKEGFKPDCFVYNTIMKGYCM+GRG+EAI VYKKMKEVGLEPD
Sbjct: 241 VCNSNNLREAMRLVSLLYKEGFKPDCFVYNTIMKGYCMVGRGAEAIGVYKKMKEVGLEPD 300

Query: 346 VVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCCEGDALRALSLLE 405
           VVTFNTLIFGLSKSGRVKEAR FLDIMAEMGHFPDAVTYTSLMNGMC EG+AL ALSLL+
Sbjct: 301 VVTFNTLIFGLSKSGRVKEARNFLDIMAEMGHFPDAVTYTSLMNGMCREGNALGALSLLK 360

Query: 406 EMEAKGCSPNSCTYNTLLHGLSKSRLLDRAIELYGLMKSSDMKLETASYATFVRVLCRSG 465
           EMEAKGC+PNSCTYNTLLHGLSKSRLLDR IELYGLMKS DMKLETASY+TFVR LCRSG
Sbjct: 361 EMEAKGCNPNSCTYNTLLHGLSKSRLLDRGIELYGLMKSCDMKLETASYSTFVRALCRSG 420

Query: 466 RIAEAYEVFDYAVESKSLTDVAAYSTLETTLKSLKKAREQA 506
           RIAEAYEVFDYAVESKSLTDV+AY +LE+TLKSLK AREQA
Sbjct: 421 RIAEAYEVFDYAVESKSLTDVSAYLSLESTLKSLKNAREQA 461

BLAST of CaUC01G020330 vs. ExPASy TrEMBL
Match: A0A1S4DT84 (pentatricopeptide repeat-containing protein At2g17670 OS=Cucumis melo OX=3656 GN=LOC103485180 PE=4 SV=1)

HSP 1 Score: 791.6 bits (2043), Expect = 2.0e-225
Identity = 409/460 (88.91%), Postives = 433/460 (94.13%), Query Frame = 0

Query: 46  MGKLSTSFRSALSTAVVSKPPHSPAAPPLSSKKPTPKLSRKTPSGQSSGHPEKPKLPTVF 105
           MGKLS SFRS LST+V+ KP  SPAAPPLSSKKPTPK SRKTPSGQSSGHP KPKLPTVF
Sbjct: 1   MGKLSPSFRSILSTSVIHKPTLSPAAPPLSSKKPTPKPSRKTPSGQSSGHPVKPKLPTVF 60

Query: 106 KSASLADAKKLYTSFISTTKAPLDLRFHNSLLQSYASIATLNDSISFLRHMSKVQPSFSP 165
           KSASLADAKKLY+SFIST+KAP +LR HNSLLQSYASIATLNDSISFLRHMSKVQPSFSP
Sbjct: 61  KSASLADAKKLYSSFISTSKAPFNLRVHNSLLQSYASIATLNDSISFLRHMSKVQPSFSP 120

Query: 166 DQSTFHVLLSTSGNGPDCSLASIRQILNFMVTNGFDPDKVTTDLAVRSLCSVGLVDEAVE 225
           DQSTFH+LLSTS N PD SLAS+R+ILNFMVTNGF+PDKVT DLAVRSLCSVGLVDEAVE
Sbjct: 121 DQSTFHILLSTSENRPDSSLASVRKILNFMVTNGFNPDKVTADLAVRSLCSVGLVDEAVE 180

Query: 226 LVKELSQKHSPPDSYTYNHLVKQLCKSRALSTVYDFIVEMRTSCGAKPDLVTFTILIDNV 285
           LVKELSQKH+P D YTYNHLVKQLCKSRALSTVY+FIVEMR+SCGAKPDLVT+TILIDNV
Sbjct: 181 LVKELSQKHTPLDIYTYNHLVKQLCKSRALSTVYNFIVEMRSSCGAKPDLVTYTILIDNV 240

Query: 286 CNSKNLREAMRLVSLLYKEGFKPDCFVYNTIMKGYCMIGRGSEAIEVYKKMKEVGLEPDV 345
           CNS NLREAMRLVSLLYKEGFKPDCFVYNTIMKGYCM+GRG+EAI VYKKMKEVGLEPD+
Sbjct: 241 CNSNNLREAMRLVSLLYKEGFKPDCFVYNTIMKGYCMVGRGAEAIGVYKKMKEVGLEPDL 300

Query: 346 VTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCCEGDALRALSLLEE 405
           VTFNTLIFGLSKSGRVKEA  FLDIMAEMGHFPD VTYTSLMNGMC EG+AL ALSLL+E
Sbjct: 301 VTFNTLIFGLSKSGRVKEAIDFLDIMAEMGHFPDTVTYTSLMNGMCREGNALGALSLLKE 360

Query: 406 MEAKGCSPNSCTYNTLLHGLSKSRLLDRAIELYGLMKSSDMKLETASYATFVRVLCRSGR 465
           MEAKGC+PNS TYNTLLHGLSKSRLLDR IELYGLMKSSDMKLE+ASY+TFVR LCRSGR
Sbjct: 361 MEAKGCNPNSFTYNTLLHGLSKSRLLDRGIELYGLMKSSDMKLESASYSTFVRALCRSGR 420

Query: 466 IAEAYEVFDYAVESKSLTDVAAYSTLETTLKSLKKAREQA 506
           IAEAYEVFDYAVESKSLTDV+AY +LE+TLKSLK AREQA
Sbjct: 421 IAEAYEVFDYAVESKSLTDVSAYLSLESTLKSLKNAREQA 460

BLAST of CaUC01G020330 vs. ExPASy TrEMBL
Match: A0A6J1EGP9 (pentatricopeptide repeat-containing protein At2g17670 OS=Cucurbita moschata OX=3662 GN=LOC111433937 PE=4 SV=1)

HSP 1 Score: 769.2 bits (1985), Expect = 1.0e-218
Identity = 398/469 (84.86%), Postives = 425/469 (90.62%), Query Frame = 0

Query: 46  MGKLSTSFRSALSTAVVSKPPHSPAAPPL-------SSKKPTPKLSRKTPSGQSSGHPEK 105
           MGKLS SFRSA+ST +V+KPPH PAAP L        SKK  PK SR+  S QSS H EK
Sbjct: 1   MGKLSPSFRSAISTPIVNKPPHPPAAPSLLSGEIRSLSKKRPPKHSRENQSAQSSAHGEK 60

Query: 106 PKLPTVFKSASLADAKKLYTSFISTTKAPLDLRFHNSLLQSYASIATLNDSISFLRHMSK 165
           PKLPT+FKSASLADAKKLY+SFI+TTKAPLD+RF+NSLLQSYASIA+LNDSISFLRHMSK
Sbjct: 61  PKLPTLFKSASLADAKKLYSSFINTTKAPLDVRFYNSLLQSYASIASLNDSISFLRHMSK 120

Query: 166 VQPSFSPDQSTFHVLLSTSGNGPDCSLASIRQILNFMVTNGFDPDKVTTDLAVRSLCSVG 225
           VQPSFSP++STFHVLLSTSGNG D SLAS+RQILNFMVT GF+PDK TTD+AVRSLCS G
Sbjct: 121 VQPSFSPERSTFHVLLSTSGNGTDSSLASVRQILNFMVTQGFNPDKATTDIAVRSLCSAG 180

Query: 226 LVDEAVELVKELSQKHSPPDSYTYNHLVKQLCKSRALSTVYDFIVEMRTSCGAKPDLVTF 285
           L+DEAVELV+E SQKHSPPDSYTYNHLVKQLCKSR+LSTVYDFI EMR+SCGA PDLVT+
Sbjct: 181 LIDEAVELVREFSQKHSPPDSYTYNHLVKQLCKSRSLSTVYDFIEEMRSSCGATPDLVTY 240

Query: 286 TILIDNVCNSKNLREAMRLVSLLYKEGFKPDCFVYNTIMKGYCMIGRGSEAIEVYKKMKE 345
           TILIDNVCN KNLREA RLVS+L KEGFKPDCFVYN IMKGYCM+GRG EAI VYKKMKE
Sbjct: 241 TILIDNVCNGKNLREATRLVSVLAKEGFKPDCFVYNIIMKGYCMLGRGVEAIGVYKKMKE 300

Query: 346 VGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCCEGDALR 405
            GLEPDVVTFNTLIFGLSKSGRVK+ARKFLDIMAEMGHFPDAVTYTSLMNGMC EGDAL 
Sbjct: 301 EGLEPDVVTFNTLIFGLSKSGRVKDARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDALG 360

Query: 406 ALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRAIELYGLMKSSDMKLETASYATFVR 465
           ALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDR IELYGLMKSSDMKLE ASYATFVR
Sbjct: 361 ALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRGIELYGLMKSSDMKLEVASYATFVR 420

Query: 466 VLCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLETTLKSLKKAREQASI 508
            LCRSGRIAEAYEVFDYAVESKSLTDV AYSTLE TLK+LKKA E+ ++
Sbjct: 421 ALCRSGRIAEAYEVFDYAVESKSLTDVTAYSTLEITLKALKKAGEKGNV 469

BLAST of CaUC01G020330 vs. ExPASy TrEMBL
Match: A0A6J1KT16 (pentatricopeptide repeat-containing protein At2g17670-like OS=Cucurbita maxima OX=3661 GN=LOC111497342 PE=4 SV=1)

HSP 1 Score: 764.2 bits (1972), Expect = 3.4e-217
Identity = 393/469 (83.80%), Postives = 427/469 (91.04%), Query Frame = 0

Query: 46  MGKLSTSFRSALSTAVVSKPPHSPAAPPL-------SSKKPTPKLSRKTPSGQSSGHPEK 105
           MGKLS SFRSA+STA+V+KPP+ PAAP L        SKK  PK SR+  S QSSGH EK
Sbjct: 1   MGKLSPSFRSAISTAIVNKPPNPPAAPSLLSGEIRSLSKKRPPKHSRENQSAQSSGHGEK 60

Query: 106 PKLPTVFKSASLADAKKLYTSFISTTKAPLDLRFHNSLLQSYASIATLNDSISFLRHMSK 165
           PKLPT+FKSASLA+AKKLY+SFI+TTKAPLD+RF+NSLL SY SIA+LNDSISFLRHMSK
Sbjct: 61  PKLPTLFKSASLAEAKKLYSSFINTTKAPLDVRFYNSLLHSYTSIASLNDSISFLRHMSK 120

Query: 166 VQPSFSPDQSTFHVLLSTSGNGPDCSLASIRQILNFMVTNGFDPDKVTTDLAVRSLCSVG 225
           VQP+FSP++STFHVLLSTSGNG D SLAS+RQILNFMVT+GF+PDK T D+AVRSLCS G
Sbjct: 121 VQPTFSPERSTFHVLLSTSGNGTDSSLASVRQILNFMVTHGFNPDKATIDIAVRSLCSAG 180

Query: 226 LVDEAVELVKELSQKHSPPDSYTYNHLVKQLCKSRALSTVYDFIVEMRTSCGAKPDLVTF 285
           L+DEAVELV+E SQKHSPPDSYTYNHLVKQLCKSR+LSTVY FI EMR+SCGA PDLVT+
Sbjct: 181 LIDEAVELVREFSQKHSPPDSYTYNHLVKQLCKSRSLSTVYGFIHEMRSSCGANPDLVTY 240

Query: 286 TILIDNVCNSKNLREAMRLVSLLYKEGFKPDCFVYNTIMKGYCMIGRGSEAIEVYKKMKE 345
           TILIDNVCN KNLREA RLVS+L +EGFKPDCFVYNTIMKGYCM+GRGSEAI VYKKMKE
Sbjct: 241 TILIDNVCNGKNLREATRLVSVLAEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKKMKE 300

Query: 346 VGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCCEGDALR 405
            GLEPDVVTFNTLIFGLSKSGRVK+ARKFLDIMAEMGHFPD VTYTSLMNGMC EGDAL 
Sbjct: 301 EGLEPDVVTFNTLIFGLSKSGRVKDARKFLDIMAEMGHFPDVVTYTSLMNGMCREGDALG 360

Query: 406 ALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRAIELYGLMKSSDMKLETASYATFVR 465
           ALSLLEEMEAKGCSPNSC+YNTLLHGLSKSRLLD+ IELYGLMKS DMKLE ASYATFVR
Sbjct: 361 ALSLLEEMEAKGCSPNSCSYNTLLHGLSKSRLLDKGIELYGLMKSGDMKLEAASYATFVR 420

Query: 466 VLCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLETTLKSLKKAREQASI 508
            LCRSGRIAEAYEVFDYAVES+SLTDVAAYSTLETTLK+LKKAREQ ++
Sbjct: 421 ALCRSGRIAEAYEVFDYAVESRSLTDVAAYSTLETTLKALKKAREQGNV 469

BLAST of CaUC01G020330 vs. ExPASy TrEMBL
Match: A0A6J1BW58 (pentatricopeptide repeat-containing protein At2g17670 OS=Momordica charantia OX=3673 GN=LOC111005895 PE=4 SV=1)

HSP 1 Score: 740.3 bits (1910), Expect = 5.2e-210
Identity = 390/467 (83.51%), Postives = 420/467 (89.94%), Query Frame = 0

Query: 46  MGKLSTSFRSALSTAVVSKPPHS-PAAPPL-------SSKKPTPKLSRKTPSGQSSGHPE 105
           MGKLS SFRSA+ST +++KPP    AAPPL        SKK  PK SRK  S QSSG  E
Sbjct: 1   MGKLSPSFRSAISTTILNKPPQPLAAAPPLLSGEPRSLSKKLPPKPSRKIQSAQSSGPLE 60

Query: 106 KPKLPTVFKSASLADAKKLYTSFISTTKAPLDLRFHNSLLQSYASIATLNDSISFLRHMS 165
           KPK  T+FKS+SLADAKKLY+SFI+TT+APLDLRF+NSLLQSYASIATLNDSISFLR+MS
Sbjct: 61  KPKTATLFKSSSLADAKKLYSSFIATTRAPLDLRFYNSLLQSYASIATLNDSISFLRYMS 120

Query: 166 KVQPSFSPDQSTFHVLLSTSGNGPDCSLASIRQILNFMVTNGFDPDKVTTDLAVRSLCSV 225
           KVQPSFSPD+STFHVLLSTSGNG   SLAS++QILNFMV+NGF+PDKVTTD+AVRSLCS 
Sbjct: 121 KVQPSFSPDRSTFHVLLSTSGNGSGSSLASVQQILNFMVSNGFNPDKVTTDIAVRSLCSA 180

Query: 226 GLVDEAVELVKELSQKHSPPDSYTYNHLVKQLCKSRALSTVYDFIVEMRTSCGAKPDLVT 285
           GL+DEAVELVKELS+K SPPDS+TYNHLVKQLCKSRALSTVY FI EMR+S G+KPDLVT
Sbjct: 181 GLIDEAVELVKELSRKQSPPDSFTYNHLVKQLCKSRALSTVYGFIDEMRSSFGSKPDLVT 240

Query: 286 FTILIDNVCNSKNLREAMRLVSLLYKEGFKPDCFVYNTIMKGYCMIGRGSEAIEVYKKMK 345
           +TILIDNVCN KNLREA RL+S+L +EGFKPDCFVYNTIMKGYCM+GRGSEAI VYKKMK
Sbjct: 241 YTILIDNVCNGKNLREATRLISVLGEEGFKPDCFVYNTIMKGYCMLGRGSEAIGVYKKMK 300

Query: 346 EVGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCCEGDAL 405
           E GLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMC EGDAL
Sbjct: 301 EEGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCREGDAL 360

Query: 406 RALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRAIELYGLMKSSDMKLETASYATFV 465
            ALSLLE MEAKGCSPNSCTYNTLLHGL+KSRLLDR IELYGLMKS  MKLETASYAT V
Sbjct: 361 GALSLLEVMEAKGCSPNSCTYNTLLHGLAKSRLLDRGIELYGLMKSGGMKLETASYATLV 420

Query: 466 RVLCRSGRIAEAYEVFDYAVESKSLTDVAAYSTLETTLKSLKKAREQ 505
           R LCRS RIAEAYEVFDYAVESKS+TDVAAYSTLE+TLKSLKK REQ
Sbjct: 421 RALCRSDRIAEAYEVFDYAVESKSMTDVAAYSTLESTLKSLKKVREQ 467

BLAST of CaUC01G020330 vs. TAIR 10
Match: AT2G17670.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 553.9 bits (1426), Expect = 1.3e-157
Identity = 278/462 (60.17%), Postives = 355/462 (76.84%), Query Frame = 0

Query: 46  MGKLSTSFRSALSTAVVSKPPHSPAAPPLSSKKPTPKLSRKTPSGQSSGHPEKPKLPTVF 105
           MGK+ +SFRS  +  +V K   SP APP   +  T          Q++  P +P L   F
Sbjct: 1   MGKVPSSFRSMPANLLVRKTTPSPPAPPRDFRNRTAVGGDSAKLPQNTQAPREPSLRNPF 60

Query: 106 KSASLADAKKLYTSFISTTKAPLDLRFHNSLLQSYASIATLNDSISFLRHMSKVQPSFSP 165
           KS +L+DAK L+ S  +T++ PLDL+FHNS+LQSY SIA +ND++   +H+ K QP+F P
Sbjct: 61  KSPNLSDAKSLFNSIAATSRIPLDLKFHNSVLQSYGSIAVVNDTVKLFQHILKSQPNFRP 120

Query: 166 DQSTFHVLLSTSGNGPDCSLASIRQILNFMVTNGFDPDKVTTDLAVRSLCSVGLVDEAVE 225
            +STF +LLS +   PD S++++ ++LN MV NG +PD+VTTD+AVRSLC  G VDEA +
Sbjct: 121 GRSTFLILLSHACRAPDSSISNVHRVLNLMVNNGLEPDQVTTDIAVRSLCETGRVDEAKD 180

Query: 226 LVKELSQKHSPPDSYTYNHLVKQLCKSRALSTVYDFIVEMRTSCGAKPDLVTFTILIDNV 285
           L+KEL++KHSPPD+YTYN L+K LCK + L  VY+F+ EMR     KPDLV+FTILIDNV
Sbjct: 181 LMKELTEKHSPPDTYTYNFLLKHLCKCKDLHVVYEFVDEMRDDFDVKPDLVSFTILIDNV 240

Query: 286 CNSKNLREAMRLVSLLYKEGFKPDCFVYNTIMKGYCMIGRGSEAIEVYKKMKEVGLEPDV 345
           CNSKNLREAM LVS L   GFKPDCF+YNTIMKG+C + +GSEA+ VYKKMKE G+EPD 
Sbjct: 241 CNSKNLREAMYLVSKLGNAGFKPDCFLYNTIMKGFCTLSKGSEAVGVYKKMKEEGVEPDQ 300

Query: 346 VTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCCEGDALRALSLLEE 405
           +T+NTLIFGLSK+GRV+EAR +L  M + G+ PD  TYTSLMNGMC +G++L ALSLLEE
Sbjct: 301 ITYNTLIFGLSKAGRVEEARMYLKTMVDAGYEPDTATYTSLMNGMCRKGESLGALSLLEE 360

Query: 406 MEAKGCSPNSCTYNTLLHGLSKSRLLDRAIELYGLMKSSDMKLETASYATFVRVLCRSGR 465
           MEA+GC+PN CTYNTLLHGL K+RL+D+ +ELY +MKSS +KLE+  YAT VR L +SG+
Sbjct: 361 MEARGCAPNDCTYNTLLHGLCKARLMDKGMELYEMMKSSGVKLESNGYATLVRSLVKSGK 420

Query: 466 IAEAYEVFDYAVESKSLTDVAAYSTLETTLKSLKKAREQASI 508
           +AEAYEVFDYAV+SKSL+D +AYSTLETTLK LKKA+EQ  +
Sbjct: 421 VAEAYEVFDYAVDSKSLSDASAYSTLETTLKWLKKAKEQGLV 462

BLAST of CaUC01G020330 vs. TAIR 10
Match: AT2G17670.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 399.8 bits (1026), Expect = 3.2e-111
Identity = 199/349 (57.02%), Postives = 257/349 (73.64%), Query Frame = 0

Query: 46  MGKLSTSFRSALSTAVVSKPPHSPAAPPLSSKKPTPKLSRKTPSGQSSGHPEKPKLPTVF 105
           MGK+ +SFRS  +  +V K   SP APP   +  T          Q++  P +P L   F
Sbjct: 1   MGKVPSSFRSMPANLLVRKTTPSPPAPPRDFRNRTAVGGDSAKLPQNTQAPREPSLRNPF 60

Query: 106 KSASLADAKKLYTSFISTTKAPLDLRFHNSLLQSYASIATLNDSISFLRHMSKVQPSFSP 165
           KS +L+DAK L+ S  +T++ PLDL+FHNS+LQSY SIA +ND++   +H+ K QP+F P
Sbjct: 61  KSPNLSDAKSLFNSIAATSRIPLDLKFHNSVLQSYGSIAVVNDTVKLFQHILKSQPNFRP 120

Query: 166 DQSTFHVLLSTSGNGPDCSLASIRQILNFMVTNGFDPDKVTTDLAVRSLCSVGLVDEAVE 225
            +STF +LLS +   PD S++++ ++LN MV NG +PD+VTTD+AVRSLC  G VDEA +
Sbjct: 121 GRSTFLILLSHACRAPDSSISNVHRVLNLMVNNGLEPDQVTTDIAVRSLCETGRVDEAKD 180

Query: 226 LVKELSQKHSPPDSYTYNHLVKQLCKSRALSTVYDFIVEMRTSCGAKPDLVTFTILIDNV 285
           L+KEL++KHSPPD+YTYN L+K LCK + L  VY+F+ EMR     KPDLV+FTILIDNV
Sbjct: 181 LMKELTEKHSPPDTYTYNFLLKHLCKCKDLHVVYEFVDEMRDDFDVKPDLVSFTILIDNV 240

Query: 286 CNSKNLREAMRLVSLLYKEGFKPDCFVYNTIMKGYCMIGRGSEAIEVYKKMKEVGLEPDV 345
           CNSKNLREAM LVS L   GFKPDCF+YNTIMKG+C + +GSEA+ VYKKMKE G+EPD 
Sbjct: 241 CNSKNLREAMYLVSKLGNAGFKPDCFLYNTIMKGFCTLSKGSEAVGVYKKMKEEGVEPDQ 300

Query: 346 VTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGMCCEG 395
           +T+NTLIFGLSK+GRV+EAR +L  M + G+ PD  TYTSLMNGMC +G
Sbjct: 301 ITYNTLIFGLSKAGRVEEARMYLKTMVDAGYEPDTATYTSLMNGMCRKG 349

BLAST of CaUC01G020330 vs. TAIR 10
Match: AT1G09900.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 189.5 bits (480), Expect = 6.6e-48
Identity = 102/346 (29.48%), Postives = 193/346 (55.78%), Query Frame = 0

Query: 129 DLRFHNSLLQSYASIATLNDSISFLRHMSKVQPSFSPDQSTFHVLL-STSGNGPDCSLAS 188
           D+  +N ++  Y     +N+++S L  M     S SPD  T++ +L S   +G    L  
Sbjct: 171 DVITYNVMISGYCKAGEINNALSVLDRM-----SVSPDVVTYNTILRSLCDSG---KLKQ 230

Query: 189 IRQILNFMVTNGFDPDKVTTDLAVRSLCSVGLVDEAVELVKELSQKHSPPDSYTYNHLVK 248
             ++L+ M+     PD +T  + + + C    V  A++L+ E+  +   PD  TYN LV 
Sbjct: 231 AMEVLDRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVLVN 290

Query: 249 QLCKSRALSTVYDFIVEMRTSCGAKPDLVTFTILIDNVCNSKNLREAMRLVSLLYKEGFK 308
            +CK   L     F+ +M +S G +P+++T  I++ ++C++    +A +L++ + ++GF 
Sbjct: 291 GICKEGRLDEAIKFLNDMPSS-GCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFS 350

Query: 309 PDCFVYNTIMKGYCMIGRGSEAIEVYKKMKEVGLEPDVVTFNTLIFGLSKSGRVKEARKF 368
           P    +N ++   C  G    AI++ +KM + G +P+ +++N L+ G  K  ++  A ++
Sbjct: 351 PSVVTFNILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKMDRAIEY 410

Query: 369 LDIMAEMGHFPDAVTYTSLMNGMCCEGDALRALSLLEEMEAKGCSPNSCTYNTLLHGLSK 428
           L+ M   G +PD VTY +++  +C +G    A+ +L ++ +KGCSP   TYNT++ GL+K
Sbjct: 411 LERMVSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAK 470

Query: 429 SRLLDRAIELYGLMKSSDMKLETASYATFVRVLCRSGRIAEAYEVF 474
           +    +AI+L   M++ D+K +T +Y++ V  L R G++ EA + F
Sbjct: 471 AGKTGKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFF 507

BLAST of CaUC01G020330 vs. TAIR 10
Match: AT3G53700.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 188.0 bits (476), Expect = 1.9e-47
Identity = 106/339 (31.27%), Postives = 184/339 (54.28%), Query Frame = 0

Query: 134 NSLLQSYASIATLNDSISFLRHMSKVQPSFSPDQSTFHVLLSTSGNGPDCSLASIR---Q 193
           N ++  +     + D+++F++ MS  Q  F PDQ TF+ L+    NG  C    ++   +
Sbjct: 263 NVIVHGFCKEGRVEDALNFIQEMSN-QDGFFPDQYTFNTLV----NGL-CKAGHVKHAIE 322

Query: 194 ILNFMVTNGFDPDKVTTDLAVRSLCSVGLVDEAVELVKELSQKHSPPDSYTYNHLVKQLC 253
           I++ M+  G+DPD  T +  +  LC +G V EAVE++ ++  +   P++ TYN L+  LC
Sbjct: 323 IMDVMLQEGYDPDVYTYNSVISGLCKLGEVKEAVEVLDQMITRDCSPNTVTYNTLISTLC 382

Query: 254 KSRALSTVYDFIVEMRTSCGAKPDLVTFTILIDNVCNSKNLREAMRLVSLLYKEGFKPDC 313
           K   +    + +  + TS G  PD+ TF  LI  +C ++N R AM L   +  +G +PD 
Sbjct: 383 KENQVEEATE-LARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMELFEEMRSKGCEPDE 442

Query: 314 FVYNTIMKGYCMIGRGSEAIEVYKKMKEVGLEPDVVTFNTLIFGLSKSGRVKEARKFLDI 373
           F YN ++   C  G+  EA+ + K+M+  G    V+T+NTLI G  K+ + +EA +  D 
Sbjct: 443 FTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFCKANKTREAEEIFDE 502

Query: 374 MAEMGHFPDAVTYTSLMNGMCCEGDALRALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRL 433
           M   G   ++VTY +L++G+C       A  L+++M  +G  P+  TYN+LL    +   
Sbjct: 503 MEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYTYNSLLTHFCRGGD 562

Query: 434 LDRAIELYGLMKSSDMKLETASYATFVRVLCRSGRIAEA 470
           + +A ++   M S+  + +  +Y T +  LC++GR+  A
Sbjct: 563 IKKAADIVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVA 594

BLAST of CaUC01G020330 vs. TAIR 10
Match: AT1G12620.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 184.5 bits (467), Expect = 2.1e-46
Identity = 104/343 (30.32%), Postives = 181/343 (52.77%), Query Frame = 0

Query: 149 SISFLRHMSKVQPSFSPDQSTFHVLLSTSGNGPDCSLASIRQILNFMVTNGFDPDKVTTD 208
           S++F      ++  + PD  TF  L+  +G   +  ++   ++++ MV  G  P  +T +
Sbjct: 124 SLAFSAMGKIIKLGYEPDTVTFSTLI--NGLCLEGRVSEALELVDRMVEMGHKPTLITLN 183

Query: 209 LAVRSLCSVGLVDEAVELVKELSQKHSPPDSYTYNHLVKQLCKSRALSTVYDFIVEMRTS 268
             V  LC  G V +AV L+  + +    P+  TY  ++K +CKS   +   + + +M   
Sbjct: 184 ALVNGLCLNGKVSDAVLLIDRMVETGFQPNEVTYGPVLKVMCKSGQTALAMELLRKMEER 243

Query: 269 CGAKPDLVTFTILIDNVCNSKNLREAMRLVSLLYKEGFKPDCFVYNTIMKGYCMIGRGSE 328
              K D V ++I+ID +C   +L  A  L + +  +GFK D  +Y T+++G+C  GR  +
Sbjct: 244 -KIKLDAVKYSIIIDGLCKDGSLDNAFNLFNEMEIKGFKADIIIYTTLIRGFCYAGRWDD 303

Query: 329 AIEVYKKMKEVGLEPDVVTFNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMN 388
             ++ + M +  + PDVV F+ LI    K G+++EA +    M + G  PD VTYTSL++
Sbjct: 304 GAKLLRDMIKRKITPDVVAFSALIDCFVKEGKLREAEELHKEMIQRGISPDTVTYTSLID 363

Query: 389 GMCCEGDALRALSLLEEMEAKGCSPNSCTYNTLLHGLSKSRLLDRAIELYGLMKSSDMKL 448
           G C E    +A  +L+ M +KGC PN  T+N L++G  K+ L+D  +EL+  M    +  
Sbjct: 364 GFCKENQLDKANHMLDLMVSKGCGPNIRTFNILINGYCKANLIDDGLELFRKMSLRGVVA 423

Query: 449 ETASYATFVRVLCRSGRIAEAYEVFDYAVESKSLTDVAAYSTL 492
           +T +Y T ++  C  G++  A E+F   V  +   D+ +Y  L
Sbjct: 424 DTVTYNTLIQGFCELGKLEVAKELFQEMVSRRVRPDIVSYKIL 463

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038883848.18.2e-24294.35pentatricopeptide repeat-containing protein At2g17670 [Benincasa hispida][more]
XP_004135005.15.5e-23089.80pentatricopeptide repeat-containing protein At2g17670 [Cucumis sativus] >KGN4897... [more]
XP_016899201.14.1e-22588.91PREDICTED: pentatricopeptide repeat-containing protein At2g17670 [Cucumis melo][more]
XP_023518399.17.4e-21984.65pentatricopeptide repeat-containing protein At2g17670-like [Cucurbita pepo subsp... [more]
XP_022926982.12.2e-21884.86pentatricopeptide repeat-containing protein At2g17670 [Cucurbita moschata] >XP_0... [more]
Match NameE-valueIdentityDescription
Q84J711.9e-15660.17Pentatricopeptide repeat-containing protein At2g17670 OS=Arabidopsis thaliana OX... [more]
Q3EDF89.3e-4729.48Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana OX... [more]
Q9LFF12.7e-4631.27Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidop... [more]
Q9ASZ83.0e-4530.32Pentatricopeptide repeat-containing protein At1g12620 OS=Arabidopsis thaliana OX... [more]
Q0WVK76.7e-4533.79Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0KHF82.7e-23089.80Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G507510 PE=4 SV=1[more]
A0A1S4DT842.0e-22588.91pentatricopeptide repeat-containing protein At2g17670 OS=Cucumis melo OX=3656 GN... [more]
A0A6J1EGP91.0e-21884.86pentatricopeptide repeat-containing protein At2g17670 OS=Cucurbita moschata OX=3... [more]
A0A6J1KT163.4e-21783.80pentatricopeptide repeat-containing protein At2g17670-like OS=Cucurbita maxima O... [more]
A0A6J1BW585.2e-21083.51pentatricopeptide repeat-containing protein At2g17670 OS=Momordica charantia OX=... [more]
Match NameE-valueIdentityDescription
AT2G17670.11.3e-15760.17Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G17670.23.2e-11157.02Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G09900.16.6e-4829.48Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT3G53700.11.9e-4731.27Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G12620.12.1e-4630.32Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 373..506
e-value: 4.9E-32
score: 113.5
coord: 103..300
e-value: 1.0E-26
score: 96.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 301..372
e-value: 1.2E-22
score: 82.4
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 452..474
e-value: 0.12
score: 12.7
coord: 211..232
e-value: 0.41
score: 11.0
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 381..415
e-value: 2.3E-9
score: 34.9
coord: 312..345
e-value: 6.0E-10
score: 36.7
coord: 416..447
e-value: 3.9E-7
score: 27.8
coord: 276..310
e-value: 2.3E-5
score: 22.2
coord: 452..480
e-value: 7.6E-4
score: 17.5
coord: 346..380
e-value: 6.6E-7
score: 27.1
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 378..427
e-value: 9.0E-15
score: 54.6
coord: 237..287
e-value: 1.4E-9
score: 38.0
coord: 308..357
e-value: 3.7E-15
score: 55.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 344..378
score: 12.24383
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 379..413
score: 12.539784
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 449..483
score: 8.604678
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 274..308
score: 10.402331
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 203..237
score: 8.944478
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 414..448
score: 10.928473
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 309..343
score: 13.109773
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..44
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 58..103
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 80..96
NoneNo IPR availablePANTHERPTHR47933PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN 1, MITOCHONDRIALcoord: 46..505
NoneNo IPR availablePANTHERPTHR47933:SF34PPR CONTAINING PLANT-LIKE PROTEINcoord: 46..505
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 289..478

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC01G020330.1CaUC01G020330.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding