CmaCh04G001800 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh04G001800
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCma_Chr04: 874653 .. 878353 (+)
RNA-Seq ExpressionCmaCh04G001800
SyntenyCmaCh04G001800
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTGCTTACCGTGGAAGCTCAACTGGGTTCGATGCCCTCGTGCCGAAGGTAAGCAGGAAACTTCCAACTTCGCCAATCTCGTTTCAACAATATCTAAATGTTTAATCCCATAATTTTCCCCTTTCCCTTGAAGAAAGTTTGCATTTACAATTACAACAAATTGGCATTTAGAGCTGCCAGTGTCAAGTGTGTCCACAAGCAAGCTGCGCAGTCGCTTACAAGTTCCACCACAGCTGAGAGGTGTTCTTGTTTTTCTTCTCTAATATTGCTTTCTGATTTTGCTTTCAACTCCATAACTTTGATTTTGATGTTGCTTTGATTAAATATAGACGTATTGTTAAGAAGAAGGTTGGGAAGGAGGCCCACCATTTATGGAAGAAAAGAGATTCTGCTGGCTCTGGCCAAAAAGCTCTGAATCTTGTTAGAATTGTAAGCTGCATTTCGTTGAATCTTCTCTCTTAGTGCCTTTCTAATATTGAGAGGAAACAATGGGAATATTTAACATCTTGTAGGTTTCCCAATGCCCCAATGAGAAAGAAGCTGTATATGGAGAATTGAATAAGTGGATAGCTTGGGAGACGGAGTTTCCATTGATTGCAGCTTCTAAAGCTTTAAGAATACTGAGGAAGAGAAGTCAATGGAAGCGTGTCATTCAAGTATGAATAAGTATACTCCTACTCATTGTCCTAAATTTTTATGTATGTATTTAATATACTTAATAATTGTTTGTCTTACAAAATGTTGGACTTATATTGTCTTTGTACAATTAGTGTCTAGTGTCACGGGGTTGTGACATCTAGCTCATAGCTATTGTGCTAACAAGTGTTCGATATGTGTCCAACAAGTGTTGGAATGTCCGATTGTTAGTTTTGCACTTATTTGATAAGTTGTGGTTTGTTCAGTTTTTATATTGAAGTTTGATCTTGAAACTTGGTCATTTAGACTTCTTTGTGCTTAAAATATGACTCAAGAAACAAGACAACACTTCGGATGTGGTTGATACATCAAATTTTATTATTCCTGTTTTAGATTTAGCTTTATGGACAGAGAACTCATGGTTGTTGGTTCCATAATTGGGAATGATGAAACAACAATGAGTTCTTATCTTTGAGAAAAATGTTTGTCGAAATATAACACCGAACTACTTCTGGCATTGGAAACCTCAGTGTTGGACTTGCAAATTACTTGTGCGATTGGCATGTGAAAGGATCATCTCTGATTAGCCTAGTTCGGTAACATGGGCTCGTGACCTGAGTGGGGGTTCTCACTGGTTTGAATGGGTGGAGGGGTTGACGTTGGAACCCTTCGATTAGGGAAGATTCAAATTTAAAAAGATAATCTCAGATTAACAAGAAAGGTTGGAAGAAGAAAATCTGTTGTCTTTTGCCTATGTTGGTTGCTTAAAGTTCTTCCTTTTCCGTTGTAAAGTTTGAGCTTATAAATATTTATGAAGAGGTTTCTGTATACAATCAATGCTTCTTATTACTCAGGTGGCAAAGTGGATGCTAAGCAAGGGTCAAGGTGCCACAATGGGTACCTATGACACTCTTCTTCTGGCATTTGATATGGACAAGAGGGTGGATGAGGCCGAATCTTTATGGAACATGATTTTGCATGCACATACTCGTTCCATCTCTAAACGACTGTTTTCTAGGATGATCGCTTTGTATGACCATCATGACTTGCAAGATAAGATTATTGAGGTATTGGGAATGGGGACTCTCACATCTCGCTTGTACCACCCTATGCAATCTTCTCTTCATACCATATCAATGACCTGCTTCAATTTTTTCCATCTCTTGGTACAGATATTTGCAGACATGGAGGAGTTGGGAGTAAGGCCAGATGAAGACACAGTTAGAAGAGTAGCCCACGCCTTTCGAAAACTAGGTCAAGAAGAAAACGGGAAAACGGTCTATAAAAGATATGGCTGCAAATGGAAATACATACATTTCAAGAGTGAGAGGGTTAGAGTGAGAAGAGATGGATGGGATGAAGATGATAAATGAACCAAACTGAAAGAAAAGATTTGTGGCTCAAACTATTGAAGGGCAGAGACTCCTGAACTTCAACATGGATTAGCTCACAAGGTATTTATCTACCAATTGCCGTTAATTATATGCGTTCTATTGGCAGTCTGGAAGTTAATTTGCAGTCTATTTTTCTCCTCTGTTCTATTTGAACATCTTAGCAGTAGTAACAGAGATAATGCAAACACTGTAATATATTAATATGATATGAAAAAGGTAGTGTATATGTTAATCAAGGAATTTATCTATCACGTTTGGAGCAAGTTTGTTTTCTAGAACCATGGTTCTCTTTTAGCTTGAGGTATTGTAAGAGCACAAGCCCACCGCTAGCAAATGTTGTCCTCTTTGGGCTTTCCCTTTCGAGCTTCCCTTCAAGGTTTCTAAAACGCGTTTGCTAGGAAGAGGTTTCCACACCCTTATAAAAAATGTTTCGTTCTCCTCCTCAACCATTTTACGGGGCCCAGCGTCCTCGCTAACATTCGTTTCCTTCTCTAATCGACGTGGGACTCCTCAATCCACCCCTCTTCAGGGCCAAGCGTCCTTGCTGGCGCATTGCCTTGTGCCCACCCCCTTTCAAGGCTCAGCCTCATCGCCGGCACATCGTCCAGTGTCTAGCTTTGATACTATGTGTAACATCTCAAGCCCACCGCTAGTAGATATTGTCTTCTTTGGGCTTTCCCTTTCGGATTTCTCCTCAAGGTTTTTAAAACGCATCTCTGCTAAGGAGAGATTTTCACACTCTTATAAAAAATGTTTCGTTCTCTTCCCTAACTAATGTAGGACCTCACAGGTATATAGAACAAAGCAGCTAACGGAGTCAGAATCAAGTAGATTTCATGGGGATGAACTACTCGTTTGAGGAATCAGATAAGATTTGCACATCTTTTGCTTTTGCCTACGTATGGAAACAGTAGAAAAGGTATAGTGAAGGTTGAGTAAATTTCACATCAAGCTTTAAATCTTCATTCTGAAGCATCAATTGTAGACAAGCTTTTGGTCAAATCACTGCTAGGATCATGGGAAAAAAAAAAAGAAAGAAAAGTGGAGTGGGTATAGTTAGGCAAGATAGATTTGAACAATCCCAACTAACTAAATAGATGACATAATATCATACTTTTGTTGGTCGTGAAGAGTAGATATTATGGTTGAGTTGCACCAAGTAGTGTGAATAGCCTTTAAGGACAGCAGCAGAAGATTTTGATACAGTCTTAATTATATAAACTAGATTTTGATGTTCTTGCAATGCTTTACTGCATAATTGTTCATTAGTTTAGATCCATTCCATGAAATTTGAAACAACATCCTAGATCCAGATTGAAATGGATGCCATTCCTGTTTAAGTACTGAGGACAGTCCCATTTCTCCGACAAAAAACTATTCATAAAACATACGGTTGGCATGGACAGATGGTATGGGCATGACCACCTACCTAACATTTAATATTCTGCAAGTTTCTTAGCAACTTAGGCAATTATCTAAGATTGATAGAAGTATGCACAAAGTATCTCGAACACTCACTAGAAGTTAGTATCATTGAGATTAGTCGTGAGGCTCATCGGTTTTTGTCTCATTAACCACTGAACTATGCCTAGATTAACAAACGAGTGTATCTTTATTAGTCCCTATCTTTAGAATAGTGTTCGACTTTAATTTGTAATGTAGTCTTCC

mRNA sequence

ATGCTTGCTTACCGTGGAAGCTCAACTGGGTTCGATGCCCTCGTGCCGAAGAAAGTTTGCATTTACAATTACAACAAATTGGCATTTAGAGCTGCCAGTGTCAAGTGTGTCCACAAGCAAGCTGCGCAGTCGCTTACAAGTTCCACCACAGCTGAGAGACGTATTGTTAAGAAGAAGGTTGGGAAGGAGGCCCACCATTTATGGAAGAAAAGAGATTCTGCTGGCTCTGGCCAAAAAGCTCTGAATCTTGTTAGAATTGTTTCCCAATGCCCCAATGAGAAAGAAGCTGTATATGGAGAATTGAATAAGTGGATAGCTTGGGAGACGGAGTTTCCATTGATTGCAGCTTCTAAAGCTTTAAGAATACTGAGGAAGAGAAGTCAATGGAAGCGTGTCATTCAAGTGGCAAAGTGGATGCTAAGCAAGGGTCAAGGTGCCACAATGGGTACCTATGACACTCTTCTTCTGGCATTTGATATGGACAAGAGGGTGGATGAGGCCGAATCTTTATGGAACATGATTTTGCATGCACATACTCGTTCCATCTCTAAACGACTGTTTTCTAGGATGATCGCTTTGTATGACCATCATGACTTGCAAGATAAGATTATTGAGATATTTGCAGACATGGAGGAGTTGGGAGTAAGGCCAGATGAAGACACAGTTAGAAGAGTAGCCCACGCCTTTCGAAAACTAGGTCAAGAAGAAAACGGGAAAACGGTCTATAAAAGATATGGCTGCAAATGGAAATACATACATTTCAAGAGTGAGAGGGTTAGAGTGAGAAGAGATGGATGGGATGAAGATGATAAATGAACCAAACTGAAAGAAAAGATTTGTGGCTCAAACTATTGAAGGGCAGAGACTCCTGAACTTCAACATGGATTAGCTCACAAGGTATATAGAACAAAGCAGCTAACGGAGTCAGAATCAAGTAGATTTCATGGGGATGAACTACTCGTTTGAGGAATCAGATAAGATTTGCACATCTTTTGCTTTTGCCTACGTATGGAAACAGTAGAAAAGGTATAGTGAAGGTTGAGTAAATTTCACATCAAGCTTTAAATCTTCATTCTGAAGCATCAATTGTAGACAAGCTTTTGGTCAAATCACTGCTAGGATCATGGGAAAAAAAAAAAGAAAGAAAAGTGGAGTGGGTATAGTTAGGCAAGATAGATTTGAACAATCCCAACTAACTAAATAGATGACATAATATCATACTTTTGTTGGTCGTGAAGAGTAGATATTATGGTTGAGTTGCACCAAGTAGTGTGAATAGCCTTTAAGGACAGCAGCAGAAGATTTTGATACAGTCTTAATTATATAAACTAGATTTTGATGTTCTTGCAATGCTTTACTGCATAATTGTTCATTAGTTTAGATCCATTCCATGAAATTTGAAACAACATCCTAGATCCAGATTGAAATGGATGCCATTCCTGTTTAAGTACTGAGGACAGTCCCATTTCTCCGACAAAAAACTATTCATAAAACATACGGTTGGCATGGACAGATGGTATGGGCATGACCACCTACCTAACATTTAATATTCTGCAAGTTTCTTAGCAACTTAGGCAATTATCTAAGATTGATAGAAGTATGCACAAAGTATCTCGAACACTCACTAGAAGTTAGTATCATTGAGATTAGTCGTGAGGCTCATCGGTTTTTGTCTCATTAACCACTGAACTATGCCTAGATTAACAAACGAGTGTATCTTTATTAGTCCCTATCTTTAGAATAGTGTTCGACTTTAATTTGTAATGTAGTCTTCC

Coding sequence (CDS)

ATGCTTGCTTACCGTGGAAGCTCAACTGGGTTCGATGCCCTCGTGCCGAAGAAAGTTTGCATTTACAATTACAACAAATTGGCATTTAGAGCTGCCAGTGTCAAGTGTGTCCACAAGCAAGCTGCGCAGTCGCTTACAAGTTCCACCACAGCTGAGAGACGTATTGTTAAGAAGAAGGTTGGGAAGGAGGCCCACCATTTATGGAAGAAAAGAGATTCTGCTGGCTCTGGCCAAAAAGCTCTGAATCTTGTTAGAATTGTTTCCCAATGCCCCAATGAGAAAGAAGCTGTATATGGAGAATTGAATAAGTGGATAGCTTGGGAGACGGAGTTTCCATTGATTGCAGCTTCTAAAGCTTTAAGAATACTGAGGAAGAGAAGTCAATGGAAGCGTGTCATTCAAGTGGCAAAGTGGATGCTAAGCAAGGGTCAAGGTGCCACAATGGGTACCTATGACACTCTTCTTCTGGCATTTGATATGGACAAGAGGGTGGATGAGGCCGAATCTTTATGGAACATGATTTTGCATGCACATACTCGTTCCATCTCTAAACGACTGTTTTCTAGGATGATCGCTTTGTATGACCATCATGACTTGCAAGATAAGATTATTGAGATATTTGCAGACATGGAGGAGTTGGGAGTAAGGCCAGATGAAGACACAGTTAGAAGAGTAGCCCACGCCTTTCGAAAACTAGGTCAAGAAGAAAACGGGAAAACGGTCTATAAAAGATATGGCTGCAAATGGAAATACATACATTTCAAGAGTGAGAGGGTTAGAGTGAGAAGAGATGGATGGGATGAAGATGATAAATGA

Protein sequence

MLAYRGSSTGFDALVPKKVCIYNYNKLAFRAASVKCVHKQAAQSLTSSTTAERRIVKKKVGKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAASKALRILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISKRLFSRMIALYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEENGKTVYKRYGCKWKYIHFKSERVRVRRDGWDEDDK
Homology
BLAST of CmaCh04G001800 vs. ExPASy Swiss-Prot
Match: Q2V3H0 (Pentatricopeptide repeat-containing protein At4g18975, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At4g18975 PE=2 SV=2)

HSP 1 Score: 331.3 bits (848), Expect = 1.1e-89
Identity = 160/206 (77.67%), Postives = 181/206 (87.86%), Query Frame = 0

Query: 58  KKVGKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAS 117
           KKVGK+ HHLWKK DSAGSGQKALNLVR++S  PNEKEAVYG LNKW+AWE EFP+IAA+
Sbjct: 76  KKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVEFPIIAAA 135

Query: 118 KALRILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHA 177
           KAL+ILRKRSQW RVIQ+AKWMLSKGQGATMGTYD LLLAFDMD+R DEAESLWNMILH 
Sbjct: 136 KALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAESLWNMILHT 195

Query: 178 HTRSISKRLFSRMIALYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEEN 237
           HTRSI +RLF+RMIALY HHDL DK+IE+FADMEEL V PDED+ RRVA AFR+L QEEN
Sbjct: 196 HTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAFRELNQEEN 255

Query: 238 GKTVYKRYGCKWKYIHFKSERVRVRR 264
            K + +RY  ++KYI+F  ERVRV+R
Sbjct: 256 RKLILRRYLSEYKYIYFNGERVRVKR 281

BLAST of CmaCh04G001800 vs. ExPASy Swiss-Prot
Match: Q8LG95 (Pentatricopeptide repeat-containing protein At4g21190 OS=Arabidopsis thaliana OX=3702 GN=EMB1417 PE=2 SV=1)

HSP 1 Score: 177.2 bits (448), Expect = 2.5e-43
Identity = 87/197 (44.16%), Postives = 123/197 (62.44%), Query Frame = 0

Query: 67  LWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAASKALRILRKR 126
           +WK R   G+  KA  ++  +    N KE VYG L+ +IAWE EFPL+   KAL IL   
Sbjct: 45  VWKTRKRIGTISKAAKMIACIKGLSNVKEEVYGALDSFIAWELEFPLVIVKKALVILEDE 104

Query: 127 SQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISKRL 186
            +WK++IQV KWMLSKGQG TMGTY +LL A   D R+DEAE LWN +   H     ++ 
Sbjct: 105 KEWKKIIQVTKWMLSKGQGRTMGTYFSLLNALAEDNRLDEAEELWNKLFMEHLEGTPRKF 164

Query: 187 FSRMIALYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEENGKTVYKRY- 246
           F++MI++Y   D+  K+ E+FADMEELGV+P+   V  V   F KL  ++  + + K+Y 
Sbjct: 165 FNKMISIYYKRDMHQKLFEVFADMEELGVKPNVAIVSMVGKVFVKLEMKDKYEKLMKKYP 224

Query: 247 GCKWKYIHFKSERVRVR 263
             +W++ + K  RV+V+
Sbjct: 225 PPQWEFRYIKGRRVKVK 241

BLAST of CmaCh04G001800 vs. ExPASy TrEMBL
Match: A0A6J1K6G2 (pentatricopeptide repeat-containing protein At4g18975, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111491588 PE=4 SV=1)

HSP 1 Score: 546.2 bits (1406), Expect = 7.7e-152
Identity = 271/271 (100.00%), Postives = 271/271 (100.00%), Query Frame = 0

Query: 1   MLAYRGSSTGFDALVPKKVCIYNYNKLAFRAASVKCVHKQAAQSLTSSTTAERRIVKKKV 60
           MLAYRGSSTGFDALVPKKVCIYNYNKLAFRAASVKCVHKQAAQSLTSSTTAERRIVKKKV
Sbjct: 1   MLAYRGSSTGFDALVPKKVCIYNYNKLAFRAASVKCVHKQAAQSLTSSTTAERRIVKKKV 60

Query: 61  GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAASKAL 120
           GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAASKAL
Sbjct: 61  GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAASKAL 120

Query: 121 RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTR 180
           RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTR
Sbjct: 121 RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTR 180

Query: 181 SISKRLFSRMIALYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEENGKT 240
           SISKRLFSRMIALYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEENGKT
Sbjct: 181 SISKRLFSRMIALYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEENGKT 240

Query: 241 VYKRYGCKWKYIHFKSERVRVRRDGWDEDDK 272
           VYKRYGCKWKYIHFKSERVRVRRDGWDEDDK
Sbjct: 241 VYKRYGCKWKYIHFKSERVRVRRDGWDEDDK 271

BLAST of CmaCh04G001800 vs. ExPASy TrEMBL
Match: A0A6J1FWG3 (pentatricopeptide repeat-containing protein At4g18975, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111447531 PE=4 SV=1)

HSP 1 Score: 539.7 bits (1389), Expect = 7.2e-150
Identity = 267/271 (98.52%), Postives = 268/271 (98.89%), Query Frame = 0

Query: 1   MLAYRGSSTGFDALVPKKVCIYNYNKLAFRAASVKCVHKQAAQSLTSSTTAERRIVKKKV 60
           MLAYRGSSTGFDALVPKKVCIYNYNKLAFRAASVKCVHKQAAQSLT STTAERRIVKKKV
Sbjct: 1   MLAYRGSSTGFDALVPKKVCIYNYNKLAFRAASVKCVHKQAAQSLTGSTTAERRIVKKKV 60

Query: 61  GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAASKAL 120
           GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAASKAL
Sbjct: 61  GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAASKAL 120

Query: 121 RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTR 180
           RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTR
Sbjct: 121 RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTR 180

Query: 181 SISKRLFSRMIALYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEENGKT 240
           SISKRLFSRMI+LYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEENGK 
Sbjct: 181 SISKRLFSRMISLYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEENGKM 240

Query: 241 VYKRYGCKWKYIHFKSERVRVRRDGWDEDDK 272
           VYKRYGCKWKYIHFK ERVRVRRDGWDEDDK
Sbjct: 241 VYKRYGCKWKYIHFKGERVRVRRDGWDEDDK 271

BLAST of CmaCh04G001800 vs. ExPASy TrEMBL
Match: A0A6J1CC95 (pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 OS=Momordica charantia OX=3673 GN=LOC111010251 PE=4 SV=1)

HSP 1 Score: 479.6 bits (1233), Expect = 8.9e-132
Identity = 237/271 (87.45%), Postives = 249/271 (91.88%), Query Frame = 0

Query: 1   MLAYRGSSTGFDALVPKKVCIYNYNKLAFRAASVKCVHKQAAQSLTSSTTAERRIVKKKV 60
           M  YRG+S GFD LVPKK CI N+NKL FRAAS KC H QA Q  TSS T ERRIV KKV
Sbjct: 1   MSVYRGTSIGFDTLVPKKDCICNFNKLTFRAASFKCSHNQAVQPFTSSPTIERRIV-KKV 60

Query: 61  GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAASKAL 120
           GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYG LNKWIAWETEFPLIAA+KAL
Sbjct: 61  GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGALNKWIAWETEFPLIAAAKAL 120

Query: 121 RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTR 180
           RILRKRSQWKRVIQVAKWMLSKGQGATMGTYD+LLLAFDMDKRVDEAESLWNMILH HTR
Sbjct: 121 RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDSLLLAFDMDKRVDEAESLWNMILHTHTR 180

Query: 181 SISKRLFSRMIALYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEENGKT 240
           SISK+LFSRMI+LYDHHDLQDK+IEIFADMEELGV+PDEDTVRRVA AF+KLGQEEN K 
Sbjct: 181 SISKQLFSRMISLYDHHDLQDKVIEIFADMEELGVKPDEDTVRRVARAFQKLGQEENRKM 240

Query: 241 VYKRYGCKWKYIHFKSERVRVRRDGWDEDDK 272
           V+KRYGCKWKYIHFK ERVRV+RDGWDEDD+
Sbjct: 241 VHKRYGCKWKYIHFKGERVRVKRDGWDEDDE 270

BLAST of CmaCh04G001800 vs. ExPASy TrEMBL
Match: A0A5A7UD21 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold174G00890 PE=4 SV=1)

HSP 1 Score: 476.9 bits (1226), Expect = 5.7e-131
Identity = 237/271 (87.45%), Postives = 249/271 (91.88%), Query Frame = 0

Query: 1   MLAYRGSSTGFDALVPKKVCIYNYNKLAFRAASVKCVHKQAAQSLTSSTTAERRIVKKKV 60
           ML   GSSTGFDALVPK  CIY +NK AFR ASV CVH QAAQ  TS TT ERR+V KKV
Sbjct: 1   MLVLHGSSTGFDALVPKIDCIYYHNKCAFRPASVICVHNQAAQPHTSFTTPERRVV-KKV 60

Query: 61  GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAASKAL 120
           GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAA+KAL
Sbjct: 61  GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKAL 120

Query: 121 RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTR 180
           RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILH HTR
Sbjct: 121 RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTR 180

Query: 181 SISKRLFSRMIALYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEENGKT 240
           SISKR+FSRMI+LY+HHDLQDKIIEIFADMEELGV+PDEDTVRR+  AF+KLGQEEN K 
Sbjct: 181 SISKRVFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRIGRAFQKLGQEENRKM 240

Query: 241 VYKRYGCKWKYIHFKSERVRVRRDGWDEDDK 272
           VYKRY C+WKYIHFK ERVRVR+DGWDEDD+
Sbjct: 241 VYKRYSCQWKYIHFKGERVRVRKDGWDEDDQ 270

BLAST of CmaCh04G001800 vs. ExPASy TrEMBL
Match: A0A1S3CP50 (pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 OS=Cucumis melo OX=3656 GN=LOC103502655 PE=4 SV=1)

HSP 1 Score: 476.9 bits (1226), Expect = 5.7e-131
Identity = 237/271 (87.45%), Postives = 249/271 (91.88%), Query Frame = 0

Query: 1   MLAYRGSSTGFDALVPKKVCIYNYNKLAFRAASVKCVHKQAAQSLTSSTTAERRIVKKKV 60
           ML   GSSTGFDALVPK  CIY +NK AFR ASV CVH QAAQ  TS TT ERR+V KKV
Sbjct: 1   MLVLHGSSTGFDALVPKIDCIYYHNKCAFRPASVICVHNQAAQPHTSFTTPERRVV-KKV 60

Query: 61  GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAASKAL 120
           GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAA+KAL
Sbjct: 61  GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKAL 120

Query: 121 RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTR 180
           RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILH HTR
Sbjct: 121 RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTR 180

Query: 181 SISKRLFSRMIALYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEENGKT 240
           SISKR+FSRMI+LY+HHDLQDKIIEIFADMEELGV+PDEDTVRR+  AF+KLGQEEN K 
Sbjct: 181 SISKRVFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRIGRAFQKLGQEENRKM 240

Query: 241 VYKRYGCKWKYIHFKSERVRVRRDGWDEDDK 272
           VYKRY C+WKYIHFK ERVRVR+DGWDEDD+
Sbjct: 241 VYKRYSCQWKYIHFKGERVRVRKDGWDEDDQ 270

BLAST of CmaCh04G001800 vs. NCBI nr
Match: XP_022996325.1 (pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Cucurbita maxima] >XP_022996334.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Cucurbita maxima])

HSP 1 Score: 546.2 bits (1406), Expect = 1.6e-151
Identity = 271/271 (100.00%), Postives = 271/271 (100.00%), Query Frame = 0

Query: 1   MLAYRGSSTGFDALVPKKVCIYNYNKLAFRAASVKCVHKQAAQSLTSSTTAERRIVKKKV 60
           MLAYRGSSTGFDALVPKKVCIYNYNKLAFRAASVKCVHKQAAQSLTSSTTAERRIVKKKV
Sbjct: 1   MLAYRGSSTGFDALVPKKVCIYNYNKLAFRAASVKCVHKQAAQSLTSSTTAERRIVKKKV 60

Query: 61  GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAASKAL 120
           GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAASKAL
Sbjct: 61  GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAASKAL 120

Query: 121 RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTR 180
           RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTR
Sbjct: 121 RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTR 180

Query: 181 SISKRLFSRMIALYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEENGKT 240
           SISKRLFSRMIALYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEENGKT
Sbjct: 181 SISKRLFSRMIALYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEENGKT 240

Query: 241 VYKRYGCKWKYIHFKSERVRVRRDGWDEDDK 272
           VYKRYGCKWKYIHFKSERVRVRRDGWDEDDK
Sbjct: 241 VYKRYGCKWKYIHFKSERVRVRRDGWDEDDK 271

BLAST of CmaCh04G001800 vs. NCBI nr
Match: XP_023541595.1 (pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Cucurbita pepo subsp. pepo] >XP_023541609.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 541.2 bits (1393), Expect = 5.1e-150
Identity = 268/271 (98.89%), Postives = 269/271 (99.26%), Query Frame = 0

Query: 1   MLAYRGSSTGFDALVPKKVCIYNYNKLAFRAASVKCVHKQAAQSLTSSTTAERRIVKKKV 60
           MLAYRGSSTGFDALVPKKVCIYNYNKLAFRAASVKCVHKQAAQSLTSSTTAERRIVKKKV
Sbjct: 1   MLAYRGSSTGFDALVPKKVCIYNYNKLAFRAASVKCVHKQAAQSLTSSTTAERRIVKKKV 60

Query: 61  GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAASKAL 120
           GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAASKAL
Sbjct: 61  GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAASKAL 120

Query: 121 RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTR 180
           RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTR
Sbjct: 121 RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTR 180

Query: 181 SISKRLFSRMIALYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEENGKT 240
           SISKRLFSRMI+LYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEENGK 
Sbjct: 181 SISKRLFSRMISLYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEENGKL 240

Query: 241 VYKRYGCKWKYIHFKSERVRVRRDGWDEDDK 272
           VYKRYGCKWKYIHFK ERVRVRRDGWDEDDK
Sbjct: 241 VYKRYGCKWKYIHFKGERVRVRRDGWDEDDK 271

BLAST of CmaCh04G001800 vs. NCBI nr
Match: XP_022942519.1 (pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Cucurbita moschata] >XP_022942520.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Cucurbita moschata])

HSP 1 Score: 539.7 bits (1389), Expect = 1.5e-149
Identity = 267/271 (98.52%), Postives = 268/271 (98.89%), Query Frame = 0

Query: 1   MLAYRGSSTGFDALVPKKVCIYNYNKLAFRAASVKCVHKQAAQSLTSSTTAERRIVKKKV 60
           MLAYRGSSTGFDALVPKKVCIYNYNKLAFRAASVKCVHKQAAQSLT STTAERRIVKKKV
Sbjct: 1   MLAYRGSSTGFDALVPKKVCIYNYNKLAFRAASVKCVHKQAAQSLTGSTTAERRIVKKKV 60

Query: 61  GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAASKAL 120
           GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAASKAL
Sbjct: 61  GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAASKAL 120

Query: 121 RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTR 180
           RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTR
Sbjct: 121 RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTR 180

Query: 181 SISKRLFSRMIALYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEENGKT 240
           SISKRLFSRMI+LYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEENGK 
Sbjct: 181 SISKRLFSRMISLYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEENGKM 240

Query: 241 VYKRYGCKWKYIHFKSERVRVRRDGWDEDDK 272
           VYKRYGCKWKYIHFK ERVRVRRDGWDEDDK
Sbjct: 241 VYKRYGCKWKYIHFKGERVRVRRDGWDEDDK 271

BLAST of CmaCh04G001800 vs. NCBI nr
Match: KAG6600024.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia] >KAG7030693.1 Pentatricopeptide repeat-containing protein, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 538.9 bits (1387), Expect = 2.5e-149
Identity = 267/271 (98.52%), Postives = 268/271 (98.89%), Query Frame = 0

Query: 1   MLAYRGSSTGFDALVPKKVCIYNYNKLAFRAASVKCVHKQAAQSLTSSTTAERRIVKKKV 60
           MLAYRGSSTGFDALVPKKVCIYNYNKLAFRAASVKCVHKQAAQSLTSSTTAERRIVKKKV
Sbjct: 1   MLAYRGSSTGFDALVPKKVCIYNYNKLAFRAASVKCVHKQAAQSLTSSTTAERRIVKKKV 60

Query: 61  GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAASKAL 120
           GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAASKAL
Sbjct: 61  GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAASKAL 120

Query: 121 RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTR 180
           RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTR
Sbjct: 121 RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTR 180

Query: 181 SISKRLFSRMIALYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEENGKT 240
           SISKRLFSRMI+LYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQ ENGK 
Sbjct: 181 SISKRLFSRMISLYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQAENGKM 240

Query: 241 VYKRYGCKWKYIHFKSERVRVRRDGWDEDDK 272
           VYKRYGCKWKYIHFK ERVRVRRDGWDEDDK
Sbjct: 241 VYKRYGCKWKYIHFKGERVRVRRDGWDEDDK 271

BLAST of CmaCh04G001800 vs. NCBI nr
Match: XP_022139301.1 (pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Momordica charantia] >XP_022139302.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Momordica charantia])

HSP 1 Score: 479.6 bits (1233), Expect = 1.8e-131
Identity = 237/271 (87.45%), Postives = 249/271 (91.88%), Query Frame = 0

Query: 1   MLAYRGSSTGFDALVPKKVCIYNYNKLAFRAASVKCVHKQAAQSLTSSTTAERRIVKKKV 60
           M  YRG+S GFD LVPKK CI N+NKL FRAAS KC H QA Q  TSS T ERRIV KKV
Sbjct: 1   MSVYRGTSIGFDTLVPKKDCICNFNKLTFRAASFKCSHNQAVQPFTSSPTIERRIV-KKV 60

Query: 61  GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAASKAL 120
           GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYG LNKWIAWETEFPLIAA+KAL
Sbjct: 61  GKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGALNKWIAWETEFPLIAAAKAL 120

Query: 121 RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTR 180
           RILRKRSQWKRVIQVAKWMLSKGQGATMGTYD+LLLAFDMDKRVDEAESLWNMILH HTR
Sbjct: 121 RILRKRSQWKRVIQVAKWMLSKGQGATMGTYDSLLLAFDMDKRVDEAESLWNMILHTHTR 180

Query: 181 SISKRLFSRMIALYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEENGKT 240
           SISK+LFSRMI+LYDHHDLQDK+IEIFADMEELGV+PDEDTVRRVA AF+KLGQEEN K 
Sbjct: 181 SISKQLFSRMISLYDHHDLQDKVIEIFADMEELGVKPDEDTVRRVARAFQKLGQEENRKM 240

Query: 241 VYKRYGCKWKYIHFKSERVRVRRDGWDEDDK 272
           V+KRYGCKWKYIHFK ERVRV+RDGWDEDD+
Sbjct: 241 VHKRYGCKWKYIHFKGERVRVKRDGWDEDDE 270

BLAST of CmaCh04G001800 vs. TAIR 10
Match: AT4G18975.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 331.3 bits (848), Expect = 7.5e-91
Identity = 160/206 (77.67%), Postives = 181/206 (87.86%), Query Frame = 0

Query: 58  KKVGKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAS 117
           KKVGK+ HHLWKK DSAGSGQKALNLVR++S  PNEKEAVYG LNKW+AWE EFP+IAA+
Sbjct: 76  KKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVEFPIIAAA 135

Query: 118 KALRILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHA 177
           KAL+ILRKRSQW RVIQ+AKWMLSKGQGATMGTYD LLLAFDMD+R DEAESLWNMILH 
Sbjct: 136 KALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAESLWNMILHT 195

Query: 178 HTRSISKRLFSRMIALYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEEN 237
           HTRSI +RLF+RMIALY HHDL DK+IE+FADMEEL V PDED+ RRVA AFR+L QEEN
Sbjct: 196 HTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAFRELNQEEN 255

Query: 238 GKTVYKRYGCKWKYIHFKSERVRVRR 264
            K + +RY  ++KYI+F  ERVRV+R
Sbjct: 256 RKLILRRYLSEYKYIYFNGERVRVKR 281

BLAST of CmaCh04G001800 vs. TAIR 10
Match: AT4G18975.2 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 331.3 bits (848), Expect = 7.5e-91
Identity = 160/206 (77.67%), Postives = 181/206 (87.86%), Query Frame = 0

Query: 58  KKVGKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAS 117
           KKVGK+ HHLWKK DSAGSGQKALNLVR++S  PNEKEAVYG LNKW+AWE EFP+IAA+
Sbjct: 49  KKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVEFPIIAAA 108

Query: 118 KALRILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHA 177
           KAL+ILRKRSQW RVIQ+AKWMLSKGQGATMGTYD LLLAFDMD+R DEAESLWNMILH 
Sbjct: 109 KALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAESLWNMILHT 168

Query: 178 HTRSISKRLFSRMIALYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEEN 237
           HTRSI +RLF+RMIALY HHDL DK+IE+FADMEEL V PDED+ RRVA AFR+L QEEN
Sbjct: 169 HTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAFRELNQEEN 228

Query: 238 GKTVYKRYGCKWKYIHFKSERVRVRR 264
            K + +RY  ++KYI+F  ERVRV+R
Sbjct: 229 RKLILRRYLSEYKYIYFNGERVRVKR 254

BLAST of CmaCh04G001800 vs. TAIR 10
Match: AT4G18975.3 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 331.3 bits (848), Expect = 7.5e-91
Identity = 160/206 (77.67%), Postives = 181/206 (87.86%), Query Frame = 0

Query: 58  KKVGKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAS 117
           KKVGK+ HHLWKK DSAGSGQKALNLVR++S  PNEKEAVYG LNKW+AWE EFP+IAA+
Sbjct: 76  KKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVEFPIIAAA 135

Query: 118 KALRILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHA 177
           KAL+ILRKRSQW RVIQ+AKWMLSKGQGATMGTYD LLLAFDMD+R DEAESLWNMILH 
Sbjct: 136 KALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAESLWNMILHT 195

Query: 178 HTRSISKRLFSRMIALYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEEN 237
           HTRSI +RLF+RMIALY HHDL DK+IE+FADMEEL V PDED+ RRVA AFR+L QEEN
Sbjct: 196 HTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAFRELNQEEN 255

Query: 238 GKTVYKRYGCKWKYIHFKSERVRVRR 264
            K + +RY  ++KYI+F  ERVRV+R
Sbjct: 256 RKLILRRYLSEYKYIYFNGERVRVKR 281

BLAST of CmaCh04G001800 vs. TAIR 10
Match: AT4G18975.4 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 331.3 bits (848), Expect = 7.5e-91
Identity = 160/206 (77.67%), Postives = 181/206 (87.86%), Query Frame = 0

Query: 58  KKVGKEAHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAS 117
           KKVGK+ HHLWKK DSAGSGQKALNLVR++S  PNEKEAVYG LNKW+AWE EFP+IAA+
Sbjct: 76  KKVGKKEHHLWKKNDSAGSGQKALNLVRMLSGLPNEKEAVYGALNKWVAWEVEFPIIAAA 135

Query: 118 KALRILRKRSQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHA 177
           KAL+ILRKRSQW RVIQ+AKWMLSKGQGATMGTYD LLLAFDMD+R DEAESLWNMILH 
Sbjct: 136 KALQILRKRSQWHRVIQLAKWMLSKGQGATMGTYDILLLAFDMDERADEAESLWNMILHT 195

Query: 178 HTRSISKRLFSRMIALYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEEN 237
           HTRSI +RLF+RMIALY HHDL DK+IE+FADMEEL V PDED+ RRVA AFR+L QEEN
Sbjct: 196 HTRSIPRRLFARMIALYAHHDLHDKVIEVFADMEELKVSPDEDSARRVARAFRELNQEEN 255

Query: 238 GKTVYKRYGCKWKYIHFKSERVRVRR 264
            K + +RY  ++KYI+F  ERVRV+R
Sbjct: 256 RKLILRRYLSEYKYIYFNGERVRVKR 281

BLAST of CmaCh04G001800 vs. TAIR 10
Match: AT4G21190.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 177.2 bits (448), Expect = 1.8e-44
Identity = 87/197 (44.16%), Postives = 123/197 (62.44%), Query Frame = 0

Query: 67  LWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAASKALRILRKR 126
           +WK R   G+  KA  ++  +    N KE VYG L+ +IAWE EFPL+   KAL IL   
Sbjct: 45  VWKTRKRIGTISKAAKMIACIKGLSNVKEEVYGALDSFIAWELEFPLVIVKKALVILEDE 104

Query: 127 SQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISKRL 186
            +WK++IQV KWMLSKGQG TMGTY +LL A   D R+DEAE LWN +   H     ++ 
Sbjct: 105 KEWKKIIQVTKWMLSKGQGRTMGTYFSLLNALAEDNRLDEAEELWNKLFMEHLEGTPRKF 164

Query: 187 FSRMIALYDHHDLQDKIIEIFADMEELGVRPDEDTVRRVAHAFRKLGQEENGKTVYKRY- 246
           F++MI++Y   D+  K+ E+FADMEELGV+P+   V  V   F KL  ++  + + K+Y 
Sbjct: 165 FNKMISIYYKRDMHQKLFEVFADMEELGVKPNVAIVSMVGKVFVKLEMKDKYEKLMKKYP 224

Query: 247 GCKWKYIHFKSERVRVR 263
             +W++ + K  RV+V+
Sbjct: 225 PPQWEFRYIKGRRVKVK 241

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q2V3H01.1e-8977.67Pentatricopeptide repeat-containing protein At4g18975, chloroplastic OS=Arabidop... [more]
Q8LG952.5e-4344.16Pentatricopeptide repeat-containing protein At4g21190 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1K6G27.7e-152100.00pentatricopeptide repeat-containing protein At4g18975, chloroplastic OS=Cucurbit... [more]
A0A6J1FWG37.2e-15098.52pentatricopeptide repeat-containing protein At4g18975, chloroplastic OS=Cucurbit... [more]
A0A6J1CC958.9e-13287.45pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 ... [more]
A0A5A7UD215.7e-13187.45Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3CP505.7e-13187.45pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 ... [more]
Match NameE-valueIdentityDescription
XP_022996325.11.6e-151100.00pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Cucurbita ... [more]
XP_023541595.15.1e-15098.89pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Cucurbita ... [more]
XP_022942519.11.5e-14998.52pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Cucurbita ... [more]
KAG6600024.12.5e-14998.52Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
XP_022139301.11.8e-13187.45pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 ... [more]
Match NameE-valueIdentityDescription
AT4G18975.17.5e-9177.67Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G18975.27.5e-9177.67Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G18975.37.5e-9177.67Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G18975.47.5e-9177.67Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G21190.11.8e-4444.16Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 100..269
e-value: 1.0E-13
score: 53.4
NoneNo IPR availablePANTHERPTHR46782:SF2OS07G0545900 PROTEINcoord: 23..267
IPR044646Pentatricopeptide repeat-containing protein EMB1417-likePANTHERPTHR46782OS01G0757700 PROTEINcoord: 23..267

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh04G001800.1CmaCh04G001800.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding