CmaCh20G006590 (gene) Cucurbita maxima (Rimu)

NameCmaCh20G006590
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCma_Chr20 : 3041507 .. 3042775 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTTCTAACTATCTACGCTTGCTTTCCTACACTAAGCAATTGAGCTTTTATGCCAACCATGGCAACCACGAGCAGGCTCTTGCTCTTTTCCTCCACATGCAGGCCTCACTAGCCCTTGCTCTAGACGCCCACGTTTTCTCTCTCGTTCTCAAGTCCTGCACCGCCATCCGCCGACCCCTTTTGGGTACTGCCATCCATGCCCACGCTACTAAATCCTCCCTCCTCTCAAACCCATTTGTTGCCTGTGCCCTAGTTGACATGTATGGCAAATCCATGTCCGTTCCACTCGCACGGAAGCTGTTCGATGAAATTCCTCAAAGAACTGTCGTCGCCTGGAATACCATGTTATCACTATACACGCACTCAAACATGTTACCTGATGCTCTCCGGTTGTTTGAAGCTATGGATGTGCCCCCCAATACTTCATCCTTCAATCCAATAATTGCAGGGTTGTTGGATGATGGATTCAAGGCCATTTCTTTCTATCGCAAGATGCAACAATGTGGGTTGAAGCCTAATTTAATTACCCTTCTTGCTTTATTACCTGCAAGTGTTGGAGTTGCGGCTTTGGACTTGATTAAACAAATTCATGGTTTTGCCATGAGAAATGATCTTGGTTGTCATCCCCAGTTAAGCAGTGGCCTTGTTGAGGCCTATGGGCGATGCGGATGTCTCAATTATGCACACAATGTGTTCGATAAAATGCACGAAAGAGATGTAGTCGCATGGAGCAGTTTAATATCAGCACATGCTCTTCATGGGGAGGCTAGAACTGCTTTAAACATCTTTCAACAAATGGCATTGTGTAAAGTGCAGCCTGATGAGATTACATTCATAGGAGTGTTGAAGGCTTGTAGTTACGTGGGGTTAGCTGATGAAGCATTGTATTATTTCAGTCGTATGCAGAGAGATTACGGTCTACAAGCAAGCAGCGACCATTATTCTTGTCTAGTAGATGCATTGAGCAGAGCAGGGAGATTACACGAGGCATATGAGATTATCCGCGAGATGCCAGTGAGGGTGACGGCTAAAGCTTGGGGCGCCCTCCTTGGGGCTTGTCGAAACTATGGGGAGTTAGAGCTGGCGGAGATTGCAGGGAAGGCTTTGTTTGATATAGAGCCTGAGAATGCTGCCAATTATGTGCTGTTGGGTAAAATGTATGCAAGTGTAGGGAGGCATGAGGAGGCTCAAAGACTGAGAATGGAAATGAAAGAGAGGAGAGTTCAGGCAGTACCTGGAAGTAGCTGGGTTGTTTATCAGGATTGA

mRNA sequence

ATGGGTTCTAACTATCTACGCTTGCTTTCCTACACTAAGCAATTGAGCTTTTATGCCAACCATGGCAACCACGAGCAGGCTCTTGCTCTTTTCCTCCACATGCAGGCCTCACTAGCCCTTGCTCTAGACGCCCACGTTTTCTCTCTCGTTCTCAAGTCCTGCACCGCCATCCGCCGACCCCTTTTGGGTACTGCCATCCATGCCCACGCTACTAAATCCTCCCTCCTCTCAAACCCATTTGTTGCCTGTGCCCTAGTTGACATGTATGGCAAATCCATGTCCGTTCCACTCGCACGGAAGCTGTTCGATGAAATTCCTCAAAGAACTGTCGTCGCCTGGAATACCATGTTATCACTATACACGCACTCAAACATGTTACCTGATGCTCTCCGGTTGTTTGAAGCTATGGATGTGCCCCCCAATACTTCATCCTTCAATCCAATAATTGCAGGGTTGTTGGATGATGGATTCAAGGCCATTTCTTTCTATCGCAAGATGCAACAATGTGGGTTGAAGCCTAATTTAATTACCCTTCTTGCTTTATTACCTGCAAGTGTTGGAGTTGCGGCTTTGGACTTGATTAAACAAATTCATGGTTTTGCCATGAGAAATGATCTTGGTTGTCATCCCCAGTTAAGCAGTGGCCTTGTTGAGGCCTATGGGCGATGCGGATGTCTCAATTATGCACACAATGTGTTCGATAAAATGCACGAAAGAGATGTAGTCGCATGGAGCAGTTTAATATCAGCACATGCTCTTCATGGGGAGGCTAGAACTGCTTTAAACATCTTTCAACAAATGGCATTGTGTAAAGTGCAGCCTGATGAGATTACATTCATAGGAGTGTTGAAGGCTTGTAGTTACGTGGGGTTAGCTGATGAAGCATTGTATTATTTCAGTCGTATGCAGAGAGATTACGGTCTACAAGCAAGCAGCGACCATTATTCTTGTCTAGTAGATGCATTGAGCAGAGCAGGGAGATTACACGAGGCATATGAGATTATCCGCGAGATGCCAGTGAGGGTGACGGCTAAAGCTTGGGGCGCCCTCCTTGGGGCTTGTCGAAACTATGGGGAGTTAGAGCTGGCGGAGATTGCAGGGAAGGCTTTGTTTGATATAGAGCCTGAGAATGCTGCCAATTATGTGCTGTTGGGTAAAATGTATGCAAGTGTAGGGAGGCATGAGGAGGCTCAAAGACTGAGAATGGAAATGAAAGAGAGGAGAGTTCAGGCAGTACCTGGAAGTAGCTGGGTTGTTTATCAGGATTGA

Coding sequence (CDS)

ATGGGTTCTAACTATCTACGCTTGCTTTCCTACACTAAGCAATTGAGCTTTTATGCCAACCATGGCAACCACGAGCAGGCTCTTGCTCTTTTCCTCCACATGCAGGCCTCACTAGCCCTTGCTCTAGACGCCCACGTTTTCTCTCTCGTTCTCAAGTCCTGCACCGCCATCCGCCGACCCCTTTTGGGTACTGCCATCCATGCCCACGCTACTAAATCCTCCCTCCTCTCAAACCCATTTGTTGCCTGTGCCCTAGTTGACATGTATGGCAAATCCATGTCCGTTCCACTCGCACGGAAGCTGTTCGATGAAATTCCTCAAAGAACTGTCGTCGCCTGGAATACCATGTTATCACTATACACGCACTCAAACATGTTACCTGATGCTCTCCGGTTGTTTGAAGCTATGGATGTGCCCCCCAATACTTCATCCTTCAATCCAATAATTGCAGGGTTGTTGGATGATGGATTCAAGGCCATTTCTTTCTATCGCAAGATGCAACAATGTGGGTTGAAGCCTAATTTAATTACCCTTCTTGCTTTATTACCTGCAAGTGTTGGAGTTGCGGCTTTGGACTTGATTAAACAAATTCATGGTTTTGCCATGAGAAATGATCTTGGTTGTCATCCCCAGTTAAGCAGTGGCCTTGTTGAGGCCTATGGGCGATGCGGATGTCTCAATTATGCACACAATGTGTTCGATAAAATGCACGAAAGAGATGTAGTCGCATGGAGCAGTTTAATATCAGCACATGCTCTTCATGGGGAGGCTAGAACTGCTTTAAACATCTTTCAACAAATGGCATTGTGTAAAGTGCAGCCTGATGAGATTACATTCATAGGAGTGTTGAAGGCTTGTAGTTACGTGGGGTTAGCTGATGAAGCATTGTATTATTTCAGTCGTATGCAGAGAGATTACGGTCTACAAGCAAGCAGCGACCATTATTCTTGTCTAGTAGATGCATTGAGCAGAGCAGGGAGATTACACGAGGCATATGAGATTATCCGCGAGATGCCAGTGAGGGTGACGGCTAAAGCTTGGGGCGCCCTCCTTGGGGCTTGTCGAAACTATGGGGAGTTAGAGCTGGCGGAGATTGCAGGGAAGGCTTTGTTTGATATAGAGCCTGAGAATGCTGCCAATTATGTGCTGTTGGGTAAAATGTATGCAAGTGTAGGGAGGCATGAGGAGGCTCAAAGACTGAGAATGGAAATGAAAGAGAGGAGAGTTCAGGCAGTACCTGGAAGTAGCTGGGTTGTTTATCAGGATTGA

Protein sequence

MGSNYLRLLSYTKQLSFYANHGNHEQALALFLHMQASLALALDAHVFSLVLKSCTAIRRPLLGTAIHAHATKSSLLSNPFVACALVDMYGKSMSVPLARKLFDEIPQRTVVAWNTMLSLYTHSNMLPDALRLFEAMDVPPNTSSFNPIIAGLLDDGFKAISFYRKMQQCGLKPNLITLLALLPASVGVAALDLIKQIHGFAMRNDLGCHPQLSSGLVEAYGRCGCLNYAHNVFDKMHERDVVAWSSLISAHALHGEARTALNIFQQMALCKVQPDEITFIGVLKACSYVGLADEALYYFSRMQRDYGLQASSDHYSCLVDALSRAGRLHEAYEIIREMPVRVTAKAWGALLGACRNYGELELAEIAGKALFDIEPENAANYVLLGKMYASVGRHEEAQRLRMEMKERRVQAVPGSSWVVYQD
BLAST of CmaCh20G006590 vs. Swiss-Prot
Match: PPR7_ARATH (Putative pentatricopeptide repeat-containing protein At1g03510 OS=Arabidopsis thaliana GN=PCMP-E3 PE=3 SV=1)

HSP 1 Score: 549.3 bits (1414), Expect = 3.7e-155
Identity = 275/423 (65.01%), Postives = 325/423 (76.83%), Query Frame = 1

Query: 3   SNYLRLLSYTKQLSFYANHGNHEQALALFLHMQASLALALDAHVFSLVLKSCTAIRRPLL 62
           S+  +L+S TKQLS YAN GNHEQAL LFL M +S AL LDAHVFSL LKSC A  RP+L
Sbjct: 7   SSCTKLISLTKQLSSYANQGNHEQALNLFLQMHSSFALPLDAHVFSLALKSCAAAFRPVL 66

Query: 63  GTAIHAHATKSSLLSNPFVACALVDMYGKSMSVPLARKLFDEIPQRTVVAWNTMLSLYTH 122
           G ++HAH+ KS+ LSNPFV CAL+DMYGK +SV  ARKLFDEIPQR  V WN M+S YTH
Sbjct: 67  GGSVHAHSVKSNFLSNPFVGCALLDMYGKCLSVSHARKLFDEIPQRNAVVWNAMISHYTH 126

Query: 123 SNMLPDALRLFEAMDVPPNTSSFNPIIAGLL---DDGFKAISFYRKMQQCGLKPNLITLL 182
              + +A+ L+EAMDV PN SSFN II GL+   D  ++AI FYRKM +   KPNLITLL
Sbjct: 127 CGKVKEAVELYEAMDVMPNESSFNAIIKGLVGTEDGSYRAIEFYRKMIEFRFKPNLITLL 186

Query: 183 ALLPASVGVAALDLIKQIHGFAMRNDLGCHPQLSSGLVEAYGRCGCLNYAHNVFDKMHER 242
           AL+ A   + A  LIK+IH +A RN +  HPQL SGLVEAYGRCG + Y   VFD M +R
Sbjct: 187 ALVSACSAIGAFRLIKEIHSYAFRNLIEPHPQLKSGLVEAYGRCGSIVYVQLVFDSMEDR 246

Query: 243 DVVAWSSLISAHALHGEARTALNIFQQMALCKVQPDEITFIGVLKACSYVGLADEALYYF 302
           DVVAWSSLISA+ALHG+A +AL  FQ+M L KV PD+I F+ VLKACS+ GLADEAL YF
Sbjct: 247 DVVAWSSLISAYALHGDAESALKTFQEMELAKVTPDDIAFLNVLKACSHAGLADEALVYF 306

Query: 303 SRMQRDYGLQASSDHYSCLVDALSRAGRLHEAYEIIREMPVRVTAKAWGALLGACRNYGE 362
            RMQ DYGL+AS DHYSCLVD LSR GR  EAY++I+ MP + TAK WGALLGACRNYGE
Sbjct: 307 KRMQGDYGLRASKDHYSCLVDVLSRVGRFEEAYKVIQAMPEKPTAKTWGALLGACRNYGE 366

Query: 363 LELAEIAGKALFDIEPENAANYVLLGKMYASVGRHEEAQRLRMEMKERRVQAVPGSSWVV 422
           +ELAEIA + L  +EPEN ANYVLLGK+Y SVGR EEA+RLR++MKE  V+  PGSSW +
Sbjct: 367 IELAEIAARELLMVEPENPANYVLLGKIYMSVGRQEEAERLRLKMKESGVKVSPGSSWCL 426

BLAST of CmaCh20G006590 vs. Swiss-Prot
Match: PP165_ARATH (Pentatricopeptide repeat-containing protein At2g20540 OS=Arabidopsis thaliana GN=PCMP-E78 PE=2 SV=1)

HSP 1 Score: 276.9 bits (707), Expect = 3.6e-73
Identity = 140/378 (37.04%), Postives = 226/378 (59.79%), Query Frame = 1

Query: 43  DAHVFSLVLKSCTAIRRPLLGTAIHAHATKSSLLSNPFVACALVDMYGKSMSVPLARKLF 102
           D   F  + KSC ++    LG  +H H  K     +     AL+DMY K   +  A K+F
Sbjct: 108 DRFTFPFMFKSCASLGSCYLGKQVHGHLCKFGPRFHVVTENALIDMYMKFDDLVDAHKVF 167

Query: 103 DEIPQRTVVAWNTMLSLYTHSNMLPDALRLFEAMDVPPNTSSFNPIIAGLLDDG--FKAI 162
           DE+ +R V++WN++LS Y     +  A  LF  M +     S+  +I+G    G   +A+
Sbjct: 168 DEMYERDVISWNSLLSGYARLGQMKKAKGLFHLM-LDKTIVSWTAMISGYTGIGCYVEAM 227

Query: 163 SFYRKMQQCGLKPNLITLLALLPASVGVAALDLIKQIHGFAMRNDLGCHPQLSSGLVEAY 222
            F+R+MQ  G++P+ I+L+++LP+   + +L+L K IH +A R        + + L+E Y
Sbjct: 228 DFFREMQLAGIEPDEISLISVLPSCAQLGSLELGKWIHLYAERRGFLKQTGVCNALIEMY 287

Query: 223 GRCGCLNYAHNVFDKMHERDVVAWSSLISAHALHGEARTALNIFQQMALCKVQPDEITFI 282
            +CG ++ A  +F +M  +DV++WS++IS +A HG A  A+  F +M   KV+P+ ITF+
Sbjct: 288 SKCGVISQAIQLFGQMEGKDVISWSTMISGYAYHGNAHGAIETFNEMQRAKVKPNGITFL 347

Query: 283 GVLKACSYVGLADEALYYFSRMQRDYGLQASSDHYSCLVDALSRAGRLHEAYEIIREMPV 342
           G+L ACS+VG+  E L YF  M++DY ++   +HY CL+D L+RAG+L  A EI + MP+
Sbjct: 348 GLLSACSHVGMWQEGLRYFDMMRQDYQIEPKIEHYGCLIDVLARAGKLERAVEITKTMPM 407

Query: 343 RVTAKAWGALLGACRNYGELELAEIAGKALFDIEPENAANYVLLGKMYASVGRHEEAQRL 402
           +  +K WG+LL +CR  G L++A +A   L ++EPE+  NYVLL  +YA +G+ E+  RL
Sbjct: 408 KPDSKIWGSLLSSCRTPGNLDVALVAMDHLVELEPEDMGNYVLLANIYADLGKWEDVSRL 467

Query: 403 RMEMKERRVQAVPGSSWV 419
           R  ++   ++  PG S +
Sbjct: 468 RKMIRNENMKKTPGGSLI 484

BLAST of CmaCh20G006590 vs. Swiss-Prot
Match: PP285_ARATH (Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H81 PE=2 SV=2)

HSP 1 Score: 270.4 bits (690), Expect = 3.3e-71
Identity = 145/413 (35.11%), Postives = 234/413 (56.66%), Query Frame = 1

Query: 11  YTKQLSFYANHGNHEQALALFLHMQASLALALDAHVFSLVLKSCTAIRRPLLGTAIHAHA 70
           +   ++ Y+ + + ++AL LF+ M+ S  L  ++   + V+ +C          AIH   
Sbjct: 372 WNAMIAGYSQNEHDKEALLLFIGMEESAGLLANSTTMAGVVPACVRSGAFSRKEAIHGFV 431

Query: 71  TKSSLLSNPFVACALVDMYGKSMSVPLARKLFDEIPQRTVVAWNTMLSLYTHSNMLPDAL 130
            K  L  + FV   L+DMY +   + +A ++F ++  R +V WNTM++ Y  S    DAL
Sbjct: 432 VKRGLDRDRFVQNTLMDMYSRLGKIDIAMRIFGKMEDRDLVTWNTMITGYVFSEHHEDAL 491

Query: 131 RLFEAMDVPPNTSSFNPIIAGLLDDGFKAISFYRKMQQCGLKPNLITLLALLPASVGVAA 190
            L   M       S                   +   +  LKPN ITL+ +LP+   ++A
Sbjct: 492 LLLHKMQNLERKVS-------------------KGASRVSLKPNSITLMTILPSCAALSA 551

Query: 191 LDLIKQIHGFAMRNDLGCHPQLSSGLVEAYGRCGCLNYAHNVFDKMHERDVVAWSSLISA 250
           L   K+IH +A++N+L     + S LV+ Y +CGCL  +  VFD++ +++V+ W+ +I A
Sbjct: 552 LAKGKEIHAYAIKNNLATDVAVGSALVDMYAKCGCLQMSRKVFDQIPQKNVITWNVIIMA 611

Query: 251 HALHGEARTALNIFQQMALCKVQPDEITFIGVLKACSYVGLADEALYYFSRMQRDYGLQA 310
           + +HG  + A+++ + M +  V+P+E+TFI V  ACS+ G+ DE L  F  M+ DYG++ 
Sbjct: 612 YGMHGNGQEAIDLLRMMMVQGVKPNEVTFISVFAACSHSGMVDEGLRIFYVMKPDYGVEP 671

Query: 311 SSDHYSCLVDALSRAGRLHEAYEIIREMPVRVT-AKAWGALLGACRNYGELELAEIAGKA 370
           SSDHY+C+VD L RAGR+ EAY+++  MP     A AW +LLGA R +  LE+ EIA + 
Sbjct: 672 SSDHYACVVDLLGRAGRIKEAYQLMNMMPRDFNKAGAWSSLLGASRIHNNLEIGEIAAQN 731

Query: 371 LFDIEPENAANYVLLGKMYASVGRHEEAQRLRMEMKERRVQAVPGSSWVVYQD 423
           L  +EP  A++YVLL  +Y+S G  ++A  +R  MKE+ V+  PG SW+ + D
Sbjct: 732 LIQLEPNVASHYVLLANIYSSAGLWDKATEVRRNMKEQGVRKEPGCSWIEHGD 765

BLAST of CmaCh20G006590 vs. Swiss-Prot
Match: PPR53_ARATH (Pentatricopeptide repeat-containing protein At1g20230 OS=Arabidopsis thaliana GN=PCMP-H21 PE=2 SV=2)

HSP 1 Score: 270.0 bits (689), Expect = 4.4e-71
Identity = 147/416 (35.34%), Postives = 236/416 (56.73%), Query Frame = 1

Query: 8   LLSYTKQLSFYANHGNHEQALALFLHMQASLALALDAHVFSLVLKSCTAIRRPLLGTAIH 67
           ++S+   LS +   G H++A+ +F  +   L    D    S VL S        +G  IH
Sbjct: 217 IVSWNGILSGFNRSGYHKEAVVMFQKIH-HLGFCPDQVTVSSVLPSVGDSEMLNMGRLIH 276

Query: 68  AHATKSSLLSNPFVACALVDMYGKSMSVPLARKLFDEIPQRTVVAWNTMLSLYTHSNMLP 127
            +  K  LL +  V  A++DMYGKS  V     LF++         N  ++  + + ++ 
Sbjct: 277 GYVIKQGLLKDKCVISAMIDMYGKSGHVYGIISLFNQFEMMEAGVCNAYITGLSRNGLVD 336

Query: 128 DALRLFEAMD---VPPNTSSFNPIIAGLLDDG--FKAISFYRKMQQCGLKPNLITLLALL 187
            AL +FE      +  N  S+  IIAG   +G   +A+  +R+MQ  G+KPN +T+ ++L
Sbjct: 337 KALEMFELFKEQTMELNVVSWTSIIAGCAQNGKDIEALELFREMQVAGVKPNHVTIPSML 396

Query: 188 PASVGVAALDLIKQIHGFAMRNDLGCHPQLSSGLVEAYGRCGCLNYAHNVFDKMHERDVV 247
           PA   +AAL   +  HGFA+R  L  +  + S L++ Y +CG +N +  VF+ M  +++V
Sbjct: 397 PACGNIAALGHGRSTHGFAVRVHLLDNVHVGSALIDMYAKCGRINLSQIVFNMMPTKNLV 456

Query: 248 AWSSLISAHALHGEARTALNIFQQMALCKVQPDEITFIGVLKACSYVGLADEALYYFSRM 307
            W+SL++  ++HG+A+  ++IF+ +   +++PD I+F  +L AC  VGL DE   YF  M
Sbjct: 457 CWNSLMNGFSMHGKAKEVMSIFESLMRTRLKPDFISFTSLLSACGQVGLTDEGWKYFKMM 516

Query: 308 QRDYGLQASSDHYSCLVDALSRAGRLHEAYEIIREMPVRVTAKAWGALLGACRNYGELEL 367
             +YG++   +HYSC+V+ L RAG+L EAY++I+EMP    +  WGALL +CR    ++L
Sbjct: 517 SEEYGIKPRLEHYSCMVNLLGRAGKLQEAYDLIKEMPFEPDSCVWGALLNSCRLQNNVDL 576

Query: 368 AEIAGKALFDIEPENAANYVLLGKMYASVGRHEEAQRLRMEMKERRVQAVPGSSWV 419
           AEIA + LF +EPEN   YVLL  +YA+ G   E   +R +M+   ++  PG SW+
Sbjct: 577 AEIAAEKLFHLEPENPGTYVLLSNIYAAKGMWTEVDSIRNKMESLGLKKNPGCSWI 631

BLAST of CmaCh20G006590 vs. Swiss-Prot
Match: PP265_ARATH (Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidopsis thaliana GN=CRR2 PE=2 SV=1)

HSP 1 Score: 268.1 bits (684), Expect = 1.7e-70
Identity = 146/401 (36.41%), Postives = 222/401 (55.36%), Query Frame = 1

Query: 22  GNHEQALALFLHMQASLALALDAHVFSLVLKSCTA----IRRPLLGTAIHAHATKSSLLS 81
           G+ E+ L L+  M   + +  D   ++ VLK+C A    +   + G  IHAH T+    S
Sbjct: 157 GHGEEVLGLYWKMNR-IGVESDRFTYTYVLKACVASECTVNHLMKGKEIHAHLTRRGYSS 216

Query: 82  NPFVACALVDMYGKSMSVPLARKLFDEIPQRTVVAWNTMLSLYTHSNMLPDALRLFEAMD 141
           + ++   LVDMY +   V  A  +F  +P R VV+W+ M++ Y  +    +ALR F  M 
Sbjct: 217 HVYIMTTLVDMYARFGCVDYASYVFGGMPVRNVVSWSAMIACYAKNGKAFEALRTFREMM 276

Query: 142 VPPNTSSFNPIIAGLLDDGFKAISFYRKMQQCGLKPNLITLLALLPASVGVAALDLIKQI 201
                SS                            PN +T++++L A   +AAL+  K I
Sbjct: 277 RETKDSS----------------------------PNSVTMVSVLQACASLAALEQGKLI 336

Query: 202 HGFAMRNDLGCHPQLSSGLVEAYGRCGCLNYAHNVFDKMHERDVVAWSSLISAHALHGEA 261
           HG+ +R  L     + S LV  YGRCG L     VFD+MH+RDVV+W+SLIS++ +HG  
Sbjct: 337 HGYILRRGLDSILPVISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISSYGVHGYG 396

Query: 262 RTALNIFQQMALCKVQPDEITFIGVLKACSYVGLADEALYYFSRMQRDYGLQASSDHYSC 321
           + A+ IF++M      P  +TF+ VL ACS+ GL +E    F  M RD+G++   +HY+C
Sbjct: 397 KKAIQIFEEMLANGASPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYAC 456

Query: 322 LVDALSRAGRLHEAYEIIREMPVRVTAKAWGALLGACRNYGELELAEIAGKALFDIEPEN 381
           +VD L RA RL EA +++++M      K WG+LLG+CR +G +ELAE A + LF +EP+N
Sbjct: 457 MVDLLGRANRLDEAAKMVQDMRTEPGPKVWGSLLGSCRIHGNVELAERASRRLFALEPKN 516

Query: 382 AANYVLLGKMYASVGRHEEAQRLRMEMKERRVQAVPGSSWV 419
           A NYVLL  +YA     +E +R++  ++ R +Q +PG  W+
Sbjct: 517 AGNYVLLADIYAEAQMWDEVKRVKKLLEHRGLQKLPGRCWM 528

BLAST of CmaCh20G006590 vs. TrEMBL
Match: A0A0A0KE48_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G124020 PE=4 SV=1)

HSP 1 Score: 738.0 bits (1904), Expect = 6.3e-210
Identity = 371/424 (87.50%), Postives = 394/424 (92.92%), Query Frame = 1

Query: 1   MGSNYLRLLSYTKQLSFYANHGNHEQALALFLHMQASLALALDAHVFSLVLKSCTAIRRP 60
           MGS+YLRLLSYTKQLSFYANHGNHEQ L+LF HMQASLAL LDAHVFSLVLKSCTA+RRP
Sbjct: 1   MGSSYLRLLSYTKQLSFYANHGNHEQTLSLFHHMQASLALGLDAHVFSLVLKSCTALRRP 60

Query: 61  LLGTAIHAHATKSSLLSNPFVACALVDMYGKSMSVPLARKLFDEIPQRTVVAWNTMLSLY 120
            LG AIHAH+ KSSLLS+PFVACALVDMYGKS+SV LARKLFDEIP R+VV WN MLSLY
Sbjct: 61  HLGIAIHAHSAKSSLLSSPFVACALVDMYGKSLSVTLARKLFDEIPHRSVVVWNVMLSLY 120

Query: 121 THSNMLPDALRLFEAMDVPPNTSSFNPIIAGL--LDDGFKAISFYRKMQQCGLKPNLITL 180
            H+NML  AL+LFEAMDVPPN SSFN I+AGL  L+DGFKAI+FYR+MQQCGLKPNLITL
Sbjct: 121 VHANMLFGALQLFEAMDVPPNASSFNAIVAGLSKLEDGFKAIAFYRQMQQCGLKPNLITL 180

Query: 181 LALLPASVGVAALDLIKQIHGFAMRNDLGCHPQLSSGLVEAYGRCGCLNYAHNVFDKMHE 240
           LALLPASVGVA+LDLIKQIHGFAMRND+G H QLSSGLVEAYGRCGCL+YAHNVFD M E
Sbjct: 181 LALLPASVGVASLDLIKQIHGFAMRNDIGAHLQLSSGLVEAYGRCGCLSYAHNVFDNMTE 240

Query: 241 RDVVAWSSLISAHALHGEARTALNIFQQMALCKVQPDEITFIGVLKACSYVGLADEALYY 300
           RDVVAWSSLISAHALHGEA TALNIFQQM  CKVQPDEITFIGVLKACS+VGLA+EAL Y
Sbjct: 241 RDVVAWSSLISAHALHGEASTALNIFQQMESCKVQPDEITFIGVLKACSHVGLANEALDY 300

Query: 301 FSRMQRDYGLQASSDHYSCLVDALSRAGRLHEAYEIIREMPVRVTAKAWGALLGACRNYG 360
           F+RMQRDYGLQASSDHYSCLVD LSRAGRLHEAY+IIREMPVRVTAKAWGALLGACR YG
Sbjct: 301 FNRMQRDYGLQASSDHYSCLVDVLSRAGRLHEAYDIIREMPVRVTAKAWGALLGACRIYG 360

Query: 361 ELELAEIAGKALFDIEPENAANYVLLGKMYASVGRHEEAQRLRMEMKERRVQAVPGSSWV 420
           ELELAEIAGKALF+IEPENAANYVLL KMYASVGRHEEAQR+R EMKERRV+ VPGSSWV
Sbjct: 361 ELELAEIAGKALFEIEPENAANYVLLAKMYASVGRHEEAQRMRREMKERRVKVVPGSSWV 420

Query: 421 VYQD 423
           VYQD
Sbjct: 421 VYQD 424

BLAST of CmaCh20G006590 vs. TrEMBL
Match: E5GBQ5_CUCME (Pentatricopeptide (PPR) repeat-containing protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 680.6 bits (1755), Expect = 1.2e-192
Identity = 344/391 (87.98%), Postives = 364/391 (93.09%), Query Frame = 1

Query: 34  MQASLALALDAHVFSLVLKSCTAIRRPLLGTAIHAHATKSSLLSNPFVACALVDMYGKSM 93
           MQASLALALDAHVFSLVLKSCTA+RRP LG AIHAH+ KSSLLS+PFVACALVDMYGKS+
Sbjct: 1   MQASLALALDAHVFSLVLKSCTALRRPHLGIAIHAHSAKSSLLSSPFVACALVDMYGKSL 60

Query: 94  SVPLARKLFDEIPQRTVVAWNTMLSLYTHSNMLPDALRLFEAMDVPPNTSSFNPIIAGL- 153
           SV LARKLFDEIP R+VV WN MLSLY HSNML  AL+LFEAMDVPPN SSFNPI+AGL 
Sbjct: 61  SVSLARKLFDEIPHRSVVVWNVMLSLYVHSNMLFGALQLFEAMDVPPNASSFNPIVAGLS 120

Query: 154 -LDDGFKAISFYRKMQQCGLKPNLITLLALLPASVGVAALDLIKQIHGFAMRNDLGCHPQ 213
            L+DGFKAI+FYR+MQQCGLKPNLITLLALLPASVGVA+LDLIKQIHGFAMRND+G H Q
Sbjct: 121 KLEDGFKAIAFYRQMQQCGLKPNLITLLALLPASVGVASLDLIKQIHGFAMRNDIGGHLQ 180

Query: 214 LSSGLVEAYGRCGCLNYAHNVFDKMHERDVVAWSSLISAHALHGEARTALNIFQQMALCK 273
           LSSGLVEAYGRCGCL+YAHNVFD M ERDVVAWSSLISAHALHGEA TALNIFQQM L K
Sbjct: 181 LSSGLVEAYGRCGCLSYAHNVFDNMTERDVVAWSSLISAHALHGEASTALNIFQQMELSK 240

Query: 274 VQPDEITFIGVLKACSYVGLADEALYYFSRMQRDYGLQASSDHYSCLVDALSRAGRLHEA 333
           VQPD ITFIGVLKACS+VGLADEAL YF+RMQRDYGLQASSDHYSCLVD LSRAGRLHEA
Sbjct: 241 VQPDAITFIGVLKACSHVGLADEALDYFNRMQRDYGLQASSDHYSCLVDVLSRAGRLHEA 300

Query: 334 YEIIREMPVRVTAKAWGALLGACRNYGELELAEIAGKALFDIEPENAANYVLLGKMYASV 393
           ++IIREMPVRVTAKAWGALLGACR YGELELAEIAGKALF+IEPENAANYVLL KMYASV
Sbjct: 301 HDIIREMPVRVTAKAWGALLGACRIYGELELAEIAGKALFEIEPENAANYVLLAKMYASV 360

Query: 394 GRHEEAQRLRMEMKERRVQAVPGSSWVVYQD 423
           GRHEEAQR+R EMKERRV+ VPGSSWVVYQD
Sbjct: 361 GRHEEAQRMRREMKERRVKVVPGSSWVVYQD 391

BLAST of CmaCh20G006590 vs. TrEMBL
Match: A0A061F553_THECC (Tetratricopeptide repeat (TPR)-like superfamily protein OS=Theobroma cacao GN=TCM_030737 PE=4 SV=1)

HSP 1 Score: 600.9 bits (1548), Expect = 1.2e-168
Identity = 304/423 (71.87%), Postives = 351/423 (82.98%), Query Frame = 1

Query: 3   SNYLRLLSYTKQLSFYANHGNHEQALALFLHMQASLALALDAHVFSLVLKSCTAIRRPLL 62
           SN+LRLLS TKQL+ + N G H  AL+LF  MQAS +L LD +VF LVLKSC AI RP L
Sbjct: 6   SNHLRLLSLTKQLTSHVNQGRHADALSLFHSMQASPSLTLDPYVFPLVLKSCAAISRPRL 65

Query: 63  GTAIHAHATKSSLLSNPFVACALVDMYGKSMSVPLARKLFDEIPQRTVVAWNTMLSLYTH 122
           G++IHAHATKSSLLSNPFVACALVDMYGK  S+  AR+LFDEIPQR VV WN+++SLYT 
Sbjct: 66  GSSIHAHATKSSLLSNPFVACALVDMYGKCFSISSARRLFDEIPQRNVVVWNSLISLYTR 125

Query: 123 SNMLPDALRLFEAMDVPPNTSSFNPIIAGL--LDDG-FKAISFYRKMQQCGLKPNLITLL 182
              + +AL LF++MDV PN S+FNPIIAGL  L+DG F A  FYR+MQ+ GL+PNLITLL
Sbjct: 126 CGRVDEALHLFQSMDVGPNESTFNPIIAGLSELEDGPFMATEFYRRMQRVGLRPNLITLL 185

Query: 183 ALLPASVGVAALDLIKQIHGFAMRNDLGCHPQLSSGLVEAYGRCGCLNYAHNVFDKMHER 242
           ALL A VGVAAL LIK+IH +A+R++   HPQL SGLVEAYGRCGCL YA NVF  M ER
Sbjct: 186 ALLRACVGVAALSLIKEIHSYALRSNTEPHPQLRSGLVEAYGRCGCLVYARNVFQCMEER 245

Query: 243 DVVAWSSLISAHALHGEARTALNIFQQMALCKVQPDEITFIGVLKACSYVGLADEALYYF 302
           DVVAWSSLISAHALHGEA+ AL +FQQM L KV PD+ITF+GVLKACS+ GLADEAL YF
Sbjct: 246 DVVAWSSLISAHALHGEAKAALEVFQQMELAKVWPDDITFLGVLKACSHAGLADEALGYF 305

Query: 303 SRMQRDYGLQASSDHYSCLVDALSRAGRLHEAYEIIREMPVRVTAKAWGALLGACRNYGE 362
            RM +DY L+AS+DHYSCLVDALSRAGRL+EAY +I+EMPV+ TAK WGALLGACR YGE
Sbjct: 306 DRMHKDYKLEASADHYSCLVDALSRAGRLYEAYRVIKEMPVKPTAKTWGALLGACRTYGE 365

Query: 363 LELAEIAGKALFDIEPENAANYVLLGKMYASVGRHEEAQRLRMEMKERRVQAVPGSSWVV 422
           +ELAEIAG+ALF+IEP NAANYVLL ++YASVGR+EEAQ +RMEMKER V+  PG SWVV
Sbjct: 366 VELAEIAGRALFEIEPSNAANYVLLARIYASVGRYEEAQGMRMEMKERGVKVAPGGSWVV 425

BLAST of CmaCh20G006590 vs. TrEMBL
Match: D7SNQ6_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0029g00490 PE=4 SV=1)

HSP 1 Score: 585.1 bits (1507), Expect = 6.8e-164
Identity = 300/424 (70.75%), Postives = 344/424 (81.13%), Query Frame = 1

Query: 3   SNYLRLLSYTKQLSFYANHGNHEQALALFLHMQASLALALDAHVFSLVLKSCTAIRRPLL 62
           SNYLRLLSYTK L+ + N G H  AL+LF HM AS A ALDA VF L LKSC A  RP L
Sbjct: 6   SNYLRLLSYTKLLASHVNQGRHHDALSLFHHMHASSAPALDAFVFPLALKSCAAAHRPNL 65

Query: 63  GTAIHAHATKSSLLSNPFVACALVDMYGKSMSVPLARKLFDEIPQRTVVAWNTMLSLYTH 122
           G AIHAH TK SL+SNPFVACALVDMYGK +SV  AR LFDEIP R +V WN M+S+YTH
Sbjct: 66  GAAIHAHVTKFSLVSNPFVACALVDMYGKCVSVSSARHLFDEIPHRNIVVWNAMISIYTH 125

Query: 123 SNMLPDALRLFEAMDVPPNTSSFNPIIAGL--LDDG-FKAISFYRKMQQCGLKPNLITLL 182
           S  + DAL LFE MDV PN S+FN II+GL  L+DG FKA+SFYR+M + GLK NLITLL
Sbjct: 126 SGRVADALGLFEVMDVEPNASTFNAIISGLSGLEDGSFKALSFYRRMGEVGLKQNLITLL 185

Query: 183 ALLPASVGVAALDLIKQIHGFAMRNDLGCHPQLSSGLVEAYGRCGCLNYAHNVFDK--MH 242
           ALLPA V +AAL LIK+IHG+A+RN +  HP L S LVEAYGRCGC+  +  VF    M 
Sbjct: 186 ALLPACVDLAALTLIKEIHGYAIRNGIDPHPHLRSCLVEAYGRCGCIVNSQCVFQSISMS 245

Query: 243 ERDVVAWSSLISAHALHGEARTALNIFQQMALCKVQPDEITFIGVLKACSYVGLADEALY 302
           ERDVVAWSSLISA+ALHG+ARTAL  F+QM + KVQPD ITF+GVLKACS+ GLADEAL 
Sbjct: 246 ERDVVAWSSLISAYALHGDARTALETFEQMEMAKVQPDGITFLGVLKACSHAGLADEALG 305

Query: 303 YFSRMQRDYGLQASSDHYSCLVDALSRAGRLHEAYEIIREMPVRVTAKAWGALLGACRNY 362
           YF RM +DYG++ASSDHYSC+VDALSRAGRL+EAYEII+ MPV+ TAK WGALLGACR Y
Sbjct: 306 YFGRMCKDYGVEASSDHYSCVVDALSRAGRLYEAYEIIQGMPVKSTAKTWGALLGACRTY 365

Query: 363 GELELAEIAGKALFDIEPENAANYVLLGKMYASVGRHEEAQRLRMEMKERRVQAVPGSSW 422
           GE+ELAEIAG+ALF++EP+NAANYVLL ++YASVGRHEEAQR+R EM E  V+A PGSSW
Sbjct: 366 GEVELAEIAGRALFELEPDNAANYVLLARIYASVGRHEEAQRMRREMNEMGVKAAPGSSW 425

BLAST of CmaCh20G006590 vs. TrEMBL
Match: B9RB17_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_1510500 PE=4 SV=1)

HSP 1 Score: 584.7 bits (1506), Expect = 8.9e-164
Identity = 292/420 (69.52%), Postives = 345/420 (82.14%), Query Frame = 1

Query: 3   SNYLRLLSYTKQLSFYANHGNHEQALALFLHMQASLALALDAHVFSLVLKSCTAIRRPLL 62
           S YLRLLS+TKQL+ + N G H  AL LF HMQ SLAL+LD +VF LVLKSC+A+  P L
Sbjct: 22  STYLRLLSFTKQLTSHVNRGLHHDALTLFYHMQTSLALSLDPYVFPLVLKSCSAVLCPQL 81

Query: 63  GTAIHAHATKSSLLSNPFVACALVDMYGKSMSVPLARKLFDEIPQRTVVAWNTMLSLYTH 122
           GT++HAH  K   LSNPFVA ALVDMYGK   +  ARKLFDEIPQR VV WN M+SLYTH
Sbjct: 82  GTSVHAHIVKMGFLSNPFVASALVDMYGKCACIFSARKLFDEIPQRNVVVWNAMISLYTH 141

Query: 123 SNMLPDALRLFEAMDVPPNTSSFNPIIAGLL---DDGFKAISFYRKMQQCGLKPNLITLL 182
           SN + DAL +F+AM++ PN S+FN +I GL    D   KAI+FY KM+Q GLKPNLITLL
Sbjct: 142 SNRVRDALDMFDAMEIEPNVSTFNALIYGLSGVKDGSIKAIAFYWKMRQLGLKPNLITLL 201

Query: 183 ALLPASVGVAALDLIKQIHGFAMRNDLGCHPQLSSGLVEAYGRCGCLNYAHNVFDKMHER 242
           ALLPA VG+AAL+LI++IHG+++RND+  HPQL SGL++AYGRCGCL  A NVF  M ER
Sbjct: 202 ALLPACVGIAALNLIREIHGYSIRNDIDRHPQLGSGLLDAYGRCGCLINASNVFCGMKER 261

Query: 243 DVVAWSSLISAHALHGEARTALNIFQQMALCKVQPDEITFIGVLKACSYVGLADEALYYF 302
           DVVAWSSLISA+ALHGEA+ AL IF+QM L KVQPD+ITF+ VLKACS+ GLADEAL YF
Sbjct: 262 DVVAWSSLISAYALHGEAKNALEIFRQMELAKVQPDDITFLAVLKACSHAGLADEALDYF 321

Query: 303 SRMQRDYGLQASSDHYSCLVDALSRAGRLHEAYEIIREMPVRVTAKAWGALLGACRNYGE 362
           ++MQ  Y LQA SDHYSCLVD LSRAGRL+EAY++I+EMPV+VTAKAWGALLGACR YGE
Sbjct: 322 TKMQEGYRLQAVSDHYSCLVDVLSRAGRLYEAYKVIQEMPVKVTAKAWGALLGACRTYGE 381

Query: 363 LELAEIAGKALFDIEPENAANYVLLGKMYASVGRHEEAQRLRMEMKERRVQAVPGSSWVV 420
           +ELAEI G+ALF+IEP+N ANYVLL ++YASVGR++EAQR+R EMKER V+  PGSSWVV
Sbjct: 382 IELAEIVGRALFEIEPDNPANYVLLARIYASVGRYDEAQRIRREMKERGVKVSPGSSWVV 441

BLAST of CmaCh20G006590 vs. TAIR10
Match: AT1G03510.1 (AT1G03510.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 549.3 bits (1414), Expect = 2.1e-156
Identity = 275/423 (65.01%), Postives = 325/423 (76.83%), Query Frame = 1

Query: 3   SNYLRLLSYTKQLSFYANHGNHEQALALFLHMQASLALALDAHVFSLVLKSCTAIRRPLL 62
           S+  +L+S TKQLS YAN GNHEQAL LFL M +S AL LDAHVFSL LKSC A  RP+L
Sbjct: 7   SSCTKLISLTKQLSSYANQGNHEQALNLFLQMHSSFALPLDAHVFSLALKSCAAAFRPVL 66

Query: 63  GTAIHAHATKSSLLSNPFVACALVDMYGKSMSVPLARKLFDEIPQRTVVAWNTMLSLYTH 122
           G ++HAH+ KS+ LSNPFV CAL+DMYGK +SV  ARKLFDEIPQR  V WN M+S YTH
Sbjct: 67  GGSVHAHSVKSNFLSNPFVGCALLDMYGKCLSVSHARKLFDEIPQRNAVVWNAMISHYTH 126

Query: 123 SNMLPDALRLFEAMDVPPNTSSFNPIIAGLL---DDGFKAISFYRKMQQCGLKPNLITLL 182
              + +A+ L+EAMDV PN SSFN II GL+   D  ++AI FYRKM +   KPNLITLL
Sbjct: 127 CGKVKEAVELYEAMDVMPNESSFNAIIKGLVGTEDGSYRAIEFYRKMIEFRFKPNLITLL 186

Query: 183 ALLPASVGVAALDLIKQIHGFAMRNDLGCHPQLSSGLVEAYGRCGCLNYAHNVFDKMHER 242
           AL+ A   + A  LIK+IH +A RN +  HPQL SGLVEAYGRCG + Y   VFD M +R
Sbjct: 187 ALVSACSAIGAFRLIKEIHSYAFRNLIEPHPQLKSGLVEAYGRCGSIVYVQLVFDSMEDR 246

Query: 243 DVVAWSSLISAHALHGEARTALNIFQQMALCKVQPDEITFIGVLKACSYVGLADEALYYF 302
           DVVAWSSLISA+ALHG+A +AL  FQ+M L KV PD+I F+ VLKACS+ GLADEAL YF
Sbjct: 247 DVVAWSSLISAYALHGDAESALKTFQEMELAKVTPDDIAFLNVLKACSHAGLADEALVYF 306

Query: 303 SRMQRDYGLQASSDHYSCLVDALSRAGRLHEAYEIIREMPVRVTAKAWGALLGACRNYGE 362
            RMQ DYGL+AS DHYSCLVD LSR GR  EAY++I+ MP + TAK WGALLGACRNYGE
Sbjct: 307 KRMQGDYGLRASKDHYSCLVDVLSRVGRFEEAYKVIQAMPEKPTAKTWGALLGACRNYGE 366

Query: 363 LELAEIAGKALFDIEPENAANYVLLGKMYASVGRHEEAQRLRMEMKERRVQAVPGSSWVV 422
           +ELAEIA + L  +EPEN ANYVLLGK+Y SVGR EEA+RLR++MKE  V+  PGSSW +
Sbjct: 367 IELAEIAARELLMVEPENPANYVLLGKIYMSVGRQEEAERLRLKMKESGVKVSPGSSWCL 426

BLAST of CmaCh20G006590 vs. TAIR10
Match: AT2G20540.1 (AT2G20540.1 mitochondrial editing factor 21)

HSP 1 Score: 276.9 bits (707), Expect = 2.0e-74
Identity = 140/378 (37.04%), Postives = 226/378 (59.79%), Query Frame = 1

Query: 43  DAHVFSLVLKSCTAIRRPLLGTAIHAHATKSSLLSNPFVACALVDMYGKSMSVPLARKLF 102
           D   F  + KSC ++    LG  +H H  K     +     AL+DMY K   +  A K+F
Sbjct: 108 DRFTFPFMFKSCASLGSCYLGKQVHGHLCKFGPRFHVVTENALIDMYMKFDDLVDAHKVF 167

Query: 103 DEIPQRTVVAWNTMLSLYTHSNMLPDALRLFEAMDVPPNTSSFNPIIAGLLDDG--FKAI 162
           DE+ +R V++WN++LS Y     +  A  LF  M +     S+  +I+G    G   +A+
Sbjct: 168 DEMYERDVISWNSLLSGYARLGQMKKAKGLFHLM-LDKTIVSWTAMISGYTGIGCYVEAM 227

Query: 163 SFYRKMQQCGLKPNLITLLALLPASVGVAALDLIKQIHGFAMRNDLGCHPQLSSGLVEAY 222
            F+R+MQ  G++P+ I+L+++LP+   + +L+L K IH +A R        + + L+E Y
Sbjct: 228 DFFREMQLAGIEPDEISLISVLPSCAQLGSLELGKWIHLYAERRGFLKQTGVCNALIEMY 287

Query: 223 GRCGCLNYAHNVFDKMHERDVVAWSSLISAHALHGEARTALNIFQQMALCKVQPDEITFI 282
            +CG ++ A  +F +M  +DV++WS++IS +A HG A  A+  F +M   KV+P+ ITF+
Sbjct: 288 SKCGVISQAIQLFGQMEGKDVISWSTMISGYAYHGNAHGAIETFNEMQRAKVKPNGITFL 347

Query: 283 GVLKACSYVGLADEALYYFSRMQRDYGLQASSDHYSCLVDALSRAGRLHEAYEIIREMPV 342
           G+L ACS+VG+  E L YF  M++DY ++   +HY CL+D L+RAG+L  A EI + MP+
Sbjct: 348 GLLSACSHVGMWQEGLRYFDMMRQDYQIEPKIEHYGCLIDVLARAGKLERAVEITKTMPM 407

Query: 343 RVTAKAWGALLGACRNYGELELAEIAGKALFDIEPENAANYVLLGKMYASVGRHEEAQRL 402
           +  +K WG+LL +CR  G L++A +A   L ++EPE+  NYVLL  +YA +G+ E+  RL
Sbjct: 408 KPDSKIWGSLLSSCRTPGNLDVALVAMDHLVELEPEDMGNYVLLANIYADLGKWEDVSRL 467

Query: 403 RMEMKERRVQAVPGSSWV 419
           R  ++   ++  PG S +
Sbjct: 468 RKMIRNENMKKTPGGSLI 484

BLAST of CmaCh20G006590 vs. TAIR10
Match: AT3G57430.1 (AT3G57430.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 270.4 bits (690), Expect = 1.9e-72
Identity = 145/413 (35.11%), Postives = 234/413 (56.66%), Query Frame = 1

Query: 11  YTKQLSFYANHGNHEQALALFLHMQASLALALDAHVFSLVLKSCTAIRRPLLGTAIHAHA 70
           +   ++ Y+ + + ++AL LF+ M+ S  L  ++   + V+ +C          AIH   
Sbjct: 372 WNAMIAGYSQNEHDKEALLLFIGMEESAGLLANSTTMAGVVPACVRSGAFSRKEAIHGFV 431

Query: 71  TKSSLLSNPFVACALVDMYGKSMSVPLARKLFDEIPQRTVVAWNTMLSLYTHSNMLPDAL 130
            K  L  + FV   L+DMY +   + +A ++F ++  R +V WNTM++ Y  S    DAL
Sbjct: 432 VKRGLDRDRFVQNTLMDMYSRLGKIDIAMRIFGKMEDRDLVTWNTMITGYVFSEHHEDAL 491

Query: 131 RLFEAMDVPPNTSSFNPIIAGLLDDGFKAISFYRKMQQCGLKPNLITLLALLPASVGVAA 190
            L   M       S                   +   +  LKPN ITL+ +LP+   ++A
Sbjct: 492 LLLHKMQNLERKVS-------------------KGASRVSLKPNSITLMTILPSCAALSA 551

Query: 191 LDLIKQIHGFAMRNDLGCHPQLSSGLVEAYGRCGCLNYAHNVFDKMHERDVVAWSSLISA 250
           L   K+IH +A++N+L     + S LV+ Y +CGCL  +  VFD++ +++V+ W+ +I A
Sbjct: 552 LAKGKEIHAYAIKNNLATDVAVGSALVDMYAKCGCLQMSRKVFDQIPQKNVITWNVIIMA 611

Query: 251 HALHGEARTALNIFQQMALCKVQPDEITFIGVLKACSYVGLADEALYYFSRMQRDYGLQA 310
           + +HG  + A+++ + M +  V+P+E+TFI V  ACS+ G+ DE L  F  M+ DYG++ 
Sbjct: 612 YGMHGNGQEAIDLLRMMMVQGVKPNEVTFISVFAACSHSGMVDEGLRIFYVMKPDYGVEP 671

Query: 311 SSDHYSCLVDALSRAGRLHEAYEIIREMPVRVT-AKAWGALLGACRNYGELELAEIAGKA 370
           SSDHY+C+VD L RAGR+ EAY+++  MP     A AW +LLGA R +  LE+ EIA + 
Sbjct: 672 SSDHYACVVDLLGRAGRIKEAYQLMNMMPRDFNKAGAWSSLLGASRIHNNLEIGEIAAQN 731

Query: 371 LFDIEPENAANYVLLGKMYASVGRHEEAQRLRMEMKERRVQAVPGSSWVVYQD 423
           L  +EP  A++YVLL  +Y+S G  ++A  +R  MKE+ V+  PG SW+ + D
Sbjct: 732 LIQLEPNVASHYVLLANIYSSAGLWDKATEVRRNMKEQGVRKEPGCSWIEHGD 765

BLAST of CmaCh20G006590 vs. TAIR10
Match: AT1G20230.1 (AT1G20230.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 270.0 bits (689), Expect = 2.5e-72
Identity = 147/416 (35.34%), Postives = 236/416 (56.73%), Query Frame = 1

Query: 8   LLSYTKQLSFYANHGNHEQALALFLHMQASLALALDAHVFSLVLKSCTAIRRPLLGTAIH 67
           ++S+   LS +   G H++A+ +F  +   L    D    S VL S        +G  IH
Sbjct: 217 IVSWNGILSGFNRSGYHKEAVVMFQKIH-HLGFCPDQVTVSSVLPSVGDSEMLNMGRLIH 276

Query: 68  AHATKSSLLSNPFVACALVDMYGKSMSVPLARKLFDEIPQRTVVAWNTMLSLYTHSNMLP 127
            +  K  LL +  V  A++DMYGKS  V     LF++         N  ++  + + ++ 
Sbjct: 277 GYVIKQGLLKDKCVISAMIDMYGKSGHVYGIISLFNQFEMMEAGVCNAYITGLSRNGLVD 336

Query: 128 DALRLFEAMD---VPPNTSSFNPIIAGLLDDG--FKAISFYRKMQQCGLKPNLITLLALL 187
            AL +FE      +  N  S+  IIAG   +G   +A+  +R+MQ  G+KPN +T+ ++L
Sbjct: 337 KALEMFELFKEQTMELNVVSWTSIIAGCAQNGKDIEALELFREMQVAGVKPNHVTIPSML 396

Query: 188 PASVGVAALDLIKQIHGFAMRNDLGCHPQLSSGLVEAYGRCGCLNYAHNVFDKMHERDVV 247
           PA   +AAL   +  HGFA+R  L  +  + S L++ Y +CG +N +  VF+ M  +++V
Sbjct: 397 PACGNIAALGHGRSTHGFAVRVHLLDNVHVGSALIDMYAKCGRINLSQIVFNMMPTKNLV 456

Query: 248 AWSSLISAHALHGEARTALNIFQQMALCKVQPDEITFIGVLKACSYVGLADEALYYFSRM 307
            W+SL++  ++HG+A+  ++IF+ +   +++PD I+F  +L AC  VGL DE   YF  M
Sbjct: 457 CWNSLMNGFSMHGKAKEVMSIFESLMRTRLKPDFISFTSLLSACGQVGLTDEGWKYFKMM 516

Query: 308 QRDYGLQASSDHYSCLVDALSRAGRLHEAYEIIREMPVRVTAKAWGALLGACRNYGELEL 367
             +YG++   +HYSC+V+ L RAG+L EAY++I+EMP    +  WGALL +CR    ++L
Sbjct: 517 SEEYGIKPRLEHYSCMVNLLGRAGKLQEAYDLIKEMPFEPDSCVWGALLNSCRLQNNVDL 576

Query: 368 AEIAGKALFDIEPENAANYVLLGKMYASVGRHEEAQRLRMEMKERRVQAVPGSSWV 419
           AEIA + LF +EPEN   YVLL  +YA+ G   E   +R +M+   ++  PG SW+
Sbjct: 577 AEIAAEKLFHLEPENPGTYVLLSNIYAAKGMWTEVDSIRNKMESLGLKKNPGCSWI 631

BLAST of CmaCh20G006590 vs. TAIR10
Match: AT3G46790.1 (AT3G46790.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 268.1 bits (684), Expect = 9.3e-72
Identity = 146/401 (36.41%), Postives = 222/401 (55.36%), Query Frame = 1

Query: 22  GNHEQALALFLHMQASLALALDAHVFSLVLKSCTA----IRRPLLGTAIHAHATKSSLLS 81
           G+ E+ L L+  M   + +  D   ++ VLK+C A    +   + G  IHAH T+    S
Sbjct: 157 GHGEEVLGLYWKMNR-IGVESDRFTYTYVLKACVASECTVNHLMKGKEIHAHLTRRGYSS 216

Query: 82  NPFVACALVDMYGKSMSVPLARKLFDEIPQRTVVAWNTMLSLYTHSNMLPDALRLFEAMD 141
           + ++   LVDMY +   V  A  +F  +P R VV+W+ M++ Y  +    +ALR F  M 
Sbjct: 217 HVYIMTTLVDMYARFGCVDYASYVFGGMPVRNVVSWSAMIACYAKNGKAFEALRTFREMM 276

Query: 142 VPPNTSSFNPIIAGLLDDGFKAISFYRKMQQCGLKPNLITLLALLPASVGVAALDLIKQI 201
                SS                            PN +T++++L A   +AAL+  K I
Sbjct: 277 RETKDSS----------------------------PNSVTMVSVLQACASLAALEQGKLI 336

Query: 202 HGFAMRNDLGCHPQLSSGLVEAYGRCGCLNYAHNVFDKMHERDVVAWSSLISAHALHGEA 261
           HG+ +R  L     + S LV  YGRCG L     VFD+MH+RDVV+W+SLIS++ +HG  
Sbjct: 337 HGYILRRGLDSILPVISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISSYGVHGYG 396

Query: 262 RTALNIFQQMALCKVQPDEITFIGVLKACSYVGLADEALYYFSRMQRDYGLQASSDHYSC 321
           + A+ IF++M      P  +TF+ VL ACS+ GL +E    F  M RD+G++   +HY+C
Sbjct: 397 KKAIQIFEEMLANGASPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYAC 456

Query: 322 LVDALSRAGRLHEAYEIIREMPVRVTAKAWGALLGACRNYGELELAEIAGKALFDIEPEN 381
           +VD L RA RL EA +++++M      K WG+LLG+CR +G +ELAE A + LF +EP+N
Sbjct: 457 MVDLLGRANRLDEAAKMVQDMRTEPGPKVWGSLLGSCRIHGNVELAERASRRLFALEPKN 516

Query: 382 AANYVLLGKMYASVGRHEEAQRLRMEMKERRVQAVPGSSWV 419
           A NYVLL  +YA     +E +R++  ++ R +Q +PG  W+
Sbjct: 517 AGNYVLLADIYAEAQMWDEVKRVKKLLEHRGLQKLPGRCWM 528

BLAST of CmaCh20G006590 vs. NCBI nr
Match: gi|778712721|ref|XP_004139961.2| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g03510 [Cucumis sativus])

HSP 1 Score: 738.0 bits (1904), Expect = 9.0e-210
Identity = 371/424 (87.50%), Postives = 394/424 (92.92%), Query Frame = 1

Query: 1   MGSNYLRLLSYTKQLSFYANHGNHEQALALFLHMQASLALALDAHVFSLVLKSCTAIRRP 60
           MGS+YLRLLSYTKQLSFYANHGNHEQ L+LF HMQASLAL LDAHVFSLVLKSCTA+RRP
Sbjct: 1   MGSSYLRLLSYTKQLSFYANHGNHEQTLSLFHHMQASLALGLDAHVFSLVLKSCTALRRP 60

Query: 61  LLGTAIHAHATKSSLLSNPFVACALVDMYGKSMSVPLARKLFDEIPQRTVVAWNTMLSLY 120
            LG AIHAH+ KSSLLS+PFVACALVDMYGKS+SV LARKLFDEIP R+VV WN MLSLY
Sbjct: 61  HLGIAIHAHSAKSSLLSSPFVACALVDMYGKSLSVTLARKLFDEIPHRSVVVWNVMLSLY 120

Query: 121 THSNMLPDALRLFEAMDVPPNTSSFNPIIAGL--LDDGFKAISFYRKMQQCGLKPNLITL 180
            H+NML  AL+LFEAMDVPPN SSFN I+AGL  L+DGFKAI+FYR+MQQCGLKPNLITL
Sbjct: 121 VHANMLFGALQLFEAMDVPPNASSFNAIVAGLSKLEDGFKAIAFYRQMQQCGLKPNLITL 180

Query: 181 LALLPASVGVAALDLIKQIHGFAMRNDLGCHPQLSSGLVEAYGRCGCLNYAHNVFDKMHE 240
           LALLPASVGVA+LDLIKQIHGFAMRND+G H QLSSGLVEAYGRCGCL+YAHNVFD M E
Sbjct: 181 LALLPASVGVASLDLIKQIHGFAMRNDIGAHLQLSSGLVEAYGRCGCLSYAHNVFDNMTE 240

Query: 241 RDVVAWSSLISAHALHGEARTALNIFQQMALCKVQPDEITFIGVLKACSYVGLADEALYY 300
           RDVVAWSSLISAHALHGEA TALNIFQQM  CKVQPDEITFIGVLKACS+VGLA+EAL Y
Sbjct: 241 RDVVAWSSLISAHALHGEASTALNIFQQMESCKVQPDEITFIGVLKACSHVGLANEALDY 300

Query: 301 FSRMQRDYGLQASSDHYSCLVDALSRAGRLHEAYEIIREMPVRVTAKAWGALLGACRNYG 360
           F+RMQRDYGLQASSDHYSCLVD LSRAGRLHEAY+IIREMPVRVTAKAWGALLGACR YG
Sbjct: 301 FNRMQRDYGLQASSDHYSCLVDVLSRAGRLHEAYDIIREMPVRVTAKAWGALLGACRIYG 360

Query: 361 ELELAEIAGKALFDIEPENAANYVLLGKMYASVGRHEEAQRLRMEMKERRVQAVPGSSWV 420
           ELELAEIAGKALF+IEPENAANYVLL KMYASVGRHEEAQR+R EMKERRV+ VPGSSWV
Sbjct: 361 ELELAEIAGKALFEIEPENAANYVLLAKMYASVGRHEEAQRMRREMKERRVKVVPGSSWV 420

Query: 421 VYQD 423
           VYQD
Sbjct: 421 VYQD 424

BLAST of CmaCh20G006590 vs. NCBI nr
Match: gi|659094732|ref|XP_008448214.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g03510 isoform X2 [Cucumis melo])

HSP 1 Score: 731.9 bits (1888), Expect = 6.5e-208
Identity = 369/424 (87.03%), Postives = 394/424 (92.92%), Query Frame = 1

Query: 1   MGSNYLRLLSYTKQLSFYANHGNHEQALALFLHMQASLALALDAHVFSLVLKSCTAIRRP 60
           MGS+YLRL+SY+KQLSFYA HG+HEQ L+LF HMQASLALALDAHVFSLVLKSCTA+RRP
Sbjct: 1   MGSSYLRLVSYSKQLSFYAKHGSHEQTLSLFHHMQASLALALDAHVFSLVLKSCTALRRP 60

Query: 61  LLGTAIHAHATKSSLLSNPFVACALVDMYGKSMSVPLARKLFDEIPQRTVVAWNTMLSLY 120
            LG AIHAH+ KSSLLS+PFVACALVDMYGKS+SV LARKLFDEIP R+VV WN MLSLY
Sbjct: 61  HLGIAIHAHSAKSSLLSSPFVACALVDMYGKSLSVSLARKLFDEIPHRSVVVWNVMLSLY 120

Query: 121 THSNMLPDALRLFEAMDVPPNTSSFNPIIAGL--LDDGFKAISFYRKMQQCGLKPNLITL 180
            HSNML  AL+LFEAMDVPPN SSFNPI+AGL  L+DGFKAI+FYR+MQQCGLKPNLITL
Sbjct: 121 VHSNMLFGALQLFEAMDVPPNASSFNPIVAGLSKLEDGFKAIAFYRQMQQCGLKPNLITL 180

Query: 181 LALLPASVGVAALDLIKQIHGFAMRNDLGCHPQLSSGLVEAYGRCGCLNYAHNVFDKMHE 240
           LALLPASVGVA+LDLIKQIHGFAMRND+G H QLSSGLVEAYGRCGCL+YAHNVFD M E
Sbjct: 181 LALLPASVGVASLDLIKQIHGFAMRNDIGGHLQLSSGLVEAYGRCGCLSYAHNVFDNMTE 240

Query: 241 RDVVAWSSLISAHALHGEARTALNIFQQMALCKVQPDEITFIGVLKACSYVGLADEALYY 300
           RDVVAWSSLISAHALHGEA TALNIFQQM L KVQPD ITFIGVLKACS+VGLADEAL Y
Sbjct: 241 RDVVAWSSLISAHALHGEASTALNIFQQMELSKVQPDAITFIGVLKACSHVGLADEALDY 300

Query: 301 FSRMQRDYGLQASSDHYSCLVDALSRAGRLHEAYEIIREMPVRVTAKAWGALLGACRNYG 360
           F+RMQRDYGLQASSDHYSCLVD LSRAGRLHEA++IIREMPVRVTAKAWGALLGACR YG
Sbjct: 301 FNRMQRDYGLQASSDHYSCLVDVLSRAGRLHEAHDIIREMPVRVTAKAWGALLGACRIYG 360

Query: 361 ELELAEIAGKALFDIEPENAANYVLLGKMYASVGRHEEAQRLRMEMKERRVQAVPGSSWV 420
           ELELAEIAGKALF+IEPENAANYVLL KMYASVGRHEEAQR+R EMKERRV+ VPGSSWV
Sbjct: 361 ELELAEIAGKALFEIEPENAANYVLLAKMYASVGRHEEAQRMRREMKERRVKVVPGSSWV 420

Query: 421 VYQD 423
           VYQD
Sbjct: 421 VYQD 424

BLAST of CmaCh20G006590 vs. NCBI nr
Match: gi|659094722|ref|XP_008448209.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g03510 isoform X1 [Cucumis melo])

HSP 1 Score: 714.1 bits (1842), Expect = 1.4e-202
Identity = 360/413 (87.17%), Postives = 383/413 (92.74%), Query Frame = 1

Query: 12  TKQLSFYANHGNHEQALALFLHMQASLALALDAHVFSLVLKSCTAIRRPLLGTAIHAHAT 71
           +KQLSFYA HG+HEQ L+LF HMQASLALALDAHVFSLVLKSCTA+RRP LG AIHAH+ 
Sbjct: 19  SKQLSFYAKHGSHEQTLSLFHHMQASLALALDAHVFSLVLKSCTALRRPHLGIAIHAHSA 78

Query: 72  KSSLLSNPFVACALVDMYGKSMSVPLARKLFDEIPQRTVVAWNTMLSLYTHSNMLPDALR 131
           KSSLLS+PFVACALVDMYGKS+SV LARKLFDEIP R+VV WN MLSLY HSNML  AL+
Sbjct: 79  KSSLLSSPFVACALVDMYGKSLSVSLARKLFDEIPHRSVVVWNVMLSLYVHSNMLFGALQ 138

Query: 132 LFEAMDVPPNTSSFNPIIAGL--LDDGFKAISFYRKMQQCGLKPNLITLLALLPASVGVA 191
           LFEAMDVPPN SSFNPI+AGL  L+DGFKAI+FYR+MQQCGLKPNLITLLALLPASVGVA
Sbjct: 139 LFEAMDVPPNASSFNPIVAGLSKLEDGFKAIAFYRQMQQCGLKPNLITLLALLPASVGVA 198

Query: 192 ALDLIKQIHGFAMRNDLGCHPQLSSGLVEAYGRCGCLNYAHNVFDKMHERDVVAWSSLIS 251
           +LDLIKQIHGFAMRND+G H QLSSGLVEAYGRCGCL+YAHNVFD M ERDVVAWSSLIS
Sbjct: 199 SLDLIKQIHGFAMRNDIGGHLQLSSGLVEAYGRCGCLSYAHNVFDNMTERDVVAWSSLIS 258

Query: 252 AHALHGEARTALNIFQQMALCKVQPDEITFIGVLKACSYVGLADEALYYFSRMQRDYGLQ 311
           AHALHGEA TALNIFQQM L KVQPD ITFIGVLKACS+VGLADEAL YF+RMQRDYGLQ
Sbjct: 259 AHALHGEASTALNIFQQMELSKVQPDAITFIGVLKACSHVGLADEALDYFNRMQRDYGLQ 318

Query: 312 ASSDHYSCLVDALSRAGRLHEAYEIIREMPVRVTAKAWGALLGACRNYGELELAEIAGKA 371
           ASSDHYSCLVD LSRAGRLHEA++IIREMPVRVTAKAWGALLGACR YGELELAEIAGKA
Sbjct: 319 ASSDHYSCLVDVLSRAGRLHEAHDIIREMPVRVTAKAWGALLGACRIYGELELAEIAGKA 378

Query: 372 LFDIEPENAANYVLLGKMYASVGRHEEAQRLRMEMKERRVQAVPGSSWVVYQD 423
           LF+IEPENAANYVLL KMYASVGRHEEAQR+R EMKERRV+ VPGSSWVVYQD
Sbjct: 379 LFEIEPENAANYVLLAKMYASVGRHEEAQRMRREMKERRVKVVPGSSWVVYQD 431

BLAST of CmaCh20G006590 vs. NCBI nr
Match: gi|659094738|ref|XP_008448217.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g03510 isoform X3 [Cucumis melo])

HSP 1 Score: 680.6 bits (1755), Expect = 1.7e-192
Identity = 344/391 (87.98%), Postives = 364/391 (93.09%), Query Frame = 1

Query: 34  MQASLALALDAHVFSLVLKSCTAIRRPLLGTAIHAHATKSSLLSNPFVACALVDMYGKSM 93
           MQASLALALDAHVFSLVLKSCTA+RRP LG AIHAH+ KSSLLS+PFVACALVDMYGKS+
Sbjct: 1   MQASLALALDAHVFSLVLKSCTALRRPHLGIAIHAHSAKSSLLSSPFVACALVDMYGKSL 60

Query: 94  SVPLARKLFDEIPQRTVVAWNTMLSLYTHSNMLPDALRLFEAMDVPPNTSSFNPIIAGL- 153
           SV LARKLFDEIP R+VV WN MLSLY HSNML  AL+LFEAMDVPPN SSFNPI+AGL 
Sbjct: 61  SVSLARKLFDEIPHRSVVVWNVMLSLYVHSNMLFGALQLFEAMDVPPNASSFNPIVAGLS 120

Query: 154 -LDDGFKAISFYRKMQQCGLKPNLITLLALLPASVGVAALDLIKQIHGFAMRNDLGCHPQ 213
            L+DGFKAI+FYR+MQQCGLKPNLITLLALLPASVGVA+LDLIKQIHGFAMRND+G H Q
Sbjct: 121 KLEDGFKAIAFYRQMQQCGLKPNLITLLALLPASVGVASLDLIKQIHGFAMRNDIGGHLQ 180

Query: 214 LSSGLVEAYGRCGCLNYAHNVFDKMHERDVVAWSSLISAHALHGEARTALNIFQQMALCK 273
           LSSGLVEAYGRCGCL+YAHNVFD M ERDVVAWSSLISAHALHGEA TALNIFQQM L K
Sbjct: 181 LSSGLVEAYGRCGCLSYAHNVFDNMTERDVVAWSSLISAHALHGEASTALNIFQQMELSK 240

Query: 274 VQPDEITFIGVLKACSYVGLADEALYYFSRMQRDYGLQASSDHYSCLVDALSRAGRLHEA 333
           VQPD ITFIGVLKACS+VGLADEAL YF+RMQRDYGLQASSDHYSCLVD LSRAGRLHEA
Sbjct: 241 VQPDAITFIGVLKACSHVGLADEALDYFNRMQRDYGLQASSDHYSCLVDVLSRAGRLHEA 300

Query: 334 YEIIREMPVRVTAKAWGALLGACRNYGELELAEIAGKALFDIEPENAANYVLLGKMYASV 393
           ++IIREMPVRVTAKAWGALLGACR YGELELAEIAGKALF+IEPENAANYVLL KMYASV
Sbjct: 301 HDIIREMPVRVTAKAWGALLGACRIYGELELAEIAGKALFEIEPENAANYVLLAKMYASV 360

Query: 394 GRHEEAQRLRMEMKERRVQAVPGSSWVVYQD 423
           GRHEEAQR+R EMKERRV+ VPGSSWVVYQD
Sbjct: 361 GRHEEAQRMRREMKERRVKVVPGSSWVVYQD 391

BLAST of CmaCh20G006590 vs. NCBI nr
Match: gi|1021559025|ref|XP_016170805.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At1g03510 [Arachis ipaensis])

HSP 1 Score: 601.7 bits (1550), Expect = 1.0e-168
Identity = 298/423 (70.45%), Postives = 363/423 (85.82%), Query Frame = 1

Query: 3   SNYLRLLSYTKQLSFYANHGNHEQALALFLHMQASLALALDAHVFSLVLKSCTAIRRPLL 62
           +N  RLLS+TK +S + + G HE+ALA+F HM +SLAL LDAHVFSL+LKSCTA+ RP L
Sbjct: 6   TNLTRLLSFTKHISSHISGGRHEEALAVFRHMHSSLALTLDAHVFSLILKSCTALNRPQL 65

Query: 63  GTAIHAHATKSSLLSNPFVACALVDMYGKSMSVPLARKLFDEIPQRTVVAWNTMLSLYTH 122
           GT+IHAH +K+S LSN FVA ALVD+YGK +SV  AR+LFDEIP R VV WN M+SLY+H
Sbjct: 66  GTSIHAHVSKASFLSNQFVASALVDLYGKCVSVSSARQLFDEIPSRNVVVWNAMISLYSH 125

Query: 123 SNMLPDALRLFEAMDVPPNTSSFNPIIAGL--LDDG-FKAISFYRKMQQCGLKPNLITLL 182
           S+ + +AL+LF+AMD+ PN S+FN IIAG+  LDDG FKAI+FYRKM + GLKP+LITLL
Sbjct: 126 SHEVGNALQLFQAMDIMPNESTFNSIIAGMARLDDGSFKAIAFYRKMIELGLKPHLITLL 185

Query: 183 ALLPASVGVAALDLIKQIHGFAMRNDLGCHPQLSSGLVEAYGRCGCLNYAHNVFDKMHER 242
           ALLPA+VGVAAL+LIK+IHG+A+RND+  HPQLSSGLVEAYGR GCL ++H+VF +M ER
Sbjct: 186 ALLPAAVGVAALNLIKEIHGYAIRNDIDSHPQLSSGLVEAYGRSGCLMHSHDVFWRMKER 245

Query: 243 DVVAWSSLISAHALHGEARTALNIFQQMALCKVQPDEITFIGVLKACSYVGLADEALYYF 302
           D+VAWSSLISA+ALHG AR AL IF+QM   KVQPD ITF+GVLKACS+ GLADEALYYF
Sbjct: 246 DLVAWSSLISAYALHGRARDALEIFKQMEFAKVQPDGITFLGVLKACSHAGLADEALYYF 305

Query: 303 SRMQRDYGLQASSDHYSCLVDALSRAGRLHEAYEIIREMPVRVTAKAWGALLGACRNYGE 362
           +RM +D+G++ +SDHYSCLVD LSRAGRLHEAY++I+EMPV+VTAKAWGALLGACRN+G+
Sbjct: 306 TRMYKDFGVEPNSDHYSCLVDVLSRAGRLHEAYQVIQEMPVKVTAKAWGALLGACRNFGD 365

Query: 363 LELAEIAGKALFDIEPENAANYVLLGKMYASVGRHEEAQRLRMEMKERRVQAVPGSSWVV 422
           LELAEIA +AL +IEP+N ANYVLL KMYASVGR EEA+++RM MKE+ V+A  GSSWVV
Sbjct: 366 LELAEIAARALSEIEPDNPANYVLLAKMYASVGRQEEAEKIRMLMKEQGVKATTGSSWVV 425

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR7_ARATH3.7e-15565.01Putative pentatricopeptide repeat-containing protein At1g03510 OS=Arabidopsis th... [more]
PP165_ARATH3.6e-7337.04Pentatricopeptide repeat-containing protein At2g20540 OS=Arabidopsis thaliana GN... [more]
PP285_ARATH3.3e-7135.11Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidop... [more]
PPR53_ARATH4.4e-7135.34Pentatricopeptide repeat-containing protein At1g20230 OS=Arabidopsis thaliana GN... [more]
PP265_ARATH1.7e-7036.41Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0KE48_CUCSA6.3e-21087.50Uncharacterized protein OS=Cucumis sativus GN=Csa_6G124020 PE=4 SV=1[more]
E5GBQ5_CUCME1.2e-19287.98Pentatricopeptide (PPR) repeat-containing protein OS=Cucumis melo subsp. melo PE... [more]
A0A061F553_THECC1.2e-16871.87Tetratricopeptide repeat (TPR)-like superfamily protein OS=Theobroma cacao GN=TC... [more]
D7SNQ6_VITVI6.8e-16470.75Putative uncharacterized protein OS=Vitis vinifera GN=VIT_05s0029g00490 PE=4 SV=... [more]
B9RB17_RICCO8.9e-16469.52Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
Match NameE-valueIdentityDescription
AT1G03510.12.1e-15665.01 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G20540.12.0e-7437.04 mitochondrial editing factor 21[more]
AT3G57430.11.9e-7235.11 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G20230.12.5e-7235.34 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G46790.19.3e-7236.41 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778712721|ref|XP_004139961.2|9.0e-21087.50PREDICTED: putative pentatricopeptide repeat-containing protein At1g03510 [Cucum... [more]
gi|659094732|ref|XP_008448214.1|6.5e-20887.03PREDICTED: putative pentatricopeptide repeat-containing protein At1g03510 isofor... [more]
gi|659094722|ref|XP_008448209.1|1.4e-20287.17PREDICTED: putative pentatricopeptide repeat-containing protein At1g03510 isofor... [more]
gi|659094738|ref|XP_008448217.1|1.7e-19287.98PREDICTED: putative pentatricopeptide repeat-containing protein At1g03510 isofor... [more]
gi|1021559025|ref|XP_016170805.1|1.0e-16870.45PREDICTED: putative pentatricopeptide repeat-containing protein At1g03510 [Arach... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh20G006590.1CmaCh20G006590.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 111..136
score: 4.1E-4coord: 386..408
score:
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 308..338
score: 1.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 240..286
score: 2.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 315..338
score: 1.6E-4coord: 242..276
score: 5.0E-4coord: 216..242
score: 0
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 141..173
score: 5.196coord: 109..139
score: 7.794coord: 377..411
score: 8.374coord: 209..239
score: 6.654coord: 7..37
score: 6.719coord: 240..274
score: 9.778coord: 78..108
score: 6.171coord: 311..341
score: 9.076coord: 275..305
score:
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 100..134
score: 9.9E-4coord: 236..328
score: 9.9E-4coord: 18..35
score: 9.9E-4coord: 189..198
score: 9.9E-4coord: 329..399
score: 7.
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 227..398
score: 7.8
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 8..418
score: 9.9E
NoneNo IPR availablePANTHERPTHR24015:SF630SUBFAMILY NOT NAMEDcoord: 8..418
score: 9.9E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CmaCh20G006590CmoCh20G005890Cucurbita moschata (Rifu)cmacmoB552
CmaCh20G006590Lsi10G008640Bottle gourd (USVL1VR-Ls)cmalsiB505
CmaCh20G006590Cp4.1LG16g04830Cucurbita pepo (Zucchini)cmacpeB562
CmaCh20G006590MELO3C034763.2Melon (DHL92) v3.6.1cmamedB586
CmaCh20G006590Carg03164Silver-seed gourdcarcmaB0472
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CmaCh20G006590Watermelon (97103) v1cmawmB525