Cp4.1LG10g06870 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG10g06870
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG10: 28318 .. 29628 (+)
RNA-Seq ExpressionCp4.1LG10g06870
SyntenyCp4.1LG10g06870
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTCTTAAGTCGCTCCTCTCACTTGCTCAATCCTGCTCGCCGTCAATTTGCTTTGTTCGCCAAAGGCTTCAACTCTTGGGCTTTACGCATTCGAAATGCTCCCTCCCTTCATAAAGCTCTTGCTATCTACTCCCAGATGCACCGCCAATCCGTTCCTCACGACAGCTTCTCAATTCTGTTTGTGCTCAAAGCCTGCGCTCGTTCCAACAACCTCTCCATTCTTCACCATCTTCATGCCCATATTACTAAACTTGGTTTCACTACCCATGTCTTTGTCGCTACATCCCTACTCTATGCGTACGTCCTGAACTCCTTTGAACTTGCTTGTTTGTTGTTCGATGAAATGCCCCACAAAAACACTGTTACCTGGAACACTATGATTTTCGGGTATTCCAAGACAGGGGATGTAGACAGAGCTCGCCAACTGTTTGATCTGATGCCATCAAAAGATTTGGCATCTTGGTCCGCCACGATTGCTGCATACGTTAACAACCGCAATTACAGGGGTGGTTTGCTTCTTTTCCAAGATATGATAGTTATTGGAATAACTCCCGACCAGATGGCGGTAGGTTCAATCTTAAAAGGGTGTGCTTATATGGGCTCTTTAGGATTGTTAGCTGGCAAATCAGTTCATGGTTTTGTGGTCAAGAATAGGTGGGAACTAAACCTGGAACTTGGTACAGTTTTGGTTGATATGTACGCCAAGTGTGGATTTTTTAAGTACGCTTGCCAGGTATTTCATTTGATGTCTGAAAAGAACGTCAGGACCTGGACTGCTCTGATATGTGGATTGGCACAGCATGGCTACTGCAAGGAGGCGTTGGATTTATTTGAGATGATGAGGAATGAATGTGTGGAACCGAATGAATTGACTTTCACTGGGATTTTAAGTGCTTGTGTTCATGCAGGATTTGTTCAAGAAGGCCGCAAATATTTCAACATGATTGAAGAATATGGCTTAGAAACAAGGATTCAACATTATGGTTGCATGGTTGATCTGCTGGGTAGGTCGGGATTGTTGGAGGAAGCTTATGGGGTTATTAAGAATATGAGACTCGAACCTAATATCATTGTGTGGAGCTCTCTTTTGTCGGCCTGTAAGCAACATAAAAGCTTTGACATGGCTGAGAGAGTTATTGAGCAGATACTGGACAAGTTAGAACCCGAGAATCATGGTGGAATTTACTCTCTTATATCTGATTTGTATGTTCTAGAGGAAAAGTGGGATGATGCAGAAAAGATAAGGAATTTACTGAACCAAAATGTGCGGAAGGTTAGGGCGTATAGCCTTATCAGAAGTGGATTATAG

mRNA sequence

ATGCTCTTAAGTCGCTCCTCTCACTTGCTCAATCCTGCTCGCCGTCAATTTGCTTTGTTCGCCAAAGGCTTCAACTCTTGGGCTTTACGCATTCGAAATGCTCCCTCCCTTCATAAAGCTCTTGCTATCTACTCCCAGATGCACCGCCAATCCGTTCCTCACGACAGCTTCTCAATTCTGTTTGTGCTCAAAGCCTGCGCTCGTTCCAACAACCTCTCCATTCTTCACCATCTTCATGCCCATATTACTAAACTTGGTTTCACTACCCATGTCTTTGTCGCTACATCCCTACTCTATGCGTACGTCCTGAACTCCTTTGAACTTGCTTGTTTGTTGTTCGATGAAATGCCCCACAAAAACACTGTTACCTGGAACACTATGATTTTCGGGTATTCCAAGACAGGGGATGTAGACAGAGCTCGCCAACTGTTTGATCTGATGCCATCAAAAGATTTGGCATCTTGGTCCGCCACGATTGCTGCATACGTTAACAACCGCAATTACAGGGGTGGTTTGCTTCTTTTCCAAGATATGATAGTTATTGGAATAACTCCCGACCAGATGGCGGTAGGTTCAATCTTAAAAGGGTGTGCTTATATGGGCTCTTTAGGATTGTTAGCTGGCAAATCAGTTCATGGTTTTGTGGTCAAGAATAGGTGGGAACTAAACCTGGAACTTGGTACAGTTTTGGTTGATATGTACGCCAAGTGTGGATTTTTTAAGTACGCTTGCCAGGTATTTCATTTGATGTCTGAAAAGAACGTCAGGACCTGGACTGCTCTGATATGTGGATTGGCACAGCATGGCTACTGCAAGGAGGCGTTGGATTTATTTGAGATGATGAGGAATGAATGTGTGGAACCGAATGAATTGACTTTCACTGGGATTTTAAGTGCTTGTGTTCATGCAGGATTTGTTCAAGAAGGCCGCAAATATTTCAACATGATTGAAGAATATGGCTTAGAAACAAGGATTCAACATTATGGTTGCATGGTTGATCTGCTGGGTAGGTCGGGATTGTTGGAGGAAGCTTATGGGGTTATTAAGAATATGAGACTCGAACCTAATATCATTGTGTGGAGCTCTCTTTTGTCGGCCTGTAAGCAACATAAAAGCTTTGACATGGCTGAGAGAGTTATTGAGCAGATACTGGACAAGTTAGAACCCGAGAATCATGGTGGAATTTACTCTCTTATATCTGATTTGTATGTTCTAGAGGAAAAGTGGGATGATGCAGAAAAGATAAGGAATTTACTGAACCAAAATGTGCGGAAGGTTAGGGCGTATAGCCTTATCAGAAGTGGATTATAG

Coding sequence (CDS)

ATGCTCTTAAGTCGCTCCTCTCACTTGCTCAATCCTGCTCGCCGTCAATTTGCTTTGTTCGCCAAAGGCTTCAACTCTTGGGCTTTACGCATTCGAAATGCTCCCTCCCTTCATAAAGCTCTTGCTATCTACTCCCAGATGCACCGCCAATCCGTTCCTCACGACAGCTTCTCAATTCTGTTTGTGCTCAAAGCCTGCGCTCGTTCCAACAACCTCTCCATTCTTCACCATCTTCATGCCCATATTACTAAACTTGGTTTCACTACCCATGTCTTTGTCGCTACATCCCTACTCTATGCGTACGTCCTGAACTCCTTTGAACTTGCTTGTTTGTTGTTCGATGAAATGCCCCACAAAAACACTGTTACCTGGAACACTATGATTTTCGGGTATTCCAAGACAGGGGATGTAGACAGAGCTCGCCAACTGTTTGATCTGATGCCATCAAAAGATTTGGCATCTTGGTCCGCCACGATTGCTGCATACGTTAACAACCGCAATTACAGGGGTGGTTTGCTTCTTTTCCAAGATATGATAGTTATTGGAATAACTCCCGACCAGATGGCGGTAGGTTCAATCTTAAAAGGGTGTGCTTATATGGGCTCTTTAGGATTGTTAGCTGGCAAATCAGTTCATGGTTTTGTGGTCAAGAATAGGTGGGAACTAAACCTGGAACTTGGTACAGTTTTGGTTGATATGTACGCCAAGTGTGGATTTTTTAAGTACGCTTGCCAGGTATTTCATTTGATGTCTGAAAAGAACGTCAGGACCTGGACTGCTCTGATATGTGGATTGGCACAGCATGGCTACTGCAAGGAGGCGTTGGATTTATTTGAGATGATGAGGAATGAATGTGTGGAACCGAATGAATTGACTTTCACTGGGATTTTAAGTGCTTGTGTTCATGCAGGATTTGTTCAAGAAGGCCGCAAATATTTCAACATGATTGAAGAATATGGCTTAGAAACAAGGATTCAACATTATGGTTGCATGGTTGATCTGCTGGGTAGGTCGGGATTGTTGGAGGAAGCTTATGGGGTTATTAAGAATATGAGACTCGAACCTAATATCATTGTGTGGAGCTCTCTTTTGTCGGCCTGTAAGCAACATAAAAGCTTTGACATGGCTGAGAGAGTTATTGAGCAGATACTGGACAAGTTAGAACCCGAGAATCATGGTGGAATTTACTCTCTTATATCTGATTTGTATGTTCTAGAGGAAAAGTGGGATGATGCAGAAAAGATAAGGAATTTACTGAACCAAAATGTGCGGAAGGTTAGGGCGTATAGCCTTATCAGAAGTGGATTATAG

Protein sequence

MLLSRSSHLLNPARRQFALFAKGFNSWALRIRNAPSLHKALAIYSQMHRQSVPHDSFSILFVLKACARSNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKNTVTWNTMIFGYSKTGDVDRARQLFDLMPSKDLASWSATIAAYVNNRNYRGGLLLFQDMIVIGITPDQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFFKYACQVFHLMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSACVHAGFVQEGRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVWSSLLSACKQHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLNQNVRKVRAYSLIRSGL
Homology
BLAST of Cp4.1LG10g06870 vs. ExPASy Swiss-Prot
Match: Q9FJY7 (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 305.4 bits (781), Expect = 1.0e-81
Identity = 162/412 (39.32%), Postives = 253/412 (61.41%), Query Frame = 0

Query: 27  WALRIRN---APSLHKALAIYSQMHRQSVPHDSFSILFVLKACARSNNLSILHHLHAHIT 86
           W L IR    +    ++L +Y +M   S PH++++   +LKAC+  +       +HA IT
Sbjct: 83  WNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETTQIHAQIT 142

Query: 87  KLGFTTHVFVATSLLYAY-VLNSFELACLLFDEMPHKNTVTWNTMIFGYSKTGDVDRARQ 146
           KLG+   V+   SL+ +Y V  +F+LA LLFD +P  + V+WN++I GY K G +D A  
Sbjct: 143 KLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKAGKMDIALT 202

Query: 147 LFDLMPSKDLASWSATIAAYVNNRNYRGGLLLFQDMIVIGITPDQMAVGSILKGCAYMGS 206
           LF  M  K+  SW+  I+ YV     +  L LF +M    + PD +++ + L  CA +G+
Sbjct: 203 LFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANALSACAQLGA 262

Query: 207 LGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFFKYACQVFHLMSEKNVRTWTALI 266
           L    GK +H ++ K R  ++  LG VL+DMYAKCG  + A +VF  + +K+V+ WTALI
Sbjct: 263 LE--QGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQAWTALI 322

Query: 267 CGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSACVHAGFVQEGRK-YFNMIEEYGL 326
            G A HG+ +EA+  F  M+   ++PN +TFT +L+AC + G V+EG+  +++M  +Y L
Sbjct: 323 SGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSMERDYNL 382

Query: 327 ETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVWSSLLSACKQHKSFDMAERVIE 386
           +  I+HYGC+VDLLGR+GLL+EA   I+ M L+PN ++W +LL AC+ HK+ ++ E  I 
Sbjct: 383 KPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIELGEE-IG 442

Query: 387 QILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLL-NQNVRKVRAYSLI 433
           +IL  ++P  HGG Y   ++++ +++KWD A + R L+  Q V KV   S I
Sbjct: 443 EILIAIDP-YHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTI 490

BLAST of Cp4.1LG10g06870 vs. ExPASy Swiss-Prot
Match: Q1PEU4 (Pentatricopeptide repeat-containing protein At2g44880 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E9 PE=2 SV=2)

HSP 1 Score: 278.9 bits (712), Expect = 1.0e-73
Identity = 136/315 (43.17%), Postives = 205/315 (65.08%), Query Frame = 0

Query: 112 LFDEMPHKNTVTWNTMIFGYSKTGDVDRARQLFDLMPSKDLASWSATIAAYVNNRNYRGG 171
           LFDEM HK  +TW TMI GY    D+D AR+LFD MP ++L SW+  I  Y  N+  + G
Sbjct: 198 LFDEMTHKTVITWTTMIHGYCNIKDIDAARKLFDAMPERNLVSWNTMIGGYCQNKQPQEG 257

Query: 172 LLLFQDM-IVIGITPDQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVL 231
           + LFQ+M     + PD + + S+L   +  G+L L  G+  H FV + + +  +++ T +
Sbjct: 258 IRLFQEMQATTSLDPDDVTILSVLPAISDTGALSL--GEWCHCFVQRKKLDKKVKVCTAI 317

Query: 232 VDMYAKCGFFKYACQVFHLMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNE 291
           +DMY+KCG  + A ++F  M EK V +W A+I G A +G  + ALDLF  M  E  +P+E
Sbjct: 318 LDMYSKCGEIEKAKRIFDEMPEKQVASWNAMIHGYALNGNARAALDLFVTMMIE-EKPDE 377

Query: 292 LTFTGILSACVHAGFVQEGRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKN 351
           +T   +++AC H G V+EGRK+F+++ E GL  +I+HYGCMVDLLGR+G L+EA  +I N
Sbjct: 378 ITMLAVITACNHGGLVEEGRKWFHVMREMGLNAKIEHYGCMVDLLGRAGSLKEAEDLITN 437

Query: 352 MRLEPNIIVWSSLLSACKQHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWD 411
           M  EPN I+ SS LSAC Q+K  + AER++++ ++ LEP+N G  Y L+ +LY  +++WD
Sbjct: 438 MPFEPNGIILSSFLSACGQYKDIERAERILKKAVE-LEPQNDGN-YVLLRNLYAADKRWD 497

Query: 412 DAEKIRNLLNQNVRK 426
           D   ++N++ +N  K
Sbjct: 498 DFGMVKNVMRKNQAK 507

BLAST of Cp4.1LG10g06870 vs. ExPASy Swiss-Prot
Match: Q9FLS9 (Pentatricopeptide repeat-containing protein At5g61800 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E8 PE=2 SV=1)

HSP 1 Score: 277.7 bits (709), Expect = 2.2e-73
Identity = 145/410 (35.37%), Postives = 237/410 (57.80%), Query Frame = 0

Query: 35  PSLHKALAIYSQMHRQSVPHDSFSILFVLKACARSNN--LSILHHLHAHITKLGFTTHVF 94
           PS   +   + +M R+SVP D  +  FV KACA   N  L+++  LH    + G  + +F
Sbjct: 94  PSSLSSKRFFVEMRRRSVPPDFHTFPFVFKACAAKKNGDLTLVKTLHCQALRFGLLSDLF 153

Query: 95  VATSLLYAY-VLNSFELACLLFDEMPHKNTVTWNTMIFGYSKTGDVDRARQLFDLMPSKD 154
              +L+  Y ++   + A  LFDE P ++ VT+N +I G  K  ++ RAR+LFD MP +D
Sbjct: 154 TLNTLIRVYSLIAPIDSALQLFDENPQRDVVTYNVLIDGLVKAREIVRARELFDSMPLRD 213

Query: 155 LASWSATIAAYVNNRNYRGGLLLFQDMIVIGITPDQMAVGSILKGCAYMGSLGLLAGKSV 214
           L SW++ I+ Y    + R  + LF +M+ +G+ PD +A+ S L  CA  G      GK++
Sbjct: 214 LVSWNSLISGYAQMNHCREAIKLFDEMVALGLKPDNVAIVSTLSACAQSGD--WQKGKAI 273

Query: 215 HGFVVKNRWELNLELGTVLVDMYAKCGFFKYACQVFHLMSEKNVRTWTALICGLAQHGYC 274
           H +  + R  ++  L T LVD YAKCGF   A ++F L S+K + TW A+I GLA HG  
Sbjct: 274 HDYTKRKRLFIDSFLATGLVDFYAKCGFIDTAMEIFELCSDKTLFTWNAMITGLAMHGNG 333

Query: 275 KEALDLFEMMRNECVEPNELTFTGILSACVHAGFVQEGRKYFNMIEE-YGLETRIQHYGC 334
           +  +D F  M +  ++P+ +TF  +L  C H+G V E R  F+ +   Y +   ++HYGC
Sbjct: 334 ELTVDYFRKMVSSGIKPDGVTFISVLVGCSHSGLVDEARNLFDQMRSLYDVNREMKHYGC 393

Query: 335 MVDLLGRSGLLEEAYGVIKNMRLE----PNIIVWSSLLSACKQHKSFDMAERVIEQILDK 394
           M DLLGR+GL+EEA  +I+ M  +      ++ WS LL  C+ H + ++AE+   ++   
Sbjct: 394 MADLLGRAGLIEEAAEMIEQMPKDGGNREKLLAWSGLLGGCRIHGNIEIAEKAANRV-KA 453

Query: 395 LEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLNQN--VRKVRAYSLIRS 435
           L PE+ GG+Y ++ ++Y   E+W++  K+R +++++  V+K   +S + S
Sbjct: 454 LSPED-GGVYKVMVEMYANAERWEEVVKVREIIDRDKKVKKNVGFSKVLS 499

BLAST of Cp4.1LG10g06870 vs. ExPASy Swiss-Prot
Match: Q9FG16 (Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H88 PE=3 SV=1)

HSP 1 Score: 275.4 bits (703), Expect = 1.1e-72
Identity = 153/429 (35.66%), Postives = 252/429 (58.74%), Query Frame = 0

Query: 7   SHLLNPARRQFALFAKGFNSWALRIRNAPSLHKALAIYSQMHRQSVPHDSFSILFVLKAC 66
           S + NP    F L  + F++ A      PS  KA   Y+QM +  +  D+ +  F++KA 
Sbjct: 75  SQIQNPNLFVFNLLIRCFSTGA-----EPS--KAFGFYTQMLKSRIWPDNITFPFLIKAS 134

Query: 67  ARSNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELAC-LLFDEMPHKNTVTWN 126
           +    + +    H+ I + GF   V+V  SL++ Y    F  A   +F +M  ++ V+W 
Sbjct: 135 SEMECVLVGEQTHSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIFGQMGFRDVVSWT 194

Query: 127 TMIFGYSKTGDVDRARQLFDLMPSKDLASWSATIAAYVNNRNYRGGLLLFQDMIVIGITP 186
           +M+ GY K G V+ AR++FD MP ++L +WS  I  Y  N  +   + LF+ M   G+  
Sbjct: 195 SMVAGYCKCGMVENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMKREGVVA 254

Query: 187 DQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFFKYACQ 246
           ++  + S++  CA++G+L    G+  + +VVK+   +NL LGT LVDM+ +CG  + A  
Sbjct: 255 NETVMVSVISSCAHLGALEF--GERAYEYVVKSHMTVNLILGTALVDMFWRCGDIEKAIH 314

Query: 247 VFHLMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSACVHAGF 306
           VF  + E +  +W+++I GLA HG+  +A+  F  M +    P ++TFT +LSAC H G 
Sbjct: 315 VFEGLPETDSLSWSSIIKGLAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSACSHGGL 374

Query: 307 VQEGRK-YFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVWSSLL 366
           V++G + Y NM +++G+E R++HYGC+VD+LGR+G L EA   I  M ++PN  +  +LL
Sbjct: 375 VEKGLEIYENMKKDHGIEPRLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPILGALL 434

Query: 367 SACKQHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLNQN-V 426
            ACK +K+ ++AERV   +L K++PE H G Y L+S++Y    +WD  E +R+++ +  V
Sbjct: 435 GACKIYKNTEVAERV-GNMLIKVKPE-HSGYYVLLSNIYACAGQWDKIESLRDMMKEKLV 492

Query: 427 RKVRAYSLI 433
           +K   +SLI
Sbjct: 495 KKPPGWSLI 492

BLAST of Cp4.1LG10g06870 vs. ExPASy Swiss-Prot
Match: Q56X05 (Pentatricopeptide repeat-containing protein At1g06143 OS=Arabidopsis thaliana OX=3702 GN=EMB1444 PE=2 SV=2)

HSP 1 Score: 273.1 bits (697), Expect = 5.5e-72
Identity = 145/416 (34.86%), Postives = 233/416 (56.01%), Query Frame = 0

Query: 39  KALAIYSQMHRQSVPHDSFSILFVLKACARSNNLSILHHLHAHITKLGFTTHVFVATSLL 98
           ++L +Y +M R SV   S++   ++KA + ++       L AHI K GF  HV + T+L+
Sbjct: 109 RSLELYVRMLRDSVSPSSYTYSSLVKASSFASRFG--ESLQAHIWKFGFGFHVKIQTTLI 168

Query: 99  YAY-VLNSFELACLLFDEMPHKNTVTWNTM------------------------------ 158
             Y        A  +FDEMP ++ + W TM                              
Sbjct: 169 DFYSATGRIREARKVFDEMPERDDIAWTTMVSAYRRVLDMDSANSLANQMSEKNEATSNC 228

Query: 159 -IFGYSKTGDVDRARQLFDLMPSKDLASWSATIAAYVNNRNYRGGLLLFQDMIVIGITPD 218
            I GY   G++++A  LF+ MP KD+ SW+  I  Y  N+ YR  + +F  M+  GI PD
Sbjct: 229 LINGYMGLGNLEQAESLFNQMPVKDIISWTTMIKGYSQNKRYREAIAVFYKMMEEGIIPD 288

Query: 219 QMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFFKYACQV 278
           ++ + +++  CA++G L +  GK VH + ++N + L++ +G+ LVDMY+KCG  + A  V
Sbjct: 289 EVTMSTVISACAHLGVLEI--GKEVHMYTLQNGFVLDVYIGSALVDMYSKCGSLERALLV 348

Query: 279 FHLMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSACVHAGFV 338
           F  + +KN+  W ++I GLA HG+ +EAL +F  M  E V+PN +TF  + +AC HAG V
Sbjct: 349 FFNLPKKNLFCWNSIIEGLAAHGFAQEALKMFAKMEMESVKPNAVTFVSVFTACTHAGLV 408

Query: 339 QEGRK-YFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVWSSLLS 398
            EGR+ Y +MI++Y + + ++HYG MV L  ++GL+ EA  +I NM  EPN ++W +LL 
Sbjct: 409 DEGRRIYRSMIDDYSIVSNVEHYGGMVHLFSKAGLIYEALELIGNMEFEPNAVIWGALLD 468

Query: 399 ACKQHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLNQ 422
            C+ HK+  +AE    +++  LEP N  G Y L+  +Y  + +W D  +IR  + +
Sbjct: 469 GCRIHKNLVIAEIAFNKLM-VLEPMN-SGYYFLLVSMYAEQNRWRDVAEIRGRMRE 518

BLAST of Cp4.1LG10g06870 vs. NCBI nr
Match: XP_023544798.1 (pentatricopeptide repeat-containing protein At5g66520-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 890 bits (2300), Expect = 0.0
Identity = 436/436 (100.00%), Postives = 436/436 (100.00%), Query Frame = 0

Query: 1   MLLSRSSHLLNPARRQFALFAKGFNSWALRIRNAPSLHKALAIYSQMHRQSVPHDSFSIL 60
           MLLSRSSHLLNPARRQFALFAKGFNSWALRIRNAPSLHKALAIYSQMHRQSVPHDSFSIL
Sbjct: 1   MLLSRSSHLLNPARRQFALFAKGFNSWALRIRNAPSLHKALAIYSQMHRQSVPHDSFSIL 60

Query: 61  FVLKACARSNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKN 120
           FVLKACARSNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKN
Sbjct: 61  FVLKACARSNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKN 120

Query: 121 TVTWNTMIFGYSKTGDVDRARQLFDLMPSKDLASWSATIAAYVNNRNYRGGLLLFQDMIV 180
           TVTWNTMIFGYSKTGDVDRARQLFDLMPSKDLASWSATIAAYVNNRNYRGGLLLFQDMIV
Sbjct: 121 TVTWNTMIFGYSKTGDVDRARQLFDLMPSKDLASWSATIAAYVNNRNYRGGLLLFQDMIV 180

Query: 181 IGITPDQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFF 240
           IGITPDQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFF
Sbjct: 181 IGITPDQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFF 240

Query: 241 KYACQVFHLMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSAC 300
           KYACQVFHLMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSAC
Sbjct: 241 KYACQVFHLMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSAC 300

Query: 301 VHAGFVQEGRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVW 360
           VHAGFVQEGRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVW
Sbjct: 301 VHAGFVQEGRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVW 360

Query: 361 SSLLSACKQHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLN 420
           SSLLSACKQHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLN
Sbjct: 361 SSLLSACKQHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLN 420

Query: 421 QNVRKVRAYSLIRSGL 436
           QNVRKVRAYSLIRSGL
Sbjct: 421 QNVRKVRAYSLIRSGL 436

BLAST of Cp4.1LG10g06870 vs. NCBI nr
Match: XP_022925591.1 (pentatricopeptide repeat-containing protein At5g66520-like [Cucurbita moschata] >KAG6581530.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 885 bits (2288), Expect = 0.0
Identity = 434/436 (99.54%), Postives = 435/436 (99.77%), Query Frame = 0

Query: 1   MLLSRSSHLLNPARRQFALFAKGFNSWALRIRNAPSLHKALAIYSQMHRQSVPHDSFSIL 60
           MLLSRSSHLLNPARRQFALFAKGFNSWALRIRNAPSL KALAIYSQMHRQSVPHDSFSIL
Sbjct: 1   MLLSRSSHLLNPARRQFALFAKGFNSWALRIRNAPSLQKALAIYSQMHRQSVPHDSFSIL 60

Query: 61  FVLKACARSNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKN 120
           FVLKACARSNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKN
Sbjct: 61  FVLKACARSNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKN 120

Query: 121 TVTWNTMIFGYSKTGDVDRARQLFDLMPSKDLASWSATIAAYVNNRNYRGGLLLFQDMIV 180
           TVTWNTMIFGYSKTGDVDRARQLFDLMPSKDLASWSATIAAYVNNRNYRGGLLLFQDMIV
Sbjct: 121 TVTWNTMIFGYSKTGDVDRARQLFDLMPSKDLASWSATIAAYVNNRNYRGGLLLFQDMIV 180

Query: 181 IGITPDQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFF 240
           IGITPDQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRW+LNLELGTVLVDMYAKCGFF
Sbjct: 181 IGITPDQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWKLNLELGTVLVDMYAKCGFF 240

Query: 241 KYACQVFHLMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSAC 300
           KYACQVFHLMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSAC
Sbjct: 241 KYACQVFHLMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSAC 300

Query: 301 VHAGFVQEGRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVW 360
           VHAGFVQEGRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVW
Sbjct: 301 VHAGFVQEGRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVW 360

Query: 361 SSLLSACKQHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLN 420
           SSLLSACKQHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLN
Sbjct: 361 SSLLSACKQHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLN 420

Query: 421 QNVRKVRAYSLIRSGL 436
           QNVRKVRAYSLIRSGL
Sbjct: 421 QNVRKVRAYSLIRSGL 436

BLAST of Cp4.1LG10g06870 vs. NCBI nr
Match: XP_022978962.1 (pentatricopeptide repeat-containing protein At5g66520-like [Cucurbita maxima])

HSP 1 Score: 877 bits (2266), Expect = 0.0
Identity = 429/436 (98.39%), Postives = 433/436 (99.31%), Query Frame = 0

Query: 1   MLLSRSSHLLNPARRQFALFAKGFNSWALRIRNAPSLHKALAIYSQMHRQSVPHDSFSIL 60
           MLLSRSSHLLNPARRQFAL AKGFNSWALRIRNAPSL+KALAIYSQMHRQSVPHDSFSIL
Sbjct: 1   MLLSRSSHLLNPARRQFALSAKGFNSWALRIRNAPSLNKALAIYSQMHRQSVPHDSFSIL 60

Query: 61  FVLKACARSNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKN 120
           FVLKACARSNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKN
Sbjct: 61  FVLKACARSNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKN 120

Query: 121 TVTWNTMIFGYSKTGDVDRARQLFDLMPSKDLASWSATIAAYVNNRNYRGGLLLFQDMIV 180
           TVTWNTMIFGYSKTGDVDRARQLFDLMPS+DLASWSATIAAYVNNRNYRGGLLLFQDMIV
Sbjct: 121 TVTWNTMIFGYSKTGDVDRARQLFDLMPSRDLASWSATIAAYVNNRNYRGGLLLFQDMIV 180

Query: 181 IGITPDQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFF 240
           IGITPDQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFF
Sbjct: 181 IGITPDQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFF 240

Query: 241 KYACQVFHLMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSAC 300
           KYACQVFHLMSEKNVRTWTALICGLA+HGYCKEALDLFEMMRNECVEPNELTFTGILSAC
Sbjct: 241 KYACQVFHLMSEKNVRTWTALICGLAKHGYCKEALDLFEMMRNECVEPNELTFTGILSAC 300

Query: 301 VHAGFVQEGRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVW 360
           VHAGFVQEGRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVI NMRLEPNIIVW
Sbjct: 301 VHAGFVQEGRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVINNMRLEPNIIVW 360

Query: 361 SSLLSACKQHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLN 420
           SSLLSACKQHKSFDMAERVIEQILDK EPENHGG+YSLISDLYVLEEKWDDAEKIRNLLN
Sbjct: 361 SSLLSACKQHKSFDMAERVIEQILDKSEPENHGGVYSLISDLYVLEEKWDDAEKIRNLLN 420

Query: 421 QNVRKVRAYSLIRSGL 436
           QNVRKVRAYSLIRSGL
Sbjct: 421 QNVRKVRAYSLIRSGL 436

BLAST of Cp4.1LG10g06870 vs. NCBI nr
Match: XP_038877521.1 (pentatricopeptide repeat-containing protein At5g66520-like [Benincasa hispida])

HSP 1 Score: 796 bits (2055), Expect = 3.20e-289
Identity = 386/436 (88.53%), Postives = 409/436 (93.81%), Query Frame = 0

Query: 1   MLLSRSSHLLNPARRQFALFAKGFNSWALRIRNAPSLHKALAIYSQMHRQSVPHDSFSIL 60
           ML++ S HLL PA RQFAL AKGFNSWALRIRNAPSLHKALAIYSQMHRQSVPHDSFSIL
Sbjct: 1   MLVNLSCHLLKPACRQFALSAKGFNSWALRIRNAPSLHKALAIYSQMHRQSVPHDSFSIL 60

Query: 61  FVLKACARSNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKN 120
           F+LKACARSNN+S+LHHLHAHITKLGFT HVFVATSLLYAYVL+S +LACL+FDEMPHK+
Sbjct: 61  FMLKACARSNNVSVLHHLHAHITKLGFTAHVFVATSLLYAYVLHSIQLACLVFDEMPHKS 120

Query: 121 TVTWNTMIFGYSKTGDVDRARQLFDLMPSKDLASWSATIAAYVNNRNYRGGLLLFQDMIV 180
           TVTWNTMI  YSKTGDVD ARQLFD MPS+DLASWS+ I AYVNNRNYR GLL+FQDMIV
Sbjct: 121 TVTWNTMILRYSKTGDVDAARQLFDQMPSRDLASWSSMITAYVNNRNYRAGLLIFQDMIV 180

Query: 181 IGITPDQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFF 240
            GI+PDQ+A+GSIL GCA+MGSLGLLAGKSVHGFVVKNRWELNL+LGTVLVDMYAKCGF 
Sbjct: 181 NGISPDQIAIGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLQLGTVLVDMYAKCGFL 240

Query: 241 KYACQVFHLMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSAC 300
           KYACQ+FH MSEKNVRTWTALICG+AQHGY KEAL LFE MR E VEPNELTFTG+LSAC
Sbjct: 241 KYACQIFHFMSEKNVRTWTALICGMAQHGYGKEALLLFETMRREGVEPNELTFTGVLSAC 300

Query: 301 VHAGFVQEGRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVW 360
           VHAGFV+EGRKYFNMIEEYGLE RIQHYGCMVDLLGRSGLLEEAYGVIK+MRLEPN+IVW
Sbjct: 301 VHAGFVEEGRKYFNMIEEYGLEIRIQHYGCMVDLLGRSGLLEEAYGVIKSMRLEPNVIVW 360

Query: 361 SSLLSACKQHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLN 420
           SSLLSACKQHK FDMAERVIEQIL K EPENHGGIYSLISDLYVLEEKWDDAEKIRNLLN
Sbjct: 361 SSLLSACKQHKRFDMAERVIEQILKKTEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLN 420

Query: 421 QNVRKVRAYSLIRSGL 436
           QNVRKVRAYSLIRSGL
Sbjct: 421 QNVRKVRAYSLIRSGL 436

BLAST of Cp4.1LG10g06870 vs. NCBI nr
Match: KAA0052417.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 768 bits (1984), Expect = 1.88e-278
Identity = 371/428 (86.68%), Postives = 396/428 (92.52%), Query Frame = 0

Query: 9   LLNPARRQFALFAKGFNSWALRIRNAPSLHKALAIYSQMHRQSVPHDSFSILFVLKACAR 68
           LLNP+ RQFA  AKGFNSWALRIRNAPSLHKALAI+SQMHRQSVPHDSFSILF+LKACA 
Sbjct: 6   LLNPSCRQFAFSAKGFNSWALRIRNAPSLHKALAIFSQMHRQSVPHDSFSILFMLKACAS 65

Query: 69  SNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKNTVTWNTMI 128
           SNNLSILHHLHAHITKLGFTTHVFVATSLL++YVL+SF+LA L+FDEMPHKN+VTWNTMI
Sbjct: 66  SNNLSILHHLHAHITKLGFTTHVFVATSLLHSYVLHSFQLARLVFDEMPHKNSVTWNTMI 125

Query: 129 FGYSKTGDVDRARQLFDLMPSKDLASWSATIAAYVNNRNYRGGLLLFQDMIVIGITPDQM 188
            GYSKTGDV  ARQLFD MPS+DLASWSA IAAY+NNRNYRG LLLFQDMI+ GI PDQM
Sbjct: 126 SGYSKTGDVHTARQLFDRMPSRDLASWSAMIAAYINNRNYRGALLLFQDMIINGINPDQM 185

Query: 189 AVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFFKYACQVFH 248
           A GSIL GCA+MGSLG LAGKSVHGFVVKNRWELNLELGTVLV MYA+CG  KYACQ+FH
Sbjct: 186 AAGSILNGCAHMGSLGSLAGKSVHGFVVKNRWELNLELGTVLVHMYARCGLLKYACQIFH 245

Query: 249 LMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSACVHAGFVQE 308
           LMSE+NVRTWTALICGLA HG CKEAL LFE MR+E VEPNELTFTG+LSACVHAG VQE
Sbjct: 246 LMSERNVRTWTALICGLAHHGCCKEALALFETMRHEGVEPNELTFTGVLSACVHAGLVQE 305

Query: 309 GRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVWSSLLSACK 368
           GRKYFNMIEEYGLE RIQHYGC VDLLGRSGLLEEAYGVIK+MR EPN+IVWSSLLSACK
Sbjct: 306 GRKYFNMIEEYGLEIRIQHYGCFVDLLGRSGLLEEAYGVIKSMRFEPNVIVWSSLLSACK 365

Query: 369 QHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLNQNVRKVRA 428
           QHKSFD+AERVIEQIL+K EP NHGG+YSL+SDLYVL+EKWDDAE IRNLLNQ VRKVRA
Sbjct: 366 QHKSFDLAERVIEQILEKTEPNNHGGVYSLVSDLYVLQEKWDDAENIRNLLNQKVRKVRA 425

Query: 429 YSLIRSGL 436
           YSLIRSGL
Sbjct: 426 YSLIRSGL 433

BLAST of Cp4.1LG10g06870 vs. ExPASy TrEMBL
Match: A0A6J1ECM3 (pentatricopeptide repeat-containing protein At5g66520-like OS=Cucurbita moschata OX=3662 GN=LOC111432978 PE=4 SV=1)

HSP 1 Score: 885 bits (2288), Expect = 0.0
Identity = 434/436 (99.54%), Postives = 435/436 (99.77%), Query Frame = 0

Query: 1   MLLSRSSHLLNPARRQFALFAKGFNSWALRIRNAPSLHKALAIYSQMHRQSVPHDSFSIL 60
           MLLSRSSHLLNPARRQFALFAKGFNSWALRIRNAPSL KALAIYSQMHRQSVPHDSFSIL
Sbjct: 1   MLLSRSSHLLNPARRQFALFAKGFNSWALRIRNAPSLQKALAIYSQMHRQSVPHDSFSIL 60

Query: 61  FVLKACARSNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKN 120
           FVLKACARSNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKN
Sbjct: 61  FVLKACARSNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKN 120

Query: 121 TVTWNTMIFGYSKTGDVDRARQLFDLMPSKDLASWSATIAAYVNNRNYRGGLLLFQDMIV 180
           TVTWNTMIFGYSKTGDVDRARQLFDLMPSKDLASWSATIAAYVNNRNYRGGLLLFQDMIV
Sbjct: 121 TVTWNTMIFGYSKTGDVDRARQLFDLMPSKDLASWSATIAAYVNNRNYRGGLLLFQDMIV 180

Query: 181 IGITPDQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFF 240
           IGITPDQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRW+LNLELGTVLVDMYAKCGFF
Sbjct: 181 IGITPDQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWKLNLELGTVLVDMYAKCGFF 240

Query: 241 KYACQVFHLMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSAC 300
           KYACQVFHLMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSAC
Sbjct: 241 KYACQVFHLMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSAC 300

Query: 301 VHAGFVQEGRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVW 360
           VHAGFVQEGRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVW
Sbjct: 301 VHAGFVQEGRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVW 360

Query: 361 SSLLSACKQHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLN 420
           SSLLSACKQHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLN
Sbjct: 361 SSLLSACKQHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLN 420

Query: 421 QNVRKVRAYSLIRSGL 436
           QNVRKVRAYSLIRSGL
Sbjct: 421 QNVRKVRAYSLIRSGL 436

BLAST of Cp4.1LG10g06870 vs. ExPASy TrEMBL
Match: A0A6J1IMI1 (pentatricopeptide repeat-containing protein At5g66520-like OS=Cucurbita maxima OX=3661 GN=LOC111478755 PE=4 SV=1)

HSP 1 Score: 877 bits (2266), Expect = 0.0
Identity = 429/436 (98.39%), Postives = 433/436 (99.31%), Query Frame = 0

Query: 1   MLLSRSSHLLNPARRQFALFAKGFNSWALRIRNAPSLHKALAIYSQMHRQSVPHDSFSIL 60
           MLLSRSSHLLNPARRQFAL AKGFNSWALRIRNAPSL+KALAIYSQMHRQSVPHDSFSIL
Sbjct: 1   MLLSRSSHLLNPARRQFALSAKGFNSWALRIRNAPSLNKALAIYSQMHRQSVPHDSFSIL 60

Query: 61  FVLKACARSNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKN 120
           FVLKACARSNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKN
Sbjct: 61  FVLKACARSNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKN 120

Query: 121 TVTWNTMIFGYSKTGDVDRARQLFDLMPSKDLASWSATIAAYVNNRNYRGGLLLFQDMIV 180
           TVTWNTMIFGYSKTGDVDRARQLFDLMPS+DLASWSATIAAYVNNRNYRGGLLLFQDMIV
Sbjct: 121 TVTWNTMIFGYSKTGDVDRARQLFDLMPSRDLASWSATIAAYVNNRNYRGGLLLFQDMIV 180

Query: 181 IGITPDQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFF 240
           IGITPDQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFF
Sbjct: 181 IGITPDQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFF 240

Query: 241 KYACQVFHLMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSAC 300
           KYACQVFHLMSEKNVRTWTALICGLA+HGYCKEALDLFEMMRNECVEPNELTFTGILSAC
Sbjct: 241 KYACQVFHLMSEKNVRTWTALICGLAKHGYCKEALDLFEMMRNECVEPNELTFTGILSAC 300

Query: 301 VHAGFVQEGRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVW 360
           VHAGFVQEGRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVI NMRLEPNIIVW
Sbjct: 301 VHAGFVQEGRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVINNMRLEPNIIVW 360

Query: 361 SSLLSACKQHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLN 420
           SSLLSACKQHKSFDMAERVIEQILDK EPENHGG+YSLISDLYVLEEKWDDAEKIRNLLN
Sbjct: 361 SSLLSACKQHKSFDMAERVIEQILDKSEPENHGGVYSLISDLYVLEEKWDDAEKIRNLLN 420

Query: 421 QNVRKVRAYSLIRSGL 436
           QNVRKVRAYSLIRSGL
Sbjct: 421 QNVRKVRAYSLIRSGL 436

BLAST of Cp4.1LG10g06870 vs. ExPASy TrEMBL
Match: A0A5A7UD49 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold120G00120 PE=4 SV=1)

HSP 1 Score: 768 bits (1984), Expect = 9.10e-279
Identity = 371/428 (86.68%), Postives = 396/428 (92.52%), Query Frame = 0

Query: 9   LLNPARRQFALFAKGFNSWALRIRNAPSLHKALAIYSQMHRQSVPHDSFSILFVLKACAR 68
           LLNP+ RQFA  AKGFNSWALRIRNAPSLHKALAI+SQMHRQSVPHDSFSILF+LKACA 
Sbjct: 6   LLNPSCRQFAFSAKGFNSWALRIRNAPSLHKALAIFSQMHRQSVPHDSFSILFMLKACAS 65

Query: 69  SNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKNTVTWNTMI 128
           SNNLSILHHLHAHITKLGFTTHVFVATSLL++YVL+SF+LA L+FDEMPHKN+VTWNTMI
Sbjct: 66  SNNLSILHHLHAHITKLGFTTHVFVATSLLHSYVLHSFQLARLVFDEMPHKNSVTWNTMI 125

Query: 129 FGYSKTGDVDRARQLFDLMPSKDLASWSATIAAYVNNRNYRGGLLLFQDMIVIGITPDQM 188
            GYSKTGDV  ARQLFD MPS+DLASWSA IAAY+NNRNYRG LLLFQDMI+ GI PDQM
Sbjct: 126 SGYSKTGDVHTARQLFDRMPSRDLASWSAMIAAYINNRNYRGALLLFQDMIINGINPDQM 185

Query: 189 AVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFFKYACQVFH 248
           A GSIL GCA+MGSLG LAGKSVHGFVVKNRWELNLELGTVLV MYA+CG  KYACQ+FH
Sbjct: 186 AAGSILNGCAHMGSLGSLAGKSVHGFVVKNRWELNLELGTVLVHMYARCGLLKYACQIFH 245

Query: 249 LMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSACVHAGFVQE 308
           LMSE+NVRTWTALICGLA HG CKEAL LFE MR+E VEPNELTFTG+LSACVHAG VQE
Sbjct: 246 LMSERNVRTWTALICGLAHHGCCKEALALFETMRHEGVEPNELTFTGVLSACVHAGLVQE 305

Query: 309 GRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVWSSLLSACK 368
           GRKYFNMIEEYGLE RIQHYGC VDLLGRSGLLEEAYGVIK+MR EPN+IVWSSLLSACK
Sbjct: 306 GRKYFNMIEEYGLEIRIQHYGCFVDLLGRSGLLEEAYGVIKSMRFEPNVIVWSSLLSACK 365

Query: 369 QHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLNQNVRKVRA 428
           QHKSFD+AERVIEQIL+K EP NHGG+YSL+SDLYVL+EKWDDAE IRNLLNQ VRKVRA
Sbjct: 366 QHKSFDLAERVIEQILEKTEPNNHGGVYSLVSDLYVLQEKWDDAENIRNLLNQKVRKVRA 425

Query: 429 YSLIRSGL 436
           YSLIRSGL
Sbjct: 426 YSLIRSGL 433

BLAST of Cp4.1LG10g06870 vs. ExPASy TrEMBL
Match: A0A1S3AYC4 (pentatricopeptide repeat-containing protein At5g66520-like OS=Cucumis melo OX=3656 GN=LOC103484239 PE=4 SV=1)

HSP 1 Score: 764 bits (1973), Expect = 4.31e-277
Identity = 370/428 (86.45%), Postives = 394/428 (92.06%), Query Frame = 0

Query: 9   LLNPARRQFALFAKGFNSWALRIRNAPSLHKALAIYSQMHRQSVPHDSFSILFVLKACAR 68
           LLNP+ RQFA  AKGFNSWALRIRNAPSLHKALAI+SQMHRQSVPHDSFSILF+LKACA 
Sbjct: 6   LLNPSCRQFAFSAKGFNSWALRIRNAPSLHKALAIFSQMHRQSVPHDSFSILFMLKACAS 65

Query: 69  SNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKNTVTWNTMI 128
           SNNLSILHHLHAHITKLGFTTHVFVATSLL++YVL+SF+LA L+FDEMPHKN+VTWNTMI
Sbjct: 66  SNNLSILHHLHAHITKLGFTTHVFVATSLLHSYVLHSFQLARLVFDEMPHKNSVTWNTMI 125

Query: 129 FGYSKTGDVDRARQLFDLMPSKDLASWSATIAAYVNNRNYRGGLLLFQDMIVIGITPDQM 188
            GYSKTGDV  ARQLFD MPS+DLASWSA IAAY+NNRNYR  LLLFQDMI+ GI PDQM
Sbjct: 126 SGYSKTGDVHTARQLFDRMPSRDLASWSAMIAAYINNRNYRVALLLFQDMIINGINPDQM 185

Query: 189 AVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFFKYACQVFH 248
           A GSIL GCA+MGSLG LAGKSVHGFVVKNRWELNLELGTVLV MYAKCG  KYACQ+FH
Sbjct: 186 AAGSILNGCAHMGSLGSLAGKSVHGFVVKNRWELNLELGTVLVHMYAKCGLLKYACQIFH 245

Query: 249 LMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSACVHAGFVQE 308
           LMSE+NVRTWTALICGLA HG CKEAL LFE MR+E VEPNELTFTG+LSACVHAG VQE
Sbjct: 246 LMSERNVRTWTALICGLAHHGCCKEALALFETMRHEGVEPNELTFTGVLSACVHAGLVQE 305

Query: 309 GRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVWSSLLSACK 368
           GRKYFNMIEEYGLE RIQHYGC VDLLGRSGLLEEAYGVIK+MR EPN+IVWSSLLSACK
Sbjct: 306 GRKYFNMIEEYGLEIRIQHYGCFVDLLGRSGLLEEAYGVIKSMRFEPNVIVWSSLLSACK 365

Query: 369 QHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLNQNVRKVRA 428
           QHKSFD+AERVIE IL+K EP NHGG+YSL+SDLYVL+EKWDDAE IRNLLNQ VRKVRA
Sbjct: 366 QHKSFDLAERVIEHILEKTEPNNHGGVYSLVSDLYVLQEKWDDAENIRNLLNQKVRKVRA 425

Query: 429 YSLIRSGL 436
           YSLIRSGL
Sbjct: 426 YSLIRSGL 433

BLAST of Cp4.1LG10g06870 vs. ExPASy TrEMBL
Match: A0A0A0KIK9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G538690 PE=4 SV=1)

HSP 1 Score: 760 bits (1962), Expect = 2.29e-275
Identity = 367/428 (85.75%), Postives = 395/428 (92.29%), Query Frame = 0

Query: 9   LLNPARRQFALFAKGFNSWALRIRNAPSLHKALAIYSQMHRQSVPHDSFSILFVLKACAR 68
           LLNP+ R FA  AKG NSWALRIRNAPSLHKALA YSQMHRQSVPHDSFSILF+LKACA 
Sbjct: 9   LLNPSCRHFAFSAKGVNSWALRIRNAPSLHKALAFYSQMHRQSVPHDSFSILFMLKACAS 68

Query: 69  SNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELACLLFDEMPHKNTVTWNTMI 128
           SNNLSILHHLHAHITKLGFTTHVFVATSLL++YVL+SF+LA L+FDEMPHKN+VTWNTMI
Sbjct: 69  SNNLSILHHLHAHITKLGFTTHVFVATSLLHSYVLHSFQLARLVFDEMPHKNSVTWNTMI 128

Query: 129 FGYSKTGDVDRARQLFDLMPSKDLASWSATIAAYVNNRNYRGGLLLFQDMIVIGITPDQM 188
            GYSK GDV  ARQLFD MPS+DLASWSA IAAY+NNRNYRG LLLFQDMI+ GI PDQM
Sbjct: 129 SGYSKAGDVHTARQLFDRMPSRDLASWSAMIAAYINNRNYRGALLLFQDMIINGINPDQM 188

Query: 189 AVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFFKYACQVFH 248
           A GSIL GCA+MGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGF KYACQ+F+
Sbjct: 189 AAGSILNGCAHMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFLKYACQIFN 248

Query: 249 LMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSACVHAGFVQE 308
           LMSE+NVRTWTALICGLA HG CKEAL LFE MR+E VEPNE TFTG+LSACVHAG VQE
Sbjct: 249 LMSERNVRTWTALICGLAHHGCCKEALVLFETMRHEGVEPNEFTFTGVLSACVHAGLVQE 308

Query: 309 GRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVWSSLLSACK 368
           GRKYFNMIEE GLE RIQHYGC VDLLGRSGLLEEAYGVIK+MRLEPN+IVWSSLLSACK
Sbjct: 309 GRKYFNMIEECGLEIRIQHYGCFVDLLGRSGLLEEAYGVIKSMRLEPNVIVWSSLLSACK 368

Query: 369 QHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLNQNVRKVRA 428
           QHKSFD+AERVIEQIL+K+EP+NH G+YSL+SDLYVL++KWDDAE IRNLLNQ+VRK RA
Sbjct: 369 QHKSFDLAERVIEQILEKIEPDNHAGVYSLVSDLYVLQDKWDDAENIRNLLNQHVRKGRA 428

Query: 429 YSLIRSGL 436
           YSLIRSGL
Sbjct: 429 YSLIRSGL 436

BLAST of Cp4.1LG10g06870 vs. TAIR 10
Match: AT5G66520.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 305.4 bits (781), Expect = 7.1e-83
Identity = 162/412 (39.32%), Postives = 253/412 (61.41%), Query Frame = 0

Query: 27  WALRIRN---APSLHKALAIYSQMHRQSVPHDSFSILFVLKACARSNNLSILHHLHAHIT 86
           W L IR    +    ++L +Y +M   S PH++++   +LKAC+  +       +HA IT
Sbjct: 83  WNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETTQIHAQIT 142

Query: 87  KLGFTTHVFVATSLLYAY-VLNSFELACLLFDEMPHKNTVTWNTMIFGYSKTGDVDRARQ 146
           KLG+   V+   SL+ +Y V  +F+LA LLFD +P  + V+WN++I GY K G +D A  
Sbjct: 143 KLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKAGKMDIALT 202

Query: 147 LFDLMPSKDLASWSATIAAYVNNRNYRGGLLLFQDMIVIGITPDQMAVGSILKGCAYMGS 206
           LF  M  K+  SW+  I+ YV     +  L LF +M    + PD +++ + L  CA +G+
Sbjct: 203 LFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANALSACAQLGA 262

Query: 207 LGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFFKYACQVFHLMSEKNVRTWTALI 266
           L    GK +H ++ K R  ++  LG VL+DMYAKCG  + A +VF  + +K+V+ WTALI
Sbjct: 263 LE--QGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQAWTALI 322

Query: 267 CGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSACVHAGFVQEGRK-YFNMIEEYGL 326
            G A HG+ +EA+  F  M+   ++PN +TFT +L+AC + G V+EG+  +++M  +Y L
Sbjct: 323 SGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSMERDYNL 382

Query: 327 ETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVWSSLLSACKQHKSFDMAERVIE 386
           +  I+HYGC+VDLLGR+GLL+EA   I+ M L+PN ++W +LL AC+ HK+ ++ E  I 
Sbjct: 383 KPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIELGEE-IG 442

Query: 387 QILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLL-NQNVRKVRAYSLI 433
           +IL  ++P  HGG Y   ++++ +++KWD A + R L+  Q V KV   S I
Sbjct: 443 EILIAIDP-YHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTI 490

BLAST of Cp4.1LG10g06870 vs. TAIR 10
Match: AT2G44880.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 278.9 bits (712), Expect = 7.1e-75
Identity = 136/315 (43.17%), Postives = 205/315 (65.08%), Query Frame = 0

Query: 112 LFDEMPHKNTVTWNTMIFGYSKTGDVDRARQLFDLMPSKDLASWSATIAAYVNNRNYRGG 171
           LFDEM HK  +TW TMI GY    D+D AR+LFD MP ++L SW+  I  Y  N+  + G
Sbjct: 198 LFDEMTHKTVITWTTMIHGYCNIKDIDAARKLFDAMPERNLVSWNTMIGGYCQNKQPQEG 257

Query: 172 LLLFQDM-IVIGITPDQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVL 231
           + LFQ+M     + PD + + S+L   +  G+L L  G+  H FV + + +  +++ T +
Sbjct: 258 IRLFQEMQATTSLDPDDVTILSVLPAISDTGALSL--GEWCHCFVQRKKLDKKVKVCTAI 317

Query: 232 VDMYAKCGFFKYACQVFHLMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNE 291
           +DMY+KCG  + A ++F  M EK V +W A+I G A +G  + ALDLF  M  E  +P+E
Sbjct: 318 LDMYSKCGEIEKAKRIFDEMPEKQVASWNAMIHGYALNGNARAALDLFVTMMIE-EKPDE 377

Query: 292 LTFTGILSACVHAGFVQEGRKYFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKN 351
           +T   +++AC H G V+EGRK+F+++ E GL  +I+HYGCMVDLLGR+G L+EA  +I N
Sbjct: 378 ITMLAVITACNHGGLVEEGRKWFHVMREMGLNAKIEHYGCMVDLLGRAGSLKEAEDLITN 437

Query: 352 MRLEPNIIVWSSLLSACKQHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWD 411
           M  EPN I+ SS LSAC Q+K  + AER++++ ++ LEP+N G  Y L+ +LY  +++WD
Sbjct: 438 MPFEPNGIILSSFLSACGQYKDIERAERILKKAVE-LEPQNDGN-YVLLRNLYAADKRWD 497

Query: 412 DAEKIRNLLNQNVRK 426
           D   ++N++ +N  K
Sbjct: 498 DFGMVKNVMRKNQAK 507

BLAST of Cp4.1LG10g06870 vs. TAIR 10
Match: AT5G61800.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 277.7 bits (709), Expect = 1.6e-74
Identity = 145/410 (35.37%), Postives = 237/410 (57.80%), Query Frame = 0

Query: 35  PSLHKALAIYSQMHRQSVPHDSFSILFVLKACARSNN--LSILHHLHAHITKLGFTTHVF 94
           PS   +   + +M R+SVP D  +  FV KACA   N  L+++  LH    + G  + +F
Sbjct: 94  PSSLSSKRFFVEMRRRSVPPDFHTFPFVFKACAAKKNGDLTLVKTLHCQALRFGLLSDLF 153

Query: 95  VATSLLYAY-VLNSFELACLLFDEMPHKNTVTWNTMIFGYSKTGDVDRARQLFDLMPSKD 154
              +L+  Y ++   + A  LFDE P ++ VT+N +I G  K  ++ RAR+LFD MP +D
Sbjct: 154 TLNTLIRVYSLIAPIDSALQLFDENPQRDVVTYNVLIDGLVKAREIVRARELFDSMPLRD 213

Query: 155 LASWSATIAAYVNNRNYRGGLLLFQDMIVIGITPDQMAVGSILKGCAYMGSLGLLAGKSV 214
           L SW++ I+ Y    + R  + LF +M+ +G+ PD +A+ S L  CA  G      GK++
Sbjct: 214 LVSWNSLISGYAQMNHCREAIKLFDEMVALGLKPDNVAIVSTLSACAQSGD--WQKGKAI 273

Query: 215 HGFVVKNRWELNLELGTVLVDMYAKCGFFKYACQVFHLMSEKNVRTWTALICGLAQHGYC 274
           H +  + R  ++  L T LVD YAKCGF   A ++F L S+K + TW A+I GLA HG  
Sbjct: 274 HDYTKRKRLFIDSFLATGLVDFYAKCGFIDTAMEIFELCSDKTLFTWNAMITGLAMHGNG 333

Query: 275 KEALDLFEMMRNECVEPNELTFTGILSACVHAGFVQEGRKYFNMIEE-YGLETRIQHYGC 334
           +  +D F  M +  ++P+ +TF  +L  C H+G V E R  F+ +   Y +   ++HYGC
Sbjct: 334 ELTVDYFRKMVSSGIKPDGVTFISVLVGCSHSGLVDEARNLFDQMRSLYDVNREMKHYGC 393

Query: 335 MVDLLGRSGLLEEAYGVIKNMRLE----PNIIVWSSLLSACKQHKSFDMAERVIEQILDK 394
           M DLLGR+GL+EEA  +I+ M  +      ++ WS LL  C+ H + ++AE+   ++   
Sbjct: 394 MADLLGRAGLIEEAAEMIEQMPKDGGNREKLLAWSGLLGGCRIHGNIEIAEKAANRV-KA 453

Query: 395 LEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLNQN--VRKVRAYSLIRS 435
           L PE+ GG+Y ++ ++Y   E+W++  K+R +++++  V+K   +S + S
Sbjct: 454 LSPED-GGVYKVMVEMYANAERWEEVVKVREIIDRDKKVKKNVGFSKVLS 499

BLAST of Cp4.1LG10g06870 vs. TAIR 10
Match: AT5G06540.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 275.4 bits (703), Expect = 7.9e-74
Identity = 153/429 (35.66%), Postives = 252/429 (58.74%), Query Frame = 0

Query: 7   SHLLNPARRQFALFAKGFNSWALRIRNAPSLHKALAIYSQMHRQSVPHDSFSILFVLKAC 66
           S + NP    F L  + F++ A      PS  KA   Y+QM +  +  D+ +  F++KA 
Sbjct: 75  SQIQNPNLFVFNLLIRCFSTGA-----EPS--KAFGFYTQMLKSRIWPDNITFPFLIKAS 134

Query: 67  ARSNNLSILHHLHAHITKLGFTTHVFVATSLLYAYVLNSFELAC-LLFDEMPHKNTVTWN 126
           +    + +    H+ I + GF   V+V  SL++ Y    F  A   +F +M  ++ V+W 
Sbjct: 135 SEMECVLVGEQTHSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIFGQMGFRDVVSWT 194

Query: 127 TMIFGYSKTGDVDRARQLFDLMPSKDLASWSATIAAYVNNRNYRGGLLLFQDMIVIGITP 186
           +M+ GY K G V+ AR++FD MP ++L +WS  I  Y  N  +   + LF+ M   G+  
Sbjct: 195 SMVAGYCKCGMVENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMKREGVVA 254

Query: 187 DQMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFFKYACQ 246
           ++  + S++  CA++G+L    G+  + +VVK+   +NL LGT LVDM+ +CG  + A  
Sbjct: 255 NETVMVSVISSCAHLGALEF--GERAYEYVVKSHMTVNLILGTALVDMFWRCGDIEKAIH 314

Query: 247 VFHLMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSACVHAGF 306
           VF  + E +  +W+++I GLA HG+  +A+  F  M +    P ++TFT +LSAC H G 
Sbjct: 315 VFEGLPETDSLSWSSIIKGLAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSACSHGGL 374

Query: 307 VQEGRK-YFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVWSSLL 366
           V++G + Y NM +++G+E R++HYGC+VD+LGR+G L EA   I  M ++PN  +  +LL
Sbjct: 375 VEKGLEIYENMKKDHGIEPRLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPILGALL 434

Query: 367 SACKQHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLNQN-V 426
            ACK +K+ ++AERV   +L K++PE H G Y L+S++Y    +WD  E +R+++ +  V
Sbjct: 435 GACKIYKNTEVAERV-GNMLIKVKPE-HSGYYVLLSNIYACAGQWDKIESLRDMMKEKLV 492

Query: 427 RKVRAYSLI 433
           +K   +SLI
Sbjct: 495 KKPPGWSLI 492

BLAST of Cp4.1LG10g06870 vs. TAIR 10
Match: AT1G06150.1 (basic helix-loop-helix (bHLH) DNA-binding superfamily protein )

HSP 1 Score: 273.1 bits (697), Expect = 3.9e-73
Identity = 145/416 (34.86%), Postives = 233/416 (56.01%), Query Frame = 0

Query: 39   KALAIYSQMHRQSVPHDSFSILFVLKACARSNNLSILHHLHAHITKLGFTTHVFVATSLL 98
            ++L +Y +M R SV   S++   ++KA + ++       L AHI K GF  HV + T+L+
Sbjct: 854  RSLELYVRMLRDSVSPSSYTYSSLVKASSFASRFG--ESLQAHIWKFGFGFHVKIQTTLI 913

Query: 99   YAY-VLNSFELACLLFDEMPHKNTVTWNTM------------------------------ 158
              Y        A  +FDEMP ++ + W TM                              
Sbjct: 914  DFYSATGRIREARKVFDEMPERDDIAWTTMVSAYRRVLDMDSANSLANQMSEKNEATSNC 973

Query: 159  -IFGYSKTGDVDRARQLFDLMPSKDLASWSATIAAYVNNRNYRGGLLLFQDMIVIGITPD 218
             I GY   G++++A  LF+ MP KD+ SW+  I  Y  N+ YR  + +F  M+  GI PD
Sbjct: 974  LINGYMGLGNLEQAESLFNQMPVKDIISWTTMIKGYSQNKRYREAIAVFYKMMEEGIIPD 1033

Query: 219  QMAVGSILKGCAYMGSLGLLAGKSVHGFVVKNRWELNLELGTVLVDMYAKCGFFKYACQV 278
            ++ + +++  CA++G L +  GK VH + ++N + L++ +G+ LVDMY+KCG  + A  V
Sbjct: 1034 EVTMSTVISACAHLGVLEI--GKEVHMYTLQNGFVLDVYIGSALVDMYSKCGSLERALLV 1093

Query: 279  FHLMSEKNVRTWTALICGLAQHGYCKEALDLFEMMRNECVEPNELTFTGILSACVHAGFV 338
            F  + +KN+  W ++I GLA HG+ +EAL +F  M  E V+PN +TF  + +AC HAG V
Sbjct: 1094 FFNLPKKNLFCWNSIIEGLAAHGFAQEALKMFAKMEMESVKPNAVTFVSVFTACTHAGLV 1153

Query: 339  QEGRK-YFNMIEEYGLETRIQHYGCMVDLLGRSGLLEEAYGVIKNMRLEPNIIVWSSLLS 398
             EGR+ Y +MI++Y + + ++HYG MV L  ++GL+ EA  +I NM  EPN ++W +LL 
Sbjct: 1154 DEGRRIYRSMIDDYSIVSNVEHYGGMVHLFSKAGLIYEALELIGNMEFEPNAVIWGALLD 1213

Query: 399  ACKQHKSFDMAERVIEQILDKLEPENHGGIYSLISDLYVLEEKWDDAEKIRNLLNQ 422
             C+ HK+  +AE    +++  LEP N  G Y L+  +Y  + +W D  +IR  + +
Sbjct: 1214 GCRIHKNLVIAEIAFNKLM-VLEPMN-SGYYFLLVSMYAEQNRWRDVAEIRGRMRE 1263

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9FJY71.0e-8139.32Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX... [more]
Q1PEU41.0e-7343.17Pentatricopeptide repeat-containing protein At2g44880 OS=Arabidopsis thaliana OX... [more]
Q9FLS92.2e-7335.37Pentatricopeptide repeat-containing protein At5g61800 OS=Arabidopsis thaliana OX... [more]
Q9FG161.1e-7235.66Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana OX... [more]
Q56X055.5e-7234.86Pentatricopeptide repeat-containing protein At1g06143 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_023544798.10.0100.00pentatricopeptide repeat-containing protein At5g66520-like [Cucurbita pepo subsp... [more]
XP_022925591.10.099.54pentatricopeptide repeat-containing protein At5g66520-like [Cucurbita moschata] ... [more]
XP_022978962.10.098.39pentatricopeptide repeat-containing protein At5g66520-like [Cucurbita maxima][more]
XP_038877521.13.20e-28988.53pentatricopeptide repeat-containing protein At5g66520-like [Benincasa hispida][more]
KAA0052417.11.88e-27886.68pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
A0A6J1ECM30.099.54pentatricopeptide repeat-containing protein At5g66520-like OS=Cucurbita moschata... [more]
A0A6J1IMI10.098.39pentatricopeptide repeat-containing protein At5g66520-like OS=Cucurbita maxima O... [more]
A0A5A7UD499.10e-27986.68Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3AYC44.31e-27786.45pentatricopeptide repeat-containing protein At5g66520-like OS=Cucumis melo OX=36... [more]
A0A0A0KIK92.29e-27585.75Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G538690 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G66520.17.1e-8339.32Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G44880.17.1e-7543.17Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT5G61800.11.6e-7435.37Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G06540.17.9e-7435.66Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G06150.13.9e-7334.86basic helix-loop-helix (bHLH) DNA-binding superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 301..433
e-value: 5.7E-16
score: 60.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 154..300
e-value: 6.4E-31
score: 109.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 28..150
e-value: 8.1E-10
score: 40.5
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 328..352
e-value: 0.011
score: 15.9
coord: 122..151
e-value: 2.4E-8
score: 33.7
coord: 359..385
e-value: 0.083
score: 13.2
coord: 154..183
e-value: 0.072
score: 13.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 253..301
e-value: 6.5E-11
score: 42.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 122..152
e-value: 6.9E-8
score: 30.2
coord: 257..290
e-value: 1.1E-7
score: 29.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 254..288
score: 11.39981
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 120..154
score: 11.980759
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 289..323
score: 8.506026
NoneNo IPR availablePANTHERPTHR47928:SF104OS01G0800400 PROTEINcoord: 22..432
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 22..432

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG10g06870.1Cp4.1LG10g06870.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding