Cp4.1LG09g11050 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG09g11050
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG09: 9464618 .. 9466910 (-)
RNA-Seq ExpressionCp4.1LG09g11050
SyntenyCp4.1LG09g11050
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGTCAATGTGCATAATACTACAACCACGCCAGATTTGACTTTATAGACCATGGACACTAGACAACTCCATAGATTTAATAAGTTCAAATGAGAAAAGATTTATTTATAAAATAACGGTTCAGATATAATATTTAGAGAATATACTAGTTATGGAATATTCTTGTAATATATTTTATTTTATTCTTAATATTTAATTAGTTTATATTTTATTTATTCGAGAGTGGTTAGTTGGTAGCTTGTATCCCGTTTATGAATATAAAAATGGGTGGGGAGTTATTATTGGGAATTAAAACTTATATTTAGATTAAAATATAAAAATGATTTAACCGAACTTGGGTGCGGTCGGGGCTAGGATTGGCTTTGAATATACCTCCGAATAGAGTTTGTTCCAGGATTTCAAATGATCATATAAATTAAATGCCTCTCCACATCGCTCGCCCATTAATTTGTGTTTCGAAATCAACAAGAACACGACTTTCAATTGCTTTGAGAATTTCTCACAAGTCTTTCATTTCAAAATCAGAGATTTCGTCCGTGAAGCTAGAAGATTTCTATGTCGATCTCTTGCATCGGTGTGTTCAAACCTCCGATTCCCGCCATGGATCGGCAATTCATGCGAAGTTTCTCAAAGGATTTCTTCCATGTTCTCTTTTCTTCCACAACCATGTACTTAATTTCTATGTCAAATGTGGAAGTCTCTCATGTGGTCTGCAACTGTTCGACGAAATGCCTGAGAGAAATGTTGTGTCCTGGTCTGCATTAATTGCTGGGTTCGTCCAACACGGCCGACCCAATGAAGCCCTCTCTCTGTTTAGCCGCATGCATTGTGATGGCACGATCATTCCCAACGAATTCACCCTTGTAAGTGCTCTTCATGCTTGTTCTTTAACTCAGAGGCTGATATGTTCGTACCAAATTTACGCATTAGTTCTTCGCTTAGGATATGGGTCGAATATTTTCCTCATGAATGCGTTCTTAACTGCTTTAATTAGGCATGAGAAATTGCTGGACGCTTTAGAAGTTTTTGAGAGTTCTTCATCCAAGGATATTGTATCATGGAATGCTATGATGGCTGGTTATTTGCAATTATCATATCTAGAACTGCCTAAGTTTTGGCGCCGGATGAATCTCGAGAACATTAAGCCTGATAATTTTACATTTGCTAGTATCTTAACTGGGTTGGCTGCTCTCTCTGAGTTTAAACTGGGTTTGCAAGTTCATGGACAGCTTGTGAAAAGTGGATATGGGAATGATATTTGTGTAGGGAATTCCTTGTGTGATATGTACATTAAGAATCAGAAATTGTTTGATGGTTTTAAAGCTTTTGATGAAATGCCTTCAAGTGATGTGTGTTCTTGGACTCAGATGGCTGCAGGGTGCCTCCATTGTGGTGAACCAATGAAGGCACTCGAGGTCATTTACGATATGAAAAACGTCGGTGTGAGGCTAAATAAGTTCACCCTTGCAACTGCCTTGAATGCTTCTGCTAATTTAGCATCCATAGAAGAAGGGAAGAAATTCCATGGATTGAGAATTAAACTTGGAGCTGATATTGATGTTTGTGTTGATAATGCTCTACTTGATATGTATGCAAAATGTGGATGTATGAGCAGTGCGAATGTTGTCTTTCGTTCAATGGATGAACAATCTGTTGTCTCATGGACTACCATGATTATGGGATTTGCACACAATGGCCAAGCAAAAGAAGCCCTTCAAATATTTGATGAAATGAGAAAAGAAGGAACTGAGCCTAATCACATCACTTTTGTTTGTGTCCTCTATGCTTGTAGCCAAGGAGGTTTCATTGATGAAGCATGGAAATACTTCTCTTCCATGAGTGCTGACCATGGGATTTCACCCTCTGAAGATCACTATGTATGTATGGTGAATCTCTTAGGCCGAGCTGGGTGTATAAAAGAAGCCGAGGATTTGATCGGACGAATGCCGTTTAAACCGGGTCCTTTGGTCTGGCAAACATTGCTTGGTGCATGCTTAGTTCATGGCGACGTAGAGACAGGAAAACGAGCAGCCGAGCACGCGTTGAATTTGGATCAAAACGATCCATCGACTTATGTTTTGTTATCGAACATGTTTGCTGGTCGGAGTAACTGGGACGGTGTGGGAAGTTTGAGAGAGCTAATGGAAACTAGAGATGTAAAGAAAGTACCTGGATTCAGTTGGATGTAAACATGAGAATGGTTAGTTCTTCTCCTTTTAAAGATTCTTCGAGGAAATTCATGTTGGTGGGTGACTTGTTTTCCTATGATAGCTGTTGGGAGAGGTTATTTGA

mRNA sequence

ATGCAGATTTCGTCCGTGAAGCTAGAAGATTTCTATGTCGATCTCTTGCATCGGTGTGTTCAAACCTCCGATTCCCGCCATGGATCGGCAATTCATGCGAAGTTTCTCAAAGGATTTCTTCCATGTTCTCTTTTCTTCCACAACCATGTACTTAATTTCTATGTCAAATGTGGAAGTCTCTCATGTGGTCTGCAACTGTTCGACGAAATGCCTGAGAGAAATGTTGTGTCCTGGTCTGCATTAATTGCTGGGTTCGTCCAACACGGCCGACCCAATGAAGCCCTCTCTCTGTTTAGCCGCATGCATTGTGATGGCACGATCATTCCCAACGAATTCACCCTTGTAAGTGCTCTTCATGCTTGTTCTTTAACTCAGAGGCTGATATGTTCGTACCAAATTTACGCATTAGTTCTTCGCTTAGGATATGGGTCGAATATTTTCCTCATGAATGCGTTCTTAACTGCTTTAATTAGGCATGAGAAATTGCTGGACGCTTTAGAAGTTTTTGAGAGTTCTTCATCCAAGGATATTGTATCATGGAATGCTATGATGGCTGGTTATTTGCAATTATCATATCTAGAACTGCCTAAGTTTTGGCGCCGGATGAATCTCGAGAACATTAAGCCTGATAATTTTACATTTGCTAGTATCTTAACTGGGTTGGCTGCTCTCTCTGAGTTTAAACTGGGTTTGCAAGTTCATGGACAGCTTGTGAAAAGTGGATATGGGAATGATATTTGTGTAGGGAATTCCTTGTGTGATATGTACATTAAGAATCAGAAATTGTTTGATGGTTTTAAAGCTTTTGATGAAATGCCTTCAAGTGATGTGTGTTCTTGGACTCAGATGGCTGCAGGGTGCCTCCATTGTGGTGAACCAATGAAGGCACTCGAGCTGTTGGGAGAGGTTATTTGA

Coding sequence (CDS)

ATGCAGATTTCGTCCGTGAAGCTAGAAGATTTCTATGTCGATCTCTTGCATCGGTGTGTTCAAACCTCCGATTCCCGCCATGGATCGGCAATTCATGCGAAGTTTCTCAAAGGATTTCTTCCATGTTCTCTTTTCTTCCACAACCATGTACTTAATTTCTATGTCAAATGTGGAAGTCTCTCATGTGGTCTGCAACTGTTCGACGAAATGCCTGAGAGAAATGTTGTGTCCTGGTCTGCATTAATTGCTGGGTTCGTCCAACACGGCCGACCCAATGAAGCCCTCTCTCTGTTTAGCCGCATGCATTGTGATGGCACGATCATTCCCAACGAATTCACCCTTGTAAGTGCTCTTCATGCTTGTTCTTTAACTCAGAGGCTGATATGTTCGTACCAAATTTACGCATTAGTTCTTCGCTTAGGATATGGGTCGAATATTTTCCTCATGAATGCGTTCTTAACTGCTTTAATTAGGCATGAGAAATTGCTGGACGCTTTAGAAGTTTTTGAGAGTTCTTCATCCAAGGATATTGTATCATGGAATGCTATGATGGCTGGTTATTTGCAATTATCATATCTAGAACTGCCTAAGTTTTGGCGCCGGATGAATCTCGAGAACATTAAGCCTGATAATTTTACATTTGCTAGTATCTTAACTGGGTTGGCTGCTCTCTCTGAGTTTAAACTGGGTTTGCAAGTTCATGGACAGCTTGTGAAAAGTGGATATGGGAATGATATTTGTGTAGGGAATTCCTTGTGTGATATGTACATTAAGAATCAGAAATTGTTTGATGGTTTTAAAGCTTTTGATGAAATGCCTTCAAGTGATGTGTGTTCTTGGACTCAGATGGCTGCAGGGTGCCTCCATTGTGGTGAACCAATGAAGGCACTCGAGCTGTTGGGAGAGGTTATTTGA

Protein sequence

MQISSVKLEDFYVDLLHRCVQTSDSRHGSAIHAKFLKGFLPCSLFFHNHVLNFYVKCGSLSCGLQLFDEMPERNVVSWSALIAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHACSLTQRLICSYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWNAMMAGYLQLSYLELPKFWRRMNLENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSGYGNDICVGNSLCDMYIKNQKLFDGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALELLGEVI
Homology
BLAST of Cp4.1LG09g11050 vs. ExPASy Swiss-Prot
Match: P93005 (Pentatricopeptide repeat-containing protein At2g33680 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E19 PE=3 SV=1)

HSP 1 Score: 174.9 bits (442), Expect = 1.4e-42
Identity = 98/278 (35.25%), Postives = 157/278 (56.47%), Query Frame = 0

Query: 28  GSAIHAKFLKGFLPCSLFFHNHVLNFYVKCGSLSCGLQLFDEMPERNVVSWSALIAGFVQ 87
           G  IH   +K  L   +   N ++  Y KC SL+   ++FD   +RN ++WSA++ G+ Q
Sbjct: 240 GRQIHCITIKNGLLGFVALSNALVTMYSKCESLNEACKMFDSSGDRNSITWSAMVTGYSQ 299

Query: 88  HGRPNEALSLFSRMHCDGTIIPNEFTLVSALHACSLTQRLICSYQIYALVLRLGYGSNIF 147
           +G   EA+ LFSRM   G I P+E+T+V  L+ACS    L    Q+++ +L+LG+  ++F
Sbjct: 300 NGESLEAVKLFSRMFSAG-IKPSEYTIVGVLNACSDICYLEEGKQLHSFLLKLGFERHLF 359

Query: 148 LMNAFLTALIRHEKLLDALEVFESSSSKDIVSWNAMMAGYLQLS-YLELPKFWRRMNLEN 207
              A +    +   L DA + F+    +D+  W ++++GY+Q S   E    +RRM    
Sbjct: 360 ATTALVDMYAKAGCLADARKGFDCLQERDVALWTSLISGYVQNSDNEEALILYRRMKTAG 419

Query: 208 IKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSGYGNDICVGNSLCDMYIKNQKLFDGF 267
           I P++ T AS+L   ++L+  +LG QVHG  +K G+G ++ +G++L  MY K   L DG 
Sbjct: 420 IIPNDPTMASVLKACSSLATLELGKQVHGHTIKHGFGLEVPIGSALSTMYSKCGSLEDGN 479

Query: 268 KAFDEMPSSDVCSWTQMAAGCLHCGEPMKALELLGEVI 305
             F   P+ DV SW  M +G  H G+  +ALEL  E++
Sbjct: 480 LVFRRTPNKDVVSWNAMISGLSHNGQGDEALELFEEML 516

BLAST of Cp4.1LG09g11050 vs. ExPASy Swiss-Prot
Match: Q9LIQ7 (Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H87 PE=3 SV=1)

HSP 1 Score: 173.3 bits (438), Expect = 4.1e-42
Identity = 93/274 (33.94%), Postives = 152/274 (55.47%), Query Frame = 0

Query: 11  FYVDLLHRCVQTSDSRHGSAIHAKFLKGFLPCSLFFHNHVLNFYVKCGSLSCGLQLFDEM 70
           FY  LL +C        G  +HA  L+      +   N +LN Y KCGSL    ++F++M
Sbjct: 62  FYNTLLKKCTVFKLLIQGRIVHAHILQSIFRHDIVMGNTLLNMYAKCGSLEEARKVFEKM 121

Query: 71  PERNVVSWSALIAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHACSLTQRLICS 130
           P+R+ V+W+ LI+G+ QH RP +AL  F++M   G   PNEFTL S + A +  +R  C 
Sbjct: 122 PQRDFVTWTTLISGYSQHDRPCDALLFFNQMLRFG-YSPNEFTLSSVIKAAAAERRGCCG 181

Query: 131 YQIYALVLRLGYGSNIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWNAMMAGYLQL 190
           +Q++   ++ G+ SN+ + +A L    R+  + DA  VF++  S++ VSWNA++AG+ + 
Sbjct: 182 HQLHGFCVKCGFDSNVHVGSALLDLYTRYGLMDDAQLVFDALESRNDVSWNALIAGHARR 241

Query: 191 SYLELP-KFWRRMNLENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSGYGNDICVG 250
           S  E   + ++ M  +  +P +F++AS+    ++    + G  VH  ++KSG       G
Sbjct: 242 SGTEKALELFQGMLRDGFRPSHFSYASLFGACSSTGFLEQGKWVHAYMIKSGEKLVAFAG 301

Query: 251 NSLCDMYIKNQKLFDGFKAFDEMPSSDVCSWTQM 284
           N+L DMY K+  + D  K FD +   DV SW  +
Sbjct: 302 NTLLDMYAKSGSIHDARKIFDRLAKRDVVSWNSL 334

BLAST of Cp4.1LG09g11050 vs. ExPASy Swiss-Prot
Match: P0C898 (Putative pentatricopeptide repeat-containing protein At3g15130 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H86 PE=3 SV=1)

HSP 1 Score: 167.9 bits (424), Expect = 1.7e-40
Identity = 98/293 (33.45%), Postives = 155/293 (52.90%), Query Frame = 0

Query: 13  VDLLHRCVQTSDSRHGSAIHAKFLKGFLPCSLFFHNHVLNFYVKCGSLSCGLQLFDEMPE 72
           V +L  C +   S  G  +H   LK     +L   N++++ Y KC       ++FD MPE
Sbjct: 10  VSILRVCTRKGLSDQGGQVHCYLLKSGSGLNLITSNYLIDMYCKCREPLMAYKVFDSMPE 69

Query: 73  RNVVSWSALIAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHACSLTQRLICSYQ 132
           RNVVSWSAL++G V +G    +LSLFS M   G I PNEFT  + L AC L   L    Q
Sbjct: 70  RNVVSWSALMSGHVLNGDLKGSLSLFSEMGRQG-IYPNEFTFSTNLKACGLLNALEKGLQ 129

Query: 133 IYALVLRLGYGSNIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWNAMMAGYLQLSY 192
           I+   L++G+   + + N+ +    +  ++ +A +VF     + ++SWNAM+AG++   Y
Sbjct: 130 IHGFCLKIGFEMMVEVGNSLVDMYSKCGRINEAEKVFRRIVDRSLISWNAMIAGFVHAGY 189

Query: 193 ----LELPKFWRRMNLENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSGY--GNDI 252
               L+     +  N++  +PD FT  S+L   ++      G Q+HG LV+SG+   +  
Sbjct: 190 GSKALDTFGMMQEANIKE-RPDEFTLTSLLKACSSTGMIYAGKQIHGFLVRSGFHCPSSA 249

Query: 253 CVGNSLCDMYIKNQKLFDGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALEL 300
            +  SL D+Y+K   LF   KAFD++    + SW+ +  G    GE ++A+ L
Sbjct: 250 TITGSLVDLYVKCGYLFSARKAFDQIKEKTMISWSSLILGYAQEGEFVEAMGL 300

BLAST of Cp4.1LG09g11050 vs. ExPASy Swiss-Prot
Match: Q9LIC3 (Putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H85 PE=3 SV=1)

HSP 1 Score: 164.5 bits (415), Expect = 1.9e-39
Identity = 101/291 (34.71%), Postives = 155/291 (53.26%), Query Frame = 0

Query: 12  YVDLLHRCVQTSDSRHGSAIHAKFLK-GFLPCSLFFHNHVLNFYVKCGSLSCGLQLFDEM 71
           Y  LL+ C+     R G  +HA  +K  +LP + +    +L FY KC  L    ++ DEM
Sbjct: 55  YDALLNACLDKRALRDGQRVHAHMIKTRYLPAT-YLRTRLLIFYGKCDCLEDARKVLDEM 114

Query: 72  PERNVVSWSALIAGFVQHGRPNEALSLFSR-MHCDGTIIPNEFTLVSALHACSLTQRLIC 131
           PE+NVVSW+A+I+ + Q G  +EAL++F+  M  DG   PNEFT  + L +C     L  
Sbjct: 115 PEKNVVSWTAMISRYSQTGHSSEALTVFAEMMRSDGK--PNEFTFATVLTSCIRASGLGL 174

Query: 132 SYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWNAMMAGYLQ 191
             QI+ L+++  Y S+IF+ ++ L    +  ++ +A E+FE    +D+VS  A++AGY Q
Sbjct: 175 GKQIHGLIVKWNYDSHIFVGSSLLDMYAKAGQIKEAREIFECLPERDVVSCTAIIAGYAQ 234

Query: 192 LSY-LELPKFWRRMNLENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSGYGNDICV 251
           L    E  + + R++ E + P+  T+AS+LT L+ L+    G Q H  +++        +
Sbjct: 235 LGLDEEALEMFHRLHSEGMSPNYVTYASLLTALSGLALLDHGKQAHCHVLRRELPFYAVL 294

Query: 252 GNSLCDMYIKNQKLFDGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALEL 300
            NSL DMY K   L    + FD MP     SW  M  G    G   + LEL
Sbjct: 295 QNSLIDMYSKCGNLSYARRLFDNMPERTAISWNAMLVGYSKHGLGREVLEL 342

BLAST of Cp4.1LG09g11050 vs. ExPASy Swiss-Prot
Match: Q9LFI1 (Pentatricopeptide repeat-containing protein At3g53360, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E86 PE=2 SV=1)

HSP 1 Score: 162.5 bits (410), Expect = 7.3e-39
Identity = 87/294 (29.59%), Postives = 152/294 (51.70%), Query Frame = 0

Query: 12  YVDLLHRCVQTSDSRHGSAIHAKFLKGFLPCSLFFHNHVLNFYVKCGSLSCGLQLFDEMP 71
           Y+ L+  C  +     G  IH   L          +NH+L+ Y KCGSL    ++FD MP
Sbjct: 70  YISLICACSSSRSLAQGRKIHDHILNSNCKYDTILNNHILSMYGKCGSLRDAREVFDFMP 129

Query: 72  ERNVVSWSALIAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHACSLTQRLICSY 131
           ERN+VS++++I G+ Q+G+  EA+ L+ +M     ++P++F   S + AC+ +  +    
Sbjct: 130 ERNLVSYTSVITGYSQNGQGAEAIRLYLKM-LQEDLVPDQFAFGSIIKACASSSDVGLGK 189

Query: 132 QIYALVLRLGYGSNIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWNAMMAGYLQL- 191
           Q++A V++L   S++   NA +   +R  ++ DA  VF     KD++SW++++AG+ QL 
Sbjct: 190 QLHAQVIKLESSSHLIAQNALIAMYVRFNQMSDASRVFYGIPMKDLISWSSIIAGFSQLG 249

Query: 192 -SYLELPKFWRRMNLENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSGYGNDICVG 251
             +  L      ++     P+ + F S L   ++L     G Q+HG  +KS    +   G
Sbjct: 250 FEFEALSHLKEMLSFGVFHPNEYIFGSSLKACSSLLRPDYGSQIHGLCIKSELAGNAIAG 309

Query: 252 NSLCDMYIKNQKLFDGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALELLGEV 304
            SLCDMY +   L    + FD++   D  SW  + AG  + G   +A+ +  ++
Sbjct: 310 CSLCDMYARCGFLNSARRVFDQIERPDTASWNVIIAGLANNGYADEAVSVFSQM 362

BLAST of Cp4.1LG09g11050 vs. NCBI nr
Match: XP_023542233.1 (pentatricopeptide repeat-containing protein At2g13600-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 609 bits (1570), Expect = 2.72e-215
Identity = 296/302 (98.01%), Postives = 301/302 (99.67%), Query Frame = 0

Query: 2   QISSVKLEDFYVDLLHRCVQTSDSRHGSAIHAKFLKGFLPCSLFFHNHVLNFYVKCGSLS 61
           +ISSVKLEDFYVDLLHRCVQTSDSRHGSAIHAKFLKGFLPCSLFFHNHVLNFYVKCGSLS
Sbjct: 36  EISSVKLEDFYVDLLHRCVQTSDSRHGSAIHAKFLKGFLPCSLFFHNHVLNFYVKCGSLS 95

Query: 62  CGLQLFDEMPERNVVSWSALIAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHAC 121
           CGLQLFDEMPERNVVSWSALIAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHAC
Sbjct: 96  CGLQLFDEMPERNVVSWSALIAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHAC 155

Query: 122 SLTQRLICSYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWN 181
           SLTQRLICSYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWN
Sbjct: 156 SLTQRLICSYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWN 215

Query: 182 AMMAGYLQLSYLELPKFWRRMNLENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSG 241
           AMMAGYLQLSYLELPKFWRRMNLENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSG
Sbjct: 216 AMMAGYLQLSYLELPKFWRRMNLENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSG 275

Query: 242 YGNDICVGNSLCDMYIKNQKLFDGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALELLG 301
           YGNDICVGNSLCDMYIKNQKLFDGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALE++ 
Sbjct: 276 YGNDICVGNSLCDMYIKNQKLFDGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALEVIY 335

Query: 302 EV 303
           ++
Sbjct: 336 DM 337

BLAST of Cp4.1LG09g11050 vs. NCBI nr
Match: KAG6572987.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 597 bits (1540), Expect = 9.83e-211
Identity = 290/302 (96.03%), Postives = 299/302 (99.01%), Query Frame = 0

Query: 2   QISSVKLEDFYVDLLHRCVQTSDSRHGSAIHAKFLKGFLPCSLFFHNHVLNFYVKCGSLS 61
           +ISSVKLEDFYV+LLHRCVQTSDSRHGSAIHAKFLKGFLP SLFFHNHVLNFYVKCGSLS
Sbjct: 36  EISSVKLEDFYVNLLHRCVQTSDSRHGSAIHAKFLKGFLPYSLFFHNHVLNFYVKCGSLS 95

Query: 62  CGLQLFDEMPERNVVSWSALIAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHAC 121
           CGLQLFDEMPERNVVSWSA+IAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHAC
Sbjct: 96  CGLQLFDEMPERNVVSWSAVIAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHAC 155

Query: 122 SLTQRLICSYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWN 181
           SLTQRLICSYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWN
Sbjct: 156 SLTQRLICSYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWN 215

Query: 182 AMMAGYLQLSYLELPKFWRRMNLENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSG 241
           AMMAGYLQL+YLELPKFWRRMNLE+IKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSG
Sbjct: 216 AMMAGYLQLAYLELPKFWRRMNLEDIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSG 275

Query: 242 YGNDICVGNSLCDMYIKNQKLFDGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALELLG 301
           YGNDICVGNSLCDMYIKNQKL DGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALE++ 
Sbjct: 276 YGNDICVGNSLCDMYIKNQKLLDGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALEVIS 335

Query: 302 EV 303
           ++
Sbjct: 336 DM 337

BLAST of Cp4.1LG09g11050 vs. NCBI nr
Match: KAG7012171.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 596 bits (1537), Expect = 2.81e-210
Identity = 290/302 (96.03%), Postives = 299/302 (99.01%), Query Frame = 0

Query: 2   QISSVKLEDFYVDLLHRCVQTSDSRHGSAIHAKFLKGFLPCSLFFHNHVLNFYVKCGSLS 61
           +ISSVKLEDFYV+LLHRCVQTSDSRHGSAIHAKFLKGFLP SLFFHNHVLNFYVKCGSLS
Sbjct: 36  EISSVKLEDFYVNLLHRCVQTSDSRHGSAIHAKFLKGFLPYSLFFHNHVLNFYVKCGSLS 95

Query: 62  CGLQLFDEMPERNVVSWSALIAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHAC 121
           CGLQLFDEMPERNVVSWSA+IAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHAC
Sbjct: 96  CGLQLFDEMPERNVVSWSAVIAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHAC 155

Query: 122 SLTQRLICSYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWN 181
           SLTQRLICSYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWN
Sbjct: 156 SLTQRLICSYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWN 215

Query: 182 AMMAGYLQLSYLELPKFWRRMNLENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSG 241
           AMMAGYLQL+YLELPKFWRRMNLE+IKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSG
Sbjct: 216 AMMAGYLQLAYLELPKFWRRMNLEDIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSG 275

Query: 242 YGNDICVGNSLCDMYIKNQKLFDGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALELLG 301
           YGNDICVGNSLCDMYIKNQKL DGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALE++ 
Sbjct: 276 YGNDICVGNSLCDMYIKNQKLLDGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALEVIY 335

Query: 302 EV 303
           ++
Sbjct: 336 DM 337

BLAST of Cp4.1LG09g11050 vs. NCBI nr
Match: XP_022954723.1 (pentatricopeptide repeat-containing protein At2g13600-like [Cucurbita moschata])

HSP 1 Score: 593 bits (1528), Expect = 6.54e-209
Identity = 289/302 (95.70%), Postives = 297/302 (98.34%), Query Frame = 0

Query: 2   QISSVKLEDFYVDLLHRCVQTSDSRHGSAIHAKFLKGFLPCSLFFHNHVLNFYVKCGSLS 61
           +IS VKLEDFYV+LLHRCVQTSDSRHGSAIHAKFLKGFLP SLFFHNHVLNFYVKCGSLS
Sbjct: 36  EISYVKLEDFYVNLLHRCVQTSDSRHGSAIHAKFLKGFLPYSLFFHNHVLNFYVKCGSLS 95

Query: 62  CGLQLFDEMPERNVVSWSALIAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHAC 121
           CGLQLFDEMPERNVVSWSA+IAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHAC
Sbjct: 96  CGLQLFDEMPERNVVSWSAVIAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHAC 155

Query: 122 SLTQRLICSYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWN 181
           SLTQRLICSYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLL+ALEVF SSSSKDIVSWN
Sbjct: 156 SLTQRLICSYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLLEALEVFGSSSSKDIVSWN 215

Query: 182 AMMAGYLQLSYLELPKFWRRMNLENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSG 241
           AMMAGYLQLSYLELPKFWRRMNLENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSG
Sbjct: 216 AMMAGYLQLSYLELPKFWRRMNLENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSG 275

Query: 242 YGNDICVGNSLCDMYIKNQKLFDGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALELLG 301
           YGNDICVGNSLCDMYIKNQKL DGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALE++ 
Sbjct: 276 YGNDICVGNSLCDMYIKNQKLLDGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALEVIY 335

Query: 302 EV 303
           ++
Sbjct: 336 DM 337

BLAST of Cp4.1LG09g11050 vs. NCBI nr
Match: XP_022994939.1 (pentatricopeptide repeat-containing protein At4g33170-like [Cucurbita maxima])

HSP 1 Score: 590 bits (1522), Expect = 5.34e-208
Identity = 287/302 (95.03%), Postives = 297/302 (98.34%), Query Frame = 0

Query: 2   QISSVKLEDFYVDLLHRCVQTSDSRHGSAIHAKFLKGFLPCSLFFHNHVLNFYVKCGSLS 61
           +ISSVKLEDFYV+LLHRCVQTSDSRHGSAIHAKFLKGFLP SLFFHNHVLNFYVKCGSLS
Sbjct: 36  EISSVKLEDFYVNLLHRCVQTSDSRHGSAIHAKFLKGFLPYSLFFHNHVLNFYVKCGSLS 95

Query: 62  CGLQLFDEMPERNVVSWSALIAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHAC 121
           CGLQLFDEMPERNVVSWSA+IAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHAC
Sbjct: 96  CGLQLFDEMPERNVVSWSAVIAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHAC 155

Query: 122 SLTQRLICSYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWN 181
           SLTQRLICSYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLL+ALEVFE+SSSKDIVSWN
Sbjct: 156 SLTQRLICSYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLLEALEVFENSSSKDIVSWN 215

Query: 182 AMMAGYLQLSYLELPKFWRRMNLENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSG 241
           AMMAGYLQLSY ELPKFWRRMNLE+IKPDNFTFASILTGLAALSEFKLGLQVHG LVKSG
Sbjct: 216 AMMAGYLQLSYFELPKFWRRMNLEDIKPDNFTFASILTGLAALSEFKLGLQVHGLLVKSG 275

Query: 242 YGNDICVGNSLCDMYIKNQKLFDGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALELLG 301
           YGNDICVGNSLCDMYIKNQKL DGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALE++ 
Sbjct: 276 YGNDICVGNSLCDMYIKNQKLLDGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALEVIY 335

Query: 302 EV 303
           ++
Sbjct: 336 DM 337

BLAST of Cp4.1LG09g11050 vs. ExPASy TrEMBL
Match: A0A6J1GRX4 (pentatricopeptide repeat-containing protein At2g13600-like OS=Cucurbita moschata OX=3662 GN=LOC111456896 PE=4 SV=1)

HSP 1 Score: 593 bits (1528), Expect = 3.17e-209
Identity = 289/302 (95.70%), Postives = 297/302 (98.34%), Query Frame = 0

Query: 2   QISSVKLEDFYVDLLHRCVQTSDSRHGSAIHAKFLKGFLPCSLFFHNHVLNFYVKCGSLS 61
           +IS VKLEDFYV+LLHRCVQTSDSRHGSAIHAKFLKGFLP SLFFHNHVLNFYVKCGSLS
Sbjct: 36  EISYVKLEDFYVNLLHRCVQTSDSRHGSAIHAKFLKGFLPYSLFFHNHVLNFYVKCGSLS 95

Query: 62  CGLQLFDEMPERNVVSWSALIAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHAC 121
           CGLQLFDEMPERNVVSWSA+IAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHAC
Sbjct: 96  CGLQLFDEMPERNVVSWSAVIAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHAC 155

Query: 122 SLTQRLICSYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWN 181
           SLTQRLICSYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLL+ALEVF SSSSKDIVSWN
Sbjct: 156 SLTQRLICSYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLLEALEVFGSSSSKDIVSWN 215

Query: 182 AMMAGYLQLSYLELPKFWRRMNLENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSG 241
           AMMAGYLQLSYLELPKFWRRMNLENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSG
Sbjct: 216 AMMAGYLQLSYLELPKFWRRMNLENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSG 275

Query: 242 YGNDICVGNSLCDMYIKNQKLFDGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALELLG 301
           YGNDICVGNSLCDMYIKNQKL DGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALE++ 
Sbjct: 276 YGNDICVGNSLCDMYIKNQKLLDGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALEVIY 335

Query: 302 EV 303
           ++
Sbjct: 336 DM 337

BLAST of Cp4.1LG09g11050 vs. ExPASy TrEMBL
Match: A0A6J1K498 (pentatricopeptide repeat-containing protein At4g33170-like OS=Cucurbita maxima OX=3661 GN=LOC111490529 PE=4 SV=1)

HSP 1 Score: 590 bits (1522), Expect = 2.58e-208
Identity = 287/302 (95.03%), Postives = 297/302 (98.34%), Query Frame = 0

Query: 2   QISSVKLEDFYVDLLHRCVQTSDSRHGSAIHAKFLKGFLPCSLFFHNHVLNFYVKCGSLS 61
           +ISSVKLEDFYV+LLHRCVQTSDSRHGSAIHAKFLKGFLP SLFFHNHVLNFYVKCGSLS
Sbjct: 36  EISSVKLEDFYVNLLHRCVQTSDSRHGSAIHAKFLKGFLPYSLFFHNHVLNFYVKCGSLS 95

Query: 62  CGLQLFDEMPERNVVSWSALIAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHAC 121
           CGLQLFDEMPERNVVSWSA+IAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHAC
Sbjct: 96  CGLQLFDEMPERNVVSWSAVIAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHAC 155

Query: 122 SLTQRLICSYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWN 181
           SLTQRLICSYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLL+ALEVFE+SSSKDIVSWN
Sbjct: 156 SLTQRLICSYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLLEALEVFENSSSKDIVSWN 215

Query: 182 AMMAGYLQLSYLELPKFWRRMNLENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSG 241
           AMMAGYLQLSY ELPKFWRRMNLE+IKPDNFTFASILTGLAALSEFKLGLQVHG LVKSG
Sbjct: 216 AMMAGYLQLSYFELPKFWRRMNLEDIKPDNFTFASILTGLAALSEFKLGLQVHGLLVKSG 275

Query: 242 YGNDICVGNSLCDMYIKNQKLFDGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALELLG 301
           YGNDICVGNSLCDMYIKNQKL DGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALE++ 
Sbjct: 276 YGNDICVGNSLCDMYIKNQKLLDGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALEVIY 335

Query: 302 EV 303
           ++
Sbjct: 336 DM 337

BLAST of Cp4.1LG09g11050 vs. ExPASy TrEMBL
Match: A0A0A0LVZ1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G043090 PE=4 SV=1)

HSP 1 Score: 558 bits (1438), Expect = 2.07e-195
Identity = 269/300 (89.67%), Postives = 283/300 (94.33%), Query Frame = 0

Query: 4   SSVKLEDFYVDLLHRCVQTSDSRHGSAIHAKFLKGFLPCSLFFHNHVLNFYVKCGSLSCG 63
           SSVKLEDFYV  L RCV TSDSRHGSAIHAKFLKGFLP SLFFHNHVLNFYVKCG LS G
Sbjct: 40  SSVKLEDFYVSFLQRCVLTSDSRHGSAIHAKFLKGFLPFSLFFHNHVLNFYVKCGRLSYG 99

Query: 64  LQLFDEMPERNVVSWSALIAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHACSL 123
           LQLFDEMPERNVVSWSA+IAGFVQHGRPNEALSLF RMHCDGTI+PNEFTLVSALHACSL
Sbjct: 100 LQLFDEMPERNVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSL 159

Query: 124 TQRLICSYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWNAM 183
           TQRLICSYQIYA ++RLGYGSN+FLMNAFLTALIRHEKLL+ALEVFES  SKD VSWNAM
Sbjct: 160 TQRLICSYQIYAFIVRLGYGSNVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAM 219

Query: 184 MAGYLQLSYLELPKFWRRMNLENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSGYG 243
           MAGYLQL+Y ELPKFWRRMNLE++KPDNFTFASILTGLAALSEF+LGLQVHGQLVKSGYG
Sbjct: 220 MAGYLQLAYFELPKFWRRMNLESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYG 279

Query: 244 NDICVGNSLCDMYIKNQKLFDGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALELLGEV 303
           NDICVGNSLCDMY+KNQKL DGFKAFDEM SSDVCSWTQMAAGCL CGEPMKALE++ E+
Sbjct: 280 NDICVGNSLCDMYVKNQKLLDGFKAFDEMSSSDVCSWTQMAAGCLQCGEPMKALEVIYEM 339

BLAST of Cp4.1LG09g11050 vs. ExPASy TrEMBL
Match: A0A1S4E4G2 (pentatricopeptide repeat-containing protein At2g13600-like OS=Cucumis melo OX=3656 GN=LOC103504641 PE=4 SV=1)

HSP 1 Score: 556 bits (1434), Expect = 8.38e-195
Identity = 268/300 (89.33%), Postives = 283/300 (94.33%), Query Frame = 0

Query: 4   SSVKLEDFYVDLLHRCVQTSDSRHGSAIHAKFLKGFLPCSLFFHNHVLNFYVKCGSLSCG 63
           SSVKLEDFYV  L RCVQTSDSRHGSAIHAKFLKGFLP SLFFHNHVLN Y+KCG LS G
Sbjct: 40  SSVKLEDFYVSFLQRCVQTSDSRHGSAIHAKFLKGFLPFSLFFHNHVLNLYLKCGRLSYG 99

Query: 64  LQLFDEMPERNVVSWSALIAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHACSL 123
           LQLFDEMPERNVVSWSA+IAGFVQHGRPNEALSLF RMHCDGTI+PNEFTLVSALHACSL
Sbjct: 100 LQLFDEMPERNVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSL 159

Query: 124 TQRLICSYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWNAM 183
           TQRLICSYQIYA ++RLGYGSN+FLMNAFLTALIRHEKLL+ALEVFES  SKD VSWNAM
Sbjct: 160 TQRLICSYQIYAFIVRLGYGSNVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAM 219

Query: 184 MAGYLQLSYLELPKFWRRMNLENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSGYG 243
           MAGYLQL+Y ELPKFWRRMNLE++KPDNFTFASILTGLAALSEF+LGLQVHGQLVKSGYG
Sbjct: 220 MAGYLQLAYFELPKFWRRMNLESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYG 279

Query: 244 NDICVGNSLCDMYIKNQKLFDGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALELLGEV 303
           NDICVGNSLCDMYIKNQKL DGFKAFDEM SSDVCSWTQMA+GCL CGEPMKALE++ E+
Sbjct: 280 NDICVGNSLCDMYIKNQKLLDGFKAFDEMSSSDVCSWTQMASGCLQCGEPMKALEVIYEM 339

BLAST of Cp4.1LG09g11050 vs. ExPASy TrEMBL
Match: A0A5D3BH26 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G002660 PE=4 SV=1)

HSP 1 Score: 556 bits (1434), Expect = 5.39e-193
Identity = 268/300 (89.33%), Postives = 283/300 (94.33%), Query Frame = 0

Query: 4   SSVKLEDFYVDLLHRCVQTSDSRHGSAIHAKFLKGFLPCSLFFHNHVLNFYVKCGSLSCG 63
           SSVKLEDFYV  L RCVQTSDSRHGSAIHAKFLKGFLP SLFFHNHVLN Y+KCG LS G
Sbjct: 167 SSVKLEDFYVSFLQRCVQTSDSRHGSAIHAKFLKGFLPFSLFFHNHVLNLYLKCGRLSYG 226

Query: 64  LQLFDEMPERNVVSWSALIAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHACSL 123
           LQLFDEMPERNVVSWSA+IAGFVQHGRPNEALSLF RMHCDGTI+PNEFTLVSALHACSL
Sbjct: 227 LQLFDEMPERNVVSWSAIIAGFVQHGRPNEALSLFGRMHCDGTIMPNEFTLVSALHACSL 286

Query: 124 TQRLICSYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWNAM 183
           TQRLICSYQIYA ++RLGYGSN+FLMNAFLTALIRHEKLL+ALEVFES  SKD VSWNAM
Sbjct: 287 TQRLICSYQIYAFIVRLGYGSNVFLMNAFLTALIRHEKLLEALEVFESCLSKDTVSWNAM 346

Query: 184 MAGYLQLSYLELPKFWRRMNLENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSGYG 243
           MAGYLQL+Y ELPKFWRRMNLE++KPDNFTFASILTGLAALSEF+LGLQVHGQLVKSGYG
Sbjct: 347 MAGYLQLAYFELPKFWRRMNLESVKPDNFTFASILTGLAALSEFRLGLQVHGQLVKSGYG 406

Query: 244 NDICVGNSLCDMYIKNQKLFDGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALELLGEV 303
           NDICVGNSLCDMYIKNQKL DGFKAFDEM SSDVCSWTQMA+GCL CGEPMKALE++ E+
Sbjct: 407 NDICVGNSLCDMYIKNQKLLDGFKAFDEMSSSDVCSWTQMASGCLQCGEPMKALEVIYEM 466

BLAST of Cp4.1LG09g11050 vs. TAIR 10
Match: AT2G33680.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 174.9 bits (442), Expect = 1.0e-43
Identity = 98/278 (35.25%), Postives = 157/278 (56.47%), Query Frame = 0

Query: 28  GSAIHAKFLKGFLPCSLFFHNHVLNFYVKCGSLSCGLQLFDEMPERNVVSWSALIAGFVQ 87
           G  IH   +K  L   +   N ++  Y KC SL+   ++FD   +RN ++WSA++ G+ Q
Sbjct: 240 GRQIHCITIKNGLLGFVALSNALVTMYSKCESLNEACKMFDSSGDRNSITWSAMVTGYSQ 299

Query: 88  HGRPNEALSLFSRMHCDGTIIPNEFTLVSALHACSLTQRLICSYQIYALVLRLGYGSNIF 147
           +G   EA+ LFSRM   G I P+E+T+V  L+ACS    L    Q+++ +L+LG+  ++F
Sbjct: 300 NGESLEAVKLFSRMFSAG-IKPSEYTIVGVLNACSDICYLEEGKQLHSFLLKLGFERHLF 359

Query: 148 LMNAFLTALIRHEKLLDALEVFESSSSKDIVSWNAMMAGYLQLS-YLELPKFWRRMNLEN 207
              A +    +   L DA + F+    +D+  W ++++GY+Q S   E    +RRM    
Sbjct: 360 ATTALVDMYAKAGCLADARKGFDCLQERDVALWTSLISGYVQNSDNEEALILYRRMKTAG 419

Query: 208 IKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSGYGNDICVGNSLCDMYIKNQKLFDGF 267
           I P++ T AS+L   ++L+  +LG QVHG  +K G+G ++ +G++L  MY K   L DG 
Sbjct: 420 IIPNDPTMASVLKACSSLATLELGKQVHGHTIKHGFGLEVPIGSALSTMYSKCGSLEDGN 479

Query: 268 KAFDEMPSSDVCSWTQMAAGCLHCGEPMKALELLGEVI 305
             F   P+ DV SW  M +G  H G+  +ALEL  E++
Sbjct: 480 LVFRRTPNKDVVSWNAMISGLSHNGQGDEALELFEEML 516

BLAST of Cp4.1LG09g11050 vs. TAIR 10
Match: AT3G24000.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 173.3 bits (438), Expect = 2.9e-43
Identity = 93/274 (33.94%), Postives = 152/274 (55.47%), Query Frame = 0

Query: 11  FYVDLLHRCVQTSDSRHGSAIHAKFLKGFLPCSLFFHNHVLNFYVKCGSLSCGLQLFDEM 70
           FY  LL +C        G  +HA  L+      +   N +LN Y KCGSL    ++F++M
Sbjct: 62  FYNTLLKKCTVFKLLIQGRIVHAHILQSIFRHDIVMGNTLLNMYAKCGSLEEARKVFEKM 121

Query: 71  PERNVVSWSALIAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHACSLTQRLICS 130
           P+R+ V+W+ LI+G+ QH RP +AL  F++M   G   PNEFTL S + A +  +R  C 
Sbjct: 122 PQRDFVTWTTLISGYSQHDRPCDALLFFNQMLRFG-YSPNEFTLSSVIKAAAAERRGCCG 181

Query: 131 YQIYALVLRLGYGSNIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWNAMMAGYLQL 190
           +Q++   ++ G+ SN+ + +A L    R+  + DA  VF++  S++ VSWNA++AG+ + 
Sbjct: 182 HQLHGFCVKCGFDSNVHVGSALLDLYTRYGLMDDAQLVFDALESRNDVSWNALIAGHARR 241

Query: 191 SYLELP-KFWRRMNLENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSGYGNDICVG 250
           S  E   + ++ M  +  +P +F++AS+    ++    + G  VH  ++KSG       G
Sbjct: 242 SGTEKALELFQGMLRDGFRPSHFSYASLFGACSSTGFLEQGKWVHAYMIKSGEKLVAFAG 301

Query: 251 NSLCDMYIKNQKLFDGFKAFDEMPSSDVCSWTQM 284
           N+L DMY K+  + D  K FD +   DV SW  +
Sbjct: 302 NTLLDMYAKSGSIHDARKIFDRLAKRDVVSWNSL 334

BLAST of Cp4.1LG09g11050 vs. TAIR 10
Match: AT3G15130.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 167.9 bits (424), Expect = 1.2e-41
Identity = 98/293 (33.45%), Postives = 155/293 (52.90%), Query Frame = 0

Query: 13  VDLLHRCVQTSDSRHGSAIHAKFLKGFLPCSLFFHNHVLNFYVKCGSLSCGLQLFDEMPE 72
           V +L  C +   S  G  +H   LK     +L   N++++ Y KC       ++FD MPE
Sbjct: 10  VSILRVCTRKGLSDQGGQVHCYLLKSGSGLNLITSNYLIDMYCKCREPLMAYKVFDSMPE 69

Query: 73  RNVVSWSALIAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHACSLTQRLICSYQ 132
           RNVVSWSAL++G V +G    +LSLFS M   G I PNEFT  + L AC L   L    Q
Sbjct: 70  RNVVSWSALMSGHVLNGDLKGSLSLFSEMGRQG-IYPNEFTFSTNLKACGLLNALEKGLQ 129

Query: 133 IYALVLRLGYGSNIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWNAMMAGYLQLSY 192
           I+   L++G+   + + N+ +    +  ++ +A +VF     + ++SWNAM+AG++   Y
Sbjct: 130 IHGFCLKIGFEMMVEVGNSLVDMYSKCGRINEAEKVFRRIVDRSLISWNAMIAGFVHAGY 189

Query: 193 ----LELPKFWRRMNLENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSGY--GNDI 252
               L+     +  N++  +PD FT  S+L   ++      G Q+HG LV+SG+   +  
Sbjct: 190 GSKALDTFGMMQEANIKE-RPDEFTLTSLLKACSSTGMIYAGKQIHGFLVRSGFHCPSSA 249

Query: 253 CVGNSLCDMYIKNQKLFDGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALEL 300
            +  SL D+Y+K   LF   KAFD++    + SW+ +  G    GE ++A+ L
Sbjct: 250 TITGSLVDLYVKCGYLFSARKAFDQIKEKTMISWSSLILGYAQEGEFVEAMGL 300

BLAST of Cp4.1LG09g11050 vs. TAIR 10
Match: AT3G13770.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 164.5 bits (415), Expect = 1.4e-40
Identity = 101/291 (34.71%), Postives = 155/291 (53.26%), Query Frame = 0

Query: 12  YVDLLHRCVQTSDSRHGSAIHAKFLK-GFLPCSLFFHNHVLNFYVKCGSLSCGLQLFDEM 71
           Y  LL+ C+     R G  +HA  +K  +LP + +    +L FY KC  L    ++ DEM
Sbjct: 55  YDALLNACLDKRALRDGQRVHAHMIKTRYLPAT-YLRTRLLIFYGKCDCLEDARKVLDEM 114

Query: 72  PERNVVSWSALIAGFVQHGRPNEALSLFSR-MHCDGTIIPNEFTLVSALHACSLTQRLIC 131
           PE+NVVSW+A+I+ + Q G  +EAL++F+  M  DG   PNEFT  + L +C     L  
Sbjct: 115 PEKNVVSWTAMISRYSQTGHSSEALTVFAEMMRSDGK--PNEFTFATVLTSCIRASGLGL 174

Query: 132 SYQIYALVLRLGYGSNIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWNAMMAGYLQ 191
             QI+ L+++  Y S+IF+ ++ L    +  ++ +A E+FE    +D+VS  A++AGY Q
Sbjct: 175 GKQIHGLIVKWNYDSHIFVGSSLLDMYAKAGQIKEAREIFECLPERDVVSCTAIIAGYAQ 234

Query: 192 LSY-LELPKFWRRMNLENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSGYGNDICV 251
           L    E  + + R++ E + P+  T+AS+LT L+ L+    G Q H  +++        +
Sbjct: 235 LGLDEEALEMFHRLHSEGMSPNYVTYASLLTALSGLALLDHGKQAHCHVLRRELPFYAVL 294

Query: 252 GNSLCDMYIKNQKLFDGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALEL 300
            NSL DMY K   L    + FD MP     SW  M  G    G   + LEL
Sbjct: 295 QNSLIDMYSKCGNLSYARRLFDNMPERTAISWNAMLVGYSKHGLGREVLEL 342

BLAST of Cp4.1LG09g11050 vs. TAIR 10
Match: AT3G53360.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 162.5 bits (410), Expect = 5.2e-40
Identity = 87/294 (29.59%), Postives = 152/294 (51.70%), Query Frame = 0

Query: 12  YVDLLHRCVQTSDSRHGSAIHAKFLKGFLPCSLFFHNHVLNFYVKCGSLSCGLQLFDEMP 71
           Y+ L+  C  +     G  IH   L          +NH+L+ Y KCGSL    ++FD MP
Sbjct: 70  YISLICACSSSRSLAQGRKIHDHILNSNCKYDTILNNHILSMYGKCGSLRDAREVFDFMP 129

Query: 72  ERNVVSWSALIAGFVQHGRPNEALSLFSRMHCDGTIIPNEFTLVSALHACSLTQRLICSY 131
           ERN+VS++++I G+ Q+G+  EA+ L+ +M     ++P++F   S + AC+ +  +    
Sbjct: 130 ERNLVSYTSVITGYSQNGQGAEAIRLYLKM-LQEDLVPDQFAFGSIIKACASSSDVGLGK 189

Query: 132 QIYALVLRLGYGSNIFLMNAFLTALIRHEKLLDALEVFESSSSKDIVSWNAMMAGYLQL- 191
           Q++A V++L   S++   NA +   +R  ++ DA  VF     KD++SW++++AG+ QL 
Sbjct: 190 QLHAQVIKLESSSHLIAQNALIAMYVRFNQMSDASRVFYGIPMKDLISWSSIIAGFSQLG 249

Query: 192 -SYLELPKFWRRMNLENIKPDNFTFASILTGLAALSEFKLGLQVHGQLVKSGYGNDICVG 251
             +  L      ++     P+ + F S L   ++L     G Q+HG  +KS    +   G
Sbjct: 250 FEFEALSHLKEMLSFGVFHPNEYIFGSSLKACSSLLRPDYGSQIHGLCIKSELAGNAIAG 309

Query: 252 NSLCDMYIKNQKLFDGFKAFDEMPSSDVCSWTQMAAGCLHCGEPMKALELLGEV 304
            SLCDMY +   L    + FD++   D  SW  + AG  + G   +A+ +  ++
Sbjct: 310 CSLCDMYARCGFLNSARRVFDQIERPDTASWNVIIAGLANNGYADEAVSVFSQM 362

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P930051.4e-4235.25Pentatricopeptide repeat-containing protein At2g33680 OS=Arabidopsis thaliana OX... [more]
Q9LIQ74.1e-4233.94Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidop... [more]
P0C8981.7e-4033.45Putative pentatricopeptide repeat-containing protein At3g15130 OS=Arabidopsis th... [more]
Q9LIC31.9e-3934.71Putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial OS... [more]
Q9LFI17.3e-3929.59Pentatricopeptide repeat-containing protein At3g53360, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_023542233.12.72e-21598.01pentatricopeptide repeat-containing protein At2g13600-like [Cucurbita pepo subsp... [more]
KAG6572987.19.83e-21196.03Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
KAG7012171.12.81e-21096.03Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022954723.16.54e-20995.70pentatricopeptide repeat-containing protein At2g13600-like [Cucurbita moschata][more]
XP_022994939.15.34e-20895.03pentatricopeptide repeat-containing protein At4g33170-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1GRX43.17e-20995.70pentatricopeptide repeat-containing protein At2g13600-like OS=Cucurbita moschata... [more]
A0A6J1K4982.58e-20895.03pentatricopeptide repeat-containing protein At4g33170-like OS=Cucurbita maxima O... [more]
A0A0A0LVZ12.07e-19589.67Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G043090 PE=4 SV=1[more]
A0A1S4E4G28.38e-19589.33pentatricopeptide repeat-containing protein At2g13600-like OS=Cucumis melo OX=36... [more]
A0A5D3BH265.39e-19389.33Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT2G33680.11.0e-4335.25Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G24000.12.9e-4333.94Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G15130.11.2e-4133.45Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G13770.11.4e-4034.71Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G53360.15.2e-4029.59Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 279..302
e-value: 0.14
score: 12.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 76..106
e-value: 5.0E-7
score: 27.5
coord: 47..76
e-value: 9.0E-4
score: 17.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 73..121
e-value: 1.2E-8
score: 35.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 74..108
score: 10.796938
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 4..132
e-value: 3.4E-21
score: 77.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 191..304
e-value: 7.0E-16
score: 60.5
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 5..302
NoneNo IPR availablePANTHERPTHR24015:SF1725TETRATRICOPEPTIDE REPEAT (TPR)-LIKE SUPERFAMILY PROTEINcoord: 5..302

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG09g11050.1Cp4.1LG09g11050.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding