Cp4.1LG20g02840 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG20g02840
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCp4.1LG20 : 1573937 .. 1576172 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGACGATAAGGCAACGGCTTCTTCCCCTCCACCGTCGTCCACTTCCACCGCAGCTCGTTCACGATCAAATTCCGGCCATTGGTACTCCATTTTCCTCTTCCTCCTCCTCCTCCGTTTATGATTCCGCAGCCAGTAAAATCAATACGACTCTTACCCAAGACGAGCTCACGAAGATCAGCCTCCTTCTTCCTCGTCTCTGCCTCGACAACCACCACTCCACCGCCATCACACTTCTGGATGCGGCGCTCCTCACTAATCTTTCACTTCAATCGCTTCCCCTCTCGATTCTTTCCCATTCCCTTGCTTCCCAATCCGATTTCGCCCTCACGATGTCCCTCCTCACCCGTCTCAATCACCATCGGAATGCGCTTCTCTATTCAAGCCCTATTATCACCATGCTTATTTTCTCTTATTGTAAGCAGCGGAAATTTAAGGAGGCCTTGAAGATTTTCCATTGGATGCTAAGACCAGGGTCGCCATGTAAGCCGGATGAGAGGGTTTATAAAACCCTAATCGCGGGACTGTGTAGGAAGGGTATGGTACTTGATGCTTTGAAGATTCTGAAGAACATGATTGACTCGAATTTGGTTCCGGATTACGATCTGAGGAGTTGGGTTTTTAGGTGTTTGCTGAGAGAAGCGATGATCGGCGAAGCAACAGAGTTGAGTGAAACTTTGAATTCCATTGGCGATCGGAATACCAGTGAGCATCTGAAAAAGGTGTCGGAATTGTTGGACGGTATCATCAACAATTGGATTGAATAGAGAAGATCTTCTGGTAAGCCATTTGCTCTCTGTGACTCAATTATGTTATTACTACCTCGAAATTTGATGATATTCAGAATAAACTTGATGCTTGATTCGCATGGTTCTTTCTTGTGTTTTTTAGTTCATTCCAATCTATCAATTATCAATTATCAATTATCAATTAAAGGGCAAGCTTCTAAACCACATCATTCTTCTTGCTTGTCTATCTGCCTGAACCAGTGCATTAGTTTGTGTTGTGACATCTATGATACATTTAACTTAGAAATGTGTCACAACTCAAGTCCCAAGTCCACCGTGACGGTAGGTGTTGTCATCTTTAAACTTTCCATTTTGAGCTTTCCTTCAAGGTTTTTGAAACGCATCTCCTAGGAAGAGGTTTCCACACCCTTATAAAGAATGTTTCGTCCTCACTGGCACACCGCCCGATGTATGTCTGGCTCTGATACCATTTGTAATGACCTAAACCCACCGCTTGCAGATATTTTCTCTTTAGGCTTTTCCTTACGGACTTCCCTTCAAGATTTTTAAAACGTGTCTACTAGGGAGTCTACTACTAAGGAGAGGTTTCCACACCCTCGTAAAACCCCTTGCCAGACATCAAGGGTTGTGCCTCAAAGAGGGTGGGGATTGTGAGATTCCACGTCGGTTAAAAATGAGTCTTACAAGGATACGGAAAACTCTCCTTAGTAGACACGTTTTTAAAACTTATATGGATGCACATCATGGAAAGCTCAAAGCTTTAACAACATATTAAAGGATTGTTCTATGTTTACCCGTCAGCTTTCCCCATATAAACAAGATGCTTAAGCTGATGGGTTTTAGTAAATTTAATATGATGTCTGAGTAAGTTTTTTGTTCGAACATCTATTCTGTCATTGAGCGAAGAAAAGCGTTCTTTAACCTATCAAGTTAAGCTTTTAGATCGTAGTTTCTTTTGTTATATGGCGGGAAGTTAATCGGAGAGTTCAAAATCTTGTACTTTCAGGTTCCTTCGTTGGAGAAGGAAGAGAAATGTTGGTGGTTATTGGTGTCTTCAGTTCACAGGGACAAGAAAGCCATTGACACTTCCTATAGTCTTCAGAAATCTTCAGAGCAGCAGAAGGCTTGAATGAACATCAAGTTCATATCAAATGCTATTAACTCGAAGAAACTGCCATAGAAATTGAGGTCATAATGTAACATAATGCTGTTTTGGGTTTATCTACACAATTTTTTTTTCTCACGAATTTTTCGAGCGGAGTCTACCGCTAGCAGATATTTTGGCCCAGCAGATATTATTTATTTTGTCCATTTTGGCCCATTACGTATCACCGTAAGCCTCATGATTTTAAAACGCGTCTATTAGGGAGAGATTTTCACACCCTTATAATGCTTCGTTCTTCTCTTCAACTAGTGTGAGATCTCACGGTTTAGTGTCATTATCTTGGATTTGAGCATTTTATAAGAAATGTTTCGTTCCTCTCTT

mRNA sequence

ATGAAGACGATAAGGCAACGGCTTCTTCCCCTCCACCGTCGTCCACTTCCACCGCAGCTCGTTCACGATCAAATTCCGGCCATTGGTACTCCATTTTCCTCTTCCTCCTCCTCCTCCGTTTATGATTCCGCAGCCAGTAAAATCAATACGACTCTTACCCAAGACGAGCTCACGAAGATCAGCCTCCTTCTTCCTCGTCTCTGCCTCGACAACCACCACTCCACCGCCATCACACTTCTGGATGCGGCGCTCCTCACTAATCTTTCACTTCAATCGCTTCCCCTCTCGATTCTTTCCCATTCCCTTGCTTCCCAATCCGATTTCGCCCTCACGATGTCCCTCCTCACCCGTCTCAATCACCATCGGAATGCGCTTCTCTATTCAAGCCCTATTATCACCATGCTTATTTTCTCTTATTGTAAGCAGCGGAAATTTAAGGAGGCCTTGAAGATTTTCCATTGGATGCTAAGACCAGGGTCGCCATGTAAGCCGGATGAGAGGGTTTATAAAACCCTAATCGCGGGACTGTGTAGGAAGGGTATGGTACTTGATGCTTTGAAGATTCTGAAGAACATGATTGACTCGAATTTGGTTCCGGATTACGATCTGAGGAGTTGGGTTTTTAGGTGTTTGCTGAGAGAAGCGATGATCGGCGAAGCAACAGAGTTGAGTGAAACTTTGAATTCCATTGGCGATCGGAATACCAGTGAGCATCTGAAAAAGGTTCCTTCGTTGGAGAAGGAAGAGAAATGTTGGTGGTTATTGGTGTCTTCAGTTCACAGGGACAAGAAAGCCATTGACACTTCCTATAGTCTTCAGAAATCTTCAGAGCAGCAGAAGGCTTGAATGAACATCAAGTTCATATCAAATGCTATTAACTCGAAGAAACTGCCATAGAAATTGAGGTCATAATGTAACATAATGCTGTTTTGGGTTTATCTACACAATTTTTTTTTCTCACGAATTTTTCGAGCGGAGTCTACCGCTAGCAGATATTTTGGCCCAGCAGATATTATTTATTTTGTCCATTTTGGCCCATTACGTATCACCGTAAGCCTCATGATTTTAAAACGCGTCTATTAGGGAGAGATTTTCACACCCTTATAATGCTTCGTTCTTCTCTTCAACTAGTGTGAGATCTCACGGTTTAGTGTCATTATCTTGGATTTGAGCATTTTATAAGAAATGTTTCGTTCCTCTCTT

Coding sequence (CDS)

ATGAAGACGATAAGGCAACGGCTTCTTCCCCTCCACCGTCGTCCACTTCCACCGCAGCTCGTTCACGATCAAATTCCGGCCATTGGTACTCCATTTTCCTCTTCCTCCTCCTCCTCCGTTTATGATTCCGCAGCCAGTAAAATCAATACGACTCTTACCCAAGACGAGCTCACGAAGATCAGCCTCCTTCTTCCTCGTCTCTGCCTCGACAACCACCACTCCACCGCCATCACACTTCTGGATGCGGCGCTCCTCACTAATCTTTCACTTCAATCGCTTCCCCTCTCGATTCTTTCCCATTCCCTTGCTTCCCAATCCGATTTCGCCCTCACGATGTCCCTCCTCACCCGTCTCAATCACCATCGGAATGCGCTTCTCTATTCAAGCCCTATTATCACCATGCTTATTTTCTCTTATTGTAAGCAGCGGAAATTTAAGGAGGCCTTGAAGATTTTCCATTGGATGCTAAGACCAGGGTCGCCATGTAAGCCGGATGAGAGGGTTTATAAAACCCTAATCGCGGGACTGTGTAGGAAGGGTATGGTACTTGATGCTTTGAAGATTCTGAAGAACATGATTGACTCGAATTTGGTTCCGGATTACGATCTGAGGAGTTGGGTTTTTAGGTGTTTGCTGAGAGAAGCGATGATCGGCGAAGCAACAGAGTTGAGTGAAACTTTGAATTCCATTGGCGATCGGAATACCAGTGAGCATCTGAAAAAGGTTCCTTCGTTGGAGAAGGAAGAGAAATGTTGGTGGTTATTGGTGTCTTCAGTTCACAGGGACAAGAAAGCCATTGACACTTCCTATAGTCTTCAGAAATCTTCAGAGCAGCAGAAGGCTTGA

Protein sequence

MKTIRQRLLPLHRRPLPPQLVHDQIPAIGTPFSSSSSSSVYDSAASKINTTLTQDELTKISLLLPRLCLDNHHSTAITLLDAALLTNLSLQSLPLSILSHSLASQSDFALTMSLLTRLNHHRNALLYSSPIITMLIFSYCKQRKFKEALKIFHWMLRPGSPCKPDERVYKTLIAGLCRKGMVLDALKILKNMIDSNLVPDYDLRSWVFRCLLREAMIGEATELSETLNSIGDRNTSEHLKKVPSLEKEEKCWWLLVSSVHRDKKAIDTSYSLQKSSEQQKA
BLAST of Cp4.1LG20g02840 vs. Swiss-Prot
Match: PP444_ARATH (Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidopsis thaliana GN=At5g64320 PE=2 SV=1)

HSP 1 Score: 58.5 bits (140), Expect = 1.3e-07
Identity = 32/89 (35.96%), Postives = 50/89 (56.18%), Query Frame = 1

Query: 135 LIFSYCKQRKFKEALKIFHWMLRPGSPCKPDERVYKTLIAGLCRKGMVLDALKILKNMID 194
           LI ++CK+ +  EA++IF  M R G  CKPD   + +LI+GLC    +  AL +L++MI 
Sbjct: 465 LISAFCKEHRIPEAVEIFREMPRKG--CKPDVYTFNSLISGLCEVDEIKHALWLLRDMIS 524

Query: 195 SNLVPDYDLRSWVFRCLLREAMIGEATEL 224
             +V +    + +    LR   I EA +L
Sbjct: 525 EGVVANTVTYNTLINAFLRRGEIKEARKL 551

BLAST of Cp4.1LG20g02840 vs. Swiss-Prot
Match: PP281_ARATH (Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidopsis thaliana GN=MEE40 PE=2 SV=1)

HSP 1 Score: 57.4 bits (137), Expect = 3.0e-07
Identity = 33/97 (34.02%), Postives = 52/97 (53.61%), Query Frame = 1

Query: 135 LIFSYCKQRKFKEALKIFHWMLRPGSPCKPDERVYKTLIAGLCRKGMVLDALKILKNMID 194
           L+   CK    K A++I   ML+ G    PD   Y ++I+GLC+ G V +A+++L  MI 
Sbjct: 301 LVNGLCKAGHVKHAIEIMDVMLQEGYD--PDVYTYNSVISGLCKLGEVKEAVEVLDQMIT 360

Query: 195 SNLVPDYDLRSWVFRCLLREAMIGEATELSETLNSIG 232
            +  P+    + +   L +E  + EATEL+  L S G
Sbjct: 361 RDCSPNTVTYNTLISTLCKENQVEEATELARVLTSKG 395

BLAST of Cp4.1LG20g02840 vs. Swiss-Prot
Match: PP327_ARATH (Pentatricopeptide repeat-containing protein At4g20090 OS=Arabidopsis thaliana GN=EMB1025 PE=3 SV=1)

HSP 1 Score: 57.4 bits (137), Expect = 3.0e-07
Identity = 46/172 (26.74%), Postives = 81/172 (47.09%), Query Frame = 1

Query: 63  LLPRLCLDNHHSTAITLLDAALLTNLSLQSLPLSILSHSLASQ---SDFALTMSLLTRLN 122
           L+  LCL      A++LL+  + +      +    L + L  Q   +D    +S +    
Sbjct: 298 LIHGLCLKGKLDKAVSLLERMVSSKCIPNDVTYGTLINGLVKQRRATDAVRLLSSMEERG 357

Query: 123 HHRNALLYSSPIITMLIFSYCKQRKFKEALKIFHWMLRPGSPCKPDERVYKTLIAGLCRK 182
           +H N  +YS     +LI    K+ K +EA+ ++  M   G  CKP+  VY  L+ GLCR+
Sbjct: 358 YHLNQHIYS-----VLISGLFKEGKAEEAMSLWRKMAEKG--CKPNIVVYSVLVDGLCRE 417

Query: 183 GMVLDALKILKNMIDSNLVPDYDLRSWVFRCLLREAMIGEATELSETLNSIG 232
           G   +A +IL  MI S  +P+    S + +   +  +  EA ++ + ++  G
Sbjct: 418 GKPNEAKEILNRMIASGCLPNAYTYSSLMKGFFKTGLCEEAVQVWKEMDKTG 462

BLAST of Cp4.1LG20g02840 vs. Swiss-Prot
Match: PP407_ARATH (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 55.1 bits (131), Expect = 1.5e-06
Identity = 34/101 (33.66%), Postives = 53/101 (52.48%), Query Frame = 1

Query: 123 NALLYSSPIITMLIFSYCKQRKFKEALKIFHWMLRPGSPCKPDERVYKTLIAGLCRKGMV 182
           + + YSS     LI  +C+QR+ KEA  ++  MLR G P  PDE  Y  LI   C +G +
Sbjct: 484 DTITYSS-----LIQGFCEQRRTKEACDLYEEMLRVGLP--PDEFTYTALINAYCMEGDL 543

Query: 183 LDALKILKNMIDSNLVPDYDLRSWVFRCLLREAMIGEATEL 224
             AL++   M++  ++PD    S +   L +++   EA  L
Sbjct: 544 EKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRL 577

BLAST of Cp4.1LG20g02840 vs. Swiss-Prot
Match: PP306_ARATH (Pentatricopeptide repeat-containing protein At4g11690 OS=Arabidopsis thaliana GN=At4g11690 PE=2 SV=1)

HSP 1 Score: 54.3 bits (129), Expect = 2.5e-06
Identity = 46/181 (25.41%), Postives = 81/181 (44.75%), Query Frame = 1

Query: 57  LTKISLLLPRLCLDNHHSTAITLLDAALLTNLSLQSLPLSILSHSLASQSDFALTMSLLT 116
           L   ++L+   C     S A  ++       +    +  +IL  + A   +    + L  
Sbjct: 373 LVTYNILVSGFCRKGDTSGAAKMVKEMEERGIKPSKVTYTILIDTFARSDNMEKAIQL-- 432

Query: 117 RLNHHRNALLYSSPIITMLIFSYCKQRKFKEALKIFHWMLRPGSPCKPDERVYKTLIAGL 176
           RL+     L+      ++LI  +C + +  EA ++F  M+     C+P+E +Y T+I G 
Sbjct: 433 RLSMEELGLVPDVHTYSVLIHGFCIKGQMNEASRLFKSMVEKN--CEPNEVIYNTMILGY 492

Query: 177 CRKGMVLDALKILKNMIDSNLVPDYDLRSWVFRCLLREAMIGEATELSETLNSIG-DRNT 236
           C++G    ALK+LK M +  L P+     ++   L +E    EA  L E +   G D +T
Sbjct: 493 CKEGSSYRALKLLKEMEEKELAPNVASYRYMIEVLCKERKSKEAERLVEKMIDSGIDPST 549

BLAST of Cp4.1LG20g02840 vs. TrEMBL
Match: A0A0A0LRQ9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G051900 PE=4 SV=1)

HSP 1 Score: 335.9 bits (860), Expect = 4.8e-89
Identity = 181/248 (72.98%), Postives = 206/248 (83.06%), Query Frame = 1

Query: 1   MKTIRQRLLPLHRRPLPPQLVHDQIPAIGTPFSSSSS---SSVYDSAASKINTTLTQDEL 60
           MK IRQ+LL L R PL P+L+  QIPAI +PFSSSSS    S   S A+K NTTLT DEL
Sbjct: 1   MKRIRQQLLTLQRLPLSPELLRFQIPAILSPFSSSSSFISDSPSASIATKPNTTLTHDEL 60

Query: 61  TKISLLLPRLCLDNHHSTAITLLDAALLTNLSLQSLPLSILSHSLASQSDFALTMSLLTR 120
           T+I+LLLPRLCL NH STAI+LL A LLTN SL SL LS+LSHSLASQSDFALTMSLLTR
Sbjct: 61  TRINLLLPRLCLHNHLSTAISLLHATLLTNPSLHSLSLSVLSHSLASQSDFALTMSLLTR 120

Query: 121 LNHHRNALLYSSPIITMLIFSYCKQRKFKEALKIFHWMLRPGSPCKPDERVYKTLIAGLC 180
           L HH NALLYS+PI+TMLI SYCK+RK KEALK+FHWMLRPGSPCKP+ERVYKTLIAGL 
Sbjct: 121 LKHHPNALLYSTPIVTMLISSYCKRRKSKEALKLFHWMLRPGSPCKPEERVYKTLIAGLY 180

Query: 181 RKGMVLDALKILKNMIDSNLVPDYDLRSWVFRCLLREAMIGEATELSETLNSIGDRNTSE 240
           RKGM  DALK+L+NMIDSNLVPD DLR+WVFR LL+EAMI EA E ++ LN +GD+NT +
Sbjct: 181 RKGMTFDALKVLRNMIDSNLVPDCDLRNWVFRSLLKEAMIPEAMEFNDALNFVGDQNTID 240

Query: 241 HLKKVPSL 246
           HL++V  L
Sbjct: 241 HLRRVSEL 248

BLAST of Cp4.1LG20g02840 vs. TrEMBL
Match: M5WIJ3_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa010317mg PE=4 SV=1)

HSP 1 Score: 204.5 bits (519), Expect = 1.7e-49
Identity = 121/225 (53.78%), Postives = 154/225 (68.44%), Query Frame = 1

Query: 30  TPFSSSSSSSVYDSAASKINT------TLTQDELTKISLLLPRLCLDNHHSTAITLLDAA 89
           TP + S+S+S  DS  +   T      +LTQ+E TKI+LLLPRLCL NH  TA  L   A
Sbjct: 20  TPRTFSTSTSAIDSITAPKPTNQTQTQSLTQEEHTKINLLLPRLCLLNHLDTATHLTITA 79

Query: 90  LLTNLSLQSLPLSILSHSLASQSDFALTMSLLTRLNHHRNALLYSSPIITMLIFSYCKQR 149
           LLTN  L+SL LSIL HS  SQ D A  MSLLTRL H+  +  Y +PI TM I SY K+ 
Sbjct: 80  LLTNPPLKSLSLSILIHSFTSQPDMARPMSLLTRLRHNPPSHPYLTPITTMFIASYFKKN 139

Query: 150 KFKEALKIFHWMLRPGSPCKPDERVYKTLIAGLCRKGMVLDALKILKNMIDSNLVPDYDL 209
           K KEALK+F+W++RPGSPC  DERV + L+ G C+ GMVL+ALK+L+ M+ +N+VP  DL
Sbjct: 140 KPKEALKMFNWLVRPGSPCVLDERVCEVLVNGFCKNGMVLEALKVLRAMLSTNIVPGCDL 199

Query: 210 RSWVFRCLLREAMIGEATELSETLNSIGDR---NTSEHLKKVPSL 246
           + WV++ LLREA I EA EL+E L  +GDR   + SE +KKV +L
Sbjct: 200 KKWVYKVLLREARIKEAVELNEALGCVGDREKGDESECVKKVLAL 244

BLAST of Cp4.1LG20g02840 vs. TrEMBL
Match: A0A067LG57_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26667 PE=4 SV=1)

HSP 1 Score: 178.3 bits (451), Expect = 1.3e-41
Identity = 105/213 (49.30%), Postives = 139/213 (65.26%), Query Frame = 1

Query: 33  SSSSSSSVYDSAASKINTTLTQDELTKISLLLPRLCLDNHHSTAITLLDAALLTNLSLQS 92
           +SSSSS +  S  S++  +LTQ ELTKI+LL+PRLCL +H +TAI L   +LLTN   +S
Sbjct: 32  ASSSSSEIEKSKNSEL--SLTQQELTKINLLIPRLCLSDHLTTAIHLTTTSLLTNPPQKS 91

Query: 93  LPLSILSHSLASQSDFALTMSLLTRLNHHRNALLYSSPIITMLIFSYCKQRKFKEALKIF 152
           +  SIL H L SQ D A +MS LT L H      + +PI TMLI SY K+R+ KEALK++
Sbjct: 92  ISFSILIHFLTSQPDMAKSMSFLTILRHTPQVHCHLTPITTMLITSYVKKRRPKEALKVY 151

Query: 153 HWMLRPGSPCKPDERVYKTLIAGLCRKGMVLDALKILKNMIDSNLVPDYDLRSWVFRCLL 212
            WM RPGSPCK +  VY+ L+   C  G+VL+ L+ILK+M+    VP   LR  V+R LL
Sbjct: 152 QWMQRPGSPCKVERIVYEVLVNRFCGFGLVLEGLRILKDMVAVGFVPKNGLRRTVYRSLL 211

Query: 213 REAMIGEATELSETLNSIGDRNTSEHLKKVPSL 246
           REA +G+A EL+E L    + +  E +KKV  L
Sbjct: 212 REARVGKAVELNEALYGCFEDDNGEGVKKVREL 242

BLAST of Cp4.1LG20g02840 vs. TrEMBL
Match: B9GN11_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s12210g PE=4 SV=2)

HSP 1 Score: 174.5 bits (441), Expect = 1.8e-40
Identity = 101/203 (49.75%), Postives = 135/203 (66.50%), Query Frame = 1

Query: 43  SAASKINTTLTQDELTKISLLLPRLCLDNHHSTAITLLDAALLTNLSLQSLPLSILSHSL 102
           S  S++ TTLTQ+E+TKI+LL+PRLCL NH +TAI L+  +LL N   +SL  SIL+HSL
Sbjct: 46  STNSEVVTTLTQEEVTKINLLIPRLCLLNHLTTAIQLITTSLLANPPPKSLSFSILTHSL 105

Query: 103 ASQSDFALTMSLLTRLNHHRNALLYSSPIITMLIFSYCKQRKFKEALKIFHWMLRPGSPC 162
            SQ D    MSLLT L H   A  + SP+ TMLI SY K+++ KEALK+++WMLRPGSPC
Sbjct: 106 TSQPDMTKPMSLLTILRHTPQAHSHLSPMNTMLITSYIKKKRPKEALKVYNWMLRPGSPC 165

Query: 163 KPDERVYKTLIAGLCRKGMVLDALKILKNMIDSNLVPDYDLRSWVFRCLLREAMIGEATE 222
           K ++ V+  L+ GLC  G VL+ LK+LK+M+    +P   L+  V+R LL EA + EA E
Sbjct: 166 KVEKIVFCVLVNGLCEIGWVLEGLKVLKDMVSVGFLPIGGLKERVYRSLLSEARVKEAVE 225

Query: 223 LSETLNSIGDRNTSEHLKKVPSL 246
           L + L    +  + E  KKV  L
Sbjct: 226 LDKALCDCFEDVSGEGGKKVIDL 248

BLAST of Cp4.1LG20g02840 vs. TrEMBL
Match: A0A061DVU2_THECC (Pentatricopeptide repeat-containing protein, putative OS=Theobroma cacao GN=TCM_005982 PE=4 SV=1)

HSP 1 Score: 159.8 bits (403), Expect = 4.7e-36
Identity = 100/219 (45.66%), Postives = 134/219 (61.19%), Query Frame = 1

Query: 16  LPPQLVHDQIPAIGTPFSSSSSS-----SVYDSAASKINTTLTQDELTKISLLLPRLCLD 75
           LPP       P+    FS S S+       Y +   K   TL+Q++++KI+LL+PRLCL 
Sbjct: 13  LPPIAASKTQPSKQQIFSYSFSAIPNLTDTYLNTRPKNFPTLSQEQVSKINLLIPRLCLS 72

Query: 76  NHHSTAITLLDAALLTNLSL--QSLPLSILSHSLASQSDFALTMSLLTRLNHHRNALLYS 135
           NH +TAI L   ALLTN S   +SL +SIL HSL  Q D  L+MSLLTRLNH   A  + 
Sbjct: 73  NHLTTAIQLTTTALLTNASPNPKSLSVSILIHSLTLQPDLKLSMSLLTRLNHIPQAHPHL 132

Query: 136 SPIITMLIFSYCKQRKFKEALKIFHWMLRPGSPCKPDERVYKTLIAGLCRKGMVLDALKI 195
           +P+ TMLI SY K+ + K+ALK+++WM RPGSPC  D+  Y  L+   C  G+VL+ L +
Sbjct: 133 TPVSTMLIASYLKKGRHKDALKVYNWMRRPGSPCTVDKDAYGILVGRFCASGVVLEGLMV 192

Query: 196 LKNMIDSNLVPDYDLRSWVFRCLLREAMIGEATELSETL 228
           L++M+  +L+P   LR  V R LLREA + EA    E L
Sbjct: 193 LRDMLKVHLLPGEGLRKKVVRSLLREARVREAEAFEELL 231

BLAST of Cp4.1LG20g02840 vs. TAIR10
Match: AT5G64320.1 (AT5G64320.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 58.5 bits (140), Expect = 7.5e-09
Identity = 32/89 (35.96%), Postives = 50/89 (56.18%), Query Frame = 1

Query: 135 LIFSYCKQRKFKEALKIFHWMLRPGSPCKPDERVYKTLIAGLCRKGMVLDALKILKNMID 194
           LI ++CK+ +  EA++IF  M R G  CKPD   + +LI+GLC    +  AL +L++MI 
Sbjct: 465 LISAFCKEHRIPEAVEIFREMPRKG--CKPDVYTFNSLISGLCEVDEIKHALWLLRDMIS 524

Query: 195 SNLVPDYDLRSWVFRCLLREAMIGEATEL 224
             +V +    + +    LR   I EA +L
Sbjct: 525 EGVVANTVTYNTLINAFLRRGEIKEARKL 551

BLAST of Cp4.1LG20g02840 vs. TAIR10
Match: AT3G53700.1 (AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 57.4 bits (137), Expect = 1.7e-08
Identity = 33/97 (34.02%), Postives = 52/97 (53.61%), Query Frame = 1

Query: 135 LIFSYCKQRKFKEALKIFHWMLRPGSPCKPDERVYKTLIAGLCRKGMVLDALKILKNMID 194
           L+   CK    K A++I   ML+ G    PD   Y ++I+GLC+ G V +A+++L  MI 
Sbjct: 301 LVNGLCKAGHVKHAIEIMDVMLQEGYD--PDVYTYNSVISGLCKLGEVKEAVEVLDQMIT 360

Query: 195 SNLVPDYDLRSWVFRCLLREAMIGEATELSETLNSIG 232
            +  P+    + +   L +E  + EATEL+  L S G
Sbjct: 361 RDCSPNTVTYNTLISTLCKENQVEEATELARVLTSKG 395

BLAST of Cp4.1LG20g02840 vs. TAIR10
Match: AT4G20090.1 (AT4G20090.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 57.4 bits (137), Expect = 1.7e-08
Identity = 46/172 (26.74%), Postives = 81/172 (47.09%), Query Frame = 1

Query: 63  LLPRLCLDNHHSTAITLLDAALLTNLSLQSLPLSILSHSLASQ---SDFALTMSLLTRLN 122
           L+  LCL      A++LL+  + +      +    L + L  Q   +D    +S +    
Sbjct: 298 LIHGLCLKGKLDKAVSLLERMVSSKCIPNDVTYGTLINGLVKQRRATDAVRLLSSMEERG 357

Query: 123 HHRNALLYSSPIITMLIFSYCKQRKFKEALKIFHWMLRPGSPCKPDERVYKTLIAGLCRK 182
           +H N  +YS     +LI    K+ K +EA+ ++  M   G  CKP+  VY  L+ GLCR+
Sbjct: 358 YHLNQHIYS-----VLISGLFKEGKAEEAMSLWRKMAEKG--CKPNIVVYSVLVDGLCRE 417

Query: 183 GMVLDALKILKNMIDSNLVPDYDLRSWVFRCLLREAMIGEATELSETLNSIG 232
           G   +A +IL  MI S  +P+    S + +   +  +  EA ++ + ++  G
Sbjct: 418 GKPNEAKEILNRMIASGCLPNAYTYSSLMKGFFKTGLCEEAVQVWKEMDKTG 462

BLAST of Cp4.1LG20g02840 vs. TAIR10
Match: AT5G39710.1 (AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 55.1 bits (131), Expect = 8.3e-08
Identity = 34/101 (33.66%), Postives = 53/101 (52.48%), Query Frame = 1

Query: 123 NALLYSSPIITMLIFSYCKQRKFKEALKIFHWMLRPGSPCKPDERVYKTLIAGLCRKGMV 182
           + + YSS     LI  +C+QR+ KEA  ++  MLR G P  PDE  Y  LI   C +G +
Sbjct: 484 DTITYSS-----LIQGFCEQRRTKEACDLYEEMLRVGLP--PDEFTYTALINAYCMEGDL 543

Query: 183 LDALKILKNMIDSNLVPDYDLRSWVFRCLLREAMIGEATEL 224
             AL++   M++  ++PD    S +   L +++   EA  L
Sbjct: 544 EKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRL 577

BLAST of Cp4.1LG20g02840 vs. TAIR10
Match: AT4G11690.1 (AT4G11690.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 54.3 bits (129), Expect = 1.4e-07
Identity = 46/181 (25.41%), Postives = 81/181 (44.75%), Query Frame = 1

Query: 57  LTKISLLLPRLCLDNHHSTAITLLDAALLTNLSLQSLPLSILSHSLASQSDFALTMSLLT 116
           L   ++L+   C     S A  ++       +    +  +IL  + A   +    + L  
Sbjct: 373 LVTYNILVSGFCRKGDTSGAAKMVKEMEERGIKPSKVTYTILIDTFARSDNMEKAIQL-- 432

Query: 117 RLNHHRNALLYSSPIITMLIFSYCKQRKFKEALKIFHWMLRPGSPCKPDERVYKTLIAGL 176
           RL+     L+      ++LI  +C + +  EA ++F  M+     C+P+E +Y T+I G 
Sbjct: 433 RLSMEELGLVPDVHTYSVLIHGFCIKGQMNEASRLFKSMVEKN--CEPNEVIYNTMILGY 492

Query: 177 CRKGMVLDALKILKNMIDSNLVPDYDLRSWVFRCLLREAMIGEATELSETLNSIG-DRNT 236
           C++G    ALK+LK M +  L P+     ++   L +E    EA  L E +   G D +T
Sbjct: 493 CKEGSSYRALKLLKEMEEKELAPNVASYRYMIEVLCKERKSKEAERLVEKMIDSGIDPST 549

BLAST of Cp4.1LG20g02840 vs. NCBI nr
Match: gi|659067591|ref|XP_008440255.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g02060, chloroplastic-like [Cucumis melo])

HSP 1 Score: 340.5 bits (872), Expect = 2.8e-90
Identity = 184/248 (74.19%), Postives = 207/248 (83.47%), Query Frame = 1

Query: 1   MKTIRQRLLPLHRRPLPPQLVHDQIPAIGTPFSSSSS---SSVYDSAASKINTTLTQDEL 60
           MK IRQ+LL L R PL P+L+  QIP I +PFSSSSS    S   S A++ NTTLT DEL
Sbjct: 1   MKRIRQQLLTLQRLPLSPELLRFQIPPILSPFSSSSSFISGSPSASIATEPNTTLTHDEL 60

Query: 61  TKISLLLPRLCLDNHHSTAITLLDAALLTNLSLQSLPLSILSHSLASQSDFALTMSLLTR 120
           T+I+LLLPRLCL NH STAITLL A LLTN SLQSL LS+LSHSLASQSDFALTMSLLTR
Sbjct: 61  TRINLLLPRLCLYNHLSTAITLLHATLLTNPSLQSLSLSVLSHSLASQSDFALTMSLLTR 120

Query: 121 LNHHRNALLYSSPIITMLIFSYCKQRKFKEALKIFHWMLRPGSPCKPDERVYKTLIAGLC 180
           L HH NALLYS+PI+TMLI SYCK+RK KEALKIFHWMLRPGSPCKP+ERVYKTLIAGL 
Sbjct: 121 LKHHPNALLYSTPIVTMLISSYCKRRKSKEALKIFHWMLRPGSPCKPEERVYKTLIAGLY 180

Query: 181 RKGMVLDALKILKNMIDSNLVPDYDLRSWVFRCLLREAMIGEATELSETLNSIGDRNTSE 240
           RKGM  DALK+L+NMIDSNLVPD DLR+WVFRCLLREAMI EA E +ET N +GD++T +
Sbjct: 181 RKGMTFDALKVLRNMIDSNLVPDCDLRNWVFRCLLREAMIPEAMEFNETFNFVGDQDTID 240

Query: 241 HLKKVPSL 246
           HL++V  L
Sbjct: 241 HLRRVSEL 248

BLAST of Cp4.1LG20g02840 vs. NCBI nr
Match: gi|449474047|ref|XP_004154059.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At5g59900 [Cucumis sativus])

HSP 1 Score: 335.9 bits (860), Expect = 6.9e-89
Identity = 181/248 (72.98%), Postives = 206/248 (83.06%), Query Frame = 1

Query: 1   MKTIRQRLLPLHRRPLPPQLVHDQIPAIGTPFSSSSS---SSVYDSAASKINTTLTQDEL 60
           MK IRQ+LL L R PL P+L+  QIPAI +PFSSSSS    S   S A+K NTTLT DEL
Sbjct: 1   MKRIRQQLLTLQRLPLSPELLRFQIPAILSPFSSSSSFISDSPSASIATKPNTTLTHDEL 60

Query: 61  TKISLLLPRLCLDNHHSTAITLLDAALLTNLSLQSLPLSILSHSLASQSDFALTMSLLTR 120
           T+I+LLLPRLCL NH STAI+LL A LLTN SL SL LS+LSHSLASQSDFALTMSLLTR
Sbjct: 61  TRINLLLPRLCLHNHLSTAISLLHATLLTNPSLHSLSLSVLSHSLASQSDFALTMSLLTR 120

Query: 121 LNHHRNALLYSSPIITMLIFSYCKQRKFKEALKIFHWMLRPGSPCKPDERVYKTLIAGLC 180
           L HH NALLYS+PI+TMLI SYCK+RK KEALK+FHWMLRPGSPCKP+ERVYKTLIAGL 
Sbjct: 121 LKHHPNALLYSTPIVTMLISSYCKRRKSKEALKLFHWMLRPGSPCKPEERVYKTLIAGLY 180

Query: 181 RKGMVLDALKILKNMIDSNLVPDYDLRSWVFRCLLREAMIGEATELSETLNSIGDRNTSE 240
           RKGM  DALK+L+NMIDSNLVPD DLR+WVFR LL+EAMI EA E ++ LN +GD+NT +
Sbjct: 181 RKGMTFDALKVLRNMIDSNLVPDCDLRNWVFRSLLKEAMIPEAMEFNDALNFVGDQNTID 240

Query: 241 HLKKVPSL 246
           HL++V  L
Sbjct: 241 HLRRVSEL 248

BLAST of Cp4.1LG20g02840 vs. NCBI nr
Match: gi|595865128|ref|XP_007211897.1| (hypothetical protein PRUPE_ppa010317mg [Prunus persica])

HSP 1 Score: 204.5 bits (519), Expect = 2.4e-49
Identity = 121/225 (53.78%), Postives = 154/225 (68.44%), Query Frame = 1

Query: 30  TPFSSSSSSSVYDSAASKINT------TLTQDELTKISLLLPRLCLDNHHSTAITLLDAA 89
           TP + S+S+S  DS  +   T      +LTQ+E TKI+LLLPRLCL NH  TA  L   A
Sbjct: 20  TPRTFSTSTSAIDSITAPKPTNQTQTQSLTQEEHTKINLLLPRLCLLNHLDTATHLTITA 79

Query: 90  LLTNLSLQSLPLSILSHSLASQSDFALTMSLLTRLNHHRNALLYSSPIITMLIFSYCKQR 149
           LLTN  L+SL LSIL HS  SQ D A  MSLLTRL H+  +  Y +PI TM I SY K+ 
Sbjct: 80  LLTNPPLKSLSLSILIHSFTSQPDMARPMSLLTRLRHNPPSHPYLTPITTMFIASYFKKN 139

Query: 150 KFKEALKIFHWMLRPGSPCKPDERVYKTLIAGLCRKGMVLDALKILKNMIDSNLVPDYDL 209
           K KEALK+F+W++RPGSPC  DERV + L+ G C+ GMVL+ALK+L+ M+ +N+VP  DL
Sbjct: 140 KPKEALKMFNWLVRPGSPCVLDERVCEVLVNGFCKNGMVLEALKVLRAMLSTNIVPGCDL 199

Query: 210 RSWVFRCLLREAMIGEATELSETLNSIGDR---NTSEHLKKVPSL 246
           + WV++ LLREA I EA EL+E L  +GDR   + SE +KKV +L
Sbjct: 200 KKWVYKVLLREARIKEAVELNEALGCVGDREKGDESECVKKVLAL 244

BLAST of Cp4.1LG20g02840 vs. NCBI nr
Match: gi|645238977|ref|XP_008225928.1| (PREDICTED: uncharacterized protein LOC103325527 [Prunus mume])

HSP 1 Score: 204.1 bits (518), Expect = 3.1e-49
Identity = 121/225 (53.78%), Postives = 153/225 (68.00%), Query Frame = 1

Query: 30  TPFSSSSSSSVYDSAASKINT------TLTQDELTKISLLLPRLCLDNHHSTAITLLDAA 89
           TP S S+S+S  DS  +   T      +LTQ+E TKI+LLLPRLCL NH  TA  L   A
Sbjct: 20  TPRSFSTSTSAIDSITAPKPTNQAQTQSLTQEEHTKINLLLPRLCLLNHLDTATHLTITA 79

Query: 90  LLTNLSLQSLPLSILSHSLASQSDFALTMSLLTRLNHHRNALLYSSPIITMLIFSYCKQR 149
           LLTN  L+SL LSIL HS  SQ D A  MSLLTRL H+  +  Y +PI TM I SY K+ 
Sbjct: 80  LLTNPPLKSLSLSILIHSFTSQPDMARPMSLLTRLRHNPPSHPYLTPITTMFIASYFKKN 139

Query: 150 KFKEALKIFHWMLRPGSPCKPDERVYKTLIAGLCRKGMVLDALKILKNMIDSNLVPDYDL 209
           K KEALK+F+W++RPGSPC  DERV + L+ G C+ GMVL+ LK+L+ M+ +N+VP  DL
Sbjct: 140 KPKEALKMFNWLVRPGSPCVLDERVCEVLVNGFCKNGMVLEVLKVLRAMLSTNIVPGCDL 199

Query: 210 RSWVFRCLLREAMIGEATELSETLNSIGDR---NTSEHLKKVPSL 246
           + WV++ LLREA I EA EL+E L  +GDR   + SE +KKV +L
Sbjct: 200 KKWVYKVLLREARIKEAVELNEALGCVGDREKGDESECVKKVLAL 244

BLAST of Cp4.1LG20g02840 vs. NCBI nr
Match: gi|764554710|ref|XP_004293884.2| (PREDICTED: uncharacterized protein LOC101313880 isoform X2 [Fragaria vesca subsp. vesca])

HSP 1 Score: 197.2 bits (500), Expect = 3.8e-47
Identity = 116/217 (53.46%), Postives = 148/217 (68.20%), Query Frame = 1

Query: 31  PFSSSSSS--SVYDSAASKINTTLTQDELTKISLLLPRLCLDNHHSTAITLLDAALLTNL 90
           PFSSSSS+  S+     +    TLTQ ++T I+LLLPRLCL ++ +TA  L   ALLTN 
Sbjct: 26  PFSSSSSTIASITSPKPTTQTQTLTQQDVTNINLLLPRLCLSDNLNTATHLTITALLTNP 85

Query: 91  SLQSLPLSILSHSLASQSDFALTMSLLTRLNHHRNALLYSSPIITMLIFSYCKQRKFKEA 150
            L SL LSIL HS  SQ D A  MSLLTRL HH  +  + +PI TMLI SY K+++ +EA
Sbjct: 86  PLHSLSLSILIHSFTSQPDMARPMSLLTRLRHHPPSHSHLTPITTMLIASYFKRKRPREA 145

Query: 151 LKIFHWMLRPGSPCKPDERVYKTLIAGLCRKGMVLDALKILKNMIDSNLVPDYDLRSWVF 210
           LK+F+WM+RPGSP   DERV   L+ G CR GMVL+AL +L+ M+  N+VP  DLR WV+
Sbjct: 146 LKVFNWMVRPGSPVVLDERVCGVLVCGFCRNGMVLEALNVLRAMLGVNIVPGCDLRKWVY 205

Query: 211 RCLLREAMIGEATELSETLNSIGDRNTSEHLKKVPSL 246
           R LLREA I EA EL++ L+ +GD   SE  +KV +L
Sbjct: 206 RGLLREARIKEALELNKALDCVGD-GESEGFRKVLAL 241

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP444_ARATH1.3e-0735.96Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidop... [more]
PP281_ARATH3.0e-0734.02Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidop... [more]
PP327_ARATH3.0e-0726.74Pentatricopeptide repeat-containing protein At4g20090 OS=Arabidopsis thaliana GN... [more]
PP407_ARATH1.5e-0633.66Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana GN... [more]
PP306_ARATH2.5e-0625.41Pentatricopeptide repeat-containing protein At4g11690 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0LRQ9_CUCSA4.8e-8972.98Uncharacterized protein OS=Cucumis sativus GN=Csa_1G051900 PE=4 SV=1[more]
M5WIJ3_PRUPE1.7e-4953.78Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa010317mg PE=4 SV=1[more]
A0A067LG57_JATCU1.3e-4149.30Uncharacterized protein OS=Jatropha curcas GN=JCGZ_26667 PE=4 SV=1[more]
B9GN11_POPTR1.8e-4049.75Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s12210g PE=4 SV=2[more]
A0A061DVU2_THECC4.7e-3645.66Pentatricopeptide repeat-containing protein, putative OS=Theobroma cacao GN=TCM_... [more]
Match NameE-valueIdentityDescription
AT5G64320.17.5e-0935.96 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G53700.11.7e-0834.02 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G20090.11.7e-0826.74 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G39710.18.3e-0833.66 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G11690.11.4e-0725.41 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659067591|ref|XP_008440255.1|2.8e-9074.19PREDICTED: pentatricopeptide repeat-containing protein At1g02060, chloroplastic-... [more]
gi|449474047|ref|XP_004154059.1|6.9e-8972.98PREDICTED: putative pentatricopeptide repeat-containing protein At5g59900 [Cucum... [more]
gi|595865128|ref|XP_007211897.1|2.4e-4953.78hypothetical protein PRUPE_ppa010317mg [Prunus persica][more]
gi|645238977|ref|XP_008225928.1|3.1e-4953.78PREDICTED: uncharacterized protein LOC103325527 [Prunus mume][more]
gi|764554710|ref|XP_004293884.2|3.8e-4753.46PREDICTED: uncharacterized protein LOC101313880 isoform X2 [Fragaria vesca subsp... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g02840.1Cp4.1LG20g02840.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 133..178
score: 2.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 168..200
score: 1.2E-7coord: 133..153
score: 0.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 128..162
score: 8.89coord: 165..199
score: 11
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 224..230
score: 3.7E-25coord: 127..200
score: 3.7E-25coord: 21..31
score: 3.7

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG20g02840Cucumber (Gy14) v2cgybcpeB080
Cp4.1LG20g02840Melon (DHL92) v3.6.1cpemedB552
Cp4.1LG20g02840Silver-seed gourdcarcpeB0698
Cp4.1LG20g02840Cucurbita pepo (Zucchini)cpecpeB045
Cp4.1LG20g02840Cucurbita maxima (Rimu)cmacpeB440
Cp4.1LG20g02840Cucurbita moschata (Rifu)cmocpeB405