CmoCh14G011150 (gene) Cucurbita moschata (Rifu)

NameCmoCh14G011150
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionPentatricopeptide repeat
LocationCmo_Chr14 : 7572480 .. 7575463 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAACAGTAATGGAGATAAATCCTATAACAAAGGAAAATATTCTTAAAGGTAGAGCCAAGGAGCATAATGCCAATGTCATTTTTTCAAACCTTCTGTGTTTCCTTGGCCACTCTGCCATCCAAAGTTAAGACCAACTTTGAAGCTTCTATCCAAGAAGTATCCGATGGGACTTCATTCTTTTCACTGCCTCCCCTTGGAACTGTAGTTCAGAGTTATGTTTTGCTTTTGCTTTTGTTTTTGAATTATGATGCGTCTTCTATTCTCCTAAACTGGAGGGCTTTCATGTTAATCTCTCTGGATTTTTATTCATCCTTTCTTTGTCTATCAGTTAAATTGTTTCTTATATCAGACAAATGCTATTGACAGTTCTTTGCTAGTAATGATATTAATTTTTTTTTTCTAGCATGGCCATTGCAGTAAAAAAAATTGTGTTGCTTTGTCAGTTGCTCAATTTGTGTAGACATCCATCATATTTCAACGATGTATGTATTTAGTCCGTGACTTACTACTATGTCATAAAAGTTGCAACGATATTGTGATAGGCCTCCTATTAGAAATGCATGTTAAAGCAAGCGTATGAGGAGAACAGAGGGGGTTTGGGGAATGTAATTAGTTGGTAAGGGGGTACTTATGTGATTAGATATCCAGTTGTCTTGACAGTCTTGACCCATTCTATGATTCTTCCCTTGGTTGTTGCCATTCTTGACAAATAATATTGAGAGCATTTCACATTGTGGAGAAAGAATAAGTGATTTTTAATTGTTACTGTAATTAAAATGTGTTGTAAAGATTTCATTCTCGAATTTTATACTGAATTGTATGGTATTCTTTGTGGCGTTGAATTGTCCTTGTTTATTAGTATTATTTTCTCGTTCTAACAAAGTCTGCTGTATTATCCTTTATTTAAAGGATCTATTTACGCCAAGAATTTTGTGATGTTATTATTAACAGGAATCTTGATCTGTTCATGAGTGGAGAATGGAGAGTTACACTTTTAATTGATATGTTCTATTGTTAATATGATGAAAAATAAAAATTTTCGTCTTGTCTACTTTCAACAGATGCTTACTGGTTGTATCAAGGTTGAACATGGTTATGCCAAAGCATTGGAGCTTCTAAAGGAGTCGCAAAACAATGGACTATGCATGGATTCTGTGTCGTATGGGACACTGATAGCTGTTTGTGCTTCACATAATAGATTGGAAGACGCAGAGAGTTTCTTCAACCAGATGAAAAATGAAGGCTATTCGCCAAATATGTTTCATTATGGCTCGCTACTCAATGCTTATTCAATGAGTGGAGATTATAAAAAGGCTGATGAGCTGATCGAGGATATGAAATTGAAGGGGTTAGTACCAAATAAGGTTTGTTTTTTTGTTATGGTGCCTTCGAAACTTTAGATTGTATTTATATATGAATAATTTTTATGAGAAGAATCGAGAAGAAAGCATAGTAATTTGTTCGGGAAATATACGAAGTTATATGGTTAAAGTCCTATTAATTGTAAAAGAATGAGTTGTAAAGACTTCCATGGAGTTGTAAAGACTACAGAAAAGATTGGGAAGAAATTGCCTAGAGAGAGGAAGTTCACCATGGATTTAATAGATAAAGTGCCATCATTTTCAAGGTTCCAATATAGTTGGTCTGATAGGATGCGGACGCCTTTCCTGAAAAGGTTAACAGCAAAAGCATTCTAGTCTTCTGCATCTCTATCCTTGAGAGCACCCCAGTGATCGTACACATTCTACCATTGTTGGTGGAAAGCACACAGAGTCTGGATATTTGAGTTTGAGGTAGCGTGGACCTAACCTGTCATCATATTAGAAGAGAGAGCTGGAGCGATCATCAATTTTAATGGAAATGTTAGCATGCAGTATTTCTGGGTACAAATAGTTATATACTTGTTTGTTAAAATTATTTCAATATGCCATGGAAGACAACTCTACAGTTATAAGATATTTGACTTGATGGCACAAGTATTGAAATCAATGTTTAAGTACATCTGAATGTGTTTATTTACATCATACCGGTCCATGTTACTCTGGAGAATCTGGAAATCATTAACTTTTCCCTCAAAAAGTTTAGGGTTAGGCTTCAAAAGTACATGTGAGAGAAGTTATAAAAATGTGGTTAAAAAATACGACTATAGTTTCTCTGGCTTATTTAGAAATATATATATATATATTCTTTTTTTTTAATGCTTACAATGGAATTTTTTTTTTCATTTCTAAAAAATTGCACACGGTTGATATTCAAGTTTTCTGTCTACTTGCAGTTGTTTGTTTCTTATAAAAAAGTATCTACTTGCAGGTGATTTTAACAACATTGCTGAAGGTTTATGTCAGGGGAGGTTTGTTTGAGAAATCAAGGAAACTCTTATCAGAACTGGAAGCCCTTGGCTACGGTGAAAATGAGGTCTGTGCCAATGATTTGTGGTAGTATTACTTTTTGTACATTTGCAGCTGATGTTTGCTAATTACCATTTGTTTATATGATTCTCTCCACACTAGAAAGATGGTCATATTGTTGGTTTTCATGTTTGTACTTTCCAAAATGGTTAAATTACAAGTTTAGTCTGTAAACCGTGAAGTTTGTGTTTATTTGACATTTGAACTTTCAAAAGTGTTTAATAGGTCCTTAATTTTTCAATTTTGTGTCTAACAGGTCCTTGAGCTTTCAATTTTGTGCCCAGTAGATCTGTGAATTTAAAAAATATCGGGAGAGACTAATCTGTAGGGATCAAACTATACACAAACTTCAAAGTTTAAGTACTAAATTTGTAATTTAATGTTTCAAAATTTGTTAATTTCTAGTTTGGTAGTTCCGTTGGGTTGAGTAATTGGTAGGGCTTAAGAAGTGAATACGATGAAGGATGCAAAGGTTTCATGACATTGACTTGGAAGATCACTACTTGGCTGCCGTTCTTCTCTGGTTATATTTTTTTCTCCTCCAATAGGAGTTGATGAAGCTGAGCTATTGA

mRNA sequence

ATGGGAACAGTAATGGAGATAAATCCTATAACAAAGGAAAATATTCTTAAAGGTAGAGCCAAGGAGCATAATGCCAATATGCTTACTGGTTGTATCAAGGTTGAACATGGTTATGCCAAAGCATTGGAGCTTCTAAAGGAGTCGCAAAACAATGGACTATGCATGGATTCTGTGTCGTATGGGACACTGATAGCTGTTTGTGCTTCACATAATAGATTGGAAGACGCAGAGAGTTTCTTCAACCAGATGAAAAATGAAGGCTATTCGCCAAATATGTTTCATTATGGCTCGCTACTCAATGCTTATTCAATGAGTGGAGATTATAAAAAGGCTGATGAGCTGATCGAGGATATGAAATTGAAGGGGTTAGTACCAAATAAGGTGATTTTAACAACATTGCTGAAGGTTTATGTCAGGGGAGGTTTGTTTGAGAAATCAAGGAAACTCTTATCAGAACTGGAAGCCCTTGGCTACGGTGAAAATGAGGAGTTGATGAAGCTGAGCTATTGA

Coding sequence (CDS)

ATGGGAACAGTAATGGAGATAAATCCTATAACAAAGGAAAATATTCTTAAAGGTAGAGCCAAGGAGCATAATGCCAATATGCTTACTGGTTGTATCAAGGTTGAACATGGTTATGCCAAAGCATTGGAGCTTCTAAAGGAGTCGCAAAACAATGGACTATGCATGGATTCTGTGTCGTATGGGACACTGATAGCTGTTTGTGCTTCACATAATAGATTGGAAGACGCAGAGAGTTTCTTCAACCAGATGAAAAATGAAGGCTATTCGCCAAATATGTTTCATTATGGCTCGCTACTCAATGCTTATTCAATGAGTGGAGATTATAAAAAGGCTGATGAGCTGATCGAGGATATGAAATTGAAGGGGTTAGTACCAAATAAGGTGATTTTAACAACATTGCTGAAGGTTTATGTCAGGGGAGGTTTGTTTGAGAAATCAAGGAAACTCTTATCAGAACTGGAAGCCCTTGGCTACGGTGAAAATGAGGAGTTGATGAAGCTGAGCTATTGA
BLAST of CmoCh14G011150 vs. Swiss-Prot
Match: PPR31_ARATH (Pentatricopeptide repeat-containing protein At1g10910, chloroplastic OS=Arabidopsis thaliana GN=At1g10910 PE=2 SV=1)

HSP 1 Score: 191.0 bits (484), Expect = 1.0e-47
Identity = 89/136 (65.44%), Postives = 116/136 (85.29%), Query Frame = 1

Query: 27  MLTGCIKVEHGYAKALELLKESQNNGLCMDSVSYGTLIAVCASHNRLEDAESFFNQMKNE 86
           +L GCIKV++GY KA+EL+ E  +NG+ MDSV YGT++A+CAS+ R E+AE+F  QMK E
Sbjct: 207 LLAGCIKVKNGYPKAIELIGELPHNGIQMDSVMYGTVLAICASNGRSEEAENFIQQMKVE 266

Query: 87  GYSPNMFHYGSLLNAYSMSGDYKKADELIEDMKLKGLVPNKVILTTLLKVYVRGGLFEKS 146
           G+SPN++HY SLLN+YS  GDYKKADEL+ +MK  GLVPNKV++TTLLKVY++GGLF++S
Sbjct: 267 GHSPNIYHYSSLLNSYSWKGDYKKADELMTEMKSIGLVPNKVMMTTLLKVYIKGGLFDRS 326

Query: 147 RKLLSELEALGYGENE 163
           R+LLSELE+ GY ENE
Sbjct: 327 RELLSELESAGYAENE 342

BLAST of CmoCh14G011150 vs. Swiss-Prot
Match: PP442_ARATH (Pentatricopeptide repeat-containing protein At5g61990, mitochondrial OS=Arabidopsis thaliana GN=At5g61990 PE=2 SV=1)

HSP 1 Score: 79.0 bits (193), Expect = 5.7e-14
Identity = 38/122 (31.15%), Postives = 70/122 (57.38%), Query Frame = 1

Query: 40  KALELLKESQNNGLCMDSVSYGTLIAVCASHNRLEDAESFFNQMKNEGYSPNMFHYGSLL 99
           +A    +   + G+  D+ +Y  L+     +++++DAE  F +M+ +G +P++F YG L+
Sbjct: 575 EACSAYRSMVDQGILGDAKTYTVLMNGLFKNDKVDDAEEIFREMRGKGIAPDVFSYGVLI 634

Query: 100 NAYSMSGDYKKADELIEDMKLKGLVPNKVILTTLLKVYVRGGLFEKSRKLLSELEALGYG 159
           N +S  G+ +KA  + ++M  +GL PN +I   LL  + R G  EK+++LL E+   G  
Sbjct: 635 NGFSKLGNMQKASSIFDEMVEEGLTPNVIIYNMLLGGFCRSGEIEKAKELLDEMSVKGLH 694

Query: 160 EN 162
            N
Sbjct: 695 PN 696

BLAST of CmoCh14G011150 vs. Swiss-Prot
Match: PP325_ARATH (Pentatricopeptide repeat-containing protein At4g19440, chloroplastic OS=Arabidopsis thaliana GN=At4g19440 PE=2 SV=2)

HSP 1 Score: 79.0 bits (193), Expect = 5.7e-14
Identity = 39/117 (33.33%), Postives = 70/117 (59.83%), Query Frame = 1

Query: 41  ALELLKESQNNGLCMDSVSYGTLIAVCASHNRLEDAESFFNQMKNEGYSPNMFHYGSLLN 100
           ALEL ++ ++ G+  +S +Y +LI   +  +R+E+A+  F +M+ EG  PN+FHY +L++
Sbjct: 677 ALELREDMKHKGISPNSATYTSLIKGMSIISRVEEAKLLFEEMRMEGLEPNVFHYTALID 736

Query: 101 AYSMSGDYKKADELIEDMKLKGLVPNKVILTTLLKVYVRGGLFEKSRKLLSELEALG 158
            Y   G   K + L+ +M  K + PNK+  T ++  Y R G   ++ +LL+E+   G
Sbjct: 737 GYGKLGQMVKVECLLREMHSKNVHPNKITYTVMIGGYARDGNVTEASRLLNEMREKG 793

BLAST of CmoCh14G011150 vs. Swiss-Prot
Match: PP241_ARATH (Pentatricopeptide repeat-containing protein At3g18110, chloroplastic OS=Arabidopsis thaliana GN=EMB1270 PE=2 SV=2)

HSP 1 Score: 74.7 bits (182), Expect = 1.1e-12
Identity = 35/122 (28.69%), Postives = 72/122 (59.02%), Query Frame = 1

Query: 41  ALELLKESQNNGLCMDSVSYGTLIAVCASHNRLEDAESFFNQMKNEGYSPNMFHYGSLLN 100
           A+ELL   +N+GL  D+++Y TL++ C+  + L+ A   F  M+     P+++ Y ++++
Sbjct: 281 AVELLDMVRNSGLRPDAITYNTLLSACSRDSNLDGAVKVFEDMEAHRCQPDLWTYNAMIS 340

Query: 101 AYSMSGDYKKADELIEDMKLKGLVPNKVILTTLLKVYVRGGLFEKSRKLLSELEALGYGE 160
            Y   G   +A+ L  +++LKG  P+ V   +LL  + R    EK +++  +++ +G+G+
Sbjct: 341 VYGRCGLAAEAERLFMELELKGFFPDAVTYNSLLYAFARERNTEKVKEVYQQMQKMGFGK 400

Query: 161 NE 163
           +E
Sbjct: 401 DE 402

BLAST of CmoCh14G011150 vs. Swiss-Prot
Match: PP178_ARATH (Pentatricopeptide repeat-containing protein At2g31400, chloroplastic OS=Arabidopsis thaliana GN=At2g31400 PE=2 SV=1)

HSP 1 Score: 73.2 bits (178), Expect = 3.1e-12
Identity = 41/146 (28.08%), Postives = 76/146 (52.05%), Query Frame = 1

Query: 12  KENILKGRAKEHNANMLTGCIKVEHGYAKALELLKESQNNGLCMDSVSYGTLIAVCASHN 71
           KE  L+     +NA ++  C K    + +  +   E Q NG+  D +++ +L+AVC+   
Sbjct: 295 KEYGLRPNLVTYNA-VIDACGKGGMEFKQVAKFFDEMQRNGVQPDRITFNSLLAVCSRGG 354

Query: 72  RLEDAESFFNQMKNEGYSPNMFHYGSLLNAYSMSGDYKKADELIEDMKLKGLVPNKVILT 131
             E A + F++M N     ++F Y +LL+A    G    A E++  M +K ++PN V  +
Sbjct: 355 LWEAARNLFDEMTNRRIEQDVFSYNTLLDAICKGGQMDLAFEILAQMPVKRIMPNVVSYS 414

Query: 132 TLLKVYVRGGLFEKSRKLLSELEALG 158
           T++  + + G F+++  L  E+  LG
Sbjct: 415 TVIDGFAKAGRFDEALNLFGEMRYLG 439

BLAST of CmoCh14G011150 vs. TrEMBL
Match: A0A0A0LCK2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G842670 PE=4 SV=1)

HSP 1 Score: 250.0 bits (637), Expect = 2.1e-63
Identity = 121/138 (87.68%), Postives = 132/138 (95.65%), Query Frame = 1

Query: 25  ANMLTGCIKVEHGYAKALELLKESQNNGLCMDSVSYGTLIAVCASHNRLEDAESFFNQMK 84
           + MLTGCI+V+HGYAKA+ELLKE Q+NGLCMD VSYGTLIA+CASHNRLEDAE FFNQM+
Sbjct: 127 STMLTGCIRVKHGYAKAMELLKELQDNGLCMDCVSYGTLIAICASHNRLEDAERFFNQMR 186

Query: 85  NEGYSPNMFHYGSLLNAYSMSGDYKKADELIEDMKLKGLVPNKVILTTLLKVYVRGGLFE 144
            EG+SPNMFHYGSLLNAYS++GDYKKADELIEDMKL GLVPNKVILTTLLKVYVRGGLFE
Sbjct: 187 AEGHSPNMFHYGSLLNAYSINGDYKKADELIEDMKLTGLVPNKVILTTLLKVYVRGGLFE 246

Query: 145 KSRKLLSELEALGYGENE 163
           KSRKLLSELE+LGYGENE
Sbjct: 247 KSRKLLSELESLGYGENE 264

BLAST of CmoCh14G011150 vs. TrEMBL
Match: M5XK80_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002505mg PE=4 SV=1)

HSP 1 Score: 209.9 bits (533), Expect = 2.4e-51
Identity = 100/138 (72.46%), Postives = 123/138 (89.13%), Query Frame = 1

Query: 25  ANMLTGCIKVEHGYAKALELLKESQNNGLCMDSVSYGTLIAVCASHNRLEDAESFFNQMK 84
           + +L GC KV+HGY+KALEL++E Q N L MDSV YGTL+AVCAS+N+LE+AE +F QMK
Sbjct: 208 STLLAGCNKVKHGYSKALELVQELQRNELQMDSVIYGTLLAVCASNNKLEEAEGYFKQMK 267

Query: 85  NEGYSPNMFHYGSLLNAYSMSGDYKKADELIEDMKLKGLVPNKVILTTLLKVYVRGGLFE 144
           NEGY PN+FHY ++LNAYS+SG+YK+AD+L++DMK  GLVPNKVILTTLLKVYVRGGLFE
Sbjct: 268 NEGYLPNVFHYSAMLNAYSISGNYKEADDLVQDMKSAGLVPNKVILTTLLKVYVRGGLFE 327

Query: 145 KSRKLLSELEALGYGENE 163
           KSR+LL+ELEALGY E+E
Sbjct: 328 KSRELLAELEALGYAEDE 345

BLAST of CmoCh14G011150 vs. TrEMBL
Match: D7SZY3_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0034g01820 PE=4 SV=1)

HSP 1 Score: 204.5 bits (519), Expect = 1.0e-49
Identity = 98/138 (71.01%), Postives = 121/138 (87.68%), Query Frame = 1

Query: 25  ANMLTGCIKVEHGYAKALELLKESQNNGLCMDSVSYGTLIAVCASHNRLEDAESFFNQMK 84
           + +L GC+KV+HGY+KALEL++E + + L MDSV YGTL+AVCAS+NR ++AE++FNQMK
Sbjct: 209 STLLAGCMKVKHGYSKALELVQEMERSRLPMDSVIYGTLLAVCASNNRCKEAENYFNQMK 268

Query: 85  NEGYSPNMFHYGSLLNAYSMSGDYKKADELIEDMKLKGLVPNKVILTTLLKVYVRGGLFE 144
           +EG+ PN+FHY SLLNAYS  GDYKKAD L++DMK  GLVPNKVILTTLLKVYVRGGLFE
Sbjct: 269 DEGHLPNVFHYSSLLNAYSADGDYKKADMLVQDMKSAGLVPNKVILTTLLKVYVRGGLFE 328

Query: 145 KSRKLLSELEALGYGENE 163
           KSR+LL+ELE LGY E+E
Sbjct: 329 KSRELLAELEDLGYAEDE 346

BLAST of CmoCh14G011150 vs. TrEMBL
Match: B9GEW2_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s21880g PE=4 SV=2)

HSP 1 Score: 202.6 bits (514), Expect = 3.8e-49
Identity = 95/138 (68.84%), Postives = 120/138 (86.96%), Query Frame = 1

Query: 25  ANMLTGCIKVEHGYAKALELLKESQNNGLCMDSVSYGTLIAVCASHNRLEDAESFFNQMK 84
           + +L GC+K++ GY+KAL+L++E   NGL MDS+ YGTL+AVCAS+NR E+A+S+FNQMK
Sbjct: 223 STLLAGCMKIKDGYSKALDLVQELNYNGLQMDSIMYGTLLAVCASNNRCEEAQSYFNQMK 282

Query: 85  NEGYSPNMFHYGSLLNAYSMSGDYKKADELIEDMKLKGLVPNKVILTTLLKVYVRGGLFE 144
           +EG+SPN+FHY SLLNAYS  G+YKKA+EL++DMK  GLVPNKVILTTLLKVYVRGGLFE
Sbjct: 283 DEGHSPNIFHYSSLLNAYSSDGNYKKAEELVQDMKSSGLVPNKVILTTLLKVYVRGGLFE 342

Query: 145 KSRKLLSELEALGYGENE 163
           KSR LL EL+ LG+ +NE
Sbjct: 343 KSRDLLVELDTLGFAKNE 360

BLAST of CmoCh14G011150 vs. TrEMBL
Match: A0A061DZ53_THECC (Pentatricopeptide repeat (PPR) superfamily protein isoform 2 OS=Theobroma cacao GN=TCM_006909 PE=4 SV=1)

HSP 1 Score: 201.1 bits (510), Expect = 1.1e-48
Identity = 97/136 (71.32%), Postives = 117/136 (86.03%), Query Frame = 1

Query: 27  MLTGCIKVEHGYAKALELLKESQNNGLCMDSVSYGTLIAVCASHNRLEDAESFFNQMKNE 86
           +L GCIK++HG++KALEL+KE + NGL MDSV YGTL+AVCAS    E+A+++FNQM+ E
Sbjct: 205 LLAGCIKIKHGHSKALELIKELKYNGLKMDSVMYGTLLAVCASSGLHEEAQNYFNQMREE 264

Query: 87  GYSPNMFHYGSLLNAYSMSGDYKKADELIEDMKLKGLVPNKVILTTLLKVYVRGGLFEKS 146
           G+SPN++HY SLLNAYS  G+Y KADEL+E MK  GLVPNKVILTTLLKVYVRGGLFEKS
Sbjct: 265 GHSPNLYHYSSLLNAYSYDGNYCKADELVEQMKSSGLVPNKVILTTLLKVYVRGGLFEKS 324

Query: 147 RKLLSELEALGYGENE 163
            KLL+ELEALGY E+E
Sbjct: 325 TKLLAELEALGYAEDE 340

BLAST of CmoCh14G011150 vs. TAIR10
Match: AT1G10910.1 (AT1G10910.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 191.0 bits (484), Expect = 5.8e-49
Identity = 89/136 (65.44%), Postives = 116/136 (85.29%), Query Frame = 1

Query: 27  MLTGCIKVEHGYAKALELLKESQNNGLCMDSVSYGTLIAVCASHNRLEDAESFFNQMKNE 86
           +L GCIKV++GY KA+EL+ E  +NG+ MDSV YGT++A+CAS+ R E+AE+F  QMK E
Sbjct: 207 LLAGCIKVKNGYPKAIELIGELPHNGIQMDSVMYGTVLAICASNGRSEEAENFIQQMKVE 266

Query: 87  GYSPNMFHYGSLLNAYSMSGDYKKADELIEDMKLKGLVPNKVILTTLLKVYVRGGLFEKS 146
           G+SPN++HY SLLN+YS  GDYKKADEL+ +MK  GLVPNKV++TTLLKVY++GGLF++S
Sbjct: 267 GHSPNIYHYSSLLNSYSWKGDYKKADELMTEMKSIGLVPNKVMMTTLLKVYIKGGLFDRS 326

Query: 147 RKLLSELEALGYGENE 163
           R+LLSELE+ GY ENE
Sbjct: 327 RELLSELESAGYAENE 342

BLAST of CmoCh14G011150 vs. TAIR10
Match: AT4G19440.1 (AT4G19440.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 79.0 bits (193), Expect = 3.2e-15
Identity = 39/117 (33.33%), Postives = 70/117 (59.83%), Query Frame = 1

Query: 41  ALELLKESQNNGLCMDSVSYGTLIAVCASHNRLEDAESFFNQMKNEGYSPNMFHYGSLLN 100
           ALEL ++ ++ G+  +S +Y +LI   +  +R+E+A+  F +M+ EG  PN+FHY +L++
Sbjct: 664 ALELREDMKHKGISPNSATYTSLIKGMSIISRVEEAKLLFEEMRMEGLEPNVFHYTALID 723

Query: 101 AYSMSGDYKKADELIEDMKLKGLVPNKVILTTLLKVYVRGGLFEKSRKLLSELEALG 158
            Y   G   K + L+ +M  K + PNK+  T ++  Y R G   ++ +LL+E+   G
Sbjct: 724 GYGKLGQMVKVECLLREMHSKNVHPNKITYTVMIGGYARDGNVTEASRLLNEMREKG 780

BLAST of CmoCh14G011150 vs. TAIR10
Match: AT5G61990.1 (AT5G61990.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 79.0 bits (193), Expect = 3.2e-15
Identity = 38/122 (31.15%), Postives = 70/122 (57.38%), Query Frame = 1

Query: 40  KALELLKESQNNGLCMDSVSYGTLIAVCASHNRLEDAESFFNQMKNEGYSPNMFHYGSLL 99
           +A    +   + G+  D+ +Y  L+     +++++DAE  F +M+ +G +P++F YG L+
Sbjct: 575 EACSAYRSMVDQGILGDAKTYTVLMNGLFKNDKVDDAEEIFREMRGKGIAPDVFSYGVLI 634

Query: 100 NAYSMSGDYKKADELIEDMKLKGLVPNKVILTTLLKVYVRGGLFEKSRKLLSELEALGYG 159
           N +S  G+ +KA  + ++M  +GL PN +I   LL  + R G  EK+++LL E+   G  
Sbjct: 635 NGFSKLGNMQKASSIFDEMVEEGLTPNVIIYNMLLGGFCRSGEIEKAKELLDEMSVKGLH 694

Query: 160 EN 162
            N
Sbjct: 695 PN 696

BLAST of CmoCh14G011150 vs. TAIR10
Match: AT3G18110.1 (AT3G18110.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 74.7 bits (182), Expect = 6.1e-14
Identity = 35/122 (28.69%), Postives = 72/122 (59.02%), Query Frame = 1

Query: 41  ALELLKESQNNGLCMDSVSYGTLIAVCASHNRLEDAESFFNQMKNEGYSPNMFHYGSLLN 100
           A+ELL   +N+GL  D+++Y TL++ C+  + L+ A   F  M+     P+++ Y ++++
Sbjct: 281 AVELLDMVRNSGLRPDAITYNTLLSACSRDSNLDGAVKVFEDMEAHRCQPDLWTYNAMIS 340

Query: 101 AYSMSGDYKKADELIEDMKLKGLVPNKVILTTLLKVYVRGGLFEKSRKLLSELEALGYGE 160
            Y   G   +A+ L  +++LKG  P+ V   +LL  + R    EK +++  +++ +G+G+
Sbjct: 341 VYGRCGLAAEAERLFMELELKGFFPDAVTYNSLLYAFARERNTEKVKEVYQQMQKMGFGK 400

Query: 161 NE 163
           +E
Sbjct: 401 DE 402

BLAST of CmoCh14G011150 vs. TAIR10
Match: AT2G31400.1 (AT2G31400.1 genomes uncoupled 1)

HSP 1 Score: 73.2 bits (178), Expect = 1.8e-13
Identity = 41/146 (28.08%), Postives = 76/146 (52.05%), Query Frame = 1

Query: 12  KENILKGRAKEHNANMLTGCIKVEHGYAKALELLKESQNNGLCMDSVSYGTLIAVCASHN 71
           KE  L+     +NA ++  C K    + +  +   E Q NG+  D +++ +L+AVC+   
Sbjct: 295 KEYGLRPNLVTYNA-VIDACGKGGMEFKQVAKFFDEMQRNGVQPDRITFNSLLAVCSRGG 354

Query: 72  RLEDAESFFNQMKNEGYSPNMFHYGSLLNAYSMSGDYKKADELIEDMKLKGLVPNKVILT 131
             E A + F++M N     ++F Y +LL+A    G    A E++  M +K ++PN V  +
Sbjct: 355 LWEAARNLFDEMTNRRIEQDVFSYNTLLDAICKGGQMDLAFEILAQMPVKRIMPNVVSYS 414

Query: 132 TLLKVYVRGGLFEKSRKLLSELEALG 158
           T++  + + G F+++  L  E+  LG
Sbjct: 415 TVIDGFAKAGRFDEALNLFGEMRYLG 439

BLAST of CmoCh14G011150 vs. NCBI nr
Match: gi|449446895|ref|XP_004141206.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g10910, chloroplastic [Cucumis sativus])

HSP 1 Score: 250.0 bits (637), Expect = 3.0e-63
Identity = 121/138 (87.68%), Postives = 132/138 (95.65%), Query Frame = 1

Query: 25  ANMLTGCIKVEHGYAKALELLKESQNNGLCMDSVSYGTLIAVCASHNRLEDAESFFNQMK 84
           + MLTGCI+V+HGYAKA+ELLKE Q+NGLCMD VSYGTLIA+CASHNRLEDAE FFNQM+
Sbjct: 211 STMLTGCIRVKHGYAKAMELLKELQDNGLCMDCVSYGTLIAICASHNRLEDAERFFNQMR 270

Query: 85  NEGYSPNMFHYGSLLNAYSMSGDYKKADELIEDMKLKGLVPNKVILTTLLKVYVRGGLFE 144
            EG+SPNMFHYGSLLNAYS++GDYKKADELIEDMKL GLVPNKVILTTLLKVYVRGGLFE
Sbjct: 271 AEGHSPNMFHYGSLLNAYSINGDYKKADELIEDMKLTGLVPNKVILTTLLKVYVRGGLFE 330

Query: 145 KSRKLLSELEALGYGENE 163
           KSRKLLSELE+LGYGENE
Sbjct: 331 KSRKLLSELESLGYGENE 348

BLAST of CmoCh14G011150 vs. NCBI nr
Match: gi|700204612|gb|KGN59745.1| (hypothetical protein Csa_3G842670 [Cucumis sativus])

HSP 1 Score: 250.0 bits (637), Expect = 3.0e-63
Identity = 121/138 (87.68%), Postives = 132/138 (95.65%), Query Frame = 1

Query: 25  ANMLTGCIKVEHGYAKALELLKESQNNGLCMDSVSYGTLIAVCASHNRLEDAESFFNQMK 84
           + MLTGCI+V+HGYAKA+ELLKE Q+NGLCMD VSYGTLIA+CASHNRLEDAE FFNQM+
Sbjct: 127 STMLTGCIRVKHGYAKAMELLKELQDNGLCMDCVSYGTLIAICASHNRLEDAERFFNQMR 186

Query: 85  NEGYSPNMFHYGSLLNAYSMSGDYKKADELIEDMKLKGLVPNKVILTTLLKVYVRGGLFE 144
            EG+SPNMFHYGSLLNAYS++GDYKKADELIEDMKL GLVPNKVILTTLLKVYVRGGLFE
Sbjct: 187 AEGHSPNMFHYGSLLNAYSINGDYKKADELIEDMKLTGLVPNKVILTTLLKVYVRGGLFE 246

Query: 145 KSRKLLSELEALGYGENE 163
           KSRKLLSELE+LGYGENE
Sbjct: 247 KSRKLLSELESLGYGENE 264

BLAST of CmoCh14G011150 vs. NCBI nr
Match: gi|659130042|ref|XP_008464970.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g10910, chloroplastic isoform X2 [Cucumis melo])

HSP 1 Score: 249.6 bits (636), Expect = 3.9e-63
Identity = 121/138 (87.68%), Postives = 132/138 (95.65%), Query Frame = 1

Query: 25  ANMLTGCIKVEHGYAKALELLKESQNNGLCMDSVSYGTLIAVCASHNRLEDAESFFNQMK 84
           + MLTGCI+V+HGYAKA+ELLKE Q+NGLCMD V YGTLIA+CASHNRLEDAESFFNQM+
Sbjct: 97  STMLTGCIRVKHGYAKAMELLKELQDNGLCMDCVLYGTLIAICASHNRLEDAESFFNQMR 156

Query: 85  NEGYSPNMFHYGSLLNAYSMSGDYKKADELIEDMKLKGLVPNKVILTTLLKVYVRGGLFE 144
            EG+SPNMFHYGSLLNAYS++GDYKKADELIEDMKL GLVPNKVILTTLLKVYVRGGLFE
Sbjct: 157 AEGHSPNMFHYGSLLNAYSINGDYKKADELIEDMKLTGLVPNKVILTTLLKVYVRGGLFE 216

Query: 145 KSRKLLSELEALGYGENE 163
           KSRKLLSELE+LGYGENE
Sbjct: 217 KSRKLLSELESLGYGENE 234

BLAST of CmoCh14G011150 vs. NCBI nr
Match: gi|659130040|ref|XP_008464969.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g10910, chloroplastic isoform X1 [Cucumis melo])

HSP 1 Score: 249.6 bits (636), Expect = 3.9e-63
Identity = 121/138 (87.68%), Postives = 132/138 (95.65%), Query Frame = 1

Query: 25  ANMLTGCIKVEHGYAKALELLKESQNNGLCMDSVSYGTLIAVCASHNRLEDAESFFNQMK 84
           + MLTGCI+V+HGYAKA+ELLKE Q+NGLCMD V YGTLIA+CASHNRLEDAESFFNQM+
Sbjct: 211 STMLTGCIRVKHGYAKAMELLKELQDNGLCMDCVLYGTLIAICASHNRLEDAESFFNQMR 270

Query: 85  NEGYSPNMFHYGSLLNAYSMSGDYKKADELIEDMKLKGLVPNKVILTTLLKVYVRGGLFE 144
            EG+SPNMFHYGSLLNAYS++GDYKKADELIEDMKL GLVPNKVILTTLLKVYVRGGLFE
Sbjct: 271 AEGHSPNMFHYGSLLNAYSINGDYKKADELIEDMKLTGLVPNKVILTTLLKVYVRGGLFE 330

Query: 145 KSRKLLSELEALGYGENE 163
           KSRKLLSELE+LGYGENE
Sbjct: 331 KSRKLLSELESLGYGENE 348

BLAST of CmoCh14G011150 vs. NCBI nr
Match: gi|645223772|ref|XP_008218792.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g10910, chloroplastic [Prunus mume])

HSP 1 Score: 209.9 bits (533), Expect = 3.4e-51
Identity = 100/138 (72.46%), Postives = 123/138 (89.13%), Query Frame = 1

Query: 25  ANMLTGCIKVEHGYAKALELLKESQNNGLCMDSVSYGTLIAVCASHNRLEDAESFFNQMK 84
           + +L GC KV+HGY+KALEL++E Q N L MDSV YGTL+AVCAS+N+LE+AE +F QMK
Sbjct: 208 STLLAGCNKVKHGYSKALELVQELQRNELQMDSVIYGTLLAVCASNNKLEEAEGYFKQMK 267

Query: 85  NEGYSPNMFHYGSLLNAYSMSGDYKKADELIEDMKLKGLVPNKVILTTLLKVYVRGGLFE 144
           NEGY PN+FHY S+LNAYS+SG+YK+AD+L++DMK  GLVPNKVILTTLLKVYVRGGLFE
Sbjct: 268 NEGYLPNVFHYSSMLNAYSISGNYKEADDLVQDMKSAGLVPNKVILTTLLKVYVRGGLFE 327

Query: 145 KSRKLLSELEALGYGENE 163
           KSR+LL+ELE+LGY E+E
Sbjct: 328 KSRELLAELESLGYAEDE 345

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PPR31_ARATH1.0e-4765.44Pentatricopeptide repeat-containing protein At1g10910, chloroplastic OS=Arabidop... [more]
PP442_ARATH5.7e-1431.15Pentatricopeptide repeat-containing protein At5g61990, mitochondrial OS=Arabidop... [more]
PP325_ARATH5.7e-1433.33Pentatricopeptide repeat-containing protein At4g19440, chloroplastic OS=Arabidop... [more]
PP241_ARATH1.1e-1228.69Pentatricopeptide repeat-containing protein At3g18110, chloroplastic OS=Arabidop... [more]
PP178_ARATH3.1e-1228.08Pentatricopeptide repeat-containing protein At2g31400, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0LCK2_CUCSA2.1e-6387.68Uncharacterized protein OS=Cucumis sativus GN=Csa_3G842670 PE=4 SV=1[more]
M5XK80_PRUPE2.4e-5172.46Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa002505mg PE=4 SV=1[more]
D7SZY3_VITVI1.0e-4971.01Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0034g01820 PE=4 SV=... [more]
B9GEW2_POPTR3.8e-4968.84Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s21880g PE=4 SV=2[more]
A0A061DZ53_THECC1.1e-4871.32Pentatricopeptide repeat (PPR) superfamily protein isoform 2 OS=Theobroma cacao ... [more]
Match NameE-valueIdentityDescription
AT1G10910.15.8e-4965.44 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G19440.13.2e-1533.33 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G61990.13.2e-1531.15 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G18110.16.1e-1428.69 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G31400.11.8e-1328.08 genomes uncoupled 1[more]
Match NameE-valueIdentityDescription
gi|449446895|ref|XP_004141206.1|3.0e-6387.68PREDICTED: pentatricopeptide repeat-containing protein At1g10910, chloroplastic ... [more]
gi|700204612|gb|KGN59745.1|3.0e-6387.68hypothetical protein Csa_3G842670 [Cucumis sativus][more]
gi|659130042|ref|XP_008464970.1|3.9e-6387.68PREDICTED: pentatricopeptide repeat-containing protein At1g10910, chloroplastic ... [more]
gi|659130040|ref|XP_008464969.1|3.9e-6387.68PREDICTED: pentatricopeptide repeat-containing protein At1g10910, chloroplastic ... [more]
gi|645223772|ref|XP_008218792.1|3.4e-5172.46PREDICTED: pentatricopeptide repeat-containing protein At1g10910, chloroplastic ... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh14G011150.1CmoCh14G011150.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 58..91
score: 8.4E-9coord: 95..126
score: 2.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 20..55
score: 5.086coord: 126..160
score: 8.079coord: 56..90
score: 12.156coord: 91..125
score: 11
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 38..155
score: 3.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 27..162
score: 4.9
NoneNo IPR availablePANTHERPTHR24015:SF367SUBFAMILY NOT NAMEDcoord: 27..162
score: 4.9

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CmoCh14G011150Cucsa.340390Cucumber (Gy14) v1cgycmoB0929
CmoCh14G011150Lsi03G013120Bottle gourd (USVL1VR-Ls)cmolsiB231
The following gene(s) are paralogous to this gene:

None