CmoCh04G030070.1 (mRNA) Cucurbita moschata (Rifu)

NameCmoCh04G030070.1
TypemRNA
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
Description(Pentatricopeptide repeat-containing protein) (3.4.24.-) (3.6.4.3)
LocationCmo_Chr04 : 21146076 .. 21147169 (-)
Sequence length576
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCATTTGACCTTAGCAACAACAACCAAACGGATGAGGGTATCTCCTAGTAAGAAGTAAAGAGCCACAAATTCCAAGGCGGAACAGGAAGCAATGGTGGCAACGGCGCTACTTGCAGAGCTAAGGCTCCACGATAAACCATTATGAGTTTATCCAACTTTGCCGTTGCTCAACCTGCTCTTTCCTTCTCGAACGCGCCAATGTTCAGTCTAATCTCTCATTCCCCAAGCAATGTCTTCTTCTTCTTCCAGAATTCGGATTTATTTTCGTAATGGCTACAAATTTTTGACAGATAAGGAATTTAGCGTTAACACTAACGCCTCCTGCGTGATTGTGTGGACGTAAGATTTCTGAAACAAGCCAAGACTGTTCATGTGTTTTTGTTAAAATCGAAATTTTAAAACCATGATTCTCTGGTCTTGCTTAATATTGTTGCTCACGCTCACTCGAAATGCTCCGATATTGGTGCTGCATGCCACCTGTTTGATCAAATGTCCCAGAGAAACATCTTTTCTTGGATTGGCTGATAATGATTTTTTCCTCGATTGGTTTTGAGTAATTCTGTGAAATGCAGAGTCAGGGAGTTTTTCCAGATCAGTTTGCGTATTCTTGTATCTTGGTTTGGAGACCATTATATTGGGCAAAATGGTTCATCCCAGATTGTTATTAGAGGCTTTGCATCTCTTTCTTTTGTGTATACTGCTCTTCTTAATATGTATGCAAGGTTACAAGAGATTCAGGATTCATTCAAGGTGTTTAACACCATGACTGAATTTAATGTAGTCTCGTGGAATGCCATGATCTCAGGGTTCACATTAAATGGTCTTTACTCAGAGGCATGTGATCATTTTCTCAGAATGAAGGGAGAAGGAGTAACACCCGATGCCCAATCGTTTATTGGCATTGCAAAAGCTATGGGTATGTTAGGAGACGTGAGCATGGCAAAAGAAGTTAGCCATTTTGCTTCAGAGTTAGGTGTGGACTCCAATACTCTCGTTGATATGCATTCTAAATGTAGATCTTTGCAAGAGGCAGATCCATCTTTGGCCCTCATTTTACAAATTGTCGGGTTAACGTCCCGCGGAATGCAATGA

mRNA sequence

ATGCATTTGACCTTAGCAACAACAACCAAACGGATGAGGGTATCTCCTAGTAAGAAAGTCAGGGAGTTTTTCCAGATCAGTTTGCGTATTCTTGTATCTTGGTTTGGAGACCATTATATTGGGCAAAATGGTTCATCCCAGATTGTTATTAGAGGCTTTGCATCTCTTTCTTTTGTGTATACTGCTCTTCTTAATATGTATGCAAGGTTACAAGAGATTCAGGATTCATTCAAGGTGTTTAACACCATGACTGAATTTAATGTAGTCTCGTGGAATGCCATGATCTCAGGGTTCACATTAAATGGTCTTTACTCAGAGGCATGTGATCATTTTCTCAGAATGAAGGGAGAAGGAGTAACACCCGATGCCCAATCGTTTATTGGCATTGCAAAAGCTATGGGTATGTTAGGAGACGTGAGCATGGCAAAAGAAGTTAGCCATTTTGCTTCAGAGTTAGGTGTGGACTCCAATACTCTCGTTGATATGCATTCTAAATGTAGATCTTTGCAAGAGGCAGATCCATCTTTGGCCCTCATTTTACAAATTGTCGGGTTAACGTCCCGCGGAATGCAATGA

Coding sequence (CDS)

ATGCATTTGACCTTAGCAACAACAACCAAACGGATGAGGGTATCTCCTAGTAAGAAAGTCAGGGAGTTTTTCCAGATCAGTTTGCGTATTCTTGTATCTTGGTTTGGAGACCATTATATTGGGCAAAATGGTTCATCCCAGATTGTTATTAGAGGCTTTGCATCTCTTTCTTTTGTGTATACTGCTCTTCTTAATATGTATGCAAGGTTACAAGAGATTCAGGATTCATTCAAGGTGTTTAACACCATGACTGAATTTAATGTAGTCTCGTGGAATGCCATGATCTCAGGGTTCACATTAAATGGTCTTTACTCAGAGGCATGTGATCATTTTCTCAGAATGAAGGGAGAAGGAGTAACACCCGATGCCCAATCGTTTATTGGCATTGCAAAAGCTATGGGTATGTTAGGAGACGTGAGCATGGCAAAAGAAGTTAGCCATTTTGCTTCAGAGTTAGGTGTGGACTCCAATACTCTCGTTGATATGCATTCTAAATGTAGATCTTTGCAAGAGGCAGATCCATCTTTGGCCCTCATTTTACAAATTGTCGGGTTAACGTCCCGCGGAATGCAATGA
BLAST of CmoCh04G030070.1 vs. Swiss-Prot
Match: PP330_ARATH (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 90.5 bits (223), Expect = 2.1e-17
Identity = 44/146 (30.14%), Postives = 85/146 (58.22%), Query Frame = 1

Query: 32  VSWFGDHYIGQNGSSQIVIRGFASLSFVYTALLNMYARLQEIQDSFKVFNTMTEFNVVSW 91
           V+   D  +G+   S ++  GF SL +V  +LL++YA   ++  ++KVF+ M E ++V+W
Sbjct: 131 VTTMADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAW 190

Query: 92  NAMISGFTLNGLYSEACDHFLRMKGEGVTPDAQSFIGIAKAMGMLGDVSMAKEVSHFASE 151
           N++I+GF  NG   EA   +  M  +G+ PD  + + +  A   +G +++ K V  +  +
Sbjct: 191 NSVINGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIK 250

Query: 152 LGV-----DSNTLVDMHSKCRSLQEA 173
           +G+      SN L+D++++C  ++EA
Sbjct: 251 VGLTRNLHSSNVLLDLYARCGRVEEA 276

BLAST of CmoCh04G030070.1 vs. Swiss-Prot
Match: PP182_ARATH (Pentatricopeptide repeat-containing protein At2g33760 OS=Arabidopsis thaliana GN=PCMP-H6 PE=3 SV=1)

HSP 1 Score: 83.6 bits (205), Expect = 2.6e-15
Identity = 42/138 (30.43%), Postives = 77/138 (55.80%), Query Frame = 1

Query: 40  IGQNGSSQIVIRGFASLSFVYTALLNMYARLQEIQDSFKVFNTMTEFNVVSWNAMISGFT 99
           IG+      V+ GF   ++V  AL+  Y++  +++ + +VF+ M E ++V+WN+++SGF 
Sbjct: 125 IGKGVHCHAVVSGFGLDTYVQAALVTFYSKCGDMEGARQVFDRMPEKSIVAWNSLVSGFE 184

Query: 100 LNGLYSEACDHFLRMKGEGVTPDAQSFIGIAKAMGMLGDVSMAKEVSHFASELGVDSN-- 159
            NGL  EA   F +M+  G  PD+ +F+ +  A    G VS+   V  +    G+D N  
Sbjct: 185 QNGLADEAIQVFYQMRESGFEPDSATFVSLLSACAQTGAVSLGSWVHQYIISEGLDLNVK 244

Query: 160 ---TLVDMHSKCRSLQEA 173
               L++++S+C  + +A
Sbjct: 245 LGTALINLYSRCGDVGKA 262

BLAST of CmoCh04G030070.1 vs. Swiss-Prot
Match: PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 82.0 bits (201), Expect = 7.6e-15
Identity = 45/138 (32.61%), Postives = 77/138 (55.80%), Query Frame = 1

Query: 40  IGQNGSSQIVIRGFASLSFVYTALLNMYARLQEIQDSFKVFNTMTEFNVVSWNAMISGFT 99
           +G+     +V  GF+   F  T L NMYA+ +++ ++ KVF+ M E ++VSWN +++G++
Sbjct: 153 VGKEIHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYS 212

Query: 100 LNGLYSEACDHFLRMKGEGVTPDAQSFIGIAKAMGMLGDVSMAKEVSHFASELGVD---- 159
            NG+   A +    M  E + P   + + +  A+  L  +S+ KE+  +A   G D    
Sbjct: 213 QNGMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVN 272

Query: 160 -SNTLVDMHSKCRSLQEA 173
            S  LVDM++KC SL+ A
Sbjct: 273 ISTALVDMYAKCGSLETA 290

BLAST of CmoCh04G030070.1 vs. Swiss-Prot
Match: PP184_ARATH (Pentatricopeptide repeat-containing protein At2g34400 OS=Arabidopsis thaliana GN=PCMP-E23 PE=3 SV=2)

HSP 1 Score: 81.6 bits (200), Expect = 9.9e-15
Identity = 52/166 (31.33%), Postives = 85/166 (51.20%), Query Frame = 1

Query: 18  KKVREFFQISLRILVSWFG------DHYIGQNGSSQIVIRGFASLSFVYTALLNMYARLQ 77
           K   E F+   R LVS  G      D   G+      + +     +F+ + L++MY +  
Sbjct: 223 KMEEEGFEPDERTLVSMLGACSHLGDLRTGRLLEEMAITKKIGLSTFLGSKLISMYGKCG 282

Query: 78  EIQDSFKVFNTMTEFNVVSWNAMISGFTLNGLYSEACDHFLRMKGEGVTPDAQSFIGIAK 137
           ++  + +VFN M + + V+W AMI+ ++ NG  SEA   F  M+  GV+PDA +   +  
Sbjct: 283 DLDSARRVFNQMIKKDRVAWTAMITVYSQNGKSSEAFKLFFEMEKTGVSPDAGTLSTVLS 342

Query: 138 AMGMLGDVSMAKEVSHFASELGVDSNT-----LVDMHSKCRSLQEA 173
           A G +G + + K++   ASEL +  N      LVDM+ KC  ++EA
Sbjct: 343 ACGSVGALELGKQIETHASELSLQHNIYVATGLVDMYGKCGRVEEA 388

BLAST of CmoCh04G030070.1 vs. Swiss-Prot
Match: PP249_ARATH (Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana GN=PCMP-H56 PE=2 SV=1)

HSP 1 Score: 78.2 bits (191), Expect = 1.1e-13
Identity = 46/131 (35.11%), Postives = 71/131 (54.20%), Query Frame = 1

Query: 48  IVIRGFASLSFVYTALLNMYARLQEIQDSFKVFNTMTEFNVVSWNAMISGFTLNGLYSEA 107
           IV  G+A   FV  +L++ YA   E+  + KVF+ M+E NVVSW +MI G+       +A
Sbjct: 160 IVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSMICGYARRDFAKDA 219

Query: 108 CDHFLRM-KGEGVTPDAQSFIGIAKAMGMLGDVSMAKEVSHFASELGVDSN-----TLVD 167
            D F RM + E VTP++ + + +  A   L D+   ++V  F    G++ N      LVD
Sbjct: 220 VDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAFIRNSGIEVNDLMVSALVD 279

Query: 168 MHSKCRSLQEA 173
           M+ KC ++  A
Sbjct: 280 MYMKCNAIDVA 290

BLAST of CmoCh04G030070.1 vs. TrEMBL
Match: A0A0A0KBQ4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G013890 PE=4 SV=1)

HSP 1 Score: 188.3 bits (477), Expect = 8.4e-45
Identity = 100/138 (72.46%), Postives = 114/138 (82.61%), Query Frame = 1

Query: 40  IGQNGSSQIVIRGFASLSFVYTALLNMYARLQEIQDSFKVFNTMTEFNVVSWNAMISGFT 99
           +G    +QIVIRGF S +FV TALLNMYA+LQEI+DS+KVFNTMTE NVVSWNAMI+GFT
Sbjct: 176 LGNMVHAQIVIRGFTSHTFVSTALLNMYAKLQEIEDSYKVFNTMTEVNVVSWNAMITGFT 235

Query: 100 LNGLYSEACDHFLRMKGEGVTPDAQSFIGIAKAMGMLGDVSMAKEVSHFASELGVDSNTL 159
            N LY +A D FLRM GEGVTPDAQ+FIG+AKA+GML DV+ AKEVS +A ELGVDSNTL
Sbjct: 236 SNDLYLDAFDLFLRMMGEGVTPDAQTFIGVAKAIGMLRDVNKAKEVSGYALELGVDSNTL 295

Query: 160 V-----DMHSKCRSLQEA 173
           V     DM+SKC SLQEA
Sbjct: 296 VGTALIDMNSKCGSLQEA 313

BLAST of CmoCh04G030070.1 vs. TrEMBL
Match: V4SCU5_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10004388mg PE=4 SV=1)

HSP 1 Score: 158.7 bits (400), Expect = 7.1e-36
Identity = 81/138 (58.70%), Postives = 104/138 (75.36%), Query Frame = 1

Query: 40  IGQNGSSQIVIRGFASLSFVYTALLNMYARLQEIQDSFKVFNTMTEFNVVSWNAMISGFT 99
           +G+   +QI+I+GFAS + V T+LLNMYA+L  ++DS K+FNTMTE N VSWNAMISGFT
Sbjct: 186 LGKMVHAQIIIKGFASHTVVTTSLLNMYAKLGRVEDSHKMFNTMTEHNEVSWNAMISGFT 245

Query: 100 LNGLYSEACDHFLRMKGEGVTPDAQSFIGIAKAMGMLGDVSMAKEVSHFASELGVDSN-- 159
            NGL+SEA DHFL MK EGVTP+  + IG++KA+G L DV   KE+  FAS+LG+DSN  
Sbjct: 246 SNGLHSEAFDHFLLMKSEGVTPNMLTIIGVSKAIGQLRDVDKGKELQSFASKLGMDSNVE 305

Query: 160 ---TLVDMHSKCRSLQEA 173
                +DM+SKC SL +A
Sbjct: 306 VETAFIDMYSKCGSLCDA 323

BLAST of CmoCh04G030070.1 vs. TrEMBL
Match: M5XS64_PRUPE (Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa014747mg PE=4 SV=1)

HSP 1 Score: 150.2 bits (378), Expect = 2.5e-33
Identity = 76/138 (55.07%), Postives = 104/138 (75.36%), Query Frame = 1

Query: 40  IGQNGSSQIVIRGFASLSFVYTALLNMYARLQEIQDSFKVFNTMTEFNVVSWNAMISGFT 99
           +G+   +Q+ +RGFAS +FV T+LLNMYA+  +I+DS K+FNTMTE N VSWNAMISG T
Sbjct: 109 LGKMVHAQVFVRGFASDTFVSTSLLNMYAKFGKIEDSCKMFNTMTEHNKVSWNAMISGLT 168

Query: 100 LNGLYSEACDHFLRMKGEGVTPDAQSFIGIAKAMGMLGDVSMAKEVSHFASELGVDSN-- 159
            NGL+ EA D+FLRMK EG+TP+  + I ++KA G LGDV+ +K V  +ASEL ++S+  
Sbjct: 169 SNGLHFEAFDYFLRMKKEGITPNMYTLISVSKAAGKLGDVNKSKVVHSYASELEMESSVQ 228

Query: 160 ---TLVDMHSKCRSLQEA 173
               L+DM+SKC+SL +A
Sbjct: 229 VGTALIDMYSKCKSLSDA 246

BLAST of CmoCh04G030070.1 vs. TrEMBL
Match: F6HC58_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_13s0067g02260 PE=4 SV=1)

HSP 1 Score: 142.9 bits (359), Expect = 4.0e-31
Identity = 76/138 (55.07%), Postives = 99/138 (71.74%), Query Frame = 1

Query: 40  IGQNGSSQIVIRGFASLSFVYTALLNMYARLQEIQDSFKVFNTMTEFNVVSWNAMISGFT 99
           +G+   +QIV+RGFA+  FV T+LLNMYA+L  I+DS+ VFN MTE N VSWNAMISG T
Sbjct: 185 LGKMVHAQIVMRGFATHIFVSTSLLNMYAKLGSIEDSYWVFNMMTEHNQVSWNAMISGCT 244

Query: 100 LNGLYSEACDHFLRMKGEGVTPDAQSFIGIAKAMGMLGDVSMAKEVSHFASELGVDSNTL 159
            NGL+ EA D F+RMK    TP+  + + ++KA+G L DV+M KEV + ASELG++ N L
Sbjct: 245 SNGLHLEAFDLFVRMKNGACTPNMYTLVSVSKAVGKLVDVNMGKEVQNCASELGIEGNVL 304

Query: 160 V-----DMHSKCRSLQEA 173
           V     DM+SKC SL +A
Sbjct: 305 VGTALIDMYSKCGSLHDA 322

BLAST of CmoCh04G030070.1 vs. TrEMBL
Match: A0A068V210_COFCA (Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00042072001 PE=4 SV=1)

HSP 1 Score: 142.1 bits (357), Expect = 6.9e-31
Identity = 72/138 (52.17%), Postives = 97/138 (70.29%), Query Frame = 1

Query: 40  IGQNGSSQIVIRGFASLSFVYTALLNMYARLQEIQDSFKVFNTMTEFNVVSWNAMISGFT 99
           +G+   ++I+I GFAS  FV T+LLNMYA+L ++++S KVF++M E N VSWNAMISGFT
Sbjct: 193 LGEMVHARILITGFASHVFVSTSLLNMYAKLGDVEESLKVFDSMNEHNEVSWNAMISGFT 252

Query: 100 LNGLYSEACDHFLRMKGEGVTPDAQSFIGIAKAMGMLGDVSMAKEVSHFASELGVDSN-- 159
            NGLY EA +HFL M      PD  S I + KA+GMLGD    K+V ++AS LG+DSN  
Sbjct: 253 ANGLYLEAFNHFLMMMEHRYAPDMYSIISVLKAVGMLGDAGKGKQVHNYASNLGLDSNVR 312

Query: 160 ---TLVDMHSKCRSLQEA 173
               L+DM++KC +L +A
Sbjct: 313 VGTALIDMYAKCGALSDA 330

BLAST of CmoCh04G030070.1 vs. TAIR10
Match: AT4G21065.1 (AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 90.5 bits (223), Expect = 1.2e-18
Identity = 44/146 (30.14%), Postives = 85/146 (58.22%), Query Frame = 1

Query: 32  VSWFGDHYIGQNGSSQIVIRGFASLSFVYTALLNMYARLQEIQDSFKVFNTMTEFNVVSW 91
           V+   D  +G+   S ++  GF SL +V  +LL++YA   ++  ++KVF+ M E ++V+W
Sbjct: 131 VTTMADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAW 190

Query: 92  NAMISGFTLNGLYSEACDHFLRMKGEGVTPDAQSFIGIAKAMGMLGDVSMAKEVSHFASE 151
           N++I+GF  NG   EA   +  M  +G+ PD  + + +  A   +G +++ K V  +  +
Sbjct: 191 NSVINGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIK 250

Query: 152 LGV-----DSNTLVDMHSKCRSLQEA 173
           +G+      SN L+D++++C  ++EA
Sbjct: 251 VGLTRNLHSSNVLLDLYARCGRVEEA 276

BLAST of CmoCh04G030070.1 vs. TAIR10
Match: AT2G33760.1 (AT2G33760.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 83.6 bits (205), Expect = 1.5e-16
Identity = 42/138 (30.43%), Postives = 77/138 (55.80%), Query Frame = 1

Query: 40  IGQNGSSQIVIRGFASLSFVYTALLNMYARLQEIQDSFKVFNTMTEFNVVSWNAMISGFT 99
           IG+      V+ GF   ++V  AL+  Y++  +++ + +VF+ M E ++V+WN+++SGF 
Sbjct: 125 IGKGVHCHAVVSGFGLDTYVQAALVTFYSKCGDMEGARQVFDRMPEKSIVAWNSLVSGFE 184

Query: 100 LNGLYSEACDHFLRMKGEGVTPDAQSFIGIAKAMGMLGDVSMAKEVSHFASELGVDSN-- 159
            NGL  EA   F +M+  G  PD+ +F+ +  A    G VS+   V  +    G+D N  
Sbjct: 185 QNGLADEAIQVFYQMRESGFEPDSATFVSLLSACAQTGAVSLGSWVHQYIISEGLDLNVK 244

Query: 160 ---TLVDMHSKCRSLQEA 173
               L++++S+C  + +A
Sbjct: 245 LGTALINLYSRCGDVGKA 262

BLAST of CmoCh04G030070.1 vs. TAIR10
Match: AT1G11290.1 (AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 82.0 bits (201), Expect = 4.3e-16
Identity = 45/138 (32.61%), Postives = 77/138 (55.80%), Query Frame = 1

Query: 40  IGQNGSSQIVIRGFASLSFVYTALLNMYARLQEIQDSFKVFNTMTEFNVVSWNAMISGFT 99
           +G+     +V  GF+   F  T L NMYA+ +++ ++ KVF+ M E ++VSWN +++G++
Sbjct: 153 VGKEIHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYS 212

Query: 100 LNGLYSEACDHFLRMKGEGVTPDAQSFIGIAKAMGMLGDVSMAKEVSHFASELGVD---- 159
            NG+   A +    M  E + P   + + +  A+  L  +S+ KE+  +A   G D    
Sbjct: 213 QNGMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVN 272

Query: 160 -SNTLVDMHSKCRSLQEA 173
            S  LVDM++KC SL+ A
Sbjct: 273 ISTALVDMYAKCGSLETA 290

BLAST of CmoCh04G030070.1 vs. TAIR10
Match: AT2G34400.1 (AT2G34400.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 81.6 bits (200), Expect = 5.6e-16
Identity = 52/166 (31.33%), Postives = 85/166 (51.20%), Query Frame = 1

Query: 18  KKVREFFQISLRILVSWFG------DHYIGQNGSSQIVIRGFASLSFVYTALLNMYARLQ 77
           K   E F+   R LVS  G      D   G+      + +     +F+ + L++MY +  
Sbjct: 223 KMEEEGFEPDERTLVSMLGACSHLGDLRTGRLLEEMAITKKIGLSTFLGSKLISMYGKCG 282

Query: 78  EIQDSFKVFNTMTEFNVVSWNAMISGFTLNGLYSEACDHFLRMKGEGVTPDAQSFIGIAK 137
           ++  + +VFN M + + V+W AMI+ ++ NG  SEA   F  M+  GV+PDA +   +  
Sbjct: 283 DLDSARRVFNQMIKKDRVAWTAMITVYSQNGKSSEAFKLFFEMEKTGVSPDAGTLSTVLS 342

Query: 138 AMGMLGDVSMAKEVSHFASELGVDSNT-----LVDMHSKCRSLQEA 173
           A G +G + + K++   ASEL +  N      LVDM+ KC  ++EA
Sbjct: 343 ACGSVGALELGKQIETHASELSLQHNIYVATGLVDMYGKCGRVEEA 388

BLAST of CmoCh04G030070.1 vs. TAIR10
Match: AT3G61170.1 (AT3G61170.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 80.9 bits (198), Expect = 9.5e-16
Identity = 50/140 (35.71%), Postives = 76/140 (54.29%), Query Frame = 1

Query: 40  IGQNGSSQIVIRGFASLSFVYTALLNMYARLQEIQDSFKVFNTMTEFNVVSWNAMISGFT 99
           I  +    IV  G+A+   V  AL++MYA+   +  + KVF  M E +V+SW A+++G T
Sbjct: 347 IASSAHCLIVKTGYATYKLVNNALVDMYAKRGIMDSALKVFEGMIEKDVISWTALVTGNT 406

Query: 100 LNGLYSEACDHFLRMKGEGVTPDAQSFIGIAKAMGMLGDVSMAKEV------SHFASELG 159
            NG Y EA   F  M+  G+TPD      +  A   L  +   ++V      S F S L 
Sbjct: 407 HNGSYDEALKLFCNMRVGGITPDKIVTASVLSASAELTLLEFGQQVHGNYIKSGFPSSLS 466

Query: 160 VDSNTLVDMHSKCRSLQEAD 174
           V +N+LV M++KC SL++A+
Sbjct: 467 V-NNSLVTMYTKCGSLEDAN 485

BLAST of CmoCh04G030070.1 vs. NCBI nr
Match: gi|659113785|ref|XP_008456754.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g27610-like [Cucumis melo])

HSP 1 Score: 193.0 bits (489), Expect = 4.9e-46
Identity = 102/138 (73.91%), Postives = 115/138 (83.33%), Query Frame = 1

Query: 40  IGQNGSSQIVIRGFASLSFVYTALLNMYARLQEIQDSFKVFNTMTEFNVVSWNAMISGFT 99
           +G+   +QIVIRGF S +FV TALLNMYA+LQEI+DS KVFNTMTE NVVSWNAMI+GFT
Sbjct: 186 LGKMVHAQIVIRGFTSHTFVSTALLNMYAKLQEIEDSCKVFNTMTEVNVVSWNAMITGFT 245

Query: 100 LNGLYSEACDHFLRMKGEGVTPDAQSFIGIAKAMGMLGDVSMAKEVSHFASELGVDSNTL 159
            NG Y +A D FLRMKGEGVTPDAQ+FIG+AKA+GML DV+ AKEVS +A ELGVDSNTL
Sbjct: 246 SNGFYLDAFDLFLRMKGEGVTPDAQTFIGVAKAIGMLRDVNKAKEVSGYALELGVDSNTL 305

Query: 160 V-----DMHSKCRSLQEA 173
           V     DMHSKC SLQEA
Sbjct: 306 VGTALIDMHSKCGSLQEA 323

BLAST of CmoCh04G030070.1 vs. NCBI nr
Match: gi|778709607|ref|XP_011656423.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g25970 [Cucumis sativus])

HSP 1 Score: 188.3 bits (477), Expect = 1.2e-44
Identity = 100/138 (72.46%), Postives = 114/138 (82.61%), Query Frame = 1

Query: 40  IGQNGSSQIVIRGFASLSFVYTALLNMYARLQEIQDSFKVFNTMTEFNVVSWNAMISGFT 99
           +G    +QIVIRGF S +FV TALLNMYA+LQEI+DS+KVFNTMTE NVVSWNAMI+GFT
Sbjct: 186 LGNMVHAQIVIRGFTSHTFVSTALLNMYAKLQEIEDSYKVFNTMTEVNVVSWNAMITGFT 245

Query: 100 LNGLYSEACDHFLRMKGEGVTPDAQSFIGIAKAMGMLGDVSMAKEVSHFASELGVDSNTL 159
            N LY +A D FLRM GEGVTPDAQ+FIG+AKA+GML DV+ AKEVS +A ELGVDSNTL
Sbjct: 246 SNDLYLDAFDLFLRMMGEGVTPDAQTFIGVAKAIGMLRDVNKAKEVSGYALELGVDSNTL 305

Query: 160 V-----DMHSKCRSLQEA 173
           V     DM+SKC SLQEA
Sbjct: 306 VGTALIDMNSKCGSLQEA 323

BLAST of CmoCh04G030070.1 vs. NCBI nr
Match: gi|700190613|gb|KGN45817.1| (hypothetical protein Csa_6G013890 [Cucumis sativus])

HSP 1 Score: 188.3 bits (477), Expect = 1.2e-44
Identity = 100/138 (72.46%), Postives = 114/138 (82.61%), Query Frame = 1

Query: 40  IGQNGSSQIVIRGFASLSFVYTALLNMYARLQEIQDSFKVFNTMTEFNVVSWNAMISGFT 99
           +G    +QIVIRGF S +FV TALLNMYA+LQEI+DS+KVFNTMTE NVVSWNAMI+GFT
Sbjct: 176 LGNMVHAQIVIRGFTSHTFVSTALLNMYAKLQEIEDSYKVFNTMTEVNVVSWNAMITGFT 235

Query: 100 LNGLYSEACDHFLRMKGEGVTPDAQSFIGIAKAMGMLGDVSMAKEVSHFASELGVDSNTL 159
            N LY +A D FLRM GEGVTPDAQ+FIG+AKA+GML DV+ AKEVS +A ELGVDSNTL
Sbjct: 236 SNDLYLDAFDLFLRMMGEGVTPDAQTFIGVAKAIGMLRDVNKAKEVSGYALELGVDSNTL 295

Query: 160 V-----DMHSKCRSLQEA 173
           V     DM+SKC SLQEA
Sbjct: 296 VGTALIDMNSKCGSLQEA 313

BLAST of CmoCh04G030070.1 vs. NCBI nr
Match: gi|568874190|ref|XP_006490200.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g27110-like [Citrus sinensis])

HSP 1 Score: 164.1 bits (414), Expect = 2.4e-37
Identity = 83/138 (60.14%), Postives = 106/138 (76.81%), Query Frame = 1

Query: 40  IGQNGSSQIVIRGFASLSFVYTALLNMYARLQEIQDSFKVFNTMTEFNVVSWNAMISGFT 99
           +G+   +QI+I+GFAS + V T+LLNMYA+L  ++DS+K+FNTMTE N VSWNAMISGFT
Sbjct: 186 LGKMVHAQIIIKGFASHTVVTTSLLNMYAKLGRVEDSYKMFNTMTEHNEVSWNAMISGFT 245

Query: 100 LNGLYSEACDHFLRMKGEGVTPDAQSFIGIAKAMGMLGDVSMAKEVSHFASELGVDSN-- 159
            NGL+SEA DHFL MK EGVTP+  + IG++KA+G L DV   KEV  FASELG++SN  
Sbjct: 246 SNGLHSEAFDHFLLMKSEGVTPNMLTIIGVSKAVGQLHDVDRGKEVQSFASELGLESNVQ 305

Query: 160 ---TLVDMHSKCRSLQEA 173
               L+DM+SKC SL +A
Sbjct: 306 VGTALIDMYSKCGSLNDA 323

BLAST of CmoCh04G030070.1 vs. NCBI nr
Match: gi|567857726|ref|XP_006421546.1| (hypothetical protein CICLE_v10004388mg [Citrus clementina])

HSP 1 Score: 158.7 bits (400), Expect = 1.0e-35
Identity = 81/138 (58.70%), Postives = 104/138 (75.36%), Query Frame = 1

Query: 40  IGQNGSSQIVIRGFASLSFVYTALLNMYARLQEIQDSFKVFNTMTEFNVVSWNAMISGFT 99
           +G+   +QI+I+GFAS + V T+LLNMYA+L  ++DS K+FNTMTE N VSWNAMISGFT
Sbjct: 186 LGKMVHAQIIIKGFASHTVVTTSLLNMYAKLGRVEDSHKMFNTMTEHNEVSWNAMISGFT 245

Query: 100 LNGLYSEACDHFLRMKGEGVTPDAQSFIGIAKAMGMLGDVSMAKEVSHFASELGVDSN-- 159
            NGL+SEA DHFL MK EGVTP+  + IG++KA+G L DV   KE+  FAS+LG+DSN  
Sbjct: 246 SNGLHSEAFDHFLLMKSEGVTPNMLTIIGVSKAIGQLRDVDKGKELQSFASKLGMDSNVE 305

Query: 160 ---TLVDMHSKCRSLQEA 173
                +DM+SKC SL +A
Sbjct: 306 VETAFIDMYSKCGSLCDA 323

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP330_ARATH2.1e-1730.14Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana GN... [more]
PP182_ARATH2.6e-1530.43Pentatricopeptide repeat-containing protein At2g33760 OS=Arabidopsis thaliana GN... [more]
PPR32_ARATH7.6e-1532.61Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
PP184_ARATH9.9e-1531.33Pentatricopeptide repeat-containing protein At2g34400 OS=Arabidopsis thaliana GN... [more]
PP249_ARATH1.1e-1335.11Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0KBQ4_CUCSA8.4e-4572.46Uncharacterized protein OS=Cucumis sativus GN=Csa_6G013890 PE=4 SV=1[more]
V4SCU5_9ROSI7.1e-3658.70Uncharacterized protein OS=Citrus clementina GN=CICLE_v10004388mg PE=4 SV=1[more]
M5XS64_PRUPE2.5e-3355.07Uncharacterized protein (Fragment) OS=Prunus persica GN=PRUPE_ppa014747mg PE=4 S... [more]
F6HC58_VITVI4.0e-3155.07Putative uncharacterized protein OS=Vitis vinifera GN=VIT_13s0067g02260 PE=4 SV=... [more]
A0A068V210_COFCA6.9e-3152.17Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00042072001 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G21065.11.2e-1830.14 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G33760.11.5e-1630.43 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G11290.14.3e-1632.61 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT2G34400.15.6e-1631.33 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT3G61170.19.5e-1635.71 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659113785|ref|XP_008456754.1|4.9e-4673.91PREDICTED: pentatricopeptide repeat-containing protein At2g27610-like [Cucumis m... [more]
gi|778709607|ref|XP_011656423.1|1.2e-4472.46PREDICTED: putative pentatricopeptide repeat-containing protein At3g25970 [Cucum... [more]
gi|700190613|gb|KGN45817.1|1.2e-4472.46hypothetical protein Csa_6G013890 [Cucumis sativus][more]
gi|568874190|ref|XP_006490200.1|2.4e-3760.14PREDICTED: pentatricopeptide repeat-containing protein At5g27110-like [Citrus si... [more]
gi|567857726|ref|XP_006421546.1|1.0e-3558.70hypothetical protein CICLE_v10004388mg [Citrus clementina][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmoCh04G030070CmoCh04G030070gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmoCh04G030070.1CmoCh04G030070.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh04G030070.1.CDS.2CmoCh04G030070.1.CDS.2CDS
CmoCh04G030070.1.CDS.1CmoCh04G030070.1.CDS.1CDS


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh04G030070.1.exon.2CmoCh04G030070.1.exon.2exon
CmoCh04G030070.1.exon.1CmoCh04G030070.1.exon.1exon


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 87..126
score: 3.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 59..88
score: 6.7E-4coord: 89..123
score: 2.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 87..121
score: 11.345coord: 122..156
score: 5.196coord: 56..86
score:
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 47..172
score: 5.3
NoneNo IPR availablePANTHERPTHR24015:SF661SUBFAMILY NOT NAMEDcoord: 47..172
score: 5.3