CmaCh19G010790 (gene) Cucurbita maxima (Rimu)

NameCmaCh19G010790
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionPentatricopeptide repeat-containing family protein
LocationCma_Chr19 : 9002545 .. 9004300 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAATTCGATGGCGTTGCCTGGAATTCCATGATTATGTGTGTGGCTAAATGTGGGGAAATTGGGGAGTCGAGGAAGCTGTTCGATAAAATGCCTGTTAGAAACCCGATCTCTTGGAATTCTATGATTGGTGGGTATGTTAGAAATGGGATGTTCAAAGAAGCTCTCAAGCTGTTTGTCAAAATGCAAGAAGAGAGAATCCAGCCTAGTGAGTTTACTATGGTGAGCTTGTTAAATGCTTCAACTCAAATTGGAGCACTTTCGGCAGGAGGAATGAATAACGAGTACATAAAGAAGAACAATTTAGAGTTGAGTGCCAATGTAGCAACAGCGATAATTGACATGTACAGCAAGCATAGGGAAGGCCCTTCAGGTGTTTGAGAAAATGCCCAACAGAGGTTATTTAGTTGGAATTCCATGATCTTTGGCCTTGCAGTGAATGGCTGCGAAAAGGAAGTAGTCTCAAACCAGATTCTGTTAGCTTTATGGAGAACGAAGGCAGCGGATTTTTCTCACTGACAAAAAACACATACAGAATTGATCCATCAACAAAACACTACAATTTAACGGTGGACATGTTCAGCCGAGCTGGGTTTCTTGAAGAAGCAGAGCAGCTCCTAAAAACCATGCCAATTAAGGCAGTTACAGTCATACGAGGGGTCACAGTTTGTAGAACATATGAGAACACAGAGATGGTGAAGAGGGCAGCAGAAAAAGTTAATGAACGGGTTATATTCTTATGGAGGATGTTCATGCTTGACGATGAAGGTCTGACGGAAGCGCAGGCAGAGTTTACCGGTACGATTAGGCCTGACATTTTGGCCCAAACGTTGCCTTGGGCTCGAACTGTAAGGAAGCATAGACAACTGGGCCTGGAATTTCAGCCCAAATGTTGTCCAGGGCTGGAAGTCAAGGAAGTATGGATAATTGGGCCTAAGATTTCAGCCCAAATAATTGGGCTAGGGCTGCCCAGTTCCCACACACGCACGCTCTTCTCACGTGTTCTACCTCCTTCTATCTTCAATTCGAATTTCTAAATTCTATAGCAGCAGCAGCCTCTCCTCTTTGTAGAGGGAAAGTAGTAAGGTTTCGGTTGGGTTCGGCTTGGATTGAGTGGCTTCTTGAAGGAATGACCGGTCGGTTCTCCAGTTCAAGGGGAAATTTTCGGGTCAGATCGAACCGCACCGGTTATATATATATTTAAGGGTTTTACGTGCCTTCAATTGTTCATTTTCTTCGGCCTTCACTTCCGTCTTCCGATTCCACTAGGCACCAAATCAGTCGCAGCGCCAGAGCTCTCCCTCGTTCGTTCTCTTCTCCCCGCCTGGTCGCCGTGCCACTCGCCCTCGTTCGTCGTTGCCTCCTCTTCTCCTCTTCTCCCCTTGACTCGCCGTCGTTCCTCGTCTCTTCATATCAGACTTCAACCTTCAGCCGACCGTCGCCATCGCCTCAAAACTTTTTACTCTTCTTTCTCTGCAGTCAGCCACCGTCCACTGCCAGCGCCGGTAACATTTACTCTCCTCAAAATTTCTTTTTGATCTTTTAGCTCAAGTCTTTCCTGTACCCAATGATTTTAGTCACAAGGTTTGTTGGTTTCTACTGTACAGATACCTGGGGTTTTGTCGAAACTTTGTCCAGCATTTTTTATTCATTGTGTTCACTGTTATTGTTTATGCATTTTGTGTTGGGAATGGTTTCAGATAGCCCGGTTTTCATTATTTTTGAAGCTCAATCAACGAAACAACAAAGAAAGTGA

mRNA sequence

ATGGAATTCGATGGCGTTGCCTGGAATTCCATGATTATGTGTGTGGCTAAATGTGGGGAAATTGGGGAGTCGAGGAAGCTGTTCGATAAAATGCCTGTTAGAAACCCGATCTCTTGGAATTCTATGATTGGTGGGTATGTTAGAAATGGGATGTTCAAAGAAGCTCTCAAGCTGTTTGTCAAAATGCAAGAAGAGAGAATCCAGCCTAGTGAGTTTACTATGGTGAGCTTGTTAAATGCTTCAACTCAAATTGGAGCACTTTCGGCAGGAGGAATGAATAACGAGTACATAAAGAAGAACAATTTAGAGTTGAGTGCCAATGTAGCAACAGCGATAATTGACATTGAATGGCTGCGAAAAGGAAGTAGTCTCAAACCAGATTCTGTTAGCTTTATGGAGAACGAAGGCAGCGGATTTTTCTCACTGACAAAAAACACATACAGAATTGATCCATCAACAAAACACTACAATTTAACGGTGGACATGTTCAGCCGAGCTGGGTTTCTTGAAGAAGCAGAGCAGCTCCTAAAAACCATGCCAATTAAGGCAGTTACAGTCATACGAGGGGTCACAGTTTGTAGAACATATGAGAACACAGAGATGGTGAAGAGGGCAGCAGAAAAAGTTAATGAACGGGTTATATTCTTATGGAGGATGTTCATGCTTGACGATGAAGGTCTGACGGAAGCGCAGGCAGAGTTTACCGGGTTTTACGTGCCTTCAATTGTTCATTTTCTTCGGCCTTCACTTCCGTCTTCCGATTCCACTAGGCACCAAATCAGTCGCAGCGCCAGAGCTCTCCCTCGTTCGTTCTCTTCTCCCCGCCTGGTCGCCGTGCCACTCGCCCTCGTTCGTCGTTGCCTCCTCTTCTCCTCTTCTCCCCTTGACTCGCCGTCGTTCCTCGTCTCTTCATATCAGACTTCAACCTTCAGCCGACCGTCGCCATCGCCTCAAAACTTTTTACTCTTCTTTCTCTGCAGTCAGCCACCGTCCACTGCCAGCGCCGATAGCCCGGTTTTCATTATTTTTGAAGCTCAATCAACGAAACAACAAAGAAAGTGA

Coding sequence (CDS)

ATGGAATTCGATGGCGTTGCCTGGAATTCCATGATTATGTGTGTGGCTAAATGTGGGGAAATTGGGGAGTCGAGGAAGCTGTTCGATAAAATGCCTGTTAGAAACCCGATCTCTTGGAATTCTATGATTGGTGGGTATGTTAGAAATGGGATGTTCAAAGAAGCTCTCAAGCTGTTTGTCAAAATGCAAGAAGAGAGAATCCAGCCTAGTGAGTTTACTATGGTGAGCTTGTTAAATGCTTCAACTCAAATTGGAGCACTTTCGGCAGGAGGAATGAATAACGAGTACATAAAGAAGAACAATTTAGAGTTGAGTGCCAATGTAGCAACAGCGATAATTGACATTGAATGGCTGCGAAAAGGAAGTAGTCTCAAACCAGATTCTGTTAGCTTTATGGAGAACGAAGGCAGCGGATTTTTCTCACTGACAAAAAACACATACAGAATTGATCCATCAACAAAACACTACAATTTAACGGTGGACATGTTCAGCCGAGCTGGGTTTCTTGAAGAAGCAGAGCAGCTCCTAAAAACCATGCCAATTAAGGCAGTTACAGTCATACGAGGGGTCACAGTTTGTAGAACATATGAGAACACAGAGATGGTGAAGAGGGCAGCAGAAAAAGTTAATGAACGGGTTATATTCTTATGGAGGATGTTCATGCTTGACGATGAAGGTCTGACGGAAGCGCAGGCAGAGTTTACCGGGTTTTACGTGCCTTCAATTGTTCATTTTCTTCGGCCTTCACTTCCGTCTTCCGATTCCACTAGGCACCAAATCAGTCGCAGCGCCAGAGCTCTCCCTCGTTCGTTCTCTTCTCCCCGCCTGGTCGCCGTGCCACTCGCCCTCGTTCGTCGTTGCCTCCTCTTCTCCTCTTCTCCCCTTGACTCGCCGTCGTTCCTCGTCTCTTCATATCAGACTTCAACCTTCAGCCGACCGTCGCCATCGCCTCAAAACTTTTTACTCTTCTTTCTCTGCAGTCAGCCACCGTCCACTGCCAGCGCCGATAGCCCGGTTTTCATTATTTTTGAAGCTCAATCAACGAAACAACAAAGAAAGTGA

Protein sequence

MEFDGVAWNSMIMCVAKCGEIGESRKLFDKMPVRNPISWNSMIGGYVRNGMFKEALKLFVKMQEERIQPSEFTMVSLLNASTQIGALSAGGMNNEYIKKNNLELSANVATAIIDIEWLRKGSSLKPDSVSFMENEGSGFFSLTKNTYRIDPSTKHYNLTVDMFSRAGFLEEAEQLLKTMPIKAVTVIRGVTVCRTYENTEMVKRAAEKVNERVIFLWRMFMLDDEGLTEAQAEFTGFYVPSIVHFLRPSLPSSDSTRHQISRSARALPRSFSSPRLVAVPLALVRRCLLFSSSPLDSPSFLVSSYQTSTFSRPSPSPQNFLLFFLCSQPPSTASADSPVFIIFEAQSTKQQRK
BLAST of CmaCh19G010790 vs. Swiss-Prot
Match: PP200_ARATH (Pentatricopeptide repeat-containing protein At2g42920, chloroplastic OS=Arabidopsis thaliana GN=PCMP-E75 PE=2 SV=1)

HSP 1 Score: 134.0 bits (336), Expect = 3.1e-30
Identity = 65/113 (57.52%), Postives = 84/113 (74.34%), Query Frame = 1

Query: 3   FDGVAWNSMIMCVAKCGEIGESRKLFDKMPVRNPISWNSMIGGYVRNGMFKEALKLFVKM 62
           FD VAWNSMIM  AKCG I +++ LFD+MP RN +SWNSMI G+VRNG FK+AL +F +M
Sbjct: 190 FDVVAWNSMIMGFAKCGLIDQAQNLFDEMPQRNGVSWNSMISGFVRNGRFKDALDMFREM 249

Query: 63  QEERIQPSEFTMVSLLNASTQIGALSAGGMNNEYIKKNNLELSANVATAIIDI 116
           QE+ ++P  FTMVSLLNA   +GA   G   +EYI +N  EL++ V TA+ID+
Sbjct: 250 QEKDVKPDGFTMVSLLNACAYLGASEQGRWIHEYIVRNRFELNSIVVTALIDM 302

BLAST of CmaCh19G010790 vs. Swiss-Prot
Match: PP295_ARATH (Pentatricopeptide repeat-containing protein At3g62890 OS=Arabidopsis thaliana GN=PCMP-H82 PE=2 SV=1)

HSP 1 Score: 102.8 bits (255), Expect = 7.7e-21
Identity = 53/114 (46.49%), Postives = 77/114 (67.54%), Query Frame = 1

Query: 7   AWNSMIMCVAKCGEIGESRKLFDKMPVRNPISWNSMIGGYVRNGMFKEALKLFVKMQ--- 66
           AWNS++   AK G I ++RKLFD+MP RN ISW+ +I GYV  G +KEAL LF +MQ   
Sbjct: 130 AWNSVVNAYAKAGLIDDARKLFDEMPERNVISWSCLINGYVMCGKYKEALDLFREMQLPK 189

Query: 67  --EERIQPSEFTMVSLLNASTQIGALSAGGMNNEYIKKNNLELSANVATAIIDI 116
             E  ++P+EFTM ++L+A  ++GAL  G   + YI K ++E+   + TA+ID+
Sbjct: 190 PNEAFVRPNEFTMSTVLSACGRLGALEQGKWVHAYIDKYHVEIDIVLGTALIDM 243

BLAST of CmaCh19G010790 vs. Swiss-Prot
Match: PP410_ARATH (Putative pentatricopeptide repeat-containing protein At5g40405 OS=Arabidopsis thaliana GN=PCMP-H14 PE=3 SV=1)

HSP 1 Score: 99.8 bits (247), Expect = 6.5e-20
Identity = 45/112 (40.18%), Postives = 76/112 (67.86%), Query Frame = 1

Query: 4   DGVAWNSMIMCVAKCGEIGESRKLFDKMPVRNPISWNSMIGGYVRNGMFKEALKLFVKMQ 63
           D V   +M+   A+CG++  +RKLF+ MP R+PI+WN+MI GY + G  +EAL +F  MQ
Sbjct: 173 DFVCRTAMVTACARCGDVVFARKLFEGMPERDPIAWNAMISGYAQVGESREALNVFHLMQ 232

Query: 64  EERIQPSEFTMVSLLNASTQIGALSAGGMNNEYIKKNNLELSANVATAIIDI 116
            E ++ +   M+S+L+A TQ+GAL  G   + YI++N ++++  +AT ++D+
Sbjct: 233 LEGVKVNGVAMISVLSACTQLGALDQGRWAHSYIERNKIKITVRLATTLVDL 284

BLAST of CmaCh19G010790 vs. Swiss-Prot
Match: PP367_ARATH (Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana GN=PCMP-H88 PE=3 SV=1)

HSP 1 Score: 95.9 bits (237), Expect = 9.4e-19
Identity = 42/114 (36.84%), Postives = 73/114 (64.04%), Query Frame = 1

Query: 4   DGVAWNSMIMCVAKCGEIGESRKLFDKMPVRNPISWNSMIGGYVRNGMFKEALKLFVKMQ 63
           D V+W SM+    KCG +  +R++FD+MP RN  +W+ MI GY +N  F++A+ LF  M+
Sbjct: 182 DVVSWTSMVAGYCKCGMVENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMK 241

Query: 64  EERIQPSEFTMVSLLNASTQIGALSAGGMNNEYIKKNNLELSANVATAIIDIEW 118
            E +  +E  MVS++++   +GAL  G    EY+ K+++ ++  + TA++D+ W
Sbjct: 242 REGVVANETVMVSVISSCAHLGALEFGERAYEYVVKSHMTVNLILGTALVDMFW 295

BLAST of CmaCh19G010790 vs. Swiss-Prot
Match: PP235_ARATH (Putative pentatricopeptide repeat-containing protein At3g15930 OS=Arabidopsis thaliana GN=PCMP-E51 PE=3 SV=2)

HSP 1 Score: 95.1 bits (235), Expect = 1.6e-18
Identity = 50/126 (39.68%), Postives = 74/126 (58.73%), Query Frame = 1

Query: 4   DGVAWNSMIMCVAKCGEIGESRKLFDKMPVRNPISWNSMIGGYVRNGMFKEALKLFVKMQ 63
           D ++W S++    + G +  +R  FD+MPVR+ ISW  MI GY+R G F E+L++F +MQ
Sbjct: 301 DVISWTSIVKGYVERGNLKLARTYFDQMPVRDRISWTIMIDGYLRAGCFNESLEIFREMQ 360

Query: 64  EERIQPSEFTMVSLLNASTQIGALSAGGMNNEYIKKNNLELSANVATAIIDIEWLRKGSS 123
              + P EFTMVS+L A   +G+L  G     YI KN ++    V  A+ID+ + + G S
Sbjct: 361 SAGMIPDEFTMVSVLTACAHLGSLEIGEWIKTYIDKNKIKNDVVVGNALIDM-YFKCGCS 420

Query: 124 LKPDSV 130
            K   V
Sbjct: 421 EKAQKV 425

BLAST of CmaCh19G010790 vs. TrEMBL
Match: A0A0A0K7F6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G235760 PE=4 SV=1)

HSP 1 Score: 191.8 bits (486), Expect = 1.4e-45
Identity = 95/115 (82.61%), Postives = 104/115 (90.43%), Query Frame = 1

Query: 1   MEFDGVAWNSMIMCVAKCGEIGESRKLFDKMPVRNPISWNSMIGGYVRNGMFKEALKLFV 60
           MEFD V+WNSMI+ +AKCGEI ESRKLFDKMPV+NPISWNSMIGGYVRNGMFKEALKLF+
Sbjct: 186 MEFDVVSWNSMILGLAKCGEIDESRKLFDKMPVKNPISWNSMIGGYVRNGMFKEALKLFI 245

Query: 61  KMQEERIQPSEFTMVSLLNASTQIGALSAGGMNNEYIKKNNLELSANVATAIIDI 116
           KMQEERIQPSEFTMVSLLNAS QIGAL  G   +EYIKKNNL+L+A V TAIID+
Sbjct: 246 KMQEERIQPSEFTMVSLLNASAQIGALRQGVWIHEYIKKNNLQLNAIVVTAIIDM 300

BLAST of CmaCh19G010790 vs. TrEMBL
Match: F6H5C7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0028g00060 PE=4 SV=1)

HSP 1 Score: 162.5 bits (410), Expect = 9.1e-37
Identity = 78/115 (67.83%), Postives = 95/115 (82.61%), Query Frame = 1

Query: 1   MEFDGVAWNSMIMCVAKCGEIGESRKLFDKMPVRNPISWNSMIGGYVRNGMFKEALKLFV 60
           M+FD VAWNSMIM +AKCGE+ ESRKLFD+MP+RN +SWNSMI GYVRNG  +EAL LF 
Sbjct: 151 MDFDIVAWNSMIMGLAKCGEVDESRKLFDEMPLRNTVSWNSMISGYVRNGRLREALDLFG 210

Query: 61  KMQEERIQPSEFTMVSLLNASTQIGALSAGGMNNEYIKKNNLELSANVATAIIDI 116
           +MQEERI+PSEFTMVSLLNAS ++GAL  G   ++YI+KNN EL+  V  +IID+
Sbjct: 211 QMQEERIKPSEFTMVSLLNASARLGALKQGEWIHDYIRKNNFELNVIVTASIIDM 265

BLAST of CmaCh19G010790 vs. TrEMBL
Match: W9RHE2_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_000760 PE=4 SV=1)

HSP 1 Score: 152.1 bits (383), Expect = 1.2e-33
Identity = 80/128 (62.50%), Postives = 98/128 (76.56%), Query Frame = 1

Query: 2   EFDGVAWNSMIMCVAKCGEIGESRKLFDKMPVRNPISWNSMIGGYVRNGMFKEALKLFVK 61
           E D VAWNSMIM ++KCGE+GESR+LFD+MP+RN +SWNSMI GYVRNG   EAL+LF K
Sbjct: 189 ELDLVAWNSMIMGLSKCGEVGESRRLFDRMPLRNSVSWNSMISGYVRNGKCVEALELFGK 248

Query: 62  MQEERIQPSEFTMVSLLNASTQIGALSAGGMNNEYIKKNNLELSANVATAIIDIEWLRKG 121
           MQ E I+ SEFTMVSLLNAS ++GA+  G   +EYI KN +EL+  V TAIID+ + + G
Sbjct: 249 MQGEGIKASEFTMVSLLNASGRLGAIRQGEWIHEYITKNGIELNVIVVTAIIDM-YCKCG 308

Query: 122 SSLKPDSV 130
           S  K  SV
Sbjct: 309 SVNKALSV 315

BLAST of CmaCh19G010790 vs. TrEMBL
Match: A0A061DLR3_THECC (Pentatricopeptide repeat (PPR-like) superfamily protein OS=Theobroma cacao GN=TCM_001877 PE=4 SV=1)

HSP 1 Score: 149.1 bits (375), Expect = 1.0e-32
Identity = 74/115 (64.35%), Postives = 90/115 (78.26%), Query Frame = 1

Query: 1   MEFDGVAWNSMIMCVAKCGEIGESRKLFDKMPVRNPISWNSMIGGYVRNGMFKEALKLFV 60
           ME D VAWNSMI+ +AKCGE+ ESR+LF+KM  RN +SWNSMI GYVRNG F EAL+LF 
Sbjct: 185 MELDIVAWNSMIIGLAKCGEVDESRRLFNKMVSRNTVSWNSMISGYVRNGRFLEALELFQ 244

Query: 61  KMQEERIQPSEFTMVSLLNASTQIGALSAGGMNNEYIKKNNLELSANVATAIIDI 116
           +MQEE I+PSEFTMVSLLNA   +GA++ G   ++YI K N EL+  V TAIID+
Sbjct: 245 EMQEEHIRPSEFTMVSLLNACACLGAITQGKWIHDYILKQNFELNGIVVTAIIDM 299

BLAST of CmaCh19G010790 vs. TrEMBL
Match: A0A059DH34_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_A01980 PE=4 SV=1)

HSP 1 Score: 148.7 bits (374), Expect = 1.4e-32
Identity = 75/121 (61.98%), Postives = 94/121 (77.69%), Query Frame = 1

Query: 2   EFDGVAWNSMIMCVAKCGEIGESRKLFDKMPVRNPISWNSMIGGYVRNGMFKEALKLFVK 61
           E D VAWNSMIM +AKCGEI ESR+LFD+ P+RN I+W+SMI GYVRNG F +A+KLF +
Sbjct: 152 ERDVVAWNSMIMGLAKCGEIDESRRLFDQAPLRNAITWSSMISGYVRNGRFVDAMKLFEE 211

Query: 62  MQEERIQPSEFTMVSLLNASTQIGALSAGGMNNEYIKKNNLELSANVATAIIDIEWLRKG 121
           MQE  I+P+EFT+VSLLNAS  +GAL  G   +EYIKK   +L+A + TAIID+ + R G
Sbjct: 212 MQERGIEPNEFTLVSLLNASAHLGALKQGQWVHEYIKKGGAQLNAIITTAIIDM-YCRCG 271

Query: 122 S 123
           S
Sbjct: 272 S 271

BLAST of CmaCh19G010790 vs. TAIR10
Match: AT2G42920.1 (AT2G42920.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 134.0 bits (336), Expect = 1.8e-31
Identity = 65/113 (57.52%), Postives = 84/113 (74.34%), Query Frame = 1

Query: 3   FDGVAWNSMIMCVAKCGEIGESRKLFDKMPVRNPISWNSMIGGYVRNGMFKEALKLFVKM 62
           FD VAWNSMIM  AKCG I +++ LFD+MP RN +SWNSMI G+VRNG FK+AL +F +M
Sbjct: 190 FDVVAWNSMIMGFAKCGLIDQAQNLFDEMPQRNGVSWNSMISGFVRNGRFKDALDMFREM 249

Query: 63  QEERIQPSEFTMVSLLNASTQIGALSAGGMNNEYIKKNNLELSANVATAIIDI 116
           QE+ ++P  FTMVSLLNA   +GA   G   +EYI +N  EL++ V TA+ID+
Sbjct: 250 QEKDVKPDGFTMVSLLNACAYLGASEQGRWIHEYIVRNRFELNSIVVTALIDM 302

BLAST of CmaCh19G010790 vs. TAIR10
Match: AT3G62890.1 (AT3G62890.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 102.8 bits (255), Expect = 4.3e-22
Identity = 53/114 (46.49%), Postives = 77/114 (67.54%), Query Frame = 1

Query: 7   AWNSMIMCVAKCGEIGESRKLFDKMPVRNPISWNSMIGGYVRNGMFKEALKLFVKMQ--- 66
           AWNS++   AK G I ++RKLFD+MP RN ISW+ +I GYV  G +KEAL LF +MQ   
Sbjct: 130 AWNSVVNAYAKAGLIDDARKLFDEMPERNVISWSCLINGYVMCGKYKEALDLFREMQLPK 189

Query: 67  --EERIQPSEFTMVSLLNASTQIGALSAGGMNNEYIKKNNLELSANVATAIIDI 116
             E  ++P+EFTM ++L+A  ++GAL  G   + YI K ++E+   + TA+ID+
Sbjct: 190 PNEAFVRPNEFTMSTVLSACGRLGALEQGKWVHAYIDKYHVEIDIVLGTALIDM 243

BLAST of CmaCh19G010790 vs. TAIR10
Match: AT5G40405.1 (AT5G40405.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 99.8 bits (247), Expect = 3.7e-21
Identity = 45/112 (40.18%), Postives = 76/112 (67.86%), Query Frame = 1

Query: 4   DGVAWNSMIMCVAKCGEIGESRKLFDKMPVRNPISWNSMIGGYVRNGMFKEALKLFVKMQ 63
           D V   +M+   A+CG++  +RKLF+ MP R+PI+WN+MI GY + G  +EAL +F  MQ
Sbjct: 173 DFVCRTAMVTACARCGDVVFARKLFEGMPERDPIAWNAMISGYAQVGESREALNVFHLMQ 232

Query: 64  EERIQPSEFTMVSLLNASTQIGALSAGGMNNEYIKKNNLELSANVATAIIDI 116
            E ++ +   M+S+L+A TQ+GAL  G   + YI++N ++++  +AT ++D+
Sbjct: 233 LEGVKVNGVAMISVLSACTQLGALDQGRWAHSYIERNKIKITVRLATTLVDL 284

BLAST of CmaCh19G010790 vs. TAIR10
Match: AT5G06540.1 (AT5G06540.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 95.9 bits (237), Expect = 5.3e-20
Identity = 42/114 (36.84%), Postives = 73/114 (64.04%), Query Frame = 1

Query: 4   DGVAWNSMIMCVAKCGEIGESRKLFDKMPVRNPISWNSMIGGYVRNGMFKEALKLFVKMQ 63
           D V+W SM+    KCG +  +R++FD+MP RN  +W+ MI GY +N  F++A+ LF  M+
Sbjct: 182 DVVSWTSMVAGYCKCGMVENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMK 241

Query: 64  EERIQPSEFTMVSLLNASTQIGALSAGGMNNEYIKKNNLELSANVATAIIDIEW 118
            E +  +E  MVS++++   +GAL  G    EY+ K+++ ++  + TA++D+ W
Sbjct: 242 REGVVANETVMVSVISSCAHLGALEFGERAYEYVVKSHMTVNLILGTALVDMFW 295

BLAST of CmaCh19G010790 vs. TAIR10
Match: AT3G15930.1 (AT3G15930.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 95.1 bits (235), Expect = 9.0e-20
Identity = 50/126 (39.68%), Postives = 74/126 (58.73%), Query Frame = 1

Query: 4   DGVAWNSMIMCVAKCGEIGESRKLFDKMPVRNPISWNSMIGGYVRNGMFKEALKLFVKMQ 63
           D ++W S++    + G +  +R  FD+MPVR+ ISW  MI GY+R G F E+L++F +MQ
Sbjct: 301 DVISWTSIVKGYVERGNLKLARTYFDQMPVRDRISWTIMIDGYLRAGCFNESLEIFREMQ 360

Query: 64  EERIQPSEFTMVSLLNASTQIGALSAGGMNNEYIKKNNLELSANVATAIIDIEWLRKGSS 123
              + P EFTMVS+L A   +G+L  G     YI KN ++    V  A+ID+ + + G S
Sbjct: 361 SAGMIPDEFTMVSVLTACAHLGSLEIGEWIKTYIDKNKIKNDVVVGNALIDM-YFKCGCS 420

Query: 124 LKPDSV 130
            K   V
Sbjct: 421 EKAQKV 425

BLAST of CmaCh19G010790 vs. NCBI nr
Match: gi|449467271|ref|XP_004151347.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g42920, chloroplastic [Cucumis sativus])

HSP 1 Score: 191.8 bits (486), Expect = 2.0e-45
Identity = 95/115 (82.61%), Postives = 104/115 (90.43%), Query Frame = 1

Query: 1   MEFDGVAWNSMIMCVAKCGEIGESRKLFDKMPVRNPISWNSMIGGYVRNGMFKEALKLFV 60
           MEFD V+WNSMI+ +AKCGEI ESRKLFDKMPV+NPISWNSMIGGYVRNGMFKEALKLF+
Sbjct: 186 MEFDVVSWNSMILGLAKCGEIDESRKLFDKMPVKNPISWNSMIGGYVRNGMFKEALKLFI 245

Query: 61  KMQEERIQPSEFTMVSLLNASTQIGALSAGGMNNEYIKKNNLELSANVATAIIDI 116
           KMQEERIQPSEFTMVSLLNAS QIGAL  G   +EYIKKNNL+L+A V TAIID+
Sbjct: 246 KMQEERIQPSEFTMVSLLNASAQIGALRQGVWIHEYIKKNNLQLNAIVVTAIIDM 300

BLAST of CmaCh19G010790 vs. NCBI nr
Match: gi|659092604|ref|XP_008447140.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g42920, chloroplastic [Cucumis melo])

HSP 1 Score: 189.1 bits (479), Expect = 1.3e-44
Identity = 94/115 (81.74%), Postives = 103/115 (89.57%), Query Frame = 1

Query: 1   MEFDGVAWNSMIMCVAKCGEIGESRKLFDKMPVRNPISWNSMIGGYVRNGMFKEALKLFV 60
           MEFD V+WNSMI+ +AKCGEI ESRKLFDKMPV+N ISWNSMIGGYVRNGMFKEALKLF+
Sbjct: 186 MEFDVVSWNSMILGLAKCGEIDESRKLFDKMPVKNSISWNSMIGGYVRNGMFKEALKLFI 245

Query: 61  KMQEERIQPSEFTMVSLLNASTQIGALSAGGMNNEYIKKNNLELSANVATAIIDI 116
           KMQEERIQPSEFTMVSLLNAS QIGAL  G   +EYIKKNNL+L+A V TAIID+
Sbjct: 246 KMQEERIQPSEFTMVSLLNASAQIGALRQGEWIHEYIKKNNLQLNAIVVTAIIDM 300

BLAST of CmaCh19G010790 vs. NCBI nr
Match: gi|225446849|ref|XP_002279693.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g42920, chloroplastic [Vitis vinifera])

HSP 1 Score: 162.5 bits (410), Expect = 1.3e-36
Identity = 78/115 (67.83%), Postives = 95/115 (82.61%), Query Frame = 1

Query: 1   MEFDGVAWNSMIMCVAKCGEIGESRKLFDKMPVRNPISWNSMIGGYVRNGMFKEALKLFV 60
           M+FD VAWNSMIM +AKCGE+ ESRKLFD+MP+RN +SWNSMI GYVRNG  +EAL LF 
Sbjct: 187 MDFDIVAWNSMIMGLAKCGEVDESRKLFDEMPLRNTVSWNSMISGYVRNGRLREALDLFG 246

Query: 61  KMQEERIQPSEFTMVSLLNASTQIGALSAGGMNNEYIKKNNLELSANVATAIIDI 116
           +MQEERI+PSEFTMVSLLNAS ++GAL  G   ++YI+KNN EL+  V  +IID+
Sbjct: 247 QMQEERIKPSEFTMVSLLNASARLGALKQGEWIHDYIRKNNFELNVIVTASIIDM 301

BLAST of CmaCh19G010790 vs. NCBI nr
Match: gi|1009126437|ref|XP_015880153.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g42920, chloroplastic [Ziziphus jujuba])

HSP 1 Score: 161.8 bits (408), Expect = 2.2e-36
Identity = 76/115 (66.09%), Postives = 96/115 (83.48%), Query Frame = 1

Query: 1   MEFDGVAWNSMIMCVAKCGEIGESRKLFDKMPVRNPISWNSMIGGYVRNGMFKEALKLFV 60
           +E+D VAWNSMIM ++KCGE+GESR+LFDKMP++N +SWNSMI GYVRNGMF EAL+LF 
Sbjct: 184 LEYDVVAWNSMIMGLSKCGEVGESRRLFDKMPLKNSVSWNSMISGYVRNGMFIEALELFG 243

Query: 61  KMQEERIQPSEFTMVSLLNASTQIGALSAGGMNNEYIKKNNLELSANVATAIIDI 116
           KMQEERI+PSE+TMVSLLNA   +GA+  G   ++YI+KN +EL+    TAIID+
Sbjct: 244 KMQEERIKPSEYTMVSLLNACGSLGAIRQGKWIHDYIRKNGIELNVLAITAIIDM 298

BLAST of CmaCh19G010790 vs. NCBI nr
Match: gi|470140203|ref|XP_004305832.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g42920, chloroplastic [Fragaria vesca subsp. vesca])

HSP 1 Score: 161.0 bits (406), Expect = 3.8e-36
Identity = 77/115 (66.96%), Postives = 98/115 (85.22%), Query Frame = 1

Query: 1   MEFDGVAWNSMIMCVAKCGEIGESRKLFDKMPVRNPISWNSMIGGYVRNGMFKEALKLFV 60
           +EFD VAWNSMIM ++KCGE+GESR+LFDKMP RN ISWNSMIGG VRNGM+ EAL LF 
Sbjct: 183 LEFDIVAWNSMIMGLSKCGEVGESRRLFDKMPQRNSISWNSMIGGSVRNGMYTEALDLFG 242

Query: 61  KMQEERIQPSEFTMVSLLNASTQIGALSAGGMNNEYIKKNNLELSANVATAIIDI 116
           +MQ+++I+PSEFTMVSLLNAS Q+GA+  G   +EYI+KN+++L+  V TAII++
Sbjct: 243 EMQKQKIKPSEFTMVSLLNASAQLGAIRQGEWIHEYIRKNHIQLNPIVVTAIINM 297

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP200_ARATH3.1e-3057.52Pentatricopeptide repeat-containing protein At2g42920, chloroplastic OS=Arabidop... [more]
PP295_ARATH7.7e-2146.49Pentatricopeptide repeat-containing protein At3g62890 OS=Arabidopsis thaliana GN... [more]
PP410_ARATH6.5e-2040.18Putative pentatricopeptide repeat-containing protein At5g40405 OS=Arabidopsis th... [more]
PP367_ARATH9.4e-1936.84Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana GN... [more]
PP235_ARATH1.6e-1839.68Putative pentatricopeptide repeat-containing protein At3g15930 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
A0A0A0K7F6_CUCSA1.4e-4582.61Uncharacterized protein OS=Cucumis sativus GN=Csa_7G235760 PE=4 SV=1[more]
F6H5C7_VITVI9.1e-3767.83Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0028g00060 PE=4 SV=... [more]
W9RHE2_9ROSA1.2e-3362.50Uncharacterized protein OS=Morus notabilis GN=L484_000760 PE=4 SV=1[more]
A0A061DLR3_THECC1.0e-3264.35Pentatricopeptide repeat (PPR-like) superfamily protein OS=Theobroma cacao GN=TC... [more]
A0A059DH34_EUCGR1.4e-3261.98Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_A01980 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G42920.11.8e-3157.52 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT3G62890.14.3e-2246.49 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G40405.13.7e-2140.18 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G06540.15.3e-2036.84 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G15930.19.0e-2039.68 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449467271|ref|XP_004151347.1|2.0e-4582.61PREDICTED: pentatricopeptide repeat-containing protein At2g42920, chloroplastic ... [more]
gi|659092604|ref|XP_008447140.1|1.3e-4481.74PREDICTED: pentatricopeptide repeat-containing protein At2g42920, chloroplastic ... [more]
gi|225446849|ref|XP_002279693.1|1.3e-3667.83PREDICTED: pentatricopeptide repeat-containing protein At2g42920, chloroplastic ... [more]
gi|1009126437|ref|XP_015880153.1|2.2e-3666.09PREDICTED: pentatricopeptide repeat-containing protein At2g42920, chloroplastic ... [more]
gi|470140203|ref|XP_004305832.1|3.8e-3666.96PREDICTED: pentatricopeptide repeat-containing protein At2g42920, chloroplastic ... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh19G010790.1CmaCh19G010790.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 156..179
score: 0.024coord: 6..33
score: 2.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 35..80
score: 1.9
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 6..32
score: 0.0015coord: 37..71
score: 1.3
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 4..34
score: 8.912coord: 152..186
score: 8.539coord: 35..69
score: 13
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 1..211
score: 1.3
NoneNo IPR availablePANTHERPTHR24015:SF485SUBFAMILY NOT NAMEDcoord: 1..211
score: 1.3

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmaCh19G010790CmaCh11G019600Cucurbita maxima (Rimu)cmacmaB148