CmaCh12G011450 (gene) Cucurbita maxima (Rimu)

NameCmaCh12G011450
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionProlyl 4-hydroxylase alpha-like protein
LocationCma_Chr12 : 9027893 .. 9031082 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTAGCTGTCGTTACCATAACAGTTATGTCTACATCTCCCAAAACCAGAAAAATCCGAAACCCATTAATTCAGTTCCTCGTTCTTCGATCTCCCTGTGGGCGCGTACAAATCCAGATTACCCACCAAAAATTCCCCCCCAATTTCCTGCATGGCCAAGCCTCGCCCAAACCCCGTTTTATATATTAATCTTTGTTTCAATTTCAGAAGTAGCAGCTTATCGGAATCGGGGTTTGATTCTTCTGAAGCTTCTTCGTTCTTTAATGGCGTGTTGAAAGGGAAATACATGAAGTCTCAGGGCCGGAAATGGTCCACATTCAAGCTTTCCAAGATAGTCATGGCCTTCCTTTTGGCACTTGGGATTTCCATGTTCATCGCTTTCCGATTCTTCTCTCCTACTGAAAGTTCTCATAGCAATCTACTCCACCGTCTCGCTTCCGTCCAGCATAGAGCCGTTCATAGGTACTTTGGCAATGCTTATTAGCGGAATCAAATGATTTTGATTTTGATTTTGATTTTTGAACTTGGAATCGAACCCCCTTTTTCTTGTTGCAGTGATGGGTTGGGGAAGAGAGAGGATCAGTGGGTTGAGTTCATTTCATGGGAGCCTAGGGCTTTCGTTTATCACAATTTCTTGGTAAGTTCTTAGAATTTTGGATTCCTTTTGGCTTTTTGTTTGGTTTTTGTCATGTTTTTGGTTGAAATGCTTGCTGATATGAATGATTCTGGACAGTCCAAGGAAGAATGCTTGTATTTGATTAGTCTTGCAAAACCTTACATGAAAAAATCAACTGTGGTTGATGTAAAAACTGGCAAGATTAAAGATAGCAGGTCTGTATGTGTCATTTTCATTTTTGTCATCAATAGAATTTGACAATCCTGACTATGAACTAACGTTGGTGGGGCTTACAATGTCATGTCCGCAGGACGCGCACCAGTTCCGGGATGTTTCTGAATAGAGAGCAGAACAAAATTGTCAGCAACATAGAGAAAAGAATAGCAGATTTTACATTCATTCCCGTAGGTAATGATTTCCAAATTTGCATGTTACCTTGGCCAATTTTTCATGCATCTTCTAATGATTCTTATCTATAAGGGCTTCAAAACATGTTGGAATCATTGTCTTTGAAATCTTGCTTCTATGTATTAGTCTTGTCTGGTGGAACATGATAGTGATTTGATTGACGACCTTACTTAAACAAGTGTCGTTGAGTTGTGTAGATAGTTATTTCACTGTGTTCAATAACATCACGTATAGTTAGAGTAGCCAAATTATGAGATCCCGCCTCGGTTGTAGAGGGAAACAAAGCATTCCTTATAAGAGTGTGGAAACATATTCCTAGTAAGCACGTTTTAAAACTGTGAGGCTAACAATGATATGTAATGGGCCAAAGCAAACAACATCTACTAGCGGTGGGCTTGGGTTGTTACAAATGGTATCAGAGACAGTTATTGGACGGTTCGAAGCACTGTCTAGACCTTCTCCATTGTAGATGCATTTTAAAACCGTGAGGATGATGGTAATGTGCAACCGGTCAAAACGGACAATATTAGCTAGCGGTGGACTTGGTTTGTTATAAATGGTATCAAAGCCACATATCGGGCGATGTGCTAGCGAGGAGACTGGGTCCCAAGGGGGTGGATTGTGAGATCCCACATCAGTTGGAGAAGGGAACAAAACTTCCTTATAAGGGTGTGAAAACTTCTCCCTAGTAGACACATTTCAAAACTGTGAGGCTGACGACGATATGTAACGAGCAAGAGTAGACAATATAGGAGTGGGCTTGGGCTGATGAAAAATGAGCTAATTTTCTCGTGGGTTTGTTCTCCATTCTAGTTTTGAGGCCCAAACCAAGCAGGGCTTTGGTGAACACTACTCATTTGCTTGTTGATACTTTGTTGACATCACTGACATTCTTGGTCTCAATACATTTTGGTATTAATGTTGATATTCACTATTCCAAACAGAGCATGGAGAAGCACTGACAAGGAAGGGAACGCTTAATTTGGAAGTTTTTTTTTTTTATTAAGCCACTGACAAGGAAGGGAACGTGCACCTTGTTGGTATATGTAGTAACTTCGGTTCTTTTTTAGTCTGTCTCAATCACTTTCATACAAATATGGTCTTATCTTGTCATTTGTGTGCTTTTTTTTACTTGGGTATCATAACTTTGAGAGGCGGATTTGAAATAATATTCTACAAATCTAGTGTTTTATTAGCAATATATGACATGATTTTAAGTGAGATCATTTCCAAATTGCAGGGTGGAATGAAGGTATACATTTTAACTGGCTACTCATTTGGATGATGATGTCTGTTTTTTTCCTTCCCGAAGTTTGCTTATGAATGCATTATTATAACTATTTGCAATTTAATGTTACTACCGTTTCGAGATTGTTTTGTTATGTGGATCTTATGTACGAGCTGACATGGAGGAAAGACCGGTGGTCTGGTCTTAGCGTATGTTTGTTGAGTTCTGATGGCCATATCATAGGAGGGGGAGTTGCAGGAGTTGGTGGACCATTGAAGGCTGCTGGACCCATGCAGGTACCGTAACCAACAAATTGTTTTTCCAGTATTTTCCCTAAGAAGTTCAGTCGATGTCTGAATGTTTTATTGTATAAAGAGTTTTAGATGTATTATTCTTTTGCTTAGTTCATTCTTGTATGGACCCATCCACATCAGGTTAACATACAGGACGAGACGAGACAAGACGTGACTGCACAACCGCTGCAACGCCAGCAATCTTCGCTTCACTGTGTTAGCTAATTCTGTCGTCGTAGTATGCTCCATACTGTACAGGTGATAGCAAACTCATCAGAAGTTATTATCCATCCACTTTTCATGGTGCATTTGCATGTTCTAATTCTAATTCTCCTGTTTTTCTCCGTTTAGATGAACAAGTTTGGTGTTGTTGTTACAGGTTGCAATCTGCAATTTGATCATATCAACAAAAACCTCTGGTGATTTAGTATCAAATCTCACCTTTTTATCACATAATATTTGGTTTGAACGTTTTGGCCGACCATCTCGAAGTTCGGTGGTCGTTGGGGTGAGCTAAAATACTTTACTCTCGGTTTGATCTACATACATTATAAAATAGCGAGTAAATAACTTCACTGTATCAACTTTAACAGTATTCAGTGTTCCACCCGACCAATTTCAATAGAGTCCACTGTTAAAGTTTGCCC

mRNA sequence

GTTAGCTGTCGTTACCATAACAGTTATGTCTACATCTCCCAAAACCAGAAAAATCCGAAACCCATTAATTCAGTTCCTCGTTCTTCGATCTCCCTGTGGGCGCGTACAAATCCAGATTACCCACCAAAAATTCCCCCCCAATTTCCTGCATGGCCAAGCCTCGCCCAAACCCCGTTTTATATATTAATCTTTGTTTCAATTTCAGAAGTAGCAGCTTATCGGAATCGGGGTTTGATTCTTCTGAAGCTTCTTCGTTCTTTAATGGCGTGTTGAAAGGGAAATACATGAAGTCTCAGGGCCGGAAATGGTCCACATTCAAGCTTTCCAAGATAGTCATGGCCTTCCTTTTGGCACTTGGGATTTCCATGTTCATCGCTTTCCGATTCTTCTCTCCTACTGAAAGTTCTCATAGCAATCTACTCCACCGTCTCGCTTCCGTCCAGCATAGAGCCGTTCATAGTGATGGGTTGGGGAAGAGAGAGGATCAGTGGGTTGAGTTCATTTCATGGGAGCCTAGGGCTTTCGTTTATCACAATTTCTTGTCCAAGGAAGAATGCTTGTATTTGATTAGTCTTGCAAAACCTTACATGAAAAAATCAACTGTGGTTGATGTAAAAACTGGCAAGATTAAAGATAGCAGGACGCGCACCAGTTCCGGGATGTTTCTGAATAGAGAGCAGAACAAAATTGTCAGCAACATAGAGAAAAGAATAGCAGATTTTACATTCATTCCCGTAGGGTGGAATGAAGGTATACATTTTAACTGGCTACTCATTTGGATGATGATATTGTTTTGTTATGTGGATCTTATGTACGAGCTGACATGGAGGAAAGACCGGTGGTCTGGTCTTAGCGTATGTTTGTTGAGTTCTGATGGCCATATCATAGGAGGGGGAGTTGCAGGAGTTGGTGGACCATTGAAGGCTGCTGGACCCATGCAGGTTAACATACAGGACGAGACGAGACAAGACGTGACTGCACAACCGCTGCAACGCCAGCAATCTTCGCTTCACTGTGTTAGCTAATTCTGTCGTCGTAGTATGCTCCATACTGTACAGGTTGCAATCTGCAATTTGATCATATCAACAAAAACCTCTGGTGATTTAGTATCAAATCTCACCTTTTTATCACATAATATTTGGTTTGAACGTTTTGGCCGACCATCTCGAAGTTCGGTGGTCGTTGGGGTGAGCTAAAATACTTTACTCTCGGTTTGATCTACATACATTATAAAATAGCGAGTAAATAACTTCACTGTATCAACTTTAACAGTATTCAGTGTTCCACCCGACCAATTTCAATAGAGTCCACTGTTAAAGTTTGCCC

Coding sequence (CDS)

ATGGCCAAGCCTCGCCCAAACCCCGTTTTATATATTAATCTTTGTTTCAATTTCAGAAGTAGCAGCTTATCGGAATCGGGGTTTGATTCTTCTGAAGCTTCTTCGTTCTTTAATGGCGTGTTGAAAGGGAAATACATGAAGTCTCAGGGCCGGAAATGGTCCACATTCAAGCTTTCCAAGATAGTCATGGCCTTCCTTTTGGCACTTGGGATTTCCATGTTCATCGCTTTCCGATTCTTCTCTCCTACTGAAAGTTCTCATAGCAATCTACTCCACCGTCTCGCTTCCGTCCAGCATAGAGCCGTTCATAGTGATGGGTTGGGGAAGAGAGAGGATCAGTGGGTTGAGTTCATTTCATGGGAGCCTAGGGCTTTCGTTTATCACAATTTCTTGTCCAAGGAAGAATGCTTGTATTTGATTAGTCTTGCAAAACCTTACATGAAAAAATCAACTGTGGTTGATGTAAAAACTGGCAAGATTAAAGATAGCAGGACGCGCACCAGTTCCGGGATGTTTCTGAATAGAGAGCAGAACAAAATTGTCAGCAACATAGAGAAAAGAATAGCAGATTTTACATTCATTCCCGTAGGGTGGAATGAAGGTATACATTTTAACTGGCTACTCATTTGGATGATGATATTGTTTTGTTATGTGGATCTTATGTACGAGCTGACATGGAGGAAAGACCGGTGGTCTGGTCTTAGCGTATGTTTGTTGAGTTCTGATGGCCATATCATAGGAGGGGGAGTTGCAGGAGTTGGTGGACCATTGAAGGCTGCTGGACCCATGCAGGTTAACATACAGGACGAGACGAGACAAGACGTGACTGCACAACCGCTGCAACGCCAGCAATCTTCGCTTCACTGTGTTAGCTAA

Protein sequence

MAKPRPNPVLYINLCFNFRSSSLSESGFDSSEASSFFNGVLKGKYMKSQGRKWSTFKLSKIVMAFLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPYMKKSTVVDVKTGKIKDSRTRTSSGMFLNREQNKIVSNIEKRIADFTFIPVGWNEGIHFNWLLIWMMILFCYVDLMYELTWRKDRWSGLSVCLLSSDGHIIGGGVAGVGGPLKAAGPMQVNIQDETRQDVTAQPLQRQQSSLHCVS
BLAST of CmaCh12G011450 vs. Swiss-Prot
Match: P4H3_ARATH (Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana GN=P4H3 PE=2 SV=1)

HSP 1 Score: 169.9 bits (429), Expect = 4.2e-41
Identity = 91/162 (56.17%), Postives = 112/162 (69.14%), Query Frame = 1

Query: 42  KGKYMKSQGRKWSTFKLSKIVMAFLLALGISMFIAFRFFS-PTESSHSNLLHRLASVQHR 101
           K ++ + Q RKWST  L  + M F+L + + M +AF  FS P  +  S+ +      +  
Sbjct: 3   KLRHSRFQARKWSTLMLV-LFMLFMLTIVLLMLLAFGVFSLPINNDESSPIDLSYFRRAA 62

Query: 102 AVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPYMKKSTVVDVKTGKI 161
              S+GLGKR DQW E +SWEPRAFVYHNFLSKEEC YLISLAKP+M KSTVVD +TGK 
Sbjct: 63  TERSEGLGKRGDQWTEVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKS 122

Query: 162 KDSRTRTSSGMFLNREQNKIVSNIEKRIADFTFIPVGWNEGI 203
           KDSR RTSSG FL R ++KI+  IEKRIAD+TFIP    EG+
Sbjct: 123 KDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGL 163

BLAST of CmaCh12G011450 vs. Swiss-Prot
Match: P4H10_ARATH (Probable prolyl 4-hydroxylase 10 OS=Arabidopsis thaliana GN=P4H10 PE=2 SV=1)

HSP 1 Score: 141.0 bits (354), Expect = 2.1e-32
Identity = 76/146 (52.05%), Postives = 97/146 (66.44%), Query Frame = 1

Query: 58  LSKIVMAFLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGLG-KREDQWVE 117
           +S  V+  LLA GI          P+ ++ S+  + L S+  + +   G    + ++WVE
Sbjct: 27  MSTFVILILLAFGILSV-------PSNNAGSSKANDLTSIVRKTLQRSGEDDSKNERWVE 86

Query: 118 FISWEPRAFVYHNFLSKEECLYLISLAKPYMKKSTVVDVKTGKIKDSRTRTSSGMFLNRE 177
            ISWEPRA VYHNFL+KEEC YLI LAKP+M+KSTVVD KTGK  DSR RTSSG FL R 
Sbjct: 87  IISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSGTFLARG 146

Query: 178 QNKIVSNIEKRIADFTFIPVGWNEGI 203
           ++K +  IEKRI+DFTFIPV   EG+
Sbjct: 147 RDKTIREIEKRISDFTFIPVEHGEGL 165

BLAST of CmaCh12G011450 vs. Swiss-Prot
Match: P4H8_ARATH (Probable prolyl 4-hydroxylase 8 OS=Arabidopsis thaliana GN=P4H8 PE=3 SV=1)

HSP 1 Score: 136.3 bits (342), Expect = 5.2e-31
Identity = 80/165 (48.48%), Postives = 106/165 (64.24%), Query Frame = 1

Query: 42  KGKYMKSQGRK-WSTFKLSKIVMAF---LLALGISMFIAFRFFSPTESSHSNLLHRLASV 101
           K K ++++ RK +ST   + +V+     L+ +G+ +F +    + T S   +L   + ++
Sbjct: 4   KPKQLRNKPRKSFSTQTFTVVVLVLFVILILVGLGIF-SLPSTNKTSSMPMDLTTIVQTI 63

Query: 102 QHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPYMKKSTVVDVKT 161
           Q R    D      D+W+E ISWEPRAFVYHNFL+ EEC +LISLAKP M KS VVDVKT
Sbjct: 64  QERESFGDEEDGNGDRWLEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKT 123

Query: 162 GKIKDSRTRTSSGMFLNREQNKIVSNIEKRIADFTFIPVGWNEGI 203
           GK  DSR RTSSG FLNR  ++IV  IE RI+DFTFIP    EG+
Sbjct: 124 GKSIDSRVRTSSGTFLNRGHDEIVEEIENRISDFTFIPPENGEGL 167

BLAST of CmaCh12G011450 vs. Swiss-Prot
Match: P4H5_ARATH (Prolyl 4-hydroxylase 5 OS=Arabidopsis thaliana GN=P4H5 PE=2 SV=1)

HSP 1 Score: 131.3 bits (329), Expect = 1.7e-29
Identity = 79/160 (49.38%), Postives = 99/160 (61.88%), Query Frame = 1

Query: 47  KSQGRKWSTFK---LSKIVMAFLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVH 106
           KS  R    F    L  +V+  LL LGI          P  + +S+  + L ++  ++  
Sbjct: 15  KSVSRSTQAFTVLILLLVVILILLGLGILSL-------PNANRNSSKTNDLTNIVRKSET 74

Query: 107 SDGLGKRE-DQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPYMKKSTVVDVKTGKIKD 166
           S G  +   ++WVE ISWEPRA VYHNFL+ EEC +LISLAKP M KSTVVD KTG  KD
Sbjct: 75  SSGDEEGNGERWVEVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKD 134

Query: 167 SRTRTSSGMFLNREQNKIVSNIEKRIADFTFIPVGWNEGI 203
           SR RTSSG FL R  +++V  IEKRI+DFTFIPV   EG+
Sbjct: 135 SRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGL 167

BLAST of CmaCh12G011450 vs. Swiss-Prot
Match: P4H11_ARATH (Probable prolyl 4-hydroxylase 11 OS=Arabidopsis thaliana GN=P4H11 PE=3 SV=1)

HSP 1 Score: 92.8 bits (229), Expect = 6.6e-18
Identity = 49/92 (53.26%), Postives = 64/92 (69.57%), Query Frame = 1

Query: 112 DQWVEFISWEPRAFVYHNFLS--------KEECLYLISLAKPYMKKSTVVDVKTGKIKDS 171
           ++W+E I+ EPRAFVYHNFL+         EEC +LISLAKP M +S V +  TG  ++S
Sbjct: 85  ERWLEVITKEPRAFVYHNFLALFFKICKTNEECDHLISLAKPSMARSKVRNALTGLGEES 144

Query: 172 RTRTSSGMFLNREQNKIVSNIEKRIADFTFIP 196
            +RTSSG F+    +KIV  IEKRI++FTFIP
Sbjct: 145 SSRTSSGTFIRSGHDKIVKEIEKRISEFTFIP 176

BLAST of CmaCh12G011450 vs. TrEMBL
Match: A0A0A0LFF5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G009620 PE=4 SV=1)

HSP 1 Score: 204.5 bits (519), Expect = 1.7e-49
Identity = 104/140 (74.29%), Postives = 115/140 (82.14%), Query Frame = 1

Query: 63  MAFLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEFISWEP 122
           MA +LALG  M IA RF SP E+SH    HR +SV+H A  SDGLGKR DQWVEFISWEP
Sbjct: 1   MALVLALGFFMLIALRFLSPPETSH----HRFSSVRHTAFLSDGLGKRGDQWVEFISWEP 60

Query: 123 RAFVYHNFLSKEECLYLISLAKPYMKKSTVVDVKTGKIKDSRTRTSSGMFLNREQNKIVS 182
           RAFVYHNFLSKEECLYLISLAKP+M+KSTVVD KTG+  DSR RTSSGMFLNR Q+KI+ 
Sbjct: 61  RAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIR 120

Query: 183 NIEKRIADFTFIPVGWNEGI 203
           NIEKRIADFTFIP+   EG+
Sbjct: 121 NIEKRIADFTFIPIEHGEGL 136

BLAST of CmaCh12G011450 vs. TrEMBL
Match: M5XZ46_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa009548mg PE=4 SV=1)

HSP 1 Score: 179.5 bits (454), Expect = 5.9e-42
Identity = 97/162 (59.88%), Postives = 118/162 (72.84%), Query Frame = 1

Query: 42  KGKYMKSQGRKWSTFKLSKIVMAFLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRA 101
           KG+Y + Q +KWSTF L  + M F+L + + M +AF   S    +  +  + L+S +   
Sbjct: 3   KGRYGRLQSKKWSTFTLV-LSMLFMLIVVLLMLLAFGIVSLPVITDESSPNDLSSFRRST 62

Query: 102 VH-SDGLGKREDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPYMKKSTVVDVKTGKI 161
           V  +DG G+REDQW E ISWEPRAF+YHNFLSKEEC YLI+LAKP M KSTVVD KTGK 
Sbjct: 63  VERTDGFGEREDQWTEVISWEPRAFIYHNFLSKEECDYLINLAKPDMVKSTVVDSKTGKS 122

Query: 162 KDSRTRTSSGMFLNREQNKIVSNIEKRIADFTFIPVGWNEGI 203
           KDSR RTSSGMFL R ++KIVS+IEKRIADFTFIPV   EG+
Sbjct: 123 KDSRVRTSSGMFLKRGRDKIVSDIEKRIADFTFIPVEHGEGL 163

BLAST of CmaCh12G011450 vs. TrEMBL
Match: W9S0S9_9ROSA (Prolyl 4-hydroxylase subunit alpha-2 OS=Morus notabilis GN=L484_014315 PE=4 SV=1)

HSP 1 Score: 176.0 bits (445), Expect = 6.6e-41
Identity = 93/162 (57.41%), Postives = 118/162 (72.84%), Query Frame = 1

Query: 42  KGKYMKSQGRKWSTFKLSKIVMAFLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRA 101
           KG+Y +  GRKWSTF L   ++ F+L++ + M +A    S   SS  +  + L+S + R 
Sbjct: 3   KGRYTRLHGRKWSTFTLVFSIL-FMLSVVLLMLLALGIVSLPVSSDDSPPNDLSSFRRRI 62

Query: 102 VH-SDGLGKREDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPYMKKSTVVDVKTGKI 161
           V   D LGKRE+QW E +SWEPRAF+YHN LSKEEC YLIS+A+P+M KSTVVD KTG+ 
Sbjct: 63  VERGDELGKREEQWTEVLSWEPRAFIYHNVLSKEECEYLISIAEPHMVKSTVVDSKTGRS 122

Query: 162 KDSRTRTSSGMFLNREQNKIVSNIEKRIADFTFIPVGWNEGI 203
           KDSR RTSSGMFL R ++KI+ +IEKRIADFTFIPV   EG+
Sbjct: 123 KDSRVRTSSGMFLKRGRDKIIRDIEKRIADFTFIPVEHGEGL 163

BLAST of CmaCh12G011450 vs. TrEMBL
Match: E0CQW5_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0001g00950 PE=4 SV=1)

HSP 1 Score: 173.7 bits (439), Expect = 3.3e-40
Identity = 92/169 (54.44%), Postives = 116/169 (68.64%), Query Frame = 1

Query: 42  KGKYMKSQGRKWSTFKLS-------KIVMAFLLALGISMFIAFRFFSPTESSHSNLLHRL 101
           KG+Y +  G++WST  L         +V+  LLALGI          P  +  S+  + L
Sbjct: 3   KGRYSRGHGKRWSTLALVLSLLLMLTVVLLMLLALGIVSL-------PIGTVDSDAANDL 62

Query: 102 ASVQHRAVHS-DGLGKREDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPYMKKSTVV 161
           +S + +     +GLGKR +QW E +SWEPRAF+YHNFLSKEEC Y+ISLAKPYMKKSTVV
Sbjct: 63  SSFRRKTFDGGEGLGKRGEQWTEIVSWEPRAFIYHNFLSKEECEYMISLAKPYMKKSTVV 122

Query: 162 DVKTGKIKDSRTRTSSGMFLNREQNKIVSNIEKRIADFTFIPVGWNEGI 203
           D +TG+ KDSR RTSSGMFL R ++KI+ +IEKRIADFTFIPV   EG+
Sbjct: 123 DSETGRSKDSRVRTSSGMFLRRGRDKIIRDIEKRIADFTFIPVEHGEGL 164

BLAST of CmaCh12G011450 vs. TrEMBL
Match: A0A067KYE3_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_08165 PE=4 SV=1)

HSP 1 Score: 170.2 bits (430), Expect = 3.6e-39
Identity = 91/164 (55.49%), Postives = 114/164 (69.51%), Query Frame = 1

Query: 42  KGKYMKSQGRKWSTFKLSKIVMAFLLALGISMFIAFRFFS-PTESSHSNLLHRLASVQHR 101
           K ++ + Q RKWST  L  + M F+L + + M +A   FS P  +  S  +    S +  
Sbjct: 3   KVRHSRFQARKWSTMTLI-LTMLFMLTVVLLMLLALGIFSLPISNEDSTPIDLTTSYRRM 62

Query: 102 AVHSDGLG--KREDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPYMKKSTVVDVKTG 161
            V  DG G  KRE+QW E +SWEPRAF+YHNFLSKEEC YLI+LA+P+M KSTVVD KTG
Sbjct: 63  TVERDGDGQEKREEQWTEIVSWEPRAFLYHNFLSKEECEYLIALARPHMVKSTVVDSKTG 122

Query: 162 KIKDSRTRTSSGMFLNREQNKIVSNIEKRIADFTFIPVGWNEGI 203
           + KDSR RTSSGMFL R ++KI+ NIEKRIADF+FIPV   EG+
Sbjct: 123 RSKDSRVRTSSGMFLRRGRDKIIRNIEKRIADFSFIPVEHGEGL 165

BLAST of CmaCh12G011450 vs. TAIR10
Match: AT1G20270.1 (AT1G20270.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 169.9 bits (429), Expect = 2.4e-42
Identity = 91/162 (56.17%), Postives = 112/162 (69.14%), Query Frame = 1

Query: 42  KGKYMKSQGRKWSTFKLSKIVMAFLLALGISMFIAFRFFS-PTESSHSNLLHRLASVQHR 101
           K ++ + Q RKWST  L  + M F+L + + M +AF  FS P  +  S+ +      +  
Sbjct: 3   KLRHSRFQARKWSTLMLV-LFMLFMLTIVLLMLLAFGVFSLPINNDESSPIDLSYFRRAA 62

Query: 102 AVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPYMKKSTVVDVKTGKI 161
              S+GLGKR DQW E +SWEPRAFVYHNFLSKEEC YLISLAKP+M KSTVVD +TGK 
Sbjct: 63  TERSEGLGKRGDQWTEVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETGKS 122

Query: 162 KDSRTRTSSGMFLNREQNKIVSNIEKRIADFTFIPVGWNEGI 203
           KDSR RTSSG FL R ++KI+  IEKRIAD+TFIP    EG+
Sbjct: 123 KDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGL 163

BLAST of CmaCh12G011450 vs. TAIR10
Match: AT5G66060.1 (AT5G66060.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 141.0 bits (354), Expect = 1.2e-33
Identity = 76/146 (52.05%), Postives = 97/146 (66.44%), Query Frame = 1

Query: 58  LSKIVMAFLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGLG-KREDQWVE 117
           +S  V+  LLA GI          P+ ++ S+  + L S+  + +   G    + ++WVE
Sbjct: 27  MSTFVILILLAFGILSV-------PSNNAGSSKANDLTSIVRKTLQRSGEDDSKNERWVE 86

Query: 118 FISWEPRAFVYHNFLSKEECLYLISLAKPYMKKSTVVDVKTGKIKDSRTRTSSGMFLNRE 177
            ISWEPRA VYHNFL+KEEC YLI LAKP+M+KSTVVD KTGK  DSR RTSSG FL R 
Sbjct: 87  IISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSGTFLARG 146

Query: 178 QNKIVSNIEKRIADFTFIPVGWNEGI 203
           ++K +  IEKRI+DFTFIPV   EG+
Sbjct: 147 RDKTIREIEKRISDFTFIPVEHGEGL 165

BLAST of CmaCh12G011450 vs. TAIR10
Match: AT4G35810.1 (AT4G35810.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 136.3 bits (342), Expect = 2.9e-32
Identity = 80/165 (48.48%), Postives = 106/165 (64.24%), Query Frame = 1

Query: 42  KGKYMKSQGRK-WSTFKLSKIVMAF---LLALGISMFIAFRFFSPTESSHSNLLHRLASV 101
           K K ++++ RK +ST   + +V+     L+ +G+ +F +    + T S   +L   + ++
Sbjct: 4   KPKQLRNKPRKSFSTQTFTVVVLVLFVILILVGLGIF-SLPSTNKTSSMPMDLTTIVQTI 63

Query: 102 QHRAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPYMKKSTVVDVKT 161
           Q R    D      D+W+E ISWEPRAFVYHNFL+ EEC +LISLAKP M KS VVDVKT
Sbjct: 64  QERESFGDEEDGNGDRWLEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKT 123

Query: 162 GKIKDSRTRTSSGMFLNREQNKIVSNIEKRIADFTFIPVGWNEGI 203
           GK  DSR RTSSG FLNR  ++IV  IE RI+DFTFIP    EG+
Sbjct: 124 GKSIDSRVRTSSGTFLNRGHDEIVEEIENRISDFTFIPPENGEGL 167

BLAST of CmaCh12G011450 vs. TAIR10
Match: AT2G17720.1 (AT2G17720.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 131.3 bits (329), Expect = 9.4e-31
Identity = 79/160 (49.38%), Postives = 99/160 (61.88%), Query Frame = 1

Query: 47  KSQGRKWSTFK---LSKIVMAFLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVH 106
           KS  R    F    L  +V+  LL LGI          P  + +S+  + L ++  ++  
Sbjct: 15  KSVSRSTQAFTVLILLLVVILILLGLGILSL-------PNANRNSSKTNDLTNIVRKSET 74

Query: 107 SDGLGKRE-DQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPYMKKSTVVDVKTGKIKD 166
           S G  +   ++WVE ISWEPRA VYHNFL+ EEC +LISLAKP M KSTVVD KTG  KD
Sbjct: 75  SSGDEEGNGERWVEVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKD 134

Query: 167 SRTRTSSGMFLNREQNKIVSNIEKRIADFTFIPVGWNEGI 203
           SR RTSSG FL R  +++V  IEKRI+DFTFIPV   EG+
Sbjct: 135 SRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGL 167

BLAST of CmaCh12G011450 vs. TAIR10
Match: AT4G35820.1 (AT4G35820.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 92.8 bits (229), Expect = 3.7e-19
Identity = 49/92 (53.26%), Postives = 64/92 (69.57%), Query Frame = 1

Query: 112 DQWVEFISWEPRAFVYHNFLS--------KEECLYLISLAKPYMKKSTVVDVKTGKIKDS 171
           ++W+E I+ EPRAFVYHNFL+         EEC +LISLAKP M +S V +  TG  ++S
Sbjct: 85  ERWLEVITKEPRAFVYHNFLALFFKICKTNEECDHLISLAKPSMARSKVRNALTGLGEES 144

Query: 172 RTRTSSGMFLNREQNKIVSNIEKRIADFTFIP 196
            +RTSSG F+    +KIV  IEKRI++FTFIP
Sbjct: 145 SSRTSSGTFIRSGHDKIVKEIEKRISEFTFIP 176

BLAST of CmaCh12G011450 vs. NCBI nr
Match: gi|659070729|ref|XP_008456383.1| (PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform X1 [Cucumis melo])

HSP 1 Score: 243.8 bits (621), Expect = 3.7e-61
Identity = 123/163 (75.46%), Postives = 137/163 (84.05%), Query Frame = 1

Query: 40  VLKGKYMKSQGRKWSTFKLSKIVMAFLLALGISMFIAFRFFSPTESSHSNLLHRLASVQH 99
           V KGKY+K QG+KWSTF+LSK++MA +LALG  M IA RFFSP E+SH    HRL SV+ 
Sbjct: 3   VSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSH----HRLPSVRR 62

Query: 100 RAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPYMKKSTVVDVKTGK 159
            A  SDGLGKR DQWVEFISWEPRAFVYHNFLSKEECLYLISLAKP+M+KSTVVD KTGK
Sbjct: 63  TAFQSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGK 122

Query: 160 IKDSRTRTSSGMFLNREQNKIVSNIEKRIADFTFIPVGWNEGI 203
             DSR RTSSGMFLNR Q+KI+SNIEKRIADFTFIP+   EG+
Sbjct: 123 SVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGL 161

BLAST of CmaCh12G011450 vs. NCBI nr
Match: gi|659070731|ref|XP_008456388.1| (PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform X2 [Cucumis melo])

HSP 1 Score: 243.8 bits (621), Expect = 3.7e-61
Identity = 123/163 (75.46%), Postives = 137/163 (84.05%), Query Frame = 1

Query: 40  VLKGKYMKSQGRKWSTFKLSKIVMAFLLALGISMFIAFRFFSPTESSHSNLLHRLASVQH 99
           V KGKY+K QG+KWSTF+LSK++MA +LALG  M IA RFFSP E+SH    HRL SV+ 
Sbjct: 3   VSKGKYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRFFSPPETSH----HRLPSVRR 62

Query: 100 RAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPYMKKSTVVDVKTGK 159
            A  SDGLGKR DQWVEFISWEPRAFVYHNFLSKEECLYLISLAKP+M+KSTVVD KTGK
Sbjct: 63  TAFQSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGK 122

Query: 160 IKDSRTRTSSGMFLNREQNKIVSNIEKRIADFTFIPVGWNEGI 203
             DSR RTSSGMFLNR Q+KI+SNIEKRIADFTFIP+   EG+
Sbjct: 123 SVDSRVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEGL 161

BLAST of CmaCh12G011450 vs. NCBI nr
Match: gi|778666404|ref|XP_011648735.1| (PREDICTED: probable prolyl 4-hydroxylase 3 [Cucumis sativus])

HSP 1 Score: 240.0 bits (611), Expect = 5.3e-60
Identity = 120/163 (73.62%), Postives = 136/163 (83.44%), Query Frame = 1

Query: 40  VLKGKYMKSQGRKWSTFKLSKIVMAFLLALGISMFIAFRFFSPTESSHSNLLHRLASVQH 99
           + KGKY+K QGRKWSTF+LSK++MA +LALG  M IA RF SP E+SH    HR +SV+H
Sbjct: 3   ISKGKYIKLQGRKWSTFQLSKMIMALVLALGFFMLIALRFLSPPETSH----HRFSSVRH 62

Query: 100 RAVHSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPYMKKSTVVDVKTGK 159
            A  SDGLGKR DQWVEFISWEPRAFVYHNFLSKEECLYLISLAKP+M+KSTVVD KTG+
Sbjct: 63  TAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGE 122

Query: 160 IKDSRTRTSSGMFLNREQNKIVSNIEKRIADFTFIPVGWNEGI 203
             DSR RTSSGMFLNR Q+KI+ NIEKRIADFTFIP+   EG+
Sbjct: 123 SVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGL 161

BLAST of CmaCh12G011450 vs. NCBI nr
Match: gi|659070723|ref|XP_008456352.1| (PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis melo])

HSP 1 Score: 232.3 bits (591), Expect = 1.1e-57
Identity = 118/160 (73.75%), Postives = 133/160 (83.12%), Query Frame = 1

Query: 43  GKYMKSQGRKWSTFKLSKIVMAFLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAV 102
           GKY+K QG+KWSTF+LSK++MA +LALG  M  A  FFSP E+SH    HRL+SV+H A 
Sbjct: 6   GKYIKLQGKKWSTFQLSKMIMALVLALGFFMLSALWFFSPPETSH----HRLSSVRHTAF 65

Query: 103 HSDGLGKREDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPYMKKSTVVDVKTGKIKD 162
            SDGLGKR DQWVEFISWEPRAFVYHNFLSKEECLYLISLAKP+M+KSTVVD +TGK  D
Sbjct: 66  LSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSETGKSVD 125

Query: 163 SRTRTSSGMFLNREQNKIVSNIEKRIADFTFIPVGWNEGI 203
           S  RTSSGMFLNR Q+KI+SNIEKRIADFTFIP+   E I
Sbjct: 126 SSVRTSSGMFLNRGQDKIISNIEKRIADFTFIPIEHGEDI 161

BLAST of CmaCh12G011450 vs. NCBI nr
Match: gi|700205656|gb|KGN60775.1| (hypothetical protein Csa_2G009620 [Cucumis sativus])

HSP 1 Score: 204.5 bits (519), Expect = 2.5e-49
Identity = 104/140 (74.29%), Postives = 115/140 (82.14%), Query Frame = 1

Query: 63  MAFLLALGISMFIAFRFFSPTESSHSNLLHRLASVQHRAVHSDGLGKREDQWVEFISWEP 122
           MA +LALG  M IA RF SP E+SH    HR +SV+H A  SDGLGKR DQWVEFISWEP
Sbjct: 1   MALVLALGFFMLIALRFLSPPETSH----HRFSSVRHTAFLSDGLGKRGDQWVEFISWEP 60

Query: 123 RAFVYHNFLSKEECLYLISLAKPYMKKSTVVDVKTGKIKDSRTRTSSGMFLNREQNKIVS 182
           RAFVYHNFLSKEECLYLISLAKP+M+KSTVVD KTG+  DSR RTSSGMFLNR Q+KI+ 
Sbjct: 61  RAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVDSRVRTSSGMFLNRGQDKIIR 120

Query: 183 NIEKRIADFTFIPVGWNEGI 203
           NIEKRIADFTFIP+   EG+
Sbjct: 121 NIEKRIADFTFIPIEHGEGL 136

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P4H3_ARATH4.2e-4156.17Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana GN=P4H3 PE=2 SV=1[more]
P4H10_ARATH2.1e-3252.05Probable prolyl 4-hydroxylase 10 OS=Arabidopsis thaliana GN=P4H10 PE=2 SV=1[more]
P4H8_ARATH5.2e-3148.48Probable prolyl 4-hydroxylase 8 OS=Arabidopsis thaliana GN=P4H8 PE=3 SV=1[more]
P4H5_ARATH1.7e-2949.38Prolyl 4-hydroxylase 5 OS=Arabidopsis thaliana GN=P4H5 PE=2 SV=1[more]
P4H11_ARATH6.6e-1853.26Probable prolyl 4-hydroxylase 11 OS=Arabidopsis thaliana GN=P4H11 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LFF5_CUCSA1.7e-4974.29Uncharacterized protein OS=Cucumis sativus GN=Csa_2G009620 PE=4 SV=1[more]
M5XZ46_PRUPE5.9e-4259.88Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa009548mg PE=4 SV=1[more]
W9S0S9_9ROSA6.6e-4157.41Prolyl 4-hydroxylase subunit alpha-2 OS=Morus notabilis GN=L484_014315 PE=4 SV=1[more]
E0CQW5_VITVI3.3e-4054.44Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0001g00950 PE=4 SV=... [more]
A0A067KYE3_JATCU3.6e-3955.49Uncharacterized protein OS=Jatropha curcas GN=JCGZ_08165 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G20270.12.4e-4256.17 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT5G66060.11.2e-3352.05 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT4G35810.12.9e-3248.48 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT2G17720.19.4e-3149.38 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT4G35820.13.7e-1953.26 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
Match NameE-valueIdentityDescription
gi|659070729|ref|XP_008456383.1|3.7e-6175.46PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform X1 [Cucumis melo][more]
gi|659070731|ref|XP_008456388.1|3.7e-6175.46PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform X2 [Cucumis melo][more]
gi|778666404|ref|XP_011648735.1|5.3e-6073.62PREDICTED: probable prolyl 4-hydroxylase 3 [Cucumis sativus][more]
gi|659070723|ref|XP_008456352.1|1.1e-5773.75PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis melo][more]
gi|700205656|gb|KGN60775.1|2.5e-4974.29hypothetical protein Csa_2G009620 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0055114 oxidation-reduction process
cellular_component GO:0005794 Golgi apparatus
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0016706 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh12G011450.1CmaCh12G011450.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR10869PROLYL 4-HYDROXYLASE ALPHA SUBUNITcoord: 46..203
score: 7.4
NoneNo IPR availablePANTHERPTHR10869:SF782-OXOGLUTARATE (2OG) AND FE(II)-DEPENDENT OXYGENASE SUPERFAMILY PROTEINcoord: 46..203
score: 7.4

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CmaCh12G011450CSPI02G02650Wild cucumber (PI 183967)cmacpiB174
The following gene(s) are paralogous to this gene:

None