CSPI02G02630 (gene) Wild cucumber (PI 183967)

NameCSPI02G02630
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionProlyl 4-hydroxylase alpha-like protein
LocationChr2 : 1817334 .. 1818617 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGAGATCACTTGTGAGTGCCCAAAATACTAATTGGGAAGGTGTAGTGAGCAGGTCGATCTACATTTATTTCTCTCACTTCCTTATTTTATAAAAATTGTATAGTTTTCATTGATATATATTACTTATTTGTTAATATTCAATAATCTCATGTGTTCGTGATCACAGCCGTCGCACCAGTTCAGGTAGGTTTCTTGCTAAAGGGCAGAACCAACTCGTCCGTAGAATAGAGAAAAGAATAGCAGAATTTACATTCATTCCCGTAGGTAATGAAAAACAAAATAATTAAAATTCTAAACTTAATTCATAATAACGTCAACCCAAACATACATTTTAGTATTGATCATATGTTCACAAATTTCAAACAGAAAATGGAGAAGGATTGAGTATTCTACATTATGAAGTTGGGCAGAAGTTTGAACCTCATCATGATTACACTCATCCTGATTCATTCAGCTTTAAAAGTTTGGGCCAAAAAATGCCACCCTCGTCATGTATCTGTAAGATTCTACTCACCCCTAATCTGATTGAAATGTAACACAATAATCCAATAATGCATGTTGAATTGTAACACATTAAAATCGTTTCAGTTTCCTTGGTACTGATACGAATGCAGGTCGGGTGTCAAAGAAGGGGTTGCGACGGTATTCCCGGAGGCGAAAAAATGCGCCAGCTCTGCACGACGATGGTGGAAGAAACTGCCTGAATATGGTAAAGATAATGGACTCTCCGTAAAACCAAAGATGGGAGATGCTTTATTGTTTTGGAGCGTGAAACCTGATGGTACATTGGATCCTACAAGTTTGCATGGTTAGTGATCTTTACTTCACCAAAATCTTTTATATGTATAATTCCATCTGTTTTATTATTATTATTTACATATGATGAGAGTTACAAAATCAGAATACTGTCATAAGGTATGGAAGAAATTATGAATCTGAAATCCATCTTTGTATGGTTGTTTTTGGTAAAAGAACAAAAATATTGTTTTGTGGCTTATTTATTTTTGAGTGTACGTTCATCACAACTTCTGATATATATATATGTATATTGGTGGTATATAGCTTCTTCTCCAGTTGTAAAGGGAGACAAATGGGTTGGTGTAAAGCTGATGCATGTTAAAGCTAAAGATTTAACTCAAGAAGTTATGTATATATTTATAGTTCTTAATTGTATTTAGATTAAATTTTCGTTTTATTTTGTGTGATATTTGATCTGATTTTAAGAGGGATTATTTTGAAATTGCAGGGCTAATAGATGCAGAAGTACATCTAATAGATGA

mRNA sequence

ATGGAGAGATCACTTGTGAGTGCCCAAAATACTAATTGGGAAGGTGTAGTGAGCAGCCGTCGCACCAGTTCAGGTAGGTTTCTTGCTAAAGGGCAGAACCAACTCGTCCGTAGAATAGAGAAAAGAATAGCAGAATTTACATTCATTCCCGTAGAAAATGGAGAAGGATTGAGTATTCTACATTATGAAGTTGGGCAGAAGTTTGAACCTCATCATGATTACACTCATCCTGATTCATTCAGCTTTAAAAGTTTGGGCCAAAAAATGCCACCCTCGTCATGTATCTGTAAGATTCTACTCACCCCTAATCTGATTGAAATGTCGGGTGTCAAAGAAGGGGTTGCGACGGTATTCCCGGAGGCGAAAAAATGCGCCAGCTCTGCACGACGATGGTGGAAGAAACTGCCTGAATATGGTAAAGATAATGGACTCTCCGTAAAACCAAAGATGGGAGATGCTTTATTGTTTTGGAGCGTGAAACCTGATGGTACATTGGATCCTACAAGTTTGCATGCTTCTTCTCCAGTTGTAAAGGGAGACAAATGGGTTGGTGTAAAGCTGATGCATGTTAAAGCTAAAGATTTAACTCAAGAAGTTATGGCTAATAGATGCAGAAGTACATCTAATAGATGA

Coding sequence (CDS)

ATGGAGAGATCACTTGTGAGTGCCCAAAATACTAATTGGGAAGGTGTAGTGAGCAGCCGTCGCACCAGTTCAGGTAGGTTTCTTGCTAAAGGGCAGAACCAACTCGTCCGTAGAATAGAGAAAAGAATAGCAGAATTTACATTCATTCCCGTAGAAAATGGAGAAGGATTGAGTATTCTACATTATGAAGTTGGGCAGAAGTTTGAACCTCATCATGATTACACTCATCCTGATTCATTCAGCTTTAAAAGTTTGGGCCAAAAAATGCCACCCTCGTCATGTATCTGTAAGATTCTACTCACCCCTAATCTGATTGAAATGTCGGGTGTCAAAGAAGGGGTTGCGACGGTATTCCCGGAGGCGAAAAAATGCGCCAGCTCTGCACGACGATGGTGGAAGAAACTGCCTGAATATGGTAAAGATAATGGACTCTCCGTAAAACCAAAGATGGGAGATGCTTTATTGTTTTGGAGCGTGAAACCTGATGGTACATTGGATCCTACAAGTTTGCATGCTTCTTCTCCAGTTGTAAAGGGAGACAAATGGGTTGGTGTAAAGCTGATGCATGTTAAAGCTAAAGATTTAACTCAAGAAGTTATGGCTAATAGATGCAGAAGTACATCTAATAGATGA
BLAST of CSPI02G02630 vs. Swiss-Prot
Match: P4H10_ARATH (Probable prolyl 4-hydroxylase 10 OS=Arabidopsis thaliana GN=P4H10 PE=2 SV=1)

HSP 1 Score: 180.3 bits (456), Expect = 2.3e-44
Identity = 98/190 (51.58%), Postives = 128/190 (67.37%), Query Frame = 1

Query: 1   MERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSIL 60
           ME+S V  + T  +   S  RTSSG FLA+G+++ +R IEKRI++FTFIPVE+GEGL +L
Sbjct: 110 MEKSTVVDEKTG-KSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVL 169

Query: 61  HYEVGQKFEPHHDYTHPDSFSFKSLGQKMPPSSCICKILLTPNLIEMSGVKEGVATVFPE 120
           HYE+GQK+EPH+DY   D ++ ++ GQ+           +   L+ +S V+EG  TVFP 
Sbjct: 170 HYEIGQKYEPHYDY-FMDEYNTRNGGQR-----------IATVLMYLSDVEEGGETVFPA 229

Query: 121 AKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKGD 180
           AK    SA  WW +L E GK  GLSVKPKMGDALLFWS+ PD TLDP+SLH    V+KG+
Sbjct: 230 AKG-NYSAVPWWNELSECGK-GGLSVKPKMGDALLFWSMTPDATLDPSSLHGGCAVIKGN 284

Query: 181 KWVGVKLMHV 191
           KW   K + V
Sbjct: 290 KWSSTKWLRV 284

BLAST of CSPI02G02630 vs. Swiss-Prot
Match: P4H3_ARATH (Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana GN=P4H3 PE=2 SV=1)

HSP 1 Score: 179.5 bits (454), Expect = 3.9e-44
Identity = 91/170 (53.53%), Postives = 119/170 (70.00%), Query Frame = 1

Query: 21  RTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHDYTHPDSF 80
           RTSSG FL +G++++++ IEKRIA++TFIP ++GEGL +LHYE GQK+EPH+DY   D F
Sbjct: 127 RTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDY-FVDEF 186

Query: 81  SFKSLGQKMPPSSCICKILLTPNLIEMSGVKEGVATVFPEAKKCASSARRWWKKLPEYGK 140
           + K+ GQ+M              L+ +S V+EG  TVFP A    SS   W+ +L E GK
Sbjct: 187 NTKNGGQRM-----------ATMLMYLSDVEEGGETVFPAANMNFSSV-PWYNELSECGK 246

Query: 141 DNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKGDKWVGVKLMHV 191
             GLSVKP+MGDALLFWS++PD TLDPTSLH   PV++G+KW   K MHV
Sbjct: 247 -KGLSVKPRMGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSSTKWMHV 282

BLAST of CSPI02G02630 vs. Swiss-Prot
Match: P4H8_ARATH (Probable prolyl 4-hydroxylase 8 OS=Arabidopsis thaliana GN=P4H8 PE=3 SV=1)

HSP 1 Score: 172.6 bits (436), Expect = 4.7e-42
Identity = 90/177 (50.85%), Postives = 115/177 (64.97%), Query Frame = 1

Query: 14  EGVVSSRRTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHD 73
           + + S  RTSSG FL +G +++V  IE RI++FTFIP ENGEGL +LHYEVGQ++EPHHD
Sbjct: 124 KSIDSRVRTSSGTFLNRGHDEIVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHHD 183

Query: 74  YTHPDSFSFKSLGQKMPPSSCICKILLTPNLIEMSGVKEGVATVFPEAKKCASSARRWWK 133
           Y   D F+ +  GQ+           +   L+ +S V EG  TVFP AK   S    WW 
Sbjct: 184 YFF-DEFNVRKGGQR-----------IATVLMYLSDVDEGGETVFPAAKGNVSDV-PWWD 243

Query: 134 KLPEYGKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKGDKWVGVKLMHV 191
           +L + GK+ GLSV PK  DALLFWS+KPD +LDP+SLH   PV+KG+KW   K  HV
Sbjct: 244 ELSQCGKE-GLSVLPKKRDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWFHV 286

BLAST of CSPI02G02630 vs. Swiss-Prot
Match: P4H5_ARATH (Prolyl 4-hydroxylase 5 OS=Arabidopsis thaliana GN=P4H5 PE=2 SV=1)

HSP 1 Score: 169.1 bits (427), Expect = 5.2e-41
Identity = 96/191 (50.26%), Postives = 125/191 (65.45%), Query Frame = 1

Query: 1   MERSLVSAQNTNWEGVVSSR-RTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSI 60
           M +S V  + T   G   SR RTSSG FL +G +++V  IEKRI++FTFIPVENGEGL +
Sbjct: 112 MVKSTVVDEKTG--GSKDSRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQV 171

Query: 61  LHYEVGQKFEPHHDYTHPDSFSFKSLGQKMPPSSCICKILLTPNLIEMSGVKEGVATVFP 120
           LHY+VGQK+EPH+DY   D F+ K+ GQ+           +   L+ +S V +G  TVFP
Sbjct: 172 LHYQVGQKYEPHYDY-FLDEFNTKNGGQR-----------IATVLMYLSDVDDGGETVFP 231

Query: 121 EAKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKG 180
            A+    SA  WW +L + GK+ GLSV PK  DALLFW+++PD +LDP+SLH   PVVKG
Sbjct: 232 AARG-NISAVPWWNELSKCGKE-GLSVLPKKRDALLFWNMRPDASLDPSSLHGGCPVVKG 286

Query: 181 DKWVGVKLMHV 191
           +KW   K  HV
Sbjct: 292 NKWSSTKWFHV 286

BLAST of CSPI02G02630 vs. Swiss-Prot
Match: P4H7_ARATH (Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana GN=P4H7 PE=2 SV=1)

HSP 1 Score: 140.6 bits (353), Expect = 2.0e-32
Identity = 80/193 (41.45%), Postives = 117/193 (60.62%), Query Frame = 1

Query: 1   MERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSIL 60
           +E+S+V A N + E V S  RTSSG FL+K Q+ +V  +E ++A +TF+P ENGE + IL
Sbjct: 88  LEKSMV-ADNDSGESVESEVRTSSGMFLSKRQDDIVSNVEAKLAAWTFLPEENGESMQIL 147

Query: 61  HYEVGQKFEPHHDYTHPDSFSFKSLGQKMPPSSCICKILLTPNLIEMSGVKEGVATVFPE 120
           HYE GQK+EPH DY H D  + +  G +           +   L+ +S V++G  TVFP 
Sbjct: 148 HYENGQKYEPHFDYFH-DQANLELGGHR-----------IATVLMYLSNVEKGGETVFPM 207

Query: 121 AKKCASSAR-RWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKG 180
            K  A+  +   W +  +     G +VKP+ GDALLF+++ P+ T D  SLH S PVV+G
Sbjct: 208 WKGKATQLKDDSWTECAK----QGYAVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEG 263

Query: 181 DKWVGVKLMHVKA 193
           +KW   + +HVK+
Sbjct: 268 EKWSATRWIHVKS 263

BLAST of CSPI02G02630 vs. TrEMBL
Match: A0A0A0LIA3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G009610 PE=4 SV=1)

HSP 1 Score: 381.3 bits (978), Expect = 7.5e-103
Identity = 190/210 (90.48%), Postives = 192/210 (91.43%), Query Frame = 1

Query: 1   MERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSIL 60
           MERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSIL
Sbjct: 37  MERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSIL 96

Query: 61  HYEVGQKFEPHHDYTHPDSFSFKSLGQKMPPSSCICKILLTPNLIEMSGVKEGVATVFPE 120
           HYEVGQKFEPHHDYTHPDSFSFKSLGQ+                + MSGVKEG ATVFPE
Sbjct: 97  HYEVGQKFEPHHDYTHPDSFSFKSLGQRNAT-------------LVMSGVKEGGATVFPE 156

Query: 121 AKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKGD 180
           AKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKGD
Sbjct: 157 AKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKGD 216

Query: 181 KWVGVKLMHVKAKDLTQEVMANRCRSTSNR 211
           KWVGVKLMHVKAKDLTQEVMANRCRSTSNR
Sbjct: 217 KWVGVKLMHVKAKDLTQEVMANRCRSTSNR 233

BLAST of CSPI02G02630 vs. TrEMBL
Match: A0A151S4M6_CAJCA (Prolyl 4-hydroxylase subunit alpha-1 OS=Cajanus cajan GN=KK1_028532 PE=4 SV=1)

HSP 1 Score: 194.1 bits (492), Expect = 1.7e-46
Identity = 102/170 (60.00%), Postives = 123/170 (72.35%), Query Frame = 1

Query: 21  RTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHDYTHPDSF 80
           RTSSG FLA+G++Q+VR IEKRIA+FTFIPVENGEGL +LHYEVGQK+EPH DY   D+F
Sbjct: 133 RTSSGTFLARGRDQVVRNIEKRIADFTFIPVENGEGLQVLHYEVGQKYEPHFDY-FMDAF 192

Query: 81  SFKSLGQKMPPSSCICKILLTPNLIEMSGVKEGVATVFPEAKKCASSARRWWKKLPEYGK 140
           + K+ GQ+           +   L+ +S V+EG  TVFP+AK   SS   WW +L E GK
Sbjct: 193 NTKNGGQR-----------IATMLMYLSDVEEGGETVFPDAKGNFSSV-PWWNELSECGK 252

Query: 141 DNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKGDKWVGVKLMHV 191
             GLS+KPKMGDALLFWSVKPD TLDP+SLH   PV+KG+KW   K M V
Sbjct: 253 -KGLSIKPKMGDALLFWSVKPDATLDPSSLHGGCPVIKGNKWSCTKWMRV 288

BLAST of CSPI02G02630 vs. TrEMBL
Match: G7LB39_MEDTR (Prolyl 4-hydroxylase alpha-like protein OS=Medicago truncatula GN=MTR_8g074880 PE=4 SV=1)

HSP 1 Score: 188.7 bits (478), Expect = 7.1e-45
Identity = 100/190 (52.63%), Postives = 130/190 (68.42%), Query Frame = 1

Query: 1   MERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSIL 60
           M +S V  + T   GV S  RTSSG FL +G +++V+ IE+RIA+FTFIPVE+GE  ++L
Sbjct: 147 MHKSAVIDEETG-NGVDSRERTSSGAFLKRGSDRIVKNIERRIADFTFIPVEHGENFNVL 206

Query: 61  HYEVGQKFEPHHDYTHPDSFSFKSLGQKMPPSSCICKILLTPNLIEMSGVKEGVATVFPE 120
           HYEVGQK+EPH+DY   D+FS    GQ+           +   L+ +S V+EG  TVFP 
Sbjct: 207 HYEVGQKYEPHYDY-FMDTFSTTYAGQR-----------IATMLMYLSDVEEGGETVFPN 266

Query: 121 AKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKGD 180
           AK   SS   WW +L + GK  GLS+KPKMG+A+LFWS+KPD TLDP+SLH + PV+KGD
Sbjct: 267 AKGNFSSV-PWWNELSDCGK-GGLSIKPKMGNAILFWSMKPDATLDPSSLHGACPVIKGD 321

Query: 181 KWVGVKLMHV 191
           KW+  K MHV
Sbjct: 327 KWLCAKWMHV 321

BLAST of CSPI02G02630 vs. TrEMBL
Match: M0YUR9_HORVD (Uncharacterized protein OS=Hordeum vulgare var. distichum PE=4 SV=1)

HSP 1 Score: 187.2 bits (474), Expect = 2.1e-44
Identity = 98/170 (57.65%), Postives = 122/170 (71.76%), Query Frame = 1

Query: 21  RTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHDYTHPDSF 80
           RTSSG FL +GQ+++VR IEKRI++FTFIPVENGEGL +LHYEVGQK+EPH DY H D F
Sbjct: 20  RTSSGTFLRRGQDKIVRTIEKRISDFTFIPVENGEGLQVLHYEVGQKYEPHFDYFH-DDF 79

Query: 81  SFKSLGQKMPPSSCICKILLTPNLIEMSGVKEGVATVFPEAKKCASSARRWWKKLPEYGK 140
           + K+ GQ+           +   L+ +S V+EG  TVFP A K  SS+  ++ +L E  K
Sbjct: 80  NTKNGGQR-----------IATVLMYLSDVEEGGETVFPSA-KVNSSSIPFYNELSECAK 139

Query: 141 DNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKGDKWVGVKLMHV 191
             G+SVKPKMGDALLFWS++PDGTLDPTSLH   PV+KGDKW   K + V
Sbjct: 140 -RGISVKPKMGDALLFWSMRPDGTLDPTSLHGGCPVIKGDKWSSTKWIRV 175

BLAST of CSPI02G02630 vs. TrEMBL
Match: W4ZTH7_WHEAT (Uncharacterized protein OS=Triticum aestivum PE=4 SV=1)

HSP 1 Score: 187.2 bits (474), Expect = 2.1e-44
Identity = 99/170 (58.24%), Postives = 121/170 (71.18%), Query Frame = 1

Query: 21  RTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHDYTHPDSF 80
           RTSSG FL +GQ+++VR IEKRI++FTFIPVENGEGL +LHYEVGQK+EPH DY H D F
Sbjct: 20  RTSSGTFLKRGQDKIVRTIEKRISDFTFIPVENGEGLQVLHYEVGQKYEPHFDYFH-DDF 79

Query: 81  SFKSLGQKMPPSSCICKILLTPNLIEMSGVKEGVATVFPEAKKCASSARRWWKKLPEYGK 140
           + K+ GQ+           +   L+ +S V+EG  TVFP A K  SS+  +  KL E  K
Sbjct: 80  NTKNGGQR-----------IATVLMYLSDVEEGGETVFPSA-KVNSSSIPFHNKLSECAK 139

Query: 141 DNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKGDKWVGVKLMHV 191
             G+SVKPKMGDALLFWS++PDGTLDPTSLH   PV+KGDKW   K + V
Sbjct: 140 -RGISVKPKMGDALLFWSMRPDGTLDPTSLHGGCPVIKGDKWSSTKWIRV 175

BLAST of CSPI02G02630 vs. TAIR10
Match: AT5G66060.1 (AT5G66060.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 180.3 bits (456), Expect = 1.3e-45
Identity = 98/190 (51.58%), Postives = 128/190 (67.37%), Query Frame = 1

Query: 1   MERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSIL 60
           ME+S V  + T  +   S  RTSSG FLA+G+++ +R IEKRI++FTFIPVE+GEGL +L
Sbjct: 110 MEKSTVVDEKTG-KSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVL 169

Query: 61  HYEVGQKFEPHHDYTHPDSFSFKSLGQKMPPSSCICKILLTPNLIEMSGVKEGVATVFPE 120
           HYE+GQK+EPH+DY   D ++ ++ GQ+           +   L+ +S V+EG  TVFP 
Sbjct: 170 HYEIGQKYEPHYDY-FMDEYNTRNGGQR-----------IATVLMYLSDVEEGGETVFPA 229

Query: 121 AKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKGD 180
           AK    SA  WW +L E GK  GLSVKPKMGDALLFWS+ PD TLDP+SLH    V+KG+
Sbjct: 230 AKG-NYSAVPWWNELSECGK-GGLSVKPKMGDALLFWSMTPDATLDPSSLHGGCAVIKGN 284

Query: 181 KWVGVKLMHV 191
           KW   K + V
Sbjct: 290 KWSSTKWLRV 284

BLAST of CSPI02G02630 vs. TAIR10
Match: AT1G20270.1 (AT1G20270.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 179.5 bits (454), Expect = 2.2e-45
Identity = 91/170 (53.53%), Postives = 119/170 (70.00%), Query Frame = 1

Query: 21  RTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHDYTHPDSF 80
           RTSSG FL +G++++++ IEKRIA++TFIP ++GEGL +LHYE GQK+EPH+DY   D F
Sbjct: 127 RTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYDY-FVDEF 186

Query: 81  SFKSLGQKMPPSSCICKILLTPNLIEMSGVKEGVATVFPEAKKCASSARRWWKKLPEYGK 140
           + K+ GQ+M              L+ +S V+EG  TVFP A    SS   W+ +L E GK
Sbjct: 187 NTKNGGQRM-----------ATMLMYLSDVEEGGETVFPAANMNFSSV-PWYNELSECGK 246

Query: 141 DNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKGDKWVGVKLMHV 191
             GLSVKP+MGDALLFWS++PD TLDPTSLH   PV++G+KW   K MHV
Sbjct: 247 -KGLSVKPRMGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSSTKWMHV 282

BLAST of CSPI02G02630 vs. TAIR10
Match: AT4G35810.1 (AT4G35810.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 172.6 bits (436), Expect = 2.7e-43
Identity = 90/177 (50.85%), Postives = 115/177 (64.97%), Query Frame = 1

Query: 14  EGVVSSRRTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHD 73
           + + S  RTSSG FL +G +++V  IE RI++FTFIP ENGEGL +LHYEVGQ++EPHHD
Sbjct: 124 KSIDSRVRTSSGTFLNRGHDEIVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHHD 183

Query: 74  YTHPDSFSFKSLGQKMPPSSCICKILLTPNLIEMSGVKEGVATVFPEAKKCASSARRWWK 133
           Y   D F+ +  GQ+           +   L+ +S V EG  TVFP AK   S    WW 
Sbjct: 184 YFF-DEFNVRKGGQR-----------IATVLMYLSDVDEGGETVFPAAKGNVSDV-PWWD 243

Query: 134 KLPEYGKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKGDKWVGVKLMHV 191
           +L + GK+ GLSV PK  DALLFWS+KPD +LDP+SLH   PV+KG+KW   K  HV
Sbjct: 244 ELSQCGKE-GLSVLPKKRDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWFHV 286

BLAST of CSPI02G02630 vs. TAIR10
Match: AT2G17720.1 (AT2G17720.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein)

HSP 1 Score: 169.1 bits (427), Expect = 2.9e-42
Identity = 96/191 (50.26%), Postives = 125/191 (65.45%), Query Frame = 1

Query: 1   MERSLVSAQNTNWEGVVSSR-RTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSI 60
           M +S V  + T   G   SR RTSSG FL +G +++V  IEKRI++FTFIPVENGEGL +
Sbjct: 112 MVKSTVVDEKTG--GSKDSRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQV 171

Query: 61  LHYEVGQKFEPHHDYTHPDSFSFKSLGQKMPPSSCICKILLTPNLIEMSGVKEGVATVFP 120
           LHY+VGQK+EPH+DY   D F+ K+ GQ+           +   L+ +S V +G  TVFP
Sbjct: 172 LHYQVGQKYEPHYDY-FLDEFNTKNGGQR-----------IATVLMYLSDVDDGGETVFP 231

Query: 121 EAKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKG 180
            A+    SA  WW +L + GK+ GLSV PK  DALLFW+++PD +LDP+SLH   PVVKG
Sbjct: 232 AARG-NISAVPWWNELSKCGKE-GLSVLPKKRDALLFWNMRPDASLDPSSLHGGCPVVKG 286

Query: 181 DKWVGVKLMHV 191
           +KW   K  HV
Sbjct: 292 NKWSSTKWFHV 286

BLAST of CSPI02G02630 vs. TAIR10
Match: AT3G28490.1 (AT3G28490.1 Oxoglutarate/iron-dependent oxygenase)

HSP 1 Score: 135.2 bits (339), Expect = 4.7e-32
Identity = 77/193 (39.90%), Postives = 112/193 (58.03%), Query Frame = 1

Query: 1   MERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSIL 60
           +E+S+V A   + E   S  RTSSG FL K Q+ +V  +E ++A +TF+P ENGE L IL
Sbjct: 64  LEKSMVVADVDSGESEDSEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENGEALQIL 123

Query: 61  HYEVGQKFEPHHDYTHPDSFSFKSLGQKMPPSSCICKILLTPNLIEMSGVKEGVATVFPE 120
           HYE GQK++PH DY + D  + +  G +           +   L+ +S V +G  TVFP 
Sbjct: 124 HYENGQKYDPHFDYFY-DKKALELGGHR-----------IATVLMYLSNVTKGGETVFPN 183

Query: 121 AK-KCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKG 180
            K K        W K  +     G +VKP+ GDALLF+++  +GT DP SLH S PV++G
Sbjct: 184 WKGKTPQLKDDSWSKCAK----QGYAVKPRKGDALLFFNLHLNGTTDPNSLHGSCPVIEG 240

Query: 181 DKWVGVKLMHVKA 193
           +KW   + +HV++
Sbjct: 244 EKWSATRWIHVRS 240

BLAST of CSPI02G02630 vs. NCBI nr
Match: gi|700205655|gb|KGN60774.1| (hypothetical protein Csa_2G009610 [Cucumis sativus])

HSP 1 Score: 381.3 bits (978), Expect = 1.1e-102
Identity = 190/210 (90.48%), Postives = 192/210 (91.43%), Query Frame = 1

Query: 1   MERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSIL 60
           MERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSIL
Sbjct: 37  MERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSIL 96

Query: 61  HYEVGQKFEPHHDYTHPDSFSFKSLGQKMPPSSCICKILLTPNLIEMSGVKEGVATVFPE 120
           HYEVGQKFEPHHDYTHPDSFSFKSLGQ+                + MSGVKEG ATVFPE
Sbjct: 97  HYEVGQKFEPHHDYTHPDSFSFKSLGQRNAT-------------LVMSGVKEGGATVFPE 156

Query: 121 AKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKGD 180
           AKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKGD
Sbjct: 157 AKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKGD 216

Query: 181 KWVGVKLMHVKAKDLTQEVMANRCRSTSNR 211
           KWVGVKLMHVKAKDLTQEVMANRCRSTSNR
Sbjct: 217 KWVGVKLMHVKAKDLTQEVMANRCRSTSNR 233

BLAST of CSPI02G02630 vs. NCBI nr
Match: gi|778673969|ref|XP_011650097.1| (PREDICTED: probable prolyl 4-hydroxylase 3 [Cucumis sativus])

HSP 1 Score: 356.3 bits (913), Expect = 3.7e-95
Identity = 177/198 (89.39%), Postives = 181/198 (91.41%), Query Frame = 1

Query: 1   MERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSIL 60
           MERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSIL
Sbjct: 29  MERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSIL 88

Query: 61  HYEVGQKFEPHHDYTHPDSFSFKSLGQKMPPSSCICKILLTPNLIEMSGVKEGVATVFPE 120
           HYEVGQKFEPHHDYTHPDSFSFKSLGQ+               ++ +SGVKEG ATVFPE
Sbjct: 89  HYEVGQKFEPHHDYTHPDSFSFKSLGQRNATL-----------VMYLSGVKEGGATVFPE 148

Query: 121 AKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKGD 180
           AKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKGD
Sbjct: 149 AKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKGD 208

Query: 181 KWVGVKLMHVKAKDLTQE 199
           KWVGVKLMHVKAKDLTQE
Sbjct: 209 KWVGVKLMHVKAKDLTQE 215

BLAST of CSPI02G02630 vs. NCBI nr
Match: gi|659071604|ref|XP_008460837.1| (PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis melo])

HSP 1 Score: 280.8 bits (717), Expect = 2.0e-72
Identity = 145/203 (71.43%), Postives = 166/203 (81.77%), Query Frame = 1

Query: 1   MERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSIL 60
           M+RSLVS Q + W+G+VSS RTS+GRFL KGQNQLVRRIEKRIAEFTFIPVENGEGLSIL
Sbjct: 46  MKRSLVSDQKS-WKGLVSSHRTSTGRFLVKGQNQLVRRIEKRIAEFTFIPVENGEGLSIL 105

Query: 61  HYEVGQKFEPHHDYTHPDSFSFKSLGQKMPPSSCICKILLTPNLIEMSGVKEGVATVFPE 120
           HYEVGQKF+PH+DY+ P+SF+FK+LGQ+           +   ++ +S VKEG A VFP 
Sbjct: 106 HYEVGQKFDPHYDYSRPESFNFKTLGQR-----------IATLVMYLSDVKEGGAKVFPA 165

Query: 121 AKK------CASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASS 180
           AKK      CASS+RRWWKKLPEYG + GLSVKPKMGDALLFWS+KPD TLDPTSLH SS
Sbjct: 166 AKKCPVSKQCASSSRRWWKKLPEYGGE-GLSVKPKMGDALLFWSLKPDSTLDPTSLHGSS 225

Query: 181 PVVKGDKWVGVKLMHVKAKDLTQ 198
           PV++GDKWVGVKLMHV  KDLTQ
Sbjct: 226 PVIEGDKWVGVKLMHV--KDLTQ 233

BLAST of CSPI02G02630 vs. NCBI nr
Match: gi|1012338504|gb|KYP49756.1| (Prolyl 4-hydroxylase subunit alpha-1 [Cajanus cajan])

HSP 1 Score: 194.1 bits (492), Expect = 2.4e-46
Identity = 102/170 (60.00%), Postives = 123/170 (72.35%), Query Frame = 1

Query: 21  RTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHDYTHPDSF 80
           RTSSG FLA+G++Q+VR IEKRIA+FTFIPVENGEGL +LHYEVGQK+EPH DY   D+F
Sbjct: 133 RTSSGTFLARGRDQVVRNIEKRIADFTFIPVENGEGLQVLHYEVGQKYEPHFDY-FMDAF 192

Query: 81  SFKSLGQKMPPSSCICKILLTPNLIEMSGVKEGVATVFPEAKKCASSARRWWKKLPEYGK 140
           + K+ GQ+           +   L+ +S V+EG  TVFP+AK   SS   WW +L E GK
Sbjct: 193 NTKNGGQR-----------IATMLMYLSDVEEGGETVFPDAKGNFSSV-PWWNELSECGK 252

Query: 141 DNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKGDKWVGVKLMHV 191
             GLS+KPKMGDALLFWSVKPD TLDP+SLH   PV+KG+KW   K M V
Sbjct: 253 -KGLSIKPKMGDALLFWSVKPDATLDPSSLHGGCPVIKGNKWSCTKWMRV 288

BLAST of CSPI02G02630 vs. NCBI nr
Match: gi|778666404|ref|XP_011648735.1| (PREDICTED: probable prolyl 4-hydroxylase 3 [Cucumis sativus])

HSP 1 Score: 193.0 bits (489), Expect = 5.4e-46
Identity = 106/190 (55.79%), Postives = 130/190 (68.42%), Query Frame = 1

Query: 1   MERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSIL 60
           ME+S V    T  E V S  RTSSG FL +GQ++++R IEKRIA+FTFIP+E+GEGL IL
Sbjct: 106 MEKSTVVDSKTG-ESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQIL 165

Query: 61  HYEVGQKFEPHHDYTHPDSFSFKSLGQKMPPSSCICKILLTPNLIEMSGVKEGVATVFPE 120
           HYEVGQK++ H+DY   D ++ K  GQ+M              L+ +S V+EG  TVFP 
Sbjct: 166 HYEVGQKYDAHYDY-FVDEYNIKKGGQRM-----------ATLLMYLSDVEEGGETVFPA 225

Query: 121 AKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKGD 180
           AK   SS   WW +L E GK  GLSVKPKMGDALLFWS+KPD TLDPTSLH + PV++G+
Sbjct: 226 AKGNFSSV-PWWNELSECGK-GGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGN 280

Query: 181 KWVGVKLMHV 191
           KW   K MHV
Sbjct: 286 KWSCTKWMHV 280

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P4H10_ARATH2.3e-4451.58Probable prolyl 4-hydroxylase 10 OS=Arabidopsis thaliana GN=P4H10 PE=2 SV=1[more]
P4H3_ARATH3.9e-4453.53Probable prolyl 4-hydroxylase 3 OS=Arabidopsis thaliana GN=P4H3 PE=2 SV=1[more]
P4H8_ARATH4.7e-4250.85Probable prolyl 4-hydroxylase 8 OS=Arabidopsis thaliana GN=P4H8 PE=3 SV=1[more]
P4H5_ARATH5.2e-4150.26Prolyl 4-hydroxylase 5 OS=Arabidopsis thaliana GN=P4H5 PE=2 SV=1[more]
P4H7_ARATH2.0e-3241.45Probable prolyl 4-hydroxylase 7 OS=Arabidopsis thaliana GN=P4H7 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LIA3_CUCSA7.5e-10390.48Uncharacterized protein OS=Cucumis sativus GN=Csa_2G009610 PE=4 SV=1[more]
A0A151S4M6_CAJCA1.7e-4660.00Prolyl 4-hydroxylase subunit alpha-1 OS=Cajanus cajan GN=KK1_028532 PE=4 SV=1[more]
G7LB39_MEDTR7.1e-4552.63Prolyl 4-hydroxylase alpha-like protein OS=Medicago truncatula GN=MTR_8g074880 P... [more]
M0YUR9_HORVD2.1e-4457.65Uncharacterized protein OS=Hordeum vulgare var. distichum PE=4 SV=1[more]
W4ZTH7_WHEAT2.1e-4458.24Uncharacterized protein OS=Triticum aestivum PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G66060.11.3e-4551.58 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT1G20270.12.2e-4553.53 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT4G35810.12.7e-4350.85 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT2G17720.12.9e-4250.26 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily prot... [more]
AT3G28490.14.7e-3239.90 Oxoglutarate/iron-dependent oxygenase[more]
Match NameE-valueIdentityDescription
gi|700205655|gb|KGN60774.1|1.1e-10290.48hypothetical protein Csa_2G009610 [Cucumis sativus][more]
gi|778673969|ref|XP_011650097.1|3.7e-9589.39PREDICTED: probable prolyl 4-hydroxylase 3 [Cucumis sativus][more]
gi|659071604|ref|XP_008460837.1|2.0e-7271.43PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis melo][more]
gi|1012338504|gb|KYP49756.1|2.4e-4660.00Prolyl 4-hydroxylase subunit alpha-1 [Cajanus cajan][more]
gi|778666404|ref|XP_011648735.1|5.4e-4655.79PREDICTED: probable prolyl 4-hydroxylase 3 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005123Oxoglu/Fe-dep_dioxygenase
IPR006620Pro_4_hyd_alph
Vocabulary: Molecular Function
TermDefinition
GO:0016491oxidoreductase activity
GO:0016705oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
GO:0005506iron ion binding
GO:0031418L-ascorbic acid binding
Vocabulary: Biological Process
TermDefinition
GO:0055114oxidation-reduction process
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006525 arginine metabolic process
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0018401 peptidyl-proline hydroxylation to 4-hydroxy-L-proline
biological_process GO:0006560 proline metabolic process
biological_process GO:0019511 peptidyl-proline hydroxylation
cellular_component GO:0005575 cellular_component
cellular_component GO:0005794 Golgi apparatus
cellular_component GO:0005768 endosome
cellular_component GO:0005802 trans-Golgi network
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005506 iron ion binding
molecular_function GO:0031418 L-ascorbic acid binding
molecular_function GO:0016706 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen
molecular_function GO:0004656 procollagen-proline 4-dioxygenase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI02G02630.1CSPI02G02630.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005123Oxoglutarate/iron-dependent dioxygenasePFAMPF136402OG-FeII_Oxy_3coord: 58..186
score: 2.
IPR006620Prolyl 4-hydroxylase, alpha subunitSMARTSM00702p4hccoord: 1..189
score: 2.6
NoneNo IPR availablePANTHERPTHR10869PROLYL 4-HYDROXYLASE ALPHA SUBUNITcoord: 21..190
score: 5.0
NoneNo IPR availablePANTHERPTHR10869:SF81SUBFAMILY NOT NAMEDcoord: 21..190
score: 5.0