Solver for convex optimization with exponent in the objective function

Question

My question is related to a previous one: Dedicated solver for convex problems

To minimize a convex function of the form $~f(x_i) = \left[ C + mx_i + \frac{s}{x_i+t} \right]^p $

with various parameters $~C$, $m$, $s$, $t~$ and $~p$, $~$where $~0 \le x_i \le 1~$ and $~p \ge 1$.

(By generating a graph of the function, I ensure that the function $f(x_i)$ is convex. $~$The constraints are all linear.)

Edit (5:30am GMT January 25): Apologies for not bringing this up earlier, since I assumed that it wouldn't make a difference. The variable actually is a vector $X =$ ($x_1$, $x_2$, $\cdots$, $x_n$).. The objective function is:

Minimise $~Z(X) = \sum_{i=1}^n Z_i = \sum_{i=1}^n f(x_i)$.

If each $f(x_i)$ is convex, then their sum $Z(X)$ is also convex.

My question is, which solvers are well-suited to minimize such a convex function (subject to linear constraints)?

Of all solvers at NEOS, only MINOS seems to come close, and even this is not perfect (sometimes returns errors for instances for which manually I'm able to obtain optimal feasible solutions).

Could you please clarify: (1) x is the only variable, all the other letters are parameters; (2) Is x a single number? If so, you can forget about the exponent, let g(x) = C + mx + s/(x+t), and solve directly the equation g'(x) = 0, i.e., x = -t +-sqrt(s/m). — Stef, Jan 24 '23 at 14:26
If x is not a real number, but a vector, then I don't know what s/(x + t) means. But if x is a real number, then I don't know what you mean when you say "the constraints are linear", as linear constraints in one dimension are nothing more than just setting an interval. — Stef, Jan 24 '23 at 14:35
To Stef -- good point! Let me be honest! There are several $x_i$ (that is, $~x_i$, $~$1 $\le i \le n$).. The objective function is actually (Minimise) $Z$ $= \sum_i Z_i$ $= \sum_i (~C + mx_i + s/(x_i + t) ~)^p $.. So I have linear constraints that connect different $x$'s, such as $x_i$ + $x_j$ $\le K_{i,j}$, etc.. If each $Z_i$ is convex, then the entire sum $Z$ is also convex. (I assume that the answers provided so far don't change with this extra info?) — Shuxue Jiaoshou, Jan 24 '23 at 20:35
Ah, it makes a lot more sense then! I suggest editing your question to include what you just said in a comment. As for Henrik Friberg's answer below, the way I understand it, it looks like it assumed that the objective function was f(x) and x was a real number and one of the variables, but there might be other variables that were involved in the constraints without being involved in the objective function. Note that the answer emits a reserve: "Hence, if f(x) really is all there is to your objective function". — Stef, Jan 24 '23 at 23:05
The answers of Oscar Dowson and Sutanu now don't make much sense in the new light that x is a vector and the objective function is a sum of functions of the x_i. — Stef, Jan 24 '23 at 23:05

Henrik Alsing Friberg · Accepted Answer · 2023-01-24T08:09:29.057

12

You are missing some details for the claimed convexity of $f(x)$. I will assume that you meant to say convex on the domain $C+mx+\frac{s}{x+t} \geq 0$ and $x+t \geq 0$. In this case the epigraph of $f(x)$ has conic normal form: $$ \begin{array}{ll} &f \geq \left(C+mx+\frac{s}{x+t}\right)^p\\\Leftrightarrow &f \geq \left(C+mx+y\right)^p,\quad y \geq \frac{s}{x+t},\\\Leftrightarrow &f \geq z^p,\quad z=C+mx+y,\quad y \geq \frac{s}{x+t},\\\Leftrightarrow &f \geq |z|^p,\quad z=C+mx+y \geq 0,\quad y (x+t) \geq s. \end{array} $$

where the last deduction adds in the missing convexity details. You can now input the representation in any Disciplined Convex Programming language using one power cone for $f \geq |z|^p$ (see, e.g, https://docs.mosek.com/modeling-cookbook/powo.html#powers), and one rotated quadratic cone for $y (x+t) \geq s$ (see, e.g., https://docs.mosek.com/modeling-cookbook/cqo.html#rotated-quadratic-cones). These two cones are so simple that their presence will hardly affect the interior-point algorithm from a complexity point of view.

As a last remark, note that minimizing $\left(C+mx+\frac{s}{x+t}\right)^p$ on $p\geq 1$ is equivalent to just minimizing $C+mx+\frac{s}{x+t}$. Hence, if $f(x)$ really is all there is to your objective function, you will get the same optimal solution by just minimizing $z$. This will allow you to get rid of the epigraph variable $f$ and the power cone that constrain it.

As to the question of which solver to use with Disciplined Convex Programming, I refer to this answer to your previous question: https://or.stackexchange.com/a/2727.

edited Jan 24 '23 at 08:09

answered Jan 23 '23 at 09:21

Henrik Alsing Friberg

1,467
7
13

Thanks.. Yes, you're correct -- certainly we need $~\left(C + mx + \frac{s}{x+t} \right) \ge 0~$.. Just to make sure -- your $t$ (on the left side), is it the same as my $t$ (on the right side)? $~$Also, any reason why $~y \ge \frac{s}{x+t}$, and not $~y = \frac{s}{x+t}~$? – Shuxue Jiaoshou Jan 24 '23 at 04:08
Pleased to report that after cleaning my model with some DCP rules, the number of iterations in MINOS to reach an optimal solution (for a particular instance) dropped from 600 to 118 -- Great! $~$Now I should try MOSEK or other solvers. – Shuxue Jiaoshou Jan 24 '23 at 06:29
2

@ShuxueJiaoshou (1) Yes that variable should not have been called $t$. I changed it so the function $f(x)$ is now represented by the epigraph variable $f$. (2) You want $y=\frac{s}{x+t}$ but that is nonconvex. Fortunately, the convex relaxtion $y \geq \frac{s}{x+t}$ is exact because $(C+mx+y)^p$ is nondecreasing in $y$ (see https://docs.mosek.com/modeling-cookbook/practical.html#composite-functions). (3) Great achievement, the DCP rules are indeed powerful. I can highly recommend MOSEK. – Henrik Alsing Friberg Jan 24 '23 at 08:25
1

@HenrikAlsingFriberg, would you please, what's happened for parameter $t$ if we would replace it with the new variable $t'$? We should simply change the parameter to the variables!!? – A.Omidi Jan 25 '23 at 10:20
1

@A.Omidi: In my reformulation it doesn't matter if $t$ is a parameter or a variable as long as the convexity requirement, $x+t \geq 0$, is valid for the problem you are trying to solve. – Henrik Alsing Friberg Jan 25 '23 at 13:09
1

@HenrikAlsingFriberg, thanks for your explanation. – A.Omidi Jan 26 '23 at 09:43

score 6 · Answer 2 · answered Jan 23 '23 at 09:11

Use Ipopt and a modeling language. For example, JuMP: https://jump.dev/JuMP.jl/stable/tutorials/nonlinear/introduction/

julia> using JuMP, Ipopt
julia> model = Model(Ipopt.Optimizer)
A JuMP Model
Feasibility problem with:
Variables: 0
Model mode: AUTOMATIC
CachingOptimizer state: EMPTY_OPTIMIZER
Solver name: Ipopt
julia> set_silent(model)
julia> @variable(model, 0 <= x <= 1)
x
julia> C, m, s, t, p = 1, 1, 1, 0.1, 3
(1, 1, 1, 0.1, 3)
julia> @NLobjective(model, Min, (C + m * x + s / (x + t))^p)
julia> optimize!(model)
julia> value(x)
0.8999994186940709
julia> objective_value(model)
24.389000000008522
julia> (C + m * value(x) + s / (value(x) + t))^p
24.389000000008522

Sutanu Majumdar · Answer 3 · 2023-01-23T03:54:28.823

5

Best way is to apply log to the function as optimal solution to log of a function is the same for the function.
$ \log f = p[\log (C(x+t)+ mx(x+t)+s)-\log(x+t)]$
Then you have to get the actual solution by the exp of $\log$ base used (e, 2 or 10).

However log functions need to be transformed into auxillary variables. You can also use the exp with $p$ but these also have to expressed as additional variables using addition constraints. Here is the link to suggested piecewise solution by Gurobi. Gurobi provides constraints as model methods like addGenConstrLog() and addGenConstrExp(). These are available at Gurobi's reference guide. Solvers won't have a problem with 2nd degree $x^2$

edited Jan 23 '23 at 03:54

answered Jan 23 '23 at 03:47

Sutanu Majumdar

3,461
1
2
11

2

Doesn't it require log(f(x)) to be convex on the domain for this to work? – Henrik Alsing Friberg Jan 23 '23 at 09:53
2

@HenrikAlsingFriberg: log preserves convexity – Ben Voigt Jan 23 '23 at 16:26
Thanks, but one worry is that taking $\log$ weakens the degree of convexity.. However as Henrik said in his answer, the exponent $p$ may be unnecessary for the minimisation. – Shuxue Jiaoshou Jan 23 '23 at 23:00
1

I don't think log weakens convexity, otherwise max loglikelihood wouldn't have used log to optimize in logistic/probit regression. Also not sure if $f(x)=x$ and $f(x)=x^2 $ will have same solution for $x=[-2,2]$ – Sutanu Majumdar Jan 23 '23 at 23:38
2

@Sutanu: You are able to use log-transformation of the likelihood objective because $\log(x)$ is strictly increasing (doesn't change optimal solution set), and you prefer to do it because log-likelihood is concave and likelihood alone isn't (you want concave objectives when maximizing, and convex objectives when minimizing). Please post a new question on this forum if you need more details as the comment section isn't suited for proper explanation. – Henrik Alsing Friberg Jan 24 '23 at 08:59

score 1 · Answer 4 · answered Feb 29 '24 at 07:01

You can use Mathematica to solve the optimization problem

$$ \min\limits_{x} f(x) = (C + mx + \frac{s}{x + t})^p \\ \text{ where } C = 1,m = 1,s = 1,t = 0.1,p = 3 \\ \\ \text{s.t. }\quad 0 \leq x \leq 1 $$

Clear["Global`*"];
(*Define the parameters*)
C1 = 1; m = 1; s = 1; t = 0.1; p = 3;
(Define the objective function)
objectiveFunction[x_] := (C1 + m*x + s/(x + t))^p;
(Perform the optimization)
solution = NMinimize[{objectiveFunction[x], 0 <= x <= 1}, x]
(Extract the optimal value of x)
optimalX = x /. Last[solution]
(Calculate the objective function value at the optimal point)
objectiveValue = First[solution]
(Optionally,verify the objective function value by direct 

substitution)
verificationValue = objectiveFunction[optimalX]

{24.389, {x -> 0.9}}

Solver for convex optimization with exponent in the objective function

4 Answers4