How to specify constraints#

Constraints vs bounds#

Estimagic distinguishes between bounds and constraints. Bounds are lower and upper bounds for parameters. In the literature they are sometimes called box constraints. Bounds are specified as lower_bounds and upper_bounds argument to maximize and minimize.

Examples with bounds can be found in this tutorial.

To specify more general constraints on the parameters you use can use the argument constraints. This ranges from rather simple ones (e.g. parameters are fixed to a value, a group of parameters is required to be equal) to more complex ones (like general linear constraints, or even nonlinear constraints).

Can you use constraints with all optimizers?#

With the exception of general nonlinear constraints, we implement constraints via reparametrizations. Details are explained here. This means that you can use all of the constraints with any optimizer that supports bounds. Some constraints (e.g. fixing parameters) can even be used with optimizers that do not support bounds.

Example criterion function#

Let’s look at a variation of the sphere function to illustrate which consraints are implemented and how you specify them in estimagic:

def criterion(params):
    offset = np.linspace(1, 0, len(params))
    x = params - offset
    return x @ x

The unconstrained optimum of a six-dimensional version of this problem is:

[1.0, 0.8, 0.6, 0.4, 0.2, 0.0]

The constrained optimum is usually also easy to see because all parameters enter the criterion function in a additively separable way.

Imposing multiple constraints at once#

The above examples all just impose one constraint at a time. To impose multiple constraints simultaneously, simple pass in a list of constraints. Example:

res = minimize(
    criterion=criterion,
    params=np.ones(6),
    algorithm="scipy_lbfgsb",
    constraints=[
        {"loc": [0, 1], "type": "equality"},
        {"loc": [2, 3, 4], "type": "linear", "weights": 1, "value": 3},
    ],
)

This yields:

>>> array([0.9, 0.9, 1.2, 1. , 0.8, 0. ])

There are limits regarding the compatibility of constraints that overlap. You will get a descriptive error message if your constraints are not compatible.

How to select the parameters?#

All the above examples use a loc entry in the constraint dictionary to select the subset of params on which the constraint is imposed. This is just one out of several ways to do it. Which ways are available also depends on whether your parameters are a numpy array, DataFrame or general pytree.

	loc	query	selector
1d-array	✅ (positions)	❌	✅
DataFrame	✅ (labels)	✅	✅
Pytree	❌	❌	✅

Below we show how to use each of these selection methods in simple examples

loc

You can look at any of the above examples to see constraints where params are a numpy array and loc is used to select parameters. So now, we focus on DataFrame params.

Let’s assume our params are a DataFrame with a two level index. The names of the index levels are category and name. Something like this could for example be the params of an Ordered Logit model.

		value
category	name
betas	a	0.95
betas	b	0.9
cutoffs	a	0
cutoffs	b	0.4

Now let’s impose the constraint that the cutoffs (i.e. the last two parameters) are increasing.

res = minimize(
    criterion=some_criterion,
    params=params,
    algorithm="scipy_lbfgsb",
    constraints={"loc": "cutoffs", "type": "increasing"},
)

The value corresponding to loc can be anything that you could pass into the DataFrame.loc method. This can be extremely powerful if you have a well designed MultiIndex, as you can easily select groups of parameters or single paramaters.

query

		value
category	name
betas	a	0.95
betas	b	0.9
cutoffs	a	0
cutoffs	b	0.4

This time we want to fix all betas as well as all parameters where the second index level is equal to "a". If we wanted to do that using loc, we would have to type out three index tuples. So let’s do it with query:

res = minimize(
    criterion=some_criterion,
    params=params,
    algorithm="scipy_lbfgsb",
    constraints={"query": "category == 'betas' | name == 'a'", "type": "fixed"},
)

The value corresponding to query can be anything you could pass to the DataFrame.query method.

selector

Using selector to select the parameters is the most general way and works for all params. Let’s assume we have defined parameters in a nested dictionary:

params = {"a": np.ones(2), "b": {"c": 3, "d": pd.Series([4, 5])}}

It is probably not a good idea to use a nested dictionary for so few parameters, but let’s ignore that.

Now assume, we want to fix the parameters in the pandas Series at their start values. We can do so as follows:

res = minimize(
    criterion=some_criterion,
    params=params,
    algorithm="scipy_lbfgsb",
    constraints={"selector": lambda params: params["b"]["d"], "type": "fixed"},
)

I.e. the value corresponding to selector is a python function that takes the full params and returns a subset. The selected subset does not have to be a numpy array, it can be an arbitrary pytree.

Using lambda functions if often convenient, but we could have just as well defined the selector function using def.

def my_selector(params):
    return params["b"]["d"]


res = minimize(
    criterion=some_criterion,
    params=params,
    algorithm="scipy_lbfgsb",
    constraints={"selector": my_selector, "type": "fixed"},
)

Previous topic

Next topic

How to specify constraints#

Constraints vs bounds#

Can you use constraints with all optimizers?#

Example criterion function#

Types of constraints#

Imposing multiple constraints at once#

How to select the parameters?#

Previous topic

Next topic

Quick search

How to specify constraints#

Constraints vs bounds#

Can you use constraints with all optimizers?#

Example criterion function#

Types of constraints#

Imposing multiple constraints at once#

How to select the parameters?#