How to specify constraints#

Constraints vs bounds#

Estimagic distinguishes between bounds and constraints. Bounds are lower and upper bounds for parameters. In the literature, they are sometimes called box constraints. Bounds are specified as lower_bounds and upper_bounds argument to maximize and minimize.

Examples with bounds can be found in this tutorial.

To specify more general constraints on your parameters, you can use the argument constraints. The variety of constraints you can impose ranges from rather simple ones (e.g. parameters are fixed to a value, a group of parameters is required to be equal) to more complex ones (like general linear constraints, or even nonlinear constraints).

Can you use constraints with all optimizers?#

With the exception of general nonlinear constraints, we implement constraints via reparametrizations. Details are explained here. This means that you can use all of the constraints with any optimizer that supports bounds. Some constraints (e.g. fixing parameters) can even be used with optimizers that do not support bounds.

Example criterion function#

Let’s look at a variation of the sphere function to illustrate what kinds of constraints you can impose and how you specify them in estimagic:

>>> import numpy as np
>>> import estimagic as em
>>> def criterion(params):
...     offset = np.linspace(1, 0, len(params))
...     x = params - offset
...     return x @ x

The unconstrained optimum of a six-dimensional version of this problem is:

>>> res = em.minimize(
...    criterion=criterion,
...    params=np.array([2.5, 1, 1, 1, 1, -2.5]),
...    algorithm="scipy_lbfgsb",
...    )
>>> res.params.round(3)
array([1. , 0.8, 0.6, 0.4, 0.2, 0. ])

The unconstrained optimum is usually easy to see because all parameters enter the criterion function in a additively separable way.

Imposing multiple constraints at once#

The above examples all just impose one constraint at a time. To impose multiple constraints simultaneously, simple pass in a list of constraints. For example:

>>> res = em.minimize(
...    criterion=criterion,
...    params=np.ones(6),
...    algorithm="scipy_lbfgsb",
...    constraints=[
...    {"loc": [0, 1], "type": "equality"},
...    {"loc": [2, 3, 4], "type": "linear", "weights": 1, "value": 3},
...    ],
...    )

This yields:

>>> res.params.round(2)
array([0.9, 0.9, 1.2, 1. , 0.8, 0. ])

There are limits regarding the compatibility of overlapping constraints. You will get a descriptive error message if your constraints are not compatible.

How to select the parameters?#

All the above examples use a "loc" entry in the constraint dictionary to select the subset of params on which the constraint is imposed. This is just one out of several ways to do it. Which methods are available also depends on whether your parameters are a numpy array, DataFrame, or general pytree.

	loc	query	selector
1d-array	✅ (positions)	❌	✅
DataFrame	✅ (labels)	✅	✅
Pytree	❌	❌	✅

Below we show how to use each of these selection methods in simple examples

loc

In all the examples above, we imposed constraints where our params are a numpy array and the loc method is used to select the constraint parameters. So now, we turn to DataFrame params.

Let’s assume our params are a DataFrame with a two level index. The names of the index levels are category and name. Something like this could, for example, be the params of an Ordered Logit model.

		value
category	name
betas	a	0.95
betas	b	0.9
cutoffs	a	0
cutoffs	b	0.4

Now, let;s impose the constraint that the cutoffs (i.e. the last two parameters) are increasing.

res = em.minimize(
    criterion=some_criterion,
    params=params,
    algorithm="scipy_lbfgsb",
    constraints={"loc": "cutoffs", "type": "increasing"},
)

The value corresponding to "loc" can be anything you would pass to pandas’ DataFrame.loc method, too. So, if you know pandas, imposing constraints in estimagic via "loc" should feel already familiar. Imposing constraints this way can be extremely powerful if you have a well designed MultiIndex, as you can easily select groups of parameters or single paramaters.

query

Let’s assume our params are a DataFrame with a two level index. The names of the index levels are category and name. Something like this could for example be the params of an Ordered Logit model.

		value
category	name
betas	a	0.95
betas	b	0.9
cutoffs	a	0
cutoffs	b	0.4

This time, we want to fix all betas as well as all parameters where the second index level is equal to "a". If we wanted to do that using loc, we would have to type out three index tuples. So let’s do that with a query instead:

res = em.minimize(
    criterion=some_criterion,
    params=params,
    algorithm="scipy_lbfgsb",
    constraints={"query": "category == 'betas' | name == 'a'", "type": "fixed"},
)

The value corresponding to "query" can be anything you would pass to pandas’ DataFrame.query method, too. So, if you know pandas, imposing constraints in estimagic via "query" should feel just the same.

selector

Using selector to select the parameters is the most general way and works for all params. Let’s assume we have defined parameters in a nested dictionary:

params = {"a": np.ones(2), "b": {"c": 3, "d": pd.Series([4, 5])}}

It is probably not a good idea to use a nested dictionary for so few parameters, but let’s ignore that.

Now assume we want to fix the parameters in the pandas Series at their start values. We can do so as follows:

res = em.minimize(
    criterion=some_criterion,
    params=params,
    algorithm="scipy_lbfgsb",
    constraints={"selector": lambda params: params["b"]["d"], "type": "fixed"},
)

I.e. the value corresponding to selector is a python function that takes the full params and returns a subset. The selected subset does not have to be a numpy array, it can be an arbitrary pytree.

Using lambda functions if often convenient, but we could have just as well defined the selector function using def.

def my_selector(params):
    return params["b"]["d"]


res = em.minimize(
    criterion=some_criterion,
    params=params,
    algorithm="scipy_lbfgsb",
    constraints={"selector": my_selector, "type": "fixed"},
)

Previous topic

Next topic