Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Nonlinear subexpressions #3738

Open
odow opened this issue Apr 25, 2024 · 10 comments
Open

Nonlinear subexpressions #3738

odow opened this issue Apr 25, 2024 · 10 comments
Labels
Category: Nonlinear Related to nonlinear programming Type: Performance

Comments

@odow
Copy link
Member

odow commented Apr 25, 2024

Background

  • JuMP has long supported nonlinear subexpressions created with @NLexpression.
  • The AD system in MOI.Nonlinear has support for representing expressions, and dealing with them efficiently
  • When we created ScalarNonlinearFunction, we did not create an analogous ScalarNonlinearExpression object.
  • In most cases, nonlinear expressions make little difference, or are a minor win.
  • An upside and a downside is that it's up to users to decide which expressions should be common. I have seen, anecdotally, that users on the forum either use no @NLexpression, or they made every possible expression a @NLexpression
  • Best performance is to judiciously use common subexpressions where it would make a difference.
  • The best real-world example we have is @mitchphillipson's MCP model: Poor performance in complementarity models #3729 (comment)

The question is how to achieve this in JuMP.

I opened an issue in MOI: jump-dev/MathOptInterface.jl#2488

Short of rewriting much of the MOI.Nonlinear module to use a single DAG of expressions (instead of the current tree), we could pass the expressions through to MOI, and then attempt to detect common subexpressions. This would rely on a heuristic of when it was beneficial to do so.

A simpler approach would be to add a "nonlinear expression" set to MOI (and JuMP), just like we've done for Parameter.

A crude API would be:

@variable(model, y in CommonSubexpression())
@constraint(model, [y; f(x)] in CommonSubexpression())

with the fallback to a bridge

@variable(model, y)
@constraint(model, y == f(x))

and maybe one for MCP:

@variable(model, y)
@constraint(model, f(x) - y  y)

We could come up with nicer syntax, for example:

@expression(model, y, f(x), NonlinearSubexpression())
@odow odow added Category: Nonlinear Related to nonlinear programming Type: Performance labels Apr 25, 2024
@odow
Copy link
Member Author

odow commented May 23, 2024

@common_expression(model, y, f(x))

@variable(model, y)
@constraint(model, y :== f(x))

@Robbybp
Copy link

Robbybp commented May 24, 2024

What is stopping @expression(model, y, f(x)) from adopting the common-subexpression behavior you're describing? Would this require breaking changes to MOI?

@odow
Copy link
Member Author

odow commented May 25, 2024

What is stopping @expression(model, y, f(x)) from adopting the common-subexpression behavior you're describing?

Yes, it's a breaking change. We discussed this on the monthly developer call. @mlubin is in favor of looking into a :subexpression node. I need to just try out some options.

@odow
Copy link
Member Author

odow commented May 25, 2024

Some related discussion from jump-dev/MathOptInterface.jl#2488:

Some possible options:

  1. Create a single unified expression graph (as opposed to the current expression tree). Change quite a lot of how ReverseAD processes functions etc.
  2. Don't do anything. Add a new decision variable with y = f(x) for expressions. This makes things very explicit.
  3. Add something new type/index to JuMP/MOI. But we're breaking the abstraction quite a bit.
  4. Add a univariate :subexpression operator to MOI. So: @expression(model, expr, subexpression(sin(x) + 1)). Solvers would see this during parsing, and could choose to ignore, or store it in a common expression.

I don't have a strong opinion yet. But our current approach requires us to smack the modeler on the hand and tell them they're holding it wrong and that they should do (2): #3729 (comment)

@odow
Copy link
Member Author

odow commented Nov 12, 2024

I started looking at this again. subexpression(expression) requires the solver to detect which subexpression it is by hashing the subexpression. And we wouldn't have an easy fallback for solvers that don't support it.

I do wonder if the JuMP-level

@expression(model, y[a in 1:3], sin(x[a]), Subexpression())
# lowers to
@variable(model, y[a in 1:3])
@constraint(model, [a in 1:3], y[a] == sin(x[a]))

isn't the right way to go about this.

We could add an additional bridge from EqualTo to Complements for PATH.

This is super explicit. And it works for linear/conic/nonlinear.

@Robbybp
Copy link

Robbybp commented Nov 12, 2024

Now you have to teach users about the Subexpression() option. This seems like about the same amount of work as just using the variable and constraint. Personally, I would prefer to have a unified expression DAG.

@mitchphillipson
Copy link

mitchphillipson commented Nov 12, 2024 via email

@odow
Copy link
Member Author

odow commented Nov 12, 2024

I'd be willing to test if there is a performance increase.

My current approach won't be a performance increase. It's just a syntactic sugar to make the fake variable and == constraint easier to write.

Perhaps we just need documentation for this.

@odow
Copy link
Member Author

odow commented Nov 13, 2024

I added some docs here: #3879

@odow
Copy link
Member Author

odow commented Nov 23, 2024

The vibe for this seems to be "no." It does add a lot of complexity for limited benefit.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Category: Nonlinear Related to nonlinear programming Type: Performance
Development

No branches or pull requests

3 participants