parallel snstop #420

awccopp · 2024-12-17T20:03:23Z

Purpose

The purpose of this PR is to parallelize the SNSTOP user callback function.
See #417 for commit history.
A --timeout option has been added to the testflo arguments to avoid tests to hang.

Expected time until merged

1 month

Type of change

Bugfix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (non-backwards-compatible fix or feature)
Code style update (formatting, renaming)
Refactoring (no functional changes, no API changes)
Documentation update
Maintenance update
Other (please describe)

Testing

Tests were added to ensure other processors are calling the snstop function.

Checklist

I have run flake8 and black to make sure the Python code adheres to PEP-8 and is consistently formatted
I have formatted the Fortran code with fprettify or C/C++ code with clang-format as applicable
I have run unit and regression tests which pass locally with my changes
I have added new tests that prove my fix is effective or that my feature works
I have added necessary documentation

* parallel snstop * formatting fixes * added comments * added test - does it make sense? * fixed MPI check * actually fixing MPI check * iSort fix * maybe this time the test will be skipped? * maybe like this? * what about this * cleanup * add timeout option to testflo * rerun tests * updated test with send/receive --------- Co-authored-by: Marco Mangano <[email protected]>

codecov · 2024-12-17T20:10:59Z

Codecov Report

❌ Patch coverage is 73.91304% with 6 lines in your changes missing coverage. Please review.
✅ Project coverage is 86.41%. Comparing base (b687be3) to head (a8132d8).

Files with missing lines	Patch %	Lines
pyoptsparse/pyOpt_optimizer.py	0.00%	5 Missing ⚠️
pyoptsparse/pySNOPT/pySNOPT.py	94.44%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #420      +/-   ##
==========================================
+ Coverage   86.22%   86.41%   +0.19%     
==========================================
  Files          24       24              
  Lines        3418     3438      +20     
==========================================
+ Hits         2947     2971      +24     
+ Misses        471      467       -4

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

marcomangano · 2025-02-05T21:14:55Z

@ewu63 you talked with @awccopp about this PR, but might still want to take a look?
@eirikurj does this look good to you?

lamkina

This looks good to me. I would still wait for @ewu63 or @eirikurj to have a look before merging if possible.

marcomangano · 2025-07-31T16:28:47Z

@ewu63 bumping this up

marcomangano

It looks like it is finally working. Is 120s a reasonable timeout timeout for tests?

ewu63 · 2025-08-11T22:49:03Z

My main concern with this PR is that I think the behaviour of whether to call snSTOP on only the root proc or all procs should be a user configurable option, and I would prefer the default being the prior behaviour (i.e. only on the root proc).

marcomangano · 2025-08-12T14:47:28Z

I see your point @ewu63 . Would that be as simple as adding an additional bool option (say parallelSnStop) and then only run these lines if 'parallelSnStop is True'?

The rest of the edits in the _waitLoop() function are just to make the mode variable more explicit (before it was just checking for the -1 flag to break the wait), and add the parallel option just within pySNOPT.

ewu63 · 2025-08-12T16:29:28Z

Yeah, something like that. The rest of the changes look OK.

ewu63 · 2025-08-17T20:14:46Z

I thought about this PR some more and I have some more reservations

we currently don't have any MPI-based tests and those probably need to be added first to make sure we are not causing any regressions. Add more test coverage #256 lists some of the tests I had in mind
the MPI imports need to be adjusted. There is an implicit contract with OpenMDAO and the wider community that mpi4py is not imported unless instructed -- this is why we have the special pyOpt_MPI module. So the internal code needs to be changed here.

marcomangano · 2025-08-18T17:00:25Z

I see. About the second point though, that would only involve the testing script. The rest of the changes are just ineffective if the code is not run in parallel.
We should probably add some kind of flag though, if someone toggles the proposed parallel snStop flag without having PYOPTSPARSE_REQUIRE_MPI the code will just import the mock class

ewu63 · 2025-08-20T01:09:53Z

Yes, I misread that part of the code I think that's all OK. I can try to address some of this soon but I would prefer holding off on this PR for a bit longer. Trying to be diligent to not break people's stuff downstream.

marcomangano · 2025-08-20T16:12:25Z

I can try to address some of this soon but I would prefer holding off on this PR for a bit longer. Trying to be diligent to not break people's stuff downstream.

Agreed, there is no rush. I can add that flag we mentioned soon, so that the default behavior is unchanged. Down to brainstorm additional tests.

awccopp requested a review from a team as a code owner December 17, 2024 20:03

awccopp requested review from ArshSaja and lamkina December 17, 2024 20:03

marcomangano changed the title ~~parallel snstop (#417)~~ parallel snstop Dec 17, 2024

marcomangano previously approved these changes Feb 5, 2025

View reviewed changes

lamkina approved these changes Feb 5, 2025

View reviewed changes

Merge branch 'main' into ParSNSTOP

59a2901

marcomangano dismissed their stale review via 59a2901 July 21, 2025 21:40

marcomangano added 3 commits July 21, 2025 16:41

Update test_real.sh

01b0107

Fixing error type

c5ed422

Missed an error

6eec953

ewu63 and others added 6 commits August 7, 2025 18:56

Merge branch 'main' into ParSNSTOP

e30a019

Updating OptTest import for hs015_parallel

195249d

isort fix

6ff13b4

set MPI env vars

c35453e

increased timeout now that the tests are run isolated

4e6c920

Merge branch 'ParSNSTOP' of github.com:mdolab/pyoptsparse into ParSNSTOP

feee7a6

marcomangano approved these changes Aug 8, 2025

View reviewed changes

marcomangano added 2 commits August 13, 2025 09:39

Merge branch 'main' into ParSNSTOP

b15d3be

Merge branch 'main' into ParSNSTOP

58d6d01

Merge branch 'main' into ParSNSTOP

9651192

Merge branch 'main' into ParSNSTOP

a8132d8

parallel snstop #420

Are you sure you want to change the base?

parallel snstop #420

Uh oh!

Conversation

awccopp commented Dec 17, 2024 • edited by marcomangano Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Expected time until merged

Type of change

Testing

Checklist

Uh oh!

codecov bot commented Dec 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

marcomangano commented Feb 5, 2025

Uh oh!

lamkina left a comment

Choose a reason for hiding this comment

Uh oh!

marcomangano commented Jul 31, 2025

Uh oh!

marcomangano left a comment

Choose a reason for hiding this comment

Uh oh!

ewu63 commented Aug 11, 2025

Uh oh!

marcomangano commented Aug 12, 2025

Uh oh!

ewu63 commented Aug 12, 2025

Uh oh!

ewu63 commented Aug 17, 2025

Uh oh!

marcomangano commented Aug 18, 2025

Uh oh!

ewu63 commented Aug 20, 2025

Uh oh!

marcomangano commented Aug 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

awccopp commented Dec 17, 2024 •

edited by marcomangano

Loading

codecov bot commented Dec 17, 2024 •

edited

Loading