Skip to content

Conversation

@whoisj
Copy link
Contributor

@whoisj whoisj commented Nov 10, 2025

This change contains the minimal change to avoid SEGFAULT failures during the BLS Model Loading test.

The crash itself is cause by deleting a shared-memory region's control allocation which can happen when we somehow endup with handle{1} (the control region) in our accounting, and then delete it when its refcount reaches zero.

This change does fix the root cause of how we're accounting for handle{1} (which we should never have).

@whoisj whoisj requested review from pskiran1 and yinggeh November 10, 2025 16:24
@whoisj whoisj added the PR: fix A bug fix label Nov 10, 2025
This change contains the minimal change to avoid SEGFAULT failures during the BLS Model Loading test.

The crash itself is cause by deleting a shared-memory region's control allocation which can happen
when we somehow endup with handle{1} (the control region) in our accounting, and then delete it when
its refcount reaches zero.

This change does fix the root cause of how we're accounting for handle{1} (which we should never have).
@whoisj whoisj force-pushed the jwyman/tri-187-exception-in-bls_model_loading-test branch from 0461377 to b49d9e2 Compare November 10, 2025 18:26
@whoisj whoisj requested a review from yinggeh November 10, 2025 22:22
@whoisj whoisj removed the request for review from pskiran1 November 12, 2025 17:09
@whoisj whoisj merged commit cbb42dd into main Nov 12, 2025
3 checks passed
@whoisj whoisj deleted the jwyman/tri-187-exception-in-bls_model_loading-test branch November 12, 2025 17:10
whoisj added a commit that referenced this pull request Nov 12, 2025
* fix: Fix SEGFAULT in BLS Model Loading

This change contains the minimal change to avoid SEGFAULT failures during the BLS Model Loading test.

The crash itself is cause by deleting a shared-memory region's control allocation which can happen
when we somehow endup with handle{1} (the control region) in our accounting, and then delete it when
its refcount reaches zero.

This change does fix the root cause of how we're accounting for handle{1} (which we should never have).

* rename variable
mc-nv pushed a commit that referenced this pull request Nov 12, 2025
* fix: Fix SEGFAULT in BLS Model Loading

This change contains the minimal change to avoid SEGFAULT failures during the BLS Model Loading test.

The crash itself is cause by deleting a shared-memory region's control allocation which can happen
when we somehow endup with handle{1} (the control region) in our accounting, and then delete it when
its refcount reaches zero.

This change does fix the root cause of how we're accounting for handle{1} (which we should never have).

* rename variable
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

PR: fix A bug fix

Development

Successfully merging this pull request may close these issues.

3 participants