Add a parallel/regional Compressor #1372

no-defun-allowed · 2025-08-20T07:12:03Z

This PR adds regions to the Compressor. The benefits are that the collector compacts each region in parallel, and the collector supports the use of discontiguous heaps (as required for compressed oops in OpenJDK).

This reverts commit ac90d4b.

…nt_metadata

no-defun-allowed · 2025-08-20T07:15:43Z

Two issues come to mind:

In the original PR for the Compressor, I modified the CI script to skip some tests on garbage collectors which don't support discontiguous heaps (i.e. only the Compressor). There are no such collectors now, so should I remove that special casing?

I added a RegionPageResource to allow the BumpAllocator to allocate from a region-structured heap. All allocations take a lock, which I'm somewhat unhappy with, as other page resources appear to be lock-free until they need to grow the heap. But the Compressor still has the best mutator time on DaCapo, so I'm not sure if it is worthwhile to remove the lock.

qinsoon · 2025-08-20T22:44:59Z

Two issues come to mind:

In the original PR for the Compressor, I modified the CI script to skip some tests on garbage collectors which don't support discontiguous heaps (i.e. only the Compressor). There are no such collectors now, so should I remove that special casing?

Yeah. If no other code needs 'discontiguous', preferably it can be cleaned up.

I added a RegionPageResource to allow the BumpAllocator to allocate from a region-structured heap. All allocations take a lock, which I'm somewhat unhappy with, as other page resources appear to be lock-free until they need to grow the heap. But the Compressor still has the best mutator time on DaCapo, so I'm not sure if it is worthwhile to remove the lock.

If there is no measurable performance issue, I would think it is fine for now.

Just to clarify, how is a RegionPageResource different from other existing page resources? To my understanding, it returns the required arbitrary number of pages to the space, but it 'internally' organizes all the used memory as regions of the same sizes. BlockPageResource only returns blocks (which are regions of the same sizes), and other page resource does not organize used memory as regions of the same sizes. Is this understanding correct?

no-defun-allowed · 2025-08-21T03:30:18Z

Just to clarify, how is a RegionPageResource different from other existing page resources?

I think your understanding is right. To elaborate, the RegionPageResource structures the heap into regions, and has a bump allocator for each region. The RegionPageResource can handle requests for any number of pages smaller than a region; but requests probably should be smaller than the region size by a decent margin, to avoid fragmentation. The regions in the Compressor are 1MiB, though that size was picked arbitrarily, and I don't know if that size is a good size for any idea of "good".

RegionPageResource allows the GC design to assume that objects never span multiple regions when they are allocated, which is necessary to compact each region separately; and that the objects in a region will be contiguously allocated, which we utilise in only scanning for objects between the start and allocation cursor of a region (though I haven't measured how much this matters). The Compressor and RegionPageResource together also maintain allocation order in each region.

src/util/heap/regionpageresource.rs

qinsoon · 2025-08-25T06:42:57Z

Just out of curiosity, what is performance like for this PR? It makes the offset calculation and the actual copying parallelized. I would expect it to improve STW time.

k-sareen · 2025-08-25T11:56:14Z

src/util/heap/regionpageresource.rs

+/// A region in a RegionPageResource and its allocation cursor.
+pub struct RegionAllocator<R: Region> {
+    pub region: R,
+    cursor: AtomicUsize,


I'm pretty sure Atomic<Address> also works. It'll simplify the back-and-forth you do with usize. Though hopefully Atomic<Address> doesn't do anything silly on other platforms.

no-defun-allowed · 2025-08-26T06:54:25Z

Just out of curiosity, what is performance like for this PR? It makes the offset calculation and the actual copying parallelized. I would expect it to improve STW time.

I don't have apples-to-apples results handy - I have benchmark results with compressed oops for the regional Compressor, but none without compressed oops.

k-sareen · 2025-08-26T07:08:17Z

Just out of curiosity, what is performance like for this PR? It makes the offset calculation and the actual copying parallelized. I would expect it to improve STW time.

I don't have apples-to-apples results handy - I have benchmark results with compressed oops for the regional Compressor, but none without compressed oops.

The comparisons against SemiSpace, Immix, etc. would still be interesting to see. I don't think Yi would have seen those results.

no-defun-allowed · 2025-08-26T07:14:17Z

Just out of curiosity, what is performance like for this PR? It makes the offset calculation and the actual copying parallelized. I would expect it to improve STW time.

I don't have apples-to-apples results handy - I have benchmark results with compressed oops for the regional Compressor, but none without compressed oops.

The comparisons against SemiSpace, Immix, etc. would still be interesting to see. I don't think Yi would have seen those results.

Right. I have such comparisons on plotty: http://squirrel.anu.edu.au/plotty/hayleyp/plots/p/CDBtDw

(Edited because I put the wrong logs in - I re-benchmarked with the minheaps for Compressor, as those are the smallest.)

no-defun-allowed and others added 30 commits July 15, 2025 15:23

Add a vaguely Compressor-esque GC

1ddbb5e

Clean up some

b80ba55

rustfmt

cd136b1

clippy fix

43b094a

cargo fmt

4e02f12

Review comments

10e5439

Capitalise sentences

e568453

Allow running MockVM with side metadata

b30d542

Add a feature to disable LOS and immortal space for Compressor

237fbbf

Use MMTk metadata for the offset vector and a separate mark bitmap

efcf713

Revert "Allow running MockVM with side metadata"

def9e36

This reverts commit ac90d4b.

Fix some comments

4608807

Skip MockVM tests with the Compressor on unsupported configurations

2470de7

Include common plan for create_space_mapping

cd3bde3

cargo fmt

a004cd6

i686 is spelled x86 in this instance

dfe9185

Clean up some more

b420216

Start breaking up the offset vector

6c272ff

Fix up some names and comments, and add SideMetadataSpec::are_differe…

1ef15cf

…nt_metadata

Add a warning about the compressor_single_space feature

b819763

More spellings of things

fd84cbf

Use Region and RegionIterator for blocks

5845836

Actually use the LOS

438f248

cargo fmt and clippy

e32d7a3

Skip mock VM tests for Compressor

ce4e1f4

Merge branch 'master' into parallel-compressor

e5f449a

Do something very wrong with side metadata

55d8cd0

Be less silly with side metadata

86346a6

Regions work

1154d80

Generify CompressorPageResource -> RegionPageResource

617c5c6

no-defun-allowed added 8 commits August 5, 2025 07:27

Relax allocator requirements on RegionPageResource

7eceb44

Hand out immutable references to RegionAllocators

f9d6939

Parallel regional Compressor

c6eeeab

Parallelise computing the offset vector, cargo fmt

d40ab18

Merge https://github.com/mmtk/mmtk-core into parallel-compressor

6c0cd6c

Remove OffsetVectorRegion

5257232

Write better documentation

001adae

cargo clippy

2c2bde5

no-defun-allowed added 2 commits August 21, 2025 05:55

Fix broken rustdoc link

ddd646e

Remove MMTK_PLAN=discontiguous from CI

fefe788

no-defun-allowed marked this pull request as ready for review August 22, 2025 06:18

qinsoon self-requested a review August 25, 2025 05:29

qinsoon added the PR-extended-testing Run extended tests for the pull request label Aug 25, 2025

qinsoon reviewed Aug 25, 2025

View reviewed changes

src/util/heap/regionpageresource.rs Outdated Show resolved Hide resolved

src/util/heap/regionpageresource.rs Outdated Show resolved Hide resolved

src/util/heap/regionpageresource.rs Outdated Show resolved Hide resolved

src/util/heap/regionpageresource.rs Show resolved Hide resolved

k-sareen reviewed Aug 25, 2025

View reviewed changes

no-defun-allowed added 2 commits August 26, 2025 03:39

Report the right number of required pages

de59733

Create CalculateOffsetVector and Compact work just before they're needed

238f33c

Clean up uses and documentation

30d3dcb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add a parallel/regional Compressor #1372

Add a parallel/regional Compressor #1372

no-defun-allowed commented Aug 20, 2025

Uh oh!

no-defun-allowed commented Aug 20, 2025

Uh oh!

qinsoon commented Aug 20, 2025

Uh oh!

no-defun-allowed commented Aug 21, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

qinsoon commented Aug 25, 2025

Uh oh!

k-sareen Aug 25, 2025

Uh oh!

no-defun-allowed commented Aug 26, 2025

Uh oh!

k-sareen commented Aug 26, 2025

Uh oh!

no-defun-allowed commented Aug 26, 2025 •

edited

Loading

Uh oh!

Uh oh!

Add a parallel/regional Compressor #1372

Are you sure you want to change the base?

Add a parallel/regional Compressor #1372

Conversation

no-defun-allowed commented Aug 20, 2025

Uh oh!

no-defun-allowed commented Aug 20, 2025

Uh oh!

qinsoon commented Aug 20, 2025

Uh oh!

no-defun-allowed commented Aug 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

qinsoon commented Aug 25, 2025

Uh oh!

k-sareen Aug 25, 2025

Choose a reason for hiding this comment

Uh oh!

no-defun-allowed commented Aug 26, 2025

Uh oh!

k-sareen commented Aug 26, 2025

Uh oh!

no-defun-allowed commented Aug 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

no-defun-allowed commented Aug 21, 2025 •

edited

Loading

no-defun-allowed commented Aug 26, 2025 •

edited

Loading