swizzle_dyn: 64 byte swizzle_dyn for AVX2 #480

87flowers · 2025-09-08T09:49:17Z

Implemented a 64 bit swizzle_dyn for AVX2.

Use case: Encountered this while implementing a chess move generator in rust with portable simd.

Would like an alternative suggestion for use of mem::transmute here.

programmerjake · 2025-09-09T10:09:24Z

crates/core_simd/src/swizzle_dyn.rs

+    use x86::_mm256_permute2x128_si256 as avx2_cross_shuffle;
+    use x86::_mm256_shuffle_epi8 as avx2_half_pshufb;
+    let high = Simd::splat(64u8);
+    // SAFETY: Caller promised AVX2


you should probably add more safety comments, e.g. answer why is the transmute sound?

Added one for the transmute. Can't really think of anywhere else where its required.

jhorstmann · 2025-09-09T12:08:01Z

The concat swizzle from #335 would be real nice to replace that transmute.

87flowers · 2025-09-11T16:37:15Z

@jhorstmann Agreed.

programmerjake · 2025-09-11T21:09:47Z

crates/core_simd/src/swizzle_dyn.rs

@@ -220,6 +220,8 @@ unsafe fn avx2_pshufb512(bytes: Simd<u8, 64>, idxs: Simd<u8, 64>) -> Simd<u8, 64

        let z0 = half_swizzler(bytes0, bytes1, idxs0);
        let z1 = half_swizzler(bytes0, bytes1, idxs1);
+
+        // SAFETY: Concatenation of two 32-element vectors to one 64-element vector


this says what your doing, what it should say is why it's safe to use transmute like this. e.g.:
[Simd<u8, 32>; 2] and Simd<u8, 64> both have the same size (64) and no padding bytes, so transmuting is safe

swizzle_dyn: 64 byte swizzle_dyn for AVX2

c045fbe

87flowers force-pushed the avx2_pshufb512 branch from 3bdb528 to c045fbe Compare September 8, 2025 09:54

programmerjake reviewed Sep 9, 2025

View reviewed changes

Add safety comment for mem::transmute

3bb811b

programmerjake reviewed Sep 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

swizzle_dyn: 64 byte swizzle_dyn for AVX2 #480

swizzle_dyn: 64 byte swizzle_dyn for AVX2 #480

Uh oh!

87flowers commented Sep 8, 2025

Uh oh!

programmerjake Sep 9, 2025 •

edited

Loading

Uh oh!

87flowers Sep 11, 2025

Uh oh!

jhorstmann commented Sep 9, 2025

Uh oh!

87flowers commented Sep 11, 2025

Uh oh!

programmerjake Sep 11, 2025

Uh oh!

Uh oh!

swizzle_dyn: 64 byte swizzle_dyn for AVX2 #480

Are you sure you want to change the base?

swizzle_dyn: 64 byte swizzle_dyn for AVX2 #480

Uh oh!

Conversation

87flowers commented Sep 8, 2025

Uh oh!

programmerjake Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

87flowers Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

jhorstmann commented Sep 9, 2025

Uh oh!

87flowers commented Sep 11, 2025

Uh oh!

programmerjake Sep 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

programmerjake Sep 9, 2025 •

edited

Loading