Releases: gfx-rs/wgpu
v27.0.4
This release includes wgpu-hal version 27.0.4. All other crates remain at their previous versions.
Bug Fixes
General
- Remove fragile dependency constraint on ordered-floatthat prevented semver-compatible changes above5.0.0. By @kpreid in #8371.
Vulkan
- Work around extremely poor frame pacing from AMD and Nvidia cards on Windows in FifoandFifoRelaxedpresent modes. This is due to the drivers implicitly using a DXGI (Direct3D) swapchain to implement these modes and it having vastly different timing properties. See #8310 and #8354 for more information. By @cwfitzgerald in #8420.
v26.0.6
This release includes wgpu-hal version 26.0.6. All other crates remain at their previous versions.
Bug Fixes
Vulkan
- Work around extremely poor frame pacing from AMD and Nvidia cards on Windows in FifoandFifoRelaxedpresent modes. This is due to the drivers implicitly using a DXGI (Direct3D) swapchain to implement these modes and it having vastly different timing properties. See #8310 and #8354 for more information. By @cwfitzgerald in #8420.
v27.0.3
This release includes naga, wgpu-core and wgpu-hal version 27.0.3. All other crates remain at their previous versions.
Bug Fixes
naga
- Fix a bug that resulted in the Metal error program scope variable must reside in constant address spacein some cases. Backport of #8311 by @teoxoy.
General
- Remove an assertion that causes problems if CommandEncoder::as_hal_mutis used. By @andyleiserson in #8387.
DX12
- Align copies b/w textures and buffers via a single intermediate buffer per copy when D3D12_FEATURE_DATA_D3D12_OPTIONS13.UnrestrictedBufferTextureCopyPitchSupportedisfalse. By @ErichDonGubler in #7721, backported in #8374.
v26.0.5
This release includes wgpu-hal version 26.0.5. All other crates remain at their previous versions.
Bug Fixes
DX12
- Align copies b/w textures and buffers via a single intermediate buffer per copy when D3D12_FEATURE_DATA_D3D12_OPTIONS13.UnrestrictedBufferTextureCopyPitchSupportedisfalse. By @ErichDonGubler in #7721, backported in #8375.
v27.0.2
This release includes wgpu-hal version 27.0.2. All other crates remain at their previous versions.
Bug Fixes
DX12
- Fix device creation failures for devices that do not support mesh shaders. By @vorporeal in #8297.
v27.0.1
This release includes wgpu, wgpu-core, wgpu-hal, and wgpu-types version 27.0.1. All other crates remain at their previous versions.
Bug Fixes
- Fixed the build on docs.rs. By @cwfitzgerald in #8292.
v27.0.0
Major Changes
Deferred command buffer actions: map_buffer_on_submit and on_submitted_work_done
You may schedule buffer mapping and a submission-complete callback to run automatically after you submit, directly from encoders, command buffers, and passes.
// Record some GPU work so the submission isn't empty and touches `buffer`.
encoder.clear_buffer(&buffer, 0, None);
// Defer mapping until this encoder is submitted.
encoder.map_buffer_on_submit(&buffer, wgpu::MapMode::Read, 0..size, |result| { .. });
// Fires after the command buffer's work is finished.
encoder.on_submitted_work_done(|| { .. });
// Automatically calls `map_async` and `on_submitted_work_done` after this submission finishes.
queue.submit([encoder.finish()]);Available on CommandEncoder, CommandBuffer, RenderPass, and ComputePass.
By @cwfitzgerald in #8125.
Builtin Support for DXGI swapchains on top of of DirectComposition Visuals in DX12
By enabling DirectComposition support, the dx12 backend can now support transparent windows.
This creates a single IDCompositionVisual over the entire window that is used by the mfSurface. If a user wants to manage the composition tree themselves, they should create their own device and composition, and pass the relevant visual down into wgpu via SurfaceTargetUnsafe::CompositionVisual.
let instance = wgpu::Instance::new(&wgpu::InstanceDescriptor {
    backend_options: wgpu::BackendOptions {
        dx12: wgpu::Dx12BackendOptions {
            presentation_system: wgpu::Dx12SwapchainKind::DxgiFromVisual,
            ..
        },
        ..
    },
    ..
});By @n1ght-hunter in #7550.
EXPERIMENTAL_RAY_TRACING_ACCELERATION_STRUCTURE has been merged into EXPERIMENTAL_RAY_QUERY
We have merged the acceleration structure feature into the RayQuery feature. This is to help work around an AMD driver bug and reduce the feature complexity of ray tracing. In the future when ray tracing pipelines are implemented, if either feature is enabled, acceleration structures will be available.
- Features::EXPERIMENTAL_RAY_TRACING_ACCELERATION_STRUCTURE
+ Features::EXPERIMENTAL_RAY_QUERYNew EXPERIMENTAL_PRECOMPILED_SHADERS API
We have added Features::EXPERIMENTAL_PRECOMPILED_SHADERS, replacing existing passthrough types with a unified CreateShaderModuleDescriptorPassthrough which allows passing multiple shader codes for different backends. By @SupaMaggie70Incorporated in #7834
Difference for SPIR-V passthrough:
- device.create_shader_module_passthrough(wgpu::ShaderModuleDescriptorPassthrough::SpirV(
-     wgpu::ShaderModuleDescriptorSpirV {
-         label: None,
-         source: spirv_code,
-     },
- ))
+ device.create_shader_module_passthrough(wgpu::ShaderModuleDescriptorPassthrough {
+     entry_point: "main".into(),
+     label: None,
+     spirv: Some(spirv_code),
+     ..Default::default()
})This allows using precompiled shaders without manually checking which backend's code to pass, for example if you have shaders precompiled for both DXIL and SPIR-V.
Buffer mapping apis no longer have lifetimes
Buffer::get_mapped_range(), Buffer::get_mapped_range_mut(), and Queue::write_buffer_with() now return guard objects without any lifetimes. This
makes it significantly easier to store these types in structs, which is useful for building utilities that build the contents of a buffer over time.
- let buffer_mapping_ref: wgpu::BufferView<'_>           = buffer.get_mapped_range(..);
- let buffer_mapping_mut: wgpu::BufferViewMut<'_>        = buffer.get_mapped_range_mut(..);
- let queue_write_with:   wgpu::QueueWriteBufferView<'_> = queue.write_buffer_with(..);
+ let buffer_mapping_ref: wgpu::BufferView               = buffer.get_mapped_range(..);
+ let buffer_mapping_mut: wgpu::BufferViewMut            = buffer.get_mapped_range_mut(..);
+ let queue_write_with:   wgpu::QueueWriteBufferView     = queue.write_buffer_with(..);By @sagudev in #8046 and @cwfitzgerald in #8070.
EXPERIMENTAL_* features now require unsafe code to enable
We want to be able to expose potentially experimental features to our users before we have ensured that they are fully sound to use.
As such, we now require any feature that is prefixed with EXPERIMENTAL to have a special unsafe token enabled in the device descriptor
acknowledging that the features may still have bugs in them and to report any they find.
adapter.request_device(&wgpu::DeviceDescriptor {
    features: wgpu::Features::EXPERIMENTAL_MESH_SHADER,
    experimental_features: unsafe { wgpu::ExperimentalFeatures::enabled() }
    ..
})By @cwfitzgerald in #8163.
Multi-draw indirect is now unconditionally supported when indirect draws are supported
We have removed Features::MULTI_DRAW_INDIRECT as it was unconditionally available on all platforms.
RenderPass::multi_draw_indirect is now available if the device supports downlevel flag DownlevelFlags::INDIRECT_EXECUTION.
The Feature::MULTI_DRAW_INDIRECT_COUNT feature can be used to determine if multi-draw is supported natively on the device. This is helpful to know if you are using spirv-passthrough and gl_DrawID in your shaders.
By @cwfitzgerald in #8162.
wgpu::PollType::Wait has now an optional timeout
We removed wgpu::PollType::WaitForSubmissionIndex and added fields to wgpu::PollType::Wait in order to express timeouts.
Before/after for wgpu::PollType::Wait:
-device.poll(wgpu::PollType::Wait).unwrap();
-device.poll(wgpu::PollType::wait_indefinitely()).unwrap();
+device.poll(wgpu::PollType::Wait {
+      submission_index: None, // Wait for most recent submission
+      timeout: Some(std::time::Duration::from_secs(60)), // Previous behavior, but more likely you want `None` instead.
+  })
+  .unwrap();Before/after for wgpu::PollType::WaitForSubmissionIndex:
-device.poll(wgpu::PollType::WaitForSubmissionIndex(index_to_wait_on))
+device.poll(wgpu::PollType::Wait {
+      submission_index: Some(index_to_wait_on),
+      timeout: Some(std::time::Duration::from_secs(60)), // Previous behavior, but more likely you want `None` instead.
+  })
+  .unwrap();wgpu::PollType::WaitForSubmissionIndex and wgpu::PollType::Wait had a hard-coded timeout of 60 seconds.
To wait indefinitely on the latest submission, you can also use the wait_indefinitely convenience function:
device.poll(wgpu::PollType::wait_indefinitely());New Features
General
- Added mesh shader support to wgpu, with examples. Requires passthrough. By @SupaMaggie70Incorporated in #7345.
- Added support for external textures based on WebGPU's GPUExternalTexture. These allow shaders to transparently operate on potentially multiplanar source texture data in either RGB or YCbCr formats via WGSL'stexture_externaltype. This is gated behind theFeatures::EXTERNAL_TEXTUREfeature, which is currently only supported on DX12. By @jamienicol in #4386.
- wgpu::Device::pollcan now specify a timeout via- wgpu::PollType::Wait. By @Wumpf in #8282 & #8285
naga
- Expose naga::front::wgsl::UnimplementedEnableExtension. By @ErichDonGubler in #8237.
Changes
General
- Command encoding now happens when CommandEncoder::finishis called, not when the individual operations are requested. This does not affect the API, but may affect performance characteristics. By @andyleiserson in #8220.
- Prevent resources for acceleration structures being created if acceleration structures are not enabled. By @Vecvec in #8036.
- Validate that each push_debug_grouppairs with exactly onepop_debug_group. By @andyleiserson in #8048.
- set_viewportnow requires that the supplied minimum depth value is less than the maximum depth value. By @andyleiserson in #8040.
- Validation of copy_texture_to_buffer,copy_buffer_to_texture, andcopy_texture_to_textureoperations more closely follows the WebGPU specification. By @andyleiserson in various PRs.- Copies within the same texture must not overlap.
- Copies of multisampled or depth/stencil formats must span an entire subresource (layer).
- Copies of depth/stencil formats must be 4B aligned.
- For texture-buffer copies, bytes_per_rowon the buffer side must be 256B-aligned, even if the transfer is a single row.
 
- The offset for set_vertex_bufferandset_index_buffermust be 4B aligned. By @andyleiserson in #7929.
- The offset and size of bindings are validated as fitting within the underlying buffer in more cases. By @andyleiserson in #7911.
- The function you pass to Device::on_uncaptured_error()must now implementSyncin addition toSend, and be wrapped inArc...
v26.0.4
v26.0.3
This release includes wgpu-hal version v26.0.3. All other crates remain at their previous versions.
Bug Fixes
- Fixed memory leak in vulkan backend. By @cwfitzgerald in #8031.
v26.0.2
This release includes wgpu-hal version v26.0.2. All other crates remain at their previous versions.
Bug Fixes
- Fixed vulkan validation error regarding the swapchain in latest SDK. By @cwfitzgerald in #7971.
- Fixed flickering on AMD devices and crashes inside Renderdoc due to incorrect caching of VkFramebuffers when the driver re-used image view handles. By @cwfitzgerald in #7972.
Warning
There is formally a breaking change in wgpu_hal::vulkan::Device::texture_from_raw
as there is now a &self receiver where there previously wasn't one. This will not
affect you unless you explicitly use this api. We have gone ahead with the release
as the bug was pervasive and made wgpu unusable for the affected people on v26.