Skip to content

Releases: rmatif/llama.cpp

b6360

02 Sep 21:01
3de0082

Choose a tag to compare

fix: resolve unsigned int initialization warning for n_dims/size in g…

b6277

25 Aug 22:20
74f52f7

Choose a tag to compare

CUDA: Accelerate MXFP4 table lookup using `__byte_perm` (#15451)

* CUDA: optimize get_int_from_table_16

* CUDA: use v_perm_b32 to replace byte_perm on AMD GPUs

* revise documentation

---------

Co-authored-by: xix <[email protected]>
Co-authored-by: Johannes Gäßler <[email protected]>

b6082

04 Aug 14:55
5aa1105

Choose a tag to compare

vulkan: fix build when using glslang that does not support coopmat2 (…