Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[WIP] s390x ci: debug build issue devops improvements to build systems and github actions
#17053 opened Nov 6, 2025 by AlekseiNikiforovIBM Loading…
# Add Megrez-MoE Architecture Support ggml-org#16724
#17052 opened Nov 6, 2025 by tamarPal Loading…
cuda: extended MMF_ROWS_PER_BLOCK ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#17051 opened Nov 6, 2025 by zhang-hui-yulo Loading…
Add MoE dynamic routing with expert caching build Compilation issues documentation Improvements or additions to documentation examples
#17044 opened Nov 6, 2025 by jmangold23 Draft
ggml-hexagon: fix test-backend-ops failures on specific binary ops ggml changes relating to the ggml tensor library for machine learning
#17042 opened Nov 6, 2025 by chraac Draft
CUDA: only use moe_expert_reduce when n_tokens=1 ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#17032 opened Nov 5, 2025 by am17an Loading…
ggml webgpu: faster matrix multiplication/matrix-vector multiplication devops improvements to build systems and github actions ggml changes relating to the ggml tensor library for machine learning python python script changes
#17031 opened Nov 5, 2025 by reeselevine Loading…
ggml-cpu: handle 3d tensors in repack mat_mul ggml changes relating to the ggml tensor library for machine learning
#17030 opened Nov 5, 2025 by Alcpz Loading…
tests(test-backend-ops): Test backend ops verbosity testing Everything test related
#17029 opened Nov 5, 2025 by gabe-l-hart Loading…
vulkan: Fix test-thread-safety crashes ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#17024 opened Nov 5, 2025 by jeffbolznv Loading…
cuda/vulkan : bicubic interpolation ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs OpenCL Issues specific to the OpenCL backend testing Everything test related Vulkan Issues specific to the Vulkan backend
#17022 opened Nov 5, 2025 by Acly Loading…
ci: add Arm-hosted Graviton4 runner devops improvements to build systems and github actions
#17021 opened Nov 5, 2025 by sudhiarm Draft
memory: Hybrid context shift examples
#17009 opened Nov 4, 2025 by gabe-l-hart Loading…
sampling : add support for GPU sampling (wip) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#17004 opened Nov 4, 2025 by danbev Draft
9 tasks
Q4/Q8 Tiled Gemm Optimization. ggml changes relating to the ggml tensor library for machine learning
#16999 opened Nov 4, 2025 by shalinib-ibm Loading…
kleidiai: add optimized per-channel kernels for Q8_0 ggml changes relating to the ggml tensor library for machine learning
#16993 opened Nov 4, 2025 by chaxu01 Loading…
CUDA: add stream-based concurrency ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#16991 opened Nov 4, 2025 by am17an Draft
2 tasks
Add circular tiling support to conv2d and pad, for Vulkan, CUDA, and CPU (used for making seamless textures) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related Vulkan Issues specific to the Vulkan backend
#16985 opened Nov 4, 2025 by Phylliida Loading…
Mamba2 SSD Apple Metal https://en.wikipedia.org/wiki/Metal_(API) examples ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#16982 opened Nov 3, 2025 by gabe-l-hart Draft
ProTip! Updated in the last three days: updated:>2025-11-03.