-
Notifications
You must be signed in to change notification settings - Fork 13.6k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
cmake : fix ARM feature verification
ggml
changes relating to the ggml tensor library for machine learning
#17170
opened Nov 11, 2025 by
angt
Loading…
[SYCL]fix ci crash about SSM_CONV
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#17169
opened Nov 11, 2025 by
NeoZhangJianyu
Loading…
server: move res_error/res_ok to static function
examples
server
#17167
opened Nov 11, 2025 by
ngxson
Loading…
vulkan: change graph_compute to be async and enable get_tensor_async
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17158
opened Nov 10, 2025 by
jeffbolznv
Loading…
HIP: WMMA-MMQ kernels for RDNA 4
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#17156
opened Nov 10, 2025 by
jiachengjason
•
Draft
llama.android : Rewrite Android binding
android
Issues specific to Android
documentation
Improvements or additions to documentation
examples
ggml
changes relating to the ggml tensor library for machine learning
#17152
opened Nov 10, 2025 by
hanyin-arm
Loading…
vulkan: add q2_K implementation in mul_mmq with ACC_TYPE_VEC2
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17147
opened Nov 10, 2025 by
SavicStefan
Loading…
metal : make the FA extra sizes consistent
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#17143
opened Nov 10, 2025 by
ggerganov
Loading…
Add complete Megrez-MoE support: GGUF conversion + inference.
model
Model specific
python
python script changes
#17141
opened Nov 10, 2025 by
tamarPal
Loading…
hexagon: various Op fixes
ggml
changes relating to the ggml tensor library for machine learning
#17135
opened Nov 10, 2025 by
max-krasnyansky
Loading…
vulkan: disable rms_norm + mul + rope for old gpus
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17134
opened Nov 10, 2025 by
netrunnereve
Loading…
llama: introduce support for model-embedded sampling parameters
python
python script changes
#17120
opened Nov 9, 2025 by
taronaeo
Loading…
rpc : fix alloc size logic
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#17116
opened Nov 9, 2025 by
ggerganov
Loading…
2 tasks
CPU SIMD and pipeline optimizations across vec/mmq/ops/kv-cache/repack
ggml
changes relating to the ggml tensor library for machine learning
#17113
opened Nov 8, 2025 by
NoahOksuz
Loading…
webui : add keyboard shortcut to toggle sidebar
examples
server
#17099
opened Nov 8, 2025 by
danbev
Loading…
Add Metal-4 Tensor API test harness for iOS
examples
#17098
opened Nov 8, 2025 by
ArjunDivecha
Loading…
CUDA: support F32 kernel type for changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
CONV_TRANSPOSE_2D
ggml
#17094
opened Nov 8, 2025 by
AgainstEntropy
Loading…
HIP: RDNA4 tensor core support for MMF
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#17077
opened Nov 7, 2025 by
zhang-hui-yulo
Loading…
[RFC] ggml: new backend for API Remoting
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
build
Compilation issues
ggml
changes relating to the ggml tensor library for machine learning
#17072
opened Nov 7, 2025 by
kpouget
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.