Skip to content

Releases: EAddario/llama.cpp

b5707

19 Jun 12:58
600e3e9
Compare
Choose a tag to compare
sycl: Cleanup codepaths in Get Rows in sycl backend (#14215)

Addresses unused reorder path

b5672

15 Jun 20:45
30e5b01
Compare
Choose a tag to compare
quantize : change int to unsigned int for KV overrides (#14197)

b5669

15 Jun 12:50
5fce5f9
Compare
Choose a tag to compare
kv-cache : fix use-after-move of defrag info (#14189)

ggml-ci

b5663

14 Jun 14:00
2e42be4
Compare
Choose a tag to compare
compare-llama-bench: add option to plot (#14169)

* compare llama-bench: add option to plot

* Address review comments: convert case + add type hints

* Add matplotlib to requirements

* fix tests

* Improve comment and fix assert condition for test

* Add back default test_name, add --plot_log_scale

* use log_scale regardless of x_values

b5649

13 Jun 07:47
c33fe8b
Compare
Choose a tag to compare
vocab : prevent heap overflow when vocab is too small (#14145)

ggml-ci

b5530

29 May 08:37
6385b84
Compare
Choose a tag to compare
llama : add RobertaForSequenceClassification reranker support (#13875)

b5497

26 May 15:46
03f582a
Compare
Choose a tag to compare
server: fix streaming crashes (#13786)

* add preludes to content on partial regex match

* allow all parsers to parse non-tool-call content.

* tweak order of <|python_tag|> vs <function= parsing for functionary v3.1 format. still not ideal but hopefully less prone to crash

b5478

25 May 07:14
f5cd27b
Compare
Choose a tag to compare
`server`: streaming of tool calls and thoughts when `--jinja` is on (…

b5476

24 May 22:21
17fc817
Compare
Choose a tag to compare
releases : enable openmp in windows cpu backend build (#13756)

b5373

14 May 08:06
be1d4a1
Compare
Choose a tag to compare
scripts : fix compare-llama-bench.py show parameter (#13514)