llama.cpp

History

Georgi Gerganov de2ef53a4b kv-cache : rework kv_cell (#13706 ) * kv-cache : rework kv_cell ggml-ci * kv-cells : use "shift" instead of "delta" consistently ggml-ci * llama : add llama_max_parallel_sequences() ggml-ci * kv-cells : update comments [no ci] * context : fail upon construction if sequences exceed max value ggml-ci * kv-cells : get_pos() -> pos_get() + comments ggml-ci * kv-cells : fix tracking of "used" cells ggml-ci	2025-05-25 16:34:36 +03:00
..
llama-cpp.h	llama : add `llama_vocab`, functions -> methods, naming (#11110 )	2025-01-12 11:32:42 +02:00
llama.h	kv-cache : rework kv_cell (#13706 )	2025-05-25 16:34:36 +03:00

Georgi Gerganov de2ef53a4b

* kv-cache : rework kv_cell

ggml-ci

* kv-cells : use "shift" instead of "delta" consistently

ggml-ci

* llama : add llama_max_parallel_sequences()

ggml-ci

* kv-cells : update comments [no ci]

* context : fail upon construction if sequences exceed max value

ggml-ci

* kv-cells : get_pos() -> pos_get() + comments

ggml-ci

* kv-cells : fix tracking of "used" cells

ggml-ci

2025-05-25 16:34:36 +03:00

llama-cpp.h llama : add llama_vocab, functions -> methods, naming (#11110 ) 2025-01-12 11:32:42 +02:00

llama.h kv-cache : rework kv_cell (#13706 ) 2025-05-25 16:34:36 +03:00