llama.cpp

History

Xuan-Son Nguyen 1466621e73 llama : Support llama 4 text-only (#12791 ) * llama4 conversion * initial support, no chat template * clean up a bit * fix tokenizer conversion * correct hparams * try this * fix shexp * ffn_inp_normed * chat template * clean up model conversion * add_bos * add scale_before_ffn * fix order * weight_before_ffn * llm_graph_input_attn_temp * add chunk attn mask * build_inp_attn_scale() * add comment about ggml_repeat * clarify comments * fix build	2025-04-07 23:06:44 +02:00
..
llama-cpp.h	llama : add `llama_vocab`, functions -> methods, naming (#11110 )	2025-01-12 11:32:42 +02:00
llama.h	llama : Support llama 4 text-only (#12791 )	2025-04-07 23:06:44 +02:00

Xuan-Son Nguyen 1466621e73

llama : Support llama 4 text-only (#12791 )

* llama4 conversion

* initial support, no chat template

* clean up a bit

* fix tokenizer conversion

* correct hparams

* try this

* fix shexp

* ffn_inp_normed

* chat template

* clean up model conversion

* add_bos

* add scale_before_ffn

* fix order

* weight_before_ffn

* llm_graph_input_attn_temp

* add chunk attn mask

* build_inp_attn_scale()

* add comment about ggml_repeat

* clarify comments

* fix build

2025-04-07 23:06:44 +02:00

llama-cpp.h llama : add llama_vocab, functions -> methods, naming (#11110 ) 2025-01-12 11:32:42 +02:00

llama.h llama : Support llama 4 text-only (#12791 ) 2025-04-07 23:06:44 +02:00