llama : Support Qwen3 and Qwen3MoE (#12828)

* add qwen3 & qwen3moe support.

* fix

---------

Co-authored-by: bozheng-hit <dsoul0621@gmail.com>
This commit is contained in:
Bo Zheng 2025-04-09 17:47:36 +08:00 committed by GitHub
parent d9a63b2f2e
commit d3bd7193ba
No known key found for this signature in database
GPG key ID: B5690EEEBB952194
5 changed files with 441 additions and 0 deletions

View file

@ -30,6 +30,8 @@ enum llm_arch {
LLM_ARCH_QWEN2,
LLM_ARCH_QWEN2MOE,
LLM_ARCH_QWEN2VL,
LLM_ARCH_QWEN3,
LLM_ARCH_QWEN3MOE,
LLM_ARCH_PHI2,
LLM_ARCH_PHI3,
LLM_ARCH_PHIMOE,