[BUG]: Unknown pre-tokenizer type: 'gpt-4o' #1128

koenigst · 2025-03-14T09:47:52Z

Description

Loading a Microsoft Phi-4-mini-instruct (4bit quantization) model fails with:
unknown pre-tokenizer type: 'gpt-4o'
This issue was already addressed in llama.cpp b4792.

Reproduction Steps

LLamaWeights.LoadFromFile(new ModelParams("Phi-4-mini-instruct-Q4_K_M.gguf"));

Environment & Configuration

Operating system: Windows 11
.NET runtime version: 9
LLamaSharp version: 0.21.0
CUDA version (if you are using cuda backend): 12

Known Workarounds

Download newer llama.cpp release b4792
Use NativeLibraryConfig.LLama.WithLibrary to use the downloaded llama.dll

The text was updated successfully, but these errors were encountered:

sangyuxiaowu · 2025-03-14T14:21:20Z

This is a normal occurrence as LLamaSharp relies on llama.cpp, which updates very frequently, sometimes multiple times a day. You can directly download and use the updated DLL from the llama.cpp repository.

martindevans · 2025-03-14T14:26:16Z

LLamaSharp is not generally compatible with any version of llama.cpp except the exact version it was updated to (see the table in the readme for the versions). The llama.cpp API frequently has breaking changes, so downloading DLLs from a different version will cause all kinds of issues.

There's a WIP PR (#1126) which will update llama.cpp to a newer version, that should include the new tokenizer type.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG]: Unknown pre-tokenizer type: 'gpt-4o' #1128

[BUG]: Unknown pre-tokenizer type: 'gpt-4o' #1128

koenigst commented Mar 14, 2025 •

edited

Loading

sangyuxiaowu commented Mar 14, 2025

martindevans commented Mar 14, 2025

[BUG]: Unknown pre-tokenizer type: 'gpt-4o' #1128

[BUG]: Unknown pre-tokenizer type: 'gpt-4o' #1128

Comments

koenigst commented Mar 14, 2025 • edited Loading

Description

Reproduction Steps

Environment & Configuration

Known Workarounds

sangyuxiaowu commented Mar 14, 2025

martindevans commented Mar 14, 2025

koenigst commented Mar 14, 2025 •

edited

Loading