Fix cache hit rate by making MCP tools order deterministic #2611

warpdev · 2025-08-23T05:14:03Z

This PR sorts the tools in get_openai_tools by name to ensure a consistent MCP tool order.

Currently, MCP servers are stored in a HashMap, which does not guarantee ordering. As a result, the tool order changes across turns, effectively breaking prompt caching in multi-turn sessions.

An alternative solution would be to replace the HashMap with an ordered structure, but that would require a much larger code change. Given that it is unrealistic to have so many MCP tools that sorting would cause performance issues, this lightweight fix is chosen instead.

By ensuring deterministic tool order, this change should significantly improve cache hit rates and prevent users from hitting usage limits too quickly. (For reference, my own sessions last week reached the limit unusually fast, with cache hit rates falling below 1%.)

Result

After this fix, sessions with MCP servers now show caching behavior almost identical to sessions without MCP servers.

Without MCP	With MCP

…isses

github-actions · 2025-08-23T05:14:15Z

All contributors have signed the CLA ✍️ ✅
_{Posted by the CLA Assistant Lite bot.}

warpdev · 2025-08-23T05:14:24Z

I have read the CLA Document and I hereby sign the CLA

bolinfest · 2025-08-24T22:00:50Z

Or should we switch to IndexMap so the order is consistent, but matches the order users list them in in config.toml (assuming our TOML parser preserves order...).

warpdev · 2025-08-25T00:07:05Z

@bolinfest I agree. Switching to IndexMap would make the intent clearer and ensure the order is preserved. The change is a bit broader than my current patch. Would that be okay?

dylan-hurd-oai · 2025-08-25T02:11:27Z

@warpdev @bolinfest given the impact here, I think it would be reasonable to land the fix with the current sort implementation and leave 2 things as follow-ups:

the IndexMap change
adding an integration test to prompt_caching.rs to test this end to end and ensure any serialization changes don't impact our cache hit rate

bolinfest · 2025-08-25T02:27:18Z

Ok I'll drive the IndexMap change after...thanks for taking the initiative on this!

dylan-hurd-oai · 2025-08-25T02:33:44Z

@bolinfest I'm happy to take on the IndexMap change! I did suggest this approach, after all 😅

dylan-hurd-oai · 2025-08-25T02:49:11Z

Also documenting that I tested this change with mcp servers enabled - can reproduce the issue and confirmed the cache hit rate is back up on this branch.

fix(core): make MCP tools order deterministic to avoid prompt cache m…

1063469

…isses

github-actions bot added a commit that referenced this pull request Aug 23, 2025

@warpdev has signed the CLA in #2611

b07a5ca

warpdev changed the title ~~Make MCP tools order deterministic to cache input tokens~~ Fix cache hit rate by making MCP tools order deterministic Aug 23, 2025

easong-openai added the codex-review label Aug 24, 2025

bolinfest self-assigned this Aug 25, 2025

bolinfest self-requested a review August 25, 2025 02:29

bolinfest approved these changes Aug 25, 2025

View reviewed changes

dylan-hurd-oai approved these changes Aug 25, 2025

View reviewed changes

dylan-hurd-oai removed the codex-review label Aug 25, 2025

bolinfest removed their assignment Aug 25, 2025

bolinfest merged commit ee2ccb5 into openai:main Aug 25, 2025
15 of 18 checks passed

github-actions bot locked and limited conversation to collaborators Aug 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix cache hit rate by making MCP tools order deterministic #2611

Fix cache hit rate by making MCP tools order deterministic #2611

warpdev commented Aug 23, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Aug 23, 2025 •

edited

Loading

Uh oh!

warpdev commented Aug 23, 2025

Uh oh!

bolinfest commented Aug 24, 2025

Uh oh!

warpdev commented Aug 25, 2025

Uh oh!

dylan-hurd-oai commented Aug 25, 2025

Uh oh!

bolinfest commented Aug 25, 2025

Uh oh!

dylan-hurd-oai commented Aug 25, 2025

Uh oh!

dylan-hurd-oai commented Aug 25, 2025

Uh oh!

Uh oh!

Uh oh!

Fix cache hit rate by making MCP tools order deterministic #2611

Fix cache hit rate by making MCP tools order deterministic #2611

Conversation

warpdev commented Aug 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Result

Uh oh!

github-actions bot commented Aug 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

warpdev commented Aug 23, 2025

Uh oh!

bolinfest commented Aug 24, 2025

Uh oh!

warpdev commented Aug 25, 2025

Uh oh!

dylan-hurd-oai commented Aug 25, 2025

Uh oh!

bolinfest commented Aug 25, 2025

Uh oh!

dylan-hurd-oai commented Aug 25, 2025

Uh oh!

dylan-hurd-oai commented Aug 25, 2025

Uh oh!

Uh oh!

Uh oh!

warpdev commented Aug 23, 2025 •

edited

Loading

github-actions bot commented Aug 23, 2025 •

edited

Loading