🚀 openai-responses-server

A plug-and-play server that speaks OpenAI’s Responses API—no matter which AI backend you’re running.

Ollama? vLLM? LiteLLM? Even OpenAI itself?
This server bridges them all to the OpenAI ChatCompletions & Responses API interface.

In plain words:
👉 Want to run OpenAI’s Coding Assistant (Codex) or other OpenAI API clients against your own models?
👉 Want to experiment with self-hosted LLMs but keep OpenAI’s API compatibility?

This project makes it happen.
It handles stateful chat, tool calls, and future features like file search & code interpreter—all behind a familiar OpenAI API.

⸻

✨ Why use this?

✅ Acts as a drop-in replacement for OpenAI’s Responses API.
✅ Lets you run any backend AI (Ollama, vLLM, Groq, etc.) with OpenAI-compatible clients.
✅ Supports OpenAI’s new Coding Assistant / Codex that requires Responses API.
✅ Built for innovators, researchers, OSS enthusiasts.
✅ Enterprise-ready: scalable, reliable, and secure for production workloads.

⸻

🔥 What’s in & what’s next?

✅ Done 📝 Coming soon

✅ Tool call support .env file support
✅ Manual & pipeline tests
✅ Docker image build
✅ PyPI release
📝 Persistent state (not just in-memory)
✅ CLI validation
📝 hosted tools:
- 📝 MCPs support
- 📝 Web search: crawl4ai
- 📝 File upload + search: graphiti
- 📝 Code interpreter
- 📝 Computer use APIs

⸻

🏗️ Quick Install

Latest release on PyPI:

pip install openai-responses-server

Or install from source:

uv venv
uv pip install .
uv pip install -e ".[dev]"  # dev dependencies

Run the server:

# Using CLI tool (after installation)
otc start

# Or directly from source
uv run src/openai_responses_server/cli.py start

Docker deployment:

# Run with Docker
docker run -p 8080:8080 \
  -e OPENAI_BASE_URL_INTERNAL=http://your-llm-api:8000 \
  -e OPENAI_BASE_URL=http://localhost:8080 \
  -e OPENAI_API_KEY=your-api-key \
  openai-responses-server

Works great with docker-compose.yaml for Codex + your own model.

⸻

🛠️ Configure

Minimal config to connect your AI backend:

OPENAI_BASE_URL_INTERNAL=http://localhost:11434  # Ollama, vLLM, Groq, etc.
OPENAI_BASE_URL=http://localhost:8080            # This server's endpoint
OPENAI_API_KEY=sk-mockapikey123456789            # Mock key tunneled to backend

Server binding:

API_ADAPTER_HOST=0.0.0.0
API_ADAPTER_PORT=8080

Optional logging:

LOG_LEVEL=INFO
LOG_FILE_PATH=./log/api_adapter.log

Configure with CLI tool:

# Interactive configuration setup
otc configure

Verify setup:

# Check if the server is working
curl http://localhost:8080/v1/models

⸻

💬 We’d love your support!

If you think this is cool:
⭐ Star the repo.
🐛 Open an issue if something’s broken.
🤝 Suggest a feature or submit a pull request!

This is early-stage but already usable in real-world demos.
Let’s build something powerful—together.

Star History

Projects using this middleware

Nvidia jetson devices - docker compose with ollama

⸻

📚 Citations & inspirations

Referenced projects

Crawl4AI – LLM-friendly web crawler
UncleCode. (2024). Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper [Computer software]. GitHub. https://github.com/unclecode/crawl4ai

Cite this project

Code citation

@software{openai-responses-server,
  author = {TeaBranch},
  title = {openai-responses-server: Open-source server bridging any AI provider to OpenAI’s Responses API},
  year = {2025},
  publisher = {GitHub},
  journal = {GitHub Repository},
  howpublished = {\url{https://github.com/teabranch/openai-responses-server}},
  commit = {use the commit hash you’re working with}
}

Text citation

TeaBranch. (2025). openai-responses-server: Open-source server the serves any AI provider with OpenAI ChatCompletions as OpenAI's Responses API and hosted tools. [Computer software]. GitHub. https://github.com/teabranch/openai-responses-server

Name		Name	Last commit message	Last commit date
Latest commit History 109 Commits
.github		.github
docs		docs
examples/ollama-ors-codex		examples/ollama-ors-codex
resources		resources
src		src
test_tools		test_tools
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.pylintrc		.pylintrc
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
release.py		release.py
run_tests.sh		run_tests.sh
sonar-project.properties		sonar-project.properties
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀 openai-responses-server

✨ Why use this?

🏗️ Quick Install

🛠️ Configure

💬 We’d love your support!

Star History

Projects using this middleware

📚 Citations & inspirations

Referenced projects

Cite this project

Code citation

Text citation

About

Releases

Packages

Languages

License

asd-engineering/openai-responses-server

Folders and files

Latest commit

History

Repository files navigation

🚀 openai-responses-server

✨ Why use this?

🏗️ Quick Install

🛠️ Configure

💬 We’d love your support!

Star History

Projects using this middleware

📚 Citations & inspirations

Referenced projects

Cite this project

Code citation

Text citation

About

Resources

License

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages