feat: add model discovery / direct connection #1865

nzlz · 2025-05-29T22:12:08Z

-discover-models flag to discover and select models from local APIs
Add -dd/--discover-direct flag for direct connection to last used model
Support for Ollama, LM Studio, and other OpenAI-compatible endpoints
Persistent configuration storage in ~/.pydantic-ai/discovery.json
Interactive model selection with last-used indicators
Update README with new CLI options

What do you think?

…-discover-models flag to discover and select models from local APIs - Add -dd/--discover-direct flag for direct connection to last used model - Support for Ollama, LM Studio, and other OpenAI-compatible endpoints - Persistent configuration storage in ~/.pydantic-ai/discovery.json - Interactive model selection with last-used indicators - Update README with new CLI options

hyperlint-ai · 2025-05-29T22:12:21Z

PR Change Summary

Enhanced the CLI with model discovery and direct connection features for improved user experience.

Introduced the -d/--discover-models flag for discovering models from local APIs.
Added the -dd/--discover-direct flag for direct connection to the last used model.
Implemented support for Ollama, LM Studio, and other OpenAI-compatible endpoints.
Enabled persistent configuration storage for model discovery.

Modified Files

clai/README.md

How can I customize these reviews?

Check out the Hyperlint AI Reviewer docs for more information on how to customize the review.

If you just want to ignore it on this PR, you can add the hyperlint-ignore label to the PR. Future changes won't trigger a Hyperlint review.

Note specifically for link checks, we only check the first 30 links in a file and we cache the results for several hours (for instance, if you just added a page, you might experience this). Our recommendation is to add hyperlint-ignore to the PR to ignore the link check for this PR.

nzlz · 2025-06-12T16:08:48Z

ping @DouweM @Kludex any thoughts? I hope this or alternative solution gets merged so that we can use it locally

I see older related PRs lost in limbo https://github.com/pydantic/pydantic-ai/pull/1103/files

DouweM · 2025-06-12T22:37:12Z

@nzlz Thanks for the ping, sorry for having missed this originally.

I like adding OpenAI-compatible API base URL support and model discovery to the CLI, as well as a way to reuse the most recently used model instead of having to specify it again.

I think we can clean up the implementation and UX a bit though, and implement it in a few stages so this is easier to reason about and review:

Stage 1

We add a --provider/-p argument that takes one of the known provider names (see the infer_provider method in providers/__init__.py)
We change --model/-m to take the base model name, without the provider: prefix, if a provider is specified
We change --list/-l to return only model names for that provider (by matching on prefix, and stripping it because -m wouldn't need it)

Stage 2

We extend --provider/-p to take a OpenAI-compatible base URL.
We add a discover_models method to OpenAIProvider that returns a list of model names
If the provider is a base URL, we call OpenAIProvider.discover_models and list those names

Stage 3

We store the last used provider+model or agent in .pydantic-ai
We add a --repeat/-r flag to use the last used model/agent

Stage 4

We add a --choose/-c option that lets the user interactively pick a provider (by number or string, or by entering a URL) and then a model (known or discovered, by number or name) and then launch them into the chat. If a provider name/URL is already specified, we only let them pick a model.

I suggest doing each of those stages in a separate PR.

Would that give you the features you're looking for, even if a bit different than you've implemented it currently?

nzlz added 4 commits May 30, 2025 00:21

cursor addresses lint

3f3e2fb

more linter

3d55311

test coverage by cursor

711bf78

lint

938d3e8

DouweM self-assigned this Jun 12, 2025

DouweM added the awaiting author revision label Jun 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add model discovery / direct connection #1865

feat: add model discovery / direct connection #1865

Uh oh!

nzlz commented May 29, 2025 •

edited by DouweM

Loading

Uh oh!

hyperlint-ai bot commented May 29, 2025

Uh oh!

nzlz commented Jun 12, 2025

Uh oh!

DouweM commented Jun 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

feat: add model discovery / direct connection #1865

Are you sure you want to change the base?

feat: add model discovery / direct connection #1865

Uh oh!

Conversation

nzlz commented May 29, 2025 • edited by DouweM Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hyperlint-ai bot commented May 29, 2025

PR Change Summary

Uh oh!

nzlz commented Jun 12, 2025

Uh oh!

DouweM commented Jun 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

nzlz commented May 29, 2025 •

edited by DouweM

Loading

DouweM commented Jun 12, 2025 •

edited

Loading