AbstractJsonExtractorOutputGuardrail behavior #1417

lpedrov · 2025-04-09T11:34:35Z

The AbstractJsonExtractorOutputGuardrail class injects the JsonGuardrailsUtils class, which uses the ObjectMapper without any kind of configuration. As a result, any DTO will always be considered valid as long as the JSON itself is valid.
Is this the expected behavior?

Shouldn't the ObjectMapper be configured with FAIL_ON_UNKNOWN_PROPERTIES set to true?

new ObjectMapper() .configure(DeserializationFeature.FAIL_ON_UNKNOWN_PROPERTIES, true) .readValue(responseFromLLM.text(), ExampleDto.class);

The text was updated successfully, but these errors were encountered:

geoand · 2025-04-09T11:37:55Z

Good question. @cescoffier @mariofusco WDYT?

cescoffier · 2025-04-09T11:42:13Z

Hum, I would have used the Quarkus mapper. Why don't we do so?

Anyway, yes, I agree.

geoand · 2025-04-09T11:43:53Z

Yeah, probably just an oversight. But even so, I am not sure the problem here would be addressed (and TBH, I am still not convinced it should be, because do we honestly want to fail if the LLM returns more fields than we expected?)

cescoffier · 2025-04-09T11:46:59Z

That’s a good point. How strict do we want to be?

…

On Wed 9 Apr 2025 at 13:44, Georgios Andrianakis ***@***.***> wrote: Yeah, probably just an oversight. But even so, I am not sure the problem here would be addressed (and TBH, I am still not convinced it should be, because do we honestly want to fail if the LLM returns more fields than we expected?) — Reply to this email directly, view it on GitHub <#1417 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AADCG7LAJH76Z7BAME5WDCL2YUBY7AVCNFSM6AAAAAB2YXMRNSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOOBZGQYDGOBUG4> . You are receiving this because you were mentioned.Message ID: ***@***.***> *geoand* left a comment (quarkiverse/quarkus-langchain4j#1417) <#1417 (comment)> Yeah, probably just an oversight. But even so, I am not sure the problem here would be addressed (and TBH, I am still not convinced it should be, because do we honestly want to fail if the LLM returns more fields than we expected?) — Reply to this email directly, view it on GitHub <#1417 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AADCG7LAJH76Z7BAME5WDCL2YUBY7AVCNFSM6AAAAAB2YXMRNSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOOBZGQYDGOBUG4> . You are receiving this because you were mentioned.Message ID: ***@***.***>

geoand · 2025-04-09T11:56:40Z

I personally think that what is proposed here should not be done (or if it is, should not be the default) as we have no way of knowing whether an LLM will decide to include additional fields that might be totally irrelevant.

lpedrov · 2025-04-09T13:24:07Z

I understand. But then the purpose of the class might be misleading, because in reality it’s only validating the JSON. And it seemingly appears to be redundant, since—as documented—this can be done in a simpler way.

mariofusco · 2025-04-09T13:24:29Z

I personally think that what is proposed here should not be done (or if it is, should not be the default) as we have no way of knowing whether an LLM will decide to include additional fields that might be totally irrelevant.

I agree on this, I would at least keep the current behavior as default. We could make this configurable, or allow to plug your own ObjectMapper.

mariofusco · 2025-04-09T13:35:41Z

I understand. But then the purpose of the class might be misleading, because in reality it’s only validating the JSON. And it seemingly appears to be redundant, since—as documented—this can be done in a simpler way.

The AbstractJsonExtractorOutputGuardrail is an utility class. It's perfectly fine if you want to implement something similar in your own OutputGuardrail with less, more or different features. I disagree on the fact that it is totally trivial or redundant as you wrote, even because it also programmatically tries to recover from a quite common hallucination where the LLM responds with a valid json, but prepend or append to it some explanation on how it generated that json, thus breaking the parser.

lpedrov · 2025-04-09T14:00:25Z

When I read the documentation, and without seeing the class source code, the first thing I thought was that it would validate both the JSON and the deserialization.
After running a couple of tests, I realized that wasn’t exactly the case.
But I agree - if a custom ObjectMapper could be used, that would be fantastic.

geoand · 2025-04-09T14:01:37Z

It definitely could be made so

mariofusco · 2025-04-09T14:06:27Z

When I read the documentation, and without seeing the class source code, the first thing I thought was that it would validate both the JSON and the deserialization. After running a couple of tests, I realized that wasn’t exactly the case. But I agree - if a custom ObjectMapper could be used, that would be fantastic.

At this point I'm afraid that I don't understand what you mean with "validate the deserialization". The guardrail tries to deserialize the json into an instance of the target class and fails if it is not able to do so. Isn't this equivalent to validate the deserialization (and actually also performing it)?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AbstractJsonExtractorOutputGuardrail behavior #1417

AbstractJsonExtractorOutputGuardrail behavior #1417

lpedrov commented Apr 9, 2025

geoand commented Apr 9, 2025

cescoffier commented Apr 9, 2025

geoand commented Apr 9, 2025

cescoffier commented Apr 9, 2025 via email

geoand commented Apr 9, 2025

lpedrov commented Apr 9, 2025

mariofusco commented Apr 9, 2025

mariofusco commented Apr 9, 2025

lpedrov commented Apr 9, 2025

geoand commented Apr 9, 2025

mariofusco commented Apr 9, 2025

AbstractJsonExtractorOutputGuardrail behavior #1417

AbstractJsonExtractorOutputGuardrail behavior #1417

Comments

lpedrov commented Apr 9, 2025

geoand commented Apr 9, 2025

cescoffier commented Apr 9, 2025

geoand commented Apr 9, 2025

cescoffier commented Apr 9, 2025 via email

geoand commented Apr 9, 2025

lpedrov commented Apr 9, 2025

mariofusco commented Apr 9, 2025

mariofusco commented Apr 9, 2025

lpedrov commented Apr 9, 2025

geoand commented Apr 9, 2025

mariofusco commented Apr 9, 2025