Tests for causal prediction #1321

maartenvanhooftds · 2025-06-02T05:48:28Z

Inspired by Issue 1313.

Changes:

Added some unit tests
Created a test for ERM and CACM based on this notebook, but with some changes here and there, e.g. such that we don't have to download data during tests.

First contribution here, please review critically for any mistakes or inconsistencies!

tests/causal_prediction/test_causal_prediction_algorithms.py

Signed-off-by: maartenvanhooft <[email protected]>

tests/conftest.py

amit-sharma

thanks for adding this PR, @maartenvanhooftds . The tests make sense, but I'm wondering if we can have a stronger test that compares CACM and ERM.
How about the following property: difference in accuracy between a test dataset from the same distribution as train, and the main test dataset? That would be higher for ERM and we can check as a comparison assert. Can you add this for your setup?

tests/causal_prediction/test_causal_prediction_algorithms.py

amit-sharma · 2025-06-08T11:57:19Z

tests/causal_prediction/test_causal_prediction_algorithms.py

+    results = trainer.test(algorithm, dataloaders=loaders["test_loaders"])
+    assert isinstance(results, list)
+    assert len(results) > 0
+    for r in results:


does the code assume that both ERM and CACM will lead to >0.7 accuracy? What is the exclusive property that we can testing for CACM?

tests/conftest.py

maartenvanhooftds · 2025-06-10T18:37:27Z

Great feedback, thanks! Will implement it later this week.

Edit: Sorry, it has been taking a bit longer, just started a new job. It's still on my mind though.

Signed-off-by: maartenvanhooft <[email protected]>

maartenvanhooftds force-pushed the causal-prediction-tests branch 2 times, most recently from 5df95b8 to 58a2504 Compare June 2, 2025 05:56

maartenvanhooftds changed the title ~~Tests for causal prediction algorithms~~ Tests for causal prediction Jun 2, 2025

maartenvanhooftds commented Jun 2, 2025

View reviewed changes

tests/causal_prediction/test_causal_prediction_algorithms.py Outdated Show resolved Hide resolved

maartenvanhooftds marked this pull request as ready for review June 2, 2025 06:00

maartenvanhooftds force-pushed the causal-prediction-tests branch from 58a2504 to 424755d Compare June 2, 2025 12:43

Tests for causal prediction

17a97cb

Signed-off-by: maartenvanhooft <[email protected]>

maartenvanhooftds force-pushed the causal-prediction-tests branch from 424755d to 17a97cb Compare June 2, 2025 12:44

maartenvanhooftds commented Jun 2, 2025

View reviewed changes

tests/conftest.py Show resolved Hide resolved

amit-sharma reviewed Jun 8, 2025

View reviewed changes

Add comparison test for ERM to degrade more than CACM

f228a4e

Signed-off-by: maartenvanhooft <[email protected]>

maartenvanhooftds force-pushed the causal-prediction-tests branch from 2fec8f0 to f228a4e Compare June 22, 2025 07:13

maartenvanhooftds requested a review from amit-sharma June 22, 2025 07:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Tests for causal prediction #1321

Tests for causal prediction #1321

Uh oh!

maartenvanhooftds commented Jun 2, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

amit-sharma left a comment

Uh oh!

Uh oh!

amit-sharma Jun 8, 2025

Uh oh!

Uh oh!

maartenvanhooftds commented Jun 10, 2025 •

edited

Loading

Uh oh!

Uh oh!

Tests for causal prediction #1321

Are you sure you want to change the base?

Tests for causal prediction #1321

Uh oh!

Conversation

maartenvanhooftds commented Jun 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

amit-sharma left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

amit-sharma Jun 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

maartenvanhooftds commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

maartenvanhooftds commented Jun 2, 2025 •

edited

Loading

maartenvanhooftds commented Jun 10, 2025 •

edited

Loading