ODSC 68580- Update Evaluation SDK to Support Multi-Model Deployment #1085

elizjo · 2025-02-25T23:47:03Z

The "name" parameter in the model parameters had to be updated from using the same default name for all deployments to the model's specific name.

This PR introduces a method to validate the user's input when they specify the model name for targeting a single model for evaluation when the model is within a multi model deployment.

Added method validate_name_multi_model():

used in AquaEvaluationApp.create() in evaluation.py.
assumes that the UI and evaluation handler sends back the user's input model parameters, which includes the "name" parameter with the key being the model specific name
we check the DataScienceModel metadata for the model specific name (configuration setting) and whether this name matches the user's input.

Wrote test cases in test_deployment() in the test_validate_multi_model_evaluation() method.

see Jira Ticket for screenshots of passed test cases.

github-actions · 2025-02-26T00:16:35Z

📌 Cov diff with main:

📌 Overall coverage:

mrDzurb · 2025-02-26T00:35:34Z

ads/aqua/evaluation/evaluation.py

+                        aqua_model, create_aqua_evaluation_details
+                    )
+
+            except (AquaRuntimeError, AquaValueError) as err:


I think we don't need to add AquaValueError into except section. Ot will work by itself.

I see. I will remove except (AquaRuntimeError, AquaValueError) as err: and the try block. The evaluation_handler.py post method (that calls .create() method here) has the @handle_exceptions decorator, which has the except AquaError as error. I believe that the decorator will catch the exception, so this code is redundant here.

mrDzurb · 2025-02-26T00:36:41Z

ads/aqua/evaluation/evaluation.py

@@ -550,6 +571,43 @@ def create(
            parameters=AquaEvalParams(),
        )

+    @staticmethod
+    def validate_name_multi_model(


Let name it - validate_model_name? Also please add description for this method.

fixed the name to validate_model_name

mrDzurb · 2025-02-26T00:37:36Z

ads/aqua/evaluation/evaluation.py

+    def validate_name_multi_model(
+        evaluation_source: DataScienceModel,
+        create_aqua_evaluation_details: CreateAquaEvaluationDetails,
+    ):


NIT: ->None?

mrDzurb · 2025-02-26T00:40:40Z

ads/aqua/evaluation/evaluation.py

+                f"User did not input model name for multi model deployment evaluation with evaluation source ID: {create_aqua_evaluation_details.evaluation_source_id}"
+            )
+            raise AquaValueError(
+                "Provide the model name. For evaluation, a single model needs to be targeted using the name in the multi model deployment."


I think we can show the valid names in this message.

Provide the model name. For evaluation, a single model needs to be targeted using the name in the multi model deployment. The valid model names for this Model Deployment are {valid_model_names}.

mrDzurb · 2025-02-26T00:41:34Z

tests/unitary/with_extras/aqua/test_evaluation.py

@@ -449,7 +451,7 @@ def test_create_evaluation(
        mock_from_id.return_value = foundation_model

        experiment = MagicMock()
-        experiment.id = "test_experiment_id"
+        experiment.id = "ocid1.datasciencemodelversionset.oc1.iad.amaaaaaav66vvniakngdzelb5hcgjd6yvfejksu2excidvvi3s5s5whtmdea"


Let's not use real IDs?

darenr · 2025-02-26T02:01:17Z

If someone specifies a model name (correctly) in a single model deployment case is this valid, or does that require the old/current "odsc_model" (or whatever it is)

mrDzurb · 2025-02-26T04:52:32Z

ads/aqua/evaluation/evaluation.py

+        custom_metadata_list = evaluation_source.custom_metadata_list
+        user_model_name = user_model_parameters.get("model")
+
+        model_group_count = int(


Wouldn't this fail if ModelCustomMetadataFields.MULTIMODEL_GROUP_COUNT is missing from the custom metadata?

mrDzurb · 2025-02-26T04:55:29Z

ads/aqua/evaluation/evaluation.py

+            for idx in range(model_group_count)
+        ]
+
+        valid_model_names = ", ".join(map(str, model_names))


Why do we need map() here? For the case if some name returns None?

Yes- I changed it now to not include None values

elizjo · 2025-02-26T19:32:02Z

@darenr we would only allow the user to enter the name if the Model Deployment selected is a multi model deployment (we use a freeform tag to ID these deployments).

Otherwise, the user will not be able to enter the name and we would be keeping the odsc_llm (default name for MD).

Hope this clarifies your question!

added unit tests and finished validation method in evaluation.py

fd292ab

elizjo requested review from darenr, mayoor, mrDzurb, VipulMascarenhas, qiuosier and ahosler as code owners February 25, 2025 23:47

oracle-contributor-agreement bot added the OCA Verified All contributors have signed the Oracle Contributor Agreement. label Feb 25, 2025

elizjo changed the base branch from main to feature/multi_model_deployment February 25, 2025 23:47

fixed model parameter name

a4e2da0

mrDzurb reviewed Feb 26, 2025

View reviewed changes

added docstring, fixed PR comments

b25fa2e

mrDzurb reviewed Feb 26, 2025

View reviewed changes

fixed PR comments

7e9d46e

mrDzurb approved these changes Feb 26, 2025

View reviewed changes

elizjo merged commit 67c6891 into feature/multi_model_deployment Feb 26, 2025
2 checks passed

ODSC 68580- Update Evaluation SDK to Support Multi-Model Deployment #1085

ODSC 68580- Update Evaluation SDK to Support Multi-Model Deployment #1085

Uh oh!

Conversation

elizjo commented Feb 25, 2025

Uh oh!

github-actions bot commented Feb 26, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

elizjo Feb 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

darenr commented Feb 26, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

elizjo commented Feb 26, 2025

Uh oh!

Uh oh!

Uh oh!

elizjo Feb 26, 2025 •

edited

Loading