Enable AQUA SDK & CLI to Deploy Fine-Tuned LLMs in Multi-Model Deployment #1175

elizjo · 2025-05-05T20:49:25Z

This PR aims to add support for using fine-tuned models in a Multi-Model deployment.

When deploying a model group w/ a fine tuned model

added FT_MODEL parameter (output path in oci bucket for fine tuned model)
we extract FT_MODEL from the fine tune single model deployment. The model_path is assumed to be the base model for the fine tuned model.

{ "models": [ { "params": "--served-model-name Llama-3.2-11B-Vision --enforce-eager --max-num-seqs 16 --tensor-parallel-size 2 --max-model-len 16000",  "FT_MODEL" : "oci://model-path", "model_path": "Llama-3.2-11B-Vision", "model_task": "image-text-to-text"}, ...

-actions · 2025-05-05T21:19:41Z

📌 Cov diff with main:

📌 Overall coverage:

-actions · 2025-05-05T22:51:18Z

📌 Cov diff with main:

📌 Overall coverage:

mrDzurb · 2025-05-06T22:27:10Z

ads/aqua/model/model.py

@@ -311,6 +312,12 @@ def create_multi(
 # "Currently only service models are supported for multi model deployment."
 # )

+is_fine_tuned_model = Tags.AQUA_FINE_TUNED_MODEL_TAG in source_model.freeform_tags


Could you add some comment what we are doing here?

mrDzurb · 2025-05-06T22:40:06Z

ads/aqua/finetuning/entities.py

@@ -179,3 +185,42 @@ class CreateFineTuningDetails(Serializable):

 class Config:
 extra = "ignore"
+
+
+def extract_base_model_ocid(aqua_model: DataScienceModel) -> Tuple[str, str]:


The name of the function is a bit confusing, it says that it extracts the base model ocid, but returns name and ocid.
Also I think it would be better to create model/utils.py module and move this logic there. Same for the set_finetune_env_var.

moved logic as suggested

mrDzurb · 2025-05-06T22:56:34Z

ads/aqua/finetuning/entities.py

+
+env_var.update({"FT_MODEL": f"{fine_tune_output_path}"})
+
+return env_var


looks like we are already modifying the original env_var, probably we don't need to return anything form this function?

changed, see new commit

mrDzurb · 2025-05-06T22:59:14Z

Could you add more description in the PR? It would be also helpful if you put in the details how the new config will look like.

-actions · 2025-05-07T22:38:30Z

📌 Cov diff with main:

📌 Overall coverage:

-actions · 2025-05-09T18:19:14Z

📌 Cov diff with main:

📌 Overall coverage:

-actions · 2025-05-09T23:12:50Z

📌 Cov diff with main:

📌 Overall coverage:

-actions · 2025-05-12T19:26:52Z

📌 Cov diff with main:

📌 Overall coverage:

mrDzurb · 2025-05-12T19:00:12Z

ads/aqua/common/entities.py

@@ -157,6 +157,8 @@ class AquaMultiModelRef(Serializable):
 Optional environment variables to override during deployment.
 artifact_location : Optional[str]
 Artifact path of model in the multimodel group.
+fine_tune_artifact : Optional[str]


I think it would be better to be more specific, according to the comment, this is more about fine_tuned_weights_location, right?

mrDzurb · 2025-05-13T19:11:47Z

ads/aqua/model/utils.py

+return config_source_id, model_name
+
+
+def set_fine_tune_env_var(aqua_model: DataScienceModel, env_var: Optional[Dict[str,str]], model: Optional[AquaMultiModelRef] = None) -> None:


I'm not sure if we really want to combine two flows in one scope. Do we really need to setup theFT_MODEL env variable for multi-model deployment case? If not, I would prefer to keep single-model and multi-model deployment logics separately.

-actions · 2025-05-13T20:28:10Z

📌 Cov diff with main:

📌 Overall coverage:

added inital code for supporting FT models in multi model

b38db75

elizjo requested review from darenr, mayoor, mrDzurb, VipulMascarenhas, qiuosier and ahosler as code owners May 5, 2025 20:49

oracle-contributor-agreement bot added the OCA VerifiedAll contributors have signed the Oracle Contributor Agreement.label May 5, 2025

fixed error caused by @staticmethod

8e94a86

mrDzurb reviewed May 6, 2025
View reviewed changes

addressed comments

1661f1a

added artifact location for fine tuned model

deae57b

modified unit tests to include fine tuned models

f5e1bb0

Merge branch 'main' into ODSC-72158/FT_multi_model_support

e9fb724

mrDzurb reviewed May 13, 2025
View reviewed changes

Merge branch 'main' into ODSC-72158/FT_multi_model_support

ec1b162

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable AQUA SDK & CLI to Deploy Fine-Tuned LLMs in Multi-Model Deployment #1175

Enable AQUA SDK & CLI to Deploy Fine-Tuned LLMs in Multi-Model Deployment #1175

elizjo commented May 5, 2025•
edited
Loading

-actions bot commented May 5, 2025

-actions bot commented May 5, 2025

mrDzurb May 6, 2025

mrDzurb May 6, 2025

elizjo May 9, 2025

mrDzurb May 6, 2025

elizjo May 9, 2025

mrDzurb commented May 6, 2025

-actions bot commented May 7, 2025

-actions bot commented May 9, 2025

-actions bot commented May 9, 2025

-actions bot commented May 12, 2025

mrDzurb May 12, 2025

mrDzurb May 13, 2025

-actions bot commented May 13, 2025


		env_var.update({"FT_MODEL": f"{fine_tune_output_path}"})

		return env_var

		return config_source_id, model_name


		def set_fine_tune_env_var(aqua_model: DataScienceModel, env_var: Optional[Dict[str,str]], model: Optional[AquaMultiModelRef] = None) -> None:

Enable AQUA SDK & CLI to Deploy Fine-Tuned LLMs in Multi-Model Deployment #1175

Are you sure you want to change the base?

Enable AQUA SDK & CLI to Deploy Fine-Tuned LLMs in Multi-Model Deployment #1175

Conversation

elizjo commented May 5, 2025•edited Loading

-actions bot commented May 5, 2025

-actions bot commented May 5, 2025

mrDzurb May 6, 2025

Choose a reason for hiding this comment

mrDzurb May 6, 2025

Choose a reason for hiding this comment

elizjo May 9, 2025

Choose a reason for hiding this comment

mrDzurb May 6, 2025

Choose a reason for hiding this comment

elizjo May 9, 2025

Choose a reason for hiding this comment

mrDzurb commented May 6, 2025

-actions bot commented May 7, 2025

-actions bot commented May 9, 2025

-actions bot commented May 9, 2025

-actions bot commented May 12, 2025

mrDzurb May 12, 2025

Choose a reason for hiding this comment

mrDzurb May 13, 2025

Choose a reason for hiding this comment

-actions bot commented May 13, 2025

elizjo commented May 5, 2025•
edited
Loading