Integrate Databricks Deployment with ZenML

Deploy Scalable, Production-Ready ML Models with Databricks and ZenML

Integrate Databricks Model Serving with ZenML to deploy and serve AI models effortlessly. This integration provides a unified interface to deploy, govern, and query models, leveraging Databricks' managed infrastructure for scalability and enterprise security.

Features with ZenML

Seamless model deployment to Databricks Inference Endpoints directly from ZenML pipelines
Switch between MLflow and Databricks Model Deployers without changing pipeline code
Secure model deployment into VPC-accessible endpoints for enterprise security
Scale model serving with dedicated, autoscaling infrastructure managed by Databricks
Turn models into production-ready APIs with minimal infrastructure or MLOps overhead

‍

Databricks Deployment integration screenshot

Main Features

Unified interface to deploy, govern, and query models
Dedicated and autoscaling infrastructure for model serving
Secure model deployment into VPC-accessible endpoints
Support for a variety of workload sizes and types (CPU, GPU)
Integration with Databricks Model Registry for model versioning and management

‍

How to use ZenML with Databricks Deployment


from zenml.integrations.databricks.steps.databricks_deployer import databricks_model_deployer_step

@step(enable_cache=False)
def deployment_deploy() -> (
    Annotated[
        Optional[DatabricksDeploymentService],
        ArtifactConfig(
            name="databricks_deployment", is_deployment_artifact=True
        ),
    ]
):
    # deploy predictor service
    zenml_client = Client()
    model_deployer = zenml_client.active_stack.model_deployer
    databricks_deployment_config = DatabricksDeploymentConfig(
        model_name=model.name,
        model_version=model.run_metadata["model_registry_version"].value,
        workload_size="Small",
        workload_type="CPU",
        scale_to_zero_enabled=True,
        endpoint_secret_name="databricks_token",
    )
    deployment_service = model_deployer.deploy_model(
        config=databricks_deployment_config,
        service_type=DatabricksDeploymentService.SERVICE_TYPE,
        timeout=1200,
    )
    logger.info(
        f"The deployed service info: {model_deployer.get_model_server_info(deployment_service)}"
    )
    return deployment_service


@pipeline(on_failure=notify_on_failure)
def databricks_deploy_pipeline():
    deployment_deploy()
    notify_on_success(after=["deployment_deploy"])

databricks_deploy_pipeline().run()

Additional Resources

Databricks Model Serving documentation