Skip to main content

databricks.labs.dqx.anomaly.single_model_scorer

Single-model anomaly scoring (distributed UDF and driver-local).

create_scoring_udf

def create_scoring_udf(model_bytes: bytes, engineered_feature_cols: list[str],
schema: StructType)

Create pandas UDF for distributed scoring.

create_scoring_udf_with_contributions

def create_scoring_udf_with_contributions(model_bytes: bytes,
engineered_feature_cols: list[str],
schema: StructType)

Create pandas UDF for distributed scoring with SHAP contributions.

score_with_sklearn_model

def score_with_sklearn_model(model_uri: str,
df: DataFrame,
feature_cols: list[str],
feature_metadata_json: str,
merge_columns: list[str],
enable_contributions: bool = False,
*,
model_record: AnomalyModelRecord) -> DataFrame

Score DataFrame using scikit-learn model with distributed pandas UDF.

score_with_sklearn_model_local

def score_with_sklearn_model_local(
model_uri: str,
df: DataFrame,
feature_cols: list[str],
feature_metadata_json: str,
merge_columns: list[str],
enable_contributions: bool = False,
*,
model_record: AnomalyModelRecord) -> DataFrame

Score DataFrame using scikit-learn model locally on the driver.