Command-Line Interface

The MLflow command-line interface (CLI) provides a simple interface to various functionality in MLflow. You can use the CLI to run projects, start the tracking UI, create and list experiments, download run artifacts, serve MLflow Python Function and scikit-learn models, serve MLflow Python Function and scikit-learn models, and serve models on Microsoft Azure Machine Learning and Amazon SageMaker.

Each individual command has a detailed help screen accessible via mlflow command_name --help.

Attention

It is advisable to set the MLFLOW_TRACKING_URI environment variable by default, as the CLI does not automatically connect to a tracking server. Without this, the CLI will default to using the local filesystem where the command is executed, rather than connecting to a localhost or remote HTTP server. Setting MLFLOW_TRACKING_URI to the URL of your desired tracking server is required for most of the commands below.

Table of Contents

mlflow
- agent
- ai-commands
- artifacts
- assistant
- autolog
- crypto
- datasets
- db
- demo
- deployments
- doctor
- experiments
- gateway
- gc
- mcp
- migrate-filestore
- models
- run
- runs
- sagemaker
- scorers
- server
- skills
- traces

mlflow

Usage

mlflow [OPTIONS] COMMAND [ARGS]...

Options

--version: Show the version and exit.

--env-file <env_file>: Load environment variables from a dotenv file before executing the command. Variables in the file will be loaded but won’t override existing environment variables.

agent

Coding-agent integrations for MLflow (prototype).

Usage

mlflow agent [OPTIONS] COMMAND [ARGS]...

setup

[Experimental] Install MLflow skills and launch a coding agent to instrument this repo.

Usage

mlflow agent setup [OPTIONS]

Options

--agent <agent_name>

Coding agent to set up. If omitted, picks from installed agents.

Options: claude | codex | opencode

--print: Print the composed task prompt to stdout and exit without launching the agent. Useful for passing the prompt into a custom invocation, e.g. claude –permission-mode auto “$(mlflow agent setup –agent claude –print)”.

ai-commands

Manage MLflow AI commands for LLMs.

Usage

mlflow ai-commands [OPTIONS] COMMAND [ARGS]...

get

Get a specific AI command by key.

Usage

mlflow ai-commands get [OPTIONS] KEY

Arguments

KEY: Required argument

list

List all available AI commands.

Usage

mlflow ai-commands list [OPTIONS]

Options

--namespace <namespace>: Filter commands by namespace

run

Get a command formatted for execution by an AI assistant.

Usage

mlflow ai-commands run [OPTIONS] KEY

Arguments

KEY: Required argument

artifacts

Upload, list, and download artifacts from an MLflow artifact repository.

To manage artifacts for a run associated with a tracking server, set the MLFLOW_TRACKING_URI environment variable to the URL of the desired server.

Usage

mlflow artifacts [OPTIONS] COMMAND [ARGS]...

download

Download an artifact file or directory to a local directory. The output is the name of the file or directory on the local filesystem.

Either --artifact-uri or --run-id must be provided.

Usage

mlflow artifacts download [OPTIONS]

Options

-r, --run-id <run_id>: Run ID from which to download

-a, --artifact-path <artifact_path>: For use with Run ID: if specified, a path relative to the run’s root directory to download

-u, --artifact-uri <artifact_uri>: URI pointing to the artifact file or artifacts directory; use as an alternative to specifying –run_id and –artifact-path

-d, --dst-path <dst_path>: Path of the local filesystem destination directory to which to download the specified artifacts. If the directory does not exist, it is created. If unspecified the artifacts are downloaded to a new uniquely-named directory on the local filesystem, unless the artifacts already exist on the local filesystem, in which case their local path is returned directly

list

Return all the artifacts directly under run’s root artifact directory, or a sub-directory. The output is a JSON-formatted list.

Usage

mlflow artifacts list [OPTIONS]

Options

-r, --run-id <run_id>: Required Run ID to be listed

-a, --artifact-path <artifact_path>: If specified, a path relative to the run’s root directory to list.

log-artifact

Log a local file as an artifact of a run, optionally within a run-specific artifact path. Run artifacts can be organized into directories, so you can place the artifact in a directory this way.

Usage

mlflow artifacts log-artifact [OPTIONS]

Options

-l, --local-file <local_file>: Required Local path to artifact to log

-r, --run-id <run_id>: Required Run ID into which we should log the artifact.

-a, --artifact-path <artifact_path>: If specified, we will log the artifact into this subdirectory of the run’s artifact directory.

log-artifacts

Log the files within a local directory as an artifact of a run, optionally within a run-specific artifact path. Run artifacts can be organized into directories, so you can place the artifact in a directory this way.

Usage

mlflow artifacts log-artifacts [OPTIONS]

Options

-l, --local-dir <local_dir>: Required Directory of local artifacts to log

-r, --run-id <run_id>: Required Run ID into which we should log the artifact.

-a, --artifact-path <artifact_path>: If specified, we will log the artifact into this subdirectory of the run’s artifact directory.

assistant

MLflow Assistant - AI-powered trace analysis.

Run ‘mlflow assistant –configure’ to set up the assistant.

Usage

mlflow assistant [OPTIONS]

Options

--configure: Configure or reconfigure the assistant settings

autolog

Commands for autologging with MLflow.

Usage

mlflow autolog [OPTIONS] COMMAND [ARGS]...

claude

Set up Claude Code tracing in a directory.

This command installs the MLflow Claude plugin into Claude Code and writes MLflow configuration into .claude/settings.json. After setup, use the regular claude command and traces will be created by the plugin runtime.

Examples:

# Set up tracing in current directory with local storage mlflow autolog claude

# Set up tracing in a specific project directory mlflow autolog claude -d ~/my-project

# Set up tracing with Databricks mlflow autolog claude -u databricks -e 123456789

# Set up tracing with custom tracking URI mlflow autolog claude -u file://./custom-mlruns

# Disable tracing in current directory mlflow autolog claude –disable

Usage

mlflow autolog claude [OPTIONS] COMMAND [ARGS]...

Options

-d, --directory <directory>: Directory to set up tracing in (default: current directory)

-u, --tracking-uri <tracking_uri>: MLflow tracking URI (e.g., ‘databricks’ or ‘file://mlruns’)

-e, --experiment-id <experiment_id>: MLflow experiment ID

-n, --experiment-name <experiment_name>: MLflow experiment name

--disable: Disable Claude tracing (removes config from both settings.json and settings.local.json)

--status: Show current tracing status

--local: Write config to settings.local.json instead of settings.json during setup.

-y, --non-interactive: Skip prompts and use flags, environment variables, or defaults.

--mlflow-cmd <mlflow_cmd>: Deprecated and ignored. Python-based Claude hooks were replaced by the marketplace plugin runtime.

crypto

Commands for managing MLflow’s cryptographic passphrase.

Usage

mlflow crypto [OPTIONS] COMMAND [ARGS]...

rotate-kek

Rotate the KEK passphrase that is used for encryption and decryption.

Usage

mlflow crypto rotate-kek [OPTIONS]

Options

--new-passphrase <new_passphrase>: Required New KEK passphrase to use for encrypting and decrypting sensitive data.

--backend-store-uri <backend_store_uri>: URI of the backend store. If not specified, uses MLFLOW_TRACKING_URI.

-y, --yes: Skip confirmation prompt.

Environment variables

MLFLOW_BACKEND_STORE_URI: Provide a default for --backend-store-uri

datasets

Manage GenAI evaluation datasets.

Usage

mlflow datasets [OPTIONS] COMMAND [ARGS]...

list

List GenAI evaluation datasets associated with an experiment.

Examples:
# List datasets in experiment 1
mlflow datasets list –experiment-id 1

# Using environment variable
export MLFLOW_EXPERIMENT_ID=1
mlflow datasets list –max-results 10

# Filter datasets by name pattern

mlflow datasets list –experiment-id 1 –filter-string “name LIKE ‘qa_%’”

# Order results by last update time

mlflow datasets list –experiment-id 1 –order-by “last_update_time DESC”

# Output as JSON

mlflow datasets list –experiment-id 1 –output json

Usage

mlflow datasets list [OPTIONS]

Options

-x, --experiment-id <experiment_id>: Required Experiment ID to list datasets for. Can be set via MLFLOW_EXPERIMENT_ID env var.

--filter-string <filter_string>: Filter string (e.g., “name LIKE ‘qa_%’”).

--max-results <max_results>: Maximum results (default: 50).

--order-by <order_by>: Columns to order by (e.g., ‘last_update_time DESC’).

--page-token <page_token>: Pagination token.

--output <output>

Output format.

Options: table | json

Environment variables

MLFLOW_EXPERIMENT_ID: Provide a default for --experiment-id

db

Commands for managing an MLflow tracking database.

Usage

mlflow db [OPTIONS] COMMAND [ARGS]...

migrate-to-default-workspace

Move workspace-scoped resources into the default workspace.

IMPORTANT: This operation runs in a single transaction, but can still be long-running. Always take a backup of your database before running this command.

Usage

mlflow db migrate-to-default-workspace [OPTIONS] URL

Options

--dry-run, --no-dry-run

Check for conflicts and report how many rows would be moved.

Default: False

-v, --verbose: List all conflicts instead of truncating the output.

-y, --yes: Skip the confirmation prompt.

Arguments

URL: Required argument

move-resources

Move resources from one workspace to another.

Selectively move workspace-scoped resources between workspaces by name or tag filter (mutually exclusive). When neither –name nor –tag is specified, all resources of the given type in the source workspace are moved.

The –resource-type value is the database table name (e.g. experiments, registered_models, evaluation_datasets, webhooks, jobs).

Tag filtering (–tag) is supported for experiments and registered_models only. When multiple –tag flags are given, only resources matching ALL tags are included (AND logic).

Examples:

# Move specific experiments by name

mlflow db move-resources sqlite:///mlflow.db

–from default –to team-a –resource-type experiments

–name training-v1 –name training-v2

# Move experiments matching ALL specified tags

mlflow db move-resources sqlite:///mlflow.db

–from default –to team-a –resource-type experiments

–tag team=team-a –tag env=prod

# Move all registered models from one workspace to another

mlflow db move-resources sqlite:///mlflow.db

–from default –to team-a –resource-type registered_models

IMPORTANT: Always take a backup of your database before running this command.

Usage

mlflow db move-resources [OPTIONS] URL

Options

--from <source_workspace>: Required Source workspace name.

--to <target_workspace>: Required Target workspace name.

--resource-type <resource_type>: Required Table name of the resource type to move (e.g. experiments, registered_models).

--name <name>: Resource name(s) to move. Repeatable.

--tag <tag>: Tag filter as key=value. Repeatable. When multiple tags are given, only resources matching ALL tags are included.

--dry-run, --no-dry-run

Show what would be moved without making changes.

Default: False

-v, --verbose: List all conflicts instead of truncating the output.

-y, --yes: Skip the confirmation prompt.

Arguments

URL: Required argument

upgrade

Upgrade the schema of an MLflow tracking database to the latest supported version.

IMPORTANT: Schema migrations can be slow and are not guaranteed to be transactional - always take a backup of your database before running migrations. The migrations README, which is located at https://github.com/mlflow/mlflow/blob/master/mlflow/store/db_migrations/README.md, describes large migrations and includes information about how to estimate their performance and recover from failures.

Usage

mlflow db upgrade [OPTIONS] URL

Arguments

URL: Required argument

demo

Launch MLflow with pre-populated demo data for exploring GenAI features.

By default, creates a persistent environment in ./mlflow-demo/ with SQLite database and file-based artifacts, generates demo data, and opens the browser to the demo experiment. Data persists across restarts; use –refresh to regenerate.

To populate an existing MLflow server with demo data, use –tracking-uri:

mlflow demo # Launch new demo server mlflow demo –no-browser # Launch without opening browser mlflow demo –port 5001 # Use custom port mlflow demo –tracking-uri http://localhost:5000 # Use existing server

Usage

mlflow demo [OPTIONS]

Options

--port <port>: Port to run demo server on (only used when starting a new server).

--tracking-uri <tracking_uri>: Tracking URI of an existing MLflow server to populate with demo data.

--no-browser: Don’t automatically open browser to demo experiment.

--debug: Enable verbose logging output.

--refresh: Force regenerate demo data by deleting existing data first.

deployments

Deploy MLflow models to custom targets. Run mlflow deployments help –target-name <target-name> for more details on the supported URI format and config options for a given target. Support is currently installed for deployment to: faketarget, databricks, http, https, openai, sagemaker

See all supported deployment targets and installation instructions in https://mlflow.org/docs/latest/plugins.html#community-plugins

You can also write your own plugin for deployment to a custom target. For instructions on writing and distributing a plugin, see https://mlflow.org/docs/latest/plugins.html#writing-your-own-mlflow-plugins.

Usage

mlflow deployments [OPTIONS] COMMAND [ARGS]...

create

Deploy the model at model_uri to the specified target.

Additional plugin-specific arguments may also be passed to this command, via -C key=value

Usage

mlflow deployments create [OPTIONS]

Options

--endpoint <endpoint>: Name of the endpoint

-C, --config <NAME=VALUE>: Extra target-specific config for the model deployment, of the form -C name=value. See documentation/help for your deployment target for a list of supported config options.

--name <name>: Required Name of the deployment

-t, --target <target>

Required Deployment target URI. Run mlflow deployments help –target-name <target-name> for more details on the supported URI format and config options for a given target. Support is currently installed for deployment to: faketarget, databricks, http, https, openai, sagemaker

See all supported deployment targets and installation instructions at https://mlflow.org/docs/latest/plugins.html#community-plugins

-m, --model-uri <URI>: Required URI to the model. A local path, a ‘runs:/’ URI, or a remote storage URI (e.g., an ‘s3://’ URI). For more information about supported remote URIs for model artifacts, see https://mlflow.org/docs/latest/tracking.html#artifact-stores

-f, --flavor <flavor>: Which flavor to be deployed. This will be auto inferred if it’s not given

create-endpoint

Create an endpoint with the specified name at the specified target.

Additional plugin-specific arguments may also be passed to this command, via -C key=value

Usage

mlflow deployments create-endpoint [OPTIONS]

Options

-C, --config <NAME=VALUE>: Extra target-specific config for the endpoint, of the form -C name=value. See documentation/help for your deployment target for a list of supported config options.

--endpoint <endpoint>: Required Name of the endpoint

-t, --target <target>

Required Deployment target URI. Run mlflow deployments help –target-name <target-name> for more details on the supported URI format and config options for a given target. Support is currently installed for deployment to: faketarget, databricks, http, https, openai, sagemaker

See all supported deployment targets and installation instructions at https://mlflow.org/docs/latest/plugins.html#community-plugins

delete

Delete the deployment with name given at –name from the specified target.

Usage

mlflow deployments delete [OPTIONS]

Options

--endpoint <endpoint>: Name of the endpoint

-C, --config <NAME=VALUE>: Extra target-specific config for the model deployment, of the form -C name=value. See documentation/help for your deployment target for a list of supported config options.

--name <name>: Required Name of the deployment

-t, --target <target>

Required Deployment target URI. Run mlflow deployments help –target-name <target-name> for more details on the supported URI format and config options for a given target. Support is currently installed for deployment to: faketarget, databricks, http, https, openai, sagemaker

See all supported deployment targets and installation instructions at https://mlflow.org/docs/latest/plugins.html#community-plugins

delete-endpoint

Delete the specified endpoint at the specified target

Usage

mlflow deployments delete-endpoint [OPTIONS]

Options

--endpoint <endpoint>: Required Name of the endpoint

-t, --target <target>

Required Deployment target URI. Run mlflow deployments help –target-name <target-name> for more details on the supported URI format and config options for a given target. Support is currently installed for deployment to: faketarget, databricks, http, https, openai, sagemaker

See all supported deployment targets and installation instructions at https://mlflow.org/docs/latest/plugins.html#community-plugins

explain

Generate explanations of model predictions on the specified input for the deployed model for the given input(s). Explanation output formats vary by deployment target, and can include details like feature importance for understanding/debugging predictions. Run mlflow deployments help or consult the documentation for your plugin for details on explanation format. For information about the input data formats accepted by this function, see the following documentation: https://www.mlflow.org/docs/latest/models.html#built-in-deployment-tools

Usage

mlflow deployments explain [OPTIONS]

Options

--name <name>: Name of the deployment. Exactly one of –name or –endpoint must be specified.

--endpoint <endpoint>: Name of the endpoint. Exactly one of –name or –endpoint must be specified.

-t, --target <target>

Required Deployment target URI. Run mlflow deployments help –target-name <target-name> for more details on the supported URI format and config options for a given target. Support is currently installed for deployment to: faketarget, databricks, http, https, openai, sagemaker

See all supported deployment targets and installation instructions at https://mlflow.org/docs/latest/plugins.html#community-plugins

-I, --input-path <input_path>: Required Path to input prediction payload file. The file canbe a JSON (Python Dict) or CSV (pandas DataFrame). If the file is a CSV, the user must specifythe –content-type csv option.

-O, --output-path <output_path>: File to output results to as a JSON file. If not provided, prints output to stdout.

get

Print a detailed description of the deployment with name given at --name in the specified target.

Usage

mlflow deployments get [OPTIONS]

Options

--endpoint <endpoint>: Name of the endpoint

--name <name>: Required Name of the deployment

-t, --target <target>

Required Deployment target URI. Run mlflow deployments help –target-name <target-name> for more details on the supported URI format and config options for a given target. Support is currently installed for deployment to: faketarget, databricks, http, https, openai, sagemaker

See all supported deployment targets and installation instructions at https://mlflow.org/docs/latest/plugins.html#community-plugins

get-endpoint

Get details for the specified endpoint at the specified target

Usage

mlflow deployments get-endpoint [OPTIONS]

Options

--endpoint <endpoint>: Required Name of the endpoint

-t, --target <target>

Required Deployment target URI. Run mlflow deployments help –target-name <target-name> for more details on the supported URI format and config options for a given target. Support is currently installed for deployment to: faketarget, databricks, http, https, openai, sagemaker

See all supported deployment targets and installation instructions at https://mlflow.org/docs/latest/plugins.html#community-plugins

help

Display additional help for a specific deployment target, e.g. info on target-specific config options and the target’s URI format.

Usage

mlflow deployments help [OPTIONS]

Options

-t, --target <target>

Required Deployment target URI. Run mlflow deployments help –target-name <target-name> for more details on the supported URI format and config options for a given target. Support is currently installed for deployment to: faketarget, databricks, http, https, openai, sagemaker

See all supported deployment targets and installation instructions at https://mlflow.org/docs/latest/plugins.html#community-plugins

list

List the names of all model deployments in the specified target. These names can be used with the delete, update, and get commands.

Usage

mlflow deployments list [OPTIONS]

Options

--endpoint <endpoint>: Name of the endpoint

-t, --target <target>

Required Deployment target URI. Run mlflow deployments help –target-name <target-name> for more details on the supported URI format and config options for a given target. Support is currently installed for deployment to: faketarget, databricks, http, https, openai, sagemaker

See all supported deployment targets and installation instructions at https://mlflow.org/docs/latest/plugins.html#community-plugins

list-endpoints

List all endpoints at the specified target

Usage

mlflow deployments list-endpoints [OPTIONS]

Options

-t, --target <target>

Required Deployment target URI. Run mlflow deployments help –target-name <target-name> for more details on the supported URI format and config options for a given target. Support is currently installed for deployment to: faketarget, databricks, http, https, openai, sagemaker

See all supported deployment targets and installation instructions at https://mlflow.org/docs/latest/plugins.html#community-plugins

predict

Predict the results for the deployed model for the given input(s)

Usage

mlflow deployments predict [OPTIONS]

Options

--name <name>: Name of the deployment. Exactly one of –name or –endpoint must be specified.

--endpoint <endpoint>: Name of the endpoint. Exactly one of –name or –endpoint must be specified.

-t, --target <target>

Required Deployment target URI. Run mlflow deployments help –target-name <target-name> for more details on the supported URI format and config options for a given target. Support is currently installed for deployment to: faketarget, databricks, http, https, openai, sagemaker

See all supported deployment targets and installation instructions at https://mlflow.org/docs/latest/plugins.html#community-plugins

-I, --input-path <input_path>: Required Path to input prediction payload file. The file canbe a JSON (Python Dict) or CSV (pandas DataFrame). If the file is a CSV, the user must specifythe –content-type csv option.

-O, --output-path <output_path>: File to output results to as a JSON file. If not provided, prints output to stdout.

run-local

Deploy the model locally. This has very similar signature to create API

Usage

mlflow deployments run-local [OPTIONS]

Options

-C, --config <NAME=VALUE>: Extra target-specific config for the model deployment, of the form -C name=value. See documentation/help for your deployment target for a list of supported config options.

--name <name>: Required Name of the deployment

-t, --target <target>

Required Deployment target URI. Run mlflow deployments help –target-name <target-name> for more details on the supported URI format and config options for a given target. Support is currently installed for deployment to: faketarget, databricks, http, https, openai, sagemaker

See all supported deployment targets and installation instructions at https://mlflow.org/docs/latest/plugins.html#community-plugins

-m, --model-uri <URI>: Required URI to the model. A local path, a ‘runs:/’ URI, or a remote storage URI (e.g., an ‘s3://’ URI). For more information about supported remote URIs for model artifacts, see https://mlflow.org/docs/latest/tracking.html#artifact-stores

-f, --flavor <flavor>: Which flavor to be deployed. This will be auto inferred if it’s not given

update

Update the deployment with ID deployment_id in the specified target. You can update the URI of the model and/or the flavor of the deployed model (in which case the model URI must also be specified).

Additional plugin-specific arguments may also be passed to this command, via -C key=value.

Usage

mlflow deployments update [OPTIONS]

Options

--endpoint <endpoint>: Name of the endpoint

-C, --config <NAME=VALUE>: Extra target-specific config for the model deployment, of the form -C name=value. See documentation/help for your deployment target for a list of supported config options.

--name <name>: Required Name of the deployment

-t, --target <target>

Required Deployment target URI. Run mlflow deployments help –target-name <target-name> for more details on the supported URI format and config options for a given target. Support is currently installed for deployment to: faketarget, databricks, http, https, openai, sagemaker

See all supported deployment targets and installation instructions at https://mlflow.org/docs/latest/plugins.html#community-plugins

-m, --model-uri <URI>: URI to the model. A local path, a ‘runs:/’ URI, or a remote storage URI (e.g., an ‘s3://’ URI). For more information about supported remote URIs for model artifacts, see https://mlflow.org/docs/latest/tracking.html#artifact-stores

-f, --flavor <flavor>: Which flavor to be deployed. This will be auto inferred if it’s not given

update-endpoint

Update the specified endpoint at the specified target.

Additional plugin-specific arguments may also be passed to this command, via -C key=value

Usage

mlflow deployments update-endpoint [OPTIONS]

Options

-C, --config <NAME=VALUE>: Extra target-specific config for the endpoint, of the form -C name=value. See documentation/help for your deployment target for a list of supported config options.

--endpoint <endpoint>: Required Name of the endpoint

-t, --target <target>

Required Deployment target URI. Run mlflow deployments help –target-name <target-name> for more details on the supported URI format and config options for a given target. Support is currently installed for deployment to: faketarget, databricks, http, https, openai, sagemaker

See all supported deployment targets and installation instructions at https://mlflow.org/docs/latest/plugins.html#community-plugins

doctor

Prints out useful information for debugging issues with MLflow.

Usage

mlflow doctor [OPTIONS]

Options

--mask-envs: If set (the default behavior without setting this flag is not to obfuscate information), mask the MLflow environment variable values (e.g. “MLFLOW_ENV_VAR”: “***”) in the output to prevent leaking sensitive information.

experiments

Manage experiments. To manage experiments associated with a tracking server, set the MLFLOW_TRACKING_URI environment variable to the URL of the desired server.

Usage

mlflow experiments [OPTIONS] COMMAND [ARGS]...

create

Create an experiment.

All artifacts generated by runs related to this experiment will be stored under artifact location, organized under specific run_id sub-directories.

Implementation of experiment and metadata store is dependent on backend storage. FileStore creates a folder for each experiment ID and stores metadata in meta.yaml. Runs are stored as subfolders.

Usage

mlflow experiments create [OPTIONS]

Options

-n, --experiment-name <experiment_name>: Required

-l, --artifact-location <artifact_location>: Base location for runs to store artifact results. Artifacts will be stored at $artifact_location/$run_id/artifacts. See https://mlflow.org/docs/latest/tracking.html#where-runs-are-recorded for more info on the properties of artifact location. If no location is provided, the tracking server will pick a default.

--trace-archival-retention <trace_archival_retention>: Configure the experiment-level trace archival retention override as a duration like ‘30d’ or ‘12h’. This only configures server-owned archival policy; it does not execute archival directly.

csv

Generate CSV with all runs for an experiment

Usage

mlflow experiments csv [OPTIONS]

Options

-x, --experiment-id <experiment_id>: Required

-o, --filename <filename>

delete

Mark an active experiment for deletion. This also applies to experiment’s metadata, runs and associated data, and artifacts if they are store in default location. Use list command to view artifact location. Command will throw an error if experiment is not found or already marked for deletion.

Experiments marked for deletion can be restored using restore command, unless they are permanently deleted.

Specific implementation of deletion is dependent on backend stores. FileStore moves experiments marked for deletion under a .trash folder under the main folder used to instantiate FileStore. Experiments marked for deletion can be permanently deleted by clearing the .trash folder. It is recommended to use a cron job or an alternate workflow mechanism to clear .trash folder.

Usage

mlflow experiments delete [OPTIONS]

Options

-x, --experiment-id <experiment_id>: Required

get

Get details of an experiment by ID or name.

Displays experiment information including name, artifact location, lifecycle stage, tags, creation time, and last update time.

Examples:

# Get experiment by ID in table format (default)
mlflow experiments get --experiment-id 1

# Get experiment by name
mlflow experiments get --experiment-name "My Experiment"

# Get experiment in JSON format
mlflow experiments get --experiment-name "My Experiment" --output json

# Using short options
mlflow experiments get -x 0
mlflow experiments get -n "Default"

Usage

mlflow experiments get [OPTIONS]

Options

-x, --experiment-id <experiment_id>: ID of the experiment to retrieve.

-n, --experiment-name <experiment_name>: Name of the experiment to retrieve.

--output <output>

Output format: ‘table’ (default) or ‘json’.

Options: json | table

rename

Renames an active experiment. Returns an error if the experiment is inactive.

Usage

mlflow experiments rename [OPTIONS]

Options

-x, --experiment-id <experiment_id>: Required

--new-name <new_name>: Required

restore

Restore a deleted experiment. This also applies to experiment’s metadata, runs and associated data. The command throws an error if the experiment is already active, cannot be found, or permanently deleted.

Usage

mlflow experiments restore [OPTIONS]

Options

-x, --experiment-id <experiment_id>: Required

search

Search for experiments in the configured tracking server.

Usage

mlflow experiments search [OPTIONS]

Options

-v, --view <view>: Select view type for experiments. Valid view types are ‘active_only’ (default), ‘deleted_only’, and ‘all’.

--max-results <max_results>: Maximum number of experiments to return. If not provided, returns all experiments.

update

Update experiment trace archival policy controls.

The trace archival options configure or request server-owned archival behavior. They do not execute archival work directly from the client.

Usage

mlflow experiments update [OPTIONS]

Options

-x, --experiment-id <experiment_id>: Required

--trace-archival-retention <trace_archival_retention>: Set the experiment-level trace archival retention override as a duration like ‘30d’ or ‘12h’. This only configures server-owned archival policy.

--clear-trace-archival-retention: Clear the experiment-level trace archival retention override so broader policy applies.

--trace-archive-now: Request archive-now processing for this experiment on the next scheduler pass. This only marks the experiment; it does not execute archival directly.

--trace-archive-now-older-than <trace_archive_now_older_than>: Request archive-now processing for traces older than the given duration on the next scheduler pass. This only marks the experiment; it does not execute archival directly.

--clear-trace-archive-now: Clear a pending archive-now request for this experiment.

gateway

Manage the MLflow Gateway service

Usage

mlflow gateway [OPTIONS] COMMAND [ARGS]...

start

Start the MLflow Gateway service

Usage

mlflow gateway start [OPTIONS]

Options

--config-path <config_path>: Required The path to the gateway configuration file.

--host <host>: The network address to listen on (default: 127.0.0.1).

--port <port>: The port to listen on (default: 5000).

--workers <workers>: The number of workers.

Environment variables

MLFLOW_GATEWAY_CONFIG: Provide a default for --config-path

gc

Permanently delete runs in the deleted lifecycle stage from the specified backend store. This command deletes all artifacts and metadata associated with the specified runs. If the provided artifact URL is invalid, the artifact deletion will be bypassed, and the gc process will continue.

Attention

If you are running an MLflow tracking server with artifact proxying enabled, you must set the MLFLOW_TRACKING_URI environment variable before running this command. Otherwise, the gc command will not be able to resolve artifact URIs and will not be able to delete the associated artifacts.

What gets deleted:

This command permanently removes:

Run metadata: Parameters, metrics, tags, and all other run information from the backend store
Artifacts: All files stored in the run’s artifact location (models, plots, data files, etc.)
Experiment metadata: When deleting experiments, removes the experiment record and all associated data
Job records: When using the –jobs flag, removes historical job records from the jobs table

Note

This command only considers lifecycle stage and the specified deletion criteria. It does not check for pinned runs, registered models, or tags. Pinning is a UI-only feature that has no effect on garbage collection. Runs must be in the deleted lifecycle stage before they can be permanently deleted.

Examples:

# Delete all runs that have been in the deleted state for more than 30 days
mlflow gc --older-than 30d

# Delete specific runs by ID (they must be in deleted state)
mlflow gc --run-ids 'run1,run2,run3'

# Delete all runs in specific experiments (experiments must be in deleted state)
mlflow gc --experiment-ids 'exp1,exp2'

# Combine criteria: delete runs older than 7 days in specific experiments
mlflow gc --older-than 7d --experiment-ids 'exp1,exp2'

# Delete deleted resources across all workspaces
mlflow gc --all-workspaces --older-than 30d

# Delete all finalized jobs older than 7 days (requires --jobs flag)
mlflow gc --jobs --older-than 7d

# Delete specific jobs by ID
mlflow gc --job-ids 'job1,job2,job3'

Usage

mlflow gc [OPTIONS]

Options

--older-than <older_than>: Optional. Remove run(s) older than the specified time limit. Specify a string in #d#h#m#s format. Float values are also supported. For example: –older-than 1d2h3m4s, –older-than 1.2d3h4m5s

--backend-store-uri <PATH>: URI of the backend store from which to delete runs. Acceptable URIs are SQLAlchemy-compatible database connection strings (e.g. ‘sqlite:///path/to/file.db’) or local filesystem URIs (e.g. ‘file:///absolute/path/to/directory’). By default, data will be deleted from the ./mlruns directory.

--artifacts-destination <URI>: The base artifact location from which to resolve artifact upload/download/list requests (e.g. ‘s3://my-bucket’). This option only applies when the tracking server is configured to stream artifacts and the experiment’s artifact root location is http or mlflow-artifacts URI. Otherwise, the default artifact location will be used.

--run-ids <run_ids>: Optional comma separated list of runs to be permanently deleted. If run ids are not specified, data is removed for all runs in the deleted lifecycle stage.

--experiment-ids <experiment_ids>: Optional comma separated list of experiments to be permanently deleted including all of their associated runs. If experiment ids are not specified, data is removed for all experiments in the deleted lifecycle stage.

--logged-model-ids <logged_model_ids>: Optional comma separated list of logged model IDs to be permanently deleted. If logged model IDs are not specified, data is removed for all logged models in the deleted lifecycle stage.

--jobs: Enable job cleanup. Without this flag, no jobs will be deleted. When enabled, all jobs are deleted unless filtered by –older-than or –job-ids. This option only works with database backends.

--job-ids <job_ids>: Optional comma separated list of job IDs to be permanently deleted. Can be used with or without –jobs flag. If –older-than is also specified, only jobs matching both filters are deleted.

--tracking-uri <tracking_uri>: Tracking URI to use for deleting ‘deleted’ runs e.g. http://127.0.0.1:8080

--workspace <workspace>: Target workspace for deletions when workspaces are enabled. Defaults to the active workspace (MLFLOW_WORKSPACE).

--all-workspaces: Delete deleted resources across all workspaces (workspace mode only).

Environment variables

MLFLOW_ARTIFACTS_DESTINATION: Provide a default for --artifacts-destination

MLFLOW_WORKSPACE: Provide a default for --workspace

mcp

Model Context Protocol (MCP) server for MLflow. MCP enables LLM applications to interact with MLflow traces programmatically.

Usage

mlflow mcp [OPTIONS] COMMAND [ARGS]...

run

Run the MLflow MCP server. This starts a server that exposes MLflow trace operations to MCP-compatible clients like Claude Desktop or other AI assistants.

Usage

mlflow mcp run [OPTIONS]

migrate-filestore

Migrate MLflow FileStore data to a SQLite database.

Usage

mlflow migrate-filestore [OPTIONS]

Options

--source <source>: Required Root directory containing mlruns/ FileStore data.

--target <target>: Required SQLite URI (e.g. sqlite:///mlflow.db).

--progress, --no-progress: Show per-experiment progress messages during migration.

models

Deploy MLflow models locally.

To deploy a model associated with a run on a tracking server, set the MLFLOW_TRACKING_URI environment variable to the URL of the desired server.

Usage

mlflow models [OPTIONS] COMMAND [ARGS]...

build-docker

Builds a Docker image whose default entrypoint serves an MLflow model at port 8080, using the python_function flavor. The container serves the model referenced by --model-uri, if specified when build-docker is called. If --model-uri is not specified when build_docker is called, an MLflow Model directory must be mounted as a volume into the /opt/ml/model directory in the container.

Building a Docker image with --model-uri:

# Build a Docker image named 'my-image-name' that serves the model from run 'some-run-uuid'
# at run-relative artifact path 'my-model'
mlflow models build-docker --model-uri "runs:/some-run-uuid/my-model" --name "my-image-name"
# Serve the model
docker run -p 5001:8080 "my-image-name"

Building a Docker image without --model-uri:

# Build a generic Docker image named 'my-image-name'
mlflow models build-docker --name "my-image-name"
# Mount the model stored in '/local/path/to/artifacts/model' and serve it
docker run --rm -p 5001:8080 -v /local/path/to/artifacts/model:/opt/ml/model "my-image-name"

Important

Since MLflow 2.10.1, the Docker image built with --model-uri does not install Java for improved performance, unless the model flavor is one of ["johnsnowlabs", "h2o", "spark"]. If you need to install Java for other flavors, e.g. custom Python model that uses SparkML, please specify the --install-java flag to enforce Java installation.

NB: by default, the container will start nginx and uvicorn processes. If you don’t need the nginx process to be started (for instance if you deploy your container to Google Cloud Run), you can disable it via the DISABLE_NGINX environment variable:

docker run -p 5001:8080 -e DISABLE_NGINX=true "my-image-name"

By default, the number of uvicorn workers is set to CPU count. If you want to set a custom number of workers, you can set the MLFLOW_MODELS_WORKERS environment variable:

docker run -p 5001:8080 -e MLFLOW_MODELS_WORKERS=4 "my-image-name"

See https://www.mlflow.org/docs/latest/python_api/mlflow.pyfunc.html for more information on the ‘python_function’ flavor.

Usage

mlflow models build-docker [OPTIONS]

Options

-m, --model-uri <URI>: [Optional] URI to the model. A local path, a ‘runs:/’ URI, or a remote storage URI (e.g., an ‘s3://’ URI). For more information about supported remote URIs for model artifacts, see https://mlflow.org/docs/latest/tracking.html#artifact-stores

-n, --name <name>: Name to use for built image

--env-manager <env_manager>

If specified, create an environment for MLmodel using the specified environment manager. The following values are supported:

- local: use the local environment
- virtualenv: use venv (and pyenv for Python version management)
- uv: use uv
- conda: use conda

If unspecified, default to virtualenv.

--mlflow-home <PATH>: Path to local clone of MLflow project. Use for development only.

--install-java <install_java>: Installs Java in the image if needed. Default is None, allowing MLflow to determine installation. Flavors requiring Java, such as Spark, enable this automatically. Note: This option only works with the UBUNTU base image; Python base images do not support Java installation.

--install-mlflow: If specified and there is a conda, virtualenv, or uv environment to be activated mlflow will be installed into the environment after it has been activated. The version of installed mlflow will be the same as the one used to invoke this command.

generate-dockerfile

Generates a directory with Dockerfile whose default entrypoint serves an MLflow model at port 8080 using the python_function flavor. The generated Dockerfile is written to the specified output directory, along with the model (if specified). This Dockerfile defines an image that is equivalent to the one produced by mlflow models build-docker.

Usage

mlflow models generate-dockerfile [OPTIONS]

Options

-m, --model-uri <URI>: [Optional] URI to the model. A local path, a ‘runs:/’ URI, or a remote storage URI (e.g., an ‘s3://’ URI). For more information about supported remote URIs for model artifacts, see https://mlflow.org/docs/latest/tracking.html#artifact-stores

-d, --output-directory <output_directory>: Output directory where the generated Dockerfile is stored.

--env-manager <env_manager>

If specified, create an environment for MLmodel using the specified environment manager. The following values are supported:

- local: use the local environment
- virtualenv: use venv (and pyenv for Python version management)
- uv: use uv
- conda: use conda

If unspecified, default to None, then MLflow will automatically pick the env manager based on the model’s flavor configuration. If model-uri is specified: if python version is specified in the flavor configuration and no java installation is required, then we use local environment. Otherwise we use virtualenv. If no model-uri is provided, we use virtualenv.

--mlflow-home <PATH>: Path to local clone of MLflow project. Use for development only.

--install-java <install_java>: Installs Java in the image if needed. Default is None, allowing MLflow to determine installation. Flavors requiring Java, such as Spark, enable this automatically. Note: This option only works with the UBUNTU base image; Python base images do not support Java installation.

--install-mlflow: If specified and there is a conda, virtualenv, or uv environment to be activated mlflow will be installed into the environment after it has been activated. The version of installed mlflow will be the same as the one used to invoke this command.

predict

Generate predictions in json format using a saved MLflow model. For information about the input data formats accepted by this function, see the following documentation: https://www.mlflow.org/docs/latest/models.html#built-in-deployment-tools.

Usage

mlflow models predict [OPTIONS]

Options

-m, --model-uri <URI>: Required URI to the model. A local path, a ‘runs:/’ URI, or a remote storage URI (e.g., an ‘s3://’ URI). For more information about supported remote URIs for model artifacts, see https://mlflow.org/docs/latest/tracking.html#artifact-stores

-i, --input-path <input_path>: CSV containing pandas DataFrame to predict against.

-o, --output-path <output_path>: File to output results to as json file. If not provided, output to stdout.

-t, --content-type <content_type>: Content type of the input file. Can be one of {‘json’, ‘csv’}.

--env-manager <env_manager>

If specified, create an environment for MLmodel using the specified environment manager. The following values are supported:

- local: use the local environment
- virtualenv: use venv (and pyenv for Python version management)
- uv: use uv
- conda: use conda

If unspecified, default to virtualenv.

--install-mlflow: If specified and there is a conda, virtualenv, or uv environment to be activated mlflow will be installed into the environment after it has been activated. The version of installed mlflow will be the same as the one used to invoke this command.

-r, --pip-requirements-override <pip_requirements_override>: Specify packages and versions to override the dependencies defined in the model. Must be a comma-separated string like x==y,z==a.

--env <env>: Extra environment variables to set when running the model. Must be key value pairs, e.g. –env key=value.

prepare-env

Performs any preparation necessary to predict or serve the model, for example downloading dependencies or initializing a conda environment. After preparation, calling predict or serve should be fast.

Usage

mlflow models prepare-env [OPTIONS]

Options

-m, --model-uri <URI>: Required URI to the model. A local path, a ‘runs:/’ URI, or a remote storage URI (e.g., an ‘s3://’ URI). For more information about supported remote URIs for model artifacts, see https://mlflow.org/docs/latest/tracking.html#artifact-stores

--env-manager <env_manager>

If specified, create an environment for MLmodel using the specified environment manager. The following values are supported:

- local: use the local environment
- virtualenv: use venv (and pyenv for Python version management)
- uv: use uv
- conda: use conda

If unspecified, default to virtualenv.

--install-mlflow: If specified and there is a conda, virtualenv, or uv environment to be activated mlflow will be installed into the environment after it has been activated. The version of installed mlflow will be the same as the one used to invoke this command.

serve

Serve a model saved with MLflow by launching a webserver on the specified host and port. The command supports models with the python_function or crate (R Function) flavor. For information about the input data formats accepted by the webserver, see the following documentation: https://www.mlflow.org/docs/latest/models.html#built-in-deployment-tools.

Warning

Models built using MLflow 1.x will require adjustments to the endpoint request payload if executed in an environment that has MLflow 2.x installed. In 1.x, a request payload was in the format: {'columns': [str], 'data': [[...]]}. 2.x models require payloads that are defined by the structural-defining keys of either dataframe_split, instances, inputs or dataframe_records. See the examples below for demonstrations of the changes to the invocation API endpoint in 2.0.

Note

Requests made in pandas DataFrame structures can be made in either split or records oriented formats. See https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.to_json.html for detailed information on orientation formats for converting a pandas DataFrame to json.

Example:

$ mlflow models serve -m runs:/my-run-id/model-path &

# records orientation input format for serializing a pandas DataFrame
$ curl http://127.0.0.1:5000/invocations -H 'Content-Type: application/json' -d '{
    "dataframe_records": [{"a":1, "b":2}, {"a":3, "b":4}, {"a":5, "b":6}]
}'

# split orientation input format for serializing a pandas DataFrame
$ curl http://127.0.0.1:5000/invocations -H 'Content-Type: application/json' -d '{
    "dataframe_split": {"columns": ["a", "b"],
                        "index": [0, 1, 2],
                        "data": [[1, 2], [3, 4], [5, 6]]}
}'

# inputs format for List submission of array, tensor, or DataFrame data
$ curl http://127.0.0.1:5000/invocations -H 'Content-Type: application/json' -d '{
    "inputs": [[1, 2], [3, 4], [5, 6]]
}'

# instances format for submission of Tensor data
curl http://127.0.0.1:5000/invocations -H 'Content-Type: application/json' -d '{
    "instances": [
        {"a": "t1", "b": [1, 2, 3]},
        {"a": "t2", "b": [4, 5, 6]},
        {"a": "t3", "b": [7, 8, 9]}
    ]
}'

Usage

mlflow models serve [OPTIONS]

Options

-m, --model-uri <URI>: Required URI to the model. A local path, a ‘runs:/’ URI, or a remote storage URI (e.g., an ‘s3://’ URI). For more information about supported remote URIs for model artifacts, see https://mlflow.org/docs/latest/tracking.html#artifact-stores

-p, --port <port>: The port to listen on (default: 5000).

-h, --host <HOST>: The network interface to bind the server to (default: 127.0.0.1). This controls which network interfaces accept connections. Use ‘127.0.0.1’ for local-only access, or ‘0.0.0.0’ to allow connections from any network. NOTE: This is NOT a security setting - it only controls network binding. To restrict which clients can connect, use –allowed-hosts.

-t, --timeout <timeout>: Timeout in seconds to serve a request (default: 60).

-w, --workers <workers>: Number of uvicorn workers to handle requests when serving mlflow models (default: 1).

--env-manager <env_manager>

If specified, create an environment for MLmodel using the specified environment manager. The following values are supported:

- local: use the local environment
- virtualenv: use venv (and pyenv for Python version management)
- uv: use uv
- conda: use conda

If unspecified, default to virtualenv.

--no-conda: If specified, use local environment.

--install-mlflow: If specified and there is a conda, virtualenv, or uv environment to be activated mlflow will be installed into the environment after it has been activated. The version of installed mlflow will be the same as the one used to invoke this command.

Environment variables

MLFLOW_PORT: Provide a default for --port

MLFLOW_HOST: Provide a default for --host

MLFLOW_SCORING_SERVER_REQUEST_TIMEOUT: Provide a default for --timeout

MLFLOW_MODELS_WORKERS: Provide a default for --workers

update-pip-requirements

Add or remove requirements from a model’s conda.yaml and requirements.txt files. If using a remote tracking server, please make sure to set the MLFLOW_TRACKING_URI environment variable to the URL of the desired server.

REQUIREMENT_STRINGS is a list of pip requirements specifiers. See below for examples.

Sample usage:

# Add requirements using the model's "runs:/" URI

mlflow models update-pip-requirements -m runs:/<run_id>/<model_path> \
    add "pandas==1.0.0" "scikit-learn" "mlflow >= 2.8, != 2.9.0"

# Remove requirements from a local model

mlflow models update-pip-requirements -m /path/to/local/model \
    remove "torchvision" "pydantic"

Note that model registry URIs (i.e. URIs in the form models:/) are not supported, as artifacts in the model registry are intended to be read-only. Editing requirements is read-only artifact repositories is also not supported.

If adding requirements, the function will overwrite any existing requirements that overlap, or else append the new requirements to the existing list.

If removing requirements, the function will ignore any version specifiers, and remove all the specified package names. Any requirements that are not found in the existing files will be ignored.

Usage

mlflow models update-pip-requirements [OPTIONS] {add|remove}
                                      [REQUIREMENT_STRINGS]...

Options

-m, --model-uri <URI>: Required URI to the model. A local path, a ‘runs:/’ URI, or a remote storage URI (e.g., an ‘s3://’ URI). For more information about supported remote URIs for model artifacts, see https://mlflow.org/docs/latest/tracking.html#artifact-stores

Arguments

OPERATION: Required argument

REQUIREMENT_STRINGS: Optional argument(s)

run

Run an MLflow project from the given URI.

For local runs, the run will block until it completes. Otherwise, the project will run asynchronously.

If running locally (the default), the URI can be either a Git repository URI or a local path. If running on Databricks, the URI must be a Git repository.

By default, Git projects run in a new working directory with the given parameters, while local projects run from the project’s root directory.

Usage

mlflow run [OPTIONS] URI

Options

-e, --entry-point <NAME>: Entry point within project. [default: main]. If the entry point is not found, attempts to run the project file with the specified name as a script, using ‘python’ to run .py files and the default shell (specified by environment variable $SHELL) to run .sh files

-v, --version <VERSION>: Version of the project to run, as a Git commit reference for Git projects.

-P, --param-list <NAME=VALUE>: A parameter for the run, of the form -P name=value. Provided parameters that are not in the list of parameters for an entry point will be passed to the corresponding entry point as command-line arguments in the form –name value

-A, --docker-args <NAME=VALUE>: A docker run argument or flag, of the form -A name=value (e.g. -A gpus=all) or -A name (e.g. -A t). The argument will then be passed as docker run –name value or docker run –name respectively.

--experiment-name <experiment_name>: Name of the experiment under which to launch the run. If not specified, ‘experiment-id’ option will be used to launch run.

--experiment-id <experiment_id>: ID of the experiment under which to launch the run.

-b, --backend <BACKEND>: Execution backend to use for run. Supported values: ‘local’, ‘databricks’, kubernetes (experimental). Defaults to ‘local’. If running against Databricks, will run against a Databricks workspace determined as follows: if a Databricks tracking URI of the form ‘databricks://profile’ has been set (e.g. by setting the MLFLOW_TRACKING_URI environment variable), will run against the workspace specified by <profile>. Otherwise, runs against the workspace specified by the default Databricks CLI profile. See https://github.com/databricks/databricks-cli for more info on configuring a Databricks CLI profile.

-c, --backend-config <FILE>: Path to JSON file (must end in ‘.json’) or JSON string which will be passed as config to the backend. The exact content which should be provided is different for each execution backend and is documented at https://www.mlflow.org/docs/latest/projects.html.

--env-manager <env_manager>

If specified, create an environment for MLproject using the specified environment manager. The following values are supported:

- local: use the local environment
- virtualenv: use venv (and pyenv for Python version management)
- uv: use uv
- conda: use conda

If unspecified, the appropriate environment manager is automatically selected based on the project configuration. For example, if MLproject.yaml contains a python_env key, virtualenv is used.

--storage-dir <storage_dir>: Only valid when backend is local. MLflow downloads artifacts from distributed URIs passed to parameters of type ‘path’ to subdirectories of storage_dir.

--run-id <RUN_ID>: If specified, the given run ID will be used instead of creating a new run. Note: this argument is used internally by the MLflow project APIs and should not be specified.

--run-name <RUN_NAME>: The name to give the MLflow Run associated with the project execution. If not specified, the MLflow Run name is left unset.

--build-image

Only valid for Docker projects. If specified, build a new Docker image that’s based on the image specified by the image field in the MLproject file, and contains files in the project directory.

Default: False

Arguments

URI: Required argument

Environment variables

MLFLOW_EXPERIMENT_NAME: Provide a default for --experiment-name

MLFLOW_EXPERIMENT_ID: Provide a default for --experiment-id

MLFLOW_TMP_DIR: Provide a default for --storage-dir

runs

Manage runs. To manage runs of experiments associated with a tracking server, set the MLFLOW_TRACKING_URI environment variable to the URL of the desired server.

Usage

mlflow runs [OPTIONS] COMMAND [ARGS]...

create

Create a new MLflow run and immediately end it with the specified status.

This command is useful for creating runs programmatically for testing, scripting, or recording completed experiments. The run will be created and immediately closed with the specified status (FINISHED, FAILED, or KILLED).

Usage

mlflow runs create [OPTIONS]

Options

--experiment-id <experiment_id>: ID of the experiment under which to create the run. Must specify either this or –experiment-name.

--experiment-name <experiment_name>: Name of the experiment under which to create the run. Must specify either this or –experiment-id.

--run-name <run_name>: Optional human-readable name for the run (e.g., ‘baseline-model-v1’).

--description <description>: Optional longer description of what this run represents.

-t, --tags <tags>: Key-value pairs to categorize and filter runs. Use multiple times for multiple tags. Format: key=value (e.g., env=prod, model=xgboost, version=1.0).

--status <status>

Final status of the run. Options: FINISHED (default), FAILED, or KILLED.

Options: FINISHED | FAILED | KILLED

--parent-run-id <parent_run_id>: Optional ID of a parent run to create a nested run under.

Environment variables

MLFLOW_EXPERIMENT_ID: Provide a default for --experiment-id

MLFLOW_EXPERIMENT_NAME: Provide a default for --experiment-name

delete

Mark a run for deletion. Return an error if the run does not exist or is already marked. You can restore a marked run with restore_run, or permanently delete a run in the backend store.

Usage

mlflow runs delete [OPTIONS]

Options

--run-id <run_id>: Required

describe

All of run details will print to the stdout as JSON format.

Usage

mlflow runs describe [OPTIONS]

Options

--run-id <run_id>: Required

link-traces

Link traces to a run.

This command links one or more traces to an existing run. Traces can be linked to runs to establish relationships between traces and runs. Maximum 100 traces can be linked in a single command.

Usage

mlflow runs link-traces [OPTIONS]

Options

--run-id <run_id>: Required ID of the run to link traces to.

-t, --trace-id <trace_ids>: Required Trace ID to link to the run. Can be specified multiple times (maximum 100 traces).

list

List all runs of the specified experiment in the configured tracking server.

Usage

mlflow runs list [OPTIONS]

Options

--experiment-id <experiment_id>: Required Specify the experiment ID for list of runs.

-v, --view <view>: Select view type for list experiments. Valid view types are ‘active_only’ (default), ‘deleted_only’, and ‘all’.

Environment variables

MLFLOW_EXPERIMENT_ID: Provide a default for --experiment-id

restore

Restore a deleted run. Returns an error if the run is active or has been permanently deleted.

Usage

mlflow runs restore [OPTIONS]

Options

--run-id <run_id>: Required

sagemaker

Serve models on SageMaker.

To serve a model associated with a run on a tracking server, set the MLFLOW_TRACKING_URI environment variable to the URL of the desired server.

Usage

mlflow sagemaker [OPTIONS] COMMAND [ARGS]...

build-and-push-container

Build new MLflow Sagemaker image, assign it a name, and push to ECR.

This function builds an MLflow Docker image. The image is built locally and it requires Docker to run. The image is pushed to ECR under current active AWS account and to current active AWS region.

Usage

mlflow sagemaker build-and-push-container [OPTIONS]

Options

--build, --no-build: Build the container if set.

--push, --no-push: Push the container to AWS ECR if set.

-c, --container <container>: image name

--network <network>: Set the networking mode for the RUN instructions during docker build. For example, use ‘–network sagemaker’ when building in SageMaker JupyterLab.

--install-java <install_java>: Installs Java in the image if needed. Default is None, allowing MLflow to determine installation. Flavors requiring Java, such as Spark, enable this automatically. Note: This option only works with the UBUNTU base image; Python base images do not support Java installation.

--env-manager <env_manager>

If specified, create an environment for MLmodel using the specified environment manager. The following values are supported:

- local: use the local environment
- virtualenv: use venv (and pyenv for Python version management)
- uv: use uv
- conda: use conda

If unspecified, default to virtualenv.

--mlflow-home <PATH>: Path to local clone of MLflow project. Use for development only.

deploy-transform-job

Deploy model on Sagemaker as a batch transform job. Current active AWS account needs to have correct permissions setup.

By default, unless the --async flag is specified, this command will block until either the batch transform job completes (definitively succeeds or fails) or the specified timeout elapses.

Usage

mlflow sagemaker deploy-transform-job [OPTIONS]

Options

-n, --job-name <job_name>: Required Transform job name

-m, --model-uri <URI>: Required URI to the model. A local path, a ‘runs:/’ URI, or a remote storage URI (e.g., an ‘s3://’ URI). For more information about supported remote URIs for model artifacts, see https://mlflow.org/docs/latest/tracking.html#artifact-stores

--input-data-type <input_data_type>: Required Input data type for the transform job

-u, --input-uri <input_uri>: Required S3 key name prefix or manifest of the input data

--content-type <content_type>: Required The multipurpose internet mail extension (MIME) type of the data

-o, --output-path <output_path>: Required The S3 path to store the output results of the Sagemaker transform job

--compression-type <compression_type>: The compression type of the transform data

-s, --split-type <split_type>: The method to split the transform job’s data files into smaller batches

-a, --accept <accept>: The multipurpose internet mail extension (MIME) type of the output data

--assemble-with <assemble_with>: The method to assemble the results of the transform job as a single S3 object

--input-filter <input_filter>: A JSONPath expression used to select a portion of the input data for the transform job

--output-filter <output_filter>: A JSONPath expression used to select a portion of the output data from the transform job

-j, --join-resource <join_resource>: The source of the data to join with the transformed data

-e, --execution-role-arn <execution_role_arn>: SageMaker execution role

-b, --bucket <bucket>: S3 bucket to store model artifacts

-i, --image-url <image_url>: ECR URL for the Docker image

--region-name <region_name>: Name of the AWS region in which to deploy the transform job

-t, --instance-type <instance_type>: The type of SageMaker ML instance on which to perform the batch transform job. For a list of supported instance types, see https://aws.amazon.com/sagemaker/pricing/instance-types/.

-c, --instance-count <instance_count>: The number of SageMaker ML instances on which to perform the batch transform job

-v, --vpc-config <vpc_config>: Path to a file containing a JSON-formatted VPC configuration. This configuration will be used when creating the new SageMaker model associated with this application. For more information, see https://docs.aws.amazon.com/sagemaker/latest/dg/API_VpcConfig.html

-f, --flavor <flavor>: The name of the flavor to use for deployment. Must be one of the following: [‘python_function’]. If unspecified, a flavor will be automatically selected from the model’s available flavors.

--archive: If specified, any SageMaker resources that become inactive after the finished batch transform job are preserved. These resources may include the associated SageMaker models and model artifacts. Otherwise, if –archive is unspecified, these resources are deleted. –archive must be specified when deploying asynchronously with –async.

--async: If specified, this command will return immediately after starting the deployment process. It will not wait for the deployment process to complete. The caller is responsible for monitoring the deployment process via native SageMaker APIs or the AWS console.

--timeout <timeout>: If the command is executed synchronously, the deployment process will return after the specified number of seconds if no definitive result (success or failure) is achieved. Once the function returns, the caller is responsible for monitoring the health and status of the pending deployment via native SageMaker APIs or the AWS console. If the command is executed asynchronously using the –async flag, this value is ignored.

push-model

Push an MLflow model to Sagemaker model registry. Current active AWS account needs to have correct permissions setup.

Usage

mlflow sagemaker push-model [OPTIONS]

Options

-n, --model-name <model_name>: Required Sagemaker model name

-m, --model-uri <URI>: Required URI to the model. A local path, a ‘runs:/’ URI, or a remote storage URI (e.g., an ‘s3://’ URI). For more information about supported remote URIs for model artifacts, see https://mlflow.org/docs/latest/tracking.html#artifact-stores

-e, --execution-role-arn <execution_role_arn>: SageMaker execution role

-b, --bucket <bucket>: S3 bucket to store model artifacts

-i, --image-url <image_url>: ECR URL for the Docker image

--region-name <region_name>: Name of the AWS region in which to push the Sagemaker model

-v, --vpc-config <vpc_config>: Path to a file containing a JSON-formatted VPC configuration. This configuration will be used when creating the new SageMaker model. For more information, see https://docs.aws.amazon.com/sagemaker/latest/dg/API_VpcConfig.html

-f, --flavor <flavor>: The name of the flavor to use for deployment. Must be one of the following: [‘python_function’]. If unspecified, a flavor will be automatically selected from the model’s available flavors.

terminate-transform-job

Terminate the specified Sagemaker batch transform job. Unless --archive is specified, all SageMaker resources associated with the batch transform job are deleted as well.

By default, unless the --async flag is specified, this command will block until either the termination process completes (definitively succeeds or fails) or the specified timeout elapses.

Usage

mlflow sagemaker terminate-transform-job [OPTIONS]

Options

-n, --job-name <job_name>: Required Transform job name

-r, --region-name <region_name>: Name of the AWS region in which the transform job is deployed

--archive: If specified, resources associated with the application are preserved. These resources may include unused SageMaker models and model artifacts. Otherwise, if –archive is unspecified, these resources are deleted. –archive must be specified when deleting asynchronously with –async.

--async: If specified, this command will return immediately after starting the termination process. It will not wait for the termination process to complete. The caller is responsible for monitoring the termination process via native SageMaker APIs or the AWS console.

--timeout <timeout>: If the command is executed synchronously, the termination process will return after the specified number of seconds if no definitive result (success or failure) is achieved. Once the function returns, the caller is responsible for monitoring the health and status of the pending termination via native SageMaker APIs or the AWS console. If the command is executed asynchronously using the –async flag, this value is ignored.

scorers

Manage scorers, including LLM judges. To manage scorers associated with a tracking server, set the MLFLOW_TRACKING_URI environment variable to the URL of the desired server.

Usage

mlflow scorers [OPTIONS] COMMAND [ARGS]...

list

List registered scorers for an experiment, or list all built-in scorers.

Examples:

# List built-in scorers (table format)
mlflow scorers list --builtin
mlflow scorers list -b

# List built-in scorers (JSON format)
mlflow scorers list --builtin --output json

# List registered scorers in table format (default)
mlflow scorers list --experiment-id 123

# List registered scorers in JSON format
mlflow scorers list --experiment-id 123 --output json

# Using environment variable for experiment ID
export MLFLOW_EXPERIMENT_ID=123
mlflow scorers list

Usage

mlflow scorers list [OPTIONS]

Options

-x, --experiment-id <experiment_id>: Experiment ID for which to list scorers. Can be set via MLFLOW_EXPERIMENT_ID env var.

-b, --builtin: List built-in scorers instead of registered scorers for an experiment.

--output <output>

Output format: ‘table’ for formatted table (default) or ‘json’ for JSON format

Options: table | json

Environment variables

MLFLOW_EXPERIMENT_ID: Provide a default for --experiment-id

register-llm-judge

Register an LLM judge scorer in the specified experiment.

This command creates an LLM judge using natural language instructions and registers it in an experiment for use in evaluation workflows. The instructions must contain at least one template variable ({{ inputs }}, {{ outputs }}, {{ expectations }}, or {{ trace }}) to define what the judge will evaluate.

Examples:

# Register a basic quality judge
mlflow scorers register-llm-judge -n quality_judge \
    -i "Evaluate if {{ outputs }} answers {{ inputs }}. Return yes or no." -x 123

# Register a judge with custom model
mlflow scorers register-llm-judge -n custom_judge \
    -i "Check whether {{ outputs }} is professional and formal. Rate pass, fail, or na" \
    -m "openai:/gpt-4" -x 123

# Register a judge with description
mlflow scorers register-llm-judge -n quality_judge \
    -i "Evaluate if {{ outputs }} answers {{ inputs }}. Return yes or no." \
    -d "Evaluates response quality and relevance" -x 123

# Using environment variable
export MLFLOW_EXPERIMENT_ID=123
mlflow scorers register-llm-judge -n my_judge \
    -i "Check whether {{ outputs }} contains PII"

Usage

mlflow scorers register-llm-judge [OPTIONS]

Options

-n, --name <name>: Required Name for the judge scorer

-i, --instructions <instructions>: Required Instructions for evaluation. Must contain at least one template variable: {{ inputs }}, {{ outputs }}, {{ expectations }}, or {{ trace }}. See the make_judge documentation for variable interpretations.

-m, --model <model>: Model identifier to use for evaluation (e.g., openai:/gpt-4). If not provided, uses the default model.

-x, --experiment-id <experiment_id>: Required Experiment ID to register the judge in. Can be set via MLFLOW_EXPERIMENT_ID env var.

-d, --description <description>: Description of what the judge evaluates.

--base-url <base_url>: Base URL to route requests through. Useful for enterprise environments requiring LLM access through internal gateways or security proxies. Note: This value is not persisted when the judge is registered.

--extra-headers <extra_headers>: JSON string of additional HTTP headers to include in requests to the LLM provider. Example: ‘{{“X-API-Key”: “secret”}}’. Note: This value is not persisted when the judge is registered.

Environment variables

MLFLOW_EXPERIMENT_ID: Provide a default for --experiment-id

server

Run the MLflow tracking server with built-in security middleware.

The server listens on http://localhost:5000 by default and only accepts connections from the local machine. To let the server accept connections from other machines, you will need to pass --host 0.0.0.0 to listen on all network interfaces (or a specific interface address).

See https://mlflow.org/docs/latest/tracking/server-security.html for detailed documentation and guidance on security configurations for the MLflow tracking server.

Usage

mlflow server [OPTIONS]

Options

--backend-store-uri <PATH>: URI to which to persist experiment and run data. Acceptable URIs are SQLAlchemy-compatible database connection strings (e.g. ‘sqlite:///path/to/file.db’) or local filesystem URIs (e.g. ‘file:///absolute/path/to/directory’). By default, data will be logged to the ./mlruns directory.

--read-replica-backend-store-uri <URI>: URI for a read-only database replica. When specified, read operations (e.g. search_runs, get_experiment) are routed to this URI while write operations use –backend-store-uri. Enables horizontal scaling via database read replicas. If not specified, all operations use –backend-store-uri. Note: there is no automatic failover to the primary if the replica becomes unavailable. Cloud-managed databases (Aurora, RDS) handle this at the DNS level. For self-hosted setups, use a connection proxy (PgBouncer, HAProxy) for failover.

--registry-store-uri <URI>: URI to which to persist registered models. Acceptable URIs are SQLAlchemy-compatible database connection strings (e.g. ‘sqlite:///path/to/file.db’). If not specified, backend-store-uri is used.

--default-artifact-root <URI>: Directory in which to store artifacts for any new experiments created. For tracking server backends that rely on SQL, this option is required in order to store artifacts. Note that this flag does not impact already-created experiments with any previous configuration of an MLflow server instance. By default, data will be logged to the mlflow-artifacts:/ uri proxy if the –serve-artifacts option is enabled. Otherwise, the default location will be ./mlruns.

--serve-artifacts, --no-serve-artifacts: Enables serving of artifact uploads, downloads, and list requests by routing these requests to the storage location that is specified by ‘–artifacts-destination’ directly through a proxy. The default location that these requests are served from is a local ‘./mlartifacts’ directory which can be overridden via the ‘–artifacts-destination’ argument. To disable artifact serving, specify –no-serve-artifacts. Default: True

--artifacts-only: If specified, configures the mlflow server to be used only for proxied artifact serving. With this mode enabled, functionality of the mlflow tracking service (e.g. run creation, metric logging, and parameter logging) is disabled. The server will only expose endpoints for uploading, downloading, and listing artifacts. Default: False

--artifacts-destination <URI>: The base artifact location from which to resolve artifact upload/download/list requests (e.g. ‘s3://my-bucket’). Defaults to a local ‘./mlartifacts’ directory. This option only applies when the tracking server is configured to stream artifacts and the experiment’s artifact root location is http or mlflow-artifacts URI.

-h, --host <HOST>: The network interface to bind the server to (default: 127.0.0.1). This controls which network interfaces accept connections. Use ‘127.0.0.1’ for local-only access, or ‘0.0.0.0’ to allow connections from any network. NOTE: This is NOT a security setting - it only controls network binding. To restrict which clients can connect, use –allowed-hosts.

-p, --port <port>: The port to listen on (default: 5000).

-w, --workers <workers>: Number of worker processes to handle requests (default: 4).

--allowed-hosts <allowed_hosts>: Comma-separated list of allowed Host headers to prevent DNS rebinding attacks (default: localhost + private IPs). DNS rebinding allows attackers to trick your browser into accessing internal services. Examples: ‘mlflow.company.com,10.0.0.100:5000’. Supports wildcards: ‘mlflow.company.com,192.168.*,app-.internal.com’. Use ‘’ to allow ALL hosts (not recommended for production). Default allows: localhost (all ports), private IPs (10.*, 192.168.*, 172.16-31.*). Set this when exposing MLflow beyond localhost to prevent host header attacks.

--cors-allowed-origins <cors_allowed_origins>: Comma-separated list of allowed CORS origins to prevent cross-site request attacks (default: localhost origins on any port). CORS attacks allow malicious websites to make requests to your MLflow server using your credentials. Examples: ‘https://app.company.com,https://notebook.company.com’. Default allows: http://localhost:* (any port), http://127.0.0.1:, http://[::1]:. Set this when you have web applications on different domains that need to access MLflow. Use ‘*’ to allow ALL origins (DANGEROUS - only for development!).

--disable-security-middleware: DANGEROUS: Disable all security middleware including CORS protection and host validation. This completely removes security protections and should only be used for testing. When disabled, your MLflow server is vulnerable to CORS attacks, DNS rebinding, and clickjacking. Instead, prefer configuring specific security settings with –cors-allowed-origins and –allowed-hosts.

--x-frame-options <x_frame_options>: X-Frame-Options header value for clickjacking protection. Options: ‘SAMEORIGIN’ (default - allows embedding only from same origin), ‘DENY’ (prevents all embedding), ‘NONE’ (disables header - allows embedding from anywhere). Set to ‘NONE’ if you need to embed MLflow UI in iframes from different origins.

--static-prefix <static_prefix>: A prefix which will be prepended to the path of all static paths.

--gunicorn-opts <gunicorn_opts>: Additional command line options forwarded to gunicorn processes.

--waitress-opts <waitress_opts>: Additional command line options for waitress-serve.

--uvicorn-opts <uvicorn_opts>: Additional command line options forwarded to uvicorn processes (used by default).

--expose-prometheus <expose_prometheus>: Path to the directory where metrics will be stored. If the directory doesn’t exist, it will be created. Activate prometheus exporter to expose metrics on /metrics endpoint.

--app-name <app_name>

Application name to be used for the tracking server. If not specified, ‘mlflow.server:app’ will be used.

Options: custom_app | basic-auth

--trace-archival-config <PATH>: Path to the YAML config file for server-owned trace archival.

--dev

If enabled, run the server with debug logging and auto-reload. Should only be used for development purposes. Cannot be used with ‘–gunicorn-opts’ or ‘–uvicorn-opts’. Unsupported on Windows.

Default: False

--secrets-cache-ttl <secrets_cache_ttl>

Server-side secrets cache time-to-live in seconds. Controls how long decrypted secrets are cached in memory (encrypted with AES-GCM-256). Lower values (10-30s) are more secure but impact performance. Higher values (120-300s) improve performance but increase exposure window. Range: 10-300 seconds.

Default: 60

--secrets-cache-max-size <secrets_cache_max_size>

Server-side secrets cache maximum entries. When exceeded, least recently used entries are evicted. Range: 1-10000 entries.

Default: 1000

--workspace-store-uri <URI>: Workspace provider backend URI used for workspace CRUD APIs and request routing. When unspecified, defaults to the backend store URI. This only needs to be specified when using a workspace store plugin leveraging externally managed workspaces (e.g. Kubernetes namespaces).

--enable-workspaces, --disable-workspaces

Enable backwards compatible workspaces mode for logical isolation of experiments, registered models, and prompts.

Default: False

Environment variables

MLFLOW_BACKEND_STORE_URI: Provide a default for --backend-store-uri

MLFLOW_READ_REPLICA_BACKEND_STORE_URI: Provide a default for --read-replica-backend-store-uri

MLFLOW_REGISTRY_STORE_URI: Provide a default for --registry-store-uri

MLFLOW_DEFAULT_ARTIFACT_ROOT: Provide a default for --default-artifact-root

MLFLOW_SERVE_ARTIFACTS: Provide a default for --serve-artifacts

MLFLOW_ARTIFACTS_ONLY: Provide a default for --artifacts-only

MLFLOW_ARTIFACTS_DESTINATION: Provide a default for --artifacts-destination

MLFLOW_HOST: Provide a default for --host

MLFLOW_PORT: Provide a default for --port

MLFLOW_WORKERS: Provide a default for --workers

MLFLOW_SERVER_ALLOWED_HOSTS: Provide a default for --allowed-hosts

MLFLOW_SERVER_CORS_ALLOWED_ORIGINS: Provide a default for --cors-allowed-origins

MLFLOW_SERVER_DISABLE_SECURITY_MIDDLEWARE: Provide a default for --disable-security-middleware

MLFLOW_SERVER_X_FRAME_OPTIONS: Provide a default for --x-frame-options

MLFLOW_STATIC_PREFIX: Provide a default for --static-prefix

MLFLOW_GUNICORN_OPTS: Provide a default for --gunicorn-opts

MLFLOW_UVICORN_OPTS: Provide a default for --uvicorn-opts

MLFLOW_EXPOSE_PROMETHEUS: Provide a default for --expose-prometheus

MLFLOW_TRACE_ARCHIVAL_CONFIG: Provide a default for --trace-archival-config

MLFLOW_WORKSPACE_STORE_URI: Provide a default for --workspace-store-uri

skills

Inspect the MLflow skills bundled with this installation.

Usage

mlflow skills [OPTIONS] COMMAND [ARGS]...

list

List the MLflow skills bundled with this installation.

Usage

mlflow skills list [OPTIONS]

view

View the details of an MLflow skill.

Usage

mlflow skills view [OPTIONS] SKILL_NAME

Arguments

SKILL_NAME: Required argument

traces

Manage traces. To manage traces associated with a tracking server, set the MLFLOW_TRACKING_URI environment variable to the URL of the desired server.

TRACE SCHEMA: info.trace_id # Unique trace identifier info.experiment_id # MLflow experiment ID info.request_time # Request timestamp (milliseconds) info.execution_duration # Total execution time (milliseconds) info.state # Trace status: OK, ERROR, etc. info.client_request_id # Optional client-provided request ID info.request_preview # Truncated request preview info.response_preview # Truncated response preview info.trace_metadata.mlflow.* # MLflow-specific metadata info.trace_metadata.* # Custom metadata fields info.tags.mlflow.traceName # Trace name tag info.tags.<key> # Custom tags info.assessments.*.assessment_id # Assessment identifiers info.assessments.*.feedback.name # Feedback names info.assessments.*.feedback.value # Feedback scores/values info.assessments.*.feedback.rationale # Feedback explanations info.assessments.*.expectation.name # Ground truth names info.assessments.*.expectation.value # Expected values info.assessments.*.source.source_type # HUMAN, LLM_JUDGE, CODE info.assessments.*.source.source_id # Source identifier info.token_usage # Token usage (property, not searchable via fields) data.spans.*.span_id # Individual span IDs data.spans.*.name # Span operation names data.spans.*.parent_id # Parent span relationships data.spans.*.start_time # Span start timestamps data.spans.*.end_time # Span end timestamps data.spans.*.status_code # Span status codes data.spans.*.attributes.mlflow.spanType # AGENT, TOOL, LLM, etc. data.spans.*.attributes.<key> # Custom span attributes data.spans.*.events.*.name # Event names data.spans.*.events.*.timestamp # Event timestamps data.spans.*.events.*.attributes.<key> # Event attributes

For additional details, see: https://mlflow.org/docs/latest/genai/tracing/concepts/trace/#traceinfo-metadata-and-context

FIELD SELECTION:

Use –extract-fields with dot notation to select specific fields.

Examples:

info.trace_id                           # Single field
info.assessments.*                      # All assessment data
info.assessments.*.feedback.value       # Just feedback scores
info.assessments.*.source.source_type   # Assessment sources
info.trace_metadata.mlflow.traceInputs  # Original inputs
info.trace_metadata.mlflow.source.type  # Source type
info.tags.`mlflow.traceName`            # Trace name (backticks for dots)
data.spans.*                            # All span data
data.spans.*.name                       # Span operation names
data.spans.*.attributes.mlflow.spanType # Span types
data.spans.*.events.*.name              # Event names
info.trace_id,info.state,info.execution_duration  # Multiple fields

Usage

mlflow traces [OPTIONS] COMMAND [ARGS]...

delete

Delete traces from an experiment.

Either –trace-ids or timestamp criteria can be specified, but not both.

Examples:
# Delete specific traces
mlflow traces delete –experiment-id 1 –trace-ids tr-abc123,tr-def456

# Delete traces older than a timestamp

mlflow traces delete –experiment-id 1 –max-timestamp-millis 1700000000000

# Delete up to 100 old traces

mlflow traces delete –experiment-id 1 –max-timestamp-millis 1700000000000 –max-traces 100

Usage

mlflow traces delete [OPTIONS]

Options

-x, --experiment-id <experiment_id>: Required Experiment ID to search within. Can be set via MLFLOW_EXPERIMENT_ID env var.

--trace-ids <trace_ids>: Comma-separated list of trace IDs to delete

--max-timestamp-millis <max_timestamp_millis>: Delete traces older than this timestamp (milliseconds since epoch)

--max-traces <max_traces>: Maximum number of traces to delete

Environment variables

MLFLOW_EXPERIMENT_ID: Provide a default for --experiment-id

delete-assessment

Delete an assessment from a trace.

Example:

mlflow traces delete-assessment –trace-id tr-abc123 –assessment-id asmt-def456

Usage

mlflow traces delete-assessment [OPTIONS]

Options

--trace-id <trace_id>: Required

--assessment-id <assessment_id>: Required Assessment ID to delete

delete-tag

Delete a tag from a trace.

Example:

mlflow traces delete-tag –trace-id tr-abc123 –key environment

Usage

mlflow traces delete-tag [OPTIONS]

Options

--trace-id <trace_id>: Required

--key <key>: Required Tag key to delete

evaluate

Evaluate one or more traces using specified scorers and display the results.

This command runs MLflow’s genai.evaluate() on specified traces, applying the specified scorers and displaying the evaluation results in table or JSON format.

Examples:
# Evaluate a single trace with built-in scorers
mlflow traces evaluate –trace-ids tr-abc123 –scorers Correctness,Safety

# Evaluate multiple traces

mlflow traces evaluate –trace-ids tr-abc123,tr-def456,tr-ghi789

–scorers RelevanceToQuery

# Evaluate with JSON output

mlflow traces evaluate –trace-ids tr-abc123

–scorers Correctness –output json

# Evaluate with custom registered scorer

mlflow traces evaluate –trace-ids tr-abc123,tr-def456

–scorers my_custom_scorer,Correctness

Available built-in scorers (use either PascalCase or snake_case):
- Correctness / correctness: Ensures responses are correct and accurate
- Safety / safety: Ensures responses don’t contain harmful/toxic content
- RelevanceToQuery / relevance_to_query: Ensures response addresses user input directly
- Guidelines / guidelines: Evaluates adherence to specific constraints
- ExpectationsGuidelines / expectations_guidelines: Row-specific guidelines evaluation
- RetrievalRelevance / retrieval_relevance: Measures chunk relevance to input request
- RetrievalSufficiency / retrieval_sufficiency: Evaluates if retrieved docs provide
necessary info
- RetrievalGroundedness / retrieval_groundedness: Assesses response alignment with
retrieved context

Usage

mlflow traces evaluate [OPTIONS]

Options

-x, --experiment-id <experiment_id>: Required Experiment ID to search within. Can be set via MLFLOW_EXPERIMENT_ID env var.

--trace-ids <trace_ids>: Required Comma-separated list of trace IDs to evaluate.

--scorers <scorers>: Required Comma-separated list of scorer names. Can be built-in scorers (e.g., Correctness, Safety, RelevanceToQuery) or registered custom scorers.

--output <output_format>

Output format: ‘table’ for formatted table (default) or ‘json’ for JSON format

Options: table | json

Environment variables

MLFLOW_EXPERIMENT_ID: Provide a default for --experiment-id

get

All trace details will print to stdout as JSON format.

Examples:
# Get full trace
mlflow traces get –trace-id tr-1234567890abcdef

# Get specific fields only

mlflow traces get –trace-id tr-1234567890abcdef

–extract-fields “info.trace_id,info.assessments.*,data.spans.*.name”

Usage

mlflow traces get [OPTIONS]

Options

--trace-id <trace_id>: Required

--extract-fields <extract_fields>: Filter and select specific fields using dot notation. Examples: ‘info.trace_id’, ‘info.assessments.*’, ‘data.spans.*.name’. Comma-separated for multiple fields. If not specified, returns all trace data.

--verbose: Show all available fields in error messages when invalid fields are specified.

get-assessment

Get assessment details as JSON.

Example:

mlflow traces get-assessment –trace-id tr-abc123 –assessment-id asmt-def456

Usage

mlflow traces get-assessment [OPTIONS]

Options

--trace-id <trace_id>: Required

--assessment-id <assessment_id>: Required Assessment ID

log-expectation

Log an expectation (ground truth label) to a trace.

Examples:

# Simple expected answer

mlflow traces log-expectation –trace-id tr-abc123

–name expected_answer –value “Paris”

# Human-annotated ground truth

mlflow traces log-expectation –trace-id tr-abc123

–name ground_truth –value “positive”

–source-type HUMAN –source-id annotator@example.com

# Complex expected output with metadata

mlflow traces log-expectation –trace-id tr-abc123

–name expected_response 
–value ‘{“answer”: “42”, “confidence”: 0.95}’ 
–metadata ‘{“dataset”: “test_set_v1”, “difficulty”: “hard”}’

Usage

mlflow traces log-expectation [OPTIONS]

Options

--trace-id <trace_id>: Required

--name <name>: Required Expectation name (e.g., ‘expected_answer’, ‘ground_truth’)

--value <value>: Required Expected value (string or JSON for complex values)

--source-type <source_type>

Source type of the expectation

Options: HUMAN | LLM_JUDGE | CODE

--source-id <source_id>: Source identifier

--metadata <metadata>: Additional metadata as JSON string

--span-id <span_id>: Associate expectation with a specific span ID

log-feedback

Log feedback (evaluation score) to a trace.

Examples:

# Simple numeric feedback

mlflow traces log-feedback –trace-id tr-abc123

–name relevance –value 0.9

–rationale “Highly relevant response”

# Human feedback with source

mlflow traces log-feedback –trace-id tr-abc123

–name quality –value good

–source-type HUMAN –source-id reviewer@example.com

# Complex feedback with JSON value and metadata

mlflow traces log-feedback –trace-id tr-abc123

–name metrics 
–value ‘{“accuracy”: 0.95, “f1”: 0.88}’ 
–metadata ‘{“model”: “gpt-4”, “temperature”: 0.7}’

# LLM judge feedback

mlflow traces log-feedback –trace-id tr-abc123

–name faithfulness –value 0.85 
–source-type LLM_JUDGE –source-id gpt-4 
–rationale “Response is faithful to context”

Usage

mlflow traces log-feedback [OPTIONS]

Options

--trace-id <trace_id>: Required

--name <name>: Required Feedback name

--value <value>: Feedback value (number, string, bool, or JSON for complex values)

--source-type <source_type>

Source type of the feedback

Options: HUMAN | LLM_JUDGE | CODE

--source-id <source_id>: Source identifier (e.g., email for HUMAN, model name for LLM)

--rationale <rationale>: Explanation/justification for the feedback

--metadata <metadata>: Additional metadata as JSON string

--span-id <span_id>: Associate feedback with a specific span ID

search

Search for traces in the specified experiment.

Examples:

# Search all traces in experiment 1

mlflow traces search –experiment-id 1

# Using environment variable
export MLFLOW_EXPERIMENT_ID=1
mlflow traces search –max-results 50

# Filter traces by run ID

mlflow traces search –experiment-id 1 –run-id abc123def

# Use filter string for complex queries

mlflow traces search –experiment-id 1

–filter-string “run_id = ‘abc123’ AND timestamp_ms > 1700000000000”

# Order results and use pagination

mlflow traces search –experiment-id 1

–order-by “timestamp_ms DESC” 
–max-results 10 
–page-token <token_from_previous>

# Search without span data (faster for metadata-only queries)

mlflow traces search –experiment-id 1 –no-include-spans

Usage

mlflow traces search [OPTIONS]

Options

-x, --experiment-id <experiment_id>: Required Experiment ID to search within. Can be set via MLFLOW_EXPERIMENT_ID env var.

--filter-string <filter_string>

Filter string for trace search.

Examples: - Filter by run ID: “run_id = ‘123abc’” - Filter by status: “status = ‘OK’” - Filter by timestamp: “timestamp_ms > 1700000000000” - Filter by metadata: “metadata.`mlflow.modelId` = ‘model123’” - Filter by tags: “tags.environment = ‘production’” - Multiple conditions: “run_id = ‘123’ AND status = ‘OK’”

Available fields: - run_id: Associated MLflow run ID - status: Trace status (OK, ERROR, etc.) - timestamp_ms: Trace timestamp in milliseconds - execution_time_ms: Trace execution time in milliseconds - name: Trace name - metadata.<key>: Custom metadata fields (use backticks for keys with dots) - tags.<key>: Custom tag fields

--max-results <max_results>: Maximum number of traces to return (default: 100)

--order-by <order_by>: Comma-separated list of fields to order by (e.g., ‘timestamp_ms DESC, status’)

--page-token <page_token>: Token for pagination from previous search

--run-id <run_id>: Filter traces by run ID (convenience option, adds to filter-string)

--include-spans, --no-include-spans: Include span data in results (default: include)

--model-id <model_id>: Filter traces by model ID

--sql-warehouse-id <sql_warehouse_id>: DEPRECATED. Use the MLFLOW_TRACING_SQL_WAREHOUSE_ID environment variable instead.SQL warehouse ID (only needed when searching for traces by model stored in Databricks Unity Catalog)

--output <output>

Output format: ‘table’ for formatted table (default) or ‘json’ for JSON format

Options: table | json

--extract-fields <extract_fields>: Filter and select specific fields using dot notation. Examples: “info.trace_id”, “info.assessments.*”, “data.spans.*.name”. For field names with dots, use backticks: “info.tags.`mlflow.traceName`”. Comma-separated for multiple fields. Defaults to standard columns for table mode, all fields for JSON mode.

--verbose: Show all available fields in error messages when invalid fields are specified.

Environment variables

MLFLOW_EXPERIMENT_ID: Provide a default for --experiment-id

set-tag

Set a tag on a trace.

Example:

mlflow traces set-tag –trace-id tr-abc123 –key environment –value production

Usage

mlflow traces set-tag [OPTIONS]

Options

--trace-id <trace_id>: Required

--key <key>: Required Tag key

--value <value>: Required Tag value

update-assessment

Update an existing assessment.

NOTE: Assessment names cannot be changed once set. Only value, rationale, and metadata can be updated.

Examples:

# Update feedback value and rationale

mlflow traces update-assessment –trace-id tr-abc123 –assessment-id asmt-def456

–value ‘{“accuracy”: 0.98}’ –rationale “Updated after review”

# Update only the rationale

mlflow traces update-assessment –trace-id tr-abc123 –assessment-id asmt-def456

–rationale “Revised evaluation”

Usage

mlflow traces update-assessment [OPTIONS]

Options

--trace-id <trace_id>: Required

--assessment-id <assessment_id>: Required Assessment ID to update

--value <value>: Updated assessment value (JSON)

--rationale <rationale>: Updated rationale

--metadata <metadata>: Updated metadata as JSON