MLflow

MLflow 3.3.0

August 19, 2025 · 2 min read

MLflow 3.3.0 includes several major features and improvements

Eval UI

🪝 Model Registry Webhooks: MLflow now supports webhooks for model registry events, enabling automated notifications and integrations with external systems. (#16583, @harupy)
🧭 Agno Tracing Integration: Added Agno tracing integration for enhanced observability of AI agent workflows. (#16995, @joelrobin18)
🧪 GenAI Evaluation in OSS: MLflow open-sources the new evaluation capability for LLM applications. This suite enables systematic measurement and improvement of LLM application quality, with tight integration into MLflow's observability, feedback collection, and experiment tracking capabilities. (#17161, #17159, @B-Step62)
🖥️ Revamped Trace Table View: The new trace view in MLflow UI provides a streamlined interface for exploring, filtering, and monitoring traces, with enhanced search capabilities including full-text search across requests.(#17092, @daniellok-db)
⚡️ FastAPI + Uvicorn Server: MLflow Tracking Server now defaults to FastAPI + Uvicorn for improved performance, while maintaining Flask compatibility. (#17038, @dbczumar)

New features:

[Tracking] Add a Docker compose file to quickly start a local MLflow server with recommended minimum setup (#17065, @joelrobin18)
[Tracing] Add memory span type for agentic workflows (#17034, @B-Step62)
[Prompts] Enable custom prompt optimizers in optimize_prompt including DSPy support (#17052, @TomeHirata)
[Model Registry / Prompts] Proper support for the @latest alias (#17146, @B-Step62)
[Metrics] Allow custom tokenizer encoding in token_count function (#16253, @joelrobin18)

Bug fixes:

[Tracking] Fix Databricks secret scope check to reduce audit log errors (#17166, @harupy)
[Tracking] Fix Databricks SDK error code mapping in retry logic (#17095, @harupy)
[Tracking] Fix Databricks secret scope check to reduce error rates (#17166, @harupy)
[Tracing] Remove API keys from CrewAI traces to prevent credential leakage (#17082, @diy2learn)
[Tracing] Fix LiteLLM span association issue by making callbacks synchronous (#16982, @B-Step62)
[Tracing] Fix OpenAI Agents tracing (#17227, @B-Step62)
[Evaluation] Fix issue with get_label_schema has no attribute (#17163, @smoorjani)
[Docs] Fix version selector on API Reference page by adding missing CSS class and versions.json generation (#17247, @copilot-swe-agent)

Documentation updates:

[Docs] Document custom optimizer usage with optimize_prompt (#17084, @TomeHirata)
[Docs] Fix built-in scorer documentation for expectation parameter (#17075, @smoorjani)
[Docs] Add comprehensive documentation for scorers (#17258, @B-Step62)

For a comprehensive list of changes, see the release change log, and check out the latest documentation on mlflow.org.