MLflow 3
The open source MLflow community has reached a major milestone. Today, we're releasing MLflow 3, which brings production-ready generative AI capabilities to the platform that millions of developers trust for ML operations.
This isn't just another feature update. MLflow 3 fundamentally expands what's possible with open source ML tooling, addressing the observability and quality challenges that have made GenAI deployment feel like a leap of faith.
Major Updatesโ
๐ฏ MLflow LoggedModel
โ
MLflow 3 introduces a refined architecture with the new LoggedModel
entity as a first-class citizen, moving beyond the traditional run-centric approach. This enables better organization and comparison of GenAI agents, deep learning checkpoints, and model variants across experiments.
Learn more about MLflow LoggedModel
in the documentation.
๐ Strong Lineage Supportโ
Enhanced model tracking provides comprehensive lineage between models, runs, traces, prompts, and evaluation metrics. The new model-centric design allows you to group traces and metrics from interactive queries and automated evaluation jobs, enabling rich comparisons across model versions.
New GenAI Evaluation Suiteโ
MLflow's evaluation and monitoring capabilities help you systematically measure, improve, and maintain the quality of your GenAI applications throughout their lifecycle. From development through production, use the same quality scorers to ensure your applications deliver accurate, reliable responses while managing cost and latency.
Learn more about the new GenAI evaluation suite in the documentation.
The new evaluation suite is available only in Managed MLflow on Databricks, with open source support coming soon. Interested in trying it out? Start a free Databricks trial to explore these features today.
โก Prompt Optimizationโ
The MLflow Prompt Registry now includes prompt optimization capabilities, allowing you to automatically improve prompts using evaluation feedback and labeled datasets. This includes versioning, tracking, and systematic prompt engineering workflows.
Learn more about prompt optimization in the documentation.