4 posts tagged with "databricks"
Evaluating Databricks Genie Spaces
A complete pipeline for tracing, evaluating, and improving a Databricks Genie space using MLflow.databricksgenieevaluationtracingagents
Genie Evaluation with LLM Judges
Score Genie traces with built-in and custom judges to find quality issues in responses and SQL generation.databricksgenieevaluationagents
Genie Space Improvement Generator
Take traces that failed evaluation, combine them with your Genie space config, and generate copy-paste-ready fixes with an LLM.databricksgenieevaluationagents
Genie Conversation Tracing Pipeline
Pull conversations from a Genie space and log each one as an MLflow trace for inspection and evaluation.databricksgenietracingagents