Coding Agents & AI Gateway
The MLflow AI Gateway supports popular AI coding agents such as Claude Code, OpenAI Codex, and Gemini CLI out of the box.
AI coding agents make dozens or hundreds of LLM calls per session, often running autonomously for extended periods. Without visibility or controls in place, it's easy to lose track of what they're doing, how much they're spending, and whether they're operating within your organization's policies. Routing coding agents through the gateway gives your team three key capabilities:
Observability
Every request is automatically captured as an MLflow trace, giving you full visibility into inputs, outputs, token counts, and latency, without any code changes.
Budget Control
Set spending limits globally or per workspace to prevent runaway costs. Configure alerts and hard limits to keep coding agent sessions within budget.
Guardrails
Enforce content policies on all coding agent requests. Block sensitive topics, redact PII, and ensure responses meet your organization's standards.

