Routing Configuration UI

The Routing section of the MultiRoute dashboard lets you configure how requests to the OpenAI-compatible /openai/v1 API are distributed across models and providers, without changing your application code.

Conceptually, it is a human-friendly layer on top of the same configuration that the Providers page and the /v1/configs API expose.

Routing Profiles and Rules

Routing is typically organized into profiles or rulesets. Each profile describes how to handle a class of requests, such as:

“Default chat completions”
“High-accuracy reasoning”
“Low-latency responses for UI”

Within each profile you can define:

Target models: Which models are eligible to receive traffic.
Weights or priorities: How traffic should be distributed across models.
Fallback behavior: What to do when a provider or model is unavailable or failing.

Your application then references these profiles via configuration or by selecting the appropriate model or routing key when calling /openai/v1 endpoints.

Managing Model Priorities

For each routing profile, the UI allows you to:

Choose a primary model: The first-choice model for normal operation.
Set secondary models: Ordered or weighted fallbacks that can receive traffic when needed.
Adjust weights: For example, send 80% of traffic to one model and 20% to another for canarying or A/B testing.

Changes are applied at the configuration level, so you can:

Shift traffic gradually between models.
Test new models with a small percentage of traffic.
Roll back to a known-good configuration if issues arise.

Timeouts, Retries, and Failover

The UI exposes high-level controls for reliability behavior, such as:

Request timeouts: How long MultiRoute should wait for a provider before treating it as failed.
Retry policies: Whether and how often to retry on specific error types (e.g. network errors vs. validation errors).
Failover settings: When to send the request to an alternate model or provider.

The intent is to give you sensible defaults out of the box, while still allowing fine-grained tuning for latency-sensitive or mission-critical workloads.

Drafting and Reviewing Changes

Routing changes often affect production traffic, so the dashboard is designed to support a safe workflow:

Edit in draft: Make changes to profiles and rules without immediately affecting live traffic.
Preview impact: See which models and providers will receive traffic under the new configuration.
Review and confirm: Optionally require a second approval before applying changes (in progress).
Apply changes: Promote the draft to active configuration for the selected project or environment.

This workflow is especially helpful when multiple teams collaborate on routing policies.

Relationship to the Providers page and /v1/configs API

All routing configuration managed in the UI is stored in the same configuration layer that the Providers page and the /v1/configs API read and write.

This means you can:

Use the API for automation (CI/CD, infrastructure-as-code).
Use the UI for inspection, quick edits, and debugging.

The recommended pattern is:

Source-of-truth in version-controlled configuration via /v1/configs for production-critical setups.
Use the dashboard UI to visualize, verify, and occasionally adjust routing behavior, especially during incident response or experiments.