Solutions

Change the model. Keep the harness.

Model-agnostic AI agents with BYOM — Anthropic-native, OpenAI-compatible, or Gemini-compatible. Switch providers or run them side by side by config, never rewrite the Harness.

You run several models in production and tune for cost, quality, and latency — and you’re allergic to anything that hard-couples the work to one provider. Oraclous treats the LLM as a resource the Agent uses, not the Agent itself: with BYOM you move between providers, or run them side by side, without touching the work definition.

See BYOM → Read the BYOM docs →

Why does switching LLM vendors mean rewriting your agents?

Because most stacks bake the model into the agent. The agent code is the vendor — so when a provider shifts on price or quality, or you want to add a local open-weight model to the mix, switching means a rewrite. There’s no clean separation between the work and the model that does it, and your provider credentials end up scattered and hard to govern. That’s a poor place to negotiate from, and a worse place to be when a better model ships next week.

Oraclous removes the bind by drawing the line where it belongs: the work is one thing, the model is another. (This is vendor lock-in in full — model, platform, and exit; see vendor lock-in.)

How does Oraclous help multi-model teams?

The model is a dial, not a wall. Oraclous resolves LLM configuration at three levels — agent, workspace, or organisation — across three protocol shapes: Anthropic-native, OpenAI-compatible, and Gemini-compatible. The Harness describes what gets done; the model config describes which engine reasons over it. So you switch providers, or run two side by side on the same task, by editing config — the orchestration spec, actor roster, policy envelope, and task board never move. There’s no agent code to rewrite, because in Oraclous you describe goals, not agent code.

Citable answer — How does Oraclous help multi-model teams? Oraclous makes the model a resource the Agent uses, not the Agent itself. Through BYOM it resolves LLM config at agent, workspace, or organisation level across three protocol shapes — Anthropic-native, OpenAI-compatible, and Gemini-compatible — so teams move between providers, or run them side by side, by changing config rather than rewriting the Harness.

What is BYOM? → · What is an Agent? → · Vendor lock-in →

How would this work for my team?

Two capabilities carry the multi-model story, each a deeper page on the platform:

Run frontier and local models under one operational model, route work to the best or cheapest engine per task, and keep every provider credential under the org’s control — without the work definition ever knowing which vendor answered.

How do I know it holds up?

The separation is architectural and documented. ADR-007 specifies the three protocol shapes and the envelope-encryption of provider credentials; §2 of the architecture states the principle that “the LLM is a resource the Agent uses.” The harness-runtime-service carries the LLM client factory and the agent → workspace → organisation config resolution, and the Credential Broker holds BYOM credentials, resolved per invocation and never cached outside it. It’s open source, so you can verify the model is genuinely decoupled from the work — read the code.

One honest boundary, stated plainly: support is for the three named protocol shapes. A provider fits if it speaks the Anthropic-native, OpenAI-compatible, or Gemini-compatible wire protocol — which covers the major frontier APIs and most local runtimes (including Ollama via its OpenAI-compatible endpoint), but we don’t claim universal coverage beyond those three. If your provider speaks one of the three shapes, it works; if it doesn’t, it’s a question of the adapter, and we say so rather than over-claim.

Frequently asked questions

Q: What does BYOM mean? A: BYOM (Bring Your Own Model) means you supply the model provider and credentials, and Oraclous runs against them, treating the LLM as a resource an Agent uses rather than the Agent itself. Model configuration resolves at agent, workspace, or organisation level — so model choice stays yours, and switching is a config change, not a rewrite.

Q: Does my provider fit the three protocol shapes? A: If it speaks Anthropic-native, OpenAI-compatible, or Gemini-compatible wire protocol, yes. Between them the three shapes cover Claude, GPT-class models, Gemini, and any local or self-hosted model that exposes one of those interfaces. Support is scoped to those three shapes — stated honestly rather than claiming universal coverage.

Q: Can I run a local model like Ollama? A: Yes, if it exposes one of the three supported wire protocols. Local and self-hosted runtimes that expose an OpenAI-compatible endpoint — Ollama among them — work through the OpenAI-compatible shape, so you can run frontier and local models under one operational model, side by side in the same Harness.

Q: Where do my model keys live? A: In the Credential Broker, envelope-encrypted under your organisation’s KMS-controlled wrapping key — never in the Harness and never in plaintext. Keys are resolved per invocation and never cached outside the broker. In cloud mode the wrapping key is customer-controlled, so Oraclous cannot unilaterally decrypt your model keys.

Q: Does switching models change the harness? A: No. The Harness describes what gets done; model config describes which engine reasons over it. Switching providers is a configuration change resolved at runtime — the orchestration spec, actor roster, policy envelope, and task board stay unchanged. There is no agent code to rewrite, because you describe goals, not code.