Debating Bots is built around structured debate: two frontier models argue opposing sides, reject weak proposals, and only converge when they've actually earned it — supervised by an independent judge with no allegiance to either side. When you just need a fast read, Ask All shows multiple labs side by side. When you need it inside your own product, the API runs the same debate engine for you.
The engine doesn't follow a script. It reacts to what the models actually say — escalating when they disagree, resolving when they converge, and guaranteeing you always get a final answer. No two debates take the same path.
Inspired by "AI Safety via Debate" (Irving, Christiano & Amodei, 2018), which proposed that two AI agents debating adversarially produce more truthful answers than either could alone.
The engine is designed for genuine disagreement. Each backstop catches a specific failure and escalates to the next. They fire in order, and each one protects against the previous one being insufficient.
The debate engine is a numbered-step state machine. Each turn, models respond in parallel via server-sent events with real-time streaming to your browser. The engine tracks rejection counts, convergence scores, budget consumption, vote state, merge rounds, and revision history — reacting dynamically to what the models actually produce.
Provider-agnostic by design. Each debater can be any combination of OpenAI GPT, Anthropic Claude, Google Gemini, xAI Grok, Alibaba Qwen, DeepSeek, or Mistral. The engine automatically picks the best model pair based on question complexity — routing simple questions to faster models and complex ones to heavier reasoning tiers. The judge is always from a different provider than either debater. Real-time cost tracking keeps every debate within budget, and models have live web search and code execution so they argue with current data and verifiable calculations.
Not every question needs a debate. Sometimes you just want to see what seven different AI providers think. Sometimes you want an AI that decides for you. Use the right tool for the question.
A real debate about consciousness. Two AI models with genuinely irreconcilable philosophical positions, fighting through backstops, judge challenges, and status checks until the judge steps in.
The public API gives you the same debate system behind the app: bearer-token auth, debate start and status polling, live server-sent events, cancellation, usage balance, model discovery, optional webhooks, repo zip uploads, and idempotent starts so retries don't double-charge.
Generate an API key from your account, add balance, then choose the integration pattern that fits your stack.
/api/v1/models.php and /api/v1/balance.php as the live source of truthDocs are public. Keys are self-serve. The API is versioned. Browser and API usage share the same balance, so you can top up once and use it anywhere.
No subscriptions. Flat pricing per debate. The engine automatically picks the right models based on your question's complexity — you just ask and pay the same price every time.
AI models are confident. They're articulate. They're often wrong. The only reliable way to find the truth is the same way humans have always done it — put two smart minds in a room and let them argue until what's left is what actually holds up.
Asking one model to double-check itself searches the same training data twice. Different providers means different training — different blind spots, different gaps. What one misses, another was trained on. The debate is what filters the signal from the noise.
Start a DebateLeading the development of cross-model deliberation systems — orchestrating structured debate between AI models from competing providers (OpenAI, Google, Anthropic, xAI, Alibaba, DeepSeek, Mistral) through a single, unified interface.