Agent Shortlist

Issue 01 · The Shortlist · April 2026

The AI agent platforms worth your team's onboarding time in 2026.

We tested five categories across identical builder workflows: open-source harnesses and autonomous agents, no-code SaaS, workflow builders, the coding agents reshaping how developers work in 2026, and voice AI agents handling phone and web conversations. Plus the one enterprise platform operators keep asking about.

~25 min read·22 platforms reviewed·Updated April 2026

Most shortlists about AI agents are written for people who want to be impressed. This one is written for people who have to decide.

We tested four categories of tools: open-source harnesses (OpenClaw, Hermes, Paperclip), no-code SaaS platforms (Lindy, Relevance AI), developer-oriented workflow builders (n8n, Stack AI), and the coding agent most builders already pay for and underuse (Claude Code). We added Microsoft Copilot Studio because a meaningful number of operators are evaluating it by default. Nine platforms total, put through identical test protocols.

The single most useful thing we can tell you: there is no universal best platform. The right answer depends on your team's technical capacity, the workflow you're automating, and whether data control matters to you. The decision matrix at the bottom of this page maps those variables to our recommendations.

How we tested

Every platform was put through four identical operator workflows:

  1. 01
    Sales follow-up. Trigger an email sequence when a lead goes cold. Log the contact, draft a personalized follow-up, send it, update the CRM record.
  2. 02
    Support triage. Read an incoming support ticket, classify it by urgency and type, draft a first response, route it to the right queue.
  3. 03
    Internal Q&A. Answer questions about internal documentation — product specs, SOPs, past decision notes.
  4. 04
    Spreadsheet automation. Pull data from an external source, clean it, update a Google Sheet, send a Slack summary.

We measured: time to working agent · failure rate · customization ceiling · maintenance burden

Before the rankings

These platforms are not all competing for the same job.

Pick the category first, then pick the platform. Using a harness for a use case that needs a SaaS tool, or vice versa, is the single most common mistake builders make.

Open-source harnesses

Software you run on your own infrastructure. Maximum control, maximum flexibility, maximum setup. Requires technical capacity.

OpenClaw, Hermes, Paperclip

No-code SaaS

Managed platforms accessible via browser. Pay monthly, nothing to maintain, working in an afternoon.

Lindy, Relevance AI

Workflow builders

Visual interfaces with serious power. Usually require a developer for setup, then accessible to non-technical users.

n8n, Stack AI

Coding agents

AI that lives in your codebase. Reads files, edits code, runs tests. Seven real options now — from the autocomplete-default everyone already pays for to the open-source CLI alternative.

Claude Code, Cursor, Aider, GitHub Copilot, Augment, Amp, OpenAI Codex

Voice AI agents

AI that talks to your customers on the phone or web. Handles outbound sales, inbound support, appointment booking. Pay-per-minute pricing, sub-second latency.

Retell AI, Vapi, Bland AI, ElevenLabs Conversational

Enterprise platforms

Designed for large organisations with existing vendor relationships. Fit inside procurement processes, not head-to-head feature comparisons.

Microsoft Copilot Studio, Azure AI Agent Service, Vertex AI Agent Builder

01

Open-source harnesses & autonomous agents

For builders who want control over their data and infrastructure, plus the autonomous-agent products gaining attention with non-technical users.

Open-source harness · 365k ★ GitHub · MIT

OpenClaw

377,771+12,473 in 45d2,507 contributors

The most mature open-source agent harness. If you want one AI doing things across your tools and devices, start here.

365,000 GitHub stars. One of the most popular AI projects ever built. OpenClaw is the standard for personal AI harnesses — install it on Mac, Windows, or Linux, connect it to a messaging platform, and you have an AI with hands: it browses the web, reads email, executes scripts, fills forms, and talks to your APIs.

It supports 20+ messaging platforms — WhatsApp, Telegram, iMessage, Discord, Slack, and more — and works with any AI model. Claude, GPT-4, local alternatives: you're not locked in. The community is the main reason to choose it. 365k stars means 500+ contributors, thousands of community-built skills, and documentation that's been written, corrected, and improved by people who actually use it.

The limit worth knowing: OpenClaw is single-user at its core. It's built for one AI serving one person. If you need a coordinated fleet of agents across a team, you need Paperclip as the orchestration layer — OpenClaw slots in underneath it.

Best for

Technical operators who want a self-hosted personal AI running 24/7. Individuals automating their own workflows. Teams evaluating harnesses for the first time.

Skip if

You need a managed platform with no infrastructure overhead, or you're deploying for a non-technical team who won't maintain it.

Full review: OpenClaw

Open-source harness · 119k ★ GitHub · MIT · Nous Research

Hermes

188,226+67,817 in 45d1,490 contributors

The most technically sophisticated open-source agent. If you want an AI that gets better at your specific workflows over time, Hermes is the only real option.

Built by Nous Research — one of the few AI labs with real research credibility outside the big players. The defining feature is the learning loop: Hermes creates skills from experience, improves them during use, and builds a deepening model of who you are and how you work. Most agents reset every session. Hermes compounds.

Use it to prepare sales briefings for a month and it gets better at preparing sales briefings — not because you re-prompted it, but because it created a skill from watching itself succeed and fail. Persistent memory means it remembers your preferences, your context, and what worked last time.

Hermes runs on a server (Docker, SSH, Modal, Singularity) rather than your local machine — it operates 24/7 without your laptop needing to be on. Supports 200+ models via OpenRouter. The Atropos RL integration connects it to frontier research methods: it's the only production harness built by people actively advancing agent capabilities, not just packaging them.

Best for

Developers and technical operators who want a server-deployed agent that builds institutional memory and improves from experience.

Skip if

You want something up and running in an afternoon. Hermes rewards sustained use — it's not the right choice for quick evaluation.

Full review: Hermes

Agent orchestration · 59k ★ GitHub · MIT

Paperclip

69,721+10,155 in 45d110 contributors

The only serious open-source platform for orchestrating teams of agents. If you're past one agent doing one thing, Paperclip is the layer you need.

Paperclip's own framing is the most accurate description we've found: if OpenClaw is an employee, Paperclip is the company. It's an orchestration layer — org charts, reporting lines, budget limits, approval gates, audit logs — for a workforce of AI agents.

It works with any agent runtime: OpenClaw, Claude Code, Cursor, custom HTTP agents. Paperclip doesn't care what's underneath. What it provides is the management infrastructure: assign tasks, track costs, enforce spending limits, require human approval for high-stakes actions, log every decision immutably. The heartbeat system wakes agents on schedule, tells them what to check, and escalates anything that needs a human.

Hard per-agent monthly budget limits are the feature that earns its place: runaway API spend is a real failure mode when agents work autonomously. Paperclip prevents it structurally rather than relying on you to catch it.

Best for

Teams running multiple AI agents who need org structure, budget controls, and approval workflows. Autonomous operations that need governance.

Skip if

You're just getting started with AI agents. Paperclip is infrastructure — it assumes you already have agents to manage.

Full review: Paperclip

Autonomous Agent · Manus AI · ~$39/month

Manus AI

The autonomous AI agent that went viral in early 2025. Genuinely capable for browser-based research tasks; less differentiated for builders who already have Claude Code or a coding agent setup.

Manus from Butterfly Effect AI launched in early 2025 and went viral on social media for autonomous demos: "do this multi-step research task" and the agent would open a browser, navigate sites, fill forms, write reports.

The product's capabilities have been real but uneven — some tasks land impressively, others fail silently or take long enough that watching is painful. Pricing has shifted multiple times. Strong for non-technical users who want an agent that "just does the thing" without prompt engineering.

For builders evaluating autonomous agents, Manus is worth a free-tier trial. For builders already invested in Claude Code or another agentic stack, the differentiator gets thinner.

Best for

Non-technical builders who want an autonomous agent that can browse the web, research, and produce structured deliverables — without setting up a CLI or writing prompts repeatedly.

Skip if

You're a developer who already has Claude Code, OpenAI Codex, or a similar agentic setup. The autonomous-browser angle is less useful when you already have a code-aware agent that can browse via MCP.

Full review: Manus AI

02

No-code SaaS

For operators without developers who need something working this week. You trade control for convenience.

No-code SaaS · From $49/month

Lindy

The best no-code AI agent platform for operators. Non-technical teams can ship real automations in hours.

The most operator-friendly platform we tested. No code required — describe what you want your agent to do in plain English and Lindy builds it. The pre-built templates for sales follow-up, meeting prep, and support triage are polished in a way that matters: not technically works polished, but you could ship this to customers polished.

The tradeoff is the usual SaaS bargain: you're on their infrastructure, their pricing, and their data policies. For standard B2B workflows with non-sensitive data, that tradeoff is worth it. For anything touching customer PII, compliance requirements, or workflows that need to do things Lindy hasn't anticipated — you'll hit the ceiling.

Best for

Sales and support teams at B2B companies who need working agents in days, with no developer on staff and no tolerance for server management.

Skip if

You have complex custom integration requirements, deal with regulated data, or need the customization ceiling of a developer tool.

Full review: Lindy

No-code SaaS · From $19/month

Relevance AI

The most powerful no-code agent builder. More complex than Lindy, but gives skilled non-developers real control.

Relevance AI sits between Lindy (pure no-code) and n8n (developer-required) on the complexity spectrum. The tool-building interface is the best we've seen in the no-code category: chain tasks, set conditional logic, build multi-step research workflows, configure how the agent reasons about edge cases — all without writing code.

If your team has a builder persona — the person who built the company's Notion system or manages Zapier — Relevance gives them more power than any other no-code platform. It's particularly strong for outbound research and sales intelligence workflows where the logic is complex but the stakes don't require a custom engineering solution.

Best for

Ops teams with one skilled builder who needs more than templates — someone who thinks in processes and isn't afraid of a workflow editor.

Skip if

You need something genuinely no-code (use Lindy) or genuinely powerful with a developer available (use n8n). Relevance is best in the middle.

Full review: Relevance AI

03

Workflow builders

For teams with one developer. The most flexible options — and the highest ceiling short of building from scratch.

Workflow builder · Open-source · From $24/month cloud

n8n

191,721+5,871 in 45d667 contributors

The best workflow automation platform for teams with a developer. Beats every no-code tool for complex automations.

n8n is what technical operators reach for when Zapier or Make isn't flexible enough. 400+ integrations, self-hostable for full data control, and a native AI agent layer that has improved significantly. The visual workflow builder handles conditional logic, loops, error handling, webhooks, and custom HTTP calls without writing code — once it's set up.

The ceiling is the point. If your team has even one developer who can spend a day configuring it, n8n will outperform any no-code platform for complex, multi-step automations involving AI agents. Self-hosted means your data never leaves your infrastructure. The cloud version removes the maintenance overhead if that tradeoff works for you.

Best for

Teams with at least one developer who need flexible, powerful workflow automation with AI agents built in — and want the option of full data control via self-hosting.

Skip if

You have zero technical resources. The initial setup requires someone comfortable configuring a server. Lindy or Relevance are better starting points.

Full review: n8n

AI application builder · SOC 2 · From $199/month teams

Stack AI

Best for internal knowledge base and document Q&A. Strong in its lane, expensive outside it.

Stack AI shines on document-heavy use cases: ingest your internal knowledge base and make it queryable by AI. If your team spends hours looking through Notion, Confluence, or shared drives for answers, Stack AI can turn those documents into an AI that responds in plain English. The RAG implementation is one of the better ones in the no-code category.

The SOC 2 compliance and enterprise data connectors make it defensible to IT and legal in a way that some cheaper alternatives aren't. Where it falls short is flexibility — multi-step automations with conditional logic are better served by n8n or Relevance AI. The $199/month team price is hard to justify unless document Q&A is a core use case.

Best for

Ops teams who want AI agents over internal documents — SOPs, contracts, product specs, Notion wikis. Teams where compliance matters.

Skip if

You need multi-step process automation or external-facing workflows. n8n handles those better at lower cost.

Full review: Stack AI

04

Coding agents

AI that works inside your codebase. The category most builders are already paying for and using a fraction of.

Coding Agent · Included with Claude Pro ($20/mo) and above

Claude Code

131,238+12,779 in 45d53 contributors

Most builders pay for Claude and use 5% of what it can do. Claude Code is the rest. The biggest productivity step most builders haven't taken yet.

If you've used Claude in the browser, you've seen what it can do with code when you paste it in. Claude Code removes the paste step. It reads your files directly. It runs commands. It commits to git. It opens pull requests. The work happens in your project, not in a chat window.

The thing that separates it from GitHub Copilot or Cursor: it's not completing your sentences. It's completing your tasks. Copilot suggests the next line. Claude Code handles the next hour of work. You give it a brief in plain English; it reads the relevant files, writes a plan, makes coordinated edits across multiple files, runs your tests, and tells you what it did.

The economic case is the unusual part. If you already pay $20/month for Claude, Claude Code is included. There's no separate subscription. The same tool you use to draft email is the tool that can rewrite your codebase. Most people miss this entirely. The biggest gap on this list isn't between Claude Code and its competitors — it's between builders who use Claude Code and builders who don't know it exists.

Best for

Builders with a Claude subscription who want to go further — developers automating engineering work, founders building internal tools, non-developers who've realised Claude can write and run code given the right environment.

Skip if

You haven't yet hit the ceiling of what Claude can do in the browser. Start there. Once you've maxed out chat-based workflows, Claude Code is the next step.

Full review: Claude Code

Coding Agent · Cursor · From $20/month (free tier available)

Cursor

The most-used AI coding tool by paying customers — $2B in annualised revenue, 360k paying users. Multi-model flexibility is a real edge. The June 2025 pricing change burned some early adopters.

Cursor is a VS Code fork with AI baked into the editor. The headline is multi-model: Claude Sonnet 4.6, GPT-5, and Gemini 2.0 are all available, and you can switch between them mid-session for the same task. Claude Code only runs Claude. Codex only runs OpenAI. Cursor lets you mix and match.

The IDE-first approach is faster than CLI workflows for exploration and rapid prototyping — builders shipping MVPs report 5–10× faster time to first working prototype. The trade-off is VS Code lock-in: no JetBrains, no Vim, no terminal-first workflows. The June 2025 pricing pivot replaced fixed fast-request quotas with a $20 credit pool, which effectively cut monthly requests by ~55% without an announcement. Existing users felt blindsided.

Despite that, Cursor still hit $2B annualised revenue by early 2026. The market has voted, even if the vote is "we'll tolerate questionable pricing changes for the multi-model UX."

Best for

Builders who want an IDE-first AI experience and the ability to switch between Claude, GPT, and Gemini mid-session. Strong for rapid prototyping and exploration.

Skip if

You're committed to JetBrains, Vim, or any non-VS Code editor. You want CLI-first workflows. You're sensitive to SaaS pricing changes.

Full review: Cursor

Coding Agent · Aider · Free (BYOK) · 41.6k ★ GitHub · Apache 2.0

Aider

45,921+1,902 in 45d182 contributors

The open-source pick. Bring your own API keys, switch models mid-session, and use 4× fewer tokens than Claude Code on identical work. Trade-off: lower accuracy and a smaller community.

Aider is the only tool on this list that's fully open source and model-agnostic. Bring your own API keys: Claude, GPT, DeepSeek, Gemini, local models — Aider works with all of them, and you can switch mid-session. The economics are different from every other tool here. There's no subscription. You pay only for the model API calls you make.

The token efficiency claim is the sleeper feature: Aider uses 4.2× fewer tokens than Claude Code on identical tasks (verified via independent benchmarks). For high-volume teams, that's real money. A team running heavy AI-assisted workflows can cut their model bill by 70%+ vs running everything through Claude Code on Opus.

The git-native workflow is the other quiet win: every change auto-commits with a clean message. Perfect audit trail, easy rollback. The trade-off — and it's a real one — is accuracy. Aider lands around 85% on technical benchmarks vs 91%+ for Claude Code or Cursor. For mainstream languages and well-trodden patterns, you won't notice. For exotic frameworks, you will.

Best for

Cost-conscious developers, open-source purists, anyone who wants to mix Claude, GPT, DeepSeek, and Gemini in one workflow. Strong for surgical refactoring and audit-friendly git workflows.

Skip if

You need maximum accuracy on complex tasks (Aider lands around 85%) or you rely on enterprise-grade vendor support.

Full review: Aider

Coding Agent · GitHub · From free, Pro $10/month, Business $19/user/month

GitHub Copilot

The default AI coding tool by reach — most developers already have it. Started as autocomplete, now has real agent mode. Best for builders who want one tool that lives where they already work.

Copilot is the most-installed AI coding tool by a wide margin — it ships with most GitHub plans, integrates natively into VS Code and JetBrains, and benefits from Microsoft's enterprise procurement reach. The product has evolved meaningfully: started as line-completion (2021), added Chat (2023), then Agent Mode (2025) for multi-file edits.

As of 2026 it can run coordinated changes across files, write tests, and create pull requests — though it's still catching up to Claude Code and Cursor on raw agentic depth. The model layer is multi-vendor: Copilot now uses GPT-5.5, Claude Sonnet 4.6, and others depending on the task.

The headline trade-off is breadth vs depth. Copilot covers more workflows than any competitor (chat, autocomplete, agent, code review) but isn't the strongest at any single one. For builders who want one tool inside their existing IDE and don't want to adopt a separate CLI or fork of VS Code, Copilot is the path of least resistance.

Best for

Teams already on GitHub Enterprise or Business. Developers who want autocomplete-plus-agent in one tool without leaving VS Code or JetBrains.

Skip if

You want the most agentic tool on the market — Claude Code and Cursor are further along on multi-file autonomous workflows. You're not on the GitHub stack.

Full review: GitHub Copilot

Coding Agent · Augment Code · ~$50/user/month

Augment Code

Strong agentic coding tool with deep codebase context. Best for builders working in large monorepos where tools like Cursor or Copilot lose the thread. Pricing is higher than most competitors.

Augment raised over $250M to build an agentic coding tool optimised for large codebases. The differentiator is the Augment Engine — a codebase indexing layer that gives the AI agent deep, real-time context about every file in your repo, not just the ones you have open.

For teams in monorepos with hundreds of thousands of files, this is a meaningful edge. The agent understands cross-cutting concerns, can trace dependency graphs, and makes refactors that respect existing architecture. Built for IDE workflows (VS Code, JetBrains) plus a chat interface.

Pricing is higher than Claude Code or Cursor — the Augment team is positioning toward engineering organisations that can justify per-seat costs against the productivity gain. For small teams or solo developers, the codebase-context advantage doesn't pay back the price. For large engineering orgs, it can.

Best for

Engineering teams in large codebases (100k+ files, multi-million lines) where context-awareness across the repo matters more than raw model speed. Strong for refactoring legacy systems.

Skip if

You're a solo developer or working in a small project — the Augment Engine's codebase indexing is overkill for a 50-file repo. Cursor or Claude Code give better value at smaller scale.

Full review: Augment Code

Coding Agent · Amp by Sourcegraph · Free tier + paid

Amp

Sourcegraph's agentic coding tool, leveraging years of investment in code search and indexing. Strong context-awareness story for teams already running Sourcegraph; less compelling as a standalone purchase.

Amp is Sourcegraph's agentic coding tool, launched in 2025. The differentiator is leverage. Sourcegraph has spent years building Cody (their AI coding assistant) and the Sourcegraph Code Search engine, which indexes large codebases for fast semantic search. Amp builds on that infrastructure.

When the agent needs context about a function, a class, a usage pattern, it queries Sourcegraph's pre-built index rather than scanning files at runtime. For teams already running Sourcegraph, this means faster and more accurate context retrieval. The free tier is real and useful for individual evaluation.

Where Amp is less compelling is for teams not already on Sourcegraph — the standalone value proposition vs Claude Code, Augment, or Cursor isn't as differentiated. The product is solid; the strategic moat is the Sourcegraph install base.

Best for

Engineering teams already paying for Sourcegraph Code Search who want to add an AI agent that reuses the existing codebase index. Free tier is generous enough for individual evaluation.

Skip if

Your team isn't on Sourcegraph. Amp's standalone story is less differentiated than Claude Code or Augment, and the indexing layer is its main edge.

Full review: Amp

Coding Agent · OpenAI · $20/month base + usage credits

OpenAI Codex

89,847+11,565 in 45d473 contributors

The fastest-growing coding agent on the market — 3 million weekly active users, 70%+ MoM token growth. Rolling 5-hour credit limits are a real operational pain. Best if you're already locked into OpenAI's ecosystem.

Codex CLI is OpenAI's answer to Claude Code: a Rust-based terminal coding agent with 75.6k GitHub stars and 3 million weekly active users. Token usage is growing 70%+ month over month, and Codex is currently outselling Cursor in some metrics. The product ships fast — multi-agent v2 workflows with inter-agent messaging, integrated terminal feedback (it can read your dev server output and build logs in-thread), Windows native plus WSL2 support.

The community's loudest criticism is the rolling 5-hour credit window: heavy Monday morning use can block you out of Codex by Monday afternoon. The June 2025 pricing overhaul kept the $20 base but moved to usage-based credits — predictable for light users, painful for heavy ones. If your team works in bursts, plan around it.

The structural trade-off is OpenAI lock-in. Codex only runs OpenAI models. If you want to mix Claude into your workflow, Cursor or Aider are better picks. If you're already committed to GPT-5+, Codex is the strongest CLI option in OpenAI's ecosystem.

Best for

Developers committed to GPT-5+ models who want a Claude Code equivalent without leaving the OpenAI ecosystem. Teams that prioritise the most recent OpenAI features.

Skip if

You need predictable monthly costs (rolling credit limits cause unpredictable workflow blocks) or you want to use Claude or Gemini in your workflow.

Full review: OpenAI Codex

05

Voice AI agents

The newest category on this list — AI agents that handle phone and web voice conversations. Pay-per-minute pricing, sub-second latency, and a real shift in what's possible for sales and support workflows.

Voice AI · Retell AI · ~$0.07–0.10 per minute

Retell AI

The most builder-friendly voice agent platform — clean SDK, predictable pricing, low latency. Best for teams shipping production voice agents without building infrastructure from scratch.

Retell handles the hard parts — sub-second latency for natural conversation, telephony integration, call routing, transcription, function-calling — and exposes a clean SDK on top. You bring the prompt and the business logic; Retell handles the voice infrastructure.

Pricing is straightforward per-minute, which most teams can model directly. Used by sales teams running outbound, support teams handling tier-1 tickets, and operations teams automating appointment scheduling.

The trade-off is lock-in to Retell's stack — voice quality, latency profile, and routing logic are all theirs. For teams that need ownership at that level, ElevenLabs Conversational or building on Twilio + a separate LLM is the alternative.

Best for

B2B teams deploying voice agents for outbound sales, customer support, or appointment booking. Strong for builders who want managed voice infrastructure without owning the telephony stack.

Skip if

You need full control over the voice synthesis pipeline (use ElevenLabs Conversational), or your call volume is too low to justify per-minute pricing.

Full review: Retell AI

Voice AI · Vapi · ~$0.05–0.08 per minute

Vapi

Developer-first voice agent infrastructure with strong customisation hooks. Best for teams that want more control over the voice pipeline than Retell allows but don't want to build from scratch.

Where Retell hides the voice stack, Vapi exposes it: pick your LLM (Claude, GPT, Gemini), pick your voice synthesis provider (ElevenLabs, PlayHT, Cartesia), pick your transcription service. The flexibility comes with a steeper learning curve.

For teams shipping voice products at scale or with specific quality requirements, the control is meaningful. Slightly cheaper than Retell on per-minute pricing, with stronger webhook and API surface for integrations. Used by voice-product companies that want to white-label and customise.

Best for

Engineering teams building production voice products who need fine control over the model, voice synthesis provider, and call routing. Strong API and webhook story.

Skip if

You're a non-technical team — Retell's SDK is more accessible. You don't need the customisation depth Vapi offers.

Full review: Vapi

Voice AI · Bland AI · ~$0.09–0.12 per minute

Bland AI

Phone-call-focused voice agents with strong throughput. Best for high-volume telephony use cases like outbound sales campaigns. Less polished for nuanced conversational work.

Bland is the most phone-native of the major voice agent platforms — built first for outbound and inbound phone calls at high volume. Strong infrastructure for running campaigns: 1,000 simultaneous calls, automatic retry logic, transcript storage, CRM webhooks.

The voice quality is acceptable but less natural than ElevenLabs-based platforms; latency is slightly higher than Retell. For B2B outbound campaigns where the goal is throughput and qualification rather than premium conversational UX, Bland is the right tool.

Best for

Teams running high-volume outbound phone campaigns — SDR fleets, lead qualification, appointment setting at scale. Direct phone integration is its strength.

Skip if

You need natural-sounding conversational voice for premium customer experiences. Bland skews toward throughput over polish.

Full review: Bland AI

Voice AI · ElevenLabs Conversational · ~$0.10–0.15 per minute

ElevenLabs Conversational AI

The best-sounding voice agent platform on the market. Built on ElevenLabs' industry-leading voice synthesis. Worth the premium when voice quality is part of the product.

The voice quality is the differentiator. In blind tests, ElevenLabs voices are consistently rated more natural and emotionally accurate than competitors. The conversational layer adds the agent infrastructure — telephony, function-calling, multi-turn memory — but the headline feature is voice.

Used by premium brands (luxury, healthcare, financial services) where the customer's first impression is the voice. Multilingual support is also strong — ElevenLabs handles 30+ languages with quality that rivals native-language alternatives. Pricing is the highest in the category, justified by quality.

Best for

Premium customer experiences where voice quality is part of the brand — concierge services, high-end B2B sales, healthcare conversations. Also strong for multilingual deployments.

Skip if

You're running high-volume cost-sensitive use cases — Bland or Vapi are cheaper at scale. You don't need premium voice quality.

Full review: ElevenLabs Conversational AI

06

Enterprise

For large organisations where vendor relationships, compliance certifications, and existing IT infrastructure drive the decision.

Enterprise platform · Microsoft · $200/month per 25k messages

Microsoft Copilot Studio

Best AI agent platform for Microsoft-first organizations. Outside a Teams/SharePoint/Dynamics environment, there's no reason to use it.

The most straightforward recommendation on this list: if your organization runs on Microsoft 365, Teams, SharePoint, and Dynamics — evaluate Copilot Studio seriously. If it doesn't, don't.

The native integrations are genuinely good. An agent built in Copilot Studio that reads SharePoint files, answers questions in Teams, and updates Dynamics records will outperform most alternatives in that specific stack. The enterprise compliance story — security, data residency, audit trails, Microsoft's certification relationships with regulated industries — is the best on this list.

Outside a Microsoft environment, the overhead isn't worth it. The platform has improved significantly in 2025 but it's designed for organizations where Microsoft controls the IT relationship. Speed and flexibility are not its strengths.

Best for

Large organizations on Microsoft 365 running Teams, SharePoint, and Dynamics. Regulated industries where Microsoft's compliance certifications are a requirement.

Skip if

You're not primarily a Microsoft shop. The integration advantage disappears entirely outside the Microsoft stack.

Full review: Microsoft Copilot Studio

Enterprise · Microsoft Azure · Pay-as-you-go

Azure AI Agent Service

Microsoft's developer-grade agent service running on Azure AI Foundry. Different audience from Copilot Studio — for engineering teams building production agents, not ops teams configuring no-code workflows.

Where Copilot Studio is the no-code path for ops teams, Agent Service is the SDK-and-API path for engineering teams. Built on Azure AI Foundry, it gives you OpenAI models, Azure-native security, function-calling, knowledge connectors to Azure SharePoint and Fabric, and the same enterprise compliance story as the rest of Azure.

Used by enterprise dev teams that have Azure procurement in place and need to ship production agents inside Microsoft's data boundary. Pricing is usage-based — no flat subscription, just per-token model costs plus standard Azure infrastructure. The trade-off is the usual Azure trade-off: powerful and enterprise-ready, but slower to iterate on than independent platforms.

Best for

Engineering teams already on Azure who want to build production AI agents with full code control, Azure-native security, and integration with Azure data services.

Skip if

You're a non-developer (Copilot Studio is the no-code path). You're not on Azure — the integration depth doesn't pay off elsewhere.

Full review: Azure AI Agent Service

Enterprise · Google Cloud · Pay-as-you-go

Vertex AI Agent Builder

Google's enterprise agent platform on Vertex AI. Strong if you're already on Google Cloud and want Gemini-native agents with BigQuery integration. Less compelling outside the Google stack.

Built on Vertex AI (Google's ML platform) and integrated with the Gemini model family, Vertex AI Agent Builder offers grounding via Google Search, knowledge ingestion from Google Workspace and BigQuery, and the same enterprise compliance story as the rest of Google Cloud.

The differentiator is Gemini's long-context window — agents can hold 1M+ tokens of context, useful for codebase analysis or long-document workflows. Where Vertex falls short is model flexibility (Gemini-only) and the usual cross-cloud friction if your stack isn't already on Google Cloud.

For teams already invested in Google Workspace and BigQuery, it's a natural extension. For everyone else, the integration advantage doesn't pay off.

Best for

Engineering teams on Google Cloud who want to build agents using Gemini's long-context capabilities and integrate directly with BigQuery, Cloud Storage, and Google Workspace.

Skip if

You're not on Google Cloud — Vertex's value proposition is integration depth that doesn't transfer. You want model flexibility — Vertex is Gemini-only.

Full review: Vertex AI Agent Builder

The skip list

Platforms we'd pass on, and why.

Make Agents (Make.com). Good if your team is already using Make for automation — the agent upgrade makes sense as an incremental improvement. Not worth switching for. The agent layer is still catching up to Lindy and Relevance for orchestration.
CrewAI. An excellent Python framework for developers building multi-agent systems. Not on the shortlist for the same reason Next.js isn't: it's a building block, not a deployable product for operators. If you have engineers building custom agent pipelines, evaluate it. If you're an operator looking to deploy, it's not for you.
Zapier Agents. Not ready. The AI layer on top of Zapier's workflow automation is currently too limited for serious use. Check back in 12 months.
AutoGPT and legacy autonomous agents. These projects pioneered the category but have been largely superseded by more capable and reliable harnesses. OpenClaw and Hermes are better in every dimension that matters for operator use.

Decision matrix

Which platform for which operator.

Non-technical team, need it working this week

Lindy

Non-technical, need more power than templates

Relevance AI

Have one developer, want maximum flexibility

n8n

Want a self-hosted personal AI on your own hardware

OpenClaw

Want an agent that improves over time, technical team

Hermes

Running multiple agents and need governance

Paperclip

Microsoft 365 / Teams / Dynamics shop

Copilot Studio

Need an internal document Q&A agent

Stack AI

Already pay for Claude and want to do more with it

Claude Code

Want an AI-first IDE with multi-model flexibility

Cursor

Open-source-first or cost-conscious about API spend

Aider

Already locked into the OpenAI / GPT-5 ecosystem

OpenAI Codex

Already on GitHub Business / Enterprise; want one tool everywhere

GitHub Copilot

Working in a large monorepo (100k+ files); refactoring legacy code

Augment Code

Already paying for Sourcegraph; want agent that reuses the index

Amp

Building a production voice agent (sales calls, support, booking)

Retell AI

Voice product where premium voice quality matters

ElevenLabs Conversational

High-volume outbound phone campaigns (SDR replacement)

Bland AI

Want an autonomous browser agent without writing code

Manus AI

Not sure? Use the Agent Picker →

Common questions

What builders ask about the AI agent landscape.

What are the best AI agents in 2026?

The right answer depends on what you're building. Our top picks by category:

  • Best no-code platform: Lindy — operator-friendly, fast time to first value.
  • Best workflow builder: n8n — flexible, self-hostable, strong AI agent layer.
  • Best coding agent: Claude Code — the highest-leverage tool most builders already pay for.
  • Best open-source harness: OpenClaw — the largest open-source agent community.
  • Best for orchestration: Paperclip — the only serious open-source platform for managing teams of agents.

The full ranking with verdicts, ratings, and context for each is in the sections above. If you'd rather skip the read, run the five-question picker.

What is an AI agent platform?

An AI agent platform is software that lets you deploy AI to actually do things, not just answer questions. The platform provides three layers: a connection to a large language model (Claude, GPT, Gemini), a way to give the AI access to your tools (email, CRM, databases, browsers), and a workflow layer that decides when the agent runs and what it does.

Without a platform, you have a chatbot — useful but not active. With a platform, the AI can read your support tickets, draft responses, send emails, update your CRM, and report back when it finishes. The platform is what makes the AI an "agent" rather than just a smart text generator.

We cover this distinction in more depth in the FAQ.

Are open-source AI agents better than SaaS platforms?

Better at different things. Open-source agents (OpenClaw, Hermes, Paperclip, Aider) give you full control: self-hosted, model-agnostic, no monthly subscription, no data leaving your infrastructure. The trade-off is setup time, maintenance burden, and a smaller polished-template ecosystem.

SaaS platforms (Lindy, Relevance AI, Stack AI) give you fast time-to-value: working agents in hours, polished templates, no infrastructure to manage. The trade-off is monthly subscription costs, your data sitting on their servers, and a ceiling on customisation.

For most teams without a developer: SaaS. For technical teams that care about data control or who need to run at scale: open-source. Many run both — SaaS for quick wins, open-source for the high-volume use cases where margins matter.

Should I use multiple AI agent platforms?

Most teams should start with one and add more only when they hit a specific gap. The most common pattern we see in mature AI-using teams:

  • One coding agent (Claude Code or Cursor) for the engineering team
  • One workflow tool (n8n or Lindy) for ops and sales automations
  • Sometimes a specialised tool for document Q&A (Stack AI) or orchestration (Paperclip)

Three platforms is plenty for most companies. More than that and you're paying for overlap or fragmenting your team across tools that don't share data. Use the picker to find your starting point and resist adding a second platform until you can name a specific workflow the first one can't do.

How often does the AI agent landscape change?

Fast — and faster on the model layer than the platform layer.

Pricing changes on the underlying models (Claude, GPT, Gemini) happen quarterly. We re-verify them weekly via an automated audit and update the calculator immediately. Platform features change every few months: new agent modes, new integrations, new tiers. Major platform launches happen ~3–4 times a year.

We update this shortlist quarterly to reflect new platforms, shifts in our verdicts, and pricing changes. Any platform marked on the skip list might come off it next quarter; any platform on the recommended list might drop down if the team behind it stops shipping.

To track shifts in real time: the GitHub stats on each platform page update weekly, and the cost calculator reflects current per-token pricing across 15 models.

Which AI agents are free?

Free in the "no platform subscription" sense:

  • OpenClaw — open-source, self-hosted, free forever
  • Hermes — open-source, server-deployed, free
  • Paperclip — open-source orchestration layer
  • Aider — open-source CLI coding agent
  • n8n — open-source self-hosted (cloud version is paid)

You still pay for the underlying model API calls (Claude, GPT, etc.) — see the cost calculator for what those actually cost. The platform itself is free; the AI behind it is per-token.

Some SaaS platforms also have free tiers: Lindy, Relevance AI, Cursor, GitHub Copilot, Amp. Limited usage but enough for individual evaluation.

Continue exploring

Open dataset

The reviews on this page are also published as a free, open dataset.

Every platform on the shortlist — verdict, rating, pros, cons, best-for, not-for — is exported to a structured platforms.json file on GitHub. CC-BY-4.0. No API key. Use it in your own comparison tools, research, or internal "what should we use" docs.