Five things we've learned about how to implement AI agents successfully.
The pressure to deploy is real, and so is the pattern of projects that stall or fail to reach production. Here are five things we've learned, across sectors and scales, about what makes implementing AI agents successful.
1. Not everything that looks like an agent problem is one.
We often see "agentic AI projects" that are, in reality, a retrieval problem. The challenge is often finding the right information and producing a coherent response. A well-designed RAG pipeline or carefully prompted LLM will solve that more reliably, more cheaply, and with far less operational overhead than an agent.
Agents add value when the system needs to decide what to do next — and act on it: calling tools, triggering steps, changing the environment. Reasoning alone doesn't require an agent. Action does.
The distinction sounds obvious, but in practice there is a thin line between a well-designed project and an over-engineered one. Before architecture, before tooling, ask yourself what the system actually needs to do. If the answer is "find and generate", adding an agent creates complexity without adding value.
The task is retrieval and synthesis? A RAG pipeline or prompted LLM will serve you more reliably. The task requires deciding what to do next and executing? That's where agents justify their complexity.
What type of problem do you have?
Do you need to...
↓
|
Find & generate ▼ moreYour AI needs to retrieve and synthesize, find the right information or produce a coherent response. Examples
Q&A
Summarisation
Document synthesis
Search
Use LLM and RAG. An agent adds complexity without improving the output. |
Act on the world ▼ moreYour AI needs to identify what to do next and execute: calling tools, triggering multi-step processes, changing state. Examples
Multi-step execution
Tool use
State changes
Real-world effects
Use agentic AI. An agent is the right solution. |
2. Workflow automation has a hard limit. You designed it.
When you automate a workflow, you codify how work gets done. Every step is designed by you, and the intelligence lives within that design. You can optimize every process and find ways to run faster through every process but your system won't find smarter ways to reach the same outcome.
Agentic systems can break this limitation by finding smarter ways to reach the goal you define. But you do need to give away some control. For well-defined tasks, workflows provide predictability, consistency, and better auditability. On the other hand, tasks with high variability, which genuinely can't be reduced to a fixed path, but still have a clear goal, can benefit from the flexibility of an agentic solution.
Fixed path vs flexible routes
Workflow automation
Cannot exceed the process you designed.
Agentic AI
Discovers paths you didn’t design.
"When you give an agent a goal and guardrails, it decides the path. That freedom is where it finds cross-process synergies, uncovers patterns, and surfaces opportunities you wouldn't have found by designing every step. The value of agents isn't pure efficiency. It's discovery."
Embracing this flexibility requires your organisation to be ready to act on what the agent finds, and willing to change how work gets done as a result.
3. The real risk is that it succeeds in ways you're not ready for.
The agent will find things no one asked it to look for. That's the point. But most organisations are built to execute on known plans instead of unexpected discoveries.
A diagnostics agent might identify a new combination of signals that predicts a technical issue earlier and more accurately than the indicators engineers currently rely on. The insight is only useful if an engineer is willing to test it — and that requires a different mindset than monitoring a known dashboard. It means trusting an unfamiliar pattern, designing a validation approach, and being open to changing how diagnosis is done.
Even if the agent succeeds, the organisation needs to be ready to incorporate the change it brings.
How ready is your organisation?
|
✓
|
We are willing to let agent outcomes reshape our processes, not just speed them up |
▼ |
This is the hardest condition to meet. It means being willing to act on what the agent discovers, even if it challenges how work is currently structured. Organisations that deploy agents on processes they refuse to change get the worst of both worlds.
|
✓
|
Leadership is open to outcomes that look different from what was planned |
▼ |
Agents will surface patterns and paths no one designed for. If leadership expects the output to match the original plan, those discoveries will be dismissed rather than acted on. The agent succeeds, but the value disappears.
|
✓
|
We have a mechanism to incorporate what the agent discovers back into how we work |
▼ |
Discovery without a feedback loop is wasted. The organisations getting compounding value from agents have a clear path from agent output to operational change. This is what turns a deployment into a learning system.
4. The more durable advantage is in the skill library.
Rather than programming every task inside of system prompts, it is worth investing in a properly governed library of discrete, reusable capabilities (like search, summarise, calculate, classify, etc.). This allows any agent to select and combine these reliable actions dynamically to reach their goals.
Skills built for one use case become reusable assets across the organisation. A search-and-retrieve capability built for a client briefing agent serves a risk screening agent and a proposal drafting agent. This is where the economics of agentic AI compound — making the skill layer the foundational investment to get right first.
One library, any number of agents
|
Skill library Search & retrieve
Summarise
Calculate
Classify
Generate draft
|
Any agent Client briefing
Risk screening
Proposal drafting
|
The same skill serves multiple agents across entirely different contexts, without reprogramming.
"A common pitfall: teams lock skills and knowledge inside system prompts — one monolithic prompt per agent, per use case. This creates silos. The knowledge can't be reused, the logic can't be shared, and every new agent starts from scratch. The result is a fragile, expensive architecture that doesn't scale."
The solution is modular skills: discrete, reusable capabilities that any agent can draw from, without reprogramming.
5. Safety relies on conscious choices between telling the agent what to do and having guardrails defining what it can't.
The freedom that makes agents powerful is exactly where the risk lives. Agentic failures are difficult to foresee, since an agent can take paths you didn't anticipate, combine actions in ways that weren't tested, and produce consequences that are hard to reverse.
Your control relies on one thing: the quality of your guardrails. These are the boundaries within which if the agent does something wrong, the consequence is acceptable (maybe creates accepted rework or delays). Mistakes can then become learning events that allows the system to correct, sharpen, and improve over time.
"We cannot solve guardrails by throwing more AI at it. We need deterministic, infrastructure-level boundaries."
The instinct is to solve this with more AI: a second model reviewing the first, a prompt telling the agent to be careful. These are soft guardrails, where a sufficiently complex action sequence can circumvent any instruction. The only boundary you can fully trust are set by hard guardrails — deterministic, set at infrastructure-level, and enforced before the model makes any decision.
Soft vs. hard guardrails
|
Soft guardrail At the prompt level An instruction telling the agent what not to do. Clear and easy to set up, but a sufficiently complex sequence of actions can work around it. Example "Please don't use tool X." Can be circumvented by complex action sequences. |
Hard guardrail At the architecture level A technical boundary enforced by infrastructure. The agent has no access to begin with — there is nothing to circumvent. Example "API access to tool X is blocked." Enforced by architecture — the agent simply cannot cross it. |
The technology is ready. The harder question is whether your organisation is.
This article was written by Bruna Di Bisceglie (Program Manager), Gerben Rijpkema (Data Scientist), and Niels Mateijsen (Data scientist) at Rewire.
Agents reward organisations that ask harder questions first.
The organisations getting compounding value from agentic AI share one trait: they invested in getting the foundations right before scaling.
We've built agent systems across financial services, energy, telecom, and public sector, from early proofs of concept to production systems running at scale. We've seen what separates the deployments that deliver from the ones that don't.