What is an AI Agent? A Plain‑English Guide for Business Leaders

A CFO/COO‑friendly guide to AI agents: what they are, how they differ from chatbots, the main patterns, ROI examples, risk controls, and a practical two‑week pilot plan - including how to build agent literacy across your org.

TL;DR: A chatbot answers questions. An agent takes action using your tools and data - under rules, with logs, and approvals. Treat agents like junior teammates: give them a clear goal, limited permissions, and supervision. Start by automating one small, valuable slice with guardrails and a tight ROI loop.

Video summary: What is an AI Agent?Open video →

What is an agent?

In plain English: an AI agent is software that can decide what to do next and do it using your systems - like a junior teammate with a checklist, API access, and a supervisor.

  • The LLM provides reasoning and language.
  • The agent wraps that reasoning with tools (APIs, databases, spreadsheets, email, ticketing, CLI), policies (permissions, approvals), and memory (context about your business).
  • It works in a loop: plan → act → check → repeat, and pauses for humans when the step is risky or ambiguous.

Not all AI is an agent. “Chat with your data” is helpful. But until the system can call tools and complete a workflow - with logging and approvals - it isn’t an operational agent.

Chatbot vs. agent (quick comparison)

DimensionChat with an LLMAgent (LLM + tools)
Primary valueAnswers questionsTakes action to complete tasks
InputsPrompts, documentsPrompts + systems, APIs, files, CLI
OutputsText, summariesTickets updated, emails sent, spreadsheets reconciled
GovernanceMinimalPermissions, audit trail, approvals
Best useResearch, draftingBack‑office ops, data hops, repetitive workflows
Treat agents like supervised junior teammates: clear goals, limited permissions, logs, and approvals.

How agents work (the 5 building blocks)

  • Goal – What outcome are we driving? (e.g., “Reconcile August SaaS invoices.”)
  • Tools – What can it use? (ERP API, Google Sheets, email, ticketing, S3, CLI.)
  • Memory – What should it remember? (Vendors, GL codes, SLAs, business rules.)
  • Policies – What are the guardrails? (Who it can contact, spend caps, PII rules.)
  • Observability – What happened, when, and why? (Logs, approvals, rollbacks, metrics.)

The loop is simple: plan → use a tool → read the result → decide next step. When uncertainty or risk spikes, the agent triggers ask‑to‑act (human approval) before proceeding.

The main flavors of agents (with real‑world examples)

1) Chat assistants (LLMs in chat)

What: Conversational helper that searches knowledge and drafts content.

Used for: Sales emails, proposal drafts, policy summaries, SOPs, meeting notes.

Limit: Without tool access, it can’t update records or move a process forward.

2) Workflow agents (back‑office doers)

What: A “do the thing” agent that moves data between systems and finishes tasks.

Examples by function:

  • Finance/AP: Pull invoices from vendor portals → match to PO/GL → flag exceptions → draft vendor emails → queue payments (approval required).
  • Ops/HR: Create onboarding tickets in HRIS/IT → provision accounts → file a checklist in your PM tool → ping hiring manager for missing items.
  • RevOps/Marketing: Clean a lead list → enrich via API → dedupe CRM → assign to reps with territory rules → create follow‑up tasks.

Signature move: Logs every step and asks for approval before irreversible changes.

3) Command‑line / DevOps agents

What: A supervised bot that runs safe, allow‑listed commands.

Examples: “Rotate credentials,” “tail logs,” “rollback the last deploy,” “open a GitHub issue with logs attached.”

Why it works: Easy to fence with allow‑lists, sandboxes, and time‑boxed tokens.

4) Inbox & triage agents

What: First‑pass responder that categorizes, drafts replies, and opens/updates tickets.

Examples: Support inbox triage, AP/AR inbox sorting, HR requests.

Human in the loop: Agents propose; humans approve and send.

5) Multi‑agent patterns (router + doer + checker)

What: A small team of agents where each has a role: one routes the request, one executes, one checks quality/compliance.

Use when: There are many task types, multiple systems of record, or required compliance steps.

Where agents shine - and where they don’t

Great fit

  • Repetitive work with clear rules but variable context
  • “Last‑mile” tasks that require reading, comparing, and updating across multiple systems
  • Processes you tweak often (change is cheaper when you own the logic)
  • Thin UIs and checklists for operators; the automation does the heavy lifting

Poor fit (or needs a narrow scope)

  • Strategy and net‑new product decisions
  • High‑stakes, irreversible actions without human approval
  • Messy or disputed data with no single source of truth
  • Rare, novel, or undefined work (start with a human‑led pilot and document)

Use cases by function (starter menu)

FunctionHigh‑value use caseTypical triggerTools usedGuardrail to startKPI to track
Finance/APInvoice → PO/GL matching & exceptionsMonthly close windowERP, email, SheetsRead‑only first; approvals for emails/paymentsTime to close; exception rate
Finance/ARCash application & dunning draftsPayment posting dailyERP, bank feed, emailApproval before sendDSO; % auto‑matched
RevOpsLead hygiene & routingNew lead batchCRM, enrichment APIWrite only to sandbox firstSpeed‑to‑lead; dupes removed
SalesProposal/SOW assemblyStage changeCPQ, doc gen, e‑signTemplate allow‑listCycle time; redlines reduced
HR/ITOn/offboarding orchestrationHire/termination eventHRIS, IDP, MDMRole‑based provisioning planTime‑to‑productive; access errors
SupportTriage & knowledge suggestionsTicket createdHelpdesk, KBSuggested replies onlyFirst‑response time; deflection
LegalIntake & clause checksRequest submittedIntake form, clause libRead‑only; checklist outputReview time; deviations caught
ProcurementVendor risk review packetNew vendor requestGRC tool, web checksEvidence attach onlyTime to approve; completeness

Personal workflow recipes (build literacy across the org)

These are 15‑minute wins that help every BA/analyst/manager learn to “think in agents”:

  • Inbox triage with approvals: Label, draft replies, and create tickets - user hits send.
  • Meeting prep pack: Pull agenda, last notes, open tasks, and relevant KPIs into one doc.
  • Spreadsheet sanity check: Validate totals, spot anomalies, and write a short “what changed” summary.
  • Policy Q&A with receipts: Answer policy questions and attach links to the authoritative source.
  • RFP digest: Extract requirements, build a compliance matrix, and flag gaps for SMEs.
  • Expense audit sampler: Spot likely duplicates or out‑of‑policy items and queue a reviewer task.

Why this matters: Literacy first. When the average BA can design and supervise a simple recipe, adoption accelerates and big projects carry less risk.

Capability building: from pilot to culture

What “good” looks like in 60–90 days

  • Agent Champions Network: 1–2 champions per function run weekly office hours.
  • Playbooks & Templates: 5–10 vetted recipes (e.g., “AP Exceptions,” “Sales Proposal Pack”).
  • Guardrails Library: Reusable approvals (send, pay, delete), role‑based access, PII redaction rules.
  • Tool Registry: An allow‑listed catalog of APIs/Connectors with test data and smoke tests.
  • Agent Ops Dashboard: Volume, success rate, approvals, rollbacks, and time saved.

Our role: We don’t just build agents - we instill the capability. That includes team training, playbooks, an internal “Agent 101/201” curriculum, and co‑development so your BAs and operators can extend agents and own the backlog.

Safety, risk, and compliance (the essentials)

  • Least privilege: Start read‑only; graduate to scoped writes.
  • Ask‑to‑act: Human approval for sends, payments, deletions, or data writes across systems.
  • Audit trail: Who/what/when/why with before/after states; exportable for audit.
  • PII & secrets: Redact where possible, vault credentials, rotate time‑boxed tokens.
  • Fail‑open design: If the agent is down, the team can still work manually.
  • Testing: Sandbox with synthetic data; smoke tests per tool; change control for prompts/policies.

Architecture notes (keep it boring and governable)

  • Hub‑and‑spoke: One Agent Gateway mediates tools/permissions; individual agents plug in.
  • Allow‑listed tools: Central registry with scopes and logs.
  • Human‑in‑the‑loop surfaces: Approvals in systems your team already uses (email, Slack, ticketing).
  • Observability: Centralized logging with metrics and replay/rollback where feasible.

How to pilot an agent in two weeks

  1. Pick a tiny slice (30–90 minutes/day reclaimed), e.g., AP exception triage.
  2. Define success in numbers (time saved, error rate, cycle time).
  3. List tools (read‑only first) and name the approver for any writes/sends.
  4. Draft guardrails (who can it contact? spend caps? PII handling?).
  5. Prototype in the Room: In 1–5 days we co‑build a working slice with logs, approvals, and a path to production.
  6. Decide with data: TCO, migration plan, and a 30‑60‑90 rollout with training and ownership.

Ready to try it on a real workflow? Book a Prototype in the Room and bring a process. We’ll build the slice together and train your team.

Simple ROI math (example)

Assume AP exception handling consumes 10 hours/week at $60/hr fully loaded.

Pilot automates 60% with human approvals → 6 hours/week saved.

Annualized: 6 × 52 × $60 = $18,720 gross savings.

Add avoidance (late fees, reduced errors) and knock off your pilot cost to get net ROI.

Use this same math for cycle time and error‑rate improvements.

Want a structured calculator? See Build vs. Buy.

Quick FAQ

Will agents replace staff?

They replace busywork, not expertise. The best results pair agents with domain experts who set goals and guardrails.

Do we need data scientists?

No. You need a small cross‑functional team: a process owner, a system owner, and an engineer to connect tools safely.

How do we keep agents from “going rogue”?

Use allow‑listed tools, least‑privilege credentials, test data, and ask‑to‑act for risky steps. Log everything.

What about compliance?

Build audit from day one: log every action, redact PII, enforce data residency, and reuse your existing controls (SSO, RBAC, DLP).

How do we measure ROI?

Track time saved, error rate, cycle time, and cost avoidance (licenses, outages, vendor changes). Then run the Build vs. Buy math.

Jargon‑to‑plain‑English glossary

  • LLM – The text‑reasoning engine (the “brain”).
  • Tool call – Letting the agent use an API, database, spreadsheet, or CLI.
  • Context window – How much text the model can read at once.
  • Memory – Facts the agent keeps across steps (customers, SKUs, rules).
  • Agent loop – Plan → act → check → repeat.
  • Vector DB – A fast “similarity search” index for your documents.
  • Ask‑to‑act – An approval step for risky actions (send/pay/delete/write).
  • Agent Gateway – A central service that mediates tool access, policies, and logging.