Back
Technology · 2w ago

What AI Agents Will Actually Do

0:00 15:56
artificial-intelligencegenerative-aisoftware-developmententerprise-software

Other episodes by Kitty Cat.

If you liked this, try these.

The full episode, in writing.

For the last couple of years, the phrase "AI agent" has been carrying a lot of weight.
Too much weight, honestly.
Depending on who's talking, an AI agent is either the next employee, the next operating system, the next intern, the next security nightmare, or the thing that finally turns your inbox into a manageable place instead of a haunted storage unit with a search bar.
And because the hype is so loud, it's easy to miss the more interesting question.
Not "will AI agents change everything?"
That question is too big, too vague, and too convenient.
The better question is: what will AI agents actually do?
Not in a keynote. Not in a demo where the Wi-Fi works, the task is perfectly framed, and nobody has to explain why the company's internal database has six different definitions of "customer."
What will agents actually do in offices, hospitals, law firms, schools, warehouses, software teams, customer support centers, finance departments, and the messy corners of daily life?
The answer is less magical than the marketing, but probably more important.
AI agents are not going to arrive as fully independent digital coworkers who quietly handle entire jobs while humans sip coffee and approve the occasional report. At least, not in the near term. They are also not just chatbots with a new label slapped on the box.
The useful version sits somewhere in between.
An AI agent is a system that can take a goal, break that goal into steps, use tools, check results, and keep going until the task is finished or until it needs a human. That sounds simple, but it changes the relationship between humans and software.
Traditional software waits. You click the button, fill the field, open the tab, upload the file, run the report, send the email.
An agent, in theory, does not just wait. It acts.
That one shift, from answering to acting, is where the real story begins.
The first thing AI agents will actually do is handle the annoying middle layer of work.
Not the glamorous work. Not the strategic work. The sludge.
Every organization has sludge. It's the work that sits between decisions. Copying information from one system to another. Checking whether a form is complete. Summarizing a meeting and turning it into follow-up tasks. Looking through a shared drive for the latest version of a file. Drafting a response based on policy. Comparing invoices against purchase orders. Pulling numbers from three dashboards and putting them into one update.
This is the kingdom of the agent.
Most people do not spend their whole day doing their official job. They spend a surprising amount of time preparing to do it, documenting that they did it, chasing other people so they can do it, or translating the output of one system into the input of another.
Agents are well suited to that kind of work because the work is structured enough to be learnable, but messy enough that old-fashioned automation often struggled with it.
A classic automation tool needs a narrow path. Do this, then that, then that. If the invoice is always in the same format, and the customer record is always in the same field, and nobody changes the system, it works beautifully.
But real work is full of exceptions.
The invoice arrives as a PDF. Or a photo. Or a forwarded email with "see attached" and three versions of the same attachment. The customer has two accounts. The policy changed last month. The spreadsheet has a column called "final_final_revised."
This is where agents become useful. Not because they are perfect, but because they can handle ambiguity better than older tools. They can read, infer, compare, ask a follow-up question, and take the next step.
So the first wave of useful agents will not feel like science fiction. It will feel like fewer tabs. Fewer status meetings. Fewer moments where a skilled employee mutters, "Why am I doing this by hand?"
The second thing agents will actually do is become a new front end for software.
For decades, using software has meant learning the software's logic. You adapt to the menu. You learn the workflow. You remember where the export button moved after the redesign. You figure out which settings matter and which ones are decorative traps.
Agents start to reverse that relationship.
Instead of opening five tools to complete one task, you tell the agent what outcome you want.
"Find every customer who renewed last year but has not booked a kickoff call this year, draft a polite check-in, and flag the accounts over fifty thousand dollars."
That request might touch a customer relationship system, a calendar, email history, billing data, and a task manager. Today, a person may need to know all five systems. Tomorrow, an agent may handle the route.
This is why big software companies are so focused on agents. They are not just selling better chat. They are fighting to become the command layer over your work.
The agent that understands your files, permissions, messages, calendars, data, and habits becomes extremely powerful. It becomes the doorway. And whoever owns the doorway has influence over everything behind it.
For users, this could be wonderful. Software could become less hostile. Instead of memorizing workflows, you describe intent. Instead of clicking through a maze, you ask for the destination.
But there's a catch.
When software becomes conversational, it can hide complexity. That feels convenient until something goes wrong. If an agent sends the wrong file, updates the wrong record, or makes a confident mistake, you may not know which step failed. You may not even know what steps it took.
So agents will not just require better AI. They will require better receipts. Better logs. Better permission systems. Better ways for humans to see what happened, undo what happened, and prevent certain things from happening at all.
The companies that win with agents may not be the ones with the flashiest demos. They may be the ones that make the boring parts trustworthy.
The third thing AI agents will actually do is change software development.
This is already happening.
Coding agents can read a bug report, search through a codebase, propose a fix, run tests, and explain what changed. They can scaffold features, refactor old code, write documentation, and help engineers understand unfamiliar systems.
But here again, the realistic story is not "AI replaces all programmers."
The realistic story is that software teams will change shape.
A good engineer with a good agent can move faster through repetitive coding tasks. They can explore more approaches. They can ask the agent to write the first draft, then spend their own attention on architecture, edge cases, security, performance, and product judgment.
That sounds like a productivity boost, and it often is.
But it also creates new problems.
Code is unforgiving. An agent can make a change that looks reasonable and breaks something subtle. It can misunderstand a legacy dependency. It can produce a fix that passes one test and fails in production. It can delete, overwrite, or expose data if given too much freedom.
This is where the agent conversation gets serious. The more capable the agent, the more dangerous a careless setup becomes.
A chatbot that gives a bad answer is a problem. An agent that takes a bad action is a different kind of problem.
So in software, the real near-term value will come from constrained autonomy. Agents will do useful work, but inside guardrails: separate development environments, approval steps, test requirements, limited permissions, rollback systems, and human review.
In other words, the agent can drive, but not on every road, not at every speed, and not without brakes.
The fourth thing agents will actually do is customer support, but not in the way people think.
We already have chatbots. Many are terrible. They apologize, misunderstand, loop, and somehow make you miss the old phone tree.
Agents could improve this because they can do more than respond. They can investigate.
A customer says, "My package never arrived."
A weak bot replies with a tracking link.
An agent checks the order, the carrier status, the address, the delivery photo, the weather delay, the replacement policy, the customer's history, and the company's rules. Then it can offer the next best step: refund, replacement, escalation, or human review.
That is genuinely useful.
But support is also where companies will be tempted to over-automate. They will see cost savings and push agents into situations that require empathy, judgment, or exception handling. And customers will notice.
The best customer service agents will not replace humans completely. They will make human support better. They will gather context before the human joins. They will draft options. They will summarize the case. They will handle simple resolutions and escalate the messy ones.
The worst ones will become a wall between customers and accountability.
That distinction matters.
An agent that solves your problem feels like progress. An agent that prevents you from reaching a person feels like a locked door with a friendly voice.
The fifth thing agents will actually do is personal administration.
This is the version people want at home.
Not a robot butler. Not a digital genius managing your entire life. Something more modest and more useful.
An agent that can help reschedule an appointment, compare insurance documents, organize travel options, monitor subscriptions, prepare for a meeting, track school emails, or remind you that the form you ignored has a deadline.
Personal life is full of small administrative burdens. Most are not difficult. They are just draining. They require context switching, attention, and remembering.
Agents could help by acting as a personal operations layer.
But personal agents face a trust problem even bigger than workplace agents.
To be useful, they need access. Calendar, email, contacts, documents, bank alerts, health portals, travel accounts, maybe even messages. That access is sensitive. The agent that can help you most is also the agent that can expose the most.
So personal agents will likely grow slowly. People may first trust them with low-risk tasks: summarize my inbox, draft a reply, find the cheapest flight, organize these receipts. Then, over time, some will allow more action: book the appointment, cancel the subscription, submit the form.
The adoption curve will be shaped less by raw intelligence and more by confidence.
Can I see what it did?
Can I approve before it acts?
Can I limit what it sees?
Can I undo the action?
Can I trust that my private life is not being quietly turned into training data, ad targeting, or a security incident?
Those questions will decide the future of personal agents.
The sixth thing agents will actually do is research and analysis, especially the first draft of understanding.
Imagine a market analyst tracking a new industry. A policy worker comparing regulations. A doctor reviewing medical literature. A journalist organizing a timeline. A teacher building lesson materials from a curriculum. A lawyer reviewing documents for patterns.
In all these cases, the agent's value is not that it becomes the expert. The value is that it does the first pass.
It gathers. Sorts. Summarizes. Compares. Highlights contradictions. Finds missing information. Prepares a map.
That map still needs a human.
This point is important because AI systems can sound more certain than they are. An agent that produces a polished analysis can create the illusion of completeness. But research is not just collecting information. It is knowing what matters, what is missing, what is unreliable, and what question should be asked next.
So agents will be extremely useful as research assistants, but risky as final authorities.
A good agent might say: "Here are the three strongest explanations, here is the evidence for each, here is what remains uncertain, and here are the questions a human should check."
A bad agent will say: "Here is the answer," when the answer is not settled.
The difference is not cosmetic. It is the difference between acceleration and misinformation.
The seventh thing agents will actually do is create new security problems.
This part is not optional. It comes with the territory.
An agent is powerful because it can use tools. But a tool-using system can be tricked, misdirected, or over-permissioned.
If an agent can read your email, download files, access databases, write code, and message coworkers, then attackers will try to manipulate it. They may hide instructions inside documents, websites, emails, calendar invites, support tickets, or shared files. The agent sees the content, interprets it as part of the task, and does something it should not do.
This is one of the strangest parts of the agent era. The interface is language, and language is also the attack surface.
Security teams are used to thinking about passwords, networks, malware, and permissions. Now they also have to think about persuasion. About malicious instructions embedded in ordinary text. About agents that act like users but are not users. About chains of agents handing tasks to other agents, blurring accountability.
In the old world, a compromised account was dangerous because someone could act as you.
In the agent world, even without a traditional compromise, a poorly governed agent might act too broadly on your behalf.
That means organizations will need new rules.
Agents should have identities. They should have limited permissions. Their actions should be logged. Their access should expire. They should operate in sandboxes when possible. High-risk actions should require human approval. Sensitive systems should not be exposed casually just because an agent can technically connect to them.
This will slow down some deployments, and it should.
Because the story of agents is not just capability. It is delegation.
And delegation without accountability is not innovation. It is negligence with a nicer interface.
So what jobs will agents replace?
Some tasks, absolutely. Some roles, partially. Entire jobs, unevenly.
The cleanest replacement will happen where the work is repetitive, digital, rules-based, and easy to verify. Basic data entry. Routine reporting. Simple support flows. Standard document processing. First-draft outreach. Certain quality checks. Some junior analytical tasks.
But most jobs are bundles of tasks. Agents will unbundle them.
A paralegal does not only search documents. A recruiter does not only screen resumes. A nurse does not only fill forms. A salesperson does not only write follow-up emails. A manager does not only summarize meetings.
Agents may take over pieces of these jobs, especially the

Hear the full story.
Listen in PodCats.

The full episode, all the chapters, your own library — and a feed of voices worth following.

Download on theApp Store
Hear the full episode Open in PodCats