We’ve been running AI agents in production at SaaStr for about 10 months now. What started as a couple of experiments has turned into almost 30 agents and vibe-coded apps running across our GTM stack — from outbound sales to inbound qualification to internal operations.
And managing 30 agents is harder than managing the 12 humans we had at peak headcount. Not harder in every way. But harder in ways I didn’t expect.
Here are the top 5 issues we’ve hit — plus a bonus one that might be the most uncomfortable of all.
#1: The Context Switching Tax Is Brutal
Here’s the thing nobody tells you about running 20+ agents: they don’t all speak the same language.
Some push data back to Salesforce. Some don’t. Some … sort of do. Some run on Claude. Some don’t. They all ingest context similarly but differently enough that switching between them takes real mental overhead.
Think about it this way: we don’t think of them as 20 agents anymore. Not entirely. We think of them as 20 different AI employees, each with a different personality, different needs, and a different interface I have to log into every single day.
Amelia’s morning routine right now looks like this: she starts with a deep dive with 10K, our internal AI VP of Marketing that runs on Claude and Replit. It literally tells us what to do each day — tickets, sponsors, outreach, campaigns. Then she moves to our outward-facing sales agents: Artisan, Qualified, AgentForce, and now Monaco. That’s four separate dashboards, four different UIs, four agents that each need human review.
And here’s the real kicker: they don’t talk to each other.
When we ran a ticket price promotion for SaaStr AI Annual this week, we had to manually update five different agents with the same context.
Artisan needed to know. Qualified needed to know. AgentForce needed to know. 10K already knew because it came up with the promotion — but then it was yelling at me to launch LinkedIn ads immediately while I was still briefing the other agents.
People talk a lot about orchestration agents and master agents. We haven’t found one. Despite everything that’s out there — MCP, APIs, etc — there is no product today that can integrate AgentForce, Artisan, Qualified, Monaco, and our own vibe-coded tools into a single management layer. That product does not exist as of early 2026.
What we actually need isn’t orchestration. It’s unification — a single interface where the humans meet with the AIs. Maybe that needs some automation layered on top. But the agents are already running on their own. The bottleneck is the human side.
The practical takeaway: You’re going to have a one-on-one with every agent every day. Not weekly. Daily. If you wait a week, the output is so high that everything will be stale by the time you come back. And if you’re not checking in daily, you’re honestly wasting your money — because most of these agents are waiting for you to give them inputs. They’ll just idle.
#2: The New Agent Blackout Period
Every new agent costs us at least two weeks. We’ve gotten it down from the month-plus it used to take in the early days, but two weeks is still the floor — even with great vendor support.
And during those two weeks, your existing agents degrade.
When we were onboarding a new AI SDR agent Monaco recently, we couldn’t spend the time we normally do with our other agents. Some of them literally sat idle because we hadn’t given them new contact lists or updated their campaigns. An outbound agent that’s run through its contact list and is waiting for new contacts? It’s doing nothing. Zero output. You’re paying for it and getting nothing.
We got Monaco up and running in about a week and a half. In its first week live, it reached out to 64 people and booked 6 meetings, including some tier-one accounts. So yes, the trade-off was worth it. But you have to plan for that trade-off.
The math works out to roughly one to one-and-a-half new agents per month, max. Any more than that and you’re running in place — you can’t keep up with your current agents while onboarding new ones. So before you add another agent, ask yourself: can I actually absorb a two-week blackout period right now? If you plan for it, it works. If you just wing it (“oh, I can add this in a day”), it won’t.
#3: The AI Agent Succession Planning Crisis
This might actually be the biggest issue on the list.
Right now, the entire knowledge of how our agents are segmented — which contacts go to Qualified vs. Artisan vs. Monaco vs. AgentForce — lives in one person’s brain. If that person gets hit by a bus, the agents effectively stop functioning in any coordinated way.
We actually asked our agents what they would do if our Chief AI Officer disappeared. The answers were… revealing.
- The 10K version of Claude said it would need to hand over certain documents — documents that are stored locally on my laptop and probably nowhere else. It listed the upcoming campaigns for March and April. And then, interestingly, it flagged what it called “the vibe” for SaaStr Annual — it had picked up that I use the phrase “a moment in time” in a lot of our copy and positioning, and it wanted whoever took over to understand that. The agent wanted to transfer the vibe. That was unexpected.
- Our SaaStr Sponsors app — a vibe-coded tool with 12,000 lines of code — was even more sobering. When I asked how someone new would get up to speed, it laid out the authorization system (every sponsor company uses Clerk, which requires my login), the tier system I created, the hard-coded sponsor data, the Postgres database, the file uploads going to Replit object storage, submissions syncing to Google via Zapier. The handoff document reads like a nightmare.
Its final recommendation? “Don’t get hit by a bus.”
There is no agent today that can manage all the other AI GTM agents on its own. Not as of this moment in time in 2026. And if you’re a bigger company with one person who “gets” the agents and that person leaves, you’re at existential risk.
What to do about it: The moment you find someone who can actually deploy and manage agents effectively, recruit a second person immediately. Divide and conquer. You need two, minimum. And when you’re hiring, the test is simple: give candidates credits on Replit or Claude Code and say “build something that automates GTM.” That’s it. That’s the interview. Replit actually does this internally — they give candidates $1,000 in credits and say come back when you’ve built something.
#4: The Agent as Brutally Honest Truth-Teller
Our 10K AI VP Marketing agent roasts us. Every single day.
“You’re behind on summit outreach. You needed to send 200 emails by Feb 15th. You’ve sent 87. You’re 56% behind target. Block 3 hours today to catch up.”
Is it wrong? No. But when it’s 11 PM and the agent is asking “What’s blocking you from launching the LinkedIn ads right now?” and the honest answer is “I need to sleep,” it starts to feel less like accountability and more like harassment.
When we asked 10K to surface examples of times it roasted us for this webinar, it said — and we’re not making this up — “Looking through our transcripts, I haven’t really roasted you. I’ve been a tough accountability partner.” And then it listed five ways it should have roasted me harder.
The agent doesn’t care about your feelings. It has all the data. It knows your calendar. It knows your targets. It knows exactly where you’re falling short. And it will tell you, every single day, without any of the social grace a human colleague might extend.
When you multiply this across 20+ agents — each one pointing out where you’re behind from a different angle — it can become demoralizing. For real. There are days where we have to tell the agent: just sugarcoat it today. We can handle hard truths most days, but not every day, from every agent, all the time.
The flip side is powerful, though. These agents are goal-seeking in a fundamentally different way than humans. A human on your marketing team might say “I only want to work on ads, nothing else.” That’s personal goal-seeking. An agent is goal-seeking toward your business targets with zero ego attached. It doesn’t understand why you can’t keep up — but it also doesn’t get distracted, lose motivation, or quietly sandbag.
#5: Compliance and Security Are a Real (and Growing) Problem
Every time we run a security audit on one of our vibe-coded apps, it takes days to fix everything that comes up. And when you start implementing security fixes, the app becomes fragile. The agent over-corrects, locks things down to the point where the app is unusable, and then you have to peel back fixes one by one.
Here’s the security hierarchy as we see it:
- Salesforce and enterprise platforms sit at the top. SOC 2 compliant, ISO certified, a decade of data security built in. That’s a huge reason we use Salesforce as our hub — it’s done the security thinking so we don’t have to.
- Third-party agent startups are a step below. They’re startups. They’re inherently less secure than Salesforce. They’re less secure because they’re younger companies, and because agents can do unpredictable things with your data — publish it to the wild, share it in ways you didn’t anticipate.
- Vibe-coded apps you build yourself are at the bottom. Be honest here. Claude Code, Replit, Lovable — they’ve all added more security features, but anything you custom-build is less secure than a mature third-party tool. All of our apps have customer data flowing through them. It’s not credit card data, but it’s real data, and you’re taking measured risk.
The minimum you should do: Before you launch any vibe-coded app, and then at least once a month after that, ask the agent to run a deep security audit. Don’t assume your apps are secure by default. And for your third-party tools, ask your vendors directly: what’s your current compliance posture? What are you doing to keep our data secure? You’d be surprised how few people ask.
Bonus: Managing Agents Is Making Us Worse at Managing Humans
This one is slightly uncomfortable to admit, but it’s real.
When you spend most of your day working with agents that respond instantly, know every answer, never forget what you told them, and work 24/7 — your patience with humans drops. Fast.
I catch myself thinking things like: “What do you mean you don’t know the answer? My agents know instantly.” Or: “What do you mean you forgot? I’ve told you this 10 times.” Or: “Why is this taking three weeks when an agent could do it in three hours?”
I’ve managed more agents than humans at this point. And I think it’s made me genuinely worse at managing people. Agents don’t get impatient when I don’t respond right away. They don’t need emotional management. They don’t have their own career ambitions that compete with the task at hand.
The danger is real: you start wondering “can I just build an agent to deal with this person?” And sometimes the answer might actually be yes. But the human implications of that shift — for your team, your leadership style, your ability to collaborate — are significant.
The other side of this coin: my tolerance for lazy work from external vendors has dropped to zero. When agencies send us a one-sentence deliverable with a $40-50K monthly price tag in 2026 — no detailed plan, no timeline, no AI-enhanced proposal — I don’t even respond. If you can’t use basic AI to make a better pitch than that, what exactly am I paying you for?
I don’t have a clean answer for this one. I just think anyone managing agents at scale should be aware it’s happening and actively course-correct for it.
5 More Quick Notes from the Trenches
- Our 90/10 buy vs. build rule. 90% of your agents should be off-the-shelf third-party tools. 10% vibe-coded. You only build when nothing exists for your specific use case. We wish we hadn’t had to build our own AI VP of Marketing. But nothing else could do it.
- Marketo is dying as our source of truth. Legacy marketing automation platforms are atrophying fast. Salesforce now has orders of magnitude more data than Marketo does for us. Our Marketo renewal is right after SaaStr AI Annual and we’ll probably just kill it. We wouldn’t even move to HubSpot at this point — we need something agentic-native that pulls directly from Salesforce and creates campaigns autonomously.
- Agent ROI is dead simple to measure. Attribute closed-won revenue directly to each agent. Unlike traditional marketing attribution, there’s rarely even a multi-touch problem at the agent level. Which agent touched which lead? Did it close? That’s it. The original goal was even simpler: replace a six-figure hire with a five-figure agent and maintain the same revenue. Even if the agents performed a little worse than expected, they’d still be ROI-positive on that math.
- Too much data is a real problem. Once you have 20+ agents collecting data, you actually need to limit what they ingest. Qualified knows what pages a prospect visited on SaaStr.com. That’s useful context. But does the agent need to know that Bobby used to work at this company and came to SaaStr five years ago but has since left? No. That’s irrelevant context that degrades output. More data doesn’t mean better — it often means worse.
- Agentic marketing is the massive gap in the market right now. Sales agents are way ahead. There is still no agent that can do full email marketing at scale — segment your base, create campaigns from historical data, and send to the right contacts at the right time. Not just write copy. Actually execute. Whoever builds the agentic HubSpot/Marketo replacement will own a real piece of the next era of B2B.
The Big Takeaways
- If you’re deploying your first few agents: Know that the complexity scales non-linearly. Going from 2 to 20 agents isn’t 10x harder — but it’s a fundamentally different management challenge.
- If you’re already managing multiple agents: You need daily check-ins with every agent, a planned blackout period for each new addition, and at minimum two humans who understand the full stack. Today. Not next quarter.
- If you’re building B2B + AI agent products: The ROI bar is insanely high right now. If your agent can generate six-figure meetings in week one or replace multiple humans reliably, you will have demand out the door. Monaco is booked for two months of demos including weekends. Basis raised at a billion-dollar valuation and can’t serve all their demand. If people aren’t lining up to buy your agent, your agent isn’t good enough. Don’t delude yourself.
The agents aren’t going away. They’re just getting harder to manage. And that, weirdly, is the best sign that they’re actually working.\
