High inference costs are OK—if they make your product so viral and so competitive it almost sells itself


Here’s the counterintuitive insight that’s reshaping how the smartest AI founders think about unit economics:

Your inference costs aren’t your gross margin problem. They’re your CAC replacement.  If … You Have a Product That Can Truly Deliver with AI.

The companies growing fastest right now—Cursor crossing $1B ARR with ~300 employees and no traditional marketing, Lovable and Replit both hitting ~$300M ARR with zero paid acquisition, Harvey and Legora in legal, OpenEvidence in healthcare—aren’t sweating inference costs. They’re optimizing them for sure, but they are also leaning into them. They’re treating compute as their primary growth investment, not their primary margin drag.

This is a fundamental reframe. And if you’re still optimizing for gross margin while your AI-native competitors are optimizing for virality, you’re playing the wrong game.  The 2022-2024 game.

 

You Can’t Do Both Well — Spend a Ton on Sales & Marketing AND a Ton on Inference

On a recent 20VC x SaaStr episode, we discussed Anthropic’s inference costs coming in 23% higher than expected. My immediate reaction was pessimistic for old school, pre-AI, mid-market B2B SaaS:

“I worry this is the final nail in the coffin. You did everything right—got profitable, built an agent—and now you just can’t afford the inference to compete.”

Here’s the scenario: You’re a $50M ARR B2B company. You built the agent your board demanded. Your agent costs $2.50 per interaction. You need 50 million interactions to stay competitive. That’s $125 million in inference costs on $50M in revenue.

Game over, right?

Potentially — if you don’t radically change. The question isn’t whether you can afford the inference. It’s whether the inference makes your product so good that sales and marketing become irrelevant.

The Cursor Playbook: Inference as Distribution

Cursor crossed $1B ARR by late 2025—roughly 24 months from launch—with about 300 employees and minimal traditional marketing. They went from $100M ARR in January 2025 to $500M by June to $1B+ by November. The fastest SaaS growth curve ever recorded.

How? They spent aggressively on inference to create what Andrej Karpathy called the “vibe coding” experience—the moment when developers forget they’re writing code and just describe what they want. That experience is computationally expensive. It requires reasoning tokens, multiple model calls, context management across entire codebases.

Traditional SaaS math would call this margin suicide. But here’s what actually happened:

  1. The “wow moment” converted instantly. Developers tried Cursor, experienced something magical, and became evangelists within hours.
  2. User-generated content became their entire marketing funnel. Every tweet about “I built an app in a day with Cursor” was free distribution worth thousands in CAC.
  3. The viral loop compounded. Engineers at OpenAI, Midjourney, Shopify, and Instacart started spreading it organically. No sales team required.
  4. Conversion was frictionless. $20/month is an impulse buy when the product makes you demonstrably faster.

The inference spend wasn’t a cost center. It was the marketing budget. It just showed up on a different line item.

Lovable’s and Replit’s Rocket Ships to ~$300M

Lovable hit $300M ARR in January 2026—roughly 14 months after launch—with fewer than 200 employees and zero paid acquisition. That’s still $1.5M+ revenue per employee, nearly 8x the industry benchmark.

Their secret? They engineered virality into the product itself. When users build apps with Lovable, the outputs are shareable. The AI-generated code is good enough that users want to show it off. Every app becomes a piece of marketing collateral.

The underlying inference cost to generate these apps is significant. But look at what they avoided:

  • No enterprise sales team (zero)
  • No paid acquisition (zero)
  • No SDRs cold-calling (zero)
  • No expensive conference sponsorships (zero)

The inference is the go-to-market motion. The product is the marketing.

You Can’t Have It Both Ways.  At Least Not For Very Long.

Here’s the brutal math that too many founders are ignoring:

You can’t have high inference costs AND high sales & marketing costs. At least not for long. It has to come from somewhere.

Traditional SaaS could absorb 40-50% S&M spend because gross margins were 80%+. There was room. The unit economics worked.

But when your gross margin drops to 50-60% because of inference costs, that room disappears. You’re now choosing between two paths:

Path A: Inference-First (Cursor, Lovable/Replit, Harvey/Legora)

  • Gross margin: 50-60%
  • S&M: <10%
  • Growth driver: Product virality
  • Requires: Magical product that sells itself

Path B: Sales-First (Traditional Enterprise SaaS)

  • Gross margin: 75-80%
  • S&M: 40-50%
  • Growth driver: Sales efficiency
  • Requires: Lower inference costs, less AI magic

What you cannot do is run 50% gross margins AND 40% S&M. That’s negative operating margin before you pay a single engineer. That’s burning cash with no path to profitability. That’s a company that dies.

The trap I see founders falling into: they build an AI product with significant inference costs, then layer a traditional enterprise sales motion on top of it because “that’s how you sell to enterprises.” Now they’re paying for compute AND paying for a sales team AND paying for marketing—and wondering why their burn rate is out of control.

Pick a lane. Either your inference spend IS your sales & marketing (because the product is so good it creates its own distribution), or you optimize inference costs aggressively and invest in traditional go-to-market.

The companies trying to do both are the ones that will run out of runway.

The Third Path: Build Something So Valuable They’ll Pay $50-100K+ (More)

There’s another way to solve the inference cost problem that doesn’t require virality or zero sales spend.

Build an AI agent so valuable that businesses will gladly pay $50,000 to $100,000+ for it. Even small businesses.  This is what we are seeing across the AI Agent stack we use at SaaStr itself.

I talked to a sales leader at an agentic AI company recently. I asked him how many figures are in his mid-market deals. I was thinking $20K to $50K, maybe $75K on the high end.

He said they’re all seven figures.

Seven figures. For mid-market.

When you price based on labor replacement instead of software seats, the math completely changes. You’re not competing with Salesforce’s $150/seat. You’re competing with a $120K SDR salary, a $180K AE salary, a $95K customer support rep.

The current market rate for agentic GTM tools: $100K+ per agent as a starting point. Some companies are paying $50-70K just to get started, plus another $25K for a forward-deployed engineer to set things up.

And here’s the thing: smaller businesses will pay this too if the ROI is obvious enough.

Think about it from the buyer’s perspective:

  • Human SDR loaded cost: $100K-$130K annually
  • AI SDR: $10K-$15K for basic, or $50K-$100K for enterprise-grade with support
  • Even at $100K, that’s still cheaper than a human—and the AI never has a bad day, never forgets the product, never needs ramp time

At $50K-$100K per agent, your inference costs become a rounding error. You’re not optimizing tokens. You’re delivering ROI measured in weeks, not quarters.

This is the premium escape hatch from the inference cost trap: Don’t try to win on efficiency. Win on value so overwhelming that price becomes irrelevant.

The Inference Squeeze on Mid-Sized B2B

Here’s where it gets brutal for traditional SaaS.

If you’re a $50-100M ARR company that spent years building a sales org, optimizing your marketing funnel, and perfecting your enterprise motion—you now face a new type of competitor. One that has no or a very small sales team to pay, treats inference as their primary distribution channel, and can invest every marginal dollar into making the product more magical.

You can’t out-spend them on ads because they don’t run ads. You can’t out-execute them on sales because they don’t have salespeople to compete against. Your moat was go-to-market efficiency. Their moat is product virality powered by unlimited compute.

The existential question becomes: Can you afford NOT to spend on inference?

The Framework: Inference as CAC

Here’s how to think about this:

Traditional SaaS Model:

  • Gross margin: 75-80%
  • S&M spend: 30-50% of revenue
  • CAC payback: 12-24 months
  • Growth driver: Sales & marketing efficiency

AI-Native Model:

  • Gross margin: 40-60% (temporarily)
  • S&M spend: <10% of revenue
  • CAC payback: Near-instant (product is the acquisition)
  • Growth driver: Product virality via inference quality

The question isn’t “What’s your gross margin?” The question is “What’s your blended customer acquisition cost when you include inference spend that drives organic growth?”

If spending an extra $1M on inference generates 10x that in organic revenue—because your product creates its own distribution—you should spend that million dollars every time. Even if it temporarily crushes your gross margin.

The New Competitive Moat

The winners in the AI era won’t be the companies with the best gross margins. They’ll be the companies that figured out how to turn inference into distribution.

This requires:

  1. A product with a shareable “wow moment.” Not just good—magical. The kind of thing users can’t help but tweet about.
  2. Viral mechanics built into the core experience. Every output should be a potential piece of marketing content.
  3. Pricing that makes conversion frictionless. $20-50/month that’s an immediate no-brainer when the value is obvious.
  4. The financial runway to absorb inference costs while the viral loop compounds. This is where well-capitalized AI startups have an edge—Open Evidence, Harvey, Lovable all raised massive rounds specifically to fund this strategy.

The Hard Truth for Incumbents

Here’s what I worry about for traditional B2B:

You did everything your board asked. You got profitable. You built an agent. It works. Your customers like it.

But your agent costs $2.50 per interaction. You need 50 million interactions this year. That’s $125M in inference costs.

Meanwhile, your AI-native competitor raised $300M specifically to fund inference spend. They have no sales team. Their product is so good it creates its own distribution. They’re growing 300% year-over-year while your growth is decelerating.

If you walked into your board meeting and said “Good news guys, inference costs are going down 30% this year because our IT team’s really good at managing costs,” I would throw my mouse at the monitor.

The only way out is to build an agent so good you can charge $10-20K per month because it replaces 20 people with ROI measured in weeks—not a sales pitch claiming it’s that good, but literally that good.

Or: accept that inference isn’t your enemy. It’s the new battlefield. And the companies willing to spend aggressively on compute to create magical, viral products will win—even if their gross margins look terrible by traditional SaaS standards.

Three Takeaways for Founders:

  1. Reframe inference as acquisition cost, not gross margin drag. If spending $1 on inference generates $10 in organic revenue through virality, that’s a 10x return on your “marketing” spend.
  2. Build shareable “wow moments” first, optimize margins later. The companies growing fastest right now prioritized magical user experiences over unit economics—knowing the economics would follow.
  3. If you can’t afford to compete on inference, find a different wedge. Either charge enough that inference costs don’t matter ($10K+/month with clear ROI), or accept that you’re competing against companies with infinite compute budgets and no sales teams to support.

Related Posts

Pin It on Pinterest

Share This