Teahose.
SIGN IN
NEW HERE — WHAT TEAHOSE DOES
We read the entire AI & tech firehose — so you don't have to.
PODPodcastsAll-In, No Priors, Acquired…
NEWNewslettersStratechery, Newcomer…
PAPPapersPhysical AI research
PHProduct Huntdaily launches
VCInvestor ScoutSequoia, a16z, Benchmark…
CLAUDE DISTILLS →
7 reads, 30 sec each — free, 6 AM ET.
+ a live graph of the companies, people & themes underneath.
HOME/SOURCERY/Benchmark's AI Bets: Cerebras, S…
POD
// EPISODE
SOURCERY

Benchmark's AI Bets: Cerebras, Sierra, Legora, Fireworks, Starcloud, Gumloop..

DATE June 29, 2026SOURCE SOURCERYPARTICIPANTS EVERETT RANDLE, MOLLY O'SHEA
// KEY TAKEAWAYS6 ITEMS
  1. 01The Death of Spreadsheet Investing: All the Old Golden Rules Are Inverted
  2. 02Scale No Longer De-Risks a Company
  3. 03Inference Is the Waterfall: Every New Business Model Traces Back to It
  4. 04The Agent Economy Is the Most Important Business Model Shift Since SaaS
  5. 05AI Business Models Are All Structurally Different From Each Other
  6. 06The Frontier vs. Open Source Question Hinges Entirely on Whether RSI Is Real

Sourcery | Everett Randle & Molly O'Shea


1. Key Themes

The Death of Spreadsheet Investing: All the Old Golden Rules Are Inverted

The classic SaaS metrics — 70–90%+ gross margins, capital-light models, high gross retention, no CapEx — were the first-principles foundation for why software companies deserved high revenue multiples. In AI, every single one of those rules is now inverted: high gross margins signal nobody is using your AI features, services-heavy implementations are celebrated ("FDE is the new PLG"), and training your own models requires standing up an internal research lab with massive GPU spend.

"All of the golden rules of the past that defined like the spreadsheet investing era are all gone. And actually, the most popular companies and categories are almost the inverse of all these golden rules." 00:00:00 — Everett Randle

Scale No Longer De-Risks a Company — It May Actually Correlate Positively With Risk

In the old paradigm, scaling sequentially de-risked product-market fit, unit economics, TAM, and market leadership. A company that kept growing was becoming safer. That relationship has broken. Today you can have a company with over $1B in revenue that still hasn't proven durable unit economics or product differentiation, meaning impairment risk may actually increase with scale.

"It feels like the risk of that over time is actually sort of flat or even like maybe there's like a weird positive, you know, correlated relationship with scale and risk. And it's a very disorienting thing because you're just very used to the fact that the bigger you are, kind of the safer the company is. And it doesn't feel like that anymore." 00:03:04 — Everett Randle

Inference Is the Waterfall: Every New Business Model Traces Back to It

The explosion in revenue growth rates — companies going 1 to 20 to 100 rather than 1 to 3 to 9 — is directly enabled by inference-based business models. Whether outcome-based pricing (Sierra), broker-style resale, or platform abstraction (Fireworks), the common thread is monetizing inference. This removes the rate limit on growth.

"When you hear about all of these companies and you hear about all of these businesses that are going not like 1 to 3 to 9 to 20 like they used to, but are going 1 to 20 to 100 or 1 to 30 to 300, all of that can be traced back to the enablement of a business model via inference." 00:17:41 — Everett Randle

The Agent Economy Is the Most Important Business Model Shift Since SaaS — Despite Being Over-Marketed

Everett is openly annoyed at how "agents" has been marketed to death, but still calls it the most significant product and business model shift in the history of technology. The reason: it moves the buyer's mental frame from "I buy a license" to "I buy white-collar work on tap," and it obliterates the rate limit on per-customer spend. A single developer spending $3,000/month on Claude Code equals $36,000/year per seat — versus a typical $50,000 total ACV in SaaS.

"It represents the most important, both product, like product slash technology, and then also just like raw revenue and business model shifts we've ever seen probably in, in, in like maybe the history of technology." 00:22:05 — Everett Randle

AI Business Models Are All Structurally Different From Each Other — Unlike SaaS, Which "Tasted Like Chicken"

Robert Smith of Vista Equity coined "software tastes like chicken" — every SaaS P&L looks the same at maturity. AI destroys that. An inference platform leasing GPUs (Fireworks) vs. a data center constructor acquiring land and power (Crusoe) vs. a foundation model lab vs. an AI app company — these are more different than alike in capital intensity, margin profile, and business model, making cross-company benchmarking nearly impossible.

"Software tastes like chicken. And that's why it's beautiful. Every software company, when you look under... at the balance sheet, at the P&L, they're all the same... And like that is also not the case now. Like there's so many different business models that exist in AI." 00:10:41 — Everett Randle

The Frontier vs. Open Source Question Hinges Entirely on Whether RSI Is Real

The long-term pricing power of frontier labs (Anthropic, OpenAI) depends on one binary: does recursive self-improvement (RSI) happen, creating "geniuses in a data center," or do capabilities hit a ceiling while open source distillation closes the gap to 95%? If the latter, frontier labs lose pricing power dramatically, though their consumer products have enough brand stickiness that it wouldn't be a death blow.

"If you do end up, you know, at some point topping out on capabilities sometime in the next few years and distillation continues, I think it's much harder to garner a premium margin if you're a frontier model company." 00:34:31 — Everett Randle

The Anthropic Liquidity Event Will Be 35x the Scale of the Largest Prior Pre-IPO Returns — Ecosystem Shock Is Coming

Everett ran an analysis comparing the Anthropic $380B round to the best pre-IPO rounds of the last decade (Slack, DoorDash, Snowflake, NuBank). The Snowflake pre-IPO turned ~$500M into ~$2.5B over four years. If Anthropic IPOs at $1.5T, it returns 35 Snowflake pre-IPO rounds in a single round. He knows people with $3–4B invested in Anthropic who may 5x that in under five years. The knock-on effects — on SF real estate, new company formation, reinvestment behavior — are barely being discussed.

"When you think about the Anthropic $380 billion round, if Anthropic was to go public and get liquid at a $1.5 trillion valuation... they would, the gross return of their $380 billion round, the 30 billion that they raised at 380, it would return 35 times that of the Snowflake pre IPO round." 00:00:30 — Everett Randle

Benchmark's Edge Is Founder-First, Not Thematic — and the Portfolio Proves It

The entire Benchmark AI portfolio — StarCloud, Sierra, Legora, Fireworks, HeyGen, Gumloop, Exa, Manus, Langchain, Cerebras — was built with zero thematic intent. Every investment was founder-out. Several companies pivoted after investment. The StarCloud investment closed weeks before Elon Musk publicly endorsed orbital data centers. The portfolio accidentally looks like a thematic sweep because great founders find great categories.

"I realized like, oh wow, that like there was literally no thought of, of, of like, oh, this is the category that we're going into. It was truly kind of what we talked about before. It was all founder out... great founders are always in style, whereas like these business models can go in and out of style." 00:50:36 — Everett Randle

"Venture Capital" Is Now a Product, Not a Firm Type — Nomenclature Has Failed the Industry

Andreessen Horowitz, General Catalyst, and Iconiq are alternative asset managers with venture products — not venture capital firms. Calling them all "venture" obscures fundamentally different risk profiles, time horizons, and LP relationships. The industry has failed to update its terminology as firms have evolved, creating confusion for LPs, founders, and the press.

"General Catalyst and Andreessen Horowitz, those are alternative asset managers because they have so many products. You know, many of these places have like... General Catalyst is still has a venture product and they also have a growth product and they also have a debt product... So I think that like venture in many ways is still the same, but it's a product now for many of these firms." 00:41:39 — Everett Randle


2. Contrarian Perspectives

The "Mom Test" for AI: Most Users Need Nothing Close to a Frontier Model

While the consensus narrative focuses on frontier model dominance, Everett argues that the vast majority of everyday AI queries — and a rapidly growing share of enterprise tasks — can be handled by highly cost-effective open source models with no perceptible quality loss. His mom (non-tech-savvy, rural Colorado) needs nothing frontier. The important insight is that this isn't zero-sum: demand for both frontier and non-frontier is growing in parallel.

"For my mom, what is the amount of queries or the amount of things that she needs out of AI that can't be done by a really, really, really cost effective open source model?... Now it's like 100%. Like there's nothing that my mom actually asks of her AI products that needs to be done by the frontier or even a nearer frontier model." 00:28:06 — Everett Randle

High Gross Margins in an AI Product Are Actually a Red Flag

This directly inverts one of the most sacred SaaS metrics. In the old world, 90% gross margins were a sign of excellence. Today, if your AI product has high gross margins, it signals that nobody is actually using your AI features — because AI inference costs money. The "good" AI companies should have structurally lower margins than legacy SaaS.

"Now gross margins, if your gross margins are high, that's actually a bad thing because, you know, AI inference costs a lot of money. And if you have an AI product with high gross margins, that means that no one's using your AI features." 00:07:05 — Everett Randle

Late-Stage AI Companies Can Have More Upside Than Series C — The Risk/Return Curve Is Broken

The conventional VC wisdom is early stage = high risk/high return, late stage = low risk/low return. Everett argues this is no longer true. Because markets are so large and companies are staying private so long, a late-stage company can offer dramatically more upside than a Series C. His first investment at Kleiner was SpaceX at $100B+, which then "reborn" via Starlink — the majority of SpaceX's business today is consumer broadband, not launch.

"Because these markets are so big in AI and elsewhere, and because these companies are staying private for much longer, you actually have these situations where a late stage company can have much higher upside than like a Series C, which is super weird." 00:38:13 — Everett Randle

Independent Third-Party AI Vendors Are Essential Precisely Because the Underlying Models Are "Jagged"

The conventional view is that foundation labs will capture the entire stack. Everett argues the opposite for enterprise: models are jagged (Gemini best for multimodal, Claude best for coding, open source best on cost), and enterprises actively need a neutral third party to route intelligently and prevent employees from, e.g., using the most expensive frontier model to check the weather. This is the structural case for Gumloop and similar platforms.

"The models at any given point are good at different things... you do need to have a third-party vendor that's sort of watching out for the customer and not just saying, you know, like in some of these enterprises, you have employees checking the weather with Opus 4.8." 00:26:04 — Everett Randle


3. Companies Identified

Fireworks AI

AI inference cloud platform. Leases GPU/inference capacity from partners rather than building data centers; value-add is software on top of inference to reduce cost and latency. Benchmark portfolio company cited as a leading inference platform alongside Modal, FAL, and Base10.

"We're huge investors in Fireworks. But if you think about the other inference platforms like FAL or Base10, Modal, a lot of app companies that are using a usage-based model or an outcome-based model, all of these are different derivations of monetizing inference." 00:17:17 — Everett Randle

Sierra

Outcome-based AI customer support platform; charges per completed deflection of a customer query. Described as the "horizontal AI winner" in Benchmark's portfolio and a leading example of abstracted inference monetization.

"If you think about like a Sierra that is, you know, outcome-based pricing for like actual completed deflections of customer support queries and things like that. That's largely abstracted, but you're still monetizing this thing which is based on inference." 00:18:36 — Everett Randle

Gumloop

Enterprise AI agent and automation canvas platform. Enables every employee (not just developers) to build, iterate, and collaborate on automations and full AI agents, triggered via Slack/Teams or running autonomously.

"Gumloop is an independent third-party software platform that does a collaborative AI agent and AI automation canvas for enterprises... every single employee within an organization can create, can iterate on, and can collaborate on both just like very simple automations... but then also full-on AI agents." 00:23:56 — Everett Randle

Legora

Described as the "vertical AI winner" in Benchmark's portfolio and also cited as a "data infrastructure play." (Note: also referred to briefly in context of legal AI, consistent with Legora's known positioning as an AI platform for law firms.)

"They have like the vertical AI winner in Legora. They have like the horizontal AI winner in Sierra. They have like a data infrastructure play in Legora." 00:50:07 — Everett Randle

StarCloud

Orbital data center company — operates data centers in space. Named in the SpaceX S1. Had already proven a GPU working in space before Benchmark invested, making them the only company at the time with a GPU in orbit. Investment closed weeks before Elon Musk publicly endorsed orbital data centers.

"When Chathan invested in StarCloud, it was like weeks, like we closed the investment weeks before Elon started like, you know, professing his love and bullishness on overall data centers." 00:51:32 — Everett Randle

Cerebras

AI chip company that recently went public. Benchmark portfolio company mentioned as a marquee AI infrastructure bet.

"You have StarCloud, data centers in space, named in the SpaceX S1. Then you have Cerebras that just went public, Sierra, Langchain, Legora, Fireworks..." 00:49:03 — Molly O'Shea

Langchain

Developer infrastructure platform for building LLM-powered applications. Described as Benchmark's "developer play" in the AI portfolio.

"They have like a developer play in Langchain." 00:50:07 — Everett Randle

HeyGen

AI video generation platform. Described as a "prosumer play" in the Benchmark AI portfolio.

"You know, they have like a prosumer play in HeyGen." 00:50:07 — Everett Randle

Manus

AI agent company. Benchmark portfolio company, mentioned in passing as part of the AI portfolio.

"Mercore. Manus." 00:49:35 — Molly O'Shea; confirmed by Everett Randle

Exa

AI-native search/web retrieval company. Part of the Benchmark AI portfolio.

"HeyGen, Gumloop, that we talked about, and Exa." 00:49:46 — Molly O'Shea

Mercore

Benchmark portfolio company; Everett noted the founder recently "cleared a lot of air" on a Harry Stebbings podcast following a data breach, with positive things happening at the company.

"I just listened to his podcast with Harry Stebbings, and I was really surprised about the positive things that are going on there... He cleared a lot of air." 00:49:36 — Molly O'Shea / Everett Randle

Crusoe

Data center construction company that acquires power, land, and permits and builds data centers, often for hyperscaler counterparties. Contrasted with Fireworks as a completely different business model despite surface-level categorization as "inference."

"Crusoe is actually going and constructing data centers. They're going and acquiring power, land, permits. And they're building data centers oftentimes for hyperscaler counterparties who become their customers." 00:09:27 — Everett Randle

Cognition

AI coding agent company. Cited for publishing research on post-training an open source model on low-complexity tasks seen within their platform, demonstrating ~95% cost savings vs. frontier for those tasks.

"Cognition published some work on this as well where they actually, you know, trained or I think they post-trained an open source model on tasks that they saw done within Cognition where there was low task complexity... you'd actually save an immense amount of money if you took it away from the frontier." 00:29:30 — Everett Randle

Anthropic

Foundation model lab. Raised at $380B valuation ($30B round). Everett's analysis shows if it IPOs at $1.5T (4x current valuation), the return on that single round would equal 35 Snowflake pre-IPO rounds. Claude Code cited as having gone "parabolic" following the Opus 4.5 coding breakthrough.

"I know several people that have like three to $4 billion invested into Anthropic... they might return five times their money in less than five years on $4 billion. Like it's unbelievable." 00:44:42 — Everett Randle

FAL

AI inference platform. Mentioned alongside Fireworks, Base10, and Modal as a leading derivation of inference monetization.

"If you think about the other inference platforms like FAL or Base10, Modal..." 00:17:41 — Everett Randle

Base10 Partners

AI inference platform / venture firm with inference infrastructure. Mentioned as part of the emerging inference platform landscape.

"If you think about the other inference platforms like FAL or Base10, Modal..." 00:17:41 — Everett Randle

Modal

Cloud compute platform for AI inference. Part of the inference platform ecosystem cited.

"If you think about the other inference platforms like FAL or Base10, Modal..." 00:17:41 — Everett Randle

SpaceX / Starlink

Everett's first investment at Kleiner Perkins, made at over $100B valuation. Key lesson: the majority of SpaceX's business today is Starlink (consumer and B2B broadband), not launch — a complete business rebirth at late stage that generated enormous returns for late-stage investors.

"If you look at like the P&L today and in the S1, the vast majority of the business is their consumer broadband business and B2B broadband business via Starlink. It's not even a launch business." 00:39:01 — Everett Randle


4. People Identified

Eric Vishria

General Partner at Benchmark. Ranked #3 on the Midas List in the year of recording. Described by Everett as a "Mount Rushmore venture capitalist" — exceptional results combined with unusual intellectual humility, kindness, and lack of ego. His tweet on the parallel growth of on-device, open source, and proprietary inference cited as a key framework.

"Eric Vichry, I think, is just like an incredible role model and leader within Benchmark. I think he is someone who obviously when you look at his results, he's, you know, best of the best. Like he is a Mount Rushmore venture capitalist at this point. But you spend an hour or two hours with him. And he is just an extremely kind, hardworking, you know, completely normal person." 00:54:24 — Everett Randle

Napoleon Ta

GP at Founders Fund, runs their growth investing practice. Mentored Everett — specifically credited not for investing tactics but for modeling a simple, focused life: work hard, spend time with family, avoid networking and podcasts. Described as possessing a clarity of priorities that reshaped Everett's own values.

"The most important thing that I learned from Napoleon was his focus on family and work. And he had a very simple life. He doesn't, you know, he doesn't do podcasts. He doesn't, you know, he doesn't network... He goes to work, he works really, really hard. And then he spends time with his family." 00:53:57 — Everett Randle

Robert Smith

CEO of Vista Equity Partners, where Everett started his career. Coined the phrase "software tastes like chicken" — meaning all SaaS P&Ls look identical at maturity. Cited as formative in Everett's understanding of software business model consistency (and, by contrast, how AI has shattered that).

"The CEO, Robert Smith, would always tell us software tastes like chicken. And that's why it's beautiful. Every software company, when you look under... at the balance sheet, at the P&L, they're all the same." 00:10:41 — Everett Randle

Brad Gerstner

Founder/CEO of Altimeter Capital. Referenced for his framework that we are now in "the age of inference" and that companies downstream of inference will be most rewarded.

"Brad Gersner thinks it's, you know, he says it's the age of inference and the companies that are going to be rewarded most now are downstream of that." 00:15:30 — Molly O'Shea

Philip (StarCloud founder)

Founder of StarCloud. Named specifically as the person-based reason Benchmark invested — had already proven a GPU working in space before the investment, making StarCloud the only company with that achievement at the time.

"The investment was about people. It was about Philip and the team that he had built. And it's like, wow, like these are like one that they already had, you know, a GPU working in space." 00:52:16 — Everett Randle

Chathan (Benchmark Partner)

Benchmark partner who led the StarCloud investment, closing weeks before Elon Musk's public endorsement of orbital data centers.

"When Chathan invested in StarCloud, it was like weeks, like we closed the investment weeks before Elon started like, you know, professing his love and bullishness on overall data centers." 00:51:32 — Everett Randle

Michelle Del Bono

Runs the multifamily office / wealth management division for Andreessen Horowitz principals (Mark Andreessen, Ben Horowitz). Cited as a key resource for thinking about wealth concentration risk as the coming AI liquidity event approaches — specifically how to manage a situation where 90%+ of net worth is in a single position.

"I've done now two episodes with Michelle Del Bono, who, to your point, Andreessen Horowitz has multi-products. He runs their multifamily office of Mark and Ben... And each time I talk to him, I'm asking because there's going to be a huge wealth event." 00:46:40 — Molly O'Shea


5. Operating Insights

Use P×Q×M as Your AI Business Quality Framework Instead of SaaS Metrics

The old SaaS quality shorthand (Rule of 40, gross margin %) doesn't translate. Everett's replacement framework is basic economics: Price × Quantity × Margin. In AI, Q (customers) may be similar to SaaS; M (margins) is almost definitively lower for 99% of AI app companies; but P (price/contract size) can be transformatively larger — nine-figure contracts with startups, which barely existed in SaaS. Operators should use this decomposition to locate where their company has genuine leverage and where it has structural weakness.

"What it comes down to in my mind is this idea of like P times Q times M... In SaaS land, you had like price was your ACV... Your Q is like how many customers... And then your M was... 70 to 90%. Well, now in AI, if we take like an AI app company... The M is almost definitively lower... But the P can be immensely high." 00:11:31 — Everett Randle

Walk Under the Waterfall Before Optimizing the Bucket

For companies with strong product traction but unclear monetization, stop optimizing the revenue model in a vacuum — get under the inference-based revenue "waterfall" first. Revenue model questions (holes in the bucket, bucket size) only become answerable once you're actually receiving the demand signal. The operational lesson: prioritize distribution/demand capture over premature revenue architecture.

"You're on this riverbank and you have this bucket... But you're sitting on the riverbank and over there, there's a fucking waterfall... you should just go walk under the waterfall. And like you don't know until you're under the waterfall whether there's like holes in your bucket or how big the bucket is or anything like that. But the first thing you should do is get under the waterfall." 00:16:47 — Everett Randle

Route AI Tasks by Complexity — Don't Default Everything to Frontier Models

Cognition's published work and Gumloop's enterprise use case both point to the same operational insight: systematically audit your inference spend by task complexity. Many high-volume, low-complexity tasks inside enterprises (and in your own product) are being run on frontier models purely by default. Post-training an open source model on those specific tasks can yield ~95% cost savings with no quality degradation. This is an immediate, actionable cost optimization lever.

"You'd actually save an immense amount of money if you took it away from the frontier because it's a very simple kind of gated task that doesn't need frontier intelligence. And you're going to save, you know, 95% on that action or that query." 00:29:56 — Everett Randle


6. Overlooked Insights

The Coming Anthropic/AI Lab IPO Liquidity Is a Macro Event That Will Reshape the Entire Startup Ecosystem — and Almost Nobody Is Modeling the Second-Order Effects

Everett briefly raises this and then moves on, but it deserves far more attention than it received. He knows individuals with $3–4B invested in Anthropic alone who may 5x in under five years. The Anthropic round alone is 35x the scale of the Snowflake pre-IPO in terms of expected gross returns. When this liquidity hits — across Anthropic, OpenAI, SpaceX, xAI simultaneously — it will represent the largest concentrated wealth creation event in technology history, faster than anything prior. The only place people are predicting the effect is SF real estate (2x asking, all cash). But the downstream effects — which new companies get funded, which causes get endowed, how many new funds get launched, how many employees stay vs. leave to found companies — are almost entirely unmodeled. Any investor or operator thinking about the 2026–2028 startup environment should be building frameworks around this now.

"I just don't know if people really understand and are ready for the impact that this much liquidity could have on the ecosystem... it's going to be a shock that's going to impact every single aspect of life, both practical life in Silicon Valley, but then also the lives of employees, investors, and founders in the ecosystem as well." 00:00:30 — Everett Randle

StarCloud (Orbital Data Centers) May Be the Most Asymmetric Infrastructure Bet in the Benchmark Portfolio — and It Was Made on Pure Founder Conviction Before the Category Existed

Everett mentions almost in passing that the StarCloud investment closed weeks before Elon Musk went public with his enthusiasm for orbital data centers — meaning Benchmark underwrote a thesis that had essentially zero public market validation at the time. The company had already achieved the only GPU in space, which is the kind of technical proof point that rarely gets mentioned casually. Orbital data centers avoid land acquisition, permitting, power grid constraints, and geopolitical jurisdiction — all of the bottlenecks currently strangling terrestrial data center buildout. The fact that it was named in the SpaceX S1 suggests it is already being taken seriously at the highest level of the infrastructure stack. This is a category that most infrastructure investors have dismissed as science fiction, yet it has already cleared the hardest technical hurdle and attracted the most credible infrastructure validator on the planet.

"When Chathan invested in StarCloud, it was like weeks, like we closed the investment weeks before Elon started like, you know, professing his love and bullishness on overall data centers... the investment was about people. It was about Philip and the team that he had built. And it's like, wow, like these are like one that they already had, you know, a GPU working in space. And so like they had proven out their metal and they were the only company that had a GPU in space." 00:51:32 — Everett Randle