Ali Ansari (Micro1) — The Human Data Engine
- 01The Infinite Last Mile: Why Human Data Demand Never Ends
- 02Compounding Errors Are the Core Weakness of Current LLMs
- 03Real-World Data Is a New Scaling Law
- 04Company Data Partnerships: An Untapped Multi-Billion Dollar Market
- 05The Fragmentation of AI into Niche Models, Not One Winner
- 06The Strategic Project Lead (SPL): A New High-Value Role Nobody Is Talking About
1. Key Themes
The Infinite Last Mile: Why Human Data Demand Never Ends
Ali argues that the need for human-generated training data is structurally infinite — not because models fail to learn, but because human job functions evolve upward as models absorb lower-level tasks, constantly creating new domains to train on. Three forces compound this: increasing task horizon length, domain drift (e.g., laws changing), and the emergence of net-new human functions enabled by AI leverage.
"When models get very good at the, let's call it the current action space of all things in law or any other domain, that means that humans in those functions will be able to use models really nicely for their job, which means that their job sort of like action space will change and they will rely on models a lot for everything they do. So they'll have free time to do sort of higher level, more nuanced tasks. And in a lot of cases, they're going to sort of invent new functions that will be very useful for their job, for their customer. And those net new functions will also require distilling into models for the models to get good on. So there's this concept of sort of like infinite last mile that emerges." [00:11:54]
Compounding Errors Are the Core Weakness of Current LLMs
The reason humans remain indispensable is not raw accuracy at individual steps — it is the catastrophic compounding of errors across multi-step tasks. Even 90% accuracy per step, across 20 steps, mathematically approaches zero. Long-horizon human-generated data directly addresses this failure mode.
"Even if you get very high accuracy on like the first part of any given task, like 90% or something, and if it's like 20 parts to that task and you just do 0.9 to the power of 20, I mean, you pretty much approach zero. And so this idea of like compounding errors is what models are really bad at, but humans are much better at." [00:15:44]
Real-World Data Is a New Scaling Law
Ali frames "real worldness" as a distinct scaling dimension alongside task horizon length — essentially a second axis on which training data quality is measured. Synthetic or bot-generated data trains models on too-clean distributions, causing performance collapse when deployed on actual messy workplace inputs.
"There's like two really important qualities of data which we sort of think of as the two new dimensions of scale, or like two sort of new scaling laws, quote unquote. One is the horizon thing we talked about earlier. The other is this notion of real worldness, which is like how real world is each data point or each task that we're creating." [00:34:05]
Company Data Partnerships: An Untapped Multi-Billion Dollar Market
Micro1 is actively purchasing anonymized corporate data corpora — Slack logs, Google Drive documents, internal databases — from companies of 30–300 employees and reselling processed versions to model labs. Ali states that if they could fulfill current demand, it would add billions in run rate immediately.
"If we were able to fulfill this demand right now, we would add a couple of billion in run rates like right away. So we're very much — the demand is high on the model side, our customers, they really demand this notion of like real world data, which comes from companies." [00:34:05]
The Fragmentation of AI into Niche Models, Not One Winner
Ali believes foundation model companies will not dominate every domain. The future is many specialized models built on top of baseline reasoning engines, each owning a vertical's "intelligence layer" through evaluation — not pre-training.
"I don't think the large providers are going to solve all the problems. I think it's very likely that what I call the sort of baseline reasoning model companies will be probably the largest companies in the AI ecosystem. But I don't think they're going to solve every domain and go sort of the last mile in every single domain. Because I think there's simply too many domains to do that." [00:18:37]
The Strategic Project Lead (SPL): A New High-Value Role Nobody Is Talking About
A new operational role — Strategic Project Lead — is commanding compensation approaching $1M/year for recent graduates, because it requires managing 100+ domain experts per pipeline, interfacing with lab researchers, and owning data quality and throughput end to end. No prior experience exists for this role.
"There are like a few people here in the office that are like two years out of school, they're making like a few hundred thousand a year, at like sometimes even like close to like a million dollars a year, because their job is like very important. And you can't like buy experience because it doesn't exist." [00:31:06]
Data, Not Algorithms or Compute, Is the Most Important Scaling Ingredient
Ali is explicitly positioning data as the single most important of the three AI scaling ingredients (data, algorithms, compute), and Micro1's singular focus on it is both a product thesis and a talent recruiting thesis.
"There's like these three ingredients, obviously data, algorithms, and compute, and the labs sort of try to focus on all three to build these really powerful models. But what we're seeing is like data is increasingly the most important ingredient. And that's the only thing that we focus on, which I think a lot of researchers really align with." [00:30:08]
Human Taste and Creativity Are Structurally Unconsolidatable
Ali offers a specific mechanism for why AI cannot replicate taste: the data labeling process requires expert consensus to produce training signal, and taste domains are ones where the more experts deliberate, the more they diverge rather than converge — making a training rubric impossible to construct.
"Things that require like human taste and human creativity is what I think models are going to have a really tough time with. The reason for this is like every time we create data sets to help models improve, we have a bunch of experts that we try to get them to come to consensus on what good looks like... I think taste, or like art, and what good videos and good images look like — there actually isn't a way that you can get to consensus. As you debate more and more in these areas, you actually diverge more." [00:26:20]
2. Contrarian Perspectives
AI Will Create Far More Jobs Than It Destroys — And It's Already Happening
Against the dominant public narrative (Ali cites ~70% U.S. disapproval of AI, partly driven by job-loss fears), Ali argues the opposite is already empirically true: net new job categories are being created at scale, they just aren't being counted as "jobs" by conventional metrics.
"There's going to be a lot of new jobs created by AI and there's going to be orders of magnitude of net new jobs that are created that are well above the jobs that are lost... you can just look at the reality of now, which is there's in fact way more jobs being created than before. And there's this new category of jobs which we don't call jobs, we don't count as jobs because they're like opportunities, but it literally is like amazing opportunities for hundreds of thousands of experts." [00:49:28]
Most Companies Should Not Pre-Train Their Own Models — But Should Do Post-Training
Conventional wisdom is split between "just use APIs" and "build your own model." Ali's contrarian take is more precise: pre-training from scratch is wasteful, but every AI application company will and should develop its own post-training layer — and that distinction is strategically critical.
"I don't think it's smart to like run these massive pre-training clusters to kind of get to similar outcomes. So I think there's going to be a lot of post training that happens. And I think probably every company that builds any sort of AI application is going to do their version of post training." [00:20:28]
Flexibility Plus Hardcore Culture Is Not Contradictory — It's a Talent Strategy
Conventional Silicon Valley wisdom assumes you must choose between demanding work hours and employee autonomy. Ali's model — no mandatory weekend work, but leadership works 7 days visibly — produces voluntary weekend effort across the team, with no mandate required.
"We actually don't require weekend work from any of our team members, but we try to, and it sounds a bit cheesy, but I think it works, which is we try to inspire it by everyone in leadership is constantly working and trying to do useful work, not just for the sake of working. And because of that, there's a lot of exciting work to do for the rest of the team as well, so everyone ends up working weekends. But we don't require it, which I think is part of why everyone sort of does it." [00:05:09]
Studying Math and Physics Beats Studying AI Itself
When asked what skills to develop to thrive in an AI-first world, Ali's advice is counterintuitive — skip AI tooling and go to foundational principles. Learning the tools themselves is a lower-leverage activity than building the mathematical intuition that lets you understand and wield them deeply.
"I would say you probably want to skip the levels of abstraction. You don't want to pick where on the level of abstraction you study. And instead you go into like the most basic principles, which is like math and physics. So like, on my free time, I like to study as much as I can math and physics, because I think that helps you actually understand these tools in a better way and in a more fundamental way." [00:24:25]
3. Companies Identified
Micro1
A human data and AI infrastructure company that recruits domain experts (lawyers, doctors, engineers, finance professionals) to create structured training data for foundational model companies, and also helps enterprises build and evaluate production AI agents. Also operates an AI recruiter product. Why mentioned: Primary subject of the episode; described as a breakout player in AI data with Reuters reporting a multi-hundred million dollar valuation.
"We help the foundational model companies improve their capabilities. And then we also help enterprises evaluate their way into making real agents, production level agents." [00:01:44]
Cursor
An AI-native coding tool cited as the leading example of a company that has successfully developed its own post-training intelligence layer on top of foundational models. Why mentioned: Used as proof that application-layer companies can and should build proprietary post-training capabilities.
"Cursor is a perfect example where they — I think they've done — I mean, now they're doing more than post training, but I think there's going to be a lot of companies like that that end up having their own sort of intelligence layer." [00:20:28]
Amazon
Mentioned as the platform Ali used as a teenager to flip resold textbooks — an early entrepreneurial venture. Why mentioned: Origin story context.
"I would buy textbooks from garage sales and Goodwills and I would like take those and like resell them on Amazon." [00:46:33]
4. People Identified
Ali Ansari
Founder and CEO of Micro1. Born in Iran, immigrated to the U.S. at age 10 via the green card lottery. Started his first business at age 13–14 flipping textbooks. Spent roughly 12 years building companies that failed before Micro1 began scaling meaningfully over the last two years. Why mentioned: Primary guest; building one of the most strategically positioned companies in the AI training data space.
"The time that Micro1 has really grown in the major ways, like really just the last two years. And the sort of 12 years before that was like all failures. And so it took a while — it took like maybe close to 13, 14 years or something for the first product to actually work." [00:47:32]
Elon Musk
Entrepreneur; CEO of Tesla, SpaceX, and other ventures. Named as Ali's most admired person outside family. Why mentioned: Ali credits Musk's "think in limits" product philosophy — specifically the principle of deleting parts every time you add them — as a core decision-making framework at Micro1.
"Every time you add parts, you should think about deleting parts at the same time, because if you don't in the limit, you have infinite parts and you have an infinitely complex system that you can't use. So there's this notion of thinking in limits, which I think Elon does in a really good way." [00:48:32]
5. Operating Insights
The "Think in Limits" Decision-Making Framework
Ali applies Elon Musk's product simplicity heuristic as a live operating practice: every time a feature, role, or process is added, simultaneously force the question of what to delete. Taking any decision "to the limit" — imagining the endpoint of a trend — reveals whether it produces an unmanageable system or a defensible one.
"Every time you add parts, you should think about deleting parts at the same time, because if you don't in the limit, you have infinite parts and you have an infinitely complex system that you can't use. So thinking in limits — I sort of like take it to the limit and I think it helps make good decisions faster." [00:48:32]
Protect Deep Work by Structuring the Day in Two Distinct Modes
Ali explicitly separates reactive work (Slack, small actions, keeping things moving — daytime) from generative work (product reviews, design reviews, pipeline architecture — 8 p.m. to 1 a.m.). This prevents the reactive layer from consuming all cognitive bandwidth and preserves a dedicated window for the highest-leverage work.
"Around six or seven is when I try to shut down Slack a bit... from 8 p.m. all the way to roughly 1 a.m. or so is more of the focused work where I do product reviews, check a bunch of big files, review designs, try to set up some new pipelines for customers — which is the fun part of the day, but the rest keeps the company moving forward." [00:03:41]
Weekly Pillar Review as a Qualitative Operating System
Rather than tracking KPIs or OKRs, Ali runs a five-minute Sunday qualitative scan across three to four personal and company pillars (Micro1: product/engineering/research, growth, talent; plus family and health). This surfaces misalignment early and directly drives goal-setting for the coming week — a lightweight but structured operating rhythm.
"I go through these few pillars that I really care about. And I think about those pillars at a qualitative, high level, like how each of them are going... And then this helps me kind of think about the current state of things and how I feel about all these things that I care about. And it also helps me determine the goals for the week." [00:53:20]
6. Overlooked Insights
Corporate Data Sales (30–300 Person Companies) Are an Underpriced, Immediately Monetizable Asset
Ali mentions almost in passing that companies with 30 to 300 employees can sell anonymized data corpora — Slack conversations, Google Drive documents, internal databases — and receive anywhere from a few hundred thousand to $10M+ depending on volume and diversity. This is not discussed anywhere in mainstream media as a legitimate revenue stream for small-to-mid-size businesses, yet Ali states demand is so intense that fulfilling it would add billions to Micro1's run rate immediately. For any operator of a company in that size range sitting on years of diverse internal communications and documents, this is an overlooked near-term liquidity event that requires no product change — only data anonymization and a partnership agreement.
"Usually companies can make anywhere from like a couple hundred thousand for a corporate snapshot of their data all the way to like a few million. And in some cases it could be even 10 million plus for really big companies... the company range is like usually from like 30 people to like 200, 300 people. It's like usually the best range." [00:36:59]
Frontier Post-Training Researchers Are Measured in the Hundreds — Not Thousands
Ali quietly reveals an extraordinary supply constraint that has enormous implications for anyone trying to build or invest in model capability improvement: the number of researchers who genuinely understand the reinforcement learning recipe and data pipeline design to improve model capabilities at the frontier is not thousands — it may be only a few hundred people globally. This is a far tighter bottleneck than almost anyone outside the labs publicly acknowledges, and it means that whoever captures this talent pipeline (as Micro1 is attempting) holds strategic leverage over the entire AI capability improvement ecosystem.
"Frontier researchers that know the RL recipe, that know what data pipelines should look like to really improve model capabilities — there isn't that many of them. There's like basically a couple thousand at most, maybe even a few hundred that really know how to improve model capabilities, especially in the post-training context." [00:29:10]