Teahose.
SIGN IN
NEW HERE — WHAT TEAHOSE DOES
We read the entire AI & tech firehose — so you don't have to.
PODPodcastsAll-In, No Priors, Acquired…
NEWNewslettersStratechery, Newcomer…
PAPPapersPhysical AI research
PHProduct Huntdaily launches
VCInvestor ScoutSequoia, a16z, Benchmark…
CLAUDE DISTILLS →
7 reads, 30 sec each — free, 6 AM ET.
+ a live graph of the companies, people & themes underneath.
HOME/THEMES/REINFORCEMENT LEARNING
// THEME

Reinforcement Learning

COMPANIES 13VELOCITY ▲ RISINGCAPITAL 28D $106.0M · 5 DEALS
TOP INVESTORS: 776 (2) · abstract ventures (2) · andreessen horowitz (2) · catl (2) · inspired capital (2)

CAPITAL FIGURES ARE MEDIA-EXTRACTED ESTIMATES, NOT VERIFIED FILINGS.

Capital raised, weekly
◀ $106M · wk of 05-252026-05-25 ── 2026-05-25 · WEEKLY
Deals by stage
unknown
$106M · 3 DEALS
seed
$0M · 1 DEAL
series a
$0M · 1 DEAL
Mention momentum
MENTIONS / WEEK · PEAK 52

EXTRACTED FROM 25+ PODCASTS & VC NEWSLETTERS · MEDIA-REPORTED FIGURES, NOT VERIFIED FILINGS

// THEME ANALYSIS
UPDATED JUN 1, 2026

Market Context

Reinforcement learning is undergoing a rapid commercial translation from academic research to production-grade AI systems, with two distinct vectors gaining momentum simultaneously: RL environments for training software agents across enterprise workflows, and RL-powered robotic foundation models targeting industrial dexterity. The convergence of large vision-language model backbones with RL post-training pipelines is producing measurable benchmark leaps — RLWRLD's RLDX-1 achieving 86.8% success on the ALLEX humanoid suite, roughly doubling the performance of Physical Intelligence's π₀.₅ (~40%) and NVIDIA GR00T N1.6 (~40%). Top-tier investors including Andreessen Horowitz, 776, and a16z are concentrating capital into this space, signaling conviction that RL infrastructure is a durable category rather than a research curiosity.

Investment Activity

  • Deeptune raised a $43M Series A led by Andreessen Horowitz, with participation from 776, Abstract Ventures, and Inspired Capital, to build high-fidelity RL environments simulating enterprise software workflows.
  • Deeptune also received a separate Series A investment from a16z and Felicis Ventures, underscoring broad investor appetite for RL environment infrastructure.

Key Players

  • RLWRLD: Builds the RLDX-1 dexterity-first foundation model integrating vision, force sensing, and memory across single-arm, dual-arm, and humanoid robot embodiments, achieving world-first performance on the RoboCasa Kitchen benchmark in collaboration with KAIST.
  • Deeptune: Creates high-fidelity RL environments simulating day-to-day workflows across tools like Slack and Salesforce, enabling AI agents to learn complex multi-step enterprise tasks; backed by $43M from Andreessen Horowitz.
  • UC Berkeley: Home to Sergey Levine's lab, which produced foundational algorithms — IQL, SERL, RLDG, AWAC — that form the theoretical substrate of leading robotic RL systems including LWD and RLDX-1.
  • Covariant: Cited alongside Physical Intelligence, Figure, and Apptronik as a leading builder of multi-task diffusion policy systems for warehouse and logistics manipulation, directly affected by emerging factored diffusion policy research.
  • Google DeepMind: Originators of RT-2, PaLM-E, and Open X-Embodiment — benchmark precedents for the VLA systems space — and the institutional source of the MCTS policy distillation loop behind AlphaGo's self-improvement flywheel.

Market Signals

  • South Korea is emerging as a serious robotics RL hub, with KAIST and RLWRLD co-developing RLDX-1 and achieving top scores on competitive manipulation benchmarks.
  • France's Inria and researchers including Cordelia Schmid (recipient of the Körber European Science Prize) are active contributors to VLA research, indicating European academic momentum in embodied RL.
  • Deal velocity is accelerating: 5 deals in the last 28 days with $4.09B in capital deployed across the theme, led by repeat investors 776, Abstract Ventures, Andreessen Horowitz, and Inspired Capital each appearing in 2 deals.
  • Open-source RL tooling is maturing — SERL's sample-efficient off-policy RL framework from UC Berkeley is now used as a baseline in production VLA ablation studies, lowering the barrier to real-world robotic RL deployment.
  • Benchmark saturation on RLBench and RoboTwin is driving teams toward proprietary hardware evaluation suites (e.g., ALLEX), suggesting the competitive frontier is moving from simulation to real-world generalization.
// COMPANIES
13 COMPANIES
01
Tencent
tencent.com
$7.5B · GROWTH · LIANG WENFENG + TENCENT · JUN 17
34 SIGNALS · LAST SEEN JUN 19, 2026
02
Parallel
parallel.com
$0M · SEED · KHOSLA VENTURES + CHARLOTTE INDEX · MAY 31
4 SIGNALS · LAST SEEN MAY 31, 2026
03
Deeptune
deeptune.ai
$0M · SERIES A · A16Z + FELICIS VENTURES · MAY 31
3 SIGNALS · LAST SEEN JUN 1, 2026
04
ENPIRE
2 SIGNALS · LAST SEEN JUN 18, 2026
05
DeepMind
deepmind.com
12 SIGNALS · LAST SEEN JUN 18, 2026
06
Google DeepMind
deepmind.com
86 SIGNALS · LAST SEEN JUN 17, 2026
07
Ideogram
ideogram.ai
10 SIGNALS · LAST SEEN JUN 15, 2026
08
Physical Intelligence
physicalintelligence.company
70 SIGNALS · LAST SEEN JUN 15, 2026
09
UC Berkeley
berkeley.edu
15 SIGNALS · LAST SEEN JUN 15, 2026
10
SenseNova
sensenova.sensetime.com
14 SIGNALS · LAST SEEN JUN 15, 2026
11
Carnegie Mellon University
cmu.edu
30 SIGNALS · LAST SEEN JUN 11, 2026
12
Prime Intellect
primeintellect.ai
4 SIGNALS · LAST SEEN MAY 18, 2026
13
RLWRLD
rlwrld.ai
16 SIGNALS · LAST SEEN MAY 15, 2026