Data Infrastructure
CAPITAL FIGURES ARE MEDIA-EXTRACTED ESTIMATES, NOT VERIFIED FILINGS.
EXTRACTED FROM 25+ PODCASTS & VC NEWSLETTERS · MEDIA-REPORTED FIGURES, NOT VERIFIED FILINGS
Market Context
Data infrastructure is experiencing a supercycle driven by AI's insatiable demand for clean, structured, and accessible data at every layer of the stack — from physical data centers to vector databases to unified investment data platforms. Institutional capital is flooding in at scale, with $26.4B deployed across 22 deals in the last 28 days alone, and traditional financial giants like BlackRock, JPMorgan, and Goldman Sachs leading the charge alongside specialist tech investors. The competitive edge is shifting from raw data collection toward proprietary scoring, semantic layers, and unified data foundations — a structural evolution confirmed by practitioners at firms like Earlybird and Ontario Teachers' Pension Plan (OTPP).
Investment Activity
- Databricks raised a $12B growth round led by JPMorgan, BlackRock, and Goldman Sachs at a $41B valuation, confirming its status as the defining private data-and-AI infrastructure company.
- Supabase raised $500M led by Coatue, backing its open-source backend-as-a-service platform built for the agentic AI era.
- Hightouch raised $150M in a Series D, reinforcing demand for data activation and reverse ETL platforms connecting warehouses to business workflows.
Key Players
- Databricks is the anchor company in this theme, raising $12B at a $41B valuation and described by investors as a quasi-public company more liquid in private markets than some public biotechs.
- Supabase is an open-source backend platform that raised $500M from Coatue, with its MCP server integration positioning it directly in the agentic AI workflow stack.
- Hightouch is a data activation and reverse ETL platform that raised $150M in Series D funding, helping enterprises push warehouse data into operational systems and AI agents.
- Digital Edge is a Singapore-headquartered hyperscale edge data center operator across Asia Pacific with over 21 facilities and 500 MW of critical IT load, reportedly exploring a sale at a ~$10B valuation.
- Foresight is an emerging platform for unifying public and private investment data, founded by Jason Miller — a practitioner with prior data infrastructure roles at BlackRock, Point72, and Greycroft — targeting the structural data gaps inside investment firms.
Market Signals
- BlackRock leads all investors with 9 deals in the last 28 days, participating in the $12B Databricks growth round alongside JPMorgan and Goldman Sachs, signaling that the largest asset managers are now direct infrastructure backers.
- Snowflake appears as a top investor with 4 deals, underscoring the data warehouse ecosystem's role in funding complementary infrastructure companies.
- The data layer is bifurcating: raw startup signal data from providers like Harmonic and Evertrace has become commoditized, while proprietary scoring and unified data platforms command premium positioning.
- Institutional investors — including OTPP — are adopting private data infrastructure to pressure-test live deals in real time, representing a new enterprise use case beyond analytics.
- Earlybird Venture Capital achieved 86% entity resolution accuracy in one week during its data infrastructure rebuild, demonstrating that high data quality is now achievable rapidly without large engineering teams.
- Firecrawl launched Prometheus, a forward-deployed agent for automated web scraping, capturing 231 Product Hunt votes and pointing to rising demand for agentic data collection pipelines.
- Digital Edge and Csquare signal physical infrastructure momentum, with Asia-Pacific data center build-out and a confidential US IPO filing respectively reflecting hyperscaler demand spillover.