0 Results for ""

AI
Podcast

How do millions of simulated worlds teach AI to think? | Patronus AI

How do we systematically evaluate and improve increasingly autonomous AI systems? Rebecca Qian is a former fundamental NLP researcher at Meta, now she's building Patronus AI.
Glenn Solomon
Dan Cahana
Glenn Solomon
Dan Cahana
April 10, 2026

How do we systematically evaluate and improve increasingly autonomous AI systems?

In this episode of Notable Perspectives, Glenn Solomon and Dan Cahana sit down with Rebecca Qian, co-founder and CTO of Patronus AI. A former fundamental NLP researcher at Facebook AI, Rebecca and her team are now creating millions of adaptive, simulated environments — “intelligent worlds” — that teach AI agents to reason, plan, and make decisions like humans.

Join us as we explore:

  • The Eval Problem: Too few evals on too narrow a slice of reality; benchmark and leaderboard culture has created the wrong incentive.
  • Reality of Simulations: The shift from single-turn classification to agents that reason and decide the way people do. And the stakes if you get it wrong — reward hacking, deception, misalignment.
  • Building the Factory: The insight that general intelligence requires generalizable capabilities — not domain-specific data.

Chapters:

00:00 — The Eval Problem: Why current AI evaluation methods are fundamentally broken—and what it means for building reliable, autonomous systems.

00:58 — The ChatGPT Turning Point: The moment that signaled AI was ready to move beyond research into real-world decision-making and enterprise use.

02:44 — The Alignment Challenge: Why teaching AI to reason like humans in messy, unpredictable environments is the core problem to solve.

04:08 — Why Simulations Matter: How simulated environments unlock scalable training, safer experimentation, and better real-world performance.

06:19 — The Shift to AI Agents: From static models to dynamic systems that take actions over time and make complex decisions.

07:35 — Reward Hacking Risks: How AI systems learn to “cheat” evaluations—and why that poses a serious risk if left unchecked.

09:05 — Choosing What to Simulate: Why focusing on transferable human capabilities (not job-specific tasks) is key to general intelligence.

11:30 — Building Adaptive Worlds: The power of curriculum learning and environments that evolve alongside increasingly capable models.

12:38 — The Simulation Factory: Inside the vision to scale millions of intelligent, adaptive environments for training AI.

13:59 — The Future of AI: What it looks like to simulate reality itself—and the massive opportunity ahead.

14:58 — Founder Mindset: How to stay grounded, resilient, and adaptive while building at the cutting edge of AI.

16:35 — Closing: Final thoughts on the future of evaluation, intelligence, and building trustworthy AI systems.

Share

Subscribe to Worth Noting

A newsletter from the team at Notable Capital