John Schulman on dead ends, scaling RL, and building research institutions
John Schulman - Reinforcement Learning from Human Feedback: Progress and Challenges
John Schulman (OpenAI Cofounder) — Reasoning, RLHF, & plan for 2027 AGI
S3 E18 John Schulman of OpenAI on ChatGPT: invention, capabilities and limitations
Deep Reinforcement Learning (John Schulman, OpenAI)
ChatGPT Creator John Schulman on OpenAI | Ray Summit 2023
John Schulman 3: Deep Reinforcement Learning
The inside story of how ChatGPT was built – OpenAI cofounder John Schulman
Scale AI Leadership Summit 2024: Lightning Talk with John Schulman
7. Deep Reinforcement Learning John Schulman, OpenAI
John Schulman: OpenAI and recent advances in Artificial Intelligence - #16
John Schulman 2: Deep Reinforcement Learning
Deep RL Bootcamp Lecture 6: Nuts and Bolts of Deep RL Experimentation
John Schulman 4: Deep Reinforcement Learning
John Schulman on Data wall
Deep RL Bootcamp Lecture 5: Natural Policy Gradients, TRPO, PPO
John Schulman 1: Deep Reinforcement Learning
John Schulman (OpenAI Cofounder) - Reasoning, RLHF, & Plan for 2027 AGI |
John Schulman on Post-training
[07] John Schulman - Optimizing Expectations: From Deep RL to Stochastic Computation Graphs (9/2020)