Generally Intelligent

Share on

A podcast interviewing machine learning researchers studying how to build intelligence. Made for researchers, by researchers.

Untitled AI

Aug 9, 2023 LATEST EPISODE
infrequent NEW EPISODES
1h 23m AVG DURATION
33 EPISODES

Search for episodes from Generally Intelligent with a specific topic:

Latest episodes from Generally Intelligent

Episode 33: Tri Dao, Stanford: On FlashAttention and sparsity, quantization, and efficient inference

Play Episode Listen Later Aug 9, 2023 80:29

Tri Dao is a PhD student at Stanford, co-advised by Stefano Ermon and Chris Re. He'll be joining Princeton as an assistant professor next year. He works at the intersection of machine learning and systems, currently focused on efficient training and long-range context. About Generally Intelligent We started Generally Intelligent because we believe that software with human-level intelligence will have a transformative impact on the world. We're dedicated to ensuring that that impact is a positive one. We have enough funding to freely pursue our research goals over the next decade, and our backers include Y Combinator, researchers from OpenAI, Astera Institute, and a number of private individuals who care about effective altruism and scientific research. Our research is focused on agents for digital environments (ex: browser, desktop, documents), using RL, large language models, and self supervised learning. We're excited about opportunities to use simulated data, network architecture search, and good theoretical understanding of deep learning to make progress on these problems. We take a focused, engineering-driven approach to research. Learn more about us Website: https://generallyintelligent.com/ LinkedIn: linkedin.com/company/generallyintelligent/ Twitter: @genintelligent

phd stanford efficient openai y combinator rl inference

Episode 32: Jamie Simon, UC Berkeley: On theoretical principles for how neural networks learn and generalize

Play Episode Listen Later Jun 22, 2023 61:54

Jamie Simon is a 4th year Ph.D. student at UC Berkeley advised by Mike DeWeese, and also a Research Fellow with us at Generally Intelligent. He uses tools from theoretical physics to build fundamental understanding of deep neural networks so they can be designed from first-principles. In this episode, we discuss reverse engineering kernels, the conservation of learnability during training, infinite-width neural networks, and much more. About Generally Intelligent We started Generally Intelligent because we believe that software with human-level intelligence will have a transformative impact on the world. We're dedicated to ensuring that that impact is a positive one. We have enough funding to freely pursue our research goals over the next decade, and our backers include Y Combinator, researchers from OpenAI, Astera Institute, and a number of private individuals who care about effective altruism and scientific research. Our research is focused on agents for digital environments (ex: browser, desktop, documents), using RL, large language models, and self supervised learning. We're excited about opportunities to use simulated data, network architecture search, and good theoretical understanding of deep learning to make progress on these problems. We take a focused, engineering-driven approach to research. Learn more about us Website: https://generallyintelligent.com/ LinkedIn: linkedin.com/company/generallyintelligent/ Twitter: @genintelligent

principles uc berkeley openai research fellows y combinator theoretical rl neural networks mike deweese

Episode 31: Bill Thompson, UC Berkeley, on how cultural evolution shapes knowledge acquisition

Play Episode Listen Later Mar 29, 2023 75:24

Bill Thompson is a cognitive scientist and an assistant professor at UC Berkeley. He runs an experimental cognition laboratory where he and his students conduct research on human language and cognition using large-scale behavioral experiments, computational modeling, and machine learning. In this episode, we explore the impact of cultural evolution on human knowledge acquisition, how pure biological evolution can lead to slow adaptation and overfitting, and much more. About Generally Intelligent We started Generally Intelligent because we believe that software with human-level intelligence will have a transformative impact on the world. We're dedicated to ensuring that that impact is a positive one. We have enough funding to freely pursue our research goals over the next decade, and our backers include Y Combinator, researchers from OpenAI, Astera Institute, and a number of private individuals who care about effective altruism and scientific research. Our research is focused on agents for digital environments (ex: browser, desktop, documents), using RL, large language models, and self supervised learning. We're excited about opportunities to use simulated data, network architecture search, and good theoretical understanding of deep learning to make progress on these problems. We take a focused, engineering-driven approach to research. Learn more about us Website: https://generallyintelligent.com/ LinkedIn: linkedin.com/company/generallyintelligent/ Twitter: @genintelligent

acquisition uc berkeley shapes openai y combinator rl cultural evolution bill thompson

Episode 30: Ben Eysenbach, CMU, on designing simpler and more principled RL algorithms

Play Episode Listen Later Mar 23, 2023 105:56

Ben Eysenbach is a PhD student from CMU and a student researcher at Google Brain. He is co-advised by Sergey Levine and Ruslan Salakhutdinov and his research focuses on developing RL algorithms that get state-of-the-art performance while being more simple, scalable, and robust. Recent problems he's tackled include long horizon reasoning, exploration, and representation learning. In this episode, we discuss designing simpler and more principled RL algorithms, and much more. About Generally Intelligent We started Generally Intelligent because we believe that software with human-level intelligence will have a transformative impact on the world. We're dedicated to ensuring that that impact is a positive one. We have enough funding to freely pursue our research goals over the next decade, and our backers include Y Combinator, researchers from OpenAI, Astera Institute, and a number of private individuals who care about effective altruism and scientific research. Our research is focused on agents for digital environments (ex: browser, desktop, documents), using RL, large language models, and self supervised learning. We're excited about opportunities to use simulated data, network architecture search, and good theoretical understanding of deep learning to make progress on these problems. We take a focused, engineering-driven approach to research. Learn more about us Website: https://generallyintelligent.com/ LinkedIn: linkedin.com/company/generallyintelligent/ Twitter: @genintelligent

phd designing algorithms openai y combinator simpler rl cmu principled google brain

Episode 29: Jim Fan, NVIDIA, on foundation models for embodied agents, scaling data, and why prompt engineering will become irrelevant

Play Episode Listen Later Mar 9, 2023 86:45

Jim Fan is a research scientist at NVIDIA and got his PhD at Stanford under Fei-Fei Li. Jim is interested in building generally capable autonomous agents, and he recently published MineDojo, a massively multiscale benchmarking suite built on Minecraft, which was an Outstanding Paper at NeurIPS. In this episode, we discuss the foundation models for embodied agents, scaling data, and why prompt engineering will become irrelevant. About Generally Intelligent We started Generally Intelligent because we believe that software with human-level intelligence will have a transformative impact on the world. We're dedicated to ensuring that that impact is a positive one. We have enough funding to freely pursue our research goals over the next decade, and our backers include Y Combinator, researchers from OpenAI, Astera Institute, and a number of private individuals who care about effective altruism and scientific research. Our research is focused on agents for digital environments (ex: browser, desktop, documents), using RL, large language models, and self supervised learning. We're excited about opportunities to use simulated data, network architecture search, and good theoretical understanding of deep learning to make progress on these problems. We take a focused, engineering-driven approach to research. Learn more about us Website: https://generallyintelligent.com/ LinkedIn: linkedin.com/company/generallyintelligent/ Twitter: @genintelligent

phd data foundation engineering stanford scaling models minecraft nvidia openai embodied y combinator irrelevant prompt rl neurips fei fei li

Episode 28: Sergey Levine, UC Berkeley, on the bottlenecks to generalization in reinforcement learning, why simulation is doomed to succeed, and how to pick good research problems

Play Episode Listen Later Mar 1, 2023 94:49

Sergey Levine, an assistant professor of EECS at UC Berkeley, is one of the pioneers of modern deep reinforcement learning. His research focuses on developing general-purpose algorithms for autonomous agents to learn how to solve any task. In this episode, we talk about the bottlenecks to generalization in reinforcement learning, why simulation is doomed to succeed, and how to pick good research problems.

research succeed uc berkeley simulation doomed levine sergey bottlenecks generalization reinforcement learning eecs

Episode 27: Noam Brown, FAIR, on achieving human-level performance in poker and Diplomacy, and the power of spending compute at inference time

Play Episode Listen Later Feb 9, 2023 104:54

Noam Brown is a research scientist at FAIR. During his Ph.D. at CMU, he made the first AI to defeat top humans in No Limit Texas Hold 'Em poker. More recently, he was part of the team that built CICERO which achieved human-level performance in Diplomacy. In this episode, we extensively discuss ideas underlying both projects, the power of spending compute at inference time, and much more.

ai performance spending achieving poker diplomacy cicero cmu compute inference noam brown

Episode 26: Sugandha Sharma, MIT, on biologically inspired neural architectures, how memories can be implemented, and control theory

Play Episode Listen Later Jan 17, 2023 104:00

Sugandha Sharma is a Ph.D. candidate at MIT advised by Prof. Ila Fiete and Prof. Josh Tenenbaum. She explores the computational and theoretical principles underlying higher cognition in the brain by constructing neuro-inspired models and mathematical tools to discover how the brain navigates the world, or how to construct memory mechanisms that don't exhibit catastrophic forgetting. In this episode, we chat about biologically inspired neural architectures, how memory could be implemented, why control theory is underrated and much more.

mit memories prof architecture sharma neural implemented biologically control theory

Episode 25: Nicklas Hansen, UCSD, on long-horizon planning and why algorithms don't drive research progress

Play Episode Listen Later Dec 16, 2022 109:18

Nicklas Hansen is a Ph.D. student at UC San Diego advised by Prof Xiaolong Wang and Prof Hao Su. He is also a student researcher at Meta AI. Nicklas' research interests involve developing machine learning systems, specifically neural agents, that have the ability to learn, generalize, and adapt over their lifetime. In this episode, we talk about long-horizon planning, adapting reinforcement learning policies during deployment, why algorithms don't drive research progress, and much more!

research planning drive progress horizon algorithms hansen uc san diego ucsd nicklas

Episode 24: Jack Parker-Holder, DeepMind, on open-endedness, evolving agents and environments, online adaptation, and offline learning

Play Episode Listen Later Dec 6, 2022 116:42

Jack Parker-Holder recently joined DeepMind after his Ph.D. with Stephen Roberts at Oxford. Jack is interested in using reinforcement learning to train generally capable agents, especially via an open-ended learning process where environments can adapt to constantly challenge the agent's capabilities. Before doing his Ph.D., Jack worked for 7 years in finance at JP Morgan. In this episode, we chat about open-endedness, evolving agents and environments, online adaptation, offline learning with world models, and much more.

learning online oxford evolving adaptation jp morgan offline holder environments deepmind jack parker stephen roberts

Episode 23: Celeste Kidd, UC Berkeley, on attention and curiosity, how we form beliefs, and where certainty comes from

Play Episode Listen Later Nov 22, 2022 112:35

Celeste Kidd is a professor of psychology at UC Berkeley. Her lab studies the processes involved in knowledge acquisition; essentially, how we form our beliefs over time and what allows us to select a subset of all the information we encounter in the world to form those beliefs. In this episode, we chat about attention and curiosity, beliefs and expectations, where certainty comes from, and much more.

attention curiosity beliefs uc berkeley kidd

Episode 22: Archit Sharma, Stanford, on unsupervised and autonomous reinforcement learning

Play Episode Listen Later Nov 17, 2022 98:13

Archit Sharma is a Ph.D. student at Stanford advised by Chelsea Finn. His recent work is focused on autonomous deep reinforcement learning—that is, getting real world robots to learn to deal with unseen situations without human interventions. Prior to this, he was an AI resident at Google Brain and he interned with Yoshua Bengio at Mila. In this episode, we chat about unsupervised, non-episodic, autonomous reinforcement learning and much more.

ai stanford sharma autonomous unsupervised reinforcement learning google brain yoshua bengio chelsea finn

Episode 21: Chelsea Finn, Stanford, on the biggest bottlenecks in robotics and reinforcement learning

Play Episode Listen Later Nov 3, 2022 40:07

Chelsea Finn is an Assistant Professor at Stanford and part of the Google Brain team. She's interested in the capability of robots and other agents to develop broadly intelligent behavior through learning and interaction at scale. In this episode, we chat about some of the biggest bottlenecks in RL and robotics—including distribution shifts, Sim2Real, and sample efficiency—as well as what makes a great researcher, why she aspires to build a robot that can make cereal, and much more.

stanford assistant professor robotics rl bottlenecks reinforcement learning google brain chelsea finn

Episode 20: Hattie Zhou, Mila, on supermasks, iterative learning, and fortuitous forgetting

Play Episode Listen Later Oct 14, 2022 107:28

Hattie Zhou is a Ph.D. student at Mila working with Hugo Larochelle and Aaron Courville. Her research focuses on understanding how and why neural networks work, starting with deconstructing why lottery tickets work and most recently exploring how forgetting may be fundamental to learning. Prior to Mila, she was a data scientist at Uber and did research with Uber AI Labs. In this episode, we chat about supermasks and sparsity, coherent gradients, iterative learning, fortuitous forgetting, and much more.

learning uber forgetting zhou iterative fortuitous

Episode 19: Minqi Jiang, UCL, on environment and curriculum design for general RL agents

Play Episode Listen Later Jul 19, 2022 114:32

Minqi Jiang is a Ph.D. student at UCL and FAIR, advised by Tim Rocktäschel and Edward Grefenstette. Minqi is interested in how simulators can enable AI agents to learn useful behaviors that generalize to new settings. He is especially focused on problems at the intersection of generalization, human-AI coordination, and open-ended systems. In this episode, we chat about environment and curriculum design for reinforcement learning, model-based RL, emergent communication, open-endedness, and artificial life.

ai environment ucl rl jiang curriculum design

Episode 18: Oleh Rybkin, UPenn, on exploration and planning with world models

Play Episode Listen Later Jul 11, 2022 121:21

Oleh Rybkin is a Ph.D. student at the University of Pennsylvania and a student researcher at Google. He is advised by Kostas Daniilidis and Sergey Levine. Oleh's research focus is on reinforcement learning, particularly unsupervised and model-based RL in the visual domain. In this episode, we discuss agents that explore and plan (and do yoga), how to learn world models from video, what's missing from current RL research, and much more!

university google planning pennsylvania models exploration oleh upenn rl

Episode 17: Andrew Lampinen, DeepMind, on symbolic behavior, mental time travel, and insights from psychology

Play Episode Listen Later Feb 28, 2022 119:34

Andrew Lampinen is a Research Scientist at DeepMind. He previously completed his Ph.D. in cognitive psychology at Stanford. In this episode, we discuss generalization and transfer learning, how to think about language and symbols, what AI can learn from psychology (and vice versa), mental time travel, and the need for more human-like tasks. [Podcast errata: Susan Goldin-Meadow accidentally referred to as Susan Gelman @00:30:34]

ai mental psychology behavior stanford time travel symbolic research scientist deepmind

Episode 16: Yilun Du, MIT, on energy-based models, implicit functions, and modularity

Play Episode Listen Later Dec 21, 2021 85:24

Yilun Du is a graduate student at MIT advised by Professors Leslie Kaelbling, Tomas Lozano-Perez, and Josh Tenenbaum. He's interested in building robots that can understand the world like humans and construct world representations that enable task planning over long horizons.

energy mit models functions implicit modularity

Episode 15: Martín Arjovsky, INRIA, on benchmarks for robustness and geometric information theory

Play Episode Listen Later Oct 15, 2021 86:41

Martín Arjovsky did his Ph.D. at NYU with Leon Bottou. Some of his well-known works include the Wasserstein GAN and a paradigm called Invariant Risk Minimization. In this episode, we discuss out-of-distribution generalization, geometric information theory, and the importance of good benchmarks.

mart nyu benchmarks geometric information theory robustness inria

Episode 14: Yash Sharma, MPI-IS, on generalizability, causality, and disentanglement

Play Episode Listen Later Sep 24, 2021 87:01

Yash Sharma is a Ph.D. student at the International Max Planck Research School for Intelligent Systems. He previously studied electrical engineering at Cooper Union and has spent time at Borealis AI and IBM Research. Yash's early work was on adversarial examples and his current research interests span a variety of topics in representation disentanglement. In this episode, we discuss robustness to adversarial examples, causality vs. correlation in data, and how to make deep learning models generalize better.

sharma causality yash cooper union intelligent systems ibm research generalizability

Episode 13: Jonathan Frankle, MIT, on the lottery ticket hypothesis and the science of deep learning

Play Episode Listen Later Sep 10, 2021 81:05

Jonathan Frankle (Google Scholar) (Website) is finishing his PhD at MIT, advised by Michael Carbin. His main research interest is using experimental methods to understand the behavior of neural networks. His current work focuses on finding sparse, trainable neural networks. **Highlights from our conversation:**

science phd mit ml deep learning hypothesis gpus lottery tickets

Episode 12: Jacob Steinhardt, UC Berkeley, on machine learning safety, alignment and measurement

Play Episode Listen Later Jun 18, 2021 60:07

Jacob Steinhardt (Google Scholar) (Website) is an assistant professor at UC Berkeley. His main research interest is in designing machine learning systems that are reliable and aligned with human values. Some of his specific research directions include robustness, rewards specification and reward hacking, as well as scalable alignment. Highlights:

ai safety alignment machine learning uc berkeley measurement steinhardt

Episode 11: Vincent Sitzmann, MIT, on neural scene representations for computer vision and more general AI

Play Episode Listen Later May 20, 2021 70:46

Vincent Sitzmann (Google Scholar) (Website) is a postdoc at MIT. His work is on neural scene representations in computer vision. Ultimately, he wants to make representations that AI agents can use to solve the same visual tasks humans solve regularly, but that are currently impossible for AI. **Highlights from our conversation:**

vision ai mit 3d neural computer vision representations

Episode 10: Dylan Hadfield-Menell, UC Berkeley/MIT, on the value alignment problem in AI

Play Episode Listen Later May 12, 2021 92:05

Dylan Hadfield-Menell (Google Scholar) (Website) recently finished his PhD at UC Berkeley and is starting as an assistant professor at MIT. He works on the problem of designing AI algorithms that pursue the intended goal of their users, designers, and society in general. This is known as the value alignment problem. Highlights from our conversation:

ai phd mit consequences alignment uc berkeley hadfield

Episode 9: Drew Linsley, Brown, on inductive biases for vision and generalization

Play Episode Listen Later Apr 2, 2021 72:22

Drew Linsley (Google Scholar) (Website) is a Paul J. Salem senior research associate at Brown, advised by Thomas Serre. He is working on building computational models of the visual system that serve the dual purpose of (1) explaining biological function and (2) extending artificial vision. Prior to his work in the Serre lab, he completed a PhD in computational neuroscience at Boston College and a BA in Psychology at Hamilton College. His most recent paper at NeurIPS is Stable and expressive recurrent vision models. It presents an alternative to back-propagation through time (BPTT) for recurrent vision models called "contractor recurrent back-propagation" (C-RBP), which has O(1) complexity for an N step model vs. O(N) for BPTT, and which learns long-range spatial dependencies in cases where BPTT cannot. Drew is also organizing an ICLR 2021 workshop named Generalization Beyond the Training Distribution in Brains and Machines on Friday, May 7th, 2021. Find them on the website and @ICLR_brains. Lastly, Drew is looking to work with collaborators in robotics, so feel free to reach out! Highlights from our conversation:

vision phd building psychology salem machines brains boston college stable biases generalization hamilton college serre paul j inductive neurips iclr

Episode 08: Giancarlo Kerg, Mila, on approaching deep learning from mathematical foundations

Play Episode Listen Later Mar 27, 2021 69:50

Giancarlo Kerg (Google Scholar) is a PhD student at Mila, supervised by Yoshua Bengio and Guillaume Lajoie. He is working on out-of-distribution generalization and modularity in memory-augmented neural networks. Prior to his PhD, he studied pure mathematics at Cambridge and Université Libre de Bruxelles. His most recent paper at NeurIPS is Untangling tradeoffs between recurrence and self-attention in neural networks. It presents a proof for how self-attention mitigates the gradient vanishing problem when trying to capture long-term dependencies. Building on this, it proposes a way to scalably use sparse self-attention with recurrence, via a relevancy screening mechanism that mirrors the cognitive process of memory consolidation. Highlights from our conversation:

phd building pure cambridge foundations bruxelles deep learning untangling giancarlo mathematical yoshua bengio universit libre neurips rnns

Episode 07: Yujia Huang, Caltech, on neuro-inspired generative models

Play Episode Listen Later Mar 18, 2021 65:42

Yujia Huang (@YujiaHuangC) is a PhD student at Caltech, working at the intersection of deep learning and neuroscience. She worked on optics and biophotonics before venturing into machine learning. Now, she hopes to design “less artificial” artificial intelligence. Her most recent paper at NeurIPS is Neural Networks with Recurrent Generative Feedback, introducing Convolutional Neural Networks with Feedback (CNN-F). Yujia is open to working with collaborators from many areas: neuroscience, signal processing, and control experts — in addition to those interested in generative models for classification. Feel free to reach out to her! Highlights from our conversation:

ai phd tips adapting neuro huang turing caltech neural networks neurips generative models convolutional neural networks

Episode 06: Julian Chibane, MPI-INF, on 3D reconstruction using implicit functions

Play Episode Listen Later Mar 5, 2021 49:45

Our next guest, Julian Chibane, is a PhD student at the Real Virtual Humans group at the Max Planck Institute for Informatics in Germany. His recent work centers around intrinsic functions for 3D reconstruction, and his most recent paper at NeurIPS is Neural Unsigned Distance Fields for Implicit Function Learning. He also introduced Implicit Feature Networks (IF-Nets) in Implicit Functions in Feature Space for 3D Shape Reconstruction and Completion. Highlights

germany phd navigating 3d reconstruction completion functions implicit informatics max planck institute highlights how neurips

Episode 05: Katja Schwarz, MPI-IS, on GANs, implicit functions, and 3D scene understanding

Play Episode Listen Later Feb 24, 2021 51:06

Katja Schwartz came to machine learning from physics, and is now working on 3D geometric scene understanding at the Max Planck Institute for Intelligent Systems. Her most recent work, “Generative Radiance Fields for 3D-Aware Image Synthesis,” revealed that radiance fields are a powerful representation for generative image synthesis, leading to 3D consistent models that render with high fidelity. We discuss the ideas in Katja’s work and more:

3d schwarz functions katja gan implicit max planck institute gans intelligent systems

Episode 04: Joel Lehman, OpenAI, on evolution, open-endedness, and reinforcement learning

Play Episode Listen Later Feb 17, 2021 78:25

Joel Lehman was previously a founding member at Uber AI Labs and assistant professor at the IT University of Copenhagen. He's now a research scientist at OpenAI, where he focuses on open-endedness, reinforcement learning, and AI safety. Joel’s PhD dissertation introduced the novelty search algorithm. That work inspired him to write the popular science book, “Why Greatness Cannot Be Planned”, with his PhD advisor Ken Stanley, which discusses what evolutionary algorithms imply for how individuals and society should think about objectives. We discuss this and much more: - How discovering novelty search totally changed Joel’s philosophy of life - Sometimes, can you reach your objective more quickly by not trying to reach it? - How one might evolve intelligence - Why reinforcement learning is a natural framework for open-endedness

ai phd evolution copenhagen openai lehman reinforcement learning it university ken stanley

Episode 03: Cinjon Resnick, NYU, on activity and scene understanding

Play Episode Listen Later Feb 1, 2021 60:09

Cinjon Resnick was formerly from Google Brain and now doing his PhD at NYU. We talk about why he believes scene understanding is critical to out of distribution generalization, and how his theses have evolved since he started his PhD. Some topics we over: How Cinjon started his research by trying to grow a baby through language and games, before running into a wall with this approach How spending time at circuses

phd nyu activity resnick google brain

Episode 02: Sarah Jane Hong, Latent Space, on neural rendering & research process

Play Episode Listen Later Jan 7, 2021 35:56

Sarah Jane Hong is the co-founder of Latent Space, a startup building the first fully AI-rendered 3D engine in order to democratize creativity. We touch on what it was like taking classes under Geoff Hinton in 2013, the trouble with using natural language prompts to render a scene, why a model’s ability to scale is more important than getting state-of-the-art results, and more.

ai space 3d hong neural sarah jane rendering latent research process geoff hinton

Episode 01: Kelvin Guu, Google AI, on language models & overlooked research problems

Play Episode Listen Later Dec 15, 2020 47:47

We interview Kelvin Guu, a researcher at Google AI and the creator of REALM. The conversation is a wide-ranging tour of language models, how computers interact with world knowledge, and much more.

research language models realm overlooked google ai

Claim Generally Intelligent

In order to claim this podcast we'll send an email to with a verification link. Simply click the link and you will be able to edit tags, request a refresh, and other features to take control of your podcast page!

Claim Cancel