POPULARITY
Abhishek Naik was a student at University of Alberta and Alberta Machine Intelligence Institute, and he just finished his PhD in reinforcement learning, working with Rich Sutton. Now he is a postdoc fellow at the National Research Council of Canada, where he does AI research on Space applications. Featured References Reinforcement Learning for Continuing Problems Using Average Reward Abhishek Naik Ph.D. dissertation 2024 Reward Centering Abhishek Naik, Yi Wan, Manan Tomar, Richard S. Sutton 2024 Learning and Planning in Average-Reward Markov Decision Processes Yi Wan, Abhishek Naik, Richard S. Sutton 2020 Discounted Reinforcement Learning Is Not an Optimization Problem Abhishek Naik, Roshan Shariff, Niko Yasui, Hengshuai Yao, Richard S. Sutton 2019 Additional References Explaining dopamine through prediction errors and beyond, Gershman et al 2024 (proposes Differential-TD-like learning mechanism in the brain around Box 4)
Erin Talvitie of Harvey Mudd College spoke with us about machine learning, hallucinating data, and making good decisions based on imperfect predictions. Paper we discussed: Self-Correcting Models for Model-Based Reinforcement Learning Erin’s grant: Using Imperfect Predictions to Make Good Decisions For a reinforcement learning book, Erin suggests Reinforcement Learning: An Introduction by Richard S. Sutton and Andrew G. Barto or the lecture series by David Silver. For a machine learning book, Elecia likes Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems by Aurélien Géron
Hadelin first spoke about his entrepreneurship mindset. This was already in his mind early on, and never left him. We spoke about his interest for math, science and technology and how it drove him to one of the best French engineering schools. We talked about him discovering Data Science and how he changed his major at the very last minute. We then talked about his professional life. How he got to work at google, and why he didn't reconduct his contract and chose to create online courses instead. We spoke about his further business ventures, all the way to BlueLife, his current company. We finished talking about AI as a whole and finding your purpose to do good in life.Hadelin is the co-founder and CEO of BlueLife AI which leverages AI for optimizing processes, maximizing efficiency and increasing profitability. Hadelin is also an online entrepreneur, who has created educational e-courses about Machine Learning, Deep Learning, AI and Blockchain, which have reached over half a million customers worldwide.Here are the links of the show:LinkedIn https://www.linkedin.com/in/hadelin-de-ponteves-1425ba5bTwitter https://www.twitter.com/hadelin2pCourse https://www.udemy.com/course/machinelearningBook "AI Crash Course" https://www.amazon.com/Crash-Course-hands-introduction-reinforcement/dp/1838645357?Conference https://www.datasciencego.comReinforcement Learning by Francis Bach, Richard S. Sutton & Andrew G. Barto https://amzn.to/2TczFk5Data Science from Scratch by from Joel Grus https://amzn.to/2vVsggWData Science for Business by Foster Provost & Tom Fawcett https://amzn.to/2SVMcJPPython Machine Learning by Sebastian Raschka & Vahid Mirjalili https://www.amazon.com/Python-Machine-Learning-scikit-learn-TensorFlow/dp/1789955750?The big leap by Gay Hendricks https://amzn.to/2v9LzDcMillionaire Success Habits by Dean Graziozi https://amzn.to/2vYdyWqCreditsMusic Aye by Yung Kartz is licensed CC BY-NC-ND 4.0.Your hostSoftware Developer‘s Journey is hosted and produced by Timothée (Tim) Bourguignon, a crazy frenchman living in Germany who dedicated his life to helping others learn & grow. More about him at timbourguignon.fr.Gift the podcast a ratingPlease do me and your fellow listeners a favor by spreading the good word about this podcast. And please leave a rating (excellent of course) on the major podcasting platforms, this is the best way to increase the visibility of the podcast:Apple PodcastsStitcherGoogle PlayPatreonFinally, if you want to help produce the podcast, support me on
AI researchers around the world are trying to create a general purpose learning system that can learn to solve a broad range of problems without being taught how. Koray Kavukcuoglu, DeepMind’s Director of Research, describes the journey to get there, and takes Hannah on a whistle-stop tour of DeepMind’s HQ and its research. If you have a question or feedback on the series, message us on Twitter (@DeepMindAI using the hashtag #DMpodcast) or emailing us at podcast@deepmind.com. Further reading: OpenAI: An overview of neural networks and the progress that has been made in AI Shane Legg, DeepMind co-founder: Measuring machine intelligence at the 2010 Singularity Summit Shane Legg and Marcus Hutter: Paper on defining machine intelligence Demis Hassabis: Talk on the history, frontiers and capabilities of AI Robert Wiblin: Positively shaping the development of artificial intelligence Asilomar AI Principles Richard S. Sutton and Andrew G. Barto: Reinforcement Learning: An Introduction Interviewees: Koray Kavukcuoglu, Director of Research; Trevor Back, Product Manager for DeepMind’s science research; research scientists Raia Hadsell and Murray Shanahan; and DeepMind CEO and co-founder, Demis Hassabis. Credits: Presenter: Hannah Fry Editor: David Prest Senior Producer: Louisa Field Producers: Amy Racs, Dan Hardoon Binaural Sound: Lucinda Mason-Brown Music composition: Eleni Shaw (with help from Sander Dieleman and WaveNet) Commissioned by DeepMind