Mixture of Experts

Follow Mixture of Experts
Share on
Copy link to clipboard

Welcome to Mixture of Experts, your weekly deep dive into the ever-evolving landscape of artificial intelligence—bringing you insightful discussions on the latest AI trends, innovations, and their impact on business. From breakthrough research to practical applications, each episode offers a balanced blend of expertise and analysis. Explore how AI is reshaping industries, driving efficiency, and unlocking new opportunities for growth. Whether you're a seasoned professional seeking to stay ahead of the curve or an enthusiast curious about the future of technology, Mixture of Experts delivers the perfect mix of insights and practical knowledge. Tune in and stay informed as we navigate the dynamic intersection of AI and business.

IBM


    • May 16, 2025 LATEST EPISODE
    • weekly NEW EPISODES
    • 38m AVG DURATION
    • 57 EPISODES


    Search for episodes from Mixture of Experts with a specific topic:

    Latest episodes from Mixture of Experts

    Mistral Medium 3, OpenAI HealthBench and AI chips to Saudi Arabia

    Play Episode Listen Later May 16, 2025 37:02


    Can Mistral make Europe a global AI contender? In episode 55 of Mixture of Experts, host Tim Hwang is joined by Chris Hay, Volkmar Uhlig and Kaoutar El Maghraoui to discuss the drop of Mistral Medium 3. Next, we analyze the AI chip sales both NVIDIA and AMD made to Saudi Arabia. Then, with IBM's new ITBench and OpenAI's HealthBench, we dive deeper into benchmarks for AI evaluation. Tune in to this week's Mixture of Experts for more! 00:01 – Intro 00:47 -- Mistral Medium 3 12:26 -- AI chips to Saudi Arabia 21:21 -- AI evaluation benchmarks 31:47 -- Amazon's AI-generated pause ads The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    IBM Think 2025, OpenAI Windsurf acquisition, reasoning models and hallucinations

    Play Episode Listen Later May 9, 2025 28:23


    Has AI hallucination gotten out of control? In episode 54 of Mixture of Experts, host Tim Hwang is joined by Kate Soule, Skyler Speakman and Kaoutar El Maghraoui to analyze reasoning models and rising hallucinations. Next, as IBM Think 2025 wraps, the experts unpack the biggest highlights from IBM's biggest show of the year: new AI agents, Ferraris and ... penguins? Then, OpenAI is making moves with its acquisition of Windsurf. What does this mean? Tune in to this week's Mixture of Experts for more! 00:01 – Intro 01:12 – IBM Think 2025 09:27 – Reasoning models and hallucinations 19:23 – OpenAI Windsurf acquisition The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    LlamaCon, Qwen3, DeepSeek-R2 rumors and JP Morgan's open letter on AI

    Play Episode Listen Later May 2, 2025 39:54


    We are celebrating MoE podcast's one year anniversary! In episode 53 of Mixture of Experts, host Tim Hwang is joined by the O.G. panel of experts from our pilot—Chris Hay, Shobhit Varshney and Kush Varshney. This week, we cover some exciting announcements at LlamaCon. Then, we discuss some new Chinese AI models from Qwen3 to the rumored DeepSeek-R2. Next, J.P. Morgan's CISO, Patrick Opet, released “An open letter to our third-party suppliers,” covering the need for AI security. Are we doomed? Finally, we look back at some of the topics we discussed in episode 1—the Rabbit AI device, GPT-2 chatbot, Apple Intelligence—after all that, who was the first person to say “agents” on the podcast? Tune in to find out, on today's one-year celebration of Mixture of Experts. 00:00 -- Intro00:38 -- LlamaCon10:34 -- Qwen3 and DeepSeek-R223:23 -- J.P. Morgan's open letter 39:45 -- One year of MoEThe opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    OpenAI social network, Anthropic's reasoning study and humanoid half-marathon

    Play Episode Listen Later Apr 25, 2025 38:25


    Is OpenAI going to enter the social media game? In episode 52 of Mixture of Experts host, Tim Hwang is joined by Gabe Goodhart, Kate Soule and Marina Danilevsky. First, Sam Altman is rumored to be testing an internal prototype social network; why is this a potential next move for the AI giant? Next, for our paper of the week, we analyze Anthropic's study on chain-of-thought reasoning, “Reasoning Models Don't Always Say What They Think.” Then, AI scraping puts a strain on Wikimedia; what's the impact of this? Finally, China held a humanoid robot half-marathon, where humans raced alongside robot competitors. Who wins this AI race? All that and more on today's Mixture of Experts. 00:41 -- OpenAI social network 10:02 -- Anthropic's reasoning study 20:56 -- AI bots strain Wikimedia 31:33 -- Humanoid half-marathon The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    o3 and o4-mini, Google Gemini on-prem and NVIDIA's U.S. chip manufacturing

    Play Episode Listen Later Apr 18, 2025 42:15


    OpenAI just dropped o3 and o4-mini! In episode 51 of Mixture of Experts host, Tim Hwang is joined by Chris Hay, Vyoma Gajjar and special guest John Willis, Owner of Botchagalupe Technologies. Today, we analyze Sam Altman's new AI models, o3 and o4-mini. Next, Google announced that by Q3 you can run Gemini on-prem; what does this mean for enterprise AI adoption? Then, John is on the show today to take us through AI evaluation tools and why we need them. Finally, NVIDIA is planning to move AI chip manufacturing to the U.S. Can they pull this off? All that and more on today's Mixture of Experts. 00:01 – Intro 00:56 – OpenAI o3 and o4 mini 14:57 – Google Gemini on-prem 23:43 – AI evaluation tools 34:59 – NVIDIA's U.S. chip manufacturing The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    AI on IBM z17, Meta's Llama 4 and Google Cloud Next 2025

    Play Episode Listen Later Apr 11, 2025 37:33


    IBM z17 is here! In episode 50 of Mixture of Experts, host Tim Hwang is joined by Kate Soule, Shobhit Varshney and Hillery Hunter to debrief the launch of a new mainframe with robust AI infrastructure. Next, Meta dropped Llama 4 over the weekend;, how's it going? Then, Shobhit is recording live from Google Cloud Next in Las Vegas, along with Gemini 2.5 Pro. What are some of the most exciting announcements? Finally, the Pew Research Center shows perception of AI, how does this impact the industry? All that and more on today's 50th Mixture of Experts. 00:01 -- Intro 00:55 -- IBM z17 11:42 -- Llama 4 25:02 -- Google Cloud Next 2025 34:29 -- Pew's research on perception of AI The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Explore the new features of IBM z17: https://www.ibm.com/products/z17 Read the Pew Research: https://www.pewresearch.org/internet/2025/04/03/how-the-us-public-and-ai-experts-view-artificial-intelligence/ Subscribe for AI updates: https://ibm.biz/Think_newsletter Visit Mixture of Experts podcast page to learn more AI content: https://www.ibm.com/think/podcasts/mixture-of-experts

    OpenAI goes open, Anthropic on interpretability, Apple Intelligence updates and Amazon AI agents

    Play Episode Listen Later Apr 4, 2025 43:25


    Will OpenAI be fully open source by 2027? In episode 49 of Mixture of Experts, host Tim Hwang is joined by Aaron Baughman, Ash Minhas and Chris Hay to analyze Sam Altman's latest move towards open source. Next, we explore Anthropic's mechanistic interpretability results and the progress the AI research community is making. Then, can Apple catch up? We analyze the latest critiques on Apple Intelligence. Finally, Amazon enters the chat with AI agents. How does this elevate the competition? All that and more on today's Mixture of Experts.00:01 -- Introduction00:48 -- OpenAI goes open 11:36 -- Anthropic interpretability results 24:55 -- Daring Fireball on Apple Intelligence 34:22 -- Amazon's AI agentsThe opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.Subscribe for AI updates: https://www.ibm.com/account/reg/us-en/signup?formid=news-urx-52120Learn more about artificial intelligence → https://www.ibm.com/think/artificial-intelligenceVisit Mixture of Experts podcast page to learn more AI content → https://www.ibm.com/think/podcasts/mixture-of-experts

    DeepSeek-V3-0324, Gemini Canvas and GPT-4o image generation

    Play Episode Listen Later Mar 28, 2025 41:43


    What's the best open-source model? In episode 48 of Mixture of Experts, host Tim Hwang is joined by Kate Soule, Kush Varshney and Skyler Speakman to explore the future of open-source AI models. First, we chat about the release of DeepSeek-V3-0324. Then, more announcements coming out of Google including Gemini Canvas and Gemini 2.5. Next, Extropic has entered the chat with a thermodynamic chip. Finally, AI image generation is on the rise as OpenAI released GPT-4o image generation. All that, and more on today's Mixture of Experts. 00:01 – Intro 00:42– DeepSeek-V3-0324 09:48 – Gemini 2.5 and Canvas 21:27– Extropic's thermodynamic chip 30:20 – OpenAI image generation The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    NVIDIA GTC, Baidu reasoning models, and Gemini AI image generation

    Play Episode Listen Later Mar 21, 2025 39:16


    What's the most exciting announcement coming out of NVIDIA GTC? In episode 47 of Mixture of Experts, host Tim Hwang is joined by Nathalie Baracaldo, Kaoutar El Maghraoui and Vyoma Gajjar. First, we dive into the latest announcements from NVIDIA GTC, including the Groot N1 model for humanoid robotics. Next, Baidu released some new AI reasoning models, and they're not open source? Then, for our paper of the week we discuss the flaws of Chain-of-Thought reasoning. Finally, Gemini Flash 2.0 has released image generation models for developer experimentation., Iis Google catching up on the AI game? Tune -in to today's Mixture of Experts to find out! 00:01 – Intro 01:27– NVIDIA GTC 14:18– New Baidu AI models 21:19– Chain-of-Thought reasoning 32:18 – Gemini image generation The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 46: Manus, vibe coding, scaling laws and Perplexity's AI phone

    Play Episode Listen Later Mar 14, 2025 50:11


    Is Manus a second DeepSeek moment? In episode 46 of Mixture of Experts, host Tim Hwang is joined by Chris Hay, Kaoutar El Maghraoui and Vyoma Gajjar to talk Manus! Next, the rise of vibe coding—what started as a joke has now become a thing? Then, we dive deep into the future of scaling laws. Finally, Perplexity is teaming up with Deutsche Telekom to release an AI phone—what's the motivation here? Tune-in to today's Mixture of Experts to find out more! 00:01 – Intro 00:37 -- Manus 14:09 – Vibe coding 30:13 – Scaling laws 39:07 – Perplexity's AI phone The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 45: Quantum leap, Model Context Protocol, CoreWeave IPO and an AI voice companion

    Play Episode Listen Later Mar 7, 2025 45:25


    When can we expect quantum to reach consumer devices? In episode 45 of Mixture of Experts, host Tim Hwang is joined by special guest, Blake Johnson, to debrief the quantum noise in the news. Blake helps us understand the intersection between quantum and AI and how far we are from this technology. Then, veteran experts Chris Hay and Volkmar Uhlig hash out some other news in AI this week. We cover Anthropic's Model Context Protocol, CoreWeave filing for an IPO and Sesame AI's new voice companion. All that and more on today's Mixture of Experts! 00:01 – Intro 01:06 – Quantum leap 20:08 -- Model Context Protocol 28:24 -- CoreWeave IPO 40:12 -- Sesame AI voice companion The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Bonus: OpenAI GPT-4.5: And the future of pre-training is...

    Play Episode Listen Later Mar 1, 2025 24:07


    Is pre-training dead? In this bonus episode of Mixture of Experts, guest host Bryan Casey is joined by Kate Soule and Chris Hay. On Thursday, Sam Altman dropped GPT-4.5 just after we wrapped our weekly recording. We got a few of our veteran experts on the podcast to analyze OpenAI's largest and “best” chat model yet. What's the hype? Tune-in to this bonus episode to find out! 00:01 – Intro 00:25 – GPT-4.5 The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 44: Claude 3.7 Sonnet, BeeAI agents, Granite 3.2, and emergent misalignment

    Play Episode Listen Later Feb 28, 2025 39:45


    Granite 3.2 is officially here! In episode 44 of Mixture of Experts, host Tim Hwang is joined by Kate Soule, Maya Murad and Kaoutar El Maghraoui to debrief a few big AI announcements. Last week we covered small vision-language models (VLMs), and this week Granite 3.2 dropped with new VLMs, enhanced reasoning capabilities, and more! Kate takes us under the hood to understand the new features and how they were created. Next, Anthropic dropped a new intelligence model, Claude 3.7 Sonnet, and a new agentic coding tool, Claude Code. Why did Anthropic release these separately? Then, as we cannot have an episode without covering agents, Maya takes us through the new BeeAI agents! Finally, can fine tuning on a malicious task lead to much broader misalignment? Our experts analyze a new paper released on ‘Emergent misalignment.' All that and more on this week's episode! 00:01 – Intro 00:41 – Claude 3.7 Sonnet 11:58 – BeeAI agents 20:11– Granite 3.2 29:23 – Emergent misalignment The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 43: Deep Research, OpenAI inference chip, small VLMs, and AI agent job posting

    Play Episode Listen Later Feb 21, 2025 45:51


    What is all the hype around Deep Research? In episode 43 of Mixture of Experts, host Tim Hwang is joined by Kate Soule, Volkmar Uhlig and Shobhit Varshney. This week, we discuss reasoning model features coming out of companies like OpenAI's Deep Research, Google Gemini, Perplexity, xAI's Grok-3 and more! Next, OpenAI is rumored to release an inference chip, but how likely is this to be a success in the AI chip game? Then, we analyze the capabilities of small vision-language models (VLMs). Finally, a startup, Firecrawl, released a job posting in search of an AI agent. Is this the future for AI tools in the workforce? Tune-in to today's Mixture of Experts to find out. 00:01 – Intro 00:35 – Deep Research 11:58 – OpenAI inference chip 22:17 – Small VLMs 32:31 – AI agent job posting The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 42: Paris AI Summit, Altman's "Three Observations," and Anthropic's Economic Index

    Play Episode Listen Later Feb 14, 2025 39:56


    Live from Paris, Tim Hwang is at the AI Action Summit 2025. In episode 42 of Mixture of Experts, we welcome Anastasia Stasenko, CEO and Co-Founder of pleias along with our veteran experts Marina Danilevsky and Chris Hay. Last week, we touched on some potential conversations at the Paris AI Summit, this week we recap what actually happened. Is AI safety improving Globally? Next, for our paper of the week, we breakdown s1: Simple test-time scaling. Then, Sam Altman is back with another blog, “Three Observations,” what do our experts have to say? Finally, what can we learn from Anthropic's Economic Index? All that and more on today's Mixture of Experts. 00:01 – Intro 00:42 – Paris AI Summit 11:10 – s1: Simple test-time scaling 19:32 – Sam Altman's “Three Observations” 30:41 – Anthropic's Economic Index The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Resources:Read the paper about s1: Simple test-time scaling: https://arxiv.org/abs/2501.19393Read Sam Altman's "Three Observations": https://blog.samaltman.com/three-observationsRead Anthropic's Economic Index: https://www.anthropic.com/economic-indexRead more about AGI: https://www.ibm.com/think/topics/artificial-general-intelligence

    Episode 41: OpenAI deep research, o3-mini, AI Action Summit, and Anthropic's Constitutional Classifiers

    Play Episode Listen Later Feb 7, 2025 38:08


    What does Sam Altman have up his sleeve? In episode 41 of Mixture of Experts, join host Tim Hwang along with experts Nathalie Baracaldo, Marina Danilevsky and Chris Hay. Last week, we covered all things DeepSeek, and this week OpenAI has some new releases to share. Today, the experts dissect deep research and o3-mini. Next, our host Tim Hwang is travelling to AI Action Summit, he asks our experts what we can expect coming out of the event. Then, we talk about Anthropic's Constitutional Classifiers. Finally, Microsoft is creating a unit to study AI's impact, what does this mean? Find out all this and more on Mixture of Experts. 00:01 – intro 00:41 – Open AI deep research and o3-mini 13:51 – AI Action Summit 20:17 – Anthropic's Constitutional Classifiers 28:54 – Microsoft AI Impact team The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Subscribe for AI updatesLearn more about artificial intelligenceDeepSeek's reasoning AI shows power of small models, efficiently trainedVisit Mixture of Experts podcast page to learn more AI content

    Episode 40: DeepSeek facts vs hype, model distillation, and open source competition

    Play Episode Listen Later Jan 31, 2025 39:17


    Let's bust some early myths about DeepSeek. In episode 40 of Mixture of Experts, join host Tim Hwang along with experts Aaron Baughman, Chris Hay and Kate Soule. Last week, we covered the release of DeepSeek-R1; now that the entire world is up to speed, let's separate the facts from the hype. Next, what is model distillation and why does it matter for competition in AI? Finally, Sam Altman among other tech CEOs shared his response to DeepSeek. Will R1 radically change the open-source strategy of other tech giants? Find out all this and more on Mixture of Experts. 00:01 – Intro 00:41 – DeepSeek facts vs hype 21:00 – Model distillation 31:21 – Open source and OpenAI The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 39: DeepSeek-R1, Mistral IPO, FrontierMath controversy, and IDC code assistant report

    Play Episode Listen Later Jan 24, 2025 39:45


    What does the future hold for DeepSeek? In episode 39 of Mixture of Experts, join host Tim Hwang along with experts Abraham Daniels, Kaoutar El Maghraoui and Skyler Speakman to discuss the release of DeepSeek-R1. Next, Mistral indicates going IPO. Then, FrontierMath's new benchmark is particularly difficult, the experts debrief. Finally, IDC released a report on code assistants, what do we need to know about generalist and specialized coding assistants? Tune-in to this week's episode to find out. 00:01 – Intro 01:08 – DeepSeek-R1 14:08 – Mistral indicates IPO 20:54 – FrontierMath controversy 30:04 -- IDC code assistants report The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 38: Anthropic valuation rumors, Microsoft CoreAI, NotebookLM upgrades, and AI agents in finance

    Play Episode Listen Later Jan 17, 2025 44:49


    What would you do with $2 billion? In episode 38 of Mixture of Experts, join host Tim Hwang along with experts Chris Hay, Kaoutar El Maghraoui and Vyoma Gajjar to discuss the Anthropic valuation rumors. Next, Microsoft CEO Nadella created a new CoreAI group to build and run apps for customers. Then, NotebookLM upgraded some of its features, including podcast intervention. Finally, AI agents are making their way into the financial services industry. Can an agent invest all of your money? Tune-in to this week's episode to find out. 00:01 -- What would you do with $2 billion? 00:51 -- Anthropic valuation 12:14 -- Microsoft CoreAI 25:01 -- NotebookLM upgrades 35:17 -- AI agents in finance The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 37: CES 2025, NVIDIA DIGITS, Apple Intelligence fails, and Sam Altman's reflections

    Play Episode Listen Later Jan 10, 2025 35:38


    What's the most exciting CES AI announcement? In episode 37 of Mixture of Experts, host Tim Hwang is joined by Skyler Speakman, Volkmar Uhlig and Shobhit Varshney to debrief CES 2025. Specifically, the experts dive into NVIDIA'S Project DIGITS, among other announcements from the AI hardware giant. Next, a new enterprise AI development survey came out that detailing how developers really feel about AI implementation. Then, Apple Intelligence experienced some major hallucination fails, what does this tell us about Apple's stake in the AI game? Finally, Sam Altman of OpenAI released a reflection blog, what does he say about the future of AI? All that and more on today's Mixture of Experts.The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 36: OpenAI o3, DeepSeek-V3, and the Brundage/Marcus AI bet

    Play Episode Listen Later Jan 3, 2025 39:19


    Is deep learning hitting a wall? It's 2025 and Mixture of Experts is back and better than ever. In episode 36, host Tim Hwang is joined by Chris Hay, Kate Soule and Kush Varshney to debrief one of the biggest releases of 2024, OpenAI o3. Next, DeepSeek-V3 is here! Finally, will AI exist in 2027? The experts dissect the AI bet between Miles Brundage and Gary Marcus. All that and more on the first Mixture of Experts of 2025.The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.00:00 — Intro00:49 — OpenAI o314:40 — DeepSeek-V328:00 — The Brundage/Marcus bet

    Episode 35: 2024 Rewind: Breakthroughs in AI models, agents, hardware and products

    Play Episode Listen Later Dec 27, 2024 61:43


    Will 2025 be the year of AI agents? In Episode 35 of Mixture of Experts, host Tim Hwang is joined by some show veterans to debrief 2024 in AI. This week, we review AI models, agents, hardware and product releases with some of the top industry experts. What was the best model of 2024? Is NVIDIA king? What are some of the AI trends in 2025? All that and more on this special edition of Mixture of Experts.The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 34: Granite 3.1, NVIDIA Jetson, stealing AI models, and is pre-training over?

    Play Episode Listen Later Dec 20, 2024 40:30


    Is pre-training a thing of the past? In Episode 34 of Mixture of Experts, host Tim Hwang is joined by Abraham Daniels, Vagner Santana and Volkmar Uhlig to debrief this week in AI. First, OpenAI cofounder Ilya Sutskever said that “peak data” was achieved, does this mean there is no longer a need to model pre-training? Next, IBM released Granite 3.1 with a slew of features, we cover them all. Then, there is a new way to steal AI models, how do we protect against model exfiltration. Finally, can NVIDIA Jetson for AI developers really increase hardware accessibility? Tune-in for more!The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.00:01 — Intro00:49— Is pre-training over?10:25 — Granite 3.122:23 — AI model stealing33:38—NVIDIA Jetson

    Episode 33: 12 Days of OpenAI, NeurIPS, ARC Prize, and Llama 3.3 70B

    Play Episode Listen Later Dec 13, 2024 40:50


    Is o1 Pro worth the cost? In Episode 33 of Mixture of Experts, host Tim Hwang is joined by Marina Danilevsky, Kate Soule and Vyoma Gajjar. First, the experts debrief the 12 Days of OpenAI. Next, we review some of the top papers in NeurIPS, how are the experts keeping up with all these research papers? Then, we are back with another benchmark, can ARC Prize make AGI more tractable? Finally, Meta announced the launch of Llama 3.3 70B with the promise of 405B performance, can we have our cake and eat it too? Find out more on today's Mixture of Experts!The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 32: Inside AWS re:Invent 2024, LLM Flowbreaking, and David Mayer

    Play Episode Listen Later Dec 6, 2024 37:55


    What's the mystery behind the name ChatGPT refuses to discuss? In Episode 32 of Mixture of Experts host Tim Hwang dives into the hottest topics shaping the AI landscape with an all-star panel: Aaron Baughman, Vagner Figueredo de Santana, and Shobhit Varshney. First, they disect the biggest announcements and takeaways from AWS re:Invent 2024, Amazon's premier AI event. Next, they talk about overcoming architectural vulnerabilities in AI systems, and finally, they uncover the curious case of a name ChatGPT won't discuss—and the questions this raises about privacy and transparency in AI. Get ready for an episode packed with insights, debates, and forward-thinking perspectives!The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 31: AI in education: Safety, literacy, and predictions

    Play Episode Listen Later Nov 27, 2024 36:31


    How much future learning will be done with an AI assistant? In Episode 31 of Mixture of Experts, host Tim Hwang is joined by Phaedra Boinodiris, Marina Danilevsky and Skyler Speakman for the AI in education special episode. First, the experts give an update on the state of AI within education. Next, we cover concerns around AI safety and literacy, what do students and teachers need to be aware of? Finally, the panel gives their predictions on what the future of education holds as it relates to AI. Tune-in to this special episode for an in-depth analysis!The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 30: “Near-infinite memory,” Microsoft Ignite, FrontierMath, and AlphaFold3

    Play Episode Listen Later Nov 22, 2024 43:12


    Should your AI assistant remember everything about you? In Episode 30 of Mixture of Experts, host Tim Hwang is joined by Vagner Santana, Vyoma Gajjar and Shobhit Varshney. First, the experts breakdown claims of “near-infinite memory” within AI models. Next, Shobhit is fresh off the plane from Microsoft Ignite, he shares some of the exciting new announcements following the event. Then, a new benchmark has entered the chat, what do we know about FrontierMath? Finally, AlphaFold3 is now more open, why does this matter? Find out more on today's episode!The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 29: Scaling AI, agent-led future, and race to AGI

    Play Episode Listen Later Nov 15, 2024 39:11


    Is 2024 the year scaling AI officially breaks? In Episode 29 of Mixture of Experts, host Tim Hwang is joined by Anthony Annunziata, Kate Soule and Naveen Rao. First, the experts discuss whether we are living in a post scale world. Next, we can't have an episode without chatting AI agents, but what does the future hold for this technology? Finally, is AGI here to stay? Tune-in to this week's Mixture of Experts to find out.The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 28: SearchGPT, from Naptime to Big Sleep, and GitHub Octoverse updates

    Play Episode Listen Later Nov 8, 2024 39:49


    Could AI wipe out software engineers? In Episode 28 of Mixture of Experts, host Tim Hwang is joined by Chris Hay, Kaoutar El Maghraoui, and Shobhit Varshney. First, the experts discuss GitHub reporting a rise of developers driven by AI code assistant tools. Next, Big Sleep finds a vulnerability in SQLite, what is the future for these kinds of AI agents? Finally, OpenAI released SearchGPT, what is the future of AI search? Tune-in today to find out! The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 27: The future of agents, AI energy consumption, Anthropic's computer use, and Google watermarking AI

    Play Episode Listen Later Nov 1, 2024 32:59


    Agents, agents, and more agents! In Episode 27 of Mixture of Experts, host Tim Hwang is joined by Volkmar Uhlig and Vyoma Gajjar. First, the experts chat about Mark Benioff's spicy tweet, and what this means for the future of AI agents. Next, how much energy is needed to power AI models, and should we be concerned? Then, the experts debrief Anthropic's release of computer use. Finally, Google is integrating SynthID-Text into Gemini to help watermark AI-generated text, do we need this feature? Learn more on today's Mixture of Experts.The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 26: Granite 3.0, NVIDIA's Nemotron AI model, and Perplexity's fundraising

    Play Episode Listen Later Oct 25, 2024 37:18


    Can chat replace search? In Episode 26 of Mixture of Experts, host Tim Hwang is joined by Kate Soule, Kush Varshney and Petros Zerfos for IBM TechXchange week! First, the experts describe how the team created the Granite 3.0 models. Next, NVIDIA enters the open source model game, what does this mean for the competition? Finally, Perplexity AI is seeking over double their valuation in new funding rounds, what does this mean for start-ups? All that and more on today's episode.The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 25: Machines of Loving Grace, Entropix, AI and elections, GSM8K

    Play Episode Listen Later Oct 18, 2024 41:17


    Can AI solve infectious disease? In Episode 25 of Mixture of Experts, host Tim Hwang is joined by Kaoutar El Maghraoui, Maya Murad, and Ruben Boonen. Today we analyze some papers. First, the experts dissect Machines of Loving Grace, a 15,000 word essay written by Anthropic's CEO making some major AI predictions. Then, Apple generated a new benchmark based of GSM8K in a recent paper, the findings were intriguing. Next, we talk Entropix, a sampler intending to replicate chain of thought features. Finally, OpenAI disclosed they are seeing an increase in AI models faking articles, what can we do to fix this? All this and more, on today's Mixture of Experts.The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 24: AI in the Nobels, DGX B200 arrival, and Unstructured's $40M funding round

    Play Episode Listen Later Oct 11, 2024 37:19


    Could AI win a Nobel Prize in the future? In Episode 24 of Mixture of Experts, host Tim Hwang is joined by Chris Hay and Edward Calvesbert. First, the experts debrief the ‘Godfather of AI' sharing a Nobel Prize. Next, we talk AI platforms and the hype around DGX B200. Finally, unstructured data is becoming usable for LLMs, why are companies like NVIDIA so interested in this data? Tune-in today to find out!The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 23: NotebookLM, OpenAI DevDay, and will AI prevent phishing attacks?

    Play Episode Listen Later Oct 4, 2024 39:15


    Will DeepDive replace the Mixture of Experts podcast? In Episode 23, host Tim Hwang is joined by IBM Researchers Marina Danilevsky, Nathalie Baracaldo and Vagner Santana to dissect this week's AI news. First, the experts talk about the hype around Google's NotebookLM, specifically regarding the DeepDive podcast feature. Next, OpenAI DevDay sparks some interesting conversation around vision fine-tuning and multimodality. Finally, it's Cybersecurity Awareness Month and IBM X-Force released the Cloud Threat Landscape Report. Will AI prevent phishing attacks? Tune-in to this week's episode to learn more!The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 22: Llama 3.2, AI Snake Oil, and gen AI for sustainability

    Play Episode Listen Later Sep 27, 2024 33:51


    Meta releases Llama 3.2! In Episode 22 of Mixture of Experts, host Tim Hwang is joined by Maryam Ashoori, Skyler Speakman, and Shobhit Varshney to debrief an exciting week of AI news. First, Meta is back with the release of Llama 3.2, and lightweight (1B/3B) models. Next, it's Climate Week NYC, we chat the use of gen AI in achieving sustainable development goals. Specifically, IBM and NASA's AI model for weather and climate. Finally, the book version of “AI Snake Oil” officially dropped and the authors claim they will be wrong in 2.5 years. What do our experts think? Tune-in today to find out!The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 21: OpenAI o1 preview, Agentforce, AI in fantasy football, and machine unlearning

    Play Episode Listen Later Sep 20, 2024 47:45


    Strawberry is officially here! In Episode 21 of Mixture of Experts, guest host Bryan Casey is joined by Chris Hay, Nathalie Baracaldo, and Aaron Baughman to chat about the hype around OpenAI's o1 preview. Additionally, we cover AI agents again, with the launch of Agentforce. Next, Aaron—our resident AI in sports expert analyzes the AI powered insights for fantasy football. Finally, what is “machine unlearning,” and why should we care? All this and more, on today's episode of Mixture of Experts.The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 20: Apple Intelligence, Reflection 70B, open-source AI agents, and LLM research ideas

    Play Episode Listen Later Sep 13, 2024 38:29


    Can Apple Intelligence compete with the AI market offerings? In Episode 20 of Mixture of Experts, host Tim Hwang is joined by Marina Danilevsky, Kate Soule and Maya Murad. Today, the experts chat Apple Intelligence, the performance of Reflection's 70B, and a new paper released on LLMs generating novel research ideas. Additionally, IBM soft launched the Bee Agent Framework to help build agentic workflows with leading open-source and proprietary models. Tune-in to hear our expert panel break down this week's AI news.The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 19: NEO 1X robot, OpenAI chips, The AI Scientist, and the future of prompt engineering

    Play Episode Listen Later Sep 6, 2024 37:58


    Will prompt engineering ever die? In Episode 19 of Mixture of Experts, host Tim Hwang is joined by Kaoutar El Maghraoui, Kate Soule and Shobhit Varshney. Today, the experts chat the future of prompt engineering, a new paper released about The AI Scientist, NEO 1X's humanoid robot, and OpenAI's in-house AI chips. Will AI takeover scientific discovery? Will everyone have at home AI assistants? Why is OpenAI investing in chip production? Tune-in for our expert's takes.0:00 - Intro1:17 - Future of Prompt Engineering11:18 - NEO 1X Robot21:56 - AI for Scientific Discovery31:48 - OpenAI's in-house ChipsThe opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 18: Cursor hype, Perplexity introduces ads, and AI at the US Open

    Play Episode Listen Later Aug 30, 2024 40:29


    Is search less trustworthy? In Episode 18 of Mixture of Experts, host Tim Hwang is joined by the IBM Fellows—Aaron Baughman, Kush Varshney, and Trent Gray-Donald. Today, the experts chat how AI is being integrated at the US Open. Next, the Perplexity is introducing ads in Q4, what is the affect on search? Finally, what's all the hype with Cursor? Tune-in to today's episode for all this and more.The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 17: Agent Q, no AI in art, and AMD acquires ZT Systems

    Play Episode Listen Later Aug 23, 2024 46:56


    What's new with AI agents? In Episode 17 of Mixture of Experts, guest host Bryan Casey is joined by Chris Hay, Skyler Speakman, and Volkmar Uhlig. Today, the experts chat Agent Q and the improvements in reasoning and planning. Next, the CEO of Procreate came out with a statement that there will be no gen AI integrated into their products—can art avoid the AI wave? Finally, AMD acquired ZT Systems, can they now compete with NVIDIA? All this and more on today's episode.The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.Segments:0:01 — Intro00:51 — Agent Q14:21 — No AI in Art29:12 — AMD Acquires ZT Systems

    Episode 16: Cost of a Data Breach 2024 and OpenAI's Project Strawberry

    Play Episode Listen Later Aug 16, 2024 22:56


    Is OpenAI about to release their biggest AI project? In Episode 16 of Mixture of Experts, host Tim Hwang is joined by Nathalie Baracaldo, Kate Soule, and Shobhit Varshney. Today, the experts chat IBM's 2024 Cost of a Data Breach Report and analyze how gen AI could reduce the cost of cyber threats. Next, rumors are circulating the internet about OpenAI dropping “Project Strawberry,” what they internally reference as a “level 2” model. Are the rumors true? Tune-in for more.The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.Segments:0:01 — Intro00:52 — Cost of a Data Breach 202412:33— Project Strawberry

    Episode 15: OpenAI Structured Outputs, character.ai “acquisition,” and is it an AI bubble?

    Play Episode Listen Later Aug 9, 2024 32:08


    Is it an AI bubble? In Episode 15 of Mixture of Experts, host Tim Hwang is joined by our veteran panel: Marina Danilevsky, Kush Varshney, and Shobhit Varshney. Today, the experts chat the stock market crash and the involvement of AI companies. Then, OpenAI released Structured Outputs, and analyze how this can support enterprise implementation of AI. Finally, Google "acquires" character.ai, does this make any sense? Tune-in for the breakdown. The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.0:01 — Intro1:07 — AI Bubble?11:49 — Structured Outputs22:41 — character.ai "Acquisition"

    Episode 14: SAM 2, friend.com and will gen AI projects be abandoned?

    Play Episode Listen Later Aug 2, 2024 28:48


    Meta releases SAM 2! In Episode 14 of Mixture of Experts, host Tim Hwang is joined by Ambhi Ganesan, Kate Soule and Vagner Santana. Today, the experts chat the next generation of Meta's Segment Anything Model (SAM). Then, another AI companion attempt via friend.com, we analyze if startups effectively compete in the AI hardware space. Finally, we get expert opinions on various topics: Will gen AI projects be abandoned? Which is bigger—9.11 or 9.9? Tune-in today to find out.0:01 — Intro1:00 — SAM 210:49 — Friend.com20:53 — Abandoned gen AI projects25:38 — Which is bigger—9.11 or 9.9?The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 13: Meta's Llama 3.1, Mistral Large 2 and big interest in small models

    Play Episode Listen Later Jul 26, 2024 20:20


    Meta strikes back with the launch of Llama 3.1! In Episode 13 of Mixture of Experts, host Tim Hwang is joined by Chris Hay, Shobhit Varshney, and Maryam Ashoori. Today, the experts analyze the business of AI in relation to the launch of Llama 3.1, including Llama 405B. Then, Mistral Large 2 sparks conversation about the open source wave. Finally, the experts talk GPT 4o-mini and the model price war. Are little models having their moment? Tune-in to find out.0:01 — Intro1:33 — Llama 3.1 and Mistral Large 210:08 — Are Little Models Having a Moment?The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 12: Goldman Sachs Gen AI report, Claude 2.0 Engineer, and RIAA lawsuits

    Play Episode Listen Later Jul 19, 2024 31:15


    In Episode 12 of Mixture of Experts, host Tim Hwang is joined by Chris Hay, Marina Danilevsky, and Brent Smolinski. Today, we review Goldman Sachs' report on investment in Gen AI, “too much spend, too little benefit.” Next, the experts break down Claude 2.0 Engineer and the future of coding agents. Finally, the Recording Industry Association of America (RIAA) files lawsuits against two generative music companies. Will modern AI break the music industry? Tune-in today to find out.The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 11: AI at Wimbledon, ChatGPT for coding, and scaling with AI personas

    Play Episode Listen Later Jul 12, 2024 41:21


    In Episode 11 of Mixture of Experts, host Tim Hwang is joined by Aaron Baughman, Kaoutar El Maghraoui, and Skyler Speakman, and its Wimbledon finals week! Today, we review how AI is providing insights throughout one of the most prestigious tennis tournaments and the future of AI in sports. Next, the experts break down the quality of ChatGPT for coding. Finally, how did scaling synthetic data create one billion personas?The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 10: AI Hardware: Training, Inference, Devices and Model Optimization

    Play Episode Listen Later Jul 3, 2024 38:26


    In Episode 10 of Mixture of Experts we are talking all hardware all the time. Guest host Bryan Casey is joined by Volkmar Uhlig, Chris Hay, and Kaoutar El Maghraoui to explore the intricacies of AI hardware. Is Apple creating a pattern for the industry with their on device and cloud architecture? Tune in to hear the experts debate the details.The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 9: Claude 3.5 Sonnet, BIRD-SQL, and the latest in AI Slop

    Play Episode Listen Later Jun 28, 2024 39:03


    In Episode 9 of Mixture of Experts, guest host Bryan Casey is joined by Shobhit Varshney, Marina Danilevsky, and Michael Glass. The experts analyze both the release Anthropic's Claude 3.5 and BIRD-SQL. Finally, we talk the latest in AI slop and is shrimp Jesus the best use case of AI content creation?The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 8: NVIDIA's Nemotron-4 340B models, Safe Superintelligence Inc. and AI agents

    Play Episode Listen Later Jun 21, 2024 43:47


    In Episode 8 of Mixture of Experts, host Tim Hwang is joined by Kush Varshney, Kate Soule, and Maya Murad. First, the experts react to NVIDIA's Nemotron-4 340B model launch and the future of LLM training. Next, new developments in enterprise agents create a great discussion around the reality of AI agents. Finally, will Safe Superintelligence Inc. become a major player in the AI space?The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 7: Apple's WWDC24 reactions and mechanistic interpretability

    Play Episode Listen Later Jun 14, 2024 39:41


    In Episode 7 of Mixture of Experts, host Tim Hwang is joined by Shobhit Varshney, Skyler Speakman, and Kaoutar El Maghaoui. Today, the experts react to Apple's WWDC24 announcements. Is Apple late to the AI game? Then, part 2 on interpretability this week, as OpenAI released their study mechanistic interpretability. The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Episode 6: AI safety, RAG benchmarking and responsible AI at ACM FAccT Conference

    Play Episode Listen Later Jun 7, 2024 40:29


    In Episode 6 of Mixture of Experts, host Tim Hwang is joined by Vagner Figueredo de Santana, Marina Danilesky, and Shobhit Varshney. Today, what's the future of AGI? The experts unpack Leopold Aschenbrenner's AI safety screed, Situational Awareness. We also break down the state of responsible AI amid the annual ACM Fairness, Accountability, and Transparency (FAccT) Conference. Finally, we chat about RAG benchmarking and what it tells us about the industry as a whole.The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

    Claim Mixture of Experts

    In order to claim this podcast we'll send an email to with a verification link. Simply click the link and you will be able to edit tags, request a refresh, and other features to take control of your podcast page!

    Claim Cancel